2.2. Implementation of frequency models
Extreme water levels are estimated using a method of statistical fitting and extrapolation of extremes. Only the main points of the method are outlined here. For more information, please refer to [
14]. The calculations were performed using the R environment because it is well known nowadays well.
This study focuses on the statistical analysis of extreme water levels of lake Nokoue, sourced from the Institute of Hydrology and Oceanology Research of Benin (IRHOB). An extreme water level is defined as the maximum observed value within a year. These values were extracted from a series spanning from 1997 to 2022.
The categorization of flood risk hazard thresholds is based on the classification of the standardized water height index. This categorization was made possible through a transposition of daily data used by McKee et al. (2002) [
15]. By normalizing the water height series of lake Nokoue, these thresholds were determined (Totin et al., 2016; WMO, 2012) [
16,
17]. Water height anomaly indices and McKee’s classification were used to characterize flood thresholds. The risk categories (limited, moderate, significant, and critical) are shown in
Table 1. The occurrence period of flood water heights associated with flood risk threshold indices is determined using the Gumbel distribution with the method of linear moments.
This table categorizes daily maximum water heights into different flood risk levels (limited, moderate, significant, and critical) based on the standardized water height index.
The experimental probabilities associated with the observations were calculated using formula “(1)” with the Weibull formula, which aims to obtain unbiased exceedance probabilities for all distributions [
18].
where
is the exceedance probability of the maximum water level, iii is the rank of the height in the series, and
is the size of the series consisting of the annual maximum water levels.
The Mann-Kendall, Wald-Wolfowitz, and Wilcoxon tests were used respectively for stationarity, independence, and homogeneity. The p-value represents the risk of error if we consider that the null hypothesis
(the hypothesis that the sample is stationary, homogeneous, and independent) is not true. The maximum acceptable value for the risk of error is set at 5%. If the p-value is less than 5%, there is less than a one in five chance of being wrong in considering that the series of extreme water levels is not independent, stationary, and homogeneous [
19].
A parametric distribution law is fitted to the extreme water levels. Adopting a distribution law to study and describe maximum water levels is undoubtedly the most critical step, introducing the greatest uncertainties [
20,
21]. Il est prudent de tester d’autres lois de distribution appartenant au domaine asymptotique des évènements extrêmes. Diverses pistes ont contribué à faciliter ce choix, mais il n’existe malheureusement pas de méthode universelle et infaillible [
22]. It is prudent to test other distribution belonging to the asymptotic domain of extreme events. Various approaches can help facilitate this choice, but unfortunately, there is no universal and infallible method. [
23]: The Generalized Extreme Value (GEV) distribution, the Gumbel distribution, and the Generalized Pareto (GPA) distribution are all types of generalized extreme value laws that are often used to model extreme events, such as river or lake floods. A comparative study of the performance of these recommended distributions by [
24] is the best approach for justifying the choice of a distribution. The linear moments method available in the (lmomco) package, based on negative logarithmic likelihood, was used for parameter estimation. The distribution functions of the three laws used in this article are as follows “(2)” to “(4)”:
where
for the Gumbel distribution, where
is the scale parameter and
is the location parameter.
With et
GPA : Let
be a random variable with distribution function
and
be a threshold value. The random variable
pour
follows the conditional distribution function:
The quality of the statistical extrapolation of extreme events is assessed using linear moments diagrams, Taylor diagrams, cumulative distribution functions, and the root mean square error (RMSE), as these methods are more practical and powerful compared to the χ² test, Bayesian Information Criterion (BIC), and Akaike Information Criterion (AIC).
par la formule de The linear moments diagram (LM) is based on the combination of skewness coefficients (
) and kurtosis coefficients (
) to graphically assess which distribution best fits the sample observations. Constructing the diagram requires knowledge of the function relating
to
through the formula of “(6)”:
With aj : polynomial approximation coefficient. The L-moments package in the R programming language was used to represent the kurtosis coefficients as a function of skewness coefficients and the experimental characteristic of the sample.
The root mean square error (RMSE) is a method for objectively evaluating the performance of models. It provides a measure of the average magnitude of prediction errors, with lower values indicating better model accuracy. It is formulated as follows “(7)”: