1. Introduction
The establishment of a reliable hydrological monitoring system is crucial for many applications, such as drought monitoring, flood prevention, water balance calculations and water resources management [
1]. When properly installed and maintained, such monitoring systems can provide precise real-time data on hydrological variables, including precipitation, river discharge, and ground water levels [
2]. However, given the high costs of maintaining accurate measurement networks, the implementation of such an observatory remains challenging, particularly in remote and hard-to-access sub-tropical regions that experience extreme weather conditions (WMO 2022, 1987 ; Sivakumar 2017; Sivapalan et al. 2003). These challenges are further exacerbated by the risk of infrastructure damage as well as insufficient public involvement in water resources monitoring, resulting in limited, fragmented, and discontinuous hydrological monitoring networks that impede a comprehensive understanding of the hydrological processes and hinder proper water resources management. Therefore, it is increasingly urgent to design easy-to-maintain and cost-effective observation systems that minimize field interventions while ensuring reliable operation of hydrological observatories.
Recent advances in remote sensing techniques have led to the emergence of satellite-based, data reanalysis-based, model-based products as a potential solution to meet the needs of hydrological observatories [
7,
8]. They offer an attractive alternative, especially to ground-based rain gauges [
3]. Among recent satellite-based gridded precipitation products, the Global Precipitation Measurement Integrated Multi-satellitE Retrievals GPM IMERG v06 Final has garnered much attention due to its 20-year availability of precipitation data, fine spatial resolution (0.1° x 0.1° or ~11 x ~11 km²), and high temporal resolution (30 min) compared to other products (Huffman et al. 2020). Given their characteristics, IMERG products appear to be suitable data sources for forcing rainfall-runoff models used in a variety of hydrological applications, such as extreme event analysis and water resources management [
9,
10,
11].
Despite their high potential, IMERG products are subject to errors arising from various factors, such as the nature of the precipitation measurement [
12], sampling uncertainties [
13], retrieval algorithms, and variations in land surface properties [
14]. Furthermore, since IMERG incorporates ground-based rainfall measurements from the Global Precipitation Climatology Center (GPCC) through its multi-satellite algorithms, the product may be affected by greater uncertainties in tropical areas where there is a limited availability of rain gauges for calibration or validation [
15]. In addition, several researchers have highlighted the limitations of IMERG, which include low accuracy representation of the spatial variability of rainfall, as well as both overestimation and underestimation of certain rainfall classes depending on the study area [
16,
17]. Despite these limitations, the research community continues to explore the applications of IMERG in the hydrological domain, especially in countries with poor rainfall monitoring networks.
While a significant amount of research has been conducted to quantify errors in IMERG products (e.g., Derin et al. 2021; Dezfuli et al. 2017), limited attention has been paid to understanding the implications of these errors on rainfall-runoff model forecasts. As hydrological models exhibit strong nonlinearity, errors in precipitation measurements can greatly affect the accuracy of flood simulations. The extent of this impact is contingent on the quality of input data and the interplay between different components of the model [
20]. For instance, some studies indicate that, conditional upon recalibrating the hydrological model, IMERG can produce satisfactory flow simulations [
10,
18]. Some research even suggests that IMERG can be as effective as using ground-based rainfall data [
21,
22,
23]. However, in other cases, hydrological models relying on IMERG products as input data exhibited inadequate performance despite repeated model calibration [
24,
25]. The main reasons for the lower performance of IMERG-data driven models have been attributed to significant biases between IMERG and ground-based observations. Hence, there is a need to recalibrate the hydrological model when using IMERG products to better account for IMERG rainfall-runoff model interactions. Furthermore, the inherently lower accuracy of IMERG rainfall data compared to those derived from ground-based networks exacerbates these challenges [
26].
It is worth emphasizing that no rainfall product can ensure excellent performance in hydrology due to various factors such as study scale and spatial coverage of the rainfall data, location, time scale, basin characteristics, choice of hydrological model and process representation, and model calibration algorithms [
27,
28]. For instance, although satellite- and gauged-based products such as IMERG have demonstrated their efficacy in estimating rainfall patterns compared to ground-based measurements in remote areas [
29], knowledge gaps remain regarding their reliability in hydrological applications, particularly in tropical areas with complex terrain and frequent extreme rainfall events. Moreover, most evaluations of the suitability of IMERG products for hydrological applications have focused on China, where extensive hydrometeorological monitoring equipment is available for conducting validation studies [
24,
25,
27,
28]. For the vast African continent, however, reliable data are scarce, and evaluations have often been conducted at large spatial (e.g., country level or larger) or temporal (e.g., annual or longer) scales, and on large river basins. Hence, given that many water management issues are addressed at smaller scales, it is crucial to also evaluate the effectiveness of rainfall products from different sources for hydrological observatory applications for small or meso-scale watersheds.
The overall aim of this study was therefore to evaluate the usefulness of ground-based rain gauge data (RG) and remote sensing IMERG products for streamflow estimation. More specifically, the study assesses the performance of the semi-distributed SWAT (Soil and Water Assessment Tool) model at predicting streamflow using either different configurations of a ground-based rain gauge network or the IMERG product. The study was conducted in the Sahafihitry watershed (ca. 200 km²) located in northeastern Madagascar. This tropical watershed experiences high seasonality in rainfall frequency, intensity and duration (Ramahaimandimby et al. 2022). Heavy precipitation, including those accompanying large cyclone events which tend to occur every other year on average, threatens human activity and causes natural disasters such as floods and landslides [
30,
31]. Data collection and measurement in this region are particularly challenging due to rugged terrain and impassable roads during the rainy season. Furthermore, a lack of awareness among the local population can make equipment vulnerable to vandalism. These factors complicate establishing and maintaining long-term hydrological observatories. By assessing different sources of rainfall data and considering the trade-offs between data cost and quality, this study can also help decision-makers select the most appropriate data sources and hydrological observatory designs for their specific needs, ultimately contributing to the sustainable management of water resources in Madagascar and other regions facing similar challenges.
Author Contributions
Conceptualization, Z.R. and C.B.; Formal analysis, Z.R. and C.B.; Funding acquisition, M.V.; Investigation, Z.R.; Methodology, Z.R. and C.B.; Project administration, M.V.; Software, Z.R.; Supervision, A.R. and C.B.; Validation, Z.R., A.R. and C.B.; Visualization, Z.R., A.R., M.V. and C.B.; Writing—original draft, Z.R.; Writing—review and editing, Z.R., M.V. and C.B. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Geographical context of the Sahafihitry catchment in Northeastern Madagascar: topography and location of weather station, additional rain gauges and river discharge monitoring station within the watershed. Grids correspond to the grid of the IMERG gridded precipitation dataset.
Figure 1.
Geographical context of the Sahafihitry catchment in Northeastern Madagascar: topography and location of weather station, additional rain gauges and river discharge monitoring station within the watershed. Grids correspond to the grid of the IMERG gridded precipitation dataset.
Figure 2.
Characteristics of the Sahafihitry catchment: (a) 2020 land cover, (b) soil type, (c) slope.
Figure 2.
Characteristics of the Sahafihitry catchment: (a) 2020 land cover, (b) soil type, (c) slope.
Figure 4.
Cumulative rainfall for one rain gauge (1RG), two rain gauges (2RG), five rain gauges (5RG) and IMERG satellite-based precipitation data based on ‘wet’ rainy season (RS) and ‘dry’ rainy season (DS). The date on the X-axis is in MM/DD/YYYY format.
Figure 4.
Cumulative rainfall for one rain gauge (1RG), two rain gauges (2RG), five rain gauges (5RG) and IMERG satellite-based precipitation data based on ‘wet’ rainy season (RS) and ‘dry’ rainy season (DS). The date on the X-axis is in MM/DD/YYYY format.
Figure 5.
Cumulative rainfall (5RG) and discharge (Q_OBS) curves of the Sahafihitry watershed over the entire study period (a) and on an annual basis (b).
Figure 5.
Cumulative rainfall (5RG) and discharge (Q_OBS) curves of the Sahafihitry watershed over the entire study period (a) and on an annual basis (b).
Figure 6.
Plots of observed and simulated daily flows for the 5 rainfall-runoff scenarios (from the left to the right: 1RG, 2RG, 5RG, IMERG) for the calibration period (top, from 08/2018 to 07/2020) and validation period (bottom, 08/2020 – 07/2021). The last scenario (IMERG_in_5RG) corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period.
Figure 6.
Plots of observed and simulated daily flows for the 5 rainfall-runoff scenarios (from the left to the right: 1RG, 2RG, 5RG, IMERG) for the calibration period (top, from 08/2018 to 07/2020) and validation period (bottom, 08/2020 – 07/2021). The last scenario (IMERG_in_5RG) corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period.
Figure 7.
Comparison of observed (Q_OBS) and simulated hydrographs for the validation period (08/2020-07/2021) at the Sahafihitry station for the 5 rainfall-runoff scenarios. Model calibrated using one raingauge (Q_1RG), 2 gauges (Q_2RG), 5 gauges (Q_5RG) or IMERG data (Q_IMERG). The last scenario corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period (IMERG_in_5RG).
Figure 7.
Comparison of observed (Q_OBS) and simulated hydrographs for the validation period (08/2020-07/2021) at the Sahafihitry station for the 5 rainfall-runoff scenarios. Model calibrated using one raingauge (Q_1RG), 2 gauges (Q_2RG), 5 gauges (Q_5RG) or IMERG data (Q_IMERG). The last scenario corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period (IMERG_in_5RG).
Figure 8.
Root Mean Square Error (RMSE), Nash Sutcliffe Efficiency coefficient (NSE) and Kling-Gupta efficiency (KGE) for various classes of discharge for the 5 rainfall-runoff scenarios. Model calibrated using one rain gauge (1RG), 2 gauges (2RG), 5 gauges (5RG) or IMERG data (IMERG). The last scenario corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period (IMERG_in_5RG). Square markers refer to the calibration period, triangular markers refer to the validation period.
Figure 8.
Root Mean Square Error (RMSE), Nash Sutcliffe Efficiency coefficient (NSE) and Kling-Gupta efficiency (KGE) for various classes of discharge for the 5 rainfall-runoff scenarios. Model calibrated using one rain gauge (1RG), 2 gauges (2RG), 5 gauges (5RG) or IMERG data (IMERG). The last scenario corresponds to the model calibrated using 5 rain gauges but run with IMERG data for the validation period (IMERG_in_5RG). Square markers refer to the calibration period, triangular markers refer to the validation period.
Table 1.
Description of the parameters used in the calibration process, their possible range, the initial value as well as the optimum values after calibration using the 1RG, 2RG, 5RG and IMERG rainfall scenarios (see Section 2.4.3).
Table 1.
Description of the parameters used in the calibration process, their possible range, the initial value as well as the optimum values after calibration using the 1RG, 2RG, 5RG and IMERG rainfall scenarios (see Section 2.4.3).
Parameters |
Description |
Range |
Initial value |
1RG |
2RG |
5RG |
IMERG |
Min |
Max |
|
|
|
|
|
1: CN2 |
Multiplication factor for SCS runoff curve number |
0 |
1 |
0.5 |
0.506 |
0.563 |
0.545 |
0.834 |
2: SOL_AWC |
Multiplication factor for available water capacity of the soil layer |
0 |
1 |
1 |
0.774 |
0.649 |
0.686 |
0.879 |
3: GW_DELAY |
Groundwater delay (days) |
0 |
450 |
31 |
367 |
58 |
72 |
70 |
4: GWQMN |
Threshold depth of water in the shallow aquifer required for return flow to occur (mm) |
0 |
5000 |
1000 |
1097 |
922 |
4982 |
1572 |
5: SHALLST |
Initial depth of water in the shallow aquifer (mm) |
0 |
5000 |
1000 |
802 |
2442 |
1002 |
2697 |
6: DEEPST |
Initial depth of water in the deep aquifer (mm) |
0 |
10000 |
2000 |
8115 |
7725 |
5275 |
9555 |
7: GW_REVAP |
Groundwater “revap” coefficient |
0.02 |
0.2 |
0.02 |
0.081 |
0.105 |
0.113 |
0.120 |
8: RCHRG_DP |
Deep aquifer percolation fraction |
0 |
1 |
0.05 |
0.3 |
0.4 |
0.9 |
0.8 |
9: GWHT |
Initial groundwater height (m) |
0 |
25 |
1 |
6.4 |
22.0 |
12.0 |
6.3 |
10: GW_SPYLD |
Specific yield of the shallow aquifer (m3/m3) |
0 |
0.4 |
0.003 |
0.3 |
0.2 |
0.2 |
0.3 |
Table 3.
Cumulative rainfall and average coefficients of variation (CV) between the five ground-based rain gauges (5RG) and eight IMERG grids across the Sahafihitry watershed for three successive years.
Table 3.
Cumulative rainfall and average coefficients of variation (CV) between the five ground-based rain gauges (5RG) and eight IMERG grids across the Sahafihitry watershed for three successive years.
Period (MM/DD/YYYY) |
CV 5RG (%) |
Cumulative rainfall 5RG |
CV IMERG (%) |
Cumulative rainfall IMERG |
Period 1 (08/01/2018 – 07/31/2019) |
71.9 |
2100 |
70.2 |
1721 |
Period 2 (08/01/2019 – 07/31/2020) |
73.8 |
3350 |
62.9 |
2121 |
Period 3 (08/01/2020 – 07/03/2021) |
63.4 |
1806 |
60.3 |
1370 |