Evaluation of globally gridded precipitation data and satellite-based terrestrial water storage products  using hydrological drought recovery time

Çakan, Çağatay; Yımaz, M. Tuğrul; Dobslaw, Henryk; Ince, E. Sinem; Evrendilek, Fatih; Förste, Christoph; Yagci, Ali Levent

doi:https://doi.org/10.5194/hess-29-3359-2025

Articles | Volume 29, issue 14

https://doi.org/10.5194/hess-29-3359-2025

Articles | Volume 29, issue 14

Research article

30 Jul 2025

Research article |

| 30 Jul 2025

Evaluation of globally gridded precipitation data and satellite-based terrestrial water storage products using hydrological drought recovery time

Çağatay Çakan, M. Tuğrul Yımaz, Henryk Dobslaw, E. Sinem Ince, Fatih Evrendilek, Christoph Förste, and Ali Levent Yagci

Abstract

Accurate precipitation observations are crucial for understanding meteorological and hydrological processes. Most precipitation products rely on station-based observations, either directly or for bias-corrected satellite retrievals. To validate these station-based precipitation products, additional independent data sources are necessary. This study aims to assess the performance of the Global Precipitation Climatology Centre (GPCC) Full Data Monthly Product v2022 and Global Precipitation Climatology Project (GPCP) v3.2 Monthly Analysis Product by estimating the hydrological drought recovery time (DRT) from precipitation and the terrestrial water storage anomaly (TWSA) acquired from satellite gravimetry. This study also evaluates the drought monitoring performance of G3P and JPL mascon total water storage (TWS) monthly solutions from the Gravity Recovery and Climate Experiment (GRACE) and GRACE Follow-On (GRACE-FO) satellite missions. The current study employed two methods to estimate DRT and evaluated the consistency of DRT estimates by calculating the time difference in DRT values derived from the two methods. Globally and across all climate zones, GPCC and GPCP showed comparable performance in hydrological applications with no significant differences in the mean DRT estimates. For the TWS products, DRT estimates using JPL mascon were, on average, 2.6 months longer than those using G3P. However, G3P showed approximately 5.0 % higher consistency than JPL mascon globally and across each climate zone, suggesting its better suitability for more precise drought-related analyses. These findings indicate that G3P outperforms JPL mascon in aligning with precipitation products and offers better consistency in DRT estimation. These results provide valuable insights into the accuracy of precipitation and TWSA products by utilizing hydrological drought characteristics, enhancing our understanding of meteorological and hydrological processes.

Download & links

How to cite.

Received: 19 Aug 2024 – Discussion started: 03 Sep 2024 – Revised: 29 Apr 2025 – Accepted: 09 May 2025 – Published: 30 Jul 2025

1 Introduction

Precipitation is a pivotal element in the global water cycle. It provides freshwater to continental regions and thereby allows vegetation to flourish. Average precipitation amounts and the associated temporal distribution of rain events characterize climate zones and terrestrial ecosystems (Bayar et al., 2023; Lai et al., 2018). Too much or too little precipitation than usual, however, can have very severe impacts on the biosphere, agriculture, and human societies in general. The close monitoring of droughts (Barker et al., 2016; Lai et al., 2019; Wu et al., 2023; Xu et al., 2015) and floods (Belabid et al., 2019; Harris et al., 2007; Maggioni and Massari, 2018), as well as the prediction of precipitation at short, medium, and long forecast horizons (Akbari Asanjan et al., 2018; Senocak et al., 2023), remains a central focus of hydrometeorological research.

In situ observations from rain gauges are typically utilized to monitor precipitation (Barker et al., 2016; Wehbe et al., 2017; Wei et al., 2019). However, the distribution of gauge stations is often sparse and uneven, particularly over complex terrains where stations may be difficult to install and maintain (Wang et al., 2017). In contrast, satellite- and satellite-blended-based precipitation products derived from remote sensing instruments have made essential strides, offering varying spatiotemporal resolutions as a viable alternative to ground-based observations (Bai et al., 2019; Prakash et al., 2015; Wang et al., 2017; Wu et al., 2023). The Global Precipitation Climatology Centre (GPCC) and Global Precipitation Climatology Project (GPCP) are frequently used precipitation products with global coverage (Adler et al., 2003; Sun et al., 2018). GPCC represents ground-based precipitation observations, whereas GPCP is a combination of satellite and in situ station observations.

Products from both GPCC and GPCP have been frequently compared with each other and against a wide range of atmospheric reanalyses (e.g., Prakash et al., 2015). At regional scales, particularly in the tropics, good agreement has been observed between GPCC and GPCP (Negrón Juárez et al., 2009; Sun et al., 2018). Moreover, the spatial distribution of annual and seasonal rainfall climatology across western Africa has been found to be consistent between GPCC and GPCP (Lamptey, 2008). Although there are regional similarities, there are also distinct differences. GPCC outperformed GPCP against station-based precipitation data in China (Wang et al., 2017), demonstrated enhanced spatiotemporal representativeness of precipitation patterns in Iran (Darand and Khandu, 2020), and showed superior performance in the Sahel region based on statistical error metrics (Ali et al., 2005). In general, these studies evaluate precipitation products by comparing them with in situ observations. However, given that both datasets utilize ground-station-based observations, independence of the evaluation analyses becomes an important aspect. Accordingly, additional independent assessments may be needed for precipitation products that utilize observations such as GPCC and GPCP.

Drought monitoring is crucial since drought is one of the most destructive disasters, resulting from a significant decrease in a region's water resources over an extended period. It can have disastrous consequences for ecosystems, human health, agriculture, irrigation, and water supply (AghaKouchak et al., 2015; Ding et al., 2020; Mishra and Singh, 2010; Patz et al., 2014; Piao et al., 2010). Drought indices, such as the standardized precipitation index (SPI; McKee et al., 1993), the standardized precipitation evapotranspiration index (SPEI; Vicente-Serrano et al., 2010), the standardized runoff index (SRI; Shukla and Wood, 2008), and the standardized streamflow index (SSI; Vicente-Serrano et al., 2012) are utilized to characterize drought characteristics (e.g., frequency, severity, and recovery time). Meteorological droughts arise from insufficient precipitation, whereas hydrological droughts result from insufficient water storage (Behrangi et al., 2015; Keyantash and Dracup, 2002; Thomas et al., 2014). SPI focuses solely on precipitation data, while SPEI utilizes precipitation and evapotranspiration data. SSI hinges on runoff yield from the land surface, while SRI utilizes streamflow in river channels (Lai et al., 2019). Complex hydrological models utilize precipitation data for hydrological drought assessment based on SSI and SRI (Lai et al., 2018; Madadgar and Moradkhani, 2014). Alternatively, the water storage deficit can provide insights into hydrological drought without employing elaborate hydrological models (Thomas et al., 2014). It only requires measurements of the amount of water stored at or underneath the ground and is employed in the estimation of drought recovery time (DRT). By combining precipitation and terrestrial water storage (TWS) observations, it is even possible to predict the amount of precipitation that will be required to re-fill any storage deficit (Singh et al., 2021).

The satellite mission Gravity Recovery and Climate Experiment (GRACE) provides such measurements of TWS (Springer et al., 2017). GRACE was conducted jointly by the National Aeronautics and Space Administration (NASA) and the German Aerospace Center (DLR) from 2002 until 2017. Since 2018, GRACE Follow-On (GRACE-FO), the successor of GRACE, has been operated by NASA together with GFZ Helmholtz Centre for Geosciences to further extend the data record until present. Terrestrial water storage anomalies (TWSAs), encompassing all subsurface and surface water balance components, are obtained by measuring tiny irregularities in the orbits of two identical twin satellites that trail each other with a distance of roughly 200 km in polar orbit of initially 490 km altitude (Wahr et al., 2004). Temporal changes in the Earth's gravity field are computed from the comparison of observations from different times. Once atmospheric, oceanic, and geophysical effects are removed, the remaining signal on monthly-to-interannual scales reflects variations in TWS. The ready-to-use TWS data from the GRACE and GRACE-FO missions are made available either as spherical harmonic (SH) or mass concentration solutions (mascons). GRACE-based TWS has been used in the past to relate interannual variations in TWS to large-scale climate modes (Pfeffer et al., 2023) and to validate hydrological models (Döll et al., 2024). There have even been attempts to assimilate GRACE data into land-surface schemes (Eicker et al., 2014; Tangdamrongsub et al., 2021). Hence, GRACE and GRACE-FO are currently the most used datasets in global TWS.

GRACE and GRACE-FO TWS products could be used for an independent assessment of precipitation products by drought monitoring, serving as an alternative to assessments conducted with hydrological models (Beck et al., 2017; Gebrechorkos et al., 2024). Existing studies evaluate precipitation products using drought monitoring and focus on drought indices such as SPI and SPEI (Golian et al., 2019; Wei et al., 2019, 2021). However, more independent assessment studies using key parameters that encompass all subsurface and surface water balance components, such as TWS, are necessary to better understand the utility of precipitation products. This is particularly important for hydrological drought assessment, as spatial variability across different climate zones globally is still inadequately explored.

The current study aims to independently evaluate and compare frequently used global gridded precipitation products (i.e., GPCC and GPCP) by using the GRACE/GRACE-FO TWS data (i.e., the JPL mascon and G3P products) in order to assess drought conditions. The research evaluates the performance of the JPL mascon and G3P TWS products based on consistency in DRT estimate criteria. Both evaluations were conducted by estimating DRT based on TWSAs and required precipitation amount. Comparisons of the suitability of these precipitation and TWS products for global hydrological applications across various Köppen–Geiger climate zones enhance our understanding of the relationship between hydrological droughts, global precipitation, and TWS products through DRT estimates.

2 Methodology

2.1 Datasets

2.1.1 GPCC and GPCP precipitation

Established in 1989 by the World Meteorological Organization (WMO), the Global Precipitation Climatology Centre (GPCC) integrates monthly land precipitation data from various sources, including global telecommunication systems (GTSs), synoptic weather reports (SYNOPs), and monthly climate reports (CLIMATs). GPCC offers different precipitation products with varying spatiotemporal resolutions, such as the Full Data Monthly Product (GPCC FDM), the Monitoring Product, and the First Guess Monthly Product. Given its suitability for model verification and water cycle studies, this study utilized GPCC FDM v2022 (Schneider et al., 2022) to analyze the relationship between precipitation and TWS (Schneider et al., 2014). The GPCC FDM dataset provides monthly precipitation data at a spatial resolution of 0.5° from 1891 to 2020. The GPCC data were sourced from the Deutscher Wetterdienst (German Meteorological Service) website (https://opendata.dwd.de/climate_environment/GPCC/html/fulldata-monthly_ v2022_doi_download.html, last access: 27 May 2022).

The Global Precipitation Climatology Project (GPCP) is a combined satellite–gauge precipitation product overseen by the World Climate Research Program (WCRP) under its Global Water and Energy Experiment (GEWEX) Data and Assessment Panel (GDAP). It integrates rain gauge observations with satellite data to generate global precipitation estimates. For this study, we utilized GPCP v3.2 Satellite-Gauge (SG) Combined Data (Huffman et al., 2023). The monthly GPCP v3.2 dataset spans 1979 to present at a spatial resolution of 0.5°. The data are available from the Goddard Earth Sciences Data and Information Services Center (https://disc.gsfc.nasa.gov/datasets/GPCPMON_3.2/summary, last access: 10 October 2023).

2.1.2 TWS from GRACE/GRACE-FO

We analyzed the GRACE/GRACE-FO Level-3 products of the G3P (Güntner et al., 2024) and the JPL Release 6 mascon (Watkins et al., 2015; Wiese et al., 2023) TWS datasets to estimate water storage deficit. TWS comprises the sum of snow, ice, surface water, soil moisture, and groundwater. The G3P TWS data were acquired from the Gravis Information Service at the GFZ Helmholtz Centre for Geosciences (https://gravis.gfz.de/gws, last access: 28 July 2024). The JPL mascon TWS data were downloaded from the Virtual Directories of Earth Data CMR (https://cmr.earthdata.nasa.gov/virtual-directory/collections/C2536962485-POCLOUD/temporal/2002/04/16, last access: 25 December 2023). Both G3P and JPL mascon offer a higher spatial resolution (0.5°) than the spherical harmonic solutions and monthly data. However, datasets suffer from missing monthly data, particularly after 2011, due to satellite battery issues. To ensure consistent comparisons between continuous TWS and precipitation time series data, we filled the missing months in the time series by averaging the data from the previous and subsequent 2 months, resulting in a mean of 4 months (Andrew et al., 2017; Long et al., 2015). However, there is also a time gap between the GRACE and GRACE-FO missions, spanning July 2017 (i.e., the end of the science phase of the GRACE mission) to May 2018 (i.e., the launch of GRACE-FO). This period is left missing.

The JPL mascon TWS dataset represents anomalies relative to a long-term mean from January 2004 to December 2009, while the G3P TWS dataset uses a long-term mean from April 2002 to December 2020 as the baseline. To ensure consistent comparisons between the time series, we adjusted the baseline of the JPL mascon TWS to match that of the G3P TWS (Humphrey et al., 2023). This involved averaging each grid point over the period of April 2002 to December 2020 and subtracting it from the entire time series. Thus, this study utilized monthly TWS data from April 2002 to December 2020 for DRT analysis.

2.1.3 Köppen–Geiger climate classification

Globally, the Köppen–Geiger climate classification system, based on temperature and precipitation, is widely used for regional climate zonation by a diverse range of disciplines, such as climate research, physical geography, hydrology, agriculture, biology, and education (Kottek et al., 2006). In response to the need for current and well-documented global climate classifications, a new Köppen–Geiger climate map was released for a high-resolution (0.5°) depiction of global climates for the 1951–2000 period (Kottek et al., 2006). The present study utilized a more recent version of this dataset, covering the 1986–2010 period at a higher spatial resolution of 0.083° (Rubel et al., 2017). To achieve consistency with our TWS and precipitation data (grid resolution of 0.5°), the Köppen–Geiger climate classifications were re-gridded using bilinear interpolation. This study focused on the following five main Köppen–Geiger climate categories: equatorial, arid, warm temperate, snow, and polar, as illustrated in Fig. 1. The Köppen–Geiger climate classification scheme used in this study is available at https://koeppen-geiger.vu-wien.ac.at/present.htm.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f01

Figure 1Köppen–Geiger climate zone classification with the five main zones: equatorial (A), arid (B), warm temperate (C), snow (D), and polar (E).

2.2 Water balance equation

TWS changes are closely related to precipitation via the water balance equation:

\begin{matrix} (1) & d s / d t = P - ET - R, \end{matrix}

where $d s / d t$ is storage change with time, P is precipitation, ET is evapotranspiration, and R is runoff, all usually given in millimeter-equivalent water height per month (mm per month).

Any storage changes with time ( $d s / d t$ ) must be caused by a water flux, which might be vertically between the surface and atmosphere (P or ET). Horizontal fluxes at or underneath the Earth's surface are summarized as lateral runoff (R). Since gravity missions directly observe TWSAs relative to an (undefined) long-term mean value, they can be used to derive information about water fluxes at a wide range of timescales. In principle, the difference between two storage estimates separated by 30 d (the usual sampling of the GRACE solutions) allows us to derive quantitative information about the amount of precipitation that occurred during this time span.

2.3 Deviation of storage (dTWSA)

Understanding the magnitude of water deficits remains crucial for determining drought recovery timelines. These water deficits can be directly inferred from the variability in TWSA data (Thomas et al., 2014). First, TWSA is smoothed by a 3-month moving average filter to reduce the high-frequency noise (Singh et al., 2021), resulting in smoothed TWSA (sTWSA). Variations in water storage can be influenced by long-term factors, such as glacier mass accumulation and/or groundwater extraction. To isolate the impact of such long-term processes, we detrended the TWSA data for each grid by removing the linear trend of the relevant grid (Singh et al., 2021). Detrending removes the linear trend, essentially isolating the deviations from this long-term trend, which we refer to as deviation of storage (dTWSA):

\begin{matrix} (2) & {dTWSA}_{x, y, t} = {sTWSA}_{x, y, t} - trend ({sTWSA}_{x, y, t}), \end{matrix}

where ${sTWSA}_{x, y, t}$ is the smoothed TWSA at the x,y grid point and time t, and $trend ({sTWSA}_{x, y, t})$ is the trend of the smoothed TWSA at the x,y grid point and time t.

2.4 Cumulative detrended precipitation anomaly (cdPA)

To ensure consistency with dTWSA, we perform a temporal integration of the precipitation data. Then, cumulative precipitation anomalies (cPAs) were obtained by subtracting the mean precipitation observed from April 2002 to December 2020 (reference period) from the actual precipitation data for each product:

\begin{matrix} (3) & {cPA}_{x, y, t} = {cP}_{x, y, t} - \overline{{cP}_{x, y}}, \end{matrix}

where ${cP}_{x, y, t}$ is the cumulated precipitation at the x,y grid point and time t, and ${\overline{cP}}_{x, y}$ is the temporal mean of the cumulative precipitation at the x,y grid point.

Similar to dTWSA, we smoothed the cumulative precipitation anomalies (scPAs) derived from GPCC and GPCP by applying a 3-month moving average filter (Singh et al., 2021). This process effectively reduced short-term fluctuations or noise in the anomaly data. To isolate variations in precipitation anomalies from long-term trends, we further detrended the smoothed precipitation anomalies. This additional step reduced any remaining long-term trends in the precipitation patterns. Finally, the cumulative detrended precipitation anomaly (cdPA) data were obtained:

\begin{matrix} (4) & {cdPA}_{x, y, t} = {scPA}_{x, y, t} - trend ({scPA}_{x, y, t}), \end{matrix}

where ${scPA}_{x, y, t}$ is the smoothed precipitation anomalies at the x,y grid point and time t, and $trend ({scPA}_{x, y, t})$ is the trend of the smoothed precipitation anomalies at the x,y grid point and time t.

2.5 Relationship between dTWSA and cdPA

Variations in the key water fluxes (ET, R, and P) cause fluctuations in TWS, a crucial component of the water budget equation. This study leveraged the assumption of a constant relationship between precipitation and the combined evapotranspiration and runoff (ET + R) flux. This assumption allows us to infer potential variations in precipitation based on changes in TWSA.

This predictive model enables us to estimate the amount of precipitation necessary to balance a water storage deficit. This approach offers a valuable tool for understanding and managing water resources by directly linking precipitation dynamics to changes in TWSA. A linear relationship between dTWSA and cdPA was established to estimate the required precipitation amount as follows:

\begin{matrix} (5) & cdPA = β_{0} + β_{1} \cdot dTWSA + ε, \end{matrix}

where β₀ is the intercept, β₁ is the slope (regression coefficient), and ε represents the residual errors of the fit.

The units of cdPA and dTWSA are both in mm per month. Therefore, a β₁ value of 1 signifies that cdPA is well represented by dTWSA. In regions where β₁ equals 1, precipitation changes directly translate to storage variations. Conversely, regions with β₁ greater than 1 indicate that some of the local precipitation is immediately transported away and does not change the local storage. This suggests that the variability in storage data only partially captures all precipitation due to other hydrological processes (e.g., ET and R) in these regions. Regions with β₁ less than 1 suggest that a smaller amount of precipitation than needed is sufficient to address the storage deficit. In other words, there must be either additional inflow from other places that is phase locked with local rain events or severe positive biases in rainfall as seen by GPCC or GPCP, leading to an underestimation of the precipitation required based solely on storage variations (Singh et al., 2021).

Following the study of Singh et al. (2021), we estimated not only regression coefficients (i.e., β₀ and β₁), but also Pearson's correlation coefficient (r) between cdPA and dTWSA, as well as maximum drought length over each pixel, utilizing 19 years of monthly data (i.e., between 2002 and 2020). We classified the correlation coefficient values as follows: no or insignificant correlation (0.00–0.13), weak correlation (0.14–0.39), moderate correlation (0.40–0.69), and strong correlation (0.70–1.00). Here, a positive relationship is expected between cdPA and dTWSA such that positive (negative) precipitation anomalies should lead to increased (decreased) storage changes. Reverse signs in cdPA and dTWSA anomalies indicate weak or no linear relationship between the two variables. Accordingly, the study of Singh et al. (2021) eliminated the pixels that contain weak or no linear relationship between cdPA and dTWSA (i.e., r < 0, β₁ < 1, and maximum drought length < 5 months) from the global analyses. Sampling errors could cause fluctuations around 1, and random variability may cause some of the pixels to have β₁ values slightly less than 1, while these pixels may still have some considerable linear relationship. Accordingly, while the methodology of Singh et al. (2021) masked out regions with β₁ <1, in this study, regions with β₁ < 0 are masked out in addition to regions with r < 0 and maximum drought length < 5 months.

2.6 DRT

Both TWSA and P are used to quantify DRT. We closely follow the methodology given by Singh et al. (2021) and utilize two different methods to estimate DRT. The first method, based on storage deficit, utilizes only GRACE data to quantify DRT as the duration of the residuals of TWSA from its climatology. The second method is based on the required precipitation amount derived from both TWSA and precipitation datasets. In this approach, a drought is considered to end when the absolute required precipitation amount surpasses the observed precipitation amount. We thus utilize two different precipitation products (GPCC and GPCP), two different GRACE storage estimates (G3P and JPL mascon), and two different DRT estimation methods (storage deficit and required precipitation amount) to study the consequences of those processing choices for different climate zones as defined by the Köppen–Geiger climatology (in total, eight different DRTs are calculated).

2.6.1 DRT based on storage deficit

The deviation of TWSA from its climatology can offer valuable insights into drought characteristics. To calculate this deviation, we first create a reference point by averaging TWSA values for each month across the entire time series. For example, to establish the average January TWSA, we would calculate the mean of all January values in the data. This average monthly TWSA represented the climatology for that specific month. Subsequently, we calculated how much each TWSA data point deviated from this average climatology by subtracting the corresponding monthly climatology value. To identify drought events, we first calculated a climatology (long-term average) for TWSA data from both JPL mascon and G3P products using the time series from April 2002 to December 2020. This climatology serves as a reference point for typical TWSA conditions for each month. We then subtracted the corresponding monthly climatology value from each TWSA data point, resulting in residuals. Negative residuals indicate water storage deficits (Thomas et al., 2014). We classified periods with persistent negative residuals lasting longer than 3 consecutive months as drought events, signifying prolonged periods of below-average water storage (Singh et al., 2021). Negative residuals lasting less than 3 consecutive months were not classified as droughts. However, if a new negative residual period began within 1 month of a previous drought recovery, we considered them a continuation of the same drought event. This approach ensured a cohesive record of drought occurrences over time. By applying these criteria, we were able to establish a comprehensive inventory of drought characteristics for each grid point. This inventory served as the basis for our DRT estimation using the storage deficit method. This method analyzed the duration of negative residuals (i.e., smaller storage values than usual) of dTWSA at each location and time, thereby providing insights into the temporal patterns and the severity of drought events.

2.6.2 DRT based on required precipitation amount

The required precipitation amount is obtained from the linear relationship between dTWSA and cdPA. The storage deficit amount is represented by dTWSA, while cdPA, the output of this relationship, represents the required precipitation amount. To quantify the absolute required precipitation amount, the climatology of precipitation over each pixel was added back into the estimated required precipitation amount. The DRT estimation was then conducted by analyzing the duration during which the observed precipitation amount exceeded the absolute required precipitation amount for any given time and location (Singh et al., 2021). This approach allows for a comprehensive assessment of DRT dynamics across the different regions and periods.

2.7 Accuracy analysis

2.7.1 Consistency in DRT estimates

The consistency (level of agreement) between the two DRT estimates was quantified by assessing the differences in the timing obtained from both methods. In this context, consistency referred to the temporal difference between the estimated DRTs from each method, as categorized in Table 1. For example, if the time difference between the two methods fell within 1 or 2 months, the location was categorized as consistency category 1. By comparing the time differences between the DRT estimates from each method for each TWS–precipitation product, we were able to quantify the consistency between the two approaches. This analysis provides valuable insights into the reliability and robustness of the DRT estimates. In essence, it helped us assess how well the two methods converged on similar DRT values for the same locations.

Table 1Consistency categories in DRT estimates.

Download Print Version | Download XLSX

2.7.2 Calculated statistics

In our analyses, we use standard error (SE) as a measure of the uncertainty associated with the means of our datasets. A smaller SE value indicates a more precise estimate of the mean DRT over each pixel, which is often achieved with less variation in the data (Lee et al., 2015). The value of SE was separately calculated for each grid point and climate zone as follows:

\begin{matrix} (6) & {SE}_{x, y} = \frac{{SD}_{x, y}}{\sqrt{n_{x, y}}}, \end{matrix}

where SD_x,y is the standard deviation of DRT at the x,y grid point, and n_x,y is the length of the dataset at the x,y grid point. In addition to SE, we employed confidence intervals (CIs) to assess the uncertainty around the mean values of our datasets (Curran-Everett, 2008; Lee et al., 2015). CIs provide a range of values that are likely to contain the true population mean with a specified level of confidence (95% in our case) as follows:

\begin{matrix} (7) & {CI}_{x, y} = μ_{x, y} \mp 1.96 \cdot {SE}_{x, y}, \end{matrix}

where μ_x,y is the mean DRT at the x,y grid point, and SE is the standard error of DRT at the x,y grid point.

3 Results and discussions

3.1 Relationship between cdPA and dTWSA

Figure 2a illustrates the spatial distribution of correlation coefficients between dTWSA and cdPA for one selected data combination of dTWSA from G3P and cdPA from GPCP (from now on abbreviated as G3P–GPCP). G3P–GPCP was selected to show actual values since the coupled product had the highest correlation coefficient (0.30). All the correlation coefficient values less than 0.13 were not significant (p > 0.05; n= 216). Significant and moderate correlations were found over Australia (0.55), South America (0.46), and southern Africa (0.60), characterized by not only substantial water storage variations, but also dense in situ observing networks. Such relatively high correlations indicate substantial agreement in these areas. The negative correlations over polar regions (∼ 70 % of grids), where water storage decline is strong during and after the melting season without any direct relation to the incoming precipitation, are more than those over the other regions (∼ 10 % of grids). Similar disagreements were found in highly arid climates in northern Africa and central Asia, where water storage variations are minimal, and GRACE observations are most likely dominated by measurement noise.

In addition, Figure 2b–d show the spatial distribution of differences in these correlation coefficients for the other possible combinations relative to the results obtained for G3P–GPCP. When investigating the JPL mascon (Fig. 2c), we observed differences primarily in arid climates with generally smaller TWS variations and a consequently poor signal-to-noise ratio in the GRACE data, where processing choices (like spatially variable a priori constraints, as applied in the mascon) had a greater effect. GPCC (Fig. 2d, JPL mascon–GPCC) affected correlations to a larger extent than GPCP (Fig. 2c, JPL mascon–GPCP), in particular over places with less dense in situ networks. Given the standard deviation values of correlation differences (Fig. 2c and d) resulting from the use of GPCC instead of GPCP, the variability was higher in Fig. 2d (global average: 0.21) than in Fig. 2c (global average: 0.14). Higher correlations for GPCP confirm the added value of satellite observations in otherwise data-sparse regions (like the Congo Catchment in central Africa). However, we also find a number of places where GPCC fits GRACE better than GPCP does, which suggests that systematic deficits in satellite observations might also degrade the combined product in certain areas. Despite the differences identified above, we conclude that there is a strong relation between precipitation and storage monitored by satellite gravimetry, implying that GRACE observations should be used more frequently in the future for large-scale hydrometeorological research.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f02

Figure 2Representation of correlation coefficients between different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) Correlation coefficients obtained for dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in correlation coefficients relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC. Note that regions with negative correlations were removed from subsequent analyses.

Figure 3a illustrates the spatial distribution of β₁ (Eq. 4), exceeding 0, for G3P–GPCP. We note that in certain regions of northern Africa, North America, and northeastern Asia, the β₁ value was less than 0. The values of TWS in these regions were likely influenced by factors other than precipitation, thereby making the link between precipitation and TWSAs less reliable. Most regions with polar climates (i.e., Köppen–Geiger Climate Zone E) exhibited β₁ below 0, indicating a weak link between precipitation anomalies and changes in TWS in these areas. Similar to the correlation analysis, a contrasting pattern emerged between the arid regions. North America's arid regions (i.e., Zone B) showed a pattern that was more comparable to Australia's arid regions. Africa's arid regions had a higher percentage of masked-out areas compared to those in Australia and North America. Additionally, the remaining areas in northern Africa's arid regions had lower β₁ values.

If we replace G3P with JPL mascon (Fig. 3c), we notice that β₁ values remain almost the same for the global average (mean difference of −0.01 in Fig. 3c). A quite similar pattern also emerges when using GPCC instead of GPCP (Fig. 3b), and the overall largest decreases in β₁ values are found for the G3P–GPCC combination (−0.06). In particular, using GPCP in Asia's snow zone and Australia's arid zone revealed more regions with β₁ closer to 1 than using GPCC. This suggests less necessity for additional variables to explain the relationship between precipitation anomalies and TWS changes in these regions when using GPCP instead of GPCC. In Europe's warm temperate zone, the use of the JPL mascon revealed more regions with β₁ larger than 3 compared to using G3P.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f03

Figure 3Representation of β₁ values between different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) β₁ values obtained for dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in β₁ values relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC. Regions where the β₁ value is smaller than 0 are shown in white. Note that regions with β₁ values less than 1 were deemed as having a weak precipitation–storage relationship and were excluded from subsequent analyses.

Figure 4a displays a time series of dTWSA derived from both G3P and JPL mascon TWSA datasets in an example location in Australia (133.75° E, 16.75° S). Figure 4b illustrates the time series of cdPA derived from both GPCC and GPCP for the same point. These visualizations allow us to observe and analyze the fluctuations in water storage deviations and cdPA over time, providing insights into the dynamics of water availability and precipitation and potential drought recovery patterns. Close agreement is observed between the time series of G3P and JPL mascon and those of GPCC and GPCP, as well as between dTWSA and cdPA (average r= 0.65), as shown in Fig. 4.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f04

Figure 4Time series of (a) dTWSA obtained from both G3P and JPL mascon products and (b) cdPA obtained from both the GPCC and the GPCP precipitation products, each in an example location in Australia (133.75° E, 16.75° S).

Download

3.2 DRT estimates

The spatial distributions of mean DRT estimates based on storage deficit and required precipitation amount from the G3P–GPCP coupled product are shown in Figs. 5a and 6a, respectively. Figures 5b–d and 6b–d depict the spatial distribution of the differences between the mean DRT estimates derived from G3P–GPCP and G3P–GPCC and JPL mascon–GPCP and JPL mascon–GPCC, respectively, for both methods. The precipitation data are not utilized in calculating DRT estimates based on the storage deficit method (Fig. 5). However, they are used in the masking procedure for regions with weak or no linear relationship between cdPA and dTWSA. Although the unmasked regions have identical DRT values, the differences between Fig. 5a and b, as well as between Fig. 5c and d, arise from whether a region is masked or unmasked. Consequently, these figures are not identical. Even though the required precipitation method incorporated precipitation data into DRT estimates, the overall spatial patterns of mean DRT remained similar between G3P–GPCP and G3P–GPCC (Fig. 6b) as well as between JPL mascon–GPCP and JPL mascon–GPCC. Figures 5 and 6 indicate that both the mean and the spatial distributions of DRT estimates were consistent with each other using both methods.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f05

Figure 5Representation of mean drought recovery time (DRT) estimates based on the storage deficit method, obtained from different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) DRT estimated using dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in DRT relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f06

Figure 6Representation of mean drought recovery time (DRT) estimates based on the required precipitation amount method, obtained from different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) DRT estimated using dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in DRT relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC.

Figure 5a and b, which utilize G3P as the TWS product with GPCC and GPCP for precipitation, respectively, reveal the highest mean DRT (50–60 months) estimates based on the storage deficit method in Iran and central Asia. Likewise, for the required precipitation amount method, Fig. 6a and b, which also utilize G3P as the TWS product with GPCC and GPCP for precipitation, respectively, show the highest mean DRT in the same regions. Both methods consistently identified Iran, central Asia, southeastern Australia, and northern Africa as the regions experiencing the highest mean DRT (50–60 months), as illustrated in Fig. 5c and d and Fig. 6c and d, which utilize JPL mascon as the TWS product with GPCC and GPCP for precipitation, respectively. The spatial correlation between Figs. 5a and 6a is 0.75. This shows a high level of spatial correlation between the two DRT estimates based on the two methods.

The other regions that exhibited high DRT estimates based on both methods and across all the product combinations are central and southern South America (∼ 40 months), central and southern Africa (∼ 45 months), eastern Australia (∼ 35 months), central and western North America (∼ 40 months), central Europe (∼ 35 months), and eastern Asia (∼ 30 months). Increasing global aridity and drought areas since the mid-20th century, mainly as a result of extensive dryness in eastern Australia and northern mid-latitude regions, as reported by Dai (2011), are consistent with the findings of high DRT estimates of this study. Eastern Australia (∼ 35 months) experienced more severe drought conditions than western Australia (∼ 20 months) based on both methods. Consistent with the results of this study, previous research has focused on monitoring droughts in regions with a history of severe, multi-month drought events, such as Iran, central Europe, central and western North America, southeast Australia, and central South America, and which have experienced higher drought severity compared to other regions (Dai, 2011; Madadgar and Moradkhani, 2014; Rubel et al., 2017; Wu et al., 2023). Over the Colorado River basin (Madadgar and Moradkhani, 2014), droughts of varying severity were observed from 2001 to 2004, lasting a total of 48 months for the period from 2000 to 2011. Our findings for the same region indicate a mean DRT of approximately 30 months. The results of this study (Figs. 5a and 6a) are consistent with those of Boergens et al. (2020), which found that central Europe is a drought-prone region, experiencing extreme drought during the consecutive summers of 2018 and 2019, with recovery taking over a year. Moreover, the mean DRT estimates obtained from both methods when using JPL mascon were greater than those when using G3P. As shown in Fig. 6, the close agreement between GPCC and GPCP regarding the spatial distribution of mean DRT estimates for both TWS products (G3P and JPL mascon) indicates that the choice of precipitation product (GPCC or GPCP) may not influence the overall spatial patterns of DRT estimates. Figures A1 and A2 given in the Appendix illustrate the spatial distributions of the SE of DRT estimates, which were similar to the spatial distributions of the mean DRT estimates. Regions with the highest mean DRT also exhibited the highest SE, indicating that those experiencing longer DRT periods showed greater variability in the DRT estimates.

The mean DRT estimates based on storage deficit and required precipitation amount for the Köppen–Geiger main climate zones using all the TWS–precipitation coupled products are shown in Fig. 7a and b, respectively. Error bars representing the 95 % confidence intervals for each zone indicate variability in the mean DRT estimates, while the “n” values show the number of grids per coupled product within each zone. For both methods, the polar (E) zone exhibited the highest mean DRT (18.1 months for storage deficit, 14.2 months for required precipitation amount). Except for the polar (E) zone, consistent with previous findings (Van Lanen et al., 2013), for both methods, the arid (B) zone exhibited the highest mean DRT (14.8 months for storage deficit, 12.9 months for required precipitation amount), while the equatorial (A) zone displayed the lowest (10.9 months for storage deficit, 9.7 months for required precipitation amount). Mean DRT estimates based on storage deficit and required precipitation amount were 13.9 months and 11.4 months in the warm temperate (C) zone, respectively, whereas they were 14.1 months and 10.0 months in the snow (D) zone, respectively. In particular, all the climate zones except the polar (E) zone displayed minimal variability (< 0.2 months) in the mean DRT (indicated by narrow 95 % confidence intervals), suggesting low uncertainty. Overall, the difference between G3P and JPL mascon is the highest in the arid (B, 3.8 months) and polar (E, 5.7 months) zones, whereas the differences in the other zones are smaller than those in the arid (B) and polar (E) zones. Figure A3a and A3b show SE for the DRT estimates based on storage deficit and required precipitation amount, respectively, across the Köppen–Geiger climate zones for all the TWS–precipitation coupled products. The polar (E) zone had the highest SE for both methods, while the lowest SE varied depending on the product. SEs for GPCC and GPCP were similar except in the polar (E) zone. The JPL mascon estimates had slightly larger SEs than G3P for both methods.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f07

Figure 7Average DRT estimates based on (a) storage deficit and (b) required precipitation amount for various climate zones from two different dTWSA (i.e., G3P and JPL mascon) and two different cdPA (i.e., GPCP and GPCC) products calculated for equatorial (A), arid (B), warm temperate (C), snow (D), and polar (E) climate zones, as given by the Köppen–Geiger classification.

Download

On average, DRT based on storage deficit was estimated as 13.9 months, whereas DRT based on required precipitation amount was estimated as 11.3 months. When considering TWS products, regardless of the precipitation products, DRT estimates using JPL mascon (14.2 months) are consistently higher than those using G3P (11.6 months) across all the climate zones and the global average. When considering the precipitation products, DRT estimates using GPCC and GPCP yielded similar values (12.9 months) regardless of the TWS products across all the climate zones and the global average. These findings suggest that GPCC and GPCP closely agree on DRT estimates, and the storage deficit derived from G3P was consistently lower than that from JPL mascon across all the zones.

3.3 Consistency in DRT estimates

Figure 8a shows the spatial distributions of the consistency categories (Table 1) for the DRT estimates from G3P–GPCP coupled products. Figure 8b–d illustrate the spatial distributions of the differences in consistency categories for the DRT estimates between G3P–GPCP and G3P–GPCC, JPL mascon–GPCP, and JPL mascon–GPCC, respectively. Most regions fell into consistency category 1 (high agreement), and the mean absolute difference between DRT estimates calculated from both methods is 1.9 months. The spatial patterns were similar across all the possible data combinations (Fig. 8a–d), including those using the different TWS (G3P vs. JPL mascon) and precipitation (GPCC vs. GPCP) products. This consistency was also observed in the mean DRT for all the pairs. As expected, the regions in category 4 (time difference > 9 months) had the highest mean DRT and SE in both methods (Table 2).

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f08

Figure 8Representation of the consistency in DRT estimates (the class of the time difference in DRT values between two methods; see Table 1), obtained from the different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) Consistency using dTWSA from G3P and cdPA from GPCP (G3P–GPCP) and differences in consistency class relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC.

Table 2Mean DRT and SE for the consistency categories.

Download Print Version | Download XLSX

Figure 9 shows the consistency levels in terms of the percentage of category 1 (time differences of 1–2 months) for the Köppen–Geiger climate zones using all the coupled products. The polar (E) zone had the lowest average consistency (74.9 %), while the equatorial (A) zone had the highest (97.8 %). Overall, 87.5 % of DRT estimates achieved category 1 consistency. The consistency rate of G3P (88.5 %) is higher than that of JPL mascon (83.5 %) across the climate zones and globally. The G3P–GPCP combination achieved the highest consistency (on average, 88.7 %), whereas JPL mascon–GPCP showed the lowest (on average, 82.8 %).

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f09

Figure 9Percentage of DRT estimates whose consistency is category 1 for different climate zones using all the TWS–precipitation coupled products for climates characterized as equatorial (A), arid (B), warm temperate (C), snow (D), and polar (E) by the Köppen–Geiger classification.

Download

The choice of the precipitation product (GPCC vs. GPCP) exerted minimal impact on consistency (average absolute difference 1.4 %) when the same TWS product was used. Conversely, G3P led to higher DRT consistency (average absolute difference 5.0 %) than JPL mascon when the same precipitation product was used. The climate zones also influenced consistency. GPCC and GPCP showed similar consistency in the arid (B, 0.2 months) zone and in the snow (D, 0.1 months) zone and the most different consistency in the polar (E) zone (5.1 % difference). In contrast, G3P and JPL mascon had the most similar consistency in the equatorial (A) zone (1.3 % difference) and the highest difference in the arid (B) zone and the polar (E) zone (7.0 % difference).

3.4 Discussion

Both precipitation products provided similar global mean DRT estimates (∼ 12 months), with a high consistency rate of 87.5%, suggesting that GPCC and GPCP are both reliable for global hydrological applications. The largest discrepancy in mean DRT estimation (0.1 months) was observed in the polar (E) zone, with no significant difference in the snow (D) zone. The consistency of the two precipitation products differed by less than 1 % across all the climate zones. These results for the precipitation products highlight the robustness of these products across the diverse climate zones.

For the TWS products, the global mean DRT estimation using JPL mascon (13.8 months) was 2.6 months higher than that using G3P (11.4 months). G3P exhibited 5.0 % higher global consistency (90.0 %) than JPL mascon (85.0%), suggesting that it is better suited for analyzing hydrological drought characteristics, particularly in regions with extreme climate conditions, such as the polar (E) and arid (B) zones, where G3P outperformed JPL mascon by 7.0 % in consistency. The significant disparity in mean DRT estimation in the polar zone (13.2 months for G3P versus 18.9 months for JPL mascon) highlights the challenges of accurately representing water storage dynamics in high-latitude regions, possibly due to differences in how the two products handle ice and snow storage variability. Conversely, the smallest differences in DRT estimates and consistency in the equatorial (A) zone suggest that both TWS products perform effectively in regions with stable precipitation patterns. These findings reveal that while both precipitation products perform similarly, G3P outperforms JPL mascon in consistency and alignment with TWS and precipitation-based DRT. This suggests that G3P may provide a more accurate representation of terrestrial water storage dynamics in diverse climate zones.

While this study demonstrates the utility of DRT estimates derived from precipitation and GRACE/GRACE-FO TWSA data for evaluating global datasets, it is essential to discuss certain methodological and data-related limitations to appropriately contextualize the findings. First, we assumed a linear relationship between cdPA and dTWSA. However, this dynamic relationship may be disrupted by anthropogenic activities (e.g., groundwater extraction, dam construction, deforestation, and urbanization) and natural processes (e.g., evapotranspiration and runoff), which can modify the hydrological response independently of precipitation dynamics. These factors may delay the transfer of precipitation into storage components or reduce the volume that ultimately contributes to storage. Second, uncertainties inherent to GRACE/GRACE-FO data processing and precipitation products, stemming from sensor characteristics or model parameterizations, may introduce noise or systematic biases into DRT estimates and their consistency. Third, the simplified water balance framework assumes stable partitioning of precipitation into evapotranspiration and runoff. However, temporal variability in these processes, driven by factors such as temperature, vegetation dynamics, or soil moisture conditions, may weaken the cdPA and dTWSA relationship, particularly in energy-limited regions (e.g., high-evapotranspiration zones) or areas highly sensitive to runoff (e.g., snowmelt-dominated basins). Fourth, a spatial-scale mismatch remains between datasets: GRACE/GRACE-FO's coarse spatial resolution smooths fine-scale TWS variability, while spatially aggregated precipitation data may obscure localized hydrometeorological events (e.g., intense convective rainfall), thereby affecting the precipitation–TWS relationship in regions characterized by complex topography or localized weather systems. Finally, the occurrence of high regression coefficients (β₁ > 2) highlights the sensitivity of storage changes to precipitation inputs and suggests the presence of unmodeled nonlinearities or time lags in the hydrological response. The high β₁ values, indicating rapid precipitation removal via runoff or evapotranspiration, may reflect limitations in the linear model's ability to capture delayed storage responses or nonlinearities in specific hydrological regimes. Despite these limitations, this study provides a valuable framework for assessing global precipitation and TWS products via hydrological drought characteristics.

4 Summary and conclusions

TWS changes from one time epoch to another, as observed by satellite gravimetry, are closely related to the precipitation amount occurring during that time interval. The novel observing concept realized by the GRACE and GRACE-FO missions thus provides a unique opportunity to evaluate the frequently used global precipitation products on monthly or longer timescales. GRACE/GRACE-FO directly provides water storage anomalies, offering a novel approach to characterizing drought by assessing the storage deficit. The time required for drought recovery can be directly derived from the temporal evolution of these deficits (Singh et al., 2021), enabling the measurement of both drought duration and severity.

Our assessments reveal that both GPCC and GPCP products exhibited not only similar DRT estimates but also comparable consistency globally and across all the Köppen–Geiger climate zones. However, as noted, GPCP's reliance on satellite data enhances its utility in data-sparse regions, making it a more versatile choice in such contexts. However, this advantage does not extend to regions with dense in situ networks, where the inclusion of satellite data does not improve its performance.

For TWS products, the mean DRT estimates from JPL mascon were, on average, 2.6 months higher than those from G3P, globally and across all the Köppen–Geiger climate zones. However, G3P showed slightly higher consistency in the DRT estimates (5.0 % difference, globally) than JPL mascon. Furthermore, G3P demonstrated greater consistency than JPL mascon across all the Köppen–Geiger climate zones. These findings highlight G3P's reliability for applications requiring precise water storage anomaly data, such as drought monitoring.

The results of our study underline the potential value of GRACE/GRACE-FO for hydrometeorological research due to the strong relationship between precipitation and TWS changes. Its global coverage (although with its rather low spatial resolution) allows for the testing of different precipitation products – not only those derived from combinations of satellite and in situ observations (as done in this study) but also those from numerical weather prediction models and global atmospheric reanalyses. Both NASA and the European Space Agency (ESA) are currently working on future satellite gravity missions with even more precise sensors and different orbital configurations to further enhance the quality of satellite gravimetry products for hydrological applications.

Appendix A

Abbreviations
cdPA	Cumulative detrended precipitation anomaly
cPA	Cumulative precipitation anomaly
DRT	Drought recovery time
dTWSA	Deviation of storage
G3P	Global Gravity-based Groundwater Project
GPCC	Global Precipitation Climatology Centre
GPCC FDM	Global Precipitation Climatology Centre Full Data Monthly Product
GPCP	Global Precipitation Climatology Project
JPL mascons	Jet Propulsion Laboratory mass concentration solutions
scPA	Smoothed cumulative precipitation anomaly
TWS	Terrestrial water storage
TWSA	Terrestrial water storage anomaly

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f10

Figure A1Display of the standard error in DRT estimates based on the storage deficit obtained from different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) Standard error for dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in standard error relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f11

Figure A2Display of the standard error in DRT estimates based on the required precipitation obtained from different dTWSA (i.e., G3P, JPL mascon) and cdPA (i.e., GPCP, GPCC) datasets. (a) Standard error for dTWSA from G3P and cdPA from GPCP (G3P–GPCP). Differences in standard error relative to G3P–GPCP for (b) G3P–GPCC, (c) JPL mascon–GPCP, and (d) JPL mascon–GPCC.

https://hess.copernicus.org/articles/29/3359/2025/hess-29-3359-2025-f12

Figure A3Standard error for average DRT estimates based on (a) storage deficit and (b) required precipitation amount for various climate zones from two different dTWSA (i.e., G3P and JPL mascon) and two different cdPA (i.e., GPCP and GPCC) products calculated for equatorial (A), arid (B), warm temperate (C), snow (D), and polar (E) climate zones, as given by the Köppen–Geiger classification.

Download

Data availability

The datasets used for this study are publicly available under the following links: JPL mascon GRACE and GRACE-FO TWS, https://doi.org/10.5067/TEMSC-3JC63 (Wiese et al., 2023); G3P GRACE and GRACE-FO TWS, https://doi.org/10.5880/G3P.2024.001 (Güntner et al., 2024); GPCC Full Data Monthly precipitation, https://doi.org/10.5676/DWD_GPCC/FD_M_V2022_050 (Schneider et al., 2022); GPCP v3.2 Satellite-Gauge (SG) Combined Data, https://doi.org/10.5067/MEASURES/GPCP/DATA305 (Huffman et al., 2022); and Köppen–Geiger climate classification scheme, https://koeppen-geiger.vu-wien.ac.at/present.htm (Climate Change and Infectious Diseases Group, 2017).

Author contributions

ÇÇ contributed to conceptualization, data curation, formal analysis, investigation, methodology, resources, software, validation, visualization, preparation of the original draft, and editing the paper according to the reviews from the co-authors. MTY contributed to funding acquisition, project administration, supervision, resources, and reviewing and editing the paper. HD contributed to conceptualization, methodology, supervision, and reviewing and editing the paper. ESI contributed to funding acquisition, project administration, supervision, and reviewing and editing the paper. FE, CF, and ALY contributed to funding acquisition and reviewing and editing the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We sincerely thank the reviewers and the editor for their valuable comments and suggestions, which greatly contributed to improving the quality of the paper. We thank the GFZ Helmholtz Centre for Geosciences and the Jet Propulsion Laboratory (JPL) for providing the GRACE and GRACE-FO datasets; the Deutscher Wetterdienst (DWD) for providing the GPCC Full Data Monthly Product; the Goddard Earth Sciences Data and Information Services Center (GES DISC) for providing the GPCP version 3.2 Satellite-Gauge Combined Precipitation dataset; and the University of Veterinary Medicine, Vienna, for providing an updated world map of the Köppen–Geiger climate classification.

Financial support

This research has been funded by the Scientific and Technological Research Council of Türkiye (grant no. 120R065) and the German Federal Ministry of Research, Technology and Space (grant no. 01DL22002) within the Turkish–German joint project SPACETIME (Investigation of the relationship between the spatiotemporal dynamics of terrestrial water and carbon storage with reference to climate change, land use, land cover, and land management through data-driven modeling over Turkey).

Review statement

This paper was edited by Alexander Gruber and reviewed by two anonymous referees.

References

Adler, R. F., Huffman, G. J., Chang, A., Ferraro, R., Xie, P.-P., Janowiak, J., Rudolf, B., Schneider, U., Curtis, S., Bolvin, D., Gruber, A., Susskind, J., Arkin, P., and Nelkin, E.: The Version-2 Global Precipitation Climatology Project (GPCP) Monthly Precipitation Analysis (1979–Present), J. Hydrometeorol., 4, 1147–1167, https://doi.org/10.1175/1525-7541(2003)004<1147:TVGPCP>2.0.CO;2, 2003.

AghaKouchak, A., Farahmand, A., Melton, F. S., Teixeira, J., Anderson, M. C., Wardlow, B. D., and Hain, C. R.: Remote sensing of drought: Progress, challenges and opportunities, Rev. Geophys., 53, 452–480, https://doi.org/10.1002/2014RG000456, 2015.

Akbari Asanjan, A., Yang, T., Hsu, K., Sorooshian, S., Lin, J., and Peng, Q.: Short-Term Precipitation Forecast Based on the PERSIANN System and LSTM Recurrent Neural Networks, J. Geophys. Res-Atmos., 123, 12543–12563, https://doi.org/10.1029/2018JD028375, 2018.

Ali, A., Amani, A., Diedhiou, A., and Lebel, T.: Rainfall Estimation in the Sahel. Part II: Evaluation of Rain Gauge Networks in the CILSS Countries and Objective Intercomparison of Rainfall Products, J. Appl. Meteorol., 44, 1707–1722, https://doi.org/10.1175/JAM2305.1, 2005.

Andrew, R. L., Guan, H., and Batelaan, O.: Large-scale vegetation responses to terrestrial moisture storage changes, Hydrol. Earth Syst. Sci., 21, 4469–4478, https://doi.org/10.5194/hess-21-4469-2017, 2017.

Bai, X., Wu, X., and Wang, P.: Blending long-term satellite-based precipitation data with gauge observations for drought monitoring: Considering effects of different gauge densities, J. Hydrol., 577, 124007, https://doi.org/10.1016/j.jhydrol.2019.124007, 2019.

Barker, L. J., Hannaford, J., Chiverton, A., and Svensson, C.: From meteorological to hydrological drought using standardised indicators, Hydrol. Earth Syst. Sci., 20, 2483–2505, https://doi.org/10.5194/hess-20-2483-2016, 2016.

Bayar, A. S., Yılmaz, M. T., Yücel, İ., and Dirmeyer, P.: CMIP6 Earth System Models Project Greater Acceleration of Climate Zone Change Due To Stronger Warming Rates, Earths Future, 11, e2022EF002972, https://doi.org/10.1029/2022EF002972, 2023.

Beck, H. E., Vergopolan, N., Pan, M., Levizzani, V., van Dijk, A. I. J. M., Weedon, G. P., Brocca, L., Pappenberger, F., Huffman, G. J., and Wood, E. F.: Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling, Hydrol. Earth Syst. Sci., 21, 6201–6217, https://doi.org/10.5194/hess-21-6201-2017, 2017.

Behrangi, A., Nguyen, H., and Granger, S.: Probabilistic Seasonal Prediction of Meteorological Drought Using the Bootstrap and Multivariate Information, J. Appl. Meteorol. Clim., 54, 1510–1522, https://doi.org/10.1175/JAMC-D-14-0162.1, 2015.

Belabid, N., Zhao, F., Brocca, L., Huang, Y., and Tan, Y.: Near-Real-Time Flood Forecasting Based on Satellite Precipitation Products, Remote Sens.-Basel, 11, 252, https://doi.org/10.3390/rs11030252, 2019.

Boergens, E., Güntner, A., Dobslaw, H., and Dahle, C.: Quantifying the Central European Droughts in 2018 and 2019 With GRACE Follow-On, Geophys. Res. Lett., 47, e2020GL087285, https://doi.org/10.1029/2020GL087285, 2020.

Curran-Everett, D.: Explorations in statistics: standard deviations and standard errors, Adv. Physiol. Educ., 32, 203–208, https://doi.org/10.1152/advan.90123.2008, 2008.

Dai, A.: Drought under global warming: a review, WIRES Clim. Change., 2, 45–65, https://doi.org/10.1002/wcc.81, 2011.

Darand, M. and Khandu, K.: Statistical evaluation of gridded precipitation datasets using rain gauge observations over Iran, J. Arid. Environ., 178, 104172, https://doi.org/10.1016/j.jaridenv.2020.104172, 2020.

Ding, Y., Xu, J., Wang, X., Peng, X., and Cai, H.: Spatial and temporal effects of drought on Chinese vegetation under different coverage levels, Sci. Total Environ., 716, 137166, https://doi.org/10.1016/j.scitotenv.2020.137166, 2020.

Döll, P., Hasan, H. M. M., Schulze, K., Gerdener, H., Börger, L., Shadkam, S., Ackermann, S., Hosseini-Moghari, S.-M., Müller Schmied, H., Güntner, A., and Kusche, J.: Leveraging multi-variable observations to reduce and quantify the output uncertainty of a global hydrological model: evaluation of three ensemble-based approaches for the Mississippi River basin, Hydrol. Earth Syst. Sci., 28, 2259–2295, https://doi.org/10.5194/hess-28-2259-2024, 2024.

Eicker, A., Schumacher, M., Kusche, J., Döll, P., and Schmied, H. M.: Calibration/Data Assimilation Approach for Integrating GRACE Data into the WaterGAP Global Hydrology Model (WGHM) Using an Ensemble Kalman Filter: First Results, Surv. Geophys., 35, 1285–1309, https://doi.org/10.1007/s10712-014-9309-8, 2014.

Gebrechorkos, S. H., Leyland, J., Dadson, S. J., Cohen, S., Slater, L., Wortmann, M., Ashworth, P. J., Bennett, G. L., Boothroyd, R., Cloke, H., Delorme, P., Griffith, H., Hardy, R., Hawker, L., McLelland, S., Neal, J., Nicholas, A., Tatem, A. J., Vahidi, E., Liu, Y., Sheffield, J., Parsons, D. R., and Darby, S. E.: Global-scale evaluation of precipitation datasets for hydrological modelling, Hydrol. Earth Syst. Sci., 28, 3099–3118, https://doi.org/10.5194/hess-28-3099-2024, 2024.

Golian, S., Javadian, M., and Behrangi, A.: On the use of satellite, gauge, and reanalysis precipitation products for drought studies, Environ. Res. Lett., 14, 075005, https://doi.org/10.1088/1748-9326/ab2203, 2019.

Güntner, A., Sharifi, E., Hass, J., Boergens, E., Dahle, C., Dobslaw, H., Behzadpour, S., Boergens, E., Dahle, C., Dorigo, W., Dussailant, I., Flechtner, F., Jäggi, A., Kosmale, M., Luojus, K., Mayer-Gürr, T., Meyer, U., Preimesberger, W., Ruz Vargas, C., and Zemp, M.: Global Gravity-based Groundwater Product (G3P) (1.12), GFZ Data Services [data set], https://doi.org/10.5880/G3P.2024.001, 2024.

Harris, A., Rahman, S., Hossain, F., Yarborough, L., Bagtzoglou, A. C., and Easson, G.: Satellite-based Flood Modeling Using TRMM-based Rainfall Products, Sensors, 7, 3416–3427, https://doi.org/10.3390/s7123416, 2007.

Huffman, G. J., Behrangi, A., Bolvin, D. T., and Nelkin, E. J.: GPCP Version 3.2 Daily Precipitation Data Set, Greenbelt, Maryland, USA, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set], https://doi.org/10.5067/MEASURES/GPCP/DATA305, 2022.

Huffman, G. J., Adler, R. F., Behrangi, A., Bolvin, D. T., Nelkin, E. J., Gu, G., and Ehsani, M. R.: The New Version 3.2 Global Precipitation Climatology Project (GPCP) Monthly and Daily Precipitation Products, J. Climate, 36, 7635–7655, https://doi.org/10.1175/JCLI-D-23-0123.1, 2023.

Humphrey, V., Rodell, M., and Eicker, A.: Using Satellite-Based Terrestrial Water Storage Data: A Review, Surv. Geophys., 44, 1489–1517, https://doi.org/10.1007/s10712-022-09754-9, 2023.

Climate Change and Infectious Diseases Group: Köppen–Geiger climate classification, the University of Veterinary Medicine, Vienna, [data set], https://koeppen-geiger.vu-wien.ac.at/present.htm (last access: 8 April 2024), 2017.

Keyantash, J. and Dracup, J. A.: The Quantification of Drought: An Evaluation of Drought Indices, B. A. Meteorol. Soc., 83, 1167–1180, https://doi.org/10.1175/1520-0477-83.8.1167, 2002.

Kottek, M., Grieser, J., Beck, C., Rudolf, B., and Rubel, F.: World Map of the Köppen-Geiger climate classification updated, Meteorol. Z., 15, 259–263, https://doi.org/10.1127/0941-2948/2006/0130, 2006.

Lai, C., Li, J., Wang, Z., Wu, X., Zeng, Z., Chen, X., Lian, Y., Yu, H., Wang, P., and Bai, X.: Drought-Induced Reduction in Net Primary Productivity across Mainland China from 1982 to 2015, Remote Sens-Basel, 10, 1433, https://doi.org/10.3390/rs10091433, 2018.

Lai, C., Zhong, R., Wang, Z., Wu, X., Chen, X., Wang, P., and Lian, Y.: Monitoring hydrological drought using long-term satellite-based precipitation data, Sci. Total Environ., 649, 1198–1208, https://doi.org/10.1016/j.scitotenv.2018.08.245, 2019.

Lamptey, B. L.: Comparison of Gridded Multisatellite Rainfall Estimates with Gridded Gauge Rainfall over West Africa, J. Appl. Meteorol. Clim., 47, 185–205, https://doi.org/10.1175/2007JAMC1586.1, 2008.

Lee, D. K., In, J., and Lee, S.: Standard deviation and standard error of the mean, Korean J. Anesthesiol., 68, 220, https://doi.org/10.4097/kjae.2015.68.3.220, 2015.

Long, D., Yang, Y., Wada, Y., Hong, Y., Liang, W., Chen, Y., Yong, B., Hou, A., Wei, J., and Chen, L.: Deriving scaling factors using a global hydrological model to restore GRACE total water storage changes for China's Yangtze River Basin, Remote Sens. Environ., 168, 177–193, https://doi.org/10.1016/j.rse.2015.07.003, 2015.

Madadgar, S. and Moradkhani, H.: Spatio-temporal drought forecasting within Bayesian networks, J. Hydrol., 512, 134–146, https://doi.org/10.1016/j.jhydrol.2014.02.039, 2014.

Maggioni, V. and Massari, C.: On the performance of satellite precipitation products in riverine flood modeling: A review, J. Hydrol., 558, 214–224, https://doi.org/10.1016/j.jhydrol.2018.01.039, 2018.

McKee, T. B., Doesken, N. J., and Kleist, J.: The relationship of drought frequency and duration to time scales, Proceedings of the 8th Conference of Applied Climatology, 17–22 January, Anaheim, CA, American Meterological Society, Boston, MA, 179–184, https://climate.colostate.edu/pdfs/relationshipofdroughtfrequency.pdf (last access: 19 July 2025), 1993.

Mishra, A. K. and Singh, V. P.: A review of drought concepts, J. Hydrol., 391, 202–216, https://doi.org/10.1016/j.jhydrol.2010.07.012, 2010.

Negrón Juárez, R. I., Li, W., Fu, R., Fernandes, K., and De Oliveira Cardoso, A.: Comparison of Precipitation Datasets over the Tropical South American and African Continents, J. Hydrometeorol., 10, 289–299, https://doi.org/10.1175/2008JHM1023.1, 2009.

Patz, J. A., Frumkin, H., Holloway, T., Vimont, D. J., and Haines, A.: Climate Change: Challenges and Opportunities for Global Health, JAMA, 312, 1565, https://doi.org/10.1001/jama.2014.13186, 2014.

Pfeffer, J., Cazenave, A., Blazquez, A., Decharme, B., Munier, S., and Barnoud, A.: Assessment of pluri-annual and decadal changes in terrestrial water storage predicted by global hydrological models in comparison with the GRACE satellite gravity mission, Hydrol. Earth Syst. Sci., 27, 3743–3768, https://doi.org/10.5194/hess-27-3743-2023, 2023.

Piao, S., Ciais, P., Huang, Y., Shen, Z., Peng, S., Li, J., Zhou, L., Liu, H., Ma, Y., Ding, Y., Friedlingstein, P., Liu, C., Tan, K., Yu, Y., Zhang, T., and Fang, J.: The impacts of climate change on water resources and agriculture in China, Nature, 467, 43–51, https://doi.org/10.1038/nature09364, 2010.

Prakash, S., Gairola, R. M., and Mitra, A. K.: Comparison of large-scale global land precipitation from multisatellite and reanalysis products with gauge-based GPCC data sets, Theor. Appl. Climatol., 121, 303–317, https://doi.org/10.1007/s00704-014-1245-5, 2015.

Rubel, F., Brugger, K., Haslinger, K., and Auer, I.: The climate of the European Alps: Shift of very high resolution Köppen-Geiger climate zones 1800–2100, Meteorol. Z., 26, 115–125, https://doi.org/10.1127/metz/2016/0816, 2017.

Schneider, U., Becker, A., Finger, P., Meyer-Christoffer, A., Ziese, M., and Rudolf, B.: GPCC's new land surface precipitation climatology based on quality-controlled in situ data and its role in quantifying the global water cycle, Theor. Appl. Climatol., 115, 15–40, https://doi.org/10.1007/s00704-013-0860-x, 2014.

Schneider, U., Hänsel, S., Finger, P., Rustemeier, E., and Ziese, M.: GPCC Full Data Monthly Product Version 2022 at 0.5°: Monthly Land-Surface Precipitation from Rain-Gauges built on GTS-based and Historical Data, Global Precipitation Climatology Centre (GPCC) at Deutscher Wetterdienst [data set], https://doi.org/10.5676/DWD_GPCC/FD_M_V2022_050, 2022.

Senocak, A. U. G., Yilmaz, M. T., Kalkan, S., Yucel, I., and Amjad, M.: An explainable two-stage machine learning approach for precipitation forecast, J. Hydrol., 627, 130375, https://doi.org/10.1016/j.jhydrol.2023.130375, 2023.

Shukla, S. and Wood, A. W.: Use of a standardized runoff index for characterizing hydrologic drought, Geophys. Res. Lett., 35, 2007GL032487, https://doi.org/10.1029/2007GL032487, 2008.

Singh, A., Reager, J. T., and Behrangi, A.: Estimation of hydrological drought recovery based on precipitation and Gravity Recovery and Climate Experiment (GRACE) water storage deficit, Hydrol. Earth Syst. Sci., 25, 511–526, https://doi.org/10.5194/hess-25-511-2021, 2021.

Springer, A., Eicker, A., Bettge, A., Kusche, J., and Hense, A.: Evaluation of the Water Cycle in the European COSMO-REA6 Reanalysis Using GRACE, Water-Suisse, 9, 289, https://doi.org/10.3390/w9040289, 2017.

Sun, Q., Miao, C., Duan, Q., Ashouri, H., Sorooshian, S., and Hsu, K.: A Review of Global Precipitation Data Sets: Data Sources, Estimation, and Intercomparisons, Rev. Geophys., 56, 79–107, https://doi.org/10.1002/2017RG000574, 2018.

Tangdamrongsub, N., Jasinski, M. F., and Shellito, P. J.: Development and evaluation of 0.05° terrestrial water storage estimates using Community Atmosphere Biosphere Land Exchange (CABLE) land surface model and assimilation of GRACE data, Hydrol. Earth Syst. Sci., 25, 4185–4208, https://doi.org/10.5194/hess-25-4185-2021, 2021.

Thomas, A. C., Reager, J. T., Famiglietti, J. S., and Rodell, M.: A GRACE-based water storage deficit approach for hydrological drought characterization, Geophys. Res. Lett., 41, 1537–1545, https://doi.org/10.1002/2014GL059323, 2014.

Van Lanen, H. A. J., Wanders, N., Tallaksen, L. M., and Van Loon, A. F.: Hydrological drought across the world: impact of climate and physical catchment structure, Hydrol. Earth Syst. Sci., 17, 1715–1732, https://doi.org/10.5194/hess-17-1715-2013, 2013.

Vicente-Serrano, S. M., Beguería, S., López-Moreno, J. I., Angulo, M., and El Kenawy, A.: A New Global 0.5° Gridded Dataset (1901–2006) of a Multiscalar Drought Index: Comparison with Current Drought Index Datasets Based on the Palmer Drought Severity Index, J. Hydrometeorol., 11, 1033–1043, https://doi.org/10.1175/2010JHM1224.1, 2010.

Vicente-Serrano, S. M., López-Moreno, J. I., Beguería, S., Lorenzo-Lacruz, J., Azorin-Molina, C., and Morán-Tejeda, E.: Accurate Computation of a Streamflow Drought Index, J. Hydrol. Eng., 17, 318–332, https://doi.org/10.1061/(ASCE)HE.1943-5584.0000433, 2012.

Wahr, J., Swenson, S., Zlotnicki, V., and Velicogna, I.: Time-variable gravity from GRACE: First results, Geophys. Res. Lett., 31, 2004GL019779, https://doi.org/10.1029/2004GL019779, 2004.

Wang, Z., Zhong, R., and Lai, C.: Evaluation and hydrologic validation of TMPA satellite precipitation product downstream of the Pearl River Basin, China, Hydrol. Process., 31, 4169–4182, https://doi.org/10.1002/hyp.11350, 2017.

Watkins, M. M., Wiese, D. N., Yuan, D., Boening, C., and Landerer, F. W.: Improved methods for observing Earth's time variable mass distribution with GRACE using spherical cap mascons, J. Geophys. Res.-Sol. Ea., 120, 2648–2671, https://doi.org/10.1002/2014JB011547, 2015.

Wehbe, Y., Ghebreyesus, D., Temimi, M., Milewski, A., and Al Mandous, A.: Assessment of the consistency among global precipitation products over the United Arab Emirates, J. Hydrol.-Reg. Stud., 12, 122–135, https://doi.org/10.1016/j.ejrh.2017.05.002, 2017.

Wei, L., Jiang, S., Ren, L., Yuan, F., and Zhang, L.: Performance of Two Long-Term Satellite-Based and GPCC 8.0 Precipitation Products for Drought Monitoring over the Yellow River Basin in China, Sustainability-Basel, 11, 4969, https://doi.org/10.3390/su11184969, 2019.

Wei, L., Jiang, S., Ren, L., Wang, M., Zhang, L., Liu, Y., Yuan, F., and Yang, X.: Evaluation of seventeen satellite-, reanalysis-, and gauge-based precipitation products for drought monitoring across mainland China, Atmos. Res., 263, 105813, https://doi.org/10.1016/j.atmosres.2021.105813, 2021.

Wiese, D. N., Yuan, D.-N., Boening, C., Landerer, F. W., and Watkins, M. M.: JPL GRACE and GRACE-FO Mascon Ocean, Ice, and Hydrology Equivalent Water Height CRI Filtered Version RL06.1Mv03TS20, PO.DAAC, CA, USA [data set], https://doi.org/10.5067/TEMSC-3JC63, 2023.

Wu, X., Feng, X., Wang, Z., Chen, Y., and Deng, Z.: Multi-source precipitation products assessment on drought monitoring across global major river basins, Atmos. Res., 295, 106982, https://doi.org/10.1016/j.atmosres.2023.106982, 2023.

Xu, K., Yang, D., Yang, H., Li, Z., Qin, Y., and Shen, Y.: Spatio-temporal variation of drought in China during 1961–2012: A climatic perspective, J. Hydrol., 526, 253–264, https://doi.org/10.1016/j.jhydrol.2014.09.047, 2015.

Articles

Short summary

The study assesses the Global Precipitation Climatology Centre (GPCC) and Global Precipitation Climatology Project (GPCP) precipitation products by estimating hydrological drought recovery time (DRT) using satellite gravimetry data, Jet Propulsion Laboratory mass concentration solution (JPL mascon), and Global Gravity-based Groundwater Project (G3P) terrestrial water storage (TWS) products. The findings reveal that DRTs from GPCC and GPCP are comparable, and JPL mascon shows longer DRT, while G3P demonstrates greater consistency. These results contribute to a deeper understanding of precipitation and water storage dynamics and are essential for meteorological and hydrological research.