the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Joint calibration of multi-scale hydrological data sets using probabilistic water balance data fusion: methodology and application to the irrigated Hindon River Basin, India
Roya Mourad
Gerrit Schoups
Vinnarasi Rajendran
Wim Bastiaanssen
Hydrological data sets have vast potential for water resource management applications; however, they are subject to uncertainties. In this paper, we develop and apply a monthly probabilistic water balance data fusion approach for automatic bias correction and noise filtering of multi-scale hydrological data. The approach first calibrates the independent data sets by linking them through the water balance, resulting in hydrologically consistent estimates of precipitation (P), evaporation (E), storage (S), irrigation canal water imports (C), and river discharge (Q) that jointly close the basin-scale water balance. Next, the basin-scale results are downscaled to the pixel-scale, to generate calibrated ensembles of gridded Precipitation (P) and Evaporation (E) that reflect the basin-wide water balance closure constraints. An application to the irrigated Hindon River basin in India illustrates that the approach generates physically reasonable estimates of all basin-scale variables, with average standard errors decreasing in the following order: 21 mm month−1 for storage, 10 mm month−1 for evaporation, 7 mm month−1 for precipitation, 4 mm month−1 for irrigation canal water imports, and 2 mm month−1 for river discharge. Results show that updating the original independent data with water balance constraint information reduces uncertainties by inducing cross-correlations between all independent variables linked through the water balance. In addition, the introduced approach yields (i) hydrologically consistent gridded P and E estimates that fuse information from prior (original) data across different land use elements and (ii) statistically consistent random errors that reflect the model's confidence about P and E estimates at each grid cell. The analysis also shows a long-term decreasing trend in groundwater, which is better captured by the more severe decline from GRACE JPL mascon than GRACE Spherical Harmonic data. This finding points towards the possible sustainability issues for irrigation in the basin and requires further validation using piezometer groundwater-level measurements. Future opportunities exist to further constrain the generated water balance variables and their associated errors within process-based models and with additional data.
- Article
(18515 KB) - Full-text XML
- BibTeX
- EndNote
Under a changing climate and anthropogenic activities (Wada et al., 2010), it becomes urgent to accurately estimate the water balance components. Although distant from the Earth's surface, remote sensing (RS) satellites can uniquely provide information that translates into such estimates. For example, microwave and infrared techniques are used to estimate precipitation (Sun et al., 2018), evaporation can be computed using optical and thermal imagery (Zhang et al., 2016a), while changes in total water storage are obtained from the Gravity Recovery and Climate Experiment (GRACE) satellites (Wahr et al., 2004; Rodell et al., 2009). These advancements have emerged as powerful decision-support tools, pushing hydrology in new directions by providing valuable information on the spatial distribution of water availability, use, and consumption (Hessels et al., 2022; Sheffield et al., 2018; Karimi et al., 2013). Such information is often challenging to obtain through traditional in situ data collection. With the rapid increase in RS data availability, it is now theoretically possible to close the water balance. However, two challenges prevent this from being achieved, including the inconsistencies among various RS products for the same variable and the imbalances resulting from integrating the unique combinations of different water balance variables. These incoherences arise from the inherent data errors. Discrepancies in RS data, for example, come from different sources, such as errors in the retrieval algorithms (Maggioni et al., 2022), input data and parameters (Crosetto et al., 2001), or scale mismatch (Foken, 2008). While in situ data are known for their greatest reliability, they also come with their own errors. Uncertainties in river discharge, for example, could be related to flow conditions (Mcmillan et al., 2012), velocity sensors (e.g., calibration) (Horner et al., 2018), or estimation methods such as the rating curve (Kuczera, 1996). Benefiting from the complementary strengths of in situ and RS data and embracing their errors requires a three-pronged methodology. This methodology should maximize the prior information content of the data available from multiple RS sources for the same variable, quantify both systematic and random errors in each water balance term, and reduce these errors by exploiting all available information to generate calibrated estimates of spatially distributed water balance data. The following paragraphs review existing approaches in the literature used to handle these aspects.
Studies addressing the uncertainties in RS water balance data often rely on in situ data (treated as “ground-truth”) to evaluate and converge on the different data products. However, in situ data (e.g., evaporation) is sparsely measured, and even when these data exist, accessibility may be challenging. Under such conditions, alternative indirect approaches such as triple collocation (Tian and Peters-Lidard, 2010; Long et al., 2014; Massari et al., 2017; Yin and Park, 2021), uncertainty propagation (Hong et al., 2006; Cawse-Nicholson et al., 2020), and sensitivity analyses (Sobol, 2001) can be utilized. Other methods involve using the deviation from the ensemble mean, and the variability between the data sets as a proxy for uncertainty (Tian and Peters-Lidard, 2010; Sahoo et al., 2011; Munier et al., 2014; Zhang et al., 2018). Each of the previously summarized error estimation techniques has its inherent merits and shortcomings. In an attempt to capitalize on these strengths, Mourad et al. (2024) recently integrated limited in situ data and multiple error metrics, along with expert judgment, to quantify and partially reduce water balance data errors. However, the resulting water balance estimates require further conditioning on in situ data using additional constraining steps, which is a subject of the methodology section of the current study.
Several other studies went beyond quantifying uncertainties in RS data and sought to fully reduce them using the water balance as a constraint (Aires, 2014; Hobeichi et al., 2020; Luo et al., 2023; Munier et al., 2014; Pan et al., 2012; Pan and Wood, 2006; Rodell et al., 2015; Sahoo et al., 2011; Zhang et al., 2018, 2016b). The core idea of the existing closure methods is to either select a preferred single data set of each water balance variable based on prior knowledge about their quality, or merge multiple data for each water balance variable using fixed pre-quantified error estimates. This step is followed by a water balance correction method such as variational data assimilation (L'ecuyer and Stephens, 2002), constrained Kalman filter (Pan and Wood, 2006), or proportional redistribution (Abolafia-Rosenzweig et al., 2021). In this process, the values of the (un)merged water balance estimates are adjusted proportionally to their relative uncertainties or magnitudes, depending on the correction method used. Contrary to the previously summarized water balance closure approaches, which fixed data errors a priori, Schoups and Nasseri (2021) proposed improving data error estimation by treating them along with the water balance variables as unknown random variables. Instead of enforcing the closure sequentially, the authors demonstrate that the water balance variables can be calibrated simultaneously in a single probabilistic data fusion methodology. The underlying premise is that all variables are interconnected through the water balance. For instance, to calibrate precipitation data, the proposed approach uses statistical inference techniques to automatically fuse and adjust two information sources available on this variable, one coming from multiple estimates of precipitation RS data (original prior data), and the other sourced from other water balance data (predicted from the water balance). These techniques include an iterative smoother that involves multiple forward-backward passes over the timeseries. In the forward pass, the estimated precipitation carries information from previous and current months, while in the backward pass it holds information from the future months. A similar process happens when estimating the other water balance variables, yielding refined posterior estimates that combine information from all other months.
Further to the above literature, previous attempts have been made to extend the basin-wide water balance closure constraints to a finer scale of up to 0.25° pixel resolution (Barkhordari et al., 2025; Heberger et al., 2023; Pellet et al., 2019; Munier et al., 2014). Building on these previous efforts, our emphasis here is on generating detailed calibrated precipitation (0.05° resolution) and evaporation (250 m resolution) estimates that align with the basin-scale closure constraints. To that end, we intend to contribute to the literature by extending the existing basin-scale water balance data fusion method first introduced by Schoups and Nasseri (2021). The extended methodology presented here retains the original model's key advantages, namely: the ability to simultaneously calibrate the basin-wide water balance variables by exploiting data from the entire timeseries. An integral part of this new version is using the basin-wide posteriors to generate calibrated ensembles of spatially distributed precipitation and evaporation. This is achieved in a stepwise procedure, with the first step involving the formulation of ensemble-driven grid-scale error models, contrary to the models in Schoups and Nasseri (2021), which were formulated at the basin-scale and relied on an ensemble of two data sets. The second step is a constraint step carried out by the water balance data fusion at the basin scale, yielding basin-scale posteriors for all water balance variables, and a Kalman smoothing algorithm, yielding grid-scale posteriors for precipitation and evaporation. This approach ensures consistency across the two scales: when back-calculating the basin-scale estimates from the grid-scale posteriors via spatial averaging, we recover the inferred basin-scale posteriors from the water balance data fusion. Lastly, the new version of the methodology incorporates the following additional features: (i) accounting for surface water imports from irrigation canals, which might significantly affect the overall water budget in irrigated basins, and (ii) quantifying posterior cross-correlations between water balance variables, which is important for generating joint posterior samples. We apply the extended approach to an irrigated monsoon-influenced Hindon Basin, which suffers from unsustainable water use.
The remainder of this paper is structured as follows: Sect. 2 introduces the case study and describes the data sets used as input for the probabilistic water balance model presented in Sect. 3. Section 4 details how all unknowns in the water balance model are solved at the basin scale, yielding a closed water balance, and how these water balance constraints are transferred to the grid scale. The basin and grid-scale results are then presented in Sect. 5. The sensitivity of the water balance data fusion approach to the modeling decisions is then discussed in Sect. 6. Throughout the paper, we use the terms “priors” and “posteriors” to refer to the variables (i.e., water balance terms and error parameters) before and after calibration, respectively (i.e., before and after introducing the monthly water balance constraints).
The study focuses on the Hindon Basin in the northwestern Uttar Pradesh State of India (see Fig. 1). The Hindon basin covers a total area of about 4780 km2 and drains to the Galeta outlet, where a river discharge station is located. The rainfed Hindon River originates from the Shivalik Hills at the foothills of the Himalayas. It flows towards the south on the flat and alluvial surfaces of the basin. The river eventually drains into the Yamuna River (near Noida, downstream of Delhi), which, in turn, joins the Ganga River before it reaches the Bay of Bengal. The basin features a large-scale supply-based irrigation system with two major irrigation canals: the Eastern Yamuna Canal (EYC) and the Upper Ganga Canal (UGC). These canals are fed by two external reservoirs located outside the basin, namely the Bhimgoda Barrage constructed on the Ganga River and Hathni Kund Headwork on the Yamuna River. The share of the external canal water imports in the water balance is significant for this heavily irrigated basin, representing about 30 %–40 % of the annual precipitation (Fig. A1). In addition to supplying surface irrigation water to the study area, the canals contribute to groundwater recharge through seepage, forming several groundwater mounds near their vicinity. Hence, the canals approximately coincide with groundwater divides, limiting lateral groundwater flow across the eastern and western boundaries of the Hindon basin (Umar et al., 2008).
The canal system is operated in two irrigation seasons: the Kharif and Rabi seasons. Kharif season extends from March to September, in which the main irrigated crops are sugarcane and rice, and the Rabi season spans from October to March of the following year, where the principal crops are wheat and mustard. The irrigated croplands represent more than 85 % of the basin, while only about 10 % of the basin's area is covered by non-croplands (Fig. A2). Irrigation water is diverted from the canals to supply the basin through off-takes that serve fixed command areas with crops rotated between Rabi and Kharif crops (Fig. 1).
Figure 1Location of the Hindon basin in the Uttar Pradesh state of India (inset map); with a detailed view of the basin featuring its boundaries, topographic profile, location of the Galeta outlet, where a river discharge station that belongs to the Central Water Commission (CWC) network of India is located. The main map shows the irrigation scheme with reservoirs and canal system. Topographic basemap sources: Esri, USGS, FAO, NPS, GIS user community, and others | Powered by Esri.
Our goal is to estimate each term in the monthly water balance, written as:
Equation (1) combines the different water balance terms together, where St−1 and St are the total water storage in the basin at the start and end of month t (including surface storage, soil moisture, and groundwater), and are the basin-average precipitation and evaporation (including transpiration), Qt is the river discharge at the basin outlet for month t, and Ct is the total canal water import into the basin (sum of all intakes, see Fig. 1) for month t. The net lateral groundwater flows into or out of this basin are assumed to be negligible due to their small magnitude relative to the other water balance variables (Alam and Umar, 2013). All terms in Eq. (1) are expressed in units of mm of equivalent water depth. The following paragraphs describe the water balance data for the case study.
Each water balance component in Eq. (1) is independently derived, gathered, or measured either from satellite data, ground-based measurements, or both (Table 1). In the prior selection of the most reliable precipitation and evaporation ensembles (see Mourad et al., 2024), we followed other water balance closure studies that emphasize the use of earth observations over model outputs to minimize the effect of their related assumptions, except for the Multi-source Weighted-Ensemble Precipitation (MSWEP v2.8) product (Beck et al., 2019). MSWEP is an “optimal merging” of gauge observations, satellite observations, and reanalysis model output. Along with MSWEP, we use two other monthly gridded precipitation ensemble members: the Tropical Rainfall Measuring Mission (TRMM) (Huffman et al., 2007) and the NASA/JAXA Global Precipitation Measurement (GPM-IMERG) (Huffman et al., 2019). By combining precipitation radar, passive microwave, and infrared satellites with ground-based observations from the Global Precipitation Climatology Centre (GPCC), TRMM and GPM-IMERG provide a comprehensive precipitation estimate over a given area. Compared to single-band radar on TRMM, GPM dual-frequency precipitation radar provides a broader range of measurable precipitation rates and a better estimate of precipitation particle size (Hou et al., 2014). Spatially interpolated rain-gauge for the basin from Indian Meteorological Department (IMD) data set (Pai et al., 2014) is included in this analysis for comparison but not used in the model.
For gridded evaporation ensembles, we incorporate five members with diverse methodological approaches for estimating evaporation from remote sensing. One ET data set based on two-parallel Penman-Monteith (PM) models for both canopy and soil: the pyWaPOR-ET v2.6 (Bastiaanssen et al., 2012), two ET data sets that estimate ET from a single-source surface energy balance (SEB) model representing vegetation and soil in a combined energy balance, namely the Landsat Collection 2 Provisional Actual Evapotranspiration Science Product (Landsat-based SSEBop ET) (Senay, 2018; Senay et al., 2023) and the Surface Energy Balance Algorithm for Land (eeSEBAL product) (Bastiaanssen et al., 1993), in addition to a two-source SEB ALEXI-ET data set from the Atmosphere-Land Exchange Inverse model (Anderson et al., 2007, 2015). We also incorporate the CMRSET ET product based on the CSIRO MODIS Reflectance-based Scaling ET model (Guerschman et al., 2009) that uses vegetation indices and meteorological data for scaling the ET. All data sets are resampled to the same spatial resolution using bilinear interpolation. More details on the theoretical background and processing steps for the P and E data sets can be found in the companion paper (Mourad et al., 2024).
Table 1Monthly Water Balance Data. The data type column distinguishes between Remote Sensing (RS) and ground-based Measurements (GBM).
The total water storage variable data were obtained from the recent release of the monthly Jet Propulsion Laboratory (JPL-RL06) mascon solution. Each monthly total water storage estimate represents the surface mass anomaly relative to the baseline average over 2004–2009. This version relies on prior geophysical information to constrain the solution, eliminating the need for empirical de-striping filtering commonly used for post-processing traditional spherical harmonic gravity solutions. However, intrinsic to this product are biases, that is, leakage errors, and spatial smoothing that damps the “true” signal, especially in small-sized basins. A Coastline Resolution Improvement (CRI) filter has been applied in this version to reduce signal leakage errors across coastlines (Wiese et al., 2016). This data is also accompanied by scaling factors for optionally restoring the damped signal. However, these were not applied to the data used herein; instead, bias along with noise variance are modeled using all water balance data incorporated in this study (see Sect. 3.3). Another storage input data set is also included: GRACE Spherical Harmonic (CSR) solution (Sect. 6.1) (Swenson and Wahr, 2006; Landerer and Swenson, 2012).
The in situ data used in this analysis are obtained from governmental agencies, including the Central Water Commission (CWC) of India for streamflow data and the irrigation department of Uttar Pradesh for external canal water imports data. While river discharge data has no gaps, canal delivery data are constrained by their spatial and temporal coverage. The latter comes in two forms: irrigation schedules and actual flow measurements. We apply an extrapolation approach that combines both data sources to generate complete monthly estimates for all intakes (for more details on the adopted gap-filling technique, refer to Appendix A2).
As can be seen in Table 1, the native resolution of the individual data sets and between the water balance variables varies widely, posing a challenge in their merging process. In principle, the probabilistic water balance data model can be performed at any spatial resolution; however, for this study design, we try to preserve the native information as much as possible by choosing to resample all precipitation data sets to a common resolution of 0.05° and all evaporation data sets to a common resolution of 250 m. An alternative design would be to resample all water balance variables to the same resolution (e.g., 250 m), but this might introduce artefacts, for example, when up-sampling from coarse to high resolution.
Despite the variety of the above-explored data sources, two barriers hinder their potential use in hydrological applications. One occurs when integrating the unique data combinations describing each water balance variable into the basin-scale water balance. In any particular month, we end up with significant errors, i.e., either too much or too little water, resulting in a non-zero net balance (Fig. B1 in Appendix B). This non-closure or imbalance problem highlights the incoherences between the water balance data sets and the inconsistencies in the individual data sets, caused by various data errors. Another well-known challenge is the inconsistencies between the various data products for the same variable. For example, for the studied basin, grid-scale inter-product uncertainty is moderate for P, while E exhibits significant spatial variability across the whole basin (Mourad et al., 2024). These challenges motivate the distinction between the “calibrated” value of each water balance variable that satisfies the water balance closure at the basin scale and represents the error-corrected estimate at the grid level, and the “observed” values, which deviate from these values. In the probabilistic form of the water balance, each term is assigned a probabilistic data error model to quantify systematic and random differences between observed and modelled water balance variables. Parameters in these data error models are treated as unknown random variables with predefined prior distributions. In this regard, the overall probabilistic water balance takes the form of a Bayesian hierarchical model with two levels of uncertainty: one related to the error parameters and another linked to the water balance variables (Schoups and Nasseri, 2021). Figure 2 gives an overview of our approach, and the following subsections describe the parametric probabilistic relations used for each water balance variable (P, E, Q, C, and S).
Figure 2A factor graph showing random variables in circles and constraints between variables as squares for a single month. Three constraints are incorporated in the water balance data fusion method, including: a prior Gaussian distribution assigned to each water balance variable (squares attached to each variable), the basin-scale water balance (BWB) constraint that links all water balance variables together, and the spatial averaging constraint that links grid-scale precipitation (P) and evaporation (E) to basin-scale precipitation () and evaporation (). Computation of posteriors of individual variables proceeds in two steps. The first step is basin-scale water balance data fusion (BWB fusion, see Sect. 4.1). This step involves multiple forward (blue arrows along the edges) and backward (dotted blue arrows) passes over the data using the entire timeseries to compute the posteriors of all water balance variables that jointly close the water balance (Schoups and Nasseri, 2021). The second step computes grid-scale posteriors for precipitation and evaporation from their basin-scale posteriors using a Kalman smoothing algorithm (green arrow, see Sect. 4.2).
3.1 Precipitation and Evaporation error models
Contrary to Schoups and Nasseri (2021), we define prior error models for P and E at the grid scale instead of at the basin scale, followed by spatial averaging to derive corresponding priors at the basin scale. The latter are then used with water balance data fusion for estimating basin-scale posteriors (Sect. 4.1). The resulting basin-scale posteriors are finally transferred back to the grid scale (Sect. 4.2).
The first step in setting up a prior error model for the gridded P and E variables is to probabilistically characterize them with prior distributions. For these two independent spatial processes, we can write their grid-scale joint prior distribution for each month t as:
where Pt and Et are vectors containing unknown precipitation and evaporation values for all grid cells in the corresponding spatial field for each month t. The Gaussian distribution (Eq. 2) is specified with a mean vector and a block diagonal covariance matrix with zero cross-covariance to reflect that these two processes are assumed to be independent a priori. The mean vector consists of the prior means of P () and E (), for all spatial locations within the corresponding spatial domain of each month t, while the autocovariance of each variable P () and E () is the block-diagonal element of the covariance matrix. The decomposed form of the joint distribution for each individual variable is described in the following subsections.
3.1.1 Precipitation
For each month t, we typically have multiple gridded precipitation products, with unknown bias and random errors. We use the spread across different precipitation products to define a grid-scale precipitation error model for the prior means (mean vector in Eq. 2) and the prior standard deviations (vector of the square root of all diagonal entries of the autocovariance matrix in Eq. 2):
Equation (3) models the systematic bias in grid-scale precipitation for each month t by describing the grid-scale precipitation prior means as a weighted average of the minimum () and maximum () precipitation for each grid cell. The weight or bias parameter wP takes on an unknown value between 0 and 1, and is estimated from the data (see Sect. 4.1).
Random errors in grid-scale precipitation are modeled using Eq. (4). This model expresses the grid-scale prior standard deviations for each month t as a function of the difference between the maximum and minimum precipitation in each grid cell. A noise parameter (rP), taking a value between 0 and 1, is used to scale the standard deviations. This parameter is also estimated from the data.
To account for the effect of spatial correlation of the random error component (Eq. 4), we write the precipitation prior covariance matrix in terms of the grid-scale standard deviations and a grid-scale auto-correlation matrix:
where is a diagonal matrix containing the grid-scale values for all locations of the spatial field (Eq. 4), and RP is the correlation matrix that captures the spatial dependence structure. , where nP×nP is the matrix dimension, and nP equals 176, representing the total number of grid cell locations of the precipitation spatial domain. RP is jointly estimated from all precipitation data using an isotropic parametric correlation function with the following form (Handcock and Stein, 1993):
where Cℳ is the Matérn correlation function for variables separated by distance d. This correlation model is flexible and widely used, with two functions: gamma function Γ(.) and the modified Bessel function Kv(.) (Abramowitz and Stegun, 1968). Cℳ also consists of two unknown nonnegative parameters, namely the spatial correlation length scale ls and a spatial smoothness parameter ν. A value of ν approaching 0 indicates a rough spatial process, while the process is smoother when ν approaches infinity. Since the smoothness parameter is usually small in many applications (Chen et al., 2022), while it increases as the aggregation time increases (Sun et al., 2015), we choose a balanced value between a rough and smooth random field, i.e., ν, fixed at 1.5. On the other hand, the correlation length scale (ls) defines an average length scale on which grid cells are correlated with each other. In principle, ls ranges from 0 (the case of uncorrelated grid cells) and extends to a scale larger than the spatial domain length (the case of maximally correlated pixels). With no prior information on the ls parameter, we fix it at 50 km ( the basin's length from North to South). The sensitivity of the results to the fixed ls will be evaluated in Sect. 6.2.
Since water balance data fusion (Schoups and Nasseri, 2021) uses basin-scale error models, we derive these from the above-described grid-scale models by spatial averaging. Specifically, the basin-scale prior mean and variance in month t follow from Eqs. (3), (4), and (5):
where ϕP is the spatial averaging operator used to derive basin-scale mean and variance from grid-scale means and variances (i.e., nP×1 vector with each element equal to , where nP is the number of spatial locations in the precipitation spatial field). is the transpose of ϕP. We also used , where DP is a diagonal matrix containing the values (from Eq. 4) for all grid cells within the precipitation spatial field. All basin-averaged input quantities to Eqs. (7)–(8) are precomputed from the precipitation data sets, and the constant but unknown parameters wP and rP are estimated as part of the water balance data fusion (see Sect. 4.1).
Finally, the last two equations in the precipitation error model treat the basin-scale calibrated precipitation for each month t as a random draw from a truncated normal distribution. The truncation at zero ensures physical consistency (nonnegative precipitation).
3.1.2 Evaporation
As with precipitation, an evaporation error model with the following form is adopted:
The systematic bias in grid-scale evaporation is modeled with two spatial and time-invariant calibration parameters, namely: wE and fE. The parameter wE is a bias parameter or weight that interpolates between the monthly grid-scale evaporation extrema and . wE takes on a value between 0 and 1, and is estimated from the data (see Sect. 4.1). An additional scaling factor (fE) is incorporated to account for potential bias outside the observed grid-scale evaporation range, and is given a lognormal prior with mode at 1 (no bias) and a coefficient of variation CV of 50 %.
On the other hand, Eq. (12) quantifies the random errors in grid-scale evaporation, as the difference between the maximum and minimum evaporation in each grid cell. A noise parameter (rE), taking a value between 0 and 1, is used to scale the random errors. All parameters are solved as part of the water balance data fusion (Sect. 4).
The basin-scale error models are derived from the grid-based models defined above, using the same spatial averaging process applied to precipitation (Eqs. 7–8). The averaging formulas are then obtained as follows:
where DE is a diagonal matrix whose diagonal entries contain the values for all grid cells within the evaporation spatial domain. All inputs of Eqs. (13)–(14) are precomputed from the evaporation data sets. ϕE is the spatial averaging operator, and the RE term stands for the correlation matrix, which captures the spatial dependencies between the evaporation grid cells , where nE×nE is the matrix dimension, and nE equals 71059, representing the total number of grid cell locations of the evaporation spatial domain. For the large-sized evaporation data sets considered here, we parameterize the evaporation correlation matrix using a Matérn Gaussian Process kernel (Hensman et al., 2015) with fixed parameters ls at 50 km and ν at 1.5.
Similar to precipitation, the basin-scale calibrated evaporation for month t is treated as a random draw from a truncated normal distribution Eqs. (15)–(16). The truncation at zero ensures physical consistency (nonnegative evaporation).
3.2 River discharge and canal water import error models
We assume that data on river discharge (Qobs) and canal water imports (Cobs) are both unbiased with proportional random errors (Eq. 18). Both variables are modelled as truncated Gaussian variables (non-negative), with mean , standard deviation , and variance in month t given by:
where represents the observed value for each month for discharge and for canal water imports). quantifies the random errors in Qobs or Cobs, and is assumed proportional to the observed value, with a proportionality constant (or relative error) ax. We assume a 10 % relative error for monthly river discharge, i.e., aQ=0.1, and a 25 % relative error for canal water imports, i.e., aC=0.25.
3.3 Water Storage Error Model
The water storage error model relates the storage observations from the GRACE satellite () to the modeled storage (St). Due to the coarse resolution of GRACE data, the monthly basin-scale total water storage derived from these observations can be polluted by the water storage dynamics occurring outside the basin, that is, “leakage”. To account for the temporal and spatial mismatch between GRACE basin-scale water storage and the modeled storage caused by leakage errors, we employ the following noisy sine wave error model (Schoups and Nasseri, 2021):
The first equation models the systematic errors associated with GRACE observations. It captures cyclic patterns and seasonality in the data within the basin via time-invariant error parameters. These parameters describe the magnitude and the timing of the seasonal error: the amplitude A (mm) and phase δ (years), respectively.
The magnitude of random errors is modeled as an unknown time-invariant parameter (σS), which reflects errors that can arise from (a) any limitations caused by the sine wave models in capturing the calibrated water storage dynamics within the basin and (b) noise in GRACE solutions. In the above equations, ω is fixed at 2π radians per year, yielding a one-year sine wave and thus capturing the annual seasonal cycle. Other parameters are assigned vague prior distributions, where A follows a lognormal prior with mode at 30 mm and a CV of 200 %, σS a lognormal prior with mode equal to 10 mm and a CV of 200 %, and δ follows a flat logit-normal prior between 0 and 1 year with a location parameter μ=0 and scale parameter σ=1.4.
While the above prior error model relies on a single storage data set, we assess a posteriori (Sect. 6.1) the sensitivity of the P and E posterior distributions to the use of different storage inputs, including Mascon (JPL) and Spherical Harmonic (CSR) GRACE solutions, which for the basin studied here, exhibit different long-term trends.
The following subsections detail the two-step statistical inference framework used to solve the basin-scale probabilistic water balance model (Sect. 4.1) and then propagate the resulting basin-wide constraints to the grid level (Sect. 4.2).
4.1 Basin-scale
Following the description of the prior probabilistic relations (Sect. 3), the posteriors of water balance variables and error parameters are computed using a basin-scale inference technique. This technique consists of a double-loop method that combines two algorithms: a single-chain differential evolution variant of Markov Chain Monte Carlo (DE-MCMC) for computing the unknown parameters and Expectation Propagation (EP) for solving the unknown water balance variables. A brief overview of these algorithms is described in Appendix B, while a detailed explanation can be found in the original paper (Schoups and Nasseri, 2021). The computed posteriors provide hydrologically consistent water balance variables that jointly close the water balance. In the water balance data fusion process, the independent prior data (Sect. 3) are updated with the monthly water balance constraints. These constraints link all independent variables through the water balance, thereby inducing cross-correlation between them. For example, the term S in the water balance constraint (dropping the time index t for simplicity: ) linearly links all other terms. To maintain the water balance closure, the terms with opposite signs must covary together, and vice versa. In this paper, an additional post-processing step is applied to represent this interdependence structure between the water balance variables, i.e., compute their covariances. For a vector of posterior variables x (including the initial storage state S0 or St−1, , , Q, and C), a joint distribution can be written for each month t as: with a mean vector () holding the marginal posterior means of each variable (Eq. B2) and a 5×5 covariance matrix that encodes the relation between these variables. The diagonal element of this matrix contains each variable's marginal posterior variance, while the unknown covariances between each variable pair are the off-diagonal entries. To quantify these covariances, a message-passing algorithm that combines the water balance constraint and the probabilistic constraints in the form of marginal Gaussian distributions is applied (see Sect. B2 in Appendix B).
4.2 Grid-scale
Having computed the joint basin-scale and posteriors (Sect. 4.1), the goal is to translate these posteriors back to the grid-scale. When spatially averaged, the generated grid-scale posteriors (P and E) should recover the basin-scale posteriors. Such a relation between the two scales can be established for each month t as follows (Eq. 25):
where represents the spatial averaging operator mapping the grid-scale to the basin-scale P and E variables, ϕP and ϕE are defined earlier in Eqs. (7, 13).
For the above relation to hold (Eq. 25), we implement a Kalman smoothing algorithm. A key aspect of this algorithm is that it ties together all sources of information on P and E variables, detailed in previous sections, including: the basin and grid-scale joint priors (Sect. 3), along with basin-scale joint posteriors (Sect. 4.1), to generate their corresponding spatially distributed joint posteriors. Distinctly, the grid-scale prior moments (Eq. 2) are updated to reflect the difference between the basin-scale joint posterior and prior moments via the Kalman gain. In what follows, we start by describing the basin-scale joint prior and posterior distributions and then show how the grid-scale joint distributions are computed.
Before constraining the basin-scale spatially averaged and priors, their joint distribution for each month tcan be written with zero-cross covariances to reflect the independent structure between both variables:
where and are prior means, and are prior variances of and , respectively.
Once the water balance constraint is imposed within the basin-scale data fusion: (omitting t for simplicity), positive correlations between and posteriors emerge. This occurs because the water balance acts as a constraint on the priors (independent and linearly linking them in their posteriors (correlated and ). To maintain this linear constraint, that is, remains equal to , an adjustment that increases should be balanced by an adjustment that increases , and vice versa. This yields a basin-scale joint posterior distribution on the posterior variables and , with induced correlations, that is, non-zero cross covariances:
where and are the marginal posterior means, and are the marginal posterior variances of and , respectively, and is the posterior cross-covariance between and .
Kalman smoothing can then be used to calculate the joint posterior moments Eqs. (28)–(29) at the grid-scale from the above basin-scale prior moments Eqs. (26)–(27) and grid-scale prior moments (Eq. 2). These posterior moments are thus rendered as a weighted combination of grid and basin-scale moments:
where and are the grid-scale means, while and are autocovariances of Pt and Et posteriors, respectively. and stand for the posterior cross-covariance between both variables and its transpose, respectively.
A noteworthy feature of the Kalman smoothing gain (K) in the above equations is that it acts as a weighting matrix propagating basin-scale information to each individual P and E grid cell. It is computed using the following equation composed of three components:
The first component depicts the block diagonal covariance matrix of the grid-scale Pt and Et priors, the second is the transpose of the spatial averaging operator A, whereas the inverse of the diagonal matrix consisting of the basin-wide variances of and priors is the third component.
Here, we present the results of the two-step inference framework described in Sect. 4, starting with the basin-scale posteriors of all water balance variables (Sect. 5.1) and followed by the grid-scale posteriors of P and E (Sect. 5.2). These represent the baseline results that are based on specific choices of GRACE data (i.e., JPL mascon data) and spatial correlation lengths. The sensitivity of the results to these choices will be addressed in Sect. 6.
5.1 Basin-scale posteriors
Basin-scale posteriors for P (precipitation), E (evaporation), and C (canal water imports) are shown in Fig. 3 and discussed first. The data shows significant prior uncertainty with a wide range of precipitation and evaporation estimates. This is especially evident in the monsoon (June to September) for precipitation and across the whole seasonal cycle for evaporation (Fig. 3). Along with the prior observations, Fig. 3 also depicts the basin-scale posteriors for each water balance term, while Fig. 4 shows the corresponding error parameter posteriors. The results for Q are not included in Fig. 3 since the posterior for Q closely follows the river discharge data, and this data is tied to a non-disclosure agreement with the CWC of India.
The results in Fig. 3 show that the precipitation posterior mean lies between the and data bounds. This balanced position within the precipitation space is also translated into the inferred parameter wP, with a posterior distribution around values of ∼0.5 (Fig. 4). The noise parameter rP has a posterior distribution around 1, suggesting a minimal reduction in the prior uncertainty range for precipitation. An independent evaluation of these posterior estimates can be done by comparing them to the Indian Meteorological Department (IMD) data set (Pai et al., 2014), which is routinely used to evaluate precipitation products in India. Despite being treated as the “ground truth” in literature, the rain-gauge interpolated IMD product is not error-free, and therefore, a perfect match between the IMD data and the generated posterior mean is not anticipated. Figure C1 in Appendix C compares the seasonal and annual timeseries of the IMD to the posterior precipitation mean. Unlike the precipitation posterior mean, which tends to sit between observed data bounds, the IMD data set closely follows the baseline precipitation estimate (TRMM). The larger posterior estimates for P found here align with a recent study (Goteti and Famiglietti, 2024) that found systematic underestimation of precipitation by the IMD data product. Moving to evaporation, the computed posteriors in Fig. 3 only slightly differ from the priors, with posterior values of noise parameter rE centered at ∼0.95 (Fig. 3), implying that the modeled random error in this variable is as large as one-fourth of the absolute difference between and . This posterior uncertainty, however, reduces with a tighter data range corresponding to prior ensembles generated from only three data sets: eeSEBAL, pyWaPOR, and Landsat-SSEBop (specifically, from 2015 onwards). As for the estimated evaporation, it takes more or less a balanced weight between and , with an estimated wE posterior of ∼0.48 and a scaling parameter fE around 1 (no additional bias). While no direct evaporation measurements are available, the posterior estimates can be evaluated by comparing them with reference evapotranspiration and expected seasonal dynamics based on known cropping and irrigation practices. For example, the evaporation is expected to be lower than but equal to or higher than the for only a short period, either in June–July or August–October, when the (dominant) sugarcane crop coefficient (Kc) exceeds 1. In Fig. 3, the inferred posterior evaporation (green band) aligns with this expectation where it is consistently below , but is also around for the previously mentioned period (e.g., in 2006, 2012, 2019, and 2022). Besides sugarcane, rice is the second most important crop, commonly transplanted at the end of April or mid-May (Joseph and Ghosh, 2023). For this shallow-rooted crop, more frequent irrigation is required during its early stages (the first 30 d), coinciding with the period when the canal water supplies form a peak (at the onset of the monsoon) (Fig. C3 in Appendix C). The evaporation timeseries of rice is, thus, expected to form a peak, most likely in June, when its Kc value is higher than 1. Considering these two elements, the evaporation is expected to peak in June, as captured by the posterior evaporation mean (Fig. C2 in Appendix C).
Figure 3Monthly water balance posteriors (90 % credible intervals in green) for: (a) P (precipitation), (b) E (actual evapotranspiration), and (c) C (canal water imports), and their corresponding observations (dots). Reference evapotranspiration computed here using ERA5 meteorological input variables is shown as (ET0). The overbars of the labels in P and E plots indicate that these values are obtained through spatial averaging of the gridded data sets for each month t. Each year's label indicates the start of the year (January).
Figure 4Prior and posterior distributions of error parameters for: (a) evaporation, (b) storage, (c) precipitation, canal water import, and river discharge when using different GRACE storage data sets: CSR Spherical Harmonic (SH) and JPL Mascon (MAS) solutions.
The third row of Fig. 3 compares the prior and posterior estimates of the canal water imports (C). The prior estimates represented here are extrapolated from scarce data. Nevertheless, the extrapolation approach mostly relied on ground-based measurements whenever available. For periods with missing data, these were filled with the design discharge capacity multiplied by the operation time or the data average of years with similar conditions, depending on data availability per distributary. For this purpose, independent information from the Palmer Drought Severity Index (PDSI) index of the Terra Climate data set (Abatzoglou et al., 2018) is used as a proxy to classify the years between wet and dry, allowing for annual variations in the canal water supply. For example, in dry years, such as 2008, and 2016 to 2018, the surface water supplies are smaller in magnitude compared to all other years. Note also the increase in the water supplies from 2009 onwards, which aligns with the construction of the Deoband Parallel Canal that connects the Upper Ganga Canal (UGC) to the Eastern Yamuna Canal (EYC) (Fig. 1). Using other approaches to generate and extrapolate the prior canal water estimates does not significantly affect the posteriors of the other terms, as explored in Sect. 6.3. The posterior results, on the other hand, reflect the wide prior uncertainty (relative error of 25 %) we place around this extrapolated data. Generally, the posterior estimates follow both their priors and the seasonality of the precipitation term. Specifically, the supplies peak at the onset of the monsoon (June) due to increased water availability from precipitation, while smaller peaks are formed during the dry season (December and April) when water stored in the reservoirs is diverted for irrigation. As canal supplies are mainly driven by precipitation, it is reasonable to conclude that the canal water imports are smaller in magnitude compared to precipitation during the rainy months (July–September), representing about 5 % to 15 % of the monthly precipitation. It follows that the fixed prior uncertainty on this term is a justifiable approximation that does not strongly affect the other water balance estimates.
Finally, we note the interplay between all variables: water balance (P, E, C, and Q), weather, and crop variables. Despite relying on external water supplies, the basin has been experiencing a decreasing trend in the river discharge (Dwivedi and Yadav, 2025), coupled with increased evaporation from 2010 to 2022 (see the annual timeseries of the posterior mean in Fig. C2). The increase in posterior mean evaporation mirrors the increasing trend in the sugarcane yield and the leaf area index from 2011 (Fig. C4). This aligns with a period of increasing temperatures (see Fig. C4) and with the period when farmers started adopting new high-yielding sugarcane varieties in the basin, as reported by Indian Council of Agricultural Research (2022). The accelerating rate of evaporation in our estimates follows a similar trend in the Landsat-based SSEBop evaporation data set (Fig. C2). This shows, on the one hand, the usefulness of the water balance constraints in evaluating the data products, and on the other hand, that simplified ET models like SSEBop can sufficiently reproduce the evaporation dynamics.
The interplay is also pronounced in dry years such as 2009 and 2016, showing a decline in the posterior mean precipitation, together with the canal water imports. The limited availability of rainfall and surface water for irrigation during droughts, along with rising temperatures in recent years (Fig. C4), does not, however, depress the evaporation rates because crop water demand is met by increased groundwater pumping. The effect of unsustainable groundwater pumping is particularly notable in the dynamics and trend of basin water storage, which we discuss next.
Figure 5 compares prior (GRACE data) and posterior estimates of the basin's water storage S. The prior estimates in the top panel are obtained by predicting water storage from the water balance using the original, uncalibrated data sets for P, E, C, and Q. The different permutations in which these data can be combined results in 15 predicted water storage time series in Fig. 5a. We see that the GRACE data closely follow the seasonal dynamics of the prior predicted storages from water balance data. This conveys that both timeseries are approximately in phase but markedly different in magnitude and trend, due to bias and noise in each data set. As shown next, water balance data fusion combines all the data and corrects data errors to yield a closing water balance in each month.
Figure 5Monthly timeseries of GRACE-JPL Mascon observations and storage anomalies predicted from the water balance: (a) using 15 unique uncalibrated combinations of the input variables (S0, P, E, Q, C), and (b) using posterior (calibrated) estimates of the input variables (S0, P, E, Q, C) with (green band) and without (gray lines) accounting for posterior cross-correlation between the input variables.
The second panel of Fig. 5 depicts the calibrated storages, i.e., posteriors shown in green, after fusing all sources of information available on this variable. These posteriors combine three noisy and biased sources of information for storage S in a given month: the GRACE observation (if available for that month), the water balance constraint for the current month, and the water balance constraint for the next month (since S is the initial storage for the next month). Fusing all information together results in a narrower storage posterior (with less uncertainty) compared to the individual sources.
In this process, the introduced water balance constraints can themselves significantly contribute to reducing the posterior uncertainty of the inferred storage by inducing correlations between variables. This occurs because (noisily) observing S in this constraint linearly links all the a priori independent variables together. Such a relation is encoded in the coefficients of the variables in the water balance equation. For example, to maintain this physical relation, that is, S remains equal to , the water balance variables with opposite signs must covary together, and vice versa. Accounting for positive correlations between the variables of opposite signs and negative correlations between the variables with the same signs in the water balance can significantly reduce the variance of S. The effect of these posterior cross-correlations between water balance variables is shown in Fig. 5b. Note the smaller uncertainty in the final posterior estimates (green band), which account for posterior cross-correlations between the variables, compared to posterior samples (in grey) that do not account for these posterior cross-correlations (i.e., by independently sampling from the marginal posteriors of , , Q, and C).
5.2 Grid-scale posteriors
While grid-scale posteriors of P and E are obtained in each month, we present here detailed results for two months, which were chosen based on previous analysis (Mourad et al., 2024), namely (a) May 2009, representing a dry month before the monsoon with the highest air temperature and with significant differences between evaporation data products, and (b) July 2009, representing the peak rainy month with large differences between precipitation data products. To guide the interpretation of the spatially distributed posteriors, we appended the land use map (see Fig. A1).
The evaporation maps in Fig. 6 show that the posterior mean () follows the evaporation estimates available from the diverse range of the original gridded data products. For example, in May, the smallest spatial difference between the posterior evaporation mean and the observed evaporation over the irrigated croplands can be seen by SSEBop, with spatial locations for these land use elements having values higher or lower than the posterior mean by ± 25 mm month−1 (Fig. C5 in Appendix C). Over non-crop lands (urban areas and water bodies), the posterior mean has a better agreement with ALEXI and CMRSET, with differences ranging between −25 and 25 mm month−1. In July, SSEBop displayed the closest agreement with the posterior mean over almost all land use elements. The posterior mean also follows eeSEBAL observed evaporation estimates over irrigated croplands in the lower parts of the basin for this particular month. It is reassuring to see that the generated grid-scale posterior ensemble mean preserves the spatial patterns inherent in the data while combining evaporation information from all products across the different land use elements. This makes sense since the posterior mean is modeled as a weighted average of the minimum and maximum estimates at each grid cell, so that spatial patterns in the original data are maintained. In parallel, the Kalman gain in Eq. (30) distributes the basin-wide constraints to the individual E pixels by explicitly accounting for spatial autocorrelations encoded in the grid-scale covariance matrix (). The results in Fig. 6 also indicate that the posterior uncertainty () is smaller in the monsoon month (July) than in the dry month (May). This is expected as the data are in better agreement in the peak rainy month than in the dry month. Generally, posterior uncertainty is larger in areas with higher evaporation values (irrigated crop-lands), while non-cropland has both lower evaporation and smaller posterior uncertainty. As such, it can be safely concluded that the spatially distributed calibrated evaporation provides reasonable estimates with acceptable uncertainties that vary with the evaporation magnitude estimated at each pixel.
Figure 6Prior observations and estimated posteriors of grid-scale evaporation E before the monsoon (May 2009) and for a peak rainy month (July 2009); and denote the posterior mean and standard deviation of evaporation.
In Fig. 7, we see that the precipitation posterior mean falls between the gridded precipitation data products. The spatial difference maps between the grid-scale observations and the posterior mean show that TRMM observes lower precipitation (by −60 to −35 mm month−1), while MSWEP observes higher precipitation (by 25 to 48 mm month−1) than the posterior mean. On the other hand, GPM deviates by −21 to 60 mm month−1 from the posterior mean (Fig. C6 in Appendix C). As we have seen with the evaporation standard errors, the grid-scale posterior standard errors of precipitation are the highest at spatial locations where the difference across the products is the largest (upper part of the basin), and vice versa.
Figure 7Prior observations and estimated posteriors of grid-scale precipitation P for the peak rainy month (July 2009); and denote the posterior mean and standard deviation of precipitation.
Via Eq. (29), the posterior cross-correlation between water variables at the basin scale is propagated to the grid scale, and thus results in posterior cross-correlation between grid-scale P and E as well. The effect of this can be illustrated by computing derived variables that involve both P and E. An example is given in Fig. 8, which shows maps of the posterior mean and standard deviation of P−E for July 2009, the latter with and without posterior cross-correlation between P and E. Due to positive posterior cross-correlation between P and E (since they have opposite signs in the water balance), the posterior standard deviation of P−E decreases when accounting for this correlation, but the effect in Fig. 8 is relatively small, and is observed at some localized spots with the highest reduction in errors of up to 2.5 mm month−1. These spots correspond to the locations at which both P and E covary the most, with the cross-covariance subtracted during the computation of the P−E posterior variance. The reason for this relatively small effect is that the water balance constraints introduce other posterior cross-correlations as well, both at the basin scale (Table 2) and at the grid scale via nonlocal posterior cross-correlations between P and E induced by Eq. (29) (i.e., in the posterior, P in a grid cell becomes correlated with E in all other grid cells).
Figure 8Grid-scale posterior mean and standard deviation of P−E, and P and E covariance for July 2009. The posterior standard deviations are computed for the two cases: with (cc+) and without (cc−) accounting for posterior cross-correlations (cc) between P and E, while the difference between these two cases illustrates the impact of accounting for cross-correlations.
To evaluate the effect of ignoring the posterior correlation between P and E, we calculated the distribution of Q from the water balance given the posterior of all other variables for three cases: (i) ignoring all posterior correlation, (ii) ignoring only correlation between P and E, (iii) accounting for all correlation. We compare these distributions to a fourth scenario (iv) in which Q is obtained from uncalibrated water balance data. Writing the water balance (Eq. 1) in terms of Q results in 15 uncalibrated Q estimates generated from the unique combinations of the water balance data. The distribution for this case is characterized in terms of mean (average across the 15 uncalibrated values) and uncertainty (set as the spread across these values). The results in Fig. 9 indicate that neglecting the posterior cross-correlation between P and E and between all water balance variables, can substantially lead to higher uncertainties (wider distributions), compared to when cross-correlations between all variables are accounted for (narrow distribution). At the extreme end, the Q predicted from the uncalibrated water balance data demonstrated the highest level of uncertainty compared to all other cases, with physically inconsistent values (negative) and a mean value greatly deviating from the observed Q value. We repeat this experiment for all months and report the performance of the estimated Q from the different approaches compared to the observed Q estimates. We obtain a low RMSE value of the calibrated Q posterior mean (0.7 mm month−1) compared to Qwb from uncalibrated water balance data (74 mm month−1), which underscores the importance of (i) water balance data fusion for bias-correcting the original data products and (ii) accounting for posterior correlation, when water balance variables (P, E, C, S) are to be used for computing Q.
Figure 9Distribution of Q in comparison to the observed Qobs value (July 2009), for the following cases: (i) ignoring and accounting for posterior correlations between all water balance variables, (ii) ignoring only the posterior correlation between P and E, and (iii) Qwb obtained from uncalibrated water balance data. The labels and tick marks are not shown due to data sharing restrictions.
Table 2Basin-scale posterior covariance matrix of all water balance variables for July 2009, the diagonal elements represent the posterior variances (mm month−1) (shown in bold), while the off-diagonal entries represent the posterior covariances between the variables (mm2 month−2) (underlined values).
This section evaluates some of the modeling decisions and their impact on the results, specifically the choice of GRACE data set and the choice of spatial correlation lengths of P and E.
6.1 Water balance data fusion with different GRACE data
The Hindon basin studied here is experiencing groundwater depletion (Alam and Umar, 2013). Storage data from GRACE satellite has been used to estimate human-driven water storage changes, such as those caused by intensive irrigation. This data is either based on spherical harmonic coefficients or mascon basis function, which entails testing the sensitivity of the water balance closure to the chosen GRACE input. The baseline results presented in Sect. 5 are based on GRACE-JPL Mascon (MAS) solution. Here, we compare these results to those obtained using the CSR spherical harmonic (SH) solution.
Figure 10 compares the posterior distributions for storage and error parameters from both data sets. The corresponding error parameter posteriors are compared in Fig. 4. The two products show negative depleting trends with widely varying magnitudes of −43.3 mm year−1 from the SH and −88.7 mm year−1 from MAS. Looking at their corresponding posterior distributions, we can see that they both largely follow the GRACE data, but with an increase in the seasonal amplitude compared to the data. This is reflected in the inferred A parameter (row 2 in Fig. 4), with an estimated value of 90–100 mm for both cases. The storage posteriors, thus, restore the seasonal signal amplitude, which tends to be more severely attenuated in small-sized basins like the Hindon basin. Additionally, the other storage error parameters (phase δ and noise σS) distributions are well constrained compared to their vague prior distributions. A relatively small phase error (time lag) is obtained using the two data sets. However, the JPL-MAS data yields a smaller noise (53 mm) than the CSR-SH (60 mm) data. The fact that these error parameters are time-invariant and that the posteriors are computed using data from all months via iterative smoothing allows the probabilistic water balance model to fill in the data gaps present in GRACE data sets from 2011 onwards.
Figure 10Storage posteriors for two different GRACE data sets: GRACE-JPL Mascon (MAS) and CSR spherical harmonic (SH) solution.
Figure 4 also depicts the posterior distribution of the precipitation and evaporation error parameters for the two scenarios. The water balance data fusion run with the CSR spherical harmonic (SH) solution gives more weight to the lower precipitation and evaporation estimates. This is translated into the wP value of 0.15. Despite that, the posterior evaporation mean lay between the data limits (wE=0.5) for this case, the bias scaling factor (fE) masked out its effect (with estimated values smaller than 1), pushing the E posterior towards the lower evaporation end (i.e., the baseline evaporation limit). Figure C7 in Appendix C contains the detailed posterior plots for the water balance variables using the GRACE-SH data sets. Apparently, the GRACE-SH data set's storage dynamics don't match the other water balance variables in terms of water balance closure, at least with the applied P and E error models. Due to the smaller decreasing trend in the GRACE-SH storage data, the inferred evaporation estimates are, on average, about 15 % lower than the estimates obtained with the GRACE-MAS data. To consistently compare the data fit from both solutions, we normalize the likelihood by the number of observations in each data set. Consequently, the water balance model with GRACE-MAS data has a slightly larger likelihood (−5.76) than that with GRACE-SH (likelihood −6.0).
This analysis suggests that long-term groundwater depletion in the basin is possibly better captured by the (more severely declining) mascon data, which has important ramifications for the sustainability of irrigation practices in the basin. This conclusion, however, requires further verification with groundwater level trends from piezometer data.
6.2 Sensitivity to the assumed spatial correlation length-scale parameters
The posterior results in the preceding section are obtained by running the water balance data fusion with fixed Matérn parameters for both the evaporation and precipitation processes. Given the differences in spatial resolution of the analyzed data, the smoothness and correlation length-scale ls were specified with mid-range values of 1.5 and 50 km, respectively. To evaluate the sensitivity of the results to the chosen value for ls, we set the value of precipitation ls at 50 km and vary that of evaporation, and vice versa. Table 3 lists these prior conditions, along with the likelihood and average posterior standard deviation for each water balance variable. The results show stability of the posteriors and the likelihood values under all prior settings, except for the cases where precipitation and evaporation ls are low (e.g., 20 km). In these cases, the probabilistic water balance exhibits the poorest fit to the data (i.e., small likelihood values), and the basin-scale posterior uncertainty of either evaporation or precipitation tends to be underestimated. On the other hand, using mid-range to larger spatial correlation length scale values produces slightly different but substantively comparable posteriors and likelihood values. The relative insensitivity to ls suggests its value could potentially be chosen to a very large value to speed up the computations: in the limit of perfect spatial correlation, the formulas linking grid and basin scales simplify, which can result in much faster computations for large grids.
Table 3Average posterior standard deviation (mm month−1) for each water balance variable and likelihood values under different spatial correlation length-scale (ls) prior settings. The underlined values represent the results of the baseline setup of the water balance data fusion model, with GRACE-JPL Mascon data set and fixed correlation length scale at 50 km.
6.3 Sensitivity of the posteriors to the prior canal water import estimates
The baseline results presented in Sect. 5.1 are based on the canal water import data generated using a combination of canal design capacities multiplied by their operational time, and the data average of years with similar conditions (Appendix A, Sect. A2). Here, we evaluate the sensitivity of the probabilistic water balance model to the different approaches used to generate canal water estimates from little available data, compared to the one adopted in the baseline run (we refer to it here as scenario 1). To do so, we include two additional scenarios, one in which the prior canal estimates are computed using only the canal design capacities multiplied by their operational time obtained from irrigation schedules (scenario 2), and the other is based on the assumption that the total flows into the basin are 20 % of the monthly flows in the UGC main canal (scenario 3). For this evaluation, we assess the model fit given the data (likelihood value) and the average posterior standard deviation (mm month−1) for each water balance variable. The resulting water balance variables posteriors showed minimal variations with the different water supplies prior estimates (Table 4). Although the fit is better with scenarios 2 and 3, the posterior error for this term is the lowest for scenario 1. Generally, this analysis shows an insensitivity of water balance posteriors to the canal water data used, and indicates that the prior estimates presented in Sect. 5.1 are sufficient for the goal of constraining the other water balance variables.
Table 4Average posterior standard deviation (mm month−1) for each water balance variable and likelihood values with different canal water import prior estimates. The underlined values represent the results of the baseline setup of the water balance data fusion model, with GRACE-JPL Mascon data set and fixed correlation length scale at 50 km.
6.4 Extensions
The presented methodology is motivated by the availability of diverse water balance remote sensing data, with very few in situ data available for the Hindon basin, making independent validation of our estimates challenging. For this reason, we evaluated our results using soft validation techniques. For example, for evaporation, in the absence of in situ data, we compared the trend in our estimated evaporation with independent crop yield data, and the seasonal dynamics with known cropping and irrigation patterns (see Sect. 5.1). As for the precipitation posterior estimates, in Sect. 5.1 and Appendix C, these were compared to the spatially interpolated rain gauge dataset for the basin from the Indian Meteorological Department (IMD), keeping in mind the potential underestimation of precipitation by this dataset (Goteti and Famiglietti, 2024). Evaluating the total water storage estimates with independent data is more challenging. In a follow-up study, we will introduce separate rootzone and groundwater balance constraints, with the aim of estimating their contributions to the total water balance. At that point, it will become possible to use available remote sensing soil moisture data, as well as in-situ groundwater level data, for evaluation. Furthermore, adding these additional constraints and data will allow for updating the posterior estimates in this paper.
Additionally, the presented methodology is general and can be applied to other gauged river basins. For example, an application to multiple semi-arid basins was reported in Schoups and Nasseri (2021). The method has several advantages that allow this, including the straightforward and flexible implementation, as it consists of two separate parts: error models specification for each water balance variable that can be customized to fit specific settings, and the model solver that automatically computes the posterior distributions. In addition, the method is set up to rely on in situ and satellite data that inherently capture the hydrological processes. For example, the precipitation data sets estimate the precipitation phase (snow and rain), while evaporation data sets can be used to differentiate between the different land use classes. However, in snow-dominant basins where precipitation data sets might underrepresent this process, or in urban-dominant settings where coarse-resolution evaporation data products might overestimate evaporation, it may be valuable to tailor the error models to local conditions in order to improve the results. Perhaps, this could be achieved by complementing the precipitation error models with other satellite data sets, like the snow cover, snow depth, or temperature products, for better snow detection and mapping. Moreover, evaporation error models can be complemented with land use maps while also considering the use of high-resolution evaporation products or other data sets with improved evaporation estimates of heterogeneous urban surfaces. The major assumption here is that formulating the error models by exploiting ancillary information would allow it to solve for error parameters and water balance variables under varying climatic zones and settings. Alternatively, for this purpose, we could combine the data error models with hydrological models that explicitly account for detailed processes and differentiate between the different hydrological responses.
This paper presents a multi-scale monthly probabilistic water balance data fusion model for calibrating estimates of each component of the water balance. A key contribution of this paper is the calibration of gridded precipitation and evaporation estimates in an ensemble approach that fully exploits the prior information content of the data. To achieve this, the introduced methodology is applied at two scales: the basin scale and the grid scale. First, we formulate prior grid-scale and ensemble-based error models for precipitation and evaporation with unknown error parameters that describe the bias and random errors in their spatial fields. The spatially averaged inputs to the precipitation and evaporation error models, along with in situ data (river discharge and canal imports) and storage data from GRACE, then drive a basin-scale probabilistic water balance closure approach. In this approach, the water balance is treated as a Bayesian model by assigning prior distributions with unknown bias and noise parameters to each water balance variable. Combining these prior distributions with monthly water balance constraints results in posterior estimates of all parameters and variables that jointly close the water balance at the basin scale. The resulting basin-scale precipitation and evaporation posteriors and their cross-correlations are then used to update the prior grid-scale data from the first step. This is attained using a Kalman smoothing algorithm that ensures consistency between the grid-scale and the basin-scale estimates; that is, spatially averaging the calibrated gridded estimates yields monthly precipitation and evaporation values that jointly satisfy the water balance at the basin scale together with the other variables (C, Q, S).
We apply the introduced methodology to the Hindon Basin, a tributary of the Yamuna River that suffers from unsustainable irrigation practices relying on the local groundwater and imported surface water. The basin-scale results demonstrate that introducing an independent set of in situ data on surface water imports and river discharge, along with monthly water balance constraints, updates the prior information with new information, automatically adjusts different information sources for each water balance variable, while maintaining a closed water balance. The output of this study highlights the potential of the monthly water balance constraints in substantially reducing uncertainties by accounting for cross-correlations between all water balance variables. In addition, the model yields basin-wide posterior estimates of: (a) error parameters that are well constrained by all water balance data, and (b) consistent basin-scale water balance variables that jointly close the water balance. Transferring these basin-scale constraints to the grid scale results in: (a) posterior ensemble mean of precipitation and evaporation that fuses the pixel-wise information from extreme prior data bounds (min-max) and finds a balance between the two according to their relative errors, and (b) posterior random errors that reflect the model confidence about the location of the posterior mean at each grid cell.
Explicit to the introduced approach are the assumptions made about the data to characterize their associated errors in a parametric form. For instance, a range-based error model is used to quantify errors in the gridded precipitation and evaporation data. Here, the bias error is described as a weighted average of the minimum and maximum estimates at each spatial location. Whereas, the random error is quantified by scaling the maximum potential error (i.e., scaling one-quarter of the grid-scale full range). As an extension to this analysis, alternative data error models might be worth considering. E.g., a possible alternative is to use Gaussian mixture models that calibrate each data set individually, while still exploiting the water balance as a constraint.
As for the errors in a single GRACE data set, these are modeled using a noisy sine wave error model with unknown phase and amplitude error parameters that are used to correct for the dominant biases in the GRACE data (i.e., leakages). The sensitivity of the model results to the input data is assessed using two GRACE data sets: spherical harmonic (GRACE-CSR) and mascon solution (GRACE-JPL). This is done quantitatively by comparing the values of the inferred noise parameters (a smaller value is preferred) and that of the likelihood, as a larger likelihood corresponds to a better fit. While the GRACE-JPL demonstrated a better performance than GRACE-CSR based on these evaluation criteria, this could also be influenced by the assumptions surrounding the precipitation and evaporation error models. Ground-based piezometer data may also help resolve the difference between the downward trends in the two GRACE data sets for the studied basin, which is important for evaluating the sustainability of irrigation practices in the basin.
Finally, the random errors in the monthly river discharge and canal water import data are more or less fixed to relative errors. A larger prior uncertainty is assigned to the gap-filled canal import data (25 %) than to the river discharge (10 %). Regardless of the approach used to generate prior canal water import estimates from little available data, these were sufficient to constrain the other water balance terms. Additionally, the generated water balance estimates are not strongly affected by the assumed errors for the canal water and river discharge data, as these two variables are relatively small compared to the other monthly water balance variables (especially during rainy months when the two processes are driven by precipitation). We also examined the impact of fixing the spatial correlation length scale parameter on model fit and the posterior standard errors. The posterior results showed robustness and model fit improvement across mid-to-large values of prior parameter sets. This suggests that this parameter does not influence the model results in a large way.
A key aspect of the proposed methodology is that it is data-driven, exploiting the water balance constraint as an independent source of information. Therefore, the resulting calibrated data generated here can be used as baseline data sets for multiple applications, such as the validation or calibration of hydrological models, climate studies, and water accounting assessments. Nonetheless, updating the calibrated and spatially distributed estimates of precipitation and evaporation, and storage data generated herein with constraints describing the physical relations between detailed water balance variables (e.g., vertical and horizontal flows), along with additional in situ data (e.g., groundwater storage observations from piezometers), hold a great premise for synergetic and complementary use of all information sources. This will be the scope of a follow-up study for the investigated basin that aims to constrain water balance stocks and flows and their errors using grid-scale water balance constraints.
The following subsections summarize the available information on the Hindon basin land use elements (Fig. A1), share of canal water imports in the water balance (Fig. A2), canal network (Figs. A3 and A4), its operation, irrigation canal delivery data, and processing steps.
Figure A1Share of the canal water imports in the water balance, expressed as the percentage of the minimum and maximum remote sensing-based precipitation observations.
A1 Description of the canal irrigation system and its operation
Figures A3 and A4 provide an overview of the canal network's spatial distribution and design capacities. While these main canals run continuously, their distributaries are operated in rotation (Ahmad, 1991; Chaube et al., 2023). The canal distributaries in the Hindon basin are operated on an “on-off” basis as shown in Table A1. The irrigation schedules (also known as the rosters) are prepared for two irrigational seasons:
-
Kharif season in which the main irrigated crops are sugarcane, rice, cotton, maize, vegetables, and fodder crops
-
Rabi season in which the principal crops are wheat, barley, mustard, and peas.
Table A1Example of seasonal operational schedule (the ON-OFF basis) for Eastern Yamuna Canal (EYC) and Upper Ganga Canal (UGC) distributaries. Kharif spans from March to September 2021, while Rabi covers Oct 2021 to March 2022.
Figure A3A schematic of the UGC main canal and its distributaries with their respective discharge capacities (m3 s−1).
A2 Description of irrigation canal delivery data and processing steps
Constrained by the limited availability of data on water deliveries, we use an extrapolation approach to fill in the data gaps. The approach integrates two data sources, the irrigation schedules and the actual flow measurements from in-situ data, to produce a complete monthly timeseries. Explicit in this method is the assumption that the canal operations do not vary widely between years in which the conditions are similar. For instance, if years “A”, “B”, and “C” are classified as “dry years,” the canal operations don't differ much across these years. In other words, if year “A” has unknown estimates, these can be estimated from year “B” or the mean of the three years, depending on the number of years with canal data. Therefore, each group of distributaries is filled out based on two criteria: the data source and the number of years with available data. For the canals, such as the Deoband parallel, with actual flow measurements and ≥5 years of data availability, we solely rely on these data for extrapolation. To this end, we distinguish the years into three conditions (wet, dry, and normal years) using the Palmer Drought Severity Index (PDSI) Terra climate data set as a proxy. Then, every unknown year is filled with the long-term average of the years falling under the same category. For example, when a year such as 2015 is identified as “normal”, this year is filled with the mean of the canal delivery observed across all other “normal” years. A similar procedure was applied to the EYC lower intakes with actual flows from 2018 to 2022 to fill in missing data for similar years outside this period. For the other intakes with only a few years of data, however, we filled them differently. For instance, the lower intake points of the UGC with flow measurements for only two years, 2021 (dry) and 2022 (wet), we use these years as benchmarks for filling similar wet or dry years. For “normal” years on these intakes, we use the irrigation schedules instead. Explicitly, in cases with only a single-year irrigation schedule, the flows are obtained by multiplying the full design capacity by the operational time. This step produces conservative and monthly variable flow estimates due to the operational time variation of each distributary. Since the Deoband branch contributes a significant share of the total canal water supplies into the basin, we fill its missing data using the irrigation schedules to avoid the impacts of assuming no variation in operation across the years. In cases where single-year data is available for distributaries such as Bijwara and Baoli, we use the actual flows for this year (2018) as a reference to fill all other years. As for distributaries without data from design or actual flows, we assume these might not add much to the overall imports into the Hindon basin. At this stage, the canal delivery data is at the daily timestep. The last step is to aggregate to a monthly timescale, sum all intakes, and normalize by the basin area to obtain the gap-filled canal water imports expressed in water depth units (mm month−1).
B1 A major challenge arising from the use of uncalibrated remote sensing water balance data
The following figure shows that imbalance errors due to combining uncalibrated remote sensing products for each water balance variable can be significant ( (mm month−1)). This is related to errors associated with the data. Therefore, there is a need to quantify and reduce these uncertainties.
B2 Computing posteriors of individual variables, parameters, and posterior cross-correlations between water balance variables
This section briefly explains how all posteriors are computed. For the probabilistic water balance model described in Sect. 4.1 of the main text, we can represent its joint distribution as . This distribution consists of two parts: the monthly water balance variables x (comprising 5N + 1 variables including S0, , , Qt, Ct and St), where N is the number of months and is equal to 240 for 20 years of data considered in this paper, S0 is the initial basin water storage at the start of the first month and Sobs is the storage timeseries, and the parameter vector θ, including wP, rP, wE, fE, rE, aQ, bQ, aC, bC , σS, A, and δ. Our objective is to compute the full posterior distribution, which integrates all available information sources (Gaussian distributions) of θ and x.
The posterior of the parameter vector is expressed in the following form:
where p(θ) is the prior distribution for the parameters and corresponds to the product of the individual parameter priors defined in the previous basin-wide error model section, while the term p(Sobs|θ) refers to the parameter likelihood function. This function scores each set of bias parameters (e.g., wE, fE) and noise parameters (e.g., rE,σS). A large likelihood for the parameters is one that shifts the storage predicted from the water balance closer to the storage observations. As we show below, this likelihood also connects the DE-MCMC for parameter sampling and EP for posterior computations of water balance variables.
Figure B1Monthly timeseries of the imbalances (water balance errors) for each of the 15 unique uncalibrated combinations of the variables.
In the basin-wide inference setup, an outer MCMC loop iteratively proposes sets of parameter candidates (samples) using a non-parametric proposal (jumping) mechanism (Ter Braak and Vrugt, 2008). For each set of sampled parameters by MCMC, the EP (message passing) algorithm operates in an inner loop, computing the (a) unnormalized posterior density of the parameters proposed by MCMC (Eq. B1) and the conditional water balance posteriors . The normalizing constant of the conditional water balance posterior (p(Sobs|θ)) is the likelihood in Eq. (B1). EP evaluates the posterior odds of the model outputs given the sampled parameters, guiding the DE-MCMC decisions in accepting or rejecting new parameters. Compared to the standard single forward-backward algorithm, EP approximates the exact posterior distributions with Gaussian distributions sharing matching moments (mean and variance). It involves multiple back-and-forward passes over the data using the entire timeseries until all posteriors stabilize. A key aspect of the EP algorithm is its ability to efficiently handle non-Gaussian posteriors, which arise from physical non-negativity constraints we force on water balance variables (, , Q, C). Typically, a small number of iterations are sufficient to achieve convergence due to the mild non-Gaussianity induced by these constraints. The EP algorithm yields conditional water balance posteriors (conditioned on the Sobs data and the parameter vector θ). Instead of the conditional posterior distribution, we are interested in the marginal posterior distribution p(x|Sobs) over the individual water balance variables (S0, , , Q, and C). Such distributions are obtained by integrating (∼ averaging) over the parameter posterior distribution p(θ|Sobs), effectively accounting for the parameter uncertainty in the final water balance estimates:
This paragraph explains how to compute posterior cross-covariances between the different water balance variables given the posterior marginal distributions of each variable (as output by the water balance data fusion model). For this, we will need two building blocks to construct the joint precision matrix (Θ) equal to the inverse of the covariance matrix (V∗), namely: the variables' linear relation through the water balance coefficients extracted from the water balance constraint at each time step: and “uncorrelated priors” of each variable. The term “priors” reflects uncertainty before considering the relation of each variable with other variables. With the marginal distributions available at this point, we can back-calculate these priors such that they satisfy the marginals through a message-passing algorithm. This algorithm relies on decomposing the marginal precisions (τ) for each variable into: (a) prior precisions (messages) for each variable and (b) precisions (messages) sent to each variable from all other variables via the water balance. Specifically, in each month, we have two conditions: the water balance constraint and the probabilistic constraints in the form of marginal Gaussian distributions written separately for individual variables x as , with posterior mean and variance obtained from Eq. (27) in the main text. Expressed in terms of precisions, we can write the decomposed posterior precision of S (τS), for example, as: where τS→ is the prior precision from S, and τ→S is the precision to S from the water balance. The latter can be obtained from all other variables by propagating uncertainty through the linear water balance equation. As such, the relationship between the posterior and prior variances of S that fulfill the two defined conditions becomes:
We can repeat the above step for all other variables to create a linear system of six equations with six unknowns (prior variances) that we need to solve for:
Where are the posterior marginal variances of the variables: Storage S, initial storage state for the first month S0, , , Q, and C, respectively. While, and vC→ are their corresponding prior variances.
The resulting priors are used to generate the joint precision matrix Θ of S0, P, E, Q, and C, which combines two terms:
The first term represents a matrix that encodes how each variable varies with every other variable in the water balance. This term is scaled by the prior precision from S (τS→), whereas the second term is a diagonal matrix of the other individual water balance variables prior precisions. The inverse of the joint precision matrix gives the posterior covariance matrix between all variables (in a given month), which contains all posterior cross-correlations.
C1 Water balance priors and posteriors
This section displays the results of the seasonal and annual posterior means of the water balance variables compared to their corresponding prior data.
Figure C1Comparison of posterior mean () to the prior data sets describing the precipitation (P) variable in the Hindon basin, the top panel depicts (a) monthly mean P, and the bottom panel (b) shows the annual P time series.
C2 Comparison of individual gridded products to the posterior means
This section presents the spatial maps of the deviation of the individual precipitation and evaporation products from their posterior mean.
Figure C3Comparison of the monthly mean posterior mean (mC*) to the prior data describing the surface water imports (C) variable in the Hindon basin.
Figure C4Interplay between the annual: evaporation posterior mean (), air temperature variable from GLDAS Noah Land Surface Model v2.1 (Rodell et al., 2004), MODIS 15 Leaf Area Index (LAI) (Myneni et al., 2021), and district-wise sugarcane crop yield from International Crops Research Institute (2020) for the Semi-Arid Tropics database.
All software used to collect the data is available at https://doi.org/10.5281/zenodo.11148992 (Mourad and Schoups, 2024). The source code developed for this research is available via: https://zenodo.org/records/17348274 (last access: 14 October 2025).
RM: methodology, software, formal analysis, validation, visualisation, writing original draft, writing (review & editing). GS: conceptualization, methodology, software, formal analysis, validation, writing (review & editing), supervision, funding acquisition. VR: data provision, review. WB: funding acquisition, review.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
The authors would like to thank Bas Mohrmann for coordinating and facilitating access to the river discharge and canal water import data used in this study. We also thank Ankur Rohel, Dikshant Bodana, Vivek Tiwari, and M. Sai Bargav Reddy for helping with canal water data access and entry.
This work is supported by the Dutch Research Council, Government of The Netherlands, through the Meridian Fund (project no. 482.20.303) and by the Department of Science & Technology, Government of India (project no. DST/TMDEWO/WTI/NWO/CGAW/2020/14), under the jointly funded Cleaning the Ganga and Agri-Water initiative and the Hindon Roots Sensing (HIROS) project: River Rejuvenation through Scalable Water- and Solute Balance Modelling and Informed Farmers' Actions.
This paper was edited by Zhongbo Yu and reviewed by two anonymous referees.
Abatzoglou, J. T., Dobrowski, S. Z., Parks, S. A., and Hegewisch, K. C.: TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015, Scientific data, 5, 1–12, https://doi.org/10.1038/sdata.2017.191, 2018.
Abolafia-Rosenzweig, R., Pan, M., Zeng, J., and Livneh, B.: Remotely sensed ensembles of the terrestrial water budget over major global river basins: An assessment of three closure techniques, Remote Sensing of Environment, 252, 112191, https://doi.org/10.1016/j.rse.2020.112191, 2021.
Abramowitz, M. and Stegun, I. A.: Handbook of mathematical functions with formulas, graphs, and mathematical tables, US Government printing office, https://archive.org/details/AandS-mono600?utm_source (last access: 1 February 2025), 1968.
Ahmad, F.: Canal Irrigation: Major Legal Issues, Journal of the Indian Law Institute, 33, 589–613, 1991.
Aires, F.: Combining datasets of satellite-retrieved products. Part I: Methodology and water budget closure, Journal of Hydrometeorology, 15, 1677–1691, https://doi.org/10.1175/JHM-D-13-0148.1, 2014.
Alam, F. and Umar, R.: Groundwater flow modelling of Hindon-Yamuna interfluve region, western Uttar Pradesh, Journal of the Geological Society of India, 82, 80–90, https://doi.org/10.1007/s12594-013-0113-8, 2013.
Anderson, M. C., Norman, J. M., Mecikalski, J. R., Otkin, J. A., and Kustas, W. P.: A climatological study of evapotranspiration and moisture stress across the continental United States based on thermal remote sensing: 2. Surface moisture climatology, Journal of Geophysical Research: Atmospheres, 112, https://doi.org/10.1029/2006JD007507, 2007.
Anderson, M. C., Zolin, C. A., Hain, C. R., Semmens, K., Yilmaz, M. T., and Gao, F.: Comparison of satellite-derived LAI and precipitation anomalies over Brazil with a thermal infrared-based Evaporative Stress Index for 2003–2013, Journal of Hydrology, 526, 287–302, https://doi.org/10.1016/j.jhydrol.2015.01.005, 2015.
Barkhordari, H., Dastjerdi, P. A., and Nasseri, M.: Development of a framework estimating regional gridded streamflow and actual evapotranspiration datasets: Fusing Budyko and water balance closure methods using remotely sensed ancillary data, Journal of Hydrology, 133456, https://doi.org/10.1016/j.jhydrol.2025.133456, 2025.
Bastiaanssen, W., Cheema, M., Immerzeel, W., Miltenburg, I., and Pelgrum, H.: Surface energy balance and actual evapotranspiration of the transboundary Indus Basin estimated from satellite measurements and the ETLook model, Water Resources Research, 48, https://doi.org/10.1029/2011WR010482, 2012.
Bastiaanssen, W. G. M., Hoekman, D. H., and Roebeling, R.: A methodology for the assessment of surface resistance and soil water storage variability at mesoscale based on remote sensing measurements. A case study with HAPEX-EFEDA data, Landbouwuniversiteit Wageningen0926-230X, 1993.
Beck, H. E., Wood, E. F., Pan, M., Fisher, C. K., Miralles, D. G., Van Dijk, A. I., McVicar, T. R., and Adler, R. F.: MSWEP V2 global 3-hourly 0.1 precipitation: methodology and quantitative assessment, Bulletin of the American Meteorological Society, 100, 473–500, https://doi.org/10.1175/BAMS-D-17-0138.1, 2019.
Cawse-Nicholson, K., Braverman, A., Kang, E. L., Li, M., Johnson, M., Halverson, G., Anderson, M., Hain, C., Gunson, M., and Hook, S.: Sensitivity and uncertainty quantification for the ECOSTRESS evapotranspiration algorithm–DisALEXI, International journal of applied earth observation and geoinformation, 89, 102088, https://doi.org/10.1016/j.jag.2020.102088, 2020.
Central Water Commission: Discahrge data for Galeta Station, Obtained from CWC, Government of India [data set], 2024.
Chaube, U. C., Pandey, A., and Singh, V. P.: Canal Irrigation Systems in India: Operation, Maintenance, and Management, Springer Nature, https://doi.org/10.1007/978-3-031-42812-8, 2023.
Chen, H., Ding, L., and Tuo, R.: Kernel packet: An exact and scalable algorithm for Gaussian process regression with Matérn correlations, Journal of machine learning research, 23, 1–32, https://doi.org/10.5555/3586589.3586716, 2022.
Crosetto, M., Ruiz, J. A. M., and Crippa, B.: Uncertainty propagation in models driven by remotely sensed data, Remote Sensing of Environment, 76, 373–385, https://doi.org/10.1016/S0034-4257(01)00184-5, 2001.
Dwivedi, P. and Yadav, B. K.: Inference of hydrological modelling and field-based monitoring on dynamics of heavy metals in water of Hindon Basin, Journal of Hydrology: Regional Studies, 60, 102488, https://doi.org/10.1016/j.ejrh.2025.102488, 2025.
Foken, T.: The energy balance closure problem: An overview, Ecological Applications, 18, 1351–1367, https://doi.org/10.1890/06-0922.1, 2008.
Goteti, G. and Famiglietti, J.: Extent of gross underestimation of precipitation in India, Hydrology and Earth System Sciences, 28, 3435–3455, https://doi.org/10.5194/hess-28-3435-2024, 2024.
Guerschman, J. P., Van Dijk, A. I., Mattersdorf, G., Beringer, J., Hutley, L. B., Leuning, R., Pipunic, R. C., and Sherman, B. S.: Scaling of potential evapotranspiration with MODIS data reproduces flux observations and catchment water balance observations across Australia, Journal of hydrology, 369, 107–119, https://doi.org/10.1016/j.jhydrol.2009.02.013, 2009.
Gumma, M. K., Thenkabail, P. S., Panjala, P., Teluguntla, P., Yamano, T., and Mohammed, I.: Multiple agricultural cropland products of South Asia developed using Landsat-8 30 m and MODIS 250 m data using machine learning on the Google Earth Engine (GEE) cloud and spectral matching techniques (SMTs) in support of food and water security, GIScience & Remote Sensing, 59, 1048–1077, https://doi.org/10.1080/15481603.2022.2088651, 2022.
Handcock, M. S. and Stein, M. L.: A Bayesian analysis of kriging, Technometrics, 35, 403–410, https://doi.org/10.1080/00401706.1993.10485354, 1993.
Heberger, M. G., Pellet, V., and Aires, F.: Improving Satellite Remote Sensing Estimates of the Global Terrestrial Hydrologic Cycle via Neural Network Modeling, Authorea Preprints, https://doi.org/10.22541/essoar.169049081.19217053/v1, 2023.
Hensman, J., Matthews, A., and Ghahramani, Z.: Scalable variational Gaussian process classification, Artificial intelligence and statistics, 351–360, 2015.
Hessels, T., Davids, J. C., and Bastiaanssen, W.: Scalable Water Balances from Earth Observations (SWEO): results from 50 years of remote sensing in hydrology, Water International, 47, 866–886, https://doi.org/10.1080/02508060.2022.2117896, 2022.
Hobeichi, S., Abramowitz, G., Contractor, S., and Evans, J.: Evaluating precipitation datasets using surface water and energy budget closure, Journal of Hydrometeorology, 21, 989–1009, https://doi.org/10.1175/JHM-D-19-0255.1, 2020.
Hong, Y., Hsu, K. L., Moradkhani, H., and Sorooshian, S.: Uncertainty quantification of satellite precipitation estimation and Monte Carlo assessment of the error propagation into hydrologic response, Water resources research, 42, https://doi.org/10.1029/2005WR004398, 2006.
Horner, I., Renard, B., Le Coz, J., Branger, F., McMillan, H., and Pierrefeu, G.: Impact of stage measurement errors on streamflow uncertainty, Water Resources Research, 54, 1952–1976, https://doi.org/10.1002/2017WR022039, 2018.
Hou, A. Y., Kakar, R. K., Neeck, S., Azarbarzin, A. A., Kummerow, C. D., Kojima, M., Oki, R., Nakamura, K., and Iguchi, T.: The global precipitation measurement mission, Bulletin of the American meteorological Society, 95, 701-722, https://doi.org/10.1175/BAMS-D-13-00164.1, 2014.
Huffman, G., Stocker, E., Bolvin, D., Nelkin, E., and Tan, J.: GPM IMERG Final Precipitation L3 1 month 0.1 degree × 0.1 degree V06, Goddard Earth Sciences Data and Information Services Center (GES DISC): Greenbelt, MD, USA, https://gpm.nasa.gov/data/directory (last access: 14 November 2024), 2019.
Huffman, G. J., Bolvin, D. T., Nelkin, E. J., Wolff, D. B., Adler, R. F., Gu, G., Hong, Y., Bowman, K. P., and Stocker, E. F.: The TRMM multisatellite precipitation analysis (TMPA): Quasi-global, multiyear, combined-sensor precipitation estimates at fine scales, Journal of hydrometeorology, 8, 38–55, https://doi.org/10.1175/JHM560.1, 2007.
Indian Council of Agricultural Research: Indian Council of Agricultural Research, Co 0238 – The Wonder Variety of Sugarcane: https://icar.org.in/en/node/4869 (last access: 14 October 2025), 2022.
International Crops Research Institute for the Semi-Arid Tropics: District Level Database (DLD): Crops dataset, http://data.icrisat.org/dld/src/crops.html (last access: 14 October 2025), 2020.
Irrigation & Water Resources Department: Canal discharge data for Hindon River Basin, Obtained from Irrigation & Water Resources Department, Uttar Pradesh, Government of India [data set], 2024.
Joseph, J. and Ghosh, S.: Representing Indian agricultural practices and paddy cultivation in the variable infiltration capacity model, Water Resources Research, 59, e2022WR033612, https://doi.org/10.1029/2022WR033612, 2023.
Karimi, P., Bastiaanssen, W. G., Molden, D., and Cheema, M. J. M.: Basin-wide water accounting based on remote sensing data: an application for the Indus Basin, Hydrology and Earth System Sciences, 17, 2473–2486, https://doi.org/10.5194/hess-17-2473-2013, 2013.
Kuczera, G.: Correlated rating curve error in flood frequency inference, Water Resources Research, 32, 2119–2127, https://doi.org/10.1029/96WR00804, 1996.
L'Ecuyer, T. S. and Stephens, G. L.: An estimation-based precipitation retrieval algorithm for attenuating radars, Journal of Applied Meteorology and Climatology, 41, 272–285, https://doi.org/10.1175/1520-0450(2002)041<0272:AEBPRA>2.0.CO;2, 2002.
Landerer, F. W. and Swenson, S.: Accuracy of scaled GRACE terrestrial water storage estimates, Water Resources Research, 48, https://doi.org/10.1029/2011WR011453, 2012.
Long, D., Longuevergne, L., and Scanlon, B. R.: Uncertainty in evapotranspiration from land surface modeling, remote sensing, and GRACE satellites, Water Resources Research, 50, 1131–1151, https://doi.org/10.1002/2013WR014581, 2014.
Luo, Z., Li, H., Zhang, S., Wang, L., Wang, S., and Wang, L.: A Novel Two-Step Method for Enforcing Water Budget Closure and an Intercomparison of Budget Closure Correction Methods Based on Satellite Hydrological Products, Water Resources Research, 59, e2022WR032176, https://doi.org/10.1029/2022WR032176, 2023.
Maggioni, V., Massari, C., and Kidd, C.: Errors and uncertainties associated with quasiglobal satellite precipitation products, in: Precipitation science, Elsevier, 377–390, https://doi.org/10.1016/B978-0-12-822973-6.00023-8, 2022.
Massari, C., Crow, W., and Brocca, L.: An assessment of the performance of global rainfall estimates without ground-based observations, Hydrology and Earth System Sciences , 21, 4347–4361, https://doi.org/10.5194/hess-21-4347-2017, 2017.
McMillan, H., Krueger, T., and Freer, J.: Benchmarking observational uncertainties for hydrology: rainfall, river discharge and water quality, Hydrological Processes, 26, 4078-4111, https://doi.org/10.1002/hyp.9384, 2012.
Mourad, R., Schoups, G., Bastiaanssen, W., and Kumar, D. N.: Expert-based prior uncertainty analysis of gridded water balance components: Application to the irrigated Hindon River Basin, India, Journal of Hydrology: Regional Studies, 55, 101935, https://doi.org/10.1016/j.ejrh.2024.101935, 2024.
Mourad, R. and Schoups, G.: Water Balance Prior Uncertainty Analysis Software, Zenodo [code], https://doi.org/10.5281/zenodo.11148992, 2024.
Munier, S., Aires, F., Schlaffer, S., Prigent, C., Papa, F., Maisongrande, P., and Pan, M.: Combining data sets of satellite-retrieved products for basin-scale water balance study: 2. Evaluation on the Mississippi Basin and closure correction model, Journal of Geophysical Research: Atmospheres, 119, 12100–112116, https://doi.org/10.1002/2014JD021953, 2014.
Myneni, R., Knyazikhin, Y., and Park, T.: MODIS/Terra Leaf Area Index/FPAR 8-Day L4 Global 500 m SIN Grid V061, NASA EOSDIS Land Processes Distributed Active Archive Center (DAAC), MOD15A12H [data set], 061, https://doi.org/10.5067/MODIS/MOD15A2H.061, 2021.
Pai, D., Rajeevan, M., Sreejith, O., Mukhopadhyay, B., and Satbha, N.: Development of a new high spatial resolution (0.25×0.25) long period (1901–2010) daily gridded rainfall data set over India and its comparison with existing data sets over the region, Mausam, 65, 1–18, https://doi.org/10.54302/mausam.v65i1.851, 2014.
Pan, M. and Wood, E. F.: Data assimilation for estimating the terrestrial water budget using a constrained ensemble Kalman filter, Journal of Hydrometeorology, 7, 534–547, https://doi.org/10.1175/JHM495.1, 2006.
Pan, M., Sahoo, A. K., Troy, T. J., Vinukollu, R. K., Sheffield, J., and Wood, E. F.: Multisource estimation of long-term terrestrial water budget for major global river basins, Journal of Climate, 25, 3191–3206, https://doi.org/10.1175/JCLI-D-11-00300.1, 2012.
Pellet, V., Aires, F., Munier, S., Fernández Prieto, D., Jordá, G., Dorigo, W. A., Polcher, J., and Brocca, L.: Integrating multiple satellite observations into a coherent dataset to monitor the full water cycle – application to the Mediterranean region, Hydrology and Earth System Sciences, 23, 465–491, https://doi.org/10.5194/hess-23-465-2019, 2019.
Rodell, M., Velicogna, I., and Famiglietti, J. S.: Satellite-based estimates of groundwater depletion in India, Nature, 460, 999–1002, https://doi.org/10.1038/nature08238, 2009.
Rodell, M., Beaudoing, H. K., L'ecuyer, T., Olson, W. S., Famiglietti, J. S., Houser, P. R., Adler, R., Bosilovich, M. G., Clayson, C. A., and Chambers, D.: The observed state of the water cycle in the early twenty-first century, Journal of Climate, 28, 8289–8318, https://doi.org/10.1175/JCLI-D-14-00555.1, 2015.
Rodell, M., Houser, P., Jambor, U., Gottschalck, J., Mitchell, K., Meng, C.-J., Arsenault, K., Cosgrove, B., Radakovich, J., and Bosilovich, M.: The global land data assimilation system, Bulletin of the American Meteorological society, 85, 381–394, https://doi.org/10.1175/BAMS-85-3-381, 2004.
Sahoo, A. K., Pan, M., Troy, T. J., Vinukollu, R. K., Sheffield, J., and Wood, E. F.: Reconciling the global terrestrial water budget using satellite remote sensing, Remote Sensing of Environment, 115, 1850–1865, https://doi.org/10.1016/j.rse.2011.03.009, 2011.
Schoups, G. and Nasseri, M.: GRACEfully closing the water balance: A data-driven probabilistic approach applied to river basins in Iran, Water Resources Research, 57, e2020WR029071, https://doi.org/10.1029/2020WR029071, 2021.
Senay, G. B.: Satellite psychrometric formulation of the Operational Simplified Surface Energy Balance (SSEBop) model for quantifying and mapping evapotranspiration, Applied Engineering in Agriculture, 34, 555–566, 10.13031/aea.12614, 2018.
Senay, G. B., Parrish, G. E., Schauer, M., Friedrichs, M., Khand, K., Boiko, O., Kagone, S., Dittmeier, R., Arab, S., and Ji, L.: Improving the Operational Simplified Surface Energy Balance Evapotranspiration Model Using the Forcing and Normalizing Operation, Remote Sensing, 15, 260, https://doi.org/10.3390/rs15010260, 2023.
Sheffield, J., Wood, E. F., Pan, M., Beck, H., Coccia, G., Serrat-Capdevila, A., and Verbist, K.: Satellite remote sensing for water resources management: Potential for supporting sustainable development in data-poor regions, Water Resources Research, 54, 9724–9758, https://doi.org/10.1029/2017WR022437, 2018.
Sobol, I. M.: Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates, Mathematics and computers in simulation, 55, 271–280, https://doi.org/10.1016/S0378-4754(00)00270-6, 2001.
Sun, Q., Miao, C., Duan, Q., Ashouri, H., Sorooshian, S., and Hsu, K. L.: A review of global precipitation data sets: Data sources, estimation, and intercomparisons, Reviews of Geophysics, 56, 79–107, https://doi.org/10.1002/2017RG000574, 2018.
Sun, Y., Bowman, K. P., Genton, M. G., and Tokay, A.: A Matérn model of the spatial covariance structure of point rain rates, Stochastic Environmental Research and Risk Assessment, 29, 411–416, https://doi.org/10.1007/s00477-014-0923-2, 2015.
Swenson, S. and Wahr, J.: Post-processing removal of correlated errors in GRACE data, Geophysical research letters, 33, https://doi.org/10.1029/2005GL025285, 2006.
Ter Braak, C. J. and Vrugt, J. A.: Differential evolution Markov chain with snooker updater and fewer chains, Statistics and Computing, 18, 435–446, https://doi.org/10.1007/s11222-008-9104-9, 2008.
Tian, Y. and Peters-Lidard, C. D.: A global map of uncertainties in satellite-based precipitation measurements, Geophysical Research Letters, 37, https://doi.org/10.1029/2010GL046008, 2010.
Umar, R., Khan, M. M. A., Ahmed, I., and Ahmed, S.: Implications of Kali-Hindon inter-stream aquifer water balance for groundwater management in western Uttar Pradesh, Journal of earth system science, 117, 69–78, https://doi.org/10.1007/s12040-008-0014-1, 2008.
Wada, Y., Van Beek, L. P., Van Kempen, C. M., Reckman, J. W., Vasak, S., and Bierkens, M. F.: Global depletion of groundwater resources, Geophysical research letters, 37, https://doi.org/10.1029/2010GL044571, 2010.
Wahr, J., Swenson, S., Zlotnicki, V., and Velicogna, I.: Time-variable gravity from GRACE: First results, Geophysical Research Letters, 31, https://doi.org/10.1029/2004GL019779, 2004.
Yin, G. and Park, J.: The use of triple collocation approach to merge satellite-and model-based terrestrial water storage for flood potential analysis, Journal of Hydrology, 603, 127197, https://doi.org/10.1016/j.jhydrol.2021.127197, 2021.
Zhang, K., Kimball, J. S., and Running, S. W.: A review of remote sensing based actual evapotranspiration estimation, Wiley interdisciplinary reviews: Water, 3, 834–853, https://doi.org/10.1002/wat2.1168, 2016a.
Zhang, Y., Pan, M., and Wood, E. F.: On creating global gridded terrestrial water budget estimates from satellite remote sensing, Remote sensing and water resources, 59–78, https://doi.org/10.1007/978-3-319-32449-4_4, 2016b.
Zhang, Y., Pan, M., Sheffield, J., Siemann, A. L., Fisher, C. K., Liang, M., Beck, H. E., Wanders, N., MacCracken, R. F., Houser, P. R., Zhou, T., Lettenmaier, D. P., Pinker, R. T., Bytheway, J., Kummerow, C. D., and Wood, E. F.: A Climate Data Record (CDR) for the global terrestrial water budget: 1984–2010, Hydrology and Earth System Sciences, 22, 241–263, https://doi.org/10.5194/hess-22-241-2018, 2018.
- Abstract
- Introduction
- Case Study: Hindon Basin in Northern India
- Probabilistic water balance model
- Inference methods
- Results
- Discussion
- Conclusions
- Appendix A: Additional data
- Appendix B: Additional methods
- Appendix C: Additional results
- Code and data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Abstract
- Introduction
- Case Study: Hindon Basin in Northern India
- Probabilistic water balance model
- Inference methods
- Results
- Discussion
- Conclusions
- Appendix A: Additional data
- Appendix B: Additional methods
- Appendix C: Additional results
- Code and data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References