the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Investigating sources of variability in closing the terrestrial water balance with remote sensing
Claire Irene Michailovsky
Bert Coerver
Marloes Mul
Graham Jewitt
Abstract. Remote sensing (RS) data is becoming an increasingly important source of information for water resources management as it provides spatially distributed data on water availability and use. However, in order to guide appropriate use of the data, it is important to understand the impact of the uncertainties of RS data on water resources studies. Previous studies have shown that the degree of closure of the water balance from remote sensing data is highly variable across basins and that different RS products vary in their levels of accuracy depending on climatological and geographical conditions.
In this paper we analyzed the water balance derived runoff from open access RS products for 591 catchments across the globe. We compared time-series of runoff estimated through a simplified water balance equation using 3 precipitation (CHIRPS, GPM and TRMM), 5 evapotranspiration (MODIS, SSEBop, GLEAM, CMRSET and SEBS) and 3 water storage change (GRACE-CSR, GRACE-JPL and GRACE-GFZ) RS datasets with monthly in situ discharge data for the period 2003–2016. Results were analyzed through the lens of 11 quantifiable catchment characteristics in order to investigate correlations between catchment characteristics and the quality of RS based water balance estimates of runoff, and whether specific products performed better than others in certain conditions.
The median Nash Sutcliffe Efficiency (NSE) for all gauges and all product combinations was −0.03, and only 43.3 % of the time-series reached positive NSE. A positive NSE could be obtained for 72.5 % of stations with at least one product combination, while the overall best performing product combination was positive for 53.8 % of stations. This confirms previous findings that the best performing products cannot be globally established. When investigating the results by catchment characteristic, all combinations tended to show similar correlations between catchment characteristics and quality of estimated runoff, with the exception of combinations using MODIS ET for which the correlation was frequently reversed. The combinations with the GPM precipitation product performed generally worse than the CHIRPS and TRMM data. However, this can be attributed to the fact that the GPM data is available at higher latitudes compared to the other products, where performance is generally poorer. When removing high latitude stations, this difference was eliminated and GPM and TRMM showed similar performance.
The results show the highest positive correlation between highly seasonal rainfall and runoff NSE. On the other hand, increasing snow cover, altitude and latitude all decreased the ability of the RS products to close the water balance. The catchment’s dominant climate zone was also found to be correlated with time series performance with the tropical areas providing the highest (median NSE = .11) and arid areas the lowest (median NSE = −0.09) NSE values. No correlation was found between catchment area and runoff NSE. The results point to the importance of further detailed studies on the uncertainties of the different data products and how these interact, as well as new approaches to using the data rather than simple water balance type approaches. Efforts to improve specific satellite products can also be better targeted using the results of this study.
Claire Irene Michailovsky et al.
Status: open (until 12 Jun 2023)
-
RC1: 'Comment on hess-2023-81', Anonymous Referee #1, 12 May 2023
reply
Closing water balance is an important issue for the hydrology community. This paper analyzed the water balance for 591 catchments across the globe to investigate correlations between catchment characteristics and the quality of RS based water balance estimates. It demonstrates whether specific products performed better than others in certain conditions. This will provide important referee for other scientists. However, to make their conclusion to be more rigorous, I suggest to include other dataset, which might influence some statements in this paper. The presentation is well structured and the quality of writing is good. I suggest it could be published after this revision.
This study uses SEBS ET data which is produced by the SEBS model updated by Chen et al. 2013. The SEBS ET data used in this study is produced by a monthly input data. This model version has problems over the high canopy which is already reported in Chen et al. 2019. Chen et al. 2019 paper has solved the low estimation of sensible heat flux over the forest area. After the model revision, a daily global ET data at 0.05 degree has been produced by Chen et al. 2021 JGR. The daily ET data has been shared through:
https://data.tpdc.ac.cn/en/data/df4005fb-9449-4760-8e8a-09727df9fe36/
This ET is a seamless data based on energy balance structure of SEBS. I suggested this dataset should be included in this analysis, since it has a high potential to change the conclusions on the dataset combination performance. In addition, the SEBS monthly ET has missing pixels over the Sahara, Arabian desert, Taiga in Canada and Rsussia. Meanwhile the updated daily ET data in Chen et al. 2021 is a gap-free daily ET data. Both ET dataset has the same spatial resolution. Hereby, I believe that this dataset will benefit this study.
Chen, X., Massman, W.J. and Su, B.Z., 2019. A column canopy-air turbulent diffusion method for different canopy structures. Journal of Geophysical Research: Atmospheres, 124: 488–506.
Chen, X., Su, Z., Ma, Y., Trigo, I. and Gentine, P., 2021. Remote Sensing of Global Daily Evapotranspiration based on a Surface Energy Balance Method and Reanalysis Data. Journal of Geophysical Research: Atmospheres, 126(16): e2020JD032873.
Citation: https://doi.org/10.5194/hess-2023-81-RC1 -
RC2: 'Comment on hess-2023-81', Anonymous Referee #2, 12 May 2023
reply
This work by Michailovsky et al. was motivated by the variability of the degree of closure of the water balance for different catchment characteristics and for the use of different RS products in previous studies. Their goal was twofold: (1) testing which combinations of RS products (precipitation, ET and water storage change products) best reproduce in situ measurements of discharge for multiple catchments, and (2) identifying catchment characteristics which can explain the achieved degree of closure of the water balance. They used 45 combinations of RS products, and used the water balance to compare the resulting monthly discharge against in situ measurements of 591 catchments. Finally, they used 11 quantifiable catchment characteristics to evaluate how well these monthly discharges match.
The article is of great interest to all those who (aim to) use remote sensing products for closing the water balance in order to estimate discharge, particularly in poorly gauged river basins. The article is well-structured and well-written. Given the relevance of this work, I recommend to publish this article after considering the comments below.
General comments
C1.I believe the authors could bring their analysis one step further by better discussing why some RS product combinations perform better than others for specific areas / catchment characteristics.
In order to understand why certain products perform better in certain catchments, one needs to know what are the general differences between the products for the same variable (i.e. measurement principle, data sources, idea of the algorithm, etc.). A brief explanation of the differences between each product could be added to paragraphs 2.1.1, 2.1.2 and 2.1.3. In chapter 3, the authors could refer to these differences when explaining why for example one ET product performs better than another ET product, in combination with the same P and dS/dT products, in certain types of catchments. This would be very valuable information for those who have to choose products to estimate discharge.
C2. Consider adding a Figure/Table which shows the best performing product combinations for certain common combinations of catchment characteristics. This would be most informative for water managers of ungauged basins.
C3. Most of the data sets used are products from missions that are not in orbit any more. I think it is relevant for the target users to describe what are the current / future missions and products, which are comparable to the missions used for the current analysis. Some of this information is given in a footnote in Table 1 (GRACE-FO, TMPA), but given its relevance for the users of this work, I think this deserves a paragraph in the text.
C4. Did you consider adding dominant soil type and/or subsurface bedrock depth to the catchment characteristics? These are important factors affecting the discharge of a catchment, particularly when looking at different continents.
Line-specific comments
C5. In line 77-103, a clear overview of previous related studies is given. However, I find the novelty of the current study a bit underexposed in lines 104-107. Both aims that are mentioned (i.e. to investigate both the ability of different combinations of RS products to reproduce in situ measurements of discharge, and to identify catchment characteristics that affect how well the closure of the water balance can be achieved) have been studies before. Here, it is not yet clear what the current study adds to the previous studies, and why this study is necessary. I recommend to formulate the gap in knowledge and added value of the current study more clearly in this section.
C6. L54 / L70: Equation 1 and 2 are exactly the same, while Eq. 2 should be a rewritten version of Eq.1.
C7. L66 “This is equivalent to the assumption that subsurface fluxes in and out of the basin are negligible”. Consider referring to Bouaziz et al. (2018), who disprove this assumption.
C8. L440: “100,000 km2”. Based on previous text, I assume this should be 10,000 km2.
C9. L233: “We selected 11 RS derived catchment characteristics… . These are summarized in Table 2 …” Table 2 only shows 10 RS catchment characteristics.
C10: L395: “This was unexpected as the GRACE data in particular is expected to perform better for larger catchments” Given the next sentence, I think this sentence is redundant.
C11. Fig. 5: The colors of SSEBOP and and GLEAM are too similar. It is difficult to distinguish the two. Please adjust the colors.
C12. Fig. 6: Consider revising the color scheme used. Make sure the colors for a positive correlation can be clearly distinguished from the colors for a negative correlation. Now, the green color representing a Pearson correlation of 0-0.25 is similar to the green/blue colors representing a negative correlation.
References
Bouaziz, L., Weerts, A., Schellekens, J., Sprokkereef, E., Stam, J., Savenije, H., & Hrachowitz, M. (2018). Redressing the balance: quantifying net intercatchment groundwater flows. Hydrology and Earth System Sciences, 22(12), 6415-6434.
Citation: https://doi.org/10.5194/hess-2023-81-RC2 -
RC3: 'Reviewer comments on Investigating sources or variability in closing the terrestrial balance with remote sensing (hess-2023-81)', Roelof Rietbroek, 17 May 2023
reply
Recommendation:
accept after (some) major revisions
General remarks:
The authors have investigated river discharge derived from various combinations of GRACE gridded products, in combination with precipitation and evapotranspiration products from several sources. The estimated discharge estimates are rigorously compared in terms of their Nash-Sutclife efficiency (NSE) with in situ data from the global river discharge centre. Furthermore, correlations between the resulting NSE and a list of catchment characteristics are compared to see if certain characteristics are linked to better performing discharge estimates.The authors find an overall median NSE close to zero while less than half of the combinations reach a positive NSE. In the most optimal case, when selecting the best performing combination for each catchment, 72.5 percent of the comparison exhibit a positive NSE. The analysis shows that a one-size-fits-all combination of GRACE and P, ET products can not be found.
I've found the paper interesting to read and I appreciate the rigorous approach to testing all possible data combinations. From my viewpoint, the paper is definitely suitable for publications after consideration of some issues (see below). I assume these are relatively easy to address, but since some may involve redoing some computations I still opted for a major revision.
Main issues:
* Relevant context on the GRACE spatial resolution and signal leakage is missing for the particular products used. In the study, gridded GRACE products have been used. This is in principle ok, but it the inherent spatial resolution of the GRACE derived total water storage is important and should be mentioned. The truncation (spherical harmonic) degree of the input gravity field solutions in combination with the strength of the applied spatial filtering will result in highly smoothed fields and signal attenuation (and contamination from nearby sources). I suspect that the smoothing may also contribute to the lacking correlation found between basins area and the obtainable NSE: below the typical GRACE resolution the results for the sub-catchments will result in essentially the same GRACE times series.
I see 3 ways out of this conundrum: (1) filter the P-ET data with a spatial filter which has similar smoothing as that was is applied GRACE, (2) try to restore the signal in GRACE (see e.g. Vishawakarma et al. 2016 below or similar references), or (3) leave it as is but write a disclaimer in the discussion on how this can effect the results.
Way (2) (then (1)) would be the best but it would essentially constitute a lot of work. I would find (3) also acceptable in light of the, likely larger, errors introduced through the P and ET estimates.* How are biases treated in R_estimated and R_observed? If biases are large, some of them may strongly influence the NSE, in particular for arid catchments (bias is larger compared to the standard deviation). Maybe a NSE variant can be computed where the biases are removed?
Note that at higher latitudes, errors in the Glacial isostatic adjustment trend-correction for GRACE may also manifest themselves as biases in the storage changes, although I suspect that this is not a big issue as the GIA models generally perform well over the Northern Hemisphere (where most of the considered catchments are)* North American nested-catchments may be disproportionally present in the catchment characteristics comparison. The authors mention that several catchments are nested-catchments of their larger parent groups. Is this somehow compensated when computing the correlations of the catchment characteristics? If not I wonder whether e.g. the Missisippi basin, which is well gauged may be overrepresented in the statistics. The authors may consider limiting the amount of subcatchments which are fed into the statistics, if this is an issue.
* Suggestion: Use histograms (cathment nr. on y versus NSE on x) to visualize and thus understand the distribution of the obtained NSE's. It would also help justify the computation of the correlation of the NSE (essentially a correlation of a correlation), which implicitly assumes that the computed NSE's are normally distributed.
Minor issues
* Please mention how aggregate catchment values are obtained from the gridded values. Are grid areas taken constant, or are they latitude weighted?
* I would like to ask the authors to consider releasing their analysis code, to improve reproducibility for others
References:Dutt Vishwakarma, B., Devaraju, B., Sneeuw, N., 2016. Minimizing the effects of filtering on catchment scale GRACE solutions. Water Resources Research 52, 5868–5890. https://doi.org/10.1002/2016WR018960
Citation: https://doi.org/10.5194/hess-2023-81-RC3 -
RC4: 'Comment on hess-2023-81', Anonymous Referee #4, 19 May 2023
reply
This reviewer appreciates the effort of the authors to investigate sources of variability in closing the terrestrial water balance with remote sensing. The novelty of this study is to link the catchment characteristics (and LCC, LU, and climate class) to the performance of different product combinations. Although the manuscript collects multi-source RS water cycle variable datasets, it is unclear why certain product combinations perform better than others. For example, it is unclear the impact of different spatial resolutions of different products on the final results.
This reviewer suggests the authors put more effort into clarifying the above perspective, for example:
- Clarifying and quantifying the potential impacts of using RS data from different spatial/temporal resolutions;
- In terms of quantifying such impacts, the triple-collocation type of approach could be applied here among multi-source RS data products to understand the relative errors between each other, which will provide further information for identifying the sources of variability in closing the terrestrial water balance using different product combinations.
Minor comments:
- Eq.2 looks exactly the same as Eq.1
- Line 36 "the results point to the importance ..." is confusing
Citation: https://doi.org/10.5194/hess-2023-81-RC4
Claire Irene Michailovsky et al.
Claire Irene Michailovsky et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
504 | 123 | 11 | 638 | 0 | 2 |
- HTML: 504
- PDF: 123
- XML: 11
- Total: 638
- BibTeX: 0
- EndNote: 2
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1