the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Achieving water budget closure through physical hydrological processes modelling: insights from a large-sample study
Abstract. Modern hydrology is embracing a data-intensive new era, information from diverse sources is currently providing support for hydrological inferences at broader scales. This results in a plethora of data reliability-related challenges that remain unsolved. The water budget non-closure is a widely reported phenomenon in hydrological and atmospheric systems. Many existing methods aim to enforce water budget closure constraints through data fusion and bias correction approaches, often neglecting the physical interconnections between water budget components. To solve this problem, this study proposes a Multisource Datasets Correction Framework grounded in Physical Hydrological Processes Modelling to enhance water budget closure, called PHPM-MDCF. The concept of decomposing the total water budget residuals into inconsistency and omission residuals is embedded in this framework to account for different residual sources. We examined the efficiency of PHPM-MDCF and the residuals distribution across 475 CONUS basins selected by hydrological simulation reliability. The results indicate that the inconsistency residuals dominate the total water budget residuals, exhibiting highly consistent spatiotemporal patterns. This portion of residuals can be significantly reduced through PHPM-MDCF correction and achieved satisfactory efficiency. The total water budget residuals have decreased by 49 % on average across all basins, with reductions exceeding 80 % in certain basins. The credibility of the correction framework was further verified through several noise experiments. In the end, we explored the potential factors influencing the distribution of residuals and found notable scale effects where residuals decrease with increasing basin area. This emphasizes the importance of carefully evaluating the water balance assumption when employing multisource datasets for hydrological inference in small basins.
- Preprint
(3706 KB) - Metadata XML
-
Supplement
(4063 KB) - BibTeX
- EndNote
Status: open (until 30 Sep 2024)
-
RC1: 'Comment on hess-2024-230', Anonymous Referee #1, 21 Aug 2024
reply
This paper emphasizes the issue of decreasing data confidence at the watershed scale in the era of big data, caused by the non-closure of water budget from multiple data sources. In their analysis, the total water budget residuals were quantitatively decomposed into two components, inconsistency and omission residuals, to account for different drivers of water budget non-closure phenomenon. This is an interesting addition, as previous studies have typically given little or only qualitative consideration to the water imbalance caused by omissions in the original water balance equation.
Attempting to close the water balance is valuable, both hydrological inference under climate change and hydrological modeling require data that satisfy the basic assumption of water balance. The PHPM-MDCF proposed in this work employ hydrological model to constrain multisource datasets, which is reasonable because hydrological models are well-known for their water balance capabilities. The correction also seems to be effective, which comes from the validation with results from large sample basins.
However, there are still some concerns that need to be explained in the response or addressed in the manuscript. The authors have observed the typical seasonal pattern of non-closure phenomena but lack corresponding explanations. In addition, although the authors decomposed the closure residuals into two parts, it seems that only the inconsistency residuals were corrected. What is the rationale behind this approach? Why were the omission residuals not corrected?
In summary, this paper is innovative and aligns with the interests of potential readers of the HESS. After careful consideration and revision, this work has the potential to make a significant contribution to this field. As they described, the underlying Bayesian philosophy is an approach for aligning our understanding of natural processes with real-world observations.
Major concerns:
- Sect. 4.1, the patterns of the Res are of interest to me. The authors identified typical spatial distributions and compared them with previous studies in Sect. 4.4, explaining these patterns through hydro-meteorological conditions and watershed area. From a physical perspective, this explanation is consistent with common sense and is sufficient for me. However, the temporal patterns of the Res are also of interest (Fig. 5). The authors should provide further explanation in this regard or compare them with previous studies, as this could offer valuable insights into the causes of the non-closure of water balance.
- From Fig. 6, it appears that the Res and Resi have been effectively corrected, but the Reso have not changed significantly. Is this merely a specific case for this basin or a general situation? If it is a general situation, dose this imply that PHPM-MDCF only corrects for Resi and does not account for Reso? I believe that further explanation of this treatment could improve the transparency of the methods used in the paper.
- Although the author has clearly articulated the main scientific problem of the paper, there are still areas that could be further improved, which I have listed in the specific issues.
Specific issues
- Line 22-25: According to the results, it seems that humid/wet basins are also prone to larger closure residuals, which needs to be emphasized here.
- Line 36-46: I believe this section should place greater emphasis on the issues of scale mismatch and difficulty in obtaining reference data.
- Line 58-60: It is recommended to cite the review by Beven (2002).
- Line 83-84: It is recommended to add references to support the argument.
- Line 119: "Res" does not appear to be in italics.
- Line 126-127: It is recommended to change it to: “(a) How can the total water budget residuals be quantitatively decomposed into inconsistency and omission residuals based on Eq. (3)?”
- Table1: The “period” should be “Original Period”.
- Figure5: The figure caption seems to contain an error. There are no other subfigures.
- Line 332-334: The argument here doesn't seem to correspond with the figure. Could it be that the figure has been updated?
- Line 418: add “which are” before “implemented”.
- Line424-426: Change the sentence to “The fact that simultaneous corrections of other variables during extreme runoff noise corrections did not significantly differ from OS-based corrections further enhances our confidence in PHPM-MDCF.”
- Line 417: It is necessary to further emphasize the issue of the non-closure phenomenon in humid regions.
- Figure 12: There seems to be a mistake with the R2 values.
- Line 639: Humid regions is a better expression.
Citation: https://doi.org/10.5194/hess-2024-230-RC1 -
AC1: 'Reply on RC1', Dengfeng Liu, 25 Aug 2024
reply
Dear reviewer,
Please find attached our detailed responses to your comments. We are grateful for your insightful feedback, which has been instrumental in refining and improving the manuscript.
-
RC2: 'Comment on hess-2024-230', Anonymous Referee #2, 25 Aug 2024
reply
The paper presents an interesting concept, and its organization and writing are well done. However, I have some differing views regarding the underlying assumptions and principles of the proposed method. My main comments are as follows:
Major Comments:
(1) I do not agree with the two underlying assumptions of the PHPM-MDCF method, nor with the significance of using Equation 4 to calculate omission errors. My main reasons are as follows:
Firstly, the errors in hydrological models are non-negligible and represent the sum of both omission errors and data errors, rather than omission errors alone. The paper assumes that hydrological models have no data errors (inconsistency errors) and only omission errors, which is evidently unreasonable. This assumption is particularly problematic because hydrological models are typically validated against observed runoff, often neglecting the validation of ET (Evapotranspiration) and TWSC (Terrestrial Water Storage Change) simulation accuracy. As a result, using Equation 4 to calculate omission errors is not justified. Due to the complexity of hydrological models and the impact of errors in driving variables, the water imbalance caused by errors in the hydrological model may be substantial. Even if the inputs to the hydrological model are observational data and the model itself is developed based on the principle of water budget, the primary contributor to water imbalance errors between input and output might still be data errors.
Secondly, the total residual is calculated using multiple sources of data, and omission errors are calculated using data that drive the hydrological model as per Equation 4. The difference between these is then used to calculate data inconsistency errors. However, this approach might introduce uncertainties due to data inconsistency.
(2) The validation of results should include a comparison between the PHPM-MDCF method and existing methods. The paper repeatedly emphasizes the inadequacy of current methods in distributing residuals, yet no comparison with existing methods is provided in the results to verify the accuracy of the PHPM-MDCF method. The goal of closing the water budget is to reduce residuals while improving the accuracy of water cycle variables. Therefore, the credibility of the model should not be judged solely by the reduction of residuals (Figure 6). A comparison with existing methods would be more convincing. I strongly recommend supplementing the results with a comparison against existing correction methods, particularly CKF, PR, and MCL methods. For instance, the accuracy of the datasets after calibration using these methods, including P (Precipitation), ET (Evapotranspiration), Q (Runoff), and TWSC (Terrestrial Water Storage Change).
(3) The description of the reference datasets is unclear. It is necessary to specify which observational system datasets were used for P (Precipitation), ET (Evapotranspiration), Q (Runoff), and TWSC (Terrestrial Water Storage Change), and why these datasets can be considered observational data. I recommend clarifying this in the text.
(4) Only a single product was selected for each water cycle variable. I believe that selecting multiple products is crucial for validating the proposed PHPM-MDCF method. This is because different datasets have different sources of error, leading to varying inconsistency residuals depending on the data combination. If the proposed method can be used to identify inconsistency residual error, using multiple data combinations would better verify the reliability of the proposed method in this study.
(5) In Step 2 at line 250, please explain why is it reasonable to allocate residuals based on the difference between simulated values and reference values? It is worth noting that the simulated ET (Evapotranspiration) and TWSC (Terrestrial Water Storage Change) by the hydrological model may not have been validated for accuracy and may contain significant uncertainties. If their errors are used to allocate residuals, substantial uncertainties could lead to unreasonable allocation of residuals to ET and TWSC. The formula for residual allocation needs to be supplemented. Additionally, if Step 3 determines that the residual allocation is unreasonable, can simply halving the residual solve the issue? The underlying principles need to be clarified, or an example should be provided.
(6) Please clearly state the scope and spatiotemporal scale of this study. Most studies investigate water budget closure at the monthly scale rather than the daily scale. Aside from data availability, I believe this is mainly due to larger data errors and the lag effect of hydrological processes at the daily scale. If this study focuses on water budget closure at the daily scale, how were these issues addressed?
(7) At line 320, it is necessary to explain the reasons behind the spatial distribution of Res. How does the difference in spatial patterns indicate that inconsistency residuals and omission residuals are driven by different factors? Please provide a detailed explanation. The most likely reason for Resi and Res having the same spatial pattern is that the former was calculated based on the latter. Their difference from Res0 is due to the different error sources used in calculating Res0 and Res, which does not necessarily demonstrate the reliability of the method for separating inconsistency residuals from omission residuals. Additionally, the residual values in Figure 4 differ significantly from those reported in previous studies. What is the reason for this discrepancy?
(8) In the multi-source dataset correction framework for achieving water budget closure, what is the rationale for setting the initial correction rate to 0.5? Why is the correction rate halved when the model produces unreliable simulations? Is there a potential proportional relationship between the adjustment of the correction rate and the magnitude of bias in unreliable simulations that could allow for more efficient correction rate adjustments? Additionally, what is the basis for setting the conditions for iteration and termination of the correction process as “the inconsistency residuals decreases to 10% of its initial value or the correction rate falls below 4%”?
Minor Comments:
(1) Please provide additional explanation on how Section 4.3.1 demonstrates the reliability of the PHPM-MDCF method.
(2) The paper does not validate the accuracy of the Reso, Resi, and Res separation method in the results.
(3) At line 310, can KGE ≥ −0.41 really indicate that the hydrological model accurately represents the observed hydrological system?
(4) In Figure 5, Reso is closer to 0. Can we attribute this to the principle of water budget in the development of the hydrological model, rather than merely to omission errors? Since Resi = Res - Reso, and Reso is relatively small, it is evident that the values and spatial patterns of Resi and Res are more similar. What does this imply?
(5) Please explain from a theoretical standpoint why the PHPM-MDCF method has such advantages over previous methods: “It suggests that the soft constraints based on physical hydrological processes will not lead to compensatory errors, as seen in traditional methods due to the rigid allocation of water budget residuals.”.
(6) I do not find this statement reasonable: “When the hydrological model calibrated against multiple variables measured by the multisource datasets and achieves reliable performance, we consider the simulation system approaching the measurement system.”.
(7) At line 255, please clarify the data sources for the observed values of P, ET, Q, and TWSC used in this study. Without this information, it is difficult to judge whether the deviation between the simulation system and the measurement system is calculated reasonably.
(8) I personally feel that the discussion in Section 5.1 would be more effective if it were more closely aligned with the scope of this study.
(9) The limitations discussed in Section 5.2 are not explained from a theoretical perspective. I hope that some convincing explanations can be supplemented from this standpoint.
(10) The structure of the article lacks a keywords section. Please add keywords.
(11) Please add references related to the water budget equation.
(12) The text states “as illustrated in Fig. 3” but the caption provided is “Figure 3”. The authors should ensure that all figure captions are consistent with the text descriptions. Please carefully check the rest of the article for similar errors and make the necessary corrections.
Citation: https://doi.org/10.5194/hess-2024-230-RC2 -
AC2: 'Reply on RC2', Dengfeng Liu, 05 Sep 2024
reply
Dear reviewer,
Please find attached our detailed responses to your comments. We sincerely appreciate your thorough review, which has significantly imporved the quality of our manuscript.
-
AC2: 'Reply on RC2', Dengfeng Liu, 05 Sep 2024
reply
-
CC1: 'Comment on hess-2024-230', Hui LIU, 02 Sep 2024
reply
In the context of fast development of measurement techniques, it is our mission to develop methods to leverage the advantages of the measured variables and thus promote the hydrological simulation. This study is a valuable try, which proposed a multisource datasets correction framework, the PHPM-MDCF, to achieve water budget closure with calibration of various variables. This experiment was carried out in 475 COUNS basins, showing great potential to reduce the inconsistency residuals.
Major concerns:
(1) There are PTRMM in both Eq. (5) and (6), then how do we reduce the inconsistency residuals brought by P in the water budget?
(2) This paper focuses on the terrestrial water balance (Eq. (1)). However, whether this framework is applicable to broader water balances, such as atmospheric water balance or local water balance, or if any adjustments are needed?
(3) Uncertainty plays a crucial role, and this study qualitatively address the uncertainty associated with the model structure. A pertinent question is whether this uncertainty can be quantified. While we know that validating this uncertainty through multiple models may be both challenging and unnecessary within the scope of the current work, it would be valuable if the authors could suggest potential avenues for future research and development.Minor comments:
(1) Please check carefully of the text, to avoid grammatic errors, e.g. km2 in Line 183.
(2) Line 75-76: The semantics are repetitive; it is recommended to delete “to ensure data consistency”.
(3) Line 80: “residuals” is more precise than “bias”.
(4) Line 131-134: It seems that these sentences should be changed to the past tense.
(5) Line 165: “One of the main aims” might be more appropriate.
(6) Line 167: This sentence should be in the past tense.
(7) Fig. 3: It is recommended to add further explanations in the caption of Fig. 3.
(8) Line 458: I suggest emphasizing the spatial distribution of water balance closure.
(9) Line 619: A “.” is missing before the “The major”.Citation: https://doi.org/10.5194/hess-2024-230-CC1 -
AC3: 'Reply on CC1', Dengfeng Liu, 05 Sep 2024
reply
Dear reviewer,
Please find attached our detailed responses to your comments. We greately appreciate your interest in our paper and your recognition of the importance of our work. We also thank you for your thorough review, which has significantly enhanced the quality of our manuscript.
-
AC4: 'Reply on CC1', Dengfeng Liu, 05 Sep 2024
reply
Dear reviewer,
Please find attached our detailed responses to your comments. We greately appreciate your interest in our paper and your recognition of the importance of our work. We also thank you for your thorough review, which has significantly enhanced the quality of our manuscript.
-
AC3: 'Reply on CC1', Dengfeng Liu, 05 Sep 2024
reply
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
299 | 74 | 150 | 523 | 65 | 12 | 3 |
- HTML: 299
- PDF: 74
- XML: 150
- Total: 523
- Supplement: 65
- BibTeX: 12
- EndNote: 3
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1