I want to thank the authors for thoroughly addressing the suggestions that I provided on an earlier version of this manuscript. Despite it has been improved considerably, I have a suite of specific comments that I would like the authors to address before this paper is accepted for publication.
Specific comments
1. L3: the authors clarified that they are referring to structural and parametric model uncertainties with “model deficiencies” in the response document, though not in the revised manuscript. Please include that clarification in the abstract.
2. L35: the authors clarified what they mean with “ensemble increments” in the response document, but not in the revised manuscript. Please include that clarification here.
3. L151, L185 and everywhere else: I do not think that a parameter perturbation approach is the same as a multiphysics approach. In order to achieve the latter, they should have used multiple model structures, which is indeed possible with Noah-MP and other modular modeling platforms like FUSE (Clark et al. 2008), SUMMA (Clark et al. 2015a,b), MARRMoT (Knoben et al. 2019), Raven (Craig et al. 2020), etc. Although I did not catch this in my first review, I think this is a critical point that needs to be revised to avoid confusion among readers.
4. L153-154: do you mean that increasing the variability in the ensemble can help your DA scheme to effectively estimate hydrological model states. If yes, please rewrite the text to reflect this.
5. L168-169: I recommend adding the calibration objective function as an equation in the paper. This is relevant information for the sake of reproducibility.
6. Equation 6: can the authors please clarify if (and how) this equation relates to the calculation of the model error covariance matrix P, and the observation error covariance matrix R?
7. Figure 2: it is quite odd having panel (a) on the right and (b) on the left. Please consider switching the order of those panels.
8. L304, 374, 416, 422, 605, 621 and everywhere else in the manuscript: please revise the use of the word “significantly”. This is a point I raised in my first review, and I think that needs to be addressed more carefully.
9. L393: please re-word this sentence to make it clear that NSE and KGE share the same range of variation.
10. L422-423: Are the authors referring to the shift in performance metrics between panels (a) and (b)? Note that panel (b) contains results for alpha = 0.1 (i.e., 10% weight to the dynamic component of the covariance, according to equation 7). Then why did the authors write that “outstanding performance can be achieved by incorporating only 10% of the hybrid covariance from the climatology”? Should it be the opposite (i.e., 10% from the dynamic component)?
Related to this point, the authors refer to “The introduction of climatological information with alpha = 0.1” in L434. However, according to equation (7) alpha = 0.1 means that you are giving a 10% weight to the dynamic component, and 90% to the climatological component (i.e., alpha = 0.1 means that the authors are introducing dynamic information and not the other way around). Based on this, I strongly recommend revisiting this interpretation of weights and/or correct equation (7).
11. L442: the authors refer to an “overestimation”, though it seems that there is underestimation because a large fraction of observations (nearly 40%) is larger than the simulated ensemble members according to the rank histogram. Please revise and re-word if needed.
12. The authors refer to an “improved estimate of the uncertainty”, though what they are getting is an improved ensemble spread based on the relative range of variation of the observations.
13. L457-458: This is still VERY hard to visualize from Figure 9, especially in the left panel. Consider decreasing the size of symbols for the hybrid configurations.
14. L461: the authors state that “The EnKF-OI schemes yield comparable correlations, with alpha = 0.5 consistently offering the best performance in both domains”. I do not think this is true for FL, where alpha = 0.7 offers the highest correlation value (according to Table 2).
15. Figure 13: all the descriptions provided by the authors regarding this figure are still very hard to visualize. Because of this and the length of the manuscript (which contains a tremendous amount of information), I would consider removing these results or sending them to supplementary material. In any case, the authors should make the final choice on this matter.
16. L556-558: please revise this sentence, because I do not see the consistent improvement that the authors describe when comparing red and blue boxplots, especially in panel (a).
17. L558: what is an outstanding score?
Suggested edits
18. L141: “using level pool scheme” -> “using a level pool scheme”.
19. L142-143: this sentence reads repetitive. Maybe just write "we use a channel, reservoir, and conceptual groundwater submodel of the NWM, following...".
20. L157-158: please place “distributions, e.g., gamma, inverse-gamma and exponential” between parentheses.
21. Caption of Figure 1: “Dotted box” -> “The dotted box”.
22. L170: “Summary statistics of the model statistics” -> “Summary model statistics”.
23. L170-171: awkward sentence. Maybe rewrite as “It should be noted that some model biases remain after the calibration process”.
24. L422 and everywhere else: please delete the word “outstanding”. There is no need to use bombastic adjectives.
25. 565: “until now” -> “so far”.
References
Clark, M. P., A. G. Slater, D. E. Rupp, R. A. Woods, J. A. Vrugt, H. V. Gupta, T. Wagener, and L. E. Hay, 2008: Framework for Understanding Structural Errors (FUSE): A modular framework to diagnose differences between hydrological models. Water Resour. Res., 44, W00B02, doi:10.1029/2007WR006735.
——, and Coauthors, 2015a: A unified approach for process‐based hydrologic modeling: 1. Modeling concept. Water Resour. Res., 51, 2498–2514, doi:10.1002/2015WR017198.
Clark, M. P., and Coauthors, 2015b: A unified approach for process-based hydrologic modeling: 2. Model implementation and case studies. Water Resour. Res., doi:10.1002/2015WR017200.
Craig, J. R., and Coauthors, 2020: Flexible watershed simulation with the Raven hydrological modelling framework. Environ. Model. Softw., 129, 104728, doi:10.1016/j.envsoft.2020.104728. https://doi.org/10.1016/j.envsoft.2020.104728.
Knoben, W. J. M., J. E. Freer, K. J. A. Fowler, M. C. Peel, and R. A. Woods, 2019: Modular Assessment of Rainfall–Runoff Models Toolbox (MARRMoT) v1.2: an open-source, extendable framework providing implementations of 46 conceptual hydrologic models as continuous state-space formulations. Geosci. Model Dev., 12, 2463–2480, doi:10.5194/gmd-12-2463-2019. |