the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Combining uncertainty quantification and entropy-inspired concepts into a single objective function for rainfall-runoff model calibration
Abstract. A novel metric for rainfall-runoff model calibration and performance assessment is proposed. By integrating entropy and mutual information concepts as well as uncertainty quantification through BLUECAT (likelihood-free approach), RUMI (Ratio of Uncertainty to Mutual Information) offers a robust framework for quantifying the shared information between observed and simulated stream flows. RUMI’s capabilities to calibrate rainfall-runoff models is demonstrated using the GR4J rainfall-runoff model over 99 catchments from various macroclimatic zones, ensuring a comprehensive evaluation. Four additional performance metrics and 50 hydrological signatures were also used for performance assessment. Key findings indicate that RUMI-based simulations provide more consistent and reliable results compared to the traditional Kling-Gupta Efficiency (KGE), with improved performance across multiple metrics and reduced variability. Additionally, RUMI includes uncertainty quantification as a core computation step, offering a more holistic view of model performance. This study highlights the potential of RUMI to enhance hydrological modelling through better performance metrics and uncertainty assessment, contributing to more accurate and reliable hydrological predictions.
- Preprint
(5571 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 04 Mar 2025)
-
RC1: 'Comment on hess-2024-389', Keith Beven, 30 Jan 2025
reply
This is a really nice paper but needs a few issues of clarification in both the introduction and in the presentation of the methodology before publication in line with the comments in the MSS.
-
RC2: 'Comment on hess-2024-389 - Good paper', Salvatore Grimaldi, 13 Feb 2025
reply
The manuscript proposes an innovative metric (named RUMI) to support the calibration of hydrological models, based on a combination of two approaches: BLUECAT and Mutual Information (MI). The authors evaluate the performance of the proposed metric through an extensive case study analysis, comparing RUMI to an established metric, Kling-Gupta Efficiency (KGE).
The topic is particularly interesting, and the proposed approach is promising due to its potential for providing more effective calibration and incorporating uncertainty evaluation through the BLUECAT component.
While I am inclined to recommend publication, I have several comments and suggestions for the authors, detailed below.
General Comments
The main issue to address concerns the paper’s structure and organization.
Although the abstract clearly conveys the manuscript’s aim and content, the subsequent sections may leave the reader disoriented.Section-Specific Comments
Introduction:
The Introduction should be revised to focus more on the calibration problem, highlighting available metrics and their limitations. The new proposed metric should then be introduced, emphasizing its innovative aspects and added value. The two components, BLUECAT and Mutual Information, could be briefly mentioned at the end.Section 2.1:
The information on where the GR4J model is available could be moved to the Appendix, as it is not directly relevant to the manuscript. Instead, it would be more helpful to provide additional details about the model itself. Currently, we only know that it has four parameters and two storage modules. The calibration strategy is omitted and should be described, particularly how the two metrics (RUMI and KGE) are applied in the calibration process. It is also essential to confirm that the model is suitable for daily-scale applications and all spatial scales. For example, the reader might question whether the GR4J model performs well at a daily scale for a 35 km² watershed, which could undermine the relevance of the evaluation and comparison.Section 2.3:
The section title is not informative. A short introduction should be added to remind readers that RUMI is a combination of BLUECAT and MI.
Line 138: The text refers to a flowchart that is missing. Please include it or clarify the reference.Section 2.4:
Both the section title and its content should be revised for clarity. This section contains mixed information on data (which might be better placed in Section 2.2), the comparison metric KGE, and the evaluation methods. These topics should be better organized and clearly separated.Section 3 (Results):
This section could be better organized by providing a more detailed explanation of the plots. Consider using more informative and communicative visualizations.Lines 220–230: These lines are somewhat confusing. It might be more effective to first show the validation dataset and then focus on two specific years.
Table 2: Is this table necessary? Perhaps violin plots in Figure 4 would suffice. If kept, the table should include comments on specific values (e.g., 1755 and other notable values).
Table 3: The meaning of the values presented is unclear and needs further clarification.
Figure 5: The phrase "only for illustration purposes" in the caption is confusing. Consider keeping only subplots c.1, c.2, d.1, and d.2, as they show a clear performance difference between the two methods.
Section 4
This section needs to be improved for clarity and coherence. Emphasize the significance of the results and their practical implications.Section 5 :
The Conclusions should be rewritten. The first sentence belongs at the end, as a concluding remark. The section should summarize key findings and highlight future research directions.Additional Suggestions
To thoroughly evaluate the proposed RUMI, consider conducting a virtual test in which parameters are assigned to the model and then calibrated using both metrics (RUMI and KGE). The comparison would then focus on the parameter estimates rather than the streamflow time series. This is a common approach that helps isolate model-independent performance and avoids limiting the conclusions to the GR4J model.Citation: https://doi.org/10.5194/hess-2024-389-RC2
Data sets
Supplement to: Alvarez-Garreton, C et al. (2018): The CAMELS-CL dataset: catchment attributes and meteorology for large sample studies - Chile dataset. Hydrology and Earth System Sciences, 22(11), 5817-5846, https://doi.org/10.5194/hess-22-5817-2018 Camila Alvarez-Garreton et al. https://doi.org/10.1594/PANGAEA.894885
Model code and software
Codes for “Combining uncertainty quantification and entropy-inspired concepts into a single objective function for rainfall-runoff model calibration.” Alonso Pizarro, Demetris Koutsoyiannis, and Alberto Montanari https://doi.org/10.17605/OSF.IO/93N4R
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
182 | 24 | 5 | 211 | 3 | 3 |
- HTML: 182
- PDF: 24
- XML: 5
- Total: 211
- BibTeX: 3
- EndNote: 3
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1