Articles | Volume 30, issue 11
https://doi.org/10.5194/hess-30-3439-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Technical note: Benchmarking large-domain model performance under sampling uncertainty
Download
- Final revised paper (published on 05 Jun 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 02 Feb 2026)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-6460', Anonymous Referee #1, 13 Mar 2026
- AC1: 'Reply on RC1', Wouter Knoben, 06 Apr 2026
-
RC2: 'Comment on egusphere-2025-6460', Anonymous Referee #2, 16 Mar 2026
- AC2: 'Reply on RC2', Wouter Knoben, 06 Apr 2026
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
ED: Publish subject to revisions (further review by editor and referees) (15 Apr 2026) by Ralf Loritz
AR by Wouter Knoben on behalf of the Authors (17 Apr 2026)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (20 Apr 2026) by Ralf Loritz
RR by Anonymous Referee #1 (20 May 2026)
RR by Anonymous Referee #2 (20 May 2026)
ED: Publish as is (21 May 2026) by Ralf Loritz
AR by Wouter Knoben on behalf of the Authors (21 May 2026)
Review of "Technical note: Separating signal from noise in large-domain hydrologic model evaluation - Benchmarking model performance" by Gründemann et al.
The technical note promotes the use of various benchmarks for model performance evaluation, particularly in a large-domain setting (or for large-sample studies) and includes a quantification of sampling uncertainty from different periods through bootstrapping of different hydrological years.
The manuscript is clearly, concisely written and well structured.
Before I can recommend publication, however, I would like to raise the following comments:
major comments:
- Since this note is all about the benchmarks, I thing two ingredients are missing:
1) Please add the benchmarks and their description to the main text and not just to the supplementary material and ensure that the abbreviations match those in the figures (or vice versa)
2) Each of the benchmarks is essentially a test of how well a model should minimally perform regarding a specific aspect. This is not discussed in detail in the manuscript, but I think providing some examples would really help promoting the use of various benchmarks from very simple ones targeting maybe the water balance to more complex ones. I would suggest extending the dicussion and conclusions accordingly and as well as adding this explanation regarding which aspect they are benchmarking in the table describing them.
- there is the sampling uncertainty, there is the model uncertainty, but what makes up these metrics are also affected by the uncertainty inherent in the observations. It would be worth reminding the reader that these can be considerably large and influential on the performance metric. For instance, for discharge, there is the rating curve uncertainty that is not constant but varies with the flows (see for instance Westerberg et al., 2011)
Line by line comments:
Abstract
L4 name at least some examples of what is meant by a simple benchmark, i.e. make it more specific
L5-7 these results are valid for the study region and basins and but not for other regions, please add that the data set is from the United Stated and maybe add even NWM
L9 ", though accounting..." this part of the sentence is not clear. Please rephrase.
Main text
L21-25 the words "score", "statistics","efficiency", "metrics" are used and they are used interchangeably. I would suggest using only one, where this is applicable and using it consistently throughout the manuscript
L22 "and more" remove (there is already "for example" in the same sentence)
L34 ... or further checks are required
L40 "can be " -> "is"
L120 since the benchmarks are the core of this note, Table S1 should be moved to the main text and the abbreviations adjusted accordingly.
L126 "as" -> "that"
L239 Supporting
L239 abbreviation was already introduced in L25
L257 "perform" missing?
L262 which benchmark? please add
L284 remove "and" before "snow"
Figure 2 in the upper panel the lines are not distinguishable in b&w print
Figure 3 Please add the written-out benchmarks in the caption so that the figure can stand-alone.
References
Westerberg, I., Guerrero, J. L., Seibert, J., Beven, K. J., & Halldin, S. (2011). Stage‐discharge uncertainty derived with a non‐stationary rating curve in the Choluteca River, Honduras. Hydrological Processes, 25(4), 603-613.