Articles | Volume 30, issue 11
https://doi.org/10.5194/hess-30-3439-2026
https://doi.org/10.5194/hess-30-3439-2026
Technical note
 | 
05 Jun 2026
Technical note |  | 05 Jun 2026

Technical note: Benchmarking large-domain model performance under sampling uncertainty

Gaby J. Gründemann, Wouter J. M. Knoben, Yalan Song, Katie van Werkhoven, and Martyn P. Clark

Related authors

Comparing multi-model mosaic and multi-model combination methods to simulate streamflow across the contiguous USA
Cyril Thébault, Wouter J. M. Knoben, Nans Addor, Andrew J. Newman, Diana Spieler, Nicolás A. Vásquez, Yalan Song, Gaby J. Gründemann, Shaun Carney, Mukesh Kumar, Katie van Werkhoven, Chaopeng Shen, Andrew W. Wood, and Martyn P. Clark
EGUsphere, https://doi.org/10.5194/egusphere-2025-6083,https://doi.org/10.5194/egusphere-2025-6083, 2026
Short summary
Technical note: How many models do we need to simulate hydrologic processes across large geographical domains?
Wouter J. M. Knoben, Ashwin Raman, Gaby J. Gründemann, Mukesh Kumar, Alain Pietroniro, Chaopeng Shen, Yalan Song, Cyril Thébault, Katie van Werkhoven, Andrew W. Wood, and Martyn P. Clark
Hydrol. Earth Syst. Sci., 29, 2361–2375, https://doi.org/10.5194/hess-29-2361-2025,https://doi.org/10.5194/hess-29-2361-2025, 2025
Short summary

Cited articles

Abdelkader, M., Temimi, M., and Ouarda, T. B.: Assessing the National Water Model’s Streamflow Estimates Using a Multi-Decade Retrospective Dataset across the Contiguous United States, Water, 15, 2319, https://doi.org/10.3390/w15132319, 2023. a
Arheimer, B., Pimentel, R., Isberg, K., Crochemore, L., Andersson, J. C. M., Hasan, A., and Pineda, L.: Global catchment modelling using World-Wide HYPE (WWH), open data, and stepwise parameter estimation, Hydrol. Earth Syst. Sci., 24, 535–559, https://doi.org/10.5194/hess-24-535-2020, 2020. a
Best, M. J., Abramowitz, G., Johnson, H. R., Pitman, A. J., Balsamo, G., Boone, A., Cuntz, M., Decharme, B., Dirmeyer, P. A., Dong, J., Ek, M., Guo, Z., Haverd, V., Van Den Hurk, B. J. J., Nearing, G. S., Pak, B., Peters-Lidard, C., Santanello, J. A., Stevens, L., and Vuichard, N.: The Plumbing of Land Surface Models: Benchmarking Model Performance, J. Hydrometeorol., 16, 1425–1442, https://doi.org/10.1175/JHM-D-14-0158.1, 2015. a
Beven, K.: Benchmarking hydrological models for an uncertain future, Hydrol. Process., 37, e14882, https://doi.org/10.1002/hyp.14882, 2023. a
Clark, M. P. and Shook, K.: gumboot: Bootstrap Analyses of Sampling Uncertainty in Goodness-of-Fit Statistics, R package version 1.0.1, https://github.com/CH-Earth/gumboot, (last access: 4 September 2024), 2021. a, b
Download
Short summary
The quality of large-domain hydrologic model simulations is often quantified with so-called accuracy metrics. Here we use simple benchmarks to provide relevant context for these accuracy metrics. Results show that areas where the model cannot beat the benchmarks do not always align with areas where the accuracy metrics are low. This suggests that model improvements are possible in regions that under more typical model evaluation approaches (i.e., without benchmarks) might not be obvious.
Share