Technical note: Benchmarking large-domain model performance under sampling uncertainty

Gründemann, Gaby J.; Knoben, Wouter J. M.; Song, Yalan; van Werkhoven, Katie; Clark, Martyn P.

doi:10.5194/hess-30-3439-2026

Articles | Volume 30, issue 11

https://doi.org/10.5194/hess-30-3439-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/hess-30-3439-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 30, issue 11

Technical note

|

05 Jun 2026

Technical note |

| 05 Jun 2026

Technical note: Benchmarking large-domain model performance under sampling uncertainty

Gaby J. Gründemann, Wouter J. M. Knoben, Yalan Song, Katie van Werkhoven, and Martyn P. Clark

Related authors

Comparing multi-model mosaic and multi-model combination methods to simulate streamflow across the contiguous USA

Cyril Thébault, Wouter J. M. Knoben, Nans Addor, Andrew J. Newman, Diana Spieler, Nicolás A. Vásquez, Yalan Song, Gaby J. Gründemann, Shaun Carney, Mukesh Kumar, Katie van Werkhoven, Chaopeng Shen, Andrew W. Wood, and Martyn P. Clark

Hydrol. Earth Syst. Sci., 30, 3945–3977, https://doi.org/10.5194/hess-30-3945-2026,https://doi.org/10.5194/hess-30-3945-2026, 2026

Short summary

Technical note: How many models do we need to simulate hydrologic processes across large geographical domains?

Wouter J. M. Knoben, Ashwin Raman, Gaby J. Gründemann, Mukesh Kumar, Alain Pietroniro, Chaopeng Shen, Yalan Song, Cyril Thébault, Katie van Werkhoven, Andrew W. Wood, and Martyn P. Clark

Hydrol. Earth Syst. Sci., 29, 2361–2375, https://doi.org/10.5194/hess-29-2361-2025,https://doi.org/10.5194/hess-29-2361-2025, 2025

Short summary

Comparing multi-model mosaic and multi-model combination methods to simulate streamflow across the contiguous USA

Cyril Thébault, Wouter J. M. Knoben, Nans Addor, Andrew J. Newman, Diana Spieler, Nicolás A. Vásquez, Yalan Song, Gaby J. Gründemann, Shaun Carney, Mukesh Kumar, Katie van Werkhoven, Chaopeng Shen, Andrew W. Wood, and Martyn P. Clark

Hydrol. Earth Syst. Sci., 30, 3945–3977, https://doi.org/10.5194/hess-30-3945-2026,https://doi.org/10.5194/hess-30-3945-2026, 2026

Short summary

Technical note: High Nash–Sutcliffe Efficiencies conceal poor simulations of interannual variance in seasonal regimes

Sacha W. Ruzzante, Wouter J. M. Knoben, Thorsten Wagener, Tom Gleeson, and Markus Schnorbus

Hydrol. Earth Syst. Sci., 30, 2337–2355, https://doi.org/10.5194/hess-30-2337-2026,https://doi.org/10.5194/hess-30-2337-2026, 2026

Short summary

Same Streamflow, Different Water Stories: The Hidden Impacts of Streamflow-Only Calibration in Distributed Hydrological Modeling

Nicolás A. Vásquez, Pablo A. Mendoza, Wouter Knoben, Martyn Clark, Tricia Stadnyk, and Naoki Mizukami

EGUsphere, https://doi.org/10.5194/egusphere-2026-1363,https://doi.org/10.5194/egusphere-2026-1363, 2026

Short summary

HESS Opinions: Applied hydrologic models in the era of machine learning – retain, revamp, reconcile, or replace?

Delanie Williams, Mukesh Kumar, Katie van Werkhoven, Martyn Clark, Christopher Wilson, and Paul Miller

EGUsphere, https://doi.org/10.5194/egusphere-2026-583,https://doi.org/10.5194/egusphere-2026-583, 2026

Short summary

Propagating Meteorological Uncertainty in Physically Based Mountain Snow Simulations

David R. Casson, Guoqiang Tang, Nicolás Vásquez, Andrew W. Wood, and Martyn P. Clark

EGUsphere, https://doi.org/10.5194/egusphere-2025-6066,https://doi.org/10.5194/egusphere-2025-6066, 2026

Short summary

Catchment Attributes and MEteorology for Large-Sample SPATially distributed analysis (CAMELS-SPAT): streamflow observations, forcing data and geospatial data for hydrologic studies across North America

Wouter J. M. Knoben, Cyril Thébault, Kasra Keshavarz, Laura Torres-Rojas, Nathaniel W. Chaney, Alain Pietroniro, and Martyn P. Clark

Hydrol. Earth Syst. Sci., 29, 5791–5833, https://doi.org/10.5194/hess-29-5791-2025,https://doi.org/10.5194/hess-29-5791-2025, 2025

Short summary

Improving streamflow simulation through machine learning-powered data integration and its potential for forecasting in the Western U.S.

Yuan Yang, Ming Pan, Dapeng Feng, Mu Xiao, Taylor Dixon, Robert Hartman, Chaopeng Shen, Yalan Song, Agniv Sengupta, Luca Delle Monache, and F. Martin Ralph

Hydrol. Earth Syst. Sci., 29, 5453–5476, https://doi.org/10.5194/hess-29-5453-2025,https://doi.org/10.5194/hess-29-5453-2025, 2025

Short summary

Technical note: How many models do we need to simulate hydrologic processes across large geographical domains?

Wouter J. M. Knoben, Ashwin Raman, Gaby J. Gründemann, Mukesh Kumar, Alain Pietroniro, Chaopeng Shen, Yalan Song, Cyril Thébault, Katie van Werkhoven, Andrew W. Wood, and Martyn P. Clark

Hydrol. Earth Syst. Sci., 29, 2361–2375, https://doi.org/10.5194/hess-29-2361-2025,https://doi.org/10.5194/hess-29-2361-2025, 2025

Short summary

On the predictability of turbulent fluxes from land: PLUMBER2 MIP experimental description and preliminary results

Gab Abramowitz, Anna Ukkola, Sanaa Hobeichi, Jon Cranko Page, Mathew Lipson, Martin G. De Kauwe, Samuel Green, Claire Brenner, Jonathan Frame, Grey Nearing, Martyn Clark, Martin Best, Peter Anthoni, Gabriele Arduini, Souhail Boussetta, Silvia Caldararu, Kyeungwoo Cho, Matthias Cuntz, David Fairbairn, Craig R. Ferguson, Hyungjun Kim, Yeonjoo Kim, Jürgen Knauer, David Lawrence, Xiangzhong Luo, Sergey Malyshev, Tomoko Nitta, Jerome Ogee, Keith Oleson, Catherine Ottlé, Phillipe Peylin, Patricia de Rosnay, Heather Rumbold, Bob Su, Nicolas Vuichard, Anthony P. Walker, Xiaoni Wang-Faivre, Yunfei Wang, and Yijian Zeng

Biogeosciences, 21, 5517–5538, https://doi.org/10.5194/bg-21-5517-2024,https://doi.org/10.5194/bg-21-5517-2024, 2024

Short summary

FROSTBYTE: a reproducible data-driven workflow for probabilistic seasonal streamflow forecasting in snow-fed river basins across North America

Louise Arnal, Martyn P. Clark, Alain Pietroniro, Vincent Vionnet, David R. Casson, Paul H. Whitfield, Vincent Fortin, Andrew W. Wood, Wouter J. M. Knoben, Brandi W. Newton, and Colleen Walford

Hydrol. Earth Syst. Sci., 28, 4127–4155, https://doi.org/10.5194/hess-28-4127-2024,https://doi.org/10.5194/hess-28-4127-2024, 2024

Short summary

Modular Assessment of Rainfall–Runoff Models Toolbox (MARRMoT) v2.1: an object-oriented implementation of 47 established hydrological models for improved speed and readability

Luca Trotter, Wouter J. M. Knoben, Keirnan J. A. Fowler, Margarita Saft, and Murray C. Peel

Geosci. Model Dev., 15, 6359–6369, https://doi.org/10.5194/gmd-15-6359-2022,https://doi.org/10.5194/gmd-15-6359-2022, 2022

Short summary

Cited articles

Abdelkader, M., Temimi, M., and Ouarda, T. B.: Assessing the National Water Model’s Streamflow Estimates Using a Multi-Decade Retrospective Dataset across the Contiguous United States, Water, 15, 2319, https://doi.org/10.3390/w15132319, 2023. a

Arheimer, B., Pimentel, R., Isberg, K., Crochemore, L., Andersson, J. C. M., Hasan, A., and Pineda, L.: Global catchment modelling using World-Wide HYPE (WWH), open data, and stepwise parameter estimation, Hydrol. Earth Syst. Sci., 24, 535–559, https://doi.org/10.5194/hess-24-535-2020, 2020. a

Best, M. J., Abramowitz, G., Johnson, H. R., Pitman, A. J., Balsamo, G., Boone, A., Cuntz, M., Decharme, B., Dirmeyer, P. A., Dong, J., Ek, M., Guo, Z., Haverd, V., Van Den Hurk, B. J. J., Nearing, G. S., Pak, B., Peters-Lidard, C., Santanello, J. A., Stevens, L., and Vuichard, N.: The Plumbing of Land Surface Models: Benchmarking Model Performance, J. Hydrometeorol., 16, 1425–1442, https://doi.org/10.1175/JHM-D-14-0158.1, 2015. a

Beven, K.: Benchmarking hydrological models for an uncertain future, Hydrol. Process., 37, e14882, https://doi.org/10.1002/hyp.14882, 2023. a

Clark, M. P. and Shook, K.: gumboot: Bootstrap Analyses of Sampling Uncertainty in Goodness-of-Fit Statistics, R package version 1.0.1, https://github.com/CH-Earth/gumboot, (last access: 4 September 2024), 2021. a, b