Technical Note: The divide and measure nonconformity – how metrics can mislead when we evaluate on different data partitions

Klotz, Daniel; Gauch, Martin; Kratzert, Frederik; Nearing, Grey; Zscheischler, Jakob

doi:10.5194/hess-28-3665-2024

Articles | Volume 28, issue 15

https://doi.org/10.5194/hess-28-3665-2024

© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/hess-28-3665-2024

© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 28, issue 15

Technical note

|

13 Aug 2024

Technical note |

| 13 Aug 2024

Technical Note: The divide and measure nonconformity – how metrics can mislead when we evaluate on different data partitions

Daniel Klotz, Martin Gauch, Frederik Kratzert, Grey Nearing, and Jakob Zscheischler

Related authors

The need for uncertainty: why probabilistic LSTMs are key to improving flood predictions and enabling learned warning rules

Sanika Baste, Sebastian Lerch, Daniel Klotz, and Ralf Loritz

EGUsphere, https://doi.org/10.5194/egusphere-2026-469,https://doi.org/10.5194/egusphere-2026-469, 2026

This preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Short summary

How to deal w___ missing input data

Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Deborah Cohen, and Oren Gilon

Hydrol. Earth Syst. Sci., 29, 6221–6235, https://doi.org/10.5194/hess-29-6221-2025,https://doi.org/10.5194/hess-29-6221-2025, 2025

Short summary

Unveiling the limits of deep learning models in hydrological extrapolation tasks

Sanika Baste, Daniel Klotz, Eduardo Acuña Espinoza, Andras Bardossy, and Ralf Loritz

Hydrol. Earth Syst. Sci., 29, 5871–5891, https://doi.org/10.5194/hess-29-5871-2025,https://doi.org/10.5194/hess-29-5871-2025, 2025

Short summary

Technical note: An approach for handling multiple temporal frequencies with different input dimensions using a single LSTM cell

Eduardo Acuña Espinoza, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Ralf Loritz, and Uwe Ehret

Hydrol. Earth Syst. Sci., 29, 1749–1758, https://doi.org/10.5194/hess-29-1749-2025,https://doi.org/10.5194/hess-29-1749-2025, 2025

Short summary

Analyzing the generalization capabilities of a hybrid hydrological model for extrapolation to extreme events

Eduardo Acuña Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, and Uwe Ehret

Hydrol. Earth Syst. Sci., 29, 1277–1294, https://doi.org/10.5194/hess-29-1277-2025,https://doi.org/10.5194/hess-29-1277-2025, 2025

Short summary

Technical note: Data assimilation and autoregression for using near-real-time streamflow observations in long short-term memory networks

Grey S. Nearing, Daniel Klotz, Jonathan M. Frame, Martin Gauch, Oren Gilon, Frederik Kratzert, Alden Keefe Sampson, Guy Shalev, and Sella Nevo

Hydrol. Earth Syst. Sci., 26, 5493–5513, https://doi.org/10.5194/hess-26-5493-2022,https://doi.org/10.5194/hess-26-5493-2022, 2022

Short summary

The Great Lakes Runoff Intercomparison Project Phase 4: the Great Lakes (GRIP-GL)

Juliane Mai, Hongren Shen, Bryan A. Tolson, Étienne Gaborit, Richard Arsenault, James R. Craig, Vincent Fortin, Lauren M. Fry, Martin Gauch, Daniel Klotz, Frederik Kratzert, Nicole O'Brien, Daniel G. Princz, Sinan Rasiya Koya, Tirthankar Roy, Frank Seglenieks, Narayan K. Shrestha, André G. T. Temgoua, Vincent Vionnet, and Jonathan W. Waddell

Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022,https://doi.org/10.5194/hess-26-3537-2022, 2022

Short summary

More articles (9)

The need for uncertainty: why probabilistic LSTMs are key to improving flood predictions and enabling learned warning rules

Sanika Baste, Sebastian Lerch, Daniel Klotz, and Ralf Loritz

EGUsphere, https://doi.org/10.5194/egusphere-2026-469,https://doi.org/10.5194/egusphere-2026-469, 2026

This preprint is open for discussion and under review for Hydrology and Earth System Sciences (HESS).

Short summary

How to deal w___ missing input data

Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Deborah Cohen, and Oren Gilon

Hydrol. Earth Syst. Sci., 29, 6221–6235, https://doi.org/10.5194/hess-29-6221-2025,https://doi.org/10.5194/hess-29-6221-2025, 2025

Short summary

Unveiling the limits of deep learning models in hydrological extrapolation tasks

Sanika Baste, Daniel Klotz, Eduardo Acuña Espinoza, Andras Bardossy, and Ralf Loritz

Hydrol. Earth Syst. Sci., 29, 5871–5891, https://doi.org/10.5194/hess-29-5871-2025,https://doi.org/10.5194/hess-29-5871-2025, 2025

Short summary

Wikimpacts 1.0: A new global climate impact database based on automated information extraction from Wikipedia

Ni Li, Wim Thiery, Shorouq Zahra, Mariana Madruga de Brito, Koffi Worou, Murathan Kurfalı, Seppe Lampe, Paul Muñoz, Clare Flynn, Camila Trigoso, Joakim Nivre, Jakob Zscheischler, and Gabriele Messori

EGUsphere, https://doi.org/10.5194/egusphere-2025-4891,https://doi.org/10.5194/egusphere-2025-4891, 2025

Short summary

GRDC-Caravan: extending Caravan with data from the Global Runoff Data Centre

Claudia Färber, Henning Plessow, Simon A. Mischel, Frederik Kratzert, Nans Addor, Guy Shalev, and Ulrich Looser

Earth Syst. Sci. Data, 17, 4613–4625, https://doi.org/10.5194/essd-17-4613-2025,https://doi.org/10.5194/essd-17-4613-2025, 2025

Short summary

Technical note: An approach for handling multiple temporal frequencies with different input dimensions using a single LSTM cell

Eduardo Acuña Espinoza, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Ralf Loritz, and Uwe Ehret

Hydrol. Earth Syst. Sci., 29, 1749–1758, https://doi.org/10.5194/hess-29-1749-2025,https://doi.org/10.5194/hess-29-1749-2025, 2025

Short summary

On the predictability of turbulent fluxes from land: PLUMBER2 MIP experimental description and preliminary results

Gab Abramowitz, Anna Ukkola, Sanaa Hobeichi, Jon Cranko Page, Mathew Lipson, Martin G. De Kauwe, Samuel Green, Claire Brenner, Jonathan Frame, Grey Nearing, Martyn Clark, Martin Best, Peter Anthoni, Gabriele Arduini, Souhail Boussetta, Silvia Caldararu, Kyeungwoo Cho, Matthias Cuntz, David Fairbairn, Craig R. Ferguson, Hyungjun Kim, Yeonjoo Kim, Jürgen Knauer, David Lawrence, Xiangzhong Luo, Sergey Malyshev, Tomoko Nitta, Jerome Ogee, Keith Oleson, Catherine Ottlé, Phillipe Peylin, Patricia de Rosnay, Heather Rumbold, Bob Su, Nicolas Vuichard, Anthony P. Walker, Xiaoni Wang-Faivre, Yunfei Wang, and Yijian Zeng

Biogeosciences, 21, 5517–5538, https://doi.org/10.5194/bg-21-5517-2024,https://doi.org/10.5194/bg-21-5517-2024, 2024

Short summary

Technical note: Data assimilation and autoregression for using near-real-time streamflow observations in long short-term memory networks

Grey S. Nearing, Daniel Klotz, Jonathan M. Frame, Martin Gauch, Oren Gilon, Frederik Kratzert, Alden Keefe Sampson, Guy Shalev, and Sella Nevo

Hydrol. Earth Syst. Sci., 26, 5493–5513, https://doi.org/10.5194/hess-26-5493-2022,https://doi.org/10.5194/hess-26-5493-2022, 2022

Short summary

Flood forecasting with machine learning models in an operational framework

Sella Nevo, Efrat Morin, Adi Gerzi Rosenthal, Asher Metzger, Chen Barshai, Dana Weitzner, Dafi Voloshin, Frederik Kratzert, Gal Elidan, Gideon Dror, Gregory Begelman, Grey Nearing, Guy Shalev, Hila Noga, Ira Shavitt, Liora Yuklea, Moriah Royz, Niv Giladi, Nofar Peled Levi, Ofir Reich, Oren Gilon, Ronnie Maor, Shahar Timnat, Tal Shechter, Vladimir Anisimov, Yotam Gigi, Yuval Levin, Zach Moshe, Zvika Ben-Haim, Avinatan Hassidim, and Yossi Matias

Hydrol. Earth Syst. Sci., 26, 4013–4032, https://doi.org/10.5194/hess-26-4013-2022,https://doi.org/10.5194/hess-26-4013-2022, 2022

Short summary

The Great Lakes Runoff Intercomparison Project Phase 4: the Great Lakes (GRIP-GL)

Juliane Mai, Hongren Shen, Bryan A. Tolson, Étienne Gaborit, Richard Arsenault, James R. Craig, Vincent Fortin, Lauren M. Fry, Martin Gauch, Daniel Klotz, Frederik Kratzert, Nicole O'Brien, Daniel G. Princz, Sinan Rasiya Koya, Tirthankar Roy, Frank Seglenieks, Narayan K. Shrestha, André G. T. Temgoua, Vincent Vionnet, and Jonathan W. Waddell

Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022,https://doi.org/10.5194/hess-26-3537-2022, 2022

Short summary

Towards a compound-event-oriented climate model evaluation: a decomposition of the underlying biases in multivariate fire and heat stress hazards

Roberto Villalobos-Herrera, Emanuele Bevacqua, Andreia F. S. Ribeiro, Graeme Auld, Laura Crocetti, Bilyana Mircheva, Minh Ha, Jakob Zscheischler, and Carlo De Michele

Nat. Hazards Earth Syst. Sci., 21, 1867–1885, https://doi.org/10.5194/nhess-21-1867-2021,https://doi.org/10.5194/nhess-21-1867-2021, 2021

Short summary

Cited articles

Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, https://doi.org/10.5194/hess-21-5293-2017, 2017. a, b

Beven, K.: Benchmarking hydrological models for an uncertain future, Hydrol. Process., 37, e14882, https://doi.org/10.1002/hyp.14882, 2023. a

Clark, M. P., Vogel, R. M., Lamontagne, J. R., Mizukami, N., Knoben, W. J., Tang, G., Gharari, S., Freer, J. E., Whitfield, P. H., Shook, K. R., and Papalexiou, S. M.: The abuse of popular performance metrics in hydrologic modeling, Water Resour. Res., 57, e2020WR029001, https://doi.org/10.1029/2020WR029001, 2021. a, b, c, d, e, f

Duc, L. and Sawada, Y.: A signal-processing-based interpretation of the Nash–Sutcliffe efficiency, Hydrol. Earth Syst. Sci., 27, 1827–1839, https://doi.org/10.5194/hess-27-1827-2023, 2023. a

Feng, D., Beck, H., Lawson, K., and Shen, C.: The suitability of differentiable, physics-informed machine learning hydrologic models for ungauged regions and climate change impact assessment, Hydrol. Earth Syst. Sci., 27, 2357–2373, https://doi.org/10.5194/hess-27-2357-2023, 2023. a