Articles | Volume 29, issue 5
https://doi.org/10.5194/hess-29-1277-2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/hess-29-1277-2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Analyzing the generalization capabilities of a hybrid hydrological model for extrapolation to extreme events
Eduardo Acuña Espinoza
CORRESPONDING AUTHOR
Institute of Water and Environment, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Ralf Loritz
Institute of Water and Environment, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Frederik Kratzert
Google Research, Vienna, Austria
Daniel Klotz
Google Research, Vienna, Austria
Helmholtz Centre for Environmental Research (UFZ), Leipzig, Germany
Martin Gauch
Google Research, Zurich, Switzerland
Manuel Álvarez Chaves
Stuttgart Center for Simulation Science, Statistical Model-Data Integration, University of Stuttgart, Stuttgart, Germany
Uwe Ehret
Institute of Water and Environment, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Related authors
Manuel Álvarez Chaves, Eduardo Acuña Espinoza, Uwe Ehret, and Anneli Guthke
EGUsphere, https://doi.org/10.5194/egusphere-2025-1699, https://doi.org/10.5194/egusphere-2025-1699, 2025
Short summary
Short summary
This study evaluates hybrid hydrological models that combine physics-based and data-driven components, using Information Theory to measure their relative contributions. When testing conceptual models with LSTMs that adjust parameters over time, we found performance primarily comes from the data-driven component, with physics constraints adding minimal value. We propose a quantitative tool to analyse this behaviour and suggest a workflow for diagnosing hybrid models.
Eduardo Acuña Espinoza, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Ralf Loritz, and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 1749–1758, https://doi.org/10.5194/hess-29-1749-2025, https://doi.org/10.5194/hess-29-1749-2025, 2025
Short summary
Short summary
Long short-term memory (LSTM) networks have demonstrated state-of-the-art performance for rainfall-runoff hydrological modelling. However, most studies focus on predictions at a daily scale, limiting the benefits of sub-daily (e.g. hourly) predictions in applications like flood forecasting. In this study, we introduce a new architecture, multi-frequency LSTM (MF-LSTM), designed to use inputs of various temporal frequencies to produce sub-daily (e.g. hourly) predictions at a moderate computational cost.
Sanika Baste, Daniel Klotz, Eduardo Acuña Espinoza, Andras Bardossy, and Ralf Loritz
EGUsphere, https://doi.org/10.5194/egusphere-2025-425, https://doi.org/10.5194/egusphere-2025-425, 2025
Short summary
Short summary
This study evaluates the extrapolation performance of Long Short-Term Memory (LSTM) networks in rainfall-runoff modeling, specifically under extreme conditions. The findings reveal that the LSTM cannot predict discharge values beyond a theoretical limit, which is well below the extremity of its training data. This behavior results from the LSTM's gating structures rather than saturation of cell states alone.
Ralf Loritz, Alexander Dolich, Eduardo Acuña Espinoza, Pia Ebeling, Björn Guse, Jonas Götte, Sibylle K. Hassler, Corina Hauffe, Ingo Heidbüchel, Jens Kiesel, Mirko Mälicke, Hannes Müller-Thomy, Michael Stölzle, and Larisa Tarasova
Earth Syst. Sci. Data, 16, 5625–5642, https://doi.org/10.5194/essd-16-5625-2024, https://doi.org/10.5194/essd-16-5625-2024, 2024
Short summary
Short summary
The CAMELS-DE dataset features data from 1582 streamflow gauges across Germany, with records spanning from 1951 to 2020. This comprehensive dataset, which includes time series of up to 70 years (median 46 years), enables advanced research on water flow and environmental trends and supports the development of hydrological models.
Eduardo Acuña Espinoza, Ralf Loritz, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret
Hydrol. Earth Syst. Sci., 28, 2705–2719, https://doi.org/10.5194/hess-28-2705-2024, https://doi.org/10.5194/hess-28-2705-2024, 2024
Short summary
Short summary
Hydrological hybrid models promise to merge the performance of deep learning methods with the interpretability of process-based models. One hybrid approach is the dynamic parameterization of conceptual models using long short-term memory (LSTM) networks. We explored this method to evaluate the effect of the flexibility given by LSTMs on the process-based part.
Sarah Quỳnh-Giang Ho and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 2785–2810, https://doi.org/10.5194/hess-29-2785-2025, https://doi.org/10.5194/hess-29-2785-2025, 2025
Short summary
Short summary
In this paper, we use models to demonstrate that even small flood reservoirs – which capture water to avoid floods downstream – can be repurposed to release water in drier conditions without affecting their ability to protect against floods. By capturing water and releasing it once levels are low, we show that reservoirs can greatly increase the water available in drought. Having more water available to the reservoir, however, is not necessarily better for drought protection.
Judith Nijzink, Ralf Loritz, Laurent Gourdol, Davide Zoccatelli, Jean François Iffly, and Laurent Pfister
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2024-482, https://doi.org/10.5194/essd-2024-482, 2025
Preprint under review for ESSD
Short summary
Short summary
The CAMELS-LUX dataset (Catchment Attributes and MEteorology for Large-sample Studies – LUXembourg) contains hydrologic, meteorologic and thunderstorm formation relevant atmospheric time series of 56 Luxembourgish catchments (2004–2021). These catchments are characterized by a large physiographic variety on a relatively small scale in a homogeneous climate. The dataset can be applied for (regional) hydrological analyses.
Manuel Álvarez Chaves, Eduardo Acuña Espinoza, Uwe Ehret, and Anneli Guthke
EGUsphere, https://doi.org/10.5194/egusphere-2025-1699, https://doi.org/10.5194/egusphere-2025-1699, 2025
Short summary
Short summary
This study evaluates hybrid hydrological models that combine physics-based and data-driven components, using Information Theory to measure their relative contributions. When testing conceptual models with LSTMs that adjust parameters over time, we found performance primarily comes from the data-driven component, with physics constraints adding minimal value. We propose a quantitative tool to analyse this behaviour and suggest a workflow for diagnosing hybrid models.
Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Deborah Cohen, and Oren Gilon
EGUsphere, https://doi.org/10.5194/egusphere-2025-1224, https://doi.org/10.5194/egusphere-2025-1224, 2025
Short summary
Short summary
Missing input data are one of the most common challenges when building deep learning hydrological models. We present and analyze different methods that can produce predictions when certain inputs are missing during training or inference. Our proposed strategies provide high accuracy while allowing for more flexible data handling and being robust to outages in operational scenarios.
Eduardo Acuña Espinoza, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Ralf Loritz, and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 1749–1758, https://doi.org/10.5194/hess-29-1749-2025, https://doi.org/10.5194/hess-29-1749-2025, 2025
Short summary
Short summary
Long short-term memory (LSTM) networks have demonstrated state-of-the-art performance for rainfall-runoff hydrological modelling. However, most studies focus on predictions at a daily scale, limiting the benefits of sub-daily (e.g. hourly) predictions in applications like flood forecasting. In this study, we introduce a new architecture, multi-frequency LSTM (MF-LSTM), designed to use inputs of various temporal frequencies to produce sub-daily (e.g. hourly) predictions at a moderate computational cost.
Maria Staudinger, Anna Herzog, Ralf Loritz, Tobias Houska, Sandra Pool, Diana Spieler, Paul D. Wagner, Juliane Mai, Jens Kiesel, Stephan Thober, Björn Guse, and Uwe Ehret
EGUsphere, https://doi.org/10.5194/egusphere-2025-1076, https://doi.org/10.5194/egusphere-2025-1076, 2025
Short summary
Short summary
Four process-based and four data-driven hydrological models are compared using different training data. We found process-based models to perform better with small data sets but stop learning soon, while data-driven models learn longer. The study highlights the importance of memory in data and the impact of different data sampling methods on model performance. The direct comparison of these models is novel and provides a clear understanding of their performance under various data conditions.
Sanika Baste, Daniel Klotz, Eduardo Acuña Espinoza, Andras Bardossy, and Ralf Loritz
EGUsphere, https://doi.org/10.5194/egusphere-2025-425, https://doi.org/10.5194/egusphere-2025-425, 2025
Short summary
Short summary
This study evaluates the extrapolation performance of Long Short-Term Memory (LSTM) networks in rainfall-runoff modeling, specifically under extreme conditions. The findings reveal that the LSTM cannot predict discharge values beyond a theoretical limit, which is well below the extremity of its training data. This behavior results from the LSTM's gating structures rather than saturation of cell states alone.
Daniel Klotz, Peter Miersch, Thiago V. M. do Nascimento, Fabrizio Fenicia, Martin Gauch, and Jakob Zscheischler
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2024-450, https://doi.org/10.5194/essd-2024-450, 2025
Revised manuscript under review for ESSD
Short summary
Short summary
Data availability is central to hydrological science. It is the basis for advancing our understanding of hydrological processes, building prediction models, and anticipatory water management. We present a data-driven daily runoff reconstruction product for natural streamflow. We name it EARLS: European aggregated reconstruction for large-sample studies. The reconstructions represent daily simulations of natural streamflow across Europe and cover the period from 1953 to 2020.
Ashish Manoj J, Ralf Loritz, Hoshin Gupta, and Erwin Zehe
Hydrol. Earth Syst. Sci. Discuss., https://doi.org/10.5194/hess-2024-375, https://doi.org/10.5194/hess-2024-375, 2024
Revised manuscript under review for HESS
Short summary
Short summary
Traditional hydrological models typically operate in a forward mode, simulating streamflow and other catchment fluxes based on precipitation input. In this study, we explored the possibility of reversing this process—inferring precipitation from streamflow data—to improve flood event modelling. We then used the generated precipitation series to run hydrological models, resulting in more accurate estimates of streamflow and soil moisture.
Ralf Loritz, Alexander Dolich, Eduardo Acuña Espinoza, Pia Ebeling, Björn Guse, Jonas Götte, Sibylle K. Hassler, Corina Hauffe, Ingo Heidbüchel, Jens Kiesel, Mirko Mälicke, Hannes Müller-Thomy, Michael Stölzle, and Larisa Tarasova
Earth Syst. Sci. Data, 16, 5625–5642, https://doi.org/10.5194/essd-16-5625-2024, https://doi.org/10.5194/essd-16-5625-2024, 2024
Short summary
Short summary
The CAMELS-DE dataset features data from 1582 streamflow gauges across Germany, with records spanning from 1951 to 2020. This comprehensive dataset, which includes time series of up to 70 years (median 46 years), enables advanced research on water flow and environmental trends and supports the development of hydrological models.
Claudia Färber, Henning Plessow, Simon Mischel, Frederik Kratzert, Nans Addor, Guy Shalev, and Ulrich Looser
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2024-427, https://doi.org/10.5194/essd-2024-427, 2024
Revised manuscript accepted for ESSD
Short summary
Short summary
Large-sample datasets are essential in hydrological science to support modelling studies and advance process understanding. Caravan is a community initiative to create a large-sample hydrology dataset of meteorological forcing data, catchment attributes, and discharge data for catchments around the world. This dataset is a subset of hydrological discharge data and station-based watersheds from the Global Runoff Data Centre (GRDC), which are covered by an open data policy.
Frederik Kratzert, Martin Gauch, Daniel Klotz, and Grey Nearing
Hydrol. Earth Syst. Sci., 28, 4187–4201, https://doi.org/10.5194/hess-28-4187-2024, https://doi.org/10.5194/hess-28-4187-2024, 2024
Short summary
Short summary
Recently, a special type of neural-network architecture became increasingly popular in hydrology literature. However, in most applications, this model was applied as a one-to-one replacement for hydrology models without adapting or rethinking the experimental setup. In this opinion paper, we show how this is almost always a bad decision and how using these kinds of models requires the use of large-sample hydrology data sets.
Andreas Auer, Martin Gauch, Frederik Kratzert, Grey Nearing, Sepp Hochreiter, and Daniel Klotz
Hydrol. Earth Syst. Sci., 28, 4099–4126, https://doi.org/10.5194/hess-28-4099-2024, https://doi.org/10.5194/hess-28-4099-2024, 2024
Short summary
Short summary
This work examines the impact of temporal and spatial information on the uncertainty estimation of streamflow forecasts. The study emphasizes the importance of data updates and global information for precise uncertainty estimates. We use conformal prediction to show that recent data enhance the estimates, even if only available infrequently. Local data yield reasonable average estimations but fall short for peak-flow events. The use of global data significantly improves these predictions.
Andrea L. Campoverde, Uwe Ehret, Patrick Ludwig, and Joaquim G. Pinto
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2024-134, https://doi.org/10.5194/gmd-2024-134, 2024
Revised manuscript not accepted
Short summary
Short summary
We looked at how well the model WRF-Hydro performed during the 2018 drought event in the River Rhine basin, even though it is typically used for floods. We used the meteorological ERA5 reanalysis dataset to simulate River Rhine’s streamflow and adjusted the model using parameters and actual discharge measurements. We focused on Lake Constance, a key part of the basin, but found issues with the model’s lake outflow simulation. By removing the lake module, we obtained more accurate results.
Daniel Klotz, Martin Gauch, Frederik Kratzert, Grey Nearing, and Jakob Zscheischler
Hydrol. Earth Syst. Sci., 28, 3665–3673, https://doi.org/10.5194/hess-28-3665-2024, https://doi.org/10.5194/hess-28-3665-2024, 2024
Short summary
Short summary
The evaluation of model performance is essential for hydrological modeling. Using performance criteria requires a deep understanding of their properties. We focus on a counterintuitive aspect of the Nash–Sutcliffe efficiency (NSE) and show that if we divide the data into multiple parts, the overall performance can be higher than all the evaluations of the subsets. Although this follows from the definition of the NSE, the resulting behavior can have unintended consequences in practice.
Eduardo Acuña Espinoza, Ralf Loritz, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret
Hydrol. Earth Syst. Sci., 28, 2705–2719, https://doi.org/10.5194/hess-28-2705-2024, https://doi.org/10.5194/hess-28-2705-2024, 2024
Short summary
Short summary
Hydrological hybrid models promise to merge the performance of deep learning methods with the interpretability of process-based models. One hybrid approach is the dynamic parameterization of conceptual models using long short-term memory (LSTM) networks. We explored this method to evaluate the effect of the flexibility given by LSTMs on the process-based part.
Uwe Ehret and Pankaj Dey
Hydrol. Earth Syst. Sci., 27, 2591–2605, https://doi.org/10.5194/hess-27-2591-2023, https://doi.org/10.5194/hess-27-2591-2023, 2023
Short summary
Short summary
We propose the
c-u-curvemethod to characterize dynamical (time-variable) systems of all kinds.
Uis for uncertainty and expresses how well a system can be predicted in a given period of time.
Cis for complexity and expresses how predictability differs between different periods, i.e. how well predictability itself can be predicted. The method helps to better classify and compare dynamical systems across a wide range of disciplines, thus facilitating scientific collaboration.
Patrick Ludwig, Florian Ehmele, Mário J. Franca, Susanna Mohr, Alberto Caldas-Alvarez, James E. Daniell, Uwe Ehret, Hendrik Feldmann, Marie Hundhausen, Peter Knippertz, Katharina Küpfer, Michael Kunz, Bernhard Mühr, Joaquim G. Pinto, Julian Quinting, Andreas M. Schäfer, Frank Seidel, and Christina Wisotzky
Nat. Hazards Earth Syst. Sci., 23, 1287–1311, https://doi.org/10.5194/nhess-23-1287-2023, https://doi.org/10.5194/nhess-23-1287-2023, 2023
Short summary
Short summary
Heavy precipitation in July 2021 led to widespread floods in western Germany and neighboring countries. The event was among the five heaviest precipitation events of the past 70 years in Germany, and the river discharges exceeded by far the statistical 100-year return values. Simulations of the event under future climate conditions revealed a strong and non-linear effect on flood peaks: for +2 K global warming, an 18 % increase in rainfall led to a 39 % increase of the flood peak in the Ahr river.
Susanna Mohr, Uwe Ehret, Michael Kunz, Patrick Ludwig, Alberto Caldas-Alvarez, James E. Daniell, Florian Ehmele, Hendrik Feldmann, Mário J. Franca, Christian Gattke, Marie Hundhausen, Peter Knippertz, Katharina Küpfer, Bernhard Mühr, Joaquim G. Pinto, Julian Quinting, Andreas M. Schäfer, Marc Scheibel, Frank Seidel, and Christina Wisotzky
Nat. Hazards Earth Syst. Sci., 23, 525–551, https://doi.org/10.5194/nhess-23-525-2023, https://doi.org/10.5194/nhess-23-525-2023, 2023
Short summary
Short summary
The flood event in July 2021 was one of the most severe disasters in Europe in the last half century. The objective of this two-part study is a multi-disciplinary assessment that examines the complex process interactions in different compartments, from meteorology to hydrological conditions to hydro-morphological processes to impacts on assets and environment. In addition, we address the question of what measures are possible to generate added value to early response management.
Grey S. Nearing, Daniel Klotz, Jonathan M. Frame, Martin Gauch, Oren Gilon, Frederik Kratzert, Alden Keefe Sampson, Guy Shalev, and Sella Nevo
Hydrol. Earth Syst. Sci., 26, 5493–5513, https://doi.org/10.5194/hess-26-5493-2022, https://doi.org/10.5194/hess-26-5493-2022, 2022
Short summary
Short summary
When designing flood forecasting models, it is necessary to use all available data to achieve the most accurate predictions possible. This manuscript explores two basic ways of ingesting near-real-time streamflow data into machine learning streamflow models. The point we want to make is that when working in the context of machine learning (instead of traditional hydrology models that are based on
bio-geophysics), it is not necessary to use complex statistical methods for injecting sparse data.
Ralf Loritz, Maoya Bassiouni, Anke Hildebrandt, Sibylle K. Hassler, and Erwin Zehe
Hydrol. Earth Syst. Sci., 26, 4757–4771, https://doi.org/10.5194/hess-26-4757-2022, https://doi.org/10.5194/hess-26-4757-2022, 2022
Short summary
Short summary
In this study, we combine a deep-learning approach that predicts sap flow with a hydrological model to improve soil moisture and transpiration estimates at the catchment scale. Our results highlight that hybrid-model approaches, combining machine learning with physically based models, are a promising way to improve our ability to make hydrological predictions.
Sella Nevo, Efrat Morin, Adi Gerzi Rosenthal, Asher Metzger, Chen Barshai, Dana Weitzner, Dafi Voloshin, Frederik Kratzert, Gal Elidan, Gideon Dror, Gregory Begelman, Grey Nearing, Guy Shalev, Hila Noga, Ira Shavitt, Liora Yuklea, Moriah Royz, Niv Giladi, Nofar Peled Levi, Ofir Reich, Oren Gilon, Ronnie Maor, Shahar Timnat, Tal Shechter, Vladimir Anisimov, Yotam Gigi, Yuval Levin, Zach Moshe, Zvika Ben-Haim, Avinatan Hassidim, and Yossi Matias
Hydrol. Earth Syst. Sci., 26, 4013–4032, https://doi.org/10.5194/hess-26-4013-2022, https://doi.org/10.5194/hess-26-4013-2022, 2022
Short summary
Short summary
Early flood warnings are one of the most effective tools to save lives and goods. Machine learning (ML) models can improve flood prediction accuracy but their use in operational frameworks is limited. The paper presents a flood warning system, operational in India and Bangladesh, that uses ML models for forecasting river stage and flood inundation maps and discusses the models' performances. In 2021, more than 100 million flood alerts were sent to people near rivers over an area of 470 000 km2.
Juliane Mai, Hongren Shen, Bryan A. Tolson, Étienne Gaborit, Richard Arsenault, James R. Craig, Vincent Fortin, Lauren M. Fry, Martin Gauch, Daniel Klotz, Frederik Kratzert, Nicole O'Brien, Daniel G. Princz, Sinan Rasiya Koya, Tirthankar Roy, Frank Seglenieks, Narayan K. Shrestha, André G. T. Temgoua, Vincent Vionnet, and Jonathan W. Waddell
Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022, https://doi.org/10.5194/hess-26-3537-2022, 2022
Short summary
Short summary
Model intercomparison studies are carried out to test various models and compare the quality of their outputs over the same domain. In this study, 13 diverse model setups using the same input data are evaluated over the Great Lakes region. Various model outputs – such as streamflow, evaporation, soil moisture, and amount of snow on the ground – are compared using standardized methods and metrics. The basin-wise model outputs and observations are made available through an interactive website.
Jonathan M. Frame, Frederik Kratzert, Daniel Klotz, Martin Gauch, Guy Shalev, Oren Gilon, Logan M. Qualls, Hoshin V. Gupta, and Grey S. Nearing
Hydrol. Earth Syst. Sci., 26, 3377–3392, https://doi.org/10.5194/hess-26-3377-2022, https://doi.org/10.5194/hess-26-3377-2022, 2022
Short summary
Short summary
The most accurate rainfall–runoff predictions are currently based on deep learning. There is a concern among hydrologists that deep learning models may not be reliable in extrapolation or for predicting extreme events. This study tests that hypothesis. The deep learning models remained relatively accurate in predicting extreme events compared with traditional models, even when extreme events were not included in the training set.
Thomas Lees, Steven Reece, Frederik Kratzert, Daniel Klotz, Martin Gauch, Jens De Bruijn, Reetik Kumar Sahu, Peter Greve, Louise Slater, and Simon J. Dadson
Hydrol. Earth Syst. Sci., 26, 3079–3101, https://doi.org/10.5194/hess-26-3079-2022, https://doi.org/10.5194/hess-26-3079-2022, 2022
Short summary
Short summary
Despite the accuracy of deep learning rainfall-runoff models, we are currently uncertain of what these models have learned. In this study we explore the internals of one deep learning architecture and demonstrate that the model learns about intermediate hydrological stores of soil moisture and snow water, despite never having seen data about these processes during training. Therefore, we find evidence that the deep learning approach learns a physically realistic mapping from inputs to outputs.
Daniel Klotz, Frederik Kratzert, Martin Gauch, Alden Keefe Sampson, Johannes Brandstetter, Günter Klambauer, Sepp Hochreiter, and Grey Nearing
Hydrol. Earth Syst. Sci., 26, 1673–1693, https://doi.org/10.5194/hess-26-1673-2022, https://doi.org/10.5194/hess-26-1673-2022, 2022
Short summary
Short summary
This contribution evaluates distributional runoff predictions from deep-learning-based approaches. We propose a benchmarking setup and establish four strong baselines. The results show that accurate, precise, and reliable uncertainty estimation can be achieved with deep learning.
Alexander Sternagel, Ralf Loritz, Brian Berkowitz, and Erwin Zehe
Hydrol. Earth Syst. Sci., 26, 1615–1629, https://doi.org/10.5194/hess-26-1615-2022, https://doi.org/10.5194/hess-26-1615-2022, 2022
Short summary
Short summary
We present a (physically based) Lagrangian approach to simulate diffusive mixing processes on the pore scale beyond perfectly mixed conditions. Results show the feasibility of the approach for reproducing measured mixing times and concentrations of isotopes over pore sizes and that typical shapes of breakthrough curves (normally associated with non-uniform transport in heterogeneous soils) may also occur as a result of imperfect subscale mixing in a macroscopically homogeneous soil matrix.
Erwin Zehe, Ralf Loritz, Yaniv Edery, and Brian Berkowitz
Hydrol. Earth Syst. Sci., 25, 5337–5353, https://doi.org/10.5194/hess-25-5337-2021, https://doi.org/10.5194/hess-25-5337-2021, 2021
Short summary
Short summary
This study uses the concepts of entropy and work to quantify and explain the emergence of preferential flow and transport in heterogeneous saturated porous media. We found that the downstream concentration of solutes in preferential pathways implies a downstream declining entropy in the transverse distribution of solute transport pathways. Preferential flow patterns with lower entropies emerged within media of higher heterogeneity – a stronger self-organization despite a higher randomness.
Frederik Kratzert, Daniel Klotz, Sepp Hochreiter, and Grey S. Nearing
Hydrol. Earth Syst. Sci., 25, 2685–2703, https://doi.org/10.5194/hess-25-2685-2021, https://doi.org/10.5194/hess-25-2685-2021, 2021
Short summary
Short summary
We investigate how deep learning models use different meteorological data sets in the task of (regional) rainfall–runoff modeling. We show that performance can be significantly improved when using different data products as input and further show how the model learns to combine those meteorological input differently across time and space. The results are carefully benchmarked against classical approaches, showing the supremacy of the presented approach.
Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Jimmy Lin, and Sepp Hochreiter
Hydrol. Earth Syst. Sci., 25, 2045–2062, https://doi.org/10.5194/hess-25-2045-2021, https://doi.org/10.5194/hess-25-2045-2021, 2021
Short summary
Short summary
We present multi-timescale Short-Term Memory (MTS-LSTM), a machine learning approach that predicts discharge at multiple timescales within one model. MTS-LSTM is significantly more accurate than the US National Water Model and computationally more efficient than an individual LSTM model per timescale. Further, MTS-LSTM can process different input variables at different timescales, which is important as the lead time of meteorological forecasts often depends on their temporal resolution.
Alexander Sternagel, Ralf Loritz, Julian Klaus, Brian Berkowitz, and Erwin Zehe
Hydrol. Earth Syst. Sci., 25, 1483–1508, https://doi.org/10.5194/hess-25-1483-2021, https://doi.org/10.5194/hess-25-1483-2021, 2021
Short summary
Short summary
The key innovation of the study is a method to simulate reactive solute transport in the vadose zone within a Lagrangian framework. We extend the LAST-Model with a method to account for non-linear sorption and first-order degradation processes during unsaturated transport of reactive substances in the matrix and macropores. Model evaluations using bromide and pesticide data from irrigation experiments under different flow conditions on various timescales show the feasibility of the method.
Elnaz Azmi, Uwe Ehret, Steven V. Weijs, Benjamin L. Ruddell, and Rui A. P. Perdigão
Hydrol. Earth Syst. Sci., 25, 1103–1115, https://doi.org/10.5194/hess-25-1103-2021, https://doi.org/10.5194/hess-25-1103-2021, 2021
Short summary
Short summary
Computer models should be as simple as possible but not simpler. Simplicity refers to the length of the model and the effort it takes the model to generate its output. Here we present a practical technique for measuring the latter by the number of memory visits during model execution by
Strace, a troubleshooting and monitoring program. The advantage of this approach is that it can be applied to any computer-based model, which facilitates model intercomparison.
Ralf Loritz, Markus Hrachowitz, Malte Neuper, and Erwin Zehe
Hydrol. Earth Syst. Sci., 25, 147–167, https://doi.org/10.5194/hess-25-147-2021, https://doi.org/10.5194/hess-25-147-2021, 2021
Short summary
Short summary
This study investigates the role and value of distributed rainfall in the runoff generation of a mesoscale catchment. We compare the performance of different hydrological models at different periods and show that a distributed model driven by distributed rainfall yields improved performances only during certain periods. We then step beyond this finding and develop a spatially adaptive model that is capable of dynamically adjusting its spatial model structure in time.
Stephanie Thiesen, Diego M. Vieira, Mirko Mälicke, Ralf Loritz, J. Florian Wellmann, and Uwe Ehret
Hydrol. Earth Syst. Sci., 24, 4523–4540, https://doi.org/10.5194/hess-24-4523-2020, https://doi.org/10.5194/hess-24-4523-2020, 2020
Short summary
Short summary
A spatial interpolator has been proposed for exploring the information content of the data in the light of geostatistics and information theory. It showed comparable results to traditional interpolators, with the advantage of presenting generalization properties. We discussed three different ways of combining distributions and their implications for the probabilistic results. By its construction, the method provides a suitable and flexible framework for uncertainty analysis and decision-making.
Uwe Ehret, Rik van Pruijssen, Marina Bortoli, Ralf Loritz, Elnaz Azmi, and Erwin Zehe
Hydrol. Earth Syst. Sci., 24, 4389–4411, https://doi.org/10.5194/hess-24-4389-2020, https://doi.org/10.5194/hess-24-4389-2020, 2020
Short summary
Short summary
In this paper we propose adaptive clustering as a new method for reducing the computational efforts of distributed modelling. It consists of identifying similar-acting model elements during the runtime, clustering them, running the model for just a few representatives per cluster, and mapping their results to the remaining model elements in the cluster. With the example of a hydrological model, we show that this saves considerable computation time, while largely maintaining the output quality.
Cited articles
Acuna Espinoza, E.: Analyzing the generalization capabilities of hybrid hydrological models for extrapolation to extreme events, Zenodo [code and data set], https://doi.org/10.5281/zenodo.14191623, 2024. a, b, c
Acuña Espinoza, E., Loritz, R., Álvarez Chaves, M., Bäuerle, N., and Ehret, U.: To bucket or not to bucket? Analyzing the performance and interpretability of hybrid hydrological models with dynamic parameterization, Hydrol. Earth Syst. Sci., 28, 2705–2719, https://doi.org/10.5194/hess-28-2705-2024, 2024. a, b, c, d, e, f, g, h
Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, https://doi.org/10.5194/hess-21-5293-2017, 2017. a
Bárdossy, A. and Anwar, F.: Why do our rainfall–runoff models keep underestimating the peak flows?, Hydrol. Earth Syst. Sci., 27, 1987–2000, https://doi.org/10.5194/hess-27-1987-2023, 2023. a, b
Bergström, S.: THE HBV MODEL – its structure and applications, Tech. rep., Sveriges Meteorologiska Och Hydrologiska Institut, https://www.smhi.se/en/publications/the-hbv-model-its-structure-and-applications-1.83591 (last access: 12 January 2025), 1992. a
Di Baldassarre, G. and Montanari, A.: Uncertainty in river discharge observations: a quantitative analysis, Hydrol. Earth Syst. Sci., 13, 913–921, https://doi.org/10.5194/hess-13-913-2009, 2009. a
Donoho, D.: 50 years of data science, J. Comput. Graph. Stat., 26, 745–766, https://doi.org/10.1080/10618600.2017.1384734, 2017. a
Duan, Q., Sorooshian, S., and Gupta, V. K.: Optimal use of the SCE-UA global optimization method for calibrating watershed models, J. Hydrol., 158, 265–284, https://doi.org/10.1016/0022-1694(94)90057-4, 1994. a
Ehret, U., van Pruijssen, R., Bortoli, M., Loritz, R., Azmi, E., and Zehe, E.: Adaptive clustering: reducing the computational costs of distributed (hydrological) modelling by exploiting time-variable similarity among model elements, Hydrol. Earth Syst. Sci., 24, 4389–4411, https://doi.org/10.5194/hess-24-4389-2020, 2020. a
England Jr., J. F., Cohn, T. A., Faber, B. A., Stedinger, J. R., Thomas Jr., W. O., Veilleux, A. G., Kiang, J. E., and Mason Jr., R. R.: Guidelines for determining flood flow frequency – Bulletin 17C, Tech. rep., US Geological Survey, https://doi.org/10.3133/tm4B5, 2019. a
Feng, D., Fang, K., and Shen, C.: Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales, Water Resour. Res., 56, e2019WR026793, https://doi.org/10.1029/2019WR026793, 2020. a
Feng, D., Liu, J., Lawson, K., and Shen, C.: Differentiable, learnable, regionalized process-based models with multiphysical outputs can approach state-of-the-art hydrologic prediction accuracy, Water Resour. Res., 58, e2022WR032404, https://doi.org/10.1029/2022WR032404, 2022. a, b, c, d, e, f, g, h, i, j, k
Gauch, M., Kratzert, F., Klotz, D., Nearing, G., Lin, J., and Hochreiter, S.: Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network, Hydrol. Earth Syst. Sci., 25, 2045–2062, https://doi.org/10.5194/hess-25-2045-2021, 2021. a, b, c
Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural Comput., 9, 1735–1780, https://doi.org/10.1162/neco.1997.9.8.1735, 1997. a
Höge, M., Scheidegger, A., Baity-Jesi, M., Albert, C., and Fenicia, F.: Improving hydrologic models for predictions and process understanding using neural ODEs, Hydrol. Earth Syst. Sci., 26, 5085–5102, https://doi.org/10.5194/hess-26-5085-2022, 2022. a, b, c
Houska, T., Kraft, P., Chamorro-Chavez, A., and Breuer, L.: SPOTting model parameters using a ready-made python package, PLoS One, 10, e0145180, https://doi.org/10.1371/journal.pone.0145180, 2015. a, b
Jiang, S., Zheng, Y., and Solomatine, D.: Improving AI system awareness of geoscience knowledge: symbiotic integration of physical approaches and deep learning, Geophys. Res. Lett., 47, e2020GL088229, https://doi.org/10.1029/2020GL088229, 2020. a
Kingma, D. P and Ba, J.: Adam: A method for stochastic optimization, arXiv [preprint], https://doi.org/10.48550/arXiv.1412.6980, 2014. a
Kraft, B., Jung, M., Körner, M., Koirala, S., and Reichstein, M.: Towards hybrid modeling of the global hydrological cycle, Hydrol. Earth Syst. Sci., 26, 1579–1614, https://doi.org/10.5194/hess-26-1579-2022, 2022. a, b, c
Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, https://doi.org/10.5194/hess-23-5089-2019, 2019. a, b, c, d, e
Kratzert, F., Gauch, M., Nearing, G., and Klotz, D.: NeuralHydrology — A Python library for Deep Learning research in hydrology, Journal of Open Source Software, 7, 4050, https://doi.org/10.21105/joss.04050, 2022. a
Kratzert, F., Gauch, M., Klotz, D., and Nearing, G.: HESS Opinions: Never train a Long Short-Term Memory (LSTM) network on a single basin, Hydrol. Earth Syst. Sci., 28, 4187–4201, https://doi.org/10.5194/hess-28-4187-2024, 2024. a, b, c
Lees, T., Buechel, M., Anderson, B., Slater, L., Reece, S., Coxon, G., and Dadson, S. J.: Benchmarking data-driven rainfall–runoff models in Great Britain: a comparison of long short-term memory (LSTM)-based models with four lumped conceptual models, Hydrol. Earth Syst. Sci., 25, 5517–5534, https://doi.org/10.5194/hess-25-5517-2021, 2021. a, b, c
Loritz, R., Dolich, A., Acuña Espinoza, E., Ebeling, P., Guse, B., Götte, J., Hassler, S. K., Hauffe, C., Heidbüchel, I., Kiesel, J., Mälicke, M., Müller-Thomy, H., Stölzle, M., and Tarasova, L.: CAMELS-DE: hydro-meteorological time series and attributes for 1582 catchments in Germany, Earth Syst. Sci. Data, 16, 5625–5642, https://doi.org/10.5194/essd-16-5625-2024, 2024. a
Martinez, G. F. and Gupta, H. V.: Toward improved identification of hydrological models: A diagnostic evaluation of the “abcd” monthly water balance model for the conterminous United States, Water Resour. Res., 46, W08507, https://doi.org/10.1029/2009WR008294, 2010. a, b
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models part I – a discussion of principles, J. Hydrol., 10, 282–290, https://doi.org/10.1016/0022-1694(70)90255-6, 1970. a
Nearing, G. S., Kratzert, F., Sampson, A. K., Pelissier, C. S., Klotz, D., Frame, J. M., Prieto, C., and Gupta, H. V.: What role does hydrological science play in the age of machine learning?, Water Resour. Res., 57, e2020WR028091, https://doi.org/10.1029/2020WR028091, 2021. a
Nevo, S., Morin, E., Gerzi Rosenthal, A., Metzger, A., Barshai, C., Weitzner, D., Voloshin, D., Kratzert, F., Elidan, G., Dror, G., Begelman, G., Nearing, G., Shalev, G., Noga, H., Shavitt, I., Yuklea, L., Royz, M., Giladi, N., Peled Levi, N., Reich, O., Gilon, O., Maor, R., Timnat, S., Shechter, T., Anisimov, V., Gigi, Y., Levin, Y., Moshe, Z., Ben-Haim, Z., Hassidim, A., and Matias, Y.: Flood forecasting with machine learning models in an operational framework, Hydrol. Earth Syst. Sci., 26, 4013–4032, https://doi.org/10.5194/hess-26-4013-2022, 2022. a
Newman, A., Sampson, K., Clark, M., Bock, A., Viger, R. J., Blodgett, D., Addor, N., and Mizukami, M.: CAMELS: Catchment Attributes and MEteorology for Large-sample Studies, Version 1.2, UCAR/NCAR [data set], https://doi.org/10.5065/D6MW2F4D, 2022. a, b
Newman, A. J., Clark, M. P., Sampson, K., Wood, A., Hay, L. E., Bock, A., Viger, R. J., Blodgett, D., Brekke, L., Arnold, J. R., Hopson, T., and Duan, Q.: Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance, Hydrol. Earth Syst. Sci., 19, 209–223, https://doi.org/10.5194/hess-19-209-2015, 2015. a, b
Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., and Carvalhais, N.: Deep learning and process understanding for data-driven Earth system science, Nature, 566, 195–204, https://doi.org/10.1038/s41586-019-0912-1, 2019. a, b
Shen, C., Laloy, E., Elshorbagy, A., Albert, A., Bales, J., Chang, F.-J., Ganguly, S., Hsu, K.-L., Kifer, D., Fang, Z., Fang, K., Li, D., Li, X., and Tsai, W.-P.: HESS Opinions: Incubating deep-learning-powered hydrologic science advances as a community, Hydrol. Earth Syst. Sci., 22, 5639–5656, https://doi.org/10.5194/hess-22-5639-2018, 2018. a
Shen, C., Appling, A. P., Gentine, P., Bandai, T., Gupta, H., Tartakovsky, A., Baity-Jesi, M., Fenicia, F., Kifer, D., Li, L., Liu, X., Ren, W., Zheng, Y., Harman, C. J., Clark, M., Farthing, M., Feng, D., Kumar, P., Aboelyazeed, D., Rahmani, F., Song, Y., Beck, H. E., Bindas, T., Dwivedi, D., Fang, K., Höge, M., Rackauckas, C., Mohanty, B., Roy, T., Xu, C., and Lawson, K.: Differentiable modelling to unify machine learning and physical models for geosciences, Nature Reviews Earth and Environment, 4, 552–567, https://doi.org/10.1038/s43017-023-00450-9, 2023. a
Slater, L. J., Arnal, L., Boucher, M.-A., Chang, A. Y.-Y., Moulds, S., Murphy, C., Nearing, G., Shalev, G., Shen, C., Speight, L., Villarini, G., Wilby, R. L., Wood, A., and Zappa, M.: Hybrid forecasting: blending climate predictions with AI models, Hydrol. Earth Syst. Sci., 27, 1865–1889, https://doi.org/10.5194/hess-27-1865-2023, 2023. a
Tsai, W.-P., Feng, D., Pan, M., Beck, H., Lawson, K., Yang, Y., Liu, J., and Shen, C.: From calibration to parameter learning: harnessing the scaling effects of big data in geoscientific modeling, Nat. Commun., 12, 5988, https://doi.org/10.1038/s41467-021-26107-z, 2021. a
US Geological Survey: National Water Information System data available on the World Wide Web (USGS Water Data for the Nation), https://doi.org/10.5066/F7P55KJN, 2016. a
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, I., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., van Mulbregt, P., Vijaykumar, A., Bardelli, A. P., Rothberg, A., Hilboll, A., Kloeckner, A., Scopatz, A., Lee, A., Rokem, A., Woods, C. N., Fulton, C., Masson, C., Häggström, C., Fitzgerald, C., Nicholson, D. A., Hagen, D. R., Pasechnik, D. V., Olivetti, E., Martin, E., Wieser, E., Silva, F., Lenders, F., Wilhelm, F., Young, G., Price, G. A., Ingold, G.-L., Allen, G. E., Lee, G. R., Audren, H., Probst, I., Dietrich, J. P., Silterra, J., Webber, J. T., Slavič, J., Nothman, J., Buchner, J., Kulick, J., Schönberger, J. L., de Miranda Cardoso, J. V., Reimer, J., Harrington, J., Rodríguez, J. L. C., Nunez-Iglesias, J., Kuczynski, J., Tritz, K., Thoma, M., Newville, M., Kümmerer, M., Bolingbroke, M., Tartre, M., Pak, M., Smith, N. J., Nowaczyk, N., Shebanov, N., Pavlyk, O., Brodtkorb, P. A., Lee, P., McGibbon, R. T., Feldbauer, R., Lewis, S., Tygier, S., Sievert, S., Vigna, S., Peterson, S., More, S., Pudlik, T., Oshima, T., Pingel, T. J., Robitaille, T. P., Spura, T., Jones, T. R., Cera, T., Leslie, T., Zito, T., Krauss, T., Upadhyay, U., Halchenko, Y. O., and Vázquez-Baeza, Y.: SciPy 1.0: fundamental algorithms for scientific computing in Python, Nat. Methods, 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2, 2020. a
Vrugt, J. A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation, Environ. Modell. Softw., 75, 273–316, https://doi.org/10.1016/j.envsoft.2015.08.013, 2016. a
Westerberg, I. K. and McMillan, H. K.: Uncertainty in hydrological signatures, Hydrol. Earth Syst. Sci., 19, 3951–3968, https://doi.org/10.5194/hess-19-3951-2015, 2015. a, b
Short summary
Data-driven techniques have shown the potential to outperform process-based models in rainfall–runoff simulations. Hybrid models, combining both approaches, aim to enhance accuracy and maintain interpretability. Expanding the set of test cases to evaluate hybrid models under different conditions, we test their generalization capabilities for extreme hydrological events.
Data-driven techniques have shown the potential to outperform process-based models in...