Articles | Volume 25, issue 2
https://doi.org/10.5194/hess-25-1103-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/hess-25-1103-2021
© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance
Steinbuch Centre for Computing, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Uwe Ehret
Institute of Water Resources and River Basin Management, Karlsruhe
Institute of Technology (KIT), Karlsruhe, Germany
Steven V. Weijs
Department of Civil Engineering, University of British Columbia,
Vancouver, Canada
Benjamin L. Ruddell
School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, USA
Rui A. P. Perdigão
Meteoceanics Interdisciplinary Centre for Complex System Science,
Vienna, Austria
CCIAM, Centre for Ecology, Evolution and Environmental Changes,
Universidade de Lisboa, Lisbon, Portugal
Physics of Information and Quantum Technologies Group, Instituto de
Telecomunicações, Lisbon, Portugal
Related authors
No articles found.
Andrea L. Campoverde, Uwe Ehret, Patrick Ludwig, and Joaquim G. Pinto
EGUsphere, https://doi.org/10.5194/egusphere-2025-3988, https://doi.org/10.5194/egusphere-2025-3988, 2025
This preprint is open for discussion and under review for Natural Hazards and Earth System Sciences (NHESS).
Short summary
Short summary
Droughts are becoming more common in Europe. Our study used vast climate data to uncover extreme unseen low-water events. These simulations show the potential droughts becoming more severe and lasting longer than the damaging 2018 event, which impacted shipping and industry. This research highlights the urgent need for adaptation measures to prevent costly economic and ecological consequences for the Rhine's waterway.
Sarah Quỳnh-Giang Ho and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 2785–2810, https://doi.org/10.5194/hess-29-2785-2025, https://doi.org/10.5194/hess-29-2785-2025, 2025
Short summary
Short summary
In this paper, we use models to demonstrate that even small flood reservoirs – which capture water to avoid floods downstream – can be repurposed to release water in drier conditions without affecting their ability to protect against floods. By capturing water and releasing it once levels are low, we show that reservoirs can greatly increase the water available in drought. Having more water available to the reservoir, however, is not necessarily better for drought protection.
Manuel Álvarez Chaves, Eduardo Acuña Espinoza, Uwe Ehret, and Anneli Guthke
EGUsphere, https://doi.org/10.5194/egusphere-2025-1699, https://doi.org/10.5194/egusphere-2025-1699, 2025
Short summary
Short summary
This study evaluates hybrid hydrological models that combine physics-based and data-driven components, using Information Theory to measure their relative contributions. When testing conceptual models with LSTMs that adjust parameters over time, we found performance primarily comes from the data-driven component, with physics constraints adding minimal value. We propose a quantitative tool to analyse this behaviour and suggest a workflow for diagnosing hybrid models.
Eduardo Acuña Espinoza, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, Ralf Loritz, and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 1749–1758, https://doi.org/10.5194/hess-29-1749-2025, https://doi.org/10.5194/hess-29-1749-2025, 2025
Short summary
Short summary
Long short-term memory (LSTM) networks have demonstrated state-of-the-art performance for rainfall-runoff hydrological modelling. However, most studies focus on predictions at a daily scale, limiting the benefits of sub-daily (e.g. hourly) predictions in applications like flood forecasting. In this study, we introduce a new architecture, multi-frequency LSTM (MF-LSTM), designed to use inputs of various temporal frequencies to produce sub-daily (e.g. hourly) predictions at a moderate computational cost.
Maria Staudinger, Anna Herzog, Ralf Loritz, Tobias Houska, Sandra Pool, Diana Spieler, Paul D. Wagner, Juliane Mai, Jens Kiesel, Stephan Thober, Björn Guse, and Uwe Ehret
EGUsphere, https://doi.org/10.5194/egusphere-2025-1076, https://doi.org/10.5194/egusphere-2025-1076, 2025
Short summary
Short summary
Four process-based and four data-driven hydrological models are compared using different training data. We found process-based models to perform better with small data sets but stop learning soon, while data-driven models learn longer. The study highlights the importance of memory in data and the impact of different data sampling methods on model performance. The direct comparison of these models is novel and provides a clear understanding of their performance under various data conditions.
Eduardo Acuña Espinoza, Ralf Loritz, Frederik Kratzert, Daniel Klotz, Martin Gauch, Manuel Álvarez Chaves, and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 1277–1294, https://doi.org/10.5194/hess-29-1277-2025, https://doi.org/10.5194/hess-29-1277-2025, 2025
Short summary
Short summary
Data-driven techniques have shown the potential to outperform process-based models in rainfall–runoff simulations. Hybrid models, combining both approaches, aim to enhance accuracy and maintain interpretability. Expanding the set of test cases to evaluate hybrid models under different conditions, we test their generalization capabilities for extreme hydrological events.
Daniel Kovacek and Steven Weijs
Earth Syst. Sci. Data, 17, 259–275, https://doi.org/10.5194/essd-17-259-2025, https://doi.org/10.5194/essd-17-259-2025, 2025
Short summary
Short summary
We made a dataset for British Columbia describing the terrain, soil, land cover, and climate of over 1 million watersheds. The attributes are often used in hydrology because they are related to the water cycle. The data are meant to be used for water resources problems that can benefit from lots of watersheds and their attributes. The data and instructions needed to build the dataset from scratch are freely available. The permanent home for the data is https://doi.org/10.5683/SP3/JNKZVT.
Noriaki Ohara, Andrew D. Parsekian, Benjamin M. Jones, Rodrigo C. Rangel, Kenneth M. Hinkel, and Rui A. P. Perdigão
The Cryosphere, 18, 5139–5152, https://doi.org/10.5194/tc-18-5139-2024, https://doi.org/10.5194/tc-18-5139-2024, 2024
Short summary
Short summary
Snow distribution characterization is essential for accurate snow water estimation for water resource prediction from existing in situ observations and remote-sensing data at a finite spatial resolution. Four different observed snow distribution datasets were analyzed for Gaussianity. We found that non-Gaussianity of snow distribution is a signature of the wind redistribution effect. Generally, seasonal snowpack can be approximated well by a Gaussian distribution for a fully snow-covered area.
Andrea L. Campoverde, Uwe Ehret, Patrick Ludwig, and Joaquim G. Pinto
Geosci. Model Dev. Discuss., https://doi.org/10.5194/gmd-2024-134, https://doi.org/10.5194/gmd-2024-134, 2024
Revised manuscript not accepted
Short summary
Short summary
We looked at how well the model WRF-Hydro performed during the 2018 drought event in the River Rhine basin, even though it is typically used for floods. We used the meteorological ERA5 reanalysis dataset to simulate River Rhine’s streamflow and adjusted the model using parameters and actual discharge measurements. We focused on Lake Constance, a key part of the basin, but found issues with the model’s lake outflow simulation. By removing the lake module, we obtained more accurate results.
Eduardo Acuña Espinoza, Ralf Loritz, Manuel Álvarez Chaves, Nicole Bäuerle, and Uwe Ehret
Hydrol. Earth Syst. Sci., 28, 2705–2719, https://doi.org/10.5194/hess-28-2705-2024, https://doi.org/10.5194/hess-28-2705-2024, 2024
Short summary
Short summary
Hydrological hybrid models promise to merge the performance of deep learning methods with the interpretability of process-based models. One hybrid approach is the dynamic parameterization of conceptual models using long short-term memory (LSTM) networks. We explored this method to evaluate the effect of the flexibility given by LSTMs on the process-based part.
Benjamin L. Ruddell and Richard Rushforth
Hydrol. Earth Syst. Sci., 28, 1089–1106, https://doi.org/10.5194/hess-28-1089-2024, https://doi.org/10.5194/hess-28-1089-2024, 2024
Short summary
Short summary
This study finds that bedroom cities show higher water productivity based on the standard efficiency benchmark of gallons per capita, but core cities that host large businesses show higher water productivity using a basket of economic values like taxes, payroll, and business revenues. Using a broader basket of water productivity benchmarks that consider more of the community’s socio-economic values and goals could inform more balanced and equitable water allocation decisions by policymakers.
Uwe Ehret and Pankaj Dey
Hydrol. Earth Syst. Sci., 27, 2591–2605, https://doi.org/10.5194/hess-27-2591-2023, https://doi.org/10.5194/hess-27-2591-2023, 2023
Short summary
Short summary
We propose the
c-u-curvemethod to characterize dynamical (time-variable) systems of all kinds.
Uis for uncertainty and expresses how well a system can be predicted in a given period of time.
Cis for complexity and expresses how predictability differs between different periods, i.e. how well predictability itself can be predicted. The method helps to better classify and compare dynamical systems across a wide range of disciplines, thus facilitating scientific collaboration.
Patrick Ludwig, Florian Ehmele, Mário J. Franca, Susanna Mohr, Alberto Caldas-Alvarez, James E. Daniell, Uwe Ehret, Hendrik Feldmann, Marie Hundhausen, Peter Knippertz, Katharina Küpfer, Michael Kunz, Bernhard Mühr, Joaquim G. Pinto, Julian Quinting, Andreas M. Schäfer, Frank Seidel, and Christina Wisotzky
Nat. Hazards Earth Syst. Sci., 23, 1287–1311, https://doi.org/10.5194/nhess-23-1287-2023, https://doi.org/10.5194/nhess-23-1287-2023, 2023
Short summary
Short summary
Heavy precipitation in July 2021 led to widespread floods in western Germany and neighboring countries. The event was among the five heaviest precipitation events of the past 70 years in Germany, and the river discharges exceeded by far the statistical 100-year return values. Simulations of the event under future climate conditions revealed a strong and non-linear effect on flood peaks: for +2 K global warming, an 18 % increase in rainfall led to a 39 % increase of the flood peak in the Ahr river.
Susanna Mohr, Uwe Ehret, Michael Kunz, Patrick Ludwig, Alberto Caldas-Alvarez, James E. Daniell, Florian Ehmele, Hendrik Feldmann, Mário J. Franca, Christian Gattke, Marie Hundhausen, Peter Knippertz, Katharina Küpfer, Bernhard Mühr, Joaquim G. Pinto, Julian Quinting, Andreas M. Schäfer, Marc Scheibel, Frank Seidel, and Christina Wisotzky
Nat. Hazards Earth Syst. Sci., 23, 525–551, https://doi.org/10.5194/nhess-23-525-2023, https://doi.org/10.5194/nhess-23-525-2023, 2023
Short summary
Short summary
The flood event in July 2021 was one of the most severe disasters in Europe in the last half century. The objective of this two-part study is a multi-disciplinary assessment that examines the complex process interactions in different compartments, from meteorology to hydrological conditions to hydro-morphological processes to impacts on assets and environment. In addition, we address the question of what measures are possible to generate added value to early response management.
Hossein Foroozand and Steven V. Weijs
Hydrol. Earth Syst. Sci., 25, 831–850, https://doi.org/10.5194/hess-25-831-2021, https://doi.org/10.5194/hess-25-831-2021, 2021
Short summary
Short summary
In monitoring network design, we have to decide what to measure, where to measure, and when to measure. In this paper, we focus on the question of where to measure. Past literature has used the concept of information to choose a selection of locations that provide maximally informative data. In this paper, we look in detail at the proper mathematical formulation of the information concept as an objective. We argue that previous proposals for this formulation have been needlessly complicated.
Cited articles
Akaike, H.: A new look at the statistical model identification, IEEE T. Automat. Control, 19, 716–723, https://doi.org/10.1109/TAC.1974.1100705, 1974.
Arkesteijn, L. and Pande, S.: On hydrological model complexity, its
geometrical interpretations and prediction uncertainty, Water Resour. Res., 49, 7048–7063, https://doi.org/10.1002/wrcr.20529, 2013.
Atkinson, S. E., Woods, R. A., and Sivapalan, M.: Climate and landscape
controls on water balance model complexity over changing timescales, Water
Resour. Res., 38, 50-51–50-17, https://doi.org/10.1029/2002wr001487, 2002.
Azmi, E.: KIT-HYD/model-evaluation: Release 1 (Version v1.0), Zenodo,
https://doi.org/10.5281/zenodo.4485876, 2021.
Bennett, N. D., Croke, B. F. W., Guariso, G., Guillaume, J. H. A., Hamilton,
S. H., Jakeman, A. J., Marsili-Libelli, S., Newham, L. T. H., Norton, J. P.,
Perrin, C., Pierce, S. A., Robson, B., Seppelt, R., Voinov, A. A., Fath, B.
D., and Andreassian, V.: Characterising performance of environmental models,
Environ. Model. Softw., 40, 1–20, https://doi.org/10.1016/j.envsoft.2012.09.011, 2013.
Bras, R. L.: Complexity and organization in hydrology: A personal view, Water Resour. Res., 51, 6532–6548, https://doi.org/10.1002/2015wr016958, 2015.
Castillo, A., Castelli, F., and Entekhabi, D.: An entropy-based measure of
hydrologic complexity and its applications, Water Resour. Res., 51, 5145–5160, https://doi.org/10.1002/2014wr016035, 2015.
Chaitin, G. J.: On the Length of Programs for Computing Finite Binary Sequences, J. ACM, 13, 547–569, https://doi.org/10.1145/321356.321363, 1966.
Cherkassky, V. and Mulier, F. M.: Learning from data: concepts, theory, and
methods. John Wiley & Sons, Hoboken, New Jersey, USA, 2007.
Cover, T. and Thomas, J. A.: Elements of Information Theory, Wiley Series in Telecommunications and Signal Processing, Wiley-Interscience, Hoboken, New Jersey, USA, 2006.
Darscheid, P., Guthke, A., and Ehret, U.: A Maximum-Entropy Method to Estimate Discrete Distributions from Samples Ensuring Nonzero Probabilities,
Entropy, 20, 601, https://doi.org/10.3390/e20080601, 2018.
Engelhardt, S., Matyssek, R., and Huwe, B.: Complexity and information
propagation in hydrological time series of mountain forest catchments, Eur.
J. Forest Res., 128, 621–631, https://doi.org/10.1007/s10342-009-0306-2, 2009.
Finger, D., Vis, M., Huss, M., and Seibert, J.: The value of multiple data
set calibration versus model complexity for improving the performance of
hydrological models in mountain catchments, Water Resour. Res., 51, 1939–1958, https://doi.org/10.1002/2014wr015712, 2015.
Forster, K., Meon, G., Marke, T., and Strasser, U.: Effect of meteorological
forcing and snow model complexity on hydrological simulations in the Sieber
catchment (Harz Mountains, Germany), Hydrol. Earth Syst. Sci., 18, 4703–4720, https://doi.org/10.5194/hess-18-4703-2014, 2014.
Gan, T. Y., Dlamini, E. M., and Biftu, G. F.: Effects of model complexity and structure, data quality, and objective functions on hydrologic modeling, J. Hydrol., 192, 81–103, 1997.
Gell-Mann, M.: What is complexity? Remarks on simplicity and complexity by the Nobel Prize-winning author of The Quark and the Jaguar, Complexity, 1,
16–19, https://doi.org/10.1002/cplx.6130010105, 1995.
Gong, W., Yang, D., Gupta, H. V., and Nearing, G.: Estimating information
entropy for hydrological data: One-dimensional case, Water Resour. Res., 50, 5003–5018, https://doi.org/10.1002/2014wr015874, 2014.
Grünwald, P. D.: The minimum description length principle, The MIT Press, Cambridge, MA, USA, 2007.
Gupta, H. V., Wagener, T., and Liu, Y.: Reconciling theory with observations: elements of a diagnostic approach to model evaluation, Hydrol. Process., 22, 3802–3813, https://doi.org/10.1002/hyp.6989, 2008.
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, J. Hydrol., 377, 80–91,
https://doi.org/10.1016/j.jhydrol.2009.08.003, 2009.
Höge, M., Wöhling, T., and Nowak, W.: A Primer for Model Selection:
The Decisive Role of Model Complexity, Water Resour. Res., 54, 1688–1715, https://doi.org/10.1002/2017wr021902, 2018.
Hutton, C., Wagener, T., Freer, J., Han, D., Duffy, C., and Arheimer, B.: Most computational hydrology is not reproducible, so is it really science?,
Water Resour. Res., 52, 7548–7555, https://doi.org/10.1002/2016wr019285, 2016.
Jenerette, G. D., Barron-Gafford, G. A., Guswa, A. J., McDonnell, J. J., and
Villegas, J. C.: Organization of complexity in water limited ecohydrology,
Ecohydrology, 5, 184–199, https://doi.org/10.1002/eco.217, 2012.
Jovanovic, T., Garcia, S., Gall, H., and Mejia, A.: Complexity as a streamflow metric of hydrologic alteration, Stoch. Environ. Res. Risk A., 31, 2107–2119, https://doi.org/10.1007/s00477-016-1315-6, 2017.
Kirchner, J. W.: Getting the right answers for the right reasons: Linking
measurements, analyses, and models to advance the science of hydrology, Water Resour. Res., 42, W03S04, https://doi.org/10.1029/2005wr004362, 2006.
Knuth, K. H.: Optimal data-based binning for histograms and histogram-based
probability density models, Digit. Sig. Process., 95, 102581,
https://doi.org/10.1016/j.dsp.2019.102581, 2019.
Kolmogorov, A. N.: Three approaches to the quantitative definition of information, Int. J. Comput. Math., 2, 157–168, https://doi.org/10.1080/00207166808803030, 1968.
Kullback, S. and Leibler, R. A.: On Information and Sufficiency, Ann. Math.
Statist., 22, 79–86, https://doi.org/10.1214/aoms/1177729694, 1951.
Levin, D. and Syromyatnikov, E.: strace linux syscall tracer, available at: https://strace.io (last access: 2 March 2021), 2018.
McDonnell, J. J., Sivapalan, M., Vache, K., Dunn, S., Grant, G., Haggerty, R., Hinz, C., Hooper, R., Kirchner, J., Roderick, M. L., Selker, J., and
Weiler, M.: Moving beyond heterogeneity and process complexity: A new vision
for watershed hydrology, Water Resour. Res., 43, W07301, https://doi.org/10.1029/2006wr005467, 2007.
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, J. Hydrology, 10, 282–290, 1970.
Orth, R., Staudinger, M., Seneviratne, S. I., Seibert, J., and Zappa, M.:
Does model performance improve with complexity? A case study with three
hydrological models, J. Hydrol., 523, 147–159, https://doi.org/10.1016/j.jhydrol.2015.01.044, 2015.
Ossola, A., Hahs, A. K., and Livesley, S. J.: Habitat complexity influences
fine scale hydrological processes and the incidence of stormwater runoff in
managed urban ecosystems, J. Environ. Manage., 159, 1–10,
https://doi.org/10.1016/j.jenvman.2015.05.002, 2015.
Pande, S. and Moayeri, M.: Hydrological Interpretation of a Statistical Measure of Basin Complexity, Water Resour. Res., 54, 7403–7416,
https://doi.org/10.1029/2018wr022675, 2018.
Pechlivanidis, I. G., Jackson, B., McMillan, H., and Gupta, H. V.: Robust
informational entropy-based descriptors of flow in catchment hydrology,
Hydrolog. Sci. J., 61, 1–18, https://doi.org/10.1080/02626667.2014.983516, 2016.
Perdigão, R. A. P.: Synergistic Dynamic Theory of Complex Coevolutionary
Systems, https://doi.org/10.46337/mdsc.5182, 2018.
Perdigão, R. A. P., Pires, C. A. L., and Hall, J.: Disentangling Nonlinear Spatiotemporal Controls on Precipitation: Dynamic Source Analysis and Predictability, https://doi.org/10.46337/mdsc.5273, 2019.
Rissanen, J.: Information and complexity in statistical modeling, Springer,
New York, USA, 2007.
Ruddell, B. L. and Kumar, P.: Ecohydrologic process networks: 1. Identification, Water Resour. Res., 45, W03419, https://doi.org/10.1029/2008wr007279, 2009.
Sang, Y. F., Wang, D., Wu, J. C., Zhu, Q. P., and Wang, L.: Wavelet-Based
Analysis on the Complexity of Hydrologic Series Data under Multi-Temporal
Scales, Entropy, 13, 195–210, https://doi.org/10.3390/e13010195, 2011.
Schöniger, A., Illman, W. A., Wöhling, T., and Nowak, W.: Finding the right balance between groundwater model complexity and experimental effort via Bayesian model selection, J. Hydrol., 531, 96–110,
https://doi.org/10.1016/j.jhydrol.2015.07.047, 2015.
Schoups, G., van de Giesen, N. C., and Savenije, H. H. G.: Model complexity
control for hydrologic prediction, Water Resour. Res., 44, W00B03,
https://doi.org/10.1029/2008wr006836, 2008.
Schwarz, G.: Estimating the Dimension of a Model, Ann. Statist., 6, 461–464,
https://doi.org/10.1214/aos/1176344136, 1978.
Sivakumar, B. and Singh, V. P.: Hydrologic system complexity and nonlinear
dynamic concepts for a catchment classification framework, Hydrol. Earth Syst. Sci., 16, 4119–4131, https://doi.org/10.5194/hess-16-4119-2012, 2012.
Sivakumar, B., Jayawardena, A. W., and Li, W. K.: Hydrologic complexity and
classification: a simple data reconstruction approach, Hydrol. Process., 21,
2713–2728, https://doi.org/10.1002/hyp.6362, 2007.
Sivapalan, M.: Process complexity at hillslope scale, process simplicity at the watershed scale: is there a connection?, Hydrol. Process., 17, 1037–1041, https://doi.org/10.1002/hyp.5109, 2003.
Solomonoff, R. J.: A formal theory of inductive inference. Part I, Inform. Control, 7, 1–22, https://doi.org/10.1016/S0019-9958(64)90223-2, 1964.
Solomonoff, R. J.: Complexity-based induction systems: Comparisons and
convergence theorems, IEEE T. Inform. Theor., 24, 422–432,
https://doi.org/10.1109/TIT.1978.1055913, 1978.
Weijs, S. V. and Ruddell, B. L.: Debates: Does Information Theory Provide a New Paradigm for Earth Science? Sharper Predictions Using Occam's Digital
Razor, Water Resour. Res., 56, e2019WR026471, https://doi.org/10.1029/2019wr026471, 2020.
Weijs, S. V., Schoups, G., and van de Giesen, N.: Why hydrological predictions should be evaluated using information theory, Hydrol. Earth
Syst. Sci., 14, 2545–2558, https://doi.org/10.5194/hess-14-2545-2010, 2010.
Zhou, Y., Zhang, Q., Li, K., and Chen, X. H.: Hydrological effects of water
reservoirs on hydrological processes in the East River (China) basin:
complexity evaluations based on the multi-scale entropy analysis, Hydrol.
Process., 26, 3253–3262, https://doi.org/10.1002/hyp.8406, 2012.
Short summary
Computer models should be as simple as possible but not simpler. Simplicity refers to the length of the model and the effort it takes the model to generate its output. Here we present a practical technique for measuring the latter by the number of memory visits during model execution by
Strace, a troubleshooting and monitoring program. The advantage of this approach is that it can be applied to any computer-based model, which facilitates model intercomparison.
Computer models should be as simple as possible but not simpler. Simplicity refers to the length...