Articles | Volume 28, issue 9
https://doi.org/10.5194/hess-28-2107-2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/hess-28-2107-2024
© Author(s) 2024. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Enhancing long short-term memory (LSTM)-based streamflow prediction with a spatially distributed approach
Qiutong Yu
CORRESPONDING AUTHOR
Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON, Canada
Bryan A. Tolson
Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON, Canada
Hongren Shen
Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON, Canada
Ming Han
Water Resources, Ontario Power Generation Inc., Niagara Falls, ON, Canada
Juliane Mai
Department of Earth and Environmental Science, University of Waterloo, Waterloo, ON, Canada
Jimmy Lin
David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada
Related authors
No articles found.
Maria Staudinger, Anna Herzog, Ralf Loritz, Tobias Houska, Sandra Pool, Diana Spieler, Paul D. Wagner, Juliane Mai, Jens Kiesel, Stephan Thober, Björn Guse, and Uwe Ehret
Hydrol. Earth Syst. Sci., 29, 5005–5029, https://doi.org/10.5194/hess-29-5005-2025, https://doi.org/10.5194/hess-29-5005-2025, 2025
Short summary
Short summary
Three process-based and four data-driven hydrological models are compared using different training data. We found that process-based models perform better with small datasets but stop learning soon, while data-driven models learn longer. The study highlights the importance of memory in data and the impact of different data sampling methods on model performance. The direct comparison of these models is novel and provides a clear understanding of their performance under various data conditions.
Robert Chlumsky, James R. Craig, and Bryan A. Tolson
Geosci. Model Dev., 18, 3387–3403, https://doi.org/10.5194/gmd-18-3387-2025, https://doi.org/10.5194/gmd-18-3387-2025, 2025
Short summary
Short summary
We aim to improve mapping of floods and present a new method for hydraulic modelling that uses a combination of novel geospatial analysis and existing hydraulic modelling approaches. This method is wrapped into a modelling software called Blackbird. We compared Blackbird with two other existing options for flood mapping and found that the Blackbird model outperformed both. The Blackbird model has the potential to support real-time and large-scale flood mapping applications in the future.
Raoul A. Collenteur, Ezra Haaf, Mark Bakker, Tanja Liesch, Andreas Wunsch, Jenny Soonthornrangsan, Jeremy White, Nick Martin, Rui Hugman, Ed de Sousa, Didier Vanden Berghe, Xinyang Fan, Tim J. Peterson, Jānis Bikše, Antoine Di Ciacca, Xinyue Wang, Yang Zheng, Maximilian Nölscher, Julian Koch, Raphael Schneider, Nikolas Benavides Höglund, Sivarama Krishna Reddy Chidepudi, Abel Henriot, Nicolas Massei, Abderrahim Jardani, Max Gustav Rudolph, Amir Rouhani, J. Jaime Gómez-Hernández, Seifeddine Jomaa, Anna Pölz, Tim Franken, Morteza Behbooei, Jimmy Lin, and Rojin Meysami
Hydrol. Earth Syst. Sci., 28, 5193–5208, https://doi.org/10.5194/hess-28-5193-2024, https://doi.org/10.5194/hess-28-5193-2024, 2024
Short summary
Short summary
We show the results of the 2022 Groundwater Time Series Modelling Challenge; 15 teams applied data-driven models to simulate hydraulic heads, and three model groups were identified: lumped, machine learning, and deep learning. For all wells, reasonable performance was obtained by at least one team from each group. There was not one team that performed best for all wells. In conclusion, the challenge was a successful initiative to compare different models and learn from each other.
Samah Larabi, Juliane Mai, Markus Schnorbus, Bryan A. Tolson, and Francis Zwiers
Hydrol. Earth Syst. Sci., 27, 3241–3263, https://doi.org/10.5194/hess-27-3241-2023, https://doi.org/10.5194/hess-27-3241-2023, 2023
Short summary
Short summary
The computational cost of sensitivity analysis (SA) becomes prohibitive for large hydrologic modeling domains. Here, using a large-scale Variable Infiltration Capacity (VIC) deployment, we show that watershed classification helps identify the spatial pattern of parameter sensitivity within the domain at a reduced cost. Findings reveal the opportunity to leverage climate and land cover attributes to reduce the cost of SA and facilitate more rapid deployment of large-scale land surface models.
Robert Chlumsky, Juliane Mai, James R. Craig, and Bryan A. Tolson
Hydrol. Earth Syst. Sci. Discuss., https://doi.org/10.5194/hess-2023-69, https://doi.org/10.5194/hess-2023-69, 2023
Revised manuscript not accepted
Short summary
Short summary
A blended model allows multiple hydrologic processes to be represented in a single model, which allows for a model to achieve high performance without the need to modify its structure for different catchments. Here, we improve upon the initial blended version by testing more than 30 blended models in twelve catchments to improve the overall model performance. We validate our proposed, updated blended model version with independent catchments, and make this version available for open use.
Richard Arsenault, Jean-Luc Martel, Frédéric Brunet, François Brissette, and Juliane Mai
Hydrol. Earth Syst. Sci., 27, 139–157, https://doi.org/10.5194/hess-27-139-2023, https://doi.org/10.5194/hess-27-139-2023, 2023
Short summary
Short summary
Predicting flow in rivers where no observation records are available is a daunting task. For decades, hydrological models were set up on these gauges, and their parameters were estimated based on the hydrological response of similar or nearby catchments where records exist. New developments in machine learning have now made it possible to estimate flows at ungauged locations more precisely than with hydrological models. This study confirms the performance superiority of machine learning models.
Juliane Mai, Hongren Shen, Bryan A. Tolson, Étienne Gaborit, Richard Arsenault, James R. Craig, Vincent Fortin, Lauren M. Fry, Martin Gauch, Daniel Klotz, Frederik Kratzert, Nicole O'Brien, Daniel G. Princz, Sinan Rasiya Koya, Tirthankar Roy, Frank Seglenieks, Narayan K. Shrestha, André G. T. Temgoua, Vincent Vionnet, and Jonathan W. Waddell
Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022, https://doi.org/10.5194/hess-26-3537-2022, 2022
Short summary
Short summary
Model intercomparison studies are carried out to test various models and compare the quality of their outputs over the same domain. In this study, 13 diverse model setups using the same input data are evaluated over the Great Lakes region. Various model outputs – such as streamflow, evaporation, soil moisture, and amount of snow on the ground – are compared using standardized methods and metrics. The basin-wise model outputs and observations are made available through an interactive website.
Michelle Viswanathan, Tobias K. D. Weber, Sebastian Gayler, Juliane Mai, and Thilo Streck
Biogeosciences, 19, 2187–2209, https://doi.org/10.5194/bg-19-2187-2022, https://doi.org/10.5194/bg-19-2187-2022, 2022
Short summary
Short summary
We analysed the evolution of model parameter uncertainty and prediction error as we updated parameters of a maize phenology model based on yearly observations, by sequentially applying Bayesian calibration. Although parameter uncertainty was reduced, prediction quality deteriorated when calibration and prediction data were from different maize ripening groups or temperature conditions. The study highlights that Bayesian methods should account for model limitations and inherent data structures.
Nicolas Gasset, Vincent Fortin, Milena Dimitrijevic, Marco Carrera, Bernard Bilodeau, Ryan Muncaster, Étienne Gaborit, Guy Roy, Nedka Pentcheva, Maxim Bulat, Xihong Wang, Radenko Pavlovic, Franck Lespinas, Dikra Khedhaouiria, and Juliane Mai
Hydrol. Earth Syst. Sci., 25, 4917–4945, https://doi.org/10.5194/hess-25-4917-2021, https://doi.org/10.5194/hess-25-4917-2021, 2021
Short summary
Short summary
In this paper, we highlight the importance of including land-data assimilation as well as offline precipitation analysis components in a regional reanalysis system. We also document the performance of the first multidecadal 10 km reanalysis performed with the GEM atmospheric model that can be used for seamless land-surface and hydrological modelling in North America. It is of particular interest for transboundary basins, as existing datasets often show discontinuities at the border.
Martin Gauch, Frederik Kratzert, Daniel Klotz, Grey Nearing, Jimmy Lin, and Sepp Hochreiter
Hydrol. Earth Syst. Sci., 25, 2045–2062, https://doi.org/10.5194/hess-25-2045-2021, https://doi.org/10.5194/hess-25-2045-2021, 2021
Short summary
Short summary
We present multi-timescale Short-Term Memory (MTS-LSTM), a machine learning approach that predicts discharge at multiple timescales within one model. MTS-LSTM is significantly more accurate than the US National Water Model and computationally more efficient than an individual LSTM model per timescale. Further, MTS-LSTM can process different input variables at different timescales, which is important as the lead time of meteorological forecasts often depends on their temporal resolution.
Juliane Mai, James R. Craig, and Bryan A. Tolson
Hydrol. Earth Syst. Sci., 24, 5835–5858, https://doi.org/10.5194/hess-24-5835-2020, https://doi.org/10.5194/hess-24-5835-2020, 2020
Cited articles
Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, https://doi.org/10.5194/hess-21-5293-2017, 2017.
Arsenault, R., Martel, J.-L., Brunet, F., Brissette, F., and Mai, J.: Continuous streamflow prediction in ungauged basins: long short-term memory neural networks clearly outperform traditional hydrological models, Hydrol. Earth Syst. Sci., 27, 139–157, https://doi.org/10.5194/hess-27-139-2023, 2023.
Bindas, T., Tsai, W. P., Liu, J., Rahmani, F., Feng, D., Bian, Y., Lawson, K., and Shen, C.: Improving River Routing Using a Differentiable Muskingum-Cunge Model and Physics-Informed Machine Learning, Water Resour. Res., 60, e2023WR035337, https://doi.org/10.1029/2023WR035337, 2024.
Camera, C., Bruggeman, A., Zittis, G., Sofokleous, I., and Arnault, J.: Simulation of extreme rainfall and streamflow events in small Mediterranean watersheds with a one-way-coupled atmospheric–hydrologic modelling system, Nat. Hazards Earth Syst. Sci., 20, 2791–2810, https://doi.org/10.5194/nhess-20-2791-2020, 2020.
Craig, J. R., Brown, G., Chlumsky, R., Jenkinson, R. W., Jost, G., Lee, K., Mai, J., Serrer, M., Sgro, N., Shafii, M., Snowdon, A. P., and Tolson, B. A.: Flexible watershed simulation with the Raven hydrological modelling framework, Environ. Model. Softw., 129, 104728, https://doi.org/10.1016/J.ENVSOFT.2020.104728, 2020.
Feng, D., Fang, K., and Shen, C.: Enhancing Streamflow Forecast and Extracting Insights Using Long-Short Term Memory Networks With Data Integration at Continental Scales, Water Resour. Res., 56, e2019WR026793, https://doi.org/10.1029/2019WR026793, 2020.
Frame, J. M., Kratzert, F., Raney, A., Rahman, M., Salas, F. R., and Nearing, G. S.: Post-Processing the National Water Model with Long Short-Term Memory Networks for Streamflow Predictions and Model Diagnostics, J. Am. Water Resour. As., 57, 885–905, https://doi.org/10.1111/1752-1688.12964, 2021.
Gasset, N., Fortin, V., Dimitrijevic, M., Carrera, M., Bilodeau, B., Muncaster, R., Gaborit, É., Roy, G., Pentcheva, N., Bulat, M., Wang, X., Pavlovic, R., Lespinas, F., Khedhaouiria, D., and Mai, J.: A 10 km North American precipitation and land-surface reanalysis based on the GEM atmospheric model, Hydrol. Earth Syst. Sci., 25, 4917–4945, https://doi.org/10.5194/hess-25-4917-2021, 2021.
Gauch, M., Kratzert, F., Klotz, D., Nearing, G., Lin, J., and Hochreiter, S.: Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network, Hydrol. Earth Syst. Sci., 25, 2045–2062, https://doi.org/10.5194/hess-25-2045-2021, 2021.
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, J. Hydrol. (Amst), 377, 80–91, https://doi.org/10.1016/J.JHYDROL.2009.08.003, 2009.
Han, M., Shen, H., Tolson, B. A., Craig, J. R., Mai, J., Lin, S., Basu, N., and Awol, F.: North American Lake-River Routing Product v2.1, derived by BasinMaker GIS Toolbox, Zenodo [data set], https://doi.org/10.5281/ZENODO.4728185, 2020.
Han, M., Shen, H., Tolson, B. A., Craig, J. R., Mai, J., Lin, S. G. M., Basu, N. B., and Awol, S.: BasinMaker 3.0: A GIS toolbox for distributed watershed delineation of complex lake-river routing networks, Environ. Model. Softw., 164, 105688, https://doi.org/10.1016/J.ENVSOFT.2023.105688, 2023.
Hunt, K. M. R., Matthews, G. R., Pappenberger, F., and Prudhomme, C.: Using a long short-term memory (LSTM) neural network to boost river streamflow forecasts over the western United States, Hydrol. Earth Syst. Sci., 26, 5449–5472, https://doi.org/10.5194/hess-26-5449-2022, 2022.
Knoben, W. J. M., Freer, J. E., and Woods, R. A.: Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores, Hydrol. Earth Syst. Sci., 23, 4323–4331, https://doi.org/10.5194/hess-23-4323-2019, 2019.
Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, https://doi.org/10.5194/hess-22-6005-2018, 2018.
Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, https://doi.org/10.5194/hess-23-5089-2019, 2019.
Kratzert, F., Gauch, M., Nearing, G., and Klotz, D.: NeuralHydrology — A Python library for Deep Learning research in hydrology, J. Open Source Softw., 7, 4050, https://doi.org/10.21105/JOSS.04050, 2022.
Kratzert, F., Nearing, G., Addor, N., Erickson, T., Gauch, M., Gilon, O., Gudmundsson, L., Hassidim, A., Klotz, D., Nevo, S., Shalev, G., and Matias, Y.: Caravan – A global community dataset for large-sample hydrology, Sci. Data, 10, 61, https://doi.org/10.1038/S41597-023-01975-W, 2023.
Kratzert, F., Gauch, M., Klotz, D., and Nearing, G.: HESS Opinions: Never train an LSTM on a single basin, Hydrol. Earth Syst. Sci. Discuss. [preprint], https://doi.org/10.5194/hess-2023-275, in review, 2024.
Lees, T., Reece, S., Kratzert, F., Klotz, D., Gauch, M., De Bruijn, J., Kumar Sahu, R., Greve, P., Slater, L., and Dadson, S. J.: Hydrological concept formation inside long short-term memory (LSTM) networks, Hydrol. Earth Syst. Sci., 26, 3079–3101, https://doi.org/10.5194/hess-26-3079-2022, 2022.
Lehner, B., Verdin, K., and Jarvis, A.: New global hydrography derived from spaceborne elevation data, Eos T. Am. Geophys. Un., 89, 93–94, https://doi.org/10.1029/2008EO100001, 2008.
Liu, W., Yang, T., Sun, F., Wang, H., Feng, Y., and Du, M.: Observation-Constrained Projection of Global Flood Magnitudes With Anthropogenic Warming, Water Resour. Res., 57, e2020WR028830, https://doi.org/10.1029/2020WR028830, 2021.
Mai, J., Kornelsen, K. C., Tolson, B. A., Fortin, V., Gasset, N., Bouhemhem, D., Schäfer, D., Leahy, M., Anctil, F., and Coulibaly, P.: The Canadian Surface Prediction Archive (CaSPAr): A Platform to Enhance Environmental Modeling in Canada and Globally, B. Am. Meteorol. Soc., 101, E341–E356, https://doi.org/10.1175/BAMS-D-19-0143.1, 2020.
Mai, J., Shen, H., Tolson, B. A., Gaborit, É., Arsenault, R., Craig, J. R., Fortin, V., Fry, L. M., Gauch, M., Klotz, D., Kratzert, F., O'Brien, N., Princz, D. G., Rasiya Koya, S., Roy, T., Seglenieks, F., Shrestha, N. K., Temgoua, A. G. T., Vionnet, V., and Waddell, J. W.: The Great Lakes Runoff Intercomparison Project Phase 4: the Great Lakes (GRIP-GL), Hydrol. Earth Syst. Sci., 26, 3537–3572, https://doi.org/10.5194/hess-26-3537-2022, 2022a.
Mai, J., Shen, H., Tolson, B., Gaborit, É., Arsenault, R., Craig, J., Fortin, V., Fry, L., Gauch, M., Klotz, D., Kratzert, F., O'Brien, N., Princz, D., Rasiya Koya, S., Roy, T., Seglenieks, F., Shrestha, N., Temgoua, A., Vionnet, V., and Waddell, J.: The Great Lakes Runoff Intercomparison Project Phase 4: The Great Lakes (GRIP-GL), Federated Research Data Repository [code and data set], https://doi.org/10.20383/103.0598, 2022b.
Messager, M. L., Lehner, B., Grill, G., Nedeva, I., and Schmitt, O.: Estimating the volume and age of water stored in global lakes using a geo-statistical approach, Nat. Commun., 7, 1–11, https://doi.org/10.1038/ncomms13603, 2016.
Nevo, S., Morin, E., Gerzi Rosenthal, A., Metzger, A., Barshai, C., Weitzner, D., Voloshin, D., Kratzert, F., Elidan, G., Dror, G., Begelman, G., Nearing, G., Shalev, G., Noga, H., Shavitt, I., Yuklea, L., Royz, M., Giladi, N., Peled Levi, N., Reich, O., Gilon, O., Maor, R., Timnat, S., Shechter, T., Anisimov, V., Gigi, Y., Levin, Y., Moshe, Z., Ben-Haim, Z., Hassidim, A., and Matias, Y.: Flood forecasting with machine learning models in an operational framework, Hydrol. Earth Syst. Sci., 26, 4013–4032, https://doi.org/10.5194/hess-26-4013-2022, 2022.
Newman, A. J., Mizukami, N., Clark, M. P., Wood, A. W., Nijssen, B., and Nearing, G.: Benchmarking of a Physically Based Hydrologic Model, J. Hydrometeorol., 18, 2215–2225, https://doi.org/10.1175/JHM-D-16-0284.1, 2017.
Pokharel, S., Roy, T., and Admiraal, D.: Effects of mass balance, energy balance, and storage-discharge constraints on LSTM for streamflow prediction, Environ. Model. Softw., 166, 105730, https://doi.org/10.1016/J.ENVSOFT.2023.105730, 2023.
Shangguan, W., Dai, Y., Duan, Q., Liu, B., and Yuan, H.: A global soil data set for earth system modeling, J. Adv. Model. Earth Sy., 6, 249–263, https://doi.org/10.1002/2013MS000293, 2014.
Slater, L. J., Arnal, L., Boucher, M.-A., Chang, A. Y.-Y., Moulds, S., Murphy, C., Nearing, G., Shalev, G., Shen, C., Speight, L., Villarini, G., Wilby, R. L., Wood, A., and Zappa, M.: Hybrid forecasting: blending climate predictions with AI models, Hydrol. Earth Syst. Sci., 27, 1865–1889, https://doi.org/10.5194/hess-27-1865-2023, 2023.
Tang, S., Sun, F., Liu, W., Wang, H., Feng, Y., and Li, Z.: Optimal Postprocessing Strategies With LSTM for Global Streamflow Prediction in Ungauged Basins, Water Resour. Res., 59, e2022WR034352, https://doi.org/10.1029/2022WR034352, 2023.
Tao, L., He, X., Li, J., and Yang, D.: A multiscale long short-term memory model with attention mechanism for improving monthly precipitation prediction, J. Hydrol. (Amst), 602, 126815, https://doi.org/10.1016/J.JHYDROL.2021.126815, 2021.
Wang, Y. and Karimi, H. A.: Impact of spatial distribution information of rainfall in runoff simulation using deep learning method, Hydrol. Earth Syst. Sci., 26, 2387–2403, https://doi.org/10.5194/hess-26-2387-2022, 2022.
Wu, Z., Yin, H., He, H., and Li, Y.: Dynamic-LSTM hybrid models to improve seasonal drought predictions over China, J. Hydrol. (Amst), 615, 128706, https://doi.org/10.1016/J.JHYDROL.2022.128706, 2022.
Xie, J., Liu, X., Tian, W., Wang, K., Bai, P., and Liu, C.: Estimating Gridded Monthly Baseflow From 1981 to 2020 for the Contiguous US Using Long Short-Term Memory (LSTM) Networks, Water Resour. Res., 58, e2021WR031663, https://doi.org/10.1029/2021WR031663, 2022.
Yamazaki, D., Ikeshima, D., Sosa, J., Bates, P. D., Allen, G. H., and Pavelsky, T. M.: MERIT Hydro: A High-Resolution Global Hydrography Map Based on Latest Topography Dataset, Water Resour. Res., 55, 5053–5073, https://doi.org/10.1029/2019WR024873, 2019.
Yu, Q.: SR-LSTM: v1.0, Zenodo [code], https://doi.org/10.5281/zenodo.11115929, 2024.
Short summary
It is challenging to incorporate input variables' spatial distribution information when implementing long short-term memory (LSTM) models for streamflow prediction. This work presents a novel hybrid modelling approach to predict streamflow while accounting for spatial variability. We evaluated the performance against lumped LSTM predictions in 224 basins across the Great Lakes region in North America. This approach shows promise for predicting streamflow in large, ungauged basin.
It is challenging to incorporate input variables' spatial distribution information when...