Articles | Volume 30, issue 11
https://doi.org/10.5194/hess-30-3425-2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/hess-30-3425-2026
© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Symbolic regression-based regionalization of baseflow separation parameter using catchment-scale characteristics
Yongen Lin
School of Geography and Planning, Sun Yat-sen University, Guangzhou, China
Carbon-Water Research Station in Karst Regions of Northern Guangdong, Sun Yat-sen University, Guangzhou, China
School of Geography and Planning, Sun Yat-sen University, Guangzhou, China
Guangdong Key Laboratory for Urbanization and Geo-simulation, Sun Yat-sen University, Guangzhou, China
Carbon-Water Research Station in Karst Regions of Northern Guangdong, Sun Yat-sen University, Guangzhou, China
Yiwen Mei
Carbon-Water Research Station in Karst Regions of Northern Guangdong, Sun Yat-sen University, Guangzhou, China
Jinxin Zhu
School of Geography and Planning, Sun Yat-sen University, Guangzhou, China
Carbon-Water Research Station in Karst Regions of Northern Guangdong, Sun Yat-sen University, Guangzhou, China
Huan Wu
School of Atmospheric Science, Sun Yat-sen University, Zhuhai, Guangdong, China
State Key Laboratory of Climate System Prediction and Risk Management, Nanjing University of Information Science and Technology, Nanjing, China
Shuo Wang
Department of Land Surveying and Geo-informatics, Hong Kong Polytechnic University, Kowloon, Hong Kong Special Administrative Region, China
Zhonghou Xu
Earth Sciences New Zealand, Hamilton 3216, New Zealand
Asaad Y. Shamseldin
Department of Civil and Environmental Engineering, The University of Auckland, Auckland 1010, New Zealand
deceased
Emmanouil N. Anagnostou
Department of Civil and Environmental Engineering, University of Connecticut, Storrs, USA
Related authors
No articles found.
Joë Pelmard, Emily Margaret Lane, Alice Harang, Cyprien Bosserelle, Zhonghou Xu, Rose Pearson, Trevor Carey-Smith, Graeme Smart, Jochen Bind, Raghunathan Senthil Kumar, Elvira Cheng, Giang Bui, Ryan Paulik, Conrad Zorn, and Liam Wotherspoon
Earth Syst. Sci. Data Discuss., https://doi.org/10.5194/essd-2026-56, https://doi.org/10.5194/essd-2026-56, 2026
Preprint under review for ESSD
Short summary
Short summary
This study presents a dataset of simulations reconstructing February 2023 floods caused by ex-Tropical Cyclone Gabrielle in eastern Aotearoa New Zealand. Combining rainfall and landscape data and numerical modelling, the dataset gives insights into how flooding developed over space and time across the modeled regions. The results aim to support researchers, communities and decision-makers in post-event analysis, risk assessment and planning, to better prepare for extreme weather events.
Ying Hu, Huan Wu, Yiwen Mei, Zhijun Huang, Aihui Wang, Chaoqun Li, and Bing Sui
EGUsphere, https://doi.org/10.5194/egusphere-2025-5032, https://doi.org/10.5194/egusphere-2025-5032, 2025
Short summary
Short summary
Our analysis of United States river data explains why more heavy rain has not led to more flash floods. Only 17.3 % of basins see more frequent events, and 6.5 % see sharper, more intense flash floods. A key finding is that the warming climate counteracts rainfall's impact, reducing potential increases in flood frequency by 3.6 % and intensity by 8.0 % since the 1980s. While national land cover has remained stable, local urbanization remains a crucial factor driving flood risk.
Lorenzo Alfieri, Andrea Libertino, Lorenzo Campo, Francesco Dottori, Simone Gabellani, Tatiana Ghizzoni, Alessandro Masoero, Lauro Rossi, Roberto Rudari, Nicola Testa, Eva Trasforini, Ahmed Amdihun, Jully Ouma, Luca Rossi, Yves Tramblay, Huan Wu, and Marco Massabò
Nat. Hazards Earth Syst. Sci., 24, 199–224, https://doi.org/10.5194/nhess-24-199-2024, https://doi.org/10.5194/nhess-24-199-2024, 2024
Short summary
Short summary
This work describes Flood-PROOFS East Africa, an impact-based flood forecasting system for the Greater Horn of Africa. It is based on hydrological simulations, inundation mapping, and estimation of population and assets exposed to upcoming river floods. The system supports duty officers in African institutions in the daily monitoring of hydro-meteorological disasters. A first evaluation shows the system performance for the catastrophic floods in the Nile River basin in summer 2020.
Qingliang Li, Gaosong Shi, Wei Shangguan, Vahid Nourani, Jianduo Li, Lu Li, Feini Huang, Ye Zhang, Chunyan Wang, Dagang Wang, Jianxiu Qiu, Xingjie Lu, and Yongjiu Dai
Earth Syst. Sci. Data, 14, 5267–5286, https://doi.org/10.5194/essd-14-5267-2022, https://doi.org/10.5194/essd-14-5267-2022, 2022
Short summary
Short summary
SMCI1.0 is a 1 km resolution dataset of daily soil moisture over China for 2000–2020 derived through machine learning trained with in situ measurements of 1789 stations, meteorological forcings, and land surface variables. It contains 10 soil layers with 10 cm intervals up to 100 cm deep. Evaluated by in situ data, the error (ubRMSE) ranges from 0.045 to 0.051, and the correlation (R) range is 0.866-0.893. Compared with ERA5-Land, SMAP-L4, and SoMo.ml, SIMI1.0 has higher accuracy and resolution.
Lorenzo Alfieri, Francesco Avanzi, Fabio Delogu, Simone Gabellani, Giulia Bruno, Lorenzo Campo, Andrea Libertino, Christian Massari, Angelica Tarpanelli, Dominik Rains, Diego G. Miralles, Raphael Quast, Mariette Vreugdenhil, Huan Wu, and Luca Brocca
Hydrol. Earth Syst. Sci., 26, 3921–3939, https://doi.org/10.5194/hess-26-3921-2022, https://doi.org/10.5194/hess-26-3921-2022, 2022
Short summary
Short summary
This work shows advances in high-resolution satellite data for hydrology. We performed hydrological simulations for the Po River basin using various satellite products, including precipitation, evaporation, soil moisture, and snow depth. Evaporation and snow depth improved a simulation based on high-quality ground observations. Interestingly, a model calibration relying on satellite data skillfully reproduces observed discharges, paving the way to satellite-driven hydrological applications.
Cited articles
Aksoy, H., Unal, N. E., and Pektas, A. O.: Smoothed minima baseflow separation tool for perennial and intermittent streams, Hydrol. Process., 22, 4467–4476, https://doi.org/10.1002/hyp.7077, 2008.
Apley, D. W. and Zhu, J.: Visualizing the Effects of Predictor Variables in Black Box Supervised Learning Models, J. Roy. Stat. Soc. B, 82, 1059–1086, 10.1111/rssb.12377, 2020.
Barnhart, T. B., Molotch, N. P., Livneh, B., Harpold, A. A., Knowles, J. F., and Schneider, D.: Snowmelt rate dictates streamflow, Geophys. Res. Lett., 43, 8006–8016, https://doi.org/10.1002/2016GL069690, 2016.
Beven, K. and Germann, P.: Macropores and water flow in soils revisited, Water Resour. Res., 49, 3071–3092, https://doi.org/10.1002/wrcr.20156, 2013.
Cartwright, I.: Implications of variations in stream specific conductivity for estimating baseflow using chemical mass balance and calibrated hydrograph techniques, Hydrol. Earth Syst. Sci., 26, 183–195, https://doi.org/10.5194/hess-26-183-2022, 2022.
Chadalawada, J., Herath, H. M. V. V., and Babovic, V.: Hydrologically Informed Machine Learning for Rainfall-Runoff Modeling: A Genetic Programming-Based Toolkit for Automatic Model Induction, Water Resour. Res., 56, e2019WR026933, https://doi.org/10.1029/2019WR026933, 2020.
Cranmer, M.: Interpretable machine learning for science with PySR and SymbolicRegression.jl, arXiv [preprint], https://doi.org/10.48550/arXiv.2305.01582, 2023.
Diebold, F. X. and Mariano, R. S.: Comparing Predictive Accuracy, J. Bus. Econ. Stat., 13, 253–263, 10.1080/07350015.1995.10524599, 1995.
Elsken, T., Metzen, J. H., and Hutter, F.: Neural architecture search: a survey, J. Mach. Learn. Res., 20, 1997–2017, 2019.
Feigl, M., Herrnegger, M., Klotz, D., and Schulz, K.: Function Space Optimization: A Symbolic Regression Method for Estimating Parameter Transfer Functions for Hydrological Models, Water Resour. Res., 56, e2020WR027385, 10.1029/2020WR027385, 2020.
Feigl, M., Thober, S., Schweppe, R., Herrnegger, M., Samaniego, L., and Schulz, K.: Automatic Regionalization of Model Parameters for Hydrological Models, Water Resour. Res., 58, e2022WR031966, https://doi.org/10.1029/2022WR031966, 2022.
Feng, D., Fang, K., and Shen, C.: Enhancing Streamflow Forecast and Extracting Insights Using Long-Short Term Memory Networks With Data Integration at Continental Scales, Water Resour. Res., 56, e2019WR026793, https://doi.org/10.1029/2019wr026793, 2020.
Garzon, L. F. L., Johnson, M. F., Mount, N., and Gomez, H.: Exploring the effects of catchment morphometry on overland flow response to extreme rainfall using a 2D hydraulic-hydrological model (IBER), J. Hydrol., 627, 130405, https://doi.org/10.1016/j.jhydrol.2023.130405, 2023.
Gnann, S. J., McMillan, H. K., Woods, R. A., and Howden, N. J. K.: Including Regional Knowledge Improves Baseflow Signature Predictions in Large Sample Hydrology, Water Resour. Res., 57, e2020WR028354, https://doi.org/10.1029/2020WR028354, 2021.
Godsey, S. E., Kirchner, J. W., and Tague, C. L.: Effects of changes in winter snowpacks on summer low flows: case studies in the Sierra Nevada, California, USA, Hydrol. Process., 28, 5048–5064, https://doi.org/10.1002/hyp.9943, 2014.
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, J. Hydrol., 377, 80–91, 10.1016/j.jhydrol.2009.08.003, 2009.
Gustard, A., Bullock, A. D., and Dixon, J. M.: Low Flow Estimation in the United Kingdom, Institute of Hydrology, Wallingford, UK, ISBN 0948540451, 1992.
Häfner, D., Gemmrich, J., and Jochum, M.: Machine-guided discovery of a real-world rogue wave model, P. Natl. Acad. Sci. USA, 120, e2306275120, https://doi.org/10.1073/pnas.2306275120, 2023.
Hagedorn, B.: Hydrograph separation through multi objective optimization: Revealing the importance of a temporally and spatially constrained baseflow solute source, J. Hydrol., 590, 125349, https://doi.org/10.1016/j.jhydrol.2020.125349, 2020.
Hou, X., Xie, D., Feng, L., Shen, F., and Nienhuis, J. H.: Sustained increase in suspended sediments near global river deltas over the past two decades, Nat. Commun., 15, 3319, https://doi.org/10.1038/s41467-024-47598-6, 2024.
Humphrey, C. E., Solomon, D. K., Genereux, D. P., Gilmore, T. E., Mittelstet, A. R., Zlotnik, V. A., Zeyrek, C., Jensen, C. R., and MacNamara, M. R.: Using Automated Seepage Meters to Quantify the Spatial Variability and Net Flux of Groundwater to a Stream, Water Resour. Res., 58, e2021WR030711, https://doi.org/10.1029/2021WR030711, 2022.
Jenicek, M. and Ledvinka, O.: Importance of snowmelt contribution to seasonal runoff and summer low flows in Czechia, Hydrol. Earth Syst. Sci., 24, 3475–3491, https://doi.org/10.5194/hess-24-3475-2020, 2020.
Karpatne, A., Jia, X., and Kumar, V.: Knowledge-guided machine learning: Current trends and future prospects, arXiv [preprint], https://doi.org/10.48550/arXiv.2403.15989, 2024.
Klotz, D., Herrnegger, M., and Schulz, K.: Symbolic Regression for the Estimation of Transfer Functions of Hydrological Models, Water Resour. Res., 53, 9402–9423, https://doi.org/10.1002/2017wr021253, 2017.
Koza, J. R.: Genetic programming as a means for programming computers by natural selection, Stat. Comput., 4, 87–112, https://doi.org/10.1007/BF00175355, 1994.
Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, https://doi.org/10.5194/hess-23-5089-2019, 2019.
Kronberger, G., de Franca, F. O., Burlacu, B., Haider, C., and Kommenda, M.: Shape-Constrained Symbolic Regression – Improving Extrapolation with Prior Knowledge, Evol. Comput., 30, 75–98, https://doi.org/10.1162/evco_a_00294, 2022.
Li, Q., Zhang, C., Wei, Z., Jin, X., Shangguan, W., Yuan, H., Zhu, J., Li, L., Liu, P., Chen, X., Yan, Y., and Dai, Y.: Advancing symbolic regression for earth science with a focus on evapotranspiration modeling, npj Climate and Atmospheric Science, 7, https://doi.org/10.1038/s41612-024-00861-5, 2024.
Li, X., Zhang, J., Li, D., Liu, X., Xu, J., and Yin, J.: UniSymNet: A Unified Symbolic Network Guided by Transformer, arXiv [preprint], https://doi.org/10.48550/arXiv.2505.06091, 2025.
Lin, Y., Mei, Y., and Wang, D.: Data and code archive for “Regionalization of Optimal Baseflow Separation using Catchment-scale Characteristics”, Zenodo [data set/code], https://doi.org/10.5281/zenodo.16924118, 2025.
Lin, Y., Mei, Y., Wang, D., Zhu, J., Wu, H., Wang, S., Yin, G., Gao, L., and N. Anagnostou, E.: Regionalization of Optimal Baseflow Separation using Catchment-scale Characteristics, Water Resour. Res., 62, https://doi.org/10.1029/2025WR040578, 2026.
Makke, N. and Chawla, S.: Interpretable scientific discovery with symbolic regression: a review, Artif. Intell. Rev., 57, https://doi.org/10.1007/s10462-023-10622-0, 2024.
McGlynn, B. L., McDonnell, J. J., Seibert, J., and Kendall, C.: Scale effects on headwater catchment runoff timing, flow sources, and groundwater-streamflow relations, Water Resour. Res., 40, https://doi.org/10.1029/2003WR002494, 2004.
McMahon, T. A. and Nathan, R. J.: Baseflow and transmission loss: A review, WIREs Water, 8, e1527, https://doi.org/10.1002/wat2.1527, 2021.
McMillan, H.: Linking hydrologic signatures to hydrologic processes: A review, Hydrol. Process., 34, 1393–1409, https://doi.org/10.1002/hyp.13632, 2020.
McMillan, H. K., Gnann, S. J., and Araki, R.: Large Scale Evaluation of Relationships Between Hydrologic Signatures and Processes, Water Resour. Res., 58, e2021WR031751, https://doi.org/10.1029/2021WR031751, 2022.
Mei, Y. and Anagnostou, E. N.: A hydrograph separation method based on information from rainfall and runoff records, J. Hydrol., 523, 636–649, https://doi.org/10.1016/j.jhydrol.2015.01.083, 2015.
Mei, Y., Zhu, J., and Wang, D.: Data and Code Archive for “Optimal Baseflow Separation through Chemical Mass Balance: Comparing the Usages of Two Tracers, Two Concentration Estimation Methods, and Four Baseflow Filters”, Zenodo [data set/code], https://doi.org/10.5281/zenodo.8388365, 2024a.
Mei, Y., Wang, D., Zhu, J., Tang, G., Cai, C., Shen, X., Hong, Y., and Zhang, X.: Optimal Baseflow Separation Through Chemical Mass Balance: Comparing the Usages of Two Tracers, Two Concentration Estimation Methods, and Four Baseflow Filters, Water Resour. Res., 60, https://doi.org/10.1029/2023wr036386, 2024b.
Nagy, E. D., Szilagyi, J., and Torma, P.: Calibrating the lyne-hollick filter for baseflow separation based on catchment response time, J. Hydrol., 638, https://doi.org/10.1016/j.jhydrol.2024.131483, 2024.
Noor, K., Marttila, H., Welker, J. M., Mustonen, K.-R., Kløve, B., and Ala-aho, P.: Snow sampling strategy can bias estimation of meltwater fractions in isotope hydrograph separation, J. Hydrol., 627, 130429, https://doi.org/10.1016/j.jhydrol.2023.130429, 2023.
Pelletier, A. and Andréassian, V.: Hydrograph separation: an impartial parametrisation for an imperfect method, Hydrol. Earth Syst. Sci., 24, 1171–1187, https://doi.org/10.5194/hess-24-1171-2020, 2020.
Piggott, A. R., Moin, S., and Southam, C.: A revised approach to the UKIH method for the calculation of baseflow [Une approche améliorée de la méthode de l'UKIH pour le calcul de l'écoulement de base], Hydrolog. Sci. J., 50, 920, https://doi.org/10.1623/hysj.2005.50.5.911, 2005.
Price, K.: Effects of watershed topography, soils, land use, and climate on baseflow hydrology in humid regions: A review, Prog. Phys. Geog., 35, 465–492, https://doi.org/10.1177/0309133311402714, 2011.
Rudin, C.: Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead, Nat. Mach. Intell., 1, 206–215, https://doi.org/10.1038/s42256-019-0048-x, 2019.
Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a grid-based hydrologic model at the mesoscale, Water Resour. Res., 46, https://doi.org/10.1029/2008WR007327, 2010.
Sheta, A., Abdel-raouf, A., Fraihat, K., and Baareh, A. K.: Evolutionary Design of a PSO-Tuned Multigene Symbolic Regression Genetic Programming Model for River Flow Forecasting, International Journal of Advanced Computer Science and Applications, 14, 2023, 10.14569/IJACSA.2023.0140489, 2023.
Sólyom, P. B. and Tucker, G. E.: The importance of the catchment area–length relationship in governing non-steady state hydrology, optimal junction angles and drainage network pattern, Geomorphology, 88, 84–108, https://doi.org/10.1016/j.geomorph.2006.10.014, 2007.
Song, W., Jiang, S., Camps-Valls, G., Williams, M., Zhang, L., Reichstein, M., Vereecken, H., He, L., Hu, X., and Shi, L.: Towards data-driven discovery of governing equations in geosciences, Commun. Earth Environ., 5, 589, https://doi.org/10.1038/s43247-024-01760-6, 2024.
Stewart, M. K.: Promising new baseflow separation and recession analysis methods applied to streamflow at Glendhu Catchment, New Zealand, Hydrol. Earth Syst. Sci., 19, 2587–2603, https://doi.org/10.5194/hess-19-2587-2015, 2015.
Stoelzle, M., Schuetz, T., Weiler, M., Stahl, K., and Tallaksen, L. M.: Beyond binary baseflow separation: a delayed-flow index for multiple streamflow contributions, Hydrol. Earth Syst. Sci., 24, 849–867, https://doi.org/10.5194/hess-24-849-2020, 2020.
Sundararajan, M. and Najmi, A.: The Many Shapley Values for Model Explanation, Proceedings of the 37th International Conference on Machine Learning, Proceedings of Machine Learning Research (PMLR), 119, https://proceedings.mlr.press/v119/sundararajan20b.html (last access: 29 May 2026), 2020.
Tan, X., Liu, B., and Tan, X.: Global Changes in Baseflow Under the Impacts of Changing Climate and Vegetation, Water Resour. Res., 56, e2020WR027349, https://doi.org/10.1029/2020WR027349, 2020.
Tarasova, L., Gnann, S., Yang, S., Hartmann, A., and Wagener, T.: Catchment characterization: Current descriptors, knowledge gaps and future opportunities, Earth-Sci. Rev., 252, https://doi.org/10.1016/j.earscirev.2024.104739, 2024.
Thorslund, J. and van Vliet, M. T. H.: A global dataset of surface water and groundwater salinity measurements from 1980–2019, Scientific Data, 7, 231, https://doi.org/10.1038/s41597-020-0562-z, 2020.
Wang, C., Gomez-Velez, J. D., and Wilson, J. L.: Dynamic coevolution of baseflow and multiscale groundwater flow system during prolonged droughts, J. Hydrol., 609, https://doi.org/10.1016/j.jhydrol.2022.127657, 2022.
Wilstrup, C. and Kasak, J.: Symbolic regression outperforms other models for small data sets, arXiv [preprint], https://doi.org/10.48550/arXiv.2103.15147, 2021.
Wu, S., Zhao, J., Wang, H., and Sivapalan, M.: Regional Patterns and Physical Controls of Streamflow Generation Across the Conterminous United States, Water Resour. Res., 57, https://doi.org/10.1029/2020wr028086, 2021.
Xie, J., Liu, X., Tian, W., Wang, K., Bai, P., and Liu, C.: Estimating Gridded Monthly Baseflow From 1981 to 2020 for the Contiguous US Using Long Short-Term Memory (LSTM) Networks, Water Resour. Res., 58, https://doi.org/10.1029/2021wr031663, 2022.
Xie, J., Liu, X., Wang, K., Yang, T., Liang, K., and Liu, C.: Evaluation of typical methods for baseflow separation in the contiguous United States, J. Hydrol., 583, https://doi.org/10.1016/j.jhydrol.2020.124628, 2020.
Xie, J., Liu, X., Jasechko, S., Berghuijs, W. R., Wang, K., Liu, C., Reichstein, M., Jung, M., and Koirala, S.: Majority of global river flow sustained by groundwater, Nat. Geosci., https://doi.org/10.1038/s41561-024-01483-5, 2024.
Yan, X., Sun, J., Huang, Y., Xia, Y., Wang, Z., and Li, Z.: Detecting and attributing the changes in baseflow in China's Loess Plateau, J. Hydrol., 617, 128957, https://doi.org/10.1016/j.jhydrol.2022.128957, 2023.
Zhang, J., Zhang, Y., Song, J., and Cheng, L.: Evaluating relative merits of four baseflow separation methods in Eastern Australia, J. Hydrol., 549, 252–263, https://doi.org/10.1016/j.jhydrol.2017.04.004, 2017.
Zhang, J., Zhang, Y., Song, J., Cheng, L., Kumar Paul, P., Gan, R., Shi, X., Luo, Z., and Zhao, P.: Large-scale baseflow index prediction using hydrological modelling, linear and multilevel regression approaches, J. Hydrol., 585, 124780, https://doi.org/10.1016/j.jhydrol.2020.124780, 2020.
Short summary
Understanding how baseflow contributes to river flow is essential for managing water resources. We studied a widely used method for separating baseflow and found that a key parameter was often estimated too simply. Using symbolic regression and data from 855 catchments, we uncovered new formulas that greatly improve accuracy and reveal how soil, snow, and catchment size jointly influence baseflow estimation.
Understanding how baseflow contributes to river flow is essential for managing water resources....