Articles | Volume 30, issue 4
https://doi.org/10.5194/hess-30-1077-2026
https://doi.org/10.5194/hess-30-1077-2026
Research article
 | 
24 Feb 2026
Research article |  | 24 Feb 2026

Sensitivity of hydrological machine learning prediction accuracy to information quantity and quality

Minhyuk Jeung, Younggu Her, Sang-Soo Baek, and Kwangsik Yoon

Related authors

Sensitivity of hydrological machine learning prediction accuracy to information quantity and quality
Minhyuk Jeung, Younggu Her, Sang-Soo Baek, and Kwangsik Yoon
Hydrol. Earth Syst. Sci. Discuss., https://doi.org/10.5194/hess-2024-284,https://doi.org/10.5194/hess-2024-284, 2024
Revised manuscript not accepted
Short summary

Cited articles

Adeola Fashae, O., Abiola Ayorinde, H., Oludapo Olusola, A., and Oluseyi Obateru, R.: Landuse and surface water quality in an emerging urban city, Appl. Water Sci., 9, 25, https://doi.org/10.1007/s13201-019-0903-2, 2019. 
Ahmad, I., Basheri, M., Iqbal, M. J., and Rahim, A.: Performance Comparison of Support Vector Machine, Random Forest, and Extreme Learning Machine for Intrusion Detection, IEEE Access, 6, 33789–33795, https://doi.org/10.1109/ACCESS.2018.2841987, 2018. 
Ahmed, S., Khalid, M., and Akram, U.: A method for short-term wind speed time series forecasting using Support Vector Machine Regression Model, 2017 6th International Conference on Clean Electrical Power (ICCEP), 27–29 June 2017, 190–195, https://doi.org/10.1109/ICCEP.2017.8004814, 2017. 
Aktan, S.: Application of machine learning algorithms for business failure prediction, Invest. Manage. And Financial Inno., 8, 52–65, 2011. 
Al-Mukhtar, M.: Random forest, support vector machine, and neural networks to modelling suspended sediment in Tigris River-Baghdad, Environ. Monit. Assess., 191, 673, https://doi.org/10.1007/s10661-019-7821-5, 2019. 
Download
Short summary
Machine learning (ML) techniques have become widely used due to the availability of large data repositories and advancements in computing resources and methods. Our study explored the connection between a model’s accuracy and the information content of input data. Results showed that the accuracy of three ML models significantly improved when high-quality input data were included. These findings highlight the importance of data quality in ML model training.
Share