Articles | Volume 24, issue 11
https://doi.org/10.5194/hess-24-5491-2020
© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
https://doi.org/10.5194/hess-24-5491-2020
© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Two-stage variational mode decomposition and support vector regression for streamflow forecasting
Ganggang Zuo
State Key Laboratory of Eco-hydraulics in Northwest Arid Region, Xi'an University of Technology, Xi'an, Shaanxi 710048, China
State Key Laboratory of Eco-hydraulics in Northwest Arid Region, Xi'an University of Technology, Xi'an, Shaanxi 710048, China
Ni Wang
CORRESPONDING AUTHOR
State Key Laboratory of Eco-hydraulics in Northwest Arid Region, Xi'an University of Technology, Xi'an, Shaanxi 710048, China
Yani Lian
State Key Laboratory of Eco-hydraulics in Northwest Arid Region, Xi'an University of Technology, Xi'an, Shaanxi 710048, China
Xinxin He
State Key Laboratory of Eco-hydraulics in Northwest Arid Region, Xi'an University of Technology, Xi'an, Shaanxi 710048, China
Cited articles
Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, arXiv [preprint], arXiv:1603.04467v2, 2016.
Abbott, M. B., Bathurst, J. C., Cunge, J. A., O'Connell, P. E., and
Rasmussen, J.: An introduction to the European Hydrological System –
Systeme Hydrologique Europeen, “SHE”, 1: History and philosophy of a
physically-based, distributed modelling system, J. Hydrol., 87,
45–59, https://doi.org/10.1016/0022-1694(86)90114-9, 1986.
Adamowski, J. and Sun, K.: Development of a coupled wavelet transform and
neural network method for flow forecasting of non-perennial rivers in
semi-arid watersheds, J. Hydrol., 390, 85–91,
https://doi.org/10.1016/j.jhydrol.2010.06.033, 2010.
Ashrafi, M., Chua, L. H. C., Quek, C., and Qin, X.: A fully-online
Neuro-Fuzzy model for flow forecasting in basins with limited data, J. Hydrol., 545, 424–435, https://doi.org/10.1016/j.jhydrol.2016.11.057, 2017.
Bai, Y., Chen, Z., Xie, J., and Li, C.: Daily reservoir inflow forecasting
using multiscale deep feature learning with hybrid models, J. Hydrol., 532, 193–206, https://doi.org/10.1016/j.jhydrol.2015.11.011, 2016.
Bergstra, J., Bardenet, R., Bengio, Y., and Kégl, B.: Algorithms for Hyper-Parameter Optimization, in: Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, Proceedings of a meeting held 12–14 December 2011, Granada, Spain, edited by: Shawe-Taylor, J., Zemel, R. S., Bartlett, P. L., Pereira, F. C. N., Weinberger, K. Q., 2546–2554, 2011.
Beven, K.: Changing ideas in hydrology – The case of physically-based
models, J. Hydrol., 105, 157–172,
https://doi.org/10.1016/0022-1694(89)90101-7, 1989.
Binley, A. M., Beven, K. J., Calver, A., and Watts, L. G.: Changing
responses in hydrology: Assessing the uncertainty in physically based model
predictions, Water Resour. Res., 27, 1253–1261, https://doi.org/10.1029/91WR00130,
1991.
Castellano-Méndez, M., González-Manteiga, W., Febrero-Bande, M.,
Manuel Prada-Sánchez, J., and Lozano-Calderón, R.: Modelling of the
monthly and daily behaviour of the runoff of the Xallas river using
Box-Jenkins and neural networks methods, J. Hydrol., 296, 38–58,
https://doi.org/10.1016/j.jhydrol.2004.03.011, 2004.
Clark, M. P., Nijssen, B., Lundquist, J. D., Kavetski, D., Rupp, D. E.,
Woods, R. A., Freer, J. E., Gutmann, E. D., Wood, A. W., Brekke, L. D.,
Arnold, J. R., Gochis, D. J., and Rasmussen, R. M.: A unified approach for
process-based hydrologic modeling: 1. Modeling concept, Water Resour. Res.,
51, 2498–2514, https://doi.org/10.1002/2015WR017198, 2015.
D'Arcy, J.: Introducing SSA for Time Series Decomposition, Kaggle,
available at: https://www.kaggle.com/jdarcy/introducing-ssa-for-time-series-decomposition (last access: 20 November 2020), 2018.
Devia, G. K., Ganasri, B. P., and Dwarakish, G. S.: A Review on Hydrological
Models, Aquat. Pr., 4, 1001–1007, https://doi.org/10.1016/j.aqpro.2015.02.126,
2015.
Dragomiretskiy, K. and Zosso, D.: Variational Mode Decomposition, IEEE
Trans. Signal Process., 62, 531–544, https://doi.org/10.1109/TSP.2013.2288675, 2014.
Du, K., Zhao, Y., and Lei, J.: The incorrect usage of singular spectral
analysis and discrete wavelet transform in hybrid models to predict
hydrological time series, J. Hydrol., 552, 44–51,
https://doi.org/10.1016/j.jhydrol.2017.06.019, 2017.
Erdal, H. I. and Karakurt, O.: Advancing monthly streamflow prediction
accuracy of CART models using ensemble learning paradigms, J. Hydrol., 477, 119–128, https://doi.org/10.1016/j.jhydrol.2012.11.015, 2013.
Fang, W., Huang, S., Ren, K., Huang, Q., Huang, G., Cheng, G., and Li, K.:
Examining the applicability of different sampling techniques in the
development of decomposition-based streamflow forecasting models, J. Hydrol., 568, 534–550, https://doi.org/10.1016/j.jhydrol.2018.11.020, 2019.
Gai, L., Nunes, J. P., Baartman, J. E. M., Zhang, H., Wang, F., de Roo, A.,
Ritsema, C. J., and Geissen, V.: Assessing the impact of human interventions
on floods and low flows in the Wei River Basin in China using the LISFLOOD
model, Sci. Total Environ., 653, 1077–1094,
https://doi.org/10.1016/j.scitotenv.2018.10.379, 2019.
Grayson, R. B., Moore, I. D., and McMahon, T. A.: Physically based
hydrologic modeling: 2. Is the concept realistic?, Water Resour. Res., 28,
2659–2666, https://doi.org/10.1029/92WR01259, 1992.
Han, D., Cluckie, I. D., Karbassioun, D., Lawry, J., and Krauskopf, B.:
River Flow Modelling Using Fuzzy Decision Trees, Water Resour. Manag.,
16, 431–445, https://doi.org/10.1023/A:1022251422280, 2002.
Hastie, T., Friedman, J., and Tibshirani, R.: The Elements of Statistical
Learning: Data Mining, Inference, and Prediction, Second, Springer Series in
Statistics, Springer, New York, USA, 2009.
He, X., Luo, J., Zuo, G., and Xie, J.: Daily Runoff Forecasting Using a
Hybrid Model Based on Variational Mode Decomposition and Deep Neural
Networks, Water Resour. Manag., 33, 1571–1590,
https://doi.org/10.1007/s11269-019-2183-x, 2019.
He, X., Luo, J., Li, P., Zuo, G., and Xie, J.: A Hybrid Model Based on
Variational Mode Decomposition and Gradient Boosting Regression Tree for
Monthly Runoff Forecasting, Water Resour. Manag., 34, 865–884,
https://doi.org/10.1007/s11269-020-02483-x, 2020.
He, Z., Wen, X., Liu, H., and Du, J.: A comparative study of artificial
neural network, adaptive neuro fuzzy inference system and support vector
machine for forecasting river flow in the semiarid mountain region, J. Hydrol., 509, 379–386, https://doi.org/10.1016/j.jhydrol.2013.11.054, 2014.
Head, T., Kumar, M., Nahrstaedt, H., Louppe, G., and Shcherbatyi, I.: scikit-optimize/scikit-optimize: Zenodo, https://doi.org/10.5281/ZENODO.1157319, 2020.
Hosseini, S. M. and Mahjouri, N.: Integrating Support Vector Regression and
a geomorphologic Artificial Neural Network for daily rainfall-runoff
modeling, Appl. Soft Comput., 38, 329–345,
https://doi.org/10.1016/j.asoc.2015.09.049, 2016.
Huang, S., Chang, J., Huang, Q., and Chen, Y.: Monthly streamflow prediction
using modified EMD-based support vector machine, J. Hydrol., 511,
764–775, https://doi.org/10.1016/j.jhydrol.2014.01.062, 2014.
Hunter, J. D.: Matplotlib: A 2D Graphics Environment, Comput. Sci. Eng., 9, 90–95, https://doi.org/10.1109/MCSE.2007.55, 2007.
Jiang, R., Wang, Y., Xie, J., Zhao, Y., Li, F., and Wang, X.: Assessment of
extreme precipitation events and their teleconnections to El Niño
Southern Oscillation, a case study in the Wei River Basin of China,
Atmos. Res., 218, 372–384, https://doi.org/10.1016/j.atmosres.2018.12.015,
2019.
Jolliffe, I. T.: Principal Component Analysis, Springer, New York, USA, 2002.
Jung, Y.: Multiple predicting K-fold cross-validation for model selection,
J. Nonparametr. Stat., 30, 197–215,
https://doi.org/10.1080/10485252.2017.1404598, 2018.
Kirchner, J. W.: Getting the right answers for the right reasons: Linking
measurements, analyses, and models to advance the science of hydrology,
Water Resour. Res., 42, 2465, https://doi.org/10.1029/2005WR004362, 2006.
Kisi, O.: Wavelet regression model for short-term streamflow forecasting,
J. Hydrol., 389, 344–353, https://doi.org/10.1016/j.jhydrol.2010.06.013,
2010.
Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, https://doi.org/10.5194/hess-22-6005-2018, 2018.
Li, M., Wang, Q. J., Bennett, J. C., and Robertson, D. E.: A strategy to overcome adverse effects of autoregressive updating of streamflow forecasts, Hydrol. Earth Syst. Sci., 19, 1–15, https://doi.org/10.5194/hess-19-1-2015, 2015.
Liu, Z., Zhou, P., Chen, G., and Guo, L.: Evaluating a coupled discrete
wavelet transform and support vector regression for daily and monthly
streamflow forecasting, J. Hydrol., 519, 2822–2831,
https://doi.org/10.1016/j.jhydrol.2014.06.050, 2014.
Lopez, J. H.: The power of the ADF test, Econ. Lett., 57, 5–10,
https://doi.org/10.1016/S0165-1765(97)81872-1, 1997.
Luo, X., Yuan, X., Zhu, S., Xu, Z., Meng, L., and Peng, J.: A hybrid support
vector regression framework for streamflow forecast, J. Hydrol.,
568, 184–193, https://doi.org/10.1016/j.jhydrol.2018.10.064, 2019.
Maheswaran, R. and Khosa, R.: Wavelets-based non-linear model for real-time
daily flow forecasting in Krishna River, J. Hydroinform., 15,
1022–1041, https://doi.org/10.2166/hydro.2013.135, 2013.
Maity, R., Bhagwat, P. P., and Bhatnagar, A.: Potential of support vector
regression for prediction of monthly streamflow using endogenous property,
Hydrol. Process., 24, 917–923, https://doi.org/10.1002/hyp.7535, 2010.
McKinney, W.: Data Structures for Statistical Computing in Python, in: Proceedings of the 9th Python in Science Conference, edited by: van der Walt, S. and Millman, J., Austin, Texas, USA, 28 June–3 July, 56–61, 2010.
Meng, E., Huang, S., Huang, Q., Fang, W., Wu, L., and Wang, L.: A robust
method for non-stationary streamflow prediction based on improved EMD-SVM
model, J. Hydrol., 568, 462–478,
https://doi.org/10.1016/j.jhydrol.2018.11.015, 2019.
Minka, T. P.: Automatic Choice of Dimensionality for PCA, in: Proceedings of the 13th International Conference on Neural Information Processing Systems, edited by: Todd, L., Thomas, D., Volker, T., Denver, Colorado, USA, January 2000, NIPS’00, MIT Press, Cambridge, MA, USA, 577–583, 2000.
Mohammadi, K., Eslami, H. R., and Kahawita, R.: Parameter estimation of an
ARMA model for river flow forecasting using goal programming, J.
Hydrol., 331, 293–299, https://doi.org/10.1016/j.jhydrol.2006.05.017, 2006.
Mulvaney, T. J.: On the use of self-registering rain and flood gauges in
making observations of the relations of rainfall and of flood discharges in
a given catchment, Proceedings Institution of Civil Engineers, 4, 18–31,
1850.
Musa, A. B.: A comparison of l1-regularizion, PCA, KPCA and ICA for
dimensionality reduction in logistic regression, Int. J. Mach. Learn. Cyber., 5, 861–873, https://doi.org/10.1007/s13042-013-0171-7, 2014.
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, J. Hydrol., 10,
282–290, https://doi.org/10.1016/0022-1694(70)90255-6, 1970.
Newman, A. J., Clark, M. P., Sampson, K., Wood, A., Hay, L. E., Bock, A., Viger, R. J., Blodgett, D., Brekke, L., Arnold, J. R., Hopson, T., and Duan, Q.: Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance, Hydrol. Earth Syst. Sci., 19, 209–223, https://doi.org/10.5194/hess-19-209-2015, 2015.
Ng, A.: Machine learning yearning, available at: https://www.deeplearning.ai/machine-learning-yearning/ (last access: 20 November 2020), 2017.
Noori, R., Karbassi, A. R., Moghaddamnia, A., Han, D., Zokaei-Ashtiani, M.
H., Farokhnia, A., and Gousheh, M. G.: Assessment of input variables
determination on the SVM model performance using PCA, Gamma test, and
forward selection techniques for monthly stream flow prediction, J.
Hydrol., 401, 177–189, https://doi.org/10.1016/j.jhydrol.2011.02.021, 2011.
Nourani, V., Komasi, M., and Mano, A.: A Multivariate ANN-Wavelet Approach
for Rainfall–Runoff Modeling, Water Resour. Manag., 23, 2877,
https://doi.org/10.1007/s11269-009-9414-5, 2009.
Paniconi, C. and Putti, M.: Physically based modeling in catchment hydrology
at 50: Survey and outlook, Water Resour. Res., 51, 7090–7129,
https://doi.org/10.1002/2015WR017780, 2015.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel,
O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J.,
Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn.
Res., 12, 2825–2830, 2011.
Quilty, J., Adamowski, J., Khalil, B., and Rathinasamy, M.: Bootstrap
rank-ordered conditional mutual information (broCMI): A nonlinear input
variable selection method for water resources modeling, Water Resour. Res.,
52, 2299–2326, https://doi.org/10.1002/2015WR016959, 2016.
Quilty, J. and Adamowski, J.: Addressing the incorrect usage of
wavelet-based hydrological and water resources forecasting models for
real-world applications with best practices and a new forecasting framework,
J. Hydrol., 563, 336–353, https://doi.org/10.1016/j.jhydrol.2018.05.003,
2018.
Rasouli, K., Hsieh, W. W., and Cannon, A. J.: Daily streamflow forecasting
by machine learning methods with weather and climate inputs, J. Hydrol., 414/415, 284–293, https://doi.org/10.1016/j.jhydrol.2011.10.039, 2012.
Seo, Y., Kim, S., Kisi, O., and Singh, V. P.: Daily water level forecasting
using wavelet decomposition and artificial intelligence techniques, J. Hydrol., 520, 224–243, https://doi.org/10.1016/j.jhydrol.2014.11.050, 2015.
Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., and de Freitas, N.:
Taking the Human Out of the Loop: A Review of Bayesian Optimization, Proc.
IEEE, 104, 148–175, https://doi.org/10.1109/JPROC.2015.2494218, 2016.
Singh, V. P.: Hydrologic modeling: progress and future directions, Geosci.
Lett., 5, 1145, https://doi.org/10.1186/s40562-018-0113-z, 2018.
Sivapragasam, C., Liong, S.-Y., and Pasha, M. F. K.: Rainfall and runoff
forecasting with SSA-SVM approach, J. Hydroinform., 3,
141–152, https://doi.org/10.2166/hydro.2001.0014, 2001.
Solomatine, D. P., Maskey, M., and Shrestha, D. L.: Instance-based learning
compared to other data-driven methods in hydrological forecasting, Hydrol.
Process., 22, 275–287, https://doi.org/10.1002/hyp.6592, 2008.
Stéfan, v. d. W., Colbert, S. C., and Varoquaux, G.: The NumPy Array: A
Structure for Efficient Numerical Computation: A Structure for Efficient
Numerical Computation, Comput. Sci. Eng., 13, 22–30,
https://doi.org/10.1109/MCSE.2011.37, 2011.
Stojković, M., Kostić, S., Plavšić, J., and Prohaska, S.: A
joint stochastic-deterministic approach for long-term and short-term
modelling of monthly flow rates, J. Hydrol., 544, 555–566,
https://doi.org/10.1016/j.jhydrol.2016.11.025, 2017.
Tan, Q.-F., Lei, X.-H., Wang, X., Wang, H., Wen, X., Ji, Y., and Kang,
A.-Q.: An adaptive middle and long-term runoff forecast model using EEMD-ANN
hybrid approach, J. Hydrol., 567, 767–780,
https://doi.org/10.1016/j.jhydrol.2018.01.015, 2018.
Tiwari, M. K. and Chatterjee, C.: Development of an accurate and reliable
hourly flood forecasting model using wavelet-bootstrap-ANN (WBANN) hybrid
approach, J. Hydrol., 394, 458–470,
https://doi.org/10.1016/j.jhydrol.2010.10.001, 2010.
Todini, E.: Hydrological catchment modelling: past, present and future, Hydrol. Earth Syst. Sci., 11, 468–482, https://doi.org/10.5194/hess-11-468-2007, 2007.
Valipour, M., Banihabib, M. E., and Behbahani, S. M. R.: Comparison of the
ARMA, ARIMA, and the autoregressive artificial neural network models in
forecasting the monthly inflow of Dez dam reservoir, J. Hydrol.,
476, 433–441, https://doi.org/10.1016/j.jhydrol.2012.11.017, 2013.
Vapnik, V., Golowich, S., and Smola, A.: Support vector method for function approximation, regression estimation, and signal processing, in: Advances in Neural Information Processing Systems, 9, edited by: Mozer, M., Jordan, M., and Petsche, T., MIT Press, Cambridge, MA, 281–287, 1997.
Woldemeskel, F., McInerney, D., Lerat, J., Thyer, M., Kavetski, D., Shin, D., Tuteja, N., and Kuczera, G.: Evaluating post-processing approaches for monthly and seasonal streamflow forecasts, Hydrol. Earth Syst. Sci., 22, 6257–6278, https://doi.org/10.5194/hess-22-6257-2018, 2018.
Wu, C. L., Chau, K. W., and Li, Y. S.: Predicting monthly streamflow using
data-driven models coupled with data-preprocessing techniques, Water Resour.
Res., 45, 1331, https://doi.org/10.1029/2007WR006737, 2009.
Wu, Z. and Huang, N. E.: Ensemble Empirical Mode Decomposition: a
Noise-Assisted Data Analysis Method, Adv. Adapt. Data Anal., 1, 1–41,
https://doi.org/10.1142/S1793536909000047, 2009.
Xie, T., Zhang, G., Hou, J., Xie, J., Lv, M., and Liu, F.: Hybrid
forecasting model for non-stationary daily runoff series: A case study in
the Han River Basin, China, J. Hydrol., 577, 123915,
https://doi.org/10.1016/j.jhydrol.2019.123915, 2019.
Xu, B., Zhou, F., Li, H., Yan, B., and Liu, Y.: Early fault feature
extraction of bearings based on Teager energy operator and optimal VMD, ISA
T., 86, 249–265, https://doi.org/10.1016/j.isatra.2018.11.010, 2019.
Yaseen, Z. M., Ebtehaj, I., Bonakdari, H., Deo, R. C., Danandeh Mehr, A.,
Mohtar, W. H. M. W., Diop, L., El-Shafie, A., and Singh, V. P.: Novel
approach for streamflow forecasting using a hybrid ANFIS-FFA model, J. Hydrol., 554, 263–276, https://doi.org/10.1016/j.jhydrol.2017.09.007, 2017.
Yu, P.-S., Chen, S.-T., and Chang, I.-F.: Support vector regression for
real-time flood stage forecasting, J. Hydrol., 328, 704–716,
https://doi.org/10.1016/j.jhydrol.2006.01.021, 2006.
Yu, S., Xu, Z., Wu, W., and Zuo, D.: Effect of land use types on stream
water quality under seasonal variation and topographic characteristics in
the Wei River basin, China, Ecol. Indic., 60, 202–212,
https://doi.org/10.1016/j.ecolind.2015.06.029, 2016.
Zhang, X., Peng, Y., Zhang, C., and Wang, B.: Are hybrid models integrated
with data preprocessing techniques suitable for monthly streamflow
forecasting? Some experiment evidences, J. Hydrol., 530, 137–152,
https://doi.org/10.1016/j.jhydrol.2015.09.047, 2015.
Zhang, Y. and Yang, Y.: Cross-validation for selecting a model selection
procedure, J. Econometrics, 187, 95–112,
https://doi.org/10.1016/j.jeconom.2015.02.006, 2015.
Zhao, X.-H. and Chen, X.: Auto Regressive and Ensemble Empirical Mode
Decomposition Hybrid Model for Annual Runoff Forecasting, Water Resour.
Manag., 29, 2913–2926, https://doi.org/10.1007/s11269-015-0977-z, 2015.
Zuo, G.: Code and data for “Two-stage Variational Mode Decomposition and Support Vector Regression for Streamflow Forecasting”, https://doi.org/10.17632/ybfvpgvvsj.4, 2020.
Zuo, G., Luo, J., Wang, N., Lian, Y., and He, X.: Decomposition ensemble
model based on variational mode decomposition and long short-term memory for
streamflow forecasting, J. Hydrol., 585, 124776,
https://doi.org/10.1016/j.jhydrol.2020.124776, 2020.
Zuo, W., Zhang, D., and Wang, K.: Bidirectional PCA with assembled matrix distance metric for image recognition, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics a publication of the IEEE Systems, Man, and Cybernetics Society, 36, 863–872, https://doi.org/10.1109/TSMCB.2006.872274, 2006.
Short summary
A two-stage variational mode decomposition and support vector regression is designed to reduce the influence of boundary effects without removing or correcting boundary-affected decompositions. The proposed model significantly reduces the boundary effect consequences, saves modeling time and computation resources, barely overfits the calibration samples, and forecasts monthly runoff reasonably well compared to the benchmark models.
A two-stage variational mode decomposition and support vector regression is designed to reduce...