Articles | Volume 29, issue 14
https://doi.org/10.5194/hess-29-3055-2025
© Author(s) 2025. This work is distributed under the Creative Commons Attribution 4.0 License.
The value of hydroclimatic teleconnections for snow-based seasonal streamflow forecasting in central Asia
Download
- Final revised paper (published on 18 Jul 2025)
- Supplement to the final revised paper
- Preprint (discussion started on 26 Jul 2024)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on hess-2024-174', Anonymous Referee #1, 16 Aug 2024
- AC1: 'Reply on RC1', Atabek Umirbekov, 18 Oct 2024
-
RC2: 'Comment on hess-2024-174', Anonymous Referee #2, 02 Sep 2024
- AC2: 'Reply on RC2', Atabek Umirbekov, 18 Oct 2024
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
ED: Reconsider after major revisions (further review by editor and referees) (09 Nov 2024) by Hilary McMillan
AR by Atabek Umirbekov on behalf of the Authors (19 Jan 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (20 Jan 2025) by Hilary McMillan
RR by Anonymous Referee #1 (27 Jan 2025)
RR by Anonymous Referee #2 (10 Feb 2025)
ED: Publish subject to minor revisions (review by editor) (11 Feb 2025) by Hilary McMillan
AR by Atabek Umirbekov on behalf of the Authors (28 Feb 2025)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (28 Mar 2025) by Hilary McMillan
AR by Atabek Umirbekov on behalf of the Authors (11 Apr 2025)
Manuscript
Review of ‘The value of hydroclimatic teleconnections for snow-based seasonal streamflow forecasting,’ Umirbekov et al., HESS Discussions
This submission summarizes the motivation for, and development, implementation, and performance of, a data-driven model for seasonal river runoff volume forecasting in Central Asia. The method uses as predictors a combination of SWE data from existing large-scale operational remote sensing and land surface modeling products, and indices of various atmosphere-ocean circulation patterns, as predictors; it employs a multi-model ensemble model structure, which has some machine learning elements; and is intended to serve as an actual operational forecasting tool for directly supporting water management decision-making.
My overall recommendation is for publication pending minor revisions.
The article will be an excellent contribution to HESS. The paper is succinct and well-organized. The study location considered has been historically understudied, though admittedly not so severely as some other regions of the developing world. The candidate predictor data combinations are geophysically sensible, but also to some degree original in the particular way they are used here. Though it is possible to quibble with certain aspects of the predictor selection process used, it is a reasonable and defensible approach for the task at hand. Its multi-model ensemble philosophy is fully consistent with a large body of evidence demonstrating its value, yet it is only one of a tiny handful of hydrologic modeling examples where several separate data-driven/statistical/machine learning modeling systems are used and their results pooled to form a best estimate, and it is also novel as implemented here. Unlike some research papers claiming to present a forecast model, this article makes a point of clearly confirming that the predictor datasets considered are available going forward on a near-real-time basis, a necessity if a modeling system is actually going to be useful for real-world operational forecasting. The submission also clearly identifies (e.g., lines 400 to 410) where, when, and why each of the candidate predictor datasets are or are not useful for water supply forecasting, which is crucially important for physical credibility of the forecasts and the systems and data generating them, and which is sometimes overlooked in research around data-driven models.
That said, I think a few improvements will be needed before the paper is ready for publication. I hope the following comments will be helpful to the authors if they move forward with submitting a revised manuscript:
1. Though in general the article is well-crafted, some passages are written poorly enough that their meaning is unclear. For example, the wrong word is used, or words are used incorrectly, or elaborate vocabulary or phrasing is used when simpler wording would do.
2. The literature review is not quite adequate. While there seems to be sufficient reference to prior work in the study area, this article is not just a case study, and HESS is an international journal. More broadly, the methods described here need to be placed in the wider and deeper context of previous work, not just locally but globally, in order for readers to understand its contributions and wider implications – the methods used here may be applicable in entirely different regions of the world. The wider literature does not need to be discussed in detail, nor do the methods used in this submission need to be compared against them, but the paper does need to leave some clues for readers about relevant prior publications. What stood out for me is that prior research (and practice) around seasonal water supply forecasting in western Canada and the western US, directly relevant to this study, has not been adequately acknowledged. Some points of particular note are the following (full citations are provided at the end of this review):
2.a. It would probably be helpful to note in the article that the type of seasonal river discharge volume forecast modeling considered here is widely referred to as “water supply forecasting” (WSF) in the western North American operational hydrology and water management communities. They don't need to use the term throughout the article, but just pointing out at the start of the paper that they're working on what is commonly called WSF will help readers connect the study to a large existing body of prior research and practice.
2.b. Contrary to what seems to be implied in this article, given the way certain passages are phrased and the sparseness of literature citations, combining teleconnection indices with snow data as inputs to statistical seasonal water supply forecasting models is neither new nor rare. It appears to have first been implemented decades ago (Garen, 1998) in the large-scale (hundreds of forecast locations) operational forecasting systems of the US Department of Agriculture’s Natural Resources Conservation Service (NRCS) (Perkins et al., 2009), predictions from which are a staple for water managers across the American West. These principal component regression models (Garen, 1992) have used a combination of SWE, accumulated precipitation, and in some cases antecedent streamflow and El Niño-Southern Oscillation indices (Garen, 1998) as predictors of seasonal river flow volumes; these methods have since be adopted widely across western North America by other operational forecast agencies. Furthermore, continued applied R&D on combined use of snowpack observations and teleconnection indices as input variables to statistical seasonal discharge forecast models has been continued by many, such as Gobena et al (2013) in western Canada, and Moradkhani and Meier (2010) and Regonda et al. (2006a, 2006b) in the western US, to give just a few examples. It has also been extended to discovering new climate prediction skill using nonlinear methods or new indices in areas where conventional linear teleconnections are weak, such as southern Oregon and northern California (Kennedy et al., 2009; see also Fleming and Dahlke 2014).
2.c. Though the computational model presented here appears to be novel, major elements of its philosophy and structure are strongly reminiscent of other recent advances in data-driven predictive modeling of seasonal river discharge volumes. Of note here is the multi model machine-learning metasystem (M4), which was developed for and is currently being operationally implemented by the US Department of Agriculture NRCS as its new western US-wide seasonal river discharge volume forecast model. This system has been run using in-situ SWE, precipitation, and antecedent streamflow data, as well as combinations of in-situ and remotely sensed snow data, as predictors (Fleming et al., 2021, 2024). It uses a multi-model ensemble approach in which six data-driven (statistical and machine learning) forecast systems are run independently and the results are pooled to form a best estimate, closely analogous to the modeling philosophy used in this HESS contribution. There are also significant differences between M4 and the method used in this submission, but citing M4 will better-place this HESS article’s contributions in the larger research and applications literature, and provide literature support to the methods the submitted paper uses. By the same token, some related exploratory work on methods for combining outputs from multiple data-driven seasonal river discharg forecast models by Najafi and Moradkhani (2016) should be cited in this regard as well.
2.d. The foregoing are just some examples I happen to be familiar with. I’d suggest that the authors scour the literature for other prior work, including work in other regions globally, that ought to be at least briefly cited in their revised paper.
3. The following are a few additional suggestions for improvements:
3.a. Line 35 and elsewhere: this paper distinguishes between what it calls “dynamic” vs. “statistical” approaches. This jargon tends to be used more in other (broadly related) disciplines like regional climate modeling, with “process-based” vs. “data-driven” being more common in the operational hydrology literature. Also, it’s usually “dynamical” not “dynamic”, and “data-driven” also tends to be preferable to “statistical” today because of the increasing popularity of machine learning techniques (including this submission). The authors can use “dynamical” and “statistical” if they like, but to better orient readers, including the operational water resource forecasting community, to which this article seems to be in part addressed, please provide some synonyms where the terms are first introduced (line 35). It could read something like “generated using either dynamical (process-based, physics-oriented) or statistical (data-driven including machine learning and conventional statistical) modeling approaches” or something similar.
3.b. Line 46, “statistical forecasts of seasonal streamflow often rely solely on accumulated snowpack.” Yes and no. Yes, data on winter-spring seasonal snowpack provides the primary source of predictive skill in data-driven forecast models of spring-summer river runoff volume in snowmelt-dominated rivers. But these models, in both the research literature and (in particular) in operational practice, at least in western North America, also almost always use additional predictor data types. Examples include wintertime accumulated precipitation, early-season precipitation, and at some locations, antecedent streamflow and/or El Niño indices. See point 2.b above.
3.c. Line 50: excellent point!
3.d. Lines 52-53 and elsewhere: if the authors want to call the Apr-Sep target period the “vegetation season,” that’s fine I suppose, but it’s not standard nomenclature. Typically this would be called either the “growing season,” looking at it from an agricultural water supply or broader ecological perspective, or the “runoff season”, looking at it from a hydrological perspective. And given that they call Nov-Mar the “cold season” rather than the “snowpack accumulation season”, it might also be more consistent to simply call Apr-Sept the “warm season.” Overall, “growing season” seems like it might be the best fit here?
3.e. Figure 1: this figure is good, but for a wide international readership, please provide an additional map showing the location of the study area within the larger geographic context of Eurasia.
3.f. Line 124: predictand, not predicant
3.g. Line 125: An 18 year data record – in other words, 18 samples - is pretty short; it’s enough to defensibly create one of these models, but just barely. Commensurate limitations to the authors’ ability to train model parameters and validate model predictions could be viewed as a source of uncertainty in this study; the counterargument, of course, is that with rapid climate change in mountain regions such as this study area, the statistical nonstationarity in a longer data record would have reduced its value anyway. This might be worth a sentence or two here. A brief explanation of why the record doesn’t go back further or continue to the present could be helpful to readers as well. My understanding of the political history of this region isn’t great, but I think this was part of the Soviet Union, which (its grave misdeeds notwithstanding) wasn’t too bad at keeping streamflow records, so one might have been forgiven for guessing that there might be some usable historical data here?
3.h. Lines 129-130: excellent point re: near-real time input data availability – this is a prerequisite for an operational forecasting model, and it’s sometimes overlooked in research articles.
3.i. Lines 173-175: a little more information about the constituent models (“base models”) is needed here. What link function was used in the GLM? And why were linear kernels used in the GP and SVR models? Does this imply that most of the base models are essentially variants of standard, multiple linear regression? If so, what are the pros and cons? Note that work in the western US has shown that the relationships between winter-spring hydroclimatic forcing and spring-summer runoff response in data-driven WSF models range from nearly linear to moderately nonlinear, with clear physical explanations for these inferred functional forms (see Fleming et al., 2021).
3.j. Lines 181-185: in defense of their methodological choice, which has no literature citations attached to it in the submission, the authors might wish to note that LOOCV is standard practice in western US WSF modeling; see references in point 2.b above.
3.k. Lines 187-190: to improve accessibility to a broad readership which may not be uniformly well-versed in machine learning, it might be helpful to add just a sentence or two, with an additional reference or two, explaining the concept of a meta-learner. It might also be helpful, in terms of connecting this concept to prior work in data-driven WSF, to refer to the work of Najafi and Moradkhani (2016) on exploring different methods for creating multi-model ensembles from the predictions of several data-driven models.
3.l. Line 205: in the context of operational hydrologic prediction models, data “assimilation” has a very specific connotation: formal methods for using new observational data, such as observed snowpack, to update the internal states, such as predicted snowpack, of a process-based (dynamical, physics-oriented) streamflow simulation model, often using fairly complex methods like ensemble Kalman filtering. It is not normally used to refer to the use of some particular data type, such as snow data, as an input predictor variable in a data-driven (statistical or machine-learning) streamflow model.
3.m. Lines 287-288: excellent point. The authors might wish to cite literature that backs up this result, such as the excellent overview article of Hagedorn et al. (2005) and the multi-model ensemble WSF modeling article of Fleming et al. (2021).
3.n. Figure 6: this a great illustration! I do have one question though: are all the base models used for the Vaksh and Kashkadarya rivers? It’s hard to tell from the figure panels.
3.o. Line 348, might suggest rephrasing this in a more specific way, such as “suggest that useful near-real time SWE estimates, suitable for operational seasonal river discharge volume forecasting, can be effectively”
3.p. Line 350: “and enlarge during the snow ablation phase” – confusing wording
3.q. Lines 365-370: the entire paragraph (except for the excellent final sentence) is muddled. Please rewrite more simply and clearly.
3.r. Line 373: “confirms this assumption” – what assumption?
3.s. Lines 398, “is assumingly reasoned by their compensation” – this is meaningless, please rewrite.
3.t. Lines 400-410: excellent points.
References:
Fleming SW, Dahlke HE. 2014. Parabolic northern-hemisphere river flow teleconnections to El Niño-Southern Oscillation and the Arctic Oscillation. Environmental Science Letters, 9, 104007, doi:10.1088/1748-9326/9/10/104007.
Fleming SW et al. 2021. Assessing the new Natural Resources Conservation Service water supply forecast model for the American West: a challenging test of explainable, automated, ensemble artificial intelligence. Journal of Hydrology, 602, 126782.
Fleming SW et al. 2024. Leveraging next-generation satellite remote sensing-based snow data to improve seasonal water supply predictions in a practical machine learning-driven river forecast system. Water Resources Research, 60, e2023WR035785, https://doi.org/10.1029/2023WR03578.
Garen DC. 1992. Improved techniques in regression-based streamflow volume forecasting. Journal of Water Resources Planning and Management, 118, 654-669.
Garen DC. 1998. ENSO indicators and long-range climate forecasts: usage in seasonal streamflow volume forecasting in the western United States, American Geophysical Union Fall Conference, San Francisco, CA.
Gobena AK et al. 2013. The role of large-scale climate modes in regional streamflow variability and implications for water supply forecasting: a case study of the Canadian Columbia Basin. Atmosphere-Ocean, 51, 380-391.
Hagedorn R et al., 2005. The rationale behind the success of multi-model ensembles in seasonal forecasting – I. basic concept. Tellus, 57A, 219-233.
Moradkhani H, Meier M. 2010. Long-lead water supply forecast using large-scale climate predictors and independent component analysis. Journal of Hydrologic Engineering, 15, 744-762.
Najafi MR, Moradkhani H. 2016. Ensemble combination of seasonal streamflow forecasts. Journal of Hydrologic Engineering, 21, 10.1061/(ASCE)HE.1943-5584.0001250.
Kennedy AM, Garen DC, Koch RW. 2009. The association between climate teleconnection indices and Upper Klamath seasonal streamflow: Trans-Niño index. Hydrological Processes, 23, 973-984.
Perkins TR et al. 2009. Innovative operational seasonal water supply forecasting technologies. Journal of Soil and Water Conservation, 64, 15-17.
Regonda SK et al. 2006a. A multimodel ensemble forecast framework: application to spring seasonal flows in the Gunnison River Basin. Water Resources Research, 42, doi:10.1029/2005WR004653.
Regonda SK et al. 2006b. A new method to produce categorical streamflow forecasts. Water Resources Research, 42, doi:10.1029/2006WR004984.