Articles | Volume 29, issue 20
https://doi.org/10.5194/hess-29-5453-2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.Improving streamflow simulation through machine learning-powered data integration and its potential for forecasting in the Western U.S.
Download
- Final revised paper (published on 21 Oct 2025)
- Preprint (discussion started on 30 Apr 2025)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-1708', Anonymous Referee #1, 02 Jun 2025
- AC1: 'Reply on RC1', Yuan Yang, 04 Aug 2025
-
RC2: 'Comment on egusphere-2025-1708', Anonymous Referee #2, 23 Jun 2025
- AC2: 'Reply on RC2', Yuan Yang, 04 Aug 2025
Peer review completion
AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
ED: Publish subject to revisions (further review by editor and referees) (08 Aug 2025) by Xing Yuan

AR by Yuan Yang on behalf of the Authors (16 Aug 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (18 Aug 2025) by Xing Yuan
RR by Anonymous Referee #2 (26 Aug 2025)
RR by Anonymous Referee #1 (02 Sep 2025)

ED: Publish as is (04 Sep 2025) by Xing Yuan

AR by Yuan Yang on behalf of the Authors (09 Sep 2025)
This article presents a robust LSTM-based data integration framework for improving streamflow simulation in the Western U.S., through integrating lagged streamflow and SWE observations across daily and monthly timescales. The paper is well-structured, the experiments are comprehensive, and the findings are practically significant. However, several aspects require further clarification and refinement. General comments are as follows:
1. Is there any reference or justification for the criteria used to select snow-dominated basins?
2. In the model input processing, the three types of inputs, which include forcings, attributes and lagged observations, have different dimensionalities. How are these inputs aligned in terms of dimensions before being fed into the LSTM model? Please clarify the specific preprocessing or embedding strategies used to ensure compatibility across these input types.
3. The meaning of Equation (2) is unclear. Does this formulation represent single-step or multi-step prediction? Are the input variables provided in a sliding window? When estimating streamflow at the current time step, are lagged forcings also included, or are only the current forcings used as inputs?
4. Why is the mean of six model simulations used for the model evaluation? How was the number six determined, and can this sample size ensure the representativeness and stability of the evaluation results? Please clarify the rationality.
5. Figure 3 shows that streamflow estimations in several basins exhibit very low or even zero KGE values under different models and temporal scales. Please discuss the possible reasons for such poor model performance in these specific basins.
6. Please provide a more detailed explanation of the statement: “The compaction of FHV was less pronounced than that of FLV, likely due to the shorter timescales of peak flows and their lower dependence on memory compared to low flows.”
7. The paper does not provide any analysis or discussion regarding the KGE spatial patterns over the Western U.S. for experiments at the monthly scale but only evaluation for April to July. Please supplement the corresponding analysis.
8. The paper attributes the limited benefits from daily SWE integration to the prevalence of zero SWE values or potential data quality issues. However, it lacks an in-depth analysis of the error structure of the SWE dataset and its influence on model performance. It is recommended to supplement the current findings with additional analyses using higher-quality SWE datasets and to further investigate this hypothesis to provide stronger support for the explanation.
9. In the paragraph around line 285, it is generally expected that integrating lagged SWE data during the snowmelt seasons should bring certain benefits to snow-dominated regions. However, the paper reports that KGE improvements are minimal and RB performance is even worse when evaluated over all regions, which may lead to biased conclusions. It is recommended to conduct this analysis specifically for snow-dominated regions.
10. The streamflow simulations in the paper are conducted using observed forcings rather than predicted forcings. However, in an operational forecasting mode, predicted forcings is used. Therefore, when applying the proposed method in a forecasting mode, the claimed enhancements such as improving daily streamflow forecasts up to 10 days in advance or monthly forecasts up to six months cannot be guaranteed.
11. In addition to the explanation provided around line 319, another possible reason for the observed phenomenon is that integrating lagged SWE performs poorly in rain-dominated regions, which may lower the overall performance when evaluated across all basins. It is recommended to compare the performance of integrating lagged Q and SWE specifically within snow-dominated regions, and also conduct a comparative analysis within rain-dominated regions.
Specific comments:
(1) Why is Δ|RV−1| used in Figure 8(c) instead of directly showing Δ|RV| values?
(2) Please clearly specify which months are defined as the accumulation season and which are defined as the snowmelt season.
(3) It is recommended to include representative case studies of individual basins in the results section, such as time series plots, rather than relying solely on statistical boxplots.
(4) The results throughout the paper are presented primarily through figures. It is recommended to include data tables to provide a more quantitative presentation of the results.