<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-25-2685-2021</article-id><title-group><article-title>A note on leveraging synergy in multiple meteorological data sets with deep learning for rainfall–runoff modeling</article-title><alt-title>Multi-forcing LSTM networks for streamflow modeling</alt-title>
      </title-group><?xmltex \runningtitle{Multi-forcing LSTM networks for streamflow modeling}?><?xmltex \runningauthor{F. Kratzert et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Kratzert</surname><given-names>Frederik</given-names></name>
          <email>kratzert@ml.jku.at</email>
        <ext-link>https://orcid.org/0000-0002-8897-7689</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Klotz</surname><given-names>Daniel</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-9843-6798</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Hochreiter</surname><given-names>Sepp</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff2 aff3">
          <name><surname>Nearing</surname><given-names>Grey S.</given-names></name>
          <email>gsnearing@google.com</email>
        </contrib>
        <aff id="aff1"><label>1</label><institution>LIT AI Lab &amp; Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Google Research, Mountain View, CA, United States</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Land, Air and Water Resources Department, University of California Davis, Davis, CA, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Frederik Kratzert (kratzert@ml.jku.at) and Grey S. Nearing (gsnearing@google.com)</corresp></author-notes><pub-date><day>20</day><month>May</month><year>2021</year></pub-date>
      
      <volume>25</volume>
      <issue>5</issue>
      <fpage>2685</fpage><lpage>2703</lpage>
      <history>
        <date date-type="received"><day>13</day><month>May</month><year>2020</year></date>
           <date date-type="rev-request"><day>14</day><month>May</month><year>2020</year></date>
           <date date-type="rev-recd"><day>16</day><month>March</month><year>2021</year></date>
           <date date-type="accepted"><day>5</day><month>April</month><year>2021</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2021 Frederik Kratzert et al.</copyright-statement>
        <copyright-year>2021</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021.html">This article is available from https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e124">A deep learning rainfall–runoff model can take multiple meteorological forcing products as input and learn to combine them in spatially and temporally dynamic ways. This is demonstrated with Long Short-Term Memory networks (LSTMs) trained over basins in the continental US, using the Catchment Attributes and Meteorological data set for Large Sample Studies (CAMELS). Using meteorological input from different data products (North American Land Data Assimilation System, NLDAS, Maurer, and Daymet) in a single LSTM significantly improved simulation accuracy relative to using only individual meteorological products. A sensitivity analysis showed that the LSTM combines precipitation products in different ways, depending on location, and also in different ways for the simulation of different parts of the hydrograph.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

      <?xmltex \hack{\allowdisplaybreaks}?>
<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e138">All meteorological forcing data available for hydrological modeling are subject to errors and uncertainty. While temperature estimates between different data products are frequently similar, precipitation estimates are often subject to large disagreements <xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx51" id="paren.1"><named-content content-type="pre">e.g.,</named-content></xref>. The most accurate precipitation data generally come from in situ gauges, which provide point-based measurements of rainfall events, which are complex spatial processes <xref ref-type="bibr" rid="bib1.bibx30" id="paren.2"><named-content content-type="pre">although, in certain cases, especially related to snow, modeled products might be better; e.g.,</named-content></xref>. However, large-scale hydrological models require spatial data (usually gridded), which are necessarily model-based products resulting from a combination of spatial interpolation and/or satellite retrieval algorithms, and, sometimes, process-based modeling. Every precipitation data product is based on different sets of assumptions that each potentially introduce different types of error and information loss. It is difficult to predict a priori how methodological choices in precipitation modeling or interpolation algorithms might lead to different types of disagreements in the resulting data products <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx38" id="paren.3"><named-content content-type="pre">e.g.,</named-content></xref>. As an example of the consequences of this difficulty, <xref ref-type="bibr" rid="bib1.bibx6" id="text.4"/> showed that no existing gridded meteorological product is uniformly better than all others over the continental United States (CONUS).</p>
      <p id="d1e159">The primary strategy for dealing with forcing uncertainty in hydrological modeling is to use ensembles of forcing products <xref ref-type="bibr" rid="bib1.bibx7" id="paren.5"><named-content content-type="pre">e.g.,</named-content></xref>. These can be ensembles of opportunity, or they can be drawn from probability distributions, and they can be combined either before (e.g., as precipitation) or after (e.g., as streamflow) being used in one or more hydrological models. In any case, it is generally not straightforward to predict how differences between different forcing products will translate into differences between hydrological model simulations <xref ref-type="bibr" rid="bib1.bibx60 bib1.bibx14 bib1.bibx40" id="paren.6"><named-content content-type="pre">e.g.,</named-content></xref>, and given that data quality among different products varies over space and time, it is difficult to design ensembling strategies that maximize the information or value of the forcing ensembles.</p>
      <p id="d1e172">However, unlike conceptual or process-based hydrological models, machine learning (ML) or deep learning (DL) can use multiple precipitation (and other meteorological) data products simultaneously. This means that it is not necessary<?pagebreak page2686?> to design a priori strategies to combine input forcing data or to combine the outputs of hydrological models forced with different data products. In principle, such models could learn to exploit potential nonlinear synergies in different (imperfect) precipitation data sets or any other type of model input. In particular, deep learning models that are able to learn spatiotemporally heterogeneous behaviors, such as those used by <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx26" id="text.7"/>, should be able to learn spatiotemporally dynamic effective mixing strategies in the way that they can leverage multiple input products in different locations and under different hydrological conditions. If successful, this could provide a simple and computationally efficient alternative to the ensembling strategies currently used for hydrological modeling.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data</title>
      <p id="d1e193">This study uses the Catchment Attributes and Meteorological data set for Large Sample Studies <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx2" id="paren.8"><named-content content-type="pre">CAMELS;</named-content></xref>. CAMELS contains basin-averaged daily meteorological forcing input derived from three different gridded data products for 671 basins across CONUS. The three forcing products are (i) Daymet <xref ref-type="bibr" rid="bib1.bibx50" id="paren.9"/>, (ii) Maurer <xref ref-type="bibr" rid="bib1.bibx32" id="paren.10"/>, and the (iii) North American Land Data Assimilation System <xref ref-type="bibr" rid="bib1.bibx59" id="paren.11"><named-content content-type="pre">NLDAS;</named-content></xref>. The former product has a 1 km <inline-formula><mml:math id="M1" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 km spatial resolution, and the latter two have a one-eighth of a degree (approximately 12.5 km <inline-formula><mml:math id="M2" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 12.5 km) spatial resolution. Although CAMELS includes 671 basins, to facilitate a direct comparison of results with previous studies, we used only the subset of 531 basins that were originally chosen for model benchmarking by <xref ref-type="bibr" rid="bib1.bibx37" id="text.12"/>, who removed all basins with an area greater than 2000 km<inline-formula><mml:math id="M3" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> and also all basins in which there was a discrepancy of more than 10 % between different methods of calculating the basin area.</p>
      <p id="d1e239"><xref ref-type="bibr" rid="bib1.bibx6" id="text.13"/> conducted a detailed analysis of eight different precipitation and surface temperature (daily max/min) data products, including the three used by CAMELS. Those authors compared gridded precipitation and temperature values to station data, using roughly 4000 weather stations across CONUS. Their findings were that “no data set was `best' everywhere and for all variables we analyzed” and “two products stood out in their overall tendency to be closest to (Maurer) and farthest from (NLDAS2) observed measurements.” Furthermore, they did not find a “clear relationship between the resolution of gridded products and their agreement with observations, either for average conditions … or extremes” and noted that the “high-resolution Daymet … data sets had the largest nationwide mean biases in precipitation.”</p>
      <p id="d1e244">Figure <xref ref-type="fig" rid="Ch1.F1"/> gives an example of disagreement between precipitation products in CAMELS that we hope to capitalize on by training a model with multiple forcing input. This figure shows the noisy relationship between the three precipitation products in a randomly selected basin (U.S. Geological Survey (USGS) ID 07359610). The idea is that DL should be able to mitigate the type of noise shown in the scatterplot in Fig. <xref ref-type="fig" rid="Ch1.F1"/>b.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e254">Illustration of the relationship between three CAMELS precipitation products at a randomly selected basin (USGS ID 07359610). Panel <bold>(a)</bold> shows the first 100 d of precipitation data from all three products during the test period, and panel <bold>(b)</bold> shows scatter between the three products over the full test period. The scatter shown in <bold>(b)</bold> is the data uncertainty that we would like to mitigate. In this particular basin, there appears to be a 1 d shift between Daymet and Maurer, which is common in the CAMELS data set (this shift is apparent in 325 of the 531 basins; see Fig. 2).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f01.png"/>

        </fig>

      <p id="d1e272">Figure <xref ref-type="fig" rid="Ch1.F1"/>a shows a time shift between Daymet and Maurer precipitation in the same basin. This type of shift is common. <xref ref-type="bibr" rid="bib1.bibx6" id="text.14"/>, for example, reported that “[because] gridded products differ in how they define a calendar day (e.g., local time relative to Coordinated Universal Time), appropriate lag correlations were applied through cross-correlation analysis to account for the several-hour offset in daily station data.” We performed a lag-correlation analysis on the precipitation products in CAMELS and found a higher correlation between Daymet and Mauer when Mauer was lagged by 1 d in 325 (of 531) basins. Figure <xref ref-type="fig" rid="Ch1.F2"/> shows the percent difference between lagged vs. non-lagged correlations between Daymet and Maurer.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e284">Spatial distributions of lagged vs. non-lagged correlations between Daymet and Maurer test period precipitation. Positive values indicate that the 1 d lagged correlation is higher.</p></caption>
          <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f02.png"/>

        </fig>

      <p id="d1e293">Each of the forcing products in CAMELS includes daily precipitation (millimeters per day) and maximum and minimum daily temperature (degrees Celsius), vapor pressure (Pascal), and surface radiation (watts per square meter). The original CAMELS data set hosted by the US National Center for Atmospheric Research <xref ref-type="bibr" rid="bib1.bibx36" id="paren.15"/> only contains daily mean temperatures for Maurer and NLDAS. CAMELS-relevant Maurer and NLDAS products, with daily minimum and maximum temperatures, are available from our HydroShare DOI (see the data availability section). We used all five meteorological variables from all three data products as input into the models. In addition to the three daily forcing data sets from CAMELS, we used the same 27 catchment attributes as <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx26" id="text.16"/>, which consist of topography, climate, vegetation, and soil descriptors <xref ref-type="bibr" rid="bib1.bibx1" id="paren.17"/>.
Prior to training any models, all input variables were normalized independently by subtracting the CONUS-wide mean and dividing by the CONUS-wide standard deviation.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Models</title>
      <?pagebreak page2687?><p id="d1e313">Long Short-Term Memory networks (LSTMs) are a type of recurrent neural network <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx17 bib1.bibx12" id="paren.18"/>. LSTMs have a state space that evolve through a set of input–state–output relationships. Gates, which are activated linear functions, control information flows from input and previous states to current state values (called an input gate), from current states to outputs (called an output gate), and also control the timescale of each element of the state vector (called a forget gate). States (called cell states) accumulate and store information over time, much like the states of a dynamical systems model. Technical details of the LSTM architecture have been described in several previous publications in hydrology journals, and we refer the reader to <xref ref-type="bibr" rid="bib1.bibx24" id="text.19"/> for a detailed explanation geared towards hydrologists.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Benchmarks</title>
      <p id="d1e330">Because all relevant benchmark models from previous studies (see, e.g., <xref ref-type="bibr" rid="bib1.bibx26" id="altparen.20"/>) were calibrated using only Maurer forcings, we produced a benchmark using the Sacramento Soil Moisture Accounting (SAC-SMA) model with multiple meteorological forcings. Following <xref ref-type="bibr" rid="bib1.bibx37" id="text.21"/>, we calibrated SAC-SMA using the dynamically dimensioned search (DDS) algorithm <xref ref-type="bibr" rid="bib1.bibx53" id="paren.22"/>, implemented in the SPOTPY optimization library <xref ref-type="bibr" rid="bib1.bibx18" id="paren.23"/>, using data from the training period in each basin. SAC-SMA was calibrated separately, <inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> times with <inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> different random seeds, in each basin for each of the three meteorological data products. This resulted in a total of 30 calibrated SAC-SMA models for each basin.</p>
      <p id="d1e370">To check our SAC-SMA calibrations, we compared the performance of our Maurer calibrations against a SAC-SMA model from the benchmark data set calibrated by <xref ref-type="bibr" rid="bib1.bibx37" id="text.24"/>. We used the (paired) Wilcoxon test to test for significance in any difference between the average, per basin, performance scores from our <inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> different SAC-SMA calibrations with Maurer forcings vs. the SAC-SMA calibrations with Maurer forcings from <xref ref-type="bibr" rid="bib1.bibx37" id="text.25"/>. The <inline-formula><mml:math id="M7" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value of this test was <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.9</mml:mn></mml:mrow></mml:math></inline-formula>, meaning no significant difference.</p>
      <p id="d1e410">Results reported in Sect. <xref ref-type="sec" rid="Ch1.S3"/> used a simple average of these 30 SAC-SMA ensembles in each basin, which is what we found to be the most accurate overall. We also tested (not reported) a Bayesian model averaging strategy, with basin-specific likelihood weights chosen according to relative training performance of the SAC-SMA ensemble members, using Gaussian likelihoods with a wide range of variance parameters. We were not able to achieve an overall higher performance in the test period using an ensembling method more sophisticated than equal-weighted averaging. There are possibilities to potentially improve on this benchmark <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx31" id="paren.26"><named-content content-type="pre">e.g.,</named-content></xref>; however, as will be shown in Sect. <xref ref-type="sec" rid="Ch1.S3"/>, the difference between ensemble averaging and the multi-input LSTMs is large, and we would be surprised if any ensembling strategy could account for this difference.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e425">Test period comparison between single-forcing and multiple-forcing LSTM ensembles (<inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>) over 531 CAMELS basins. All differences were statistically significant (<inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.001</mml:mn></mml:mrow></mml:math></inline-formula>), with the exception of Daymet vs. Maurer (<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.08</mml:mn></mml:mrow></mml:math></inline-formula>) and NLDAS and Maurer vs. Maurer and Daymet (<inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>)</p></caption>
          <?xmltex \igopts{width=227.622047pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f03.png"/>

        </fig>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e485">Description of the performance metrics (top part) and signatures (bottom part) considered in this study. For each signature, we derived a metric by computing the Pearson correlation between the signature of the observed flow and the signature of the simulated flow over all basins. Description of the signatures taken from <xref ref-type="bibr" rid="bib1.bibx3" id="text.27"/></p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="justify" colwidth="4cm"/>
     <oasis:colspec colnum="2" colname="col2" align="justify" colwidth="7cm"/>
     <oasis:colspec colnum="3" colname="col3" align="justify" colwidth="5cm"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Metric or signature</oasis:entry>
         <oasis:entry colname="col2">Description</oasis:entry>
         <oasis:entry colname="col3">Reference</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">NSE</oasis:entry>
         <oasis:entry colname="col2">Nash–Sutcliffe efficiency</oasis:entry>
         <oasis:entry colname="col3">Eq. (3) in <xref ref-type="bibr" rid="bib1.bibx35" id="text.28"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">KGE</oasis:entry>
         <oasis:entry colname="col2">Kling–Gupta efficiency</oasis:entry>
         <oasis:entry colname="col3">Eq. (9) in <xref ref-type="bibr" rid="bib1.bibx13" id="text.29"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pearson <inline-formula><mml:math id="M13" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">Pearson correlation between observed and simulated flow</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M14" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>–NSE</oasis:entry>
         <oasis:entry colname="col2">Ratio of standard deviations of observed and simulated flow</oasis:entry>
         <oasis:entry colname="col3">From Eq. (4) in <xref ref-type="bibr" rid="bib1.bibx13" id="text.30"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M15" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE</oasis:entry>
         <oasis:entry colname="col2">Ratio of the means of observed and simulated flow</oasis:entry>
         <oasis:entry colname="col3">From Eq. (10) in <xref ref-type="bibr" rid="bib1.bibx13" id="text.31"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FHV</oasis:entry>
         <oasis:entry colname="col2">Top 2 % peak flow bias</oasis:entry>
         <oasis:entry colname="col3">Eq. (A3) in <xref ref-type="bibr" rid="bib1.bibx61" id="text.32"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FLV</oasis:entry>
         <oasis:entry colname="col2">Bottom 30 % low flow bias</oasis:entry>
         <oasis:entry colname="col3">Eq. (A4) in <xref ref-type="bibr" rid="bib1.bibx61" id="text.33"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FMS</oasis:entry>
         <oasis:entry colname="col2">Bias of the slope of the flow duration curve between the 20th and 80th percentile</oasis:entry>
         <oasis:entry colname="col3">Eq. (A2) <xref ref-type="bibr" rid="bib1.bibx61" id="text.34"/></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Peak timing</oasis:entry>
         <oasis:entry colname="col2">Mean peak time lag (in days) between observed and simulated peaks</oasis:entry>
         <oasis:entry colname="col3">See Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Baseflow index</oasis:entry>
         <oasis:entry colname="col2">Ratio of mean daily baseflow to mean daily discharge</oasis:entry>
         <oasis:entry colname="col3">
                      <xref ref-type="bibr" rid="bib1.bibx29" id="text.35"/>
                    </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Half-flow date (HFD) mean</oasis:entry>
         <oasis:entry colname="col2">Mean half-flow date (date on which the cumulative discharge, since October, first reaches half of the annual discharge)</oasis:entry>
         <oasis:entry colname="col3">
                      <xref ref-type="bibr" rid="bib1.bibx9" id="text.36"/>
                    </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow duration</oasis:entry>
         <oasis:entry colname="col2">Average duration of high-flow events (number of consecutive days <inline-formula><mml:math id="M16" display="inline"><mml:mi mathvariant="italic">&gt;</mml:mi></mml:math></inline-formula> 9 times the median daily flow)</oasis:entry>
         <oasis:entry colname="col3"><xref ref-type="bibr" rid="bib1.bibx8" id="text.37"/>; Table 2 in <xref ref-type="bibr" rid="bib1.bibx58" id="text.38"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow frequency</oasis:entry>
         <oasis:entry colname="col2">Frequency of high-flow days (<inline-formula><mml:math id="M17" display="inline"><mml:mi mathvariant="italic">&gt;</mml:mi></mml:math></inline-formula> 9 times the median<?xmltex \hack{\hfill\break}?>daily flow)</oasis:entry>
         <oasis:entry colname="col3"><xref ref-type="bibr" rid="bib1.bibx8" id="text.39"/>; Table 2 in <xref ref-type="bibr" rid="bib1.bibx58" id="text.40"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow duration</oasis:entry>
         <oasis:entry colname="col2">Average duration of low-flow events (number of consecutive days with <inline-formula><mml:math id="M18" display="inline"><mml:mi mathvariant="italic">&lt;</mml:mi></mml:math></inline-formula> 0.2 times the mean daily flow)</oasis:entry>
         <oasis:entry colname="col3"><xref ref-type="bibr" rid="bib1.bibx39" id="text.41"/>; Table 2 in <xref ref-type="bibr" rid="bib1.bibx58" id="text.42"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow frequency</oasis:entry>
         <oasis:entry colname="col2">Frequency of low-flow days (<inline-formula><mml:math id="M19" display="inline"><mml:mi mathvariant="italic">&lt;</mml:mi></mml:math></inline-formula> 0.2 times the mean daily flow)</oasis:entry>
         <oasis:entry colname="col3"><xref ref-type="bibr" rid="bib1.bibx39" id="text.43"/>; Table 2 in <xref ref-type="bibr" rid="bib1.bibx58" id="text.44"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">5 % flow quantile (low flow)</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">95</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">95 % flow quantile (high flow)</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M22" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula> mean</oasis:entry>
         <oasis:entry colname="col2">Mean daily discharge</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Runoff ratio</oasis:entry>
         <oasis:entry colname="col2">Runoff ratio (ratio of mean daily discharge to mean daily precipitation, using Daymet precipitation)</oasis:entry>
         <oasis:entry colname="col3">Eq. (2) in <xref ref-type="bibr" rid="bib1.bibx44" id="text.45"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Slope flow duration curve (FDC)</oasis:entry>
         <oasis:entry colname="col2">Slope of the FDC  (between the log-transformed 33rd and 66th streamflow percentiles)</oasis:entry>
         <oasis:entry colname="col3">Eq. (3) in <xref ref-type="bibr" rid="bib1.bibx44" id="text.46"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Stream elasticity</oasis:entry>
         <oasis:entry colname="col2">Streamflow precipitation elasticity (sensitivity of streamflow to changes in precipitation at the annual timescale, using Daymet precipitation)</oasis:entry>
         <oasis:entry colname="col3">Eq. (7) in <xref ref-type="bibr" rid="bib1.bibx43" id="text.47"/></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Zero flow frequency</oasis:entry>
         <oasis:entry colname="col2">Frequency of days with zero discharge</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e896">Empirical cumulative density function of the NSE performance over the 531 basins of different SAC-SMA ensembles <bold>(a, c)</bold> and different LSTM ensembles <bold>(b, d)</bold>. Panels <bold>(a, b)</bold> show the entire range of the cumulative density function, while panels  <bold>(c, d)</bold> show the lower range in more detail. The red indicator lines mark the median NSE difference between the worst single-forcing ensemble and the multi-forcing ensemble of the LSTM and SAC-SMA, respectively.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f04.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Experimental design</title>
      <p id="d1e925">We trained <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> LSTMs using (1) all of the three forcing products together, (2) for each pairwise combination of forcing products (Daymet and Maurer, Daymet and NLDAS, and Maurer and NLDAS), and (3) separately for all three forcing products individually.</p>
      <p id="d1e940">For each of these seven input configurations, we trained an ensemble of <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> different LSTMs with different randomly initialized weights. We report the statistics from averaging the simulated hydrographs from each of these 10-member ensembles (single model results are provided in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>). Ensembles are used to account for the randomness inherent in the training procedure. The importance of using ensembles for this purpose was demonstrated by <xref ref-type="bibr" rid="bib1.bibx26" id="text.48"/>. Notice that ensembles are used here to mitigate a different type of uncertainty than when using<?pagebreak page2688?> ensembles for combining forcing products. In this case, the model learns how to (dynamically) combine forcing products, and ensembles are used for the same reason as proposed by <xref ref-type="bibr" rid="bib1.bibx37" id="text.49"/>, i.e., to account for randomness in the calibration and/or training.</p>
      <p id="d1e963">The training period was from 1 October 1999 to 30 September 2008 (9 years of training data for each catchment), and the test period was 1 October 1989 to 30 September 1999 (10 years of test data for each catchment). A single LSTM was trained on the combined training period of all 531 basins. Similar to previous studies <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx26" id="paren.50"/>, we used LSTMs with 256 memory cells and a dropout rate of 0.4 (40 %) in the fully connected layer that derives network predictions (streamflow) from LSTM output. All models were trained with a mini-batch size of 256 for 30 epochs, using the Adam optimizer <xref ref-type="bibr" rid="bib1.bibx21" id="paren.51"/> with an initial learning rate of <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, reduced to <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:mn mathvariant="normal">5</mml:mn><mml:mi>e</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> after 20 epochs, and further reduced to <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mi>e</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> after 25 epochs. All input were standardized to have zero mean and unit variance over all 531 catchments collectively. During model evaluation, negative predictions in the original value space were clipped to zero, i.e., no negative discharges. The loss function was the basin-averaged Nash–Sutcliffe efficiency <xref ref-type="bibr" rid="bib1.bibx26" id="paren.52"><named-content content-type="pre">NSE; see</named-content></xref>.</p>
</sec>
<?pagebreak page2689?><sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Analysis</title>
      <p id="d1e1033">We examined the experiments described above with two types of analyses. The goal is to provide illustrations of how the LSTM leverages multiple forcing products in spatiotemporally dynamic ways.</p>
      <p id="d1e1036"><list list-type="bullet">
            <list-item>

      <p id="d1e1041"><italic>Analysis 1 – feature ablation</italic>. An ablation study removes parts of the network to gain a better understanding of the model. We adopted this procedure by removing the different meteorological forcing products in a step-wise fashion and subsequently comparing results using several performance metrics and hydrologic signatures (see Table <xref ref-type="table" rid="Ch1.T1"/>). To provide context, we also benchmarked the LSTMs against ensembles of SAC-SMA models (see Sect. <xref ref-type="sec" rid="Ch1.S2.SS3"/>).</p>
            </list-item>
            <list-item>

      <p id="d1e1053"><italic>Analysis 2 – sensitivity and contribution</italic>. We performed an input attribution analysis of the trained LSTM models to quantify how the trained LSTMs leverage different forcing products in different places and under different hydrologic conditions. We concentrated the sensitivity analysis on the precipitation input because (i) precipitation is consistently found to be the most important variable in rainfall–runoff modeling, which is also true for LSTMs <xref ref-type="bibr" rid="bib1.bibx11" id="paren.53"><named-content content-type="pre">see</named-content></xref>, and (ii) according to <xref ref-type="bibr" rid="bib1.bibx6" id="text.54"/>, there is little difference in other meteorological variables between these data products.</p>
            </list-item>
          </list></p>
      <p id="d1e1068">In addition, we performed an analysis that correlates estimated uncertainty in different precipitation products with LSTM performance to help understand in what sense the LSTM is using different precipitation data to mitigate data uncertainty directly. This analysis is presented in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>.</p>
<sec id="Ch1.S2.SS5.SSS1">
  <label>2.5.1</label><title>Analysis 1 – feature ablation</title>
      <p id="d1e1080">All LSTM ensembles were trained using a squared-error loss function (the average of the basin-specific NSE values); however, we are interested in knowing how the models simulate different aspects of the hydrograph. As such, we report a collection of hydrologically relevant performance metrics outlined in Table <xref ref-type="table" rid="Ch1.T1"/>. These statistics include the standard time-averaged performance metrics (e.g., NSE and KGE) and comparisons between observed and simulated hydrologic signatures. The hydrologic signatures we report are the same ones used by <xref ref-type="bibr" rid="bib1.bibx3" id="text.55"/>. For each hydrologic signature, we computed the Pearson correlation between the signatures derived from observed discharge vs. those from simulated discharge in each basin. Correlation metrics were calculated on simulated vs. observed signatures in all basins.</p>
</sec>
<sec id="Ch1.S2.SS5.SSS2">
  <label>2.5.2</label><title>Analysis 2 – sensitivity and contribution</title>
      <p id="d1e1097">All neural networks (like LSTMs) are differentiable, almost everywhere, by design. Therefore, a gradient-based input contribution analysis seems natural. However, as discussed by <xref ref-type="bibr" rid="bib1.bibx48" id="text.56"/>, the naive solution of using local gradients does not provide reliable measures of sensitivity, since gradients might be flat even if the model response is heavily influenced by a particular input data source <xref ref-type="bibr" rid="bib1.bibx16" id="paren.57"><named-content content-type="pre">which is not necessarily a bad property; see, e.g.,</named-content></xref>. This is especially true in neural networks, where activation functions often include step changes over portions of the input space – for example, the sigmoid and hyperbolic tangent activation functions used by LSTMs have close-to-zero gradients at both extremes <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx48" id="paren.58"><named-content content-type="pre">see also</named-content></xref>.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e1116">Values of the benchmarking metrics from Table <xref ref-type="table" rid="Ch1.T1"/>. Bold values indicates the best model (<inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>). Multiple bold values per row indicate no significant difference.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">LSTM all forcing</oasis:entry>
         <oasis:entry colname="col3">SAC-SMA all forcing</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">ensemble (<inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">ensemble (<inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">NSE<inline-formula><mml:math id="M51" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2"><bold>0.821</bold></oasis:entry>
         <oasis:entry colname="col3">0.705</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NSE<inline-formula><mml:math id="M52" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> (mean)</oasis:entry>
         <oasis:entry colname="col2"><bold>0.783</bold></oasis:entry>
         <oasis:entry colname="col3">0.673</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">KGE<inline-formula><mml:math id="M53" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.801</bold></oasis:entry>
         <oasis:entry colname="col3">0.650</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pearson <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.915</bold></oasis:entry>
         <oasis:entry colname="col3">0.861</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M55" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>–NSE<inline-formula><mml:math id="M56" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">d</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.861</bold></oasis:entry>
         <oasis:entry colname="col3">0.742</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M57" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE<inline-formula><mml:math id="M58" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">e</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M59" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.028</oasis:entry>
         <oasis:entry colname="col3"><bold>0.024</bold></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FHV<inline-formula><mml:math id="M60" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M61" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula><bold>13.818</bold></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M62" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>23.863</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FLV<inline-formula><mml:math id="M63" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">g</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>41.277</bold></oasis:entry>
         <oasis:entry colname="col3"><bold>49.641</bold></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FMS<inline-formula><mml:math id="M64" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">h</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M65" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula><bold>8.087</bold></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M66" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>29.418</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Peak timing<inline-formula><mml:math id="M67" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">i</mml:mi></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.370</bold></oasis:entry>
         <oasis:entry colname="col3">0.552</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e1133"><inline-formula><mml:math id="M29" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> Nash–Sutcliffe efficiency; <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M31" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula> Kling–Gupta efficiency; <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M33" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">c</mml:mi></mml:msup></mml:math></inline-formula> Pearson correlation; <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M35" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">d</mml:mi></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M36" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>–NSE decomposition; <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 1 are desirable.
<inline-formula><mml:math id="M38" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">e</mml:mi></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M39" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE decomposition; <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M41" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:math></inline-formula> Top 2 % peak flow bias; <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M43" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">g</mml:mi></mml:msup></mml:math></inline-formula> 30 % low flow bias; <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M45" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">h</mml:mi></mml:msup></mml:math></inline-formula> Bias of FDC mid-segment slope; <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M47" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">i</mml:mi></mml:msup></mml:math></inline-formula> Lag of peak timing; <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.</p></table-wrap-foot></table-wrap>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e1717">Values of the correlation coefficients (over 531 basins) of the simulated vs. observed hydrological signatures from Table <xref ref-type="table" rid="Ch1.T1"/>. Bold values indicate the best model (<inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula>). Multiple bold values per row indicate no significant difference.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.92}[.92]?><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">LSTM all forcing</oasis:entry>
         <oasis:entry colname="col3">SAC-SMA all forcing</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">ensemble (<inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col3">ensemble (<inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Baseflow index</oasis:entry>
         <oasis:entry colname="col2"><bold>0.93</bold></oasis:entry>
         <oasis:entry colname="col3">0.80</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">HFD mean</oasis:entry>
         <oasis:entry colname="col2"><bold>0.98</bold></oasis:entry>
         <oasis:entry colname="col3"><bold>0.96</bold></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow duration</oasis:entry>
         <oasis:entry colname="col2"><bold>0.84</bold></oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow frequency</oasis:entry>
         <oasis:entry colname="col2"><bold>0.81</bold></oasis:entry>
         <oasis:entry colname="col3">0.68</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow duration</oasis:entry>
         <oasis:entry colname="col2"><bold>0.50</bold></oasis:entry>
         <oasis:entry colname="col3">0.41</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow frequency</oasis:entry>
         <oasis:entry colname="col2"><bold>0.79</bold></oasis:entry>
         <oasis:entry colname="col3">0.63</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.96</bold></oasis:entry>
         <oasis:entry colname="col3">0.90</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">95</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2"><bold>0.99</bold></oasis:entry>
         <oasis:entry colname="col3"><bold>0.99</bold></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M73" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula> mean</oasis:entry>
         <oasis:entry colname="col2"><bold>1.00</bold></oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Runoff ratio</oasis:entry>
         <oasis:entry colname="col2"><bold>0.99</bold></oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Slope FDC</oasis:entry>
         <oasis:entry colname="col2"><bold>0.65</bold></oasis:entry>
         <oasis:entry colname="col3">0.62</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Stream elasticity</oasis:entry>
         <oasis:entry colname="col2"><bold>0.72</bold></oasis:entry>
         <oasis:entry colname="col3">0.67</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Zero flow frequency</oasis:entry>
         <oasis:entry colname="col2"><bold>0.03</bold></oasis:entry>
         <oasis:entry colname="col3">NaN</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e1995">Spatial distribution of the NSE differences between the three-forcing LSTM, relative to the best single-forcing model in each basin <bold>(a)</bold> and relative to each single-forcing model <bold>(b–d)</bold>. Positive (purple) values represent basins where the three-forcing LSTM improved over the single-forcing LSTM. Negative (brown) values reflect basins where the single-forcing LSTM had a higher NSE than the three-forcing LSTM. In total, the three-forcing LSTM was better than the best single-forcing model in 351 of 531 basins (66 %) and was better than each single-forcing model in 443 (83 %; Daymet), 456 (86 %; Maurer), and 472 (89 %; NLDAS) basins, respectively.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f05.png"/>

          </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e2012">Time- and basin-averaged integrated gradients of one of the <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> multi-forcing LSTMs as a function of lag time (days before current streamflow  prediction) of the three precipitation products. Because of the time shift shown in Fig. <xref ref-type="fig" rid="Ch1.F2"/>, the model learned to ignore the Maurer input at the current time step.</p></caption>
            <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f06.png"/>

          </fig>

      <p id="d1e2035"><xref ref-type="bibr" rid="bib1.bibx48" id="text.59"/> proposed a method of input attribution for neural networks which accounts for this lack of local sensitivity. This method is called integrated gradients. Integrated gradients are a path integral of the gradients from some baseline input value, <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msup><mml:mi>x</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>, to the actual value of the input, <inline-formula><mml:math id="M76" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula>, as follows:
              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M77" display="block"><mml:mtable rowspacing="0.2ex" columnspacing="1em" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:msubsup><mml:mtext>IntegratedGrads</mml:mtext><mml:mi>i</mml:mi><mml:mi mathvariant="normal">approx</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:mi mathvariant="bold">x</mml:mi><mml:mo>)</mml:mo><mml:mo>:</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="bold">x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:msup><mml:mi mathvariant="bold">x</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mi>m</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>∂</mml:mo><mml:mi>F</mml:mi><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mi mathvariant="bold">x</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover></mml:mfenced></mml:mrow><mml:mrow><mml:mo>∂</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold">x</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:msub><mml:mo mathsize="2.0em">|</mml:mo><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="bold">x</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="bold">x</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mo>+</mml:mo><mml:mstyle scriptlevel="+1"><mml:mfrac><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:mi mathvariant="bold">x</mml:mi><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">x</mml:mi><mml:mo>′</mml:mo></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d1e2185">We used a value of zero precipitation everywhere
as the baseline for calculating integrated gradients with respect to the three different precipitation forcings (Daymet, Maurer, and NLDAS).
We calculated the integrated gradients of each daily streamflow estimate in each CAMELS basin during the 10-year test period with respect to precipitation input from the past 365 d (the look-back period of the LSTM). That is, on day <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:math></inline-formula>, we calculated <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mn mathvariant="normal">1095</mml:mn><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>⋅</mml:mo><mml:mn mathvariant="normal">365</mml:mn></mml:mrow></mml:math></inline-formula> integrated gradient values related to the three precipitation products. The relative integrated gradient values quantify how the LSTM combines precipitation products over time, over space, and<?pagebreak page2690?> also as a function of lag or lead time into the current streamflow prediction.
In theory, one has to take the “explaining away” effect into account when analyzing the decision process in models <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx57" id="paren.60"/>. However, we assume that, if evaluated over hundreds of basins and thousands of time steps, this effect is largely averaged out, and therefore, the analysis provides an indication of the actual information used by the model.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and discussion</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Analysis 1 – feature ablation</title>
      <p id="d1e2236">The feature ablation analysis compared NSE values over 10-year test periods from the CAMELS basins for the seven distinct input combinations. As shown in Fig. <xref ref-type="fig" rid="Ch1.F3"/>, the three-forcing LSTM ensemble had a median NSE value of 0.82<?pagebreak page2691?> for the 531 basins. The three-forcing model outperformed all two-forcing models. Similarly, all two-forcing models outperformed all single-forcing models (all improvements were statistically significant at <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> when using the Wilcoxon test). The best single-forcing LSTM had a median NSE of 0.77. This indicates that the LSTM was able to leverage unique information in the precipitation signals <xref ref-type="bibr" rid="bib1.bibx49" id="paren.61"><named-content content-type="pre">this is not an unusual finding in the context of machine learning; see, e.g.,</named-content></xref>. We also note that the single-forcing LSTM with Maurer input outperformed the single-forcing NLDAS model, which agrees with the results of <xref ref-type="bibr" rid="bib1.bibx6" id="text.62"/>, who showed that Maurer precipitation was generally more accurate than NLDAS precipitation.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e2263">Expansion of Fig. <xref ref-type="fig" rid="Ch1.F6"/> by individual basins and truncated at a lag of <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>. The relative importance of Daymet is shown in blue, Maurer in orange, and NLDAS in blue. The multi-forcing LSTM combined the precipitation products in different ways in different basins. Daymet is generally more important in high-number basins located in the Pacific Northwest.</p></caption>
          <?xmltex \igopts{width=227.622047pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f07.png"/>

        </fig>

      <p id="d1e2286">To put these results into context, Fig. <xref ref-type="fig" rid="Ch1.F4"/> compares all LSTMs against benchmark hydrology models, which are all ensembles of SAC-SMA models that were calibrated for each of the three different forcings. All LSTM models were better than all corresponding benchmark models through the entire cumulative distribution function (CDF) curve. The following points can be seen in Fig. <xref ref-type="fig" rid="Ch1.F4"/>.
First, the SAC-SMA sees a large improvement from using two-forcing products ensembles; this improvement was larger than the corresponding improvement in the LSTMs. However, adding calibrated SAC-SMA models from a third data product did not increase the performance by much (see, e.g., Fig. <xref ref-type="fig" rid="Ch1.F4"/>a, where the NLDAS and Daymet ensemble CDF overlaps, most of the time, with the three-forcing ensemble). In contrast, CDFs of the LSTM results show a constant improvement from one- to two-forcing models and from two- to three-forcing models.</p>
      <p id="d1e2296">Second, the difference between the worst single-forcing ensemble and the three-forcing ensemble is larger for the LSTM (<inline-formula><mml:math id="M82" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>NSE <inline-formula><mml:math id="M83" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.074) than for the SAC-SMA (<inline-formula><mml:math id="M84" display="inline"><mml:mi mathvariant="normal">Δ</mml:mi></mml:math></inline-formula>NSE <inline-formula><mml:math id="M85" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.068). This difference could arise from the fact that the LSTM is better able to handle the data shift of the Maurer forcings that occurs in some of the basins (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>), while this is impossible for the SAC-SMA ensemble.</p>

      <?xmltex \floatpos{p}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><?xmltex \def\figurename{Figure}?><label>Figure 8</label><caption><p id="d1e2331">The forcing product, with the highest overall contribution (sensitivity) in each basin <bold>(a)</bold>, averaged over the prediction time step and lag. The alpha value (opacity) of each dot on this map is a relative measure of the fraction of the total integrated gradients of all three precipitation products (summed over time, lag, and product) due to the highest-contributing product. Panel <bold>(b)</bold> shows that the total integrated gradient summed over all three precipitation products is highly correlated with total precipitation in the basin.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f08.png"/>

        </fig>

      <?xmltex \floatpos{p}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><?xmltex \def\figurename{Figure}?><label>Figure 9</label><caption><p id="d1e2348">Spatial distribution of the highest-ranked precipitation products at specific lags (different rows) over the whole hydrograph (left-hand column), and the rising and falling limbs of the hydrograph (center and right-hand columns, respectively), where blue circles denote Daymet, orange circles denote Maurer and green circles denote NLDAS. The takeaway from this figure is that the multi-forcing LSTM learns to combine the different products in different ways for different memory timescales in different basins and under different hydrological conditions. The alpha value (opacity) of each dot is a relative measure of the fraction of the total integrated gradients of all three precipitation products due to the highest-contributing product.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f09.png"/>

        </fig>

      <p id="d1e2357">Third, the worst-performing single-forcing LSTM ensemble (i.e., with NLDAS forcings) was significantly better (<inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">13</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) than the whole <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> SAC-SMA ensemble, which uses all three forcing products (i.e., the best SAC-SMA result that we found). In fact, even the average single LSTM (not the full <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> ensemble) trained with NLDAS forcings is as good as the <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> SAC-SMA ensemble (see Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/> for non-ensemble LSTM performances), and the average single LSTM (not the ensemble) trained with Maurer or Daymet forcings was significantly better (<inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) than the <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> SAC-SMA ensemble.</p>
      <?pagebreak page2692?><p id="d1e2455">Fourth, the ranking of the forcing products is not as clear for the SAC-SMA ensembles as it was the LSTM ensembles (there is more separation in the LSTM single-forcing CDFs than the SAC-SMA single-forcing CDFs). However, qualitatively, the same ranking is visible, i.e., that Daymet models are better than NLDAS or Maurer and that NLDAS and Daymet produce the best two-forcing results.</p>
      <p id="d1e2459">Tables <xref ref-type="table" rid="Ch1.T2"/> and <xref ref-type="table" rid="Ch1.T3"/> give benchmarking results from all metrics and signatures in Table <xref ref-type="table" rid="Ch1.T1"/>. The three-forcing LSTM significantly out-performed the three-forcing SAC-SMA ensemble in all metrics except <inline-formula><mml:math id="M92" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE decomposition, where the SAC-SMA ensemble was better, and FLV, where the difference was not significant (see Table <xref ref-type="table" rid="Ch1.T2"/>). The three-forcing LSTM also significantly out-performed the three forcing SAC-SMA ensemble in all signatures (see Table <xref ref-type="table" rid="Ch1.T3"/>) except the HFD mean and the Q95, where the difference was not significant. Note that the LSTM – while generally providing the best model overall – has approximation difficulties towards the extreme lower end of the runoff distribution (low flow duration, low flow frequency, and zero flow frequency).</p>
      <p id="d1e2480">Figure <xref ref-type="fig" rid="Ch1.F5"/> shows the spatial distribution of the performance differences between the best single-forcing model and the three-forcing model in all basins. The three-forcing LSTM outperformed the single forcing LSTMs almost everywhere. Individual exceptions, where less is more do, however, exist (e.g., southern California). Concretely, if we compare the three-forcing model to the best performing single-forcing LSTM per basin, the three-forcing LSTM had a higher NSE in 66 % of the basins (351 of 531). When compared to each single-forcing LSTM separately, the three-forcing LSTM had a higher NSE in 443 (83 %; Daymet), 456 (86 %; Maurer), and 472 (89 %; NLDAS) basins, respectively.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Analysis 2 – sensitivity and contribution</title>
      <p id="d1e2493">Figure <xref ref-type="fig" rid="Ch1.F6"/> shows the time- and basin-averaged integrated gradient of one of the <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> multi-forcing LSTMs as a function of lead time. To reiterate the information above, the integrated gradient is a measure of input attribution or sensitivity, such that input with higher integrated gradients have a larger influence on model outputs. Integrated gradients shown in Fig. <xref ref-type="fig" rid="Ch1.F6"/> were averaged over all time steps in the test period and also over all basins. This figure shows the sensitivity of streamflow at time <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:math></inline-formula> to each of the three precipitation input at times <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>T</mml:mi><mml:mo>-</mml:mo><mml:mi>s</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M96" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula> is the lag value on the <inline-formula><mml:math id="M97" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis. The main takeaways from this high-level illustration of input sensitivities are (1) that the sensitivity of current streamflow to precipitation decays with lead time (i.e., time before present) and (2) that the multi-forcing model has learned to ignore the Maurer input at the present time step. The reason for the latter is the time shift in the Maurer product (illustrated in Fig. <xref ref-type="fig" rid="Ch1.F2"/>).</p>
      <p id="d1e2557">Figure <xref ref-type="fig" rid="Ch1.F6"/> shows results from only one of <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> model repetitions; however, we performed an integrated gradient analysis on all <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> multi-input LSTMs (not shown), and the results were qualitatively similar. It is difficult to show all the results on the same figure because the values are relative; so, integrated gradients between two different models often have different absolute scales, and the results presented for a single model (in Fig. <xref ref-type="fig" rid="Ch1.F6"/>) are representative.</p>
      <p id="d1e2588">The multi-forcing LSTMs learned to combine the different precipitation products in spatiotemporally variable ways. Figure <xref ref-type="fig" rid="Ch1.F6"/> demonstrates the overall behavior of the multi-forcing LSTM. It is, however a highly condensed aggregate of a highly nonlinear system. As such, a lot of specific information is lost in that figure.</p>
      <?pagebreak page2694?><p id="d1e2593">Figure <xref ref-type="fig" rid="Ch1.F7"/> shows integrated gradients by basin and up to a lead time of <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> d prior to the present. The model largely ignores Maurer precipitation at the current time step in most basins (as was apparent in Fig. <xref ref-type="fig" rid="Ch1.F6"/>), but the ratio of the contributions of each product (averaged over the whole test period hydrograph) varies between basins. Figure <xref ref-type="fig" rid="Ch1.F7"/> shows the relative contributions of each precipitation product, but it is important to note that the overall importance of precipitation also varies between basin.</p>
      <p id="d1e2615">Figure <xref ref-type="fig" rid="Ch1.F8"/> shows the spatial distribution of the most sensitive precipitation contribution (averaged over the whole hydrograph in each basin) in Fig. <xref ref-type="fig" rid="Ch1.F8"/>a and the overall sensitivity to all three precipitation products combined in  Fig. <xref ref-type="fig" rid="Ch1.F8"/>b. The latter (total sensitivity to precipitation relative to all other input) is highly correlated with the total (or average) precipitation in the basin.</p>
      <p id="d1e2624">It is possible to break the spatial relationship down even further. The spatial distribution of the highest-ranked product as a function of the lag time for rising and falling limits is shown in Fig. <xref ref-type="fig" rid="Ch1.F9"/>. This figure shows some of the nuance in how the multi-forcing LSTM learned to combine the different precipitation products by distinguishing between different memory timescales in different basins for different hydrological conditions (i.e., rising and falling limbs of the hydrograph).</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d1e2639">The purpose of this paper is to show that LSTMs can leverage different precipitation products in spatiotemporally dynamic ways to improve streamflow simulations. These experiments show that there exist systematic and location- and time-specific differences between different precipitation products that can be learned and leveraged by deep learning. As might be expected, the LSTMs tested here tended to improve hydrological simulations more when there were larger disagreements between different precipitation estimates in a given basin (see Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/>).</p>
      <p id="d1e2644">It is worth comparing these findings with classical conceptual and process-based hydrological models that treat precipitation estimate as a unique input. Current best practice for using multiple precipitation products is to run an ensemble of hydrological models, such that each forcing data set is treated independently. Deep learning models have the ability to use a larger number and variety of input than classical hydrology models, and in fact, DL models do not need input that represent any given hydrological variable or process and, therefore, have the potential to use less highly processed input data like remote sensing brightness temperatures, etc. Future work might focus on building runoff models that take as input the raw measurements that were used to create standard precipitation data products.</p>
      <p id="d1e2647"><?xmltex \hack{\newpage}?>Deep learning provides possibilities not only for improving the quality of regional <xref ref-type="bibr" rid="bib1.bibx26" id="paren.63"/> and even ungauged <xref ref-type="bibr" rid="bib1.bibx25" id="paren.64"/> simulations but also, potentially, for replacing large portions of ensemble-based strategies for uncertainty quantification <xref ref-type="bibr" rid="bib1.bibx7" id="paren.65"><named-content content-type="pre">e.g.,</named-content></xref> with multi-input models. There are many ways to deal with the uncertainty in traditional hydrological modeling workflows, but almost certainly, the most common approach is to use ensembles. Ensembles can be opportunistic – i.e., from a set of pre-existing models or data products – or constructed – i.e., sampled from a probability distribution – but in either case, the idea is to use variability to represent lack of perfect information. <xref ref-type="bibr" rid="bib1.bibx7" id="text.66"/> advocated using ensembles as hydrologic storylines, which would avoid the problem of the sparsity of sampling any explicit or implied probability distributions. No matter how ensembles are used, however, with conceptual and process-based hydrology models, each model takes one precipitation estimate (time series) as input. Multi-input DL models have the potential to provide a fundamentally different alternative for modeling under this kind of uncertainty, since DL models can learn how to combine different input in ways that leverage – in nonlinear ways – all data available to the full simulation task. Future work could focus on producing predictive probabilities with multi-input deep learning models.</p><?xmltex \hack{\clearpage}?>
</sec>

      
      </body>
    <back><app-group>

<?pagebreak page2695?><app id="App1.Ch1.S1">
  <?xmltex \currentcnt{A}?><label>Appendix A</label><title>Average LSTM single-model performance</title>

<?xmltex \floatpos{h!}?><table-wrap id="App1.Ch1.S1.T4"><?xmltex \hack{\hsize\textwidth}?><?xmltex \currentcnt{A1}?><label>Table A1</label><caption><p id="d1e2681">Average single LSTM performance over a variety of metrics. The average single-model performances is computed as the mean of the metric of the <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> model repetitions.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">NLDAS</oasis:entry>
         <oasis:entry colname="col3">Maurer</oasis:entry>
         <oasis:entry colname="col4">Daymet</oasis:entry>
         <oasis:entry colname="col5">Maurer <inline-formula><mml:math id="M122" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">NLDAS <inline-formula><mml:math id="M123" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">NLDAS <inline-formula><mml:math id="M124" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">All</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">Daymet</oasis:entry>
         <oasis:entry colname="col6">Maurer</oasis:entry>
         <oasis:entry colname="col7">Daymet</oasis:entry>
         <oasis:entry colname="col8">combined</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">NSE<inline-formula><mml:math id="M125" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2">0.72</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">0.74</oasis:entry>
         <oasis:entry colname="col5">0.77</oasis:entry>
         <oasis:entry colname="col6">0.77</oasis:entry>
         <oasis:entry colname="col7">0.79</oasis:entry>
         <oasis:entry colname="col8">0.80</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M126" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M127" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M128" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M129" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M130" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M131" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M132" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">NSE<inline-formula><mml:math id="M133" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> (mean)</oasis:entry>
         <oasis:entry colname="col2">0.68</oasis:entry>
         <oasis:entry colname="col3">0.70</oasis:entry>
         <oasis:entry colname="col4">0.70</oasis:entry>
         <oasis:entry colname="col5">0.73</oasis:entry>
         <oasis:entry colname="col6">0.74</oasis:entry>
         <oasis:entry colname="col7">0.75</oasis:entry>
         <oasis:entry colname="col8">0.76</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M134" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M135" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.006</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M136" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M137" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M138" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M139" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M140" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">KGE<inline-formula><mml:math id="M141" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2">0.74</oasis:entry>
         <oasis:entry colname="col3">0.76</oasis:entry>
         <oasis:entry colname="col4">0.76</oasis:entry>
         <oasis:entry colname="col5">0.79</oasis:entry>
         <oasis:entry colname="col6">0.78</oasis:entry>
         <oasis:entry colname="col7">0.79</oasis:entry>
         <oasis:entry colname="col8">0.80</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M142" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.006</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M143" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M144" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M145" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M146" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.008</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M147" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M148" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pearson <inline-formula><mml:math id="M149" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula><inline-formula><mml:math id="M150" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">c</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2">0.86</oasis:entry>
         <oasis:entry colname="col3">0.87</oasis:entry>
         <oasis:entry colname="col4">0.88</oasis:entry>
         <oasis:entry colname="col5">0.89</oasis:entry>
         <oasis:entry colname="col6">0.89</oasis:entry>
         <oasis:entry colname="col7">0.90</oasis:entry>
         <oasis:entry colname="col8">0.90</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M151" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M152" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M153" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M154" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M155" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.001</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M156" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M157" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M158" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>–NSE<inline-formula><mml:math id="M159" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">d</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2">0.83</oasis:entry>
         <oasis:entry colname="col3">0.86</oasis:entry>
         <oasis:entry colname="col4">0.86</oasis:entry>
         <oasis:entry colname="col5">0.88</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
         <oasis:entry colname="col7">0.87</oasis:entry>
         <oasis:entry colname="col8">0.88</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M160" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.010</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M161" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.011</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M162" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.008</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M163" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.007</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M164" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.007</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M165" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M166" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.008</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M167" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE<inline-formula><mml:math id="M168" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">e</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M169" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M170" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M171" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M172" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M173" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M174" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.03</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M175" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.02</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M176" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M177" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M178" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M179" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.004</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M180" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>  0.004</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M181" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M182" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FHV<inline-formula><mml:math id="M183" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M184" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>17.28</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M185" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>13.89</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M186" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>15.00</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M187" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>12.52</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M188" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>14.20</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M189" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>13.15</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M190" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>11.91</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M191" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.904</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M192" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>1.217</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M193" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.504</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M194" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.791</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M195" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.881</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M196" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.450</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M197" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.549</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FLV<inline-formula><mml:math id="M198" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">g</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M199" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.88</oasis:entry>
         <oasis:entry colname="col3">2.83</oasis:entry>
         <oasis:entry colname="col4">0.05</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M200" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>4.02</oasis:entry>
         <oasis:entry colname="col6">0.86</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M201" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1.54</oasis:entry>
         <oasis:entry colname="col8">2.57</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M202" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 7.637</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M203" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>5.403</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M204" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>6.056</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M205" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 6.825</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M206" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>5.499</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M207" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>6.955</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M208" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>4.072</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">FMS<inline-formula><mml:math id="M209" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">h</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M210" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>9.44</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M211" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>7.31</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M212" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>5.96</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M213" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>5.60</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M214" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>7.55</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M215" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>6.93</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M216" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>6.69</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M217" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>1.293</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M218" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>1.500</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M219" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 1.234</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M220" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>1.241</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M221" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>1.358</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M222" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.911</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M223" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 1.678</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Peak timing<inline-formula><mml:math id="M224" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">i</mml:mi></mml:msup></mml:math></inline-formula> (median)</oasis:entry>
         <oasis:entry colname="col2">0.46</oasis:entry>
         <oasis:entry colname="col3">0.49</oasis:entry>
         <oasis:entry colname="col4">0.46</oasis:entry>
         <oasis:entry colname="col5">0.44</oasis:entry>
         <oasis:entry colname="col6">0.42</oasis:entry>
         <oasis:entry colname="col7">0.41</oasis:entry>
         <oasis:entry colname="col8">0.41</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M225" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.010</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M226" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.009</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M227" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.008</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M228" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.007</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M229" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.007</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M230" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.009</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M231" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.015</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e2696"><inline-formula><mml:math id="M102" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">a</mml:mi></mml:msup></mml:math></inline-formula> Nash–Sutcliffe efficiency; <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M104" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">b</mml:mi></mml:msup></mml:math></inline-formula> Kling–Gupta efficiency; <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M106" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">c</mml:mi></mml:msup></mml:math></inline-formula> Pearson correlation; <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>; values closer to 1 are desirable.
<inline-formula><mml:math id="M108" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">d</mml:mi></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M109" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>–NSE decomposition; <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 1 are desirable.
<inline-formula><mml:math id="M111" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">e</mml:mi></mml:msup></mml:math></inline-formula> <inline-formula><mml:math id="M112" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>–NSE decomposition; <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M114" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">f</mml:mi></mml:msup></mml:math></inline-formula> Top 2 % peak flow bias; <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M116" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">g</mml:mi></mml:msup></mml:math></inline-formula> 30 % low flow bias; <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M118" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">h</mml:mi></mml:msup></mml:math></inline-formula> Bias of FDC mid-segment slope; <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
<inline-formula><mml:math id="M120" display="inline"><mml:msup><mml:mi/><mml:mi mathvariant="normal">i</mml:mi></mml:msup></mml:math></inline-formula> Lag of peak timing; <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">∞</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>; values close to 0 are desirable.
</p></table-wrap-foot></table-wrap>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><table-wrap id="App1.Ch1.S1.T5"><?xmltex \hack{\hsize\textwidth}?><?xmltex \currentcnt{A2}?><label>Table A2</label><caption><p id="d1e4283">Average single LSTM performance across a range of different hydrological signatures. The derived metric for each signature is the Pearson correlation between the signature derived from the observed discharge vs. the signature derived from the simulated discharge. The average single-model performances are then reported as the mean value of the <inline-formula><mml:math id="M232" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> model repetitions.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">NLDAS</oasis:entry>
         <oasis:entry colname="col3">Maurer</oasis:entry>
         <oasis:entry colname="col4">Daymet</oasis:entry>
         <oasis:entry colname="col5">Maurer <inline-formula><mml:math id="M233" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">NLDAS <inline-formula><mml:math id="M234" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">NLDAS <inline-formula><mml:math id="M235" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8">All</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">Daymet</oasis:entry>
         <oasis:entry colname="col6">Maurer</oasis:entry>
         <oasis:entry colname="col7">Daymet</oasis:entry>
         <oasis:entry colname="col8">combined</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Baseflow index</oasis:entry>
         <oasis:entry colname="col2">0.93</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
         <oasis:entry colname="col5">0.94</oasis:entry>
         <oasis:entry colname="col6">0.93</oasis:entry>
         <oasis:entry colname="col7">0.93</oasis:entry>
         <oasis:entry colname="col8">0.92</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M236" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.014</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M237" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.018</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M238" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.011</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M239" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M240" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.013</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M241" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.009</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M242" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.018</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">HFD mean</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4">0.97</oasis:entry>
         <oasis:entry colname="col5">0.97</oasis:entry>
         <oasis:entry colname="col6">0.97</oasis:entry>
         <oasis:entry colname="col7">0.97</oasis:entry>
         <oasis:entry colname="col8">0.97</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M243" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M244" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M245" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M246" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M247" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M248" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M249" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.004</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow duration</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.85</oasis:entry>
         <oasis:entry colname="col4">0.83</oasis:entry>
         <oasis:entry colname="col5">0.86</oasis:entry>
         <oasis:entry colname="col6">0.85</oasis:entry>
         <oasis:entry colname="col7">0.85</oasis:entry>
         <oasis:entry colname="col8">0.85</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M250" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.027</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M251" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.014</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M252" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.010</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M253" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.014</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M254" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.014</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M255" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.008</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M256" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.014</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">High flow frequency</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.82</oasis:entry>
         <oasis:entry colname="col4">0.82</oasis:entry>
         <oasis:entry colname="col5">0.82</oasis:entry>
         <oasis:entry colname="col6">0.81</oasis:entry>
         <oasis:entry colname="col7">0.81</oasis:entry>
         <oasis:entry colname="col8">0.79</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M257" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.013</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M258" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.014</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M259" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.016</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M260" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.016</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M261" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.040</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M262" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.032</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M263" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.037</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow duration</oasis:entry>
         <oasis:entry colname="col2">0.44</oasis:entry>
         <oasis:entry colname="col3">0.42</oasis:entry>
         <oasis:entry colname="col4">0.46</oasis:entry>
         <oasis:entry colname="col5">0.47</oasis:entry>
         <oasis:entry colname="col6">0.43</oasis:entry>
         <oasis:entry colname="col7">0.46</oasis:entry>
         <oasis:entry colname="col8">0.45</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M264" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.033</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M265" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.027</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M266" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.025</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M267" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.035</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M268" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.018</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M269" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.015</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M270" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.039</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Low flow frequency</oasis:entry>
         <oasis:entry colname="col2">0.83</oasis:entry>
         <oasis:entry colname="col3">0.82</oasis:entry>
         <oasis:entry colname="col4">0.84</oasis:entry>
         <oasis:entry colname="col5">0.86</oasis:entry>
         <oasis:entry colname="col6">0.82</oasis:entry>
         <oasis:entry colname="col7">0.84</oasis:entry>
         <oasis:entry colname="col8">0.83</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M271" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.020</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M272" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.044</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M273" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.028</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M274" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.022</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M275" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.027</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M276" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.021</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M277" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.043</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M278" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">0.96</oasis:entry>
         <oasis:entry colname="col6">0.95</oasis:entry>
         <oasis:entry colname="col7">0.96</oasis:entry>
         <oasis:entry colname="col8">0.96</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M279" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M280" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.006</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M281" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M282" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M283" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.005</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M284" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.005</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M285" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.003</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">95</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.99</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.99</oasis:entry>
         <oasis:entry colname="col6">0.99</oasis:entry>
         <oasis:entry colname="col7">0.99</oasis:entry>
         <oasis:entry colname="col8">0.99</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M287" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M288" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M289" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M290" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M291" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M292" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M293" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M294" display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula> mean</oasis:entry>
         <oasis:entry colname="col2">0.99</oasis:entry>
         <oasis:entry colname="col3">1.00</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">0.99</oasis:entry>
         <oasis:entry colname="col6">1.00</oasis:entry>
         <oasis:entry colname="col7">0.99</oasis:entry>
         <oasis:entry colname="col8">1.00</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M295" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M296" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M297" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M298" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M299" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M300" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M301" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Runoff ratio</oasis:entry>
         <oasis:entry colname="col2">0.98</oasis:entry>
         <oasis:entry colname="col3">0.98</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">0.98</oasis:entry>
         <oasis:entry colname="col6">0.98</oasis:entry>
         <oasis:entry colname="col7">0.98</oasis:entry>
         <oasis:entry colname="col8">0.99</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M302" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.002</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M303" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M304" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M305" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M306" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M307" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.001</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M308" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 0.001</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Slope FDC</oasis:entry>
         <oasis:entry colname="col2">0.62</oasis:entry>
         <oasis:entry colname="col3">0.63</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">0.56</oasis:entry>
         <oasis:entry colname="col6">0.59</oasis:entry>
         <oasis:entry colname="col7">0.59</oasis:entry>
         <oasis:entry colname="col8">0.57</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M309" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.095</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M310" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.053</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M311" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.093</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M312" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.053</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M313" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.061</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M314" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.091</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M315" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.096</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Stream elasticity</oasis:entry>
         <oasis:entry colname="col2">0.61</oasis:entry>
         <oasis:entry colname="col3">0.69</oasis:entry>
         <oasis:entry colname="col4">0.70</oasis:entry>
         <oasis:entry colname="col5">0.70</oasis:entry>
         <oasis:entry colname="col6">0.68</oasis:entry>
         <oasis:entry colname="col7">0.69</oasis:entry>
         <oasis:entry colname="col8">0.71</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M316" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.015</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M317" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.024</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M318" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.017</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M319" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.018</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M320" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.025</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M321" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.032</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M322" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.021</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Zero flow frequency</oasis:entry>
         <oasis:entry colname="col2">0.30</oasis:entry>
         <oasis:entry colname="col3">0.42</oasis:entry>
         <oasis:entry colname="col4">0.27</oasis:entry>
         <oasis:entry colname="col5">0.33</oasis:entry>
         <oasis:entry colname="col6">0.33</oasis:entry>
         <oasis:entry colname="col7">0.31</oasis:entry>
         <oasis:entry colname="col8">0.28</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M323" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.101</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M324" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.097</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M325" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.088</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M326" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.080</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M327" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.067</oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M328" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.086</oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M329" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula>0.085</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<?xmltex \hack{\clearpage}?>
</app>

<?pagebreak page2697?><app id="App1.Ch1.S2">
  <?xmltex \currentcnt{B}?><label>Appendix B</label><title>Peak flow timing</title>
      <p id="d1e5692">To evaluate the model performance on the peak timing, we used the following procedure: first, we determined peaks in the observed runoff time series by locality search; that is, potential peaks are defined as local maxima. To reduce the number of peaks and filter out noise, the next step was an iterative process where, by pairwise comparison, only the maximum peak is kept until all peaks have at least a distance of 100 time steps to each other. The procedure is implemented in SciPy's find_peak function <xref ref-type="bibr" rid="bib1.bibx56" id="paren.67"/> and is used in the current work.</p>
      <p id="d1e5698">Second, we iterated over all peaks and searched for the corresponding peak in the simulated discharge time series. The simulated peak is defined as the highest discharge value inside a window of <inline-formula><mml:math id="M330" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> d around the observed peak, and the peak timing error is the offset between the observed peak and the simulated peak. The resulting metric is the average offset over all peaks.</p>
</app>

<app id="App1.Ch1.S3">
  <?xmltex \currentcnt{C}?><label>Appendix C</label><title>Analysis of precipitation uncertainty</title>
      <p id="d1e5719">The goal of this supplementary analysis was to understand the relationship between precipitation uncertainty and improvements to streamflow simulations due to using multiple forcing data sets. Because we do not have access to  true precipitation values in each catchment, we used triple collocation to estimate precipitation uncertainty. Triple collocation is a statistical technique for estimating error variances of three or more noisy measurement sources without knowing the true values of the measured quantities <xref ref-type="bibr" rid="bib1.bibx47 bib1.bibx45" id="paren.68"/>. Its major assumption is that the error models are linear and independent between sources and, in particular, that all (three or more) measurement sources are each a combination of a scaled value of the true variable plus additive random noise, as follows:
          <disp-formula id="App1.Ch1.S3.E2" content-type="numbered"><label>C1</label><mml:math id="M331" display="block"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>T</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M332" display="inline"><mml:mrow><mml:msub><mml:mi>M</mml:mi><mml:mo>*</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> are measurement values (i.e., here the modeled precipitation values), subscript <inline-formula><mml:math id="M333" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> represents the source (Daymet, Maurer, and NLDAS), and subscript <inline-formula><mml:math id="M334" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> represents the time step in the test period (1 October 1989 to 30 September 1999). <inline-formula><mml:math id="M335" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mo>*</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> is the unobserved true value of total precipitation in a given catchment on a given day, and <inline-formula><mml:math id="M336" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>*</mml:mo></mml:msub></mml:mrow></mml:math></inline-formula> are independent and identically distributed measurement errors from any distribution.</p>
      <p id="d1e5815">The linearity assumption is not appropriate for precipitation data, which are typically assumed to have multiplicative errors. Following <xref ref-type="bibr" rid="bib1.bibx4" id="text.69"/>, we assumed a multiplicative error model for all three precipitation sources and converted these to linear error models by working with the log-transformed precipitation data, as follows:

              <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M337" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.S3.E3"><mml:mtd><mml:mtext>C2</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:msub><mml:mi>T</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.S3.E4"><mml:mtd><mml:mtext>C3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>ln⁡</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">α</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>T</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Standard triple collocation is then applied so that estimates of the error variances for each source are as follows:
          <disp-formula id="App1.Ch1.S3.E5" content-type="numbered"><label>C4</label><mml:math id="M338" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        for all <inline-formula><mml:math id="M339" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M340" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the covariance between the time series of source <inline-formula><mml:math id="M341" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and source <inline-formula><mml:math id="M342" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, and  <inline-formula><mml:math id="M343" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the variance of the distribution that each independent and identically distributed measurement <inline-formula><mml:math id="M344" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is drawn from.</p>
      <p id="d1e6071">Additionally, extended triple collocation <xref ref-type="bibr" rid="bib1.bibx33" id="paren.70"/> allows us to derive the correlation coefficients between measurement sources and truth as follows:
          <disp-formula id="App1.Ch1.S3.E6" content-type="numbered"><label>C5</label><mml:math id="M345" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e6135">This triple collocation analysis was applied separately in each of the 531 CAMELS catchments to obtain basin-specific estimates of the error variances, <inline-formula><mml:math id="M346" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and truth correlations, <inline-formula><mml:math id="M347" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, for each of the three precipitation products. Albeit the assumption that the forcing products have independent error structures (i.e., <inline-formula><mml:math id="M348" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>⟂</mml:mo><mml:mspace linebreak="nobreak" width="-0.125em"/><mml:mspace width="-0.125em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="-0.125em"/><mml:mspace width="-0.125em" linebreak="nobreak"/><mml:mspace width="-0.125em" linebreak="nobreak"/><mml:mo>⟂</mml:mo><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) is not met in our case, we expect the results to be robust enough for the purpose at hand.</p>
      <p id="d1e6196">Daymet typically produced lower NSE values in basins where triple collocation reported that the Daymet precipitation error variances were high. This is what we would expect, i.e., low model skill in basins with high precipitation error. However, we did not see similar patterns with the other two precipitation products (see Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>), where the triple collocation error variances and truth correlation are plotted against the NSE scores of the single-source models. In fact, the NLDAS LSTM tended to perform worse in basins with lower precipitation error (as estimated by triple collocation).</p>

      <?xmltex \floatpos{t}?><fig id="App1.Ch1.S3.F10"><?xmltex \currentcnt{C1}?><?xmltex \def\figurename{Figure}?><label>Figure C1</label><caption><p id="d1e6203">Triple collocation error variances (<inline-formula><mml:math id="M349" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> – Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E5"/>) and truth correlations (<inline-formula><mml:math id="M350" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>  – Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E6"/>) plotted against NSE scores of the single-forcing LSTM models. <inline-formula><mml:math id="M351" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> describes how much correlation there is between the given data product and the estimated truth, and <inline-formula><mml:math id="M352" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> describes the estimated disagreement between a given data product and the other two data products. Daymet typically produces lower NSE values in basins where triple collocation reports that the precipitation error variances are high, whereas NLDAS produces lower NSE values in basins where triple collocation reports that the error variances are low. There is no apparent pattern in the Maurer data.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f10.png"/>

      </fig>

      <p id="d1e6247">A reason for this is shown in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/>, which is an adapted version of Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> that highlights a few high-skill, high triple-collocation-variance NLDAS basins in blue. These basins correspond to a cluster of basins in the Rocky Mountains (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F12"/>) where NLDAS has a low correlation with the other two products but still yields high-skill LSTM simulations. What is happening here is that triple collocation measures (dis)agreement between measurement sources rather than error variances directly. Thus, the results in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> that appear to show NLDAS forcing models tending to perform well in basins with high precipitation error is driven, in part, by the fact that there are a few basins in the Rockies where NLDAS disagrees with, but is generally better than, the other two products. What Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/> is really showing is the disagreement between precipitation estimates, and it is not necessarily the case that if one precipitation product disagrees with the others then this product contains more error. The LSTM is able to learn and account for this type of situation; it is not simply learning to trust one product over the others, and it is<?pagebreak page2698?> not simply learning to do something resembling a majority vote in each basin.</p>

      <?xmltex \floatpos{t}?><fig id="App1.Ch1.S3.F11"><?xmltex \currentcnt{C2}?><?xmltex \def\figurename{Figure}?><label>Figure C2</label><caption><p id="d1e6262">As in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F10"/>, the triple collocation error variances (<inline-formula><mml:math id="M353" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> – Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E5"/>) and truth correlations (<inline-formula><mml:math id="M354" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>  – Eq. <xref ref-type="disp-formula" rid="App1.Ch1.S3.E6"/>) are plotted against NSE scores of the single-forcing LSTM models. The coloring shows the anomalous NLDAS basins in blue and all others in red. For these basins, NLDAS has low correlation with the other two products but still yields high-skill simulations.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f11.png"/>

      </fig>

<?xmltex \hack{\clearpage}?><?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F12"><?xmltex \currentcnt{C3}?><?xmltex \def\figurename{Figure}?><label>Figure C3</label><caption><p id="d1e6297">Spatial distribution of anomalous NLDAS basins shown in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F11"/> <bold>(a)</bold> compared with elevation of the CAMELS basins <bold>(b)</bold>.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f12.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F13"><?xmltex \currentcnt{C4}?><?xmltex \def\figurename{Figure}?><label>Figure C4</label><caption><p id="d1e6318">Performance of single-input models relative to the total variance of log precipitation in each basin. The Daymet model tends to perform better in wetter basins (as the total Daymet variance increases), but the other two products have poor-performing basins in catchments with high precipitation variance.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f13.png"/>

      </fig>

      <?xmltex \floatpos{h!}?><fig id="App1.Ch1.S3.F14"><?xmltex \currentcnt{C5}?><?xmltex \def\figurename{Figure}?><label>Figure C5</label><caption><p id="d1e6331">Fractional increase in NSE from the three-forcing model relative to the single-forcing models plotted against the log determinant of the covariance matrix of all three (standardized, log-transformed) precipitation products (<inline-formula><mml:math id="M355" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="normal">Σ</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula>). <inline-formula><mml:math id="M356" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="normal">Σ</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> increases when there is a larger disagreement between the three data sets, approximating the joint entropy of the three products. With the exception of the anomalous NLDAS basins (blue markers), the three-forcing model offers improvements with respect to the single-forcing models when there is larger disagreement between the three data sets. The three-forcing model learned to leverage synergy in these three precipitation products.</p></caption>
        <?xmltex \hack{\hsize\textwidth}?>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/25/2685/2021/hess-25-2685-2021-f14.png"/>

      </fig>

      <p id="d1e6366">Figure <xref ref-type="fig" rid="App1.Ch1.S3.F13"/> plots model performance against the individual variances of the precipitation products in each basin. This figure shows that the single-forcing Daymet LSTM tended to perform better in catchments with higher total precipitation variance (not triple collocation error variance). This is, again, not true for the other two models, where higher total variance was associated with a higher variance in model skill, indicating that a higher proportion of the total variance is likely due to measurement error.</p>
      <?pagebreak page2700?><p id="d1e6371"><?xmltex \hack{\clearpage}?>To analyze the synergy due to using all forcings in a single LSTM, we transposed the NSE improvements in each basin (due to using all three forcing products in the same LSTM) with the log determinant of the covariance matrix of all three (standardized, log-transformed) precipitation products (Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F14"/>). The log determinant is a proxy for the joint entropy of the three (standardized, log-transformed) products and increases when there is larger disagreement between the three data sets. Unlike in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F13"/>, the variances in Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F14"/> were calculated after removing the mean and overall variance of each log-transformed precipitation product so that the log determinant of the covariance is not affected by the overall magnitude of precipitation in each catchment (i.e., does not increase in wetter catchments). With the exception of the anomalous NLDAS basins, Fig. <xref ref-type="fig" rid="App1.Ch1.S3.F14"/> shows that the three-forcing model offered improvements with respect to the single-forcing models when there was larger disagreement between the three data sets. This indicates that there is value in diversity among precipitation data sets and that the LSTM can exploit this diversity.</p><?xmltex \hack{\clearpage}?>
</app>
  </app-group><notes notes-type="codeavailability"><title>Code availability</title>

      <p id="d1e6388">The code required to reproduce all LSTM results and figures is available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.4738770" ext-link-type="DOI">10.5281/zenodo.4738770</ext-link>. The code for running and optimizing SAC-SMA is available from the “multi-inputs” branch at the following repository: <uri>https://github.com/Upstream-Tech/SACSMA-SNOW17.git</uri> <xref ref-type="bibr" rid="bib1.bibx52" id="paren.71"/>.</p>
  </notes><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d1e6403">The validation periods of all benchmark models used in this study are available at <ext-link xlink:href="https://doi.org/10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1" ext-link-type="DOI">10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1</ext-link> <xref ref-type="bibr" rid="bib1.bibx27" id="paren.72"/>. The extended Maurer forcings, including daily minimum and maximum temperature, are available at <ext-link xlink:href="https://doi.org/10.4211/hs.17c896843cf940339c3c3496d0c1c077" ext-link-type="DOI">10.4211/hs.17c896843cf940339c3c3496d0c1c077</ext-link> <xref ref-type="bibr" rid="bib1.bibx23" id="paren.73"/>. The extended NLDAS forcings, including daily minimum and maximum temperature, are available at <ext-link xlink:href="https://doi.org/10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c" ext-link-type="DOI">10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c</ext-link> <xref ref-type="bibr" rid="bib1.bibx22" id="paren.74"/>. Finally, the weights of the pre-trained models are available at <ext-link xlink:href="https://doi.org/10.5281/zenodo.4670268" ext-link-type="DOI">10.5281/zenodo.4670268</ext-link> <xref ref-type="bibr" rid="bib1.bibx28" id="paren.75"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e6434">FK had the idea to train LSTMs on multiple forcing products. FK, DK, and GSN designed all the experiments. FK trained the models and evaluated the results. GSN did the triple collocation analysis and the integrated gradients analysis. GSN supervised the paper from the hydrological perspective and SH from the machine learning perspective. GSN and SH shared the responsibility of the last authorship in their respective fields. All authors worked on the paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e6440">The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e6446">Authors from the Johannes Kepler University acknowledge support from Bosch, ZF, Google (Faculty Research Award), the NVIDIA Corporation, GPU donations, LIT (grant no. LIT-2017-3-YOU-003), and FWF (grant no. P 28660-N31). Grey Nearing acknowledges support from the NASA Advanced Information Systems Technology program (award ID 80NSSC17K0541).</p><p id="d1e6448">The project relies heavily on open-source software. All programming was done in Python version 3.7 <xref ref-type="bibr" rid="bib1.bibx55" id="paren.76"/> and the associated libraries, including NumPy <xref ref-type="bibr" rid="bib1.bibx54" id="paren.77"/>, Pandas <xref ref-type="bibr" rid="bib1.bibx34" id="paren.78"/>, PyTorch <xref ref-type="bibr" rid="bib1.bibx41" id="paren.79"/>, SciPy <xref ref-type="bibr" rid="bib1.bibx56" id="paren.80"/>, Matplotlib <xref ref-type="bibr" rid="bib1.bibx20" id="paren.81"/>, xarray <xref ref-type="bibr" rid="bib1.bibx19" id="paren.82"/>, and SPOTPY <xref ref-type="bibr" rid="bib1.bibx18" id="paren.83"/>.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e6478">This research has been supported by the Global Water Futures project, the Bundesministerium für Bildung, Wissenschaft und Forschung (grant nos. LIT-2018-6-YOU-212, LIT-2017-3-YOU-003, LIT-2018-6-YOU-214, and LIT-2019-8-YOU-213), the Österreichische
Forschungsförderungsgesellschaft (grant nos. FFG-873979, FFG-872172, and FFG-871302), the European Commission's Horizon 2020 Framework Programme (AIDD; grant no. 956832), Janssen Pharmaceuticals, UCB Biopharma SRL, Merck Healthcare KGaA, the Audi.JKU Deep Learning Center, TGW Logistics Group GmbH, Silicon Austria Labs (SAL), FILL Gesellschaft mbH, Anyline GmbH, Google, ZF Friedrichshafen AG, Robert Bosch GmbH, Software Competence Center Hagenberg GmbH, TÜV Austria, NVIDIA Corporation, and Microsoft Canada.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e6484">This paper was edited by Dimitri Solomatine and reviewed by Thomas Lees and two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><?xmltex \def\ref@label{{Addor et~al.(2017{\natexlab{a}})Addor, Newman, Mizukami, and
Clark}}?><label>Addor et al.(2017a)Addor, Newman, Mizukami, and
Clark</label><?label addor2017camels?><mixed-citation>Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, <ext-link xlink:href="https://doi.org/10.5194/hess-21-5293-2017" ext-link-type="DOI">10.5194/hess-21-5293-2017</ext-link>, 2017a.</mixed-citation></ref>
      <ref id="bib1.bibx2"><?xmltex \def\ref@label{{Addor et~al.(2017{\natexlab{b}})Addor, Newman, Mizukami, and
Clark}}?><label>Addor et al.(2017b)Addor, Newman, Mizukami, and
Clark</label><?label addor2017large?><mixed-citation>Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: Catchment attributes
for large-sample studies, Boulder, CO, UCAR/NCAR,
<ext-link xlink:href="https://doi.org/10.5065/D6G73C3Q" ext-link-type="DOI">10.5065/D6G73C3Q</ext-link>, 2017b.</mixed-citation></ref>
      <ref id="bib1.bibx3"><?xmltex \def\ref@label{{Addor et~al.(2018)Addor, Nearing, Prieto, Newman, Le~Vine, and
Clark}}?><label>Addor et al.(2018)Addor, Nearing, Prieto, Newman, Le Vine, and
Clark</label><?label addor_ranking_2018?><mixed-citation>Addor, N., Nearing, G., Prieto, C., Newman, A. J., Le Vine, N., and Clark,
M. P.: A Ranking of Hydrological Signatures Based on Their
Predictability in Space, Water Resour. Res., 54, 8792–8812,
<ext-link xlink:href="https://doi.org/10.1029/2018WR022606" ext-link-type="DOI">10.1029/2018WR022606</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx4"><?xmltex \def\ref@label{{Alemohammad et~al.(2015)Alemohammad, McColl, Konings, Entekhabi, and
Stoffelen}}?><label>Alemohammad et al.(2015)Alemohammad, McColl, Konings, Entekhabi, and
Stoffelen</label><?label alemohammad_characterization_2015?><mixed-citation>Alemohammad, S. H., McColl, K. A., Konings, A. G., Entekhabi, D., and Stoffelen, A.: Characterization of precipitation product errors across the United States using multiplicative triple collocation, Hydrol. Earth Syst. Sci., 19, 3489–3503, <ext-link xlink:href="https://doi.org/10.5194/hess-19-3489-2015" ext-link-type="DOI">10.5194/hess-19-3489-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx5"><?xmltex \def\ref@label{{Beck et~al.(2017)Beck, Vergopolan, Pan, Levizzani, van Dijk, Weedon,
Brocca, Pappenberger, Huffman, and Wood}}?><label>Beck et al.(2017)Beck, Vergopolan, Pan, Levizzani, van Dijk, Weedon,
Brocca, Pappenberger, Huffman, and Wood</label><?label beck201722products?><mixed-citation>Beck, H. E., Vergopolan, N., Pan, M., Levizzani, V., van Dijk, A. I. J. M., Weedon, G. P., Brocca, L., Pappenberger, F., Huffman, G. J., and Wood, E. F.: Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling, Hydrol. Earth Syst. Sci., 21, 6201–6217, <ext-link xlink:href="https://doi.org/10.5194/hess-21-6201-2017" ext-link-type="DOI">10.5194/hess-21-6201-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx6"><?xmltex \def\ref@label{{Behnke et~al.(2016)Behnke, Vavrus, Allstadt, Albright, Thogmartin,
and Radeloff}}?><label>Behnke et al.(2016)Behnke, Vavrus, Allstadt, Albright, Thogmartin,
and Radeloff</label><?label behnke2016evaluation?><mixed-citation>
Behnke, R., Vavrus, S., Allstadt, A., Albright, T., Thogmartin, W. E., and
Radeloff, V. C.: Evaluation of downscaled, gridded climate data for the
conterminous United States, Ecol. Appl., 26, 1338–1351, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx7"><?xmltex \def\ref@label{{Clark et~al.(2016)Clark, Wilby, Gutmann, Vano, Gangopadhyay, Wood,
Fowler, Prudhomme, Arnold, and Brekke}}?><label>Clark et al.(2016)Clark, Wilby, Gutmann, Vano, Gangopadhyay, Wood,
Fowler, Prudhomme, Arnold, and Brekke</label><?label clark2016characterizing?><mixed-citation>
Clark, M. P., Wilby, R. L., Gutmann, E. D., Vano, J. A., Gangopadhyay, S.,
Wood, A. W., Fowler, H. J., Prudhomme, C., Arnold, J. R., and Brekke, L. D.:
Characterizing uncertainty of the hydrologic impacts of climate change,
Current Climate Change Reports, 2, 55–64, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx8"><?xmltex \def\ref@label{{Clausen and Biggs(2000)}}?><label>Clausen and Biggs(2000)</label><?label clausen_flow_2000?><mixed-citation>Clausen, B. and Biggs, B.: Flow variables for ecological studies in temperate
streams: groupings based on covariance, J. Hydrol., 237, 184–197,
<ext-link xlink:href="https://doi.org/10.1016/S0022-1694(00)00306-1" ext-link-type="DOI">10.1016/S0022-1694(00)00306-1</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx9"><?xmltex \def\ref@label{{Court(1962)}}?><label>Court(1962)</label><?label court_measures_1962?><mixed-citation>Court, A.: Measures of streamflow timing, J. Geophys. Res., 67,
4335–4339, <ext-link xlink:href="https://doi.org/10.1029/JZ067i011p04335" ext-link-type="DOI">10.1029/JZ067i011p04335</ext-link>, 1962.</mixed-citation></ref>
      <ref id="bib1.bibx10"><?xmltex \def\ref@label{{Duan et~al.(2007)Duan, Ajami, Gao, and Sorooshian}}?><label>Duan et al.(2007)Duan, Ajami, Gao, and Sorooshian</label><?label duan2007multi?><mixed-citation>
Duan, Q., Ajami, N. K., Gao, X., and Sorooshian, S.: Multi-model ensemble
hydrologic prediction using Bayesian model averaging, Adv. Water
Resour., 30, 1371–1386, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx11"><?xmltex \def\ref@label{{Frame et~al.(2020)Frame, Nearing, Kratzert, and
Rahman}}?><label>Frame et al.(2020)Frame, Nearing, Kratzert, and
Rahman</label><?label frame2020post?><mixed-citation>Frame, J., Nearing, G., Kratzert, F., and Rahman, M.: Post processing the US
National Water Model with a Long Short-Term Memory network, J. Am. Water Resour. As.,
<ext-link xlink:href="https://doi.org/10.31223/osf.io/4xhac" ext-link-type="DOI">10.31223/osf.io/4xhac</ext-link>, in review,  2020.</mixed-citation></ref>
      <?pagebreak page2702?><ref id="bib1.bibx12"><?xmltex \def\ref@label{{Gers et~al.(2000)Gers, Schmidhuber, and Cummins}}?><label>Gers et al.(2000)Gers, Schmidhuber, and Cummins</label><?label gers1999learning?><mixed-citation>
Gers, F. A., Schmidhuber, J., and Cummins, F.: Learning to forget: continual prediction
with LSTM, Neural Comput., 12,  2451–2471, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx13"><?xmltex \def\ref@label{{Gupta et~al.(2009)Gupta, Kling, Yilmaz, and
Martinez}}?><label>Gupta et al.(2009)Gupta, Kling, Yilmaz, and
Martinez</label><?label gupta2009decomposition?><mixed-citation>
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of
the mean squared error and NSE performance criteria: Implications for
improving hydrological modelling, J. Hydrol., 377, 80–91, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx14"><?xmltex \def\ref@label{{Henn et~al.(2018)Henn, Newman, Livneh, Daly, and
Lundquist}}?><label>Henn et al.(2018)Henn, Newman, Livneh, Daly, and
Lundquist</label><?label henn2018assessment?><mixed-citation>
Henn, B., Newman, A. J., Livneh, B., Daly, C., and Lundquist, J. D.: An
assessment of differences in gridded precipitation datasets in complex
terrain, J. Hydrol., 556, 1205–1219, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx15"><?xmltex \def\ref@label{{Hochreiter(1991)}}?><label>Hochreiter(1991)</label><?label hochreiter1991untersuchungen?><mixed-citation>
Hochreiter, S.: Untersuchungen zu dynamischen neuronalen Netzen, Diploma,
Technische Universität München, München, 91, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx16"><?xmltex \def\ref@label{{Hochreiter and Schmidhuber(1997{\natexlab{a}})}}?><label>Hochreiter and Schmidhuber(1997a)</label><?label hochreiter1997flat?><mixed-citation>
Hochreiter, S. and Schmidhuber, J.: Flat minima, Neural Comput., 9, 1–42,
1997a.</mixed-citation></ref>
      <ref id="bib1.bibx17"><?xmltex \def\ref@label{{Hochreiter and Schmidhuber(1997{\natexlab{b}})}}?><label>Hochreiter and Schmidhuber(1997b)</label><?label hochreiter1997long?><mixed-citation>
Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural Comput.,
9, 1735–1780, 1997b.</mixed-citation></ref>
      <ref id="bib1.bibx18"><?xmltex \def\ref@label{{Houska et~al.(2019)Houska, Kraft, Chamorro-Chavez, and
Breuer}}?><label>Houska et al.(2019)Houska, Kraft, Chamorro-Chavez, and
Breuer</label><?label houska2019spotpy?><mixed-citation>Houska, T., Kraft, P., Chamorro-Chavez, A. and Breuer, L.: SPOTting Model
Parameters Using a Ready-Made Python Package, PLoS ONE, 10, e0145180,
<ext-link xlink:href="https://doi.org/10.1371/journal.pone.0145180" ext-link-type="DOI">10.1371/journal.pone.0145180</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx19"><?xmltex \def\ref@label{{Hoyer and Hamman(2017)}}?><label>Hoyer and Hamman(2017)</label><?label hoyer2017xarray?><mixed-citation>Hoyer, S. and Hamman, J.: xarray: N-D labeled arrays and datasets in
Python, Journal of Open Research Software, 5, p. 10, <ext-link xlink:href="https://doi.org/10.5334/jors.148" ext-link-type="DOI">10.5334/jors.148</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx20"><?xmltex \def\ref@label{{Hunter(2007)}}?><label>Hunter(2007)</label><?label Hunter:2007?><mixed-citation>
Hunter, J. D.: Matplotlib: A 2D graphics environment, Comput. Sci.
Eng., 9, 90–95, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx21"><?xmltex \def\ref@label{{Kingma and Ba(2014)}}?><label>Kingma and Ba(2014)</label><?label kingma2014adam?><mixed-citation>Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv
[preprint],  <ext-link xlink:href="https://arxiv.org/abs/1412.6980">arXiv:1412.6980</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx22"><?xmltex \def\ref@label{Kratzert(2019a)}?><label>Kratzert(2019a)</label><?label Kratzertsingle?><mixed-citation>Kratzert, F.: Extended NLDAS forcings,  HydroShare, <ext-link xlink:href="https://doi.org/10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c" ext-link-type="DOI">10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx23"><?xmltex \def\ref@label{Kratzert(2019b)}?><label>Kratzert(2019b)</label><?label Kratzertsingle2?><mixed-citation>Kratzert, F.: Extended Maurer forcings, HydroShare, <ext-link xlink:href="https://doi.org/10.4211/hs.17c896843cf940339c3c3496d0c1c077" ext-link-type="DOI">10.4211/hs.17c896843cf940339c3c3496d0c1c077</ext-link>, 2019b.</mixed-citation></ref>
      <ref id="bib1.bibx24"><?xmltex \def\ref@label{{Kratzert et~al.(2018)Kratzert, Klotz, Brenner, Schulz, and
Herrnegger}}?><label>Kratzert et al.(2018)Kratzert, Klotz, Brenner, Schulz, and
Herrnegger</label><?label kratzert2018rainfall?><mixed-citation>Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <ext-link xlink:href="https://doi.org/10.5194/hess-22-6005-2018" ext-link-type="DOI">10.5194/hess-22-6005-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx25"><?xmltex \def\ref@label{{Kratzert et~al.(2019{\natexlab{a}})Kratzert, Klotz, Herrnegger,
Sampson, Hochreiter, and Nearing}}?><label>Kratzert et al.(2019a)Kratzert, Klotz, Herrnegger,
Sampson, Hochreiter, and Nearing</label><?label kratzert2019toward?><mixed-citation>Kratzert, F., Klotz, D., Herrnegger, M., Sampson, A. K., Hochreiter, S., and
Nearing, G. S.: Toward Improved Predictions in Ungauged Basins: Exploiting
the Power of Machine Learning, Water Resour. Res., 55, 11344–11354,
<ext-link xlink:href="https://doi.org/10.1029/2019WR026065" ext-link-type="DOI">10.1029/2019WR026065</ext-link>, 2019a.</mixed-citation></ref>
      <ref id="bib1.bibx26"><?xmltex \def\ref@label{{Kratzert et~al.(2019{\natexlab{b}})Kratzert, Klotz, Shalev,
Klambauer, Hochreiter, and Nearing}}?><label>Kratzert et al.(2019b)Kratzert, Klotz, Shalev,
Klambauer, Hochreiter, and Nearing</label><?label kratzert2019towards?><mixed-citation>Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, <ext-link xlink:href="https://doi.org/10.5194/hess-23-5089-2019" ext-link-type="DOI">10.5194/hess-23-5089-2019</ext-link>, 2019b.</mixed-citation></ref>
      <ref id="bib1.bibx27"><?xmltex \def\ref@label{Kratzert et al.(2019c)}?><label>Kratzert et al.(2019c)</label><?label Kratzert?><mixed-citation>Kratzert, F., Klotz, D., Hochreiter, S., and Nearing, G. S.: Benchmark models, HydroShare, <ext-link xlink:href="https://doi.org/10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1" ext-link-type="DOI">10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1</ext-link>, 2019c.</mixed-citation></ref>
      <ref id="bib1.bibx28"><?xmltex \def\ref@label{Kratzert et al.(2021)}?><label>Kratzert et al.(2021)</label><?label Kratzertzenodo?><mixed-citation>Kratzert, F., Klotz, D., Hochreiter, S., and Nearing, G. S.: Pre-trained models, Zenodo [data set],
<ext-link xlink:href="https://doi.org/10.5281/zenodo.4670268" ext-link-type="DOI">10.5281/zenodo.4670268</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx29"><?xmltex \def\ref@label{{Ladson et~al.(2013)Ladson, Brown, Neal, and
Nathan}}?><label>Ladson et al.(2013)Ladson, Brown, Neal, and
Nathan</label><?label ladson_standard_2013?><mixed-citation>
Ladson, A., Brown, R., Neal, B., and Nathan, R.: A standard approach to
baseflow separation using the Lyne and Hollick filter, Australian Journal of Water Resources, 17, , 25–34, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx30"><?xmltex \def\ref@label{{Lundquist et~al.(2019)Lundquist, Hughes, Gutmann, and
Kapnick}}?><label>Lundquist et al.(2019)Lundquist, Hughes, Gutmann, and
Kapnick</label><?label lundquist2019our?><mixed-citation>Lundquist, J., Hughes, M., Gutmann, E., and Kapnick, S.: Our skill in modeling
mountain rain and snow is bypassing the skill of our observational networks,
B. Am. Meteorol. Soc., 100, 2473–2490, <ext-link xlink:href="https://doi.org/10.1175/BAMS-D-19-0001.1" ext-link-type="DOI">10.1175/BAMS-D-19-0001.1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx31"><?xmltex \def\ref@label{{Madadgar and Moradkhani(2014)}}?><label>Madadgar and Moradkhani(2014)</label><?label madadgar2014improved?><mixed-citation>
Madadgar, S. and Moradkhani, H.: Improved B ayesian multimodeling: Integration
of copulas and B ayesian model averaging, Water Resour. Res., 50,
9586–9603, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx32"><?xmltex \def\ref@label{{Maurer et~al.(2002)Maurer, Wood, Adam, Lettenmaier, and
Nijssen}}?><label>Maurer et al.(2002)Maurer, Wood, Adam, Lettenmaier, and
Nijssen</label><?label maurer2002long?><mixed-citation>
Maurer, E. P., Wood, A., Adam, J., Lettenmaier, D. P., and Nijssen, B.: A
long-term hydrologically based dataset of land surface fluxes and states for
the conterminous United States, J. Climate, 15, 3237–3251, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx33"><?xmltex \def\ref@label{{McColl et~al.(2014)McColl, Vogelzang, Konings, Entekhabi, Piles, and
Stoffelen}}?><label>McColl et al.(2014)McColl, Vogelzang, Konings, Entekhabi, Piles, and
Stoffelen</label><?label mccoll_extended_2014?><mixed-citation>McColl, K. A., Vogelzang, J., Konings, A. G., Entekhabi, D., Piles, M., and
Stoffelen, A.: Extended triple collocation: Estimating errors and
correlation coefficients with respect to an unknown target: EXTENDED
TRIPLE COLLOCATION, Geophys. Res. Lett., 41, 6229–6236,
<ext-link xlink:href="https://doi.org/10.1002/2014GL061322" ext-link-type="DOI">10.1002/2014GL061322</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx34"><?xmltex \def\ref@label{{McKinney(2010)}}?><label>McKinney(2010)</label><?label McKinney2010?><mixed-citation>
McKinney, W.: Data Structures for Statistical Computing in Python, Proceedings
of the 9th Python in Science Conference, Austin, Texas, 28 June–3 July, 1697900, 51–56, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx35"><?xmltex \def\ref@label{{Nash and Sutcliffe(1970)}}?><label>Nash and Sutcliffe(1970)</label><?label nash1970river?><mixed-citation>
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, J. Hydrol., 10,
282–290, 1970.</mixed-citation></ref>
      <ref id="bib1.bibx36"><?xmltex \def\ref@label{{Newman et~al.(2014)Newman, Sampson, Clark, Bock, Viger, and
Blodgett}}?><label>Newman et al.(2014)Newman, Sampson, Clark, Bock, Viger, and
Blodgett</label><?label newman2014large?><mixed-citation>Newman, A., Sampson, K., Clark, M., Bock, A., Viger, R., and Blodgett, D.: A
large-sample watershed-scale hydrometeorological dataset for the contiguous
USA, Boulder, CO: UCAR/NCAR, <ext-link xlink:href="https://doi.org/10.5065/D6MW2F4D" ext-link-type="DOI">10.5065/D6MW2F4D</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx37"><?xmltex \def\ref@label{{Newman et~al.(2017)Newman, Mizukami, Clark, Wood, Nijssen, and
Nearing}}?><label>Newman et al.(2017)Newman, Mizukami, Clark, Wood, Nijssen, and
Nearing</label><?label newman2017benchmarking?><mixed-citation>
Newman, A. J., Mizukami, N., Clark, M. P., Wood, A. W., Nijssen, B., and
Nearing, G.: Benchmarking of a physically based hydrologic model, J.
Hydrometeorol., 18, 2215–2225, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx38"><?xmltex \def\ref@label{{Newman et~al.(2019)Newman, Clark, Longman, and
Giambelluca}}?><label>Newman et al.(2019)Newman, Clark, Longman, and
Giambelluca</label><?label newman2019methodological?><mixed-citation>
Newman, A. J., Clark, M. P., Longman, R. J., and Giambelluca, T. W.:
Methodological intercomparisons of station-based gridded meteorological
products: Utility, limitations, and paths forward, J.
Hydrometeorol., 20, 531–547, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx39"><?xmltex \def\ref@label{{Olden and Poff(2003)}}?><label>Olden and Poff(2003)</label><?label olden2003redundancy?><mixed-citation>Olden, J. D. and Poff, N. L.: Redundancy and the choice of hydrologic indices
for characterizing streamflow regimes, River Res. Appl., 19,
101–121, <ext-link xlink:href="https://doi.org/10.1002/rra.700" ext-link-type="DOI">10.1002/rra.700</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx40"><?xmltex \def\ref@label{{Parkes et~al.(2019)Parkes, Higginbottom, Hufkens, Ceballos, Kramer,
and Foster}}?><label>Parkes et al.(2019)Parkes, Higginbottom, Hufkens, Ceballos, Kramer,
and Foster</label><?label parkes2019weather?><mixed-citation>Parkes, B., Higginbottom, T. P., Hufkens, K., Ceballos, F., Kramer, B., and
Foster, T.: Weather dataset choice introduces uncertainty to estimates of
crop yield responses to climate variability and change, Environ.
Res. Lett., 14, 124089, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ab5ebb" ext-link-type="DOI">10.1088/1748-9326/ab5ebb</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx41"><?xmltex \def\ref@label{{Paszke et~al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer}}?><label>Paszke et al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer</label><?label paszke2017automatic?><mixed-citation>
Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z.,
Desmaison, A., Antiga, L., and Lerer, A.: Pytorch: an imperative style, high-performance deep learning library,in:
Advances in Neural Information Processing Systems, 32,
8024–8035, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx42"><?xmltex \def\ref@label{{Pearl(1988)}}?><label>Pearl(1988)</label><?label pearl1988embracing?><mixed-citation>
Pearl, J.: Embracing causality in default reasoning, Artificial Intelligence,
35, 259–271, 1988.</mixed-citation></ref>
      <ref id="bib1.bibx43"><?xmltex \def\ref@label{{Sankarasubramanian et~al.(2001)Sankarasubramanian, Vogel, and
Limbrunner}}?><label>Sankarasubramanian et al.(2001)Sankarasubramanian, Vogel, and
Limbrunner</label><?label sankarasubramanian_climate_2001?><mixed-citation>Sankarasubramanian, A., Vogel, R. M., and Limbrunner, J. F.: Climate elasticity
of streamflow in the United States, Water Resour. Res., 37,
1771–1781, <ext-link xlink:href="https://doi.org/10.1029/2000WR900330" ext-link-type="DOI">10.1029/2000WR900330</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx44"><?xmltex \def\ref@label{{Sawicz et~al.(2011)Sawicz, Wagener, Sivapalan, Troch, and
Carrillo}}?><label>Sawicz et al.(2011)Sawicz, Wagener, Sivapalan, Troch, and
Carrillo</label><?label sawicz_catchment_2011?><mixed-citation>Sawicz, K., Wagener, T., Sivapalan, M., Troch, P. A., and Carrillo, G.: Catchment classification: empirical analysis of hydrologic similarity based on catchment function in the eastern USA, Hydrol. Earth Syst. Sci., 15, 2895–2911, <ext-link xlink:href="https://doi.org/10.5194/hess-15-2895-2011" ext-link-type="DOI">10.5194/hess-15-2895-2011</ext-link>, 2011.</mixed-citation></ref>
      <?pagebreak page2703?><ref id="bib1.bibx45"><?xmltex \def\ref@label{{Scipal et~al.(2010)Scipal, Dorigo, and deJeu}}?><label>Scipal et al.(2010)Scipal, Dorigo, and deJeu</label><?label scipal2010triple?><mixed-citation>
Scipal, K., Dorigo, W., and deJeu, R.: Triple collocation—A new tool to
determine the error structure of global soil moisture products, in: 2010 IEEE
International Geoscience and Remote Sensing Symposium, Honolulu, HI, USA, 25—30 July 2010, 4426–4429, IEEE,
2010.</mixed-citation></ref>
      <ref id="bib1.bibx46"><?xmltex \def\ref@label{{Shrikumar et~al.(2016)Shrikumar, Greenside, Shcherbina, and
Kundaje}}?><label>Shrikumar et al.(2016)Shrikumar, Greenside, Shcherbina, and
Kundaje</label><?label shrikumar2016not?><mixed-citation>Shrikumar, A., Greenside, P., Shcherbina, A., and Kundaje, A.: Not just a black
box: Learning important features through propagating activation differences,
arXiv [preprint],    <ext-link xlink:href="https://arxiv.org/abs/1605.01713">arXiv:1605.01713</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx47"><?xmltex \def\ref@label{{Stoffelen(1998)}}?><label>Stoffelen(1998)</label><?label stoffelen1998toward?><mixed-citation>
Stoffelen, A.: Toward the true near-surface wind speed: Error modeling and
calibration using triple collocation, J. Geophys. Res.-Oceans, 103, 7755–7766, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx48"><?xmltex \def\ref@label{{Sundararajan et~al.(2017)Sundararajan, Taly, and
Yan}}?><label>Sundararajan et al.(2017)Sundararajan, Taly, and
Yan</label><?label Sundararajan2017?><mixed-citation>Sundararajan, M., Taly, A., and Yan, Q.: Axiomatic attribution for deep
networks, in: Proceedings of the 34th International Conference on Machine
Learning-Volume 70,  3319–3328, available at: <uri>http://proceedings.mlr.press/v70/sundararajan17a.html</uri> (last access: 13 May 2020), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx49"><?xmltex \def\ref@label{{Sutton(2019)}}?><label>Sutton(2019)</label><?label sutton2019bitter?><mixed-citation>Sutton, R.: The bitter lesson, Incomplete Ideas (blog), available at: <uri>http://www.incompleteideas.net/IncIdeas/BitterLesson.html</uri> (last access: 13 May 2020),  2019.</mixed-citation></ref>
      <ref id="bib1.bibx50"><?xmltex \def\ref@label{{Thornton et~al.(1997)Thornton, Running, White
et~al.}}?><label>Thornton et al.(1997)Thornton, Running, White
et al.</label><?label thornton1997generating?><mixed-citation>
Thornton, P. E., Running, S. W., White, M. A.: Generating surfaces of
daily meteorological variables over large regions of complex terrain, J.
Hydrol., 190, 214–251, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx51"><?xmltex \def\ref@label{{Timmermans et~al.(2019)Timmermans, Wehner, Cooley, O'Brien, and
Krishnan}}?><label>Timmermans et al.(2019)Timmermans, Wehner, Cooley, O'Brien, and
Krishnan</label><?label timmermans2019evaluation?><mixed-citation>
Timmermans, B., Wehner, M., Cooley, D., O'Brien, T., and Krishnan, H.: An
evaluation of the consistency of extremes in gridded precipitation data sets,
Clim. Dynam., 52, 6651–6670, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx52"><?xmltex \def\ref@label{ Upstream-Tech(2020)}?><label> Upstream-Tech(2020)</label><?label data?><mixed-citation>Upstream-Tech:
SACSMA-SNOW17, available at: <uri>https://github.com/Upstream-Tech/SACSMA-SNOW17.git</uri>, last access: 11 July 2020.</mixed-citation></ref>
      <ref id="bib1.bibx53"><?xmltex \def\ref@label{{Tolson and Shoemaker(2007)}}?><label>Tolson and Shoemaker(2007)</label><?label tolson2007dynamically?><mixed-citation>Tolson, B. A. and Shoemaker, C. A.: Dynamically dimensioned search algorithm
for computationally efficient watershed model calibration, Water Resour. Res., 43, W01413, <ext-link xlink:href="https://doi.org/10.1029/2005WR004723" ext-link-type="DOI">10.1029/2005WR004723</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx54"><?xmltex \def\ref@label{{{Van Der Walt} et~al.(2011){Van Der Walt}, Colbert, and
Varoquaux}}?><label>Van Der Walt et al.(2011)Van Der Walt, Colbert, and
Varoquaux</label><?label VanDerWalt2011?><mixed-citation>
Van Der Walt, S., Colbert, S. C., and Varoquaux, G.: The NumPy array: A
structure for efficient numerical computation, Comput. Sci.
Eng., 13, 22–30, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx55"><?xmltex \def\ref@label{{van Rossum(1995)}}?><label>van Rossum(1995)</label><?label VanRossum1995?><mixed-citation>van Rossum, G.: Python tutorial, Technical Report CS-R9526, Tech. rep.,
Centrum voor Wiskunde en Informatica (CWI), Amsterdam, 1995.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx56"><?xmltex \def\ref@label{{{Virtanen} et~al.(2020){Virtanen}, {Gommers}, {Oliphant},
{Haberland}, {Reddy}, {Cournapeau}, {Burovski}, {Peterson}, {Weckesser},
{Bright}, {van der Walt}, {Brett}, {Wilson}, {Jarrod Millman}, {Mayorov},
{Nelson}, {Jones}, {Kern}, {Larson}, {Carey}, {Polat}, {Feng}, {Moore}, {Vand
erPlas}, {Laxalde}, {Perktold}, {Cimrman}, {Henriksen}, {Quintero}, {Harris},
{Archibald}, {Ribeiro}, {Pedregosa}, {van Mulbregt}, and
{Contributors}}}?><label>Virtanen et al.(2020)Virtanen, Gommers, Oliphant,
Haberland, Reddy, Cournapeau, Burovski, Peterson, Weckesser,
Bright, van der Walt, Brett, Wilson, Jarrod Millman, Mayorov,
Nelson, Jones, Kern, Larson, Carey, Polat, Feng, Moore, Vand
erPlas, Laxalde, Perktold, Cimrman, Henriksen, Quintero, Harris,
Archibald, Ribeiro, Pedregosa, van Mulbregt, and
Contributors</label><?label 2020SciPy-NMeth?><mixed-citation>Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T.,
Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright,
J., van der Walt, S. J., Brett, M., Wilson, J., Jarrod Millman, K.,
Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E.,
Carey, C., Polat, İ., Feng, Y., Moore, E. W., Vand erPlas, J.,
Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero,
E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa,
F., van Mulbregt, P., and Contributors, S. . .: SciPy 1.0: Fundamental
Algorithms for Scientific Computing in Python, Nat. Methods, 17, 261–272,
<ext-link xlink:href="https://doi.org/10.1038/s41592-019-0686-2" ext-link-type="DOI">10.1038/s41592-019-0686-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx57"><?xmltex \def\ref@label{{Wellman and Henrion(1993)}}?><label>Wellman and Henrion(1993)</label><?label wellman1993explaining?><mixed-citation>
Wellman, M. P. and Henrion, M.: Explaining'explaining away', IEEE T.
Pattern Anal., 15, 287–292, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx58"><?xmltex \def\ref@label{{Westerberg and McMillan(2015)}}?><label>Westerberg and McMillan(2015)</label><?label westerberg_uncertainty_2015?><mixed-citation>Westerberg, I. K. and McMillan, H. K.: Uncertainty in hydrological signatures, Hydrol. Earth Syst. Sci., 19, 3951–3968, <ext-link xlink:href="https://doi.org/10.5194/hess-19-3951-2015" ext-link-type="DOI">10.5194/hess-19-3951-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx59"><?xmltex \def\ref@label{{Xia et~al.(2012)Xia, Mitchell, Ek, Sheffield, Cosgrove, Wood, Luo,
Alonge, Wei, Meng et~al.}}?><label>Xia et al.(2012)Xia, Mitchell, Ek, Sheffield, Cosgrove, Wood, Luo,
Alonge, Wei, Meng et al.</label><?label xia2012continental?><mixed-citation>Xia, Y.,  Mitchell, K.,  Ek, M., Sheffield, J., Cosgrove, B.,  Wood, E.,  Luo, L.,  Alonge, C., Wei, H., Meng, J.,  Livneh, B.,  Lettenmaier, D.,  Koren, V.,  Duan,
Q.,  Mo, K.,  Fan, Y., and Mocko, D.: Continental-scale water and energy
flux analysis and validation for the North American Land Data Assimilation
System project phase 2 (NLDAS-2): 1. Intercomparison and application of model
products, J. Geophys. Res.-Atmos., 117,  D03109, <ext-link xlink:href="https://doi.org/10.1029/2011JD016048" ext-link-type="DOI">10.1029/2011JD016048</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx60"><?xmltex \def\ref@label{{Yilmaz et~al.(2005)Yilmaz, Hogue, Hsu, Sorooshian, Gupta, and
Wagener}}?><label>Yilmaz et al.(2005)Yilmaz, Hogue, Hsu, Sorooshian, Gupta, and
Wagener</label><?label yilmaz2005intercomparison?><mixed-citation>
Yilmaz, K. K., Hogue, T. S., Hsu, K.-L., Sorooshian, S., Gupta, H. V., and
Wagener, T.: Intercomparison of rain gauge, radar, and satellite-based
precipitation estimates with emphasis on hydrologic forecasting, J.
Hydrometeorol., 6, 497–517, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx61"><?xmltex \def\ref@label{{Yilmaz et~al.(2008)Yilmaz, Gupta, and Wagener}}?><label>Yilmaz et al.(2008)Yilmaz, Gupta, and Wagener</label><?label yilmaz2008process?><mixed-citation>Yilmaz, K. K., Gupta, H. V., and Wagener, T.: A process-based diagnostic
approach to model evaluation: Application to the NWS distributed hydrologic
model, Water Resour. Res., 44, W09417, <ext-link xlink:href="https://doi.org/10.1029/2007WR006716" ext-link-type="DOI">10.1029/2007WR006716</ext-link>, 2008.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>A note on leveraging synergy in multiple meteorological data sets with deep learning for rainfall–runoff modeling</article-title-html>
<abstract-html><p>A deep learning rainfall–runoff model can take multiple meteorological forcing products as input and learn to combine them in spatially and temporally dynamic ways. This is demonstrated with Long Short-Term Memory networks (LSTMs) trained over basins in the continental US, using the Catchment Attributes and Meteorological data set for Large Sample Studies (CAMELS). Using meteorological input from different data products (North American Land Data Assimilation System, NLDAS, Maurer, and Daymet) in a single LSTM significantly improved simulation accuracy relative to using only individual meteorological products. A sensitivity analysis showed that the LSTM combines precipitation products in different ways, depending on location, and also in different ways for the simulation of different parts of the hydrograph.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Addor et al.(2017a)Addor, Newman, Mizukami, and
Clark</label><mixed-citation>
Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: The CAMELS data set: catchment attributes and meteorology for large-sample studies, Hydrol. Earth Syst. Sci., 21, 5293–5313, <a href="https://doi.org/10.5194/hess-21-5293-2017" target="_blank">https://doi.org/10.5194/hess-21-5293-2017</a>, 2017a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Addor et al.(2017b)Addor, Newman, Mizukami, and
Clark</label><mixed-citation>
Addor, N., Newman, A. J., Mizukami, N., and Clark, M. P.: Catchment attributes
for large-sample studies, Boulder, CO, UCAR/NCAR,
<a href="https://doi.org/10.5065/D6G73C3Q" target="_blank">https://doi.org/10.5065/D6G73C3Q</a>, 2017b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Addor et al.(2018)Addor, Nearing, Prieto, Newman, Le Vine, and
Clark</label><mixed-citation>
Addor, N., Nearing, G., Prieto, C., Newman, A. J., Le Vine, N., and Clark,
M. P.: A Ranking of Hydrological Signatures Based on Their
Predictability in Space, Water Resour. Res., 54, 8792–8812,
<a href="https://doi.org/10.1029/2018WR022606" target="_blank">https://doi.org/10.1029/2018WR022606</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Alemohammad et al.(2015)Alemohammad, McColl, Konings, Entekhabi, and
Stoffelen</label><mixed-citation>
Alemohammad, S. H., McColl, K. A., Konings, A. G., Entekhabi, D., and Stoffelen, A.: Characterization of precipitation product errors across the United States using multiplicative triple collocation, Hydrol. Earth Syst. Sci., 19, 3489–3503, <a href="https://doi.org/10.5194/hess-19-3489-2015" target="_blank">https://doi.org/10.5194/hess-19-3489-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Beck et al.(2017)Beck, Vergopolan, Pan, Levizzani, van Dijk, Weedon,
Brocca, Pappenberger, Huffman, and Wood</label><mixed-citation>
Beck, H. E., Vergopolan, N., Pan, M., Levizzani, V., van Dijk, A. I. J. M., Weedon, G. P., Brocca, L., Pappenberger, F., Huffman, G. J., and Wood, E. F.: Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling, Hydrol. Earth Syst. Sci., 21, 6201–6217, <a href="https://doi.org/10.5194/hess-21-6201-2017" target="_blank">https://doi.org/10.5194/hess-21-6201-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Behnke et al.(2016)Behnke, Vavrus, Allstadt, Albright, Thogmartin,
and Radeloff</label><mixed-citation>
Behnke, R., Vavrus, S., Allstadt, A., Albright, T., Thogmartin, W. E., and
Radeloff, V. C.: Evaluation of downscaled, gridded climate data for the
conterminous United States, Ecol. Appl., 26, 1338–1351, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Clark et al.(2016)Clark, Wilby, Gutmann, Vano, Gangopadhyay, Wood,
Fowler, Prudhomme, Arnold, and Brekke</label><mixed-citation>
Clark, M. P., Wilby, R. L., Gutmann, E. D., Vano, J. A., Gangopadhyay, S.,
Wood, A. W., Fowler, H. J., Prudhomme, C., Arnold, J. R., and Brekke, L. D.:
Characterizing uncertainty of the hydrologic impacts of climate change,
Current Climate Change Reports, 2, 55–64, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Clausen and Biggs(2000)</label><mixed-citation>
Clausen, B. and Biggs, B.: Flow variables for ecological studies in temperate
streams: groupings based on covariance, J. Hydrol., 237, 184–197,
<a href="https://doi.org/10.1016/S0022-1694(00)00306-1" target="_blank">https://doi.org/10.1016/S0022-1694(00)00306-1</a>, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Court(1962)</label><mixed-citation>
Court, A.: Measures of streamflow timing, J. Geophys. Res., 67,
4335–4339, <a href="https://doi.org/10.1029/JZ067i011p04335" target="_blank">https://doi.org/10.1029/JZ067i011p04335</a>, 1962.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Duan et al.(2007)Duan, Ajami, Gao, and Sorooshian</label><mixed-citation>
Duan, Q., Ajami, N. K., Gao, X., and Sorooshian, S.: Multi-model ensemble
hydrologic prediction using Bayesian model averaging, Adv. Water
Resour., 30, 1371–1386, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Frame et al.(2020)Frame, Nearing, Kratzert, and
Rahman</label><mixed-citation>
Frame, J., Nearing, G., Kratzert, F., and Rahman, M.: Post processing the US
National Water Model with a Long Short-Term Memory network, J. Am. Water Resour. As.,
<a href="https://doi.org/10.31223/osf.io/4xhac" target="_blank">https://doi.org/10.31223/osf.io/4xhac</a>, in review,  2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Gers et al.(2000)Gers, Schmidhuber, and Cummins</label><mixed-citation>
Gers, F. A., Schmidhuber, J., and Cummins, F.: Learning to forget: continual prediction
with LSTM, Neural Comput., 12,  2451–2471, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Gupta et al.(2009)Gupta, Kling, Yilmaz, and
Martinez</label><mixed-citation>
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of
the mean squared error and NSE performance criteria: Implications for
improving hydrological modelling, J. Hydrol., 377, 80–91, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Henn et al.(2018)Henn, Newman, Livneh, Daly, and
Lundquist</label><mixed-citation>
Henn, B., Newman, A. J., Livneh, B., Daly, C., and Lundquist, J. D.: An
assessment of differences in gridded precipitation datasets in complex
terrain, J. Hydrol., 556, 1205–1219, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Hochreiter(1991)</label><mixed-citation>
Hochreiter, S.: Untersuchungen zu dynamischen neuronalen Netzen, Diploma,
Technische Universität München, München, 91, 1991.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Hochreiter and Schmidhuber(1997a)</label><mixed-citation>
Hochreiter, S. and Schmidhuber, J.: Flat minima, Neural Comput., 9, 1–42,
1997a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hochreiter and Schmidhuber(1997b)</label><mixed-citation>
Hochreiter, S. and Schmidhuber, J.: Long short-term memory, Neural Comput.,
9, 1735–1780, 1997b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Houska et al.(2019)Houska, Kraft, Chamorro-Chavez, and
Breuer</label><mixed-citation>
Houska, T., Kraft, P., Chamorro-Chavez, A. and Breuer, L.: SPOTting Model
Parameters Using a Ready-Made Python Package, PLoS ONE, 10, e0145180,
<a href="https://doi.org/10.1371/journal.pone.0145180" target="_blank">https://doi.org/10.1371/journal.pone.0145180</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Hoyer and Hamman(2017)</label><mixed-citation>
Hoyer, S. and Hamman, J.: xarray: N-D labeled arrays and datasets in
Python, Journal of Open Research Software, 5, p. 10, <a href="https://doi.org/10.5334/jors.148" target="_blank">https://doi.org/10.5334/jors.148</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Hunter(2007)</label><mixed-citation>
Hunter, J. D.: Matplotlib: A 2D graphics environment, Comput. Sci.
Eng., 9, 90–95, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Kingma and Ba(2014)</label><mixed-citation>
Kingma, D. P. and Ba, J.: Adam: A method for stochastic optimization, arXiv
[preprint],  <a href="https://arxiv.org/abs/1412.6980" target="_blank">arXiv:1412.6980</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Kratzert(2019a)</label><mixed-citation>
Kratzert, F.: Extended NLDAS forcings,  HydroShare, <a href="https://doi.org/10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c" target="_blank">https://doi.org/10.4211/hs.0a68bfd7ddf642a8be9041d60f40868c</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Kratzert(2019b)</label><mixed-citation>
Kratzert, F.: Extended Maurer forcings, HydroShare, <a href="https://doi.org/10.4211/hs.17c896843cf940339c3c3496d0c1c077" target="_blank">https://doi.org/10.4211/hs.17c896843cf940339c3c3496d0c1c077</a>, 2019b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Kratzert et al.(2018)Kratzert, Klotz, Brenner, Schulz, and
Herrnegger</label><mixed-citation>
Kratzert, F., Klotz, D., Brenner, C., Schulz, K., and Herrnegger, M.: Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks, Hydrol. Earth Syst. Sci., 22, 6005–6022, <a href="https://doi.org/10.5194/hess-22-6005-2018" target="_blank">https://doi.org/10.5194/hess-22-6005-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Kratzert et al.(2019a)Kratzert, Klotz, Herrnegger,
Sampson, Hochreiter, and Nearing</label><mixed-citation>
Kratzert, F., Klotz, D., Herrnegger, M., Sampson, A. K., Hochreiter, S., and
Nearing, G. S.: Toward Improved Predictions in Ungauged Basins: Exploiting
the Power of Machine Learning, Water Resour. Res., 55, 11344–11354,
<a href="https://doi.org/10.1029/2019WR026065" target="_blank">https://doi.org/10.1029/2019WR026065</a>, 2019a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Kratzert et al.(2019b)Kratzert, Klotz, Shalev,
Klambauer, Hochreiter, and Nearing</label><mixed-citation>
Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., and Nearing, G.: Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets, Hydrol. Earth Syst. Sci., 23, 5089–5110, <a href="https://doi.org/10.5194/hess-23-5089-2019" target="_blank">https://doi.org/10.5194/hess-23-5089-2019</a>, 2019b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Kratzert et al.(2019c)</label><mixed-citation>
Kratzert, F., Klotz, D., Hochreiter, S., and Nearing, G. S.: Benchmark models, HydroShare, <a href="https://doi.org/10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1" target="_blank">https://doi.org/10.4211/hs.474ecc37e7db45baa425cdb4fc1b61e1</a>, 2019c.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Kratzert et al.(2021)</label><mixed-citation>
Kratzert, F., Klotz, D., Hochreiter, S., and Nearing, G. S.: Pre-trained models, Zenodo [data set],
<a href="https://doi.org/10.5281/zenodo.4670268" target="_blank">https://doi.org/10.5281/zenodo.4670268</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Ladson et al.(2013)Ladson, Brown, Neal, and
Nathan</label><mixed-citation>
Ladson, A., Brown, R., Neal, B., and Nathan, R.: A standard approach to
baseflow separation using the Lyne and Hollick filter, Australian Journal of Water Resources, 17, , 25–34, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Lundquist et al.(2019)Lundquist, Hughes, Gutmann, and
Kapnick</label><mixed-citation>
Lundquist, J., Hughes, M., Gutmann, E., and Kapnick, S.: Our skill in modeling
mountain rain and snow is bypassing the skill of our observational networks,
B. Am. Meteorol. Soc., 100, 2473–2490, <a href="https://doi.org/10.1175/BAMS-D-19-0001.1" target="_blank">https://doi.org/10.1175/BAMS-D-19-0001.1</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Madadgar and Moradkhani(2014)</label><mixed-citation>
Madadgar, S. and Moradkhani, H.: Improved B ayesian multimodeling: Integration
of copulas and B ayesian model averaging, Water Resour. Res., 50,
9586–9603, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Maurer et al.(2002)Maurer, Wood, Adam, Lettenmaier, and
Nijssen</label><mixed-citation>
Maurer, E. P., Wood, A., Adam, J., Lettenmaier, D. P., and Nijssen, B.: A
long-term hydrologically based dataset of land surface fluxes and states for
the conterminous United States, J. Climate, 15, 3237–3251, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>McColl et al.(2014)McColl, Vogelzang, Konings, Entekhabi, Piles, and
Stoffelen</label><mixed-citation>
McColl, K. A., Vogelzang, J., Konings, A. G., Entekhabi, D., Piles, M., and
Stoffelen, A.: Extended triple collocation: Estimating errors and
correlation coefficients with respect to an unknown target: EXTENDED
TRIPLE COLLOCATION, Geophys. Res. Lett., 41, 6229–6236,
<a href="https://doi.org/10.1002/2014GL061322" target="_blank">https://doi.org/10.1002/2014GL061322</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>McKinney(2010)</label><mixed-citation>
McKinney, W.: Data Structures for Statistical Computing in Python, Proceedings
of the 9th Python in Science Conference, Austin, Texas, 28 June–3 July, 1697900, 51–56, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Nash and Sutcliffe(1970)</label><mixed-citation>
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual
models part I – A discussion of principles, J. Hydrol., 10,
282–290, 1970.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Newman et al.(2014)Newman, Sampson, Clark, Bock, Viger, and
Blodgett</label><mixed-citation>
Newman, A., Sampson, K., Clark, M., Bock, A., Viger, R., and Blodgett, D.: A
large-sample watershed-scale hydrometeorological dataset for the contiguous
USA, Boulder, CO: UCAR/NCAR, <a href="https://doi.org/10.5065/D6MW2F4D" target="_blank">https://doi.org/10.5065/D6MW2F4D</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Newman et al.(2017)Newman, Mizukami, Clark, Wood, Nijssen, and
Nearing</label><mixed-citation>
Newman, A. J., Mizukami, N., Clark, M. P., Wood, A. W., Nijssen, B., and
Nearing, G.: Benchmarking of a physically based hydrologic model, J.
Hydrometeorol., 18, 2215–2225, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Newman et al.(2019)Newman, Clark, Longman, and
Giambelluca</label><mixed-citation>
Newman, A. J., Clark, M. P., Longman, R. J., and Giambelluca, T. W.:
Methodological intercomparisons of station-based gridded meteorological
products: Utility, limitations, and paths forward, J.
Hydrometeorol., 20, 531–547, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Olden and Poff(2003)</label><mixed-citation>
Olden, J. D. and Poff, N. L.: Redundancy and the choice of hydrologic indices
for characterizing streamflow regimes, River Res. Appl., 19,
101–121, <a href="https://doi.org/10.1002/rra.700" target="_blank">https://doi.org/10.1002/rra.700</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Parkes et al.(2019)Parkes, Higginbottom, Hufkens, Ceballos, Kramer,
and Foster</label><mixed-citation>
Parkes, B., Higginbottom, T. P., Hufkens, K., Ceballos, F., Kramer, B., and
Foster, T.: Weather dataset choice introduces uncertainty to estimates of
crop yield responses to climate variability and change, Environ.
Res. Lett., 14, 124089, <a href="https://doi.org/10.1088/1748-9326/ab5ebb" target="_blank">https://doi.org/10.1088/1748-9326/ab5ebb</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Paszke et al.(2017)Paszke, Gross, Chintala, Chanan, Yang, DeVito,
Lin, Desmaison, Antiga, and Lerer</label><mixed-citation>
Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z.,
Desmaison, A., Antiga, L., and Lerer, A.: Pytorch: an imperative style, high-performance deep learning library,in:
Advances in Neural Information Processing Systems, 32,
8024–8035, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Pearl(1988)</label><mixed-citation>
Pearl, J.: Embracing causality in default reasoning, Artificial Intelligence,
35, 259–271, 1988.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Sankarasubramanian et al.(2001)Sankarasubramanian, Vogel, and
Limbrunner</label><mixed-citation>
Sankarasubramanian, A., Vogel, R. M., and Limbrunner, J. F.: Climate elasticity
of streamflow in the United States, Water Resour. Res., 37,
1771–1781, <a href="https://doi.org/10.1029/2000WR900330" target="_blank">https://doi.org/10.1029/2000WR900330</a>, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Sawicz et al.(2011)Sawicz, Wagener, Sivapalan, Troch, and
Carrillo</label><mixed-citation>
Sawicz, K., Wagener, T., Sivapalan, M., Troch, P. A., and Carrillo, G.: Catchment classification: empirical analysis of hydrologic similarity based on catchment function in the eastern USA, Hydrol. Earth Syst. Sci., 15, 2895–2911, <a href="https://doi.org/10.5194/hess-15-2895-2011" target="_blank">https://doi.org/10.5194/hess-15-2895-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Scipal et al.(2010)Scipal, Dorigo, and deJeu</label><mixed-citation>
Scipal, K., Dorigo, W., and deJeu, R.: Triple collocation—A new tool to
determine the error structure of global soil moisture products, in: 2010 IEEE
International Geoscience and Remote Sensing Symposium, Honolulu, HI, USA, 25—30 July 2010, 4426–4429, IEEE,
2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Shrikumar et al.(2016)Shrikumar, Greenside, Shcherbina, and
Kundaje</label><mixed-citation>
Shrikumar, A., Greenside, P., Shcherbina, A., and Kundaje, A.: Not just a black
box: Learning important features through propagating activation differences,
arXiv [preprint],    <a href="https://arxiv.org/abs/1605.01713" target="_blank">arXiv:1605.01713</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Stoffelen(1998)</label><mixed-citation>
Stoffelen, A.: Toward the true near-surface wind speed: Error modeling and
calibration using triple collocation, J. Geophys. Res.-Oceans, 103, 7755–7766, 1998.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Sundararajan et al.(2017)Sundararajan, Taly, and
Yan</label><mixed-citation>
Sundararajan, M., Taly, A., and Yan, Q.: Axiomatic attribution for deep
networks, in: Proceedings of the 34th International Conference on Machine
Learning-Volume 70,  3319–3328, available at: <a href="http://proceedings.mlr.press/v70/sundararajan17a.html" target="_blank"/> (last access: 13 May 2020), 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Sutton(2019)</label><mixed-citation>
Sutton, R.: The bitter lesson, Incomplete Ideas (blog), available at: <a href="http://www.incompleteideas.net/IncIdeas/BitterLesson.html" target="_blank"/> (last access: 13 May 2020),  2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Thornton et al.(1997)Thornton, Running, White
et al.</label><mixed-citation>
Thornton, P. E., Running, S. W., White, M. A.: Generating surfaces of
daily meteorological variables over large regions of complex terrain, J.
Hydrol., 190, 214–251, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Timmermans et al.(2019)Timmermans, Wehner, Cooley, O'Brien, and
Krishnan</label><mixed-citation>
Timmermans, B., Wehner, M., Cooley, D., O'Brien, T., and Krishnan, H.: An
evaluation of the consistency of extremes in gridded precipitation data sets,
Clim. Dynam., 52, 6651–6670, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label> Upstream-Tech(2020)</label><mixed-citation>
Upstream-Tech:
SACSMA-SNOW17, available at: <a href="https://github.com/Upstream-Tech/SACSMA-SNOW17.git" target="_blank"/>, last access: 11 July 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Tolson and Shoemaker(2007)</label><mixed-citation>
Tolson, B. A. and Shoemaker, C. A.: Dynamically dimensioned search algorithm
for computationally efficient watershed model calibration, Water Resour. Res., 43, W01413, <a href="https://doi.org/10.1029/2005WR004723" target="_blank">https://doi.org/10.1029/2005WR004723</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Van Der Walt et al.(2011)Van Der Walt, Colbert, and
Varoquaux</label><mixed-citation>
Van Der Walt, S., Colbert, S. C., and Varoquaux, G.: The NumPy array: A
structure for efficient numerical computation, Comput. Sci.
Eng., 13, 22–30, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>van Rossum(1995)</label><mixed-citation>
van Rossum, G.: Python tutorial, Technical Report CS-R9526, Tech. rep.,
Centrum voor Wiskunde en Informatica (CWI), Amsterdam, 1995.

</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Virtanen et al.(2020)Virtanen, Gommers, Oliphant,
Haberland, Reddy, Cournapeau, Burovski, Peterson, Weckesser,
Bright, van der Walt, Brett, Wilson, Jarrod Millman, Mayorov,
Nelson, Jones, Kern, Larson, Carey, Polat, Feng, Moore, Vand
erPlas, Laxalde, Perktold, Cimrman, Henriksen, Quintero, Harris,
Archibald, Ribeiro, Pedregosa, van Mulbregt, and
Contributors</label><mixed-citation>
Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T.,
Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright,
J., van der Walt, S. J., Brett, M., Wilson, J., Jarrod Millman, K.,
Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E.,
Carey, C., Polat, İ., Feng, Y., Moore, E. W., Vand erPlas, J.,
Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero,
E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa,
F., van Mulbregt, P., and Contributors, S. . .: SciPy 1.0: Fundamental
Algorithms for Scientific Computing in Python, Nat. Methods, 17, 261–272,
<a href="https://doi.org/10.1038/s41592-019-0686-2" target="_blank">https://doi.org/10.1038/s41592-019-0686-2</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Wellman and Henrion(1993)</label><mixed-citation>
Wellman, M. P. and Henrion, M.: Explaining'explaining away', IEEE T.
Pattern Anal., 15, 287–292, 1993.
</mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Westerberg and McMillan(2015)</label><mixed-citation>
Westerberg, I. K. and McMillan, H. K.: Uncertainty in hydrological signatures, Hydrol. Earth Syst. Sci., 19, 3951–3968, <a href="https://doi.org/10.5194/hess-19-3951-2015" target="_blank">https://doi.org/10.5194/hess-19-3951-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Xia et al.(2012)Xia, Mitchell, Ek, Sheffield, Cosgrove, Wood, Luo,
Alonge, Wei, Meng et al.</label><mixed-citation>
Xia, Y.,  Mitchell, K.,  Ek, M., Sheffield, J., Cosgrove, B.,  Wood, E.,  Luo, L.,  Alonge, C., Wei, H., Meng, J.,  Livneh, B.,  Lettenmaier, D.,  Koren, V.,  Duan,
Q.,  Mo, K.,  Fan, Y., and Mocko, D.: Continental-scale water and energy
flux analysis and validation for the North American Land Data Assimilation
System project phase 2 (NLDAS-2): 1. Intercomparison and application of model
products, J. Geophys. Res.-Atmos., 117,  D03109, <a href="https://doi.org/10.1029/2011JD016048" target="_blank">https://doi.org/10.1029/2011JD016048</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Yilmaz et al.(2005)Yilmaz, Hogue, Hsu, Sorooshian, Gupta, and
Wagener</label><mixed-citation>
Yilmaz, K. K., Hogue, T. S., Hsu, K.-L., Sorooshian, S., Gupta, H. V., and
Wagener, T.: Intercomparison of rain gauge, radar, and satellite-based
precipitation estimates with emphasis on hydrologic forecasting, J.
Hydrometeorol., 6, 497–517, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Yilmaz et al.(2008)Yilmaz, Gupta, and Wagener</label><mixed-citation>
Yilmaz, K. K., Gupta, H. V., and Wagener, T.: A process-based diagnostic
approach to model evaluation: Application to the NWS distributed hydrologic
model, Water Resour. Res., 44, W09417, <a href="https://doi.org/10.1029/2007WR006716" target="_blank">https://doi.org/10.1029/2007WR006716</a>, 2008.
</mixed-citation></ref-html>--></article>
