<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-22-5243-2018</article-id><title-group><article-title>Value of uncertain streamflow observations for<?xmltex \hack{\break}?> hydrological modelling</article-title><alt-title>Value of uncertain streamflow observations for hydrological modelling</alt-title>
      </title-group><?xmltex \runningtitle{Value of uncertain streamflow observations for hydrological modelling}?><?xmltex \runningauthor{S. Etter et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Etter</surname><given-names>Simon</given-names></name>
          <email>simon.etter@geo.uzh.ch</email>
        <ext-link>https://orcid.org/0000-0002-7553-9102</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Strobl</surname><given-names>Barbara</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-5530-4632</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Seibert</surname><given-names>Jan</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-6314-2124</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>van Meerveld</surname><given-names>H. J. Ilja</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-7547-3270</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Geography, University of Zurich, Winterthurerstrasse
190, 8057 Zurich, Switzerland</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Department of Aquatic Sciences and Assessment,
Swedish University of Agricultural Sciences,<?xmltex \hack{\break}?> P.O. Box 7050, 75007 Uppsala, Sweden.</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Simon Etter (simon.etter@geo.uzh.ch)</corresp></author-notes><pub-date><day>15</day><month>October</month><year>2018</year></pub-date>
      
      <volume>22</volume>
      <issue>10</issue>
      <fpage>5243</fpage><lpage>5257</lpage>
      <history>
        <date date-type="received"><day>28</day><month>June</month><year>2018</year></date>
           <date date-type="rev-request"><day>11</day><month>July</month><year>2018</year></date>
           <date date-type="rev-recd"><day>20</day><month>September</month><year>2018</year></date>
           <date date-type="accepted"><day>24</day><month>September</month><year>2018</year></date>
      </history>
      <permissions>
        
        
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018.html">This article is available from https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018.pdf</self-uri>
      <abstract>
    <p id="d1e117">Previous studies have shown that hydrological models can be parameterised using a
limited number of streamflow measurements. Citizen science projects can
collect such data for otherwise ungauged catchments but an important question
is whether these observations are informative given that these streamflow
estimates will be uncertain. We assess the value of inaccurate streamflow estimates for calibration of a
simple bucket-type runoff model for six Swiss catchments. We pretended that
only a few observations were available and that these were affected by
different levels of inaccuracy. The level of inaccuracy was based on a
log-normal error distribution that was fitted to streamflow estimates of 136
citizens for medium-sized streams. Two additional levels of inaccuracy, for
which the standard deviation of the error distribution was divided by 2 and
4, were used as well. Based on these error distributions,
random errors were added to the measured hourly streamflow data. New time
series with different temporal resolutions were created from these synthetic
streamflow time series. These included scenarios with one observation each
week or month, as well as scenarios that are more realistic for crowdsourced
data that generally have an irregular distribution of data points throughout
the year, or focus on a particular season. The model was then calibrated for
the six catchments using the synthetic time series for a dry, an average and
a wet year. The performance of the calibrated models was evaluated based on
the measured hourly streamflow time series. The results indicate that
streamflow estimates from untrained citizens are not informative for model
calibration. However, if the errors can be reduced, the estimates are
informative and useful for model calibration. As expected, the model
performance increased when the number of observations used for calibration
increased. The model performance was also better when the observations were
more evenly distributed throughout the year. This study indicates that
uncertain streamflow estimates can be useful for model calibration but that
the estimates by citizen scientists need to be improved by training or more
advanced data filtering before they are useful for model calibration.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p id="d1e127">The application of hydrological models usually requires several years of
precipitation, temperature and streamflow data for calibration, but these
data are only available for a limited number of catchments. Therefore,
several studies have addressed the question: how many data points are needed to
calibrate a model for a catchment? Yapo et al. (1996) and Vrugt et
al. (2006), using stable parameters as a criterion for satisfying model
performance, concluded that most of the information to calibrate a model is
contained in 2–3 years of continuous streamflow data and that no more value
is added when using more than 8 years of data. Perrin et al. (2007), using
the Nash–Sutcliffe efficiency criterion (NSE), showed that streamflow data
for 350 randomly sampled days out of a 39-year period were sufficient to
obtain robust model parameter values for two bucket-type models, TOPMO, which
is derived from TOPMODEL concepts (Michel et al., 2003), and GR4J (Perrin et
al., 2003). Brath et al. (2004), using the volume error, relative peak error and
time-to-peak error, concluded that at
least 3 months of continuous data were<?pagebreak page5244?> required to obtain a reliable
calibration. Other studies have shown that discontinuous streamflow data can
be informative for constraining model parameters (Juston et al., 2009; Pool
et al., 2017; Seibert and Beven, 2009; Seibert and McDonnell, 2015). Juston
et al. (2009) used a multi-objective calibration that included groundwater
data and concluded that the information content of a subset of 53 days of
streamflow data was the same as for the 1065 days of data from which the subset was
drawn. Seibert and Beven (2009), using the NSE criterion, found that model
performance reached a plateau for 8–16 streamflow measurements collected
throughout a 1-year period. They furthermore showed that the use of
streamflow data for one event and the corresponding recession resulted in a
similar calibration performance as the six highest measured
streamflow values during a 2-month period.</p>
      <p id="d1e130">These studies had different foci and used different model performance
metrics, but nevertheless their results are encouraging for the calibration
of hydrological models for ungauged basins based on a limited number of
high-quality measurements. However, the question remains: how informative
are low(er)-quality data? An alternative approach to high-quality streamflow
measurements in ungauged catchments is to use citizen science. Citizen
science has been proven to be a valuable tool to collect (Dickinson et al., 2010) or analyse (Koch and Stisen, 2017) various kinds of
environmental data, including hydrological data (Buytaert et al., 2014). Citizen science
approaches use simple methods to enable a large number of citizens to
collect data and allow local communities to contribute data to support
science and environmental management. Citizen science approaches can be
particularly useful in light of the declining stream gauging networks (Ruhi et al., 2018; Shiklomanov et al.,
2002) and to complement the existing monitoring networks. However, citizen
science projects that collect streamflow or stream level data in flowing
water bodies are still rare. Examples are the CrowdHydrology project (Lowry and Fienen, 2013), SmartPhones4Water in Nepal (Davids et al., 2018)
and a project in Kenya (Weeser et al., 2018), which all ask citizens to read
stream levels at staff gauges and to send these via an app or as a text
message to a central database. Estimating streamflow is obviously more
challenging than reading levels from a staff gauge but citizens can apply
the stick or float method, where they measure the time it takes for a
floating object (e.g. a small stick) to travel a given distance to estimate
the flow velocity. Combined with estimates for the width and the average
depth of the stream, this allows them to obtain a rough estimate of the
streamflow. However, these streamflow estimates may be so inaccurate that
they are not useful for model calibration. It is therefore necessary to not
only evaluate the requirements of hydrological models in terms of the amount
and temporal resolution of data, but also in terms of the achievable quality
by the citizen scientists before starting a citizen science project.</p>
      <p id="d1e133">The effects of rating curve uncertainty on model calibration (e.g.
McMillan et al., 2010; Horner et al., 2018) and the value of sparse datasets (Davids et al., 2017) have been
quantified in recent studies. However, the potential value of sparse
datasets in combination with large uncertainties (such as those from
crowdsourced streamflow estimates) has not been evaluated so far. Therefore,
the aim of this study was to determine the effects of observation
inaccuracies on the calibration of bucket-type hydrological models when only
a limited number of observations are available. The specific objectives of
this paper are to determine (i) whether the streamflow estimates from citizen
scientists are informative for model calibration or if these errors need to
be reduced (e.g. through training) to become useful and (ii) how the timing of
the streamflow observations affects the calibration of a hydrological model.
The latter is important for citizen science projects, as it provides
guidance on whether it is useful to encourage citizens to contribute
streamflow observations during a specific time of the year.</p>
</sec>
<sec id="Ch1.S2">
  <title>Methods</title>
      <p id="d1e142">To assess the potential value of crowdsourced streamflow estimates for
hydrological model calibration, the HBV (Hydrologiska Byråns
Vattenbalansavdelning) model (Bergström, 1976) was calibrated
against streamflow time series for six Swiss catchments, as well as
for different subsets of the data that represent citizen science data in terms
of errors and temporal resolution. Similar to the approach used in several
recent studies (Ewen
et al., 2008; Finger et al., 2015; Fitzner et al., 2013; Haberlandt and
Sester, 2010; Seibert and Beven, 2009), we pretended that only a small
subset of the data were available for model calibration. In addition,
various degrees of inaccuracy were assumed. The value of these data for
model calibration was then evaluated by comparing the model performance for
these subsets of data to the performance of the model calibrated with the
complete measured streamflow time series.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><caption><p id="d1e148">Characteristics of the six Swiss catchments used in this study. For
the location of the study catchments, see Fig. 1. Long-term averages are for
the period 1974–2014, except for Verzasca for which the long-term average is
for the 1990–2014 period. Regime types are classified according to
Aschwanden and Weingartner (1985).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2">Catchment </oasis:entry>
         <oasis:entry colname="col3">Murg</oasis:entry>
         <oasis:entry colname="col4">Guerbe</oasis:entry>
         <oasis:entry colname="col5">Allenbach</oasis:entry>
         <oasis:entry colname="col6">Riale</oasis:entry>
         <oasis:entry colname="col7">Mentue</oasis:entry>
         <oasis:entry colname="col8">Verzasca</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">di Calneggia</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Gauging station</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Waengi</oasis:entry>
         <oasis:entry colname="col4">Belp</oasis:entry>
         <oasis:entry colname="col5">Adelboden</oasis:entry>
         <oasis:entry colname="col6">Cavergno,</oasis:entry>
         <oasis:entry colname="col7">Yvonand La</oasis:entry>
         <oasis:entry colname="col8">Lavertezzo,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">(FOEN station</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(2126)</oasis:entry>
         <oasis:entry colname="col4">Mülimatt</oasis:entry>
         <oasis:entry colname="col5">(2232)</oasis:entry>
         <oasis:entry colname="col6">Pontit</oasis:entry>
         <oasis:entry colname="col7">Mauguettaz</oasis:entry>
         <oasis:entry colname="col8">Campiòi</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">number)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">(2159)</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">(2356)</oasis:entry>
         <oasis:entry colname="col7">(2369)</oasis:entry>
         <oasis:entry colname="col8">(2605)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Area (km<inline-formula><mml:math id="M3" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">79</oasis:entry>
         <oasis:entry colname="col4">117</oasis:entry>
         <oasis:entry colname="col5">29</oasis:entry>
         <oasis:entry colname="col6">24</oasis:entry>
         <oasis:entry colname="col7">105</oasis:entry>
         <oasis:entry colname="col8">186</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Elevation</oasis:entry>
         <oasis:entry colname="col2">Min</oasis:entry>
         <oasis:entry colname="col3">465</oasis:entry>
         <oasis:entry colname="col4">522</oasis:entry>
         <oasis:entry colname="col5">1297</oasis:entry>
         <oasis:entry colname="col6">885</oasis:entry>
         <oasis:entry colname="col7">445</oasis:entry>
         <oasis:entry colname="col8">490</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(m a.s.l.)</oasis:entry>
         <oasis:entry colname="col2">Max</oasis:entry>
         <oasis:entry colname="col3">1035</oasis:entry>
         <oasis:entry colname="col4">2176</oasis:entry>
         <oasis:entry colname="col5">2762</oasis:entry>
         <oasis:entry colname="col6">2921</oasis:entry>
         <oasis:entry colname="col7">927</oasis:entry>
         <oasis:entry colname="col8">2864</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Regime type</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Pluvial-</oasis:entry>
         <oasis:entry colname="col4">Pluvial-</oasis:entry>
         <oasis:entry colname="col5">Nival-alpin</oasis:entry>
         <oasis:entry colname="col6">Nival-</oasis:entry>
         <oasis:entry colname="col7">Pluvial-</oasis:entry>
         <oasis:entry colname="col8">Nivo-pluvial-</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">inférieur</oasis:entry>
         <oasis:entry colname="col4">superieur</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">méridional</oasis:entry>
         <oasis:entry colname="col7">jurassien</oasis:entry>
         <oasis:entry colname="col8">méridional</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Min–max</oasis:entry>
         <oasis:entry colname="col2">Dry year</oasis:entry>
         <oasis:entry colname="col3">0.29–1.61</oasis:entry>
         <oasis:entry colname="col4">0.44–1.93</oasis:entry>
         <oasis:entry colname="col5">0.40–2.48</oasis:entry>
         <oasis:entry colname="col6">0.13–3.22</oasis:entry>
         <oasis:entry colname="col7">0.22–2.37</oasis:entry>
         <oasis:entry colname="col8">0.16–2.92</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Pardé</oasis:entry>
         <oasis:entry colname="col2">Average year</oasis:entry>
         <oasis:entry colname="col3">0.58–2.16</oasis:entry>
         <oasis:entry colname="col4">0.61–1.65</oasis:entry>
         <oasis:entry colname="col5">0.39–2.44</oasis:entry>
         <oasis:entry colname="col6">0.09–2.84</oasis:entry>
         <oasis:entry colname="col7">0.23–2.66</oasis:entry>
         <oasis:entry colname="col8">0.23–3.17</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">coefficients</oasis:entry>
         <oasis:entry colname="col2">Wet year</oasis:entry>
         <oasis:entry colname="col3">0.34–1.69</oasis:entry>
         <oasis:entry colname="col4">0.42–2.14</oasis:entry>
         <oasis:entry colname="col5">0.32–2.12</oasis:entry>
         <oasis:entry colname="col6">0.10–3.48</oasis:entry>
         <oasis:entry colname="col7">0.35–2.39</oasis:entry>
         <oasis:entry colname="col8">0.26–2.64</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">Long-term</oasis:entry>
         <oasis:entry colname="col3">0.68–1.34</oasis:entry>
         <oasis:entry colname="col4">0.77–1.39</oasis:entry>
         <oasis:entry colname="col5">0.35–2.70</oasis:entry>
         <oasis:entry colname="col6">0.14–2.70</oasis:entry>
         <oasis:entry colname="col7">0.46–1.57</oasis:entry>
         <oasis:entry colname="col8">0.23–2.22</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Annual</oasis:entry>
         <oasis:entry colname="col2">Dry year</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">0.37</oasis:entry>
         <oasis:entry colname="col5">0.86</oasis:entry>
         <oasis:entry colname="col6">1.30<inline-formula><mml:math id="M4" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">0.41</oasis:entry>
         <oasis:entry colname="col8">0.98</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">runoff :</oasis:entry>
         <oasis:entry colname="col2">Average year</oasis:entry>
         <oasis:entry colname="col3">0.55</oasis:entry>
         <oasis:entry colname="col4">0.48</oasis:entry>
         <oasis:entry colname="col5">1.73<inline-formula><mml:math id="M5" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">1.38<inline-formula><mml:math id="M6" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">0.52</oasis:entry>
         <oasis:entry colname="col8">0.66</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">rainfall</oasis:entry>
         <oasis:entry colname="col2">Wet year</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4">0.54</oasis:entry>
         <oasis:entry colname="col5">0.78</oasis:entry>
         <oasis:entry colname="col6">0.98</oasis:entry>
         <oasis:entry colname="col7">0.50</oasis:entry>
         <oasis:entry colname="col8">1.32<inline-formula><mml:math id="M7" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">ratio</oasis:entry>
         <oasis:entry colname="col2">Long-term</oasis:entry>
         <oasis:entry colname="col3">0.56</oasis:entry>
         <oasis:entry colname="col4">0.57</oasis:entry>
         <oasis:entry colname="col5">0.94</oasis:entry>
         <oasis:entry colname="col6">1.06<inline-formula><mml:math id="M8" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7">0.38</oasis:entry>
         <oasis:entry colname="col8">0.9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Long-term mean</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">1.84</oasis:entry>
         <oasis:entry colname="col4">2.75</oasis:entry>
         <oasis:entry colname="col5">1.23</oasis:entry>
         <oasis:entry colname="col6">1.43</oasis:entry>
         <oasis:entry colname="col7">1.64</oasis:entry>
         <oasis:entry colname="col8">10.76</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">annual streamflow</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(m<inline-formula><mml:math id="M9" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M10" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Weather stations</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Aadorf-</oasis:entry>
         <oasis:entry colname="col4">Plaffeien,</oasis:entry>
         <oasis:entry colname="col5">Adelboden</oasis:entry>
         <oasis:entry colname="col6">Robiei</oasis:entry>
         <oasis:entry colname="col7">Mathod,</oasis:entry>
         <oasis:entry colname="col8">Acquarossa,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Taenikon,</oasis:entry>
         <oasis:entry colname="col4">Bern-</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">Pully</oasis:entry>
         <oasis:entry colname="col8">Cimetta,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Hörnli</oasis:entry>
         <oasis:entry colname="col4">Zollikofen</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">Magadino,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">Piotta</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e151"><inline-formula><mml:math id="M1" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula>In Verzasca, Allenbach and Riale die Calneggia there are some
streamflow : rainfall ratios <inline-formula><mml:math id="M2" display="inline"><mml:mi mathvariant="italic">&gt;</mml:mi></mml:math></inline-formula> 1 because the weather
stations are located outside the catchment and precipitation is highly
variable in alpine terrain.</p></table-wrap-foot></table-wrap>

<sec id="Ch1.S2.SS1">
  <title>HBV model</title>
      <p id="d1e922">The HBV model was originally developed at the Hydrologiska Byråns
Vattenbalansavdelning unit at the Swedish Meteorological and Hydrological
Institute (SMHI) by Bergström (1976). The HBV model is a
bucket-type model that represents snow, soil, groundwater and stream routing
processes in separate routines. In this study, we used the version HBV-light (Seibert and Vis, 2012).</p>
</sec>
<sec id="Ch1.S2.SS2">
  <title>Catchments</title>
      <p id="d1e931">The HBV-light model was set up for six 24–186 km<inline-formula><mml:math id="M11" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> catchments in
Switzerland (Table 1 and
Fig. 1). The catchments were selected based on
the following criteria: (i) there is little anthropogenic influence, (ii) they
are gauged at a single location, (iii) they have reliable streamflow data during
high flow and low flow conditions (i.e. no complete freezing<?pagebreak page5245?> during winter
and a cross section that allows accurate streamflow measurement at low
flows) and (iv) there are no glaciers. The six selected catchments
(Table 1) represent different streamflow regime
types (Aschwanden and Weingartner, 1985). The snow-dominated
highest elevation catchments (Allenbach and Riale di Calneggia) have the
largest seasonality in streamflow, i.e. the biggest differences between the
long-term maximum and minimum Pardé coefficients, followed by the rain- and snow-dominated Verzasca catchment. The rain-dominated catchments (Murg,
Guerbe and Mentue) have the lowest seasonal variability in streamflow
(Table 1). The mean elevation of the catchments
varies from 652 to 2003 m a.s.l. (Table 1). The
elevation range of each individual catchment was divided into 100 m elevation
bands for the simulations.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><caption><p id="d1e945">Location of the six study catchments in Switzerland. Shading
indicates whether the catchment is located on the north or south side of the
Alps. See Table 1 for the characteristics of the
study catchments.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f01.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS3">
  <title>Measured data</title>
      <p id="d1e961">Hourly runoff time series (based on 10 min measurements) for the six
study catchments were obtained from the Federal Office for the Environment
(FOEN; see Table 1 for the gauging station
numbers). The average hourly areal precipitation amounts were extracted for
each study catchment from the gridded CombiPrecip dataset from MeteoSwiss (Sideris et al., 2014). This dataset
combines gauge and radar precipitation measurements at an hourly timescale
and 1 km<inline-formula><mml:math id="M12" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> spatial resolution and is available for the time period since 2005.</p>
      <p id="d1e973">We used hourly temperature data from the automatic monitoring network of
MeteoSwiss (see Table 1 for the stations) and
applied a gradient of <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M14" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula>C per 1000 m to adjust the temperature
of each weather station to the mean elevation of the catchment. Within the
HBV model, the temperature was then adjusted for the different elevation
bands using a calibrated lapse rate.</p>
      <p id="d1e995">As recommended by Oudin et al. (2005), potential
evapotranspiration was calculated using the temperature-based potential
evapotranspiration model of McGuinness and Bordne (1972) using
the day of the year, the latitude and the temperature. This rather
simplistic approach was considered<?pagebreak page5246?> sufficient because this study focused on
differences in model performance relative to a benchmark calibration.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><caption><p id="d1e1001">The calibration years (second most extreme and second closest to
average years) and validation years (most extreme and closest to average
years) for each catchment. The numbers in parentheses are the ranks over the
period 1974–2014 (or 1990–2014 for Verzasca).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Year</oasis:entry>
         <oasis:entry colname="col2">Murg</oasis:entry>
         <oasis:entry colname="col3">Guerbe</oasis:entry>
         <oasis:entry colname="col4">Allenbach</oasis:entry>
         <oasis:entry colname="col5">Riale di</oasis:entry>
         <oasis:entry colname="col6">Mentue</oasis:entry>
         <oasis:entry colname="col7">Verzasca</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">character</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">Calneggia</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col7">Calibration </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Wet</oasis:entry>
         <oasis:entry colname="col2">2007 (3)</oasis:entry>
         <oasis:entry colname="col3">2007 (2)</oasis:entry>
         <oasis:entry colname="col4">2007 (4)</oasis:entry>
         <oasis:entry colname="col5">2009 (11)</oasis:entry>
         <oasis:entry colname="col6">2014 (7)</oasis:entry>
         <oasis:entry colname="col7">2011 (4)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Dry</oasis:entry>
         <oasis:entry colname="col2">2013 (8)</oasis:entry>
         <oasis:entry colname="col3">2011 (8)</oasis:entry>
         <oasis:entry colname="col4">2009 (11)</oasis:entry>
         <oasis:entry colname="col5">2012 (8)</oasis:entry>
         <oasis:entry colname="col6">2010 (4)</oasis:entry>
         <oasis:entry colname="col7">2013 (5)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Average</oasis:entry>
         <oasis:entry colname="col2">2008 (6)</oasis:entry>
         <oasis:entry colname="col3">2008 (17)</oasis:entry>
         <oasis:entry colname="col4">2013 (7)</oasis:entry>
         <oasis:entry colname="col5">2013 (2)</oasis:entry>
         <oasis:entry colname="col6">2006 (6)</oasis:entry>
         <oasis:entry colname="col7">2007 (7)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col7">Validation </oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Wet</oasis:entry>
         <oasis:entry colname="col2">2014 (1)</oasis:entry>
         <oasis:entry colname="col3">2014 (1)</oasis:entry>
         <oasis:entry colname="col4">2014 (1)</oasis:entry>
         <oasis:entry colname="col5">2008 (9)</oasis:entry>
         <oasis:entry colname="col6">2007 (1)</oasis:entry>
         <oasis:entry colname="col7">2008 (1)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Dry</oasis:entry>
         <oasis:entry colname="col2">2009 (7)</oasis:entry>
         <oasis:entry colname="col3">2013 (5)</oasis:entry>
         <oasis:entry colname="col4">2012 (9)</oasis:entry>
         <oasis:entry colname="col5">2006 (5)</oasis:entry>
         <oasis:entry colname="col6">2009 (3)</oasis:entry>
         <oasis:entry colname="col7">2010 (4)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Average</oasis:entry>
         <oasis:entry colname="col2">2011 (4)</oasis:entry>
         <oasis:entry colname="col3">2006 (13)</oasis:entry>
         <oasis:entry colname="col4">2011 (6)</oasis:entry>
         <oasis:entry colname="col5">2011 (1)</oasis:entry>
         <oasis:entry colname="col6">2013 (2)</oasis:entry>
         <oasis:entry colname="col7">2006 (4)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS4">
  <title>Selection of years for model calibration and validation</title>
      <p id="d1e1244">The model was calibrated for an average, a dry and a wet year to investigate
the influence of wetness conditions and the amount of streamflow on the
calibration results. The years were selected based on the total streamflow
during summer (July–September). The driest and the wettest years of the
period 2006–2014 were selected based on the smallest and largest sum of
streamflow during the summer. The average streamflow years were selected
based on the proximity to the mean summer streamflow for all the years
1974–2014 (1990–2014 for Verzasca). For each catchment the years that were
the 2nd-closest to the mean summer streamflow for all years, as well as
the years with the second lowest and second highest streamflow sum were
chosen for model calibration (see Table 2). We did this separately for each
catchment because for each catchment a different year was dry, average or
wet. For the validation, we chose the year closest to the mean summer
streamflow and the years with the lowest and the highest total summer
streamflow (see Table 2). We used each of the parameter sets obtained from
calibration for the dry, average or wet years to validate the model for each
of the three validation years, resulting in nine validation combinations for each
catchment (and each dataset, as described below).</p><?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S2.SS5">
  <title>Transformation of datasets to resemble citizen science data
quality</title>
<sec id="Ch1.S2.SS5.SSS1">
  <title>Errors in crowdsourced streamflow observations</title>
      <p id="d1e1259">Strobl et al. (2018) asked 517 participants to estimate streamflow based on
the stick method at 10 streams in Switzerland. Here we use the estimates for
the medium-sized streams Töss, Sihl and Schanzengraben in the Canton of
Zurich and the Magliasina in Ticino (<inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">136</mml:mn></mml:mrow></mml:math></inline-formula>), which had a similar streamflow
range at the time of the estimations (2.6–28 m<inline-formula><mml:math id="M16" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M17" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) as the mean
annual streamflow of the six streams used for this study
(1.2–10.8 m<inline-formula><mml:math id="M18" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> s<inline-formula><mml:math id="M19" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>). We calculated the streamflow from the
estimated width, depth and flow velocities using a factor of 0.8 to adjust
the surface flow velocity to the average velocity (Harrelson et al., 1994).
The resulting streamflow estimates were normalised by dividing them by the
measured streamflow. We then combined the normalised estimates of all four
rivers and log-transformed the relative estimates. A normal distribution with a mean of 0.12 and a standard
deviation of 1.30 fits the distribution of the log-transformed relative
estimates well (standard error of the mean: 0.11, standard
error of the standard deviation: 0.08; Fig. 2).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><caption><p id="d1e1318">Fit of the normal distribution to the frequency distribution of the
log-transformed relative streamflow estimates (ratio of the estimated
streamflow and the measured streamflow).</p></caption>
            <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f02.png"/>

          </fig>

      <p id="d1e1327">To create synthetic datasets with data quality characteristics that
represent the observed crowdsourced streamflow estimates, we assumed that
the errors in the streamflow estimates are uncorrelated (as they are likely
provided by different people). For each time step, we randomly selected a
relative error value from the log-normal distribution of the relative
estimates (Fig. 2) and multiplied the measured
streamflow with this relative error. To simulate the effect of training and
to obtain time series with different data quality, two additional streamflow
time series were created using a standard deviation divided by 2 (standard
deviation of 0.65) and by 4 (standard deviation of 0.33). This reduces
the spread in the data (but does not change the small systematic
overestimation of the streamflow), so large outliers are still
possible, but are less likely. To summarise, we tested the following four
cases.</p>
      <p id="d1e1330"><list list-type="bullet">
              <list-item>

      <?pagebreak page5247?><p id="d1e1335"><italic>No error</italic>: the data measured by the FOEN, assumed to be (almost) error-free, the
benchmark in terms of quality.</p>
              </list-item>
              <list-item>

      <p id="d1e1343"><italic>Small error</italic>: random errors according to the log-normal distribution of the snapshot
campaigns with the standard deviation divided by 4.</p>
              </list-item>
              <list-item>

      <p id="d1e1351"><italic>Medium error</italic>: random errors according to the log-normal distribution of the surveys with the standard
deviation divided by 2.</p>
              </list-item>
              <list-item>

      <p id="d1e1359"><italic>Large error</italic>: typical errors of citizen scientists, i.e. random errors according to the
log-normal distribution of errors from the surveys.</p>
              </list-item>
            </list></p>
</sec>
<sec id="Ch1.S2.SS5.SSS2">
  <title>Filtering of extreme outliers</title>
      <p id="d1e1372">Usually some form of quality control is used before citizen science data are
analysed. Here, we used a very simple check to remove unrealistic outliers
from the synthetic datasets. This check was based on the likely minimum and
maximum streamflow for a given catchment area. We defined an upper limit of
possible streamflow values as a function of the catchment area using the dataset
of maximum streamflow from 1500 Swiss catchments provided by Scherrer AG, Hydrologie und Hochwasserschutz (2017). To
account for the different precipitation intensities north and south of the
Alps, different curves were created for the catchments on each side of the
Alps. All streamflow observations, i.e. modified streamflow measurements,
above the maximum observed streamflow for a particular catchment size
including a 20 % buffer (Fig. S1), were replaced by the value of the maximum streamflow for a
catchment of that size. This affected less than 0.5 % of all data points.
A similar procedure was used for low flows based on a dataset of the FOEN
with the lowest recorded mean streamflows over 7 days but this resulted
in no replacements.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3"><caption><p id="d1e1378">Weights assigned to specific seasons, days and times of the day for
the random selection of data points for Crowd52 and Crowd12. The weights for
each hour were multiplied and normalised. We then used them as probabilities
for the individual hours. For times without daylight the probability was set
to zero.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{0.85}[0.85]?><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2" align="center">Variable  </oasis:entry>
         <oasis:entry colname="col3">Weight</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2">Season </oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">December–February</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">2</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">March–May/September–November</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">6</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">June–August</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">10</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2">Day </oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Saturdays–Sundays</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">3</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Monday–Friday</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">1</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2">Time </oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Times when people have breaks</oasis:entry>
         <oasis:entry colname="col2">06:00–08:00,</oasis:entry>
         <oasis:entry colname="col3">3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">12:00–13:00,</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">17:00–21:00</oasis:entry>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Times with daylight in winter</oasis:entry>
         <oasis:entry colname="col2">08:00–16:00</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(December–February)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Times with daylight in spring/fall</oasis:entry>
         <oasis:entry colname="col2">07:00–19:00</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(March–May/September–November):</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Times with daylight in summer</oasis:entry>
         <oasis:entry colname="col2">06:00–21:00</oasis:entry>
         <oasis:entry colname="col3">1</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">(June–August)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Other times (depending on season)</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">0</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS5.SSS3">
  <title>Temporal resolution of the observations</title>
      <p id="d1e1604">Data entries from citizen scientists are not as regular as data from sensors
with a fixed temporal resolution. Therefore, we decided to test eight
scenarios with a different temporal resolution and distribution
of the data throughout the year to simulate different patterns in citizen
contributions.</p>
      <p id="d1e1607"><list list-type="bullet">
              <list-item>

      <p id="d1e1612"><italic>Hourly</italic>: one data point per hour (<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mn mathvariant="normal">8760</mml:mn><mml:mo>≤</mml:mo><mml:mi>n</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">8784</mml:mn></mml:mrow></mml:math></inline-formula>, depending on the
year).</p>
              </list-item>
              <list-item>

      <?pagebreak page5248?><p id="d1e1636"><italic>Weekly</italic>: one data point per week, every Saturday, randomly between 06:00 and 20:00
(<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:mn mathvariant="normal">52</mml:mn><mml:mo>≤</mml:mo><mml:mi>n</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">53</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
              </list-item>
              <list-item>

      <p id="d1e1660"><italic>Monthly</italic>: one data point per month on the 15th of the month, randomly between
06:00 and 20:00 (<inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">12</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
              </list-item>
              <list-item>

      <p id="d1e1680"><italic>IntenseSummer</italic>: one data point every other day from July until September, randomly between
06:00 and 20:00 (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> observations per month, <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">46</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
              </list-item>
              <list-item>

      <p id="d1e1710"><italic>WeekendSummer</italic>: one data point each Saturday and each Sunday between May and October,
randomly between 06:00 and 20:00 (<inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:mn mathvariant="normal">52</mml:mn><mml:mo>≤</mml:mo><mml:mi>n</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">54</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
              </list-item>
              <list-item>

      <p id="d1e1735"><italic>WeekendSpring</italic>: one data point on each Saturday and each Sunday between March and August,
randomly between 06:00 and 20:00 (<inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:mn mathvariant="normal">52</mml:mn><mml:mo>≤</mml:mo><mml:mi>n</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">54</mml:mn></mml:mrow></mml:math></inline-formula>).</p>
              </list-item>
              <list-item>

      <p id="d1e1759"><italic>Crowd52</italic>: 52 random data points during daylight (in order to be comparable to the Weekly,
IntenseSummer, WeekendSummer and WeekendSpring time
series).</p>
              </list-item>
              <list-item>

      <p id="d1e1767"><italic>Crowd12</italic>: 12 random data points during daylight (comparable to the Monthly data).</p>
              </list-item>
            </list></p>
      <p id="d1e1774">Except for the hourly data, these scenarios were based on our own experiences
within the CrowdWater project (<uri>https://www.crowdwater.ch</uri>, last access:
3 October 2018) and information from the CrowdHydrology project (Lowry and
Fienen, 2013). The hourly dataset was included to test the effect of errors
when the temporal resolution of the data is optimal (i.e. by comparing
simulations for the models calibrated with the hourly FOEN data and those
calibrated with hourly data with errors). In the two scenarios Crowd52 and
Crowd12, with random intervals between data points, we assigned higher
probabilities for periods when people are more likely to be outdoors (i.e.
higher probabilities for summer than winter, higher probabilities for
weekends than weekdays, higher probabilities outside office hours; Table 3).
Times without daylight (dependent on the season) were always excluded. We
used the same selection of days, including the same times of the day for each
of the four different error groups, years and catchments to allow comparison
of the different model results.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS6">
  <title>Model calibration</title>
      <p id="d1e1787">For each of the 1728 cases (6 catchments, 3 calibration years, 4 error
groups, 8 temporal resolutions), the HBV model was calibrated by optimising
the overall consistency performance P<inline-formula><mml:math id="M27" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> (Finger et al., 2011)
using a genetic optimisation algorithm (Seibert, 2000). The overall
consistency performance P<inline-formula><mml:math id="M28" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> is the mean of four objective
functions with an optimum value of 1: (i) NSE, (ii) the NSE for the logarithm
of streamflow, (iii) the volume error and (iv) the mean absolute relative
error (MARE). The parameters were
calibrated within their typical ranges (see Table S1 in the Supplement.). To
consider parameter uncertainty, the calibration was performed 100 times,
which resulted in 100 parameter sets for each case. For each case, the
preceding year was used for the warm-up period. For the Crowd52 and Crowd12
time series, we used 100 different random selections of times, whereas for the regularly spaced time series the same times
were used for each case.</p>
</sec>
<sec id="Ch1.S2.SS7">
  <title>Model validation and analysis of the model results</title>
      <p id="d1e1814">The 100 parameters from the calibration for each case were used to run the
model for the validation years (Table 2). For each case (i.e. each catchment,
year, error magnitude and temporal resolution), we determined the median
validation P<inline-formula><mml:math id="M29" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> for the 100 parameter sets for each validation
year. We analysed the validation results of all years combined and for all
nine combinations of dry, mean and wet years separately.</p>
      <p id="d1e1826">Because the focus of this study was on the value of limited inaccurate
streamflow observations for model calibration, i.e. the difference in the
performance of the models calibrated with the synthetic data series compared
to the performance of the models calibrated with hourly FOEN data, all model
validation performances are expressed relative to the average P<inline-formula><mml:math id="M30" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> of
the model calibrated with the hourly FOEN data (our upper benchmark,
representing the fully informed case when continuous high quality streamflow
data are available). A relative P<inline-formula><mml:math id="M31" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> of 1 indicates that the model
performance is as good as the performance of the model calibrated with the
hourly FOEN data, whereas lower P<inline-formula><mml:math id="M32" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> values indicate a poorer
performance.</p>
      <p id="d1e1856">In humid climates, the input data (precipitation and temperature) often
dictate that model simulations can not be too far off as long as the water
balance is respected (Seibert et al., 2018). To assess
the value of limited inaccurate streamflow data for model calibration
compared to a situation without any streamflow data, a lower benchmark (Seibert et al., 2018) was used. Here, the lower
benchmark was defined as the median performance of the model ran with 1000
random parameters sets. By running the model with 1000 randomly chosen
parameter sets, we represent a situation where no streamflow data for
calibration are available and the model is driven only by the temperature
and precipitation data. We used 1000 different parameter sets to cover most
of the model variability due to the different parameter combinations. The
Mann–Whitney U test was used to evaluate whether the median P<inline-formula><mml:math id="M33" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> for a
specific error group and temporal resolution of the data was significantly
different from the median P<inline-formula><mml:math id="M34" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> for the lower benchmark (i.e. the model
runs with random parameters). We furthermore checked for differences in
model performance for models calibrated with the same data errors but
different temporal resolutions using a Kruskal–Wallis test. By applying a
Dunn–Bonferroni post hoc test (Bonferroni, 1936;
Dunn, 1959, 1961), we analysed which of the validation results were
significantly different from each other.</p>
      <?pagebreak page5249?><p id="d1e1877">The random generation of the 100 crowdsourced-like datasets (i.e. the
Crowd52 and Crowd12 scenario) for each of the catchments and year characteristics resulted
in time series with a different number of high flow estimates. In order to
find out whether the inclusion of more high flow values resulted in a better
validation performance, we defined the threshold for high flows as the
streamflow value that was exceeded 10 % of the time in the hourly FOEN
streamflow dataset. The Crowd52 and Crowd12 datasets were then divided into a group that
had more than the expected 10 % high flow observations and a group that
had fewer high flow observations. To determine if more high flow data
improve model performance, the Mann–Whitney U test was used to compare the
relative median P<inline-formula><mml:math id="M35" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> of the two groups.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4" specific-use="star"><caption><p id="d1e1893">Median and the full range of the overall consistency performance P<inline-formula><mml:math id="M36" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> scores for the upper benchmark (hourly FOEN data).
The P<inline-formula><mml:math id="M37" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> values for the
dry, average and wet calibration years were used as the upper benchmarks for
the evaluation based on the year character (Figs. 6 and S2 in the
Supplement); the values in the “overall median” column were used as the
benchmarks in the overall median performance evaluation shown in Fig. 4.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="center"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Calibration year</oasis:entry>
         <oasis:entry colname="col2">Dry</oasis:entry>
         <oasis:entry colname="col3">Average</oasis:entry>
         <oasis:entry colname="col4">Wet</oasis:entry>
         <oasis:entry colname="col5">Overall median</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry rowsep="1" namest="col1" nameend="col4">Validation wet year </oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Upper benchmark</oasis:entry>
         <oasis:entry colname="col2">0.63</oasis:entry>
         <oasis:entry colname="col3">0.65</oasis:entry>
         <oasis:entry colname="col4">0.66</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">(0.19–0.79)</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">(0.36–0.8)</oasis:entry>
         <oasis:entry rowsep="1" colname="col4">(0.45–0.8)</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Lower benchmark</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">0.34</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"/>
         <oasis:entry rowsep="1" colname="col3">(<inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>–0.47)</oasis:entry>
         <oasis:entry rowsep="1" colname="col4"/>
         <oasis:entry colname="col5">Upper benchmark</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" namest="col1" nameend="col4">Validation average year </oasis:entry>
         <oasis:entry colname="col5">0.61</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Upper benchmark</oasis:entry>
         <oasis:entry colname="col2">0.59</oasis:entry>
         <oasis:entry colname="col3">0.61</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5">(0.19–0.83)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">(0.49–0.64)</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">(0.45–0.78)</oasis:entry>
         <oasis:entry rowsep="1" colname="col4">(0.36–0.77)</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Lower benchmark</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">0.36</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">Lower benchmark</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2"/>
         <oasis:entry rowsep="1" colname="col3">(0.03–0.59)</oasis:entry>
         <oasis:entry rowsep="1" colname="col4"/>
         <oasis:entry colname="col5">0.34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" namest="col1" nameend="col4">Validation dry year </oasis:entry>
         <oasis:entry colname="col5">(<inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula>–0.59)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Upper benchmark</oasis:entry>
         <oasis:entry colname="col2">0.51</oasis:entry>
         <oasis:entry colname="col3">0.59</oasis:entry>
         <oasis:entry colname="col4">0.53</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry rowsep="1" colname="col1"/>
         <oasis:entry rowsep="1" colname="col2">(0.35–0.71)</oasis:entry>
         <oasis:entry rowsep="1" colname="col3">(0.41–0.83)</oasis:entry>
         <oasis:entry rowsep="1" colname="col4">(0.23–0.74)</oasis:entry>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Lower benchmark</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">0.35</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(0.09–0.52)</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><caption><p id="d1e2194">Examples of streamflow time series used for calibration
with small, medium and large errors and different temporal resolutions (Weekly,
Crowd52 and WeekendSpring) for the Mentue in 2010. Large error: adjusted FOEN data with errors
resulting from the log-normal distribution fitted to the streamflow
estimates from citizen scientists (see Fig. 2).
Medium error: same as large error, but the standard deviation of the log-normal distribution was divided by 2. Small error: same as the large error,
but the standard deviation of the log-normal distribution was divided by 4.
The grey line represents the measured streamflow, and the dots the derived time
series of streamflow observations. Note that especially in the large error
category some dots lie outside the figure margins.</p></caption>
          <?xmltex \igopts{width=327.206693pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f03.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><caption><p id="d1e2205">Box plots of the median model performance relative to the upper
benchmark for all datasets. The grey rectangles around the boxes indicate
non-significant differences in median model performance compared to the
lower benchmark with random parameter sets. The box represents the 25th
and 75th percentile, the thick horizontal line represents the median, the whiskers
extend to 1.5 times the interquartile range below the 25th percentile
and above the 75th percentile and the dots represent the outliers. The
numbers at the bottom indicate the number of outliers beyond the figure
margins; <inline-formula><mml:math id="M40" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of streamflow observations used for model
calibration. The result of the hourly benchmark FOEN dataset has some spread
because the results of the 100 parameters sets were divided by their median
performance. A relative P<inline-formula><mml:math id="M41" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> of 1 indicates that the model performance
is as good as the performance of the model calibrated with the hourly FOEN
data (upper benchmark).</p></caption>
          <?xmltex \igopts{width=312.980315pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f04.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S3">
  <title>Results</title>
<sec id="Ch1.S3.SS1">
  <title>Upper benchmark results</title>
      <p id="d1e2242">The model was able to reproduce the measured streamflow reasonably well when
the complete and unchanged hourly FOEN datasets were used for calibration,
although there were also a few exceptions. The average validation
P<inline-formula><mml:math id="M42" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> was 0.61 (range: 0.19–0.83; Table 4). The validation
performance was poorest for the Guerbe (validation P<inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.19</mml:mn></mml:mrow></mml:math></inline-formula>)
because several high flow peaks were missed or underestimated by the model
for the wet validation year. Similarly, the validation for the Mentue for the
dry validation year resulted in a low P<inline-formula><mml:math id="M44" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> (0.23) because a very
distinct peak at the end of the year was missed and summer low flows were
overestimated. The third lowest P<inline-formula><mml:math id="M45" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> value was also for the Guerbe
(dry validation year) but already had a P<inline-formula><mml:math id="M46" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> of 0.35. Six out of
the nine lowest P<inline-formula><mml:math id="M47" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> values were for dry validation years.
Validation for wet years for the models calibrated with data from wet years
resulted in the best validation results (i.e. highest P<inline-formula><mml:math id="M48" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> values;
Table 4).</p>
</sec>
<sec id="Ch1.S3.SS2">
  <title>Effect of errors on the model validation results</title>
      <p id="d1e2320">Not surprisingly, increasing the errors in the streamflow data used for
model calibration led to a decrease in the model performance
(Fig. 4). For the small error category, the
median validation performance was better than the lower benchmark for all
temporal resolutions (Fig. 4 and Table S2). For the medium
error category, the median validation performance was also better than the
lower benchmark for all scenarios, except for the Crowd12 dataset. For the model
calibrated with the dataset with large errors, only the Hourly dataset was
significantly better than the lower benchmark (Table 5).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5"><caption><p id="d1e2325">Results (<inline-formula><mml:math id="M49" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values) of the Kruskal–Wallis with Bonferroni post hoc test to determine
the significance of the difference in the median model performance for the
data with different temporal resolutions within each data quality group (no
error <bold>a</bold>, small error <bold>b</bold>, medium error <bold>c</bold>, and
large error <bold>d</bold>). Blue shades represent the p values. White triangles
indicate <inline-formula><mml:math id="M50" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> values <inline-formula><mml:math id="M51" display="inline"><mml:mi mathvariant="italic">&lt;</mml:mi></mml:math></inline-formula> 0.05 and white stars indicate p values
that, when adjusted for multiple comparisons, are still <inline-formula><mml:math id="M52" display="inline"><mml:mi mathvariant="italic">&lt;</mml:mi></mml:math></inline-formula> 0.05.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <title>Effect of the data resolution on the model validation results</title>
      <p id="d1e2381">The Hourly measurement scenario resulted in the best validation performance for
each error group, followed by the Weekly data, and then usually the Crowd52 data
(Fig. 4). Although the median validation
performance of the models calibrated with the Weekly datasets was better than for
the Crowd52 dataset for all error cases, the difference was only statistically
significant for the no error category (Fig. 5).</p>
      <p id="d1e2384">The validation performance of the models calibrated with the Weekly and Crowd52 datasets
was better than for the scenarios focused on spring and summer observations
(WeekendSpring, WeekendSummer and IntenseSummer). The median model performance for the Weekly dataset was significantly
better than the datasets focusing on spring and summer for the no, small and
medium error groups. The median performance of the Crowd52 dataset was only
significantly better than all three measurement scenarios focusing on spring
or summer for the small error case (Fig. 5). The
model validation performance for the WeekendSummer and IntenseSummer scenarios decreased faster with
increasing errors compared to the Weekly, Crowd52 or WeekendSpring datasets
(Fig. 4). The median validation P<inline-formula><mml:math id="M53" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> for the
models calibrated with the WeekendSpring observations was better than for the models
calibrated with the WeekendSummer and IntenseSummer datasets but the differences were only significant
for the small, medium and large error groups. The differences in the model
performance results for the observation strategies that focussed on summer
(IntenseSummer and WeekendSummer) were not significant for any of the error groups
(Fig. 5).</p>
      <p id="d1e2396">The median model performance for the regularly spaced Monthly datasets with 12
observations was similar to the median performance for the three datasets
focusing on summer with 46–54 measurements (WeekendSpring, WeekendSummer and IntenseSummer), except for the case of
large errors for which the monthly dataset performed worse. The irregularly
spaced Crowd12 time series resulted in the worst model performance for each error
group, but the difference from the performance for the regularly spaced
Monthly data was only significant for the dataset with large errors (Fig. 5).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><caption><p id="d1e2401">Median model validation performance for the datasets calibrated and
validated both in a dry year and in a wet year. Each horizontal line
represents the median model performance for one catchment. The black bold
line represents the median for the six catchments. The grey rectangles around
the boxes indicate non-significant differences in median model performance
for the six catchments compared to the lower benchmark with random
parameters. The numbers at the bottom indicate the number of outliers beyond
the figure margins. For the individual P<inline-formula><mml:math id="M54" display="inline"><mml:msub><mml:mi/><mml:mi mathvariant="normal">OA</mml:mi></mml:msub></mml:math></inline-formula> values of the upper
benchmark (no error–Hourly dataset) in the different calibration and
validation years, see Table 4.</p></caption>
          <?xmltex \igopts{width=312.980315pt}?><graphic xlink:href="https://hess.copernicus.org/articles/22/5243/2018/hess-22-5243-2018-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS4">
  <title>Effect of errors and data resolution on the parameter ranges</title>
      <p id="d1e2425">For most parameters the spread in the optimised parameter values was
smallest for the upper benchmark. The spread in the parameter values
increased with increasing errors in the data used for calibration,
particularly for MAXBAS (the routing parameter) but also for some other
parameters (e.g. TCALT, TT and BETA). However, for some parameters (e.g.
CFMAX, FC, and SFCF) the range in the optimised parameter values was mainly
affected by the temporal resolution of the data and the number of data
points used for calibration. It should be noted though that the changes in
the range of model parameters differed significantly for the different
catchments and the trends were not very clear.</p><?xmltex \hack{\newpage}?>
</sec>
<?pagebreak page5250?><sec id="Ch1.S3.SS5">
  <title>Influence of the calibration and validation year and number of high flow
data points on the model performance</title>
      <p id="d1e2436">The influence of the validation year on the model performance was larger than
the effect of the calibration year (Figs. 6 and S2). In general model
performance was poorest for the dry validation years. The model performances
of all datasets with fewer observations or bigger errors than the Hourly
datasets without errors were not significantly better than the lower
benchmark for the dry validation years, except for Crowd52 in the no error
group when calibrated with data from a wet year. However, even for the wet
validation years some observation scenarios of the no error and small error
group did not lead to significantly better model validation results compared
to the median validation performance for the random parameters.
Interestingly, the IntenseSummer dataset in the no error group resulted in a
very good performance when the model was calibrated for a dry year and also
validated in a dry year compared to its performance in the other calibration
and validation year combinations. The median model performance was however
not significantly better than the lower benchmark due to the low
performance for the Guerbe and
Allenbach (outliers beyond figure margins in Fig. 6). The validation results
for these two catchments were the worst for all the no
error–IntenseSummer datasets for all calibration and validation year
combinations.</p>
      <p id="d1e2439">For 13 out of the 18 catchment and year combinations, the Crowd52 datasets
with fewer than 10 % high streamflow data points led to a better
validation performance than the Crowd52 datasets with more high streamflow
data points. For six of them, the difference in model performance was
significant. For none of the five cases where more high flow data points led
to a better model performance was the difference significant. Also when the
results were analysed by year character or catchment, there was no
improvement when more high flow values were included in the calibration
dataset.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <title>Discussion</title>
<sec id="Ch1.S4.SS1">
  <title>Usefulness of inaccurate streamflow data for hydrological model
calibration</title>
      <p id="d1e2454">In this study, we evaluated the information content of streamflow estimates
by citizen scientists for calibration of a bucket-type hydrological model
for six Swiss catchments. While the hydroclimatic conditions, the model or the calibration approach might be
different in other studies, these results should be applicable for a wide
range of cases. However, for physically based spatially distributed models
that are usually not calibrated automatically, the use of limited streamflow
data would probably benefit from a different calibration approach.
Furthermore, our results might not be applicable in arid catchments
where rivers become dry for some periods of the year because the linear
reservoirs used in the HBV model are not appropriate for such systems.</p>
      <?pagebreak page5251?><p id="d1e2457"><?xmltex \hack{\newpage}?>Streamflow estimates by citizens are sometimes very
different from the measured values, and the individual estimates can be
disinformative for model calibration (Beven, 2016; Beven and Westerberg, 2011).
The results show that if the streamflow estimates by citizen scientists were
available at a high temporal resolution (hourly), these data would still be informative
for the calibration of a bucket-type hydrological model despite their high
uncertainties. However, observations with such a high resolution are very
unlikely to be obtained in practice. All scenarios with error distributions
that represent the estimates from citizen scientists with fewer observations
were no better than the lower benchmark (using random parameters). With
medium errors, however, and one data point per week on average or regularly
spaced monthly data, the data were informative for model parameterisation.
Reducing the standard deviation of the error distribution by a factor of
4 led to a significantly improved model performance compared to the lower benchmark for all the observation
scenarios.</p>
      <p id="d1e2461">A reduction in the errors of the streamflow estimates could be achieved by
training of citizen scientists (e.g. videos), improved information about
feasible ranges for stream depth, width and velocity, or examples of
streamflow values for well-known streams. Filtering of extreme outliers can
also reduce the spread of the estimates. This could be done with existing
knowledge of feasible streamflow values for a catchment of a given area or
the amount of rainfall right before the estimate is made to determine if
streamflow is likely to be higher or lower than for the previous estimate.
More<?pagebreak page5252?> detailed research is necessary to test the effectiveness of such
methods.</p>
      <p id="d1e2464">Le Coz et al. (2014) reported an
uncertainty in stage–discharge streamflow measurements of around 5 %–20 %. McMillan et al. (2012) summarised streamflow
uncertainties from stage–discharge relationships in a more detailed review
and gave a range of <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> %–100 % for low flows, <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> %–20 %
for medium or high (in-bank) flows and <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> % for out-of-bank
flows. The errors for the most extreme outliers in the citizen estimates are
considerably higher, and could differ up to a factor of 10 000 from the
measured value in the most extreme but rare cases
(Fig. 2). Even with reduced standard deviations
of the error distribution by a factor of 2 or 4, the observations in
the most extreme cases can still differ by a factor of 100 and 10. The
percentage of data points that differed from the measured value by more than 200 % was 33 % for the large error group,
19 % for the medium error group and 4 % for the small error group. Only
3 % of the data points were more than 90 % below the measured value in the large error
group and 0 % for both in the medium and small error classes. If such
observations are used for model calibration without filtering, they are seen
as extreme floods or droughts, even if the actual conditions may be close to
average flow. Beven and Westerberg (2011) suggest isolating
periods of disinformative data. It is therefore beneficial to identify such
extreme outliers, independent of a model, e.g. with knowledge of feasible
maximum and minimum streamflow quantities, as used in this study, with the
help of the maximum regionalised specific streamflow values for a given
catchment area.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <title>Number of streamflow estimates required for model calibration</title>
      <p id="d1e2503">In general, one would assume that the calibration of a model becomes better
when there are more data (Perrin et al.,
2007), although others have shown that the increase in model performance
plateaus after a certain number of measurements (Juston
et al., 2009; Pool et al., 2017; Seibert and Beven, 2009; Seibert and
McDonnell, 2015). In this study, we limited the length of the calibration
period to 1 year because in practice it may be possible to obtain a
limited number of measurements during a 1-year period for ungauged
catchments before the model results are needed for a certain application, as
has been assumed in previous studies (Pool et al., 2017; Seibert
and McDonnell, 2015). While a limited number of observations (12) was
informative for model calibration when the data uncertainties were limited,
the results of this study also suggest that the performance of bucket-type
models decreases faster with increasing errors when fewer data points are
available (i.e. there was a faster decline in model performance<?pagebreak page5253?> with
increasing errors for models calibrated with 12 data points than for the
models calibrated with 48–52 data points). This finding was most pronounced
when comparing the model performance for the small and medium error
groups (Fig. 4). These findings can be explained by the compensating
effect of the number of observations and their accuracy because the random
errors for the inaccurate data average out when a large number of
observations are used, as long as the data do not have a large bias.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <title>Best timing of streamflow estimates for model calibration</title>
      <p id="d1e2512">The performance of the parameter sets depended on the timing and the error
distribution of the data used for model calibration. The model performance
was generally better if the observations were more evenly spread throughout
the year. For example, for the cases of no and small errors, the performance
of the model calibrated with the Monthly dataset with 12 observations was better
than for the IntenseSummer and WeekendSummer scenarios with 46–54 observations. Similarly, the less
clustered observation scenarios performed better than the more clustered
scenarios (i.e. Weekly vs. Crowd52, Monthly
vs. Crowd12, Crowd52 vs. IntenseSummer, etc.). This suggests that more regularly
distributed data over the year lead to a better model calibration.
Juston et al. (2009) compared
different subsamples of hydrological data for a 5.6 km<inline-formula><mml:math id="M58" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> Swedish
catchment and found that including inter-annual variability in the data used
for the calibration of the HBV model reduced the model uncertainties. More
evenly distributed observations throughout the year might represent more of
the within-year streamflow variability and therefore result in improved
model performance. This is good news for using citizen science data for
model calibration as it suggests that the timing is not as important as the
number of observations because it is likely much easier to get observations
throughout the year than during specific periods or flow conditions.</p>
      <p id="d1e2524">When comparing the WeekendSpring, WeekendSummer and IntenseSummer datasets, it seems that it was in most cases more
beneficial to include data from spring rather than summer. This tendency was
more pronounced with increasing data errors. The reason for this might be
that the WeekendSpring scenario includes more snowmelt or rain-on-snow event peaks, in
addition to usually higher baseflow, and therefore contains more information
on the inter-annual variability in streamflow.</p>
      <p id="d1e2527">By comparing different variations of 12 data points to calibrate the HBV
model, Pool et al. (2017) found
that a dataset that contains a combination of different maximum (monthly,
yearly etc.) and other flows in model calibration led to the best model
performance but also that the differences in performance for the different datasets
covering the range of flows were small. In our study we did not specifically
focus on the high or low flow data points, and therefore did not have
datasets that contained only high flow estimates, which would be very
difficult to obtain with citizen science data. However, our findings
similarly show that for model calibration for catchments with seasonal
variability in streamflow it is beneficial to obtain data for different
magnitudes of flow. Furthermore, we found that data points during relatively
dry periods are beneficial for validation or prediction in another year and
might even be beneficial for years with the same characteristics, as was
shown for the improved validation performance of the IntenseSummer dataset compared to
the other datasets when data from dry years were used for calibration
(Fig. 6).</p>
</sec>
<sec id="Ch1.S4.SS4">
  <title>Effects of different types of years on model calibration and
validation</title>
      <p id="d1e2536">The calibration year, i.e. the year in which the observations were made, was
not decisive for the model performance. Therefore, a model calibrated with
data from a dry year can still be useful for simulations for an average or
wet year. This also means that data in citizen science projects can be
collected during any year and that these data are useful for simulating
streamflow for most years, except the driest years. However, model
performance did vary significantly for the different validation years. The
results during dry validation years were almost never significantly better
than the<?pagebreak page5254?> lower benchmark (Fig. S2). This might be due to the objective function that was used in
this study. Especially the NSE was lower for dry years because the flow
variance (i.e. the denominator in the equation) is smaller when there is a
larger variation in streamflow. Also, these results are based on six median
model performances, and therefore, outliers have a big influence on the
significance of results (Fig. S2).</p>
      <p id="d1e2539">Lidén and Harlin (2000) used the HBV-96 model by Lindström et al. (1997) with changes suggested by
Bergström et al. (1997) for four catchments in
Europe, Africa and South America. They achieved better model results for
wetter catchments and argued that during dry years evapotranspiration plays
a bigger role and therefore the model performance is more sensitive to
inaccuracies in the simulation of the evapotranspiration processes. The fact
that we used a very simple method to calculate the potential
evapotranspiration (McGuinness and Bordne, 1972) might also
explain why the model performed less well during dry years.</p>
      <p id="d1e2542">The model parameterisation, obtained from calibration using the
IntenseSummer dataset, resulted in a surprisingly good performance for the validation for
a more extreme dry year for four out of the six catchments. For the two
catchments for which the performance for the IntenseSummer dataset was poor (Guerbe and
Allenbach), the weather stations are located outside the catchment
boundaries. Especially during dry periods missed streamflow peaks due to
misrepresentation of precipitation can affect model performance a lot. The
fact that always one of these two catchments had the worst model performance
for all the no error–IntenseSummer runs furthermore indicates that the July–September
period might not be suitable to represent characteristic runoff events for
these catchments. The bad performance for these two catchments for the
IntenseSummer–no error run with calibration and validation in the dry year resulted in
the insignificant improvement in model performance compared to the lower
benchmark. Because the wetness of a year was based on the summer streamflow,
these findings suggest that data obtained during times of low flow result
in improved validation performance during dry years compared to data
collected during other times (Fig. S2). This suggests that if the interest is in understanding the
streamflow response during very dry years, it is important to obtain data
during the dry period. To test this hypothesis, more detailed analyses are
needed.</p>
</sec>
<sec id="Ch1.S4.SS5">
  <title>Recommendations for citizen science projects</title>
      <p id="d1e2552">Our results show that streamflow estimates from citizens are not informative
for hydrological model calibration, unless the errors in the estimates can be
reduced through training or advanced filtering of the data to reduce the
errors (i.e. to reduce the number of extreme outliers). In order to make
streamflow estimates useful, the standard deviation of the error distribution of the estimates needs
to be reduced by a<?pagebreak page5255?> factor of 2.
Gibson and Bergman (1954) suggest that errors in distance estimates can be
reduced from 33 % to 14 % with very little training. These findings
are encouraging, although their tests covered distances larger than 365 m
(400 yards) and the widths of the medium-sized rivers for which the
streamflow was estimated were less than 40 m (Strobl et al., 2018). Options
for training might be tutorial videos, as well as lists with values for the
width, average depth and flow velocity of well-known streams (Strobl et al.,
2018). In order to determine the effect of training on streamflow estimates,
further research has to be done because especially the depth estimates were
inaccurate (Strobl et al., 2018).</p>
      <p id="d1e2555">The findings of this study suggest the following recommendations for citizen
science projects that want to use streamflow estimates:</p>
      <p id="d1e2558"><list list-type="bullet">
            <list-item>

      <p id="d1e2563"><italic>Collect as many data points as possible</italic>. In this study hourly data always led to
the best model performance. It is therefore beneficial to collect as many
data points as possible. Because it is unlikely that hourly data are obtained, we suggest
to aim for (on average) one observation per week. Provided that the standard
deviation of the streamflow estimates can be reduced by a factor of 2, 52
observations (as in the Crowd52 data series) are informative for model calibration.
Therefore, it is essential to invest in advertisement of a project and to
find suitable locations where many people can potentially contribute, as
well as to communicate to the citizen scientists that it is beneficial to
submit observations regularly.</p>
            </list-item>
            <list-item>

      <p id="d1e2571"><italic>Encourage observations throughout the year</italic>. To further improve the model
performance, or to allow for greater errors, it is beneficial to have
observations at all types of flow conditions during the year, rather than
during a certain season.</p>
            </list-item>
          </list></p>
      <p id="d1e2578">Observations during high streamflow conditions were in most cases not more
informative than flows during other times of the year. Efforts to ask
citizens to submit observations during specific flow conditions (e.g. by
sending reminders to the citizen observers) do not seem to be very effective in
light of the above findings. It is rather more beneficial to remind them to
submit observations regularly.</p>
      <p id="d1e2582">Instead of focussing on training to reduce the errors in the streamflow
estimates, an alternative approach for citizen science projects is to switch
to a parameter that is easier to estimate, such as stream levels (Lowry and
Fienen, 2013). Recent studies successfully used daily stream-level data
(Seibert and Vis, 2016) and stream-level class data (van Meerveld et al.
2017) to calibrate hydrological models, and other
studies have demonstrated the potential
value of crowdsourced stream level data for providing information on, e.g.
baseflow (Lowry and Fienen, 2013), or for improving flood forecasts
(Mazzoleni et al., 2017). However, further research is needed to determine if
real crowdsourced stream-level (class) data are informative for the
calibration of hydrological models.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <title>Conclusions</title>
      <p id="d1e2593">The results of this study extend previous studies on the value of limited
hydrological data for hydrological model calibration or the best timing of
streamflow measurements for model calibration (Juston et al., 2009;
Pool et al., 2017; Seibert and McDonnell, 2015) that did not consider
observation errors. This is an important aspect, especially when considering
citizen science approaches to obtain streamflow data. Our results show that
inaccurate streamflow data can be useful for model calibration, as long as
the errors are not too large. When the distribution of errors in the
streamflow data represented the distribution of the errors in the streamflow
estimates from citizen scientists, this information was not informative for
model calibration (i.e. the median performance of the models calibrated with
these data was not significantly better than the median performance of the
models with random parameter values). However, if the standard deviation of
the estimates is reduced by a factor of 2, then the (less) inaccurate data
would be informative for model calibration. We furthermore demonstrated
that realistic frequencies for citizen science projects (one observation on
average per week or month) can be informative for model calibration. The
findings of studies such as the one presented here provide important
guidance on the design of citizen science projects as well as other observation approaches.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability">

      <p id="d1e2600">The data are
available from FOEN (streamflow) and MeteoSwiss (precipitation and
temperature). The HBV software is available at
<uri>https://www.geo.uzh.ch/en/units/h2k/Services/HBV-Model.html</uri> (Seibert and Vis, 2012) or from jan.seibert@geo.uzh.ch.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e2606">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/hess-22-5243-2018-supplement" xlink:title="pdf">https://doi.org/10.5194/hess-22-5243-2018-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution">

      <p id="d1e2615">While JS and IvM had the initial idea, the concrete
study design was based on input from all authors. SE and BS conducted the
field surveys to determine the typical errors in streamflow estimates. The simulations and analyses were
performed by SE. The writing of the manuscript was led by SE; all co-authors
contributed to the writing.</p>
  </notes><notes notes-type="competinginterests">

      <p id="d1e2621">The authors declare that they have no conflict of
interest.</p>
  </notes><ack><title>Acknowledgements</title><?pagebreak page5256?><p id="d1e2627">We thank all citizen scientists who participated in the field surveys, as
well as the Swiss Federal Office for the Environment for providing the
streamflow data, MeteoSwiss for providing the weather data, Maria Staudinger,
Jan Schwanbeck and Scherrer AG for the permission to use their datasets and
the reviewers for the useful comments. This project was funded by the Swiss
National Science Foundation (project CrowdWater).<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?> Edited by: Nadav Peleg<?xmltex \hack{\newline}?> Reviewed by: two
anonymous referees</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>
Aschwanden, H. and Weingartner, R.: Die Abflussregimes der Schweiz,
Geographisches Institut der Universität Bern, Abteilung Physikalische
Geographie, Gewässerkunde, Bern, Switzerland, 1985.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Bergström, S.: Development and application of a conceptual runoff model for
Scandinavian catchments, Sveriges Meteorologiska och Hydrologiska Institut
(SMHI), Norrköping, Sweden, available at:
<uri>https://www.researchgate.net/publication/255274162_Development_and_Application_of_a_Conceptual_Runoff_Model_for_Scandinavian_Catchments</uri>
(last access: 3 October 2018), 1976.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>
Bergström, S., Carlsson, B., Grahn, G., and Johansson, B.: A More
Consistent Approach to Watershed Response in the HBV Model, Vannet i Nord.,
4, 1997.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Beven, K.: Facets of uncertainty: epistemic uncertainty, non-stationarity,
likelihood, hypothesis testing, and communication, Hydrol. Sci. J., 61,
1652–1665, <ext-link xlink:href="https://doi.org/10.1080/02626667.2015.1031761" ext-link-type="DOI">10.1080/02626667.2015.1031761</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Beven, K. and Westerberg, I.: On red herrings and real herrings:
disinformation and information in hydrological inference, Hydrol. Process.,
25, 1676–1680, <ext-link xlink:href="https://doi.org/10.1002/hyp.7963" ext-link-type="DOI">10.1002/hyp.7963</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>
Bonferroni, C. E.: Teoria statistica delle classi e calcolo delle
probabilità, st. Super. di Sci. Econom. e Commerciali di Firenze,
Istituto superiore di scienze economiche e commerciali, Florence, Italy, 62
pp., 1936.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Brath, A., Montanari, A., and Toth, E.: Analysis of the effects of different
scenarios of historical data availability on the calibration of a
spatially-distributed hydrological model, J. Hydrol., 291, 232–253,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2003.12.044" ext-link-type="DOI">10.1016/j.jhydrol.2003.12.044</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Buytaert, W., Zulkafli, Z., Grainger, S., Acosta, L., Alemie, T. C.,
Bastiaensen, J., De BiÃv̈re, B., Bhusal, J., Clark, J., Dewulf, A.,
Foggin, M., Hannah, D. M., Hergarten, C., Isaeva, A., Karpouzoglou, T.,
Pandeya, B., Paudel, D., Sharma, K., Steenhuis, T., Tilahun, S., Van Hecken,
G., and Zhumanova, M.: Citizen science in hydrology and water resources:
opportunities for knowledge generation, ecosystem service management, and
sustainable development, Front. Earth Sci., 2, 21 pp.,
<ext-link xlink:href="https://doi.org/10.3389/feart.2014.00026" ext-link-type="DOI">10.3389/feart.2014.00026</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Davids, J. C., van de Giesen, N., and Rutten, M.: Continuity vs. the
Crowd – Tradeoffs Between Continuous and Intermittent Citizen Hydrology
Streamflow Observations, Environ. Manage., 60, 12–29,
<ext-link xlink:href="https://doi.org/10.1007/s00267-017-0872-x" ext-link-type="DOI">10.1007/s00267-017-0872-x</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Davids, J. C., Rutten, M. M., Shah, R. D. T., Shah, D. N., Devkota, N.,
Izeboud, P., Pandey, A., and van de Giesen, N.: Quantifying the connections
– linkages between land-use and water in the Kathmandu Valley, Nepal,
Environ. Monit. Assess., 190, 17 pp., <ext-link xlink:href="https://doi.org/10.1007/s10661-018-6687-2" ext-link-type="DOI">10.1007/s10661-018-6687-2</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Dickinson, J. L., Zuckerberg, B., and Bonter, D. N.: Citizen Science as an
Ecological Research Tool: Challenges and Benefits, Annu. Rev. Ecol. Evol.
Syst., 41, 149–172, <ext-link xlink:href="https://doi.org/10.1146/annurev-ecolsys-102209-144636" ext-link-type="DOI">10.1146/annurev-ecolsys-102209-144636</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Dunn, O. J.: Estimation of the Medians for Dependent Variables, Ann. Math.
Stat., 30, 192–197, <ext-link xlink:href="https://doi.org/10.1214/aoms/1177706374" ext-link-type="DOI">10.1214/aoms/1177706374</ext-link>, 1959.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Dunn, O. J.: Multiple Comparisons among Means, J. Am. Stat. Assoc., 56,
52–64, <ext-link xlink:href="https://doi.org/10.1080/01621459.1961.10482090" ext-link-type="DOI">10.1080/01621459.1961.10482090</ext-link>, 1961.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Ewen, T., Brönnimann, S., and Annis, J.: An extended Pacific-North
American index from upper-air historical data back to 1922, J. Climate, 21,
1295–1308, <ext-link xlink:href="https://doi.org/10.1175/2007JCLI1951.1" ext-link-type="DOI">10.1175/2007JCLI1951.1</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Finger, D., Pellicciotti, F., Konz, M., Rimkus, S., and Burlando, P.: The
value of glacier mass balance, satellite snow cover images, and hourly
discharge for improving the performance of a physically based distributed
hydrological model, Water Resour. Res., 47, 14 pp.,
<ext-link xlink:href="https://doi.org/10.1029/2010WR009824" ext-link-type="DOI">10.1029/2010WR009824</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Finger, D., Vis, M., Huss, M., and Seibert, J.: The value of multiple data
set calibration versus model complexity for improving the performance of
hydrological models in mountain catchments, Water Resour. Res., 51,
1939–1958, <ext-link xlink:href="https://doi.org/10.1002/2014WR015712" ext-link-type="DOI">10.1002/2014WR015712</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Fitzner, D., Sester, M., Haberlandt, U., and Rabiei, E.: Rainfall Estimation
with a Geosensor Network of Cars – Theoretical Considerations and First
Results, Photogramm. Fernerkun., 2013, 93–103,
<ext-link xlink:href="https://doi.org/10.1127/1432-8364/2013/0161" ext-link-type="DOI">10.1127/1432-8364/2013/0161</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Gibson, E. J. and Bergman, R.: The effect of training on absolute estimation
of distance over the ground, J. Exp. Psychol., 48, 473–482,
<ext-link xlink:href="https://doi.org/10.1037/h0055007" ext-link-type="DOI">10.1037/h0055007</ext-link>, 1954.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Haberlandt, U. and Sester, M.: Areal rainfall estimation using moving cars as
rain gauges – a modelling study, Hydrol. Earth Syst. Sci., 14, 1139–1151,
<ext-link xlink:href="https://doi.org/10.5194/hess-14-1139-2010" ext-link-type="DOI">10.5194/hess-14-1139-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>
Harrelson, C. C., Rawlins, C. L., and Potyondy, J. P.: Stream channel
reference sites: an illustrated guide to field technique, Department of
Agriculture, Forest Service, Rocky Mountain Forest and Range Experiment
Station location, Fort Collins, CO, US, 1994.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Horner, I., Renard, B., Le Coz, J., Branger, F., McMillan, H. K., and
Pierrefeu, G.: Impact of Stage Measurement Errors on Streamflow Uncertainty,
Water Resour. Res., 54, 1952–1976, <ext-link xlink:href="https://doi.org/10.1002/2017WR022039" ext-link-type="DOI">10.1002/2017WR022039</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Juston, J., Seibert, J., and Johansson, P.: Temporal sampling strategies and
uncertainty in calibrating a conceptual hydrological model for a small boreal
catchment, Hydrol. Process., 23, 3093–3109, <ext-link xlink:href="https://doi.org/10.1002/hyp.7421" ext-link-type="DOI">10.1002/hyp.7421</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Koch, J. and Stisen, S.: Citizen science: A new perspective to advance
spatial pattern evaluation in hydrology, PLoS One, 12, 1–20,
<ext-link xlink:href="https://doi.org/10.1371/journal.pone.0178165" ext-link-type="DOI">10.1371/journal.pone.0178165</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Le Coz, J., Renard, B., Bonnifait, L., Branger, F., and Le Boursicaud, R.:
Combining hydraulic knowledge and uncertain gaugings in the estimation of
hydrometric rating curves: A Bayesian approach, J. Hydrol., 509, 573–587,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2013.11.016" ext-link-type="DOI">10.1016/j.jhydrol.2013.11.016</ext-link>, 2014.</mixed-citation></ref>
      <?pagebreak page5257?><ref id="bib1.bib25"><label>25</label><mixed-citation>Lidén, R. and Harlin, J.: Analysis of conceptual rainfall–runoff
modelling performance in different climates, J. Hydrol., 238, 231–247,
<ext-link xlink:href="https://doi.org/10.1016/S0022-1694(00)00330-9" ext-link-type="DOI">10.1016/S0022-1694(00)00330-9</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Lindström, G., Johansson, B., Persson, M., Gardelin, M., and
Bergström, S.: Development and test of the distributed HBV-96
hydrological model, J. Hydrol., 201, 272–288,
<ext-link xlink:href="https://doi.org/10.1016/S0022-1694(97)00041-3" ext-link-type="DOI">10.1016/S0022-1694(97)00041-3</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Lowry, C. S. and Fienen, M. N.: CrowdHydrology: Crowdsourcing Hydrologic Data
and Engaging Citizen Scientists, Ground Water, 51, 151–156,
<ext-link xlink:href="https://doi.org/10.1111/j.1745-6584.2012.00956.x" ext-link-type="DOI">10.1111/j.1745-6584.2012.00956.x</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>Mazzoleni, M., Verlaan, M., Alfonso, L., Monego, M., Norbiato, D., Ferri, M.,
and Solomatine, D. P.: Can assimilation of crowdsourced data in hydrological
modelling improve flood prediction?, Hydrol. Earth Syst. Sci., 21, 839–861,
<ext-link xlink:href="https://doi.org/10.5194/hess-21-839-2017" ext-link-type="DOI">10.5194/hess-21-839-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>
McGuinness, J. and Bordne, E.: A comparison of lysimeter-derived potential
evapotranspiration with computed values, Agricultural Research Service –
United States Department of Agriculture Location, Washington D.C., 1972.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>McMillan, H., Freer, J., Pappenberger, F., Krueger, T., and Clark, M.:
Impacts of uncertain river flow data on rainfall-runoff model calibration and
discharge predictions, Hydrol. Process., 24, 1270–1284,
<ext-link xlink:href="https://doi.org/10.1002/hyp.7587" ext-link-type="DOI">10.1002/hyp.7587</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>McMillan, H., Krueger, T., and Freer, J.: Benchmarking observational
uncertainties for hydrology: rainfall, river discharge and water quality,
Hydrol. Process., 26, 4078–4111, <ext-link xlink:href="https://doi.org/10.1002/hyp.9384" ext-link-type="DOI">10.1002/hyp.9384</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Michel, C., Perrin, C., and Andreassian, V.: The exponential store: a correct
formulation for rainfall – runoff modelling, Hydrol. Sci. J., 48, 109–124,
<ext-link xlink:href="https://doi.org/10.1623/hysj.48.1.109.43484" ext-link-type="DOI">10.1623/hysj.48.1.109.43484</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Oudin, L., Hervieu, F., Michel, C., Perrin, C., Andréassian, V., Anctil,
F., and Loumagne, C.: Which potential evapotranspiration input for a lumped
rainfall-runoff model?, J. Hydrol., 303, 290–306,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2004.08.026" ext-link-type="DOI">10.1016/j.jhydrol.2004.08.026</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>Perrin, C., Michel, C., and Andréassian, V.: Improvement of a
parsimonious model for streamflow simulation, J. Hydrol., 279, 275–289,
<ext-link xlink:href="https://doi.org/10.1016/S0022-1694(03)00225-7" ext-link-type="DOI">10.1016/S0022-1694(03)00225-7</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>Perrin, C., Ouding, L., Andreassian, V., Rojas-Serna, C., Michel, C., and
Mathevet, T.: Impact of limited streamflow data on the efficiency and the
parameters of rainfall-runoff models, Hydrol. Sci. J., 52, 131–151,
<ext-link xlink:href="https://doi.org/10.1623/hysj.52.1.131" ext-link-type="DOI">10.1623/hysj.52.1.131</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Pool, S., Viviroli, D., and Seibert, J.: Prediction of hydrographs and
flow-duration curves in almost ungauged catchments: Which runoff measurements
are most informative for model calibration?, J. Hydrol., 554, 613–622,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2017.09.037" ext-link-type="DOI">10.1016/j.jhydrol.2017.09.037</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Ruhi, A., Messager, M. L., and Olden, J. D.: Tracking the pulse of the
Earth's fresh waters, Nat. Sustain., 1, 198–203,
<ext-link xlink:href="https://doi.org/10.1038/s41893-018-0047-7" ext-link-type="DOI">10.1038/s41893-018-0047-7</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Scherrer AG: Verzeichnis grosser Hochwasserabflüsse in schweizerischen
Einzugsgebieten, Auftraggeber: Bundesamt für Umwelt (BAFU), Abteilung
Hydrologie, Reinach, 2017.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Seibert, J.: Multi-criteria calibration of a conceptual runoff model using a
genetic algorithm, Hydrol. Earth Syst. Sci., 4, 215–224,
<ext-link xlink:href="https://doi.org/10.5194/hess-4-215-2000" ext-link-type="DOI">10.5194/hess-4-215-2000</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Seibert, J. and Beven, K. J.: Gauging the ungauged basin: how many discharge
measurements are needed?, Hydrol. Earth Syst. Sci., 13, 883–892,
<ext-link xlink:href="https://doi.org/10.5194/hess-13-883-2009" ext-link-type="DOI">10.5194/hess-13-883-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Seibert, J. and McDonnell, J. J.: Gauging the Ungauged Basin?: Relative Value
of Soft and Hard Data, J. Hydrol. Eng., 20, A4014004-1–6,
<ext-link xlink:href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000861" ext-link-type="DOI">10.1061/(ASCE)HE.1943-5584.0000861</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Seibert, J. and Vis, M. J. P.: Teaching hydrological modeling with a
user-friendly catchment-runoff-model software package, Hydrol. Earth Syst.
Sci., 16, 3315–3325, <ext-link xlink:href="https://doi.org/10.5194/hess-16-3315-2012" ext-link-type="DOI">10.5194/hess-16-3315-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Seibert, J. and Vis, M. J. P.: How informative are stream level observations
in different geographic regions?, Hydrol. Process., 30, 2498–2508,
<ext-link xlink:href="https://doi.org/10.1002/hyp.10887" ext-link-type="DOI">10.1002/hyp.10887</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Seibert, J., Vis, M. J. P., Lewis, E., and van Meerveld, H. J.: Upper and
lower benchmarks in hydrological modelling, Hydrol. Process., 32, 1120–1125,
<ext-link xlink:href="https://doi.org/10.1002/hyp.11476" ext-link-type="DOI">10.1002/hyp.11476</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Shiklomanov, A. I., Lammers, R. B., and Vörösmarty, C. J.: Widespread
decline in hydrological monitoring threatens Pan-Arctic Research, Eos, Trans.
Am. Geophys. Union, 83, 13–17, <ext-link xlink:href="https://doi.org/10.1029/2002EO000007" ext-link-type="DOI">10.1029/2002EO000007</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>Sideris, I. V., Gabella, M., Erdin, R., and Germann, U.: Real-time
radar-rain-gauge merging using spatio-temporal co-kriging with external
drift in the alpine terrain of Switzerland, Q. J. Roy. Meteor. Soc.,
140, 1097–1111, <ext-link xlink:href="https://doi.org/10.1002/qj.2188" ext-link-type="DOI">10.1002/qj.2188</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>
Strobl, B., Etter, S., van Meerveld, I., and Seibert, J.: Accuracy of
Crowdsourced Streamflow and Stream Level Class Estimates, Hydrol. Sci. J.,
(special issue on hydrological data: opportunities and barriers), in review,
2018.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>van Meerveld, H. J. I., Vis, M. J. P., and Seibert, J.: Information content
of stream level class data for hydrological model calibration, Hydrol. Earth
Syst. Sci., 21, 4895–4905, <ext-link xlink:href="https://doi.org/10.5194/hess-21-4895-2017" ext-link-type="DOI">10.5194/hess-21-4895-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>Vrugt, J. A., Gupta, H. V., Dekker, S. C., Sorooshian, S., Wagener, T., and
Bouten, W.: Application of stochastic parameter optimization to the
Sacramento Soil Moisture Accounting model, J. Hydrol., 325, 288–307,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2005.10.041" ext-link-type="DOI">10.1016/j.jhydrol.2005.10.041</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Weeser, B., Stenfert Kroese, J., Jacobs, S. R., Njue, N., Kemboi, Z., Ran,
A., Rufino, M. C., and Breuer, L.: Citizen science pioneers in Kenya – A
crowdsourced approach for hydrological monitoring, Sci. Total Environ.,
631–632, 1590–1599, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2018.03.130" ext-link-type="DOI">10.1016/j.scitotenv.2018.03.130</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Yapo, P. O., Gupta, H. V., and Sorooshian, S.: Automatic calibration of
conceptual rainfall-runoff models: sensitivity to calibration data, J.
Hydrol., 181, 23–48, <ext-link xlink:href="https://doi.org/10.1016/0022-1694(95)02918-4" ext-link-type="DOI">10.1016/0022-1694(95)02918-4</ext-link>, 1996.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Value of uncertain streamflow observations for hydrological modelling</article-title-html>
<abstract-html><p>Previous studies have shown that hydrological models can be parameterised using a
limited number of streamflow measurements. Citizen science projects can
collect such data for otherwise ungauged catchments but an important question
is whether these observations are informative given that these streamflow
estimates will be uncertain. We assess the value of inaccurate streamflow estimates for calibration of a
simple bucket-type runoff model for six Swiss catchments. We pretended that
only a few observations were available and that these were affected by
different levels of inaccuracy. The level of inaccuracy was based on a
log-normal error distribution that was fitted to streamflow estimates of 136
citizens for medium-sized streams. Two additional levels of inaccuracy, for
which the standard deviation of the error distribution was divided by 2 and
4, were used as well. Based on these error distributions,
random errors were added to the measured hourly streamflow data. New time
series with different temporal resolutions were created from these synthetic
streamflow time series. These included scenarios with one observation each
week or month, as well as scenarios that are more realistic for crowdsourced
data that generally have an irregular distribution of data points throughout
the year, or focus on a particular season. The model was then calibrated for
the six catchments using the synthetic time series for a dry, an average and
a wet year. The performance of the calibrated models was evaluated based on
the measured hourly streamflow time series. The results indicate that
streamflow estimates from untrained citizens are not informative for model
calibration. However, if the errors can be reduced, the estimates are
informative and useful for model calibration. As expected, the model
performance increased when the number of observations used for calibration
increased. The model performance was also better when the observations were
more evenly distributed throughout the year. This study indicates that
uncertain streamflow estimates can be useful for model calibration but that
the estimates by citizen scientists need to be improved by training or more
advanced data filtering before they are useful for model calibration.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Aschwanden, H. and Weingartner, R.: Die Abflussregimes der Schweiz,
Geographisches Institut der Universität Bern, Abteilung Physikalische
Geographie, Gewässerkunde, Bern, Switzerland, 1985.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Bergström, S.: Development and application of a conceptual runoff model for
Scandinavian catchments, Sveriges Meteorologiska och Hydrologiska Institut
(SMHI), Norrköping, Sweden, available at:
<a href="https://www.researchgate.net/publication/255274162_Development_and_Application_of_a_Conceptual_Runoff_Model_for_Scandinavian_Catchments" target="_blank">https://www.researchgate.net/publication/255274162_Development_and_Application_of_a_Conceptual_Runoff_Model_for_Scandinavian_Catchments</a>
(last access: 3 October 2018), 1976.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Bergström, S., Carlsson, B., Grahn, G., and Johansson, B.: A More
Consistent Approach to Watershed Response in the HBV Model, Vannet i Nord.,
4, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Beven, K.: Facets of uncertainty: epistemic uncertainty, non-stationarity,
likelihood, hypothesis testing, and communication, Hydrol. Sci. J., 61,
1652–1665, <a href="https://doi.org/10.1080/02626667.2015.1031761" target="_blank">https://doi.org/10.1080/02626667.2015.1031761</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Beven, K. and Westerberg, I.: On red herrings and real herrings:
disinformation and information in hydrological inference, Hydrol. Process.,
25, 1676–1680, <a href="https://doi.org/10.1002/hyp.7963" target="_blank">https://doi.org/10.1002/hyp.7963</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Bonferroni, C. E.: Teoria statistica delle classi e calcolo delle
probabilità, st. Super. di Sci. Econom. e Commerciali di Firenze,
Istituto superiore di scienze economiche e commerciali, Florence, Italy, 62
pp., 1936.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Brath, A., Montanari, A., and Toth, E.: Analysis of the effects of different
scenarios of historical data availability on the calibration of a
spatially-distributed hydrological model, J. Hydrol., 291, 232–253,
<a href="https://doi.org/10.1016/j.jhydrol.2003.12.044" target="_blank">https://doi.org/10.1016/j.jhydrol.2003.12.044</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Buytaert, W., Zulkafli, Z., Grainger, S., Acosta, L., Alemie, T. C.,
Bastiaensen, J., De BiÃv̈re, B., Bhusal, J., Clark, J., Dewulf, A.,
Foggin, M., Hannah, D. M., Hergarten, C., Isaeva, A., Karpouzoglou, T.,
Pandeya, B., Paudel, D., Sharma, K., Steenhuis, T., Tilahun, S., Van Hecken,
G., and Zhumanova, M.: Citizen science in hydrology and water resources:
opportunities for knowledge generation, ecosystem service management, and
sustainable development, Front. Earth Sci., 2, 21 pp.,
<a href="https://doi.org/10.3389/feart.2014.00026" target="_blank">https://doi.org/10.3389/feart.2014.00026</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Davids, J. C., van de Giesen, N., and Rutten, M.: Continuity vs. the
Crowd – Tradeoffs Between Continuous and Intermittent Citizen Hydrology
Streamflow Observations, Environ. Manage., 60, 12–29,
<a href="https://doi.org/10.1007/s00267-017-0872-x" target="_blank">https://doi.org/10.1007/s00267-017-0872-x</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Davids, J. C., Rutten, M. M., Shah, R. D. T., Shah, D. N., Devkota, N.,
Izeboud, P., Pandey, A., and van de Giesen, N.: Quantifying the connections
– linkages between land-use and water in the Kathmandu Valley, Nepal,
Environ. Monit. Assess., 190, 17 pp., <a href="https://doi.org/10.1007/s10661-018-6687-2" target="_blank">https://doi.org/10.1007/s10661-018-6687-2</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Dickinson, J. L., Zuckerberg, B., and Bonter, D. N.: Citizen Science as an
Ecological Research Tool: Challenges and Benefits, Annu. Rev. Ecol. Evol.
Syst., 41, 149–172, <a href="https://doi.org/10.1146/annurev-ecolsys-102209-144636" target="_blank">https://doi.org/10.1146/annurev-ecolsys-102209-144636</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Dunn, O. J.: Estimation of the Medians for Dependent Variables, Ann. Math.
Stat., 30, 192–197, <a href="https://doi.org/10.1214/aoms/1177706374" target="_blank">https://doi.org/10.1214/aoms/1177706374</a>, 1959.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Dunn, O. J.: Multiple Comparisons among Means, J. Am. Stat. Assoc., 56,
52–64, <a href="https://doi.org/10.1080/01621459.1961.10482090" target="_blank">https://doi.org/10.1080/01621459.1961.10482090</a>, 1961.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Ewen, T., Brönnimann, S., and Annis, J.: An extended Pacific-North
American index from upper-air historical data back to 1922, J. Climate, 21,
1295–1308, <a href="https://doi.org/10.1175/2007JCLI1951.1" target="_blank">https://doi.org/10.1175/2007JCLI1951.1</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Finger, D., Pellicciotti, F., Konz, M., Rimkus, S., and Burlando, P.: The
value of glacier mass balance, satellite snow cover images, and hourly
discharge for improving the performance of a physically based distributed
hydrological model, Water Resour. Res., 47, 14 pp.,
<a href="https://doi.org/10.1029/2010WR009824" target="_blank">https://doi.org/10.1029/2010WR009824</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Finger, D., Vis, M., Huss, M., and Seibert, J.: The value of multiple data
set calibration versus model complexity for improving the performance of
hydrological models in mountain catchments, Water Resour. Res., 51,
1939–1958, <a href="https://doi.org/10.1002/2014WR015712" target="_blank">https://doi.org/10.1002/2014WR015712</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Fitzner, D., Sester, M., Haberlandt, U., and Rabiei, E.: Rainfall Estimation
with a Geosensor Network of Cars – Theoretical Considerations and First
Results, Photogramm. Fernerkun., 2013, 93–103,
<a href="https://doi.org/10.1127/1432-8364/2013/0161" target="_blank">https://doi.org/10.1127/1432-8364/2013/0161</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Gibson, E. J. and Bergman, R.: The effect of training on absolute estimation
of distance over the ground, J. Exp. Psychol., 48, 473–482,
<a href="https://doi.org/10.1037/h0055007" target="_blank">https://doi.org/10.1037/h0055007</a>, 1954.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Haberlandt, U. and Sester, M.: Areal rainfall estimation using moving cars as
rain gauges – a modelling study, Hydrol. Earth Syst. Sci., 14, 1139–1151,
<a href="https://doi.org/10.5194/hess-14-1139-2010" target="_blank">https://doi.org/10.5194/hess-14-1139-2010</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Harrelson, C. C., Rawlins, C. L., and Potyondy, J. P.: Stream channel
reference sites: an illustrated guide to field technique, Department of
Agriculture, Forest Service, Rocky Mountain Forest and Range Experiment
Station location, Fort Collins, CO, US, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Horner, I., Renard, B., Le Coz, J., Branger, F., McMillan, H. K., and
Pierrefeu, G.: Impact of Stage Measurement Errors on Streamflow Uncertainty,
Water Resour. Res., 54, 1952–1976, <a href="https://doi.org/10.1002/2017WR022039" target="_blank">https://doi.org/10.1002/2017WR022039</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Juston, J., Seibert, J., and Johansson, P.: Temporal sampling strategies and
uncertainty in calibrating a conceptual hydrological model for a small boreal
catchment, Hydrol. Process., 23, 3093–3109, <a href="https://doi.org/10.1002/hyp.7421" target="_blank">https://doi.org/10.1002/hyp.7421</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Koch, J. and Stisen, S.: Citizen science: A new perspective to advance
spatial pattern evaluation in hydrology, PLoS One, 12, 1–20,
<a href="https://doi.org/10.1371/journal.pone.0178165" target="_blank">https://doi.org/10.1371/journal.pone.0178165</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Le Coz, J., Renard, B., Bonnifait, L., Branger, F., and Le Boursicaud, R.:
Combining hydraulic knowledge and uncertain gaugings in the estimation of
hydrometric rating curves: A Bayesian approach, J. Hydrol., 509, 573–587,
<a href="https://doi.org/10.1016/j.jhydrol.2013.11.016" target="_blank">https://doi.org/10.1016/j.jhydrol.2013.11.016</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Lidén, R. and Harlin, J.: Analysis of conceptual rainfall–runoff
modelling performance in different climates, J. Hydrol., 238, 231–247,
<a href="https://doi.org/10.1016/S0022-1694(00)00330-9" target="_blank">https://doi.org/10.1016/S0022-1694(00)00330-9</a>, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Lindström, G., Johansson, B., Persson, M., Gardelin, M., and
Bergström, S.: Development and test of the distributed HBV-96
hydrological model, J. Hydrol., 201, 272–288,
<a href="https://doi.org/10.1016/S0022-1694(97)00041-3" target="_blank">https://doi.org/10.1016/S0022-1694(97)00041-3</a>, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Lowry, C. S. and Fienen, M. N.: CrowdHydrology: Crowdsourcing Hydrologic Data
and Engaging Citizen Scientists, Ground Water, 51, 151–156,
<a href="https://doi.org/10.1111/j.1745-6584.2012.00956.x" target="_blank">https://doi.org/10.1111/j.1745-6584.2012.00956.x</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Mazzoleni, M., Verlaan, M., Alfonso, L., Monego, M., Norbiato, D., Ferri, M.,
and Solomatine, D. P.: Can assimilation of crowdsourced data in hydrological
modelling improve flood prediction?, Hydrol. Earth Syst. Sci., 21, 839–861,
<a href="https://doi.org/10.5194/hess-21-839-2017" target="_blank">https://doi.org/10.5194/hess-21-839-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
McGuinness, J. and Bordne, E.: A comparison of lysimeter-derived potential
evapotranspiration with computed values, Agricultural Research Service –
United States Department of Agriculture Location, Washington D.C., 1972.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
McMillan, H., Freer, J., Pappenberger, F., Krueger, T., and Clark, M.:
Impacts of uncertain river flow data on rainfall-runoff model calibration and
discharge predictions, Hydrol. Process., 24, 1270–1284,
<a href="https://doi.org/10.1002/hyp.7587" target="_blank">https://doi.org/10.1002/hyp.7587</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
McMillan, H., Krueger, T., and Freer, J.: Benchmarking observational
uncertainties for hydrology: rainfall, river discharge and water quality,
Hydrol. Process., 26, 4078–4111, <a href="https://doi.org/10.1002/hyp.9384" target="_blank">https://doi.org/10.1002/hyp.9384</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Michel, C., Perrin, C., and Andreassian, V.: The exponential store: a correct
formulation for rainfall – runoff modelling, Hydrol. Sci. J., 48, 109–124,
<a href="https://doi.org/10.1623/hysj.48.1.109.43484" target="_blank">https://doi.org/10.1623/hysj.48.1.109.43484</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Oudin, L., Hervieu, F., Michel, C., Perrin, C., Andréassian, V., Anctil,
F., and Loumagne, C.: Which potential evapotranspiration input for a lumped
rainfall-runoff model?, J. Hydrol., 303, 290–306,
<a href="https://doi.org/10.1016/j.jhydrol.2004.08.026" target="_blank">https://doi.org/10.1016/j.jhydrol.2004.08.026</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Perrin, C., Michel, C., and Andréassian, V.: Improvement of a
parsimonious model for streamflow simulation, J. Hydrol., 279, 275–289,
<a href="https://doi.org/10.1016/S0022-1694(03)00225-7" target="_blank">https://doi.org/10.1016/S0022-1694(03)00225-7</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Perrin, C., Ouding, L., Andreassian, V., Rojas-Serna, C., Michel, C., and
Mathevet, T.: Impact of limited streamflow data on the efficiency and the
parameters of rainfall-runoff models, Hydrol. Sci. J., 52, 131–151,
<a href="https://doi.org/10.1623/hysj.52.1.131" target="_blank">https://doi.org/10.1623/hysj.52.1.131</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Pool, S., Viviroli, D., and Seibert, J.: Prediction of hydrographs and
flow-duration curves in almost ungauged catchments: Which runoff measurements
are most informative for model calibration?, J. Hydrol., 554, 613–622,
<a href="https://doi.org/10.1016/j.jhydrol.2017.09.037" target="_blank">https://doi.org/10.1016/j.jhydrol.2017.09.037</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Ruhi, A., Messager, M. L., and Olden, J. D.: Tracking the pulse of the
Earth's fresh waters, Nat. Sustain., 1, 198–203,
<a href="https://doi.org/10.1038/s41893-018-0047-7" target="_blank">https://doi.org/10.1038/s41893-018-0047-7</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Scherrer AG: Verzeichnis grosser Hochwasserabflüsse in schweizerischen
Einzugsgebieten, Auftraggeber: Bundesamt für Umwelt (BAFU), Abteilung
Hydrologie, Reinach, 2017.

</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
Seibert, J.: Multi-criteria calibration of a conceptual runoff model using a
genetic algorithm, Hydrol. Earth Syst. Sci., 4, 215–224,
<a href="https://doi.org/10.5194/hess-4-215-2000" target="_blank">https://doi.org/10.5194/hess-4-215-2000</a>, 2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
Seibert, J. and Beven, K. J.: Gauging the ungauged basin: how many discharge
measurements are needed?, Hydrol. Earth Syst. Sci., 13, 883–892,
<a href="https://doi.org/10.5194/hess-13-883-2009" target="_blank">https://doi.org/10.5194/hess-13-883-2009</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
Seibert, J. and McDonnell, J. J.: Gauging the Ungauged Basin?: Relative Value
of Soft and Hard Data, J. Hydrol. Eng., 20, A4014004-1–6,
<a href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000861" target="_blank">https://doi.org/10.1061/(ASCE)HE.1943-5584.0000861</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
Seibert, J. and Vis, M. J. P.: Teaching hydrological modeling with a
user-friendly catchment-runoff-model software package, Hydrol. Earth Syst.
Sci., 16, 3315–3325, <a href="https://doi.org/10.5194/hess-16-3315-2012" target="_blank">https://doi.org/10.5194/hess-16-3315-2012</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
Seibert, J. and Vis, M. J. P.: How informative are stream level observations
in different geographic regions?, Hydrol. Process., 30, 2498–2508,
<a href="https://doi.org/10.1002/hyp.10887" target="_blank">https://doi.org/10.1002/hyp.10887</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
Seibert, J., Vis, M. J. P., Lewis, E., and van Meerveld, H. J.: Upper and
lower benchmarks in hydrological modelling, Hydrol. Process., 32, 1120–1125,
<a href="https://doi.org/10.1002/hyp.11476" target="_blank">https://doi.org/10.1002/hyp.11476</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
Shiklomanov, A. I., Lammers, R. B., and Vörösmarty, C. J.: Widespread
decline in hydrological monitoring threatens Pan-Arctic Research, Eos, Trans.
Am. Geophys. Union, 83, 13–17, <a href="https://doi.org/10.1029/2002EO000007" target="_blank">https://doi.org/10.1029/2002EO000007</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
Sideris, I. V., Gabella, M., Erdin, R., and Germann, U.: Real-time
radar-rain-gauge merging using spatio-temporal co-kriging with external
drift in the alpine terrain of Switzerland, Q. J. Roy. Meteor. Soc.,
140, 1097–1111, <a href="https://doi.org/10.1002/qj.2188" target="_blank">https://doi.org/10.1002/qj.2188</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
Strobl, B., Etter, S., van Meerveld, I., and Seibert, J.: Accuracy of
Crowdsourced Streamflow and Stream Level Class Estimates, Hydrol. Sci. J.,
(special issue on hydrological data: opportunities and barriers), in review,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
van Meerveld, H. J. I., Vis, M. J. P., and Seibert, J.: Information content
of stream level class data for hydrological model calibration, Hydrol. Earth
Syst. Sci., 21, 4895–4905, <a href="https://doi.org/10.5194/hess-21-4895-2017" target="_blank">https://doi.org/10.5194/hess-21-4895-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
Vrugt, J. A., Gupta, H. V., Dekker, S. C., Sorooshian, S., Wagener, T., and
Bouten, W.: Application of stochastic parameter optimization to the
Sacramento Soil Moisture Accounting model, J. Hydrol., 325, 288–307,
<a href="https://doi.org/10.1016/j.jhydrol.2005.10.041" target="_blank">https://doi.org/10.1016/j.jhydrol.2005.10.041</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
Weeser, B., Stenfert Kroese, J., Jacobs, S. R., Njue, N., Kemboi, Z., Ran,
A., Rufino, M. C., and Breuer, L.: Citizen science pioneers in Kenya – A
crowdsourced approach for hydrological monitoring, Sci. Total Environ.,
631–632, 1590–1599, <a href="https://doi.org/10.1016/j.scitotenv.2018.03.130" target="_blank">https://doi.org/10.1016/j.scitotenv.2018.03.130</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
Yapo, P. O., Gupta, H. V., and Sorooshian, S.: Automatic calibration of
conceptual rainfall-runoff models: sensitivity to calibration data, J.
Hydrol., 181, 23–48, <a href="https://doi.org/10.1016/0022-1694(95)02918-4" target="_blank">https://doi.org/10.1016/0022-1694(95)02918-4</a>, 1996.
</mixed-citation></ref-html>--></article>
