<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" dtd-version="3.0">
  <front>
    <journal-meta>
<journal-id journal-id-type="publisher">HESS</journal-id>
<journal-title-group>
<journal-title>Hydrology and Earth System Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1607-7938</issn>
<publisher><publisher-name>Copernicus GmbH</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>

    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-19-2925-2015</article-id><title-group><article-title>TopREML: a topological restricted maximum likelihood approach to
regionalize trended runoff signatures in stream networks</article-title>
      </title-group><?xmltex \runningtitle{TopREML -- runoff regionalization on stream networks}?><?xmltex \runningauthor{M.~F.~M\"{u}ller and S.~E.Thompson}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Müller</surname><given-names>M. F.</given-names></name>
          <email>marc.f.muller@gmail.com</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Thompson</surname><given-names>S. E.</given-names></name>
          
        </contrib>
        <aff id="aff1"><institution>Department of Civil and Environmental Engineering, Davis Hall, University of California, Berkeley, CA, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">M. F. Müller (marc.f.muller@gmail.com)</corresp></author-notes><pub-date><day>24</day><month>June</month><year>2015</year></pub-date>
      
      <volume>19</volume>
      <issue>6</issue>
      <fpage>2925</fpage><lpage>2942</lpage>
      <history>
        <date date-type="received"><day>12</day><month>December</month><year>2014</year></date>
           <date date-type="rev-request"><day>29</day><month>January</month><year>2015</year></date>
           <date date-type="rev-recd"><day>01</day><month>May</month><year>2015</year></date>
           <date date-type="accepted"><day>03</day><month>June</month><year>2015</year></date>
      </history>
      <permissions>
<license license-type="open-access">
<license-p>This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/3.0/">http://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions><self-uri xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015.html">This article is available from https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015.html</self-uri>
<self-uri xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015.pdf</self-uri>


      <abstract>
    <p>We introduce topological restricted maximum likelihood (TopREML) as a method to predict runoff signatures in ungauged
basins. The approach is based on the use of linear mixed models with
spatially correlated random effects. The nested nature of streamflow networks
is taken into account by using water balance considerations to constrain the
covariance structure of runoff and to account for the stronger spatial
correlation between flow-connected basins. The restricted maximum likelihood
(REML) framework generates the best linear unbiased predictor (BLUP) of both
the predicted variable and the associated prediction uncertainty, even when
incorporating observable covariates into the model. The method was
successfully tested in cross-validation analyses on mean streamflow and
runoff frequency in Nepal (sparsely gauged) and Austria (densely gauged),
where it matched the performance of comparable methods in the prediction of
the considered runoff signature, while significantly outperforming them in
the prediction of the associated modeling uncertainty. The ability of TopREML to
combine deterministic and stochastic information to generate BLUPs of the
prediction variable and its uncertainty makes it a particularly versatile
method that can readily be applied in both densely gauged basins, where it
takes advantage of spatial covariance information, and data-scarce regions,
where it can rely on covariates, which are increasingly observable via remote-sensing technology.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p>Regionalizing runoff and streamflow for the purposes of
making predictions in ungauged basins (PUB) continues to be one of the
major contemporary challenges in hydrology. At global, regional and local
scales only a small fraction of catchments are monitored for streamflow
<xref ref-type="bibr" rid="bib1.bibx4" id="paren.1"/>, and this fraction is at risk of decreasing
given the ongoing challenge of maintaining existing gauging stations
<xref ref-type="bibr" rid="bib1.bibx46" id="paren.2"/>. Reliable information about local streamflows is
essential for the management of water resources, especially in the context
of changing climate, ecosystem and demography; flow prediction
uncertainties are bound to propagate and lead to significantly suboptimal
design and management decisions
<xref ref-type="bibr" rid="bib1.bibx41 " id="paren.3"><named-content content-type="pre">e.g.,</named-content></xref>. Techniques are needed to
maximize the use of available data in data-scarce regions to accurately
predict streamflow, while providing a reliable estimate of the related
modeling uncertainty.</p>
<sec id="Ch1.S1.SS1">
  <title>Linear models</title>
      <p>There are a number of approaches to predicting runoff in ungauged catchments,
including process-based modeling <xref ref-type="bibr" rid="bib1.bibx32" id="paren.4"><named-content content-type="pre">e.g.,</named-content></xref>,
graphical methods based on the construction of isolines
<xref ref-type="bibr" rid="bib1.bibx2" id="paren.5"><named-content content-type="pre">e.g.,</named-content></xref>, and statistical approaches. Statistical
approaches are often implemented via linear regression, wherein the runoff
signature of interest is considered to be an unobservable random variable
correlated with observable features of both gauged and ungauged basins (e.g.,
rainfall, topography). Such linear models are well understood and widely
implemented, not only for PUB <xref ref-type="bibr" rid="bib1.bibx4" id="paren.6"><named-content content-type="pre">see review in </named-content><named-content content-type="post">p.83</named-content></xref>
but also across a wide variety of fields in the physical and social sciences
<xref ref-type="bibr" rid="bib1.bibx33" id="paren.7"><named-content content-type="pre">e.g.,</named-content></xref>.</p>
      <p>Spatial correlation is generally problematic for linear model predictions,
including the multiple regression approaches commonly used for
regionalization. For example if these models predict a hydrologic outcome,
<inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>,
using a matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">X</mml:mi></mml:math></inline-formula> of observed features then the linear model has the form:
            <disp-formula id="Ch1.E1" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="italic">η</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          Here <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula> is an a priori unknown set of weights that represent the
influence of each external trend on the hydrological outcome being modeled.
The residuals, <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">η</mml:mi></mml:math></inline-formula>, are the observed variation of <inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> that cannot be
explained by a linear relation with <inline-formula><mml:math display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula>. If the residuals are
independent and identically distributed (iid), the best
linear unbiased predictions (BLUP) of both <inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> and its uncertainty (i.e.,
<inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>Var</mml:mtext><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>) can readily be obtained using ordinary least squares (OLS)
regression. Unfortunately, residuals are rarely  iid in hydrological
applications due to the spatial organization of hydrological processes around
the topology of river channel networks. This organization has the potential
to introduce non-random spatial correlations with a structure imposed by the
river network. To recover a suitable model in which residuals remain
independent requires that the model structure be altered to explicitly
account for the spatial and topological correlation in the residuals.</p>
</sec>
<sec id="Ch1.S1.SS2">
  <title>Spatial correlation models</title>
      <p>There are several techniques available to address spatially correlated data.
Within PUB, kriging-based geostatistical
methods <xref ref-type="bibr" rid="bib1.bibx10" id="paren.8"/> have been widely used
<xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx20 bib1.bibx38 bib1.bibx39 bib1.bibx42" id="paren.9"><named-content content-type="pre">e.g.,</named-content></xref>.
In a geostatistical framework, a parametric function is used to model the
relationship between distance and covariance in observations. The ensuing
semi-variogram is assumed to be homogenous in space, and predictions at a
point are computed as a weighted sum of the available observations. The
weights are chosen to minimize the variance while meeting a given constraint
on the expected value of the prediction. In ordinary kriging for PUB applications,
this given constraint is simply the average of the streamflow signature as observed
in gauged catchments.  Ordinary kriging can also be extended as
“universal kriging” to include a linear combination of observable features
<xref ref-type="bibr" rid="bib1.bibx34" id="paren.10"/>. Kriging approaches are widely used to predict
spatially distributed point-scale processes like soil properties
<xref ref-type="bibr" rid="bib1.bibx17" id="paren.11"><named-content content-type="pre">e.g.,</named-content></xref> and climatic features
<xref ref-type="bibr" rid="bib1.bibx18" id="paren.12"><named-content content-type="pre">e.g.,</named-content></xref>. Although ordinary
kriging has also been used to interpolate runoff <xref ref-type="bibr" rid="bib1.bibx23" id="paren.13"><named-content content-type="pre">e.g.,</named-content></xref>,
the theoretical justification for this approach is less robust than for point-scale
processes. Runoff is organized around a topological network of stream channels,
and the covariance structure implied for prediction should reflect the higher
correlation between streamflow at watersheds that are “flow connected”
(i.e., share one or more subcatchments), compared to unconnected but spatially
proximate catchments. Currently, two broad classes of geostatistical
methods accommodate this network-aligned correlation structure.</p>
      <p>The first suite of methods posits the existence of an underlying point-scale
process, which is assumed to have a spatial auto-correlation structure that
allows kriging to be applied. Because the runoff point-scale process is only
observed as a spatially integrated measure made at specific gauged locations
along an organized network of streams, the spatial auto-correlation structure
of the point-scale process cannot itself be observed. Block-kriging
approaches
<xref ref-type="bibr" rid="bib1.bibx20 bib1.bibx38 bib1.bibx39" id="paren.14"/> infer the
semi-variogram of the (unobserved) point scale so as to best reproduce the
observed spatial correlation of the area-integrated runoff at the gauges – a
procedure known as regularization. The topology of the network is implicitly
accounted for by the fact that nested catchments have overlapping areas,
which affect the relation between observed (area integrated) and modeled
(point-scale) covariances. Yet, complex catchment shapes complicate the
regularization of semi-variograms, meaning that the estimation of the
point-scale process becomes analytically intractable and requires a
trial-and-error approach in most practical applications (e.g., Top-kriging;
<xref ref-type="bibr" rid="bib1.bibx42" id="text.15"/>). Top-kriging is an extension of the block-kriging
approach that accommodates non-stationary variables and short observation
records. Top-kriging provides an improved prediction method for hydrological
variables when compared to ordinary kriging or linear regression techniques
<xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx49 bib1.bibx7" id="paren.16"/> and
was recently extended to account for deterministic trends
<xref ref-type="bibr" rid="bib1.bibx25" id="paren.17"/>. Top-kriging represents an important advance for
PUB, but it does have a few drawbacks: (i) the regularization process is
unintuitive, and requires extensive trial-and-error to determine both the
form of a suitable point-scale variogram, and its parameters; (ii) this
trial-and-error process is likely to be computationally expensive; (iii) like
all kriging techniques, the estimation of the variogram is challenging when
accounting for observable features: the presence of an unknown trend
coefficient and variogram leads to an underdetermined problem, making
consistent estimates for both challenging. <xref ref-type="bibr" rid="bib1.bibx10" id="text.18"><named-content content-type="post">p. 166</named-content></xref> showed that the presence of a trend tends to impose a spatially
inhomogeneous, negative bias on the estimated semi-variogram. The bias
increases quadratically with distance, meaning that estimates of the
long-range variance (the <italic>sill</italic>) are strongly impacted by the presence
of the trend, leading to an underestimation of the prediction uncertainty.
This bias, however, only marginally affects the prediction itself.</p>
      <p>Geomorphological considerations of the topology of a river network generally
focus on the channels, and lead to an intuitive conceptualization that
topological interpolation should focus on runoff correlations along flow
paths. The second type of approach embraces this topological structure. It
does not consider a point-scale runoff generation process, but instead models
the hillslope-scale runoff delivery process to the channel network as a
uni-dimensional directed tree <xref ref-type="bibr" rid="bib1.bibx11 bib1.bibx48" id="paren.19"/>.
Runoff correlation is expected to decrease with the distance along the stream
following a known parametric function. However, unlike Euclidian distances,
the streamwise distance does not have the necessary properties to provide a
solvable kriging system. This issue is addressed in <xref ref-type="bibr" rid="bib1.bibx11" id="text.20"/> and
<xref ref-type="bibr" rid="bib1.bibx47" id="text.21"/>, where streamflow is modeled as a random process
represented by a Brownian motion that starts at the trunk of the tree (i.e.,
the river mouth) moves upstream, bifurcates and evolves independently on each
branch. The resulting model only allows spatial dependence with points that
are upstream on the river network and provides a positive definite covariance
matrix that is estimated through restricted maximum likelihood (REML). Models
of this nature have been successfully tested on stream chemistry data
<xref ref-type="bibr" rid="bib1.bibx48" id="paren.22"/> and further developed to also allow spatial
auto-correlation among random variables on stream segments that do not share
flow, with potential applications to the modeling of the concentration of
upstream moving species (e.g., fishes or insects) <xref ref-type="bibr" rid="bib1.bibx47" id="paren.23"/>. While
these methods do not account for the streamflow generation process, they
avoid the conceptual and prediction uncertainty challenges confronted by
kriging techniques.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><caption><p>Conceptual flow propagation model. (i) Runoff is generated continuously
by a spatially distributed point process and drained to the stream
network. (ii) When monitored by stream gauges, runoff is spatially integrated
over the corresponding catchment and temporally averaged at the chosen observation
frequency (e.g., daily streamflow). (iii) The model conceptualizes the catchments
as isolated drainage areas (IDA) (A<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mo>′</mml:mo></mml:msup></mml:math></inline-formula>, B<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mo>′</mml:mo></mml:msup></mml:math></inline-formula>, and C<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mo>′</mml:mo></mml:msup></mml:math></inline-formula>) representing the <italic>local</italic> runoff
contribution to each gauge. The flow actually measured at each gauge is the sum
of the upstream IDA.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f01.pdf"/>

        </fig>

</sec>
<sec id="Ch1.S1.SS3">
  <title>The topological restricted maximum likelihood approach</title>
      <p>Inspired by both types of approaches, here we present a method based on the
use of linear mixed models to generate a BLUP for hydrological variables on a
flow network. Rather than using a kriging estimator, we adopt a
REML framework
<xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx35 bib1.bibx27" id="paren.24"/> to
estimate variance parameters. This reduces the bias on the semi-variogram by
allowing the variance to be estimated independently from the trend
coefficients <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx27" id="paren.25"/>. This use of a
REML framework to estimate a linear mixed effect model on a topological
support is termed topological restricted maximum likelihood (TopREML). The approach is based on the following conceptual
assumptions:
<list list-type="bullet"><list-item>
      <p><bold>Flow generation and propagation:</bold> similar to Top-kriging, runoff is
assumed to be generated at a point scale on the landscape, from where it is
routed to a channel and measured at a gauge (Fig. <xref ref-type="fig" rid="Ch1.F1"/>i). Runoff
observations made at any individual gauge (Fig. <xref ref-type="fig" rid="Ch1.F1"/>ii) can be
broken up into a <italic>local</italic> contribution, derived from a
never previously gauged catchment area, and an <italic>upstream</italic> contribution
that was previously observed at upstream gauge(s) along the channel (Fig. <xref ref-type="fig" rid="Ch1.F1"/>iii).
TopREML disaggregates all flow contributions into a
cascade of local components, as observed at each successive gauge, and uses
these characteristics to constrain the covariance structure of runoff and to
account for the stronger spatial correlations between flow-connected basins.</p></list-item><list-item>
      <p><bold>Treatment of time: </bold> for the local effects to form a suitable basis
for spatial interpolation, variations associated with temporal correlation
(e.g., travel time effects) need to be removed. This is achieved by
considering time-averaged streamflow data, with the proviso that the time
averaging window is much greater than the characteristic catchment and
channel response timescales. This treatment of time has several specific
consequences. First, TopREML is only suitable for the regionalization of
time-averaged and statistically stationary runoff properties (i.e.,
<italic>runoff signatures</italic>). Stationarity is necessary to ensure that the water
balance assumption used to separate local from upstream runoff contributions
is valid. However, as a consequence, TopREML cannot be used to interpolate
transient signatures, such as those associated with real-time forecasting.
Nor can it be used to describe runoff properties that are correlated over
timescales larger than the time averaging window. Because of the
stationarity assumption applied, all correlation arguments described in this
manuscript refer to the <italic>spatial</italic>, and not <italic>temporal</italic>, correlation
of the runoff signatures.</p></list-item><list-item>
      <p><bold>Network topology: </bold> network topology in TopREML also follows a
conceptual model that is similar to the model posited by Top-kriging.
Topology is conceptualized by area connectivity. That is, flow-connected
gauges are characterized by overlapping drainage areas. Unlike Top-kriging,
TopREML does not require information about a spatially random point process,
but solely relies on information measured at the gauges. It uses the
inter-centroidal Euclidian distance between drainage areas of the local flow
contributions at each gauge – the isolated drainage areas (IDA) – as
a distance metric to compute streamflow correlation. The underlying
assumption is that runoff signatures of local flow generation regions that
are close to each other (in Euclidian space) are more likely to be identical.
Although TopREML does not require that the characteristics of a point-scale
runoff generation process are known in order to support interpolation (a
necessary requirement for Top-kriging), the existence of such a point process
is consistent with the treatment of spatial correlation in TopREML. To
illustrate this consistency, a stylized example relating point-scale runoff
generation to the existence of a covariance structure that relates
flow-connected gauges is outlined in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>.</p></list-item></list></p>
</sec>
<sec id="Ch1.S1.SS4">
  <title>Paper outline</title>
      <p>We first derive the TopREML estimator and its variance for mass conserving
(i.e., linearly aggregated) variables, with extensions to some
non-conservative variables (Sect. <xref ref-type="sec" rid="Ch1.S2"/>). We then apply the approach
in two case studies to evaluate its ability to predict mean runoff and runoff
frequency by comparison to other available interpolation techniques: Sects. <xref ref-type="sec" rid="Ch1.S3.SS1"/>
and <xref ref-type="sec" rid="Ch1.S4.SS1"/> present leave-one-out cross-validations in
Nepal (sparse gauges, significant trends) and Austria (dense gauge network,
no observed trends). In both cases, TopREML performed similarly to the best
alternative geostatistical method. We then use numerical simulations to
illustrate the effect of the two distinguishing features of TopREML: its
ability to properly predict runoff using highly nested networks of stream
gauges and its ability to properly estimate the prediction variance when
accounting for observable features (Sects. <xref ref-type="sec" rid="Ch1.S3.SS2"/> and <xref ref-type="sec" rid="Ch1.S4.SS2"/>).
Finally, we discuss the limits and delineate the context in which TopREML –
and geostatistical methods in general – can successfully be applied to
predict streamflow signatures in ungauged basins (Sect. <xref ref-type="sec" rid="Ch1.S5"/>).</p>
</sec>
</sec>
<sec id="Ch1.S2">
  <title>Theory</title>
<sec id="Ch1.S2.SS1">
  <title>Accounting for spatially correlated residuals</title>
      <p>Linear models can be used to make predictions about hydrological
variables along a network, provided that the models explicitly address the
effects of network structure. A mixed linear model approach provides a
suitable framework for this accounting. In this framework, the effects of
observable features on the hydrological outcome are assumed to be independent
of the network, and retain their influence independently, as so-called “fixed
effects”. The role of spatial structure is assumed to lead to correlation
specifically in the residuals <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">η</mml:mi></mml:math></inline-formula>. The residuals are split into two parts:
(i) one containing “random effects”, <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula>, that exhibit spatial correlation
along the flow network and (ii) a remaining, spatially independent, white
noise term, <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula>, which does not have any spatial structure. With these
assumptions, the mixed linear model is written as:
<?xmltex \hack{\newpage\hspace*{-12mm}}?>

                <disp-formula id="Ch1.E2" content-type="numbered"><mml:math display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{6.5}{6.5}\selectfont$\displaystyle}?><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:munder><mml:mi mathvariant="bold">X</mml:mi><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mtext>Trends:</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>Explanatory</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>variables</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mo>×</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:msub><mml:msub><mml:munder><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mtext>Coefficients</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:munder><mml:mrow><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mtext>Identity</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>Matrix</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mo>×</mml:mo><mml:mi>N</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:msub><mml:msub><mml:munder><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mtext>Correlated</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>random</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>effects</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:munder><mml:mi mathvariant="bold-italic">ϵ</mml:mi><mml:mo mathvariant="normal">︸</mml:mo></mml:munder><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mtext>Residuals,</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>uncorrelated</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>errors</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:msub></mml:mrow><?xmltex \hack{$\egroup}?><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          To proceed, we assume that <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula>
(and therefore <inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula>) are
normally distributed with zero mean and are independent from each other. The
variance associated with <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula> is denoted <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, the variance of
<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> is assumed to be proportional to <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> according to some ratio,
<inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>, and finally, <inline-formula><mml:math display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula> is assumed to have a spatial dependence captured by a
correlation structure <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">G</mml:mi></mml:math></inline-formula>, which is related to the spatial layout of gauges
along the river network and a distance parameter <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula> (the correlation
range). Thus, the random effects can be specified as
            <disp-formula id="Ch1.E3" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">u</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mfenced open="(" close=")"><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="center center"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi mathvariant="bold">G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          To solve this mixed model, five unknowns must be found: <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ξ</mml:mi></mml:math></inline-formula>,
<inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula>, the fixed (<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:math></inline-formula>) and random (<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula>) effects. Once <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> are
known, the empirical best linear unbiased prediction (E-BLUP) of <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> can be
made at ungauged locations <xref ref-type="bibr" rid="bib1.bibx27" id="paren.26"/>. The solution strategy
adopted here is to prescribe a parametric form for <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, allowing the
covariance structure along the network to be specified, and the likelihood
function for the model to be written in terms of <italic>all</italic> five unknowns.
Identifying the parameter values that optimize this model thus simultaneously
solves for the correlation structure, covariance parameters, fixed and random
effects. To proceed with the specification of <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, however, the form of
the covariance structure that arises along the network needs to be addressed.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <title>Covariance structure of mass conserving variables</title>
      <p>In the linear mixed model framework, the propagation of
hydrological variables through the flow network introduces topological
effects into the covariance structure of that variable. Firstly, linearly
propagated variables, such as annual specific runoff, are discussed.
Nonlinearly propagating variables can in some cases be transformed to allow
the linear solutions to be used (as outlined in Sect. <xref ref-type="sec" rid="Ch1.S2.SS5"/>).
Consider a set of streamflow gauges monitoring a watershed as illustrated in
Fig. <xref ref-type="fig" rid="Ch1.F1"/>ii. Because of the nested nature of the river network,
the catchment area related to any upstream gauge is entirely included within
the area drained by all downstream gauges. To account for the network
structure, the catchment at any location along a stream can be subdivided
into the IDA that are <italic>monitored for the first time</italic> by an upstream gauge. This is illustrated in Fig. <xref ref-type="fig" rid="Ch1.F1"/>iii,
and leads to a subdivision into non-overlapping areas,
each associated with the most upstream gauge that monitors them. In making
this subdivision, it is implicitly assumed that the timescales at which a
hydrological variable is propagated in the channel are negligible compared
with the timescales on which hillslope effects operate (a generally valid
assumption for small to moderately sized watersheds; see
<xref ref-type="bibr" rid="bib1.bibx13" id="altparen.27"/>). IDAs can be associated with both gauged
locations and ungauged locations. In what follows, indices <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> are used to refer to gauged sites, while index <inline-formula><mml:math display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> refers to ungauged
sites where a prediction is to be made.</p>
      <p>With these assumptions, observations of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> made at gauge <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> can be
expressed as a linear combination of contributions from the upstream IDAs:
            <disp-formula id="Ch1.E4" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is the contribution of the IDA related to gauge <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> (that is,
<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is equivalent to <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>i</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> only if there are no gauges upstream of gauge
<inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>); UP is the set of IDA monitored by gauges
that are located upstream of <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>; <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mtext>UP</mml:mtext></mml:msubsup><mml:msub><mml:mi>A</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> is
the surface area of the drainage area <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> normalized by the total watershed
area upstream of gauge <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>. The covariance between observations of <inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> made
at different gauges can then be expressed as

                <disp-formula specific-use="align"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced><mml:mo>-</mml:mo><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mfenced><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mtext>E</mml:mtext><mml:mfenced open="[" close="]"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mfenced><mml:mfenced open="(" close=")"><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mfenced></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

            With <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>E</mml:mtext><mml:mfenced close="]" open="["><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced><mml:mo>=</mml:mo><mml:mtext>Cov</mml:mtext><mml:mfenced open="(" close=")"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced><mml:mo>+</mml:mo><mml:mtext>E</mml:mtext><mml:mfenced open="[" close="]"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced><mml:mtext>E</mml:mtext><mml:mfenced open="[" close="]"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mrow></mml:math></inline-formula>, we have
            <disp-formula id="Ch1.E5" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced open="(" close=")"><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mrow></mml:math></inline-formula> is the covariance between the contributions of
sub-catchments <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>. By summing over UP in Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>)
(rather then the complete set of available gauges), the model
assumes no correlation between runoff observed at flow-unconnected gauges.</p>
      <p>Here we assume that the area-averaged process <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>′</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is drawn from a second-order stationary random process, and that the covariance between <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> will depend only on the relative position of sub-catchments <inline-formula><mml:math display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>, given some specified correlation function <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of the distance
<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> between the centroids of the two sub-catchments
<xref ref-type="bibr" rid="bib1.bibx10" id="paren.28"/>. We assume that this function is well
approximated by an exponential function <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:mi>p</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi>c</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. A
justification for this assumption, which reproduces the streamflow variances
observed in our case studies well (Fig. <xref ref-type="fig" rid="Ch1.F8"/>), is derived for
strongly idealized conditions in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>. Finally, because the
observations made at the gauges represent an area-averaged process, the
averaging generates a nugget variance <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> that is homogenous across
observations. The nugget consists of the variance of processes that are
spatially correlated over scales smaller than the sub-catchments (see
Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>) and of measurement errors at the gauges.</p>
      <p>With this background, the covariance matrix of <inline-formula><mml:math display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> can be expressed as

                <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mfenced open="(" close=")"><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E6"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>⋅</mml:mo><mml:mfenced close=")" open="("><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi mathvariant="bold">U</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="bold">A</mml:mi><mml:mo>⋄</mml:mo><mml:mi mathvariant="bold">R</mml:mi><mml:mo>]</mml:mo><mml:msup><mml:mi mathvariant="bold">U</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mtext>Var</mml:mtext><mml:mo>(</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="bold">1</mml:mn><mml:mo mathvariant="italic">{</mml:mo><mml:mi>j</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula>,
<inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">A</mml:mi><mml:mo>=</mml:mo><mml:mi>a</mml:mi><mml:msup><mml:mi>a</mml:mi><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, and where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mo>⋅</mml:mo><mml:mo>⋄</mml:mo><mml:mo>⋅</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> denotes
the element-by-element matrix multiplication. The matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">G</mml:mi></mml:math></inline-formula> describing the
correlation between the random effects in Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>) is finally
            <disp-formula id="Ch1.E7" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="bold">G</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="bold">U</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="bold">A</mml:mi><mml:mo>⋄</mml:mo><mml:mi mathvariant="bold">R</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>)</mml:mo><mml:mo>]</mml:mo><mml:msup><mml:mi mathvariant="bold">U</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          The topology of the network is described by the matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">U</mml:mi></mml:math></inline-formula>, which ensures
that only those catchments that are on the same sub-network (upstream or
downstream) of the considered gauge are utilized in the determination of the
covariance of <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. This spatial constraint comes at the expense of neglecting
potential correlations with neighboring catchments that are not
flow-connected, and the effects of this tradeoff are investigated in the
Monte Carlo experiment described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>. The effect of
spatial proximity is addressed by use of the Euclidian distance between
catchment centroids (matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula>), and the effect of scale is accounted for by
weighting by the catchment area of the IDAs (matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">A</mml:mi></mml:math></inline-formula>).</p>
</sec>
<sec id="Ch1.S2.SS3">
  <title>REML estimation</title>
      <p>The restricted maximum likelihood approach partitions the likelihood of
<inline-formula><mml:math display="inline"><mml:mrow><mml:mi>y</mml:mi><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mfenced close=")" open="("><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi mathvariant="bold">G</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mfenced></mml:mrow></mml:math></inline-formula>
into two parts, one of which
is independent of <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx9" id="paren.29"/>. This allows the determination
of fixed effects and the variance parameters of the model (here <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>)
to be undertaken separately. The variance parameters are then estimated by maximizing
the restricted log likelihood expression <xref ref-type="bibr" rid="bib1.bibx14" id="paren.30"/>

                <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>R</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac><mml:mfenced close="" open="("><mml:mi>log⁡</mml:mi><mml:mtext>det</mml:mtext><mml:mfenced close=")" open="("><mml:msup><mml:mi mathvariant="bold">X</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:msup><mml:mi mathvariant="bold">H</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mi mathvariant="bold">X</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:mi>log⁡</mml:mi><mml:mtext>det</mml:mtext><mml:mfenced close=")" open="("><mml:mi mathvariant="bold">H</mml:mi></mml:mfenced></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E8"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced open="." close=")"><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ν</mml:mi><mml:mi>log⁡</mml:mi><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac><mml:msup><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold">P</mml:mi><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>det</mml:mtext><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the matrix determinant operator, <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>=</mml:mo><mml:mi>N</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">H</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mi mathvariant="bold">G</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">P</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msup><mml:mi mathvariant="bold">WK</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mi mathvariant="bold">W</mml:mi><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="bold">W</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mi mathvariant="bold">X</mml:mi><mml:mo>:</mml:mo><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>,
<inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">R</mml:mi></mml:math></inline-formula> is the correlation matrix in
Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>) and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> is the block matrix:

                <disp-formula id="Ch1.Ex6"><mml:math display="block"><mml:mrow><mml:mi mathvariant="bold">K</mml:mi><mml:mo>=</mml:mo><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="center center"><mml:mtr><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="bold">X</mml:mi><mml:mi>T</mml:mi></mml:msup><mml:mi mathvariant="bold">X</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="bold">X</mml:mi><mml:mi>T</mml:mi></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="bold">X</mml:mi></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="bold">I</mml:mi><mml:mi>N</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="italic">ξ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          The REML estimators <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> that maximize <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>R</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
can be obtained through numerical optimization.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <title>E-BLUP and prediction variance at ungauged catchments</title>
      <p>Once the variance components <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> are estimated, the
fixed effect coefficients <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> and the random effects <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi>u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover></mml:math></inline-formula> can be
obtained by solving the linear system <xref ref-type="bibr" rid="bib1.bibx22" id="paren.31"/>:
            <disp-formula id="Ch1.E9" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="bold">K</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mfenced close="]" open="["><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mover accent="true"><mml:mi>u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>=</mml:mo><mml:mfenced open="[" close="]"><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="bold">X</mml:mi><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi mathvariant="bold-italic">y</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p>The empirical best linear unbiased prediction of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at an ungauged
site <inline-formula><mml:math display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> can be computed by summing the fixed and random effect predictions
<xref ref-type="bibr" rid="bib1.bibx27" id="paren.32"/>
            <disp-formula id="Ch1.E10" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>u</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the vector of fixed covariates at ungauged site <inline-formula><mml:math display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> a
correlation vector between site <inline-formula><mml:math display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> and each gauge. Knowing <inline-formula><mml:math display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
can be readily obtained from the relative position of site <inline-formula><mml:math display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> and the gauges
in the river network.</p>
      <p>The variance of the TopREML prediction error can be expressed as

                <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E11"><mml:mtd/><mml:mtd><mml:mrow><mml:mtext>Var</mml:mtext><mml:mfenced close=")" open="("><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mtext>Var</mml:mtext><mml:mfenced open="(" close=")"><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:mfenced open="(" close=")"><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:mfenced><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mfenced open="(" close=")"><mml:mover accent="true"><mml:mi>u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi></mml:mfenced></mml:mfenced></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9.0}{9.0}\selectfont$\displaystyle}?><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:mtext>Var</mml:mtext><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mi mathvariant="italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="italic">τ</mml:mi></mml:mfenced><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mtext>Var</mml:mtext><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi></mml:mfenced><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub><?xmltex \hack{$\egroup}?></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E12"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:mover accent="true"><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:mfenced><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            The covariance matrix of the error on <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>)
can be expressed as a function of the inverted model matrix <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula>
<xref ref-type="bibr" rid="bib1.bibx27" id="paren.33"/>:
            <disp-formula id="Ch1.E13" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:mtable class="array" columnalign="center"><mml:mtr><mml:mtd><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">τ</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mo>-</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:msup><mml:mi mathvariant="bold">K</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          This provides

                <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>Var</mml:mtext><mml:msub><mml:mfenced close=")" open="("><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced><mml:mo>-</mml:mo></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mfenced close="" open="("><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>11</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>22</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E14"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mfenced close=")" open="."><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:msubsup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>n</mml:mi><mml:mi>T</mml:mi></mml:msubsup><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>12</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:msup><mml:mi mathvariant="bold">G</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msub><mml:mi mathvariant="bold-italic">g</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>11</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>22</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="bold">K</mml:mi><mml:mn>12</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> are <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>×</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>×</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>×</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> partitions of the inverted <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold">K</mml:mi></mml:math></inline-formula> matrix. If <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula> is
an error that is truly iid and does not affect the true value of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
(e.g., measurement errors), then Eq. (<xref ref-type="disp-formula" rid="Ch1.E14"/>) corresponds to the mean
square error of the TopREML prediction of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. If, by contrast, <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula>
represents random variations of the true value of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> that are correlated
over short distances (and so do not appear correlated in our data), then
<inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϵ</mml:mi></mml:math></inline-formula> should be included in Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) and the prediction
variance becomes
            <disp-formula id="Ch1.E15" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mtext>Var</mml:mtext><mml:msub><mml:mfenced open="(" close=")"><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced><mml:mo>+</mml:mo></mml:msub><mml:mo>=</mml:mo><mml:mtext>Var</mml:mtext><mml:msub><mml:mfenced open="(" close=")"><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">̃</mml:mo></mml:mover><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mfenced><mml:mo>-</mml:mo></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          because <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">ϵ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> are independent. In reality <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϵ</mml:mi></mml:math></inline-formula> is likely
composed of both spatially correlated and iid error components and the
true variance will be somewhere between these two bounds
<xref ref-type="bibr" rid="bib1.bibx27" id="paren.34"/>.</p>
</sec>
<sec id="Ch1.S2.SS5">
  <title>Application to non-conservative variables</title>
      <p>Unlike mean specific runoff, numerous streamflow signatures
(e.g., runoff frequency or descriptors of the recession behavior) are
non-conservative and cannot be expressed as linear combinations of their
values in upstream sub-catchments. In such conditions the derivations in
Sect. <xref ref-type="sec" rid="Ch1.S2.SS2"/> cannot be applied and the correlation structure in
Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>) will lead to biased REML predictions. The effect of the
network structure on streamflow can nonetheless be accounted if the
nonlinearities can be neglected or eliminated through algebraic
transformations.</p>
      <p>For instance, runoff frequency <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is defined as the probability, on
daily timescales, that a gauge will record a positive increment in streamflow
<xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx32" id="paren.35"/>. Provided all sub-basins are
large enough to significantly contribute to streamflow, a runoff pulse at
<italic>any</italic> of the upstream sub-basins causes a streamflow increase at the
gauge. Therefore, runoff frequency does not scale linearly through the river
network. It can nonetheless be shown (see Appendix <xref ref-type="sec" rid="App1.Ch1.S2"/>) that if runoff
pulses occur independently for each sub-basin, the logarithm of the
complement to runoff probability (i.e., ln<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>) propagates linearly
throughout the network, enabling the application of TopREML to predict runoff
probability at ungauged catchments.</p>
      <p>A similar reasoning can be applied to predict recession parameters. For
example, the exponential function <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">r</mml:mi></mml:msub><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a widely used
approach to model base flow recession, where <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the discharge at time
<inline-formula><mml:math display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> the peak discharge and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">r</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> the recession constant which can be
considered to represent the inverse of the average response time in storage
<xref ref-type="bibr" rid="bib1.bibx51" id="paren.36"/>. Because expected values scale linearly, the
average response time at a gauge can be modeled as a linear combination of
the mean response times of the upstream IDAs. Therefore, although recession
constants themselves do not propagate linearly, their value in ungauged
basins can be estimated by taking the inverse of TopREML predictions of
average response times.</p>
</sec>
<sec id="Ch1.S2.SS6">
  <title>Implementation</title>
      <p>The computational implementation of TopREML in R <xref ref-type="bibr" rid="bib1.bibx37" id="paren.37"/>
is described in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/> and an operational script is provided as a
Supplement to this manuscript.  To run the script, two vector data sets (e.g.,
ESRI Shapefiles) are needed as inputs – one containing the catchments where
runoff is available and another containing the basins where predictions are to
be made. Catchment polygons and explanatory and predicted variables must be
provided as attributes of the vector polygons. The way in which the catchment
polygons are nested provides the topology of the stream network. TopREML uses
the Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm <xref ref-type="bibr" rid="bib1.bibx53" id="paren.38"/> to maximize the restricted log
likelihood, though stochastic algorithms are required if a non-differentiable
(e.g., spherical) covariance function is selected. The selection of initial
values for <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> is a key user input that may affect
the performance of optimization algorithms by causing them to converge to a
local extrema. We found that initial values of <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">0</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ξ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>]</mml:mo><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">LM</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>,</mml:mo><mml:mtext>E</mml:mtext><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>
worked well in our case studies, with <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">LM</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> the variance of the OLS
residuals of the linear model and <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>E</mml:mtext><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> the average distance between IDA
centroids.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1"><caption><p>Taxonomy of the compared regionalization approaches.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.82}[.82]?><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="right" colsep="1"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="center"/>
     <oasis:thead>
       <oasis:row>  
         <oasis:entry colname="col1"/>  
         <oasis:entry colname="col2">Explanatory</oasis:entry>  
         <oasis:entry colname="col3">Spatial</oasis:entry>  
         <oasis:entry colname="col4">Network</oasis:entry>  
         <oasis:entry colname="col5">Unbiased</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1"/>  
         <oasis:entry colname="col2">variables</oasis:entry>  
         <oasis:entry colname="col3">covariance</oasis:entry>  
         <oasis:entry colname="col4">topology</oasis:entry>  
         <oasis:entry colname="col5">variance</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>  
         <oasis:entry colname="col1">Sample mean</oasis:entry>  
         <oasis:entry colname="col2"/>  
         <oasis:entry colname="col3"/>  
         <oasis:entry colname="col4"/>  
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">Linear regression</oasis:entry>  
         <oasis:entry colname="col2">X</oasis:entry>  
         <oasis:entry colname="col3"/>  
         <oasis:entry colname="col4"/>  
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">Universal kriging</oasis:entry>  
         <oasis:entry colname="col2">X</oasis:entry>  
         <oasis:entry colname="col3">X</oasis:entry>  
         <oasis:entry colname="col4"/>  
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">Top-kriging</oasis:entry>  
         <oasis:entry colname="col2">X</oasis:entry>  
         <oasis:entry colname="col3">X</oasis:entry>  
         <oasis:entry colname="col4">X</oasis:entry>  
         <oasis:entry colname="col5"/>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">TopREML</oasis:entry>  
         <oasis:entry colname="col2">X</oasis:entry>  
         <oasis:entry colname="col3">X</oasis:entry>  
         <oasis:entry colname="col4">X</oasis:entry>  
         <oasis:entry colname="col5">X</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><caption><p>Empirical correlograms of the mean specific summer flow recorded at the
57 gauges of the Austrian data set. Distance has a different effect on the
correlation between flow-connected (black circles) and flow-unconnected (white
triangles) gauges. Both correlograms are well fitted by an exponential function
but the spatial correlation range doubles when gauges are flow connected. Both
empirical correlograms are constructed using 5 km bins.</p></caption>
          <?xmltex \igopts{width=170.716535pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f02.pdf"/>

        </fig>

<?xmltex \hack{\newpage}?>
</sec>
</sec>
<sec id="Ch1.S3">
  <title>Methods</title>
<sec id="Ch1.S3.SS1">
  <title>Case studies</title>
      <p>Observed streamflow data are used to evaluate the ability of
TopREML to predict streamflow signatures in ungauged basins. The assessment
is based on leave-one-out cross-validations, where the tested model is
applied to predict runoff at one basin based on observations from all the
other basins. After predicting runoff at all available basins in that manner,
the model is evaluated based on its mean absolute prediction error.
Streamflow variables from 57 catchments in Upper Austria
<xref ref-type="bibr" rid="bib1.bibx43" id="paren.39"/> and 52 catchments in Nepal
<xref ref-type="bibr" rid="bib1.bibx12" id="paren.40"/> are used in two
separate leave-one-out analyses. The location of the gauges is shown in
Fig. <xref ref-type="fig" rid="Ch1.F3"/>, and Table <xref ref-type="table" rid="Ch1.T2"/> provides a summary of
relevant catchment characteristics. Further details on the data sets are
provided in <xref ref-type="bibr" rid="bib1.bibx43" id="text.41"/> for Austria and
<xref ref-type="bibr" rid="bib1.bibx32" id="text.42"/> in Nepal. The two regions differ significantly
with respect to gauge density (high in Austria and low in Nepal) and in the
nature of the runoff signature and observable features. The Nepalese data set
provides specific runoff and wet season runoff frequency as well as gauge
elevation and bias-adjusted annual rainfall derived from the Tropical
Rainfall Measurement Mission 3B42v7 data set <xref ref-type="bibr" rid="bib1.bibx31" id="paren.43"/>. Gauge
elevation and annual rainfall are used as observable features for specific
runoff <xref ref-type="bibr" rid="bib1.bibx8" id="paren.44"/>. The Austrian data set was directly taken
from the <italic>rtop</italic> package <xref ref-type="bibr" rid="bib1.bibx43" id="paren.45"/>, where mean summer runoff
observations are provided to demonstrate Top-kriging. The Austrian data set
did not contain additional observable features and previous studies have
found spatial proximity to be a significantly better predictor of runoff than
catchment attributes in Austria <xref ref-type="bibr" rid="bib1.bibx30" id="paren.46"/>.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><caption><p>Catchment characteristics of the case studies. </p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="right" colsep="1"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="left"/>
     <oasis:colspec colnum="9" colname="col9" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">  
         <oasis:entry colname="col1"/>  
         <oasis:entry colname="col2"><inline-formula><mml:math display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col3"><inline-formula><mml:math display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col4"><inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col5"><inline-formula><mml:math display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col6"><inline-formula><mml:math display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col7">Dpt</oasis:entry>  
         <oasis:entry colname="col8"><inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>  
         <oasis:entry colname="col9"><inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mtext>g</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>  
         <oasis:entry colname="col1">Nepal</oasis:entry>  
         <oasis:entry colname="col2">52</oasis:entry>  
         <oasis:entry colname="col3">1660</oasis:entry>  
         <oasis:entry colname="col4">0.42</oasis:entry>  
         <oasis:entry colname="col5">2121</oasis:entry>  
         <oasis:entry colname="col6">13.9</oasis:entry>  
         <oasis:entry colname="col7">10</oasis:entry>  
         <oasis:entry colname="col8">1683</oasis:entry>  
         <oasis:entry colname="col9">320</oasis:entry>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1"/>  
         <oasis:entry colname="col2"/>  
         <oasis:entry colname="col3">(1062, 2228)</oasis:entry>  
         <oasis:entry colname="col4">(0.40, 0.46)</oasis:entry>  
         <oasis:entry colname="col5">(513, 5267)</oasis:entry>  
         <oasis:entry colname="col6">(9.2, 25.2)</oasis:entry>  
         <oasis:entry colname="col7"/>  
         <oasis:entry colname="col8">(1482, 1909)</oasis:entry>  
         <oasis:entry colname="col9">(507, 750)</oasis:entry>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1">Austria</oasis:entry>  
         <oasis:entry colname="col2">57</oasis:entry>  
         <oasis:entry colname="col3">0.68</oasis:entry>  
         <oasis:entry colname="col4"/>  
         <oasis:entry colname="col5">68</oasis:entry>  
         <oasis:entry colname="col6">4.5</oasis:entry>  
         <oasis:entry colname="col7">8</oasis:entry>  
         <oasis:entry colname="col8"/>  
         <oasis:entry colname="col9"/>
       </oasis:row>
       <oasis:row>  
         <oasis:entry colname="col1"/>  
         <oasis:entry colname="col2"/>  
         <oasis:entry colname="col3">(0.42,1.43)</oasis:entry>  
         <oasis:entry colname="col4"/>  
         <oasis:entry colname="col5">(44, 136)</oasis:entry>  
         <oasis:entry colname="col6">(3.9, 6.3)</oasis:entry>  
         <oasis:entry colname="col7"/>  
         <oasis:entry colname="col8"/>  
         <oasis:entry colname="col9"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p><inline-formula><mml:math display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> is the number of catchments; <inline-formula><mml:math display="inline"><mml:mi>Q</mml:mi></mml:math></inline-formula> the specific runoff (mm yr<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) in
Nepal and the mean summer streamflow (m<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:math></inline-formula> s) in Austria; <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is the rainy
season runoff frequency (<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi>d</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) in Nepal; <inline-formula><mml:math display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula> the catchment area in km<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula>; <inline-formula><mml:math display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula>
the distance in km between the centroids of IDA; Dpt the
depth of the stream network graph (i.e., the maximum number of flow-connected
gauges); <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> the annual rainfall in millimeters given by TRMM over Nepal and adjusted
according to  <xref ref-type="bibr" rid="bib1.bibx31" id="paren.47"/>; <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mtext>g</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is the gauge elevation in
meters above sea level. Median values are provided with 25th and 75th
quantiles in parenthesis.</p></table-wrap-foot></table-wrap>

      <p>The predictive ability of TopREML was evaluated on (a) specific annual runoff
in Nepal, (b) wet season runoff frequency in Nepal and (c) average summer
streamflow in Austria. The performance of TopREML (TR) was compared to five
other widely used regionalization methods: sample mean (LM<inline-formula><mml:math display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:math></inline-formula>), linear
regression (LM), universal kriging (UK) and Top-kriging (TK). As shown
in Table <xref ref-type="table" rid="Ch1.T1"/>, these methods cover a wide spectrum of
incrementally specific assumptions and comparing them provides an assessment
of the value added by increased model complexity for regionalization of these
streamflow parameters. Code to implement all four methods is readily
available in R, with dedicated packages available for Top-kriging –
<italic>rtop</italic> – and universal kriging – <italic>gstat</italic>
<xref ref-type="bibr" rid="bib1.bibx36" id="paren.48"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><caption><p>Location of the gauges and related catchments included in the cross-validation analyses in Upper Austria <bold>(a)</bold> and Nepal <bold>(b)</bold>. Coloring is semi-transparent
to emphasize overlapping catchment areas. Dark colors represent upstream catchments,
whose runoff is monitored by many gauges downstream. Light colors represent
downstream catchments with only few downstream gauges to monitor runoff.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f03.pdf"/>

        </fig>

<?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S3.SS2">
  <title>Numerical simulations</title>
<sec id="Ch1.S3.SS2.SSS1">
  <title>Network effects</title>
      <p>Conventional geostatistical methods predict runoff by weighing observations
from surrounding basins based on their geographic distance. TopREML also
incorporates the topology of the stream network by including or excluding
basins based on their flow-connectedness. This adds topological information
to the determination of the covariance structure of runoff, at the expense of
discarding information that could be derived from correlations between
spatially proximate regions that are not connected to the gauge of interest
by a flow path. Assessing the net benefits of accounting for network effects
requires being able to control the topology of the network, and thus requires
numerical simulations. A series of Monte Carlo experiments as described in
Fig. <xref ref-type="fig" rid="Ch1.F4"/> were run to simulate network complexity by varying the
number of flow-connected basins that are within (<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and beyond
(<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) the predefined spatial auto-correlation range of the randomly
generated runoff. A non-topological geostatistical method like universal
kriging would include all basins within and exclude all basins beyond the
spatial auto-correlation range. We expect TopREML to outperform universal
kriging when the number of flow-connected basins beyond the auto-correlation
range increases and the number of connected basins within the auto-correlation
range decreases.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4"><caption><p>Monte Carlo generation procedure: (i) a spatially correlated Gaussian
field with an exponential covariance function (mean = 30, partial sill = 8,
nugget = 2, range = 3) is generated along a 7 <inline-formula><mml:math display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 7 irregular grid. The central
pixel (in black) represents the downstream-most catchment, where runoff is to
be predicted. Among the remaining pixels, 24 <italic>inner</italic> isolated drainage
areas (IDA) are within a radius of one spatial correlation range (dashed circle)
of the central pixel, and 24 <italic>outer</italic> pixels are beyond that
radius. (ii) A predefined number of <italic>inner</italic> and <italic>outer</italic> pixels are
randomly selected as part of the set of catchments that are flow-connected to
the central pixel. In the figure, all 24 <italic>inner</italic> pixels and 12 <italic>outer</italic>
pixels are selected and form the flow catchment outlined with a thick black
line. (iii) A tree graph is randomly generated (grey arrows) with its trunk at
the prediction pixel and branches passing through all the flow-connected pixels.
The random field generated in step one is aggregated along the tree by summing
the value of all lower order branches at each confluence. (iv) A new spatially
correlated field (mean = 1, partial sill = 0.15, nugget = 0, range = 0.5) is generated
at each pixel – that is the observed trend. The trend is multiplied by a
predefined trend coefficient (<inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>=</mml:mo><mml:mn>10</mml:mn></mml:mrow></mml:math></inline-formula>) and added to the aggregated runoff
at each pixel – that is the observed runoff. (v) Based on the observed runoff
and (if applicable) trend at the 48 non-central pixels, TopREML and the
compared baseline method (Top-kriging or universal kriging) are used to predict
runoff at the central pixel. Prediction errors are recorded and the procedure
repeated 1000 times to get the mean and variance of the errors.</p></caption>
            <?xmltex \igopts{width=213.395669pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f04.pdf"/>

          </fig>

<?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <title>Variance estimation and observable features</title>
      <p>A key advantage of the REML framework is its ability to
avoid the downward bias in the covariance function that affects kriging-based
methods (including Top-kriging) when external trend coefficients are
simultaneously estimated. This bias particularly affects the prediction of
the variance. Again, empirical cross-validation analysis does not allow an
assessment of this bias, because the observation data sets used contained only
one observation per location. Numerical simulations, however, allow many
realizations of the underlying stochastic process to be made at each
location, and thus allow the prediction variance to be compared with the
numerical variance. We evaluate TopREML's ability to predict variances (and
therefore evaluate prediction uncertainties) at ungauged locations using the
Monte Carlo procedure on the synthetic catchments described in Fig. <xref ref-type="fig" rid="Ch1.F4"/>.
We construct the observed prediction uncertainty by taking the
standard deviation of the prediction errors across all 1000 Monte Carlo runs
and compare it to the square root of the median predicted variance. The
external trend is omitted from the model specification (i.e., it is <italic>not observed</italic>) in a first experiment, and explicitly included in the model in the
second experiment. We compare TopREML and Top-kriging based on their ability
to model prediction variance. We expect TopREML to provide a better estimate
of the variance than Top-kriging when accounting for observable features.
Because the trend is spatially correlated, omitting it in the model
specification adds a significant spatially correlated component to the error
and Eq. (<xref ref-type="disp-formula" rid="Ch1.E15"/>) should be used to predict the variance. Conversely,
including a trend in the model will cause the remaining error to mostly
consists of (spatially uncorrelated) residuals, so in this case Eq. (<xref ref-type="disp-formula" rid="Ch1.E14"/>) is used.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><caption><p>Results of the comparative cross-validation analyses of <bold>(a)</bold> specific
runoff and <bold>(b)</bold> wet season runoff frequency in Nepal, and (c) mean summer streamflow
in Austria. First row: box plots with the quartiles and 95 % confidence
intervals around the median of leave-one-out (LOO) absolute prediction errors.
Compared models are TopREML (TR), Top-kriging (TK), universal kriging (UK),
linear regression models (LM) and the sample mean (LM<inline-formula><mml:math display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:math></inline-formula>). Note that without
observable trends (<bold>b</bold> and <bold>c</bold>), LM and LM<inline-formula><mml:math display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:math></inline-formula> are equivalent. Second row: catchment
level performance of TopREML. Signatures predicted by TopREML for each catchment
in the leave-one-out cross-validation analysis are plotted against the corresponding
observed signature. Diagonal lines (<inline-formula><mml:math display="inline"><mml:mrow><mml:mi>x</mml:mi><mml:mo>=</mml:mo><mml:mi>y</mml:mi></mml:mrow></mml:math></inline-formula>) representing perfect fit are also displayed
for indicative purposes.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f05.pdf"/>

          </fig>

<?xmltex \hack{\newpage}?>
</sec>
</sec>
</sec>
<sec id="Ch1.S4">
  <title>Results</title>
<sec id="Ch1.S4.SS1">
  <title>Case studies</title>
      <p>Basin-level predictions of the considered signatures are
presented in Fig. <xref ref-type="fig" rid="Ch1.F5"/> for the three cross-validation analyses
described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>. Figure <xref ref-type="fig" rid="Ch1.F5"/> also provides box plots summarizing the distribution of the ensuing cross-validation errors. In
the three analyses, the prediction errors related to TopREML were comparable
to the best alternate method: a linear model for annual specific runoff
(Nepal) and Top-kriging for runoff frequency (Nepal) and summer runoff
(Austria).</p>
      <p>Figure <xref ref-type="fig" rid="Ch1.F5"/>a presents results for annual specific runoff in
Nepal and shows that observable features play a significant role in the
prediction of runoff. The linear model showed a highly significant effect of
annual precipitation (<inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="italic">τ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mtext>yearlyPrecip</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="normal">LM</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>0.99</mml:mn></mml:mrow></mml:math></inline-formula>, t-stat:
9.1) a moderately significant effect of altitude
(<inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi mathvariant="bold-italic">τ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mtext>meanElev</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="normal">LM</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mn>0.39</mml:mn></mml:mrow></mml:math></inline-formula>, t-stat: 2.5) and an overall fit of
<inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn>0.63</mml:mn></mml:mrow></mml:math></inline-formula>. The positive sign of the altitude coefficient can be attributed
to the effects of glacial melt on runoff, which are more significant at
higher altitudes, while the average effect of evapotranspiration explains the
negative and noisy intercept of <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>313 mm yr<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. While including rainfall and
altitude in the model decreased the median absolute error by 43 % (LM to
LM<inline-formula><mml:math display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:math></inline-formula>), further increasing the complexity of the model by allowing for
spatial (UK) and topological effects (TK and TR) did not improve the
predictive performance: residuals from the linear regression appeared to be
correlated at a range shorter than the distance between the gauges in Nepal.
Indeed, fitting the empirical semi-variograms with exponential functions
revealed spatial correlation ranges that were on the order of the mean
distance between IDA centroids for annual streamflow (21.6 km), and
significantly below that distance (7.0 km) for the regression residuals.
Nonetheless, the lack of parsimony of TopREML did not appear to affect its
predictive performance, which almost perfectly reproduced the performance of
the linear model – the most parsimonious method.</p>
      <p>In contrast, the analysis revealed significant spatial effects for both
runoff frequency in Nepal, which has a much larger spatial correlation range
than annual streamflow (426 km – presumably set by meteorology and the
correlation range of storm events; <xref ref-type="bibr" rid="bib1.bibx31" id="text.49"/>), and summer
average streamflow in Austria, which has a range of 19.1 km but is sampled by
a much higher density of streamflow gauges than in Nepal. Allowing for
spatial correlation in the residuals (UK) decreased the median absolute
error by 11 % compared to the linear model (LM) for runoff frequency in
Nepal and 31 % for summer runoff in Austria. Accounting for topological
effects further reduced errors by 33 % (runoff frequency) and 40 % (summer
runoff) for both TopREML and Top-kriging methods.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><caption><p>Results of the Monte Carlo experiments. <bold>(a)</bold> and <bold>(b)</bold> display
the effect of network complexity on the performance of TopREML relative to universal
kriging. Network complexity is given as the ratio of basins <italic>beyond</italic>
(<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and <italic>within</italic> (<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) the spatial correlation range that
are flow-connected  – minimum network complexity is modeled when <italic>no</italic> basins
beyond and <italic>all</italic> basins within the range are flow-connected. Relative
performance is computed at each Monte Carlo run as the difference in relative
prediction errors between universal kriging and TopREML (i.e., <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="normal">RE</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="normal">UK</mml:mi><mml:mo>]</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="sans-serif">RE</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="sans-serif">TR</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> on
<bold>(a)</bold> and <bold>(b)</bold>). The graphs display the expectation and standard
deviation of that difference over the 1000 Monte Carlo runs. <bold>(c)</bold> presents
the observed (grey boxes) and predicted (black error bars) standard deviation on
the prediction errors for Top-kriging (TK) and TopREML (TR). Note that the
slight downward biases that appear on the graph remain below 1 % of the
expected value of the predicted outcome.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f06.pdf"/>

        </fig>

<?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S4.SS2">
  <title>Numerical simulation</title>
      <p>Results from the Monte Carlo analysis are presented in Fig. <xref ref-type="fig" rid="Ch1.F6"/>,
showing the outcomes of the two numerical experiments
described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>.</p>
      <p>Figure <xref ref-type="fig" rid="Ch1.F6"/>a and b shows the effect of network complexity on
the performance of TopREML relative to the baseline performance of universal
kriging. This effect is measured as the difference in the relative errors of
the two methods as a function of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the ratio of basins
<italic>beyond</italic> the spatial correlation range of runoff that <italic>are</italic>
flow-connected, and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the ratio of basins <italic>within</italic> range
that are <italic>not</italic> flow-connected. The effect is expected to increase with
<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and decrease with <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, reaching zero when 100 % of
observed basins lie within the spatial correlation range and 0 % of the
basins beyond the range are flow-connected. In that case (not shown in the
figure), TopREML and universal kriging perform similarly and the mean
difference in the relative error of the two methods is zero.  Figure <xref ref-type="fig" rid="Ch1.F6"/>a
shows that the relative performance of TopREML improves
with the number of flow-connected catchments that are located beyond the
spatial correlation range, and which are therefore not properly accounted for
by universal kriging. Conversely, Fig. <xref ref-type="fig" rid="Ch1.F6"/>b shows that the
relative performance of TopREML decreases with decreasing network effects
within the spatial correlation range. A linear regression of the relative
performance of TopREML against <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> showed that both
trends are significant and in the expected direction. However, the positive
coefficient associated with <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">outer</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (9.1, t-stat: 11.9) is larger in
absolute value and more statistically significant than the negative
coefficient associated with <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">inner</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2.6, t-stat: <inline-formula><mml:math display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2.6), which suggests
that the benefits of including distant flow-connected basins outweigh the
costs of discarding nearby (but unconnected) IDAs.</p>
      <p>In Fig. <xref ref-type="fig" rid="Ch1.F6"/>c, the Monte Carlo analysis showed that model
uncertainty is well predicted by TopREML and strongly underestimated by
Top-kriging, both with and without considering an external trend. Including a
trend in the model reduces the prediction variance of TopREML – this effect
is expected because the variance explained by the trend is no longer included
in the modeling error <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϵ</mml:mi></mml:math></inline-formula>. The decrease in the prediction variance is
well modeled by TopREML, which predicts the observe model uncertainty almost
exactly.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <title>Discussion</title>
<sec id="Ch1.S5.SS1">
  <title>Performance of TopREML</title>
      <p>Cross-validation outcomes suggest that TopREML is an attractive operational
tool for predicting streamflow in ungauged basins. The method performs as well
as the best alternative approach in the prediction of the considered runoff
signatures in Nepal and Austria, and significantly outperforms Top-kriging in
the prediction of modeling uncertainties in the numerical analysis.
Two distinguishing features of TopREML are responsible for these encouraging
results. First, TopREML incorporates the topology of the stream network by
restricting correlations to runoff observed at flow-connected catchments.
This allows TopREML to explicitly model the higher correlation in streamflow
anticipated along channels, but comes at the expense of discarding
correlations with neighboring, but not flow-connected catchments. Such
correlations can, for instance, be driven by large-scale weather patterns.
This tradeoff was investigated in a Monte Carlo analysis showing that
modeling performance increases more rapidly when including distant
flow-connected basins (slope in Fig. <xref ref-type="fig" rid="Ch1.F6"/>a) than it decreases
when discarding nearby unconnected basins (slope in Fig. <xref ref-type="fig" rid="Ch1.F6"/>b).
Further, empirical correlograms of Austrian summer runoff (Fig. <xref ref-type="fig" rid="Ch1.F2"/>)
reveal significantly lower and shorter-ranged spatial
correlations when basins are not flow-connected. Both results suggest that
the benefit of accounting for network effects on correlations outweighs the
cost of losing some information on the correlation between unconnected
basins. Second, the REML framework provides an
unbiased estimation of variance parameters, even when accounting for
observable features. This allows TopREML to accurately predict modeling
uncertainties even for highly trended and auto-correlated runoff signatures,
as visible in the Monte Carlo analysis presented on Fig. <xref ref-type="fig" rid="Ch1.F6"/>c.
By contrast, the expected downward bias in the kriging estimation of partial
sills <xref ref-type="bibr" rid="bib1.bibx10" id="paren.50"/> is clearly visible in the underestimation
of prediction uncertainties by the Top-kriging method.</p>
      <p>TopREML also has considerably lower computational requirements than
Top-kriging, both in terms of input data and optimization complexity. Unlike
Top-kriging, where watershed polygons are necessary inputs for the
regularization procedure, vectors are not fundamentally indispensable for
TopREML. Indeed, TopREML does not rely on a distributed point process but
assumes homogenous IDAs. It follows that its only fundamental data
requirement is a table (i.e., a data.frame) of IDAs displaying the observed
regionalization variable and the area, centroid coordinates and network
position (i.e., own ID and downstream ID) of the IDA. When considering
runtime, both methods rely on numerical optimization, but Top-kriging uses it
to back calculate the point semi-variogram in its regularization procedure.
This may substantially increase the dimensionality of the optimization task,
depending on the grid resolution chosen for the discretization of the
catchment areas, which in turn has a highly significant effect on prediction
performances <xref ref-type="bibr" rid="bib1.bibx42" id="paren.51"/>. By contrast, the dimensionality of the
optimization in TopREML is driven by the number of catchments, not an
arbitrary grid. More importantly, TopREML admits a well-defined objective
function, the restricted likelihood, that is differentiable if the selected
variogram function is differentiable. This allows gradient optimization
methods to be used, which are much less computationally intensive than the
stochastic algorithm required by Top-kriging. The resampling analysis shown
in Appendix <xref ref-type="sec" rid="App1.Ch1.S3"/> suggests that TopREML reduces the computation runtime by
an order of magnitude, relative to the implementation of Top-kriging in the
<italic>rtop</italic> package, for comparable prediction performances.</p>
      <p>Despite these encouraging results, TopREML is subject to stringent linearity
assumptions on the nature of the regionalized runoff signature. The predicted
variable should aggregate linearly both on hillslope surfaces and at channel
junctions that are subject to mass conservation. This limitation also affects
block-kriging approaches, as pointed out by <xref ref-type="bibr" rid="bib1.bibx42" id="text.52"/>, who suggest
that Top-kriging can still be applied, <italic>in an approximate way</italic> on
non-conservative variables. Here we assert that hydrologic arguments can be
used to convert some non-conservative variables into linearly aggregating
processes using simple algebraic transformations. This theoretically more
robust approach was here successfully tested in a cross-validation analysis
of runoff frequency in Nepal.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7"><caption><p>Global distribution of factors affecting model selection. <bold>(a)</bold> Spatial
repartition of the 8540 stream gauges indexed by the Global Runoff Data Center
<xref ref-type="bibr" rid="bib1.bibx16" id="paren.53"/>. <bold>(b)</bold> Dominant rainfall type: orographic rainfall are assumed to
occur in mountains, as defined by the United Nations Environment Programme
<xref ref-type="bibr" rid="bib1.bibx52" id="paren.54"/>, and have a typical range of 1–10 km <xref ref-type="bibr" rid="bib1.bibx1" id="paren.55"/>.
Convective rainfall are assumed dominant in region with a high frequency of lighting
strikes (<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn>10</mml:mn><mml:mo>/</mml:mo></mml:mrow></mml:math></inline-formula>km<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> yr<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) as recorded by the TRMM satellite <xref ref-type="bibr" rid="bib1.bibx28" id="paren.56"/>
and have a typical scale of 10–100 km <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx44" id="paren.57"/>.
Frontal precipitations are assumed dominant in the remaining regions and have a
typical scale in excess of 100 km <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx54" id="paren.58"/>. <bold>(c)</bold> Drainage
density is estimated based on the  Hydro1k data set <xref ref-type="bibr" rid="bib1.bibx24" id="paren.59"/>, using
154 large basins <xref ref-type="bibr" rid="bib1.bibx50" id="paren.60"/> as units of analysis. Drainage densities are displayed
in three classes: low (0.01–0.025 km<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>), medium (0.025–0.027 km<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) and
high (<inline-formula><mml:math display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 0.027 km<inline-formula><mml:math display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>).</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f07.pdf"/>

        </fig>

<?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S5.SS2">
  <title>Model selection</title>
      <p>The regionalization methods assessed in the cross-validation analysis range
from simple linear regressions with strong independence assumptions, to
complex geostatistical methods that allow for both spatial and topological
correlations. Results indicate that while complex methods perform best in
general, there seems to be a threshold beyond which increasing the
complexity of the statistical method does not significantly improve the
prediction performance: while a linear model is better than a simple average
for the prediction of annual streamflow in Nepal (Fig. <xref ref-type="fig" rid="Ch1.F5"/>a),
accounting for spatial (UK) and topological (TR) correlation does not further
improve predictions. In that situation, parsimony prescribes selecting the
least complex of the best performing methods.</p>
      <p>Under these conditions, the selection of the optimal method is driven by the
interplay between the layout of the gauges and the spatial correlation range
of the considered runoff signature. A dense network of flow gauges is
necessary for geostatistical methods to properly estimate the semi-variogram
and improve on predictions from linear regressions – the case studies
suggest that the mean distance between the gauges must be on the order of
half the spatial correlation range of the runoff signature. Sparser gauge
densities do not allow geostatistical methods to capture spatial correlations
and their prediction is effectively driven by the deterministic components of
the model, i.e., the intercept and (when available) observable features.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8"><caption><p>The probability density functions assumed in Eqs. (<xref ref-type="disp-formula" rid="App1.Ch1.E6"/>) and (<xref ref-type="disp-formula" rid="App1.Ch1.E2"/>) represent well the
case of adjacent ellipsoidal watersheds illustrated in <bold>(a)</bold>.  <bold>(b)</bold> displays
the histogram of distance between two random points <italic>within</italic> a watershed,
overlaid by a plot of Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.E6"/>) with <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>.  <bold>(c)</bold> displays
the histogram of distance between one random point <italic>on each</italic> watershed,
overlaid by a plot of Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.E2"/>) with <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=213.395669pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f08.pdf"/>

        </fig>

      <p>An interesting tradeoff arises if observable features are themselves
spatially correlated and explain a significant part of the spatial
correlation of the predicted variable. Including these observable features in
the model reduces the correlation scale of the residuals, possibly crossing
the threshold below which geostatistics are not the most parsimonious
approach. In Nepal, controlling for rainfall reduced the spatial correlation
range of annual streamflow from 21.6 to 7 km – well below the mean
distance between the gauges (13.9 km). In that case there is a tradeoff
between relying on observable features or variance information to make a
prediction, and parsimony and stationarity considerations come into play when
selecting the regionalization model. For instance, while parsimony generally
prescribes the use of observable features, a climate may be less stationary
– and therefore a less reliable external trend – than embedded geology or
geomorphology.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9"><caption><p>Algorithmic chart of the provided TopREML implementation. Dashed frames
and arrows represent vector data and operations and the bold arrow represents the
step requiring numerical optimization. The complexity of the computational tasks
represented by the remaining plain arrows is driven by matrix inversion, which is
of polynomial complexity. In the figure, <inline-formula><mml:math display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> is a matrix of observed covariates
and <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> a vector of outcomes measured at the available gauges, as defined in Eq. (<xref ref-type="disp-formula" rid="Ch1.E1"/>); <inline-formula><mml:math display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is
a vector of identical covariates observed at the prediction location. <inline-formula><mml:math display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are matrices of relative catchment areas, network topology and inter-centroidal
distances of the available gauges, as defined in Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>); <inline-formula><mml:math display="inline"><mml:mi>a</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>U</mml:mi><mml:mi mathvariant="normal">out</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and
<inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>c</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow><mml:mi mathvariant="normal">out</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are equivalent matrices for the prediction location. <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ϕ</mml:mi></mml:math></inline-formula>
and <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> are estimated variance parameters as defined in Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>); <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>
and <inline-formula><mml:math display="inline"><mml:mi>G</mml:mi></mml:math></inline-formula> are the estimated fixed and random effects (Eq. <xref ref-type="disp-formula" rid="Ch1.E11"/>) and
variance–covariance matrix (Eq. <xref ref-type="disp-formula" rid="Ch1.E7"/>); <inline-formula><mml:math display="inline"><mml:mi>g</mml:mi></mml:math></inline-formula> is the estimated covariance at
the prediction location (used in Eq. <xref ref-type="disp-formula" rid="Ch1.E13"/>). Finally, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi mathvariant="normal">out</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>Var</mml:mtext><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi mathvariant="normal">out</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are
the predicted outcome and the related prediction variance.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f09.pdf"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F10"><caption><p>Leave-one-out cross-validation results for Austrian summer flow when
resampling a subset of the training gauges. Computational performances are represented
as the ratio of runtimes for TopREML against Top-kriging. Prediction performances are
represented as the ratio of relative errors. TopREML performances when using gradient-based and stochastic optimization algorithms are represented as circles and
triangles,
respectively. Points represent the median value and error bars represent 90 %
confidence intervals over 200 repetitions.</p></caption>
          <?xmltex \igopts{width=184.942913pt}?><graphic xlink:href="https://hess.copernicus.org/articles/19/2925/2015/hess-19-2925-2015-f10.pdf"/>

        </fig>

      <p>In general, geostatistical approaches improve on the prediction of ungauged
basins if the distance between the stream gauges is significantly smaller
than the spatial correlation scale of runoff. Favorable areas are
characterized by high drainage densities or localized rainfall, in addition
to a high density of streamflow gauges. All three variables are highly
heterogeneously distributed at a global scale, as seen on Fig. <xref ref-type="fig" rid="Ch1.F7"/>.
The multiplicity of local settings likely explains the large
diversity of existing regionalization methods and suggests that the selection
of the optimal regionalization approach has to be made locally.</p>
      <p>Lastly, the decreasing returns to improvements in the complexity of the model
also suggest that the performance of statistical methods for PUB is
ultimately bounded by the spatial heterogeneity of runoff generating
processes. Statistical methods resolve parts of that heterogeneity using the
spatial distribution of observable features (linear regressions) and/or based
on the analysis of the variance of a sample of the predicted variable
(geostatistics). Yet very important parts of the hydrological activity
related to storage and flow path characteristics take place underground: they
cannot be observed and included in the statistical models <xref ref-type="bibr" rid="bib1.bibx21" id="paren.61"/>.
This residual spatial heterogeneity can ultimately only be resolved through a
better understanding of the particular catchment processes governing runoff
in the considered region. Approaches coupling statistical regionalization
with process-based models that assimilate both a conceptual understanding of
catchment-scale processes and the random nature of runoff (e.g.,
<xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx40 bib1.bibx32" id="altparen.62"/>) are
particularly promising.</p>
</sec>
</sec>
<sec id="Ch1.S6" sec-type="conclusions">
  <title>Conclusions</title>
      <p>We introduced TopREML as a method to predict runoff signatures in ungauged
basins. The approach takes into account the spatially correlated nature of
runoff and the nested character of streamflow networks. Unlike kriging
approaches, the restricted maximum likelihood (REML) estimators provide the
best linear unbiased predictor (BLUP) of both the predicted variable and the
associated prediction uncertainty, even when incorporating observable
features in the model.</p>
      <p>The method was successfully tested in cross-validation analyses on mass
conserving (mean streamflow) and non-conservative (runoff frequency) runoff
signatures in Nepal (sparsely gauged) and Austria (densely gauged), where it
matched the performance of the best alternative method: Top-kriging in
Austria and linear regression in Nepal. TopREML outperformed Top-kriging in
the prediction of uncertainty in Monte Carlo simulations and its performance
is robust to the inclusion of observable features.</p>
      <p><?xmltex \hack{\newpage}?>The ability of TopREML to combine deterministic (observable features) and
stochastic (covariance) information to generate a BLUP makes it a
particularly versatile method that can readily be applied in densely gauged
basins, where it takes advantage of spatial covariance information, as well
as data-scarce regions, where it can rely on covariates with spatial
distributions that are increasingly observable thanks to remote-sensing
technology. This flexibility, along with its ability to provide a reliable
estimate of the prediction uncertainty, offer considerable scope to use this
computationally inexpensive method for practical PUB applications.</p><?xmltex \hack{\clearpage}?>
</sec>

      
      </body>
    <back><app-group><app id="App1.Ch1.S1">
  <title>Covariance of a spatially averaged process</title>
      <p>The aim of this analysis is to explore the likely forms of a
correlation structure between spatially aggregated processes, given that the
underlying point-scale processes are also spatially correlated. In order to
maintain tractability, the analysis will consider a strongly idealized case.
While we anticipate deviations from the results in non-ideal situations, we
nonetheless interpret this idealized analysis as offering insight that
constrains the choice of correlation function in the TopREML analysis.</p>
      <p>Assuming that the underlying point-scale process <inline-formula><mml:math display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula> is conservative, the
aggregated process <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> related to the subcatchment <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of gauge <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> can
be expressed as

              <disp-formula id="App1.Ch1.Ex1"><mml:math display="block"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>=</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:mfrac><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:munder><mml:mi>Y</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>x</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the area of <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p>To proceed, we make the assumption that the area of the drainage areas <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>
are approximately equal. While this is a strong constraint, under situations
where gauges are placed near confluences and where subcatchments for a given
stream ratio are adequately monitored by the gauge network, Horton scaling
ensures that the drainage areas are of a similar order of magnitude. Thus, we
will take <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi>A</mml:mi><mml:mo>∀</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:math></inline-formula>. The subcatchments are further assumed to have
similar shapes and (by definition) do not overlap.</p>
      <p>Following <xref ref-type="bibr" rid="bib1.bibx10" id="text.63"/> (p. 68), the covariance between
two aggregated random variables <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is expressed as a function
of the covariogram <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of the underlying point-scale process:

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced close=")" open="("><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>m</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:munder><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:munder><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>∣</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>∣</mml:mo><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi mathvariant="normal">d</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="App1.Ch1.E1"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∫</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mi mathvariant="normal">∞</mml:mi></mml:munderover><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>D</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the surfaces of subcatchments <inline-formula><mml:math display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>, and
<inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the probability density function of the distance between randomly
chosen points within <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> – two identical and non-overlapping
shapes. Analytical expressions for <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can be derived for simple
geometries <xref ref-type="bibr" rid="bib1.bibx29" id="paren.64"><named-content content-type="pre">e.g.,</named-content></xref>, although complex algebraic
expressions typically result. For analytical tractability we adopt a
simplified expression:
          <disp-formula id="App1.Ch1.E2" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable columnspacing="1em" class="cases" rowspacing="0.2ex" columnalign="left left" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">exp</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mfenced close=")" open="("><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mi>c</mml:mi></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if</mml:mtext><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>c</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>≤</mml:mo><mml:mi>D</mml:mi><mml:mo>≤</mml:mo><mml:mi>c</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mtext>otherwise</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>
        which approximates distance frequency function of adjacent
elliptical subcatchments, as shown in Fig. <xref ref-type="fig" rid="Ch1.F8"/>. In Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.E2"/>)
the parameters <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are positive
functions of <inline-formula><mml:math display="inline"><mml:mi>A</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> is the distance between the centroids of the
subcatchments.</p>
      <p>We also assume that the underlying point-scale process is second-order
stationary and follows an exponential correlation function:
          <disp-formula id="App1.Ch1.E3" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mi mathvariant="normal">exp</mml:mi><mml:mfenced close=")" open="("><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mi>D</mml:mi></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are respectively the point variance and spatial
range of the process.</p>
      <p>Inserting Eqs. (<xref ref-type="disp-formula" rid="App1.Ch1.E2"/>) and (<xref ref-type="disp-formula" rid="App1.Ch1.E3"/>) into Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.E1"/>) allows
the covariance of the two spatially aggregated random variables to also be
expressed as an exponential function of the distance <inline-formula><mml:math display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> between their
supports

              <disp-formula id="App1.Ch1.E4" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi>A</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>c</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="normal">exp</mml:mi><mml:mfenced open="(" close=")"><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mi>c</mml:mi></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

        where <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math display="inline"><mml:mrow><mml:mo>[</mml:mo><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> – <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>]</mml:mo><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>
and <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="italic">ϕ</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. This exponential form was adopted in the covariance
derivation in the main text.</p>
      <p>We note that within this analysis, the spatial aggregation of the point-scale
process creates a nugget variance arising from spatial correlation scales
smaller than the subcatchments. The nugget variance can be derived (for this
idealized case) by considering the average covariance of points within the
catchments:

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mfenced open="(" close=")"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>k</mml:mi><mml:mo>′</mml:mo></mml:msubsup></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msup><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:munder><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:munder><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>∣</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>∣</mml:mo><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi mathvariant="normal">d</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="App1.Ch1.E5"><mml:mtd/><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∫</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mi mathvariant="normal">∞</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">ν</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="normal">P</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="normal">d</mml:mi><mml:mi>D</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ν</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> now represents the probability density function (pdf) of the distance between two randomly
selected points within <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>:
          <disp-formula id="App1.Ch1.E6" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>(</mml:mo><mml:mi>D</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable rowspacing="0.2ex" class="cases" columnspacing="1em" columnalign="left left" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mi mathvariant="normal">exp</mml:mi><mml:mfenced close=")" open="("><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:mi>D</mml:mi></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if </mml:mtext><mml:mn mathvariant="normal">0</mml:mn><mml:mo>≤</mml:mo><mml:mi>D</mml:mi><mml:mo>≤</mml:mo><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mtext>otherwise</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is the maximum distance between two points within <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Again,
inserting Eqs. (<xref ref-type="disp-formula" rid="App1.Ch1.E6"/>) and (<xref ref-type="disp-formula" rid="App1.Ch1.E3"/>) into Eq. (<xref ref-type="disp-formula" rid="App1.Ch1.E5"/>), we get
the nugget variance resulting from spatial aggregation:

              <disp-formula id="App1.Ch1.E7" content-type="numbered"><mml:math display="block"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:mfrac><mml:mfenced close="]" open="["><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        Therefore, under the aforementioned assumptions, catchment-scale variance
parameters <inline-formula><mml:math display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) can be expressed in
terms of point-scale parameters:

              <disp-formula specific-use="align" content-type="numbered"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="App1.Ch1.E8"><mml:mtd/><mml:mtd><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msub><mml:mi>a</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub></mml:mrow></mml:mfrac><mml:mfenced open="[" close="]"><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="App1.Ch1.E9"><mml:mtd/><mml:mtd><mml:mi mathvariant="italic">ξ</mml:mi></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="normal">exp</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi mathvariant="normal">p</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mi>D</mml:mi></mml:msub><mml:msub><mml:mi>D</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mfrac><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula></p>
</app>

<app id="App1.Ch1.S2">
  <title>Propagation of runoff frequency in a stream network</title>
      <p>We describe runoff occurrence as a binary random variable taking
the value of 1 if an increase in daily streamflow occurs and 0 otherwise. If
runoff events are uncorrelated in time, the random variable follows a
Bernouilli distribution with frequency <inline-formula><mml:math display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula>. At a given gauge on a given
day, the random variable takes a value of 0 if <italic>all</italic> of the upstream
gauges take a value of 0.</p>
      <p>In a simple situation with two upstream sub-basins described by the random
variables <inline-formula><mml:math display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula>, the frequency <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>Z</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of the random variable
<inline-formula><mml:math display="inline"><mml:mrow><mml:mi>Z</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">max</mml:mi><mml:mo>(</mml:mo><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can be described as

              <disp-formula specific-use="align"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>Z</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>Y</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math display="inline"><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:math></inline-formula> stands for the event <inline-formula><mml:math display="inline"><mml:mrow><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. Applying the law of total
probabilities to substitute <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> gives

              <disp-formula id="App1.Ch1.Ex6"><mml:math display="block"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>Z</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mfenced close=")" open="("><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>Y</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        The covariance of <inline-formula><mml:math display="inline"><mml:mi>X</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mi>Y</mml:mi></mml:math></inline-formula> can be derived as

              <disp-formula id="App1.Ch1.Ex7"><mml:math display="block"><mml:mrow><mml:mtext>Cov</mml:mtext><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mtext>E</mml:mtext><mml:mi>X</mml:mi><mml:mi>Y</mml:mi><mml:mo>-</mml:mo><mml:mtext>E</mml:mtext><mml:mi>X</mml:mi><mml:mtext>E</mml:mtext><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi>X</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mi>Y</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>

        with <inline-formula><mml:math display="inline"><mml:mrow><mml:mtext>E</mml:mtext><mml:mi>X</mml:mi><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>⋅</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>⋅</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi mathvariant="normal">!</mml:mi><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>⋅</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="normal">!</mml:mi><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>⋅</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. Finally, substituting <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>Y</mml:mi><mml:mo>∣</mml:mo><mml:mi>X</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> for the
covariance expression, yields:

              <disp-formula specific-use="align"><mml:math display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>Z</mml:mi></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mfenced open="(" close=")"><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>Y</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mfenced open="[" close="]"><mml:mtext>Cov</mml:mtext><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:msub><mml:mi>P</mml:mi><mml:mi>Y</mml:mi></mml:msub></mml:mfenced></mml:mrow><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mfenced></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>X</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mi>Y</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mtext>Cov</mml:mtext><mml:mi>X</mml:mi><mml:mo>,</mml:mo><mml:mi>Y</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p>Extending the above derivation to multiple sub-basins and neglecting the
covariance term leads to a linear relation between runoff frequencies at
gauge <inline-formula><mml:math display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and at upstream gauges in the following form:

              <disp-formula id="App1.Ch1.E10" content-type="numbered"><mml:math display="block"><mml:mrow><mml:mi mathvariant="normal">ln</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>≈</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mtext>UP</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munderover><mml:mi>ln⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

        Thus, if runoff pulses occur independently for each sub-basin, TopREML can be
applied to ln<inline-formula><mml:math display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (setting <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>), to estimate runoff frequency at
ungauged sites.</p><?xmltex \hack{\newpage}?>
</app>

<app id="App1.Ch1.S3">
  <title>Computational performance of TopREML</title>
      <p>An algorithmic chart of TopREML, as implemented in the provided
script, is presented in Fig. <xref ref-type="fig" rid="Ch1.F9"/>. IDAs and the topology of the
stream network are extracted from the nested catchment using differential
overlay. TopREML uses the BFGS algorithm <xref ref-type="bibr" rid="bib1.bibx53" id="paren.65"/> to
maximize the restricted likelihood, with the option of using a stochastic
optimization algorithm (Simulated Annealing, <xref ref-type="bibr" rid="bib1.bibx3" id="altparen.66"/>)
if a non-differentiable (e.g., spherical) covariance function is selected.</p>
      <p>A resampling analysis was performed on Austrian data set to evaluate the
runtime and predictive performance of each method as a function of the
topological complexity of the considered region (as proxied by the size of
the considered sample of gauges) and the considered semi-variogram model. We
randomly selected one validation gauge, and resampled the remaining gauges
randomly (no repetition) to obtain the chosen sample size. The resampled
gauges were used to estimate summer flow at the validation gauges using
TopREML and Top-kriging, and successively assuming an exponential
(differentiable) and a spherical (non-differentiable) variogram. In each
case, relative error and runtime were recorded. This process was repeated 200
times for each sample size. Results (shown in Fig. <xref ref-type="fig" rid="Ch1.F10"/>) indicate
that the gradient-based optimization algorithm used by TopREML for the
differentiable (i.e., exponential) variogram reduces the computation runtime
by an order of magnitude, relative to the implementation of Top-kriging in
the rtop package. This computational advantage vanishes if a
non-differentiable (i.e., spherical) variogram must be used, which requires
stochastic optimization. The results also indicate that the relative
computational performance of TopREML improves with the number of gauges,
while its predictive performance remains constant and approximately
equivalent to Top-kriging.</p><?xmltex \hack{\clearpage}?><supplementary-material position="anchor"><p><bold>The Supplement related to this article is available online at <inline-supplementary-material xlink:href="http://dx.doi.org/10.5194/hess-19-2925-2015-supplement" xlink:title="zip">doi:10.5194/hess-19-2925-2015-supplement</inline-supplementary-material>.</bold></p></supplementary-material>
</app>
  </app-group><ack><title>Acknowledgements</title><p>The authors would like to thank Michèle Müller, Morgan Levy, David
Dralle, Gabrielle Boisramé and two anonymous reviewers for their helpful
review and comments. Data have been graciously provided by the Department of
Hydrology and Meteorology of Nepal, the HKH-FRIEND project and the
<italic>rtop</italic> package. The Swiss National Science Foundation are gratefully
acknowledged for funding (M. F. Müller). Publication made possible in part by
support from the Berkeley Research Impact Initative (BRII) sponsored by the
UC Berkeley Library.<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?>
Edited by: J. Freer</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Anders et al.(2006)Anders, Roe, Hallet, Montgomery, Finnegan, and
Putkonen</label><mixed-citation>
Anders, A. M., Roe, G. H., Hallet, B., Montgomery, D. R., Finnegan, N. J.,
and Putkonen, J.: Spatial patterns of precipitation and topography in the
Himalaya, Geol. Soc. Am. Special Papers, 398, 39–53, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Bishop and Church(1992)</label><mixed-citation>
Bishop, G. D. and Church, M. R.: Automated approaches for regional runoff
mapping in the northeastern United States, J. Hydrol., 138,
361–383, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Belisle(1992)</label><mixed-citation>
Belisle, C. J. P.: Convergence theorems for a class of simulated annealing
algorithms on Rd, J. Appl. Probab., 29,
885–895, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Blöschl et al.(2013)Blöschl, Sivapalan, Wagener, Viglione and Savenije</label><mixed-citation>
Blöschl, G., Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H.:
Runoff prediction in ungauged
basins: Synthesis across processes, places and scales, Cambridge University
Press, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bosch et al.(1999)Bosch, Sheridan, and Davis</label><mixed-citation>
Bosch, D., Sheridan, J., and Davis, F.: Rainfall characteristics and spatial
correlation for the Georgia Coastal Plain, Trans. ASAE, 42,
1637–1644, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Botter et al.(2007)Botter, Porporato, Rodriguez-Iturbe, and
Rinaldo</label><mixed-citation>
Botter, G., Porporato, A., Rodriguez-Iturbe, I., and Rinaldo, A.: Basin-scale
soil moisture dynamics and the probabilistic characterization of carrier
hydrologic flows: Slow, leaching-prone components of the hydrologic response,
Water Resour. Res., 43, W02417, doi:10.1029/2006WR005043, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Castiglioni et al.(2011)Castiglioni, Castellarin, Montanari,
Skøien, Laaha, and Blöschl</label><mixed-citation>Castiglioni, S., Castellarin, A., Montanari, A., Skøien, J. O., Laaha, G.,
and Blöschl, G.: Smooth regional estimation of low-flow indices:
physiographical space based interpolation and top-kriging, Hydrol. Earth
Syst. Sci., 15, 715–727, <ext-link xlink:href="http://dx.doi.org/10.5194/hess-15-715-2011" ext-link-type="DOI">10.5194/hess-15-715-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Chalise et al.(2003)Chalise, Kansakar, Rees, Croker, and
Zaidman</label><mixed-citation>
Chalise, S., Kansakar, S., Rees, G., Croker, K., and Zaidman, M.: Management
of   water resources and low flow estimation for the Himalayan basins of Nepal,
J. Hydrol., 282, 25–35, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Corbeil and Searle(1976)</label><mixed-citation>
Corbeil, R. R. and Searle, S. R.: Restricted maximum likelihood (REML)
estimation of variance components in the mixed model, Technometrics, 18,
31–38, 1976.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Cressie(1993)</label><mixed-citation>
Cressie, N.: Statistics for Spatial Data, Wiley, New York, NY, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Cressie et al.(2006)Cressie, Frey, Harch, and
Smith</label><mixed-citation>
Cressie, N., Frey, J., Harch, B., and Smith, M.: Spatial prediction on a
river   network, J. Agr.  Biol. Environ. Stat.,
11, 127–150, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Department of Hydrology and
Meteorology(2011)</label><mixed-citation>
Department of Hydrology and Meteorology: Daily Streamflow and Precipitation
Data,  Kathmandu, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>D'Odorico and Rigon(2003)</label><mixed-citation>D'Odorico, P. and Rigon, R.: Hillslope and channel contributions to the
hydrologic response, Water Resour. Res., 39, 1113, <ext-link xlink:href="http://dx.doi.org/10.1029/2002WR001708" ext-link-type="DOI">10.1029/2002WR001708</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Gilmour et al.(1995)Gilmour, Thompson, and Cullis</label><mixed-citation>
Gilmour, A., Thompson, R., and Cullis, B.: Average Information REML: an
efficient algorithm for variance parameter estimation in linear mixed models,
Biometrics, 51, 1440–1450, 1995.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Gilmour et al.(2004)Gilmour, Cullis, Welham, Gogel, and
Thompson</label><mixed-citation>
Gilmour, A., Cullis, B., Welham, S., Gogel, B., and Thompson, R.: An
efficient
computing strategy for prediction in mixed linear models, Comput.
Stat. Data Anal., 44, 571–586, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Global Runoff Data Center(2014)</label><mixed-citation>
Global Runoff Data Center: Global Runoff Data Base, Global Runoff Data
Centre. Koblenz, Federal Institute of Hydrology (BfG), 2014.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Goovaerts(1999)</label><mixed-citation>
Goovaerts, P.: Geostatistics in soil science: state-of-the-art and
perspectives, Geoderma, 89, 1–45, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Goovaerts(2000)</label><mixed-citation>
Goovaerts, P.: Geostatistical approaches for incorporating elevation into the
spatial interpolation of rainfall, J. Hydrol., 228, 113–129, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Gottschalk(1993)</label><mixed-citation>
Gottschalk, L.: Interpolation of runoff applying objective methods,
Stoch.   Hydrol. Hydraul., 7, 269–281, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Gottschalk et al.(2006)Gottschalk, Krasovskaia, Leblois, Sauquet
et al.</label><mixed-citation>Gottschalk, L., Krasovskaia, I., Leblois, E., and Sauquet, E.: Mapping mean
and variance of runoff in a river basin, Hydrol. Earth Syst. Sci., 10,
469–484, <ext-link xlink:href="http://dx.doi.org/10.5194/hess-10-469-2006" ext-link-type="DOI">10.5194/hess-10-469-2006</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Gupta et al.(2013)Gupta, Blöschl, McDonnell, Savenije, Sivapalan,
Viglione, and Wagener</label><mixed-citation>
Gupta, H., Blöschl, G., McDonnell, J., Savenije, H., Sivapalan, M.,
Viglione, A., and Wagener, T.: Outcomes of Synthesis, in: Runoff Prediction
in Ungauged Basins: Synthesis across Processes, Places and Scales, edited by: Blöschl, G.,
Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H., Cambridge
University Press, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Henderson(1975)</label><mixed-citation>
Henderson, C. R.: Best linear unbiased estimation and prediction under a
selection model, Biometrics, 31, 423–447, 1975.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Huang and Yang(1998)</label><mixed-citation>
Huang, W.-C. and Yang, F.-T.: Streamflow estimation using Kriging, Water
Resour. Res., 34, 1599–1608, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Hyd(2004)</label><mixed-citation>
HYDRO1k Elevation Derivative Database, U.S. Geological Survey Earth Resources
Observation and Science (EROS) Center, Sioux Falls, South Dakota, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Laaha et al.(2013)Laaha, Skøien, Nobilis, and
Blöschl</label><mixed-citation>
Laaha, G., Skøien, J. O., Nobilis, F., and Blöschl, G.: Spatial
prediction of stream temperatures using Top-kriging with an external drift,
Environ. Modell. Assessm., 18, 671–683, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Laaha et al.(2014)Laaha, Skøien, and
Blöschl</label><mixed-citation>
Laaha, G., Skøien, J., and Blöschl, G.: Spatial prediction on river
networks: comparison of Top-kriging with regional regression, Hydrol.
Process., 28, 315–324, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Lark et al.(2006)Lark, Cullis, and Welham</label><mixed-citation>
Lark, R., Cullis, B., and Welham, S.: On spatial prediction of soil
properties
in the presence of a spatial trend: the empirical best linear unbiased
predictor (E-BLUP) with REML, Eur. J. Soil Sci., 57, 787–799,
2006.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>LIS(2011)</label><mixed-citation>
LIS Global lightning Image, NASA EOSDIS Global Hydrology Resource Center
(GHRC)
DAAC, Huntsville, AL, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Mathai(1999)</label><mixed-citation>
Mathai, A. M.: An introduction to geometrical probability: distributional
aspects with applications, Vol. 1, CRC Press, Boca Raton, FL, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Merz and Blöschl(2005)</label><mixed-citation>
Merz, R. and Blöschl, G.: Flood frequency regionalisation – spatial
proximity vs. catchment attributes, J. Hydrol., 302, 283–306,
2005.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Müller and Thompson(2013)</label><mixed-citation>
Müller, M. F. and Thompson, S. E.: Bias adjustment of satellite rainfall
data through stochastic modeling: Methods development and application to
Nepal, Adv. Water Resour., 60, 121–134, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Müller et al.(2014)Müller, Dralle, and
Thompson</label><mixed-citation>
Müller, M. F., Dralle, D. N., and Thompson, S. E.: Analytical model for
flow duration curves in seasonally dry climates, Water Resour. Res.,
50, 5510–5531, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Myers(1990)</label><mixed-citation>
Myers, R.: Classical and modern regression with applications,
PWS-KENT Publishing Company, 1990.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Olea(1974)</label><mixed-citation>
Olea, R.: Optimal contour mapping using universal kriging, J.
Geophys. Res., 79, 695–702, 1974.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Patterson and Thompson(1971)</label><mixed-citation>
Patterson, H. D. and Thompson, R.: Recovery of inter-block information when
block sizes are unequal, Biometrika, 58, 545–554, 1971.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Pebesma(2004)</label><mixed-citation>
Pebesma, E. J.: Multivariable geostatistics in S: the gstat package,
Comput.  Geosci., 30, 683–691, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>R Core Team(2008)</label><mixed-citation>
R Core Team: R: A language and environment for statistical computing, R
Foundation Statistical Computing, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Sauquet(2006)</label><mixed-citation>
Sauquet, E.: Mapping mean annual river discharges: geostatistical
developments   for incorporating river network dependencies, J. Hydrol., 331,
300–314, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Sauquet et al.(2000)Sauquet, Gottschalk, and
Leblois</label><mixed-citation>
Sauquet, E., Gottschalk, L., and Leblois, E.: Mapping average annual runoff:
a
hierarchical approach applying a stochastic interpolation scheme,
Hydrol. Sci. J., 45, 799–815, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Schaefli et al.(2013)Schaefli, Rinaldo, and
Botter</label><mixed-citation>
Schaefli, B., Rinaldo, A., and Botter, G.: Analytic probability distributions
for snow-dominated streamflow, Water Resour. Res., 49, 2701–2713,  2013.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Sivapalan et al.(2014)Sivapalan, Konar, Srinivasan, Chhatre, Wutich,
Scott, Wescoat, and Rodríguez-Iturbe</label><mixed-citation>
Sivapalan, M., Konar, M., Srinivasan, V., Chhatre, A., Wutich, A., Scott, C.,
Wescoat, J., and Rodríguez-Iturbe, I.: Socio-hydrology: Use-inspired
water sustainability science for the Anthropocene, Earth's Future, 2,
225–230, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Skøien et al.(2006)Skøien, Merz, and
Blöschl</label><mixed-citation>Skøien, J. O., Merz, R., and Blöschl, G.: Top-kriging – geostatistics on
stream networks, Hydrol. Earth Syst. Sci., 10, 277–287,
<ext-link xlink:href="http://dx.doi.org/10.5194/hess-10-277-2006" ext-link-type="DOI">10.5194/hess-10-277-2006</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Skøien et al.(2014)Skøien, Pebesma, and
Blöschl</label><mixed-citation>Skøien, J., Blöschl, G., Laaha, G., Pebesma, E., Parajka, J. and
Viglione, A.: <italic>rtop</italic>: An R package for interpolation of data with a
variable spatial support, with an example from river networks, Comput.
Geosci., 67, 180–190, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Smith et al.(2005)Smith, Gasiewski, Jackson, and
Wick</label><mixed-citation>Smith, D. F., Gasiewski, A. J., Jackson, D. L., and Wick, G. A.: Spatial
scales   of tropical precipitation inferred from TRMM microwave imager data,
Geoscience and Remote Sensing, IEEE Trans., 43, 1542–1551, 2005.
 </mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bibx45"><label>Srinivasan et al.(2014)Srinivasan, Thompson, Madhyastha, Penny,
Jeremiah and Lele</label><mixed-citation>Srinivasan, V., Thompson, S., Madhyastha,
K., Penny, G., Jeremiah, K., and Lele, S.: Why is the Arkavathy River drying?
A multiple-hypothesis approach in a data-scarce region, Hydrol. Earth Syst.
Sci., 19, 1905–1917, <ext-link xlink:href="http://dx.doi.org/10.5194/hess-19-1905-2015" ext-link-type="DOI">10.5194/hess-19-1905-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Stokstad(1999)</label><mixed-citation>
Stokstad, E.: Scarcity of rain, stream gages threatens forecasts, Science,
285,   1199–1200, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Ver Hoef and Peterson(2010)</label><mixed-citation>
Ver Hoef, J. M. and Peterson, E. E.: A moving average approach for spatial
statistical models of stream networks, J. Am. Stat.
Assoc., 105, 6–18, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Ver Hoef et al.(2006)Ver Hoef, Peterson, and
Theobald</label><mixed-citation>
Ver Hoef, J. M., Peterson, E., and Theobald, D.: Spatial statistical models
that use flow and stream distance, Environ. Ecol. Stat.,
13, 449–464, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Viglione et al.(2013)Viglione, Parajka, Rogger, Salinas, Laaha,
Sivapalan, and Blöschl</label><mixed-citation>Viglione, A., Parajka, J., Rogger, M., Salinas, J. L., Laaha, G., Sivapalan, M.,
and Blöschl, G.: Comparative assessment of predictions in ungauged basins – Part 3:
Runoff signatures in Austria, Hydrol. Earth Syst. Sci., 17, 2263–2279, <ext-link xlink:href="http://dx.doi.org/10.5194/hess-17-2263-2013" ext-link-type="DOI">10.5194/hess-17-2263-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Wot(2003)</label><mixed-citation>
Watersheds of the World, IUCN, IWMI, Rasmar Convention Bureau and WRI World
Resources Institute, Washington, DC, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Wittenberg and Sivapalan(1999)</label><mixed-citation>
Wittenberg, H. and Sivapalan, M.: Watershed groundwater balance estimation
using streamflow recession analysis and baseflow separation, J.
Hydrol., 219, 20–33, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>WCM(2000)</label><mixed-citation>
World Conservation Monitoring Centre (UNEP-WCMC): Mountains and Forests in
Mountains, UNEP-WCMC, Cambridge, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Wright and Nocedal(1999)</label><mixed-citation>
Wright, S. and Nocedal, J.: Numerical optimization, Vol. 2, Springer New
York, NY, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Xu et al.(2014)Xu, Croke, and Hutchinson</label><mixed-citation>
Xu, T., Croke, B., and Hutchinson, M. F.: Identification of spatial and
temporal patterns of Australian daily rainfall under a changing climate, in:
Proceedings of the 7th International Congress on Environmental Modelling and
Software, edited by: Ames, D. P., Quinn, N. W. T., and Rizzoli, A. E., 2014.</mixed-citation></ref>

  </ref-list><app-group content-type="float"><app><title/>

    </app></app-group></back>
    </article>
