<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-24-4109-2020</article-id><title-group><article-title>Estimation of annual runoff by exploiting long-term spatial patterns and short records within a geostatistical framework</article-title><alt-title>Geostatistical model for exploiting short records of runoff</alt-title>
      </title-group><?xmltex \runningtitle{Geostatistical model for exploiting short records of runoff}?><?xmltex \runningauthor{T. Roksvåg et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Roksvåg</surname><given-names>Thea</given-names></name>
          <email>thea.roksvag@ntnu.no</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Steinsland</surname><given-names>Ingelin</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Engeland</surname><given-names>Kolbjørn</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Norwegian University of Science and Technology, NTNU, Department of Mathematical Sciences, Trondheim, Norway</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>The Norwegian Water Resources and Energy Directorate, NVE, Oslo, Norway</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Thea Roksvåg (thea.roksvag@ntnu.no)</corresp></author-notes><pub-date><day>21</day><month>August</month><year>2020</year></pub-date>
      
      <volume>24</volume>
      <issue>8</issue>
      <fpage>4109</fpage><lpage>4133</lpage>
      <history>
        <date date-type="received"><day>8</day><month>August</month><year>2019</year></date>
           <date date-type="rev-request"><day>24</day><month>September</month><year>2019</year></date>
           <date date-type="rev-recd"><day>29</day><month>June</month><year>2020</year></date>
           <date date-type="accepted"><day>15</day><month>July</month><year>2020</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 Thea Roksvåg et al.</copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020.html">This article is available from https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e104">In this article, we present a Bayesian geostatistical framework that is particularly suitable for interpolation of hydrological data when the available dataset is sparse and includes both long and short records of runoff. A key feature of the proposed framework is that several years of runoff are modelled simultaneously with two spatial fields: one that is common for all years under study that represents the runoff generation due to long-term (climatic) conditions and one that is year-specific. The climatic spatial field captures how short records of runoff from partially gauged catchments vary relative to longer time series from other catchments, and transfers this information across years. To make the Bayesian model computationally feasible and fast, we use integrated nested Laplace approximations (INLAs) and the stochastic partial differential equation (SPDE) approach to spatial modelling.</p>
    <p id="d1e107">The geostatistical framework is demonstrated by filling in missing values of annual runoff and by predicting mean annual runoff for around 200 catchments in Norway. The predictive performance is compared to top-kriging (interpolation method) and simple linear regression (record augmentation method). The results show that if the runoff is driven by processes that are repeated over time (e.g. orographic precipitation patterns), the value of including short records in the suggested model is large. For partially gauged catchments the suggested framework performs better than comparable methods, and one annual observation from the target catchment can lead to a 50 % reduction in root mean squared error (RMSE) compared to when no observations are available from the target catchment. We also find that short records safely can be included in the framework regardless of the spatial characteristics of the underlying climate, and down to record lengths of 1 year.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e119">Characteristic values for streamflow are used for various purposes in water resource management. High-flow indices or design flood estimates are needed for flood risk assessments and design of infrastructure and dams, low-flow indices are needed for assessment of environmental flow and reliability assessment of water supply, while mean annual flow is an important basis for water resource management and a key for design of water supply systems and allocation of water resources between stakeholders. Mean annual flow can also be used as a predictor for low-flow and high-flow indices <xref ref-type="bibr" rid="bib1.bibx48 bib1.bibx10" id="paren.1"/>.</p>
      <p id="d1e125">At locations with measurements, the streamflow indices can be estimated based on observations. However, streamflow is only measured at a limited number of locations, and in many applications we need to predict the streamflow indices at ungauged locations. This is a central problem in hydrology and known as the Prediction in Ungauged Basins problem <xref ref-type="bibr" rid="bib1.bibx4" id="paren.2"/>. Often it is of interest to estimate flow indices that represent the long-term average behaviour in a catchment. If this is the case, using only a few years of data from the target catchment might lead to biased estimates. The reason is climate variability over short timescales combined with sample uncertainty. Often a minimum record length is recommended for estimation of long-term indices, but a substantial part of the available streamflow gauges in the world have too short records to provide reliable estimates. These<?pagebreak page4110?> short data series can, however, provide useful information if they are used together with longer time series from other catchments <xref ref-type="bibr" rid="bib1.bibx28" id="paren.3"/>. Motivated by this, we propose a framework for runoff interpolation particularly suitable for datasets including data series of this type, more specifically runoff datasets including a mix of fully gauged catchments (with data available from the whole study period) and partially gauged catchments (with data available from a subset of the study period). We suggest a framework for runoff interpolation that unifies two commonly used statistical approaches for runoff estimation: geostatistical approaches and approaches for exploiting short records of data.</p>
      <p id="d1e134">Within the geostatistical framework, Gaussian random fields (GRFs) are often used to model hydrological phenomena that are continuous in space and/or time. The hydrological variable of interest is a GRF if a vector containing a random sample of length <inline-formula><mml:math id="M1" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> from the process follows a Gaussian distribution with mean vector <inline-formula><mml:math id="M2" display="inline"><mml:mi mathvariant="bold-italic">μ</mml:mi></mml:math></inline-formula> and covariance matrix <inline-formula><mml:math id="M3" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx7" id="paren.4"/>. The elements in the covariance matrix are typically determined by a covariance function that is specified based on the pairwise distances between the <inline-formula><mml:math id="M4" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> target locations. For most environmental variables it is straightforward to compute these distances. However, for runoff-related variables the measure of distance is ambiguous because the observations are related to catchment areas, some of them nested, and not to point locations in space. Traditionally, this challenge has been solved by simply interpreting runoff as a point-referenced process linked to the catchment centroids or stream outlets (see e.g. <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx46 bib1.bibx1" id="altparen.5"/>). The problem with these methods is that they can lead to a violation of basic conservation laws, and several alternatives approaches are suggested for making an interpolation scheme that takes the nested structure of catchments into account <xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx19 bib1.bibx43" id="paren.6"/>. In particular, the top-kriging approach suggested by <xref ref-type="bibr" rid="bib1.bibx43" id="text.7"/> has shown promising results for interpolation of hydrological variables <xref ref-type="bibr" rid="bib1.bibx49" id="paren.8"/>. In the top-kriging approach, information from a subcatchment is weighted more than information from a nearby non-overlapping catchment when performing runoff predictions for an ungauged catchment.</p>
      <p id="d1e181">In the literature, there exist several techniques to exploit short records of runoff, and these are known as record augmentation techniques. The first step in a record augmentation procedure is often to find one or several donor catchments with longer time series of runoff. The donor catchments are typically selected based on runoff correlation, catchment similarity, or proximity in space. By applying e.g. linear regression approaches and/or computing the correlation between time series, a relationship between the target catchment and the donor catchments is developed. Next, the longer time series from the donor catchment(s) are used to perform predictions for the target catchments for years/months/days without measurements (see e.g. <xref ref-type="bibr" rid="bib1.bibx13" id="altparen.9"/>, <xref ref-type="bibr" rid="bib1.bibx23" id="altparen.10"/>, <xref ref-type="bibr" rid="bib1.bibx31" id="altparen.11"/>, <xref ref-type="bibr" rid="bib1.bibx50" id="altparen.12"/> or <xref ref-type="bibr" rid="bib1.bibx28" id="altparen.13"/>). The regression and/or correlation analysis is performed based on runoff observations that are of the same type as the target flow index; i.e. for annual runoff, short records of annual runoff are used <xref ref-type="bibr" rid="bib1.bibx4" id="paren.14"/>.</p>
      <p id="d1e204">In this paper, we suggest a geostatistical Bayesian framework that represents a new way of exploiting short records of data. The framework is constructed to exploit long-term spatial patterns stored in sparse datasets, i.e. hydrological datasets with several missing values. A key feature of the suggested framework is that it simultaneously models several years of runoff. This is done by using two statistical spatial components or GRFs in the hydrological model: the first GRF is common for all years under study and models the long-term spatial variability of runoff. We denote this the climatic GRF as it represents the spatial variability over time, or what we refer to as the climate in the study area. In this context the term <italic>climate</italic> also includes the runoff generation due to catchment characteristics that are static, like elevation and slope. The other GRF is year-specific and models the annual discrepancy from the climate, and we denote this the annual or year-specific GRF. If we have a study area for which the spatial variability of runoff is stable over time, the climatic GRF will capture this tendency. Hence, it will also capture how short records of runoff vary relative to longer data series from other catchments. On the other hand, if there are no strong long-term trends present in the data, the year-specific GRF will dominate over the climatic GRF. For this scenario, short records from the target catchment(s) will have less impact on the final results. By adjusting the two spatial fields relative to each other, our method represents a way for detecting long-term trends and uses this to exploit short records in the runoff interpolation.</p>
      <p id="d1e210">The framework we suggest is flexible and can be used for any hydrological variable. However, its benefits are linked to exploiting long-term spatial trends in the data, and in order to work better than other interpolation methods, the hydrological variable of interest should be driven by processes that are repeated over time. For this reason, we develop our methodology for <italic>annual</italic> runoff. This is a flow index that often has a prominent spatial pattern over years, for example due to orographic precipitation and topography that creates weather divides. To describe study areas and/or variables like this, we hereby introduce the terms <italic>hydrologically spatially stable</italic> and <italic>hydrological spatial stability</italic>. For hydrologically spatially stable areas, the difference in runoff between two locations for a given year is close to the difference in runoff between these two locations any other year. Be aware that a hydrologically spatially stable area can both have large differences in annual runoff between two close locations and have large variability in annual runoff over years for a given location. The key property is that the underlying spatial pattern is preserved over time.</p>
      <p id="d1e222">While annual runoff represents a hydrologically spatially stable variable for many countries, the spatial pattern for monthly runoff is typically less stable. This is due to local<?pagebreak page4111?> weather patterns and the variability in the seasonality of snow accumulation and snowmelt. To demonstrate our methodology for a variable with less hydrological spatial stability, we therefore fit the framework to annual time series of monthly runoff. These predictions allow us to discuss how the approach might work in different regions.</p>
      <p id="d1e225">In the following presentation, we introduce two versions of our framework, i.e. two geostatistical models. The first model we propose is denoted the areal model and is particularly suitable for mass-conserved hydrological variables. It ensures that the water balance is preserved for the predicted runoff for any point in the landscape and defines the average runoff in a catchment as the average point runoff integrated over (nested) catchment areas. This way, the nested structure of catchments is taken into account, and the interpretation of covariance between two catchments is similar to the one of top-kriging. The areal model for annual runoff is already presented in <xref ref-type="bibr" rid="bib1.bibx37" id="text.15"/>, where its mass-conserving properties were demonstrated through an example from Voss in western Norway. The model's ability to exploit short data records was also indicated in <xref ref-type="bibr" rid="bib1.bibx37" id="text.16"/>, but the property was not tested for a larger dataset or compared to any existing methods. This is a key contribution of this article.</p>
      <p id="d1e234">As an alternative to the areal model, we also propose a model that defines runoff as a point-referenced process for which distances are measured between the catchment centroids. This model does not consider preservation of water balance, but on the other hand it can be used for any point-referenced environmental variable, and it is computationally faster than the areal model. This model is more similar to models that have been used traditionally in hydrology, and we denote this the centroid model. Both the areal model and the centroid model have the ability to exploit hydrological spatial stability but have different benefits, drawbacks and hence also areas of use. These are discussed and highlighted throughout the article.</p>
      <p id="d1e237">The main objective of this work is to present and evaluate the new geostatistical framework for exploiting short records and to compare its performance to top-kriging (interpolation method) and simple linear regression (record augmentation technique). In particular our goals are to
<list list-type="order"><list-item>
      <p id="d1e242">assess the two spatial models' ability to fill in missing annual observations of runoff for ungauged and partially gauged catchments; and<?xmltex \hack{\\}?></p></list-item><list-item>
      <p id="d1e247">assess the two spatial models' ability to predict <italic>mean</italic> annual runoff for a longer time period for catchments with varying record lengths.</p></list-item></list>
Through  (1) and (2) we also aim to
<list list-type="custom"><list-item><label>3.</label>
      <p id="d1e256">demonstrate the potential added value of including short records in the modelling, compared to when not using them or compared to when using traditional methods.</p></list-item></list></p>
      <p id="d1e260">The framework is evaluated by using annual and monthly runoff data from catchments in Norway. This dataset is presented in the section that follows (Sect. <xref ref-type="sec" rid="Ch1.S2"/>). Next, in Sect .<xref ref-type="sec" rid="Ch1.S3"/>, we briefly introduce relevant statistical background theory and notation. In Sect. <xref ref-type="sec" rid="Ch1.S4"/> the suggested model for annual runoff is presented, before evaluation scores and experimental set-up are presented in Sect. <xref ref-type="sec" rid="Ch1.S5"/>. Here, we have one experimental set-up for annual predictions (Sect. <xref ref-type="sec" rid="Ch1.S5.SS1"/>) and one set-up for mean annual predictions (Sect. <xref ref-type="sec" rid="Ch1.S5.SS2"/>). In Sect. <xref ref-type="sec" rid="Ch1.S6"/>, the results are presented before they are discussed in Sect. <xref ref-type="sec" rid="Ch1.S7"/>. Finally, we conclude in Sect. <xref ref-type="sec" rid="Ch1.S8"/>.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Study area</title>
      <p id="d1e290">The study is carried out by using a dataset from Norway provided by the Norwegian Water Resources and Energy Directorate (NVE). It originally consisted of <italic>daily</italic> runoff data from 1981 to 2010. To make the data suitable for an analysis, a data preparation procedure was performed to construct datasets for two purposes: for assessing the framework's ability to fill in missing annual data and for assessing the framework's ability to predict mean annual runoff.</p>
      <p id="d1e296">To make a cross-validation dataset for the experiments related to infill of missing annual data, the daily runoff data were aggregated to annual runoff data for hydrological years that start on 1 September and end on 31 August. We chose to consider a study period from 1996 to 2005: for this period we had the maximum number of fully gauged catchments, i.e. 180 catchments. These 180 fully gauged catchments have areas ranging from 13 to 15 500 <inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and median elevations from 85 to 1562 m a.s.l. Among these, none were significantly influenced by human activities in the time period of interest. Regulated catchments were removed from the original dataset.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e312">Mean annual runoff (1996–2005) from 180 fully gauged catchments in Norway <bold>(a)</bold> and annual runoff observations from all 180 catchments and years <bold>(b)</bold>. These data are used to evaluate the framework's ability to fill in missing values for individual years. Thirty percent of the involved catchments are nested and most of these are located in southern Norway as visualized in <bold>(c)</bold>. In <bold>(c)</bold>, coloured catchments are subcatchments of at least one larger catchment, while the black catchments are not subcatchments of any larger catchments (but might contain one or two smaller catchments). In the visualization in <bold>(a)</bold>, subcatchments are plotted on top of larger catchments, and this is done throughout the article. The coordinate system used is EUREF89 – UTM33N (EPSG 25833). See Fig. <xref ref-type="fig" rid="Ch1.F7"/> for a closer image of the observed mean annual runoff in southern Norway (1996–2005).</p></caption>
        <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f01.png"/>

      </fig>

      <p id="d1e340">Figure <xref ref-type="fig" rid="Ch1.F1"/>a and b show two visualizations of the annual data from the 180 Norwegian target catchments. We see a large spatial variability of runoff. The annual runoff (for individual years) ranges from 170 to 5050 mm yr<inline-formula><mml:math id="M6" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, whereas the mean annual runoff ranges from 350 to 4230 mm yr<inline-formula><mml:math id="M7" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, with the highest values of runoff in western Norway and more moderate values in the east and north. In total 53 of the 180 catchments were nested with at least one other catchment; i.e. the degree of nestedness is 30 %. Most of these are located in southern Norway, and the nested structure here is shown in Fig. <xref ref-type="fig" rid="Ch1.F1"/>c. The remaining 127 catchments did not overlap with any other catchment.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e373">Seven catchments located in western Norway <bold>(a)</bold> and their annual runoff <bold>(b)</bold>. The seven time series in <bold>(b)</bold> are almost parallel (and barely cross), indicating that most of the spatial variability can be explained by long-term spatial patterns. This represents a good example of what we mean by hydrological spatial stability. </p></caption>
        <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f02.png"/>

      </fig>

      <?pagebreak page4112?><p id="d1e391">In the Norwegian annual data in Fig. <xref ref-type="fig" rid="Ch1.F1"/>a we see an east–west pattern of runoff. This is mainly caused by orographic enhancement of frontal precipitation formed around extratropical cyclones. The orographic enhancement is driven by the steep mountains in western Norway that create a topographic barrier for the western wind belt, which transports moist air across the North Atlantic <xref ref-type="bibr" rid="bib1.bibx47" id="paren.17"/>. Due to the orographic enhancement, the maximum precipitation is observed at distances 30–70 km from the coast <xref ref-type="bibr" rid="bib1.bibx16" id="paren.18"/> and not necessarily at the highest elevations since the air dries out due to precipitation. The topography results in a spatial pattern of runoff that is stable over years, which means that there exist long-term spatial patterns in the data that can be exploited.</p>
      <p id="d1e402">Figure <xref ref-type="fig" rid="Ch1.F2"/> shows time series of annual runoff from seven catchments in the south-western part of the country. We see a year-to-year variation for all catchments that is quite large. However, the seven time series are almost parallel (and almost never cross), indicating that the difference in annual runoff between stations is approximately constant over time. Hence, this is a good example of what we mean by hydrological spatial stability. The tendency we see in Fig. <xref ref-type="fig" rid="Ch1.F2"/> is typical for the annual runoff in many of the areas in Norway.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e411">Annual series of monthly runoff for January, April and June for the 7 catchments in Fig. <xref ref-type="fig" rid="Ch1.F2"/>a. The time series for January and April are less parallel compared to the time series for June and to the annual runoff (Fig. <xref ref-type="fig" rid="Ch1.F2"/>b). This suggests that the datasets from January and April represent a more hydrologically spatially unstable setting.</p></caption>
        <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f04.png"/>

      </fig>

      <p id="d1e425">To illustrate the framework's properties for study areas and/or variables that are driven by more unstable weather patterns or hydrological processes, we also aggregated the daily runoff data to monthly runoff for the 180 catchments in Fig. <xref ref-type="fig" rid="Ch1.F1"/>a. From this we made annual time series of monthly runoff for 1996–2005 for 3 months: a winter month dominated by snow accumulation (January), a spring month with snowmelt (April) and a summer month dominated by rain (June). The annual observations of monthly runoff for the selected months are presented in Fig. <xref ref-type="fig" rid="Ch1.F4"/>, and we see that January has the lowest average runoff, whereas June has the highest. The variation in average monthly runoff describes a<?pagebreak page4113?> runoff regime, and in Norway the combination of snow accumulation, snowmelt, and evapotranspiration processes controls this regime <xref ref-type="bibr" rid="bib1.bibx20" id="paren.19"/>. Along the western coast, the winter weather is typically rainy with temperatures above the freezing point. In these regions the highest monthly runoff is observed in October–December. The colder areas are found in the interior of the country with winters dominated by snow accumulation. In these regions the highest monthly runoff is observed for the snowmelt season (May–June).</p>
      <p id="d1e435">Annual time series of monthly runoff from the seven selected catchments from Fig. <xref ref-type="fig" rid="Ch1.F2"/>a are shown in Fig. <xref ref-type="fig" rid="Ch1.F3"/>. We see that the spatial pattern is less stable on a monthly scale compared to the annual scale, particularly for January: the difference in monthly runoff between stations over time is not approximately constant for January, and the runoff in January hence represents a more hydrologically spatially unstable variable in Norway. For June, however, the hydrological spatial stability is higher.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e444">Monthly runoff data (1996–2005) from 180 catchments in Norway for January, April and June. These are used to evaluate the framework's ability to fill in missing values for hydrological variables and/or study areas that are driven by more unstable weather patterns.</p></caption>
        <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f03.png"/>

      </fig>

      <p id="d1e453">The cross-validation datasets described so far are used to assess the framework's ability to fill in missing annual observations for a 10-year period and to illustrate how the models behave for different hydrological settings. In addition, we also evaluate the framework's ability to predict <italic>mean</italic> annual runoff, which is a key hydrological signature. This is done for a 30-year period from 1981 to 2010. As we consider a longer time period for this assessment, a different subset of the original dataset was used: more specifically, annual data from 260 catchments located in southern Norway. These are shown in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. Each of the 260 catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/> have at least one observation of annual runoff between 1981 and 2010, but only 83 of them are fully gauged in the time period of interest (i.e. have annual observations for all 30 years). Among the partially gauged catchments, the mean record length is 15, while the median record length is 13. Furthermore, 20 of the involved catchments only have one, two or three annual observations. As for the previously described datasets, we removed regulated catchments that were significantly influenced by human activity. Also note that we in this experiment only consider catchments from southern Norway. This is done to reduce the computational complexity of fitting 30 years of runoff simultaneously in a cross-validation setting.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e465">Mean annual runoff for 1981–2010 for 260 catchments in southern Norway, where only 83 of them are fully gauged (i.e. have annual data for each year in the study period). The 83 fully gauged catchments have black borders in the above plot. In addition, there are data available from 177 so-called partially gauged catchments. These have at least one annual observation between 1981 and 2010 and are visible as catchments without borders in the above figure. Among the 83 fully gauged catchments, 44 catchments are nested (53 %), while 39 catchments do not overlap with any other catchment in the dataset. Data from the catchments in this figure are used to evaluate the framework's ability to estimate <italic>mean annual runoff</italic>.</p></caption>
        <?xmltex \igopts{width=142.26378pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f05.png"/>

      </fig>

      <p id="d1e478">When using the data in Fig. <xref ref-type="fig" rid="Ch1.F5"/> to predict mean annual runoff, we do predictions by cross-validation for the 83 fully gauged catchments. However, data from both partially gauged and fully gauged catchments are included in the observation sample (see Sect. <xref ref-type="sec" rid="Ch1.S5.SS2"/>). For the 83 fully gauged catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/>, 53 % of the catchments were nested with a fully gauged or partially gauged catchment.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Statistical methodology</title>
      <p id="d1e495">In Sect. <xref ref-type="sec" rid="Ch1.S4"/> we present two Bayesian geostatistical models for runoff interpolation particularly suitable for sparse datasets containing several missing values. First, some statistical background is necessary.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Bayesian statistics and hierarchal modelling</title>
      <p id="d1e507">The goal in hydrology is to learn about processes related to hydrological variables like daily rainfall, annual runoff or the 5th percentile flow. To gain knowledge about the different hydrological processes, relevant data are collected. There are always uncertainties related to the data that must be accounted for in an analysis, and which make a statistical analysis appropriate.</p>
      <?pagebreak page4114?><p id="d1e510">Assuming <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a vector consisting of hydrological variables of interest, e.g. the annual runoff at several locations for a specific year, the observation likelihood <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> expresses how the data <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are connected to the truth <inline-formula><mml:math id="M11" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>. In the classical frequentist statistical approach, the variables in <inline-formula><mml:math id="M12" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> are considered unknown but fixed. In the Bayesian approach, however, <inline-formula><mml:math id="M13" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is considered to be a quantity whose variation can be described by a probability distribution (see e.g. <xref ref-type="bibr" rid="bib1.bibx6" id="altparen.20"/>). Prior to the analysis, this probability distribution is expressed through what is called a prior distribution <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. This is constructed based on expert knowledge about the variable(s) of interest. The goal of the Bayesian analysis is to update the prior distribution by using data. Through Bayes' formula, the so-called posterior distribution of <inline-formula><mml:math id="M15" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is obtained:
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M16" display="block"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>∝</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d1e711">Next, the marginal distribution <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi></mml:mrow></mml:math></inline-formula> can be integrated out, and a prediction of <inline-formula><mml:math id="M19" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> can be summarized through e.g. the mean, median or the mode of the posterior distribution <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e782">If a complex process is under study, it is sometimes easier to model it by thinking of its mechanisms in a hierarchy of underlying processes or distributions <xref ref-type="bibr" rid="bib1.bibx3" id="paren.21"/>. The annual runoff <inline-formula><mml:math id="M21" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> can e.g. be thought of as a process that depends on some parameters <inline-formula><mml:math id="M22" display="inline"><mml:mi mathvariant="bold-italic">θ</mml:mi></mml:math></inline-formula> that express the spatial correlation between locations. Here, both <inline-formula><mml:math id="M23" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M24" display="inline"><mml:mi mathvariant="bold-italic">θ</mml:mi></mml:math></inline-formula> are stochastic variables with prior (and posterior) distributions. A Bayesian model of this type is typically expressed as a three-stage hierarchical model where the first stage consists of the observation likelihood <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, the second stage is the prior distribution <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, often referred to as the latent model or process model, while the third stage is the prior distribution of the model parameters <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. As before, Bayes' formula can be used to make inference about the variables of interest <inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> but also about the model parameters <inline-formula><mml:math id="M29" display="inline"><mml:mi mathvariant="bold-italic">θ</mml:mi></mml:math></inline-formula> given the set of observations <inline-formula><mml:math id="M30" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula>. In this study we use a three-stage hierarchical Bayesian model to model annual runoff.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Gaussian random fields</title>
      <p id="d1e900">GRFs are commonly used to model environmental variables like precipitation, runoff and temperature or other phenomena that are continuous in space and/or time. In this analysis, the second stage of the Bayesian hierarchical model consists of GRFs that model the spatial dependency of runoff between catchments. A continuous field <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>;</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="script">D</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> defined on a spatial domain <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi mathvariant="script">D</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="script">R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is a GRF if for any collection of locations <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:mi mathvariant="script">D</mml:mi></mml:mrow></mml:math></inline-formula> the vector <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup></mml:mrow></mml:math></inline-formula> follows a multivariate normal distribution <xref ref-type="bibr" rid="bib1.bibx7" id="paren.22"/>, i.e. <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mtext>T</mml:mtext></mml:msup><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">μ</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold">Σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M36" display="inline"><mml:mi mathvariant="bold-italic">μ</mml:mi></mml:math></inline-formula> is a vector of expected values and <inline-formula><mml:math id="M37" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> is the covariance matrix. The covariance matrix <inline-formula><mml:math id="M38" display="inline"><mml:mi mathvariant="bold">Σ</mml:mi></mml:math></inline-formula> defines the dependency structure in the spatial domain, and element <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is typically constructed from a covariance function <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mi>C</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. The dependency structure for a spatial process is often characterized by two parameters: the marginal variance <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and the range <inline-formula><mml:math id="M42" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula>. The marginal variance provides information about the spatial variability of the process of interest, while the range gives information about how the covariance between the process at two locations decays with distance. The range is defined as the distance at which the correlation between two locations in space has dropped to almost 0. If the range and the marginal variance are constant over the spatial domain, we have a stationary GRF.</p>
      <?pagebreak page4115?><p id="d1e1152">In this study, the involved GRFs have their dependency structure specified by a stationary Matérn covariance function that is given by
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M43" display="block"><mml:mrow><mml:mi>C</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msup><mml:mn mathvariant="normal">2</mml:mn><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:mi mathvariant="italic">κ</mml:mi><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mi mathvariant="italic">ν</mml:mi></mml:msup><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="italic">ν</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">κ</mml:mi><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          Here, <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> is the Euclidean distance between two locations <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="script">R</mml:mi><mml:mi>d</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:msub><mml:mi>K</mml:mi><mml:mi mathvariant="italic">ν</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the modified Bessel function of the second kind and order <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the gamma function and <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is the marginal variance that controls the spatial variability <xref ref-type="bibr" rid="bib1.bibx22" id="paren.23"/>. The parameter <inline-formula><mml:math id="M50" display="inline"><mml:mi mathvariant="italic">κ</mml:mi></mml:math></inline-formula> is a scale parameter, and it can be shown empirically that the spatial range can be expressed as <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mi mathvariant="italic">ρ</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mn mathvariant="normal">8</mml:mn><mml:mi mathvariant="italic">ν</mml:mi></mml:mrow></mml:msqrt><mml:mo>/</mml:mo><mml:mi mathvariant="italic">κ</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M52" display="inline"><mml:mi mathvariant="italic">ρ</mml:mi></mml:math></inline-formula> is defined as the distance at which the correlation between two locations has dropped to 0.1. Using a Matérn GRF is convenient for computational reasons because it makes it possible to use the stochastic partial differential equation (SPDE) approach to spatial modelling from <xref ref-type="bibr" rid="bib1.bibx29" id="text.24"/> which is briefly described in Sect. <xref ref-type="sec" rid="Ch1.S4.SS3"/>.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Kriging and top-kriging</title>
      <p id="d1e1423">Within the geostatistical framework, kriging approaches have shown promising results for interpolation of hydrological variables (see e.g. <xref ref-type="bibr" rid="bib1.bibx19" id="altparen.25"/>, <xref ref-type="bibr" rid="bib1.bibx40" id="altparen.26"/> or <xref ref-type="bibr" rid="bib1.bibx32" id="altparen.27"/>). In kriging methods, the target variable is represented as a random field, typically a Gaussian random field <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> defined through a covariance structure and some unknown parameters. The process of interest is observed at <inline-formula><mml:math id="M54" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> locations <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, and any unknown parameters can be estimated based on e.g. maximum likelihood procedures. Furthermore, to estimate the value of the variable <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> at an unobserved location <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> a weighted average of the observations is used, i.e.
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M58" display="block"><mml:mrow><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="bold">0</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are interpolation weights and <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> are observed values. The interpolation weights are computed by assuming that <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the best linear unbiased estimator (BLUE) of <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi>x</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. That is, we determine <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> by finding the weights that both minimize the mean squared error and that give zero mean expected error <xref ref-type="bibr" rid="bib1.bibx7" id="paren.28"/>. Note that the consequence of the latter is that the kriging weights are restricted to sum to 1, i.e. <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> if we assume that the process is homogeneous in space.</p>
      <p id="d1e1699">Further, to minimize the mean squared error of the kriging predictor in Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>), the covariance function (or variogram) must be estimated and evaluated. The covariance function typically depends on the distance between the observations and the target locations, such that observations measured close to the target location <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are weighted more than observations further away. In many hydrological applications, the centroids of the catchments are used to compute the catchment distances <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx46" id="paren.29"/>, but as mentioned in the introduction this can lead to a violation of basic mass conservation laws. The reason is that streamflow variables are connected to (catchment) areas, not single point locations. Catchments are also organized into subcatchments, and this should be considered when computing the kriging weights.</p>
      <p id="d1e1718">The top-kriging approach suggested by <xref ref-type="bibr" rid="bib1.bibx43" id="text.30"/> is an example of a kriging method that takes the nested structure of catchments into account. In this method, the streamflow observations are interpreted as areal referenced, and the covariance is computed based on the pairwise distances between all grid nodes in a discretization of the involved catchments. This way, observations from a subcatchment can be weighted more than observations from nearby, non-overlapping catchments. Top-kriging is currently one of the leading methods for interpolation of hydrological variables <xref ref-type="bibr" rid="bib1.bibx49" id="paren.31"/> and is therefore chosen as a benchmark when we evaluate our new interpolation approach.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Methods for exploiting short records (record augmentation techniques)</title>
      <p id="d1e1736">The framework we suggest is both a framework for spatial interpolation and a framework for record augmentation. There exist several approaches for record augmentation for which many of them are based on developing a linear relationship between the target catchment and one or several catchments with longer time series of runoff <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx28 bib1.bibx31" id="paren.32"/>. One class of approaches is the maintenance of variance extension (MOVE) methods. MOVE methods are based on developing a linear relationship between the target catchment and the donor(s) catchment(s) by assuming that the sample mean and sample variance of runoff are maintained over time for the target catchment <xref ref-type="bibr" rid="bib1.bibx23" id="paren.33"/>. There are different ways the sample mean and sample variance can be estimated, giving different estimators for the predicted runoff. Another way to develop a linear relationship between a donor and a target catchment is to use simple linear regression <xref ref-type="bibr" rid="bib1.bibx23" id="paren.34"/>. In this article, we use simple linear regression as a benchmark method, in addition to top-kriging.</p>
      <?pagebreak page4116?><p id="d1e1748">Assume annual runoff is observed for year <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula> in the target catchment and that there exist annual runoff data from some other catchments for year <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:math></inline-formula>. Simple linear regression is performed by first finding a so-called donor catchment for the catchment of interest. This can be e.g. the closest catchment in space or a catchment with similar catchment characteristics (elevation, annual precipitation, vegetation). Next, it is assumed that there is a linear relationship between the annual runoff in the target catchment and the donor catchment, <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the annual runoff in the target catchment, <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the annual runoff in the donor catchment, <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is normal distributed measurement error <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with fixed (but typically unknown) variance <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> are coefficients that must be estimated. The linear relationship between the two catchments is developed by estimating <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> by minimizing the sum of least squares, <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. Next, the linear relationship can be used to estimate the runoff in the target catchment <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> based on <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> with corresponding uncertainty estimates.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>A geostatistical framework for exploiting long-term averages and short records</title>
      <p id="d1e2076">In this section we present the suggested Bayesian geostatistical framework for runoff interpolation. We start by developing a three-stage hierarchical model for annual runoff consisting of a process model, an observation likelihood and prior distributions as described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>. Next, we highlight two model properties that make the suggested framework different from most other methods used for interpolation in hydrology (Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>) and explain how the framework is made computationally feasible (Sect. <xref ref-type="sec" rid="Ch1.S4.SS3"/>).</p>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Hierarchical model for annual runoff</title>
<sec id="Ch1.S4.SS1.SSS1">
  <label>4.1.1</label><title>True annual runoff (process models)</title>
      <p id="d1e2099">Let the spatial process <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>:</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="script">D</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> denote the runoff-generating process at a point location <inline-formula><mml:math id="M84" display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> in the spatial domain <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mi mathvariant="script">D</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi mathvariant="script">R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> in year <inline-formula><mml:math id="M86" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. The true annual runoff generated at point location <inline-formula><mml:math id="M87" display="inline"><mml:mi mathvariant="bold-italic">u</mml:mi></mml:math></inline-formula> in year <inline-formula><mml:math id="M88" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> is modelled as
              <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M89" display="block"><mml:mtable class="split" rowspacing="0.2ex" columnspacing="1em" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mspace linebreak="nobreak" width="1em"/><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>)</mml:mo><mml:mo>;</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi mathvariant="normal">GRF</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi mathvariant="normal">GRF</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
            where <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is an intercept common for all years <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> that models the average runoff in the study area over time, while <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a year-specific intercept that models the annual discrepancy from the long-term average runoff. Likewise is <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> a spatial effect that models the long-term spatial variability of runoff that is caused by climatic conditions in the study area, while <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a year-specific spatial effect that models the spatial variability due to annual discrepancy from the climate. We emphasize that, in this context, climate is for simplicity used as a collective term that describes both runoff generation caused by long-term weather patterns <italic>and</italic> the runoff generation due to static catchment characteristics like e.g. elevation and slope. The two spatial effects are modelled as GRFs with zero mean and stationary Matérn covariance functions with <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mi mathvariant="italic">ν</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, given a range and a marginal variance parameter; <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with range parameter <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and marginal variance <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>; and <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> with range parameter <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and marginal variance <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. Furthermore, the spatial fields <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> are assumed to be independent realizations, or replicates, of the same underlying field to increase the identifiability of the model parameters <xref ref-type="bibr" rid="bib1.bibx26" id="paren.35"/>. The same applies for the year-dependent intercepts <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> that are all assigned a Gaussian prior <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> given the variance parameter <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. The intercept <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is assigned the weakly informative wide Gaussian prior <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e2759">So far, runoff has been defined for point locations in space. However, runoff observations are linked to catchment areas, and we need to define the true annual runoff generated inside a catchment <inline-formula><mml:math id="M109" display="inline"><mml:mi mathvariant="script">A</mml:mi></mml:math></inline-formula>. We suggest two alternative models: the first model is denoted <italic>the areal model</italic>. For the areal model, the true annual runoff in catchment <inline-formula><mml:math id="M110" display="inline"><mml:mi mathvariant="script">A</mml:mi></mml:math></inline-formula> in year <inline-formula><mml:math id="M111" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> is given by the average point runoff over the catchment area, i.e.
              <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M112" display="block"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:munder><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="script">A</mml:mi></mml:mrow></mml:munder><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mi>d</mml:mi><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> is the catchment area and <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the point runoff from Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>). Interpreting annual runoff as an integral of point runoff ensures that the water balance is approximately preserved for the posterior mean runoff for any point in the landscape. Thus, the areal model is a model for mass-conserved hydrological variables. It also gives a realistic representation of distances and hence also the correlation between the catchments under study (see Eq. <xref ref-type="disp-formula" rid="Ch1.E2"/>).</p>
      <p id="d1e2878">The second model for the annual runoff generated inside a catchment area is denoted <italic>the centroid model</italic>. For the centroid model, the true annual runoff inside a catchment <inline-formula><mml:math id="M115" display="inline"><mml:mi mathvariant="script">A</mml:mi></mml:math></inline-formula> in year <inline-formula><mml:math id="M116" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> is given by
              <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M117" display="block"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the point runoff from Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>), and <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the centroid of catchment <inline-formula><mml:math id="M120" display="inline"><mml:mi mathvariant="script">A</mml:mi></mml:math></inline-formula>. This alternative does not provide a preservation of the water balance for the posterior mean predicted runoff and can be used for any point-referenced environmental variable. Distances are measured between catchment centroids, such that this method is more similar to the traditional kriging methods described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS3"/>.</p>
</sec>
<sec id="Ch1.S4.SS1.SSS2">
  <label>4.1.2</label><title>Observation likelihood</title>
      <p id="d1e2984">The true annual runoff from Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS1"/> is observed with uncertainty through streamflow data from <inline-formula><mml:math id="M121" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> catchments which we denote <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. We use the following model for the observed runoff <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> in catchment <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in year <inline-formula><mml:math id="M125" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>:
              <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M126" display="block"><mml:mtable columnspacing="1em" rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><?xmltex \hspace{4mm}?><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="1em"/><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d1e3200">Here, <inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the true runoff from Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>) if we use the areal model, or the true runoff from Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) if we use the centroid model. The error terms <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are identically, independently distributed as <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> given the parameter <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, and we assume that each observation has its own uncertainty by scaling the variance parameter <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> with a fixed factor <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> that is further specified in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS3"/>.</p>
      <?pagebreak page4117?><p id="d1e3315">Through the observation likelihood and the areal formulation of annual runoff from Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>), the areal model puts (soft) constraints on the annual runoff over the catchment areas of the gauged catchments. This way the areal model is able to influence the model to distribute the observed annual runoff within the catchment areas and not only at certain gauging points which is what the centroid model does. This represents a potential benefit for the areal model compared to the centroid model when modelling runoff. However, imposing constraints on areas also comes with a computational cost.</p>
</sec>
<sec id="Ch1.S4.SS1.SSS3">
  <label>4.1.3</label><title>Prior models</title>
      <p id="d1e3328">According to the model specification in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS1"/> and <xref ref-type="sec" rid="Ch1.S4.SS1.SSS2"/>, there are six model parameters in the suggested hierarchical model for annual runoff, i.e. <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. As we apply the Bayesian framework, these have to be given prior distributions, and we use knowledge based priors for most parameters. Note that since the priors are based on expert opinions about the study area, they are specific for the Norwegian dataset and should be modified before further use for other countries or environmental variables.</p>
      <p id="d1e3386">In the observation model for runoff in Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>), each observation is allowed to have its own measurement uncertainty by scaling the variance parameter <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, with a fixed scale <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. This makes sense because the spatial variability of mean annual runoff in Norway is large, with values ranging from around 400 to 4000 mm yr<inline-formula><mml:math id="M136" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and heteroscedastic errors can be expected <xref ref-type="bibr" rid="bib1.bibx36" id="paren.36"/>. In the specification of the prior standard deviation <inline-formula><mml:math id="M137" display="inline"><mml:msqrt><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msqrt></mml:math></inline-formula>, we assume that the measurement uncertainty for runoff increases with the magnitude of the observed value <inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>.
Based on this we suggest the following scaling factors:
              <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M139" display="block"><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.025</mml:mn><mml:mo>⋅</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1000</mml:mn><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the observed runoff in catchment <inline-formula><mml:math id="M141" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> in year <inline-formula><mml:math id="M142" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> in mm yr<inline-formula><mml:math id="M143" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The scaling factors are chosen based on what the data provider NVE believes are realistic standard deviations for the observed values, around <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.5</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">%</mml:mi></mml:mrow></mml:math></inline-formula> of the observed runoff. They are scaled down by 1000 to achieve appropriate values for <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. For the variance parameter <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, we use the penalized complexity prior (PC prior) suggested by <xref ref-type="bibr" rid="bib1.bibx42" id="text.37"/>. The PC prior is a prior constructed for the precision, i.e. the inverse of the variance, and the PC prior for the precision <inline-formula><mml:math id="M147" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula> of a Gaussian effect <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> has density
              <disp-formula id="Ch1.E9" content-type="numbered"><label>9</label><mml:math id="M149" display="block"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi mathvariant="italic">λ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:msup><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup><mml:mi>exp⁡</mml:mi><mml:mo>(</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:msup><mml:mi mathvariant="italic">τ</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mspace width="1em" linebreak="nobreak"/><mml:mi mathvariant="italic">τ</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace width="1em" linebreak="nobreak"/><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M150" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is a parameter that determines the penalty of deviating from a simpler base model. The parameter <inline-formula><mml:math id="M151" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> can be specified through a quantile <inline-formula><mml:math id="M152" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula> and a probability <inline-formula><mml:math id="M153" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> by <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mi mathvariant="normal">Prob</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>&gt;</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:mi>u</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>&lt;</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mi>ln⁡</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">α</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi>u</mml:mi></mml:mrow></mml:math></inline-formula>. Here, <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:msqrt><mml:mi mathvariant="italic">τ</mml:mi></mml:msqrt></mml:mrow></mml:math></inline-formula> is the standard deviation of this Gaussian distribution. In our case, we specify the PC prior for <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> as
              <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M160" display="block"><mml:mrow><mml:mtext>Prob</mml:mtext><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">1500</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">mm</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">yr</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
            Recall that <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is scaled with <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> in the final uncertainty model such that a prior 95 % credible interval for the standard deviation <inline-formula><mml:math id="M163" display="inline"><mml:msqrt><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:math></inline-formula> for the observed runoff in catchment <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> year <inline-formula><mml:math id="M165" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> becomes <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.04</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>)</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="italic">%</mml:mi></mml:mrow></mml:math></inline-formula> of the observed value <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. This is a quite strict prior that is chosen in order to influence the posterior observation uncertainty to be as low as possible. The reason behind this modelling choice is further described in Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>. However, an observation uncertainty of 0.04 %–6 % of the observed value also corresponds quite well to what NVE knows about the measurement uncertainty for runoff in the study area. Percentages around <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:mn mathvariant="normal">2.5</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">%</mml:mi></mml:mrow></mml:math></inline-formula> are as mentioned realistic.</p>
      <p id="d1e3998">For the spatial ranges <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and the marginal variances <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> for the Gaussian random fields <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, we use the joint informative PC prior suggested in <xref ref-type="bibr" rid="bib1.bibx15" id="text.38"/>. It is specified through the following probabilities and quantiles:

                  <disp-formula specific-use="align"><mml:math id="M175" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">Prob</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">20</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mtext> km</mml:mtext><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">Prob</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">2000</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">Prob</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">20</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mtext> km</mml:mtext><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">Prob</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">2000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

              The percentages and quantiles are chosen based on expert knowledge about the spatial variability in the area of interest. It is reasonable to assume that locations that are less than 20 km apart are correlated when it comes to runoff generation. In Norway the annual runoff varies from around 300 to 6000 mm yr<inline-formula><mml:math id="M176" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> such that a marginal standard deviation that is below 2000 mm yr<inline-formula><mml:math id="M177" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> is reasonable. The parameters of the climatic GRF <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and the year-dependent GRF <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are given the same prior as it is difficult to identify whether the spatial variability mainly comes from climatic processes or from annual variations. We also want the data to decide which of the two effects that dominates in the study area, and in this way detect hydrological spatial stability or instability. Recall that the phrase <italic>hydrological spatial stability</italic> here is used to describe a variable and/or a study area that is characterized by an underlying spatial pattern that is repeated over time.</p>
      <p id="d1e4273">As specified in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS1"/>, the year-specific intercepts <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> are all assigned the same Gaussian prior <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> given the standard deviation parameter <inline-formula><mml:math id="M183" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The standard deviation <inline-formula><mml:math id="M184" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is given the PC prior from Eq. (<xref ref-type="disp-formula" rid="Ch1.E9"/>) specified by the wide prior <inline-formula><mml:math id="M185" display="inline"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">yr</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.2</mml:mn></mml:mrow></mml:math></inline-formula>. With this prior, the prior <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">%</mml:mi></mml:mrow></mml:math></inline-formula> credible interval is approximately <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">40.5</mml:mn><mml:mo>)</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">yr</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> for the standard deviation <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of <inline-formula><mml:math id="M189" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S4.SS1.SSS4">
  <label>4.1.4</label><title>Feasible computation of catchment runoff for the areal model</title>
      <?pagebreak page4118?><p id="d1e4465">In the areal model in Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>), the true runoff is modelled as the integral of point runoff over a catchment. To make the areal model computationally feasible, the integral is calculated by a finite sum over a discretization of the target catchment. More specifically, if <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denotes the discretization of catchment <inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the annual runoff in catchment <inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in year <inline-formula><mml:math id="M193" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula> is calculated as
              <disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M194" display="block"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:munder><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
            where <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the number of grid nodes in the discretization <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In the discretization of the catchments it is important that a subcatchment shares grid nodes with its overlapping catchment(s) such that the water balance can be preserved. In our analysis, we use a regular grid with 4 km spacing. It is also important that the discretization of the study area is fine enough to capture the rapid changes in annual runoff in the study area. Otherwise, non-realistic results such as negative runoff can occur.</p>
</sec>
<sec id="Ch1.S4.SS1.SSS5">
  <label>4.1.5</label><title>Full model specification</title>
      <p id="d1e4601">Assuming that we observe runoff at <inline-formula><mml:math id="M197" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> stream gauges for <inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula> years and that <inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi mathvariant="script">D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> contains all grid nodes in the discretization of the catchments <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula>, the areal model in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS1"/>–<xref ref-type="sec" rid="Ch1.S4.SS1.SSS4"/> can be summarized as the following hierarchical geostatistical model:
              <disp-formula id="Ch1.E12" content-type="numbered"><label>12</label><mml:math id="M202" display="block"><mml:mtable class="split" rowspacing="0.2ex" columnspacing="1em" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:munderover><mml:mo movablelimits="false">∏</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>r</mml:mi></mml:munderover><mml:munderover><mml:mo movablelimits="false">∏</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:mi>I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="normal">Observation</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">is</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="normal">available</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9.5}{9.5}\selectfont$\displaystyle}?><mml:mo>⋅</mml:mo><mml:mi mathvariant="script">N</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>s</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>⋅</mml:mo><mml:mi>I</mml:mi><mml:mo mathvariant="italic">{</mml:mo><mml:mi mathvariant="normal">Observation</mml:mi><mml:mspace linebreak="nobreak" width="0.25em"/><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">is</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">missing</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:mo>)</mml:mo><?xmltex \hack{$\egroup}?></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>[</mml:mo><mml:mtext>Observation likelihood</mml:mtext><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>⋅</mml:mo><mml:munderover><mml:mo movablelimits="false">∏</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>r</mml:mi></mml:munderover><mml:mo>[</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>[</mml:mo><mml:mtext>Latent Model</mml:mtext><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>[</mml:mo><mml:mtext>Prior</mml:mtext><mml:mo>]</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
            where <inline-formula><mml:math id="M203" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> is a vector containing all runoff observations <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> from all catchments <inline-formula><mml:math id="M205" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> and years <inline-formula><mml:math id="M206" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M207" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is a vector containing all latent variables, i.e. the intercepts <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and the GRFs <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="bold">⋅</mml:mo></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M211" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo mathvariant="bold">⋅</mml:mo></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for all combinations of grid nodes <inline-formula><mml:math id="M212" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>∈</mml:mo><mml:msub><mml:mi mathvariant="script">L</mml:mi><mml:mi mathvariant="script">D</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and years <inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M214" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>. Furthermore, <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the true annual runoff that is modelled as a function of the latent field <inline-formula><mml:math id="M216" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula>, while <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:mi>I</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is an indicator function that is equal to 1 if its argument is true and 0 otherwise, allowing for missing data and short records of runoff. Finally, <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:mi mathvariant="bold-italic">θ</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. Together with <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> it contains all model parameters.</p>
      <p id="d1e5383">The centroid model is summarized as a hierarchical model similarly, except that the true annual runoff <inline-formula><mml:math id="M220" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is given by Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) instead of Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>). This also means that the grid nodes <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the above hierarchical model must be replaced by <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, i.e. the locations of the centroids of the <inline-formula><mml:math id="M223" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> catchments under study.</p>
      <p id="d1e5470">The purpose of Bayesian inference is to estimate the posterior distributions of the latent variables <inline-formula><mml:math id="M224" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> and the parameters <inline-formula><mml:math id="M225" display="inline"><mml:mi mathvariant="bold-italic">θ</mml:mi></mml:math></inline-formula> based on the observations <inline-formula><mml:math id="M226" display="inline"><mml:mi mathvariant="bold-italic">y</mml:mi></mml:math></inline-formula> as described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/>. In this study, the resulting distributions are used to quantify the variable of interest, the catchment runoff <inline-formula><mml:math id="M227" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. By Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>) and Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) we see that the catchment runoff is determined by the point runoff <inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> which is again determined by the latent field <inline-formula><mml:math id="M229" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> through Eq. (<xref ref-type="disp-formula" rid="Ch1.E4"/>). This means that in the process of estimating the catchment runoff <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> we always estimate the point runoff <inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and the latent field <inline-formula><mml:math id="M232" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> first. To clarify this process, consider Fig. <xref ref-type="fig" rid="Ch1.F10"/> that is presented later in the article. This shows the posterior mean runoff <inline-formula><mml:math id="M233" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, or <inline-formula><mml:math id="M234" display="inline"><mml:mrow><mml:mi mathvariant="italic">π</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>|</mml:mo><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> implicitly, for all points in the study area. From these point estimates, predictions for the <italic>areal</italic> model <inline-formula><mml:math id="M235" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are obtained by taking the average of <inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> over relevant grid nodes according to Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>). For the centroid model, a catchment areal prediction <inline-formula><mml:math id="M237" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi></mml:mrow></mml:math></inline-formula>) is obtained by simply extracting the value of <inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> at the catchment centroid <inline-formula><mml:math id="M239" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> according to Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>). From the point-referenced predictions in Fig. <xref ref-type="fig" rid="Ch1.F10"/> we this way obtain catchment predictions like the ones presented later in e.g. Fig. <xref ref-type="fig" rid="Ch1.F7"/>.</p>
      <p id="d1e5739">From the hierarchical formulation in Eq. (<xref ref-type="disp-formula" rid="Ch1.E12"/>) we also note that the framework takes the time dimension into account through multiplying the likelihood for annual runoff over different years <inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:math></inline-formula>. These years do not need to be consecutive, which allows for e.g. combining old measurements from closed stations with more recent data. Different years of data are connected through the constant climatic component (<inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>). Apart from this, there is no temporal dependency in the model that assumes correlation over time, and routing is not taken into account. This makes sense for our suggested application, as there is no prominent time dependency for annual runoff in Norway (see e.g. Fig. <xref ref-type="fig" rid="Ch1.F2"/>b). Routing effects can typically be neglected for time-aggregated runoff variables for longer timescales. For shorter timescales for which routing has an impact, other spatio-temporal models should be considered, for example the one in <xref ref-type="bibr" rid="bib1.bibx45" id="text.39"/>.</p>
</sec>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Two model properties and contributions</title>
      <p id="d1e5797">In this section we highlight and describe two of the model properties that make the suggested framework different from top-kriging and other geostatistical interpolation methods that are used for hydrological applications.</p>
<sec id="Ch1.S4.SS2.SSS1">
  <label>4.2.1</label><title>Exploiting short records</title>
      <?pagebreak page4119?><p id="d1e5807">The first property we highlight is how the model is particularly suitable for exploiting short records of runoff, and this holds for both the areal model and the centroid model. This property is already briefly addressed in the introduction and is enabled because we simultaneously model several years of data with a spatial component <inline-formula><mml:math id="M242" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> that is common for all years under study. The GRF <inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represents the long-term spatial variability of runoff. If most of the spatial variability can be explained by long-term patterns, the marginal variance parameter <inline-formula><mml:math id="M244" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> will dominate over the marginal variance parameter <inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> of the annual GRF <inline-formula><mml:math id="M246" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (and the other model variances), i.e. <inline-formula><mml:math id="M247" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Thus, a short record of runoff from an otherwise ungauged catchment will have a large impact also for predictions in years without data through <inline-formula><mml:math id="M248" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. On the other hand, if most of the annual runoff is explained by year-specific effects, <inline-formula><mml:math id="M249" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> will dominate over <inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and short records will not have a large impact on the final model. Hence, it is safe to include short records in the model regardless of the weather patterns in the study area.</p>
      <p id="d1e5945">Existing methods for exploiting short records are typically based on linear regression or computing the correlation between the runoff in the target catchment and one or several donor catchments, and in order to perform these procedures the short record must be of length larger than 1 <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx28" id="paren.40"/>. In the method we suggest it is possible to include a short record of length 1, and it is already shown for a smaller case study that this often is enough to see a large improvement in the predictability of (annual) runoff for certain climates <xref ref-type="bibr" rid="bib1.bibx37" id="paren.41"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6"><?xmltex \currentcnt{6}?><label>Figure 6</label><caption><p id="d1e5956">Conceptual, simplified figure of a river network, for which the involved catchments are discretized by 5 grid nodes <inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, and each grid node represent one areal unit. Catchment <inline-formula><mml:math id="M252" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> contains all grid nodes <inline-formula><mml:math id="M253" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, Catchment <inline-formula><mml:math id="M254" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> consists of grid nodes <inline-formula><mml:math id="M255" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M256" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, while Catchment <inline-formula><mml:math id="M257" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> consists of grid nodes <inline-formula><mml:math id="M258" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M259" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Hence, this is a system of nested catchments where <inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> covers <inline-formula><mml:math id="M261" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M262" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. Assume that there are observations of annual runoff at the outlet of catchment <inline-formula><mml:math id="M263" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and catchment <inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>: <inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2000</mml:mn></mml:mrow></mml:math></inline-formula> mm yr<inline-formula><mml:math id="M266" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M267" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1000</mml:mn></mml:mrow></mml:math></inline-formula> mm yr<inline-formula><mml:math id="M268" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Catchment <inline-formula><mml:math id="M269" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is ungauged. In order to fulfill water balance constraints of the areal model from Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>), imposed by the likelihood in Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>), the predicted mean annual runoff in catchment <inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> must be around 2667 mm yr<inline-formula><mml:math id="M271" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> if we assume a low observation uncertainty.</p></caption>
            <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f06.png"/>

          </fig>

</sec>
<sec id="Ch1.S4.SS2.SSS2">
  <label>4.2.2</label><title>The water balance constraints of the areal model</title>
      <p id="d1e6263">The second property we highlight only holds for the areal model and is related to its mass-conserving properties and its ability to do more than smoothing: runoff is in Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) defined as a weighted sum of point runoff. Through Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) and the likelihood defined in Eq. (<xref ref-type="disp-formula" rid="Ch1.E7"/>), a (soft) constraint is put on the predicted annual runoff for the catchments for which we have observations. This also has the consequence that the suggested model allows us to predict values that are larger than any of the observed values in the area of interest. As a conceptual example, consider the river network in Fig. <xref ref-type="fig" rid="Ch1.F6"/>, where each black node represents one areal unit in the discretization of the catchments. The observed runoff is 1000 mm yr<inline-formula><mml:math id="M272" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> in the subcatchment and 2000 mm yr<inline-formula><mml:math id="M273" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> in the surrounding larger catchment. That means that the constraints imposed by the observation likelihood and Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>) are the following:
              <disp-formula id="Ch1.E13" content-type="numbered"><label>13</label><mml:math id="M274" display="block"><mml:mtable columnspacing="1em" rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mn mathvariant="normal">1000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>+</mml:mo><mml:mtext>uncertainty</mml:mtext></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mn mathvariant="normal">2000</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mspace linebreak="nobreak" width="1em"/><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>+</mml:mo><mml:mtext>uncertainty</mml:mtext><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
      <p id="d1e6462">As described in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS3"/> we use a quite strict prior on the observation uncertainty. This is done to try to force the above soft constraints to be stronger. By solving this system of equations, it can be shown that the predicted value in the ungauged catchment <inline-formula><mml:math id="M275" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> must be

                  <disp-formula specific-use="align"><mml:math id="M276" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>q</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mn mathvariant="normal">5</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">3</mml:mn></mml:mfrac></mml:mstyle><mml:mo>+</mml:mo><mml:mtext>uncertainty</mml:mtext></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2667</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mtext>mm yr</mml:mtext><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mtext>uncertainty</mml:mtext><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

              where the above uncertainty term is determined by the observation uncertainties for a fixed set of ranges and marginal variances. Hence, as long as the observation uncertainties are sufficiently low, the predicted runoff in the unobserved area <inline-formula><mml:math id="M277" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in Fig. <xref ref-type="fig" rid="Ch1.F6"/> is close to 2667 mm yr<inline-formula><mml:math id="M278" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> which is larger than any of the observed values. This example illustrates that the areal model is able to go beyond smoothing through its runoff constraints. This makes the model different from many other interpolation methods that only rely on spatial smoothing. For the above example, such methods would typically produce a prediction between 1000 and 2000 mm yr<inline-formula><mml:math id="M279" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
      <p id="d1e6617">The full areal model is of course slightly more complicated than the simple example above, as prior distributions, covariance calculations and spatial ranges must be taken into account. However, the simple example illustrates the general idea of how the observation likelihood interprets the areal observations and constrains runoff. That the full areal model actually is able to conserve mass in practice is demonstrated for a real case example from Norway in <xref ref-type="bibr" rid="bib1.bibx37" id="text.42"/>.</p>
      <p id="d1e6623">The constraints in Eq. (<xref ref-type="disp-formula" rid="Ch1.E13"/>) also show how the areal model ensures consistent predictions over nested catchments: as the predicted runoff in the main catchment <inline-formula><mml:math id="M280" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> can be expressed as a weighted sum of the predicted runoff in all its subcatchments depending on catchment areas, i.e. as <inline-formula><mml:math id="M281" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">2</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:mfrac></mml:mstyle><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">3</mml:mn><mml:mn mathvariant="normal">5</mml:mn></mml:mfrac></mml:mstyle><mml:mover accent="true"><mml:mi>Q</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, the water balance can not be violated for the predicted runoff for any of the catchments in Fig. <xref ref-type="fig" rid="Ch1.F6"/>. This means that the equations in Eq. (<xref ref-type="disp-formula" rid="Ch1.E13"/>) correspond to water balance constraints.</p>
      <?pagebreak page4120?><p id="d1e6709">Compared to top-kriging, both top-kriging and the proposed method assume that the underlying variable is linearly aggregated and mass-conserved when performing covariance calculations. Top-kriging is also able to predict larger values than any of the observed values by allowing negative kriging weights. However, top-kriging does not use constraints to ensure that the mass balance is preserved over nested catchments, as in the above example. Consequently, the top-kriging predictions can more easily violate the water balance, which can have both benefits and drawbacks depending on the target variable and the problem we are trying to solve. Another hydrological model that uses water balance constraints, not unlike the proposed method, is the kriging approach in <xref ref-type="bibr" rid="bib1.bibx40" id="text.43"/>, where mass balance constraints are introduced as additional constraints in the kriging system of equations.</p>
</sec>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Inference</title>
      <p id="d1e6724">In order to make the framework described in Sect. <xref ref-type="sec" rid="Ch1.S4"/> computationally feasible, some simplifications of the suggested models are necessary. In general, statistical inference on models including GRFs is slow when the number of target locations is large because matrix operations on dense covariance matrices are required. The computational complexity is particularly large for the areal model, because each grid node in the discretization of the catchments can be regarded as a new target location, and because it includes soft constraints. To solve the computational issues for the centroid and areal models, we utilize the fact that a GRF with a Matérn covariance function can be expressed as the solution of a specific SPDE <xref ref-type="bibr" rid="bib1.bibx29" id="paren.44"/>. This SPDE can be solved by using the finite-element method (see e.g. <xref ref-type="bibr" rid="bib1.bibx5" id="altparen.45"/>), and the result is a Gaussian Markov random field (GMRF). Working with GMRFs is convenient because GMRFs have precision matrices (inverse covariance matrices) that typically are sparse with many zero elements, and efficient algorithms are available for sparse matrix operations (see e.g <xref ref-type="bibr" rid="bib1.bibx38" id="altparen.46"/>). In this work, both GRFs <inline-formula><mml:math id="M282" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M283" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are approximated by GMRFs.</p>
      <p id="d1e6770">Another challenge with the suggested models is that we suggest Bayesian models that include a large number of parameters for which the marginal distributions must be estimated. Traditionally, Bayesian inference is done by using Markov chain Monte Carlo (MCMC) methods, but inference can be slow when the dimension of the problem is large <xref ref-type="bibr" rid="bib1.bibx17" id="paren.47"/>. These challenges are met by modelling runoff as a latent Gaussian model (LGM). That is, the latent part <inline-formula><mml:math id="M284" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> of the hierarchical model in Sect. <xref ref-type="sec" rid="Ch1.S4.SS1.SSS5"/> consists of only Gaussian distributions. More specifically, the prior distributions for <inline-formula><mml:math id="M285" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are modelled as GRFs, and the prior distributions for <inline-formula><mml:math id="M287" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M288" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are Gaussian given the model parameters (see the equations in Eq. <xref ref-type="disp-formula" rid="Ch1.E4"/>). This is convenient, because it allows us to use integrated nested Laplace approximations (INLAs) to make inference and predictions. INLA is a tool for making Bayesian inference for LGMs <xref ref-type="bibr" rid="bib1.bibx39" id="paren.48"/> and represents a fast and approximate alternative to MCMC algorithms. The INLA approach is based on approximating the Bayesian marginal distributions by using Laplace or other analytic approximations, and on numerical integration schemes. The main computational tool is the sparse matrix calculations described in <xref ref-type="bibr" rid="bib1.bibx38" id="text.49"/>, such that in order to work quickly, the latent field of the LGM should be a GMRF with a sparse precision matrix. This requirement is fulfilled through the SPDE approach as already outlined.</p>
      <p id="d1e6847">INLA in general provides approximations of very high accuracy for most models <xref ref-type="bibr" rid="bib1.bibx39 bib1.bibx30 bib1.bibx8 bib1.bibx24" id="paren.50"/> but has faced problems for some (more extreme) models with binomial or Poisson data <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx12" id="paren.51"/>. For Gaussian likelihoods, however, INLA is exact up to numerical integration error. As we use Gaussian likelihoods in this work, we can thus expect INLA to give reliable approximations. The SPDE approach also provides accurate approximations <xref ref-type="bibr" rid="bib1.bibx29 bib1.bibx24" id="paren.52"/>, but it is important that the mesh involved in the finite-element method computations is sufficiently dense relative to the spatial variability and range in the study area.</p>
      <p id="d1e6859">Because of the high computational speed and accuracy, the INLA and SPDE framework has become quite common to use within different fields of science. See for example <xref ref-type="bibr" rid="bib1.bibx27" id="text.53"/>, <xref ref-type="bibr" rid="bib1.bibx34" id="text.54"/>, <xref ref-type="bibr" rid="bib1.bibx51" id="text.55"/>, <xref ref-type="bibr" rid="bib1.bibx21" id="text.56"/>, <xref ref-type="bibr" rid="bib1.bibx25" id="text.57"/>, and <xref ref-type="bibr" rid="bib1.bibx2" id="text.58"/>. We refer to the R package <monospace>r-inla</monospace> for a user-friendly interface for applying INLA and the SPDE approach to spatial modelling. In particular, <xref ref-type="bibr" rid="bib1.bibx33" id="text.59"/> is recommended for a description of how a model with (catchment) areal data can be implemented in <monospace>r-inla</monospace>. Furthermore, we have made code for the centroid model available on <uri>http://www.github.com/tjroksva/runoffinterpolation</uri>  (last access: 29 January 2020; <ext-link xlink:href="https://doi.org/10.5281/zenodo.3630348" ext-link-type="DOI">10.5281/zenodo.3630348</ext-link>) with example data from the catchments in Fig. <xref ref-type="fig" rid="Ch1.F1"/>a.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Model evaluation</title>
      <p id="d1e6909">The main objectives of this article are to (1) evaluate the new framework's ability to fill in missing annual runoff observations and to (2) predict mean annual runoff for catchments with varying record lengths. By this we also want to (3) demonstrate the potential added value of including short runoff records in the modelling compared to when not using them. In this section we present the experimental set-up and the evaluation criteria used to address our research questions.</p>
<?pagebreak page4121?><sec id="Ch1.S5.SS1">
  <label>5.1</label><title>Experimental set-up for infill of missing annual observations (1996–2005)</title>
      <p id="d1e6919">To assess the framework's ability to fill in missing values of annual runoff, we do interpolation of runoff for the 10 hydrological years 1996–2005 for the 180 fully gauged catchments shown in Fig. <xref ref-type="fig" rid="Ch1.F1"/>a. This is done both for series of annual runoff, and for the annual series of monthly runoff for January, April and June described in Sect. <xref ref-type="sec" rid="Ch1.S2"/>.</p>
      <p id="d1e6926">The annual time series of monthly runoff are included in the analysis in order to demonstrate the framework’s properties for hydrological variables or areas that are driven by more unstable hydrological processes. For the annual series of monthly runoff, the models from Sect. <xref ref-type="sec" rid="Ch1.S4"/> are specified as before: considering predictions for January, <inline-formula><mml:math id="M289" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> in Eq. (<xref ref-type="disp-formula" rid="Ch1.E5"/>) represents the true runoff in January for catchment <inline-formula><mml:math id="M290" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, year <inline-formula><mml:math id="M291" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>, such that the GRF <inline-formula><mml:math id="M292" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represents the long-term spatial variability in January. Likewise, the GRF <inline-formula><mml:math id="M293" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represents the annual discrepancy from the climate in January, and <inline-formula><mml:math id="M294" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the observed runoff in January for catchment <inline-formula><mml:math id="M295" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="script">A</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> year <inline-formula><mml:math id="M296" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>. The models for June and April are specified similarly, and for simplicity we use the same prior distributions for all experiments.</p>
      <p id="d1e7035">In our assessment of the framework's predictive performance for infill of missing annual observations, the three following methods are compared.</p>
      <p id="d1e7038"><list list-type="bullet">
            <list-item>

      <p id="d1e7043"><italic>Top-kriging.</italic> Spatial interpolation with top-kriging. For top-kriging each year (1996–2005) is interpolated independently of other years. Short records on an annual (or monthly) scale do not have an impact on years without data. The default covariance function (or variogram) in the R package <monospace>rtop</monospace> was fitted as this gave the most accurate results. This is a multiplication of a modified exponential and fractal variogram model, the same model as used in <xref ref-type="bibr" rid="bib1.bibx43" id="text.60"/>.</p>
            </list-item>
            <list-item>

      <p id="d1e7057"><italic>Areal model.</italic> Spatial interpolation with the model defined in Sect. <xref ref-type="sec" rid="Ch1.S4"/> with true annual runoff given by the areal model from Eq. (<xref ref-type="disp-formula" rid="Ch1.E11"/>). That is, the annual runoff in a catchment is interpreted as the average point runoff over the catchment area. All years are modelled simultaneously (1996–2005) such that short records of data can influence years without data.</p>
            </list-item>
            <list-item>

      <p id="d1e7069"><italic>Centroid model.</italic> Spatial interpolation with the model defined in Sect. <xref ref-type="sec" rid="Ch1.S4"/> with true annual runoff given by the centroid model from Eq. (<xref ref-type="disp-formula" rid="Ch1.E6"/>). That is, annual runoff is interpreted as a process linked to point locations in space (the catchment centroids), and not to catchment areas. All years are modelled simultaneously (1996–2005) such that short records of data can influence years without data.</p>
            </list-item>
          </list></p>
      <p id="d1e7081">The predictive performance of the three methods is evaluated by cross-validation: the 180 catchments in Norway were divided into 20 groups or folds, each containing nine catchments. In turn each group was left out, and annual or monthly runoff predictions were performed for these so-called target catchments by using observations from the catchments in the other groups. That is, we predict runoff for 1996–2005 for 9 target catchments at once by using data from the remaining 171 fully gauged catchments and repeat the process for all 20 cross-validation folds. To evaluate and compare the three methods described above, we do the following two tests.</p>
      <p id="d1e7084"><list list-type="bullet">
            <list-item>

      <p id="d1e7089"><italic>UG (ungauged).</italic> Assess the methods' ability to fill in missing values for ungauged catchments (denoted UG). That is, the target catchments are treated as totally ungauged, and all their observations are left out of the dataset when the predictions for 1996–2005 are performed.</p>
            </list-item>
            <list-item>

      <p id="d1e7097"><italic>PG (partially gauged).</italic> Assess the methods' ability to fill in missing values for partially gauged catchments (denoted PG). Each of the 9 target catchments in the cross-validation group is allowed to have one annual observation of runoff. That is, a short record of length 1 from the target catchment is included in the observation likelihood in addition to the full data series of runoff from the catchments in the other cross-validation folds. The short record is drawn randomly from the 10 years of observations available for each target catchment. We perform predictions for 9 partially gauged target catchments at once, for all 10 study years (for which 1 of them is observed for each catchment), and repeat the process for all 20 cross-validation folds.</p>
            </list-item>
          </list></p>
      <p id="d1e7104">To make the results comparable, we use the same cross-validation groups for both experiments (UG and PG) and methods (top-kriging, areal model and centroid model) and remove the same set of annual observations for PG across methods. For the PG case, we also compare our models to a method for exploiting short records from the target catchment. The method we choose for comparison is simple linear regression, and we perform linear regression for the PG case as follows.</p>
      <p id="d1e7107"><list list-type="bullet">
            <list-item>

      <p id="d1e7112"><italic>Linear regression.</italic> The closest catchment in terms of catchment centroid is used as a donor catchment, and only catchments outside the target catchment's cross-validation group can be considered. Two annual observations between 1996 and 2005 are randomly drawn from the target catchment, and data from the donor catchment and target catchment are used to fit a linear regression model on the form <inline-formula><mml:math id="M297" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Next, the model is fitted as described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS4"/> and used to predict runoff for the target catchment for 1996–2005 (where two of the years are observed). The reason for using a short record of length 2 instead of 1 is that at least two observations are required to fit a linear regression model with uncertainty. Also note that we have omitted the intercept <inline-formula><mml:math id="M298" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in the regression model, such that we only have two unknown variables (<inline-formula><mml:math id="M299" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M300" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>).<?pagebreak page4122?> We emphasize that estimating the variables statistically based on only two pairs of observations might not give the most reliable estimates, particularly when considering <inline-formula><mml:math id="M301" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. However, the linear regression results can provide an intuition about the target variable and how correlated the observation locations are over time. A good performance for linear regression in this study suggests that the spatial pattern for the target variable is very stable and that a fair prediction can be obtained by simply using the ratio of runoff between a target catchment and a donor catchment for any chosen year to develop a linear relationship between the two catchments. Hence, a short record can be very valuable. Motivated by this, linear regression is treated as an indicator for when our geostatistical method can be expected to perform particularly well rather than as a recommended method for record augmentation when having only two observations.</p>
            </list-item>
          </list></p>
</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Experimental set-up for predictions of mean annual runoff (1981–2010)</title>
      <p id="d1e7204">To assess the framework's ability to estimate mean annual runoff, we use annual data from 1981 to 2010 from the 260 catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. Recall that these catchments have at least one observation of mean annual runoff between 1981 and 2010, but only 83 of them are fully gauged. This was different from the experiments described in Sect. <xref ref-type="sec" rid="Ch1.S5.SS1"/>, where all the test catchments were fully gauged before the cross-validation was performed.</p>
      <p id="d1e7211">For this experiment, we again compare the performances of top-kriging and the areal and centroid models. The areal model and the centroid model are fitted for several years of annual runoff simultaneously, as before. As a predictor for the mean annual runoff, we use the posterior distribution of the climatic part of the model. This is given by <inline-formula><mml:math id="M302" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for the centroid model, where <inline-formula><mml:math id="M303" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi mathvariant="script">A</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the centroid of the catchment <inline-formula><mml:math id="M304" display="inline"><mml:mi mathvariant="script">A</mml:mi></mml:math></inline-formula> of interest. For the areal model it is given by the average <inline-formula><mml:math id="M305" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> over the grid nodes <inline-formula><mml:math id="M306" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the discretization of the target catchment. Note that the climatic part of the model must be re-estimated for each experiment or cross-validation fold.</p>
      <p id="d1e7291">In order to interpolate mean annual runoff by using top-kriging, we have to compute the mean annual runoff based on the annual observations for all catchments before running the analysis. For catchments with less than 30 annual observations we use the average of the 1–29 available observations as an approximation for the mean annual runoff for 1981–2010. Next, the mean annual runoff is interpolated by using top-kriging where the uncertainty of the observations is specified as a function of record length. This is the suggested approach from <xref ref-type="bibr" rid="bib1.bibx43" id="text.61"/> for including short records in the top-kriging framework. We set the observation variance for a catchment with record length <inline-formula><mml:math id="M307" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> to <inline-formula><mml:math id="M308" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi mathvariant="italic">σ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>/</mml:mo><mml:mi>m</mml:mi></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M309" display="inline"><mml:mover accent="true"><mml:mi mathvariant="italic">σ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> is the average empirical standard deviation for the observed annual runoff taken over the 83 fully gauged catchments in our dataset, in this case <inline-formula><mml:math id="M310" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi mathvariant="italic">σ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mn mathvariant="normal">336</mml:mn></mml:mrow></mml:math></inline-formula> mm yr<inline-formula><mml:math id="M311" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. For the top-kriging experiments, we fit the same covariance model as in Sect. <xref ref-type="sec" rid="Ch1.S5.SS1"/>.</p>
      <p id="d1e7362">The areal and centroid models and top-kriging are again evaluated by cross-validation. The 83 fully gauged catchments from Fig. <xref ref-type="fig" rid="Ch1.F5"/> were divided into four folds containing 20, 20, 20 and 23 catchments respectively, and in turn observations from each fold were removed and predicted. This was done for varying record lengths for the target catchments, more specifically when 0, 1, 3, 5 or 10 randomly drawn annual observations from the target catchments were included in the likelihood. We denote these settings UG, PG1, PG3, PG5 and PG10. Note that while we only are able to assess the predictive performance for the 83 fully gauged catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/>, data from the remaining 177 partially gauged catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/> are used in the observation sample. This is in addition to the data from the fully gauged catchments from the other folds.</p>
</sec>
<sec id="Ch1.S5.SS3">
  <label>5.3</label><title>Evaluation scores</title>
      <p id="d1e7379">To evaluate the predictions we use the root mean squared error (RMSE) and the continuous rank probability score (CRPS). Having <inline-formula><mml:math id="M312" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> pairs of observations and predictions, the RMSE is computed as
            <disp-formula id="Ch1.Ex5"><mml:math id="M313" display="block"><mml:mrow><mml:mi mathvariant="normal">RMSE</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>m</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>m</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>j</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>j</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M314" display="inline"><mml:mrow><mml:msubsup><mml:mi>y</mml:mi><mml:mi>j</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is the observed value and <inline-formula><mml:math id="M315" display="inline"><mml:mrow><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>j</mml:mi><mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is the corresponding predicted value. In our analysis, the posterior mean is used as the predicted value for the areal and centroid models.</p>
      <p id="d1e7476">The CRPS is defined as
            <disp-formula id="Ch1.Ex6"><mml:math id="M316" display="block"><mml:mrow><mml:mi mathvariant="normal">CRPS</mml:mi><mml:mo>(</mml:mo><mml:mi>F</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∫</mml:mo><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="normal">∞</mml:mi></mml:mrow><mml:mi mathvariant="normal">∞</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mi>s</mml:mi><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo mathvariant="italic">{</mml:mo><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>≤</mml:mo><mml:mi>s</mml:mi><mml:mo mathvariant="italic">}</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mi mathvariant="normal">d</mml:mi><mml:mi>s</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M317" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the predictive cumulative distribution and <inline-formula><mml:math id="M318" display="inline"><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is the actual observation <xref ref-type="bibr" rid="bib1.bibx18" id="paren.62"/>. For the methods we test (areal, centroid, top-kriging and linear regression), <inline-formula><mml:math id="M319" display="inline"><mml:mrow><mml:mi>F</mml:mi><mml:mo>(</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a Gaussian distribution with mean equal to the predicted value and standard deviation equal to the standard deviation of the prediction.</p>
      <p id="d1e7590">For the experiments related to infill of individual years, the CRPS and RMSE are first computed for each of the 180 catchments in the dataset based on 10 pairs of predictions and observations. The average RMSE and CRPS over all catchments are used as a summary scores. For the experiments related to predictions of mean annual runoff, there is only one (mean annual) prediction for each catchment, and the RMSE<?pagebreak page4123?> and CRPS over all catchments are reported. Both the CRPS and the RMSE are negatively oriented such that low scores mean better predictions.</p>
      <p id="d1e7593">To be able to compare the RMSE and CRPS across methods, we use a paired Wilcoxon signed-rank test <xref ref-type="bibr" rid="bib1.bibx41" id="paren.63"/>. This is a non-parametric test that does not require normal distributed data. The null hypothesis of the test is that the median difference between pairs of data (in this case pairs of RMSE or CRPS values) follows a symmetric distribution around zero. The alternative hypothesis is that the difference between the data pairs does not follow a symmetric distribution around zero. If the null hypothesis is rejected, it indicates that one of the methods gives a significantly smaller RMSE or CRPS than another method.</p>
      <p id="d1e7600">In addition to the RMSE and the CRPS, we report the 95 % coverage of the experiments. The 95 % coverage is computed by calculating the amount of the actually observed runoff values that are within the corresponding 95 % posterior prediction intervals. Here, we make posterior prediction intervals for top-kriging and linear regression by assuming that the predictions are Gaussian. A 95 % coverage close to 0.95 is optimal and indicates that the model provides an accurate representation of the underlying uncertainty.</p>
      <p id="d1e7603">We also want to compare our mean annual runoff results with other studies of mean annual runoff, more specifically the studies collected in <xref ref-type="bibr" rid="bib1.bibx4" id="text.64"/>. In <xref ref-type="bibr" rid="bib1.bibx4" id="text.65"/>, the absolute normalized error (ANE) and the squared correlation coefficient (<inline-formula><mml:math id="M320" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) are used as evaluation scores. The ANE is computed as
            <disp-formula id="Ch1.E14" content-type="numbered"><label>14</label><mml:math id="M321" display="block"><mml:mrow><mml:mtext>ANE</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>|</mml:mo><mml:msup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup><mml:mo>-</mml:mo><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M322" display="inline"><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M323" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> are the observed and predicted value as before. The ANE divides the absolute difference between the actual observation <inline-formula><mml:math id="M324" display="inline"><mml:mrow><mml:msup><mml:mi>y</mml:mi><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and corresponding prediction <inline-formula><mml:math id="M325" display="inline"><mml:mrow><mml:msup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>*</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> by the observed runoff and is therefore scale independent. An ANE close to zero corresponds to an accurate prediction.</p>
      <p id="d1e7714">Finally, the squared correlation coefficient between <inline-formula><mml:math id="M326" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula> pairs of observations and predictions is computed as
            <disp-formula id="Ch1.E15" content-type="numbered"><label>15</label><mml:math id="M327" display="block"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mtext>Cor</mml:mtext><mml:mo mathvariant="italic">{</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mo>(</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mn mathvariant="normal">1</mml:mn><mml:mo>*</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:msubsup><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>m</mml:mi><mml:mo>*</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo mathvariant="italic">}</mml:mo><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where Cor(<inline-formula><mml:math id="M328" display="inline"><mml:mrow><mml:mo>⋅</mml:mo><mml:mo>,</mml:mo><mml:mo>⋅</mml:mo></mml:mrow></mml:math></inline-formula>) denotes the Pearson correlation. An <inline-formula><mml:math id="M329" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> close to 1 indicates a high correlation between the predicted and observed values.</p>
</sec>
</sec>
<sec id="Ch1.S6">
  <label>6</label><title>Results</title>
<sec id="Ch1.S6.SS1">
  <label>6.1</label><title>Predictions for individual years (1996–2005)</title>
      <p id="d1e7845">We now present the results related to the framework's ability to predict runoff for individual, missing years for the annual time series of annual and monthly runoff for a 10-year period (1996–2005). First, we present the results for the ungauged catchments (UG), before we proceed to the partially gauged catchments (PG) that have short records of length 1.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><label>Figure 7</label><caption><p id="d1e7850">Average posterior mean for <inline-formula><mml:math id="M330" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, average posterior standard deviation for <inline-formula><mml:math id="M331" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and average <inline-formula><mml:math id="M332" display="inline"><mml:mi mathvariant="normal">RMSE</mml:mi></mml:math></inline-formula> for each catchment for predictions of missing annual observations in southern Norway for <inline-formula><mml:math id="M333" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> for the areal model (A, left), the centroid model (C, middle) and top-kriging (TK, right) when the target catchments are treated as ungauged (UG). The observed mean annual runoff is also included as a reference (first plot).</p></caption>
          <?xmltex \igopts{width=369.885827pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f07.png"/>

        </fig>

<sec id="Ch1.S6.SS1.SSS1">
  <label>6.1.1</label><title>Infill for ungauged catchments (UG)</title>
      <p id="d1e7927">For the ungauged case (UG), the target catchments are treated as totally ungauged for the 10 study years 1996–2005, and missing values are predicted both for annual and monthly runoff. In Fig. <xref ref-type="fig" rid="Ch1.F7"/> the resulting average predicted annual runoff in southern Norway is presented for top-kriging, the areal model and the centroid model. The three methods give similar results for the posterior mean, and all are able to reproduce the true spatial pattern of annual runoff. Furthermore, the RMSE plots in Fig. <xref ref-type="fig" rid="Ch1.F7"/> show that the three methods succeed and fail for many of the same catchments. Here, we should keep in mind that the RMSE is scale dependent and might not give the best impression of the relative performance across the study area. However, we note that many of the catchments with high RMSE values typically are small catchments located in western Norway. We will come back to these catchments in Sect. <xref ref-type="sec" rid="Ch1.S6.SS1.SSS2"/> to see how the predictions here were affected when including a short record.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e7939">Predictive performance for predictions of missing annual values in ungauged catchments (UG) and partially gauged catchments (PG) for the areal model, centroid model, top-kriging (TK) and simple linear regression (LR). The best performance in each row is marked in bold. The RMSE and CRPS were compared across methods by using a one-sided paired Wilcoxon signed-rank test for assessing the significance of the results. Results that were significantly better than other results are marked with stars. </p></caption><oasis:table frame="topbot"><oasis:tgroup cols="17">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right"/>
     <oasis:colspec colnum="11" colname="col11" align="right"/>
     <oasis:colspec colnum="12" colname="col12" align="left"/>
     <oasis:colspec colnum="13" colname="col13" align="right"/>
     <oasis:colspec colnum="14" colname="col14" align="right"/>
     <oasis:colspec colnum="15" colname="col15" align="right"/>
     <oasis:colspec colnum="16" colname="col16" align="right"/>
     <oasis:colspec colnum="17" colname="col17" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry namest="col1" nameend="col2" align="center"/>
         <oasis:entry rowsep="1" namest="col3" nameend="col6" align="center">RMSE (mm yr<inline-formula><mml:math id="M336" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) </oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry rowsep="1" namest="col8" nameend="col11" align="center">CRPS (mm yr<inline-formula><mml:math id="M337" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) </oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry rowsep="1" namest="col13" nameend="col16" align="center">Coverage 95 % </oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Case</oasis:entry>
         <oasis:entry colname="col2">Dataset</oasis:entry>
         <oasis:entry colname="col3">Areal</oasis:entry>
         <oasis:entry colname="col4">Centr.</oasis:entry>
         <oasis:entry colname="col5">TK</oasis:entry>
         <oasis:entry colname="col6">LR</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">Areal</oasis:entry>
         <oasis:entry colname="col9">Centr.</oasis:entry>
         <oasis:entry colname="col10">TK</oasis:entry>
         <oasis:entry colname="col11">LR</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">Areal</oasis:entry>
         <oasis:entry colname="col14">Centr.</oasis:entry>
         <oasis:entry colname="col15">TK</oasis:entry>
         <oasis:entry colname="col16">LR</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">UG</oasis:entry>
         <oasis:entry colname="col2">Annual</oasis:entry>
         <oasis:entry colname="col3">337</oasis:entry>
         <oasis:entry colname="col4">343</oasis:entry>
         <oasis:entry colname="col5"><bold>310</bold><inline-formula><mml:math id="M338" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">242</oasis:entry>
         <oasis:entry colname="col9">249</oasis:entry>
         <oasis:entry colname="col10"><bold>225</bold><inline-formula><mml:math id="M339" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">–</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.92</oasis:entry>
         <oasis:entry colname="col14">0.91</oasis:entry>
         <oasis:entry colname="col15"><bold>0.94</bold></oasis:entry>
         <oasis:entry colname="col16">–</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">UG</oasis:entry>
         <oasis:entry colname="col2">January</oasis:entry>
         <oasis:entry colname="col3">39</oasis:entry>
         <oasis:entry colname="col4">37</oasis:entry>
         <oasis:entry colname="col5"><bold>36</bold><inline-formula><mml:math id="M340" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">26</oasis:entry>
         <oasis:entry colname="col9">25</oasis:entry>
         <oasis:entry colname="col10"><bold>24</bold><inline-formula><mml:math id="M341" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">–</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.92</oasis:entry>
         <oasis:entry colname="col14">0.89</oasis:entry>
         <oasis:entry colname="col15"><bold>0.93</bold></oasis:entry>
         <oasis:entry colname="col16">–</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">UG</oasis:entry>
         <oasis:entry colname="col2">April</oasis:entry>
         <oasis:entry colname="col3">38</oasis:entry>
         <oasis:entry colname="col4">38</oasis:entry>
         <oasis:entry colname="col5"><bold>37</bold></oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">25</oasis:entry>
         <oasis:entry colname="col9">25</oasis:entry>
         <oasis:entry colname="col10"><bold>24</bold></oasis:entry>
         <oasis:entry colname="col11">–</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.89</oasis:entry>
         <oasis:entry colname="col14">0.85</oasis:entry>
         <oasis:entry colname="col15"><bold>0.93</bold></oasis:entry>
         <oasis:entry colname="col16">–</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">UG</oasis:entry>
         <oasis:entry colname="col2">June</oasis:entry>
         <oasis:entry colname="col3">87</oasis:entry>
         <oasis:entry colname="col4">96</oasis:entry>
         <oasis:entry colname="col5"><bold>82</bold> *</oasis:entry>
         <oasis:entry colname="col6">–</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">59</oasis:entry>
         <oasis:entry colname="col9">67</oasis:entry>
         <oasis:entry colname="col10"><bold>56</bold><inline-formula><mml:math id="M342" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11">–</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13"><bold>0.91</bold></oasis:entry>
         <oasis:entry colname="col14">0.84</oasis:entry>
         <oasis:entry colname="col15"><bold>0.91</bold></oasis:entry>
         <oasis:entry colname="col16">–</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">PG</oasis:entry>
         <oasis:entry colname="col2">Annual</oasis:entry>
         <oasis:entry colname="col3"><bold>171</bold><inline-formula><mml:math id="M343" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">184<inline-formula><mml:math id="M344" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">290</oasis:entry>
         <oasis:entry colname="col6">178<inline-formula><mml:math id="M345" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"><bold>105</bold><inline-formula><mml:math id="M346" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">113<inline-formula><mml:math id="M347" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col10">201</oasis:entry>
         <oasis:entry colname="col11">240</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13"><bold>0.95</bold></oasis:entry>
         <oasis:entry colname="col14">0.94</oasis:entry>
         <oasis:entry colname="col15"><bold>0.95</bold></oasis:entry>
         <oasis:entry colname="col16">0.96</oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PG</oasis:entry>
         <oasis:entry colname="col2">January</oasis:entry>
         <oasis:entry colname="col3"><bold>30</bold><inline-formula><mml:math id="M348" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><bold>30</bold><inline-formula><mml:math id="M349" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">33</oasis:entry>
         <oasis:entry colname="col6">61</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"><bold>19</bold><inline-formula><mml:math id="M350" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">20<inline-formula><mml:math id="M351" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col10">21</oasis:entry>
         <oasis:entry colname="col11">88</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.91</oasis:entry>
         <oasis:entry colname="col14">0.89</oasis:entry>
         <oasis:entry colname="col15">0.91</oasis:entry>
         <oasis:entry colname="col16"><bold>0.95</bold></oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PG</oasis:entry>
         <oasis:entry colname="col2">April</oasis:entry>
         <oasis:entry colname="col3"><bold>31</bold><inline-formula><mml:math id="M352" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">33<inline-formula><mml:math id="M353" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">35</oasis:entry>
         <oasis:entry colname="col6">50</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"><bold>20</bold><inline-formula><mml:math id="M354" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">21<inline-formula><mml:math id="M355" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col10">22</oasis:entry>
         <oasis:entry colname="col11">94</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.86</oasis:entry>
         <oasis:entry colname="col14">0.84</oasis:entry>
         <oasis:entry colname="col15"><bold>0.94</bold></oasis:entry>
         <oasis:entry colname="col16"><bold>0.96</bold></oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">PG</oasis:entry>
         <oasis:entry colname="col2">June</oasis:entry>
         <oasis:entry colname="col3"><bold>55</bold><inline-formula><mml:math id="M356" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">63<inline-formula><mml:math id="M357" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">78</oasis:entry>
         <oasis:entry colname="col6">95</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"><bold>35</bold><inline-formula><mml:math id="M358" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9">42<inline-formula><mml:math id="M359" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col10">50</oasis:entry>
         <oasis:entry colname="col11">136</oasis:entry>
         <oasis:entry colname="col12"/>
         <oasis:entry colname="col13">0.90</oasis:entry>
         <oasis:entry colname="col14">0.84</oasis:entry>
         <oasis:entry colname="col15">0.93</oasis:entry>
         <oasis:entry colname="col16"><bold>0.96</bold></oasis:entry>
         <oasis:entry colname="col17"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e7942"><inline-formula><mml:math id="M334" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula> The RMSE/CRPS is significantly lower than the RMSE/CRPS of the <italic>areal and centroid models</italic> on a 5 % significance level. <inline-formula><mml:math id="M335" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>*</mml:mo><mml:mo>*</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> The RMSE/CRPS is significantly lower than the RMSE/CRPS of <italic>top-kriging</italic> on a 5 % significance level.</p></table-wrap-foot></table-wrap>

      <p id="d1e8798">Considering the posterior standard deviation in Fig. <xref ref-type="fig" rid="Ch1.F7"/>, we notice that top-kriging and the areal model provide a similar quantification of the predictive uncertainty. Top-kriging and the areal model take the nestedness of catchments into account by interpreting the runoff data as areal referenced, providing a predictive standard deviation of runoff that varies with the size of the target catchment: Fig. <xref ref-type="fig" rid="Ch1.F7"/> shows that smaller catchments typically have a larger predictive uncertainty, which is reasonable. For the centroid model, runoff observations are point referenced and weighted independently of catchment size. Consequently, the predictive uncertainty only depends on how the centroids of the observed catchments are distributed in space, and decreases in areas where there are clusters of data. The predictive uncertainties provided by top-kriging and the areal method are thus more intuitive and realistic considering the process we are studying. The latter is also reflected in the coverage percentages presented in Table <xref ref-type="table" rid="Ch1.T1"/>. The coverages show the amount of actual observations that were captured by the corresponding 95 % prediction intervals, and these are slightly closer to 0.95 for top-kriging and the areal model compared to the centroid model.</p>
      <p id="d1e8808">Table <xref ref-type="table" rid="Ch1.T1"/> also presents the summary scores for the predictive performance for infill of missing values for ungauged catchments for all methods. According to the RMSE and CRPS, top-kriging is a better interpolation method than our two suggested methods for ungauged catchments. However, the boxplots in Fig. <xref ref-type="fig" rid="Ch1.F8"/> illustrate the distribution of RMSE for all catchments, and we see that on a monthly scale, the difference between top-kriging and the two other methods is quite low from a practical point of view. For January and April the differences are almost negligible.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><label>Figure 8</label><caption><p id="d1e8817">Distribution of RMSE (mm yr<inline-formula><mml:math id="M360" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and mm month<inline-formula><mml:math id="M361" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) for infill of missing values for all catchments and years (1996–2005) when the target catchments are treated as ungauged (UG) in the cross-validation for the areal, centroid and top-kriging methods. The lower and upper quartiles correspond to the first and third quartiles (the 25th and 75th percentiles), and the whiskers extend from the quartiles no further than <inline-formula><mml:math id="M362" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.5</mml:mn><mml:mo>⋅</mml:mo></mml:mrow></mml:math></inline-formula> IQR, where IQR is the distance between the first and third quartiles. The same applies for all boxplots presented in this paper.</p></caption>
            <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f08.png"/>

          </fig>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e8863">The posterior mode of the range parameters <inline-formula><mml:math id="M363" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M364" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and the marginal standard deviations <inline-formula><mml:math id="M365" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M366" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> of the climatic and the annual GRFs <inline-formula><mml:math id="M367" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M368" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for the areal model <bold>(a)</bold> and centroid model <bold>(b)</bold>. The posterior standard deviations of the parameters are shown in parenthesis as a measure of the uncertainty. The mode and standard deviations vary between the experiments and groups in the cross-validation, and the values given here are the mean over all folds and experiments (UG and PG). The spatial effect that dominates (annual or climatic) is marked in bold.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(a)</bold> Areal model</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M369" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (km)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M370" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (km)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M371" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (mm yr<inline-formula><mml:math id="M372" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M373" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (mm yr<inline-formula><mml:math id="M374" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Annual</oasis:entry>
         <oasis:entry colname="col2">58     (7)</oasis:entry>
         <oasis:entry colname="col3">476  (65)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M375" display="inline"><mml:mn mathvariant="bold">880</mml:mn></mml:math></inline-formula> (56)</oasis:entry>
         <oasis:entry colname="col5">267 (23)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">January</oasis:entry>
         <oasis:entry colname="col2">31 (7)</oasis:entry>
         <oasis:entry colname="col3">247 (22)</oasis:entry>
         <oasis:entry colname="col4">72 (6)</oasis:entry>
         <oasis:entry colname="col5"><bold>83</bold> (4)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">April</oasis:entry>
         <oasis:entry colname="col2">77 (14)</oasis:entry>
         <oasis:entry colname="col3">239 (32)</oasis:entry>
         <oasis:entry colname="col4"><bold>75</bold> (6)</oasis:entry>
         <oasis:entry colname="col5">48 (3)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">June</oasis:entry>
         <oasis:entry colname="col2">43    (5)</oasis:entry>
         <oasis:entry colname="col3">153 (22)</oasis:entry>
         <oasis:entry colname="col4"><bold>181</bold> (9)</oasis:entry>
         <oasis:entry colname="col5">75 (3)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(b)</bold> Centr. model</oasis:entry>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M376" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (km)</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M377" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (km)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M378" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (mm yr<inline-formula><mml:math id="M379" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col5"><inline-formula><mml:math id="M380" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (mm yr<inline-formula><mml:math id="M381" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Annual</oasis:entry>
         <oasis:entry colname="col2">89     (12)</oasis:entry>
         <oasis:entry colname="col3">659  (77)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M382" display="inline"><mml:mn mathvariant="bold">750</mml:mn></mml:math></inline-formula> (57)</oasis:entry>
         <oasis:entry colname="col5">263 (22)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">January</oasis:entry>
         <oasis:entry colname="col2">82 (15)</oasis:entry>
         <oasis:entry colname="col3">369 (44)</oasis:entry>
         <oasis:entry colname="col4">60 (6)</oasis:entry>
         <oasis:entry colname="col5"><bold>88</bold> (7)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">April</oasis:entry>
         <oasis:entry colname="col2">118 (19)</oasis:entry>
         <oasis:entry colname="col3">375 (51)</oasis:entry>
         <oasis:entry colname="col4"><bold>66</bold> (4)</oasis:entry>
         <oasis:entry colname="col5">52 (5)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">June</oasis:entry>
         <oasis:entry colname="col2">69    (9)</oasis:entry>
         <oasis:entry colname="col3">335 (47)</oasis:entry>
         <oasis:entry colname="col4"><bold>161</bold> (12)</oasis:entry>
         <oasis:entry colname="col5">71 (6)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<?pagebreak page4124?><sec id="Ch1.S6.SS1.SSS2">
  <label>6.1.2</label><title>Infill for partially gauged catchments (PG)</title>
      <p id="d1e9316">For the partially gauged (PG) case, each target catchment is allowed to have a short record of length 1 for top-kriging and the areal and centroid models and length 2 for linear regression. Before we comment the results from the cross-validation in Table <xref ref-type="table" rid="Ch1.T1"/> and Fig. <xref ref-type="fig" rid="Ch1.F9"/>, we consider the posterior estimates of the range parameters (<inline-formula><mml:math id="M383" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M384" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) and the marginal variance parameters (<inline-formula><mml:math id="M385" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M386" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) of the year-specific GRF <inline-formula><mml:math id="M387" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and the climatic GRF <inline-formula><mml:math id="M388" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for our four datasets. These are shown in Table <xref ref-type="table" rid="Ch1.T2"/> and indicate how much of the spatial variability that is captured by the climatic GRF relative to the annual GRF. In particular, if <inline-formula><mml:math id="M389" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> dominates over <inline-formula><mml:math id="M390" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, it suggests hydrological spatial stability.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><label>Figure 9</label><caption><p id="d1e9425">Distribution of RMSE (mm yr<inline-formula><mml:math id="M391" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and mm month<inline-formula><mml:math id="M392" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) for infill of missing values for all catchments and years (1996–2005) for the areal model, centroid model and top-kriging when the target catchments are treated as partially gauged (PG); i.e. a short record of length 1 from the target catchment is included in the observation likelihood in the cross-validation. Results for linear regression (LR) are also included here.</p></caption>
            <?xmltex \igopts{width=497.923228pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f09.png"/>

          </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F10" specific-use="star"><?xmltex \currentcnt{10}?><label>Figure 10</label><caption><p id="d1e9460">From left to right: the climatic part of the model (common for all years), the annual (year-dependent) part of the model and the full model <inline-formula><mml:math id="M393" display="inline"><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for annual runoff in 1997 and 1998 (mm yr<inline-formula><mml:math id="M394" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>). Note that the scales of the middle plots only cover 25 % of the scale of the other plots. We see that most of the spatial variability of annual runoff for 1997 and 1998 can be explained by climatic effects and that the climatic range <inline-formula><mml:math id="M395" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is considerably smaller than the year-specific range <inline-formula><mml:math id="M396" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The results above are produced by the centroid model, and plots similar to these are behind all results presented for the areal and centroid models in this article.</p></caption>
            <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f10.png"/>

          </fig>

      <?pagebreak page4125?><p id="d1e9521">The estimates in Table <xref ref-type="table" rid="Ch1.T2"/> show that the hydrological spatial stability is largest for June and for annual runoff, as expected from the time series in Figs. <xref ref-type="fig" rid="Ch1.F2"/>b and <xref ref-type="fig" rid="Ch1.F3"/>. Here, the posterior mode for <inline-formula><mml:math id="M397" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is more than twice as large as the posterior mode for <inline-formula><mml:math id="M398" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for both the areal and centroid models. Furthermore, we see in Table <xref ref-type="table" rid="Ch1.T2"/> that the climatic range <inline-formula><mml:math id="M399" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is only around 12 % of the annual range <inline-formula><mml:math id="M400" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. In Fig. <xref ref-type="fig" rid="Ch1.F10"/> we have illustrated the spatial pattern these parameters give for annual predictions in 1997 and 1998 for the whole study area. We see that the annual runoff for 1997 and 1998 have the same spatial pattern, and that this spatial pattern mostly originates from <inline-formula><mml:math id="M401" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, i.e. climatic conditions including catchment characteristics. The trend we see in Fig. <xref ref-type="fig" rid="Ch1.F10"/> can also be seen for the remaining 8 years in the dataset (1996, 1999–2005) as well as for June. A spatial pattern like this, with <inline-formula><mml:math id="M402" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M403" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, suggests that the information gain from neighbouring catchments further away is low for an ungauged catchment and that the potential information stored in short records is high.</p>
      <p id="d1e9632">For January however the situation is different: the posterior mode of <inline-formula><mml:math id="M404" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is larger than the posterior mode of <inline-formula><mml:math id="M405" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for both the areal and centroid model. The parameters show that for January, year-specific effects explain a larger part of the spatial variability. This can be due to a more unstable hydrological setting with runoff driven by snow accumulation and snowmelt. For April, we have that <inline-formula><mml:math id="M406" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, but <inline-formula><mml:math id="M407" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is less dominant than for June and for the annual data.</p>
      <p id="d1e9686">In the areal and centroid models, the inclusion of a short record changes the climatic spatial field <inline-formula><mml:math id="M408" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, and hence the predictions can be considerably changed for the target catchment if the climatic effect is strong. The parameter values thus suggest that the gain of including short records is lower for
April and January compared to the other two datasets. This is confirmed by comparing the RMSE and CRPS for the areal and centroid models for the partially gauged case (PG) to the RMSE and CRPS obtained for the ungauged case (UG) in Table <xref ref-type="table" rid="Ch1.T1"/>. For all datasets, the RMSE and CRPS for our two models are reduced for PG compared to UG, but the reduction is lower for January and April than for June and the annual data. For the annual predictions, the RMSE and CRPS are reduced by more than <inline-formula><mml:math id="M409" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="italic">%</mml:mi></mml:mrow></mml:math></inline-formula> when a short record of length 1 from the target catchment is included in the observation likelihood. The reduction for June is also remarkable (around 35 %–40 %), while the reduction for January<?pagebreak page4126?> and April is moderate (around 13 %–20 %). The results hold for both the areal and centroid models, but the areal model seems to be somewhat better than the centroid model in terms of exploiting short records of data from the target catchment. This is again related to the parameter estimates in Table <xref ref-type="table" rid="Ch1.T2"/>, where we see that <inline-formula><mml:math id="M410" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> dominates more over <inline-formula><mml:math id="M411" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the areal model than in the centroid model.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F11"><?xmltex \currentcnt{11}?><label>Figure 11</label><caption><p id="d1e9743">All observations for 1996–2005 compared to the corresponding predictions for the ungauged case (UG, left) and the partially gauged case (PG, right) for annual predictions <bold>(a)</bold> and for April <bold>(b)</bold>. The predictions are performed by the areal model. The straight line represents a perfect correspondence between prediction and observation.</p></caption>
            <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f11.png"/>

          </fig>

      <p id="d1e9758">Considering the results for top-kriging in Table <xref ref-type="table" rid="Ch1.T1"/>, we only obtain a small reduction in the RMSE and CRPS for the partially gauged case (PG) compared to the ungauged case (UG). This is because top-kriging treats each year of data independently when considering infill of missing annual data. A reduction in RMSE and CRPS is only seen for the specific year with extra data. This is different from our framework, where several years of data are modelled simultaneously. The evaluation scores in Table <xref ref-type="table" rid="Ch1.T1"/> and the boxplots in Fig. <xref ref-type="fig" rid="Ch1.F9"/> clearly show that our two suggested methods outperform top-kriging for the partially gauged case for annual predictions and monthly predictions in June, which were the two timescales with the most hydrological spatial stability (<inline-formula><mml:math id="M412" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>). For January and April the three models are more similar in predictive performance.</p>
      <p id="d1e9786">For the PG case, we also compare the areal and centroid models to simple linear regression. According to Table <xref ref-type="table" rid="Ch1.T1"/> and Fig. <xref ref-type="fig" rid="Ch1.F9"/> linear regression performs quite well for the annual data, which represent the most hydrologically spatially stable dataset. Linear regression actually provides the second lowest RMSE of all four methods for annual predictions. However, recall that a short record of length 2 from the target catchment is needed to use this method, while our areal model performs slightly better with a short record of length 1 (and observations from other neighbouring catchments). For January, April and June, linear regression is outperformed by the three other methods in terms of RMSE and CRPS (Table <xref ref-type="table" rid="Ch1.T1"/>).</p>
      <p id="d1e9795">To illustrate the possible gain of including (very) short records of data from the target catchment, we present four scatter plots that compare the predicted values produced by the areal model to the actual observations of runoff (Fig. <xref ref-type="fig" rid="Ch1.F11"/>). For the annual predictions in Fig. <xref ref-type="fig" rid="Ch1.F11"/>a, the predictions for PG<?pagebreak page4127?> are considerably more concentrated around the straight line that indicates a perfect fit, than the predictions for UG. There are similar results for June, whereas the difference between the ungauged and partially gauged case is not that prominent for April (see Fig. <xref ref-type="fig" rid="Ch1.F11"/>b) and January. Furthermore, the April scatter plots demonstrate that (very) short records do not lead to a poorer predictive performance, even if April is a month driven by more unstable hydrological patterns. The predictions are simply not substantially affected by the new data points that are included in the likelihood, as we can see in Fig. <xref ref-type="fig" rid="Ch1.F11"/>b. In our model, the risk of including very short records is low because climatic effects <inline-formula><mml:math id="M413" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are adjusted relative to year-specific effects <inline-formula><mml:math id="M414" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> by statistical inference. This way short records can safely be included in the modelling regardless of the underlying weather patterns and the degree of hydrological spatial stability.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F12"><?xmltex \currentcnt{12}?><label>Figure 12</label><caption><p id="d1e9840">Average posterior mean <inline-formula><mml:math id="M415" display="inline"><mml:mrow><mml:msub><mml:mi>Q</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="script">A</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> <bold>(a, b)</bold>, average posterior standard deviation <bold>(c, d)</bold> and RMSE <bold>(e, f)</bold> for <inline-formula><mml:math id="M416" display="inline"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> for predictions of missing annual observations for the areal model for the ungauged case (UG, <bold>a, c, e</bold>) and the partially gauged case (PG, <bold>b, d, f</bold>).</p></caption>
            <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f12.png"/>

          </fig>

      <p id="d1e9902">In Fig. <xref ref-type="fig" rid="Ch1.F7"/> we saw that all three interpolation methods were able to reproduce the true spatial pattern of annual runoff when filling in missing annual values for ungauged catchments (UG). However, all three methods produced high RMSE values for some of the catchments. These were typically small catchments located on the western coast of Norway. Figure <xref ref-type="fig" rid="Ch1.F12"/> shows the impact of including a short record of length 1 for these catchments. It compares the annual predictions from the ungauged case (UG) to the annual predictions from the partially gauged case (PG) for the areal model. We see a large reduction in the RMSE for many of the catchments, and a (realistic) reduction of the posterior standard deviation. We also see that a few of the catchments obtain a decrease in predictive performance when short records are included, but the overall tendency is clear: the gain of including short records for annual predictions in Norway is high, and the suggested framework is able to exploit this property.</p>
</sec>
</sec>
<sec id="Ch1.S6.SS2">
  <label>6.2</label><title>Predictions of mean annual runoff (1981–2010)</title>
      <p id="d1e9918">So far, we have presented an evaluation of the framework's ability to fill in missing annual observations of runoff for a 10-year period (1996–2005). We now present the evaluation of the framework's ability to predict mean annual runoff for a 30-year period as a whole (1981–2010), as described in Sect. <xref ref-type="sec" rid="Ch1.S5.SS2"/>.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F13"><?xmltex \currentcnt{13}?><label>Figure 13</label><caption><p id="d1e9925">RMSE and CRPS as a function of record length (0, 1, 3, 5 and 10) for predictions of mean annual runoff for 1981–2010 for 83 fully gauged catchments in southern Norway.</p></caption>
          <?xmltex \igopts{width=241.848425pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f13.png"/>

        </fig>

      <p id="d1e9934">Figure <xref ref-type="fig" rid="Ch1.F13"/> shows the RMSE and CRPS for the predictions of mean annual runoff for top-kriging and the areal and centroid models as a function of record length (0, 1, 3, 5 and 10). The record length is the number of annual runoff observations available from the target catchment in the cross-validation. We find that top-kriging again performs best for<?pagebreak page4128?> the ungauged case (short record length 0), while the centroid model performs slightly better than the areal model for ungauged target catchments.
Furthermore, the RMSE and CRPS decrease with increasing record length for all three methods. However, Fig. <xref ref-type="fig" rid="Ch1.F13"/> shows that our areal and centroid models outperform top-kriging for record lengths larger than 0: the overall difference between our framework and top-kriging is around 30–60 mm yr<inline-formula><mml:math id="M417" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> in terms of RMSE, which is a considerable difference when the RMSE values are around 100–200 mm yr<inline-formula><mml:math id="M418" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
      <p id="d1e9966">Furthermore, we notice the large increase in predictive performance when including a (very) short record of length 1 (PG1 in Fig. <xref ref-type="fig" rid="Ch1.F13"/>). The reduction in RMSE and CRPS is 45 %–50 % from the UG to the PG1 case for the areal and centroid models. These results are thus comparable to the results we obtained for the experiments related to infill of missing annual values (Sect. <xref ref-type="sec" rid="Ch1.S6.SS1"/>).</p>
      <p id="d1e9973">To be able to compare our findings with other studies, we also included plots of the ANE and the squared correlation coefficient (<inline-formula><mml:math id="M419" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) for the experiments. These can be found in Figs. <xref ref-type="fig" rid="Ch1.F14"/> and <xref ref-type="fig" rid="Ch1.F15"/>, and are referred to in the discussion (Sect. <xref ref-type="sec" rid="Ch1.S7.SS2"/>). Also, according to these scale-independent evaluation criteria, the overall results are that for ungauged catchments top-kriging performs best, while when there are short records available, our framework performs better.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F14" specific-use="star"><?xmltex \currentcnt{14}?><label>Figure 14</label><caption><p id="d1e9995">Absolute normalized error (ANE) for the areal model (A), centroid model (C) and top-kriging (TK) for predictions of mean annual runoff in ungauged catchments (UG, <bold>a</bold>) and in partially gauged catchments with short records of length 1 (PG1, <bold>b</bold>) and length 5 (PG5, <bold>c</bold>).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f14.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F15"><?xmltex \currentcnt{15}?><label>Figure 15</label><caption><p id="d1e10015">The squared correlation coefficient (<inline-formula><mml:math id="M420" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) for predictions of mean annual runoff for catchments in Norway with record length 0, 1, 3, 5 and 10.</p></caption>
          <?xmltex \igopts{width=113.811024pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/4109/2020/hess-24-4109-2020-f15.png"/>

        </fig>

</sec>
</sec>
<sec id="Ch1.S7">
  <label>7</label><title>Discussion</title>
      <p id="d1e10044">In this article we have presented a geostatistical framework particularly suitable for hydrological datasets that include short records of data. Here, we highlight four points for discussion: (1) the difference in performance across methods and study areas, (2) comparing the findings with other<?pagebreak page4129?> studies, (3) shortcomings of the suggested framework and (4) suggested areas of use.</p>
<sec id="Ch1.S7.SS1">
  <label>7.1</label><title>Difference in performance across methods and study areas</title>
      <p id="d1e10054">In our work, we evaluated two versions of our suggested framework by predicting annual runoff and mean annual runoff for Norway. The results showed that our areal-referenced method and our point- (or centroid-)referenced method gave very similar results in terms of posterior mean (see e.g. Figs. <xref ref-type="fig" rid="Ch1.F7"/> and <xref ref-type="fig" rid="Ch1.F13"/>). We did not find a trend describing when one of the methods performed better than the others. In prior to the analysis, we would expect the areal model to perform better than the centroid model for ungauged, nested catchments since the areal model takes the water balance and the nested structure of catchments into account. However, these properties did not have a notable impact on the predicted posterior mean runoff for this particular dataset. This is not an extraordinary result as similar results have been obtained by other studies that have compared top-kriging (areal-referenced approach) to ordinary kriging (point-referenced approach): the point-referenced approaches often perform similarly to the areal-referenced approaches <xref ref-type="bibr" rid="bib1.bibx11 bib1.bibx44" id="paren.66"/>.</p>
      <p id="d1e10064">A possible explanation for the similar performance of the centroid and areal model in this study is that the proportion of nested catchments in our datasets was relatively low: only 30 % of the catchments in Fig. <xref ref-type="fig" rid="Ch1.F1"/> were nested, while the percentage of nested catchments was 53 % among the fully gauged catchments in Fig. <xref ref-type="fig" rid="Ch1.F5"/>. Furthermore, most of the nested catchments only have one overlapping catchment. The water balance constraints of the areal model might be more important for datasets where there is a higher percentage of nested catchments in an area with high spatial variability. One example is shown in <xref ref-type="bibr" rid="bib1.bibx37" id="text.67"/>.</p>
      <p id="d1e10074">It is also possible that the water balance constraints of the areal model have some drawbacks. One example is if there is poor data quality for a subcatchment in the dataset. Then we impose an inaccurate but relatively strict constraint on the runoff in this catchment's drainage area. This will have an impact on the predictions for all overlapping catchments, and how the predicted runoff is distributed here. In this sense, the areal model is less flexible than the centroid model and requires better data quality.</p>
      <p id="d1e10077">The water balance constraints of the areal model also makes it computationally more expensive than the two other models. Top-kriging used around 1 min for the interpolation of mean annual runoff presented in Sect. <xref ref-type="sec" rid="Ch1.S6.SS2"/> for one cross-validation fold. The centroid model used around 30–40 min for the interpolation, but provided results for both mean annual runoff and runoff for 30 individual years at the same time. Its run time is thus similar to the run time of top-kriging per year. The areal model on the other hand used around 6–7 h to do the calculations on the same computational server. Hence, from a practical point of view, the centroid model might be the most convenient version of our suggested framework for many applications. However, note that if the posterior uncertainty is important, the areal model gives a more realistic representation of uncertainty than the centroid model (see Fig. <xref ref-type="fig" rid="Ch1.F7"/>). The centroid model also treats small and large catchments equally, which can be problematic for some applications and study areas.</p>
      <?pagebreak page4130?><p id="d1e10085">When considering predictions for ungauged catchments, the results showed that top-kriging provided better results than our two suggested models. Figure <xref ref-type="fig" rid="Ch1.F7"/> showed that the three methods failed and succeeded for many of the same catchments, but that our models failed slightly more than top-kriging on average. We also see an indication that our models fail more than top-kriging for ungauged catchments that are located further away from other catchments. See for example the catchment that is located south-east in Fig. <xref ref-type="fig" rid="Ch1.F7"/>. For ungauged catchments located far away from other catchments (relative to the spatial range), the predicted value will go towards the intercept <inline-formula><mml:math id="M421" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for our two Bayesian models. For top-kriging, the predicted value will always be a weighted sum of the observations from the neighbouring catchments. This can explain the difference in performance here. Apart from this, we do not find a pattern for which catchments top-kriging performs better (mean elevation, location and the magnitude of the observed value were investigated).</p>
      <p id="d1e10103">While top-kriging performed best for ungauged catchments, our framework outperformed top-kriging when there were some available data from the target catchment. This was the case both when predicting mean annual runoff, and runoff for individual years. The results showed that the potential gain of including (very) short records in the modelling in Norway was large. An explanation is that the annual runoff in Norway is mainly controlled by orographic precipitation. Since the orographic precipitation is driven by topography and westerly winds are dominating, the precipitation patterns are repeated each year and we obtain hydrological spatial stability with <inline-formula><mml:math id="M422" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>≫</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. The mountains in Norway also lead to rapid weather changes in space, here expressed through a low climatic spatial range <inline-formula><mml:math id="M423" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>. Consequently, the information gain from neighbouring catchments is often low for ungauged catchments, and information from the target catchment can be very valuable. It is also convenient that Norway has a humid climate where only around 10 %–20 % of the annual precipitation evaporates.</p>
      <p id="d1e10135">The evaluation based on annual time series of monthly runoff gave us an indication of how the framework can be expected to behave for other climates and countries: for areas where the annual runoff is driven by unstable weather patterns and hydrological processes, short records can not be expected to contribute to as large improvements in the predictions as for the Norwegian annual data (see the predictions for April in Fig. <xref ref-type="fig" rid="Ch1.F11"/>b). This might be the case for countries and areas where most of the runoff can be explained by convective precipitation, where the aridity index is large or for variables for which storage effects are significant. However, the monthly predictions for January and April also illustrated that we safely can include (very) short records in the model, even if year-specific effects explain most of the spatial variability of runoff. By this we have demonstrated that our models represent a framework for safe use of short records regardless of record length and climate and with the benefit that we do not need to consider the choice of donor catchment as in other comparable methods.</p>
      <p id="d1e10140">Norway is a country with a moderate gauging density. The framework has not been tested for a denser gauging density. We suppose that there is less to gain from including short records if the gauging density is large relative to the spatial range: here the information obtained from neighbouring catchments could be sufficient. However, a high density of gauged catchments and a close distance to neighbouring catchments do not always guarantee good predictability at an ungauged catchment <xref ref-type="bibr" rid="bib1.bibx35" id="paren.68"/>. It is for example often difficult to predict runoff in ungauged catchments that are very small and/or located close to weather divides. We believe that for such catchments, our method for including short records can be useful regardless of gauging density (as long as the study area is characterized by repeated runoff patterns over time).</p>
</sec>
<sec id="Ch1.S7.SS2">
  <label>7.2</label><title>Comparing the findings of this study with other studies</title>
      <p id="d1e10154">There exist several other studies of mean annual runoff in the literature, and some of them are compared in terms of the ANE in the chapter about annual runoff in <xref ref-type="bibr" rid="bib1.bibx4" id="text.69"/>. According to Figure 5.27 in <xref ref-type="bibr" rid="bib1.bibx4" id="text.70"/>, an ANE between 0.05 and 0.5 is a typical result for regions like Norway where the potential evapotranspiration is less than 40 % of the mean annual precipitation. Figure <xref ref-type="fig" rid="Ch1.F14"/> showed that the median ANE obtained for our suggested models is around 0.12 for ungauged catchments, i.e. in the lower range of ANE values in <xref ref-type="bibr" rid="bib1.bibx4" id="text.71"/>. When a short record of length 1 or 5 was available (PG1 and PG5), the median ANE was as low as 0.05 and 0.03 for our methods.</p>
      <p id="d1e10168">In Fig. 5.30 in <xref ref-type="bibr" rid="bib1.bibx4" id="text.72"/> there is also a subplot showing the ANE for predictions of mean annual runoff for ungauged catchments in Austria. Here, geostatistical models (top-kriging) and process-based models (conceptual hydrological models) provided the best predictions according to the ANE, with a median ANE around 0.1. The results we obtain in Fig. <xref ref-type="fig" rid="Ch1.F14"/> for the ungauged catchments are thus comparable to the results from Austria. This is reasonable as the Austrian climate is humid, like the Norwegian, and the western part of the country is dominated by mountains (the Alps) and has similar climate characteristics to Norway.</p>
      <p id="d1e10176">Furthermore, <xref ref-type="bibr" rid="bib1.bibx4" id="text.73"/> report an <inline-formula><mml:math id="M424" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (squared correlation coefficient) between 0.60 and 0.99 for studies done by cross-validation of around 250 catchments, or for studies using models based on spatial proximity like our suggested framework (Figure 5.25 and Figure 5.26 in <xref ref-type="bibr" rid="bib1.bibx4" id="altparen.74"/>). The <inline-formula><mml:math id="M425" display="inline"><mml:mrow><mml:msup><mml:mi>r</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for our two models was shown in Fig. <xref ref-type="fig" rid="Ch1.F15"/>, and we see that it lies between 0.91 and 0.99. This is in the higher range of values obtained by comparable studies.</p>
</sec>
<sec id="Ch1.S7.SS3">
  <label>7.3</label><title>Shortcomings</title>
      <p id="d1e10217">In this article, we proposed two models for runoff that are Gaussian. However, runoff is truncated at zero and typically<?pagebreak page4131?> not Gaussian distributed which we also can see from the histograms in Fig. <xref ref-type="fig" rid="Ch1.F1"/>b and <xref ref-type="fig" rid="Ch1.F4"/>. The consequence of the Gaussian assumptions is that there is nothing in the models that prevents them from predicting negative runoff. Negative values appear for both the areal and centroid models due to the uncertainty given by <inline-formula><mml:math id="M426" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, but this is also a problem for the top-kriging technique. Another source of negative values is that the climatic part of the model (<inline-formula><mml:math id="M427" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) can be negative in some areas. This is a fully valid result because the other model components could still ensure positive predictions for most catchments and years. However, it can become a problem if we are unlucky and the year-specific GRF does not make up for the negative climatic GRF for one specific year. To avoid negative values, it is possible to log transform the data before performing an analysis. However, this is only valid for the centroid model, as the log transform is not compatible with the linear aggregation performed by the areal model (Eq. <xref ref-type="disp-formula" rid="Ch1.E11"/>).</p>
      <p id="d1e10258">In the areal model, negative values also appear as a consequence of requiring preservation of water balance. If there are inconsistent or poor data over nested catchments, negative runoff in parts of a catchment can be the only option to fulfill the water balance requirements. To avoid negative runoff it is important that the discretization of the study area is fine enough to capture rapid changes in runoff over nested catchments. Catchments that are significantly influenced by human activities should also be removed from the analysis as these can influence both the water balance and the significance of the climatic field <inline-formula><mml:math id="M428" display="inline"><mml:mrow><mml:mi>c</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> relative to the annual field <inline-formula><mml:math id="M429" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">u</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e10292">In our study, some negative values were produced for the monthly predictions, as we can see in Fig. <xref ref-type="fig" rid="Ch1.F11"/>b. However, this is not common and happened for only 1.2 % of the predictions of missing monthly data and for a few data points for the missing annual data for the areal and centroid models. For predictions of <italic>mean</italic> annual runoff, negative values almost never appear as such effects typically are averaged out. Note that unphysical results also appear for top-kriging and other interpolation methods, either in terms of violating the water balance or in terms of negative values. These model weaknesses should be noted such that the modeller is able to choose what is most important in a real modelling setting. In this case it is a choice between (1) avoiding negative values by log transforming the data before using top-kriging or the centroid model or (2) imposing water balance constraints through the areal model.</p>
</sec>
<sec id="Ch1.S7.SS4">
  <label>7.4</label><title>Suggested areas of use</title>
      <p id="d1e10309">Finally, we want to highlight what we think are the main areas of use for our suggested framework. First, our results showed that our main benefit compared to top-kriging was connected to exploiting short records from the <italic>target</italic> catchment. For this reason, we think that our method is suitable as a pre-processing method for making inference about the (mean) annual runoff in partially gauged catchments before doing a further analysis with other statistical tools or process-based models. One possible approach for runoff estimation could for example be a two-step procedure where we (i) use the centroid or areal model as a record augmentation technique to predict runoff for the partially gauged catchments in the dataset and (ii) use top-kriging to predict runoff in <italic>ungauged</italic> catchments. Here, the results from step (i) can be used as observed values in top-kriging together with data from fully gauged catchments. Differences in observation uncertainty between fully gauged and partially gauged catchments should here be taken into account.</p>
      <p id="d1e10318">Secondly, we see that the parameter values of the suggested model provides interesting information about the study area. More specifically, if the marginal variance of the climatic GRF <inline-formula><mml:math id="M430" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> dominates over the marginal variance of the year-specific GRF <inline-formula><mml:math id="M431" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, it suggests that the spatial variability is stable over time and that short records of runoff can have a large impact on the model, particularly if also <inline-formula><mml:math id="M432" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is  small. This information can be used by decision makers to e.g. motivate the installation of a new (possibly temporary) gauging station as this might improve the long-term estimates only a year after installation for this catchment. Likewise can the model and its parameters be used to assess whether a gauging station is redundant and can be shut down. However, to exactly quantify the importance of a gauging station, all model variances (<inline-formula><mml:math id="M433" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>x</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M434" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>c</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M435" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M436" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>y</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>) and ranges (<inline-formula><mml:math id="M437" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>x</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M438" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi mathvariant="normal">c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>) must be taken into account, as well as the distances between the donor catchments and the target catchment. Computing this gain is outside the scope of this article but an interesting topic for further research that is related to the field of decision theory and the <italic>value of information</italic> <xref ref-type="bibr" rid="bib1.bibx9" id="paren.75"/>.</p>
</sec>
</sec>
<sec id="Ch1.S8" sec-type="conclusions">
  <label>8</label><title>Conclusions</title>
      <p id="d1e10444">We have presented a geostatistical framework for estimating runoff by modelling several years of runoff data simultaneously by using one (climatic) spatial field that is common for all years under study and one (annual) spatial field that is year-specific. By this, we obtain a framework that is particularly suitable for runoff interpolation when the available data originate from a mixture of gauged and partially gauged catchments, and that can be used to estimate runoff at ungauged and partially gauged locations. We evaluated the framework by (1) its ability to fill in missing values of annual runoff and (2) its ability to predict mean annual runoff for ungauged and partially gauged catchments. The case study from Norway showed that the suggested framework performs better than top-kriging for catchments that have short records of data, both for predictions of mean annual runoff and when filling in missing annual values. For totally ungauged catchments, top-kriging performed best. We also (3) demonstrated the potential value of including short records in the modelling and found that the value of (very) short records was high in<?pagebreak page4132?> Norway: an average reduction of 50 % in the RMSE was reported when a short record of length 1 was available from the target catchment, compared to when no annual observations were available. The reason for the large reduction is that the annual runoff in Norway is mainly driven by hydrological processes that are repeated each year. For such areas, our methodology has its main benefits, and we can use it as a tool for motivating the installation of new gauging stations: the new gauging stations might improve the long-term estimates at the target catchments only a year after installation. Furthermore, the results also show that the framework represents safe use of short records down to record lengths of 1 year, regardless of the underlying climatic conditions in the area of interest.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e10451">Example code for fitting the centroid model with example data is available on <uri>https://github.com/tjroksva/RunoffInterpolation</uri> (last access: 29 January 2020, <ext-link xlink:href="https://doi.org/10.5281/zenodo.3630348" ext-link-type="DOI">10.5281/zenodo.3630348</ext-link>, Roksvåg, 2020). The remaining data are available upon request.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e10463">TR was the main author, responsible for writing the majority of the paper, came up with initial ideas for experimental design, did the implementation, carried out the analysis and made figures.
IS contributed to discussion throughout the work around ideas, analysis and discussion, suggested ideas for the experimental set-up and commented on the manuscript structure and content, and contributed to the writing of Sect. 6.
KE provided the hydrological data, contributed to the discussion, particularly around the hydrological context and questions related to the data, contributed to the writing of Sects. 1 and 2, and commented on the structure and content of the rest of the paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e10469">The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e10476">We would like to thank Gregor Laaha, Jon Olav Skøien, Joris Beemster and one anonymous referee for in-depth reviews and valuable comments. The review process improved the quality of the paper.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e10481">This research has been supported by the The Research Council of Norway (grant no. 250362).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e10487">This paper was edited by Harrie-Jan Hendricks Franssen and reviewed by Gregor Laaha, Jon Olav Skøien, and one anonymous referee.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Adamowski and Bocci(2001)</label><?label euclid11?><mixed-citation>Adamowski, K. and Bocci, C.: Geostatistical regional trend detection in river
flow data, Hydrol. Process., 15, 3331–3341, <ext-link xlink:href="https://doi.org/10.1002/hyp.1045" ext-link-type="DOI">10.1002/hyp.1045</ext-link>,
2001.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Bakka et al.(2018)Bakka, Rue, Fuglstad, Riebler, Bolin, Illian,
Krainski, Simpson, and Lindgren</label><?label BakkaReview?><mixed-citation>Bakka, H., Rue, H., Fuglstad, G.-A., Riebler, A., Bolin, D., Illian, J.,
Krainski, E., Simpson, D., and Lindgren, F.: Spatial modeling with R-INLA: A
review, WIREs Computational Statistics, 10, e1443, <ext-link xlink:href="https://doi.org/10.1002/wics.1443" ext-link-type="DOI">10.1002/wics.1443</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Banerjee et al.(2004)Banerjee, Gelfand, and Carlin</label><?label hierarchical1?><mixed-citation>
Banerjee, S., Gelfand, A., and Carlin, B.: Hierarchical Modeling and Analysis
for Spatial Data, vol. 101 of Monographs on Statistics and Applied
Probability, Chapman &amp; Hall, Boca Raton, Florida, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Blöschl et al.(2013)Blöschl, Sivapalan, Wagener, Viglione, and
Savenije</label><?label PUB?><mixed-citation>
Blöschl, G., Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H.:
Runoff Prediction in Ungauged Basins: Synthesis across Processes,
Places and Scales, Camebridge University Press, Cambridge, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Brenner and Scott(2008)</label><?label FEM?><mixed-citation>
Brenner, S. and Scott, L.: The Mathematical Theory of Finite Element
Methods, 3rd Edition. Vol. 15 of Texts in Applied Mathematics,
Springer, New York, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Casella and Berger(1990)</label><?label StatInf?><mixed-citation>
Casella, G. and Berger, R.: Statistical Inference, Duxbury Press, Belmont, 1990.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Cressie(1993)</label><?label Cressie?><mixed-citation>
Cressie, N.: Statistics for spatial data, J. Wiley &amp; Sons, New York, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Eidsvik et al.(2012)Eidsvik, Finley, Banerjee, and Rue</label><?label JoFast?><mixed-citation>Eidsvik, J., Finley, A. O., Banerjee, S., and Rue, H.: Approximate Bayesian
inference for large spatial datasets using predictive process models,
Comput. Stat. Data An., 56, 1362–1380,
<ext-link xlink:href="https://doi.org/10.1016/j.csda.2011.10.022" ext-link-type="DOI">10.1016/j.csda.2011.10.022</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Eidsvik et al.(2015)Eidsvik, Mukerji, and Bhattacharjya</label><?label voi?><mixed-citation>Eidsvik, J., Mukerji, T., and Bhattacharjya, D.: Value of Information in the
Earth Sciences: Integrating Spatial Modeling and Decision Analysis, Cambridge
University Press, Cambridge, <ext-link xlink:href="https://doi.org/10.1017/CBO9781139628785" ext-link-type="DOI">10.1017/CBO9781139628785</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Engeland and Hisdal(2009)</label><?label lowFlow?><mixed-citation>Engeland, K. and Hisdal, H.: A Comparison of Low Flow Estimates in Ungauged
Catchments Using Regional Regression and the HBV-Model, Water Resour.
Manag., 23, 2567–2586, <ext-link xlink:href="https://doi.org/10.1007/s11269-008-9397-7" ext-link-type="DOI">10.1007/s11269-008-9397-7</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Farmer(2016)</label><?label ordvstk1?><mixed-citation>Farmer, W. H.: Ordinary kriging as a tool to estimate historical daily streamflow records, Hydrol. Earth Syst. Sci., 20, 2721–2735, <ext-link xlink:href="https://doi.org/10.5194/hess-20-2721-2016" ext-link-type="DOI">10.5194/hess-20-2721-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Ferkingstad and Rue(2015)</label><?label Egil?><mixed-citation>Ferkingstad, E. and Rue, H.: Improving the INLA approach for approximate
Bayesian inference for latent Gaussian models, Electron. J.
Stat., 9, 2706–2731, <ext-link xlink:href="https://doi.org/10.1214/15-EJS1092" ext-link-type="DOI">10.1214/15-EJS1092</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Fiering(1963)</label><?label shortrec1?><mixed-citation>
Fiering, M.: Use of correlation to improve estimates of the mean and variance,
USGS Publications Warehouse, Washington, 1963.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Fong et al.(2009)Fong, Rue, and Wakefield</label><?label InlaFast2?><mixed-citation>Fong, Y., Rue, H., and Wakefield, J.: Bayesian inference for generalized linear
mixed models, Biostatistics, 11, 397–412,
<ext-link xlink:href="https://doi.org/10.1093/biostatistics/kxp053" ext-link-type="DOI">10.1093/biostatistics/kxp053</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Fuglstad et al.(2019)Fuglstad, Simpson, Lindgren, and Rue</label><?label PC2?><mixed-citation>Fuglstad, G.-A., Simpson, D., Lindgren, F., and Rue, H.: Constructing Priors
that Penalize the Complexity of Gaussian Random Fields, J.
Am. Stat. Assoc., 114, 445–452,
<ext-link xlink:href="https://doi.org/10.1080/01621459.2017.1415907" ext-link-type="DOI">10.1080/01621459.2017.1415907</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Førland(1979)</label><?label Forland?><mixed-citation>
Førland, E. J.: Nedbørens høydeavhengighet, Klima, 2, 3–24,  1979.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Gamerman and Lopes(2006)</label><?label mcmc?><mixed-citation>
Gamerman, D. and Lopes, H. F.: Markov chain Monte Carlo: stochastic simulation
for Bayesian inference, 2 Edn., Chapman and Hall/CRC,   Boca Raton, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Gneiting and Raftery(2007)</label><?label Gneiting?><mixed-citation>Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules,
Prediction, and Estimation, J. Am. Stat.
Assoc., 102, 359–378, <ext-link xlink:href="https://doi.org/10.1198/016214506000001437" ext-link-type="DOI">10.1198/016214506000001437</ext-link>, 2007.</mixed-citation></ref>
      <?pagebreak page4133?><ref id="bib1.bibx19"><label>Gottschalk(1993)</label><?label Gottschalk1993a?><mixed-citation>Gottschalk, L.: Correlation and covariance of runoff, Stoch. Hydrol.
Hydraul., 7, 85–101, <ext-link xlink:href="https://doi.org/10.1007/BF01581418" ext-link-type="DOI">10.1007/BF01581418</ext-link>, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Gottschalk et al.(1979)Gottschalk, Jensen, Lundquist, Solantie,
and Tollan</label><?label Nordic?><mixed-citation>
Gottschalk, L., Jensen, J. L., Lundquist, D., Solantie, R., and Tollan, A.:
Hydrologic Regions in the Nordic Countries, Hydrol. Res.,  10, 273–286,
1979.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Guillot et al.(2014)Guillot, Vitalis, le Rouzic, and
Gautier</label><?label genome?><mixed-citation>Guillot, G., Vitalis, R., le Rouzic, A., and Gautier, M.: Detecting correlation
between allele frequencies and environmental variables as a signature of
selection. A fast computational approach for genome-wide studies, Spat.
Stat., 8, 145–155, <ext-link xlink:href="https://doi.org/10.1016/j.spasta.2013.08.001" ext-link-type="DOI">10.1016/j.spasta.2013.08.001</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Guttorp and Gneiting(2006)</label><?label Matern?><mixed-citation>Guttorp, P. and Gneiting, T.: Studies in the history of probability and
statistics XLIX On the Matérn correlation family, Biometrika, 93,
989–995, <ext-link xlink:href="https://doi.org/10.1093/biomet/93.4.989" ext-link-type="DOI">10.1093/biomet/93.4.989</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Hirsch(1982)</label><?label hirsch?><mixed-citation>
Hirsch, R. M.: A comparison of four record extension techniques, Water Resour.
Res., 8,  1081–1088, 1982.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Huang et al.(2017)Huang, Malone, Minasny, Mcbratney, and
Triantafilis</label><?label evaluatingINLASPDE?><mixed-citation>Huang, J., Malone, B., Minasny, B., Mcbratney, A., and Triantafilis, J.:
Evaluating a Bayesian modelling approach (INLA-SPDE) for environmental
mapping, Sci.  Total Environ., 609, 621–632,
<ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2017.07.201" ext-link-type="DOI">10.1016/j.scitotenv.2017.07.201</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Ingebrigtsen et al.(2014)Ingebrigtsen, Lindgren, and
Steinsland</label><?label Rikke1?><mixed-citation>Ingebrigtsen, R., Lindgren, F., and Steinsland, I.: Spatial models with
explanatory variables in the dependence structure, Spat. Stat., 8, 20–38, <ext-link xlink:href="https://doi.org/10.1016/j.spasta.2013.06.002" ext-link-type="DOI">10.1016/j.spasta.2013.06.002</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Ingebrigtsen et al.(2015)Ingebrigtsen, Lindgren, Steinsland, and
Martino</label><?label Rikke2?><mixed-citation>Ingebrigtsen, R., Lindgren, F., Steinsland, I., and Martino, S.: Estimation of
a non-stationary model for annual precipitation in southern Norway using
replicates of the spatial field, Spat. Stat., 14, 338–364,
<ext-link xlink:href="https://doi.org/10.1016/j.spasta.2015.07.003" ext-link-type="DOI">10.1016/j.spasta.2015.07.003</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Khan and Warner(2018)</label><?label INLAwin1?><mixed-citation>
Khan, D. and Warner, M.: A Bayesian spatial and temporal modeling approach to
mapping geographic variation in mortality rates for subnational areas with
r-inla, Journal of Data Science, 18, 147–182, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Laaha and Blöschl(2005)</label><?label shortrec2?><mixed-citation>Laaha, G. and Blöschl, G.: Low flow estimates from short stream flow records
– a comparison of methods, J. Hydrol., 306, 264–286,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2004.09.012" ext-link-type="DOI">10.1016/j.jhydrol.2004.09.012</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Lindgren et al.(2011)Lindgren, Rue, and Lindström</label><?label SPDELindgren?><mixed-citation>Lindgren, F., Rue, H., and Lindström, J.: An explicit link between Gaussian
fields and Gaussian Markov random fields: the stochastic partial
differential equation approach, J. Roy. Stat. Soc.-B, 73, 423–498,
<ext-link xlink:href="https://doi.org/10.1111/j.1467-9868.2011.00777.x" ext-link-type="DOI">10.1111/j.1467-9868.2011.00777.x</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Martino et al.(2011)Martino, Akerkar, and Rue</label><?label MartinoFast?><mixed-citation>Martino, S., Akerkar, R., and Rue, H.: Approximate Bayesian Inference for
Survival Models, Scand. J. Stat., 38, 514–528,
<ext-link xlink:href="https://doi.org/10.1111/j.1467-9469.2010.00715.x" ext-link-type="DOI">10.1111/j.1467-9469.2010.00715.x</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Matalas and Jacobs(1964)</label><?label Matalas?><mixed-citation>Matalas, N. C. and Jacobs, B.: A correlation procedure for augmenting
hydrologic data, U.S. Geol. Surv. Prof. Pap., 434-E, <ext-link xlink:href="https://doi.org/10.3133/pp434E" ext-link-type="DOI">10.3133/pp434E</ext-link>, 1964.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Merz and Blöschl(2005)</label><?label euclid2?><mixed-citation>Merz, R. and Blöschl, G.: Flood frequency regionalisation – spatial proximity
vs. catchment attributes, J. Hydrol., 302, 283–306,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2004.07.018" ext-link-type="DOI">10.1016/j.jhydrol.2004.07.018</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Moraga et al.(2017)Moraga, Cramb, Mengersen, and Pagano</label><?label areadata1?><mixed-citation>Moraga, P., Cramb, S. M., Mengersen, K. L., and Pagano, M.: A geostatistical
model for combined analysis of point-level and area-level data using INLA
and SPDE, Spat. Stat., 21, 27–41,
<ext-link xlink:href="https://doi.org/10.1016/j.spasta.2017.04.006" ext-link-type="DOI">10.1016/j.spasta.2017.04.006</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Opitz et al.(2018)Opitz, Huser, Bakka, and Rue</label><?label INLAextreme?><mixed-citation>Opitz, T., Huser, R., Bakka, H., and Rue, H.: INLA goes extreme: Bayesian tail
regression for the estimation of high spatio-temporal quantiles, Extremes,
21,  331–462, <ext-link xlink:href="https://doi.org/10.1007/s10687-018-0324-x" ext-link-type="DOI">10.1007/s10687-018-0324-x</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Patil and Stieglitz(2012)</label><?label neighCatch?><mixed-citation>Patil, S. and Stieglitz, M.: Controls on hydrologic similarity: role of nearby gauged catchments for prediction at an ungauged catchment, Hydrol. Earth Syst. Sci., 16, 551–562, <ext-link xlink:href="https://doi.org/10.5194/hess-16-551-2012" ext-link-type="DOI">10.5194/hess-16-551-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Petersen-Øverleir(2004)</label><?label hetero?><mixed-citation>Petersen-Øverleir, A.: Accounting for heteroscedasticity in rating curve
estimates, J. Hydrol., 292, 173–181,
<ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2003.12.024" ext-link-type="DOI">10.1016/j.jhydrol.2003.12.024</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>Roksvåg, T.:
Code for implementing the centroid model, <ext-link xlink:href="https://doi.org/10.5281/zenodo.3630348" ext-link-type="DOI">10.5281/zenodo.3630348</ext-link>, last access: 29 January 2020.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Roksvåg et al.(2020)Roksvåg, Steinsland, and Engeland</label><?label PandA?><mixed-citation>Roksvåg, T., Steinsland, I., and Engeland, K.: A geostatistical two field
model that combines point observations and nested areal observations, and
quantifies long-term spatial variability – A case study of annual runoff
predictions in the Voss area, <ext-link xlink:href="https://arxiv.org/abs/1904.02519v2">arXiv:1904.02519v2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Rue and Held(2005)</label><?label GMRFbook?><mixed-citation>
Rue, H. and Held, L.: Gaussian Markov Random Fields: Theory and
Applications, vol. 104 of Monographs on Statistics and Applied
Probability, Chapman &amp; Hall, London, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Rue et al.(2009)Rue, Martino, and Chopin</label><?label mainINLA?><mixed-citation>Rue, H., Martino, S., and Chopin, N.: Approximate Bayesian inference for
latent Gaussian models using integrated nested Laplace approximations,
J. Roy. Stat. Soc.-B,
71, 319–392, <ext-link xlink:href="https://doi.org/10.1111/j.1467-9868.2008.00700.x" ext-link-type="DOI">10.1111/j.1467-9868.2008.00700.x</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Sauquet et al.(2000)Sauquet, Gottschalk, and Lebois</label><?label Sauquet2000?><mixed-citation>Sauquet, E., Gottschalk, L., and Lebois, E.: Mapping average annual runoff: A
hierarchical approach applying a stochastic interpolation scheme,
Hydrolog. Sci. J., 45, 799–815, <ext-link xlink:href="https://doi.org/10.1080/02626660009492385" ext-link-type="DOI">10.1080/02626660009492385</ext-link>,
2000.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Siegel(1956)</label><?label wilcoxon?><mixed-citation>
Siegel, S.: Non-parametric statistics for the behavioral sciences,
McGraw-Hill, New York, 1956.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Simpson et al.(2017)Simpson, Rue, Riebler, Martins, and
Sørbye</label><?label PC1?><mixed-citation>Simpson, D., Rue, H., Riebler, A., Martins, T. G., and Sørbye, S. H.:
Penalising Model Component Complexity: A Principled, Practical
Approach to Constructing Priors, Stat. Sci., 32, 1–28,
<ext-link xlink:href="https://doi.org/10.1214/16-STS576" ext-link-type="DOI">10.1214/16-STS576</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx43"><?xmltex \def\ref@label{{Sk{\o}ien et~al.(2006)Sk{\o}ien, Merz, and Bl\"{o}schl}}?><label>Skøien et al.(2006)Skøien, Merz, and Blöschl</label><?label topkriging?><mixed-citation>Skøien, J. O., Merz, R., and Blöschl, G.: Top-kriging - geostatistics on stream networks, Hydrol. Earth Syst. Sci., 10, 277–287, <ext-link xlink:href="https://doi.org/10.5194/hess-10-277-2006" ext-link-type="DOI">10.5194/hess-10-277-2006</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Skøien et al.(2014)Skøien, Blöschl, Laaha, Pebesma, Parajka, and
Viglione</label><?label ordvstk2?><mixed-citation>Skøien, J., Blöschl, G., Laaha, G., Pebesma, E., Parajka, J., and Viglione,
A.: rtop: An R package for interpolation of data with a variable spatial
support, with an example from river networks, Comput.  Geosci., 67,
180–190, <ext-link xlink:href="https://doi.org/10.1016/j.cageo.2014.02.009" ext-link-type="DOI">10.1016/j.cageo.2014.02.009</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Skøien and Blöschl(2007)</label><?label tk2?><mixed-citation>Skøien, J. O. and Blöschl, G.: Spatiotemporal topological kriging of runoff
time series, Water Resour. Res., 43, W09419, <ext-link xlink:href="https://doi.org/10.1029/2006WR005760" ext-link-type="DOI">10.1029/2006WR005760</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Skøien et al.(2003)Skøien, Blöschl, and Western</label><?label euclid3?><mixed-citation>Skøien, J. O., Blöschl, G., and Western, A. W.: Characteristic space scales
and timescales in hydrology, Water Resour. Res., 39, 1304,
<ext-link xlink:href="https://doi.org/10.1029/2002WR001736" ext-link-type="DOI">10.1029/2002WR001736</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Stohl et al.(2008)Stohl, Forster, and Sodemann</label><?label Stohl?><mixed-citation>Stohl, A., Forster, C., and Sodemann, H.: Remote sources of water vapor forming
precipitation on the Norwegian west coast at 60<inline-formula><mml:math id="M439" display="inline"><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:math></inline-formula> N – a tale of
hurricanes and an atmospheric river, J. Geophys. Res.-Atmos., 113, D05102, <ext-link xlink:href="https://doi.org/10.1029/2007JD009006" ext-link-type="DOI">10.1029/2007JD009006</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Sælthun et al.(1997)Sælthun, Tveito, Bøsnes, and
Roald</label><?label flomNVE?><mixed-citation>
Sælthun, N., Tveito, O., Bøsnes, T., and Roald, L.: Regional
flomfrekvensanalyse for norske vassdrag, Tech. Rep., Oslo, NVE, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Viglione et al.(2013)Viglione, Parajka, Rogger, L. Salinas, Laaha,
Sivapalan, and Blöschl</label><?label Austriacompare?><mixed-citation>Viglione, A., Parajka, J., Rogger, M., Salinas, J. L., Laaha, G., Sivapalan, M., and Blöschl, G.: Comparative assessment of predictions in ungauged basins – Part 3: Runoff signatures in Austria, Hydrol. Earth Syst. Sci., 17, 2263–2279, <ext-link xlink:href="https://doi.org/10.5194/hess-17-2263-2013" ext-link-type="DOI">10.5194/hess-17-2263-2013</ext-link>, 2013.</mixed-citation></ref>
      <?pagebreak page4134?><ref id="bib1.bibx50"><label>Vogel and Stedinger(1985)</label><?label shortrec3?><mixed-citation>Vogel, R. M. and Stedinger, J. R.: Minimum variance streamflow record
augmentation procedures, Water Resour. Res., 21, 715–723,
<ext-link xlink:href="https://doi.org/10.1029/WR021i005p00715" ext-link-type="DOI">10.1029/WR021i005p00715</ext-link>, 1985.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Yuan et al.(2017)Yuan, Bachl, Lindgren, Borchers, Illian, Buckland,
Rue, and Gerrodette</label><?label whales?><mixed-citation>Yuan, Y., Bachl, F., Lindgren, F., Borchers, D., Illian, J., Buckland, S., Rue,
H., and Gerrodette, T.: Point process models for spatio-temporal distance
sampling data from a large-scale survey of blue whales,   Ann. Appl. Stat., 11, 2270–2297, <ext-link xlink:href="https://doi.org/10.1214/17-AOAS1078" ext-link-type="DOI">10.1214/17-AOAS1078</ext-link>, 2017.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Estimation of annual runoff by exploiting long-term spatial patterns and short records within a geostatistical framework</article-title-html>
<abstract-html><p>In this article, we present a Bayesian geostatistical framework that is particularly suitable for interpolation of hydrological data when the available dataset is sparse and includes both long and short records of runoff. A key feature of the proposed framework is that several years of runoff are modelled simultaneously with two spatial fields: one that is common for all years under study that represents the runoff generation due to long-term (climatic) conditions and one that is year-specific. The climatic spatial field captures how short records of runoff from partially gauged catchments vary relative to longer time series from other catchments, and transfers this information across years. To make the Bayesian model computationally feasible and fast, we use integrated nested Laplace approximations (INLAs) and the stochastic partial differential equation (SPDE) approach to spatial modelling.</p><p>The geostatistical framework is demonstrated by filling in missing values of annual runoff and by predicting mean annual runoff for around 200 catchments in Norway. The predictive performance is compared to top-kriging (interpolation method) and simple linear regression (record augmentation method). The results show that if the runoff is driven by processes that are repeated over time (e.g. orographic precipitation patterns), the value of including short records in the suggested model is large. For partially gauged catchments the suggested framework performs better than comparable methods, and one annual observation from the target catchment can lead to a 50&thinsp;% reduction in root mean squared error (RMSE) compared to when no observations are available from the target catchment. We also find that short records safely can be included in the framework regardless of the spatial characteristics of the underlying climate, and down to record lengths of 1 year.</p></abstract-html>
<ref-html id="bib1.bib1"><label>Adamowski and Bocci(2001)</label><mixed-citation>
Adamowski, K. and Bocci, C.: Geostatistical regional trend detection in river
flow data, Hydrol. Process., 15, 3331–3341, <a href="https://doi.org/10.1002/hyp.1045" target="_blank">https://doi.org/10.1002/hyp.1045</a>,
2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Bakka et al.(2018)Bakka, Rue, Fuglstad, Riebler, Bolin, Illian,
Krainski, Simpson, and Lindgren</label><mixed-citation>
Bakka, H., Rue, H., Fuglstad, G.-A., Riebler, A., Bolin, D., Illian, J.,
Krainski, E., Simpson, D., and Lindgren, F.: Spatial modeling with R-INLA: A
review, WIREs Computational Statistics, 10, e1443, <a href="https://doi.org/10.1002/wics.1443" target="_blank">https://doi.org/10.1002/wics.1443</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Banerjee et al.(2004)Banerjee, Gelfand, and Carlin</label><mixed-citation>
Banerjee, S., Gelfand, A., and Carlin, B.: Hierarchical Modeling and Analysis
for Spatial Data, vol. 101 of Monographs on Statistics and Applied
Probability, Chapman &amp; Hall, Boca Raton, Florida, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Blöschl et al.(2013)Blöschl, Sivapalan, Wagener, Viglione, and
Savenije</label><mixed-citation>
Blöschl, G., Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H.:
Runoff Prediction in Ungauged Basins: Synthesis across Processes,
Places and Scales, Camebridge University Press, Cambridge, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Brenner and Scott(2008)</label><mixed-citation>
Brenner, S. and Scott, L.: The Mathematical Theory of Finite Element
Methods, 3rd Edition. Vol. 15 of Texts in Applied Mathematics,
Springer, New York, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Casella and Berger(1990)</label><mixed-citation>
Casella, G. and Berger, R.: Statistical Inference, Duxbury Press, Belmont, 1990.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Cressie(1993)</label><mixed-citation>
Cressie, N.: Statistics for spatial data, J. Wiley &amp; Sons, New York, 1993.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Eidsvik et al.(2012)Eidsvik, Finley, Banerjee, and Rue</label><mixed-citation>
Eidsvik, J., Finley, A. O., Banerjee, S., and Rue, H.: Approximate Bayesian
inference for large spatial datasets using predictive process models,
Comput. Stat. Data An., 56, 1362–1380,
<a href="https://doi.org/10.1016/j.csda.2011.10.022" target="_blank">https://doi.org/10.1016/j.csda.2011.10.022</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Eidsvik et al.(2015)Eidsvik, Mukerji, and Bhattacharjya</label><mixed-citation>
Eidsvik, J., Mukerji, T., and Bhattacharjya, D.: Value of Information in the
Earth Sciences: Integrating Spatial Modeling and Decision Analysis, Cambridge
University Press, Cambridge, <a href="https://doi.org/10.1017/CBO9781139628785" target="_blank">https://doi.org/10.1017/CBO9781139628785</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Engeland and Hisdal(2009)</label><mixed-citation>
Engeland, K. and Hisdal, H.: A Comparison of Low Flow Estimates in Ungauged
Catchments Using Regional Regression and the HBV-Model, Water Resour.
Manag., 23, 2567–2586, <a href="https://doi.org/10.1007/s11269-008-9397-7" target="_blank">https://doi.org/10.1007/s11269-008-9397-7</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Farmer(2016)</label><mixed-citation>
Farmer, W. H.: Ordinary kriging as a tool to estimate historical daily streamflow records, Hydrol. Earth Syst. Sci., 20, 2721–2735, <a href="https://doi.org/10.5194/hess-20-2721-2016" target="_blank">https://doi.org/10.5194/hess-20-2721-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Ferkingstad and Rue(2015)</label><mixed-citation>
Ferkingstad, E. and Rue, H.: Improving the INLA approach for approximate
Bayesian inference for latent Gaussian models, Electron. J.
Stat., 9, 2706–2731, <a href="https://doi.org/10.1214/15-EJS1092" target="_blank">https://doi.org/10.1214/15-EJS1092</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Fiering(1963)</label><mixed-citation>
Fiering, M.: Use of correlation to improve estimates of the mean and variance,
USGS Publications Warehouse, Washington, 1963.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Fong et al.(2009)Fong, Rue, and Wakefield</label><mixed-citation>
Fong, Y., Rue, H., and Wakefield, J.: Bayesian inference for generalized linear
mixed models, Biostatistics, 11, 397–412,
<a href="https://doi.org/10.1093/biostatistics/kxp053" target="_blank">https://doi.org/10.1093/biostatistics/kxp053</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Fuglstad et al.(2019)Fuglstad, Simpson, Lindgren, and Rue</label><mixed-citation>
Fuglstad, G.-A., Simpson, D., Lindgren, F., and Rue, H.: Constructing Priors
that Penalize the Complexity of Gaussian Random Fields, J.
Am. Stat. Assoc., 114, 445–452,
<a href="https://doi.org/10.1080/01621459.2017.1415907" target="_blank">https://doi.org/10.1080/01621459.2017.1415907</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Førland(1979)</label><mixed-citation>
Førland, E. J.: Nedbørens høydeavhengighet, Klima, 2, 3–24,  1979.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Gamerman and Lopes(2006)</label><mixed-citation>
Gamerman, D. and Lopes, H. F.: Markov chain Monte Carlo: stochastic simulation
for Bayesian inference, 2 Edn., Chapman and Hall/CRC,   Boca Raton, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Gneiting and Raftery(2007)</label><mixed-citation>
Gneiting, T. and Raftery, A. E.: Strictly Proper Scoring Rules,
Prediction, and Estimation, J. Am. Stat.
Assoc., 102, 359–378, <a href="https://doi.org/10.1198/016214506000001437" target="_blank">https://doi.org/10.1198/016214506000001437</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Gottschalk(1993)</label><mixed-citation>
Gottschalk, L.: Correlation and covariance of runoff, Stoch. Hydrol.
Hydraul., 7, 85–101, <a href="https://doi.org/10.1007/BF01581418" target="_blank">https://doi.org/10.1007/BF01581418</a>, 1993.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Gottschalk et al.(1979)Gottschalk, Jensen, Lundquist, Solantie,
and Tollan</label><mixed-citation>
Gottschalk, L., Jensen, J. L., Lundquist, D., Solantie, R., and Tollan, A.:
Hydrologic Regions in the Nordic Countries, Hydrol. Res.,  10, 273–286,
1979.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Guillot et al.(2014)Guillot, Vitalis, le Rouzic, and
Gautier</label><mixed-citation>
Guillot, G., Vitalis, R., le Rouzic, A., and Gautier, M.: Detecting correlation
between allele frequencies and environmental variables as a signature of
selection. A fast computational approach for genome-wide studies, Spat.
Stat., 8, 145–155, <a href="https://doi.org/10.1016/j.spasta.2013.08.001" target="_blank">https://doi.org/10.1016/j.spasta.2013.08.001</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Guttorp and Gneiting(2006)</label><mixed-citation>
Guttorp, P. and Gneiting, T.: Studies in the history of probability and
statistics XLIX On the Matérn correlation family, Biometrika, 93,
989–995, <a href="https://doi.org/10.1093/biomet/93.4.989" target="_blank">https://doi.org/10.1093/biomet/93.4.989</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Hirsch(1982)</label><mixed-citation>
Hirsch, R. M.: A comparison of four record extension techniques, Water Resour.
Res., 8,  1081–1088, 1982.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Huang et al.(2017)Huang, Malone, Minasny, Mcbratney, and
Triantafilis</label><mixed-citation>
Huang, J., Malone, B., Minasny, B., Mcbratney, A., and Triantafilis, J.:
Evaluating a Bayesian modelling approach (INLA-SPDE) for environmental
mapping, Sci.  Total Environ., 609, 621–632,
<a href="https://doi.org/10.1016/j.scitotenv.2017.07.201" target="_blank">https://doi.org/10.1016/j.scitotenv.2017.07.201</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Ingebrigtsen et al.(2014)Ingebrigtsen, Lindgren, and
Steinsland</label><mixed-citation>
Ingebrigtsen, R., Lindgren, F., and Steinsland, I.: Spatial models with
explanatory variables in the dependence structure, Spat. Stat., 8, 20–38, <a href="https://doi.org/10.1016/j.spasta.2013.06.002" target="_blank">https://doi.org/10.1016/j.spasta.2013.06.002</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Ingebrigtsen et al.(2015)Ingebrigtsen, Lindgren, Steinsland, and
Martino</label><mixed-citation>
Ingebrigtsen, R., Lindgren, F., Steinsland, I., and Martino, S.: Estimation of
a non-stationary model for annual precipitation in southern Norway using
replicates of the spatial field, Spat. Stat., 14, 338–364,
<a href="https://doi.org/10.1016/j.spasta.2015.07.003" target="_blank">https://doi.org/10.1016/j.spasta.2015.07.003</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Khan and Warner(2018)</label><mixed-citation>
Khan, D. and Warner, M.: A Bayesian spatial and temporal modeling approach to
mapping geographic variation in mortality rates for subnational areas with
r-inla, Journal of Data Science, 18, 147–182, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Laaha and Blöschl(2005)</label><mixed-citation>
Laaha, G. and Blöschl, G.: Low flow estimates from short stream flow records
– a comparison of methods, J. Hydrol., 306, 264–286,
<a href="https://doi.org/10.1016/j.jhydrol.2004.09.012" target="_blank">https://doi.org/10.1016/j.jhydrol.2004.09.012</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Lindgren et al.(2011)Lindgren, Rue, and Lindström</label><mixed-citation>
Lindgren, F., Rue, H., and Lindström, J.: An explicit link between Gaussian
fields and Gaussian Markov random fields: the stochastic partial
differential equation approach, J. Roy. Stat. Soc.-B, 73, 423–498,
<a href="https://doi.org/10.1111/j.1467-9868.2011.00777.x" target="_blank">https://doi.org/10.1111/j.1467-9868.2011.00777.x</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Martino et al.(2011)Martino, Akerkar, and Rue</label><mixed-citation>
Martino, S., Akerkar, R., and Rue, H.: Approximate Bayesian Inference for
Survival Models, Scand. J. Stat., 38, 514–528,
<a href="https://doi.org/10.1111/j.1467-9469.2010.00715.x" target="_blank">https://doi.org/10.1111/j.1467-9469.2010.00715.x</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Matalas and Jacobs(1964)</label><mixed-citation>
Matalas, N. C. and Jacobs, B.: A correlation procedure for augmenting
hydrologic data, U.S. Geol. Surv. Prof. Pap., 434-E, <a href="https://doi.org/10.3133/pp434E" target="_blank">https://doi.org/10.3133/pp434E</a>, 1964.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Merz and Blöschl(2005)</label><mixed-citation>
Merz, R. and Blöschl, G.: Flood frequency regionalisation – spatial proximity
vs. catchment attributes, J. Hydrol., 302, 283–306,
<a href="https://doi.org/10.1016/j.jhydrol.2004.07.018" target="_blank">https://doi.org/10.1016/j.jhydrol.2004.07.018</a>, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Moraga et al.(2017)Moraga, Cramb, Mengersen, and Pagano</label><mixed-citation>
Moraga, P., Cramb, S. M., Mengersen, K. L., and Pagano, M.: A geostatistical
model for combined analysis of point-level and area-level data using INLA
and SPDE, Spat. Stat., 21, 27–41,
<a href="https://doi.org/10.1016/j.spasta.2017.04.006" target="_blank">https://doi.org/10.1016/j.spasta.2017.04.006</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Opitz et al.(2018)Opitz, Huser, Bakka, and Rue</label><mixed-citation>
Opitz, T., Huser, R., Bakka, H., and Rue, H.: INLA goes extreme: Bayesian tail
regression for the estimation of high spatio-temporal quantiles, Extremes,
21,  331–462, <a href="https://doi.org/10.1007/s10687-018-0324-x" target="_blank">https://doi.org/10.1007/s10687-018-0324-x</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Patil and Stieglitz(2012)</label><mixed-citation>
Patil, S. and Stieglitz, M.: Controls on hydrologic similarity: role of nearby gauged catchments for prediction at an ungauged catchment, Hydrol. Earth Syst. Sci., 16, 551–562, <a href="https://doi.org/10.5194/hess-16-551-2012" target="_blank">https://doi.org/10.5194/hess-16-551-2012</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Petersen-Øverleir(2004)</label><mixed-citation>
Petersen-Øverleir, A.: Accounting for heteroscedasticity in rating curve
estimates, J. Hydrol., 292, 173–181,
<a href="https://doi.org/10.1016/j.jhydrol.2003.12.024" target="_blank">https://doi.org/10.1016/j.jhydrol.2003.12.024</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>1</label><mixed-citation>
Roksvåg, T.:
Code for implementing the centroid model, <a href="https://doi.org/10.5281/zenodo.3630348" target="_blank">https://doi.org/10.5281/zenodo.3630348</a>, last access: 29 January 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Roksvåg et al.(2020)Roksvåg, Steinsland, and Engeland</label><mixed-citation>
Roksvåg, T., Steinsland, I., and Engeland, K.: A geostatistical two field
model that combines point observations and nested areal observations, and
quantifies long-term spatial variability – A case study of annual runoff
predictions in the Voss area, <a href="https://arxiv.org/abs/1904.02519v2" target="_blank">arXiv:1904.02519v2</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Rue and Held(2005)</label><mixed-citation>
Rue, H. and Held, L.: Gaussian Markov Random Fields: Theory and
Applications, vol. 104 of Monographs on Statistics and Applied
Probability, Chapman &amp; Hall, London, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Rue et al.(2009)Rue, Martino, and Chopin</label><mixed-citation>
Rue, H., Martino, S., and Chopin, N.: Approximate Bayesian inference for
latent Gaussian models using integrated nested Laplace approximations,
J. Roy. Stat. Soc.-B,
71, 319–392, <a href="https://doi.org/10.1111/j.1467-9868.2008.00700.x" target="_blank">https://doi.org/10.1111/j.1467-9868.2008.00700.x</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Sauquet et al.(2000)Sauquet, Gottschalk, and Lebois</label><mixed-citation>
Sauquet, E., Gottschalk, L., and Lebois, E.: Mapping average annual runoff: A
hierarchical approach applying a stochastic interpolation scheme,
Hydrolog. Sci. J., 45, 799–815, <a href="https://doi.org/10.1080/02626660009492385" target="_blank">https://doi.org/10.1080/02626660009492385</a>,
2000.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Siegel(1956)</label><mixed-citation>
Siegel, S.: Non-parametric statistics for the behavioral sciences,
McGraw-Hill, New York, 1956.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Simpson et al.(2017)Simpson, Rue, Riebler, Martins, and
Sørbye</label><mixed-citation>
Simpson, D., Rue, H., Riebler, A., Martins, T. G., and Sørbye, S. H.:
Penalising Model Component Complexity: A Principled, Practical
Approach to Constructing Priors, Stat. Sci., 32, 1–28,
<a href="https://doi.org/10.1214/16-STS576" target="_blank">https://doi.org/10.1214/16-STS576</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Skøien et al.(2006)Skøien, Merz, and Blöschl</label><mixed-citation>
Skøien, J. O., Merz, R., and Blöschl, G.: Top-kriging - geostatistics on stream networks, Hydrol. Earth Syst. Sci., 10, 277–287, <a href="https://doi.org/10.5194/hess-10-277-2006" target="_blank">https://doi.org/10.5194/hess-10-277-2006</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Skøien et al.(2014)Skøien, Blöschl, Laaha, Pebesma, Parajka, and
Viglione</label><mixed-citation>
Skøien, J., Blöschl, G., Laaha, G., Pebesma, E., Parajka, J., and Viglione,
A.: rtop: An R package for interpolation of data with a variable spatial
support, with an example from river networks, Comput.  Geosci., 67,
180–190, <a href="https://doi.org/10.1016/j.cageo.2014.02.009" target="_blank">https://doi.org/10.1016/j.cageo.2014.02.009</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Skøien and Blöschl(2007)</label><mixed-citation>
Skøien, J. O. and Blöschl, G.: Spatiotemporal topological kriging of runoff
time series, Water Resour. Res., 43, W09419, <a href="https://doi.org/10.1029/2006WR005760" target="_blank">https://doi.org/10.1029/2006WR005760</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Skøien et al.(2003)Skøien, Blöschl, and Western</label><mixed-citation>
Skøien, J. O., Blöschl, G., and Western, A. W.: Characteristic space scales
and timescales in hydrology, Water Resour. Res., 39, 1304,
<a href="https://doi.org/10.1029/2002WR001736" target="_blank">https://doi.org/10.1029/2002WR001736</a>, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Stohl et al.(2008)Stohl, Forster, and Sodemann</label><mixed-citation>
Stohl, A., Forster, C., and Sodemann, H.: Remote sources of water vapor forming
precipitation on the Norwegian west coast at 60°&thinsp;N – a tale of
hurricanes and an atmospheric river, J. Geophys. Res.-Atmos., 113, D05102, <a href="https://doi.org/10.1029/2007JD009006" target="_blank">https://doi.org/10.1029/2007JD009006</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Sælthun et al.(1997)Sælthun, Tveito, Bøsnes, and
Roald</label><mixed-citation>
Sælthun, N., Tveito, O., Bøsnes, T., and Roald, L.: Regional
flomfrekvensanalyse for norske vassdrag, Tech. Rep., Oslo, NVE, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Viglione et al.(2013)Viglione, Parajka, Rogger, L. Salinas, Laaha,
Sivapalan, and Blöschl</label><mixed-citation>
Viglione, A., Parajka, J., Rogger, M., Salinas, J. L., Laaha, G., Sivapalan, M., and Blöschl, G.: Comparative assessment of predictions in ungauged basins – Part 3: Runoff signatures in Austria, Hydrol. Earth Syst. Sci., 17, 2263–2279, <a href="https://doi.org/10.5194/hess-17-2263-2013" target="_blank">https://doi.org/10.5194/hess-17-2263-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Vogel and Stedinger(1985)</label><mixed-citation>
Vogel, R. M. and Stedinger, J. R.: Minimum variance streamflow record
augmentation procedures, Water Resour. Res., 21, 715–723,
<a href="https://doi.org/10.1029/WR021i005p00715" target="_blank">https://doi.org/10.1029/WR021i005p00715</a>, 1985.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Yuan et al.(2017)Yuan, Bachl, Lindgren, Borchers, Illian, Buckland,
Rue, and Gerrodette</label><mixed-citation>
Yuan, Y., Bachl, F., Lindgren, F., Borchers, D., Illian, J., Buckland, S., Rue,
H., and Gerrodette, T.: Point process models for spatio-temporal distance
sampling data from a large-scale survey of blue whales,   Ann. Appl. Stat., 11, 2270–2297, <a href="https://doi.org/10.1214/17-AOAS1078" target="_blank">https://doi.org/10.1214/17-AOAS1078</a>, 2017.
</mixed-citation></ref-html>--></article>
