Introduction

HESS

Hydrology and Earth System Sciences

HESS

Hydrol. Earth Syst. Sci.

1607-7938

Copernicus Publications

Göttingen, Germany

10.5194/hess-22-929-2018

Monthly streamflow forecasting at varying spatial scales in the Rhine basin

Schick

Simon

simon.schick@giub.unibe.ch Rössler

Ole

https://orcid.org/0000-0003-2308-0907

Weingartner

Rolf

1Institute of Geography, University of Bern, Bern, Switzerland 2Oeschger Centre for Climate Change Research, University of Bern, Bern, Switzerland

Simon Schick (simon.schick@giub.unibe.ch)

2February2018

22 2 929942 28June2017 6July2017 6November2017 19December2017

This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/

This article is available from https://hess.copernicus.org/articles/22/929/2018/hess-22-929-2018.html

The full text article is available as a PDF file from https://hess.copernicus.org/articles/22/929/2018/hess-22-929-2018.pdf

Model output statistics (MOS) methods can be used to empirically relate an environmental variable of interest to predictions from earth system models (ESMs). This variable often belongs to a spatial scale not resolved by the ESM. Here, using the linear model fitted by least squares, we regress monthly mean streamflow of the Rhine River at Lobith and Basel against seasonal predictions of precipitation, surface air temperature, and runoff from the European Centre for Medium-Range Weather Forecasts. To address potential effects of a scale mismatch between the ESM's horizontal grid resolution and the hydrological application, the MOS method is further tested with an experiment conducted at the subcatchment scale. This experiment applies the MOS method to 133 additional gauging stations located within the Rhine basin and combines the forecasts from the subcatchments to predict streamflow at Lobith and Basel. In doing so, the MOS method is tested for catchments areas covering 4 orders of magnitude. Using data from the period 1981–2011, the results show that skill, with respect to climatology, is restricted on average to the first month ahead. This result holds for both the predictor combination that mimics the initial conditions and the predictor combinations that additionally include the dynamical seasonal predictions. The latter, however, reduce the mean absolute error of the former in the range of 5 to 12 %, which is consistently reproduced at the subcatchment scale. An additional experiment conducted for 5-day mean streamflow indicates that the dynamical predictions help to reduce uncertainties up to about 20 days ahead, but it also reveals some shortcomings of the present MOS method.

Introduction

Environmental forecasting at the subseasonal to seasonal timescale promises a basis for planning in e.g. energy production, agriculture, shipping, or water resources management. While the uncertainties of these forecasts are inherently large, they can be reduced when the quantity of interest is controlled by slowly varying and predictable phenomena. For example, the El Niño–Southern Oscillation plays an important role in predicting the atmosphere, and snow accumulation and melting often forms the backbone in predicting hydrological variables of the land surface .

In the case of streamflow forecasting, the ESP-revESP experiment proposed by provides a methodological framework to disentangle forecast uncertainty with respect to the initial conditions and the meteorological forcings. Being a retrospective simulation, the experiment consists of model runs where the initial conditions are assumed to be known and the meteorological forcing series are randomly drawn (ESP, Ensemble Streamflow Prediction) and vice versa (revESP, reverse Ensemble Streamflow Prediction). In this context the initial conditions refer to the spatial distribution, volume, and phase of water in the catchment at the date of prediction.

The framework allows for the estimation of the time range at which the initial conditions control the generation of streamflow: when the prediction error of the ESP simulation exceeds that of the revESP simulation, the meteorological forcings start to dominate the streamflow generation. Similarly, when the prediction error of the ESP simulation approaches the prediction error of the climatology (i.e. average streamflow used as naive prediction strategy), the initial conditions no longer control the streamflow generation.

In both cases this time range depends on the interplay between climatological features (e.g. transitions between wet and dry or cold and warm seasons) and catchment-specific hydrological storages (e.g. surface water bodies, soils, aquifers, and snow) and can vary from 0 up to several months . Indeed, this source of predictability is the rationale behind the application of the ESP approach in operational forecast settings, and it can be further exploited by conditioning on climate precursors e.g..

An emerging option for streamflow forecasting is the integration of seasonal predictions from earth system models (ESMs), i.e. coupled atmosphere–ocean–land general circulation models . Predictions from an ESM can be used threefold towards the aim of streamflow forecasting by

forcing a hydrological model with the predicted evolution of the atmosphere;

employing runoff simulated by the land surface model;

using the predicted states of the atmosphere, ocean, or land surface in a perfect prognosis or model output statistics context with the streamflow as the predictand.

The first approach requires a calibrated hydrological model for the region of interest. In order to correct a potential bias and to match the spatial and temporal resolution of the hydrological model, it further involves a postprocessing of the atmospheric fields. A postprocessing might also be applied to the streamflow forecasts to account for deficiencies of the hydrological model. See e.g. or for recent implementations of such a model chain.

In the second approach the land surface model takes the hydrological model's place with the difference that the atmosphere and land surface are fully coupled. Since the land surface component of ESMs often represents groundwater dynamics and the river routing in a simplified way , the simulated runoff might be fed to a routing model as e.g. in . To the best of our knowledge, this approach has not yet been tested with a specific focus on subseasonal or seasonal streamflow forecasting.

The third approach deals with developing an empirical prediction rule for streamflow. If the model-building procedure is based on observations only, the approach is commonly referred to as perfect prognosis (PP). On the other hand, the model might be built using the hindcast archive of a particular ESM (model output statistics, MOS). In both cases the final prediction rule is applied to the actual ESM outcome to forecast the quantity of interest. Therefore, MOS methods require the presence of a hindcast archive of the ESM involved, but can take systematic errors of the ESM into account .

Studies that map ESM output to streamflow with PP or MOS methods include multiple linear regression , principal components regression and canonical correlation analysis , generalised linear models , or artificial neural networks . Whatever the selected predictors, PP and MOS methods generally conduct the mapping across spatial scales. For example, if the catchment of interest falls below the grid scale of the ESM, PP and MOS methods implicitly perform a downscaling step. If the catchment covers several grid points, the method implicitly performs an upscaling.

The present study aims to take up this scale bridging and to test a MOS-based approach for monthly mean streamflow forecasting and a range of catchment areas. To analyse the limits of predictability and to aid interpretation, we first define predictor combinations motivated by the ESP-revESP framework. Next, seasonal predictions of precipitation, surface air temperature, and runoff from the European Centre for Medium-Range Weather Forecasts (ECMWF) are entered into the regression equation and the resulting forecast skill is estimated with respect to the ESP-inspired regression model.

The variation of the catchment area is borrowed from the concept of the “working scale” : given a particular target catchment, the regression models are applied at the catchment scale as well as at two levels of subcatchment scales. In the case of the latter, the resulting forecasts are combined in order to get a forecast at the outlet of the target catchment. By validating the combined forecasts of the subcatchments at the main outlet, any differences in the forecast quality can be attributed to the working scales.

This experiment is conducted for the Rhine River at Lobith and Basel in western Europe. Studies using subseasonal or seasonal climate predictions indicate for several parts of the Rhine basin moderate skill beyond the lead time of traditional weather forecasts. These studies apply the model chain as outlined above in approach number one: concerning catchments of the Alpine and High Rhine, estimate the skillful lead time for daily mean streamflow to lie between 1 and 2 weeks, which increases to about 1 month when focusing on low flows . Also for daily low flow report a sharp decrease in skill after 30 days for the Moselle River. For a set of French catchments show that weekly streamflow forecasts are improved for lead times up to about 1 month when using postprocessed precipitation predictions. advance spring mean streamflow forecasts for the French part of the Rhine basin with seasonal predictions of precipitation and surface air temperature.

As a compromise between skillful lead time and temporal resolution, we decide to focus on monthly mean streamflow at lead times of 0, 1, and 2 months. In order to resolve the monthly timescale and to test the MOS method at shorter time intervals, an experiment is further conducted for 5-day mean streamflow. Here, 0 lead time refers to forecasting one time interval ahead, while e.g. a 1-month lead time denotes a temporal gap of 1 month between the release of a forecast and its time of validity.

Strictly speaking, the present study deals with hindcasts or retrospective forecasts. However, for the sake of readability we use the terms forecast, hindcast, and prediction interchangeably. Below, Sect. introduces the study region, Sect. describes the data set, Sect. exposes the methodology in more detail, and in Sects. and the results are presented and discussed, respectively.

Study region

The Rhine River is situated in western Europe and discharges into the North Sea; in the south its basin is defined by the Alps. About 58 million people use the Rhine water for the purpose of navigation, hydropower, industry, agriculture, drinking water supply, and leisure . The present study focuses on two gauging stations: the first is located in Lobith near the Dutch–German border, the second in Basel in the tri-border region of France, Germany, and Switzerland.

Table lists some geographical attributes. The Rhine at Basel covers an area of approximately one-fifth of the Rhine at Lobith, whereas the mean elevation halves when going from Basel to Lobith. The negative minimum elevation of the Rhine at Lobith is due to a coal mine. Dominant land use classes are farmed areas and forests, but the Rhine at Basel proportionately includes more grassland, wasteland, surface water, and glacier.

Concerning the climatology of the period 1981–2011 (Fig. ), we observe that streamflow peaks at Lobith in winter and at Basel in early summer. Streamflow at Basel is dominated by snow accumulation in winter, subsequent snow melting in spring, and high precipitation in summer. At Lobith precipitation exhibits less variability and higher surface air temperature intensifies evaporation. Based on recent climate projections, it is expected that streamflow in the Rhine basin increases in winter, decreases in summer, and slightly decreases in its annual mean in the last third of the 21th century .

Geography of the Rhine River at Basel and Lobith according to , , and .

Lobith Basel Area (km2) 159 700 36 000 Gauging station (m a.s.l.) 20 250 Elevation min (m a.s.l.) -230 250 Elevation max (m a.s.l.) 4060 4060 Elevation mean (m a.s.l.) 490 1050 Farmed area (%) 47.7 36.8 Forest (%) 35.8 31.6 Grassland (%) 3.4 11.4 Urban area (%) 9.6 7.0 Wasteland (%) 1.8 8.2 Surface water (%) 1.4 4.0 Glacier (%) 0.3 1.0

Monthly area averages of streamflow, precipitation, and surface air temperature for the Rhine at Lobith and Basel with respect to the period 1981–2011 .

Data

Observations of river streamflow and gridded precipitation, surface air temperature, and runoff of the period 1981–2011 in daily resolution constitute the data set. Throughout the study gridded quantities get aggregated to (sub)catchment area averages.

Observations

The streamflow observations consist of a set of 135 time series in  m3s-1. These series as well as the corresponding catchment boundaries are provided by several public authorities and the Global Runoff Data Centre (; see also Data Availability section) and belong to catchments with nearly natural to heavily regulated streamflow.

The ENSEMBLES gridded observational data set in Europe (E-OBS, version 16.0) provides precipitation and surface air temperature on a 0.25∘ regular grid . These fields are based upon the interpolation of station data and are subject to inhomogeneities and biases. However, a comparison against meteorological fields derived from denser station networks attests to a high correlation . In the case of the Rhine basin an E-OBS tile approximately covers an area of 500 km2.

Dynamical seasonal predictions

Precipitation, surface air temperature, and runoff from ECMWF's seasonal forecast system 4 (S4) archive are on a 0.75∘ regular grid, amounting in the case of the Rhine basin to a tile area of about 4500 km2. The hindcast set consists of 15 members of which we take the ensemble mean. Runs of the coupled atmosphere–ocean–land model are initialised on the first day of each month and simulate the subsequent 7 months. Up to 2010, initial conditions are from ERA-Interim, and the year 2011 is based on the operational analysis.

The atmospheric model (IFS cycle 36r4) consists of 91 vertical levels with the top level at 0.01 hPa in the mesosphere. The horizontal resolution is truncated at TL255 and the temporal discretisation equals 45 min. The NEMO ocean model has 42 levels with a horizontal resolution of about 1∘. Sea ice is considered by using its actual extent from the analysis and relaxing it towards the climatology of the past 5 years .

The H-TESSEL land surface model implements four soil layers with an additional snow layer on the top. Interception, infiltration, surface runoff, and evapotranspiration are dealt with by dynamically separating a grid cell into fractions of bare ground, low and high vegetation, intercepted water, and shaded and exposed snow. In contrast, the soil properties of a particular layer are uniformly distributed within one grid cell. Vertical water movement in the soil follows Richards's equation with an additional sink term to allow for water uptake by plants. Runoff per grid cell equals the sum of surface runoff and open drainage at the soil bottom .

Method

The following subsections outline the experiment, which is individually conducted for both the Rhine at Lobith and Basel. Section details the predictor combinations and the regression strategy, Sect. introduces the variation of the catchment area, and Sect. illustrates the validation of the resulting hindcasts.

Model building

The predictand yi,j denotes observations of mean streamflow at a specific gauging site in m3s-1 for j=5,10,…,180 days, starting the first day of each calendar month i=1,…,12 in the period 1981–2011.

Predictor combinations

The set of predictors consists of variables that either precede or succeed the date of prediction i (Table ). The first model refRun (reference run) aims to estimate how well the regression works given the best available input data. The combinations named preMet (preceding meteorology) and subMet (subsequent meteorology) are constrained to precipitation and surface air temperature preceding and subsequent to the date of forecast, respectively.

The S4* combinations constitute the MOS method and consider the seasonal predictions from the S4 hindcast archive, where we use the asterisk as wildcard to refer to any of the S4P, S4T, S4PT, and S4Q models. The S4P and S4T models are used to separate the forecast quality with respect to precipitation and temperature. The S4Q model is tested as H-TESSEL does not implement groundwater dynamics and preceding precipitation and temperature might tap this source of predictability.

Predictor combinations consisting of (with respect to the date of prediction) preceding and subsequent precipitation (p), surface air temperature (t), and runoff (q); the numerical values are either from the E-OBS gridded data set or ECMWF's S4 hindcast archive.

Preceding Subsequent Model

ppre

tpre

psub

tsub

qsub

refRun E-OBS E-OBS E-OBS E-OBS – preMet E-OBS E-OBS – – – subMet – – E-OBS E-OBS – S4P E-OBS E-OBS S4 – – S4T E-OBS E-OBS – S4 – S4PT E-OBS E-OBS S4 S4 – S4Q E-OBS E-OBS – – S4

Regression

For a particular yi,j we first apply a correlation screening to select the optimal aggregation time ai,j for each predictor according to ai,j=arg⁡max⁡k∣cor(yi,j,xi,k)∣, where xi,k is one of the predictors from Table and k=-10,-20,…,-720 days in the case of ppre and tpre (backward in time relative to the date of prediction) and k=5,10,…,j days in the case of psub, tsub, and qsub (forward in time relative to the date of prediction). The limit of 720 days is chosen since larger values rarely get selected.

The ordinary least squares hyperplane is then used for prediction without any transformation, basis expansion, or interaction. However, model variance can be an issue: specifically for the preMet model from Table we expect the signal-to-noise ratio to be low for most of the predictands. In combination with the moderate sample size n=26 for model fitting (with respect to the cross-validation; see Sect. ), perturbations in the training set can lead to large changes in the predictor's time lengths ai,j and regression coefficients. In order to stabilise model variance, we draw 100 non-parametric bootstrap replicates of the training set, fit the model to these replicates, and combine the predictions by unweighted averaging .

Cross-validation

Each year with a buffer of 2 years (i.e. the two preceding and subsequent years) is left out and the regression outlined in Sect. is applied to the remaining years. The fitted models then predict the central years that have been left out. Buffering is used to avoid artificial forecast quality due to hydrometeorological persistence .

Lead time

Lead time is introduced by integrating the predicted y^i,j in time and taking differences with respect to j. For example monthly mean streamflow zi in July (i=7) is predicted with a lead time of 1 month according to z^7=(y^6,60⋅(30+31)⋅b-y^6,30⋅30⋅b)/(31⋅b), where b=24⋅60⋅60 s equals the number of seconds of 1 day and both y^ and z^ have unit m3s-1. For 0 lead time, we set z^i=y^i,30. Please note that the year 1981 needs to be dropped from the validation (Sect. ) since the length of the streamflow series prevents e.g. January 1981, with a lead time of 1 month, from being forecast.

Spatial levels

Contrasting the forecast quality of a given model for catchments separated in space inevitably implies a large number of factors, e.g. the geographic location (and thus the grid points of the ESM involved), the orography, or the degree to which streamflow is regulated. In order that these factors are held while screening through a range of catchment areas, we propose to vary the working scale within a particular target catchment.

Following this line of argumentation we apply the model-building procedure from Sect. to three distinct sets of subcatchments, which we term “spatial levels” (Table ). Spatial level 1 simply consists of the target catchment itself, i.e. the Rhine at Lobith and Basel. At spatial levels 2 and 3 we take additional gauging stations from within the Rhine basin, which naturally divide the basin into subcatchments.

For these subcatchments we have streamflow observations from the entire upstream area but not the actual subcatchment area itself. To arrive at an estimate of the water volume generated by the subcatchment, we equate the predictand yi,j to the difference of outflow and inflow of that subcatchment. For a particular date of prediction and spatial level, the sum of the resulting subcatchment forecasts zi^ then constitutes the final forecast for the Rhine at Lobith and Basel.

This procedure implies that we ignore the water travel time: first, when taking the differences of outflows and inflows and second, when summing up the subcatchment forecasts. While the former increases the observational noise, the latter does not affect the regression itself, but it adds a noise term to the final forecast at Lobith and Basel. As the statistical properties of the noise introduced by the water travel time are unknown, we only can argue that the results provide a lower bound of the forecast quality due to this methodological constraint.

Subcatchment division of the Rhine at Lobith and Basel. The median area covers 4 orders of magnitude.

Number of Area km2 subcatchments min median max Lobith level 1 1 – 159 700 – Lobith level 2 5 19 690 33 220 43 550 Lobith level 3 12 8284 13 040 17 610 Basel level 1 1 – 36 000 – Basel level 2 10 1871 2946 6346 Basel level 3 124 6 187 2654

Validation

The forecast quality of the regression models is analysed using the pairs of cross-validated monthly mean streamflow forecasts and observations (z^,z). These series cover the period 1982–2011 and have a sample size of n=360. In general the validation is based on the mean absolute error (MAE) and Pearson's correlation coefficient (ρ).

The first validation steps focus on the forecasts at Lobith and Basel and thus consider the sum of the subcatchment forecasts z^ per spatial level. The forecasts in the subcatchments itself are addressed in Sect. . Finally, the validation of the 5-day mean streamflow forecasts (Sect. ) complements the monthly analysis.

Benchmarks

Climatology and runoff simulated by H-TESSEL serve as benchmarks. The climatology is estimated using the arithmetic mean from the daily streamflow observations. After averaging in time, runoff from H-TESSEL gets post-calibrated via linear regression against the streamflow observations per spatial level. For both benchmarks the cross-validation scheme from Sect. is applied.

Taylor diagram

Taylor diagrams provide an instrument to contrast model performances. The plotting position of a particular model has a distance from the origin equal to the standard deviation of its forecasts z^ and is located on the line with an angle of incline ϕ=arccos(ρ). The plotting position of the observations z has a distance from the origin equal to the standard deviation of z and is located on the abscissa. The distance between these two plotting positions equals the root mean squared error with the unconditional bias E(Z^-Z) removed.

Statistical significance

In the case of the monthly analysis it turns out that the paired differences of absolute errors for a given lead time, spatial level, and reference model r d=∣z^r-z∣-∣z^S4*-z∣ no longer exhibit serial correlation and approximately follow a Gaussian distribution. Using the mean difference d¯, we then report the p values of the two-sided t-test with null hypothesis d¯=0 and alternative hypothesis d¯≠0. The sample autocorrelation functions and quantile plots against the Gaussian distribution of d for 0 lead time and r being the preMet model are included in the Supplement.

Skill

To evaluate whether a particular model m has skill with respect to a reference model r the MAE ratio s=1-MAEmMAEr is employed. For example, m could be a S4* model and r the preMet model. s=0.1 means that the model m lowers the MAE of model r by 10 %.

Subcatchments

To help in the interpretation of the forecast quality of the MOS method regarding the spatial levels at Lobith and Basel, we plot, in a qualitative manner, the MAE skill score (Eq. ) of the S4* and preMet models in space as well as against the subcatchment area, the median of the terrain roughness, the MAE skill score of the subMet with the preMet model as reference, and the MAE skill score of the refRun model with the climatology as reference.

The terrain roughness is included since the atmospheric flow in complex terrain is challenging to simulate and atmospheric general circulation models need to filter the topography according to their spatial resolution . The terrain roughness is defined as the difference of the maximum and minimum elevation value within a 3 × 3 pixel window . It is derived here from the digital elevation model , which has a horizontal resolution of 25 m.

Five-day mean streamflow

In order to predict 5-day mean streamflow, Eq. () is used with a step size of 5 days. However, the monthly dates of prediction impose some restrictions to the validation: first, it is not possible to derive regular time series at different lead times as in the monthly analysis. Furthermore, the distributional assumptions required for the statistical test from Sect. are not valid. The results of the 5-day mean streamflow experiment thus are restricted to a qualitative interpretation.

Results

The experiment spans several dimensions (i.e. Lobith versus Basel, dates of prediction, lead times, predictor combinations, spatial levels), so we frequently need to collapse one or several dimensions. The Supplement aims to complete the results as presented below.

Taylor diagram

Figure shows the Taylor diagrams for Lobith and Basel to get a global overview regarding the lead times, predictor combinations, and spatial levels. Accurate forecasts reproduce the standard deviation of the observations (thus lie on the circle with radius equal to the standard deviation of the observations) and also exhibit high correlation (so travel on this circle towards the observations on the abscissa). At a first glimpse the spatial levels do not introduce clear differences and most of the models mass at the same spots.

The benchmark climatology is outperformed at 0 lead time by all models. At longer lead times the subMet model pops up besides the refRun model and the remaining models approach climatology. For the refRun model we note a correlation of about 0.9 independently of the lead time while the observation's variability generally is underestimated.

For Lobith and 0 lead time we observe an elongated cluster, which comprises all models except the climatology and the refRun model. Some models score a higher correlation – a closer look would reveal that these are the S4P, S4PT, and S4Q models with H-TESSEL standing at the forefront.

Taylor diagrams for the benchmarks climatology and H-TESSEL and the predictor combinations from Table at Lobith (a) and Basel (b); n=360.

Date of prediction versus lead time

Figure takes a closer look at the clusters in Fig. with the example of the S4PT model and in addition breaks down the prediction skill into the different calendar months. Please note that the ordinate lists the calendar month and not the date of prediction – e.g. the top rows show the skill in predicting January's mean streamflow for lead times of 0 up to 2 months. Crosses indicate p values smaller than 0.05 when Eq. () is applied to the individual calendar months.

In general, the patterns repeat more or less along the spatial levels and the S4PT model only beats the reference models in the denominator of Eq. () at 0 lead time. An exception can be observed at Lobith for the month of June, for which the S4PT model most likely outperforms the climatology at 1-month lead time.

While significant differences between the S4PT and the preMet models are rare, the subMet model starts to outperform the S4PT model already at a lead time of 1 month. The comparison against the bias-corrected H-TESSEL runoff shows that the S4PT model might provide more accurate predictions for early summer, but not otherwise.

MAE skill score of the S4PT model with respect to the climatology, the preMet and subMet models, and bias corrected H-TESSEL runoff. The ordinate depicts the target calendar month and the abscissa the monthly lead time. Crosses indicate p values smaller than 0.05 for the null hypothesis “the reference model in the denominator and the S4PT model score an equal mean absolute error”; n=30.

Mean absolute error

In order to conclude the analysis of the monthly predictions at Lobith and Basel, Table reports the MAE at 0 lead time. Reading Table along the rows reveals a more or less consistent pattern: the refRun model approximately halves the MAE of the climatology; differences between the preMet, subMet, and S4T models are small; compared to the preMet model, the S4P, S4PT, and S4Q models lower the MAE by about 40 to 50 m3s-1 for Lobith and by about 10 to 20 m3s-1 for Basel; and H-TESSEL outperforms the S4* models in the case of Lobith, but not Basel. When reading Table along the columns, we generally note a decreasing MAE when going from spatial level 1 to spatial level 3 at Lobith. In the case of Basel, the MAE remains more or less constant except for the refRun model.

Focusing on the MOS method, Table lists the corresponding MAE skill score (Eq. ) of the S4* models using the preMet model as the reference. The p values for the null hypothesis “the preMet and S4* models score an equal mean absolute error” are listed in brackets. We see that the S4P, S4PT, and S4Q models score an error reduction ranging from 5 to 12 %. In the case of the S4T model an error reduction is either not existent (Lobith) or small (Basel), which comes along with high p values.

Mean absolute error at 0 lead time of the benchmarks climatology and H-TESSEL and the predictor combinations from Table , rounded to integers. All values have unit m3s-1; n=360.

Climatology H-TESSEL refRun preMet subMet S4P S4T S4PT S4Q Lobith level 1 633 421 336 501 502 455 500 453 451 Lobith level 2 633 415 299 484 498 429 482 435 443 Lobith level 3 633 415 285 480 498 424 476 428 444 Basel level 1 239 190 131 197 195 183 195 183 191 Basel level 2 239 185 115 197 193 179 192 178 187 Basel level 3 239 182 110 198 193 179 194 178 188

MAE skill score of the S4* models relative to the preMet model (Eq. , expressed in %) at 0 month lead time. The p values for the null hypothesis “the preMet and S4* models score an equal mean absolute error” are enclosed in brackets; n=360.

S4P S4T S4PT S4Q Lobith level 1 9 (< 0.01) 0 (0.85) 10 (< 0.01) 10 (< 0.01) Lobith level 2 11 (< 0.01) 0 (0.78) 10 (< 0.01) 8 (< 0.01) Lobith level 3 12 (< 0.01) 1 (0.55) 11 (< 0.01) 8 (< 0.01) Basel level 1 7 (0.01) 1 (0.36) 7 (0.01) 3 (0.07) Basel level 2 9 (< 0.01) 3 (0.02) 10 (< 0.01) 5 (< 0.01) Basel level 3 10 (< 0.01) 2 (0.08) 10 (< 0.01) 5 (< 0.01)

Subcatchments

Figure depicts the MAE skill score (Eq. ) for the S4PT model relative to the preMet model for each subcatchment at 0 lead time. If the MAE difference does not exhibit a p value smaller than 0.05 (Eq. ), the subcatchment is coloured in white. We observe that the MAE skill score takes values in the range of about -0.05 to 0.11 and both the lowest and highest scores occur at Basel and spatial level 3. Negative scores can only be found at Basel and spatial level 3, and positive skill tends to cluster in space.

MAE skill score of the S4PT model with respect to the preMet model for each subcatchment and 0 lead time. Subcatchments are only coloured when the p value for the null hypothesis “the preMet and S4PT models score an equal mean absolute error” is smaller than 0.05. In the bottom maps the main outlets at Lobith and Basel are marked with a white cross and open water surfaces are coloured in blue ; n=360.

The same skill scores from Fig. are contrasted in Fig. with the subcatchment area, the median of the terrain roughness, the MAE skill score of the subMet model relative to the preMet model, and the MAE skill score of the refRun model relative to the climatology. If the MAE difference of the S4PT and the preMet models does not exhibit a p value smaller than 0.05, the symbol is drawn with a reduced size. The horizontal lines depict the MAE skill scores from Table .

While the first two attributes concern the geography of the subcatchments, the third attribute indicates the relevance of the initial conditions for the subsequent generation of streamflow. The fourth attribute shows how well the S4PT model performs relative to the climatology as benchmark, when it has access to the best available input data.

The resulting patterns suggest that positive skill does not depend on the subcatchment area. On the other hand, a low terrain roughness and a weak relevance of the initial conditions seem to favour positive skill. The last row finally indicates that positive skill is restricted to subcatchments where the refRun model outperforms climatology. Roughly, a hypothetical relationship appears to strengthen from the top to the bottom plots.

MAE skill score of the S4PT model with respect to the preMet model for each subcatchment and 0 lead time, plotted against subcatchment attributes (see Sect. for details). Large symbols note a p value smaller than 0.05 for the null hypothesis “the preMet and S4PT models score an equal mean absolute error”. The horizontal lines indicate the corresponding skill per spatial level at Lobith and Basel; n=360.

Five day mean streamflow

Figure shows the correlation coefficient of the 5-day mean streamflow observations and corresponding predictions for all models and benchmarks up to a lead time of 45 days. We observe that the refRun model scores a correlation of about 0.8 with a slowly decreasing tendency towards longer lead times. Furthermore, the subMet model crosses the preMet model approximately in the second week; the preMet model approaches climatology within about 3 weeks; and the subMet model comes close to the refRun model in about 3 weeks.

In addition, we see that the bias-corrected H-TESSEL runoff starts rather cautiously, but it seems to slightly outperform the S4* models at longer lead times. While the S4T model is hardly distinguishable from the preMet model, the S4P, S4PT, and S4Q models appear to outperform the preMet model within the first 20 days (Lobith) and 15 days (Basel).

For the full range of lead times, the spatial levels introduce some clear differences (Fig. ): the refRun and subMet models are improved at longer lead times along the spatial levels. For lead times longer than about 50 days, the bias-corrected H-TESSEL runoff stays in close harmony with the climatology, while the S4* and preMet models start to score a smaller correlation instead. This effect seems to be mitigated at spatial levels 2 and 3.

Correlation coefficient of 5-day mean streamflow observations and predictions for lead times up to 45 days; n=360.

Correlation coefficient of 5-day mean streamflow observations and predictions for lead times up to 175 days; n=360.

Discussion Model building

In the case of the monthly streamflow, the refRun model ends up with a correlation of about 0.9 for all lead times, spatial levels, and both Lobith and Basel (Fig. ). Part of this correlation is also the annual cycle (Fig. ), which already leads to a correlation of about 0.5 when using the climatology as prediction rule. The forecasts from the refRun model do not fully reproduce the observations' variance, which might be improved with a transformation of the predictand . This option – along with predictors that more explicitly represent the initial conditions, e.g. lake levels, soil moisture content, or snow courses – preferably should be tested in a future study with a small number of catchments and longer time series.

For the 5-day mean streamflow the refRun model gets degraded. At short lead times the correlation amounts to about 0.8, while for longer lead times the correlation exhibits a decreasing trend. Either the present model formulation is less valid (especially for small values of j, say 5 or 10 days, the assumption of linearity might fail) or the scheme to introduce the lead time (Eq. ) is not appropriate for mean values of small time windows (e.g. the subtraction of streamflow volumes of 155 and 150 days only allows for small prediction errors). Since the final forecast values are not part of the regression equation, it is even possible to perform worse than climatology (Fig. ).

Spatial levels

The spatial levels can affect the forecast quality in two ways:

via the ignorance of the water travel time (Sect. )

or by the aggregation of the E-OBS and S4 fields at the catchment scale not being the appropriate spatial resolution (e.g. large-scale grid averages cancel any spatial variability, and for catchment areas below the grid scale a grid point does not necessarily contain information valid at the local scale).

However, clear differences between the spatial levels can only be observed for the 5-day streamflow predictions, where at spatial levels 2 and 3 the forecast quality is improved. Using local information of precipitation, surface air temperature, or runoff appears to compensate for the ignorance of the water travel time.

preMet-subMet

In the ESP-revESP framework is applied to the world's largest river basins using the global hydrological model PCRaster Global Water Balance (PCR-GLOBWB). Considering all calendar months and the Rhine at Lobith, the ESP simulation only outperforms the climatology at 0 lead time; the revESP simulation is outperformed at 0 lead time by both the ESP simulation and climatology; and at longer lead times the revESP simulation clearly outperforms both the ESP simulation and climatology. Therefore, the results of and those of the present study are mostly in line.

The analysis of the 5-day mean streamflow forecasts (Sect. ) further reveals that the crossover of the preMet and subMet models occurs approximately in the second week. However, this estimate ignores variations within the calendar year and should be considered as a rough guess since the regression method is far from being perfect in the case of the 5-day mean streamflow.

MOS method

In the case of the monthly mean streamflow forecasts at 0 lead time, the MOS method based on precipitation or runoff provides a smaller mean absolute error than the preMet model (Table ). Figure suggests that this error reduction at the monthly timescale arises from the predictions of the first 15 to 20 days. Here, it must be stressed that for the present regression strategy temperature subsequent to the date of prediction often is a weak predictor (regression coefficients of the refRun model at spatial level 1 are included in the Supplement). Thus, a rejection of the S4T model does not allow any inference about the forecast quality of surface air temperature itself.

Figure indicates that the subcatchment area is most likely not relevant to score positive skill; rather the S4PT model outperforms the preMet model in subcatchments where the terrain roughness and the relevance of the initial conditions are low. However, the terrain roughness and the relevance of the initial conditions are not independent attributes: Fig. shows that for small subcatchments in the Alpine region positive skill is rarely present (spatial levels 2 and 3 at Basel). These subcatchments generally exhibit a high terrain roughness as well as a high relevance of the initial conditions due to snow accumulation in winter and subsequent melting in spring and summer. A possible explanation could be that errors in the initial condition estimates outweigh the moderate skill contained in the seasonal climate predictions.

H-TESSEL

Within ECMWF's seasonal forecasting system S4, H-TESSEL aims to provide a lower boundary condition for the simulation of the atmosphere and consequently neither implements streamflow routing nor groundwater storage . However, H-TESSEL in combination with a linear bias correction often performs best (Table ).

The S4Q model, which has access to the same input data and in addition conditions on preceding precipitation and temperature, scores a lower forecast accuracy than H-TESSEL in the case of Lobith (Table ). This is most likely related to overfitting, which is not sufficiently smoothed by the model averaging (Sect. ).

Conclusions

The present study tests a model output statistics (MOS) method for monthly and 5-day mean streamflow forecasts in the Rhine basin. The method relies on the linear regression model fitted by least squares and uses predictions of precipitation and surface air temperature from the seasonal forecast system S4 of the European Centre for Medium-Range Weather Forecasts. Observations of precipitation and surface air temperature prior to the date of prediction are employed as a surrogate for the initial conditions. In addition, runoff simulated by the S4 land surface component, the H-TESSEL land surface model, is evaluated for its predictive power.

MOS methods often bridge the grid resolution of the dynamical model and the spatial scale of the actual predictand. In order to estimate how the forecast quality depends on the catchment area, a hindcast experiment for the period 1981–2011 is conducted that varies the working scale within the Rhine basin at Lobith and Basel. This variation is implemented by applying the MOS method to subcatchments and combining the resulting forecasts to predict streamflow at the main outlets at Lobith and Basel.

On average, the monthly mean streamflow forecasts based on the initial conditions are skillful with respect to the climatology at 0 lead time for both the Rhine at Lobith and Basel. The MOS method, which in addition has access to the dynamical seasonal predictions, further reduces the mean absolute error by about 5 to 12 % compared to the model that is constrained to the initial conditions. For lead times of 1 and 2 months the forecasts virtually reduce to climatology. These results hold for the entire range of tested subcatchment scales, meaning that effects of a scale mismatch between the horizontal grid resolution and the catchment area do not emerge. Applying the MOS method for 5-day mean streamflow finally results in a rather moderate forecast quality.

We conclude that the present model formulation – in particular the assumption of linearity – is valid for the monthly timescale, catchments with areas up to 160 000 km2, and water travel times similar to the Rhine river. However, the results also show that a simple linear bias correction of the runoff predicted by the H-TESSEL land surface model is hard to beat. Given the simplicity of a linear bias correction, we think that it could be worth further investigating runoff simulations from land surface components of earth system models for subseasonal to seasonal streamflow forecasting.

The regression approach from Sect. is implemented in an R package maintained on https://github.com/schiggo/SSO.

, , and are public data sets. Access to the ECMWF and GRDC archive must be requested. Data from the various public authorities as listed in the Acknowledgements are partly public.

The Supplement related to this article is available online at https://doi.org/10.5194/hess-22-929-2018-supplement.

The authors declare that they have no conflict of interest.

This article is part of the special issue “Sub-seasonal to seasonal hydrological forecasting”. It is not associated with a conference.

Acknowledgements

Streamflow series and catchment boundaries are provided by the following public authorities: the State Institute for the Environment, Measurements and Conservation Baden Wuerttemberg; the Bavarian Environmental Agency; the State of Vorarlberg; the Austrian Federal Ministry of Agriculture, Forestry, Environment and Water; and the Swiss Federal Office for the Environment. Further we acknowledge the E-OBS data set from the EU-FP6 project ENSEMBLES (http://ensembles-eu.metoffice.com/) and the data providers in the ECA&D project (www.ecad.eu) as well as the Copernicus data and information funded by the European Union (EU-DEM and CORINE). We also thank the Global Runoff Data Centre and the European Centre for Medium-Range Weather Forecasts for the access to the data archives. The reviews and comments by Joost Beckers, Kean Foster, and Fredrik Wetterhall substantially improved the manuscript. The study was funded by the Group of Hydrology, which is part of the Institute of Geography at the University of Bern, Bern, Switzerland. Edited by: Fredrik Wetterhall Reviewed by: Joost Beckers and Kean Foster

References Balsamo et al.(2009)Balsamo, Beljaars, Scipal, Viterbo, van den Hurk, Hirschi, and Betts

Balsamo, G., Beljaars, A., Scipal, K., Viterbo, P., van den Hurk, B., Hirschi, M., and Betts, A. K.: A Revised Hydrology for the ECMWF Model: Verification from Field Site to Terrestrial Water Storage and Impact in the Integrated Forecast System, J. Hydrometeorol., 10, 623–643, 10.1175/2008JHM1068.1, 2009.

Beckers et al.(2016)Beckers, Weerts, Tijdeman, and Welles

Beckers, J. V. L., Weerts, A. H., Tijdeman, E., and Welles, E.: ENSO-conditioned weather resampling method for seasonal ensemble streamflow prediction, Hydrol. Earth. Syst. Sc., 20, 3277–3287, 10.5194/hess-20-3277-2016, 2016.

Bennett et al.(2016)Bennett, Wang, Li, Robertson, and Schepen

Bennett, J. C., Wang, Q. J., Li, M., Robertson, D. E., and Schepen, A.: Reliable long-range ensemble streamflow forecasts: Combining calibrated climate forecasts with a conceptual runoff model and a staged error model, Water Resour. Res., 52, 8238–8259, 10.1002/2016WR019193, 2016.

Blöschl and Sivapalan(1995)

Blöschl, G. and Sivapalan, M.: Scale issues in hydrological modelling: A review, Hydrol. Process., 9, 251–290, 10.1002/hyp.3360090305, 1995.

Bosshard et al.(2014)Bosshard, Kotlarski, Zappa, and Schär

Bosshard, T., Kotlarski, S., Zappa, M., and Schär, C.: Hydrological Climate-Impact Projections for the Rhine River: GCM-RCM Uncertainty and Separate Temperature and Precipitation Effects, J. Hydrometeorol., 15, 697–713, 10.1175/JHM-D-12-098.1, 2014.

Breiman(1996)

Breiman, L.: Bagging Predictors, Mach. Learn., 24, 123–140, 10.1023/A:1018054314350, 1996.

Brunet et al.(1988)Brunet, Verret, and Yacowar

Brunet, N., Verret, R., and Yacowar, N.: An Objective Comparison of Model Output Statistics and “Perfect Prog” Systems in Producing Numerical Weather Element Forecasts, Weather Forecast., 3, 273–283, 10.1175/1520-0434(1988)003<0273:AOCOMO>2.0.CO;2, 1988.

Clark et al.(2015)Clark, Fan, Lawrence, Adam, Bolster, Gochis, Hooper, Kumar, Leung, Mackay, Maxwell, Shen, Swenson, and Zeng

Clark, M. P., Fan, Y., Lawrence, D. M., Adam, J. C., Bolster, D., Gochis, D. J., Hooper, R. P., Kumar, M., Leung, L. R., Mackay, D. S., Maxwell, R. M., Shen, C., Swenson, S. C., and Zeng, X.: Improving the representation of hydrologic processes in Earth System Models, Water Resour. Res., 51, 5929–5956, 10.1002/2015WR017096, 2015.

CORINE(2016)

CORINE: Corine Land Cover 2006 raster data CLC 2006, available at: https://land.copernicus.eu/pan-european/corine-land-cover, (last access: 29 December 2017), 2016.

Crochemore et al.(2016)Crochemore, Ramos, and Pappenberger

Crochemore, L., Ramos, M.-H., and Pappenberger, F.: Bias correcting precipitation forecasts to improve the skill of seasonal streamflow forecasts, Hydrol. Earth. Syst. Sc., 20, 3601–3618, 10.5194/hess-20-3601-2016, 2016.

Demirel et al.(2015)Demirel, Booij, and Hoekstra

Demirel, M. C., Booij, M. J., and Hoekstra, A. Y.: The skill of seasonal ensemble low-flow forecasts in the Moselle River for three different hydrological models, Hydrol. Earth. Syst. Sc., 19, 275–291, 10.5194/hess-19-275-2015, 2015.

E-OBS(2017)

E-OBS: Daily temperature and precipitation fields in Europe V.16, available at: http://www.ecad.eu/download/ensembles/ensembles.php, last access: 29 December 2017.

ECMWF(2017)

ECMWF: IFS Documentation, available at: http://www.ecmwf.int/en/forecasts/documentation-and-support/changes-ecmwf-model/ifs-documentation, last access: 29 December 2017.

EU-DEM(2016)

EU-DEM: Digital Elevation Model over Europe v1.0, available at: https://land.copernicus.eu/pan-european/satellite-derived-products/eu-dem (last access: 29 December 2017), 2016.

Foster and Uvo(2010)

Foster, K. L. and Uvo, C. B.: Seasonal streamflow forecast: a GCM multi-model downscaling approach, Hydrol. Res., 41, 503–507, 10.2166/nh.2010.143, 2010.

Fundel et al.(2013)Fundel, Jörg-Hess, and Zappa

Fundel, F., Jörg-Hess, S., and Zappa, M.: Monthly hydrometeorological ensemble prediction of streamflow droughts and corresponding drought indices, Hydrol. Earth. Syst. Sc., 17, 395–407, 10.5194/hess-17-395-2013, 2013.

GRDC(2016)

GRDC: The Global Runoff Data Centre (data base state June 2016), available at: http://www.bafg.de/GRDC/EN/Home/homepage_node.html (last access: 29 December 2017), 2016.

Haylock et al.(2008)Haylock, Hofstra, Klein Tank, Klok, Jones, and New

Haylock, M. R., Hofstra, N., Klein Tank, A. M. G., Klok, E. J., Jones, P. D., and New, M.: A European daily high-resolution gridded data set of surface temperature and precipitation for 1950–2006, J. Geophys. Res.-Atmos., 113, D20, 10.1029/2008JD010201, 2008.

Hofstra et al.(2009)Hofstra, Haylock, New, and Jones

Hofstra, N., Haylock, M., New, M., and Jones, P. D.: Testing E-OBS European high-resolution gridded data set of daily precipitation and surface temperature, J. Geophys. Res.-Atmos., 114, D21, 10.1029/2009JD011799, 2009.

Humphrey et al.(2016)Humphrey, Gibbs, Dandy, and Maier

Humphrey, G. B., Gibbs, M. S., Dandy, G. C., and Maier, H. R.: A hybrid approach to monthly streamflow forecasting: Integrating hydrological model outputs into a Bayesian artificial neural network, J. Hydrol., 540, 623–640, 10.1016/j.jhydrol.2016.06.026, 2016.

ICPR(2009)

ICPR: Internationally Coordinated Management Plan for the International River Basin District of the Rhine, International Commission for the Protection of the Rhine, 2009.

Jörg-Hess et al.(2015)Jörg-Hess, Griessinger, and Zappa

Jörg-Hess, S., Griessinger, N., and Zappa, M.: Probabilistic Forecasts of Snow Water Equivalent and Runoff in Mountainous Areas, J. Hydrometeorol., 16, 2169–2186, 10.1175/JHM-D-14-0193.1, 2015.

Maraun and Widmann(2015)

Maraun, D. and Widmann, M.: The representation of location by a regional climate model in complex terrain, Hydrol. Earth. Syst. Sc., 19, 3449–3456, 10.5194/hess-19-3449-2015, 2015.

Marcos et al.(2017)Marcos, Llasat, Quintana-Segui, and Turco

Marcos, R., Llasat, M. C., Quintana-Segui, P., and Turco, M.: Seasonal predictability of water resources in a Mediterranean freshwater reservoir and assessment of its utility for end-users, Sci. Total Environ., 575, 681–691, 10.1016/j.scitotenv.2016.09.080, 2017.

Michaelsen(1987)

Michaelsen, J.: Cross-Validation in Statistical Climate Forecast Models, J. Clim. Appl. Meteorol., 26, 1589–1600, 10.1175/1520-0450(1987)026<1589:CVISCF>2.0.CO;2, 1987.

Molteni et al.(2011)Molteni, Stockdale, Balmaseda, Balsamo, Buizza, Ferranti, Magnusson, Mogensen, Palmer, and Vitart

Molteni, F., Stockdale, T., Balmaseda, M., Balsamo, G., Buizza, R., Ferranti, L., Magnusson, L., Mogensen, K., Palmer, T., and Vitart, F.: The new ECMWF seasonal forecast system (System 4), European Centre for Medium Range Weather Forecasts, 2011.

National Academies(2016)

National Academies: Next Generation Earth System Prediction, The National Academies Press, 1st Edn., 10.17226/21873, 2016.

Orth and Seneviratne(2013)

Orth, R. and Seneviratne, S. I.: Predictability of soil moisture and streamflow on subseasonal timescales: A case study, J. Geophys. Res.-Atmos., 118, 10963–10979, 10.1002/jgrd.50846, 2013.

Pappenberger et al.(2010)Pappenberger, Cloke, Balsamo, Ngo-Duc, and Oki

Pappenberger, F., Cloke, H. L., Balsamo, G., Ngo-Duc, T., and Oki, T.: Global runoff routing with the hydrological component of the ECMWF NWP system, Int. J. Climatol., 30, 2155–2174, 10.1002/joc.2028, 2010.

Sahu et al.(2016)Sahu, Robertson, Boer, Behera, DeWitt, Takara, Kumar, and Singh

Sahu, N., Robertson, A. W., Boer, R., Behera, S., DeWitt, D. G., Takara, K., Kumar, M., and Singh, R. B.: Probabilistic seasonal streamflow forecasts of the Citarum River, Indonesia, based on general circulation models, Stoch. Env. Res. Risk A., 31, 1–12, 10.1007/s00477-016-1297-4, 2016.

Schick et al.(2016)Schick, Rössler, and Weingartner

Schick, S., Rössler, O., and Weingartner, R.: Comparison of cross-validation and bootstrap aggregating for building a seasonal streamflow forecast model, Proc. IAHS, 374, 159–163, 10.5194/piahs-374-159-2016, 2016.

Shukla et al.(2013)Shukla, Sheffield, Wood, and Lettenmaier

Shukla, S., Sheffield, J., Wood, E. F., and Lettenmaier, D. P.: On the sources of global land surface hydrologic predictability, Hydrol. Earth. Syst. Sc., 17, 2781–2796, 10.5194/hess-17-2781-2013, 2013.

Singla et al.(2012)Singla, Céron, Martin, Regimbeau, Déqué, Habets, and Vidal

Singla, S., Céron, J.-P., Martin, E., Regimbeau, F., Déqué, M., Habets, F., and Vidal, J.-P.: Predictability of soil moisture and river flows over France for the spring season, Hydrol. Earth. Syst. Sc., 16, 201–216, 10.5194/hess-16-201-2012, 2012.

Slater et al.(2017)Slater, Villarini, Bradley, and Vecchi

Slater, L. J., Villarini, G., Bradley, A. A., and Vecchi, G. A.: A dynamical statistical framework for seasonal streamflow forecasting in an agricultural watershed, Clim. Dynam., 49, 10.1007/s00382-017-3794-7, 2017.

Taylor(2001)

Taylor, K. E.: Summarizing multiple aspects of model performance in a single diagram, J. Geophys. Res.-Atmos., 106, 7183–7192, 10.1029/2000JD900719, 2001.

Torma et al.(2015)Torma, Giorgi, and Coppola

Torma, C., Giorgi, F., and Coppola, E.: Added value of regional climate modeling over areas characterized by complex terrain–Precipitation over the Alps, J. Geophys. Res.-Atmos., 120, 3957–3972, 10.1002/2014JD022781, 2015.

van Dijk et al.(2013)van Dijk, Peña Arancibia, Wood, Sheffield, and Beck

van Dijk, A. I. J. M., Peña Arancibia, J. L., Wood, E. F., Sheffield, J., and Beck, H. E.: Global analysis of seasonal streamflow predictability using an ensemble prediction system and observations from 6192 small catchments worldwide, Water Resour. Res., 49, 2729–2746, 10.1002/wrcr.20251, 2013.

Wang et al.(2012)Wang, Shrestha, Robertson, and Pokhrel

Wang, Q. J., Shrestha, D. L., Robertson, D. E., and Pokhrel, P.: A log-sinh transformation for data normalization and variance stabilization, Water Resour. Res., 48, 5, 10.1029/2011WR010973, 2012.

Wilson et al.(2007)Wilson, O'Connell, Brown, Guinan, and Grehan

Wilson, M. F. J., O'Connell, B., Brown, C., Guinan, J. C., and Grehan, A. J.: Multiscale Terrain Analysis of Multibeam Bathymetry Data for Habitat Mapping on the Continental Slope, Mar. Geod., 30, 3–35, 10.1080/01490410701295962, 2007.

Wood and Lettenmaier(2008)

Wood, A. W. and Lettenmaier, D. P.: An ensemble approach for attribution of hydrologic prediction uncertainty, Geophys. Res. Lett., 35, 14, 10.1029/2008GL034648, 2008.

Yossef et al.(2013)Yossef, Winsemius, Weerts, van Beek, and Bierkens

Yossef, N. C., Winsemius, H., Weerts, A., van Beek, R., and Bierkens, M. F. P.: Skill of a global seasonal streamflow forecasting system, relative roles of initial conditions and meteorological forcing, Water Resour. Res., 49, 4687–4699, 10.1002/wrcr.20350, 2013.

Yuan et al.(2015a)Yuan, Roundy, Wood, and Sheffield

Yuan, X., Roundy, J. K., Wood, E. F., and Sheffield, J.: Seasonal Forecasting of Global Hydrologic Extremes: System Development and Evaluation over GEWEX Basins, B. Am. Meteorol. Soc., 96, 1895–1912, 10.1175/BAMS-D-14-00003.1, 2015a.

Yuan et al.(2015b)Yuan, Wood, and Ma

Yuan, X., Wood, E. F., and Ma, Z.: A review on climate-model-based seasonal hydrologic forecasting: physical understanding and system development, Wiley Interdisciplinary Reviews: Water, 2, 523–536, 10.1002/wat2.1088, 2015b.

</app></app-group></back> </article>