**Research article**
07 Mar 2019

**Research article** | 07 Mar 2019

# Climate influences on flood probabilities across Europe

Eva Steirou Lars Gerlitz Heiko Apel Xun Sun and Bruno Merz

^{1},

^{1},

^{1},

^{2,3},

^{1,4}

**Eva Steirou et al.**Eva Steirou Lars Gerlitz Heiko Apel Xun Sun and Bruno Merz

^{1},

^{1},

^{1},

^{2,3},

^{1,4}

^{1}Section Hydrology, GFZ German Research Center for Geosciences, Potsdam, 14473, Germany^{2}Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University, 200241, Shanghai, China^{3}Columbia Water Center, Earth Institute, Columbia University, New York, NY 10027, USA^{4}Institute of Environmental Science and Geography, University of Potsdam, Potsdam, 14476, Germany

^{1}Section Hydrology, GFZ German Research Center for Geosciences, Potsdam, 14473, Germany^{2}Key Laboratory of Geographic Information Science (Ministry of Education), East China Normal University, 200241, Shanghai, China^{3}Columbia Water Center, Earth Institute, Columbia University, New York, NY 10027, USA^{4}Institute of Environmental Science and Geography, University of Potsdam, Potsdam, 14476, Germany

**Correspondence**: Eva Steirou (esteirou@gfz-potsdam.de) and Xun Sun (xs2226@columbia.edu)

**Correspondence**: Eva Steirou (esteirou@gfz-potsdam.de) and Xun Sun (xs2226@columbia.edu)

Received: 08 Aug 2018 – Discussion started: 15 Aug 2018 – Revised: 27 Dec 2018 – Accepted: 02 Feb 2019 – Published: 07 Mar 2019

The link between streamflow extremes and climatology has been widely studied in recent decades. However, a study investigating the effect of large-scale circulation variations on the distribution of seasonal discharge extremes at the European level is missing. Here we fit a climate-informed generalized extreme value (GEV) distribution to about 600 streamflow records in Europe for each of the standard seasons, i.e., to winter, spring, summer and autumn maxima, and compare it with the classical GEV distribution with parameters invariant in time. The study adopts a Bayesian framework and covers the period 1950 to 2016. Five indices with proven influence on the European climate are examined independently as covariates, namely the North Atlantic Oscillation (NAO), the east Atlantic pattern (EA), the east Atlantic–western Russian pattern (EA/WR), the Scandinavia pattern (SCA) and the polar–Eurasian pattern (POL).

It is found that for a high percentage of stations the climate-informed model is preferred to the classical model. Particularly for NAO during winter, a strong influence on streamflow extremes is detected for large parts of Europe (preferred to the classical GEV distribution for 46 % of the stations). Climate-informed fits are characterized by spatial coherence and form patterns that resemble relations between the climate indices and seasonal precipitation, suggesting a prominent role of the considered circulation modes for flood generation. For certain regions, such as northwestern Scandinavia and the British Isles, yearly variations of the mean seasonal climate indices result in considerably different extreme value distributions and thus in highly different flood estimates for individual years that can also persist for longer time periods.

- Article
(10691 KB) -
Supplement
(1833 KB) - BibTeX
- EndNote

The understanding of extreme streamflow is a key issue for infrastructure design, flood risk management and (re-)insurance, and the estimation of flood probabilities has been a focus of scientific debate in recent decades. Traditionally, streamflow has been analyzed with regard to associated hydro-climatic processes acting at the catchment scale. In recent years many studies have additionally focused on the link between local streamflow and larger-scale climate mechanisms, extending beyond the catchment boundaries (Merz et al., 2014). An early example can be found in Hirschboeck (1988), who provides a detailed explanation of relationships between floods and synoptic patterns in the USA. Large-scale atmospheric patterns acting at global or continental scales have been shown to significantly influence flood magnitude and frequency at the local and regional scale. Regional in this context refers to the joint consideration of several gauges. For example, Kiem et al. (2003) stratified a regional flood index in Australia according to quantiles of the El Niño–Southern Oscillation (ENSO) index and showed that La Niña events are associated with a distinctly higher flood risk compared with El Niño events. Ward et al. (2014) found that peak discharges are strongly influenced by ENSO for a large fraction of catchments across the globe. Delgado et al. (2012) detected a dependence between the variance of the annual maximum flow at stations along the Mekong River and the intensity of the western Pacific monsoon.

This perception of climate-influenced extremes has been incorporated into flood frequency analysis by including climatic variables as covariates of extreme value distribution parameters. It is therefore assumed that the probability density function (pdf) of streamflow is not constant in time but it is conditioned on external variables. This framework, usually called nonstationary, can be particularly useful for hydro-climatic studies since the influence of the climatic phenomena on the distribution of the hydrological target variable, such as extreme streamflow, can be considered (Sun et al., 2014). This means that the whole distribution as well as certain parts of the target variable distribution, such as the tails, can be assessed by including the influence of the large-scale climate phenomenon and used for flood risk management or reinsurance purposes. This conditional or nonstationary frequency analysis has been popularized in the field of hydrology and flood research in recent years. Different covariate types have been examined for their influence on flood extremes, e.g., time (e.g., Delgado et al., 2010; Sun et al., 2015), snow cover indices (Kwon et al., 2008), reservoir indices (López and Francés, 2013; Silva et al., 2017), population measures (Villarini et al., 2009) and large-scale atmospheric and oceanic fields and indices (Delgado et al., 2014; Renard and Lall, 2014). A review of nonstationary approaches for local frequency analyses is given by Khaliq et al. (2006), while some of their limitations are discussed by Koutsoyiannis and Montanari (2015), Serinaldi and Kilsby (2015) and Serinaldi et al. (2018).

In this study, we focus on the European continent and the relation between
streamflow extremes and the large-scale atmospheric circulation. The
European climate is mainly influenced by pressure patterns acting at the
broader region covering Europe and the northern Atlantic. In particular,
five circulation modes have been shown to significantly modify the moisture
fluxes into the European domain: the North Atlantic Oscillation (NAO), the
east Atlantic (EA), the east Atlantic–western Russia (EA/WR), the
Scandinavia (SCA) and the polar–Eurasian (POL) patterns (Bartolini
et al., 2010; Casanueva et al., 2014; Rust et al., 2015; Steirou et al.,
2017). These patterns represent the first five pressure modes north of
50^{∘} N, derived by means of a rotated principle component analysis
of monthly mean 500 hPa geopotential height fields (Barnston and
Livezey, 1987). The modes indicate the position and magnitude of large-scale
atmospheric waves and thus control the strength and location of the northern
hemispheric jetstream. All modes are characterized by a particular pattern
of large-scale winds and moisture fluxes and strongly affect near-surface
climate conditions over vast parts of the Northern Hemisphere. Particularly
NAO has been shown to significantly influence the European winter climate:
its positive state has been linked to positive (negative) anomalies of
moisture fluxes, cyclone passages and precipitation over northern (southern)
Europe (Hurrell
and Deser, 2009; Wibig, 1999). A seasonal shift of the NAO pressure centers
and moisture fluxes towards the north during summer has been detected (Hurrell and Deser, 2009). EA,
often referred to as a southward-shifted NAO, is characterized by distinctly
defined geopotential height anomalies and an associated influence on
westerly moisture fluxes and local climate conditions over Great Britain (Comas-Bru and McDermott, 2014;
Moore and Renfrew, 2012). EA/WR features two centers of action over central
Europe and central Russia. During its positive state, a planetary ridge is
located over northwestern Europe, and this reduces the advection of moist
air masses (Krichak and Alpert, 2005). SCA is particularly
active over northern Europe and triggers atmospheric blocking during its
positive phase (Bueh and Nakamura, 2007). POL
represents the strength of the pressure gradient between the polar regions
and the midlatitudes and thus controls the westerly circulation,
particularly over northern Europe (Claud et
al., 2007). Correlation maps, demonstrating links between these circulation
modes and seasonal precipitation and temperature, are included in the Supplement (Figs. S1–S4).

Apart from Northern Hemisphere modes, the El Niño–Southern Oscillation (ENSO) has been suggested to influence the European hydrology. Significant relations have been found with precipitation and different discharge indices (Guimarães Nobre et al., 2017; Mariotti et al., 2002; Steirou et al., 2017). However, in contrast to the above-described circulation modes, ENSO does not shape the European climate and hydrology directly, but rather indirectly through the regulation of the phase of other large-scale modes, such as the EA (Iglesias et al., 2014). Other patterns acting at a smaller scale, such as the Mediterranean Oscillation (MO) and the western Mediterranean Oscillation (WMO), have also been related with hydrological variables in Europe (Criado-Aldeanueva and Soto-Navarro, 2013; Dünkeloh and Jacobeit, 2003; Martin-Vide and Lopez-Bustins, 2006). However, such modes seem to have limited importance at the continental scale.

While the relation between European hydrology and large-scale circulation has attracted much attention and has been widely studied, only a few studies have adopted a conditional flood frequency framework for the investigation of climate–flood interactions. Villarini et al. (2012) conducted a frequency analysis of annual maximum and peak-over-threshold discharge in Austria with NAO as a covariate. López and Francés (2013) examined maximum annual flows in Spain conditioned on the principal components of four winter climate modes: NAO, AO, MO and WMO. Still, a comprehensive study on streamflow extremes at the European scale has not been conducted.

Thus, this study aims at a large-scale investigation of circulation–streamflow interactions for the entire European continent by adopting a flood frequency framework. We examine seasonal streamflow maxima from more than 600 gauges covering the entire European continent and particularly investigate the influence of the five major pressure modes that directly affect the European climate: NAO, EA, EA/WR, SCA and POL. In order to quantify the effect of important hydro-climatological processes for the streamflow regimes, we investigate contemporaneous relationships only, without considering any time lags. We identify regions with a consistent influence of each particular circulation index in order to explain the spatial coherence of flood frequency. The analysis is conducted at a seasonal scale in order to better account for the intra-annual variations of the circulation characteristics and the associated seasonal shift of climate–streamflow relationships. A Bayesian framework is adopted for the flood frequency analysis because of its advantages concerning the quantification and interpretation of uncertainty. Furthermore, prior information about hydrologic extremes exists in the literature and can be used for inference.

## 2.1 Streamflow data and circulation indices

The time period of our analysis is from 1950 to 2016, defined by the overlap
between streamflow data and circulation indices. Daily streamflow data for
the European continent were received from GRDC (Global Runoff Data Centre).
From this dataset, gauges with record lengths of at least 50 years after
1950 and with a catchment area larger than 200 km^{2} were selected. Small
catchments are not considered, as they may be more prone to local phenomena,
which could blur the large-scale atmospheric influence. In total, 649 stations covering northern and central Europe with the exception of Poland are
considered. Due to the underrepresentation of southern Europe, additional
data from other sources satisfying the abovementioned criteria are included
in the analysis. Five time series with monthly maximum discharges were
obtained for Spain and one station with daily discharge was provided for
Portugal. For details about these additional stations the reader is referred
to Mediero
et al. (2014, 2015). Finally, one record with daily streamflow data was
provided for Pontelagoscuro in Italy (Alessio Domeneghetti, personal
communication, 2017). For each station, the maximum value of mean daily streamflow
is derived for the four standard boreal seasons: winter (DJF), spring,
(MAM), summer (JJA) and autumn (SON). Seasons with more than 20 % missing
values are not considered. Overall 586 records in winter, 604 in spring, 599
in summer and 597 for the autumn season are utilized for the analysis.

Time series of monthly circulation indices for the period 1950–2016 were retrieved from the Climate Prediction Center (CPC) of the National Oceanic and Atmospheric Administration (NOAA) (http://www.cpc.ncep.noaa.gov/data/teledoc/telecontents.shtml, last access: May 2017). We make use of the five indices mentioned in the introduction, namely, the NAO, EA, EA/WR, SCA and POL patterns. Seasonal mean climate indices are used for the adjustment of the extreme value distribution; however, we also examine whether the results differ if monthly values (in accordance with the observed flood date) are considered as covariate. The time series of the seasonal indices, along with their running mean for a 10-year window, are shown in Fig. S5. Histograms showing the distribution of mean circulation indices for each season are provided in Fig. S6.

## 2.2 Flood frequency analysis – competing models

The GEV distribution with parameters invariant in time and with parameters conditioned on
the climate indices are fitted to the seasonal maximum streamflow data. For
the climate-informed models the condition of independent and identically
distributed observations of the classical GEV distribution is relaxed to include
parameters conditioned on time-varying covariates (Katz et
al., 2002). For the two types of models we use the terms “classical model”
instead of stationary model and “climate-informed model” rather than
“nonstationary model”. It has been suggested that if covariates have a
stochastic structure and no deterministic component, the resulting
distribution is not truly nonstationary (Montanari
and Koutsoyiannis, 2014; van Montfort and van Putten, 2002; Serinaldi and
Kilsby, 2015). As our climate covariates have no distinguishable
deterministic component (not shown), it is consequently not clear if they
result in nonstationary models. Here each streamflow gauge is handled
independently and site-specific parameters are derived. Let *Y*(*t*) denote a
streamflow observation at time *t* and $\mathit{Y}=\left(Y\right({t}_{\mathrm{1}}),Y({t}_{\mathrm{2}})$, … ,*Y*(*t*_{n}))
denote the vector of streamflow observations at a specific site. Then for
the classical case the model is given as follows:

where ** θ** is the vector of length

*m*of (time-invariant) distribution parameters. The classical GEV distribution comprises

*m*=3 parameters: a location parameter

*μ*, a scale parameter

*σ*and a shape parameter

*ξ*.

In the Bayesian framework, the posterior pdf of the parameter vector is computed as follows, based on Bayes theorem:

where *f*(** θ**) is the prior pdf of distribution
parameters and

*f*(

**|**

*Y***) is the likelihood function:**

*θ*For the climate-informed distribution, parameters are assumed to be a
function *h*_{i} of the vector of time-varying climate covariates ** x**(

*t*). In the general case, Eq. (1) takes the following form:

with $\mathit{\theta}\left(t\right)=\left({\mathit{\theta}}_{\mathrm{1}}\right(t),{\mathit{\theta}}_{\mathrm{2}}(t),\phantom{\rule{0.125em}{0ex}}\mathrm{\dots}\phantom{\rule{0.125em}{0ex}},{\mathit{\theta}}_{m}(t\left)\right)$ the collection of *m* distribution parameters at time *t*, and

Here *β*_{i} is the vector of
(internal) parameters used in function *h*_{i} (not to be confused with
parameters *θ*_{i}).

The climate-informed GEV distribution is a generalization of the classical GEV distribution. The likelihood function is then defined as follows:

The function *h*_{i}, linking the distribution parameters with climate
covariates, is derived by means of a linear regression. The shape parameter
is assumed to be constant as its estimation includes large uncertainties,
even under the assumption of stationarity (Coles, 2001; Papalexiou and
Koutsoyiannis, 2013; Silva et al., 2017). A preliminary analysis considering
the effect of a covariate on both the location and scale parameter (see Sect. 2.3 below) did
not provide very different results than those for a
covariate on the location parameter only (not shown). Consequently and for
reasons of parsimony, we examine only conditional extreme value
distributions with a time-varying location parameter.

Conditional distributions of only one covariate at a time are derived, since
we are interested in the separate effect of each individual climate index on
flood quantiles. Based on the abovementioned assumptions concerning model
structure and the form of the function *h*_{i}, Eq. (5) can be simplified to the following:

where *μ*(*t*) is the varying location parameter, *μ*_{0} the location
intercept, *μ*_{1} the location slope and *x*(*t*) the single covariate
examined.

Consequently, the conditional GEV distribution comprises four parameters: scale and shape
parameters, and intercept *μ*_{0} and slope *μ*_{1} for the
location parameter. Since five different climate covariates *x*(*t*) are
investigated, we construct six different models (one classical and five
conditional) for each station and season. The posterior pdf of parameters in
Eq. (5) for both the classical and conditional model is estimated using a
No-U-Turn Sampler (NUTS) Hamiltonian Monte Carlo (HMC) approach
implemented in Rstan, the R interface to Stan (Stan Development Team, 2018).
NUTS is an extension to HMC, a Markov chain Monte Carlo (MCMC) algorithm
that avoids the random walk behavior and sensitivity to correlated
parameters which characterize many MCMC methods (Hoffman and Gelman, 2014).
Stan is a state-of-the-art platform for statistical modeling and
high-performance statistical computation.

For all covariates and seasons, models are fitted independently. No posterior distributions from the classical approach are used as priors for the climate-informed case. For all models, noninformative uniform priors are used for the location parameter (for both intercept and slope) and for the scale parameter, since no prior information is available. For the shape parameter an informative normal distribution with mean 0.093 and standard deviation 0.12 is used. This distribution is adopted from a global study of extreme rainfall by Papalexiou and Koutsoyiannis (2013), which, to our knowledge, summarizes an analysis of shape parameters using the largest number of stations with hydrological data worldwide. Although rainfall extremes may be characterized by slightly different shape parameters than those of streamflow, our informative prior is very close to the “geophysical prior” of Martins and Stedinger (2000), which is often used to restrict the range of shape parameters based on previous hydrological experience (Renard et al. 2013). The latter prior was not preferred because it is bounded to the interval (−0.5, 0.5), while the distribution of Papalexiou and Koutsoyiannis (2013) allows more extreme shape values with a low probability.

Five chains of 14 000 simulations, with the first half discarded as warmup period, are run for all parameters. Convergence is investigated by the potential scale reduction statistic $\widehat{R}$ (Gelman and Rubin, 1992). Following Gelman (1996), we assume convergence for values of $\widehat{R}$ below 1.2. Thinning is applied to the post-warmup simulations to remove autocorrelation. Every 10th value from all chains is kept, leading to a final sample of 3500 simulations for each model and season.

## 2.3 Model selection

We apply a two-step methodology to select the optimal model among the
classical and conditional competitors. First, we assess whether the covariates
have a significant effect on our extreme streamflow models by examining the
posterior distribution of the slope *μ*_{1} of the location parameters
(Eq. 7). Conditional models are considered significant if the
zero value is not included in the 90 % posterior interval of the slope
parameter (and thus not by means of a significance test). A second criterion
is additionally adopted in order to select the distribution with the best
performance by taking into consideration that complex models with more
parameters tend to fit the data better. The deviance information criterion
(DIC) (Spiegelhalter et al., 2002) is chosen for
model selection. The DIC was preferred against two more common tools, the
Akaike information criterion (AIC; Akaike, 1974) and the Bayesian
information criterion (BIC; Schwarz, 1978), because it is based on the
posterior distribution of the model parameters and thus includes parameter
uncertainties, while the AIC and BIC are based on maximum likelihood
estimates of parameters.

The deviance, used for the calculation of the DIC, is defined as follows:

where ** θ** is the parameter vector. The DIC
is then given by the following equation:

where $\stackrel{\mathrm{\u203e}}{D}$ is the expectation of the deviance with respect
to the posterior distribution, and ${p}_{D}=\stackrel{\mathrm{\u203e}}{D}-D\left(\stackrel{\mathrm{\u203e}}{\mathit{\theta}}\right)$ is the
effective number of parameters (penalty for model complexity, following
Spiegelhalter et al., 2002). $\stackrel{\mathrm{\u203e}}{\mathit{\theta}}$ is a vector of the expectation
of parameters *θ*. Models with smaller DIC values
are preferred.

Conditional models satisfying both criteria are preferred to the classical model. The model comparison is performed in two steps: first, for each station and season, each climate-informed competitor is pairwise compared to the classical GEV distribution. Subsequently, the model with the overall best performance is identified.

## 2.4 Conditional flood quantiles

In the classical or stationary approach one can define the *n*-year return
level as the high quantile of the examined variable for which the probability
of exceedance is 1∕*n*. In this case, the same probability of exceedance is
assigned to the same events in different years. The concept of return periods can then be introduced as the reciprocal
of the probability of exceedance of a specific value or return level of the
examined variable (Cooley, 2013). In engineering practice, return periods are
often used to communicate risk and are understood either as the expected time
interval at which the examined variable exceeds a certain threshold for the
first time (average occurrence interval) or as the average of the time
intervals between two exceedances of a given threshold (average recurrence
interval) (Volpi et al., 2015). When the parameters of the distribution vary
in time, as in the nonstationary or conditional frequency analysis, a
different probability of exceedance is assigned to different years. In this
case, the concept of return periods becomes less straightforward to define.
Thus, communicating risk by means of probabilities makes more sense (Cooley,
2013). Instead of the classical return levels the term “effective” return
levels has been introduced (Gilleland and Katz, 2016), which represents the
quantiles of the conditioned distribution under consideration of a particular
value of the covariate during a given year.

Here we assess whether the consideration of climatic drivers leads to a significant alteration of flood “effective” return levels or conditional quantiles in individual years. Differences of flood quantiles during years with high and medium values of the considered circulation indices are quantified. Since the model is linear, the effect of high and low covariate values on the extreme value distribution quantiles is approximately symmetric (it would be symmetric if the seasonal indices had a symmetric distribution around zero – see Fig. S6) and thus low covariate values are not considered. The 95th and 50th quantile of the considered climate index are chosen as high and medium index values, respectively. Index quantiles are calculated for the entire period 1950–2016.

From the No-U-Turn sampling after thinning, 3500 post-warmup sets of
parameters are obtained, each corresponding to a flood quantile (for a given
probability of exceedance). The median value of all 3500 flood quantiles is
chosen as a point estimate. The median estimate was preferred to the maximum
a posteriori (MAP) estimate because it is more representative of the
posterior distribution. Based on this approach, the percent relative
difference *Y*_{p} (%) of the
two flood quantiles for a particular probability of
exceedance *p*, corresponding to the high and medium climate index quantiles,
respectively, is calculated as follows:

where *y*_{p, h} is a
flood quantile for the probability *p*, incorporating a
high value of the considered climate index (95th quantile). *y*_{p, m} is
the quantile value for the same probability *p* under consideration of the
medium (50th quantile) climate index. The analysis is performed for
probability of exceedance of 0.02 (corresponding to the 50-year return
period of the classical case).

## 2.5 Uncertainty analysis

In the previous chapters an automatic methodology for the choice of an adequate model and a discussion of flood quantiles for different covariate values is presented. However, a visual comparison of point estimates and uncertainty intervals of the classical and conditional models can be useful, since it illustrates the differences but also the plausibility and possible drawbacks of the competing models. For this reason, we plot the time series of flood quantiles for a probability of exceedance of 0.02 for selected gauges and covariates based on both the classical and the climate-informed extreme value distribution. As discussed in the previous section, the median flood quantile for a probability of exceedance of 0.02 is chosen as point estimate (median quantile curve). Uncertainty of flood quantiles is quantified by means of posterior or credibility intervals, which are the Bayesian equivalent to frequentist confidence intervals, although there exist differences in the interpretation of the two types (Renard et al., 2013; Gelman et al., 2013).

## 3.1 Spatial patterns of competing models

For all seasonal indices climate-informed models are preferred over the classical distribution for a large number of stations; percentages of preferred models (based on both the DIC and the significance of the slope of the location parameter) are shown in Table 1 and spatial patterns are mapped in Figs. 1–2. The climate-informed fits form spatial clusters that resemble the correlations between the climate indices and average seasonal precipitation (Figs. S1–S4), while a relation with the correlations of seasonal mean temperature is not straightforward. Particularly for NAO a dipole pattern is evident in winter, with a positive influence on extreme discharge in northern and central Europe and a negative relationship south of the Alps (Fig. 1). The intra-annual shift of the NAO pressure centers is well captured. The positive influence of NAO on flood magnitudes during summer is only detected for northern Scandinavia (Fig. 2). Similar dipole structures, resembling the correlations with seasonal mean precipitation, are found for other indices. However, there are some deviations from the precipitation patterns. For example, contradicting results are found in Scandinavia during spring and summer for the SCA index. Scandinavian rivers usually have small catchments and are fed by snowmelt in particular in spring; subsequently, in this area, both temperature and precipitation are important for runoff generation. An opposite sign between correlations with precipitation and the slope of the location parameter can also be found during autumn in northeastern Germany for the EA index.

NAO is the covariate with the highest number of significant fits in winter (46 %) and autumn (31 %) and EA in spring (32 %) and summer (18 %). High percentages of preferred climate-informed models are also found for EA and SCA in winter, which is the season where most indices are characterized by their strongest influence on the European climate (Table 1). The worst overall results are found for EA/WR in spring (3 %) and POL in summer (7 %). It can be argued that these two latter cases could occur solely by chance or due to spatial correlation of nearby flood time series; however, results are coherent in space and cover large regions, which suggests a real influence of the circulation modes on the location parameter of the extreme value distributions, restricted though to certain subregions of Europe.

Similar spatial patterns are obtained from the same analysis if monthly covariates during the month of the seasonal discharge peaks are examined (Figs. S7–S8). Clusters of stations with positive or negative slopes of the location parameter agree with those for seasonal indices; however, in most cases the percentages of preferred fits are lower for the monthly covariates, with EA/WR in spring being an exception. In particular, the role of NAO in winter and autumn and of EA during the rest of the seasons is less pronounced in the monthly-scale analysis. NAO and SCA are the covariates with the highest number of preferred fits in spring and EA together with EA/WR in summer and autumn (Table 2). Regarding the spatial patterns of preferred fits, deviations from those for seasonal covariates can be found for EA/WR, SCA and POL during spring and summer.

For all indices examined, a percentage of stations between 5 % and 13 %, depending on the season and the covariate, are characterized by lower DIC for the climate-informed model, although the slope of the location parameter is not significant (illustrated as yellow points in Figs. 1 and 2). Only a few station records, up to three per season and index (not shown in Figs. 1 and 2), are characterized by higher DIC value for the climate-informed model without showing a significant slope. These results indicate that DIC is a weaker criterion for model selection than the slope significance at 10 % level.

In order to illustrate the spatial structure of the best models, the preferred model (classical or climate-informed) is mapped in Figs. 3 and 4 for each station for seasonal covariates. Spatial patterns do not resemble the pattern of significant fits for separate indices (Figs. 1, 2), since the influence of the selected climate modes on flood frequencies is overlapping for some regions and some of the indices are correlated for particular seasons (Table S1). Winter (summer) is the season with the highest (lowest) overall percentage of preferred climate-informed models: 77 % and 38 %, respectively. In winter, NAO is the most influential climate mode, being preferred over the other modes for 28 % of the gauges. The largest influence of NAO on flood frequencies is detected in central Europe, Great Britain, parts of Scandinavia and the Iberian Peninsula (Fig. 3). The first three regions also show a high fraction of SCA-influenced models, which points towards a joint effect of NAO and SCA during winter. The two indices are significantly correlated during this season (Table S1). EA is identified as the best covariate in winter for Great Britain. In spring an expansion of the EA influence towards central Europe is detected. The NAO influence is shifted to the south during the transition seasons (spring and autumn) and is completely dissolved in summer. Patterns for SCA are heterogeneous throughout the year. The same results but for monthly covariates are shown in Figs. S7 and S8. Spatial patterns resemble those for seasonal covariates. Percentages of preferred climate-informed models are included in Tables 1 and 2.

## 3.2 Conditional quantiles and uncertainty analysis

In the previous section it is shown that models with monthly covariates do not outperform those with seasonal covariates for most indices and seasons. Hence, quantiles of climate indices are calculated at the seasonal scale only (Table 3). Figures 5 and 6 show the relative differences of seasonal flood quantiles for a probability of exceedance of 0.02 between a (hypothetical) year with a climate index value equal to the 95th index quantile and a year with an index value equal to the median. For a probability of exceedance of 0.02, relative differences higher than 20 % and up to 22 % are detected in winter for NAO. For the rest of the seasons, maximum relative differences are lower than 20 % with highest values for EA/WR in autumn (marginally below 20 %). In spring and summer the highest value is considerably lower, between 11 % and 13 % for NAO and SCA in spring and EA and SCA in summer.

A difference of 5 %–10 % is quite common for NAO in winter. For example, a
station with a positive slope of the location parameter and a probability of
exceedance of 0.02 for a maximum seasonal discharge value of 600 m^{3} s^{−1} during years
characterized by a medium NAO index has an effective return
level between 630 and 660 m^{3} s^{−1} during years with a highly positive NAO
state. Particularly for Great Britain and Scandinavia, high relative
differences, positive or negative, are found in winter for different
indices. Differences of extreme discharge higher than 10 % are
characteristic for variations of the EA index in southeastern Britain and
for EA/WR in Norway. Some stations with high differences are also found in
Norway and northern Britain for NAO and SCA in spring. Summer is
characterized by low relative differences, below 5 % for most stations. On
the contrary, in autumn clusters of stations with medium to high
differences, positive or negative (higher than 5 % and locally exceeding
10 %), are found in Scandinavia for NAO and EA/WR; in all of northern
Europe for EA; and in the Alpine region, southern Great Britain and Norway
for SCA.

The high relative differences of flood quantiles could partly reflect differences in catchment size or unreasonable posterior values of the shape parameter. A link with catchment size was, however, not found (not shown). Posterior shapes for all seasons and indices were further analyzed. Summary statistics of the median shape from the posterior distribution of each fitted model are given in Table 4. Little deviation is observed for different models (classical or climate-informed) during the same season but some inter-season variation is present. No unreasonable values are observed, and thus we assume that the use of an informative prior distribution for shape adequately restricts the posterior distributions to reasonable limits.

The results for three selected gauges with high relative differences
*Y*_{0.02} are presented in detail. The selected stations cover different
characteristic combinations with regard to the investigated season and the
considered covariate. The time series of discharge values with a probability
of exceedance of 0.02 are illustrated for the classical case and the
climate-informed case for the three indices with the lowest DIC (Fig. 7).
Conditional quantiles are calculated on a year-to-year basis, based on the
observed values of the selected climate indices. Details about the
streamflow gauges and the climate-informed fits are given in Tables 5 and 6,
respectively. Results show that the conditional and unconditional point
estimates and uncertainty bounds can differ considerably, particularly for
models with a high relative difference *Y*_{0.02} and a low DIC (subplots A1,
B1 and C1 in Fig. 7). Obviously results from the conditional models vary
with time. For example, for the station Asbro 3 in Sweden, strongly
different results are obtained by the classical and the NAO-conditional
model in winter, particularly for the period 1960–1970, which was dominated
by negative NAO conditions and reduced winter precipitation amounts over
northern Europe. The same applies for the station Teston in Great Britain
during the period 1960–1980, if EA is considered as a covariate. These
results show that the climate-informed models can modulate the estimated
flood risk for single years or longer periods and thus substantially deviate
from the estimation based on the classical distributions. For models
characterized by small relative differences or insignificant slopes of the
location parameter (subplots A3, B3 and C3), conditional uncertainty bounds
tend to converge to a straight line resembling the classical case. The
classical case is theoretically a subcase of the climate-informed model.
However, the two models are fitted independently and the two intervals do
not always overlap.

The uncertainty bounds of the climate-informed fits can be narrower or wider
than those of the classical model. They are also asymmetric, contrary to
uncertainty bounds that result from a method using a normal approximation.
Asymmetric intervals are associated with the shape parameter of the GEV distribution and
are not uncommon (see for example Zeng et al., 2017). The range of
uncertainty bounds reflects an interplay between model complexity and the
additional information provided by the more complex models. In Fig. 7,
uncertainty bounds are narrower in the case of the “best” conditional
models (e.g., subplot A1). Uncertainty increases when extrapolations are made
towards high and low index values. This can be more easily observed in Fig. 8. For the classical case, the range is about 94 m^{3} s^{−1}. For the
climate-informed case and NAO =0 (close to its median value) the range is
around 70 m^{3} s^{−1}. The range increases to 74 m^{3} s^{−1} for NAO =1 and
to 80 m^{3} s^{−1} for the most extreme observed NAO value (NAO $=-\mathrm{2.1}$). For
a NAO value around $\mathrm{3}/-\mathrm{3}$ the range of uncertainty bounds reaches that of the
classical model.

This study explored whether a climate-informed flood frequency analysis provides insights and can improve the estimation of flood probabilities at the European scale. A site-specific model using a Bayesian framework was developed, and five Euro-Atlantic circulation modes were investigated as potential covariates: the North Atlantic Oscillation (NAO), the east Atlantic pattern (EA), the east Atlantic–western Russian pattern (EA/WR), the Scandinavia pattern (SCA) and the polar–Eurasian pattern (POL). Streamflow was analyzed at a seasonal timescale in order to account for the variable influence of the circulation modes on the European climate during different seasons of the year. Covariates were averaged and examined at both seasonal and monthly scales, contemporaneous to the season or month of the seasonal streamflow maxima, respectively.

The developed climate-informed models were compared to the classical GEV distribution with time-invariant parameters. For most seasons and covariates investigated, the climate-informed models were preferred over the classical GEV distribution for a high percentage of stations (around 20 % on average), with best results found in winter for NAO and EA, in spring for EA, and in autumn for NAO (Table 1). Results were shown to be coherent in space, indicating that certain regions are influenced by particular circulation modes (Figs. 1–4). In winter 77 % of the stations were found to be influenced by one of the climate modes, which indicates high potential for an improvement of flood probability estimations by including climate information in extreme value statistics. On the contrary, less than half of the stations examined were significantly affected by at least one of the five large-scale indices during summer season, indicating a rather convective and nonpredictable precipitation regime (Table 1).

Based on the variability of the circulation indices, we identified regions that are characterized by preferred climate-informed fits and by steep slopes of the location parameter. For models with significant slopes, variations of the climate indices lead to highly varying flood quantile estimations for the same probability of exceedance. Particularly for northwestern Scandinavia and the British Isles, variations of the climate indices result in considerably different extreme value distributions and thus highly different flood estimates for individual years (Figs. 5–6). This difference in estimates could be partly a result of unreasonable posterior values of the shape parameter; however, the use of an informative prior distribution for shape adequately restricts the posterior distributions to reasonable limits. Plots of extreme streamflow under consideration of a probability of exceedance of 0.02 indicate that the deviation between the classical and climate-informed analysis concerns not only single years but can also persist for longer time periods (Fig. 7), which reflects the decadal-scale variability of NAO and other large-scale circulation indices (Fig. S5).

Although the circulation indices examined are characterized by high intra-seasonal variability, the seasonally averaged indices provided in most cases better fits compared with monthly values (Tables 1–2). This should be emphasized, since extreme precipitation events are most likely more closely related to monthly circulation states, which better represent the moisture fluxes into the target domain. On the contrary, the catchment wetness before the flood event is likely to be influenced by the seasonal mean circulation and the associated precipitation sums. Hence, our result suggests that the skill of climate-informed extreme value distributions is to a significant extent a consequence of the important link between catchment wetness and flooding. Thus we assume, in line with recent studies (Blöschl et al., 2017; Merz et al., 2018; Schröter et al., 2015), that in many regions of Europe, catchment wetness plays an important role for flood generation.

For the selection of the best model among the classical and climate-informed
models, two criteria were adopted: the DIC and the significance of the slope of the
location parameter *μ*_{1}. For all indices and seasons, the DIC
favored the climate-informed models over the classical distribution for a
larger number of stations compared to the slope significance. DIC has
received some criticism for not adequately penalizing complex models and
tending to choose overfitted models (Silva et al., 2017;
Spiegelhalter et al., 2014). Our results show that at least compared to the
slope significance, DIC is a weaker criterion for model selection. A
criterion comprising a higher penalty term for model complexity could
alternatively be adopted. A more conservative version of DIC has been
proposed by Ando (2011) but is not commonly used until today (Silva et al.,
2017).

The described methodology can be complemented in several ways.

- a.
*Regional framework*. In this study, a local, site-specific flood frequency model was developed. This model allowed spatial coherence in relations between streamflow extremes and large-scale atmospheric patterns to be identified. However, a shortcoming of this methodology is the high uncertainty of streamflow estimates for high probabilities of exceedance (corresponding for example to the 100- or 200-year flood). Instead of a local framework, a regional framework can alternatively be implemented. The latter, by considering all available streamflow information in a region, decreases uncertainty and offers the possibility of improving streamflow quantile estimation. - b.
*Alternative models*. A linear relationship was assumed between streamflow extremes and the large-scale atmospheric indices. This is a simplification of reality and some relations may be over- or underestimated due to existing nonlinearities in the climate–streamflow system. More complex models, in particular nonlinear models, would also be possible candidates for describing the relation between climate indexes and flood probabilities. However, with increasing model complexity, the chances of model overfitting also increase. In this study we assumed a symmetric influence of the positive and negative phases of the climate indices. However, an asymmetric relation may better describe the effect of certain climate modes on streamflow extremes. For example, Sun et al. (2014) used an asymmetric piecewise-linear regression to account for the different effects of El Niño and La Niña on rainfall extremes in southeastern Queensland, Australia. Furthermore, we also assumed a varying location parameter and constant scale parameter. A constant coefficient of variation as in Serago and Vogel (2018) would also be possible and as parsimonious as our model. In this case, a varying scale parameter linked to the location parameter would need to be implemented. - c.
*Number of covariates*. Single covariate models were developed, focusing on the separate effect of each individual climate mode. The methodology can be extended to a model considering several covariates at the same time. In that case, dependencies between the covariates, if existent, should be taken into consideration. López and Francés (2013) overcame this problem by using the principal components of climatic indices as covariates for the flood frequency analysis. This, however, increases the model complexity considerably and thus the chances of model overfitting. This needs to be considered in developing models with multiple covariates. - d.
*Contemporaneous and lagged relationships*. In this paper we considered contemporaneous relationships between streamflow extremes and pressure modes that directly shape the European climate and hydrology. However, lagged relationships may prove more useful for flood risk management and the (re-)insurance industry, since they would allow forecasts of temporal variable flood quantiles for the following month or season. The contemporaneous streamflow–covariate setup presented here can be used, together with a seasonal prediction of indices, for an ahead-of-season forecast of streamflow quantiles. In this case covariate uncertainty must also be considered. A second possibility is to operate the presented model in a forecast mode under consideration of different time lags between selected covariates and observed streamflow maxima. Our results suggest that catchment wetness has an important role in shaping seasonal maximum streamflow. In a follow-up study, we will systematically test the skill of various predictor variables, describing both the climate and catchment state, in forecasting runoff extremes in Europe.

The GRDC discharge dataset was obtained from the Global Runoff Data Centre, 56068 Koblenz, Germany (https://www.bafg.de/GRDC/EN, last access: October 2017), and is available upon request. Time series of monthly circulation indices were retrieved from the Climate Prediction Center (CPC) of the National Oceanic and Atmospheric Administration (NOAA) and can be accessed through http://www.cpc.ncep.noaa.gov/data/teledoc/telecontents.shtml (last access: May 2017). Additional discharge data from Spain and Portugal were provided upon request by Luis Mediero and for Pontelagoscuro, Italy, by Alessio Domeneghetti. Gridded pressure data were extracted from the NCEP/NCAR Reanalysis dataset and are provided through http://www.esrl.noaa.gov/psd/ (last access: February 2016). Gridded temperature and precipitation data were extracted from the CRU TS3.24 dataset from the climatic research unit (CRU, https://crudata.uea.ac.uk/cru/data/hrg/, last access: February 2016) of the University of East Anglia.

The supplement related to this article is available online at: https://doi.org/10.5194/hess-23-1305-2019-supplement.

BM conceived the original idea, and all co-authors designed the overall study. ES developed the model code with contributions from XS, performed the analysis and prepared the paper. All co-authors contributed to the interpretation of the results and writing of the paper.

The authors declare that they have no conflict of interest.

The authors are grateful to the three reviewers, Alberto Viglione, Elena
Volpi and Francesco Marra, for their helpful comments and suggestions that
substantially improved the paper. Alessio Domeneghetti is thanked for
providing unpublished discharge data from Italy and Luis Mediero for
providing discharge data from Spain and Portugal. Daniel Beiter is thanked
for his support in coding and parallel computing. Xun Sun is supported by
the National Key R&D Program of China (no. 2017YFE0100700) and Shanghai
Pujiang Program (no. 17PJ1402500). This study was conducted in the frame of
the projects “Conditional flood frequency analysis: exploring the link of
flood frequency to catchment state and climate variations” and “The link
of flood frequency to catchment state and climate variations”, two joint
research initiatives between AXA Global P&C and GFZ, Potsdam. The authors
wish to acknowledge the AXA Research Fund for financial support.

The article processing charges for this open-access

publication were covered by a Research

Centre of the Helmholtz Association.

Edited by: Nadav Peleg

Reviewed by: Elena Volpi, Francesco Marra, and Alberto Viglione

Akaike, H.: New look at statistical-model identification, IEEE T. Automat. Contr., 19, 716–723, 1974.

Ando, T.: Predictive bayesian model selection, Am. J. Math.-S., 1–2, 13–38, 2011.

Barnston, A. G. and Livezey, R. E.: Classification, seasonality and persistence of low-frequency atmospheric circulation patterns, Mon. Weather Rev., 115, 1083–1126, 1987.

Bartolini, E., Claps, P., and D'Odorico, P.: Connecting European snow cover variability with large scale atmospheric patterns, Adv. Geosci., 26, 93–97, https://doi.org/10.5194/adgeo-26-93-2010, 2010.

Blöschl, G., Hall, J., Parajka, J., Perdigão, R. A. P., Merz, B., Arheimer, B., Aronica, G. T., Bilibashi, A., Bonacci, O., Borga, M., Čanjevac, I., Castellarin, A., Chirico, G. B., Claps, P., Fiala, K., Frolova, N., Gorbachova, L., Gül, A., Hannaford, J., Harrigan, S., Kireeva, M., Kiss, A., Kjeldsen, T. R., Kohnová, S., Koskela, J. J., Ledvinka, O., Macdonald, N., Mavrova-Guirguinova, M., Mediero, L., Merz, R., Molnar, P., Montanari, A., Murphy, C., Osuch, M., Ovcharuk, V., Radevski, I., Rogger, M., Salinas, J. L., Sauquet, E., Šraj, M., Szolgay, J., Viglione, A., Volpi, E., Wilson, D., Zaimi, K., and Živković, N.: Changing climate shifts timing of European floods, Science, 357, 588–590, https://doi.org/10.1126/science.aan2506, 2017.

Bueh, C. and Nakamura, H.: Scandinavian pattern and its climatic impact, Q. J. Roy. Meteor. Soc., 133, 2117–2131, https://doi.org/10.1002/qj.173, 2007.

Casanueva, A., Rodríguez-Puebla, C., Frías, M. D., and González-Reviriego, N.: Variability of extreme precipitation over Europe and its relationships with teleconnection patterns, Hydrol. Earth Syst. Sci., 18, 709–725, https://doi.org/10.5194/hess-18-709-2014, 2014.

Claud, C., Duchiron, B., and Terray, P.: Associations between large-scale atmospheric circulation and polar low developments over the North Atlantic during winter, J. Geophys. Res.-Atmos., 112, 1–16, https://doi.org/10.1029/2006JD008251, 2007.

Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, 2001.

Comas-Bru, L. and McDermott, F.: Impacts of the EA and SCA patterns on the European twentieth century NAO-winter climate relationship, Q. J. Roy. Meteor. Soc., 140, 354–363, https://doi.org/10.1002/qj.2158, 2014.

Cooley, D.: Return periods and return levels under climate change, in Extremes in a Changing Climate, 97–114, Springer, Amsterdam, the Netherlands, 2013.

Criado-Aldeanueva, F. and Soto-Navarro, F. J.: The Mediterranean Oscillation Teleconnection Index: Station-Based versus Principal Component Paradigms, Adv. Meteorol., 2013, 1–10, https://doi.org/10.1155/2013/738501, 2013.

Delgado, J. M., Apel, H., and Merz, B.: Flood trends and variability in the Mekong river, Hydrol. Earth Syst. Sci., 14, 407–418, https://doi.org/10.5194/hess-14-407-2010, 2010.

Delgado, J. M., Merz, B., and Apel, H.: A climate-flood link for the lower Mekong River, Hydrol. Earth Syst. Sci., 16, 1533–1541, https://doi.org/10.5194/hess-16-1533-2012, 2012.

Delgado, J. M., Merz, B., and Apel, H.: Projecting flood hazard under climate change: an alternative approach to model chains, Nat. Hazards Earth Syst. Sci., 14, 1579–1589, https://doi.org/10.5194/nhess-14-1579-2014, 2014.

Dünkeloh, A. and Jacobeit, J.: Circulation dynamics of Mediterranean precipitation variability 1948–98, Int. J. Climatol., 23, 1843–1866, https://doi.org/10.1002/joc.973, 2003.

Gelman, A.: Inference and monitoring convergence, in: Markov chain Monte Carlo in practice, edited by: Gilks, W. R., Richardson, S., and Spiegelhalter, D. J., Chapman & Hall, New York, 131–143, 1996.

Gelman, A. and Rubin, D. B.: Inference from Iterative Simulation Using Multiple Sequences, Stat. Sci., 7, 457–472, https://doi.org/10.1214/ss/1177011136, 1992.

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B.: Bayesian Data Analysis, 3rd edition, Chapman & Hall/CRC, London, 2013.

Gilleland, E. and Katz, R. W.: extRemes 2.0: An Extreme Value Analysis Package in R, J. Stat. Softw., 72, https://doi.org/10.18637/jss.v072.i08, 2016.

Guimarães Nobre, G., Jongman, B., Aerts, J., and Ward, P. J.: The role of climate variability in extreme floods in Europe, Environ. Res. Lett., 12, 084012, https://doi.org/10.1088/1748-9326/aa7c22, 2017.

Hirschboeck, K. K.: Flood hydroclimatology, in: Flood geomorphology, edited by: Baker, V. R., 27–49, Wiley-Interscience, New York, 1988.

Hoffman, M. D. and Gelman, A.: The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo, J. Mach. Learn. Res., 15, 1593–1623, 2014.

Hurrell, J. W. and Deser, C.: North Atlantic climate variability: The role of the North Atlantic Oscillation, J. Marine Syst., 78, 28–41, https://doi.org/10.1016/j.jmarsys.2008.11.026, 2009.

Iglesias, I., Lorenzo, M. N., and Taboada, J. J.: Seasonal Predictability of the East Atlantic Pattern from Sea Surface Temperatures, edited by: Dias, J. M., PLoS One, 9, e86439, https://doi.org/10.1371/journal.pone.0086439, 2014.

Katz, R. W., Parlange, M. B., and Naveau, P.: Statistics of extremes in hydrology, Adv. Water Resour., 25, 1287–1304, https://doi.org/10.1016/S0309-1708(02)00056-8, 2002.

Khaliq, M. N., Ouarda, T. B. M. J., Ondo, J. C., Gachon, P., and Bobée, B.: Frequency analysis of a sequence of dependent and/or non-stationary hydro-meteorological observations: A review, J. Hydrol., 329, 534–552, https://doi.org/10.1016/j.jhydrol.2006.03.004, 2006.

Kiem, A. S., Franks, S. W., and Kuczera, G.: Multi-decadal variability of flood risk, Geophys. Res. Lett., 30, 1035, https://doi.org/10.1029/2002GL015992, 2003.

Koutsoyiannis, D. and Montanari, A.: Negligent killing of scientific concepts: the stationarity case, Hydrol. Sci. J., 60, 1174–1183, https://doi.org/10.1080/02626667.2014.959959, 2015.

Krichak, S. O. and Alpert, P.: Decadal trends in the east Atlantic-west Russia pattern and Mediterranean precipitation, Int. J. Climatol., 25, 183–192, https://doi.org/10.1002/joc.1124, 2005.

Kwon, H.-H., Brown, C., and Lall, U.: Climate informed flood frequency analysis and prediction in Montana using hierarchical Bayesian modeling, Geophys. Res. Lett., 35, L05404, https://doi.org/10.1029/2007GL032220, 2008.

López, J. and Francés, F.: Non-stationary flood frequency analysis in continental Spanish rivers, using climate and reservoir indices as external covariates, Hydrol. Earth Syst. Sci., 17, 3189–3203, https://doi.org/10.5194/hess-17-3189-2013, 2013.

Mariotti, A., Zeng, N., and Lau, K.-M.: Euro-Mediterranean rainfall and ENSO – a seasonally varying relationship, Geophys. Res. Lett., 29, 1621, https://doi.org/10.1029/2001GL014248, 2002.

Martins, E. S. and Stedinger, J. R.: Generalized maximum-likelihood generalized extreme-value quantile estimators for hydrologic data, Water Resour. Res., 36, 737–744, https://doi.org/10.1029/1999WR900330, 2000.

Martin-Vide, J. and Lopez-Bustins, J.-A.: The Western Mediterranean Oscillation and rainfall in the Iberian Peninsula, Int. J. Climatol., 26, 1455–1475, https://doi.org/10.1002/joc.1388, 2006.

Mediero, L., Santillán, D., Garrote, L., and Granados, A.: Detection and attribution of trends in magnitude, frequency and timing of floods in Spain, J. Hydrol., 517, 1072–1088, https://doi.org/10.1016/j.jhydrol.2014.06.040, 2014.

Mediero, L., Kjeldsen, T. R., Macdonald, N., Kohnova, S., Merz, B., Vorogushyn, S., Wilson, D., Alburquerque, T., Blöschl, G., Bogdanowicz, E., Castellarin, A., Hall, J., Kobold, M., Kriauciuniene, J., Lang, M., Madsen, H., Onuşluel Gül, G., Perdigão, R. A. P., Roald, L. A., Salinas, J. L., Toumazis, A. D., Veijalainen, N., and Þórarinsson, Ó.: Identification of coherent flood regions across Europe by using the longest streamflow records, J. Hydrol., 528, 341–360, https://doi.org/10.1016/j.jhydrol.2015.06.016, 2015.

Merz, B., Aerts, J., Arnbjerg-Nielsen, K., Baldi, M., Becker, A., Bichet, A., Blöschl, G., Bouwer, L. M., Brauer, A., Cioffi, F., Delgado, J. M., Gocht, M., Guzzetti, F., Harrigan, S., Hirschboeck, K., Kilsby, C., Kron, W., Kwon, H.-H., Lall, U., Merz, R., Nissen, K., Salvatti, P., Swierczynski, T., Ulbrich, U., Viglione, A., Ward, P. J., Weiler, M., Wilhelm, B., and Nied, M.: Floods and climate: emerging perspectives for flood risk assessment and management, Nat. Hazards Earth Syst. Sci., 14, 1921–1942, https://doi.org/10.5194/nhess-14-1921-2014, 2014.

Merz, B., Dung, N. V., Apel, H., Gerlitz, L., Schröter, K., Steirou, E., and Vorogushyn, S.: Spatial coherence of flood-rich and flood-poor periods across Germany, J. Hydrol., 559, 813–826, https://doi.org/10.1016/j.jhydrol.2018.02.082, 2018.

Montanari, A. and Koutsoyiannis, D.: Modeling and mitigating natural hazards: Stationary is immortal, Water Resour. Res., 50, 9748–9756, https://doi.org/10.1002/2014WR016092, 2014.

Moore, G. W. K. and Renfrew, I. A.: Cold European winters: interplay between the NAO and the East Atlantic mode, Atmos. Sci. Lett., 13, 1–8, https://doi.org/10.1002/asl.356, 2012.

Papalexiou, S. M. and Koutsoyiannis, D.: Battle of extreme value distributions: A global survey on extreme daily rainfall, Water Resour. Res., 49, 187–201, https://doi.org/10.1029/2012WR012557, 2013.

Renard, B. and Lall, U.: Regional frequency analysis conditioned on large-scale atmospheric or oceanic fields, Water Resour. Res., 50, 9536–9554, https://doi.org/10.1002/2014WR016277, 2014.

Renard, B., Sun, X., and Lang, M.: Bayesian methods for non-stationary extreme value analysis, in: Extremes in a Changing Climate: Detection, Analysis and Uncertainty, Water Science and Technology Library, edited by: AghaKouchak, A., Easterling, D., Hsu, K., Schubert, S., and Sorooshian, S., Springer, the Netherlands, 39–95, 2013.

Rust, H. W., Richling, A., Bissolli, P., and Ulbrich, U.: Linking teleconnection patterns to European temperature – a multiple linear regression model, Meteorol. Z., 24, 411–423, https://doi.org/10.1127/metz/2015/0642, 2015.

Schröter, K., Kunz, M., Elmer, F., Mühr, B., and Merz, B.: What made the June 2013 flood in Germany an exceptional event? A hydro-meteorological evaluation, Hydrol. Earth Syst. Sci., 19, 309–327, https://doi.org/10.5194/hess-19-309-2015, 2015.

Schwarz, G.: Estimating the dimension of a model, Ann. Stat, 6, 461–464, 1978.

Serago, J. M. and Vogel, R. M.: Parsimonious nonstationary flood frequency analysis, Adv. Water Resour., 112, 1–16, https://doi.org/10.1016/j.advwatres.2017.11.026, 2018.

Serinaldi, F. and Kilsby, C. G.: Stationarity is undead: Uncertainty dominates the distribution of extremes, Adv. Water Resour., 77, 17–36, https://doi.org/10.1016/j.advwatres.2014.12.013, 2015.

Serinaldi, F., Kilsby, C. G., and Lombardo, F.: Untenable nonstationarity: An assessment of the fitness for purpose of trend tests in hydrology, Adv. Water Resour., 111, 132–155, https://doi.org/10.1016/J.ADVWATRES.2017.10.015, 2018.

Silva, A. T., Portela, M. M., Naghettini, M., and Fernandes, W.: A Bayesian peaks-over-threshold analysis of floods in the Itajaí-açu River under stationarity and nonstationarity, Stoch. Env. Res. Risk. A, 31, 185–204, https://doi.org/10.1007/s00477-015-1184-4, 2017.

Spiegelhalter, D. J., Best, N. G., Carlin, B. P., and van der Linde, A.: Bayesian Measures of Model Complexity anf Fit, J. R. Stat. Soc. B Met., 64, 583–639, https://doi.org/10.1111/1467-9868.00353, 2002.

Spiegelhalter, D. J., Best, N. G., Carlin, B. P., and van der Linde, A.: The deviance information criterion: 12 years on (with discussion), J. R. Stat. Soc. B Met., 64, 485–493, 2014.

Stan Development Team: RStan: the R interface to Stan, R package version 2.18.2, available at: http://mc-stan.org, last access: November 2018.

Steirou, E., Gerlitz, L., Apel, H., and Merz, B.: Links between large-scale circulation patterns and streamflow in Central Europe: A review, J. Hydrol., 549, 484–500, https://doi.org/10.1016/j.jhydrol.2017.04.003, 2017.

Sun, X., Thyer, M., Renard, B., and Lang, M.: A general regional frequency analysis framework for quantifying local-scale climate effects: A case study of ENSO effects on Southeast Queensland rainfall, J. Hydrol., 512, 53–68, https://doi.org/10.1016/j.jhydrol.2014.02.025, 2014.

Sun, X., Lall, U., Merz, B., and Dung, N. V.: Hierarchical Bayesian clustering for nonstationary flood frequency analysis: Application to trends of annual maximum flow in Germany, Water Resour. Res., 51, 6586–6601, https://doi.org/10.1002/2015WR017117, 2015.

van Montfort, M. A. J. and van Putten, B.: A comment on modelling extremes?: Links between Multi-Component Extreme Value and General Extreme Value distributions, J. Hydrol., 41, 197–202, 2002.

Villarini, G., Smith, J. A., Serinaldi, F., Bales, J., Bates, P. D., and Krajewski, W. F.: Flood frequency analysis for nonstationary annual peak records in an urban drainage basin, Adv. Water Resour., 32, 1255–1266, https://doi.org/10.1016/j.advwatres.2009.05.003, 2009.

Villarini, G., Smith, J. A., Serinaldi, F., Ntelekos, A. A., and Schwarz, U.: Analyses of extreme flooding in Austria over the period 1951–2006, Int. J. Climatol., 32, 1178–1192, https://doi.org/10.1002/joc.2331, 2012.

Volpi, E., Fiori, A., Grimaldi, S., Lombardo, F., and Koutsoyiannis, D.: One hundred years of return period: Strengths and limitations, Water Resour. Res., 51, 8570–8585, https://doi.org/10.1002/2015WR017820, 2015.

Ward, P. J., Eisner, S., Flörke, M., Dettinger, M. D., and Kummu, M.: Annual flood sensitivities to El Niño-Southern Oscillation at the global scale, Hydrol. Earth Syst. Sci., 18, 47–66, https://doi.org/10.5194/hess-18-47-2014, 2014.

Wibig, J.: Precipitation in Europe in relation to circulation patterns at the 500 hPa level, Int. J. Climatol., 19, 253–269, https://doi.org/10.1002/(SICI)1097-0088(19990315)19:3<253::AID-JOC366>3.0.CO;2-0, 1999.

Zeng, H., Sun, X., Lall, U., and Feng, P.: Nonstationary extreme flood/rainfall frequency analysis informed by large-scale oceanic fields for Xidayang Reservoir in North China, Int. J. Climatol., 37, 3810–3820, https://doi.org/10.1002/joc.4955, 2017.