Informativeness of teleconnections in frequency analysis of  rainfall extremes

Magnini, Andrea; Pavan, Valentina; Castellarin, Attilio

doi:https://doi.org/10.5194/hess-29-5031-2025

Articles | Volume 29, issue 19

https://doi.org/10.5194/hess-29-5031-2025

Articles | Volume 29, issue 19

Research article

09 Oct 2025

Research article |

| 09 Oct 2025

Informativeness of teleconnections in frequency analysis of rainfall extremes

Andrea Magnini, Valentina Pavan, and Attilio Castellarin

Abstract

We propose an effective and reproducible framework to assess the informative content of teleconnections (or climate indices) for representing and modeling the frequency regime of rainfall extremes at regional scale. Our dataset consists of 680 annual maximum series of rainfall depth, with 1 and 24 h durations, located in Northern Italy. We compute at-site time series of L-moments (i.e., the mean and the L-coefficient of variation) through sliding time windows; then we discretize the study region into tiles, where L-moments time series are averaged. We compute the 30-years sliding mean for six teleconnections: North Atlantic Oscillation, Pacific Decadal Oscillation, East Atlantic – West Russia pattern (EA-WR), El Niño Southern Oscillation, Mediterranean Oscillation Index, and Western Mediterranean Oscillation Index (WeMOI). Then, we calculate L-moments-teleconnection Spearman correlations for single sites and for tiles with several resolutions, and retain correlations with p-values ≤ 0.05. We observe spatial patterns of strong correlation between several teleconnections and gridded L-moments. These spatial patterns are clearly visible at various tiles' resolutions, and may be used for setting up regional prediction models. The strongest influence is detected for the sliding mean on the WeMOI and EA-WR. Finally, we show a preliminary application of climate-informed regional frequency analysis, through a hierarchical framework, where the L-moments are modelled as functions of teleconnections. We observe high variability of teleconnection-driven predictions of rainfall percentiles, and an increase in overall goodness-of-fit of the climate-informed regional models relative to stationary models. Overall, our research suggests promising pathways for climate-informed local and regional frequency analysis of rainfall extremes, and describes a general method, that can be adapted to different geographical and climatic contexts, as well as environmental variables.

Download & links

Article (PDF, 7312 KB)

Supplement (371 KB)

Download & links

How to cite.

Received: 18 Oct 2024 – Discussion started: 28 Oct 2024 – Revised: 30 Jul 2025 – Accepted: 01 Aug 2025 – Published: 09 Oct 2025

1 Introduction

There is strong evidence that large-scale climate oscillations, also called climate indices or teleconnections, have a significant influence on a region’s climate (e.g., Bardossy and Plate, 1992; Bonsal and Shabbar, 2008; Rasouli et al., 2020). Several authors have investigated the link between teleconnections and the seasonal regime of precipitation, and found strong influence for monthly (e.g., Das et al., 2020; Romano et al., 2022) or 3-month (i.e., seasonal, e.g., Belkhiri and Krakauer, 2023; González-Pérez et al., 2022) cumulate rainfall and number of wet days (e.g., Ouachani et al., 2013; Ríos-Cornejo et al., 2015). Other authors focused on rainfall extremes, and showed how to exploit teleconnections to model the non-stationarity of the frequency regime of annual maxima (e.g., Cheng and AghaKouchak, 2014; Fauer and Rust, 2023; Ouarda et al., 2019; Ragno et al., 2018). Not only the parameters of the frequency distribution can be represented as a function of teleconnections (e.g., Johnson et al., 2025; El Adlouni and Ouarda, 2009), but also the distribution itself may change (Ouarda and Charron, 2019). Investigations on the balance between increased complexity and better reliability from non-stationary frequency analysis of rainfall extremes point out that the improvement can be worth the effort (Ouarda et al., 2020). Overall, the last findings appear to suggest the need for non-stationary frequency analysis of rainfall extremes, depending on teleconnections (e.g., Volpi et al., 2024; Nerantzaki and Papalexiou, 2022). Most of the studies on this topic show applications with a limited number of stations, where observations are abundant enough for fitting non-stationary local frequency analysis models. However, the dependence of the extreme rainfall regime on teleconnections may have regional patterns, instead of being specific for some isolated sites. In case this dependence could be described as a function of space, this would lead to non-stationary models for regional frequency analysis, similarly to the case of local frequency analysis.

The presence of regional structures in the dependence between teleconnections and rainfall has been investigated by several authors, focusing on monthly/seasonal or annual totals (Caroletti et al., 2021; Das et al., 2020; Ríos-Cornejo et al., 2015) or droughts (Romano et al., 2022). Differently, this field remains highly unexplored for rainfall extremes. This is a complex problem, as rainfall extremes have higher statistical and spatial complexity than seasonal/monthly rainfall totals or number of wet days. Thus, the correlation between teleconnections and extreme rainfall can vary significantly in sign, strength and significance across a region (see e.g., Jayaweera et al., 2023). In fact, climate may have strong local variations due to orography (Marra et al., 2021), which makes it difficult to understand which teleconnections are more relevant to a specific region. Moreover, the length and quality of the observed timeseries play an important role in the reliability of the obtained results (Martins and Stedinger, 2000; Nerantzaki and Papalexiou, 2022; Ouarda et al., 2020).

In this study, we propose a framework for assessing the informativeness of teleconnections in frequency analysis of hourly and daily rainfall extremes. In particular, we want to investigate (1) whether it is possible to delineate robust regional zonation of the dependence on teleconnections, and (2) what is the effect and suitability of teleconnection-informed regional frequency analysis. Accordingly, the study and proposed framework are structured in two parts: correlation analysis and regional frequency analysis. Our study area is North-Central Italy, where 680 timeseries with at least 30 years of records are available. We focus on annual maxima of precipitation with duration of 1 and 24 h. In the first part of the research, we consider six teleconnection patterns with proven influence on the rainfall regime in the study area (Caroletti et al., 2021; Criado-Aldeanueva and Soto-Navarro, 2020; Krichak et al., 2014): North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), East Atlantic – West Russia pattern (EA-WR), El Niño Southern Oscillation (ENSO), Mediterranean Oscillation Index (MOI), and Western Mediterranean Oscillation Index (WeMOI).

Differently from other studies, we do not perform our correlation analysis on the raw timeseries of the teleconnections and annual maxima. Here, two strategies are simultaneously adopted, that is to aggregate the data temporally and spatially. First, we consider sliding time windows, which allows us to (a) account only for long-term variability components, and (b) consider the variation of the timeseries’ statistics during the recorded period. In particular, we consider two linear moments (or L-moments, see Hosking and Wallis, 1997): the mean and L-coefficient of variation (L-CV) for each station. Second, we divide the study region into tiles: within each tile, we average the at-site sample L-moments, in order to obtain timeseries of regional L-moments. Finally, we compute tile-wise the correlation between the timeseries of the L-moments and the rolling mean of the teleconnections and we define raster maps of the dependence of the mean and L-CV. Significance testing specifically addresses autocorrelation issues due to the use of sliding windows (Lun et al., 2023). In the second part, we fit polynomial relationships between the L-moments and the most influent teleconnections. By using a hierarchical approach (see e.g. Gabriele and Arnell, 1991; Castellarin et al., 2001), we define regional Generalized Extreme Value distributions (GEV, see Jenkinson, 1955) in a stationary and “doubly-stochastic” framework. The authors use the terminology “doubly-stochastic” for frequency models where the parameters depend on stochastic variables, as teleconnections (see Serinaldi and Kilsby, 2018). We test hypothesis that the GEV parameters of rainfall extremes depend on teleconnections (i.e., doubly-stochastic model) through ad-hoc Monte Carlo experiments that take spatial correlation into account (see e.g., Castellarin et al., 2024). Finally, the research is enriched by a critical discussion on the generality and reproducibility of the proposed methodology. It is shown that beside the choice of the study area, our methods are innovative and universally applicable.

2 Methodological framework

We propose an innovative and structured methodological framework for assessing the effectiveness of teleconnections-informed frequency analysis of rainfall extremes. The general methodology is structured into two phases. Some elements of the procedure, which include numerical parameters and specific functions, need to be adjusted according to the specific study case; for the sake of brevity, all of these elements will be referred to as “parameters”. The general methodology is represented in Fig. 1 and described in this Section, while the parametrization adopted for this study is detailed in Sect. 3.2.

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f01

Figure 1Methodological framework: first phase (white background, panels a, b and c) and second phase (coloured background, panel d). Vectors (i.e., time series) are highlighted with bold, italic font, substeps are numbered and underlined.

Download

2.1 Phase 1: correlation analysis

The first part aims at (1) evaluating the possible relation between teleconnection indices and the local climate indices (i.e., extreme rainfall statistics), (2) investigating the spatial structure of the correlation, and (3) producing maps of the correlation structure over the study area. These objectives are obtained through three fundamental steps which we describe below. The first step is the definition of two sliding time windows (STWs). One STW is used for the teleconnections, with width w_tel years. Over this STW, the mean of the teleconnection μ_tel is computed. Another STW is used for the Annual Maximum Series (AMS), with width w_AMS years. Over this STW, the at-site mean (μ) and L-coefficient of variation (L-CV, see Hosking and Wallis, 1997) of the AMS of rainfall depths are computed (see Fig. 1a). Thus, for each gauging station (st) time series of the mean (μ_st) and L-CV (L-CV_st) are obtained; these have length $n - w_{AMS} + 1$ , where n is the number of years of observations for the considered site:

\begin{matrix} (1) & μ_{st} = {μ_{1, st}, μ_{2, st}, \dots, μ_{n - w_{AMS} + 1, st}} \\ (2) & L - C V_{st} = {{L-CV}_{1, st}, {L-CV}_{2, st}, \dots {L-CV}_{n - w_{AMS} + 1, st}} \end{matrix}

where μ_1,st, μ_2,st and $μ_{n - w_{AMS} + 1, st}$ represent the mean computed over the first, second and last time-steps defined by the STW at site st; the same notation is used for the L-CV. The value (i.e., mean or L-CV) computed at each time-step of the STW is conventionally assigned to the last year of the interval.

The second step is the discretization of the spatial domain into single tiles (or cells, or pixels) that do not overlap with each other. The spatial resolution is w_g. For each single tile (g), the timeseries μ_st and L-CV_st of the gauged sites within the tile are averaged yearly (see Fig. 1b). Thus, regional timeseries (μ_g and L-CV_g) for each tile are obtained:

\begin{matrix} (3) & μ_{g} = {\frac{\sum_{i = 1}^{n_{1}} μ_{1, i}}{n_{1}}, \frac{\sum_{i = 1}^{n_{2}} μ_{2, i}}{n_{2}}, \dots, \frac{\sum_{i = 1}^{n_{n - w_{AMS} + 1}} μ_{n - w_{AMS} + 1, i}}{n_{n - w_{AMS} + 1}}} \\ (4) & \begin{aligned} L - C V_{g} & = {\frac{\sum_{i = 1}^{n_{1}} {L-CV}_{1, i}}{n_{1}}, \frac{\sum_{i = 1}^{n_{2}} {L-CV}_{2, i}}{n_{2}}, \dots, \\ \frac{\sum_{i = 1}^{n_{n - w_{AMS} + 1}} {L-CV}_{n - w_{AMS} + 1, i}}{n_{n - w_{AMS} + 1}}} \end{aligned} \end{matrix}

where n₁, n₂, … $n - w_{AMS} + 1$ , are the numbers of stations with available rainfall statistics at steps 1,2, …, $n - w_{AMS} + 1$ of the STW.

The third step is the correlation analysis (see Fig. 1c). The correlation coefficient (cc) is computed for each tile between the averaged timeseries (μ_g and L-CV_g) and the rolling mean of the considered teleconnection (μ_tel). We adopt the Spearman coefficient as it accounts also for non-linear correlations, yet preliminary experiments showed very similar results with the Pearson correlation coefficient. The significance of cc is tested according to the methodology described in Lun et al. (2023), which takes into account the possible presence of spurious correlations, resulting from the use of sliding time windows. Only stations with p-value ≤ 0.05 are considered as significantly correlated (i.e., significance at 5 %).

Finally, we estimate the robustness, or reliability, of the detected correlation signals with an ad-hoc approach. This is an important step, since the definition of the parameters described above (i.e., w_tel, w_AMS, w_g) is necessarily affected by subjectivity and uncertainty. Then, the possible presence of spatial patterns of the correlation fields is investigated through the definition of correlation maps. The methodology adopted for correlation reliability assessment and correlation maps production is detailed in Sect. 3.2.

2.2 Phase 2: regional frequency analysis

The second part of the present study aims at assessing (1) the possible effect of teleconnections on the frequency of extreme rainfall events, and (2) the potential of considering teleconnections as covariates in doubly-stochastic frequency models. For the sake of generality, the Generalized Extreme Value distribution (GEV, see Jenkinson, 1955) is considered, given its flexibility and representativeness of the frequency regime of hydrological extremes (e.g., Papalexiou and Koutsoyiannis, 2013; Salinas et al., 2014). Nevertheless, the proposed framework is not limited to the case of the GEV distribution; on the contrary, it could be easily extended and adapted to the case in which alternative theoretical frequency distributions are considered and tested against each other. The cumulative distribution function of the GEV, F_GEV(x), depending on the location, scale and shape parameters (ξ, α, k), is defined as follows:

\begin{matrix} (5) & \begin{aligned} F_{GEV} (x) & = e^{- e^{- y}}, where \\ y = \{\begin{cases} - k^{- 1} \log_{e} [1 - k (x - ξ) / α], & k \neq 0 \\ (x - ξ) / α, & k = 0 \end{cases} \end{aligned} \end{matrix}

The hierarchical method for regional frequency analysis is adopted (see e.g. Gabriele and Arnell, 1991). Accordingly, for a given gauged site, the mean is computed from the at-site records, the L-CV exploits all the time series within a tile with resolution w_L-CV where the target site is included, and the L-CS exploits all the time series within a tile with resolution w_L-CS (where w_L-CV≤w_L-CS). The values adopted in the present study for w_L-CV and w_L-CS are detailed in Sect. 3.2. First, a stationary GEV is fit (GEV₀), where the mean over the whole time series is computed at-site, and the regional L-CV and L-CS are obtained as described in Hosking and Wallis (1997) as a weighted average over their respective tiles by referring to the complete sequences of annual maxima (see Fig. 1). Second, GEV distributions whose parameters depend on teleconnections are set up. Following the suggestion by Serinaldi and Kilsby (2018), we term these distributions as doubly-stochastic (DS) models, even if we acknowledge that the most common definition in the literature is “non-stationary” models (e.g., Volpi et al., 2024, and references therein). We consider three types of doubly-stochastic models. The first type, GEV₁, adopts the same L-CV and L-CS as the GEV₀, whereas the mean varies as a function of the best teleconnection index (selected in phase 1). This function is fitted at-site, according to the hierarchical regionalization framework. The choice for its shape, f(x), is detailed in Sect. 3.2. The second type, GEV₂, adopts the same mean and L-CS as the GEV₀, while the L-CV varies as a function f(x) of the best index. This is fitted on the w_L-CV-averaged timeseries (L-CV_g), according to the hierarchical regionalization framework. The third type, GEV₃, adopts the same L-CS as the GEV₀, while the mean and L-CV are obtained with the same methods as for the GEV₁ and GEV₂, respectively. Finally, the models are compared by means of the ratio of models' likelihood (RML). This metric is usually defined as “ratio of maximum likelihood”, and is very commonly used for this purpose (e.g., Ashkar and Aucoin, 2012; Ashkar and Ba, 2017). It is defined as in Ashkar and Ba (2017):

\begin{matrix} (6) & RML = \log_{e} (\frac{{LH}_{{GEV}_{DS}}}{{LH}_{{GEV}_{0}}}) \end{matrix}

Where LH $_{{GEV}_{0}}$ and LH $_{{GEV}_{DS}}$ are the likelihood of the observed timeseries computed with the stationary and doubly-stochastic models, respectively.
Since RML is asymptotically distributed as half of a Chi-squared variable with degrees of freedom equal to the difference of the number of parameters of the two models (e.g., see Bhattacharya and Burman, 2016; Coles, 2001), a lower threshold th_RML should be considered to conclude that GEV_DS is a better fit to the data. Given that significance is tested at 5 %, the theoretical significance threshold equals half of the 95th percentile of the corrispondent Chi-squared distribution. In the present study, th_RML values are evaluated with reference to the total number of parameters of the models, consisting of those of the frequency distribution and those of f(x). Furthermore, since in our opinion these two sets of parameters do not have the same weight (see also what observed by Laio et al., 2009), we have also determined th_RML values experimentally, through a set of Monte Carlo (MC) experiments that are described in Supplement Sect. S1. The thresholds obtained with the two methods, which depend on the choice of f(x) are shown in Sect. 3.2. Finally, we test field significance of possible doubly-stochastic spatial signals in presence of cross-correlation. In fact, the number of sites with doubly-stochastic rainfall regime could be inflated by the presence of intersite dependence (see also Castellarin, 2007; Vogel et al., 2001). For this reason, we define a significance test ad-hoc, based on the comparison of n_DS, the global number of sites where a given doubly-stochastic model outperforms the stationary model, with lower thresholds th_nDS. These thresholds are estimated through MC experiments that are described in Supplement Sect. S2.

3 Study region and parameterization of the procedure

3.1 Study region and data

The study area includes most of Northern and part of Central Italy, a region characterized by great climate variability (see Fig. 2a). Two main mountain ranges are present: the Alps in the North, with a maximum elevation of about 4000 m a.s.l., and the Apennines, crossing all along continental Italy, with a maximum elevation of ∼ 2100 m a.s.l. in the study area. The largest Italian plain, the Po plain, is located at the southern border of the Alps, following the course of the Po River from the Northwest to the Northeast, where low coasts are located. We select 680 gauged stations (Fig. 2a) from the I2-RED dataset (Mazzoglio et al., 2020 a) with a minimum of 30 years of data. Thus, all the selected timeseries should be sufficiently long to show variations of the frequency regime of rainfall extremes, if these are present (see also Renard et al., 2008; Ouarda et al., 2019). For each station, we consider time series of annual maximum cumulative rainfall over 1 and 24 consecutive hours, which represent distinct events: mainly convective the former, and mainly synoptic the latter. Data have been recorded between 1921 and 2020.

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f02

Figure 2Elevation (a, in grey color scale) and location (b) of the study area. Length of the timeseries of sub-daily annual maximum rainfall depths (a, red color scale). Black lines: Italian administrative regions.

In the study, six teleconnections are considered; a detailed description of their nature is not reported here, since an interested reader can refer to the extensive literature cited in the text. Namely, these are the North Atlantic Oscillation (NAO, see Jones et al., 1997), the East Atlantic-West Russia (EA-WR) pattern (see Barnston and Livezey, 1987), the Pacific Decadal Oscillation (PDO, see Zhang et al., 1997), El Niño Southern Oscillation (ENSO, see Chen et al., 2019), Mediterranean Oscillation Index (MOI, see Conte et al., 1991), and Western Mediterranean Oscillation Index (WeMOI, see Martin-Vide and Lopez‐Bustins, 2006). All of these indices have demonstrated significant influences on local climate in various parts of Southern Europe and the Mediterranean region (e.g., Caroletti et al., 2021; Krichak et al., 2002; Krichak and Alpert, 2005; Krichak et al., 2014; Criado-Aldeanueva and Soto-Navarro, 2020). In particular, they show a strong relationship with seasonal and annual precipitation in the study area (e.g., Luppichini and Bini, 2025). However, their connection to extreme rainfall events in Italy has not been systematically investigated, unlike in other parts of Southern Europe (e.g., see Gonzalez‐Hidalgo et al., 2025, and references therein). Alpert et al. (2002) suggest a potential link between extreme rainfall in Italy and ENSO, while Caroletti et al. (2021) examine a specific area in Southern Italy (Calabria) and find a strong influence from WeMOI and EA–WR. Other studies focusing on droughts have identified significant roles of NAO and MOI in Calabria (Coscarelli et al., 2023), and NAO and WeMOI in Central Italy (Romano et al., 2022). Although PDO is recognized as a major global climate driver affecting temperature and precipitation, studies specifically addressing its direct influence on rainfall in Europe are relatively scarce. Nonetheless, some authors have reported significant correlations between PDO and winter precipitation in Europe (e.g., Zanchettin et al., 2008), as well as with extreme rainfall events in Southern Italy (e.g., Naples; (Diodato and Bellocchi, 2018)). Time series of NAO, EA-WR, PDO and ENSO are freely accessible from the NOAA Physical Sciences Laboratory data base available at https://psl.noaa.gov/data/climateindices/list/ (last access: 25 September 2025). MOI and WeMOI can be retrieved from the University of East Anglia's Climate Research Unit (CRU; https://crudata.uea.ac.uk/cru/data/moi/, last access: 25 September 2025).

3.2 Parameterization of the procedure for the study area

The parameterization of the methodology for the study area presented below results from several preliminary experiments. The implementation of the first part of the study requires the definition of w_tel, w_AMS, w_g, and the approach for the robustness analysis (i.e., the evaluation of the reliability of the correlation coefficients computed with w_tel, w_AMS and w_g; see Sect. 2.1). First, w_tel is set to 30 years. Preliminary experiments with smaller w_tel provided similar results, yet 30 years generally smooth out short interannual oscillations, while preserving the pluridecadal climate variability. Also, shorter oscillations (i.e., < 30 years) would not be of interest for the design of hydraulic structures, while longer ones (i.e., > 30 years) would be highly uncertain to detect, due to limited length of the timeseries. Second, w_AMS is set to 10 years. This choice is also a trade off between a minimum width for the computation of the L-CV and a minimum length of the timeseries of the rainfall statistics (i.e., μ_st and L-CV_st). In fact, on the one hand w_AMS< 10 years would lead to large sampling variability of local L-CV values. On the other hand, the μ_st and L-CV_st timeseries originated from an n-long annual maxima timeseries have length of $n - (w_{AMS} - 1)$ . This means that longer w_AMS would lead to shorter μ_st and L-CV_st, which in turn would lead to a smaller number of stations where the correlation between teleconnections and rainfall statistics can be reliable. Concerning the spatial resolution of the tile size, w_g, we consider four values: 0 km (i.e., considering the single gauged stations, with no spatial discretization), 15, 30 and 50 km. This multiple choice comes from a balance. On the one hand, L-CV computed over a 10 years time window may have low accuracy, which can be addressed by averaging L-CVs from several stations within large tiles (i.e., regional prediction). On the other hand, larger tiles may be less statistically homogeneous. Moreover, averaging L-statistics over large tiles may smooth the variability of the rainfall regime, hiding local patterns where the morphology is complex. Since there is no universal guideline for finding the right trade off, we decide to consider four different values for w_g. The suitability of these values is tested by means of the Hosking and Wallis (1997) heterogeneity measure for L-CV. The results of the test (not reported here for the sake of brevity) detected “definitely heterogeneous” tiles in a very limited number of cases for all the resolutions, confirming the viability of the homogeneity hypothesis. We assess the robustness of the correlation signal at each station st through a reliability index, ri_st, defined as the sum of multiple sign functions (sign):

\begin{matrix} (7) & \begin{aligned} {ri}_{st} & = sign ({cc}_{0 km, st}) + sign ({cc}_{15 km, st}) + sign ({cc}_{30 km, st}) \\ + sign ({cc}_{50 km, st}) \end{aligned} \end{matrix}

Where cc_0 km,st is the correlation computed at station st, while cc_15 km,st, cc_30 km,st and cc_50 km,st are the correlation coefficients relative to the tiles (with w_g 15, 30 and 50 km) where st is inserted. Non-significant correlations are considered as 0. This is considered to be a measure of the spatial coherence of the correlation signal, which varies between −4 and 4. The absolute value is the coherence of the correlations at different tiles. The sign (i.e., + or −) represents the sign of the prevailing correlation. For instance, a gauged station with ri 4 indicates a significant and positive correlation when calculated at-site, as well as for 15, 30, and 50 km aggregation. On the opposite, 0 represents areas with no significant correlation or where positive and negative correlations compensate with each other (e.g. positive correlation at-site and at 15 km resolution, and negative correlation at 30 and 50 km resolutions). Finally, ri is interpolated by ordinary kriging (Hengl, 2007) to produce reliable maps of the correlation field of the mean and L-CV for 1 and 24 h.

Regarding the second part of the study, w_L-CV, w_L-CS and f(x) need to be set. The selected resolution w_L-CV is 30 km, as it is a good trade off between accuracy of the prediction (i.e., aggregating at least two at-site L-CV timeseries) and an adequate representation of regional patterns and local variability. We set w_L-CS to 100 km, based on low spatial variability of the skewness parameter (e.g., Gabriele and Arnell, 1991; Claps et al., 2022) and improved accuracy. The function f(x) between teleconnections and L-statistics is shaped as a second-order polynomial function. This form has been selected because of its simplicity (i.e., only three parameters are needed) and adaptability to the empirical data, given that the observed dependence of extreme rainfall statistics on teleconnections is often non-linear. Even though other choices are possible, the aims of the present study are mainly demonstrative of the potential of the proposed approach. Thus, the nature and selection of the best function is not part of the main focus of our analyses.

The lower thresholds th_RML considered for RML significance test at 5 % (see Sect. 2.2) are reported in Table 1. Interestingly, our Monte Carlo experiments (see details in Supplement Sect. S1) suggest that RML's diagnostic power is not affected by the duration and teleconnection considered. Also, the resulting empirical th_RML values are much lower than the theoretical ones, and less affected by the number of parameters of f(x) (compare th_RML for GEV₃ with GEV₁ and GEV₂). While important for the general field of model evaluation, a detailed discussion of these results is beyond the scope of the present study. However, they present an interesting topic for further research.

Table 1RML significance thresholds, th_RML, at 5 %.

Download Print Version | Download XLSX

4 Results

4.1 Phase 1: correlation analysis

Table 2 reports the number of gauges (at-site), or tiles (resolutions from 15 to 30 km), showing statistically significant correlations at 5 % for each teleconnection and the two statistical moments considered here (i.e., the mean, μ, and the L-coefficient of variation, L-CV) for durations of 1 and 24 h.

Table 2Number of significant correlations for all the considered teleconnections and tiles' resolutions. The numbers stand for stations in columns “at-site” and for tiles in the other columns. For each case (i.e., each semi-column), the highest number is marked with underlined bold font, while the second highest has bold font.

Download Print Version | Download XLSX

The most correlated indices are WeMOI and the EA-WR for μ, and PDO and MOI for L-CV. ENSO shows the weakest influence on both statistics, and therefore is excluded from further analysis. The overall number of significant correlations is not sufficient to fully understand the dependence relationships of μ-teleconnection or L-CV-teleconnection, nor to compare the influence of various teleconnections. Instead, our main objective is the description of the spatial correlation field, consisting of the spatial patterns of significant correlations, and their robustness to varying the resolution of the spatial aggregation (i.e., tiles' size). This is represented through the interpolated reliability index, ri, in Fig. 3. As defined in Sects. 2.1 and 3.2, areas where ri≥3 or $ri \leq - 3$ indicate “consistent” correlation. Focusing on μ, it is confirmed that WeMOI and EA-WR have a strong influence, particularly concentrated in two major patterns of consistent negative correlation in the Gulf of Genoa and in the North-Eastern Alps (panels b, d, g and i of Fig. 3). These patterns are evident in the case of 1 h duration, where the North-East shows consistent correlation with all the indices (panels a–e of Fig. 3). For the 24 h duration the two patterns for WeMOI and EA-WR are more fragmented and less extended, even tough still present, while the other indices show only small areas with strong influence on μ. Focusing on L-CV, consistent correlation patterns for 1 h are highly heterogeneous, suggesting that regional GEV₂ models could be ineffective (panels k–o of Fig. 3). More extended, yet still restrained, patterns can be observed for 24 h, in particular in the western portion of the Gulf of Genoa and in the central region of the study area (see panels p–t in Fig. 3). It is difficult to establish which index is the most influent, but PDO and MOI show high similarities and promising homogeneous patterns, in agreement with Table 2.

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f03

Figure 3Raster maps of reliability index (ri) of the correlation between teleconnections and mean (panels a–j), or L-CV (panels k–t) of AMS of rainfall depths with duration of 1 h (panels a–e and k–o) and 24 h (panels f–j and p–t).

The results obtained for WeMOI are presented in detail in Fig. 4, which reports statistically significant Spearman correlation coefficients at different spatial resolutions for μ (panels a–h) and L-CV (panels i–p), respectively. Several stations present statistically significant correlation values with WeMOI, with signs and amplitude changing depending on the site considered (panels a and e for μ, i and m for L-CV, Fig. 4). Aggregating stations into tiles reduces spatial heterogeneity and decreases the extension of significantly correlated areas, which allows to describe the geographical pattern of the correlation field (see other panels). Both for 1 and 24 h duration, the correlation fields of extreme rainfall with μ present consistent spatial patterns, with extended areas characterized by homogeneous values (panels a–h). On the contrary, the correlation field of L-CV is more sensitive to the change of tiles' resolution and presents smaller isolated hotspots (panels i–p). Similar results are observed also for the other teleconnections, but they are not reported for the sake of brevity.

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f04

Figure 4Spearman correlation coefficient between WeMOI and the mean [L-CV] for AMS of rainfall depths with duration of 1 and 24 h: at-site (a [i] and e [m]) and for tiles of size 15 km (b [j] and f [n]), 30 km (c [k] and g [o]), and 50 km (d [l] and h [p]). Statistically significant (at 10 %) correlation coefficients are illustrated using a blue-red color scale. Red outlines highlight correlation significant at 5 % (i.e., the ones considered for the ri). Black outlines highlight tiles where only one station is present. In panels (a) and (i), gray circles represent non-significantly correlated stations.

4.2 Phase 2: Frequency analysis

Following the adopted framework (see Sects. 2.2 and 3.2), polynomial relations are fitted at-site (i.e., μ-teleconnection) and at 30 km-tiles (i.e., L-CV-teleconnection) to obtain the parameters for GEV₁ and GEV₂ distributions, respectively, and jointly for GEV₃. Then, the goodness-of-fit is compared with the stationary framework (i.e., GEV₀) with RML (Eq. 6). For the sake of brevity, the gauged sites where RML is higher than a given RML threshold (Table 1) will be hereinafter referred to as “doubly-stochastic sites”. The number of doubly-stochastic sites in each case, n_DS, is reported in Table 3, while Fig. 5 represents their location.

Table 3Number of doubly-stochastic sites (n_DS). I.e., stations where RML ≥th_RML. The highest number is marked with bold font in each line, corresponding to a specific duration and doubly stochastic framework. Emprical and theoretical threshold are reported in Table 1.

Download Print Version | Download XLSX

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f05

Figure 5Doubly-stochastic sites observed for GEV₁ (panels a–i), GEV₂ (panels l–o) and GEV₃ (panels u–(d1)). Black triangles represent sites where theoretical th_RML> RML ≥ empirical th_RML. In red triangles, RML ≥ theoretical th_RML. Stations where RML < empirical th_RML (i.e., stationary sites) are represented with grey dots.

Regarding GEV₁, WeMOI has the highest number of doubly-stochastic sites for 1 h, followed by EA-WR and PDO, depending on the threshold for RML (3 or 1.5, respectively, see Table 3). For 24 h, WeMOI and EA-WR are confirmed as the most influent indices. Figure 5 is quite in agreement with Fig. 3, confirming that in the two regions of the Gulf of Genoa and North-East GEV₁ fits the AMS better than GEV₀. These two regions show non-stationarity signals with all the indices (see panels a–j), but particularly with the most influent ones (panels b, d, g, i). Regarding GEV₂, WeMOI has the highest number of doubly-stochastic sites both for 1 and 24 h, followed closely by the other indices (see Table 3). In this case, detecting clear geographical patterns is difficult, but it is possible to spot some sub-regions where the signal is strong for multiple indices. Some examples are the North-East for 1 h (see panels k–o of Fig. 5), and the western portion of the Gulf of Genoa (in agreement with Fig. 3) and the south-central portion of the Adriatic coast (see panels p–t of Fig. 5). Considering the theoretical threshold (i.e., 4.7) doubly-stochastic sites for GEV₃ have a very similar spatial configuration relative to GEV₁ or GEV₂, according to the case (see panels u–d1 of Fig. 5). Hence, the numbers of GEV₃ doubly-stochastic sites in Table 3 are close to the maximum value within GEV₁ and GEV₂ for each index and duration. Differently, when the empirical threshold is considered (i.e., 1.6), the results suggest that GEV₃ doubly-stochastic sites are much more than GEV₁ and GEV₂, and located over the whole study area. As mentioned in Sect. 2.2, the field significance of non-stationary (i.e., doubly-stochastic in this context) signals in presence of intersite dependence is tested by means of lower thresholds th_nDS to the number of doubly-stochastic sites n_DS (see details of the test in Supplement Sect. S2). While WeMOI and EA-WR are clearly very effective for GEV₁ and GEV₃ distributions, it is hard to unequivocably establish the best teleconnections for GEV₂. For coherence, the test in the present study is performed for WeMOI and EA-WR for all of the three models, and the results are reported in Table 4. Regarding GEV₁, the number of doubly-stochastic sites is not significant when considering 1.5 as th_RML (with the exception of WeMOI for 1 h), while the null hypothesis (i.e., the distribution is stationary) is always rejected with a threshold of 3. Differently, for GEV₂ and GEV₃ the null hypothesis is rejected with all the thresholds.

Table 4Number of doubly-stochastic sites (i.e., sites where RML≥th_RML) in the original dataset (n_DS), and significance thresholds against spatial dependence (i.e., empirical and theoretical th_nDS). For each case, corresponding to a given duration and doubly-stochastic framework, the highest number between n_DS and th_nDS is marked with bold font.

Download Print Version | Download XLSX

Figure 6 illustrates for a specific location an example of doubly-stochastic modeling. Panels d–f show frequency curves corresponding to single “realizations” of the teleconnection of interest (i.e. WeMOI in the example, colorscale from yellow to purple). Indeed, frequency curves are not theoretically consistent with the non-stationary framework. Nevertheless, the representation is very useful to visualize the degree of variability of design rainfall depth associated with alterations of the driving teleconnection.

https://hess.copernicus.org/articles/29/5031/2025/hess-29-5031-2025-f06

Figure 6hierarchical frequency analyses of 24 h rainfall annual maximum series at a given location where the hypothesis of doubly-stochasticity in the mean (μ) and L-CV resulted to be significal at 5 % level. Upper panels: considered station (a), polynomial function of μ (b) and L-CV depending on WeMOI (c). Lower panels: expected percentiles with given return periods in stationary (black line) and doubly-stochastic framework (colored scale lines), under the assumption that the mean depends on WeMOI (GEV₁, panel d), L-CV depends on the WeMOI (GEV₂, panel e), or both mean and L-CV depend on the WeMOI (GEV₃, panel f).

As clearly illustrated in Fig. 6, the variability of predicted rainfall percentiles associated with changes in the teleconnection may be very significant. It is worth noting that WeMOI changes in the example span across observed values, no extrapolation is performed. The prediction of the 100-year 24 h rainfall depth in the selected location is equal to 240 mm according to the stationary model (GEV₀), but may vary between 200 and 240 if the empirical relationship between WeMOI and L-CV is explicitly modelled (GEV₂), between 200 and 300 if the mean 24 h annual maximum rainfall depth becomes a function of WeMOI (GEV₁), and may be as low as 170 and as high as 300 if both mean and L-CV are expressed as functions of WeMOI (GEV₃).

5 Discussion

5.1 Spatial correlation fields

The results in Sect. 4.1 clearly show that WeMOI and EA-WR have a very strong influence on the mean of extreme precipitation over the study area. This was expected, as it is in line with the observations of Caroletti et al. (2021) for Southern Italy, and Romano et al. (2022) for Central Italy. In fact, WeMOI consists of the normalized difference in atmospheric pressure between Cadiz, in the South of Spain, and Padua, in Northern Italy, and thus, it describes the formation of precipitation systems over the Tyrrhenian Sea (Lopez-Bustins et al., 2020; Redolat et al., 2019; Martin-Vide and Lopez‐Bustins, 2006). Nevertheless, EA-WR pattern consists of four atmospheric anomalies extending from North Atlantic to Western Russia (Barnston and Livezey, 1987). Negative phases, which correspond to low-pressure anomalies over Europe and Northern China and high-pressure anomalies over the Central North Atlantic and North of the Caspian Sea, have been found to drive wetter conditions across Europe and the Mediterranean (e.g., Krichak and Alpert, 2005). Moreover, EA-WR is highly correlated with WeMOI, and hence is reasonable that the two teleconnections have similar influence. In particular, the presence of significant intense negative correlation values in the Gulf of Genoa and the North-East for the mean with WeMOI (panels b and g of Fig. 3) and EA-WR (panels d and i of Fig. 3) is consistent with the known patterns of precipitation regimes over North-Central Italy. In fact, intense daily precipitation values are expected over the Tyrrhenian Coast and North-Eastern Alps in the presence of intense southwesterly flows from the Mediterranean, typical during the autumn season and favored by large scale circulation anomalies associated with negative value of WeMOI and EA-WR. In the remaining portion of the study area, precipitation systems are more complex, as influenced by the passage of cut-of lows favoring precipitation over the Southern and Eastern portion of the Apennines area. In this case, correlation patterns with WeMOI and EA-WR are expected to be more fragmented. It is very interesting to notice that although the 1 h rainfall maxima are mostly linked to convective phenomena, often characterized by a very limited spatial scale, the correlation with WeMOI and EA-WR presents more extended geographical patterns than in the daily case (compare Figs. 3b and 3d with 3g 3i).

Regarding the L-CV, the absence of evident correlation patterns for hourly annual maxima confirms our knowledge about spatial variability of convective phenomena (Fig. 3k–o). Small and fragmented, mainly positive patterns are visible for daily rainfall (Fig. 3p–t), but a physical interpretation of the L-CV-teleconnection dependence is more complex. In general, the consistency of patterns with spatial aggregation is lower than for the mean, which may be partly due to higher uncertainty in at-site computation of L-CV within a 10-year time window. For this reason, we believe that for L-CV, a spatial aggregation at 30 km should be preferred for RFA. An important element to consider when commenting Fig. 3 is that only correlations with p-value ≤ 0.05 were considered in the computation of ri (Eq. 7). Even though this threshold value is the most commonly adopted in the literature, it is still subjective, and it could lead to discard some significant correlations. This is evident when looking at the correlations with p-value within 0.1 and 0.05 in Fig. 4 (i.e., tiles without red outlines). An additional source of subjectivity is the interpolation of ri with kriging, that depends on the choice of kriging type and semivariance function. Nevertheless, based on the results described in Sect. 4, the methods adopted generally lead to a useful characterization of the teleconnection-extreme rainfall correlation field, which can be used to drive frequency analyses. In fact, sub-regions with homogeneous rainfall-teleconnection dependence may be defined, in order to set specific doubly-stochastic RFA models. This option requires to define an objective grouping criterium, possibly based on a set of morphoclimatic characteristics (see e.g., the morphoclimatic characteristics adopted in Magnini et al., 2024); due its complexity, this problem deserves specific studies, and is not further investigated in the present study.

We estimate spatial correlation fields based on two major elements: temporal and spatial aggregation of the data. First, temporal aggregation through sliding time windows allows to consider the statistics of the extreme rainfall during time, instead of the rainfall depth themselves. In this way, it is possible to filter out inter-annual variability of the seasonality and magnitude of the annual maxima and focus on the decadal precipitation statistics. Second, spatial aggregation into tiles allows to obtain more reliable values of the rainfall statistics. This produces a smoothing local effect, that could be due to data fragmentation and noise and enhance geographical pattern recognition (see Fig. 4). The choice of w_g for spatial aggregation be carefully conducted to preserve the local climatology, depending on the morphology of the study area, and the density and location of the rain gauges. Generally, considering a number of different values for w_g is useful to analyze the reliability of the detected correlation patterns. A different approach for spatial aggregation could be the one described in Castellarin et al. (2024), where overlapping tiles are used. However, including each gauged site in multiple tiles could result in eccessive smoothing of the orographic effect over the correlation field, and strenghtening of spatial dependence. Thus, this solution should be preferred only in case of scarce density of gauges network. Alternatively, not strictly orthogonal grids could be considered, after preliminary analyses.

A key aspect of the proposed approach is its high adaptability. In fact, the same methodology with an appropriate parametrization could be used to study the influcence of teleconnections on several environmental variables, such as AMS of floods or temperature or wind. Moreover, raster maps of the correlation field (as the ones in Fig. 3) could be used as descriptors of the drivers of an environmental variable, and adopted as input of predictive raster-based models (e.g., for prediction of flood susceptibility, as in Magnini et al., 2023).

5.2 Doubly-stochastic regional frequency analysis

Looking at the results of the second phase of our study, two main points are of general interest. First, the range of variability of the expected percentiles with doubly-stochastic models (i.e., non-stationary models depending on teleconnections) is very wide, confirming what observed locally by other authors (Fig. 6). Second, the regional dependency of rainfall statistics on teleconnections can be successfully exploited locally for frequency analysis (see Table 3 and Fig. 5). This is a useful improvement over present literature, as it allows to obtain extreme rainfall statistics (i.e., μ_st and L-CV_st) even where observations are not locally available. The framework we adopted for doubly-stochastic RFA is based on the strong assumption that the same type of function (i.e., a polynomial function) can represent the teleconnection-statistic relationships within all the study area. Indeed, this approximation is sufficiently accurate in some stations, while being not adatp in others. This is probably also the reason why GEV₁ models outperform GEV₀ globally more often than GEV₂ models (Table 3). In fact, the polynomial approximation may fit the data better when these are collected locally (as for the μ_tel−μ_st case) than when they result from tile-wise averaging (as for the μ_tel-L-CV_t case). Nevertheless, it is encouraging that a significant number of sites with RML ≥ 3 is detected for GEV₁ and GEV₂ (see Table 4), despite the low-complexity of the function adopted for modelling the dependence on teleconnections. Accordingly, GEV₃ models derive their goodness or badness from the sum of GEV₁ and GEV₂ contributes, which leads them to be the most adaptable models. Therefore, a high number of doubly-stochastic sites is observed when 1.6 is considered as RML significance threshold, corresponding to a small penalty for the additional parameters of f(x) (compare 1.6 and 4.7 thresholds for GEV₃ in Table 3).

It is very interesting that the comparison of ri (essentially based on Spearman correlation) and RML highlights differences in the most influent indices (i.e., compare Table 2 with Table 3) and doubly-stochastic patterns (e.g., compare panels k–t of Figs. 3 and 5). A partial difference between the two metrics is expected and natural, mainly due to three reasons. First, ri takes into account both the significance of the correlation and its sign, while RML represents the goodness-of-fit, and hence it does not describe the nature of the teleconnection-rainfall dependence (i.e., increasing/decreasing). Thus, RML may lead to group sites whose dependence on the teleconnection has different sign. Second, RML is probably more similar to a single correlation (i.e., at-site in the GEV₁ case, and at 30 km-tiles in the GEV₂ case), while ri depends on correlations at several spatial aggregations, which may disagree. Third, also correlations that are not considered significant could lead to good doubly-stochastic models. Furthermore, some studies (e.g., Sugihara et al., 2012; Cappelli and Grimaldi, 2023; Zhang et al., 2004) suggest that correlation may fail in describing the causality of two variables, and that an approach based on a model's performance can be more accurate. However, we believe that in our case both the metrics are very useful, as they can help to delineate sub-regions with homogeneous teleconnection-rainfall dependence function (see also Sect. 5.1). As perfectly exemplified in our results, sometimes the presence of such regions is evident (e.g., the Gulf of Genoa and North-East for GEV₁, see Fig. 5), while in other cases it is hard to distinguish between small homogeneous regions and effects of spatial dependence (e.g., Adriatic coast and western Gulf of Genoa for GEV₂, see Fig. 5).

It is important to underline that the aim of the present research is to investigate the potential of teleconnections as independent variables in RFA models, and not to propose a specific method for RFA. Indeed, the RFA results depend on a number of parameters, including the widths of sliding time windows for temporal aggregation of the teleconnection indices (w_tel) and AMS (w_AMS), and the resolution for spatial aggregation (w_g). These can be set after a careful sensitivity analysis for defining the spatial field of the teleconnection-statistic correlation (see Sects. 2 and 4.1). The case of formalizing a function f(x) of extreme rainfall statistics depending on teleconnections is very different and much more complex. In fact, one should decide not only the shape of this function, but also the way its parameters vary in space and should be estimated, which may require the delineation of sub-regions (see considerations above). In our study, we adopted a simple framework, as this function has a limited number of parameters and the same shape (i.e., polynomial) in all the spatial domain. We showed a hierarchical RFA approach where the parameters of the polynomial functions are fitted at-site for μ and at 30 km-tiles for L-CV. Our analyses overall suggest that even with a simple RFA framework, the use of teleconnections as dependent variables to describe the extreme rainfall regime may increase the accuracy in frequency modelling. Different approaches are indeed possible. First, the best resolution for spatial aggregation and the shape of the teleconnection-statistic function should be carefully evaluated for each specific case. Second, a more complex teleconnection-statistics function could be defined. A possible approach is the one proposed by Magnini et al. (2024), which leverages neural networks' capabilities to obtain functions whose parameters depend on the location of the considered site and other morphoclimatic descriptors, or the one adopted by Machado et al. (2015), which exploits generalized additive models. Indeed, the implementation and discussion of more sophisticated RFA methods to exploit teleconnections' informative content is complex, and should be addressed by future studies.

Overall, our results suggest several pathways for future reserch about non-stationary frequency analysis, but two major points of interest can be underlined. First, teleconnection-AMS relationships may be defined regionally (i.e., pooling together measurements from several stations), instead of locally (i.e., considering separately single sites). With respect to a local approach, a regional approach (where it is possible) is less affected by measurement errors, noise, and outliers (Hosking and Wallis, 1997), and may lead to more accurate climate dependence functions. Second, the variation of the expected percentiles is very wide if a teleconnection-informed frequency model is considered. This may be taken into account when planning long- to midterm water management (e.g., reservoir regulation) and designing hydraulic structures.

6 Conclusions

A growing number of recent studies show how large scale climatic indices (or teleconnections) can be used as covariates to increase reliability of local frequency analysis of rainfall extremes across diverse geographical regions worldwide (e.g., Fauer and Rust, 2023; Ouarda et al., 2020; Ragno et al., 2018). It is theoretically possible to extend these methods to regional frequency analysis (RFA), but the teleconnection-extreme rainfall dependency at a regional scale should be first investigated. Beside its usefulness for correct estimation of the design rainfall for engineering applications, this topic is still not well addressed in the literature.

In the present study, we propose a framework to assess the link between teleconnections and the frequency regime of rainfall extremes at a regional scale, in order to perform climate-informed RFA. The approach is tested for a large and climatically diverse region in Northern Italy. Our dataset consists of 680 annual maximum series (AMS) of hourly and daily (i.e., 1 and 24 h durations) rainfall depth, recorded between 1921 and 2022. We select six climate indices, known to have significant correlation with local climate variability over the study area (Caroletti et al., 2021; Criado-Aldeanueva and Soto-Navarro, 2020; Romano et al., 2022): the North Atlantic Oscillation, Pacific Decadal Oscillation (PDO), East Atlantic – West Russia pattern, El Niño Southern Oscillation, Mediterranean Oscillation Index, and Western Mediterranean Oscillation Index (WeMOI). The main steps of the proposed framework can be summarized as follows. First, we define sliding time windows in order to obtain time series of pluridecadal averaged teleconnections and statistics of annual maxima. The latter consist of the sliding mean and L-coefficient of variation (L-CV) of AMS, which in our case are used to characterize the distribution of sub-daily rainfall extremes. Second, we discretize the study area into tiles where L-moments are averaged into regional predictions. Then, we evaluate the correlation of teleconnections with time series of spatially gridded L-moments. Finally, we show a preliminary application of climate-informed RFA of rainfall extremes, where L-moments are modelled as functions of the teleconnections singularly. These models are compared with the corresponding stationary models by means of goodness-of-fit metrics.

Our results show that the sliding mean of annual maxima (both hourly and daily) has a higher number of significant correlations with WeMOI and EA-WR than with the other indices. Moreover, the relationship between these indices and sliding mean of extreme rainfall shows clear spatial patterns across the study area, whose robustness is confirmed by their limited sensitivity to the chosen grid resolution and the partial agreement with previous studies (Caroletti et al., 2021; Romano et al., 2022). Similar patterns are found when considering the areas where the goodness-of-fit of the climate-informed regional models (where the mean depends on the WeMOI and EA-WR) outperforms the stationary approach. As well, this is coherent with the known spatial variability of precipitation regimes over the region. The relationship between the sliding L-CV and the teleconnections is more complex, since the number of significant correlations is lower, and dependence patterns are less extended and more heterogeneous. Nevertheless, the regional models where L-CV depends on the teleconnections outperforms the stationary approach in a significant number of stations. The proposed approach is simple and easily reproducible, yet it is new with respect to the existing literature. In fact, while most authors investigated the correlation between the teleconnections and the raw AMS, we consider the L-moments. This, in combination with spatial discretization of the domain, allows us to focus on the relationship between the teleconnections and the extreme rainfall regime, instead of the extreme values themselves, whose seasonality and interannual variability can affect the correlation analysis. Beside the preliminary nature of our RFA application, commonly used metrics (e.g., Ashkar and Ba, 2017) detect overall an increase in goodness-of-fit with respect to a stationary approach, in line with previous studies (Nerantzaki and Papalexiou, 2022). This shows that teleconnections may be useful covariates in a regional a framework. Overall, our research suggests promising pathways for climate-informed local and regional frequency analysis of rainfall extremes, and our methodology is highly adaptable to different environmental variables, such as floods and temperature.

Code availability

Part of the study (i.e., polynomial functions fitting and hierarchical regional frequency analysis) can be reproduced with a Python script available at the following Zenodo repository: https://doi.org/10.5281/zenodo.16610039 (Magnini et al., 2025). The repository also contains slightly altered data from 13 gauged sites, that are sufficient to run the script.

Data availability

The rainfall data used in the present study are part of the dataset I2-RED (Mazzoglio et al., 2020 a), available under authorization at the website https://doi.org/10.5281/zenodo.4269509 (Mazzoglio et al., 2020 b). Teleconnections indices data are available at https://psl.noaa.gov/data/climateindices/list/ (last access: 25 September 2025) and https://crudata.uea.ac.uk/cru/data/ (last access: 25 September 2025).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/hess-29-5031-2025-supplement.

Author contributions

AM: conceptualization, investigation, methodology, software, writing – original draft, writing – review and editing. VP: methodology, writing – original draft, writing – review and editing, validation. AC: conceptualization, methodology, writing – original draft, writing – review and editing, validation, supervision.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Also, please note that this paper has not received English language copy-editing.

Acknowledgements

The Authors thankfully acknowledge the use of free and opensource software, in particular Python (Van Rossum and Drake, 2003) and R (R Core Team, 2024). Also, sincere gratitude is due to Paola Mazzoglio and Pierluigi Claps, for giving access to the dataset I2-RED (Mazzoglio et al., 2020 a).

Financial support

This research has been supported by the European Climate, Infrastructure and Environment Executive Agency (grant no. 101069928 – LIFE21-IPC-IT-LIFE CLIMAX PO).

Review statement

This paper was edited by Nadav Peleg and reviewed by two anonymous referees.

References

Alpert, P., Ben‐Gai, T., Baharad, A., Benjamini, Y., Yekutieli, D., Colacino, M., Diodato, L., Ramis, C., Homar, V., Romero, R., Michaelides, S., and Manes, A.: The paradoxical increase of Mediterranean extreme daily rainfall in spite of decrease in total values, Geophys. Res. Lett., 29, https://doi.org/10.1029/2001gl013554, 2002. a

Ashkar, F. and Aucoin, F.: Choice between competitive pairs of frequency models for use in hydrology: a review and some new results, Hydrol. Sci. J., 57, 1092–1106, https://doi.org/10.1080/02626667.2012.701746, 2012. a

Ashkar, F. and Ba, I.: Selection between the generalized Pareto and kappa distributions in peaks-over-threshold hydrological frequency modelling, Hydrol. Sci. J., 62, 1167–1180, https://doi.org/10.1080/02626667.2017.1302089, 2017. a, b, c

Bardossy, A. and Plate, E. J.: Space‐time model for daily rainfall using atmospheric circulation patterns, Water Resour. Res., 28, 1247–1259, https://doi.org/10.1029/91WR02589, 1992. a

Barnston, A. G. and Livezey, R. E.: Classification, Seasonality and Persistence of Low-Frequency Atmospheric Circulation Patterns, Mon. Weather Rev., 115, 1083–1126, https://doi.org/10.1175/1520-0493(1987)115<1083:CSAPOL>2.0.CO;2, 1987. a, b

Bhattacharya, P. K. and Burman, P.: Theory and Methods of Statistics, Elsevier, https://doi.org/10.1016/C2014-0-02379-9, 2016. a

Belkhiri, L. and Krakauer, N.: Quantifying the effect of climate variability on seasonal precipitation using Bayesian clustering approach in Kebir Rhumel Basin, Algeria, Stoch. Environ. Res. Risk Assess., 37, 3929–3943, https://doi.org/10.1007/s00477-023-02488-z, 2023. a

Bonsal, B. and Shabbar, A.: Impacts of Large-Scale Circulation Variability on Low Streamflows over Canada: A Review, Can. Water Resour. J., 33, 137–154, https://doi.org/10.4296/cwrj3302137, 2008. a

Cappelli, F. and Grimaldi, S.: Feature importance measures for hydrological applications: insights from a virtual experiment, Stoch. Environ. Res. Risk Assess., 37, 4921–4939, https://doi.org/10.1007/s00477-023-02545-7, 2023. a

Caroletti, G. N., Coscarelli, R., and Caloiero, T.: A sub‐regional approach to the influence analysis of teleconnection patterns on precipitation in Calabria (southern Italy), Int. J. Climatol., 41, 4574–4586, https://doi.org/10.1002/joc.7087, 2021. a, b, c, d, e, f, g

Castellarin, A.: Probabilistic envelope curves for design flood estimation at ungauged sites, Water Resour. Res., 43, 2005WR004384, https://doi.org/10.1029/2005WR004384, 2007. a

Castellarin, A., Burn, D. H., and Brath, A.: Assessing the effectiveness of hydrological similarity measures for flood frequency analysis, J. Hydrol., 241, 270–285, https://doi.org/10.1016/S0022-1694(00)00383-8, 2001. a

Castellarin, A., Magnini, A., Kyaw, K. K., Ciavaglia, F., Bertola, M., Blöschl, G., Volpi, E., Claps, P., Viglione, A., Marinelli, A., and Vogel, R. M.: Frequency of Italian Record-Breaking Floods over the Last Century (1911–2020), Atmosphere, 15, 865, https://doi.org/10.3390/atmos15070865, 2024. a, b

Chen, N., Thual, S., and Hu, S.: El Niño and the Southern Oscillation: Observation, in: Reference Module in Earth Systems and Environmental Sciences, Elsevier, B978012409548911766X, https://doi.org/10.1016/B978-0-12-409548-9.11766-X, 2019. a

Cheng, L. and AghaKouchak, A.: Nonstationary Precipitation Intensity-Duration-Frequency Curves for Infrastructure Design in a Changing Climate, Sci. Rep., 4, 7093, https://doi.org/10.1038/srep07093, 2014. a

Claps, P., Ganora, D., and Mazzoglio, P.: Rainfall regionalization techniques, in: Rainfall, Elsevier, 327–350, https://doi.org/10.1016/B978-0-12-822544-8.00013-5, 2022. a

Coles, S.: An introduction to statistical modeling of extreme values, Springer series in statistics, Springer, London, New York, ISBN 9781852334598, 2001. a

Conte, M., Giuffrida, A., and Tedesco, S.: The Mediterranean Oscillation [L'Oscillazione Mediterranea], Mem. Soc. Geogr. Ital., 46, 115–124, 1991. a

Coscarelli, R., Caloiero, T., Filice, E., Marsico, L., and Rotundo, R.: Meteorological Drought Characterization in the Calabria Region (Southern Italy), Climate, 11, 160, https://doi.org/10.3390/cli11080160, 2023. a

Criado-Aldeanueva, F., and Soto-Navarro, J.: Climatic Indices over the Mediterranean Sea: A Review, Appl. Sci., 10, 5790, https://doi.org/10.3390/app10175790, 2020. a, b, c

Das, J., Jha, S., and Goyal, M. K.: On the relationship of climatic and monsoon teleconnections with monthly precipitation over meteorologically homogenous regions in India: Wavelet & global coherence approaches, Atmos. Res., 238, 104889, https://doi.org/10.1016/j.atmosres.2020.104889, 2020. a, b

Diodato, N., and Bellocchi, G.: Using Historical Precipitation Patterns to Forecast Daily Extremes of Rainfall for the Coming Decades in Naples (Italy), Geosciences, 8, 293, https://doi.org/10.3390/geosciences8080293, 2018. a

El Adlouni, S. and Ouarda, T. B. M. J.: Joint Bayesian model selection and parameter estimation of the generalized extreme value model with covariates using birth‐death Markov chain Monte Carlo, Water Resour. Res., 45, 2007WR006427, https://doi.org/10.1029/2007WR006427, 2009. a

Fauer, F. S. and Rust, H. W.: Non-stationary large-scale statistics of precipitation extremes in central Europe, Stoch. Environ. Res. Risk Assess., 37, 4417–4429, https://doi.org/10.1007/s00477-023-02515-z, 2023. a, b

Gabriele, S. and Arnell, N.: A hierarchical approach to regional flood frequency analysis, Water Resour. Res., 27, 1281–1289, https://doi.org/10.1029/91WR00238, 1991. a, b, c

Gonzalez‐Hidalgo, J. C., Beguería, S., Peña‐Angulo, D., and Blanco, V. T.: Catalogue and Analysis of Extraordinary Precipitation Events in the Spanish Mainland, 1916–2022, Int. J. Climatol., 45, https://doi.org/10.1002/joc.8785, 2025. a

González-Pérez, A., Álvarez-Esteban, R., Penas, A., and Del Río, S.: Analysis of recent rainfall trends and links to teleconnection patterns in California (U.S.), J. Hydrol., 612, 128211, https://doi.org/10.1016/j.jhydrol.2022.128211, 2022. a

Hengl, T.: A Practical Guide to Geostatistical Mapping of Environmental Variables, EUR 22904 EN, Office for Official Publications of the European Communities, Luxembourg, ISBN 9789279069048, 2007. a

Hosking, J. R. M. and Wallis, J. R.: Regional Frequency Analysis: An Approach Based on L-Moments, 1st ed., Cambridge University Press, https://doi.org/10.1017/CBO9780511529443, 1997. a, b, c, d, e

Jayaweera, L., Wasko, C., Nathan, R., and Johnson, F.: Non-stationarity in extreme rainfalls across Australia, J. Hydrol., 624, 129872, https://doi.org/10.1016/j.jhydrol.2023.129872, 2023. a

Jenkinson, A. F.: The frequency distribution of the annual maximum (or minimum) values of meteorological elements, Q. J. R. Meteorol. Soc., 81, 158–171, https://doi.org/10.1002/qj.49708134804, 1955. a, b

Jones, P. D., Jonsson, T., and Wheeler, D.: Extension to the North Atlantic oscillation using early instrumental pressure observations from Gibraltar and south-west Iceland, Int. J. Climatol., 17, 1433–1450, https://doi.org/10.1002/(SICI)1097-0088(19971115)17:13<1433::AID-JOC203>3.0.CO;2-P, 1997. a

Johnson, K., Smithers, J., Schulze, R., and Kjeldsen, T.: Non-stationary frequency analysis of extreme rainfall events on the east coast of KwaZulu-Natal, South Africa, Hydrol. Sci. J., 70, 849–859, https://doi.org/10.1080/02626667.2025.2464233, 2025. a

Krichak, S. O. and Alpert, P.: Decadal trends in the east Atlantic-west Russia pattern and Mediterranean precipitation, Int. J. Climatol., 25, 183–192, https://doi.org/10.1002/joc.1124, 2005. a, b

Krichak, S. O., Kishcha, P., and Alpert, P.: Decadal trends of main Eurasian oscillations and the Eastern Mediterranean precipitation, Theor. Appl. Climatol., 72, 209–220, https://doi.org/10.1007/s007040200021, 2002. a

Krichak, S. O., Breitgand, J. S., Gualdi, S., and Feldstein, S. B.: Teleconnection–extreme precipitation relationships over the Mediterranean region, Theor. Appl. Climatol., 117, 679–692, https://doi.org/10.1007/s00704-013-1036-4, 2014. a, b

Laio, F., Di Baldassarre, G., and Montanari, A.: Model selection techniques for the frequency analysis of hydrological extremes, Water Resour. Res., 45, 2007WR006666, https://doi.org/10.1029/2007WR006666, 2009. a

Lopez-Bustins, J. A., Arbiol-Roca, L., Martin-Vide, J., Barrera-Escoda, A., and Prohom, M.: Intra-annual variability of the Western Mediterranean Oscillation (WeMO) and occurrence of extreme torrential precipitation in Catalonia (NE Iberia), Nat. Hazards Earth Syst. Sci., 20, 2483–2501, https://doi.org/10.5194/nhess-20-2483-2020, 2020. a

Lun, D., Fischer, S., Viglione, A., and Blöschl, G.: Significance testing of rank cross-correlations between autocorrelated time series with short-range dependence, J. Appl. Stat., 50, 2934–2950, https://doi.org/10.1080/02664763.2022.2137115, 2023. a, b

Luppichini, M. and Bini, M.: Evolution of rainfall in Italy over the last 200 years: Interactions between climate indices and global warming, Atmos. Res., 326, 108276, https://doi.org/10.1016/j.atmosres.2025.108276, 2025. a

Machado, M. J., Botero, B. A., López, J., Francés, F., Díez-Herrero, A., and Benito, G.: Flood frequency analysis of historical flood data under stationary and non-stationary modelling, Hydrol. Earth Syst. Sci., 19, 2561–2576, https://doi.org/10.5194/hess-19-2561-2015, 2015. a

Magnini, A., Lombardi, M., Bujari, A., Mattivi, P., Shustikova, I., Persiano, S., Patella, M., Bitelli, G., Bellavista, P., Lo Conti, F., Tirri, A., Bagli, S., Mazzoli, P., and Castellarin, A.: Geomorphic flood hazard mapping: from floodplain delineation to flood hazard characterization, Hydrol. Sci. J., 68, 2388–2403, https://doi.org/10.1080/02626667.2023.2269909, 2023. a

Magnini, A., Lombardi, M., Ouarda, T. B. M. J., and Castellarin, A.: AI-driven morphoclimatic regional frequency modelling of sub-daily rainfall-extremes, J. Hydrol., 631, 130808, https://doi.org/10.1016/j.jhydrol.2024.130808, 2024. a, b

Magnini, A., Pavan, V., and Castellarin, A.: Supplement code – Informativeness of teleconnections in frequency analysis of rainfall extremes, Zenodo [code], https://doi.org/10.5281/zenodo.16610039, 2025. a

Marra, F., Armon, M., Borga, M., and Morin, E.: Orographic Effect on Extreme Precipitation Statistics Peaks at Hourly Time Scales, Geophys. Res. Lett., 48, e2020GL091498, https://doi.org/10.1029/2020GL091498, 2021. a

Martins, E. S. and Stedinger, J. R.: Generalized maximum‐likelihood generalized extreme‐value quantile estimators for hydrologic data, Water Resour. Res., 36, 737–744, https://doi.org/10.1029/1999WR900330, 2000. a

Martin-Vide, J. and Lopez‐Bustins, J.: The Western Mediterranean Oscillation and rainfall in the Iberian Peninsula, Int. J. Climatol., 26, 1455–1475, https://doi.org/10.1002/joc.1388, 2006. a, b

Mazzoglio, P., Butera, I., and Claps, P.: I2-RED: A Massive Update and Quality Control of the Italian Annual Extreme Rainfall Dataset, Water, 12, 3308, https://doi.org/10.3390/w12123308, 2020a. a, b, c

Mazzoglio, P., Butera, I., and Claps, P.: Improved Italian – Rainfall Extreme Dataset (I2-RED), Zenodo [data set], https://doi.org/10.5281/zenodo.4269509, 2020b. a

Nerantzaki, S. D. and Papalexiou, S. M.: Assessing extremes in hydroclimatology: A review on probabilistic methods, J. Hydrol., 605, 127302, https://doi.org/10.1016/j.jhydrol.2021.127302, 2022. a, b, c

Ouachani, R., Bargaoui, Z., and Ouarda, T.: Power of teleconnection patterns on precipitation and streamflow variability of upper Medjerda Basin, Int. J. Climatol., 33, 58–76, https://doi.org/10.1002/joc.3407, 2013. a

Ouarda, T. B. M. J. and Charron, C.: Changes in the distribution of hydro-climatic extremes in a non-stationary framework, Sci. Rep., 9, 8104, https://doi.org/10.1038/s41598-019-44603-7, 2019. a

Ouarda, T. B. M. J., Yousef, L. A., and Charron, C.: Non‐stationary intensity‐duration‐frequency curves integrating information concerning teleconnections and climate change, Int. J. Climatol., 39, 2306–2323, https://doi.org/10.1002/joc.5953, 2019. a, b

Ouarda, T. B. M. J., Charron, C., and St‐Hilaire, A.: Uncertainty of stationary and nonstationary models for rainfall frequency analysis, Int. J. Climatol., 40, 2373–2392, https://doi.org/10.1002/joc.6339, 2020. a, b, c

Papalexiou, S. M. and Koutsoyiannis, D.: Battle of extreme value distributions: A global survey on extreme daily rainfall, Water Resour. Res., 49, 187–201, https://doi.org/10.1029/2012WR012557, 2013. a

Ragno, E., AghaKouchak, A., Love, C. A., Cheng, L., Vahedifard, F., and Lima, C. H. R.: Quantifying Changes in Future Intensity‐Duration‐Frequency Curves Using Multimodel Ensemble Simulations, Water Resour. Res., 54, 1751–1764, https://doi.org/10.1002/2017WR021975, 2018. a, b

Rasouli, K., Scharold, K., Mahmood, T. H., Glenn, N. F., and Marks, D.: Linking hydrological variations at local scales to regional climate teleconnection patterns, Hydrol. Process., 34, 5624–5641, https://doi.org/10.1002/hyp.13982, 2020. a

R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria [software], https://www.r-project.org/ (last access: 25 September 2025), 2024. a

Redolat, D., Monjo, R., Lopez-Bustins, J. A., and Martin-Vide, J.: Upper-Level Mediterranean Oscillation index and seasonal variability of rainfall and temperature, Theor. Appl. Climatol., 135, 1059–1077, https://doi.org/10.1007/s00704-018-2424-6, 2019. a

Renard, B., Lang, M., Bois, P., Dupeyrat, A., Mestre, O., Niel, H., Sauquet, E., Prudhomme, C., Parey, S., Paquet, E., Neppel, L., and Gailhard, J.: Regional methods for trend detection: Assessing field significance and regional consistency, Water Resour. Res., 44, 2007WR006268, https://doi.org/10.1029/2007WR006268, 2008. a

Ríos-Cornejo, D., Penas, Á., Álvarez-Esteban, R., and Del Río, S.: Links between teleconnection patterns and precipitation in Spain, Atmos. Res., 156, 14–28, https://doi.org/10.1016/j.atmosres.2014.12.012, 2015. a, b

Romano, E., Petrangeli, A. B., Salerno, F., and Guyennon, N.: Do recent meteorological drought events in central Italy result from long‐term trend or increasing variability?, Int. J. Climatol., 42, 4111–4128, https://doi.org/10.1002/joc.7487, 2022. a, b, c, d, e, f

Salinas, J. L., Castellarin, A., Kohnová, S., and Kjeldsen, T. R.: Regional parent flood frequency distributions in Europe – Part 2: Climate and scale controls, Hydrol. Earth Syst. Sci., 18, 4391–4401, https://doi.org/10.5194/hess-18-4391-2014, 2014. a

Serinaldi, F. and Kilsby, C. G.: Unsurprising Surprises: The Frequency of Record‐breaking and Overthreshold Hydrological Extremes Under Spatial and Temporal Dependence, Water Resour. Res., 54, 6460–6487, https://doi.org/10.1029/2018WR023055, 2018. a, b

Sugihara, G., May, R., and Ye, H.: Detecting causality in complex ecosystems, Science, 338, 496–500, https://doi.org/10.1126/science.1227079, 2012. a

Van Rossum, G. and Drake, F. L.: Python Language Reference Manual, A Python manual, Network Theory Limited, ISBN 0954161785, 2003. a

Vogel, R. M., Zafirakou‐Koulouris, A., and Matalas, N. C.: Frequency of record‐breaking floods in the United States, Water Resour. Res., 37, 1723–1731, https://doi.org/10.1029/2001WR900019, 2001. a

Volpi, E., Grimaldi, S., Aghakouchak, A., Castellarin, A., Chebana, F., Papalexiou, S. M., Aksoy, H., Bárdossy, A., Cancelliere, A., Chen, Y., Deidda, R., Haberlandt, U., Eris, E., Fischer, S., Francés, F., Kavetski, D., Rodding Kjeldsen, T., Kochanek, K., Langousis, A., Mediero Orduña, L., Montanari, A., Nerantzaki, S. D., Ouarda, T. B. M. J., Prosdocimi, I., Ragno, E., Rajulapati, C. R., Requena, A. I., Ridolfi, E., Sadegh, M., Schumann, A., and Sharma, A.: The legacy of STAHY: milestones, achievements, challenges, and open problems in statistical hydrology, Hydrol. Sci. J., 69, 1913==1949, https://doi.org/10.1080/02626667.2024.2385686, 2024. a, b

Zanchettin, D., Franks, S. W., Traverso, P., and Tomasino, M.: On ENSO impacts on European wintertime rainfalls and their modulation by the NAO and the Pacific multi‐decadal variability described through the PDO index, Int. J. Climatol., 28, 995–1006, https://doi.org/10.1002/joc.1601, 2008. a

Zhang, X., Zwiers, F. W., and Li, G.: Monte Carlo Experiments on the Detection of Trends in Extreme Values, J. Climate, 17, 1945–1952, https://doi.org/10.1175/1520-0442(2004)017<1945:mceotd>2.0.co;2, 2004. a

Zhang, Y., Wallace, J. M., and Battisti, D. S.: ENSO-like Interdecadal Variability: 1900–93, J. Climate, 10, 1004–1020, https://doi.org/10.1175/1520-0442(1997)010<1004:ELIV>2.0.CO;2, 1997. a

Articles

Short summary

This study describes a new methodology to identify regional structures in the dependence of extreme rainfall on global climate indices. The study area is North-Central Italy, but the methods are highly adaptable to other regions. We observe that while multiple climate indices show influence, the Western Mediterranean Oscillation Index and East Atlantic West Russia pattern appear to have the strongest effects, with regional structures that align with other studies. We also show that using these indices may improve estimation of extreme rainfall depth with a given probability.