<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0"><?xmltex \makeatother\@nolinetrue\makeatletter?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">HESS</journal-id><journal-title-group>
    <journal-title>Hydrology and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1607-7938</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/hess-24-473-2020</article-id><title-group><article-title>Numerical investigation on the power of parametric and nonparametric tests for trend detection in annual maximum series</article-title><alt-title>Numerical investigation on the power of parametric and nonparametric tests</alt-title>
      </title-group><?xmltex \runningtitle{Numerical investigation on the power of parametric and nonparametric tests}?><?xmltex \runningauthor{V.~Totaro et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name><surname>Totaro</surname><given-names>Vincenzo</given-names></name>
          <email>vincenzo.totaro@poliba.it</email>
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Gioia</surname><given-names>Andrea</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no">
          <name><surname>Iacobellis</surname><given-names>Vito</given-names></name>
          
        </contrib>
        <aff id="aff1"><institution>Dipartimento di Ingegneria Civile, Ambientale, del Territorio, Edile e di Chimica (DICATECh), <?xmltex \hack{\break}?> Politecnico di Bari, Bari, 70125, Italy</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Vincenzo Totaro (vincenzo.totaro@poliba.it)</corresp></author-notes><pub-date><day>29</day><month>January</month><year>2020</year></pub-date>
      
      <volume>24</volume>
      <issue>1</issue>
      <fpage>473</fpage><lpage>488</lpage>
      <history>
        <date date-type="received"><day>12</day><month>July</month><year>2019</year></date>
           <date date-type="rev-request"><day>8</day><month>August</month><year>2019</year></date>
           <date date-type="rev-recd"><day>13</day><month>November</month><year>2019</year></date>
           <date date-type="accepted"><day>3</day><month>December</month><year>2019</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2020 Vincenzo Totaro et al.</copyright-statement>
        <copyright-year>2020</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020.html">This article is available from https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020.html</self-uri><self-uri xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020.pdf</self-uri>
      <abstract><title>Abstract</title>
    <p id="d1e97">The need to fit time series characterized by the presence of a trend or change points has generated increased interest in the investigation of nonstationary probability distributions in recent years. Considering that the available hydrological time series can be recognized as the observable part of a stochastic process with a definite probability distribution, two main topics can be tackled in this context: the first is related to the definition of an objective criterion for choosing whether the stationary hypothesis can be adopted, whereas the second regards the effects of nonstationarity on the estimation of distribution parameters and quantiles for an assigned return period and flood risk evaluation. Although the time series trend or change points are usually detected using nonparametric tests available in the literature (e.g., Mann–Kendall or CUSUM test), the correct selection of the stationary or nonstationary probability distribution is still required for design purposes. In this light, the focus is shifted toward model selection criteria; this implies the use of parametric methods, including all of the issues related to parameter estimation. The aim of this study is to compare the performance of parametric and nonparametric methods for trend detection, analyzing their power and focusing on the use of traditional model selection tools (e.g., the Akaike information criterion and the likelihood ratio test) within this context. The power and efficiency of parameter estimation, including the trend coefficient, were investigated via Monte Carlo simulations using the generalized extreme value distribution as the parent with selected parameter sets.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

      <?xmltex \hack{\newpage}?>
<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e111">The long- and medium-term prediction of extreme hydrological events under
nonstationary conditions is one of the major challenges of our times.
Streamflow, as well as temporal rainfall and many other hydrological
phenomena, can be considered as stochastic processes (Chow, 1964), i.e., families of random variables with an assigned probability distribution, and time series are the observable part of this process. One of the main goals of extreme event frequency analysis is the estimation of distribution quantiles related to a certain non-exceedance probability. They are usually obtained after fitting a probabilistic model to observed data. As Koutsoyiannis and Montanari (2015) depicted in their historical review of the “concept of stationarity”, Kolmogorov, in 1931, “used the term stationary to describe a probability density function that is unchanged in time”, whereas Khintchine (1934) provided a formal definition of stationarity of a stochastic process.</p>
      <p id="d1e114">In this context, detecting the existence of time-dependence in a stochastic process should be considered a necessary task in the statistical analysis of recorded time series. Thus, several considerations should be made with respect to updating some important hydrological concepts while assuming that the non-exceedance probability varies with time or other covariates. For example, the return period may be reformulated in two different ways, the “expected waiting time” (EWT; Olsen et al., 1998) or the “expected number of events” (ENE; Parey et al., 2007, 2010), which lead to a different evaluation of quantiles within a nonstationary approach. As proved by Cooley (2013), the EWT and ENE are affected differently by nonstationarity, possibly producing ambiguity in engineering design practice (Du et al., 2015; Read and Vogel, 2015).<?pagebreak page474?> Salas and Obeysekera (2014) provided a detailed report regarding relationships between stationary and nonstationary EWT values within a parametric approach for the assessment of nonstationary conditions. In such a framework, a strong relevance is given to statistical tools for detecting
changes in non-normally distributed time series (Kundewicz and Robson, 2004).</p>
      <p id="d1e117">To date, the vast majority of research regarding climate change and the detection of nonstationary conditions has been developed using nonparametric
approaches. One of the most commonly used nonparametric measures of trend is Sen's
slope (Gocic and Trajkovic, 2013); however, a wide array of nonparametric
tests for detecting nonstationarity is available (e.g., Kundewicz and Robson, 2004). Statistical tests include the Mann–Kendall (MK; Mann, 1945; Kendall,
1975) and Spearman (Lehmann, 1975) tests for detecting trends, and the Pettitt (Pettitt, 1979) and CUSUM (Smadi and Zghoul, 2006) tests for change point detection. All of these tests are based on a specific null hypothesis and have to be performed for an assigned significance level. Nonparametric tests are usually preferred over parametric tests as they are distribution-free and do not require knowledge of the parent distribution. They are traditionally considered more suitable for the frequency analysis of extreme events with respect to parametric tests because they are less sensitive to the presence of outliers (Wang et al., 2005).</p>
      <p id="d1e120">In contrast, the use of null hypothesis significance tests for trend
detection has raised concerns and severe criticisms in a wide range of
scientific fields for many years (e.g., Cohen, 1994), as outlined by Vogel
et al. (2013). Serinaldi et al. (2018) provided an extensive critical review
focusing on logical flaws and misinterpretations often related to their
misuse.</p>
      <p id="d1e124">In general, the use of statistical tests involves different errors, such as
type I error (rejecting the null hypothesis when it is true) and type II error (accepting the null hypothesis when it is false). The latter is related to
the test power, i.e., the probability of rejecting the null hypothesis when
it is false; however, as recognized by a few authors (e.g., Milly et al., 2015;
Beven, 2016), the importance of the power has been largely overlooked
in Earth system science fields. Strong attention has always been paid to
the level of significance (i.e., type I error), although, as pointed out by
Vogel et al. (2013), “a type II error in the context of an infrastructure decision implies under-preparedness, which is often an error much more costly to society than the type I error (over-preparedness)”.</p>
      <p id="d1e127">Moreover, as already proven by Yue et al. (2002a), the power of the
Mann–Kendall test, despite its nonparametric structure, actually shows a
strong dependence on the type and parametrization of the parent
distribution.</p>
      <p id="d1e130">Using a parametric approach, the estimation of quantiles of an extreme event distribution requires the search for the underlying distribution and for time-dependant hydrological variables. If variables are time-dependent, they are “i/nid” (independent/non-identically distributed) and the model is considered nonstationary; otherwise, the variables are “iid” (independent, identically distributed) and the model is a stationary one (Montanari and Koutsoyiannis, 2014; Serinaldi and Kilsby, 2015).</p>
      <p id="d1e133">From this perspective, the detection of nonstationarity may exploit (besides
traditional statistical tests) well-known properties of model selection
tools. Even in this case, several measures and criteria are available for
selecting a best-fit model, such as the Akaike information criterion
(AIC; Akaike, 1974), the Bayesian information criterion (BIC; Schwarz, 1978), and the
likelihood ratio test (LR; Coles, 2001); the latter is suitable when dealing
with nested models.</p>
      <p id="d1e136">The purpose of this paper is to provide further insights into the use of
parametric and nonparametric approaches in the framework of extreme event
frequency analysis under nonstationary conditions. The comparison between
those different approaches is not straightforward. Nonparametric tests do
not require knowledge of the parent distribution, and their properties
strongly rely on the choice of the null hypothesis. Parametric methods for
model selection, in comparison, require the selection of the parent
distribution and the estimation of its parameters, but are not necessarily
associated with a specific null hypothesis. Nevertheless, in both cases, the
evaluation of the rejection threshold is usually based on a statistical
measure of trend that, under the null hypothesis of stationarity, follows a
specific distribution (e.g., the Gaussianity of the Kendall statistic for the MK nonparametric test, and the <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> distribution of deviance statistic for the
LR parametric test).</p>
      <p id="d1e150">Considering the pros and cons of the different approaches, we believe that specific remarks should be made about the use of parametric and nonparametric methods for the analysis of extreme event series. For this purpose, we set up a numerical experiment to compare the performance of (1) the MK as a nonparametric test for trend detection, (2) the LR parametric test for model selection, and (3) the AIC<inline-formula><mml:math id="M2" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> parametric test, as defined in Sect. 2.3. In particular, the AIC<inline-formula><mml:math id="M3" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> is a measure for model selection, based on the AIC, whose distribution was numerically evaluated, under the null hypothesis of a stationary process, for comparison purposes with other tests.</p>
      <p id="d1e172">We aim to provide (i) a comparison of test power between the MK, LR, and
AIC<inline-formula><mml:math id="M4" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>; (ii) a sensitivity analysis of test power to parameters of a
known parent distribution used to generate sample data; and (iii) an analysis of
the influence of the sample size on the test power and the significance level.</p>
      <?pagebreak page475?><p id="d1e184">We conducted the analysis using Monte Carlo techniques; this entailed generating samples from parent populations assuming one of the most popular extreme
event distributions, the generalized extreme value (GEV; Jenkinson, 1955), with a
linear (and without any) trend in the position parameter. From the
samples generated, we numerically evaluated the power and significance level of tests
for trend detection, using the MK, LR, and AIC<inline-formula><mml:math id="M5" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> tests. For the latter, we also
checked the option of using the modified version of AIC, referred to as AIC<inline-formula><mml:math id="M6" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula>, suggested by
Sugiura (1978) for smaller samples.</p>
      <p id="d1e205">Considering that parametric methods involve the estimation of the parent
distribution parameters, we also analyzed the efficiency of the maximum likelihood (ML) estimator by comparing the sample variability of the ML estimate of trend with the nonparametric Sen’s slope. Furthermore, we scoped the sample variability of the GEV parameters in the stationary
and nonstationary cases.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methodological framework</title>
      <p id="d1e216">This section is divided into five parts. Sect. 2.1, 2.2, and 2.3 report the
main characteristics of the MK, LR, and AIC<inline-formula><mml:math id="M7" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> tests, respectively. In Sect. 2.4, the probabilistic model used for sample data generation, based on the
use of the GEV distribution, is described in the stationary and nonstationary cases. Finally, Sect. 2.5 outlines the procedure for the numerical
evaluation of the tests' power and significance level.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>The Mann–Kendall test</title>
      <p id="d1e235">Hydrological time series are often composed by non-normally independent
realizations of phenomena, and this characteristic makes the use of
nonparametric trend tests very attractive (Kundzewicz and Robson, 2004). The
Mann–Kendall test is a widely used rank-based tool for detecting monotonic,
and not necessarily linear, trends. Given a random variable z, and assigned
a sample of <inline-formula><mml:math id="M8" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> independent data <inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:mi>z</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, …, <inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>L</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>), the Kendall <inline-formula><mml:math id="M11" display="inline"><mml:mi>S</mml:mi></mml:math></inline-formula> statistic (Kendall, 1975) can be defined as follows:
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M12" display="block"><mml:mrow><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mrow><mml:mi>L</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>L</mml:mi></mml:munderover><mml:mi mathvariant="normal">sgn</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where “sgn” is the sign function.</p>
      <p id="d1e341">The null hypothesis of this test is the absence of any statistically
significant trend in the sample, whereas the presence of a trend represents an alternative hypothesis. Yilmaz and Perera (2014) reported that serial dependence can
lead to a more frequent rejection of the null hypothesis. For <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:math></inline-formula>, Mann (1945) reported that Eq. (1) is approximatively a normal variable with a zero mean and variance which, in the presence of <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msub><mml:mi>t</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> ties of length <inline-formula><mml:math id="M15" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>, can be expressed as
            <disp-formula id="Ch1.Ex1"><mml:math id="M16" display="block"><mml:mrow><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>L</mml:mi><mml:mo>(</mml:mo><mml:mi>L</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>L</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msub><mml:mi>t</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mi>m</mml:mi><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">5</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">18</mml:mn></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          In practice, the Mann–Kendall test is performed using the <inline-formula><mml:math id="M17" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula> statistic
            <disp-formula id="Ch1.Ex2"><mml:math id="M18" display="block"><mml:mrow><mml:mi>Z</mml:mi><mml:mo>=</mml:mo><mml:mfenced close="" open="{"><mml:mtable class="array" columnalign="left center"><mml:mtr><mml:mtd><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>S</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:msqrt><mml:mrow><mml:mi>V</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle></mml:mstyle></mml:mtd><mml:mtd><mml:mrow><mml:mi>S</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn mathvariant="normal">0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>S</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:msqrt><mml:mrow><mml:mi>V</mml:mi><mml:mo>(</mml:mo><mml:mi>S</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:msqrt></mml:mfrac></mml:mstyle></mml:mstyle></mml:mtd><mml:mtd><mml:mrow><mml:mi>S</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          which follows a standard normal distribution. Using this approach, it is
simple to evaluate the <inline-formula><mml:math id="M19" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> value and compare it with an assigned level of
significance or, equivalently, to calculate the <inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> threshold
value and compare it with <inline-formula><mml:math id="M21" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula>, where <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msub><mml:mi>Z</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:math></inline-formula>) quantile of a standard normal distribution.</p>
      <p id="d1e602">Yue et al. (2002b) observed that autocorrelation in time series can
influence the ability of the MK test to detect trends. To avoid this problem, a
correct approach with respect to trend analysis should contemplate a preliminary check
for autocorrelation and, if necessary, the application of pre-whitening
procedures.</p>
      <p id="d1e605">A nonparametric tool for a reliable estimation of a trend in a time series with <inline-formula><mml:math id="M24" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> pairs of data is the Sen's slope estimator (Sen, 1968), which is defined as the median of the set of slopes <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>:
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M26" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">δ</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>N</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>&gt;</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Likelihood ratio test</title>
      <p id="d1e703">The likelihood ratio statistical test allows for the comparison of two candidate models. As its name suggests, it is based on the evaluation of the likelihood function of different models.</p>
      <p id="d1e706">The LR test has been used multiple times (Tramblay et al., 2013; Cheng et
al., 2014; Yilmaz et al., 2014) to select between stationary and nonstationary models in the context of nested models. Given a stationary
model characterized by a parameter set <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and a
nonstationary model, with parameter set <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, if
<inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> are their respective maximized log likelihoods, the likelihood ratio test can be defined using the deviance statistic
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M32" display="block"><mml:mrow><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mfenced close="]" open="["><mml:mrow><mml:mi>l</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>-</mml:mo><mml:mi>l</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">st</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          <inline-formula><mml:math id="M33" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> is (for large <inline-formula><mml:math id="M34" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>) approximately <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> distributed, with
<inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">dim</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mi mathvariant="normal">dim</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> degrees of freedom. The null hypothesis of stationarity is rejected if <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>&gt;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the (<inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:math></inline-formula>) quantile of the <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> distribution (Coles, 2001).</p>
      <p id="d1e927">Besides the analysis of power, we also checked (in Sect. 3.3) the approximation <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>∼</mml:mo><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> as a function of the sample size <inline-formula><mml:math id="M42" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> for the evaluation of the level of significance.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Akaike information criterion ratio test</title>
      <p id="d1e962">Information criteria are useful tools for model selection. It is reasonable
to retain that the Akaike information criterion (AIC; Akaike, 1974) is the most
famous among these tools. Based on the Kullback–Leibler discrepancy measure, if
<inline-formula><mml:math id="M43" display="inline"><mml:mi mathvariant="italic">θ</mml:mi></mml:math></inline-formula> is the parameter set of a <inline-formula><mml:math id="M44" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-dimensional model
(<inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">dim</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">θ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>), AIC is defined as
            <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M46" display="block"><mml:mrow><mml:mi mathvariant="normal">AIC</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>k</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          The model that best fits the data has the lowest value of the AIC between candidates.
It is useful to observe that the term<?pagebreak page476?> proportional to the number of model
parameters allows one to account for the increase of the estimator variance as
the number of model parameters increases.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><label>Figure 1</label><caption><p id="d1e1033">An empirical distribution of AIC<inline-formula><mml:math id="M47" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> and the rejection threshold AIC<inline-formula><mml:math id="M48" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> of the null hypothesis (stationary GEV parent).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f01.png"/>

        </fig>

      <p id="d1e1065">Sugiura (1978) observed that the AIC can lead to misleading results for small samples; thus, he proposed a new measure for the AIC:
            <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M49" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="normal">AIC</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>k</mml:mi><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi>L</mml:mi><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where a second-order bias correction is introduced. Burnham and Anderson (2004) suggested only using this version when <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>/</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula>, with <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> being the maximum number of parameters between the models compared. However, for larger <inline-formula><mml:math id="M52" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>, AIC<inline-formula><mml:math id="M53" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> converges to AIC. For a quantitative comparison between the AIC and AIC<inline-formula><mml:math id="M54" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> in the extreme value stationary model selection framework, the reader is referred to Laio et al. (2009).</p>
      <p id="d1e1185">In order to select between stationary and nonstationary candidate models, we
use the ratio
            <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M55" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="normal">AIC</mml:mi><mml:mi>R</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="normal">AIC</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi mathvariant="normal">AIC</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where the subscripts indicate the AIC value obtained for a stationary (st) and a nonstationary (ns) model, both fitted with maximum likelihood to the same data series.</p>
      <p id="d1e1217">Considering that the better fitting model has a lower AIC, if the time series
arises from a nonstationary process, the AIC<inline-formula><mml:math id="M56" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> should be less than 1; the opposite is true if the process is stationary.</p>
      <p id="d1e1229">In order to provide a rigorous comparison between the use of the MK, LR, and
AIC<inline-formula><mml:math id="M57" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>, we evaluated the AIC<inline-formula><mml:math id="M58" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> threshold value
corresponding to the significance level <inline-formula><mml:math id="M59" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> using numerical experiments.</p>
      <p id="d1e1262">More in detail, we adopted the following procedure:
<list list-type="order"><list-item>
      <p id="d1e1267"><inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula> samples are generated from a stationary GEV parent distribution, with known parameters;</p></list-item><list-item>
      <p id="d1e1285">for each of these samples the AIC<inline-formula><mml:math id="M61" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> is evaluated by fitting the
stationary and nonstationary GEV models described in Sect. 2.4, thus
providing its empirical distribution (see probability density function, pdf, in Fig. 1);</p></list-item><list-item>
      <p id="d1e1298">exploiting the empirical distribution of AIC<inline-formula><mml:math id="M62" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>, the threshold
associated with a significance level of <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> is numerically
evaluated. This value, AIC<inline-formula><mml:math id="M64" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, represents the threshold for
rejecting the null hypothesis of stationarity (which in these generations is true) in 5 % of the synthetic samples.</p></list-item></list>
This procedure was applied both for the AIC and AIC<inline-formula><mml:math id="M65" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula>. The experiment was
repeated for a few selected sets of the GEV parameters, including different
trend values, and different sample lengths, as detailed in Sect. 3.</p><?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>The GEV parent distribution</title>
      <p id="d1e1356">The cumulative distribution function of the generalized extreme value (GEV) distribution (Jenkinson,
1955) can be expressed as follows:
            <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M66" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9}{9}\selectfont$\displaystyle}?><mml:mi>F</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable class="array" columnalign="left center"><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:mo>-</mml:mo><mml:msup><mml:mfenced close="]" open="["><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ε</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>z</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ζ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced open="{" close="}"><mml:mrow><mml:mo>-</mml:mo><mml:mi>exp⁡</mml:mi><mml:mfenced close="]" open="["><mml:mrow><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>z</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">ζ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><?xmltex \hack{$\egroup}?><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M67" display="inline"><mml:mi mathvariant="italic">ζ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M68" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M69" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> are known as the position, scale, and shape parameters, respectively; <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mi mathvariant="italic">ζ</mml:mi></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M71" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> is a general and comprehensive way to express the parameter set in the stationary case. The flexibility of the GEV, which accounts for the Gumbel, Fréchet, and Weibull distributions as special cases (for <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> respectively) makes it eligible for a more general discussion about the implications of nonstationarity.</p>
      <p id="d1e1568">Traditional extreme value distributions can be used in a nonstationary
framework, modeling their parameters as function of time or other
covariates (Coles, 2001), producing <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>⟶</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ε</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e1622">In this study, only a deterministic linear dependence on the time <inline-formula><mml:math id="M79" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> of the
position parameter <inline-formula><mml:math id="M80" display="inline"><mml:mi mathvariant="italic">ζ</mml:mi></mml:math></inline-formula> has been introduced, leading Eq. (7) to be expressed as follows:
            <disp-formula id="Ch1.E8" content-type="numbered"><label>8</label><mml:math id="M81" display="block"><mml:mrow><?xmltex \hack{\hbox\bgroup\fontsize{9}{9}\selectfont$\displaystyle}?><mml:mi>F</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>z</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:mfenced close="" open="{"><mml:mtable class="array" columnalign="left center"><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:mo>-</mml:mo><mml:msup><mml:mfenced open="[" close="]"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ε</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>z</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ε</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mfenced close="}" open="{"><mml:mrow><mml:mo>-</mml:mo><mml:mi>exp⁡</mml:mi><mml:mfenced open="[" close="]"><mml:mrow><mml:mo>-</mml:mo><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>z</mml:mi><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn><?xmltex \hack{$\egroup}?></mml:mrow></mml:math></disp-formula>
          with
            <disp-formula id="Ch1.E9" content-type="numbered"><label>9</label><mml:math id="M82" display="block"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mi>t</mml:mi></mml:mrow></mml:math></disp-formula>
          and <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M84" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M85" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d1e1836">It is important to note that Eq. (8) is a more general way of defining the GEV
and has the property of degenerating into Eq. (7) for <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>; in
other words, Eq. (7) represents a nested model of Eq. (8) which would confirm the
suitability of the likelihood ratio test for model selection.</p>
      <p id="d1e1855">According to Muraleedharan et al. (2010), the first three moments of the GEV
distribution are as follows:

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M88" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E10"><mml:mtd><mml:mtext>10</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">mean</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="italic">ζ</mml:mi><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="italic">ε</mml:mi></mml:mfrac></mml:mstyle><mml:mfenced close=")" open="("><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfenced><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E11"><mml:mtd><mml:mtext>11</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">variance</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msup><mml:mi mathvariant="italic">ε</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfenced><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>&lt;</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo><mml:mtext>and</mml:mtext></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E12"><mml:mtd><mml:mtext>12</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="normal">skewness</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">sgn</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:msubsup><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:msubsup></mml:mrow><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>-</mml:mo><mml:msubsup><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mfenced><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mstyle><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.25em"/><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>&lt;</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">3</mml:mn></mml:mfrac></mml:mstyle><mml:mo>.</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            Here, <inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msub><mml:mi>g</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>k</mml:mi><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, with <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>∈</mml:mo><mml:msup><mml:mi>Z</mml:mi><mml:mo>+</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mi mathvariant="normal">Γ</mml:mi><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, is the gamma function. It is worth noting that, following Eqs. (10)–(12), the trend in the position parameter only affects the mean, while the variance and skewness remain constant.</p>
      <p id="d1e2145">In this work, we used the maximum likelihood method (ML) to estimate the GEV
parameters from sample data. The ML allows one to treat <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> as an independent parameter, as well as <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M94" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M95" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula>. For this purpose, we exploited the “extRemes” R package (Gilleland and Katz, 2016).</p><?xmltex \hack{\newpage}?>
</sec>
<?pagebreak page477?><sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Numerical evaluation of test power and significance level</title>
      <p id="d1e2193">The power of a test is related to the type II error and is the
probability of correctly rejecting the null hypothesis when it is false. In
particular, defining <inline-formula><mml:math id="M96" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> (level of significance), the probability of a
type I error, and <inline-formula><mml:math id="M97" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>, the probability of a type II error, we have a
power value of <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi mathvariant="italic">β</mml:mi></mml:mrow></mml:math></inline-formula>. The maximum value of power is 1, which correspond to
<inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, i.e., no probability of a type II error. In most applications, the
conventional values are <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.2</mml:mn></mml:mrow></mml:math></inline-formula>, meaning that a 1-to-4 trade-off between <inline-formula><mml:math id="M102" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M103" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> is accepted. Thus, in our experiment we always assumed a significance level of 0.05, and, for the following description of results and discussion, we considered a power level of less than 0.8 to be too low and, hence, unacceptable. In Sect. 4, we report further considerations regarding this choice. For each of the tests described in Sect. 2.1, 2.2, and 2.3, the power was numerically evaluated according to the following procedure:
<list list-type="order"><list-item>
      <p id="d1e2275"><inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2000</mml:mn></mml:mrow></mml:math></inline-formula> Monte Carlo synthetic series, each of length <inline-formula><mml:math id="M105" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>, are generated using the
nonstationary GEV in Eqs. (8) and (9) as a parent distribution with a fixed parameter set <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M108" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>.</p></list-item><list-item>
      <p id="d1e2361">The threshold AIC<inline-formula><mml:math id="M111" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> associated with a significance level of <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> is numerically evaluated, as described in Sect. 2.3, using
the corresponding parameter set <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M114" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> of the GEV parent distribution.</p></list-item><list-item>
      <p id="d1e2428">From these synthetic series, the power of the test is estimated as<disp-formula id="Ch1.Ex3"><mml:math id="M116" display="block"><mml:mrow><mml:mi mathvariant="normal">rejection</mml:mi><mml:mspace width="0.25em" linebreak="nobreak"/><mml:mi mathvariant="normal">rate</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">rej</mml:mi></mml:msub></mml:mrow><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">rej</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the number of series for which the null hypothesis is rejected, as in Yue et al. (2002a).</p></list-item></list>
<?xmltex \hack{\newpage}?><?xmltex \hack{\noindent}?>The same procedure, with <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula>, was used in order to check the actual significance level of the test, which is the probability of type I error, i.e., the probability of rejecting the null hypothesis of stationarity when it is true. The task was performed by following the abovementioned steps 1 to 3 while replacing <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">st</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in step 1); in such a case, the rejection rate <inline-formula><mml:math id="M121" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mi mathvariant="normal">rej</mml:mi></mml:msub><mml:mo>/</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> represents the actual level of significance <inline-formula><mml:math id="M122" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>.</p>
      <p id="d1e2531">We used a reduced number of generations (<inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2000</mml:mn></mml:mrow></mml:math></inline-formula>) for the evaluation of
power as a good compromise between the quality of the results and computational time. <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2000</mml:mn></mml:mrow></mml:math></inline-formula> was also used by Yue et al. (2002a).</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Sensitivity analysis, results, and discussion</title>
      <p id="d1e2567">A comparative evaluation of the tests' performance was carried out for different GEV parameter sets <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, considering three values of <inline-formula><mml:math id="M126" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> (<inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>, 0, and 0.4) and three values of <inline-formula><mml:math id="M128" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> (10, 15, and 20). The position parameter was always kept constant and equal to <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula>. Then, for any possible pair of <inline-formula><mml:math id="M130" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M131" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> values, we considered <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> ranging from <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to 1 with a step size of 0.1. Such a range of parameters represents a wide domain in the hydrologically feasible parameter space of annual maximum daily rainfall. Upper-bounded (<inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>), EV1 (<inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>), and heavy-tailed (<inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>) cases are included. Moreover, for each of these parameter sets <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">ns</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M138" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> samples of different sizes (30, 50, and 70) were generated.</p>
      <?pagebreak page478?><p id="d1e2715">For a clear exposition of the results, this section is divided into four
subsections. In Sect. 3.1, we focus on the opportunity to use the AIC or
AIC<inline-formula><mml:math id="M139" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> for the evaluation of AIC<inline-formula><mml:math id="M140" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>; in Sect. 3.2, the
comparison of test power and its sensitivity analysis to the parent distribution parameters and the sample size is shown; in Sect. 3.3, the evaluation of the
level of significance for all tests and, in particular, the validity of the
<inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> approximation for the <inline-formula><mml:math id="M142" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> statistic is discussed; and finally in Sect. 3.4, the numerical investigation of the sample variability of the
parameters is reported.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2" specific-use="star"><?xmltex \currentcnt{2}?><label>Figure 2</label><caption><p id="d1e2756">Distributions of the differences between the power of AIC<inline-formula><mml:math id="M143" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> evaluated with the AIC and AIC<inline-formula><mml:math id="M144" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> for <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f02.png"/>

      </fig>

<sec id="Ch1.S3.SS1">
  <label>3.1</label><?xmltex \opttitle{Evaluation of AIC${}_{{R}}$, with the AIC and AIC${}_{{c}}$}?><title>Evaluation of AIC<inline-formula><mml:math id="M146" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>, with the AIC and AIC<inline-formula><mml:math id="M147" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula></title>
      <p id="d1e2821">Considering the nonstationary GEV four-parameter model, in order to satisfy
the relation <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>/</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">40</mml:mn></mml:mrow></mml:math></inline-formula> suggested by Burnham and Anderson (2004), a time series with a record length no less than 160 should be available. Following this simple reasoning, the AIC should be considered not to be applicable to any annual maximum series showing a changing point between the 1970s and 1980s (e.g., Kiely, 1999). In our numerical experiment, the second-order bias correction of Sugiura (1978) should always be used, as we have <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>/</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mo>=</mml:mo><mml:mn mathvariant="normal">17.5</mml:mn></mml:mrow></mml:math></inline-formula> for the nonstationary GEV for a maximum sample length of <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula>. Nevertheless, we checked if using the AIC or AIC<inline-formula><mml:math id="M151" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> may affect results. For this purpose, we evaluated the percentage differences between the power of the AIC<inline-formula><mml:math id="M152" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> obtained by means of the AIC and AIC<inline-formula><mml:math id="M153" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> from synthetic series. In Fig. 2, the empirical probability density functions of such percentage differences, grouped according to sample length, are plotted for generations with <inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> and different values of <inline-formula><mml:math id="M155" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>. It is interesting to note that the error distribution only shows a regular and unbiased bell-shaped distribution for <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula>. We then observe a small negative bias (about <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.02</mml:mn></mml:mrow></mml:math></inline-formula> %) for <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>, while a bias of <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.08</mml:mn></mml:mrow></mml:math></inline-formula> with a multi-peak and negatively skewed pdf is noted for <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula>. The latter pdf also has a higher variance than the others. The purpose of this figure is to show that the difference between the power obtained with the AIC and the power obtained with the AIC<inline-formula><mml:math id="M161" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> is negligible. Different peaks in one curve (<inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula>) can be explained by the merging of sample errors obtained for different values of <inline-formula><mml:math id="M163" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>. Similar results were obtained for all values of <inline-formula><mml:math id="M164" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula>, which always provided very low differences and allow for the conclusion to be reached that the use of the AIC or AIC<inline-formula><mml:math id="M165" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula> does not significantly affect the power of AIC<inline-formula><mml:math id="M166" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> for the cases examined. This follows the combined effect of the sample size (whose minimum value considered here is 30) and the limited difference in the number of parameters in the selected models. In the following, we will refer to and show only the plots obtained for the AIC<inline-formula><mml:math id="M167" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> in Eq. (6) with the AIC evaluated as in Eq. (4).</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Dependence of the power on the parent distribution parameters and sample size</title>
      <p id="d1e3057">The effect of the parent distribution parameters and the sample size on the numerical evaluation of the power and significance level of the MK, LR, and AIC<inline-formula><mml:math id="M168" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> tests for different values of <inline-formula><mml:math id="M169" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M170" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is shown in Fig. 3. The curves represent both the significance level, which is shown for
<inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> (true parent is the stationary GEV), and the power, which is shown for all other values <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> (true parent is the nonstationary GEV). Each panel in Fig. 3 shows the dependence of the power and
significance level of MK, LR and AIC<inline-formula><mml:math id="M174" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> on the trend coefficient for one set of parameter values and different sample sizes. In all panels, the test power strongly depends on the trend coefficient and sample size. This dependence is also affected by the
parent parameter values. In all cases, the power reaches 1 for a strong trend
and approaches 0.05 (the chosen level of significance) for a weak trend
(<inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> close to 0). In all combinations of the shape and scale
parameters (and especially for short samples) for a wide range of trend
values, the power exhibits values well below the conventional value of 0.8. The
curves' slope between 0.05 and 1 is sharp for long samples and gentle for
short samples. It also depends on the parameter set, with slopes
generally being gentler for higher values of the scale (<inline-formula><mml:math id="M176" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>) and shape (<inline-formula><mml:math id="M177" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula>) parameters of the parent distribution. A significant
difference in the power between the MK, LR, and AIC<inline-formula><mml:math id="M178" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> tests is observable when the sample size is smaller and even more so when the parent distribution is heavy-tailed (<inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><label>Figure 3</label><caption><p id="d1e3185">Dependence of test power on the trend coefficient, sample size, scale, and shape of the parent parameters.</p></caption>
          <?xmltex \igopts{width=455.244094pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f03.png"/>

        </fig>

      <p id="d1e3194">In particular, for <inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M182" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>, 70, it is possible to
report a slightly larger power of LR with respect to the AIC<inline-formula><mml:math id="M183" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> and MK, but
values are very close to each other. However, the reciprocal position
of MK and AIC<inline-formula><mml:math id="M184" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> power curves is interesting; in fact, the AIC<inline-formula><mml:math id="M185" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> power is always
larger than that of the MK, except when <inline-formula><mml:math id="M186" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>, for all values of the scale parameter.</p>
      <p id="d1e3274">A higher difference is found for a heavy-tailed parent distribution
(<inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>). While LR still has the largest power value, the difference
with respect to <inline-formula><mml:math id="M188" display="inline"><mml:mrow><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mi>R</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> remains small and the MK power value almost always collapses to values smaller than 0.5.</p>
      <p id="d1e3306">The practical consequences of such patterns are very important and are discussed
in Sect. 4.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Sensitivity and evaluation of the actual significance level</title>
      <p id="d1e3317">We evaluated the threshold values (corresponding to a significance level of
0.05) for accepting/rejecting the null hypothesis of stationarity according
to the methodologies recalled in Sect. 2.1 and 2.2 for the MK and LR tests and
introduced in Sect. 2.3 for AIC<inline-formula><mml:math id="M189" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>. Based on these thresholds, we exploited the generation of series from a stationary model (<inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>) in order to numerically evaluate the rate of rejection of the null hypothesis, i.e., the actual significance level of the tests considered in the numerical experiment, following the procedure described in Sect. 2.5.</p>
      <p id="d1e3344">Table 1 shows the numerical values of the actual level of significance,
obtained numerically, to be compared with the theoretical value of 0.05 for all of
the sets of parameters and sample sizes considered. Among the three measures
for trend detection, the LR shows the worst performance. The results in Table 1
show that the rejection rate of the (true) null hypothesis is systematically
higher than it should be, and it is also dependent on parent parameter values. This effect is exalted when the parent distribution has an upper boundary
(<inline-formula><mml:math id="M191" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>) and for shorter series (<inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula>). In practice,<?pagebreak page479?> this
implies that when using the LR test, as described in Sect. 2.2, there is
a higher probability of rejecting the null hypothesis of stationarity
(if it is true) than expected or designed.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e3376">The actual level of significance of the tests for different sample sizes, scales, and shapes of the parent parameters.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="12">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:colspec colnum="7" colname="col7" align="center"/>
     <oasis:colspec colnum="8" colname="col8" align="center"/>
     <oasis:colspec colnum="9" colname="col9" align="left"/>
     <oasis:colspec colnum="10" colname="col10" align="center"/>
     <oasis:colspec colnum="11" colname="col11" align="center"/>
     <oasis:colspec colnum="12" colname="col12" align="center"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" namest="col2" nameend="col4"><inline-formula><mml:math id="M193" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry rowsep="1" namest="col6" nameend="col8"><inline-formula><mml:math id="M194" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry rowsep="1" namest="col10" nameend="col12"><inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M197" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M198" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M203" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col12"><inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col12"><inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MK</oasis:entry>
         <oasis:entry colname="col2">0.048</oasis:entry>
         <oasis:entry colname="col3">0.047</oasis:entry>
         <oasis:entry colname="col4">0.047</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.047</oasis:entry>
         <oasis:entry colname="col7">0.050</oasis:entry>
         <oasis:entry colname="col8">0.050</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.046</oasis:entry>
         <oasis:entry colname="col11">0.049</oasis:entry>
         <oasis:entry colname="col12">0.048</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AIC<inline-formula><mml:math id="M206" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.050</oasis:entry>
         <oasis:entry colname="col3">0.046</oasis:entry>
         <oasis:entry colname="col4">0.052</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.051</oasis:entry>
         <oasis:entry colname="col7">0.052</oasis:entry>
         <oasis:entry colname="col8">0.045</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.052</oasis:entry>
         <oasis:entry colname="col11">0.054</oasis:entry>
         <oasis:entry colname="col12">0.051</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">LR</oasis:entry>
         <oasis:entry colname="col2">0.104</oasis:entry>
         <oasis:entry colname="col3">0.103</oasis:entry>
         <oasis:entry colname="col4">0.115</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.061</oasis:entry>
         <oasis:entry colname="col7">0.064</oasis:entry>
         <oasis:entry colname="col8">0.060</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.084</oasis:entry>
         <oasis:entry colname="col11">0.081</oasis:entry>
         <oasis:entry colname="col12">0.083</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col12"><inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MK</oasis:entry>
         <oasis:entry colname="col2">0.050</oasis:entry>
         <oasis:entry colname="col3">0.047</oasis:entry>
         <oasis:entry colname="col4">0.046</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.044</oasis:entry>
         <oasis:entry colname="col7">0.047</oasis:entry>
         <oasis:entry colname="col8">0.050</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.049</oasis:entry>
         <oasis:entry colname="col11">0.044</oasis:entry>
         <oasis:entry colname="col12">0.048</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AIC<inline-formula><mml:math id="M208" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.053</oasis:entry>
         <oasis:entry colname="col3">0.053</oasis:entry>
         <oasis:entry colname="col4">0.046</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.051</oasis:entry>
         <oasis:entry colname="col7">0.051</oasis:entry>
         <oasis:entry colname="col8">0.057</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.050</oasis:entry>
         <oasis:entry colname="col11">0.050</oasis:entry>
         <oasis:entry colname="col12">0.053</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">LR</oasis:entry>
         <oasis:entry colname="col2">0.079</oasis:entry>
         <oasis:entry colname="col3">0.078</oasis:entry>
         <oasis:entry colname="col4">0.074</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.060</oasis:entry>
         <oasis:entry colname="col7">0.063</oasis:entry>
         <oasis:entry colname="col8">0.063</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.070</oasis:entry>
         <oasis:entry colname="col11">0.069</oasis:entry>
         <oasis:entry colname="col12">0.070</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col12"><inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MK</oasis:entry>
         <oasis:entry colname="col2">0.050</oasis:entry>
         <oasis:entry colname="col3">0.052</oasis:entry>
         <oasis:entry colname="col4">0.054</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.052</oasis:entry>
         <oasis:entry colname="col7">0.051</oasis:entry>
         <oasis:entry colname="col8">0.047</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.049</oasis:entry>
         <oasis:entry colname="col11">0.048</oasis:entry>
         <oasis:entry colname="col12">0.046</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AIC<inline-formula><mml:math id="M210" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.047</oasis:entry>
         <oasis:entry colname="col3">0.051</oasis:entry>
         <oasis:entry colname="col4">0.051</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.058</oasis:entry>
         <oasis:entry colname="col7">0.058</oasis:entry>
         <oasis:entry colname="col8">0.052</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.050</oasis:entry>
         <oasis:entry colname="col11">0.054</oasis:entry>
         <oasis:entry colname="col12">0.051</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LR</oasis:entry>
         <oasis:entry colname="col2">0.069</oasis:entry>
         <oasis:entry colname="col3">0.069</oasis:entry>
         <oasis:entry colname="col4">0.073</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.063</oasis:entry>
         <oasis:entry colname="col7">0.065</oasis:entry>
         <oasis:entry colname="col8">0.058</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.062</oasis:entry>
         <oasis:entry colname="col11">0.062</oasis:entry>
         <oasis:entry colname="col12">0.063</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e4023">Conversely, the performance of MK with respect to the designed level
of significance is less biased and is independent of the parameter set.
Similar good performance is trivially obtained for the AIC<inline-formula><mml:math id="M211" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>, whose
rejection threshold is numerically evaluated.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><label>Figure 4</label><caption><p id="d1e4037">Enlargement of the power test curves in the case (<inline-formula><mml:math id="M212" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>), with focus on the actual level of significance (<inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f04.png"/>

        </fig>

      <p id="d1e4087">The plot in Fig. 4 is displayed in order to focus on the actual value of the
level of significance and, in particular, on the LR approximation <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>∼</mml:mo><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> as a function of the sample length <inline-formula><mml:math id="M216" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula>. The difference between the theoretical and<?pagebreak page480?> numerical values of the significance level is represented by the distance between the bottom value of the curve (obtained for <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, i.e., the stationary GEV model) and the chosen level of significance 0.05, represented by the horizontal dotted line. In particular, in Fig. 4, results for the parameter set (<inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>) show that the actual rate of rejection is always higher than the theoretical one and changes significantly with the sample size; this means that the <inline-formula><mml:math id="M220" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> approximation leads to a significant underestimation of the rejection threshold of the <inline-formula><mml:math id="M221" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> statistic. Moreover, it seems that the LR power curves (in red) are shifted toward higher values as a consequence of the significance level overestimation, meaning that the LR test power is also overestimated due to the approximation <inline-formula><mml:math id="M222" display="inline"><mml:mrow><mml:mi>D</mml:mi><mml:mo>∼</mml:mo><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula>. These results suggest the use of a numerical procedure for the LR test (such as that introduced for AIC<inline-formula><mml:math id="M223" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> in Sect. 2.3) for evaluating the <inline-formula><mml:math id="M224" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> distribution and the rejection threshold.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e4212">Actual level of significance of the AIC<inline-formula><mml:math id="M225" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> test for AIC<inline-formula><mml:math id="M226" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="12">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="center"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="center"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:colspec colnum="7" colname="col7" align="center"/>
     <oasis:colspec colnum="8" colname="col8" align="center"/>
     <oasis:colspec colnum="9" colname="col9" align="left"/>
     <oasis:colspec colnum="10" colname="col10" align="center"/>
     <oasis:colspec colnum="11" colname="col11" align="center"/>
     <oasis:colspec colnum="12" colname="col12" align="center"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry rowsep="1" namest="col2" nameend="col4"><inline-formula><mml:math id="M227" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry rowsep="1" namest="col6" nameend="col8"><inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry rowsep="1" namest="col10" nameend="col12"><inline-formula><mml:math id="M229" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M231" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M232" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M233" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col7"><inline-formula><mml:math id="M234" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M235" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"><inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col11"><inline-formula><mml:math id="M237" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col12"><inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M239" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.246</oasis:entry>
         <oasis:entry colname="col3">0.254</oasis:entry>
         <oasis:entry colname="col4">0.261</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.188</oasis:entry>
         <oasis:entry colname="col7">0.191</oasis:entry>
         <oasis:entry colname="col8">0.181</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.220</oasis:entry>
         <oasis:entry colname="col11">0.221</oasis:entry>
         <oasis:entry colname="col12">0.215</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.213</oasis:entry>
         <oasis:entry colname="col3">0.209</oasis:entry>
         <oasis:entry colname="col4">0.206</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.171</oasis:entry>
         <oasis:entry colname="col7">0.175</oasis:entry>
         <oasis:entry colname="col8">0.170</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.188</oasis:entry>
         <oasis:entry colname="col11">0.207</oasis:entry>
         <oasis:entry colname="col12">0.195</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col2">0.192</oasis:entry>
         <oasis:entry colname="col3">0.192</oasis:entry>
         <oasis:entry colname="col4">0.201</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">0.168</oasis:entry>
         <oasis:entry colname="col7">0.169</oasis:entry>
         <oasis:entry colname="col8">0.173</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">0.184</oasis:entry>
         <oasis:entry colname="col11">0.204</oasis:entry>
         <oasis:entry colname="col12">0.184</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><label>Figure 5</label><caption><p id="d1e4611">AIC<inline-formula><mml:math id="M242" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> thresholds for different parameter sets vs. sample size.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f05.png"/>

        </fig>

      <p id="d1e4634">Other considerations can be made regarding the use of AIC<inline-formula><mml:math id="M243" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>. As explained in
Sect. 2.3, we empirically evaluated the AIC<inline-formula><mml:math id="M244" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> threshold value using numerical generations with a significance level 0.05 for each of
the parameter sets and sample sizes considered. Similar results were
obtained using the AIC<inline-formula><mml:math id="M245" display="inline"><mml:msub><mml:mi/><mml:mi>c</mml:mi></mml:msub></mml:math></inline-formula>, which are not shown for brevity. We found a
significant dependence of AIC<inline-formula><mml:math id="M246" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> on the sample size. Figure 5
shows the AIC<inline-formula><mml:math id="M247" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> curves obtained for each of the<?pagebreak page481?> parameter sets vs. sample size. It is also worth noting that all curves asymptotically
trend to 1 as <inline-formula><mml:math id="M248" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> increases. This property is due to the structure of the AIC and the
peculiarity of the nested models used in this paper: while using a sample
generated with weak nonstationarity (i.e., when <inline-formula><mml:math id="M249" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>→</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> in
Eq. 9), the maximum likelihood of the model shown in Eq. 7, <inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, tends toward <inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of the model shown in Eq. 8, leaving only the bias correction (2<inline-formula><mml:math id="M252" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> in equation 4) to discriminate between competing AIC values in model selection applications. As a consequence, the AIC<inline-formula><mml:math id="M253" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> should always be lower than 1; however, when increasing the sample size, both the likelihood terms <inline-formula><mml:math id="M254" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi mathvariant="normal">st</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M255" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mi>l</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="italic">θ</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi mathvariant="normal">ns</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>
in Eq. (4) will also increase, pushing AIC<inline-formula><mml:math id="M256" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> toward the limit 1. Conversely, Fig. 5 shows that the threshold value AIC<inline-formula><mml:math id="M257" display="inline"><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is
significantly smaller than 1 up to <inline-formula><mml:math id="M258" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> values well beyond the length usually available in this kind of analysis. Hence, the numerical evaluation of the threshold has to be considered as a required task in order to provide an
assigned significance level to model selection. In contrast, the
simple adoption of the selection criteria AIC<inline-formula><mml:math id="M259" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> (i.e., AIC<inline-formula><mml:math id="M260" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>) would correspond to an unknown significance level that is
dependent on the parent distribution and sample size. In order to highlight
this point, we evaluated the significance level <inline-formula><mml:math id="M261" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> corresponding to
AIC<inline-formula><mml:math id="M262" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mrow><mml:mi>R</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="italic">α</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, following the procedure described in Sect. 2.5, by generating <inline-formula><mml:math id="M263" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula> synthetic series (from a stationary model) for any
parameter set and sample length. The results, provided in Table 2, show that <inline-formula><mml:math id="M264" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> ranges between 0.16 and 0.26 in
the explored GEV parameter domain and mainly depends on the sample length and the shape parameter of the parent
distribution.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><label>Figure 6</label><caption><p id="d1e4944">Sample variability of ML-<inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M266" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula> vs. the trend coefficient <inline-formula><mml:math id="M267" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Sample variability of parent distribution parameters</title>
      <p id="d1e4990">In our opinion, the results shown above, with respect to the performance of parametric and
nonparametric tests, are quite surprising and important. It
is proved that the preference widely accorded to nonparametric tests, due to the fact that
their statistics are allegedly independent from the parent distribution, is not
well founded. Conversely, the use of parametric procedures raises the
problem of correctly estimating the parent distribution and, for the purpose
of this paper, its parameters. Moreover, as the trend coefficient <inline-formula><mml:math id="M268" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is a parameter of the parent distribution under nonstationary<?pagebreak page482?> conditions, the proposed parametric approach provides a maximum
likelihood-based estimation of the same trend coefficient, which is hereafter
referred to as ML-<inline-formula><mml:math id="M269" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>. For a comparison with nonparametric approaches, we
also evaluated the sample variability of the Sen's slope measure (<inline-formula><mml:math id="M270" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula>)
of the imposed linear trend. Furthermore, in order to provide insights into these issues, we analyzed the
sample variability of the maximum likelihood estimates ML-<inline-formula><mml:math id="M271" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and
ML-<inline-formula><mml:math id="M272" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> (from the same sets of generations exploited above) for different parameter sets and sample lengths.</p>
      <p id="d1e5036">We evaluated sample variability <inline-formula><mml:math id="M273" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>[</mml:mo><mml:mo>⋅</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>, as the standard deviation of the ML estimates of parameter values obtained from synthetic series. In the upper panels of Fig. 6, we show <inline-formula><mml:math id="M274" display="inline"><mml:mi>s</mml:mi></mml:math></inline-formula>[ML-<inline-formula><mml:math id="M275" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>], and in the lower panels, we show the Sen's slope median <inline-formula><mml:math id="M276" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>[</mml:mo><mml:mi mathvariant="italic">δ</mml:mi><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula>. In both cases, the sample variability of the linear trend is strongly dependent on sample size and independent from the true <inline-formula><mml:math id="M277" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> value in the range examined [<inline-formula><mml:math id="M278" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, 1]. It reaches high values for short samples and, in such cases, its dependence on the scale and shape parent parameters is also relevant. The ML estimation of the trend coefficient is always more efficient than Sen's slope, and this is observed for heavy-tailed distributions in particular.</p>
      <p id="d1e5107">In Fig. 7, we show the empirical distributions of the Sen's slope <inline-formula><mml:math id="M279" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula>
and ML-<inline-formula><mml:math id="M280" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> estimates obtained from samples of size <inline-formula><mml:math id="M281" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> with a
parent distribution characterized by <inline-formula><mml:math id="M282" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M283" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>, 0, 0.4<inline-formula><mml:math id="M284" display="inline"><mml:mo>]</mml:mo></mml:math></inline-formula>, providing visual information about the range of trend values that may result from a local evaluation. Similar results, characterized by smaller sample variability, as shown in Fig. 6, are obtained for <inline-formula><mml:math id="M285" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula> and are not shown for brevity.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><label>Figure 7</label><caption><p id="d1e5203">Empirical distributions of <inline-formula><mml:math id="M287" display="inline"><mml:mi mathvariant="italic">δ</mml:mi></mml:math></inline-formula> and ML-<inline-formula><mml:math id="M288" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> evaluated from samples with <inline-formula><mml:math id="M289" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M290" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> vs. the trend coefficient <inline-formula><mml:math id="M291" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f07.png"/>

        </fig>

      <p id="d1e5265">Figure 8 shows the sample variability of ML-<inline-formula><mml:math id="M292" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and ML-<inline-formula><mml:math id="M293" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>,
which is still independent of the true <inline-formula><mml:math id="M294" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> for values of
<inline-formula><mml:math id="M295" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and 0.4, whereas for the upper-bounded GEV distributions
(<inline-formula><mml:math id="M296" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn></mml:mrow></mml:math></inline-formula>) it shows a significant increase for higher values of <inline-formula><mml:math id="M297" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and high trend coefficients (<inline-formula><mml:math id="M298" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>|</mml:mo><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula>). The randomness of results for <inline-formula><mml:math id="M299" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M300" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M301" display="inline"><mml:mrow><mml:mn mathvariant="normal">20</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math></inline-formula> is probably due to the reduced efficiency of the algorithm that maximizes the <inline-formula><mml:math id="M302" display="inline"><mml:mi>log⁡</mml:mi></mml:math></inline-formula>-likelihood function for heavy-tailed distributions.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><label>Figure 8</label><caption><p id="d1e5391">Sample variability of ML-<inline-formula><mml:math id="M303" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and ML-<inline-formula><mml:math id="M304" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> vs. the trend coefficient <inline-formula><mml:math id="M305" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=426.791339pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f08.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9" specific-use="star"><?xmltex \currentcnt{9}?><label>Figure 9</label><caption><p id="d1e5427">Empirical distributions of ML-<inline-formula><mml:math id="M306" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> evaluated for <inline-formula><mml:math id="M307" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> from samples with <inline-formula><mml:math id="M308" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M309" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula> vs. the trend coefficient <inline-formula><mml:math id="M310" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f09.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F10" specific-use="star"><?xmltex \currentcnt{10}?><label>Figure 10</label><caption><p id="d1e5493">Empirical distributions of ML-<inline-formula><mml:math id="M311" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> evaluated for <inline-formula><mml:math id="M312" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> from samples with <inline-formula><mml:math id="M313" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M314" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula> vs. the trend coefficient <inline-formula><mml:math id="M315" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://hess.copernicus.org/articles/24/473/2020/hess-24-473-2020-f10.png"/>

        </fig>

      <?pagebreak page484?><p id="d1e5556">In order to better analyze such patterns, for the scale and shape parent
parameters we also report the distribution of their empirical ML estimates
for different parameter sets vs. the true <inline-formula><mml:math id="M316" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> value used in
generation. The sample distribution of ML-<inline-formula><mml:math id="M317" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> for <inline-formula><mml:math id="M318" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> is
shown in Fig. 9 for <inline-formula><mml:math id="M319" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M320" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula>. The sample distribution of ML-<inline-formula><mml:math id="M321" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> for <inline-formula><mml:math id="M322" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:math></inline-formula> is shown in Fig. 10 for <inline-formula><mml:math id="M323" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M324" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">70</mml:mn></mml:mrow></mml:math></inline-formula>. The panels show
that the presence of a strong trend coefficient may produce significant loss
in the estimator efficiency, which is probably due to deviation from the normal
distribution of the sample estimates for long samples. This suggests
the need for more robust estimation procedures that provide higher
efficiency for estimates of <inline-formula><mml:math id="M325" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M326" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> in the case of a strong
observed trend. It should be highlighted that efficiency in the parameter
estimation increases with sample size for <inline-formula><mml:math id="M327" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>[</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>, 0.4<inline-formula><mml:math id="M328" display="inline"><mml:mo>]</mml:mo></mml:math></inline-formula>, whereas it decreases for both <inline-formula><mml:math id="M329" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M330" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> in the case of <inline-formula><mml:math id="M331" display="inline"><mml:mrow><mml:mi mathvariant="italic">ε</mml:mi><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.4</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:math></inline-formula> where the trend of the location parameter implies a shift in time of the distribution upper bound.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d1e5733">The results shown have important practical implications. The dependence of
test power on the parent distribution parameters may significantly affect
results of both parametric and nonparametric tests, including the widely
used Mann–Kendall test.</p>
      <p id="d1e5736"><?xmltex \hack{\newpage}?>Considering the feasibility of the numerical evaluation of power, allowed by the
parametric approach, we observe that, while awareness of the crucial
role of type II error has been growing in recent years in the hydrological
literature, a common debate would deserve more development about which power
values should be considered acceptable. Such an issue is much more enhanced
in other scientific fields where the experimental design is traditionally
required to estimate the appropriate sample size to adequately support
results and conclusions. In psychological research, Cohen (1992) proposed 0.8 to be a conventional value of power to be used with level of significance of 0.05, thus lea<?pagebreak page485?>ding to a 4 : 1 ratio between the risk of type II and type I
error. The conventional value proposed by Cohen (1992) has been taken as a
reference by thousands of papers in social and behavioral sciences. In
pharmacological and medical research, depending on the real implications and the nature of the type II error, conventional values of power may be as high as 0.999. This was the value suggested by Lieber (1990) for testing a
treatment for patients' blood pressure. The author stated, while “guarding against cookbook application of statistical methods”, “it should also be noted that, at times, type II error may be more important to an investigator then type I error”.</p>
      <p id="d1e5740">We believe that, when selecting between stationary and nonstationary models
for extreme hydrological event prediction, a fair comparison between the
null and the alternative hypotheses of <inline-formula><mml:math id="M332" display="inline"><mml:mrow><mml:mi mathvariant="italic">α</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="italic">β</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.05</mml:mn></mml:mrow></mml:math></inline-formula> should be utilized, which provides a power value of 0.95. In our discussion, we considered 0.8 to be a minimum threshold for acceptable power values.</p>
      <p id="d1e5759">For all of the generation sets and tests conducted, under the null hypothesis
of stationarity, the power has values ranging between the chosen
significance level (0.05) and 1 for large (and larger) ranges of the trend
coefficient. The test power always collapses to very low values for weak
(but climatically important) trend values (e.g., in the case of annual maximum
daily rainfall, <inline-formula><mml:math id="M333" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> was equal to 0.2 or 0.3 mm yr<inline-formula><mml:math id="M334" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>).
In the presence of a trend, the power is also affected by the scale and shape
parameters of the GEV parent distribution. This observation can be made with
reference to samples of all of the lengths considered in this paper (from 30 to
70 years of observations), but the use of smaller samples significantly
reduces the test power and dramatically extends the range of <inline-formula><mml:math id="M335" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ζ</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>
values for which the power is below the conventional value of 0.8. The use of
this sample size is not rare considering that significant trends due to
anthropic effects are typically investigated in periods following a changing
point often observed in the 1980s.</p>
      <p id="d1e5797">These results also imply that in spatial fields where the alternative
hypothesis of nonstationarity is true but the parent's parameters
(including the trend coefficient) and the sample length are variable in
space, the rate of rejection of the false null hypothesis may be highly
variable from site to site and the power, if left without control, de facto assumes random values in space. In other words, the probability of recognizing the alternative hypothesis of
nonstationarity as true from a single observed sample may unknowingly
change (between 0.05 and 1) from place to place. For small samples (e.g.,
<inline-formula><mml:math id="M336" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">30</mml:mn></mml:mrow></mml:math></inline-formula> in our analysis) and heavy-tailed distributions, the power is always
very low for the entire investigated range of the trend coefficient.</p>
      <p id="d1e5812">Therefore, considering the high spatial variability of the parent distribution
parameters and the relatively short period of reliable and continuous
historical observations usually available, a regional assessment of trend
nonstationarity may suffer from the different probability of the rejection of
the null hypothesis of stationarity (when it is false).</p>
      <p id="d1e5815">These problems affect both parametric and
nonparametric tests (to slightly different degrees). While these considerations are generally applicable to
all of the tests considered, differences also emerge between them. For heavy-tailed parent distributions and smaller samples, the MK test power decreases
more rapidly than for the other tests considered. Low values of power are
already observable for <inline-formula><mml:math id="M337" display="inline"><mml:mrow><mml:mi>L</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>. The LR test slightly outperforms the AIC<inline-formula><mml:math id="M338" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula> for small sample sizes and higher absolute values of the shape parameter. Nevertheless, the higher value of the LR power seems to be overestimated as a consequence of the <inline-formula><mml:math id="M339" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">χ</mml:mi><mml:mi>m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:math></inline-formula> approximation for the <inline-formula><mml:math id="M340" display="inline"><mml:mi>D</mml:mi></mml:math></inline-formula> statistic distribution (see Sect. 3.3).</p>
      <p id="d1e5859">Results also suggest that the theoretical distribution of the LR test-statistic
based on the null hypothesis of stationarity may lead to a significant
increase in the rejection rate compared with the chosen level of significance,
i.e., an abnormal rate of rejection of the null hypothesis when it is true.
In this case, the use of numerical techniques, based on the implementation of synthetic
generations performed by exploiting a known parent distribution, should be
preferred.</p>
      <p id="d1e5862">In light of these results, we conclude that the assessment of the parent distribution and the choice of
the null hypothesis should be considered as fundamental preliminary tasks in trend detection on annual
maximum series. Therefore, it is advisable to make use of parametric tests by
numerically evaluating both the rejection threshold for the assigned
significance level and the power corresponding to alternative hypotheses.
This also requires the development of robust techniques for selecting the parent
distribution and estimating its parameters. To this end, the use of
a parametric measure such as the AIC<inline-formula><mml:math id="M341" display="inline"><mml:msub><mml:mi/><mml:mi>R</mml:mi></mml:msub></mml:math></inline-formula>, may take different choices for the parent distribution into account and, even more importantly, allow one to set the null hypothesis differently from the stationary case, based on a priori information.</p>
      <p id="d1e5874">The need for robust procedures to assess the parent distribution and its
parameters is also proven by the numerical simulations that we conducted. Sample
variability of parameters (including the trend coefficient) may increase
rapidly for series with <inline-formula><mml:math id="M342" display="inline"><mml:mi>L</mml:mi></mml:math></inline-formula> values as low as 30 years of the annual maxima. Moreover, we
observed that, in the case of high trends, numerical instability and
non-convergence of algorithms may affect the estimation procedure for upper-bounded and heavy-tailed distributions. Nevertheless, the sample variability
of the ML trend estimator was always found to be smaller than the Sen's slope
sample variability. Finally, it is worth noting that the nonparametric
Sen's slope method, applied to synthetic series, also showed dependence on the
parent distribution parameters, with sample variability being higher for heavy-tailed distributions.</p>
      <p id="d1e5885">This analysis shed light on important eventual flaws in the at-site analysis of climate change provided by nonparametric approaches. Both test
power and trend evaluation are affected by the parent distribution as is
also the case for parametric methods. It is not by chance, in our opinion,
that many technical studies that have recently been conducted around<?pagebreak page486?> the world provide
inhomogeneous maps of positive/negative trends and large areas of
stationarity characterized by weak trends that are not considered
statistically significant.</p>
      <p id="d1e5888">As already stated, an advantage of using parametric tests and numerical
evaluation of the test statistic distribution is given by the possibility of
assuming a null hypothesis based on a preliminary assessment of the parent
distribution, including trend detection via the evaluation of nonstationary
parameters. This could lead to a regionally homogeneous and controlled
assessment of both the significance level and the power in a fair mutual
relationship. With respect to the estimation of the parameters of the parent
distribution, results suggest that at-site analysis may provide highly
biased results. More robust procedures are necessary, such as hierarchic
estimation procedures (Fiorentino et al., 1987), and procedures that provide estimates of <inline-formula><mml:math id="M343" display="inline"><mml:mi mathvariant="italic">ε</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M344" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> from detrended series (Strupczewski et al.,
2016; Kochanek et al., 2013).</p>
      <p id="d1e5905">As a final remark, concerning real data analysis, in our numerical experiment we showed that a weak linear trend in the mean suffices to reduce power to unacceptable values in some cases. However, we explored the simplest nonstationary working hypothesis by introducing a deterministic linear dependence of the location parameter of the parent distribution on time. Obviously, when making inference from real observed data, other sources of uncertainty may affect statistical inference (trend, heteroscedasticity, persistence, nonlinearity, and so on); moreover, if considering a nonstationary process with underlying deterministic dynamics, the process becomes non-ergodic, implying that statistical inference from sampled series is not representative of the process's ensemble properties (Koutsoyiannis and Montanari, 2015).</p>
      <p id="d1e5908">As a consequence, when considering a nonstationary stochastic process as being produced by a combination of a deterministic function and a stationary
stochastic process, other sources of information and deductive arguments
should be exploited in order to identify the physical mechanism underlying
such relationships. Also, in this case observed time series have a crucial
role in the calibration and validation of deterministic modeling; in other
words, they are important for confirming or disproving the model hypotheses.</p>
      <p id="d1e5911">In the field of frequency analysis of extreme hydrological events, considering the high spatial variability of the sample length, the trend coefficient, the scale, and the shape parameters, among others, physically based probability distributions could be further developed and exploited for the selection and assessment of the parent distribution in the context of nonstationarity and change detection. The physically based probability distributions we refer to are (i) those arising from stochastic compound processes introduced by Todorovic and Zelenhasic (1970), which also include the GEV (see Madsen et al., 1997) and the TCEV (Rossi et al., 1984), and (ii) the theoretically derived distributions following Eagleson (1972) whose parameters are provided by clear physical meaning and are usually estimated with the support of exogenous information in regional methods (e.g., Gioia et al., 2008; Iacobellis et al., 2011; see Rosbjerg et al., 2013 for a more extensive overview).</p>
      <p id="d1e5914">Hence, we believe that “learning from data” (Sivapalan, 2003), will remain
a key task for hydrologists in future years, as they face the challenge of
consistently identifying both deterministic and stochastic components of
change (Montanari et al., 2013). This involves crucial and interdisciplinary
research to develop suitable methodological frameworks for enhancing
physical knowledge and data exploitation, in order to reduce the overall
uncertainty of prediction in a changing environment.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d1e5921">No data sets were used in this article.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e5927">All authors contributed in equal measure to all stages of the development and production of this paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e5933">The authors declare that they have no conflict of interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e5939">The authors thank the three anonymous reviewers and the editor Giuliano Di Baldassarre, who all helped to extend and the improve the paper.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e5944">The present investigation was partially carried out with support from the Puglia Region (POR Puglia FESR-FSE 2014–2020) through the “T.E.S.A.” – Tecnologie innovative per l'affinamento Economico e Sostenibile delle Acque reflue depurate rivenienti dagli impianti di depurazione di Taranto Bellavista e Gennarini – project.</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e5950">This paper was edited by Giuliano Di Baldassarre and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>Akaike, H.: A new look at the statistical model identification, IEEE T. Automat. Control, 19, 716–723, <ext-link xlink:href="https://doi.org/10.1109/TAC.1974.1100705" ext-link-type="DOI">10.1109/TAC.1974.1100705</ext-link>, 1974.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 1?><mixed-citation>Beven, K.: Facets of uncertainty: Epistemic uncertainty, non-stationarity,
likelihood, hypothesis testing, and communication, Hydrolog. Sci. J., 61, 1652–1665, <ext-link xlink:href="https://doi.org/10.1080/02626667.2015.1031761" ext-link-type="DOI">10.1080/02626667.2015.1031761</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 1?><mixed-citation>
Burnham, K. P. and Anderson, D. R.: Model selection and multimodel inference, Springer, New York, 2004.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 1?><mixed-citation>Cheng, L., AghaKouchak, A., Gilleland, E., and Katz, R. W.: Non-stationary
extreme value analysis in a changing climate, Climatic Change, 127, 353–369, <ext-link xlink:href="https://doi.org/10.1007/s10584-014-1254-5" ext-link-type="DOI">10.1007/s10584-014-1254-5</ext-link>, 2014.</mixed-citation></ref>
      <?pagebreak page487?><ref id="bib1.bib5"><label>5</label><?label 1?><mixed-citation>
Chow, V. T. (Ed.): Statistical and probability analysis of hydrologic data, in: Handbook of applied hydrology, McGraw-Hill, New York, 8.1–8.97, 1964.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 1?><mixed-citation>
Cohen, J.: A power primer, Psycholog. Bull., 112, 155–159, 1992.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><?label 1?><mixed-citation>Cohen, J.: The Earth Is Round (<inline-formula><mml:math id="M345" display="inline"><mml:mrow><mml:mi>p</mml:mi><mml:mo>&lt;</mml:mo><mml:mn>.05</mml:mn></mml:mrow></mml:math></inline-formula>), Am. Psychol., 49, 997–1003, 1994.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 1?><mixed-citation>
Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, 2001.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 1?><mixed-citation>Cooley, D.: Return Periods and Return Levels Under Climate Change, in:
Extremes in a Changing Climate, edited by: AghaKouchak, A., Easterling, D.,
Hsu, K., Schubert, S., and Sorooshian, S., Springer, Dordrecht, 97–113,
<ext-link xlink:href="https://doi.org/10.1007/978-94-007-4479-0_4" ext-link-type="DOI">10.1007/978-94-007-4479-0_4</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 1?><mixed-citation>Du, T., Xiong, L., Xu, C. Y., Gippel, C. J., Guo, S., and Liu, P.: Return
period and risk analysis of nonstationary low-flow series under climate change, J. Hydrol., 527, 234–250, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2015.04.041" ext-link-type="DOI">10.1016/j.jhydrol.2015.04.041</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 1?><mixed-citation>Eagleson, P. S.: Dynamics of flood frequency, Water Resour. Res., 8, 878–898,
<ext-link xlink:href="https://doi.org/10.1029/WR008i004p00878" ext-link-type="DOI">10.1029/WR008i004p00878</ext-link>, 1972.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 1?><mixed-citation>
Fiorentino, M., Gabriele, S., Rossi, F., and Versace, P.: Hierarchical approach for regional flood frequency analysis, in: Regional Flood Frequency
Analysis, edited by: Singh, V. P., D. Reidel, Norwell, Massachusetts, 35-49, 1987.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 1?><mixed-citation>
Gilleland, E. and Katz, R. W.: extRemes 2.0: An Extreme Value Analysis Package in R, J. Stat. Soft., 72, 1–39, 2016.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 1?><mixed-citation>Gioia, A., Iacobellis, V., Manfreda, S., and Fiorentino, M.: Runoff thresholds in derived flood frequency distributions, Hydrol. Earth Syst. Sci., 12, 1295–1307, <ext-link xlink:href="https://doi.org/10.5194/hess-12-1295-2008" ext-link-type="DOI">10.5194/hess-12-1295-2008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><?label 1?><mixed-citation>Gocic, M. and Trajkovic, S.: Analysis of changes in meteorological variables
using Mann-Kendall and Sen's slope estimator statistical tests in Serbia,
Global Planet. Change, 100, 172–182, <ext-link xlink:href="https://doi.org/10.1016/j.gloplacha.2012.10.014" ext-link-type="DOI">10.1016/j.gloplacha.2012.10.014</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><?label 1?><mixed-citation>Iacobellis, V., Gioia, A., Manfreda, S., and Fiorentino, M.: Flood quantiles estimation based on theoretically derived distributions: regional analysis in Southern Italy, Nat. Hazards Earth Syst. Sci., 11, 673–695, <ext-link xlink:href="https://doi.org/10.5194/nhess-11-673-2011" ext-link-type="DOI">10.5194/nhess-11-673-2011</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><?label 1?><mixed-citation>
Jenkinson, A. F.: The frequency distribution of the annual maximum (or minimum) values of meteorological elements, Q. J. Roy. Meteorol. Soc., 81,
158–171, 1955.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><?label 1?><mixed-citation>
Kendall, M. G.: Rank Correlation Methods, 4th Edn., Charles Griffin, London,
UK, 1975.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><?label 1?><mixed-citation>Khintchine, A.: Korrelationstheorie der stationären stochastischen
Prozesse, Math. Ann., 109, 604–615, <ext-link xlink:href="https://doi.org/10.1007/BF01449156" ext-link-type="DOI">10.1007/BF01449156</ext-link>, 1934.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><?label 1?><mixed-citation>Kiely, G.: Climate change in Ireland from precipitation and streamflow
observations, Adv. Water Resour., 23, 141–151, <ext-link xlink:href="https://doi.org/10.1016/S0309-1708(99)00018-4" ext-link-type="DOI">10.1016/S0309-1708(99)00018-4</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><?label 1?><mixed-citation>Kochanek, K., Strupczewski, W. G., Bogdanowicz, E., Feluch, W., and Markiewicz, I.: Application of a hybrid approach in nonstationary flood frequency analysis – a Polish perspective, Nat. Hazards Earth Syst. Sci. Discuss., 1, 6001–6024, <ext-link xlink:href="https://doi.org/10.5194/nhessd-1-6001-2013" ext-link-type="DOI">10.5194/nhessd-1-6001-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><?label 1?><mixed-citation>Kolmogorov, A. N.: Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Mathematische Annalen, 104, 415–458 (English translation: On analytical methods in probability theory, in: Kolmogorov, A. N., 1992. Selected Works of A. N. Kolmogorov – Volume 2, Probability Theory and Mathematical Statistics, edited by: Shiryayev, A. N., Kluwer, Dordrecht, the Netherlands, 62–108, <ext-link xlink:href="https://doi.org/10.1007/BF01457949" ext-link-type="DOI">10.1007/BF01457949</ext-link>, 1931.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><?label 1?><mixed-citation>Koutsoyiannis, D. and Montanari A.: Negligent killing of scientific concepts: the stationarity case, Hydrolog. Sci. J., 60, 1174–1183,
<ext-link xlink:href="https://doi.org/10.1080/02626667.2014.959959" ext-link-type="DOI">10.1080/02626667.2014.959959</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><?label 1?><mixed-citation>Kundzewicz, Z. W. and Robson, A. J.: Change detection in hydrological records – a review of the methodology, Hydrolog. Sci. J., 49, 7–19,
<ext-link xlink:href="https://doi.org/10.1623/hysj.49.1.7.53993" ext-link-type="DOI">10.1623/hysj.49.1.7.53993</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><?label 1?><mixed-citation>Laio, F., Baldassarre, G. D., and Montanari, A.: Model selection techniques
for the frequency analysis of hydrological extremes, Water Resour. Res.,
45, W07416, <ext-link xlink:href="https://doi.org/10.1029/2007wr006666" ext-link-type="DOI">10.1029/2007wr006666</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><?label 1?><mixed-citation>
Lehmann, E. L.: Nonparametrics, Statistical Methods Based on Ranks, Holden-Day, Oxford, England, 1975</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><?label 1?><mixed-citation>Lieber, R. L.: Statistical significance and statistical power in hypothesis
testing, J. Orthop. Res., 8, 304–309, <ext-link xlink:href="https://doi.org/10.1002/jor.1100080221" ext-link-type="DOI">10.1002/jor.1100080221</ext-link>, 1990.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><?label 1?><mixed-citation>Madsen, H., Rasmussen, P., and Rosbjerg, D.: Comparison of annual maximum
series and partial duration series for modelling exteme hydrological events:
1. At-site modeling, Water Resour. Res., 33, 747–757, <ext-link xlink:href="https://doi.org/10.1029/96WR03848" ext-link-type="DOI">10.1029/96WR03848</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><?label 1?><mixed-citation>Mann, H. B.: Nonparametric tests against trend, Econometrica, 13, 245–259, <ext-link xlink:href="https://doi.org/10.2307/1907187" ext-link-type="DOI">10.2307/1907187</ext-link>, 1945.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><?label 1?><mixed-citation>Milly, P. C. D., Betancourt, J., Falkenmark, M., Hirsch, R. M., Kundzewicz, Z. W., Lettenmaier, D. P., Stouffer, R. J., Dettinger, M. D., and Krysanova,
V.: On Critiques of “stationarity is Dead: Whither Water Management?”, Water Resour. Res., 51, 7785–7789, <ext-link xlink:href="https://doi.org/10.1002/2015WR017408" ext-link-type="DOI">10.1002/2015WR017408</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><?label 1?><mixed-citation>Montanari, A. and Koutsoyiannis, D.: Modeling and mitigating natural hazards: Stationarity is immortal!, Water Resour. Res., 50, 9748–9756,
<ext-link xlink:href="https://doi.org/10.1002/2014wr016092" ext-link-type="DOI">10.1002/2014wr016092</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><?label 1?><mixed-citation>
Montanari, A., Young, G., Savenije, H. H. G., Hughes, D., Wagener, T., Ren, L. L., Koutsoyiannis, D., Cudennec, C., Toth, E., Grimaldi, S., Blöschl, G., Sivapalan, M., Beven, K., Gupta, H., Hipsey, M., Schaefli, B., Arheimer, B., Boegh, E., Schymanski, S. J., Di Baldassarre, G., Yu, B., Hubert, P.,
Huang, Y., Schumann, A., Post, D., Srinivasan, V., Harman, C., Thompson, S.,
Rogger, M., Viglione, A., McMillan, H., Characklis, G., Pang, Z., and
Belyaev, V.: “Panta Rhei—Everything Flows”: Change in hydrology and
society – The IAHS Scientific Decade 2013–2022, Hydrolog. Sci. J., 58, 1256–1275, 2013.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><?label 1?><mixed-citation>
Muraleedharan, G., Guedes Soares, C., and Lucas, C.: Characteristic and moment generating functions of generalised extreme value distribution (GEV),
in: Sea Level Rise, Coastal Engineering, Shorelines and Tides, Nova Science, New York, 2010.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><?label 1?><mixed-citation>Olsen, J. R., Lambert, J. H., and Haimes, Y. Y.: Risk of extreme events under nonstationary conditions, Risk Anal., 18, 497–510,
<ext-link xlink:href="https://doi.org/10.1111/j.1539-6924.1998.tb00364.x" ext-link-type="DOI">10.1111/j.1539-6924.1998.tb00364.x</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><?label 1?><mixed-citation>Parey, S., Malek, F., Laurent, C., and Dacunha-Castelle, D.: Trends and climate evolution: statistical approach for very high temperatures in
France, Climatic Change, 81, 331–352, <ext-link xlink:href="https://doi.org/10.1007/s10584-006-9116-4" ext-link-type="DOI">10.1007/s10584-006-9116-4</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><?label 1?><mixed-citation>Parey, S., Hoang, T. T. H., and Dacunha-Castelle, D.: Different ways to compute temperature return levels in the climate change context, Environmetrics, 21, 698–718, <ext-link xlink:href="https://doi.org/10.1002/env.1060" ext-link-type="DOI">10.1002/env.1060</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><?label 1?><mixed-citation>Pettitt, A. N.: A non-parametric approach to the change-point problem, Appl.
Stat., 28, 126–135, <ext-link xlink:href="https://doi.org/10.2307/2346729" ext-link-type="DOI">10.2307/2346729</ext-link>, 1979.</mixed-citation></ref>
      <?pagebreak page488?><ref id="bib1.bib38"><label>38</label><?label 1?><mixed-citation>Read, L. K. and Vogel, R. M.: Reliability, return periods, and risk under
nonstationarity, Water Resour. Res., 51, 6381–6398,
<ext-link xlink:href="https://doi.org/10.1002/2015WR017089" ext-link-type="DOI">10.1002/2015WR017089</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><?label 1?><mixed-citation>Rosbjerg, D., Blöschl, G., Burn, D., Castellarin, A., Croke, B., Di Baldassarre, G., Iacobellis, V., Kjeldsen, T. R., Kuczera, G., Merz, R.,
Montanari, A., Morris, D., Ouarda, T. B. M. J., Ren, L., Rogger, M.,  Salinas, J. L., Toth, E., and Viglione, A.: Prediction of floods in ungauged basins, in: Runoff Prediction in Ungauged Basins: Synthesis across Processes, Places and Scales, edited by:  Blöschl, G., Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H., Cambridge University Press, Cambridge, 189–226, <ext-link xlink:href="https://doi.org/10.1017/CBO9781139235761.012" ext-link-type="DOI">10.1017/CBO9781139235761.012</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><?label 1?><mixed-citation>Rossi, F., Fiorentino, M., and Versace, P.: Two-Component Extreme Value
Distribution for Flood Frequency Analysis, Water Resour. Res., 20, 847–856,
<ext-link xlink:href="https://doi.org/10.1029/WR020i007p00847" ext-link-type="DOI">10.1029/WR020i007p00847</ext-link>, 1984.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><?label 1?><mixed-citation>Salas, J. D. and Obeysekera, J.: Revisiting the concepts of return period and risk for nonstationary hydrologic extreme events, J. Hydrol. Eng., 19, 554–568, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000820" ext-link-type="DOI">10.1061/(ASCE)HE.1943-5584.0000820</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><?label 1?><mixed-citation>
Schwarz, G.: Estimating the dimension of a model, Ann. Stat., 6, 461–464,
1978.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><?label 1?><mixed-citation>Sen, P. K.: Estimates of the regression coefficient based on Kendall's tau,
J. Am. Stat. Assoc., 63, 1379–1389, <ext-link xlink:href="https://doi.org/10.2307/2285891" ext-link-type="DOI">10.2307/2285891</ext-link>, 1968.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><?label 1?><mixed-citation>Serinaldi, F. and Kilsby, C. G.: Stationarity is undead: Uncertainty dominates the distribution of extremes, Adv. Water Resour., 77, 17–36,
<ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2014.12.013" ext-link-type="DOI">10.1016/j.advwatres.2014.12.013</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><?label 1?><mixed-citation>Serinaldi, F., Kilsby, C. G., and Lombardo, F.: Untenable nonstationarity:
An assessment of the fitness for purpose of trend tests in hydrology, Adv.
Water Resour., 111, 132–155, <ext-link xlink:href="https://doi.org/10.1016/J.ADVWATRES.2017.10.015" ext-link-type="DOI">10.1016/J.ADVWATRES.2017.10.015</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><?label 1?><mixed-citation>
Sivapalan, M., Prediction in Ungauged Basins: A Grand Challenge for Theoretical Hydrology, Hydrol. Process., 17, 3163–3170, 2003.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><?label 1?><mixed-citation>Smadi, M. M. and Zghoul A.: A sudden change in rainfall characteristics in
Amman, Jordan during the mid 1950s, Am. J. Environ. Sci., 2, 84–91,
<ext-link xlink:href="https://doi.org/10.3844/ajessp.2006.84.91" ext-link-type="DOI">10.3844/ajessp.2006.84.91</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><?label 1?><mixed-citation>Strupczewski, W. G., Kochanek, K., Bogdanowicz, E., Markiewicz, I., and Feluch, W.: Comparison of two nonstationary flood frequency analysis methods
within the context of the variable regime in the representative polish rivers, Acta Geophys., 64, 206–236, <ext-link xlink:href="https://doi.org/10.1515/acgeo-2015-0070" ext-link-type="DOI">10.1515/acgeo-2015-0070</ext-link>, 2016.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib49"><label>49</label><?label 1?><mixed-citation>Sugiura, N.: Further analysis of the data by Akaike's information criterion
and the finite corrections, Commun. Stat. Theor. Meth., A7, 13–26,
<ext-link xlink:href="https://doi.org/10.1080/03610927808827599" ext-link-type="DOI">10.1080/03610927808827599</ext-link>, 1978.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><?label 1?><mixed-citation>Todorovic, P. and Zelenhasic, E.: A Stochastic Model for Flood Analysis, Water Resour. Res., 6, 1641–1648, <ext-link xlink:href="https://doi.org/10.1029/WR006i006p01641" ext-link-type="DOI">10.1029/WR006i006p01641</ext-link>, 1970.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><?label 1?><mixed-citation>Tramblay, Y., Neppel, L., Carreau, J., and Najib, K.: Non-stationary frequency analysis of heavy rainfall events in southern France, Hydrolog. Sci. J., 58, 280–194, <ext-link xlink:href="https://doi.org/10.1080/02626667.2012.754988" ext-link-type="DOI">10.1080/02626667.2012.754988</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><?label 1?><mixed-citation>Vogel, R. M., Rosner, A., and Kirshen, P. H.: Brief Communication: Likelihood of societal preparedness for global change: trend detection, Nat. Hazards Earth Syst. Sci., 13, 1773–1778, <ext-link xlink:href="https://doi.org/10.5194/nhess-13-1773-2013" ext-link-type="DOI">10.5194/nhess-13-1773-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><?label 1?><mixed-citation>
Wang, W., Van Gelder, P. H., and Vrijling, J. K.: Trend and stationarity
analysis for streamflow processes of rivers in Western Europe in the 20th Century, in: IWA International Conference on Water Economics, Statistics, and Finance, 8–10 July 2005, Rethymno, Greece, 2005.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><?label 1?><mixed-citation>Yilmaz, A. G. and Perera, B. J. C.: Extreme rainfall nonstationarity
investigation and intensity–frequency–duration relationship, J. Hydrol. Eng., 19, 1160–1172, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000878" ext-link-type="DOI">10.1061/(ASCE)HE.1943-5584.0000878</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><?label 1?><mixed-citation>Yilmaz, A. G., Hossain, I., and Perera, B. J. C.: Effect of climate change
and variability on extreme rainfall intensity–frequency–duration relationships: a case study of Melbourne, Hydrol. Earth Syst. Sci., 18,
4065–4076, <ext-link xlink:href="https://doi.org/10.5194/hess-18-4065-2014" ext-link-type="DOI">10.5194/hess-18-4065-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><?label 1?><mixed-citation>Yue, S., Pilon, P., and Cavadias, G.: Power of the Mann–Kendall and
Spearman's rho tests for detecting monotonic trends in hydrological series,
J. Hydrol., 259, 254–271, <ext-link xlink:href="https://doi.org/10.1016/S0022-1694(01)00594-7" ext-link-type="DOI">10.1016/S0022-1694(01)00594-7</ext-link>, 2002a.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><?label 1?><mixed-citation>Yue, S., Pilon, P., Phinney, B., and Cavadias, G.: The influence of autocorrelation on the ability to detect trend in hydrological series, Hydrol. Process., 16, 1807–1829, <ext-link xlink:href="https://doi.org/10.1002/hyp.1095" ext-link-type="DOI">10.1002/hyp.1095</ext-link>, 2002b.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Numerical investigation on the power of parametric and nonparametric tests for trend detection in annual maximum series</article-title-html>
<abstract-html><p>The need to fit time series characterized by the presence of a trend or change points has generated increased interest in the investigation of nonstationary probability distributions in recent years. Considering that the available hydrological time series can be recognized as the observable part of a stochastic process with a definite probability distribution, two main topics can be tackled in this context: the first is related to the definition of an objective criterion for choosing whether the stationary hypothesis can be adopted, whereas the second regards the effects of nonstationarity on the estimation of distribution parameters and quantiles for an assigned return period and flood risk evaluation. Although the time series trend or change points are usually detected using nonparametric tests available in the literature (e.g., Mann–Kendall or CUSUM test), the correct selection of the stationary or nonstationary probability distribution is still required for design purposes. In this light, the focus is shifted toward model selection criteria; this implies the use of parametric methods, including all of the issues related to parameter estimation. The aim of this study is to compare the performance of parametric and nonparametric methods for trend detection, analyzing their power and focusing on the use of traditional model selection tools (e.g., the Akaike information criterion and the likelihood ratio test) within this context. The power and efficiency of parameter estimation, including the trend coefficient, were investigated via Monte Carlo simulations using the generalized extreme value distribution as the parent with selected parameter sets.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Akaike, H.: A new look at the statistical model identification, IEEE T. Automat. Control, 19, 716–723, <a href="https://doi.org/10.1109/TAC.1974.1100705" target="_blank">https://doi.org/10.1109/TAC.1974.1100705</a>, 1974.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Beven, K.: Facets of uncertainty: Epistemic uncertainty, non-stationarity,
likelihood, hypothesis testing, and communication, Hydrolog. Sci. J., 61, 1652–1665, <a href="https://doi.org/10.1080/02626667.2015.1031761" target="_blank">https://doi.org/10.1080/02626667.2015.1031761</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Burnham, K. P. and Anderson, D. R.: Model selection and multimodel inference, Springer, New York, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Cheng, L., AghaKouchak, A., Gilleland, E., and Katz, R. W.: Non-stationary
extreme value analysis in a changing climate, Climatic Change, 127, 353–369, <a href="https://doi.org/10.1007/s10584-014-1254-5" target="_blank">https://doi.org/10.1007/s10584-014-1254-5</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Chow, V. T. (Ed.): Statistical and probability analysis of hydrologic data, in: Handbook of applied hydrology, McGraw-Hill, New York, 8.1–8.97, 1964.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Cohen, J.: A power primer, Psycholog. Bull., 112, 155–159, 1992.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Cohen, J.: The Earth Is Round (<i>p</i> &lt; . 05), Am. Psychol., 49, 997–1003, 1994.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Cooley, D.: Return Periods and Return Levels Under Climate Change, in:
Extremes in a Changing Climate, edited by: AghaKouchak, A., Easterling, D.,
Hsu, K., Schubert, S., and Sorooshian, S., Springer, Dordrecht, 97–113,
<a href="https://doi.org/10.1007/978-94-007-4479-0_4" target="_blank">https://doi.org/10.1007/978-94-007-4479-0_4</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Du, T., Xiong, L., Xu, C. Y., Gippel, C. J., Guo, S., and Liu, P.: Return
period and risk analysis of nonstationary low-flow series under climate change, J. Hydrol., 527, 234–250, <a href="https://doi.org/10.1016/j.jhydrol.2015.04.041" target="_blank">https://doi.org/10.1016/j.jhydrol.2015.04.041</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Eagleson, P. S.: Dynamics of flood frequency, Water Resour. Res., 8, 878–898,
<a href="https://doi.org/10.1029/WR008i004p00878" target="_blank">https://doi.org/10.1029/WR008i004p00878</a>, 1972.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Fiorentino, M., Gabriele, S., Rossi, F., and Versace, P.: Hierarchical approach for regional flood frequency analysis, in: Regional Flood Frequency
Analysis, edited by: Singh, V. P., D. Reidel, Norwell, Massachusetts, 35-49, 1987.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Gilleland, E. and Katz, R. W.: extRemes 2.0: An Extreme Value Analysis Package in R, J. Stat. Soft., 72, 1–39, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Gioia, A., Iacobellis, V., Manfreda, S., and Fiorentino, M.: Runoff thresholds in derived flood frequency distributions, Hydrol. Earth Syst. Sci., 12, 1295–1307, <a href="https://doi.org/10.5194/hess-12-1295-2008" target="_blank">https://doi.org/10.5194/hess-12-1295-2008</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Gocic, M. and Trajkovic, S.: Analysis of changes in meteorological variables
using Mann-Kendall and Sen's slope estimator statistical tests in Serbia,
Global Planet. Change, 100, 172–182, <a href="https://doi.org/10.1016/j.gloplacha.2012.10.014" target="_blank">https://doi.org/10.1016/j.gloplacha.2012.10.014</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Iacobellis, V., Gioia, A., Manfreda, S., and Fiorentino, M.: Flood quantiles estimation based on theoretically derived distributions: regional analysis in Southern Italy, Nat. Hazards Earth Syst. Sci., 11, 673–695, <a href="https://doi.org/10.5194/nhess-11-673-2011" target="_blank">https://doi.org/10.5194/nhess-11-673-2011</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Jenkinson, A. F.: The frequency distribution of the annual maximum (or minimum) values of meteorological elements, Q. J. Roy. Meteorol. Soc., 81,
158–171, 1955.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Kendall, M. G.: Rank Correlation Methods, 4th Edn., Charles Griffin, London,
UK, 1975.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Khintchine, A.: Korrelationstheorie der stationären stochastischen
Prozesse, Math. Ann., 109, 604–615, <a href="https://doi.org/10.1007/BF01449156" target="_blank">https://doi.org/10.1007/BF01449156</a>, 1934.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Kiely, G.: Climate change in Ireland from precipitation and streamflow
observations, Adv. Water Resour., 23, 141–151, <a href="https://doi.org/10.1016/S0309-1708(99)00018-4" target="_blank">https://doi.org/10.1016/S0309-1708(99)00018-4</a>, 1999.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Kochanek, K., Strupczewski, W. G., Bogdanowicz, E., Feluch, W., and Markiewicz, I.: Application of a hybrid approach in nonstationary flood frequency analysis – a Polish perspective, Nat. Hazards Earth Syst. Sci. Discuss., 1, 6001–6024, <a href="https://doi.org/10.5194/nhessd-1-6001-2013" target="_blank">https://doi.org/10.5194/nhessd-1-6001-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Kolmogorov, A. N.: Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Mathematische Annalen, 104, 415–458 (English translation: On analytical methods in probability theory, in: Kolmogorov, A. N., 1992. Selected Works of A. N. Kolmogorov – Volume 2, Probability Theory and Mathematical Statistics, edited by: Shiryayev, A. N., Kluwer, Dordrecht, the Netherlands, 62–108, <a href="https://doi.org/10.1007/BF01457949" target="_blank">https://doi.org/10.1007/BF01457949</a>, 1931.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Koutsoyiannis, D. and Montanari A.: Negligent killing of scientific concepts: the stationarity case, Hydrolog. Sci. J., 60, 1174–1183,
<a href="https://doi.org/10.1080/02626667.2014.959959" target="_blank">https://doi.org/10.1080/02626667.2014.959959</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Kundzewicz, Z. W. and Robson, A. J.: Change detection in hydrological records – a review of the methodology, Hydrolog. Sci. J., 49, 7–19,
<a href="https://doi.org/10.1623/hysj.49.1.7.53993" target="_blank">https://doi.org/10.1623/hysj.49.1.7.53993</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Laio, F., Baldassarre, G. D., and Montanari, A.: Model selection techniques
for the frequency analysis of hydrological extremes, Water Resour. Res.,
45, W07416, <a href="https://doi.org/10.1029/2007wr006666" target="_blank">https://doi.org/10.1029/2007wr006666</a>, 2009.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Lehmann, E. L.: Nonparametrics, Statistical Methods Based on Ranks, Holden-Day, Oxford, England, 1975
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Lieber, R. L.: Statistical significance and statistical power in hypothesis
testing, J. Orthop. Res., 8, 304–309, <a href="https://doi.org/10.1002/jor.1100080221" target="_blank">https://doi.org/10.1002/jor.1100080221</a>, 1990.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Madsen, H., Rasmussen, P., and Rosbjerg, D.: Comparison of annual maximum
series and partial duration series for modelling exteme hydrological events:
1. At-site modeling, Water Resour. Res., 33, 747–757, <a href="https://doi.org/10.1029/96WR03848" target="_blank">https://doi.org/10.1029/96WR03848</a>, 1997.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Mann, H. B.: Nonparametric tests against trend, Econometrica, 13, 245–259, <a href="https://doi.org/10.2307/1907187" target="_blank">https://doi.org/10.2307/1907187</a>, 1945.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Milly, P. C. D., Betancourt, J., Falkenmark, M., Hirsch, R. M., Kundzewicz, Z. W., Lettenmaier, D. P., Stouffer, R. J., Dettinger, M. D., and Krysanova,
V.: On Critiques of “stationarity is Dead: Whither Water Management?”, Water Resour. Res., 51, 7785–7789, <a href="https://doi.org/10.1002/2015WR017408" target="_blank">https://doi.org/10.1002/2015WR017408</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Montanari, A. and Koutsoyiannis, D.: Modeling and mitigating natural hazards: Stationarity is immortal!, Water Resour. Res., 50, 9748–9756,
<a href="https://doi.org/10.1002/2014wr016092" target="_blank">https://doi.org/10.1002/2014wr016092</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Montanari, A., Young, G., Savenije, H. H. G., Hughes, D., Wagener, T., Ren, L. L., Koutsoyiannis, D., Cudennec, C., Toth, E., Grimaldi, S., Blöschl, G., Sivapalan, M., Beven, K., Gupta, H., Hipsey, M., Schaefli, B., Arheimer, B., Boegh, E., Schymanski, S. J., Di Baldassarre, G., Yu, B., Hubert, P.,
Huang, Y., Schumann, A., Post, D., Srinivasan, V., Harman, C., Thompson, S.,
Rogger, M., Viglione, A., McMillan, H., Characklis, G., Pang, Z., and
Belyaev, V.: “Panta Rhei—Everything Flows”: Change in hydrology and
society – The IAHS Scientific Decade 2013–2022, Hydrolog. Sci. J., 58, 1256–1275, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Muraleedharan, G., Guedes Soares, C., and Lucas, C.: Characteristic and moment generating functions of generalised extreme value distribution (GEV),
in: Sea Level Rise, Coastal Engineering, Shorelines and Tides, Nova Science, New York, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Olsen, J. R., Lambert, J. H., and Haimes, Y. Y.: Risk of extreme events under nonstationary conditions, Risk Anal., 18, 497–510,
<a href="https://doi.org/10.1111/j.1539-6924.1998.tb00364.x" target="_blank">https://doi.org/10.1111/j.1539-6924.1998.tb00364.x</a>, 1998.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Parey, S., Malek, F., Laurent, C., and Dacunha-Castelle, D.: Trends and climate evolution: statistical approach for very high temperatures in
France, Climatic Change, 81, 331–352, <a href="https://doi.org/10.1007/s10584-006-9116-4" target="_blank">https://doi.org/10.1007/s10584-006-9116-4</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Parey, S., Hoang, T. T. H., and Dacunha-Castelle, D.: Different ways to compute temperature return levels in the climate change context, Environmetrics, 21, 698–718, <a href="https://doi.org/10.1002/env.1060" target="_blank">https://doi.org/10.1002/env.1060</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Pettitt, A. N.: A non-parametric approach to the change-point problem, Appl.
Stat., 28, 126–135, <a href="https://doi.org/10.2307/2346729" target="_blank">https://doi.org/10.2307/2346729</a>, 1979.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Read, L. K. and Vogel, R. M.: Reliability, return periods, and risk under
nonstationarity, Water Resour. Res., 51, 6381–6398,
<a href="https://doi.org/10.1002/2015WR017089" target="_blank">https://doi.org/10.1002/2015WR017089</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
Rosbjerg, D., Blöschl, G., Burn, D., Castellarin, A., Croke, B., Di Baldassarre, G., Iacobellis, V., Kjeldsen, T. R., Kuczera, G., Merz, R.,
Montanari, A., Morris, D., Ouarda, T. B. M. J., Ren, L., Rogger, M.,  Salinas, J. L., Toth, E., and Viglione, A.: Prediction of floods in ungauged basins, in: Runoff Prediction in Ungauged Basins: Synthesis across Processes, Places and Scales, edited by:  Blöschl, G., Sivapalan, M., Wagener, T., Viglione, A., and Savenije, H., Cambridge University Press, Cambridge, 189–226, <a href="https://doi.org/10.1017/CBO9781139235761.012" target="_blank">https://doi.org/10.1017/CBO9781139235761.012</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
Rossi, F., Fiorentino, M., and Versace, P.: Two-Component Extreme Value
Distribution for Flood Frequency Analysis, Water Resour. Res., 20, 847–856,
<a href="https://doi.org/10.1029/WR020i007p00847" target="_blank">https://doi.org/10.1029/WR020i007p00847</a>, 1984.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
Salas, J. D. and Obeysekera, J.: Revisiting the concepts of return period and risk for nonstationary hydrologic extreme events, J. Hydrol. Eng., 19, 554–568, <a href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000820" target="_blank">https://doi.org/10.1061/(ASCE)HE.1943-5584.0000820</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
Schwarz, G.: Estimating the dimension of a model, Ann. Stat., 6, 461–464,
1978.
</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
Sen, P. K.: Estimates of the regression coefficient based on Kendall's tau,
J. Am. Stat. Assoc., 63, 1379–1389, <a href="https://doi.org/10.2307/2285891" target="_blank">https://doi.org/10.2307/2285891</a>, 1968.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
Serinaldi, F. and Kilsby, C. G.: Stationarity is undead: Uncertainty dominates the distribution of extremes, Adv. Water Resour., 77, 17–36,
<a href="https://doi.org/10.1016/j.advwatres.2014.12.013" target="_blank">https://doi.org/10.1016/j.advwatres.2014.12.013</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
Serinaldi, F., Kilsby, C. G., and Lombardo, F.: Untenable nonstationarity:
An assessment of the fitness for purpose of trend tests in hydrology, Adv.
Water Resour., 111, 132–155, <a href="https://doi.org/10.1016/J.ADVWATRES.2017.10.015" target="_blank">https://doi.org/10.1016/J.ADVWATRES.2017.10.015</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
Sivapalan, M., Prediction in Ungauged Basins: A Grand Challenge for Theoretical Hydrology, Hydrol. Process., 17, 3163–3170, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
Smadi, M. M. and Zghoul A.: A sudden change in rainfall characteristics in
Amman, Jordan during the mid 1950s, Am. J. Environ. Sci., 2, 84–91,
<a href="https://doi.org/10.3844/ajessp.2006.84.91" target="_blank">https://doi.org/10.3844/ajessp.2006.84.91</a>, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
Strupczewski, W. G., Kochanek, K., Bogdanowicz, E., Markiewicz, I., and Feluch, W.: Comparison of two nonstationary flood frequency analysis methods
within the context of the variable regime in the representative polish rivers, Acta Geophys., 64, 206–236, <a href="https://doi.org/10.1515/acgeo-2015-0070" target="_blank">https://doi.org/10.1515/acgeo-2015-0070</a>, 2016.

</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
Sugiura, N.: Further analysis of the data by Akaike's information criterion
and the finite corrections, Commun. Stat. Theor. Meth., A7, 13–26,
<a href="https://doi.org/10.1080/03610927808827599" target="_blank">https://doi.org/10.1080/03610927808827599</a>, 1978.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
Todorovic, P. and Zelenhasic, E.: A Stochastic Model for Flood Analysis, Water Resour. Res., 6, 1641–1648, <a href="https://doi.org/10.1029/WR006i006p01641" target="_blank">https://doi.org/10.1029/WR006i006p01641</a>, 1970.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
Tramblay, Y., Neppel, L., Carreau, J., and Najib, K.: Non-stationary frequency analysis of heavy rainfall events in southern France, Hydrolog. Sci. J., 58, 280–194, <a href="https://doi.org/10.1080/02626667.2012.754988" target="_blank">https://doi.org/10.1080/02626667.2012.754988</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
Vogel, R. M., Rosner, A., and Kirshen, P. H.: Brief Communication: Likelihood of societal preparedness for global change: trend detection, Nat. Hazards Earth Syst. Sci., 13, 1773–1778, <a href="https://doi.org/10.5194/nhess-13-1773-2013" target="_blank">https://doi.org/10.5194/nhess-13-1773-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
Wang, W., Van Gelder, P. H., and Vrijling, J. K.: Trend and stationarity
analysis for streamflow processes of rivers in Western Europe in the 20th Century, in: IWA International Conference on Water Economics, Statistics, and Finance, 8–10 July 2005, Rethymno, Greece, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
Yilmaz, A. G. and Perera, B. J. C.: Extreme rainfall nonstationarity
investigation and intensity–frequency–duration relationship, J. Hydrol. Eng., 19, 1160–1172, <a href="https://doi.org/10.1061/(ASCE)HE.1943-5584.0000878" target="_blank">https://doi.org/10.1061/(ASCE)HE.1943-5584.0000878</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
Yilmaz, A. G., Hossain, I., and Perera, B. J. C.: Effect of climate change
and variability on extreme rainfall intensity–frequency–duration relationships: a case study of Melbourne, Hydrol. Earth Syst. Sci., 18,
4065–4076, <a href="https://doi.org/10.5194/hess-18-4065-2014" target="_blank">https://doi.org/10.5194/hess-18-4065-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
Yue, S., Pilon, P., and Cavadias, G.: Power of the Mann–Kendall and
Spearman's rho tests for detecting monotonic trends in hydrological series,
J. Hydrol., 259, 254–271, <a href="https://doi.org/10.1016/S0022-1694(01)00594-7" target="_blank">https://doi.org/10.1016/S0022-1694(01)00594-7</a>, 2002a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
Yue, S., Pilon, P., Phinney, B., and Cavadias, G.: The influence of autocorrelation on the ability to detect trend in hydrological series, Hydrol. Process., 16, 1807–1829, <a href="https://doi.org/10.1002/hyp.1095" target="_blank">https://doi.org/10.1002/hyp.1095</a>, 2002b.
</mixed-citation></ref-html>--></article>
