<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">HESS</journal-id>
<journal-title-group>
<journal-title>Hydrology and Earth System Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">HESS</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Hydrol. Earth Syst. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">1607-7938</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>GΓΆttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/hess-14-1909-2010</article-id>
<title-group>
<article-title>Exploiting the information content of hydrological &apos;&apos;outliers&apos;&apos; for goodness-of-fit testing</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Laio</surname>
<given-names>F.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Allamano</surname>
<given-names>P.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Claps</surname>
<given-names>P.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>DITIC, Politecnico di Torino, Corso Duca degli Abruzzi 24,  10129 Torino, Italy</addr-line>
</aff>
<pub-date pub-type="epub">
<day>12</day>
<month>10</month>
<year>2010</year>
</pub-date>
<volume>14</volume>
<issue>10</issue>
<fpage>1909</fpage>
<lpage>1917</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2010 F. Laio et al.</copyright-statement>
<copyright-year>2010</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 3.0 Unported License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/3.0/">https://creativecommons.org/licenses/by/3.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://hess.copernicus.org/articles/14/1909/2010/hess-14-1909-2010.html">This article is available from https://hess.copernicus.org/articles/14/1909/2010/hess-14-1909-2010.html</self-uri>
<self-uri xlink:href="https://hess.copernicus.org/articles/14/1909/2010/hess-14-1909-2010.pdf">The full text article is available as a PDF file from https://hess.copernicus.org/articles/14/1909/2010/hess-14-1909-2010.pdf</self-uri>
<abstract>
<p>Validation of probabilistic models based on goodness-of-fit tests is an
essential step for the frequency analysis of extreme events. The outcome of
standard testing techniques, however, is mainly determined by the behavior of
the hypothetical model, &lt;i&gt;F&lt;/i&gt;&lt;sub&gt;&lt;i&gt;X&lt;/i&gt;&lt;/sub&gt;&lt;i&gt;(x)&lt;/i&gt;, in the central part of the distribution,
while the behavior in the tails of the distribution, which is indeed very
relevant in hydrological applications, is relatively unimportant for the
results of the tests. The maximum-value test, originally proposed as a
technique for outlier detection, is a suitable, but seldom applied, technique
that addresses this problem. The test is specifically targeted to verify if
the maximum (or minimum) values in the sample are consistent with the
hypothesis that the distribution &lt;i&gt;F&lt;/i&gt;&lt;sub&gt;&lt;i&gt;X&lt;/i&gt;&lt;/sub&gt;&lt;i&gt;(x)&lt;/i&gt; is the real parent distribution.
The application of this test is hindered by the fact that the critical values
for the test should be numerically obtained when the parameters of &lt;i&gt;F&lt;/i&gt;&lt;sub&gt;&lt;i&gt;X&lt;/i&gt;&lt;/sub&gt;&lt;i&gt;(x)&lt;/i&gt;
are estimated on the same sample used for verification, which is the standard
situation in hydrological applications. We propose here a simple,
analytically explicit, technique to suitably account for this effect, based
on the application of censored L-moments estimators of the parameters. We
demonstrate, with an application that uses artificially generated samples,
the superiority of this modified maximum-value test with respect to the
standard version of the test. We also show that the test has comparable or
larger power with respect to other goodness-of-fit tests (e.g., chi-squared
test, Anderson-Darling test, Fung and Paul test), in particular when dealing
with small samples (sample size lower than 20β25) and when the parent
distribution is similar to the distribution being tested.</p>
</abstract>
<counts><page-count count="9"/></counts>
</article-meta>
</front>
<body/>
<back>
<ref-list>
<title>References</title>
<ref id="ref1">
<label>1</label><mixed-citation publication-type="other" xlink:type="simple">Ahmad, M., Sinclair, C., and Spurr, B.: Assessment of flood frequency models using empirical distribution function statistics, Water Resour. Res., 24, 1323-1328, 1988.</mixed-citation>
</ref>
<ref id="ref2">
<label>2</label><mixed-citation publication-type="other" xlink:type="simple">Barnett, V. and Lewis, T.: Outliers in statistical data, Springer Series in Statistics, John Wiley and Sons, 1994.</mixed-citation>
</ref>
<ref id="ref3">
<label>3</label><mixed-citation publication-type="other" xlink:type="simple">Bayliss, A. and Reed, D.: The use of historical data in flood frequency estimation, Tech. rep., Centre for Ecology and Hydrology, 2001.</mixed-citation>
</ref>
<ref id="ref4">
<label>4</label><mixed-citation publication-type="other" xlink:type="simple">Bryson, M.: Heavy-tailed distributions: properties and tests, Technometrics, 16, 61β68, 1974.</mixed-citation>
</ref>
<ref id="ref5">
<label>5</label><mixed-citation publication-type="other" xlink:type="simple">Chowdhury, J., Stedinger, J., and Lu, L.: Goodness-of-fit tests for regional generalized extreme value flood distributions, Water Resour. Res., 27, 1765β1776, 1991.</mixed-citation>
</ref>
<ref id="ref6">
<label>6</label><mixed-citation publication-type="other" xlink:type="simple">D&apos;Agostino, R. and Stephens, M.: Goodness-of-Fit Techniques, Marcel Dekker Inc, New York, 1986.</mixed-citation>
</ref>
<ref id="ref7">
<label>7</label><mixed-citation publication-type="other" xlink:type="simple">Di&amp;nbsp;Baldassarre, G., Laio, F., and Montanari, A.: Design flood estimation using model selection criteria, Phys. Chem. Earth, 34(10β12), 606β611, &lt;a href=&quot;http://dx.doi.org/10.1016/j.pce.2008.10.066&quot;&gt;https://doi.org/10.1016/j.pce.2008.10.066&lt;/a&gt;, 2008.</mixed-citation>
</ref>
<ref id="ref8">
<label>8</label><mixed-citation publication-type="other" xlink:type="simple">Falk, M. and Reiss, R.: Independence of Order Statistics, Annals of Probability, 16, 854β862, 1988.</mixed-citation>
</ref>
<ref id="ref9">
<label>9</label><mixed-citation publication-type="other" xlink:type="simple">Fill, H. and Stedinger, J.: L-moment and probability plot correlation coefficient goodness-of-fit tests for the Gumbel distribution and impact of autocorrelation, Water Resour. Res., 31, 225β229, 1995.</mixed-citation>
</ref>
<ref id="ref10">
<label>10</label><mixed-citation publication-type="other" xlink:type="simple">Fiorentino, M., Versace, P., and Rossi, F.: Regional flood frequency estimation using the two-component extreme value distribution, Hydrolog. Sci. J., 30, 51β63, 1985.</mixed-citation>
</ref>
<ref id="ref11">
<label>11</label><mixed-citation publication-type="other" xlink:type="simple">Frances, F.: Using the TCEV distribution function with systematic and non-systematic data in a regional flood frequency analysis, Stoch. Hydrol. Hydraul., 12, 267β283, 1998.</mixed-citation>
</ref>
<ref id="ref12">
<label>12</label><mixed-citation publication-type="other" xlink:type="simple">Fung, K. and Paul, S.: Comparison of outlier detection procedures in Weibull or Extreme-Value distribution, Commun. Statist. Simula. Computa, 14, 895β917, 1985.</mixed-citation>
</ref>
<ref id="ref13">
<label>13</label><mixed-citation publication-type="other" xlink:type="simple">Grubbs, F.: Procedures for detecting outlying observations in samples, Technometrics, 11, 1β21, 1969.</mixed-citation>
</ref>
<ref id="ref14">
<label>14</label><mixed-citation publication-type="other" xlink:type="simple">Gumbel, E.: Discussion of the Papers of Messrs. Anscombe and Daniel, Technometrics, 2, 165β166, 1960.</mixed-citation>
</ref>
<ref id="ref15">
<label>15</label><mixed-citation publication-type="other" xlink:type="simple">Hershfield, D.: Estimating the probable maximum precipitation, J. Hydraul. Div. ASCE, 87(HY5), 99β106, 1961.</mixed-citation>
</ref>
<ref id="ref16">
<label>16</label><mixed-citation publication-type="other" xlink:type="simple">Hershfield, D.: Method for estimating probable maximum precipitation, J. Am. Water Works Assoc., 57, 965β972, 1965.</mixed-citation>
</ref>
<ref id="ref17">
<label>17</label><mixed-citation publication-type="other" xlink:type="simple">Hosking, J. and Wallis, J.: Regional Frequency Analysis: An Approach Based on {L}-Moments, Cambridge University Press, 1997.</mixed-citation>
</ref>
<ref id="ref18">
<label>18</label><mixed-citation publication-type="other" xlink:type="simple">Hosking, J., Wallis, J., and Wood, E.: Estimation of the Generalized Extreme Value distribution by the method of the probability weighted moments, Technometrics, 27, 251β261, 1985.</mixed-citation>
</ref>
<ref id="ref19">
<label>19</label><mixed-citation publication-type="other" xlink:type="simple">Kendall, M. and Stuart, A.: The Advanced Theory of Statistics, Charles Griffin and Company Limited, 1979.</mixed-citation>
</ref>
<ref id="ref20">
<label>20</label><mixed-citation publication-type="other" xlink:type="simple">Kottegoda, N. and Rosso, R.: Statistics, probability, and reliability for civil and environmental engineers, McGraw-Hill, International Edition, 1998.</mixed-citation>
</ref>
<ref id="ref21">
<label>21</label><mixed-citation publication-type="other" xlink:type="simple">Koutsoyiannis, D.: Probable maximum precipitation, \urlprefix&lt;a href=&quot;http://www.itia.ntua.gr/getfile/116/5/documents/2000HydrometP% MP.pdf&quot;&gt;http://www.itia.ntua.gr/getfile/116/5/documents/2000HydrometP% MP.pdf&lt;/a&gt;, 2000.</mixed-citation>
</ref>
<ref id="ref22">
<label>22</label><mixed-citation publication-type="other" xlink:type="simple">Laio, F.: Cramer-von Mises and Anderson-Darling goodness of fit tests for extreme value distributions with unknown parameters, Water Resour. Res., 40, W09308, &lt;a href=&quot;http://dx.doi.org/10.1029/2004WR003204&quot;&gt;https://doi.org/10.1029/2004WR003204&lt;/a&gt;, 2004.</mixed-citation>
</ref>
<ref id="ref23">
<label>23</label><mixed-citation publication-type="other" xlink:type="simple">Laio, F., Di&amp;nbsp;Baldassarre, G., and Montanari, A.: Model selection techniques for the frequency analysis of hydrological extremes, Water Resour. Res., 45, W07416, &lt;a href=&quot;http://dx.doi.org/10.1029/2007WR006666&quot;&gt;https://doi.org/10.1029/2007WR006666&lt;/a&gt;, 2009.</mixed-citation>
</ref>
<ref id="ref24">
<label>24</label><mixed-citation publication-type="other" xlink:type="simple">Laio, F., Allamano, P., and Claps, P.: Interactive comment on &quot;Exploiting the information content of hydrological &quot;outliers&quot; for goodness-of-fit testing&quot; by F. Laio et al., Hydrol. Earth Syst. Sci. Discuss., 7, C2227βC2230, 2010.</mixed-citation>
</ref>
<ref id="ref25">
<label>25</label><mixed-citation publication-type="other" xlink:type="simple">Mitosek, H., Strupczewski, W., and Singh, V.: Three procedures for selection of annual flood peak distribution, J. Hydrol., 323(1β4), 57β73, 2006.</mixed-citation>
</ref>
<ref id="ref26">
<label>26</label><mixed-citation publication-type="other" xlink:type="simple">Moore, D.: Goodness-of-Fit Techniques, chap. Tests of the chi-squared type, Marcel Dekker, New York, 1986.</mixed-citation>
</ref>
<ref id="ref27">
<label>27</label><mixed-citation publication-type="other" xlink:type="simple">Rossi, F., Fiorentino, M., and Versace, P.: Two-component extreme value distribution for flood frequency analysis, Water Resour. Res., 20, 847β856, 1984.</mixed-citation>
</ref>
<ref id="ref28">
<label>28</label><mixed-citation publication-type="other" xlink:type="simple">Stedinger, J., Vogel, R., and Foufoula-Georgiou, E.: Handbook of Hydrology, chap. 8: Frequency analysis of extreme events, McGraw-Hill, New York, 1992.</mixed-citation>
</ref>
<ref id="ref29">
<label>29</label><mixed-citation publication-type="other" xlink:type="simple">Strupczewski, W., Singh, V., and Weglarczyk, S.: Asymptotic bias of estimation methods caused by the assumption of false probability distributions, J. Hydrol., 258, 122β148, 2002.</mixed-citation>
</ref>
<ref id="ref30">
<label>30</label><mixed-citation publication-type="other" xlink:type="simple">Vogel, R.: The probability plot correlation coefficient test for the normal, lognormal, and Gumbel distributional hypotheses, Water Resour. Res., 22, 587β590, 1986.</mixed-citation>
</ref>
<ref id="ref31">
<label>31</label><mixed-citation publication-type="other" xlink:type="simple">Vogel, R. and McMartin, D.: Probability plot goodness-of-fit and skewness estimation procedures for the Pearson type 3 distribution, Water Resour. Res., 27, 3149β3158, 1991.</mixed-citation>
</ref>
<ref id="ref32">
<label>32</label><mixed-citation publication-type="other" xlink:type="simple">Wang, Q.: Unbiased estimation of probability weighted moments and partial probability weighted moments from systematic and historical flood information and their application to estimating the GEV distribution, J. Hydrol., 120, 115β124, 1990.</mixed-citation>
</ref>
<ref id="ref33">
<label>33</label><mixed-citation publication-type="other" xlink:type="simple">Wang, Q.: Approximate goodness-of-fit tests of fitted generalized extreme value distributions using LH moments, Water Resour. Res., 34, 3497β3502, 1998.</mixed-citation>
</ref>
</ref-list>
</back>
</article>