Exploiting the information content of hydrological ''outliers'' for goodness-of-fit testing

Laio, F.; Allamano, P.; Claps, P.

doi:https://doi.org/10.5194/hess-14-1909-2010

Articles | Volume 14, issue 10

https://doi.org/10.5194/hess-14-1909-2010

© Author(s) 2010. This work is distributed under
the Creative Commons Attribution 3.0 License.

Special issue:

Advances in statistical hydrology

https://doi.org/10.5194/hess-14-1909-2010

© Author(s) 2010. This work is distributed under
the Creative Commons Attribution 3.0 License.

Articles | Volume 14, issue 10

Research article

|

12 Oct 2010

Research article |

| 12 Oct 2010

Exploiting the information content of hydrological ''outliers'' for goodness-of-fit testing

F. Laio, P. Allamano, and P. Claps

Abstract. Validation of probabilistic models based on goodness-of-fit tests is an essential step for the frequency analysis of extreme events. The outcome of standard testing techniques, however, is mainly determined by the behavior of the hypothetical model, F_X(x), in the central part of the distribution, while the behavior in the tails of the distribution, which is indeed very relevant in hydrological applications, is relatively unimportant for the results of the tests. The maximum-value test, originally proposed as a technique for outlier detection, is a suitable, but seldom applied, technique that addresses this problem. The test is specifically targeted to verify if the maximum (or minimum) values in the sample are consistent with the hypothesis that the distribution F_X(x) is the real parent distribution. The application of this test is hindered by the fact that the critical values for the test should be numerically obtained when the parameters of F_X(x) are estimated on the same sample used for verification, which is the standard situation in hydrological applications. We propose here a simple, analytically explicit, technique to suitably account for this effect, based on the application of censored L-moments estimators of the parameters. We demonstrate, with an application that uses artificially generated samples, the superiority of this modified maximum-value test with respect to the standard version of the test. We also show that the test has comparable or larger power with respect to other goodness-of-fit tests (e.g., chi-squared test, Anderson-Darling test, Fung and Paul test), in particular when dealing with small samples (sample size lower than 20–25) and when the parent distribution is similar to the distribution being tested.

Received: 06 Jul 2010 – Discussion started: 22 Jul 2010 – Revised: 27 Sep 2010 – Accepted: 05 Oct 2010 – Published: 12 Oct 2010