Looking beyond general metrics for model comparison &ndash; lessons from an international model intercomparison study

de Boer-Euser, Tanja; Bouaziz, Laurène; De Niel, Jan; Brauer, Claudia; Dewals, Benjamin; Drogue, Gilles; Fenicia, Fabrizio; Grelier, Benjamin; Nossent, Jiri; Pereira, Fernando; Savenije, Hubert; Thirel, Guillaume; Willems, Patrick

doi:https://doi.org/10.5194/hess-21-423-2017

Articles | Volume 21, issue 1

https://doi.org/10.5194/hess-21-423-2017

© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.

https://doi.org/10.5194/hess-21-423-2017

© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.

Articles | Volume 21, issue 1

Research article

|

25 Jan 2017

Research article |

| 25 Jan 2017

Looking beyond general metrics for model comparison – lessons from an international model intercomparison study

Tanja de Boer-Euser, Laurène Bouaziz, Jan De Niel, Claudia Brauer, Benjamin Dewals, Gilles Drogue, Fabrizio Fenicia, Benjamin Grelier, Jiri Nossent, Fernando Pereira, Hubert Savenije, Guillaume Thirel, and Patrick Willems

Abstract. International collaboration between research institutes and universities is a promising way to reach consensus on hydrological model development. Although model comparison studies are very valuable for international cooperation, they do often not lead to very clear new insights regarding the relevance of the modelled processes. We hypothesise that this is partly caused by model complexity and the comparison methods used, which focus too much on a good overall performance instead of focusing on a variety of specific events. In this study, we use an approach that focuses on the evaluation of specific events and characteristics. Eight international research groups calibrated their hourly model on the Ourthe catchment in Belgium and carried out a validation in time for the Ourthe catchment and a validation in space for nested and neighbouring catchments. The same protocol was followed for each model and an ensemble of best-performing parameter sets was selected. Although the models showed similar performances based on general metrics (i.e. the Nash–Sutcliffe efficiency), clear differences could be observed for specific events. We analysed the hydrographs of these specific events and conducted three types of statistical analyses on the entire time series: cumulative discharges, empirical extreme value distribution of the peak flows and flow duration curves for low flows. The results illustrate the relevance of including a very quick flow reservoir preceding the root zone storage to model peaks during low flows and including a slow reservoir in parallel with the fast reservoir to model the recession for the studied catchments. This intercomparison enhanced the understanding of the hydrological functioning of the catchment, in particular for low flows, and enabled to identify present knowledge gaps for other parts of the hydrograph. Above all, it helped to evaluate each model against a set of alternative models.

Download & links

Article (PDF, 5645 KB)

Supplement (31366 KB)

Download & links

Article (5645 KB)
Full-text XML
Supplement (31366 KB)
BibTeX
EndNote

Received: 08 Jul 2016 – Discussion started: 20 Jul 2016 – Revised: 29 Nov 2016 – Accepted: 16 Dec 2016 – Published: 25 Jan 2017