Disinformative data in large-scale hydrological modelling

Kauffeldt, A.; Halldin, S.; Rodhe, A.; Xu, C.-Y.; Westerberg, I. K.

doi:https://doi.org/10.5194/hess-17-2845-2013

Articles | Volume 17, issue 7

https://doi.org/10.5194/hess-17-2845-2013

© Author(s) 2013. This work is distributed under
the Creative Commons Attribution 3.0 License.

https://doi.org/10.5194/hess-17-2845-2013

© Author(s) 2013. This work is distributed under
the Creative Commons Attribution 3.0 License.

Articles | Volume 17, issue 7

Research article

|

22 Jul 2013

Research article |

| 22 Jul 2013

Disinformative data in large-scale hydrological modelling

A. Kauffeldt, S. Halldin, A. Rodhe, C.-Y. Xu, and I. K. Westerberg

Abstract. Large-scale hydrological modelling has become an important tool for the study of global and regional water resources, climate impacts, and water-resources management. However, modelling efforts over large spatial domains are fraught with problems of data scarcity, uncertainties and inconsistencies between model forcing and evaluation data. Model-independent methods to screen and analyse data for such problems are needed. This study aimed at identifying data inconsistencies in global datasets using a pre-modelling analysis, inconsistencies that can be disinformative for subsequent modelling. The consistency between (i) basin areas for different hydrographic datasets, and (ii) between climate data (precipitation and potential evaporation) and discharge data, was examined in terms of how well basin areas were represented in the flow networks and the possibility of water-balance closure. It was found that (i) most basins could be well represented in both gridded basin delineations and polygon-based ones, but some basins exhibited large area discrepancies between flow-network datasets and archived basin areas, (ii) basins exhibiting too-high runoff coefficients were abundant in areas where precipitation data were likely affected by snow undercatch, and (iii) the occurrence of basins exhibiting losses exceeding the potential-evaporation limit was strongly dependent on the potential-evaporation data, both in terms of numbers and geographical distribution. Some inconsistencies may be resolved by considering sub-grid variability in climate data, surface-dependent potential-evaporation estimates, etc., but further studies are needed to determine the reasons for the inconsistencies found. Our results emphasise the need for pre-modelling data analysis to identify dataset inconsistencies as an important first step in any large-scale study. Applying data-screening methods before modelling should also increase our chances to draw robust conclusions from subsequent model simulations.

Received: 14 Dec 2012 – Discussion started: 14 Jan 2013 – Revised: 01 May 2013 – Accepted: 02 Jun 2013 – Published: 22 Jul 2013