Technical note: Comparing three different methods for allocating river points to coarse-resolution hydrological modelling grid cells

Godet, Juliette; Gaume, Eric; Javelle, Pierre; Nicolle, Pierre; Payrastre, Olivier

doi:https://doi.org/10.5194/hess-28-1403-2024

Articles | Volume 28, issue 6

https://doi.org/10.5194/hess-28-1403-2024

Articles | Volume 28, issue 6

Technical note

27 Mar 2024

Technical note |

| 27 Mar 2024

Technical note: Comparing three different methods for allocating river points to coarse-resolution hydrological modelling grid cells

Juliette Godet, Eric Gaume, Pierre Javelle, Pierre Nicolle, and Olivier Payrastre

Abstract

The allocation of points in a river network to pixels of a coarse-resolution hydrological modelling grid is a well-known issue, especially for hydrologists who use measurements at gauging stations to calibrate and validate distributed hydrological models. To address this issue, the traditional approach involves examining grid cells surrounding the considered river point and selecting the best candidate, based on distance and upstream drainage area as decision criteria. However, recent studies have suggested that focusing on basin boundaries rather than basin areas could prevent many allocation errors, even though the performance gain is rarely assessed. This paper compares different allocation methods and examines their relative performances. Three methods representing various families of methods have been designed: area-based, topology-based and contour-based methods. These methods are implemented to allocate 2580 river points to a 1 km hydrological modelling grid. These points are distributed along the entire hydrographic network of the French southeastern Mediterranean region, covering upstream drainage areas ranging from 5 to 3000 km². The results indicate that the differences between the methods can be significant, especially for small upstream catchment areas.

Download & links

Article (PDF, 3446 KB)

Download & links

How to cite.

Received: 04 Jul 2023 – Discussion started: 10 Jul 2023 – Revised: 08 Feb 2024 – Accepted: 13 Feb 2024 – Published: 27 Mar 2024

1 Introduction

In hydrology, rainfall–runoff model outputs are often compared to observed discharge series at gauging stations for calibration or evaluation purposes. Vector-based hydrological models are adequate to meet these objectives, because it is straightforward to locate a gauging station along the river network. However, when using gridded models, it is necessary to allocate each gauging station to a specific cell in the model grid. In the literature, terms such as “co-registering” (Fekete et al., 2002), “co-referencing” (Döll and Lehner, 2002), and “matching” (Wang et al., 2018) are also used to describe this process. The allocation of specific river points to a coarse-resolution cell can also be necessary when connecting the output of a hydrological distributed model (providing hydrographs or peak discharges on a grid) to a hydraulic model for inundation modelling. As an example, Dottori et al. (2017) developed a European-wide flood risk assessment system, based on the European Flood Awareness System (EFAS; see Thielen et al., 2009; Bartholmes et al., 2009). The discharge output of EFAS is provided on a grid of spatial resolution of 5 km, which needed to be downscaled to a resolution of 100 m in order to derive flood hazard maps at the pan-European scale. Dottori et al. (2015) opted for a basic method consisting of allocating the 100 m river pixel to the 5 km river cell containing this pixel. This approach has limitations, particularly when the two river networks defined at the 100 m and 5 km scales do not overlap.

Generally, the allocation of river points to a coarse-resolution grid for hydrological modelling relies on distance and upstream drainage area (UPA) error criteria (Döll and Lehner, 2002; Fekete et al., 2002; Lehner, 2012; Zhao et al., 2017; Sutanudjaja et al., 2018; Wang et al., 2018; Burek et al., 2020; Polcher et al., 2022). However, this process is also prone to errors, especially near confluences, where points of different branches of the river network may have similar UPAs. As a result, their allocation on the coarse-resolution grid can lead to assigning a point to the wrong hydrological grid cell and corresponding upstream watershed, based on a slightly better UPA fit or a slightly shorter distance (see Fig. 1 for an example).

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f01

Figure 1Example of allocation process failure when based on distance and UPA error criteria: green river point is allocated to grid cell C8 instead of grid cell C6.

Download

Considering this possible limitation, efforts have been made to propose more effective protocols for allocating river points to hydrological grid cells. For instance, the methods proposed by Burek and Smilovic (2022) and Munier and Decharme (2022) combine distance and UPA error criteria with a comparison between the gauging station's basin boundaries, delineated on the basis of a fine-resolution digital elevation model (DEM), and the basin boundaries of the allocated cell based on the coarse-resolution hydrological grid. In both studies, the similarity between the watershed limits is characterised by the “Intersection over Union” index (Rezatofighi et al., 2019).

The idea of comparing basin boundaries had been previously considered, although in a slightly different context, namely the evaluation of hydrological grids obtained from an upscaling algorithm (i.e., transforming a fine-resolution grid into a coarser-resolution grid). Initially, upscaling algorithms were also guided by a comparison between the UPA values of “small pixels” and the corresponding upscaled “large cells” (Reed, 2003; Paz et al., 2006; Yamazaki et al., 2008; Eilander et al., 2021). Visual inspections were performed to identify obvious inconsistencies between coarse- and high-resolution river networks. To reduce these inconsistencies automatically, additional criteria have been proposed to complement the UPA criterion, such as the mean distance between river networks and the percentage within a buffer (Davies and Bell, 2009), the correctness index and the figure of merit (Li and Wong, 2010), and the watershed delineation percentages of consistency (Sousa and Paz, 2017).

In summary, numerous methods are available to achieve the objective of allocating a river point to a coarse-resolution grid cell. However, these methods have been developed in different contexts and have rarely been compared. This study aims at comparing the results obtained from three different types of methods for allocating a large number (2580) of river points to a coarse-resolution hydrological grid (1 km × 1 km). The first method belongs to the category of area-based methods and employs distance and UPA error criteria. The third method is a contour-based approach. The second method is a topological method based on proximity along the river network. Another unique aspect of this work is that it deals with a detailed river network that includes river points with small drainage areas (minimum of 5 km²), whereas most previous studies have been limited to the main river networks (catchments larger than 500 km² in Dottori et al., 2017, for instance). In this study, 1 km × 1 km is considered coarse resolution because the hydrological model is intended for the regional scale. However, the same problem could arise for hydrological modelling applied on a continental scale where the resolution will be coarser than 1 km (i.e. 5 to 10 km).

The paper is structured as follows. Section 2 presents the three tested allocation methods as well as the validation metrics. Section 3 provides an overview of the case study. Section 4 compares the results obtained with the three tested allocation methods, and Sect. 5 highlights the limitations of the study and summarises the key messages.

2 Allocation methods and validation metrics

In this section, we describe three methods that allow for the allocation of a river point to a coarse-resolution grid cell. The validation metrics used to compare the methods are described at the end of the section.

2.1 Method 1: area-based method

Area-based methods can be traced back to Döll and Lehner (2002), who proposed allocating river points to the coarse-resolution grid cells containing the points, provided that the relative difference between coarse and reference resolution UPAs did not exceed 5 %. This criterion led, in their case, to a manual reallocation of 35 % of the points. In order to automate the allocation procedure, Lehner (2012) proposed to select the grid cell within a 5 km radius search area around each river point (see Fig. 2) with the lowest value of a discrepancy criterion $D = RA + 2 R$ , where RA stands for the relative difference between UPAs that should not exceed 50 % and R for the distance between the point and the centre of the grid cell. In most other studies (Zhao et al., 2017; Sutanudjaja et al., 2018; Wang et al., 2018; Burek et al., 2020), RA is the only selection criterion, with the radius of the search area (1–25 km, depending on the spatial resolution) and the maximum acceptable RA value for a successful allocation (10 %–30 %) varying between studies. To maximise the proportion of allocated river points and to optimise the computation times, the approach proposed herein proceeds in three possible successive steps. At step 1, the closest grid cell verifying RA < 10 % and R< 3 cells is selected if it exists. If it does not, the maximum RA value is increased to 20 % at step 2 and 30 % at step 3. It can be noted that the proposed approach combines area and distance criteria. Also, if a maximum difference between UPAs of 30 % is a recurrent choice in the literature, e.g. Burek et al. (2020), regardless of the studied model resolutions, the distance criterion R< 3 is more study-dependent. In the present study, it appeared after some tests as a good compromise providing accurate results. However when using global-scale hydrological models and coarser grids, the value of R may have to be adjusted.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f02

Figure 2Illustration of Method 1: grid cell candidates for a specific river point. In this situation, the river point is allocated to the green grid cell with hatching.

2.2 Method 2: topology-based method

This second method requires a vector-based river network and the definition of coarse grid cell outlet points. The cells' outlet points are located and selected according to the iterative hydrography upscaling (IHU) method (Eilander et al., 2021), used to generate the coarse-resolution hydrological modelling grid (see Sect. 3.2). Each river point can then be connected to the closest upstream or downstream grid cell outlet point and hence allocated to the corresponding grid cell, provided that both points belong to the same river reach, i.e. are not separated by a network confluence (case of point P3 in Fig. 3). With this method, river points located between two confluences within the same grid cell cannot be allocated.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f03

Figure 3Illustration and limits of Method 2: connection of river points and cell outlet points (black dotted arrows) with an impossibility for point P3, located between two confluences in the same grid cell.

Download

2.3 Method 3: contour-based method

Several previous studies have stressed the importance of considering the consistency of watershed contours for the evaluation or optimisation of upscaling or allocation methods (Davies and Bell, 2009; Li and Wong, 2010; Sousa and Paz, 2017). Method 3 is similar to Method 1, except that it is not only based on the comparison between upstream watershed surfaces but also between upstream watershed contours. While Munier and Decharme (2022) and Burek and Smilovic (2022) did propose an allocation criterion based on a combination of an area-based and a contour-based criterion, it is proposed herein to base the selection of the appropriate grid cell for each river point based on a single criterion, namely the critical success index, CSI (see Eq. 1). The CSI is a standard score to compare surfaces, often used to compare flood inundation models for instance (Fleischmann et al., 2019; Hocini et al., 2021).

\begin{matrix} (1) & CSI = \frac{a}{a + b + c} \end{matrix}

where a (“HIT”, see Fig. 4) is the overlapping area between the reference upstream watershed of the considered river point and the upstream watershed of the candidate grid cell, defined on the coarse-resolution grid using the TAUDEM library (Tarboton, 1997). b is the area of the reference watershed not overlapping with the coarse grid watershed (“MISS”), and conversely, c is the area of the coarse grid watershed not overlapping with the reference watershed (“FALSE ALARM”). CSI is equal to 1 in the case of a perfect overlap and 0 when there is no overlap at all. It can be noted that the CSI has often been used with alternative denominations in previous studies, such as the “Intersection over Union” criterion (Munier and Decharme, 2022; Burek and Smilovic, 2022), the Figure of Merit (Li and Wong, 2010), or Fit Metric (Fleischmann et al., 2019).

The grid cell maximising the CSI value is selected among the nine cells closest to the considered river point. If a minimum CSI value is not reached (i.e. 0.4 for watershed areas under 10 km² and 0.6 otherwise), the search area is extended to the 7 × 7 = 49 closest grid cells. Tests carried out prior to this choice showed that extending the research area to the 5 × 5 = 25 closest grid cells did not allow most of the targeted cases to be corrected. The CSI thresholds can be adjusted (see Sect. 4.4), as well as the surface threshold and the extended research area, since they may depend on the model resolution.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f04

Figure 4Illustration of Method 3: CSI calculations for the nine candidate cells. In this situation, the river point is allocated to the candidate cell of the blue box scenario, which maximises the CSI (CSI = 0.80).

2.4 Evaluation metrics

The allocation procedures aims at relating points of the river network, often corresponding to a gauging station, to coarse grid cells of a distributed hydrological model, with the upstream watershed closest to the actual watershed or at least closest to the watershed delineated based on the finest available geographical data. Therefore, the CSI appears as the best suited score for the efficiency evaluation of allocation methods and will be used hereafter. The CSI is calculated based on the low-resolution catchment contours (reference contours). By construction, Method 3 should then lead to the best performances but at the cost of a higher implementation complexity and significantly larger computation times, as will be illustrated herein. Hence, the main question will be, how well do Methods 1 and 2 compare to Method 3?

3 Case study and data

The test area covers three departments in the Eastern Mediterranean region of France, with a total surface area exceeding 15 000 km² (Fig. 5). Two geographical datasets have been used to implement and compare the allocation methods: shapefiles of the river points to be allocated and their reference catchment boundaries and a raster describing the coarse-resolution grid specifically designed for the purpose of the study.

3.1 The river points to be allocated

The BNBV (Base Nationale des Bassins Versants) is a French reference GIS (geographic information system) layer describing the network of rivers with an upstream catchment area larger than 5 km², over the whole territory of France. It was produced by Organde et al. (2013), based on the processing of a hydrologically-validated 50 m resolution flow direction grid. It includes a vector description of the river reaches and identifies approximately 15 000 points of interest along the river network across mainland France. These points correspond to locations of gauging stations, urban areas, confluences and river mouths. Intermediate outlets were also added to ensure comprehensive coverage of the whole river network. The upstream watershed limits and areas are associated with each BNBV point.

Figure 5 displays the BNBV outlets and river reaches in the Eastern Mediterranean region. BNBV outlets located in the Étang de Berre area were excluded from our study due to the lack of meaningful flow direction in a lagoon. The region contains 2580 BNBV outlets that will be allocated to a coarse-resolution grid. The vector representation of the upstream catchment limits serves as the reference for the evaluation and for the implementation of Method 3, while the vector description of river reaches is essential for Method 2.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f05

Figure 5The 2580 BNBV outlets on the French Eastern Mediterranean region.

3.2 The coarse-resolution hydrological modelling grid

Regional gridded hydrological models are often implemented on 1 km resolution grids, aligning with the typical resolution of operational radar-based quantitative precipitation estimates. In this context, the 1 km × 1 km hydrological modelling grid was herein generated by upscaling the 50 m flow direction grid from the BNBV database using the IHU method (Eilander et al., 2021); see Fig. 6 to visualise the upscaling results. The IHU method incorporates principles from previous upscaling methods (Döll and Lehner, 2002; Fekete et al., 2001; Olivera et al., 2002; Paz et al., 2006; Wu et al., 2011) and has demonstrated superior performance compared to the benchmarks methods, e.g. the DMM method (Olivera et al., 2002) and EAM method (Yamazaki et al., 2008). Moreover, the IHU method is the only fully automated and open-source flow direction grid upscaling method known to the authors. After implementing the IHU method, we made minimal manual corrections to the flow direction grid. Only a small number of cells, approximately a dozen out of around 14 000 cells, required adjustments. These manual corrections were primarily made along the zone's borders, particularly near the coastline.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f06

Figure 6Initial 50 m river network vectorised for S> 5 km² (a) and 1 km upscaled surface accumulation (S> 5 km²) grid (b).

4 Results

4.1 Advantages and disadvantages of each method

Table 1 provides an overview of the general advantages and disadvantages of each method. Method 1 and 2 demonstrate superior performance in terms of computation times. The most time-consuming stage of Method 3 is the computation of the upstream catchment for each candidate cell, as well as its comparison with the reference catchment. The duration of these processes will naturally increase with the size of the considered case study and the number of points to be allocated. However, the computation times might not be considered a crucial issue, since the allocation of river points to grid cells needs to be performed only once, before hydrological modelling. Method replicability is probably a more important issue for this work. Method 2 is limited in its compatibility as it relies on the IHU upscaling, As a consequence, it makes this method inoperable in contexts where the coarse-resolution gridded network does not coincide with the river network (i.e. both networks may come from different data sources). On the other hand, Method 3 requires reference catchment boundaries. Furthermore, the confidence level varies between the methods. Method 1 relies solely on basin area information, which means that near confluences it may allocate a river point to a neighbouring grid cell with a similar basin area but belonging to a different catchment. Method 2 ensures that the river points and chosen grid cell belong to the same river reach, while Method 3 selects the cell with the most similar upstream catchment in terms of basin contour and location.

Table 1Advantages and disadvantages of each method.

Download Print Version | Download XLSX

Lastly, Method 3 guarantees the allocation of all river points, covering 100 % of the dataset, whereas the other two methods may not achieve complete allocation. For instance, river points similar to P3 in Fig. 3, i.e. points located between two confluences within the same grid cell, are not allocated with Method 2. These points represent about 2 % of the total number of points to be allocated in the considered region. One potential solution could consist of selecting, among the closest upstream or downstream outlets of grids, the one with the nearest upstream watershed area. Another solution would be to allocate P3 to both upstream cells (C3 and C5 in Fig. 3); however, in this paper we chose not to permit the allocation to several grid cells. In these cases, Method 2 detects that all allocations to a single grid cell will be imperfect, which is why it was not deemed essential to propose an allocation solution for these specific points for the sake of comparison.

4.2 Comparison of allocation performances

In order to compare the quality of allocation among the three methods, we initially examined the CSI statistics. However, it should be noted that while Method 1 and Method 3 successfully allocated all 2580 considered river points (BNBV outlets in the Eastern Mediterranean zone), Method 2 only allocated 2532 points (98 %). To ensure a fair comparison, these 48 points were excluded from the analysis. Figure 7a displays the histograms of CSI values for each method, along with the corresponding mean and median values.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f07

Figure 7Histograms of (a) CSI values and (b) UPA relative error for the three allocation methods.

Download

The results indicate that Method 1 has significantly lower performances compared to Methods 2 and 3. This discrepancy is primarily attributed to a high percentage of river points with very low CSI scores (less than 0.05) in Method 1. These low scores occur when a river point is allocated to a cell solely based on similar basin area, disregarding substantial differences in the shape and location of the upstream catchments (e.g. Fig. 1). In its state, Method 2 cannot allocate 100 % of the river outlets, even though it could with a small extension (see previous section). Method 3 was able to allocate the excluded outlets successfully, with a mean CSI of 0.7. Additionally, Fig. 7a highlights that Method 3 consistently yields a minimum CSI of around 0.25, whereas Method 2 falls below 0.05 (with 12 river points having CSI < 0.25). Figure 8 provides a more detailed comparison of CSI scores between Method 2 and Method 3.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f08

Figure 8Comparison of CSI scores for each outlet, between Methods 3 and 1 and between Methods 3 and 2.

Download

It clearly demonstrates the consistent superiority of Method 3 over Method 2. While the CSI differentials are generally small, the green circle in the figure highlights cases where the differential can be significant. These are characterised by river points with small upstream basin areas (less than 12 km²) located far from the nearest cell outlet point, resulting in notable differences in UPAs. Figure 9 shows an example of these situations, which are often complex configurations where different branches of the network are in close proximity to each other, often within the same grid cell. As a consequence, the upscaled flow directions cannot represent faithfully the river network, and all candidate cells would give a mediocre allocation result. However we argue that Method 3 will give the most correct allocations in these situations. Furthermore, Fig. 8 reveals that Method 3 effectively corrects the allocation errors made by Method 1 (indicated by orange circles in the figure). However, it also indicates that there are surprisingly few cases where Method 1 yields slightly better CSI scores than Method 3 (nine cases circled in blue). This suggests that the minimum CSI values defined in Sect. 2.3 for the two possible steps of Method 3 may not be optimal, a topic that will be further discussed in Sect. 4.4.

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f09

Figure 9An example of high CSI differential between Methods 2 and 3 (basin area 12 km²). For the sake of comprehension, small (< 5 km²) tributaries were added to the vectorial network in this figure, explaining why the river outlet can be allocated with Method 2 despite being apparently located between two confluences within the same grid cell.

Finally, the difference in UPAs between each reference catchment and its corresponding coarse-resolution catchment was also calculated, even though we decided not to use this metric for the comparison of the three methods, in agreement with the many hydrologists (e.g. Davies and Bell, 2008) who have pointed out that an evaluation based on UPA comparison is highly uncertain. This thesis is supported by the results presented in Fig. 7b, which show a slightly smaller difference in upstream area for Method 1 than for Methods 2 and 3. Thus, comparing the three allocation methods based on this criterion only would be misleading, because it would not account for all the cases where a river point is allocated to a hydrological modelling cell describing a different upstream catchment if they have similar UPAs. However, these results show that the relative difference in UPAs remains limited (mostly lower than 15 %) for all three methods.

4.3 The influence of the upstream basin area

The analyses performed in this study also revealed that basin area plays a significant role in explaining the largest allocation errors. Specifically, among all the river points allocated using Method 1 and having a CSI lower than 0.05, 80 % have a basin area smaller than 9 km². Similarly, with Method 3, among all the river points with CSI scores lower than 0.6, 100 % have a basin area lower than 25 km² and 92 % have a basin area lower than 10 km².

The boxplot comparisons in Fig. 10 highlight two important observations. Firstly, Method 3 effectively prevents the largest allocation errors generated by Method 1. Secondly, these allocation errors predominantly occur for small catchment areas. For catchments larger than 100 km², CSI scores systematically exceed 0.75 for both methods. Consequently, Method 1 can be considered a reliable method when applied to the main river network, which explains its widespread usage in previous studies: for almost all the studied watersheds, the minimum catchment size in various previous research studies was between 100 and 10 000 km² (Fekete et al., 2002; Döll and Lehner, 2002; Sutanudjaja et al., 2018; Wang et al., 2018; Zhao et al., 2017; Burek et al., 2020; Polcher et al., 2022).

https://hess.copernicus.org/articles/28/1403/2024/hess-28-1403-2024-f10

Figure 10Boxplots of CSI values divided into classes of surfaces for Methods 1 and 3.

Download

In addition, Fig. 10 also confirms that, with Method 3, the lowest CSI scores are obtained for small catchments – the minimum CSI value is consistently higher than 0.7 for catchment sizes larger than 50 km². The low CSI values reflect the uncertainties affecting the boundaries of small basins defined on a 1 km resolution grid, particularly pronounced for narrow watersheds. However, it is important to note that the impact of low CSI scores on the representation of rainfall is relatively minor for small catchments compared to larger ones because of the limited variability of rainfall at scales of the order of a square kilometre.

4.4 Effect of the criterion conditioning Method 3 iteration

While Method 3 appears, without surprise, to be the most reliable method, it has also some shortcomings. One is the limited search zone on the coarse-resolution hydrological grid. The current implementation first considers only the 9 surrounding pixels and then extends the search area to 49 surrounding pixels under certain conditions. This approach was chosen to reduce the computation time, which is significantly longer for Method 3 compared to the other two methods. The criterion used for the second iteration depends on the catchment size, based on the assumption that the upstream drainage area (UPA) would influence the allocation results.

The second iteration of Method 3 is only activated for 70 outlets out of 2580. Extending the research area in the second iteration improved the allocation for only 10 out of these 70 outlets. However, the increase in CSI for these improved allocations was quite significant, with a median increase of 133 %. Moreover, it is obvious in Fig. 8 that Method 3 does not lead to the optimal allocation in some cases: see the nine cases circled in blue. A visual analysis indicates that the search area composed of the 49 surrounding grid cells would have solved the problem if the second iteration of Method 3 would have been activated. Increasing the threshold values for this activation to 0.55 (resp. 0.65) for catchment areas lower (resp. higher) than 10 km² would solve the problem encountered for these nine points in the present case study, but at the cost of higher computing times: 353 points processed in the second iteration of Method 3 instead of 70 for the initial threshold values.

5 Discussion and conclusions

This comparative study of methods allocating river points to coarse grid cells was driven by the shift in approach from area-based methods to contour-based methods. In this work, we compared these two categories of methods and introduced a new approach based on topological proximity.

5.1 Performance and scope of each method

The study results revealed that contour-based methods were more relevant and satisfying from a hydrological point of view, although costly in terms of computing time. The introduced topology-based method is a good compromise because it leads to similar allocation quality than the contour-based method. However, it is inoperable when the fine- and coarse-resolution river networks come from different data sources, since it requires the definition of cell outlet points as well as the vector-based description of the river network. Moreover, it cannot allocate as many points as the contour-based method.

The area-based method generated numerous allocation errors, which the contour-based method was able to address for a significant portion of them. However, upon closer examination, it was observed that the performance gap between both methods was more pronounced for small catchments, while being less significant for larger catchments (with S>100 km²). The area-based methods thus lead to satisfying results if we only consider river points with large UPAs compared to the grid cell resolution. Based on the results obtained, we would recommend a minimum factor of 100 between a river point's UPA and the resolution of the hydrological modelling grid for the application of an area-based method.

The transferability of the results outside the test area and to coarser resolutions (i.e. global hydrological models) is debatable, as there are many uncertainties and non-linearities in the representation of hydrological information at such larger scales and coarser resolutions. However, it is very likely that, with coarser resolution grids, allocation problems will increase and that errors related to area-based methods will impact larger catchments (i.e. larger than 100 km²). Even if this needs to be verified, the contour-based method will certainly remain more effective at coarser resolutions than the area-based method.

5.2 Remaining limitations of allocating river points to coarse grid cells

Even if allocation errors are reduced, some low CSI scores remain with contour-based methods, due to the inherent difficulty of representing the boundaries of small basins at a 1 km resolution. The situation illustrated in Fig. 9 is an example of configurations where different branches of the river system are crossing the same coarse grid cell, which makes it impossible to correctly represent the river network and basin contour with coarse flow directions. However, it is important to note that these inevitable errors resulting in low CSI scores are generally found on small catchments, and they are less problematic than low CSI scores on larger catchments because of the more limited variability of rainfall at smaller spatial scales.

A possible way to reduce these remaining errors could be to allocate river points to multiple grid cells, by either taking the sum of the upstream cells or the difference of the downstream cells, even though it could complicate the hydrological modelling at a later stage. Another approach that could circumvent the challenges faced would be to use hydrological models structured based on vectorial objects instead of regular grids. These models preserve the topology of river networks and allow seamless integration of observational data. However, vector-based modelling also introduces its own challenges related to data and computational requirements and the need for accurate input data.

Code and data availability

All data and codes used for this study are provided in an open-access format on the French public data platform data.gouv.fr: https://doi.org/10.57745/7GCCUN (Godet, 2023).

Author contributions

JG performed the computational work, helped by PN, and did most of the writing. PN, OP, EG and PJ supervised this work and reviewed the paper. JG, OP and EG replied to the reviewer comments.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The BNBV database was provided by SCHAPI (Service central d'hydrométéorologie et d'appui à la prévision des inondations). The authors would like to thank Patrick Arnaud from INRAE Aix-en-Provence, who wrote and shared the core code used for performing the area-based method, and Nouhaila Mesbahi, who first worked on this method during her internship at Gustave Eiffel University in 2021.

Financial support

This research was performed within the framework of the MUFFINS project (ANR-21-CE04-0021-01).

Review statement

This paper was edited by Albrecht Weerts and reviewed by Francesco Dottori and two anonymous referees.

References

Bartholmes, J. C., Thielen, J., Ramos, M. H., and Gentilini, S.: The european flood alert system EFAS – Part 2: Statistical skill assessment of probabilistic and deterministic operational forecasts, Hydrol. Earth Syst. Sci., 13, 141–153, https://doi.org/10.5194/hess-13-141-2009, 2009. a

Burek, P. and Smilovic, M.: The use of GRDC gauging stations for calibrating large-scale hydrological models, Earth Syst. Sci. Data, 15, 5617–5629, https://doi.org/10.5194/essd-15-5617-2023, 2022. a, b, c

Burek, P., Satoh, Y., Kahil, T., Tang, T., Greve, P., Smilovic, M., Guillaumot, L., Zhao, F., and Wada, Y.: Development of the Community Water Model (CWatM v1.04) – a high-resolution hydrological model for global and regional assessment of integrated water resources management, Geosci. Model Dev., 13, 3267–3298, https://doi.org/10.5194/gmd-13-3267-2020, 2020. a, b, c, d

Davies, H. and Bell, V.: Assessment of methods for extracting low-resolution river networks from high-resolution digital data, Hydrolog. Sci. J., 54, 17–28, https://doi.org/10.1623/hysj.54.1.17, 2009. a, b

Dottori, F., Kalas, M., Salamon, P., Bianchi, A., Thielen Del Pozo, J., and Feyen, L.: A near real-time procedure for flood hazard mapping and risk assessment in Europe, in: E-proceedings of the 36th IAHR World Congress, https://emergency.copernicus.eu/mapping/ems/near- real-time-procedure-flood-hazard-mapping-and-risk-assess ment-europe (last access: 20 March 2024), 2015. a

Dottori, F., Kalas, M., Salamon, P., Bianchi, A., Alfieri, L., and Feyen, L.: An operational procedure for rapid flood risk assessment in Europe, Nat. Hazards Earth Syst. Sci., 17, 1111–1126, https://doi.org/10.5194/nhess-17-1111-2017, 2017. a, b

Döll, P. and Lehner, B.: Validation of a new global 30-min drainage direction map, J. Hydrol., 258, 214–231, https://doi.org/10.1016/S0022-1694(01)00565-0, 2002. a, b, c, d, e

Eilander, D., van Verseveld, W., Yamazaki, D., Weerts, A., Winsemius, H. C., and Ward, P. J.: A hydrography upscaling method for scale-invariant parametrization of distributed hydrological models, Hydrol. Earth Syst. Sci., 25, 5287–5313, https://doi.org/10.5194/hess-25-5287-2021, 2021. a, b, c

Fekete, B. M., Vörösmarty, C. J., and Lammers, R. B.: Scaling gridded river networks for macroscale hydrology: Development, analysis, and control of error, Water Resour. Res., 37, 1955–1967, https://doi.org/10.1029/2001WR900024, 2001. a

Fekete, B. M., Vörösmarty, C. J., and Grabs, W.: High-resolution fields of global runoff combining observed river discharge and simulated water balances, Global Biogeochem. Cy., 16, 15-1–15-10, https://doi.org/10.1029/1999GB001254, 2002. a, b, c

Fleischmann, A., Paiva, R., and Collischonn, W.: Can regional to continental river hydrodynamic models be locally relevant? A cross-scale comparison, Journal of Hydrology X, 3, 100027, https://doi.org/10.1016/j.hydroa.2019.100027, 2019. a, b

Godet, J.: Three methods for allocating river points to coarse resolution grid cells: R codes, case study data and results, Version V1, Recherche Data Gouv [data set], https://doi.org/10.57745/7GCCUN, 2023. a

Hocini, N., Payrastre, O., Bourgin, F., Gaume, E., Davy, P., Lague, D., Poinsignon, L., and Pons, F.: Performance of automated methods for flash flood inundation mapping: a comparison of a digital terrain model (DTM) filling and two hydrodynamic methods, Hydrol. Earth Syst. Sci., 25, 2979–2995, https://doi.org/10.5194/hess-25-2979-2021, 2021. a

Lehner, B.: Derivation of watershed boundaries for GRDC gauging stations based on the HydroSHEDS drainage network, GRDC Report Series, Report 41, https://grdc.bafg.de/GRDC/EN/02_srvcs/24_rprtsrs/report_41.html?nn=201764 (last access: 20 March 2024), 2012. a, b

Li, J. and Wong, D. W. S.: Effects of DEM sources on hydrologic applications, Comput. Environ. Urban, 34, 251–261, https://doi.org/10.1016/j.compenvurbsys.2009.11.002, 2010. a, b, c

Munier, S. and Decharme, B.: River network and hydro-geomorphological parameters at 1/12° resolution for global hydrological and climate studies, Earth Syst. Sci. Data, 14, 2239–2258, https://doi.org/10.5194/essd-14-2239-2022, 2022. a, b, c

Olivera, F., Lear, M. S., Famiglietti, J. S., and Asante, K.: Extracting low-resolution river networks from high-resolution digital elevation models, Water Resour. Res., 38, 1231, https://doi.org/10.1029/2001WR000726, 2002. a, b

Organde, D., Javelle, P., Ardilouze, C., and Lamblin, R.: Base nationale des bassins versants du SCHAPI, Tech. rep., HYDRIS Hydrologie, 2013. a

Paz, A. R., Collischonn, W., and Lopes da Silveira, A. L.: Improvements in large-scale drainage networks derived from digital elevation models, Water Resour. Res., 42, W08502, https://doi.org/10.1029/2005WR004544, 2006. a, b

Polcher, J., Schrapffer, A., Dupont, E., Rinchiuso, L., Zhou, X., Boucher, O., Mouche, E., Ottlé, C., and Servonnat, J.: Hydrological modelling on atmospheric grids; using graphs of sub-grid elements to transport energy and water, EGUsphere [preprint], https://doi.org/10.5194/egusphere-2022-690, 2022. a, b

Reed, S. M.: Deriving flow directions for coarse-resolution (1–4 km) gridded hydrologic modeling, Water Resour. Res., 39, 1238, https://doi.org/10.1029/2003WR001989, 2003. a

Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S.: Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019, IEEE, 658–666, https://doi.org/10.1109/CVPR.2019.00075, ISSN: 2575-7075, 2019. a

Sousa, T. M. I. and Paz, A. R.: How to evaluate the quality of coarse-resolution DEM-derived drainage networks, Hydrol. Process., 31, 3379–3395, https://doi.org/10.1002/hyp.11262, 2017. a, b

Sutanudjaja, E. H., van Beek, R., Wanders, N., Wada, Y., Bosmans, J. H. C., Drost, N., van der Ent, R. J., de Graaf, I. E. M., Hoch, J. M., de Jong, K., Karssenberg, D., López López, P., Peßenteiner, S., Schmitz, O., Straatsma, M. W., Vannametee, E., Wisser, D., and Bierkens, M. F. P.: PCR-GLOBWB 2: a 5 arcmin global hydrological and water resources model, Geosci. Model Dev., 11, 2429–2453, https://doi.org/10.5194/gmd-11-2429-2018, 2018. a, b, c

Tarboton, D. G.: A new method for the determination of flow directions and upslope areas in grid digital elevation models, Water Resour. Res., 33, 309–319, https://doi.org/10.1029/96WR03137, 1997. a

Thielen, J., Bartholmes, J., Ramos, M.-H., and de Roo, A.: The European Flood Alert System – Part 1: Concept and development, Hydrol. Earth Syst. Sci., 13, 125–140, https://doi.org/10.5194/hess-13-125-2009, 2009. a

Wang, F., Polcher, J., Peylin, P., and Bastrikov, V.: Assimilation of river discharge in a land surface model to improve estimates of the continental water cycles, Hydrol. Earth Syst. Sci., 22, 3863–3882, https://doi.org/10.5194/hess-22-3863-2018, 2018. a, b, c, d

Wu, H., Kimball, J. S., Mantua, N., and Stanford, J.: Automated upscaling of river networks for macroscale hydrological modeling, Water Resour. Res., 47, W03517, https://doi.org/10.1029/2009WR008871, 2011. a

Yamazaki, D., Masutomi, Y., Oki, T., and Kanae, S.: An Improved Upscaling Method to Construct a Global River Map, in: Proceedings of the 4th Asia-Pacific Hydrology and Water Resources (APHW) Conference, Beijing, China, 3–5 November 2008. a, b

Zhao, F., Veldkamp, T. I. E., Frieler, K., Schewe, J., Ostberg, S., Willner, S., Schauberger, B., Gosling, S. N., Schmied, H. M., Portmann, F. T., Leng, G., Huang, M., Liu, X., Tang, Q., Hanasaki, N., Biemans, H., Gerten, D., Satoh, Y., Pokhrel, Y., Stacke, T., Ciais, P., Chang, J., Ducharne, A., Guimberteau, M., Wada, Y., Kim, H., and Yamazaki, D.: The critical role of the routing scheme in simulating peak river discharge in global hydrological models, Environ. Res. Lett., 12, 075003, https://doi.org/10.1088/1748-9326/aa7250, 2017. a, b, c