Deep Learning Methods for Flood Mapping: A Review of Existing Applications and Future Research Directions
- 1Department of Water Management, Faculty of Civil Engineering and Geosciences, Delft University of Technology
- 2Department of Intelligent Systems, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology
- 3Department of Hydraulic Engineering, Faculty of Civil Engineering and Geosciences, Delft University of Technology
- 1Department of Water Management, Faculty of Civil Engineering and Geosciences, Delft University of Technology
- 2Department of Intelligent Systems, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology
- 3Department of Hydraulic Engineering, Faculty of Civil Engineering and Geosciences, Delft University of Technology
Abstract. Deep Learning techniques have been increasingly used in flood management to overcome the limitations of accurate, yet slow, numerical models, and to improve the results of traditional methods for flood mapping. In this paper, we review 58 recent publications to outline the state-of-the-art of the field, identify knowledge gaps, and propose future research directions. The review focuses on the type of deep learning models used for various flood mapping applications, the flood types considered, the spatial scale of the studied events, and the data used for model development. The results show that models based on convolutional layers are usually more accurate as they leverage inductive biases to better process the spatial characteristics of the flooding events. Models based on fully-connected layers, instead, provide accurate results when coupled with other statistical models. Deep learning models showed increased accuracy when compared to traditional approaches and increased speed when compared to numerical methods. While there exist several applications in flood susceptibility, inundation, and hazard mapping, more work is needed to understand how deep learning can assist real-time flood warning during an emergency, and how it can be employed to estimate flood risk. A major challenge lies in developing deep learning models that can generalize to unseen case studies. Furthermore, all reviewed models and their outputs, are deterministic, with limited considerations for uncertainties in outcomes and probabilistic predictions. The authors argue that these identified gaps can be addressed by exploiting recent fundamental advancements in deep learning or by taking inspiration from developments in other applied areas. Models based on graph neural networks and neural operators can work with arbitrarily structured data and thus should be capable of generalizing across different case studies and could account for complex interactions with the natural and built environment. Physics-based deep learning can be used to preserve the underlying physical equations resulting in more reliable speed-up alternatives for numerical models. Similarly, probabilistic models can be built by resorting to Deep Gaussian Processes or Bayesian neural networks.
Roberto Bentivoglio et al.
Status: open (extended)
-
RC1: 'Comment on hess-2022-83', Anonymous Referee #1, 05 Apr 2022
reply
I think the authors have properly adressed my comments in the previous version. I wish to thank the authors for their efforts in producing a paper that will actually also be useful in the classroom. I suggest accepting the paper after the minor comments below have been adressed:
line 380: please check this sentence, I think it is the other way round, MAE is more robust. See the small numerical experiments here: https://towardsdatascience.com/comparing-robustness-of-mae-mse-and-rmse-6d69da870828
Section 3.3.3 - I think it would be good to clarify in this section and maybe also earlier when introducing the different applications, that flood inundation in this paper refers to mapping applications (in contrast to forecasting of flood inundation based on some meteorological input, which will yield much lower scores)
line 570: The logical flow does not make sense. You start outlining limitations of the methodological approaches and then suddenly jump to new applications. It's not clear how these are connected. I would say these text blocks belong into separate subsections
line 804 - 809: Maybe this can be formulated a bit more soft. Initial suggestions for accounting for changing terrain etc. have been published, but there certainly is more work to be done in this direction.
-
AC1: 'Reply on RC1', Roberto Bentivoglio, 22 Apr 2022
reply
We thank the Reviewer for following up with the review process and for the additional comments provided. We address each of them in the following lines.
> I think the authors have properly addressed my comments in the previous version. I wish to thank the authors for their efforts in producing a paper that will actually also be useful in the classroom. I suggest accepting the paper after the minor comments below have been addressed:
> line 380: please check this sentence, I think it is the other way round, MAE is more robust. See the small numerical experiments here: https://towardsdatascience.com/comparing-robustness-of-mae-mse-and-rmse-6d69da870828
We thank the Reviewer for pointing this out. We agree with this consideration so we modified lines 382-383 as follows: “In general, MAE may be preferred to RMSE since the latter is heavily influenced by the presence of extreme outliers.”
> Section 3.3.3 - I think it would be good to clarify in this section and maybe also earlier when introducing the different applications, that flood inundation in this paper refers to mapping applications (in contrast to forecasting of flood inundation based on some meteorological input, which will yield much lower scores)
We added a sentence in section 3.3 to better clarify what we refer to as flood inundation maps: “We remind the reader that, in this paper, we refer to flood inundation as the process of mapping flooded and non-flooded areas from a picture of a flood”
Moreover, we added a link to such a definition in section 3.3.3 as follows: “As defined in section 3.3, flood inundation mapping determines which cells of the flood picture are represented as flooded or not.”
> line 570: The logical flow does not make sense. You start outlining limitations of the methodological approaches and then suddenly jump to new applications. It's not clear how these are connected. I would say these text blocks belong into separate subsections
We thank the Reviewer for this comment. As the previous version was confusing, we redistributed the limitations of each methodological approach throughout the results section, when each application is addressed. In this way, section 4.1 now focuses directly on new applications where DL could be applied.
We show in the following paragraphs how the limitations have been redistributed for each application (in italic).
Lines 439-443: “Satellite data and UAV imagery are both remote sensing data that represent a flood event seen from above. The main differences concern the scale, the resolution, and the availability. UAVs are applicable only for small areas but their resolution is higher than satellite data. UAVs can be readily used but may be unavailable in certain areas. On the other hand, satellite data is available worldwide but its frequency of observation can be limiting. Satellites can also struggle to extract information below clouded areas (e.g., Meraner et al., 2020).”
Lines 495-499: “Topographical data were the most frequent type of input. Many papers present a sensitivity analysis to determine which factors influenced the most the final results: on average, these were slope, land use, aspect, terrain curvature, and distance from the rivers (e.g., Khosravi et al., 2020; Fang et al., 2020a; Popa et al., 2019; Costache et al., 2020). A complete list of inputs is reported in the Appendix (Fig. B1). As there are several typologies of inputs, it is important to design an appropriate model to integrate heterogeneous environmental information.”
> line 804 - 809: Maybe this can be formulated a bit more soft. Initial suggestions for accounting for changing terrain etc. have been published, but there certainly is more work to be done in this direction.
We understand the point of the Reviewer. Hence, we modified the paragraph in lines 804-810 as follows:
“Current deep learning models struggle to generalize across different case studies and regions, implying that a new model must be created each time. Further problems occur when modelling the complex interactions with the natural and built environment. While some of the reviewed papers provide initial suggestions to tackle these issues, the community should invest more efforts in this direction. A possible solution to these problems is to use novel DL architectures that include meshes as learning frameworks. Mesh-based neural networks, such as graph neural networks and neural operators, can consider arbitrarily shaped domains and thus provide the required flexibility to generalize across case studies and model the effects of complex interactions.”
-
AC1: 'Reply on RC1', Roberto Bentivoglio, 22 Apr 2022
reply
-
RC2: 'Comment on hess-2022-83', Anonymous Referee #2, 07 Apr 2022
reply
This paper presents a review of the applications of deep learning models for ï¬ood inundation, susceptibility, and hazard mapping in a period between 2010 and 2021, leading to reviewing a total of 58 papers. The manuscript is well-rewritten and organized and provides information for international readers. However, two important weaknesses exist; these can both be strengthened relatively easily. (1) The objectives should be better presented. A numbered list of objectives would work better. (2) The conclusions section should be rewritten and further improved to address the objectives. This section should also include the limitations of the current work and suggestions for further research in the future. For example, the authors can suggest a more systematic Bibliometric/scientometric analysis and the use of several well-established metrics for future research. See https://www.mdpi.com/2071-1050/13/15/8261/htm are the references therein.
-
AC2: 'Reply on RC2', Roberto Bentivoglio, 22 Apr 2022
reply
>> We thank the Reviewer for the comments. We address each of them in the following lines.
> This paper presents a review of the applications of deep learning models for flood inundation, susceptibility, and hazard mapping in a period between 2010 and 2021, leading to reviewing a total of 58 papers. The manuscript is well-rewritten and organized and provides information for international readers. However, two important weaknesses exist; these can both be strengthened relatively easily. (1) The objectives should be better presented. A numbered list of objectives would work better.
>> We thank the reviewer for this suggestion. We have organized the main contributions of our paper in a list, in lines 70-76:
“The main insights from this paper can be summarized as follows:
- We identify common patterns and deduce general considerations based on the presented results while highlighting individual innovative approaches;
- We compare against traditional methods to further validate the benefits of employing DL models;
- We identify a series of current knowledge gaps and propose possible solutions to them drawing from recent advancements in DL.”
> (2) The conclusions section should be rewritten and further improved to address the objectives. This section should also include the limitations of the current work and suggestions for further research in the future. For example, the authors can suggest a more systematic Bibliometric/scientometric analysis and the use of several well-established metrics for future research. See https://www.mdpi.com/2071-1050/13/15/8261/htm are the references therein.
>> We thank the Reviewer for the suggestion. We have now modified the conclusions section to enforce a clearer connection between the research objectives and the conclusions we obtained, in lines 775-804:
“This paper presented a review of current applications of deep learning models for flood mapping. The chosen search criteria yielded a total of 58 papers published between 2010 and 2021. From our analysis we found common patterns that can be summarized as follows:
- Flood inundation, susceptibility, and hazard mapping were investigated using deep learning models. Flood inundation considers as the main data images of floods, mostly taken via satellite. The main and most accurate deep learning models were CNNs. In flood susceptibility, deep learning models consider several inputs, the most important being slope, land use, aspect, terrain curvature, and distance from the rivers. The main deep learning model used were MLPs, often in combination with other statistical techniques although CNNs provided more accurate results. Deep learning for flood hazard mapping generally involves developing surrogates of numerical models that estimate water depths in a study area. For this application, there are no deep learning model preferences. However, RNNs are preferable for spatio-temporal simulations.
- MLPs and CNNs were the most common type of deep learning model considered in flood mapping, while RNNs were less used. To overcome their lack of inductive biases and achieve good accuracy, MLPs are often coupled with other statistical techniques. On the other hand, thanks to their spatial and temporal inductive biases, CNNs and RNNs were found to regularly outperform other models.
- Most papers dealt with river and urban floods, while only a few works described applications for flash, coastal, and dam break floods. Case studies were mainly addressed at local or regional scales, arguably due to the availability of high-resolution data. Conversely, the community should further investigate the suitability of deep learning models for flood applications at larger scales.
- Concerning the development data, we found that models producing susceptibility and inundation maps rely on the availability of real flood observations. Instead, DL-based surrogate models for hazard mapping require target data from numerical simulations.
In terms of comparison with traditional and machine learning approaches we found that:
- Regardless of the application, results show that deep learning solutions outperform traditional approaches as well as other machine learning techniques.
- Deep learning models used for surrogate modeling provide significant speed-up (up to three orders of magnitude) while maintaining sufficient accuracy.
This review did not consider works featuring ML methods alone. Therefore, further research is needed to thoroughly compare ML against DL methods, especially with respect to explainability, generalization ability, and data requirements. This review also outlined several knowledge gaps, which can be addressed via deep learning to improve the state of the art of flood mapping. To solve these gaps we proposed possible solutions based on recent advances in fundamental machine learning research:”
>> Regarding the limitations of our work, we believe that our selection procedure has gathered enough relevant papers to carry out this critical review, as stated in lines 266-268. While the current literature in deep learning for flood mapping is probably not large enough for a bibliometric analysis (see Donthu et al., 2021), we thank the reviewer for recommending this approach. Indeed, we now report the suggested reference, mentioning the potential benefits of conducting a bibliometric analysis for a broader area of research, encompassing deep learning for water resources and natural hazards in lines 822-825:
“While our review draws insights for future research directions from the machine learning literature, further understanding may emerge from a broader review including deep learning applications across other water- and natural-hazard-related fields, and featuring a bibliometric analysis (Fazeli-Varzaneh et al., 2021). This approach may facilitate cross-fertilization between sister disciplines, especially with respect to the successful implementation of advanced deep learning methods for spatial analysis.”
References:
Donthu, N., Kumar, S., Mukherjee, D., Pandey, N. and Lim, W.M., 2021. How to conduct a bibliometric analysis: An overview and guidelines. Journal of Business Research, 133, pp.285-296.
Fazeli-Varzaneh, M., Bettinger, P., Ghaderi-Azad, E., Kozak, M., Mafi-Gholami, D. and Jaafari, A., 2021. Forestry research in the Middle East: A bibliometric analysis. Sustainability, 13(15), p.8261.
-
RC3: 'Reply on AC2', Anonymous Referee #2, 22 Apr 2022
reply
Thanks for addressing my comments and revising the manuscript. As far as I am concerned, the manuscript can be accepted for publication.
-
AC2: 'Reply on RC2', Roberto Bentivoglio, 22 Apr 2022
reply
-
RC4: 'Comment on hess-2022-83', Anonymous Referee #2, 22 Apr 2022
reply
The authors addressed my comments and suggestion and carefully revised the manuscript. As far as I am concerned, the manuscript can be accepted for publication.
Roberto Bentivoglio et al.
Roberto Bentivoglio et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
614 | 200 | 18 | 832 | 10 | 7 |
- HTML: 614
- PDF: 200
- XML: 18
- Total: 832
- BibTeX: 10
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1