Integrating historical archives and geospatial data to revise  flood estimation equations for Philippine rivers

Hoey, Trevor B.; Tolentino, Pamela Louise M.; Guardian, Esmael; Perez, John Edward G.; Williams, Richard D.; Boothroyd, Richard; David, Carlos Primo C.; Paringit, Enrico C.

doi:10.5194/hess-29-6181-2025

Articles | Volume 29, issue 21

https://doi.org/10.5194/hess-29-6181-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/hess-29-6181-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 29, issue 21

Research article

|

11 Nov 2025

Research article |

| 11 Nov 2025

Integrating historical archives and geospatial data to revise flood estimation equations for Philippine rivers

Trevor B. Hoey, Pamela Louise M. Tolentino, Esmael Guardian, John Edward G. Perez, Richard D. Williams, Richard Boothroyd, Carlos Primo C. David, and Enrico C. Paringit

Download

Final revised paper (published on 11 Nov 2025)
Supplement to the final revised paper
Preprint (discussion started on 08 Jul 2024)
Supplement to the preprint

Interactive discussion

Status: closed

RC1:
'Comment on hess-2024-188', Anonymous Referee #1, 07 Aug 2024
This study evaluates 11 physical variables for index flood estimation across catchments in the Philippines, aiming to enhance flood estimation for ungauged catchments. The authors show significant effort in data collection and selection, and they present extensive analyses in this manuscript. Notice that the authors claim their study is applicable to ungauged catchments. However, if my understanding is correct, the analyses presented do not show anything regarding this applicability. While they propose using more local information to improve flood estimation—a common approach in many studies—they suggest this could benefit ungauged sites. Although this suggestion might be correct, it is overstated in the title since there are no relevant analyses or validation to support this claim. In addition, the manuscript has several critical issues regarding the quality: 1) Unclear Critical Information - There is confusion regarding the number and details of study sites, as well as incomplete or unclear descriptions of methodologies. 2) Lack of Novelty and Significant Findings - The framework lacks innovation and the findings are not particularly groundbreaking (as noted by the authors in line 527). 3) Quality and Clarity - The structure of the manuscript, along with its figures and tables (including captions), lacks quality and clarity. There are numerous mistakes throughout the document. I found it challenging to understand the authors' main points, both from the text and the figures. While the authors' efforts in conducting numerous analyses are commendable, they are strongly encouraged to improve the manuscript by enhancing its accuracy, clarity, and focus. Some specific comments (but not all) for improvement are listed below for reference:
How many catchments are analyzed exactly? Is it 513 or 466? The abstract states 513, but other parts of the manuscript (e.g., Figure 1, Table 2, line 172) suggest it is 466. Lines 164-172 are particularly confusing: how does 513 minus 205 result in 466 sites?

The catchment area sizes are analyzed in this study, but this information is missing in the data section.

Why do the catchment area groupings differ between Table 2 and Figure 3? The former uses four groups (100-200, 200-400, 400-800, <800), whereas the latter uses five groups (<25, 25-50, 50-250, 250-2500, >2500), with so different ranges.

Is the area grouping range rational? Comparing catchments across such varied groupings (<25, 25-50, 50-250, 250-2500, >2500) seems to be strange for representing hydrological responses.

The title of Section 5.1 is not coherent with its content. Moreover, it is difficult to discern the patterns the authors aim to show (lines 232-234) because Figure 3 is unclear. Improving the color and marker settings or changing the plot type (e.g., stacked bar plots) might be helpful.

Line 233: Figure 3 refers to area, not climate.

Line 233: All the others should point to Figure S5, not Figure S6.

There are references to regions 1-13 in many analyses, but no introduction or proper definition of these regions is provided. The first mention appears on line 78, without any definition.

Line 213 claims that the authors apply the FEH approach described earlier, but there is no earlier description. The term is introduced on line 50, but without sufficient detail.

Improper structure: Lines 220-227 fit better in the data section rather than the methods section.

Line 245: Why are only 71 sites analyzed here instead of the full 466 sites?

The figure captions should be more descriptive of the settings, but discussions of the results/patterns should not be included here.

Tables could be improved by adding delineation lines, clarifying headers (e.g., the last item in the first column of Table 2), and removing unnecessary information.

In summary, I acknowledge that such a study is needed for the selected country, as the authors claim there are no other similar studies to such an extent. While the analyses are comprehensive, the manuscript lacks sufficient clarity in its structure and critical information, which hampers its transferability and overall readability.
Citation: https://doi.org/10.5194/hess-2024-188-RC1
- AC2: 'Reply on RC1', Pamela Louise M. Tolentino, 01 Oct 2024
  
  Dear Editor Kumar and Referee no. 1,
  Thank you for your comments. We appreciate the feedback on our manuscript.
  Please find attached our responses to the comments.
  Thank you.
  
  Citation: https://doi.org/10.5194/hess-2024-188-AC2
RC2:
'Comment on hess-2024-188', Anonymous Referee #2, 27 Aug 2024
The paper by Hoey et al., entitled ‘Flood estimation for ungauged catchments in the Philippines’ aims at delivering design equations to estimate flood magnitudes and frequency in ungauged catchments across the Philippines.
The paper addresses an interesting topic, the design of equations to predict floods in data scarce region. The authors do enormous work to analyse the data. However, the methodology is not well defined/structured. Overall, the paper lacks clarity and needs to be restructured for better clarity.
General comments:
The title of the paper does not reflect the content of the paper appropriately. The authors propose and evaluate the index flood approach and multi-variate regression to estimate flood. The approach is evaluated only at catchments with data used to fit the regression equations. In a context of flood estimation in ungauged catchments, the paper is lacking the following steps: 1) a clear definition of the methodology used for regionalization to ungauged catchment, 2) a cross-validation to evaluate the performance of the designed equations in ungauged catchments. It is necessary to evaluate how well the proposed approach would perform in ungauged catchments. Therefore, I suggest the authors consider using some cross-validation approach (i.e., leave-one-out or k-cross validation) where a set of catchments are used to fit the design equations; then the remaining catchment(s) are used as pseudo-ungauged catchment(s) to evaluate the accuracy of the designed equations in ungauged catchments. In addition, the methodology used is not clearly described (see specific comments below). The paper could benefit from a flow diagram that describes the steps followed to estimate floods at ungauged catchments.
Specific comments:
The authors fit and compare the accuracy of the three distributions showing that the choice of the distribution influences flow estimates and no distribution performs well at all sites. Then, the GLO distribution is selected to predict high flow magnitude in section 4.2 with no justification of this choice. In addition, Equation 5 , the factorial standard error for the GLO distribution, is only applicable when number of records is at least 20 years. The study region has only 71 sites (line 245), with more than 20 years data, among the 466 sites retained for the analysis (Line 234) . Why choose the GLO distribution if it cannot be applicable to all sites? Why choose only GLO instead of using the best fitted distribution at each site as shown in table 2?

The last paragraph in section 4.1, the authors lists three ways the construct growth curves by combining the curves fitted from each site by region, climate type or catchment areas. Therefore, I am assuming that the resulting curves are regionalized curves where catchments from within each group(region or climate type, catchment area bin) would be represented by a single curve. In other words, catchments of each group would have a single Q_T value estimate from the regionalized growth curve. How the author use these results to perform a correlation analysis with catchment properties in Section 4.2 (Line 212: values of Q_T provided by the GLO analysis previously, were correlated with catchment properties”). ?

The authors use precipitation dataset covering only 17 years (from 1998-2015) which is not sufficient to derive climatology of the catchments. In addition, this dataset are not available for a period that overlaps with flows records period. As suggestion, the author could use precipitation datasets from reanalysis products (e.g. ERA5) which are available from 1940 to present around the globe. It would cover the same period as flow records for most catchments and provide sufficient long timeseries to derive climatology for precipitation variables used in Table 3.

Why the fitted curves from individual sites are not used to fit a relationship between flood magnitude and catchment area?

It is unclear how the individual short historical records are combined to generate large dataset that produce consistent results.

Results presented in Section 5.3.3 (Eq6a-6c) would be better presented in a table with equation and associated R² and RMSE in the same way as in Table 5.

Overall, the way the methodology and the results are presented is incoherent and lacks clarity.
Citation: https://doi.org/10.5194/hess-2024-188-RC2
- AC1: 'Reply on RC2', Pamela Louise M. Tolentino, 01 Oct 2024
  
  Dear Editor Kumar and Referee no. 2,
  Thank you for your comments. We appreciate the feedback on our manuscript.
  Please find attached our responses to the comments.
  Thank you.
  
  Citation: https://doi.org/10.5194/hess-2024-188-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

ED: Reconsider after major revisions (further review by editor and referees) (14 Oct 2024) by Rohini Kumar

AR by Pamela Louise M. Tolentino on behalf of the Authors (20 Dec 2024) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (20 Dec 2024) by Rohini Kumar

RR by Anonymous Referee #2 (29 Jan 2025)

RR by Anonymous Referee #3 (18 Feb 2025)

Suggestions for revision or reasons for rejection

# HESS-2024-188 Review

This manuscript titled `Integrating historical archives and geospatial data to revise flood estimation equations for Philippine rivers` is a `revision of Flood estimation for ungauged catchments in the Philippines` manuscript that has been submitted to Hydrology and Earth System Sciences.\
The authors have addressed the comments raised by the reviewers in the previous round of review.\
Therefore, it was time consuming to compare the previous and current versions of the manuscript.

The revised manuscript presents an ambitious effort to combine short-duration historical records from diverse sources into a single, coherent analysis for deriving flood design equations at a national scale.

I suggest the authors consider the following points in their revision:

## Major Comments

Although the authors mentioned uncertainty many times in the manuscript, they did not provide any quantification of the uncertainty in their results. This is a major issue that needs to be addressed.\
Also, in general, flood frequency analysis is not a proper method to estimate flood magnitude when you have limited data. Fitting a curve to 7–10 data points is not a reliable method to estimate flood magnitude.

### Abstract

While the abstract effectively conveys the general research objective and findings, in my opinion it may need some revisions to improve clarity and precision. When you start to mention R² and then express the added value of including the new variables, the sentence is not clear (L25-27). I suggest you revise it. It would benefit from a clearer statement regarding the limitations of the low R² values and the implications for design uncertainty.

### Introduction

The introduction provides an overview of the study. It briefly describes the importance of catchment area and mean annual rainfall as predictors of flood magnitude. The authors tried to highlight the impact of pooling data from available sources to improve flood estimation when the data are limited in time and space.\
I suggest merging the two middle paragraphs of the introduction to make it more concise and clear.

Also, the hypothesis and research questions of the study are not clearly stated in the introduction. It would be better to state them explicitly.

## Methodology

- In the section `Data sources`, the authors provide a detailed description of the data sources and the process of data collection. Different sources introduce distinct uncertainties and biases into the analysis. For example, in Figure 1, some sources (red and blue dots) are more concentrated in regions such as the north of the country, while in the west and south—where there is lower rainfall and lower contribution from tropical cyclones—we have no or only one source of data. This may introduce bias in the analysis. The authors should discuss this issue in the manuscript.
- More details regarding the screening criteria for data quality and the rationale behind the selected catchment properties would improve transparency. For example, three sources of data are used in the analysis; while they were recorded differently, in different periods of time, and likely with different measurement techniques, the method of merging these data should be discussed in the manuscript. It is highly likely that the quality of measurements before the 1980s is lower.

### Analysis Methods

- I am curious to know whether you ever tried to employ two peaks per year or any POT analysis to identify the peaks in the data, instead of only using the annual maximums. This approach would give you more freedom not only to select the highest peak in the year but also the second highest independent peak in the year. This could help you better understand the flood frequency in the region, as the second peak may occur in another season and allow you to better capture your basin’s behavior. Then, you could continue to determine Q_med of the new series of peaks.
- The manuscript provides a thorough description of the curve fitting using L-moments and the subsequent regression analyses. Yet, the discussion on the potential biases arising from combining data of varying quality and the choice of best-fit distributions (with respect to low R² values) deserves further elaboration. Moreover, since the study aims to estimate extreme floods, linear regression may not be the best approach. The authors should consider using a more robust method, such as quantile regression, to account for the non-linear relationship between the predictors and the response variable.
- What is the set threshold of low CvM p-values used to exclude data from the analysis in L183?

## Results

- The correlation approach in Table 4 does not lead to a new conclusion. The fact that a larger catchment area leads to a higher correlation is not a new finding. It is the same with the DPLBAR variable, the length of the streamflow network, and the mean annual rainfall. Therefore, your addition in Table 5 should be highlighted. I suggest restructuring the results section to emphasize the new findings of the study.
- Perhaps testing and illustrating your approach on only the new dataset as a test case would be a good idea to show the robustness of your approach. This will also help in understanding the uncertainty in the results.
- What would be the expected best R² value by adding the new variables? It would be better to have a benchmark to compare the results. What is the ideal R² value for flood frequency analysis in the region? Is the benchmark 0.92 in Papua New Guinea? You could randomly generate some synthetic data and try to estimate the flood frequency analysis to see the ideal R² value.
- As you mentioned, land use change is a major factor in flood frequency analysis, and you employed almost current land use data in the analysis. This is a significant challenge and limitation of the study.
- The abbreviations in this study are not mathematically scientific, such as AREA or RMED. It would be better to use the full names of the variables in the text and use better letters for the variables. For example, A for area, and R_m for RMED, and so on.
- Since the results are mainly presented on Q10 and they are not significantly appropriate for flood control and design, it would be better to include a discussion on the results and the limitations of the study.

## Discussion

- It would be valuable to discuss the limitations (e.g., stationarity assumptions, data quality issues, and land use change) more explicitly and to outline potential paths for future improvement, such as incorporating non-stationary models or enhancing continuous monitoring.
- Tropical cyclones were not part of your investigation; however, they play a role in the discussion.
- Climate change and spatiotemporal variability in the region are not discussed in the manuscript at all, despite the merged data varying over time.
- The discussion section is generally long. I suggest revising it to be brief, more concise, and clear. However, the current form is good for readers to understand the results and limitations of the study.
- The comparison with HEC-HMS modeling lends additional credibility, though the discussion might be expanded to explain the practical implications of the observed discrepancies between instantaneous peak flows and daily mean flow estimates.

## Conclusion

- The conclusion is well-structured and effectively summarizes the key findings of the study.

## Minor Comments

**Abstract:**

- L18: Split the long sentence `However, the global ...` into two sentences for clarity. The current sentence contains four commas.
- L23: What does `national and regional scales` mean? Are they two different scales?
- L25: The term `GIS-derived` is not needed here. You can simply say `geospatial catchment characteristics`.
- L30: There is a redundancy with the term `predictive equation` in the same sentence.

**Introduction:**

- L43: The sentence `The resulting equations ...` is not well connected to the previous sentence. The starting lines are quite good, but there is a gap between the first and second parts of the first paragraph.
- L79: A reference to Figure S1 is needed.
- Please provide a map of the available length of time series in the Philippines. This will help in understanding the data availability in the country (L80). Although the time period is indicated in Table 1, it is not clear whether the records are continuous or if there are gaps in the data. Alternatively, you can provide some sentences in the text to explain this issue.
- How do you define short time series? Is it less than 35 years? (L80). It would be better to provide a definition for short time series or a reference for the definition.
- L84: The `FEH` abbreviation has already been defined previously in L50.

**Data sources:**

- Figure 1: In the caption, it is mentioned that `the four climate types that have been identified for the Philippines (Coronas, 1920)`. Since the climate types were identified in the 1920s, is there any more recent climate type identification for this region? Given global warming and climate change, the climate types may have changed or been better defined in recent years.
- Figure 1: Please replot panels b and c and use discrete colors instead of gradient colors. Also, Figure 1C does not support any of your results except for a sentence in the conclusion. It would be better to remove it from the manuscript or integrate its insight into your interpretation.
- I suggest moving Table 1 to the supplementary material, as it is not necessary in the main text.

**Analysis Methods:**

- To achieve more consistency in the manuscript text, I suggest adding Q5 and Q50 in Figure 2, and so on, in your text.
- Since Table 2 does not show any relation between the size of the catchment, climate type, and the best-fit curve, is there any geographical pattern in the best-fit curve? For example, do catchments in the north of the country have the same best-fit curve? What if you plot the best-fit curve on the map of the country? Usually, subcatchments in the same basin may have the same best-fit curve since they are flow-connected.
- L204: The phrase `(Figure S1) show this pattern` is unclear. I have not seen this pattern in Figure S1. Please revise the text. The mentioned figure is `Administrative regions of the Philippines`. Since the numbering starts from north to south, it would be better to reorder the legend of the figure to follow the same order instead of alphabetical order.
- L208: The reference to Figure S2 is incorrect. It is currently written as `(Figures 2, S1)`; it should be `(Figure S2)`, as seen in the supplementary material. Also, the figure itself is not well plotted.
- The quality of Figure S3 is too low. It is not readable. Please revise it. The current figure overlays the main curves on top of each other. The area of concentration should be zoomed in to see the differences between the curves on the right part of the x-axis.
- The same applies to Figures S4 and S5. However, they are slightly better. I think these figures can be highlighted for regionalization since they are important for understanding how the curves differ by region, climate type, and catchment size. The current format does not help in understanding the differences between the regions.
- L247: As far as I know, we have free global DEMs with 30 m resolution. So why did you resample the DEM?

**Results:**

- L287: The phrase `This contrasts with Meigh’s` should be moved to the conclusion.
- Table 4: Instead of `NA`, write `-` in the table.
- Theoretically, your Q_med should be equivalent to your Q2 when you have a limited length of time series. If you look at Table 4, the columns Q_med and Q2 are almost identical. Also, the correlation is sensitive to the number of data points.
- Table 5: The alignment of the table is not correct. Please revise it, and make it more readable.
- Figure 5 and subsequent figures: Please elaborate on "normally distributed" in the figure caption, especially for subplots b, c, e, and f.
- Set a fixed significance level for the p-values in the text. In section 5.3.3 it is 0.05, while previously it was 0.01.
- Figure S7 must be revised. The current figure is not readable enough; it is a bit small, and the selected colors do not help the readers understand it. Also, since this figure has three parts, the main body of the manuscript does not support it well.

**Discussion:**

- Figure 8: It seems that the x-axis of panel b is not correct. Please revise it.

This study contributes to hydrological modeling by demonstrating how pooling individually short historical flood records—combined with high-resolution geospatial data—can produce nationally applicable flood estimation equations even in data-sparse tropical regions. The `Recommended design equations` section is a part that the authors may consider including in their analysis.\
I suggest authors consider the above points in their revision and I look forward to seeing the revised manuscript.

Referee Report: PDF

Hide

ED: Publish subject to revisions (further review by editor and referees) (19 Feb 2025) by Rohini Kumar

AR by Pamela Louise M. Tolentino on behalf of the Authors (02 Apr 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (12 Apr 2025) by Rohini Kumar

RR by Anonymous Referee #3 (11 Jun 2025)

Suggestions for revision or reasons for rejection

HESS-2024-188 Review on the Revised Version 3

This manuscript, titled "Integrating historical archives and geospatial data to revise flood estimation equations for Philippine rivers," presents a valuable contribution toward developing national-scale flood design equations in a data-scarce region. While the authors have addressed several reviewer comments, two central issues—particularly concerning uncertainty quantification—remain insufficiently resolved.

Major Comments:

- Uncertainty Quantification
The manuscript presents residual plots (Figures 5 and 6), residual mapping (Figure 7), and summary statistics, including R² and standard error (Tables 6 and 7), as indicators of uncertainty. While informative for model diagnostics, these are not substitutes for a formal uncertainty analysis.
A robust treatment of uncertainty in flood design requires:
- Confidence or prediction intervals for Q estimates (e.g., Q2, Q10, Q100)
- Sensitivity analysis to evaluate how changes in AREA, RMED, and other predictors affect outcomes
- Uncertainty propagation to assess how uncertainties in input data, model parameters, and structural assumptions influence the final flood estimates across regions

None of these are implemented in the current analysis.
For instance:
Dashed lines in Figure 5 are visual guides (1:1, 1:2, 2:1) and do not represent statistical intervals.
Figure 8 includes approximate 95% prediction intervals (±2σ) for selected points, but these are only applied to the DREAM model comparison and not to the core regression framework.
While steps such as data screening, exclusion of unreliable records, and residual inspection show good methodological care, they do not quantify how uncertainty in inputs translates into uncertainty in outputs. Without prediction intervals or sensitivity diagnostics, practitioners cannot assess the confidence of the resulting design estimates—an essential requirement for engineering or policy use.

Figures
Some supplementary figures (e.g., S3–S7) remain hard to interpret due to dense overlays or unclear legends. Improving their clarity would enhance the presentation and facilitate the interpretation of the results.
The figures lack visual consistency in terms of font style, axis scale, and label size. A uniform formatting standard across all figures would improve readability and presentation quality.

Minor Comments

- Methodological Choices
The continued reliance on linear regression—despite suggestions to consider more flexible or robust methods such as quantile regression—is a missed opportunity. Given the goal of estimating extremes and the relatively low R² values reported, this limitation should at least be acknowledged and addressed as a direction for future work.

- Data Limitations
The manuscript rightly notes that many sites have short records (7–10 years), and data are pooled across sources and decades. However, it does not assess how these factors affect flood frequency estimation or prediction accuracy. Similarly, the use of annual maxima overlooks possible seasonal structures or secondary peaks, which may lead to underestimation in some basins. Please reflect on these points in the manuscript text.

Overall Assessment
This study addresses a significant problem in a complex data environment and makes a valuable contribution to the field. However, the lack of quantitative uncertainty analysis and the quality of the figures restrict its practical applicability. I recommend a focused revision addressing the following:
- Inclusion of prediction intervals for design estimates
- Clarification or expansion of the uncertainty framework
- Consideration or discussion of alternative statistical approaches

With these revisions, the manuscript will be much stronger and more useful for both scientific and applied hydrology audiences.
I recommend further revision of these remaining points to address the gaps before publication.

Hide

ED: Publish subject to minor revisions (review by editor) (27 Jun 2025) by Rohini Kumar

AR by Pamela Louise M. Tolentino on behalf of the Authors (14 Aug 2025) Author's response Author's tracked changes Manuscript

ED: Publish as is (15 Aug 2025) by Rohini Kumar

AR by Pamela Louise M. Tolentino on behalf of the Authors (25 Aug 2025) Manuscript

Short summary

Estimating the sizes of flood events is critical for flood risk management and other activities. We used data from several sources in a statistical analysis of flood sizes for rivers in the Philippines. Flood size is mainly controlled by the size of the river catchment, along with the volume of rainfall. Other factors, such as land use, appear to play only minor roles in flood size. The results can be used to estimate flood size for any river in the country, alongside other local information.