Prediction of groundwater quality index to assess suitability for drinking purpose using averaged neural network and geospatial analysis

Ahn, Seok Hyun; Jeong, Do Hwan; Kim, MoonSu; Lee, Tae Kwon; Kim, Hyun-Koo

doi:10.5194/hess-2022-86

Preprints

https://doi.org/10.5194/hess-2022-86

Preprints

01 Jun 2022

| 01 Jun 2022

Status: this discussion paper is a preprint. It has been under review for the journal Hydrology and Earth System Sciences (HESS). The manuscript was not accepted for further review after discussion.

Prediction of groundwater quality index to assess suitability for drinking purpose using averaged neural network and geospatial analysis

Seok Hyun Ahn, Do Hwan Jeong, MoonSu Kim, Tae Kwon Lee, and Hyun-Koo Kim

Abstract. The aims of this study were to determine the groundwater quality index (GQI) using an averaged neural network and evaluate its field applicability with two-dimensional (2D) spatial analysis. The GQI was computed using 29 water quality parameters obtained at 3,552 portable groundwater wells used as drinking water sources. The GQI was divided into the following three grades: ‘worrisome’, <0.89 (20.1 % of the wells); ‘good’, 0.89–0.94 (62.8 %); and ‘very good’, >0.94 (17.1 %). Based on the random forest, the most important water quality parameters were general bacteria, turbidity and nitrate. The 2D spatial analysis confirmed notable differences in the GQI grades among regions. The 10-year long-term groundwater quality monitoring in the ‘worrisome’ grade showed the nitrate and chloride concentrations have continuously increased. These results indicate that the coupling of the GQI with 2D spatial analysis is a promising approach that can be applied in groundwater management and vulnerability assessment.

Received: 01 Mar 2022 – Discussion started: 01 Jun 2022

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 1293 KB)

Supplement (844 KB)

Download & links

Seok Hyun Ahn, Do Hwan Jeong, MoonSu Kim, Tae Kwon Lee, and Hyun-Koo Kim

Status: closed

RC1:
'Comment on hess-2022-86', Anonymous Referee #1, 17 Jun 2022
This paper applied GWI to evaluate groundwater quality of 3,552 portable groundwater wells based on the 29 water quality parameters. The authors claimed that ANN and SVM models yielded the best result for GWI prediction. The research has practical applications based on the 2D spatial analysis, but its presentation can be improved as well.

Please highlight the innovation of your study in Abstract.

Introduction: It is not clear why GQI is the selected index to evaluate groundwater quality. What are the advantages/potential of using this index?

Line 35: make “a WQI suitable” to “a suitable WQI”.

The Introduction section must be written on more quality way. The research gap should be delivered on more clear way with directed necessity for the conducted research work.

It seems the major contributions of this study are using 47 water quality parameters from 8326 wells to determine the groundwater quality index (GQI) using an averaged neural network and also investigate field applicability with two-dimensional (2D) spatial analysis. I strongly suggest to explain more about these contributions in introduction to enhance the quality of this paper over previous. The novelty of this work must be clearly addressed and discussed in Introduction section.

Table S1 is not available (line 99).

Line 114: provide a reference.

Unclear sentence (line 153): “The models used include averaged neural …”

Nothing is reported about the distribution of the data, about possible correlations between them. This is to be provided.

The methods are taken in “model setup” section is not explained in such a way that the strengths and weaknesses of the methods become visible to the reader. Likewise, no reasons are given as to why the selected ensemble methods are favorable for this issue, or why data are split into 5 subsets. It is better for the reader to directly use the cited literature to classify the methods.

When presenting the RF method for feature selection, no explanation is made about the method, advantages, and disadvantages, no comparison is made to other methods, such as AIC, Gamma Test.

The Korean groundwater quality standard for each parameter is not provided in the manuscript.

Again Fig. S1, Fig. S2, Fig. S3, Fig. S4 and Fig. S5 are not available.

Nothing is said about the results of boxplots (Fig.4).

The proposed models have different parameters and model structure. How the authors do model parameter, layers, nodes, etc determination? Up to what structure and function of ANN, SVM, and Naïve Bayes models causes these models have high performance?

Using binning method for special analysis is an applicable and useful result in this study.

There is not enough explanation on different models’ prediction and feature selection results.

Please cite the following reference:

Uncertainty analysis of water quality index (WQI) for groundwater quality evaluation: Application of Monte-Carlo method for weight allocation. Ecological Indicators, 117, 106653.
Citation: https://doi.org/10.5194/hess-2022-86-RC1
- AC1: 'Reply on RC1', Tae Kwon Lee, 21 Nov 2022
  
  # Reviewer 1
  
  This paper applied GWI to evaluate groundwater quality of 3,552 portable groundwater wells based on the 29 water quality parameters. The authors claimed that ANN and SVM models yielded the best result for GWI prediction. The research has practical applications based on the 2D spatial analysis, but its presentation can be improved as well.
  
  ○ Please highlight the innovation of your study in Abstract.
  Response:
  As the reviewer commented, the previous abstract was written too general. We revised abstract to emphasize the novelty and innovation of our study.
  
  ○ Introduction: It is not clear why GQI is the selected index to evaluate groundwater quality. What are the advantages/potential of using this index?
  Response:
  We agreed with reviewer’s comments. We revised the introduction to clearly show the advantage of GQI for the groundwater management.
  
  ○ Line 35: make “a WQI suitable” to “a suitable WQI”.
  Response:
  Corrected.
  
  ○ The Introduction section must be written on more quality way. The research gap should be delivered on more clear way with directed necessity for the conducted research work. It seems the major contributions of this study are using 47 water quality parameters from 8326 wells to determine the groundwater quality index (GQI) using an averaged neural network and also investigate field applicability with two-dimensional (2D) spatial analysis. I strongly suggest to explain more about these contributions in introduction to enhance the quality of this paper over previous. The novelty of this work must be clearly addressed and discussed in Introduction section.
  Response:
  We thank you for your thought comments. We revised the introduction to show not only the advantage of GQI but the potential of GQI coupled with 2-D spatial analysis for groundwater management.
  
  ○ Table S1 is not available (line 99).
  Response:
  You can find Table S1 in Supplement.
  
  ○ Line 114: provide a reference.
  Response:
  pH is included in the water quality parameters to be measured for groundwater quality, but no water quality standards of pH are presented to determine whether drinking is suitable. Thus it was excluded for GQI calculation because it was difficult to use for GQI, which used the distance from the water quality standard.
  ○ Unclear sentence (line 153): “The models used include averaged neural …”
  Response:
  We revised this sentence for the clarity.
  
  ○ Nothing is reported about the distribution of the data, about possible correlations between them. This is to be provided.
  Response:
  We thank you for the pointing out the issues we did not consider. We added the results of comparing individual water quality parameters of groundwater according to suitability in supplementary table and described in the results section. In addition, the results of correlation analysis between water quality parameters were described in the result section.
  
  ○ The methods are taken in “model setup” section is not explained in such a way that the strengths and weaknesses of the methods become visible to the reader. Likewise, no reasons are given as to why the selected ensemble methods are favorable for this issue, or why data are split into 5 subsets. It is better for the reader to directly use the cited literature to classify the methods.
  Response:
  We selected 10 classification machine learning models included in the R package ‘caret’ to predict the GQI grades, and selected ANN with the best classification performance as the final model. The ensemble method “Random forest” was used only for feature selection as it was not able to perform feature selection in ANN.
  Despite the importance of model setup, we have not written specific details about model setup, so I also agreed with the reviewer’s comments that we can provide inconvenience to readers. We added a supplemental table describing the model setup with detail information (e.g. parameter and function) with the references which help the reader understating the methods clearly.
  ○ When presenting the RF method for feature selection, no explanation is made about the method, advantages, and disadvantages, no comparison is made to other methods, such as AIC, Gamma Test.
  Response:
  RF is a powerful method to select the features that affect classification. We did not use RF to predict GQI grades, but only used it to select features that influence classification of GQI grades. As the reviewer concerned, the selected features may vary from model to model, but it is absolutely difficult to compare because the criteria or indicators selected are different according to the model. We added the supplemental results for variations in the explanatory power of the model according to the combination of features selected by the RF, instead of directly comparing them with other models. This result may be useful information for readers because information on the classification performance according to the number and combination of water quality parameters can be quantitatively known.
  
  ○ The Korean groundwater quality standard for each parameter is not provided in the manuscript.
  Response:
  You can find this information in Table S1.
  
  ○ Again Fig. S1, Fig. S2, Fig. S3, Fig. S4 and Fig. S5 are not available.
  Response:
  All these figures were in Supplement.
  
  ○ Nothing is said about the results of boxplots (Fig.4).
  Response:
  We sorry for the unintentional mistakes. We described in detail the statistical tests for the results.
  
  ○ The proposed models have different parameters and model structure. How the authors do model parameter, layers, nodes, etc determination? Up to what structure and function of ANN, SVM, and Naïve Bayes models causes these models have high performance?
  Response:
  R package ‘caret’ allow the researcher to test more than 200 classification models with near-automatic cross validation-bootstrapping and parameter tuning, and to find the best predictive model. We used default parameters in R package ‘caret’ without any modification. As reviewer suggested, the parameters used in each model were summarized in the supplemental table.
  
  ○ Using binning method for special analysis is an applicable and useful result in this study.
  Response:
  We appreciate your support for our approach.
  
  ○ There is not enough explanation on different models’ prediction and feature selection results.
  Response:
  It seems that we have not written enough explanations for the results of model prediction and feature selection. We described the both results in more detail in the result section.
  
  ○ Please cite the following reference:
  Uncertainty analysis of water quality index (WQI) for groundwater quality evaluation: Application of Monte-Carlo method for weight allocation. Ecological Indicators, 117, 106653.
  Response:
  We added this reference in introduction section.
  
  Citation: https://doi.org/10.5194/hess-2022-86-AC1
RC2:
'Comment on hess-2022-86', Anonymous Referee #2, 10 Nov 2022
The paper describes the application of data-driven models to predict the groundwater quality index for drinking purposes using multiple water quality parameters. Groundwater quality was assessed by 2D spatial analysis and long-term monitoring results. However, the manuscript needs to be further improved in terms of novelty, literature research, and data handling and interpretation.

Abstract: The authors need to explain more the novelty and importance of their work for international communities.

Introduction: The section must be substantially improved by citing new literature.

Lines 66-70: I disagree with the authors’ statement, there are several recent works about data-driven models predicting comprehensive groundwater quality (WQI, vulnerability, suitability for drinking water,...) as shown in the examples below. Search for recent studies and emphasize improvements of the authors’ research from them.

Prediction of groundwater quality using efficient machine learning technique. Chemosphere, 276, 130265. https://doi.org/10.1016/j.chemosphere.2021.130265

Advanced utilization of multi-learning algorithm: ensemble super learner to map groundwater potential for potable mineral water, Geocarto International, DOI: 10.1080/10106049.2022.2025921

Reliability evaluation of groundwater quality index using data-driven models. Environmental Science and Pollution Research, 29(6), 8174-8190.

Lines 80-83: There is a doubt that 'Analyzing long-term monitoring results can evaluate the accuracy of the groundwater pollution vulnerability.' Details comments are described in the results section.

Figure 1 needs to be redrawn. Clarify the location of the authors’ study area by representing neighboring countries, name of country (or region), and other essential elements of map presentation for the worldwide readers.

Lines 105-106: The authors removed 4,774 wells of groundwater quality that are inappropriate for drinking purposes. Considering the title of the manuscript, it is likely more effective to analyze both appropriate and inappropriate groundwater samples to assess suitability for drinking purpose. Is there any reason the authors removed unsuitable groundwater quality for drinking water?

Lines 151-153: In the section 2.3, the method to calculate GQI and to classify GQI into the grade was already developed. Then why the classification models are necessary? I think there is no reason to use learning models including averaged neural network, RF, SVM,… whose accuracy is not perfect. Isn’t classifying according to a given classification method the most accurate?

Section 2.4: References and details are required when describing the models.

Section 2.5 also needs to be described in more detail.

Lines 198-200: It is unclear why Chungcheongbuk-do was analyzed with high resolution unlike the other regions. Is there a specific reason?

Lines 278-288: The GQI grade was estimated using groundwater quality data sampled at one time for each well. Then, what is scientific basis of the statement that long-term trends confirm the reliability of the authors’ results? What is relation between increasing (or decreasing) trend and the grade? In addition, the authors only showed a long-term trend in only one area corresponding to each grade. I disagree with the authors on the following three points.

There is no relation between long-term trend and the GQI grade.

It is impossible to confirm the reliability of the authors’ results by checking only one area per each grade.

In Figure 6B, Site A (worrisome grade) showed increasing trend in chloride but the value is lower than Site C (very good grade) showing decreasing trend. It is not up to expectation.

Discussion: Most of the discussion is a repeat of the previous sections. More in-depth discussion is required.

Figure S4 is not referred in the manuscript.
Citation: https://doi.org/10.5194/hess-2022-86-RC2
- AC2: 'Reply on RC2', Tae Kwon Lee, 21 Nov 2022
  
  # Reviewer 2
  The paper describes the application of data-driven models to predict the groundwater quality index for drinking purposes using multiple water quality parameters. Groundwater quality was assessed by 2D spatial analysis and long-term monitoring results. However, the manuscript needs to be further improved in terms of novelty, literature research, and data handling and interpretation.
  
  ○ Abstract: The authors need to explain more the novelty and importance of their work for international communities.
  Response:
  As the reviewer commented, the previous abstract was written too general. We revised abstract to emphasize the novelty and innovation of our study.
  
  ○ Introduction: The section must be substantially improved by citing new literature.
  Response:
  We have partially replaced it with a new literature.
  
  ○ Lines 66-70: I disagree with the authors’ statement, there are several recent works about data-driven models predicting comprehensive groundwater quality (WQI, vulnerability, suitability for drinking water,...) as shown in the examples below. Search for recent studies and emphasize improvements of the authors’ research from them.
  
  Prediction of groundwater quality using efficient machine learning technique. Chemosphere, 276, 130265. https://doi.org/10.1016/j.chemosphere.2021.130265
  Advanced utilization of multi-learning algorithm: ensemble super learner to map groundwater potential for potable mineral water, Geocarto International, DOI: 10.1080/10106049.2022.2025921
  Reliability evaluation of groundwater quality index using data-driven models. Environmental Science and Pollution Research, 29(6), 8174-8190.
  Response:
  We are sorry for missing the recent studies. We revised the introduction sections to emphasize our novelty and innovation from the recent studies with the recent references.
  
  ○ Lines 80-83: There is a doubt that 'Analyzing long-term monitoring results can evaluate the accuracy of the groundwater pollution vulnerability.' Details comments are described in the results section.
  Response:
  This sentence was deleted because the results of our study were not only unsupportable but could also cause confusion in the overall direction of the manuscript.
  ○ Figure 1 needs to be redrawn. Clarify the location of the authors’ study area by representing neighboring countries, name of country (or region), and other essential elements of map presentation for the worldwide readers.
  Response:
  We modified the figure 1 to have more information with neighboring countries, and name of countries. And we zoomed Korean with the original Figure 1.
  
  ○ Lines 105-106: The authors removed 4,774 wells of groundwater quality that are inappropriate for drinking purposes. Considering the title of the manuscript, it is likely more effective to analyze both appropriate and inappropriate groundwater samples to assess suitability for drinking purpose. Is there any reason the authors removed unsuitable groundwater quality for drinking water?
  Response:
  We thank you for concerning the critical points. Legally, if even one of the groundwater quality indicators exceeds the water quality standard, the groundwater cannot be drink. Since it is already legally determined whether it is suitable for drinking in a dichotomous manner, it is meaningless to provide GQI information to citizens that has been judged to be inappropriate. Therefore, by providing GQI information on the groundwater suitability for drinking, citizens will be able to obtain comprehensive water quality information and determine the possibility of contamination of groundwater in advance. In addition, it will be possible to systematically manage groundwater quality by collecting the regional GQI.
  
  ○ Lines 151-153: In the section 2.3, the method to calculate GQI and to classify GQI into the grade was already developed. Then why the classification models are necessary? I think there is no reason to use learning models including averaged neural network, RF, SVM,… whose accuracy is not perfect. Isn’t classifying according to a given classification method the most accurate?
  Response:
  Of course, GQI graded can be decided by calculating GQI. Nevertheless, there is a reason we used the classification model to predict GQI grade. First, since it is practically difficult (e.g. cost and man power) to measure each of the 48 water quality parameters, it is important to select the minimum water quality indicators that can be determined by GQI grades. Classification model can select water quality parameters that can calculate GQI grades through feature selection. According to our results, we made high performance up to 90% (11 parameters) or 95% (14 parameters) even we used lower number of water quality parameters compared to models made using all parameters. The classification model will dramatically reduce the time, cost, and labor required to measure water quality. We added this information to the results to clearly present the advantage of using the classification model.
  
  ○ Section 2.4: References and details are required when describing the models.
  Response:
  We summarized the function and parameter of the models with the references in the supplemental table.
  
  ○ Section 2.5 also needs to be described in more detail.
  Response:
  We describe the function and parameter of the RF for clarity in section 2.5.
  
  ○ Lines 198-200: It is unclear why Chungcheongbuk-do was analyzed with high resolution unlike the other regions. Is there a specific reason?
  Response:
  Even we used the groundwater qualities from 3,552 wells and binning methods, it is still insufficient data considering a national scale (e.g. Republic of Korea). We were able to determine the GQI grades for some provinces less than 30% of total province area. We wanted to show the feasibility of combining GQI with spatial analysis by selecting Chungcheongbuk-do, where the GQI grades was determined in a significant area through the binning methods among the province. And we tried the to compare the GQI results with other independent long-term water quality results. Although we gave a brief reason selecting Chungcheongbuk-do shortly in result section in original version, we revised this sentence for clarity.
  
  ○ Lines 278-288: The GQI grade was estimated using groundwater quality data sampled at one time for each well. Then, what is scientific basis of the statement that long-term trends confirm the reliability of the authors’ results? What is relation between increasing (or decreasing) trend and the grade? In addition, the authors only showed a long-term trend in only one area corresponding to each grade. I disagree with the authors on the following three points.
  - There is no relation between long-term trend and the GQI grade.
  - It is impossible to confirm the reliability of the authors’ results by checking only one area per each grade.
  - In Figure 6B, Site A (worrisome grade) showed increasing trend in chloride but the value is lower than Site C (very good grade) showing decreasing trend. It is not up to expectation.
  Response:
  Due to the nature of groundwater, pollution is localized and residual, so it takes a very long time to observe the pollution of groundwater. As mentioned earlier, the GQI we developed was developed for potable groundwater, and through the binning method, we present information on the groundwater quality level of a specific area. The fact that the GQI of a particular region is selected as "worrisome" means that the pollution is not simply going on in one well of the region, but that the pollution is going on for a long time due to the pollution source in the region. Therefore, we think it is a very important approach to find the characteristics of GQI grades by comparing them with long-term water quality measurement data from national groundwater networks. Through this process, the GQI grades may not simply provide information on the current state of water quality, but can help rapidly select areas that require groundwater quality management.
  
  Of course, I also agree that selecting an area for each GQI grade and matching GQI grade with a long-term trend in national groundwater network may be considered insufficient to explain the characteristics of the grade. There are 688 wells for total national groundwater network, and 48 wells in Chungcheongbuk-do. Unfortunately, selecting an independent region (where the GQI rating results are not present on either side) to confirm the characteristics of the region's GQI grades is very limited in terms of the wells overlapping with that of the national groundwater network. We further described the contents of these technical difficulties in the discussion as future studies.
  As groundwater move slowly through an aquifer, the groundwater qualities should be determined by the type and concentration of the geological materials and nutrients in the aquifer. The change in groundwater quality is due to external environmental factors such as human activity, and in particular, nitrate are very likely to be factors of human activity around them. In the case of chloride, it may be high due to regional characteristics, but nitrate increases due to surrounding pollutants, and generally, the concentration of chloride increases at the same time. Simply increasing one water quality indicator does not increase GQI rapidly. Because GQI increases as multiple water quality indicators approach water quality standards, it is reasonable to analyze the characteristics of GQI ratings to consider the trends of Chloride and nitrate simultaneously.
  
  ○ Discussion: Most of the discussion is a repeat of the previous sections. More in-depth discussion is required.
  Response:
  We agreed that there is a more space to improve the discussion. We revised the discussion so that the proposed GQI could be more innovative and useful compared to previous studies.
  
  ○ Figure S4 is not referred in the manuscript.
  Response:
  We are sorry for the unintentional mistake. We added the Figure S4 in the proper positions.
  
  Citation: https://doi.org/10.5194/hess-2022-86-AC2

Status: closed

RC1:
'Comment on hess-2022-86', Anonymous Referee #1, 17 Jun 2022
This paper applied GWI to evaluate groundwater quality of 3,552 portable groundwater wells based on the 29 water quality parameters. The authors claimed that ANN and SVM models yielded the best result for GWI prediction. The research has practical applications based on the 2D spatial analysis, but its presentation can be improved as well.

Please highlight the innovation of your study in Abstract.

Introduction: It is not clear why GQI is the selected index to evaluate groundwater quality. What are the advantages/potential of using this index?

Line 35: make “a WQI suitable” to “a suitable WQI”.

The Introduction section must be written on more quality way. The research gap should be delivered on more clear way with directed necessity for the conducted research work.

It seems the major contributions of this study are using 47 water quality parameters from 8326 wells to determine the groundwater quality index (GQI) using an averaged neural network and also investigate field applicability with two-dimensional (2D) spatial analysis. I strongly suggest to explain more about these contributions in introduction to enhance the quality of this paper over previous. The novelty of this work must be clearly addressed and discussed in Introduction section.

Table S1 is not available (line 99).

Line 114: provide a reference.

Unclear sentence (line 153): “The models used include averaged neural …”

Nothing is reported about the distribution of the data, about possible correlations between them. This is to be provided.

The methods are taken in “model setup” section is not explained in such a way that the strengths and weaknesses of the methods become visible to the reader. Likewise, no reasons are given as to why the selected ensemble methods are favorable for this issue, or why data are split into 5 subsets. It is better for the reader to directly use the cited literature to classify the methods.

When presenting the RF method for feature selection, no explanation is made about the method, advantages, and disadvantages, no comparison is made to other methods, such as AIC, Gamma Test.

The Korean groundwater quality standard for each parameter is not provided in the manuscript.

Again Fig. S1, Fig. S2, Fig. S3, Fig. S4 and Fig. S5 are not available.

Nothing is said about the results of boxplots (Fig.4).

The proposed models have different parameters and model structure. How the authors do model parameter, layers, nodes, etc determination? Up to what structure and function of ANN, SVM, and Naïve Bayes models causes these models have high performance?

Using binning method for special analysis is an applicable and useful result in this study.

There is not enough explanation on different models’ prediction and feature selection results.

Please cite the following reference:

Uncertainty analysis of water quality index (WQI) for groundwater quality evaluation: Application of Monte-Carlo method for weight allocation. Ecological Indicators, 117, 106653.
Citation: https://doi.org/10.5194/hess-2022-86-RC1
- AC1: 'Reply on RC1', Tae Kwon Lee, 21 Nov 2022
  
  # Reviewer 1
  
  This paper applied GWI to evaluate groundwater quality of 3,552 portable groundwater wells based on the 29 water quality parameters. The authors claimed that ANN and SVM models yielded the best result for GWI prediction. The research has practical applications based on the 2D spatial analysis, but its presentation can be improved as well.
  
  ○ Please highlight the innovation of your study in Abstract.
  Response:
  As the reviewer commented, the previous abstract was written too general. We revised abstract to emphasize the novelty and innovation of our study.
  
  ○ Introduction: It is not clear why GQI is the selected index to evaluate groundwater quality. What are the advantages/potential of using this index?
  Response:
  We agreed with reviewer’s comments. We revised the introduction to clearly show the advantage of GQI for the groundwater management.
  
  ○ Line 35: make “a WQI suitable” to “a suitable WQI”.
  Response:
  Corrected.
  
  ○ The Introduction section must be written on more quality way. The research gap should be delivered on more clear way with directed necessity for the conducted research work. It seems the major contributions of this study are using 47 water quality parameters from 8326 wells to determine the groundwater quality index (GQI) using an averaged neural network and also investigate field applicability with two-dimensional (2D) spatial analysis. I strongly suggest to explain more about these contributions in introduction to enhance the quality of this paper over previous. The novelty of this work must be clearly addressed and discussed in Introduction section.
  Response:
  We thank you for your thought comments. We revised the introduction to show not only the advantage of GQI but the potential of GQI coupled with 2-D spatial analysis for groundwater management.
  
  ○ Table S1 is not available (line 99).
  Response:
  You can find Table S1 in Supplement.
  
  ○ Line 114: provide a reference.
  Response:
  pH is included in the water quality parameters to be measured for groundwater quality, but no water quality standards of pH are presented to determine whether drinking is suitable. Thus it was excluded for GQI calculation because it was difficult to use for GQI, which used the distance from the water quality standard.
  ○ Unclear sentence (line 153): “The models used include averaged neural …”
  Response:
  We revised this sentence for the clarity.
  
  ○ Nothing is reported about the distribution of the data, about possible correlations between them. This is to be provided.
  Response:
  We thank you for the pointing out the issues we did not consider. We added the results of comparing individual water quality parameters of groundwater according to suitability in supplementary table and described in the results section. In addition, the results of correlation analysis between water quality parameters were described in the result section.
  
  ○ The methods are taken in “model setup” section is not explained in such a way that the strengths and weaknesses of the methods become visible to the reader. Likewise, no reasons are given as to why the selected ensemble methods are favorable for this issue, or why data are split into 5 subsets. It is better for the reader to directly use the cited literature to classify the methods.
  Response:
  We selected 10 classification machine learning models included in the R package ‘caret’ to predict the GQI grades, and selected ANN with the best classification performance as the final model. The ensemble method “Random forest” was used only for feature selection as it was not able to perform feature selection in ANN.
  Despite the importance of model setup, we have not written specific details about model setup, so I also agreed with the reviewer’s comments that we can provide inconvenience to readers. We added a supplemental table describing the model setup with detail information (e.g. parameter and function) with the references which help the reader understating the methods clearly.
  ○ When presenting the RF method for feature selection, no explanation is made about the method, advantages, and disadvantages, no comparison is made to other methods, such as AIC, Gamma Test.
  Response:
  RF is a powerful method to select the features that affect classification. We did not use RF to predict GQI grades, but only used it to select features that influence classification of GQI grades. As the reviewer concerned, the selected features may vary from model to model, but it is absolutely difficult to compare because the criteria or indicators selected are different according to the model. We added the supplemental results for variations in the explanatory power of the model according to the combination of features selected by the RF, instead of directly comparing them with other models. This result may be useful information for readers because information on the classification performance according to the number and combination of water quality parameters can be quantitatively known.
  
  ○ The Korean groundwater quality standard for each parameter is not provided in the manuscript.
  Response:
  You can find this information in Table S1.
  
  ○ Again Fig. S1, Fig. S2, Fig. S3, Fig. S4 and Fig. S5 are not available.
  Response:
  All these figures were in Supplement.
  
  ○ Nothing is said about the results of boxplots (Fig.4).
  Response:
  We sorry for the unintentional mistakes. We described in detail the statistical tests for the results.
  
  ○ The proposed models have different parameters and model structure. How the authors do model parameter, layers, nodes, etc determination? Up to what structure and function of ANN, SVM, and Naïve Bayes models causes these models have high performance?
  Response:
  R package ‘caret’ allow the researcher to test more than 200 classification models with near-automatic cross validation-bootstrapping and parameter tuning, and to find the best predictive model. We used default parameters in R package ‘caret’ without any modification. As reviewer suggested, the parameters used in each model were summarized in the supplemental table.
  
  ○ Using binning method for special analysis is an applicable and useful result in this study.
  Response:
  We appreciate your support for our approach.
  
  ○ There is not enough explanation on different models’ prediction and feature selection results.
  Response:
  It seems that we have not written enough explanations for the results of model prediction and feature selection. We described the both results in more detail in the result section.
  
  ○ Please cite the following reference:
  Uncertainty analysis of water quality index (WQI) for groundwater quality evaluation: Application of Monte-Carlo method for weight allocation. Ecological Indicators, 117, 106653.
  Response:
  We added this reference in introduction section.
  
  Citation: https://doi.org/10.5194/hess-2022-86-AC1
RC2:
'Comment on hess-2022-86', Anonymous Referee #2, 10 Nov 2022
The paper describes the application of data-driven models to predict the groundwater quality index for drinking purposes using multiple water quality parameters. Groundwater quality was assessed by 2D spatial analysis and long-term monitoring results. However, the manuscript needs to be further improved in terms of novelty, literature research, and data handling and interpretation.

Abstract: The authors need to explain more the novelty and importance of their work for international communities.

Introduction: The section must be substantially improved by citing new literature.

Lines 66-70: I disagree with the authors’ statement, there are several recent works about data-driven models predicting comprehensive groundwater quality (WQI, vulnerability, suitability for drinking water,...) as shown in the examples below. Search for recent studies and emphasize improvements of the authors’ research from them.

Prediction of groundwater quality using efficient machine learning technique. Chemosphere, 276, 130265. https://doi.org/10.1016/j.chemosphere.2021.130265

Advanced utilization of multi-learning algorithm: ensemble super learner to map groundwater potential for potable mineral water, Geocarto International, DOI: 10.1080/10106049.2022.2025921

Reliability evaluation of groundwater quality index using data-driven models. Environmental Science and Pollution Research, 29(6), 8174-8190.

Lines 80-83: There is a doubt that 'Analyzing long-term monitoring results can evaluate the accuracy of the groundwater pollution vulnerability.' Details comments are described in the results section.

Figure 1 needs to be redrawn. Clarify the location of the authors’ study area by representing neighboring countries, name of country (or region), and other essential elements of map presentation for the worldwide readers.

Lines 105-106: The authors removed 4,774 wells of groundwater quality that are inappropriate for drinking purposes. Considering the title of the manuscript, it is likely more effective to analyze both appropriate and inappropriate groundwater samples to assess suitability for drinking purpose. Is there any reason the authors removed unsuitable groundwater quality for drinking water?

Lines 151-153: In the section 2.3, the method to calculate GQI and to classify GQI into the grade was already developed. Then why the classification models are necessary? I think there is no reason to use learning models including averaged neural network, RF, SVM,… whose accuracy is not perfect. Isn’t classifying according to a given classification method the most accurate?

Section 2.4: References and details are required when describing the models.

Section 2.5 also needs to be described in more detail.

Lines 198-200: It is unclear why Chungcheongbuk-do was analyzed with high resolution unlike the other regions. Is there a specific reason?

Lines 278-288: The GQI grade was estimated using groundwater quality data sampled at one time for each well. Then, what is scientific basis of the statement that long-term trends confirm the reliability of the authors’ results? What is relation between increasing (or decreasing) trend and the grade? In addition, the authors only showed a long-term trend in only one area corresponding to each grade. I disagree with the authors on the following three points.

There is no relation between long-term trend and the GQI grade.

It is impossible to confirm the reliability of the authors’ results by checking only one area per each grade.

In Figure 6B, Site A (worrisome grade) showed increasing trend in chloride but the value is lower than Site C (very good grade) showing decreasing trend. It is not up to expectation.

Discussion: Most of the discussion is a repeat of the previous sections. More in-depth discussion is required.

Figure S4 is not referred in the manuscript.
Citation: https://doi.org/10.5194/hess-2022-86-RC2
- AC2: 'Reply on RC2', Tae Kwon Lee, 21 Nov 2022
  
  # Reviewer 2
  The paper describes the application of data-driven models to predict the groundwater quality index for drinking purposes using multiple water quality parameters. Groundwater quality was assessed by 2D spatial analysis and long-term monitoring results. However, the manuscript needs to be further improved in terms of novelty, literature research, and data handling and interpretation.
  
  ○ Abstract: The authors need to explain more the novelty and importance of their work for international communities.
  Response:
  As the reviewer commented, the previous abstract was written too general. We revised abstract to emphasize the novelty and innovation of our study.
  
  ○ Introduction: The section must be substantially improved by citing new literature.
  Response:
  We have partially replaced it with a new literature.
  
  ○ Lines 66-70: I disagree with the authors’ statement, there are several recent works about data-driven models predicting comprehensive groundwater quality (WQI, vulnerability, suitability for drinking water,...) as shown in the examples below. Search for recent studies and emphasize improvements of the authors’ research from them.
  
  Prediction of groundwater quality using efficient machine learning technique. Chemosphere, 276, 130265. https://doi.org/10.1016/j.chemosphere.2021.130265
  Advanced utilization of multi-learning algorithm: ensemble super learner to map groundwater potential for potable mineral water, Geocarto International, DOI: 10.1080/10106049.2022.2025921
  Reliability evaluation of groundwater quality index using data-driven models. Environmental Science and Pollution Research, 29(6), 8174-8190.
  Response:
  We are sorry for missing the recent studies. We revised the introduction sections to emphasize our novelty and innovation from the recent studies with the recent references.
  
  ○ Lines 80-83: There is a doubt that 'Analyzing long-term monitoring results can evaluate the accuracy of the groundwater pollution vulnerability.' Details comments are described in the results section.
  Response:
  This sentence was deleted because the results of our study were not only unsupportable but could also cause confusion in the overall direction of the manuscript.
  ○ Figure 1 needs to be redrawn. Clarify the location of the authors’ study area by representing neighboring countries, name of country (or region), and other essential elements of map presentation for the worldwide readers.
  Response:
  We modified the figure 1 to have more information with neighboring countries, and name of countries. And we zoomed Korean with the original Figure 1.
  
  ○ Lines 105-106: The authors removed 4,774 wells of groundwater quality that are inappropriate for drinking purposes. Considering the title of the manuscript, it is likely more effective to analyze both appropriate and inappropriate groundwater samples to assess suitability for drinking purpose. Is there any reason the authors removed unsuitable groundwater quality for drinking water?
  Response:
  We thank you for concerning the critical points. Legally, if even one of the groundwater quality indicators exceeds the water quality standard, the groundwater cannot be drink. Since it is already legally determined whether it is suitable for drinking in a dichotomous manner, it is meaningless to provide GQI information to citizens that has been judged to be inappropriate. Therefore, by providing GQI information on the groundwater suitability for drinking, citizens will be able to obtain comprehensive water quality information and determine the possibility of contamination of groundwater in advance. In addition, it will be possible to systematically manage groundwater quality by collecting the regional GQI.
  
  ○ Lines 151-153: In the section 2.3, the method to calculate GQI and to classify GQI into the grade was already developed. Then why the classification models are necessary? I think there is no reason to use learning models including averaged neural network, RF, SVM,… whose accuracy is not perfect. Isn’t classifying according to a given classification method the most accurate?
  Response:
  Of course, GQI graded can be decided by calculating GQI. Nevertheless, there is a reason we used the classification model to predict GQI grade. First, since it is practically difficult (e.g. cost and man power) to measure each of the 48 water quality parameters, it is important to select the minimum water quality indicators that can be determined by GQI grades. Classification model can select water quality parameters that can calculate GQI grades through feature selection. According to our results, we made high performance up to 90% (11 parameters) or 95% (14 parameters) even we used lower number of water quality parameters compared to models made using all parameters. The classification model will dramatically reduce the time, cost, and labor required to measure water quality. We added this information to the results to clearly present the advantage of using the classification model.
  
  ○ Section 2.4: References and details are required when describing the models.
  Response:
  We summarized the function and parameter of the models with the references in the supplemental table.
  
  ○ Section 2.5 also needs to be described in more detail.
  Response:
  We describe the function and parameter of the RF for clarity in section 2.5.
  
  ○ Lines 198-200: It is unclear why Chungcheongbuk-do was analyzed with high resolution unlike the other regions. Is there a specific reason?
  Response:
  Even we used the groundwater qualities from 3,552 wells and binning methods, it is still insufficient data considering a national scale (e.g. Republic of Korea). We were able to determine the GQI grades for some provinces less than 30% of total province area. We wanted to show the feasibility of combining GQI with spatial analysis by selecting Chungcheongbuk-do, where the GQI grades was determined in a significant area through the binning methods among the province. And we tried the to compare the GQI results with other independent long-term water quality results. Although we gave a brief reason selecting Chungcheongbuk-do shortly in result section in original version, we revised this sentence for clarity.
  
  ○ Lines 278-288: The GQI grade was estimated using groundwater quality data sampled at one time for each well. Then, what is scientific basis of the statement that long-term trends confirm the reliability of the authors’ results? What is relation between increasing (or decreasing) trend and the grade? In addition, the authors only showed a long-term trend in only one area corresponding to each grade. I disagree with the authors on the following three points.
  - There is no relation between long-term trend and the GQI grade.
  - It is impossible to confirm the reliability of the authors’ results by checking only one area per each grade.
  - In Figure 6B, Site A (worrisome grade) showed increasing trend in chloride but the value is lower than Site C (very good grade) showing decreasing trend. It is not up to expectation.
  Response:
  Due to the nature of groundwater, pollution is localized and residual, so it takes a very long time to observe the pollution of groundwater. As mentioned earlier, the GQI we developed was developed for potable groundwater, and through the binning method, we present information on the groundwater quality level of a specific area. The fact that the GQI of a particular region is selected as "worrisome" means that the pollution is not simply going on in one well of the region, but that the pollution is going on for a long time due to the pollution source in the region. Therefore, we think it is a very important approach to find the characteristics of GQI grades by comparing them with long-term water quality measurement data from national groundwater networks. Through this process, the GQI grades may not simply provide information on the current state of water quality, but can help rapidly select areas that require groundwater quality management.
  
  Of course, I also agree that selecting an area for each GQI grade and matching GQI grade with a long-term trend in national groundwater network may be considered insufficient to explain the characteristics of the grade. There are 688 wells for total national groundwater network, and 48 wells in Chungcheongbuk-do. Unfortunately, selecting an independent region (where the GQI rating results are not present on either side) to confirm the characteristics of the region's GQI grades is very limited in terms of the wells overlapping with that of the national groundwater network. We further described the contents of these technical difficulties in the discussion as future studies.
  As groundwater move slowly through an aquifer, the groundwater qualities should be determined by the type and concentration of the geological materials and nutrients in the aquifer. The change in groundwater quality is due to external environmental factors such as human activity, and in particular, nitrate are very likely to be factors of human activity around them. In the case of chloride, it may be high due to regional characteristics, but nitrate increases due to surrounding pollutants, and generally, the concentration of chloride increases at the same time. Simply increasing one water quality indicator does not increase GQI rapidly. Because GQI increases as multiple water quality indicators approach water quality standards, it is reasonable to analyze the characteristics of GQI ratings to consider the trends of Chloride and nitrate simultaneously.
  
  ○ Discussion: Most of the discussion is a repeat of the previous sections. More in-depth discussion is required.
  Response:
  We agreed that there is a more space to improve the discussion. We revised the discussion so that the proposed GQI could be more innovative and useful compared to previous studies.
  
  ○ Figure S4 is not referred in the manuscript.
  Response:
  We are sorry for the unintentional mistake. We added the Figure S4 in the proper positions.
  
  Citation: https://doi.org/10.5194/hess-2022-86-AC2

Seok Hyun Ahn, Do Hwan Jeong, MoonSu Kim, Tae Kwon Lee, and Hyun-Koo Kim

Supplement

https://doi.org/10.5194/hess-2022-86-supplement

Seok Hyun Ahn, Do Hwan Jeong, MoonSu Kim, Tae Kwon Lee, and Hyun-Koo Kim

Viewed

Total article views: 1,580 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
1,114	414	52	1,580	123	55	64

HTML: 1,114
PDF: 414
XML: 52
Total: 1,580
Supplement: 123
BibTeX: 55
EndNote: 64

Views and downloads (calculated since 01 Jun 2022)

Month	HTML	PDF	XML	Total
Jun 2022	233	34	7	274
Jul 2022	40	10	0	50
Aug 2022	15	6	0	21
Sep 2022	21	5	0	26
Oct 2022	17	9	1	27
Nov 2022	61	26	6	93
Dec 2022	18	17	0	35
Jan 2023	3	6	0	9
Feb 2023	21	13	2	36
Mar 2023	8	8	0	16
Apr 2023	15	11	2	28
May 2023	6	8	1	15
Jun 2023	9	9	1	19
Jul 2023	14	16	2	32
Aug 2023	9	11	1	21
Sep 2023	19	8	2	29
Oct 2023	9	15	0	24
Nov 2023	6	6	0	12
Dec 2023	10	8	0	18
Jan 2024	12	10	2	24
Feb 2024	8	5	3	16
Mar 2024	10	16	1	27
Apr 2024	13	3	5	21
May 2024	8	5	0	13
Jun 2024	6	5	1	12
Jul 2024	7	2	1	10
Aug 2024	9	2	0	11
Sep 2024	8	2	0	10
Oct 2024	6	5	0	11
Nov 2024	6	5	1	12
Dec 2024	5	1	0	6
Jan 2025	5	5	0	10
Feb 2025	6	5	0	11
Mar 2025	13	8	2	23
Apr 2025	6	13	2	21
May 2025	7	7	1	15
Jun 2025	24	10	1	35
Jul 2025	21	11	0	32
Aug 2025	67	8	2	77
Sep 2025	287	8	1	296
Oct 2025	20	21	1	42
Nov 2025	12	17	1	30
Dec 2025	14	14	2	30

Cumulative views and downloads (calculated since 01 Jun 2022)

Month	HTML	PDF	XML	Total
Jun 2022	233	34	7	274
Jul 2022	40	10	0	50
Aug 2022	15	6	0	21
Sep 2022	21	5	0	26
Oct 2022	17	9	1	27
Nov 2022	61	26	6	93
Dec 2022	18	17	0	35
Jan 2023	3	6	0	9
Feb 2023	21	13	2	36
Mar 2023	8	8	0	16
Apr 2023	15	11	2	28
May 2023	6	8	1	15
Jun 2023	9	9	1	19
Jul 2023	14	16	2	32
Aug 2023	9	11	1	21
Sep 2023	19	8	2	29
Oct 2023	9	15	0	24
Nov 2023	6	6	0	12
Dec 2023	10	8	0	18
Jan 2024	12	10	2	24
Feb 2024	8	5	3	16
Mar 2024	10	16	1	27
Apr 2024	13	3	5	21
May 2024	8	5	0	13
Jun 2024	6	5	1	12
Jul 2024	7	2	1	10
Aug 2024	9	2	0	11
Sep 2024	8	2	0	10
Oct 2024	6	5	0	11
Nov 2024	6	5	1	12
Dec 2024	5	1	0	6
Jan 2025	5	5	0	10
Feb 2025	6	5	0	11
Mar 2025	13	8	2	23
Apr 2025	6	13	2	21
May 2025	7	7	1	15
Jun 2025	24	10	1	35
Jul 2025	21	11	0	32
Aug 2025	67	8	2	77
Sep 2025	287	8	1	296
Oct 2025	20	21	1	42
Nov 2025	12	17	1	30
Dec 2025	14	14	2	30

Viewed (geographical distribution)

Total article views: 1,498 (including HTML, PDF, and XML) Thereof 1,498 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 27 Dec 2025

Short summary

We collected water quality datasets including 29 water quality parameters and 3,552 wells of groundwater for drinking. A simple water quality index with averaged neural network model and geospatial analysis are sufficient to select priority groundwater quality management areas in South Korea. We believe that our study makes a significant contribution to the water resource management.


Total:	0
HTML:	0
PDF:	0
XML:	0