the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Covariance-based selection of parameters for particle filter data assimilation in soil hydrology
Abstract. Real-time in-situ measurements are increasingly used to improve the estimations of simulation models via data assimilation techniques such as particle filter. However, models that describe complex processes such as water flow contain a large number of parameters while the data available is typically very limited. In such situations, applying particle filter to a large, fixed set of parameters chosen a priori can lead to unstable behavior, i.e. inconsistent adjustment of some of the parameters that have only limited impact on the states that are being measured. To prevent this, in this study correlation-based variable selection is embedded in the particle filter, so that at each data assimilation step only a subset of the parameters is adjusted. More specifically, whenever measurements become available, the most influential (i.e., the most highly correlated) parameters are determined by correlation analysis, and only these are updated by particle filter. The proposed method was applied to a water flow model (Hydrus-1D) in which states (soil water contents) and parameters (soil hydraulic parameters) were updated via data assimilation. Two simulation case studies were conducted in order to demonstrate the performance of the proposed method. Overall, the proposed method yielded parameters and states estimates that were more accurate and more consistent than those obtained when adjusting all the parameters.
- Preprint
(814 KB) - Metadata XML
- BibTeX
- EndNote
Status: closed
-
RC1: 'Comment on hess-2021-295', Anonymous Referee #1, 30 Jun 2021
A one-dimensional exercise of data assimilation with homogeneous parameters and no benchmarking is of no interest to the readers.
Please, reconsider submitting a paper with higher complexity in the parameter description (two- or three-dimensional, heterogeneous parameters...) and benchmark it against other more commonly used approaches describing why the new method proposed should be adopted.
Citation: https://doi.org/10.5194/hess-2021-295-RC1 -
AC1: 'Reply on RC1', Alaa Jamal, 14 Sep 2021
We regret that you didn’t find much interest in our paper. Clearly, we agree that a one-dimensional model has severe limitations but we restricted ourselves to such a model in the current study for the following reason: the primary purpose of the study was to introduce the concept of real-time dynamic selection of parameters for data assimilation, which to the best of our knowledge has not be reported in previous works. We considered that a one-dimensional model, which is someone trackable, was sufficient to illustrate the approach.
With regard to comparing the proposed method with existing ones, as mentioned above to the best of our knowledge selecting dynamically which parameters should be estimated has not been suggested in the past and hence such a comparison is not relevant. However, we compared the proposed method to straightforward application of data assimilation involving all parameters in order to show that selecting dynamically a sub-set of parameters for estimation leads to improved results
Citation: https://doi.org/10.5194/hess-2021-295-AC1
-
AC1: 'Reply on RC1', Alaa Jamal, 14 Sep 2021
-
RC2: 'Comment on hess-2021-295', Anonymous Referee #2, 28 Jul 2021
Paper # hess-2021-295
Covariance-based selection of parameters for particle filter data assimilation in soil hydrology
by
Alaa Jamal and Raphael LinkerThe paper presents a new particle filter data assimilation method where only highly correlated parameters to the measures are updated by the particle filter. A combination of Monte Carlo Markov Chain method and Particle Filter with Genetic Algorithm (Jamal and Linker, 2020) is used for the estimation of the state variables and parameters. The method is then applied to a water flow model where soil water contents and hydraulic soil parameters are updated using data assimilation. The results are compared to the traditional particle filter method where all parameters are updated during data assimilation.
Comments:
1/The investigated test cases are synthetic. The authors should consider simulation of real field or laboratory cases.
2/The constitutive law describing the relationships between pressure head, conductivity and water content should be specified when describing the test studies. Is it the Mualem–van Genuchten model?
3/The parameter alpha should have a dimension [L-1].
4/The value of the parameter n should be greater than 1
5/Why the parameter n is fixed and not included in the inversion procedure? Both Mualem van Genuchten parameters alpha and n cannot be measured and both should be included in the estimation.
6/The interval of variation +-40% is acceptable for the saturated water content, but not for alpha nor KS which can vary at least by one order of magnitude. Their intervals should be significantly enlarged because of the usually lack of prior knowledge of their values.
7/Why the estimation of the tetaS in the second layer is better with C-GPFM than with CPFM? Note that the inverse can be observed with KS ? How can you explain that parameters of this layer although they are not highly correlated to the measures, they can be better estimated with C-GPFM ? I believe that the parameters in the second zone cannot be accurately estimated with neither of the methods.
8/Could you provide detailed explanation why C-GPFM is more consistent than GPFM?
9/All presented results (for all figures) should include uncertainty ranges for the estimated parameters and variables and the results should be discussed based on these uncertainties.
In sum, the conclusion is not well supported by the analysis. I can understand that C-GPFM can be more efficient than CPFM but I really don’t see why it could be more consistent or more accurate that CPFM. Further, the authors should consider real and not synthetic experiments.
I suggest major revisions.-
AC2: 'Reply on RC2', Alaa Jamal, 14 Sep 2021
1/ While testing the proposed procedure with a real field test should certainly be the next step, we believe such a test should be conducted only after the soundness of the approach has been established, which was the purpose of the present study. For this first step, working only with synthetic data has the main advantage that ground-truth values are available for both parameters and state variables (at all depths), so that convergence of the ensemble can be fully analyzed. It would not be possible to quantify the procedure performance in an objective fashion using a real measurements since (1) measurements are not perfect, (2) measurements would be available only at specific depths and (3) the “true” characteristics of the soil would of course not be known.
2/ The constitutive law was indeed the Mualem–van Genunchten model. This information will be added.
3/ This regretfully mistake will be corrected
4/ The parameters n and alpha were regretfully mistakenly substituted in Table 1. In addition, the parameters in Figure 3 should be n instead of alpha. This mistake will be corrected throughout the manuscript.
5/ None of the parameters was fixed a priori. All parameters were allowed to change, if the correlation analysis showed that such a change was “justified”. In the specific case studies reported in the manuscript, the parameter n was never selected for adjustment.
6/ In specific situations the uncertainty in Ksat could indeed be very large. However, particle filters such as used in the study tend to perform poorly when the uncertainty is very large, since it causes particle weights to become extremely small. This is most certainly a limitation of the approach (related to particle filter and not parameter selection) that should have been emphasized in the manuscript and we will correct this omission.
7/ The measurements were located close to the interface between top and middle layer, and middle and bottom layer, which explains why these measurements indeed conveyed some information about the properties of the middle layer. The fact that some of the parameters of the middle layer were indeed adjusted is a direct result of the fact that some correlation between the parameters of this layer and the measured states indeed existed during specific periods. We will add a Figure showing such correlation and expend the explanation.
8/ C-GPFM is more consistent, or provides more robust estimations, than GPFM because it prevents changes in parameters that are currently not influential but will become influential under other circumstance. To illustrate this, consider a period during which the water content remains close to field capacity. Clearly during this period the value of the residual water content is not important, but when applying GPFM (or any other particle filter) there is nothing that prevents the value of this (currently non-influential) parameter from being changed to values that are in fact worse than the current ones. If the adjusted model is then used on a dry period (where of course the value of residual water content influences the results), the performance of the adjusted model will be worse than the performance of the initial model. The proposed C-GPFM approach prevents this by applying a “don’t touch if not necessary” approach.
9/ We will add the uncertainties of the current states errors and current states to Figures 4 and 5, respectively. As mentioned in the paper (L162) parameters estimation is directly related to the state estimation so that states uncertainties provide information about parameters uncertainties as well.
Citation: https://doi.org/10.5194/hess-2021-295-AC2
-
AC2: 'Reply on RC2', Alaa Jamal, 14 Sep 2021
Status: closed
-
RC1: 'Comment on hess-2021-295', Anonymous Referee #1, 30 Jun 2021
A one-dimensional exercise of data assimilation with homogeneous parameters and no benchmarking is of no interest to the readers.
Please, reconsider submitting a paper with higher complexity in the parameter description (two- or three-dimensional, heterogeneous parameters...) and benchmark it against other more commonly used approaches describing why the new method proposed should be adopted.
Citation: https://doi.org/10.5194/hess-2021-295-RC1 -
AC1: 'Reply on RC1', Alaa Jamal, 14 Sep 2021
We regret that you didn’t find much interest in our paper. Clearly, we agree that a one-dimensional model has severe limitations but we restricted ourselves to such a model in the current study for the following reason: the primary purpose of the study was to introduce the concept of real-time dynamic selection of parameters for data assimilation, which to the best of our knowledge has not be reported in previous works. We considered that a one-dimensional model, which is someone trackable, was sufficient to illustrate the approach.
With regard to comparing the proposed method with existing ones, as mentioned above to the best of our knowledge selecting dynamically which parameters should be estimated has not been suggested in the past and hence such a comparison is not relevant. However, we compared the proposed method to straightforward application of data assimilation involving all parameters in order to show that selecting dynamically a sub-set of parameters for estimation leads to improved results
Citation: https://doi.org/10.5194/hess-2021-295-AC1
-
AC1: 'Reply on RC1', Alaa Jamal, 14 Sep 2021
-
RC2: 'Comment on hess-2021-295', Anonymous Referee #2, 28 Jul 2021
Paper # hess-2021-295
Covariance-based selection of parameters for particle filter data assimilation in soil hydrology
by
Alaa Jamal and Raphael LinkerThe paper presents a new particle filter data assimilation method where only highly correlated parameters to the measures are updated by the particle filter. A combination of Monte Carlo Markov Chain method and Particle Filter with Genetic Algorithm (Jamal and Linker, 2020) is used for the estimation of the state variables and parameters. The method is then applied to a water flow model where soil water contents and hydraulic soil parameters are updated using data assimilation. The results are compared to the traditional particle filter method where all parameters are updated during data assimilation.
Comments:
1/The investigated test cases are synthetic. The authors should consider simulation of real field or laboratory cases.
2/The constitutive law describing the relationships between pressure head, conductivity and water content should be specified when describing the test studies. Is it the Mualem–van Genuchten model?
3/The parameter alpha should have a dimension [L-1].
4/The value of the parameter n should be greater than 1
5/Why the parameter n is fixed and not included in the inversion procedure? Both Mualem van Genuchten parameters alpha and n cannot be measured and both should be included in the estimation.
6/The interval of variation +-40% is acceptable for the saturated water content, but not for alpha nor KS which can vary at least by one order of magnitude. Their intervals should be significantly enlarged because of the usually lack of prior knowledge of their values.
7/Why the estimation of the tetaS in the second layer is better with C-GPFM than with CPFM? Note that the inverse can be observed with KS ? How can you explain that parameters of this layer although they are not highly correlated to the measures, they can be better estimated with C-GPFM ? I believe that the parameters in the second zone cannot be accurately estimated with neither of the methods.
8/Could you provide detailed explanation why C-GPFM is more consistent than GPFM?
9/All presented results (for all figures) should include uncertainty ranges for the estimated parameters and variables and the results should be discussed based on these uncertainties.
In sum, the conclusion is not well supported by the analysis. I can understand that C-GPFM can be more efficient than CPFM but I really don’t see why it could be more consistent or more accurate that CPFM. Further, the authors should consider real and not synthetic experiments.
I suggest major revisions.-
AC2: 'Reply on RC2', Alaa Jamal, 14 Sep 2021
1/ While testing the proposed procedure with a real field test should certainly be the next step, we believe such a test should be conducted only after the soundness of the approach has been established, which was the purpose of the present study. For this first step, working only with synthetic data has the main advantage that ground-truth values are available for both parameters and state variables (at all depths), so that convergence of the ensemble can be fully analyzed. It would not be possible to quantify the procedure performance in an objective fashion using a real measurements since (1) measurements are not perfect, (2) measurements would be available only at specific depths and (3) the “true” characteristics of the soil would of course not be known.
2/ The constitutive law was indeed the Mualem–van Genunchten model. This information will be added.
3/ This regretfully mistake will be corrected
4/ The parameters n and alpha were regretfully mistakenly substituted in Table 1. In addition, the parameters in Figure 3 should be n instead of alpha. This mistake will be corrected throughout the manuscript.
5/ None of the parameters was fixed a priori. All parameters were allowed to change, if the correlation analysis showed that such a change was “justified”. In the specific case studies reported in the manuscript, the parameter n was never selected for adjustment.
6/ In specific situations the uncertainty in Ksat could indeed be very large. However, particle filters such as used in the study tend to perform poorly when the uncertainty is very large, since it causes particle weights to become extremely small. This is most certainly a limitation of the approach (related to particle filter and not parameter selection) that should have been emphasized in the manuscript and we will correct this omission.
7/ The measurements were located close to the interface between top and middle layer, and middle and bottom layer, which explains why these measurements indeed conveyed some information about the properties of the middle layer. The fact that some of the parameters of the middle layer were indeed adjusted is a direct result of the fact that some correlation between the parameters of this layer and the measured states indeed existed during specific periods. We will add a Figure showing such correlation and expend the explanation.
8/ C-GPFM is more consistent, or provides more robust estimations, than GPFM because it prevents changes in parameters that are currently not influential but will become influential under other circumstance. To illustrate this, consider a period during which the water content remains close to field capacity. Clearly during this period the value of the residual water content is not important, but when applying GPFM (or any other particle filter) there is nothing that prevents the value of this (currently non-influential) parameter from being changed to values that are in fact worse than the current ones. If the adjusted model is then used on a dry period (where of course the value of residual water content influences the results), the performance of the adjusted model will be worse than the performance of the initial model. The proposed C-GPFM approach prevents this by applying a “don’t touch if not necessary” approach.
9/ We will add the uncertainties of the current states errors and current states to Figures 4 and 5, respectively. As mentioned in the paper (L162) parameters estimation is directly related to the state estimation so that states uncertainties provide information about parameters uncertainties as well.
Citation: https://doi.org/10.5194/hess-2021-295-AC2
-
AC2: 'Reply on RC2', Alaa Jamal, 14 Sep 2021
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
870 | 290 | 49 | 1,209 | 47 | 32 |
- HTML: 870
- PDF: 290
- XML: 49
- Total: 1,209
- BibTeX: 47
- EndNote: 32
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1