A Bayesian consistent dual ensemble Kalman filter for state-parameter estimation in subsurface hydrology

Ait-El-Fquih, Boujemaa; El Gharamti, Mohamad; Hoteit, Ibrahim

doi:https://doi.org/10.5194/hess-20-3289-2016

Articles | Volume 20, issue 8

https://doi.org/10.5194/hess-20-3289-2016

© Author(s) 2016. This work is distributed under
the Creative Commons Attribution 3.0 License.

https://doi.org/10.5194/hess-20-3289-2016

© Author(s) 2016. This work is distributed under
the Creative Commons Attribution 3.0 License.

Articles | Volume 20, issue 8

Research article

|

12 Aug 2016

Research article |

| 12 Aug 2016

A Bayesian consistent dual ensemble Kalman filter for state-parameter estimation in subsurface hydrology

Boujemaa Ait-El-Fquih, Mohamad El Gharamti, and Ibrahim Hoteit

Download

Final revised paper (published on 12 Aug 2016)
Preprint (discussion started on 01 Feb 2016)
Supplement to the preprint

Interactive discussion

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'minor revision but clarification of amount of novel material', Anonymous Referee #1, 08 Mar 2016
- AC1: 'Reply to Referee #1', Boujemaa Ait-El-Fquih, 30 Mar 2016
RC2: 'comments on hess-2015-544', Anonymous Referee #2, 14 Mar 2016
- AC2: 'Reply to Referee #2', Boujemaa Ait-El-Fquih, 30 Mar 2016

Peer-review completion

AR: Author's response | RR: Referee report | ED: Editor decision

ED: Reconsider after major revisions (12 Apr 2016) by Mauro Giudici

AR by Boujemaa Ait-El-Fquih on behalf of the Authors (23 Apr 2016) Author's response Manuscript

ED: Referee Nomination & Report Request started (25 Apr 2016) by Mauro Giudici

RR by Anonymous Referee #1 (12 May 2016)

RR by Anonymous Referee #3 (03 Jun 2016)

Suggestions for revision or reasons for rejection

General comments
----------------

This paper presents a new version of the Dual Ensemble Kalman Filter, namely the Dual-EnKF_OSA, and compares it against the more traditional Joint-EnKF and the Dual-EnKF.

The material is novel and the manuscript is within the scope of HESS.

Apart from some editorial remarks that I have reported below, I have some concerns about how the initialization of the ensemble members is implemented and how it is described in the manuscript (see detailed comments below).

Based on these considerations and on my comments below, I suggest accepting the manuscript after minor revisions.

Detailed comments
-----------------

Lines 51-55: I suggest relaxing this sentence and limiting the ability to handle model structure errors to the cases presented in the cited reference (Hendricks Franssen and Kinzelbach, 1998). In fact the EnKF has been proven to be ineffective when other model structure errors such as, e.g., uncertain variogram model parameters, need to be taken into account (see, e.g., Jafarpour and Tarrahi, 2011: http://dx.doi.org/10.1029/2010WR009090 - "Assessing the performance of the ensemble Kalman filter for subsurface flow data integration under variogram uncertainty").

Lines 124-125: is it necessary to assume that parameters and state variables are independent? This doesn't seem to be realistic to me. You also state that there must be consistency between model parameters and initial hydraulic head fields (lines 439-442).

Figure 1: I suggest adding the x and y axes labels with the corresponding units.

Table 2 (caption): please correct "variorum" in "variogram".

Lines 430-442: the procedure through which you initialize your ensemble members and the motivations for doing so are not clear to me.
In more details:
lines 430-431: what is the "mean hydraulic head of the reference run solution"? Spatial mean? Temporal mean?
line 432: "randomly select" from which set?
Regardless of the fact that the same procedure has already been used by Gharamti et al. (2014), I suggest that all this initialization methodology should be explained more clearly in the manuscript.

Equations 38-39: defined in this way, and consistently with the definition of vector x in equation (1), the two metrics AAE and AESP should refer to system states only. Subsequently, you employ AAE with reference to log-conductivies (e.g., in Figure 4, 5). Please consider revising this inconsistency.

Line 476 and 478, caption of Table 3: I don't understand the reason for using the word "mean" before AESP. Shouldn't the AESP be an averaged quantity? Please consider dropping the word "mean" in "mean AESP". Otherwise state more clearly what you intend with the word "mean" in this context.

Table 3: the results presented in Table 3 in my opinion are not adequately commented: the AESP indices related to log-conductivity do not increase with ensemble size as stated at lines 477. On the same line, it is not clear why the authors say "as expected".

Lines 504-506: Does this mean that updating the model variables is more expensive than running the forward model? Commonly, the updating step consists of a few algebraic equations that can be solved in a very short time, while the forward model run usually requires more time. Could you provide further details for this behavior?

Section 5.4 (and also in many other further instances): you probably mean that the standard deviation of the measurement error is, e.g., 0.10 m, not the measurement error itself.

Hide

ED: Publish subject to minor revisions (Editor review) (15 Jun 2016) by Mauro Giudici

AR by Boujemaa Ait-El-Fquih on behalf of the Authors (29 Jun 2016)

ED: Publish subject to technical corrections (10 Jul 2016) by Mauro Giudici

I appreciated the Authors' effort in the last revision of the manuscript, which was even more effective to improve the paper quality than the first revision.
The paper is now ready for publication, provided minor techncial corrections are introduced, as listed below.
Line 55. Modify "some forms of model errors", as this expression is not informative.
Line 384. The saturated thickness is given by b=h-z_{bot}, where z_{bot} is the height of the impermeable aquifer bottom. Details about z_{bot} must be given: is it constant (horizontal aquifer bottom) or variable? If variable, please, specify how.
Lines 393 & 394. The correction can be further improved. My (TeX) suggestion is: "with a geometric mean of $10^{-13}\,\mathrm{m/s}$, a variance of $Y = \log K$ equal to 1.5". The same holds for the caption of Figure 1.
Line 476. Substitute "frequency" with "period".
Line 495. I think it is preferable the expression "time-averaged AESP".
Line 541. Check the expression "as the frequency of observations in time decreases".
Line 622. Substitute "observations sampling frequency" with "sampling period".
Table 2. Erase parentheses around the measurement units.
Figure 1. Substitute the heading of the colour scale with "$Y = \log K$, for $K$ in m/s". The same correction in Figure 8.
Figure 2. Modify the heading of the colour scale in analogy to what has been described here above for Figure 1. Also, check the values and maps, because recharge q should be given in m/s, since the dimension of q are [L/T] (see line 391).
Figure 3. The change to the labels are not sufficient to make them clearly visible and readable. Please, improve the label formats.
Figure 4. Erase "-water" from the y-axis title of the upper plot. Use "log K (K in m/s)" for the y-axis titles of the lower plot.
Figure 5. Substitute x-axis title with "Observation sampling period (days)".
Figure 7. Substitute the x-axes titles with "Time (months)".

Hide

AR by Boujemaa Ait-El-Fquih on behalf of the Authors (18 Jul 2016) Author's response Manuscript

Short summary

We derive a new dual ensemble Kalman filter (EnKF) for state-parameter estimation. The derivation is based on the one-step-ahead smoothing formulation, and unlike the standard dual EnKF, it is consistent with the Bayesian formulation of the state-parameter estimation problem and uses the observations in both state smoothing and forecast. This is shown to enhance the performance and robustness of the dual EnKF in experiments conducted with a two-dimensional synthetic groundwater aquifer model.