Socio-hydrological data assimilation: analyzing human–flood interactions by model–data integration

Sawada, Yohei; Hanazaki, Risa

doi:https://doi.org/10.5194/hess-24-4777-2020

Articles | Volume 24, issue 10

https://doi.org/10.5194/hess-24-4777-2020

Articles | Volume 24, issue 10

Research article

05 Oct 2020

Research article |

| 05 Oct 2020

Socio-hydrological data assimilation: analyzing human–flood interactions by model–data integration

Yohei Sawada and Risa Hanazaki

Abstract

In socio-hydrology, human–water interactions are simulated by mathematical models. Although the integration of these socio-hydrological models and observation data is necessary for improving the understanding of human–water interactions, the methodological development of the model–data integration in socio-hydrology is in its infancy. Here we propose applying sequential data assimilation, which has been widely used in geoscience, to a socio-hydrological model. We developed particle filtering for a widely adopted flood risk model and performed an idealized observation system simulation experiment and a real data experiment to demonstrate the potential of the sequential data assimilation in socio-hydrology. In these experiments, the flood risk model's parameters, the input forcing data, and empirical social data were assumed to be somewhat imperfect. We tested if data assimilation can contribute to accurately reconstructing the historical human–flood interactions by integrating these imperfect models and imperfect and sparsely distributed data. Our results highlight that it is important to sequentially constrain both state variables and parameters when the input forcing is uncertain. Our proposed method can accurately estimate the model's unknown parameters – even if the true model parameter temporally varies. The small amount of empirical data can significantly improve the simulation skill of the flood risk model. Therefore, sequential data assimilation is useful for reconstructing historical socio-hydrological processes by the synergistic effect of models and data.

Download & links

Article (PDF, 3285 KB)

Supplement (1149 KB)

Download & links

How to cite.

Received: 16 Jan 2020 – Discussion started: 19 Feb 2020 – Revised: 07 Aug 2020 – Accepted: 14 Aug 2020 – Published: 05 Oct 2020

1 Introduction

Socio-hydrology is an emerging research field in which two-way feedback between social and water systems is investigated (Sivapalan et al., 2012, 2014). Understanding complex socio-hydrological phenomena contributes to solving water crises around the world. Socio-hydrology has been recognized as an important scientific grand challenge in meeting the United Nations' Sustainable Development Goals (Di Baldassarre et al., 2019).

The most popular approach to socio-hydrology is developing dynamic models which compute nonlinear interactions between humans and water. For instance, Di Baldassarre et al. (2013) developed a simplified model, which described human–flood interactions, to understand the levee effect in which high levees generate a false sense of security and induce social vulnerabilities to severe floods in communities (see also Viglione et al., 2014; Ciullo et al., 2017). Van Emmerik et al. (2014) developed a stylized model, which described two-way feedback between the environment and economic activities, to understand the historical competition for water between agricultural development and environment health in Australia (see also Roobavannan et al., 2017). Pande and Savenije (2016) modeled economic activities of smallholder farmers to analyze the agrarian crisis in Marathwada, India. While the socio-hydrological models described above assumed the existence of a single lumped decision maker, Yu et al. (2017) incorporated a collective action into their model and analyzed the dynamics of community-managed flood protection systems in coastal Bangladesh. Please refer to Di Baldassarre et al. (2019) for a comprehensive review of socio-hydrological modeling.

In addition to these modeling approaches, both qualitative and quantitative data related to socio-hydrological processes are important for understanding human–water interactions. For instance, Mostert (2018) revealed historical changes in river management, from water resources development to protection and restoration, by analyzing qualitative data. Dang and Konar (2018) applied econometric methods to analyze quantitative data in both human and water domains and quantified the causal relationship between trade openness and water use. Kreibich et al. (2017) performed a detailed case study analysis on paired floods, i.e. consecutive flood events which occurred in the same region with the second flood causing significantly lower damage. They found that the reduction in vulnerability played a key role in the successful adaptation to the second flood.

Although it is expected that the integration of model and data contributes to accurately understanding the socio-hydrological processes (Mount et al., 2016), the methodological development of the model–data integration in socio-hydrology is in its infancy. Generally, mathematical models can provide spatiotemporally continuous state variables and quantitative scenarios for future socio-hydrological developments. In addition, mathematical models can quantitatively provide possible scenarios unrealized in the real world, which gives insight to targeted processes (e.g., Viglione et al., 2014). The major limitation of socio-hydrological models is that they are often inaccurate due to the uncertainty in their input forcing, parameters, and descriptions of the processes. On the other hand, hydrological and social data are often more reliable than numerical models and can provide a more complete understanding of the socio-hydrological processes (e.g., Mostert, 2018), although data also have uncertainties. However, in many cases, relevant data in socio-hydrology are sparsely distributed so that it is difficult to completely reconstruct the historical socio-hydrological processes from data. The other limitation of the data-driven approach is that the quantification of the causal relationship cannot be easily done by empirical data only (e.g., Dang and Konar, 2018). Considering the advantages and disadvantages of model and data, previous studies used social statistics to calibrate and validate their socio-hydrological models (e.g., Barendrecht et al., 2019; Roobavannan et al., 2017; Ciullo et al., 2017; van Emmerik et al., 2014; Gonzales and Ajami, 2017).

In geosciences, sequential data assimilation has been widely used for the model–data integration. Data assimilation sequentially adjusts the predicted state variables and parameters of dynamic models by integrating observation data into models based on Bayes' theorem. Data assimilation has been widely applied to numerical weather prediction (e.g., Miyoshi and Yamane, 2007; Bauer et al., 2015; Poterjoy et al., 2019; Sawada et al., 2019), atmospheric reanalysis (e.g., Kobayashi et al., 2015; Hersbach et al., 2019), and hydrology and land surface modeling (e.g., Moradkhani et al., 2005; Sawada et al., 2015; Rasmussen et al., 2015; Lievens et al., 2017). The applicability of the data assimilation approach to socio-hydrological models has yet to be investigated.

In this study, we aim to develop the methodology of sequential data assimilation for the flood risk model proposed by Di Baldassarre et al. (2013). From a series of idealized experiments and a real data experiment in the city of Rome, we demonstrate the potential of data assimilation to accurately reconstruct the historical human–flood interactions. We focus on the case in which the socio-hydrological model's parameters, input forcing data, and social data are somewhat inaccurate.

2 Method

2.1 Model

In this study, we used a socio-hydrological flood risk model proposed by Di Baldassarre et al. (2013). This model conceptualizes human–flood interactions by a set of simple equations which describe the states of flood, economy, technology, politics, and society. Based on this original model of Di Baldassarre et al. (2013), many similar flood risk models have been proposed, validated, and applied (e.g., Viglione et al., 2014; Ciullo et al., 2017; Barendrecht et al., 2019). Here we briefly describe this model. Please refer to Di Baldassarre et al. (2013) for a complete description of this model.

The governing equations of the flood risk model are shown as follows:

\begin{array}{l} (1) & F = \{\begin{array}{lcl} 1 - \exp (- \frac{W + ξ_{H} H}{α_{H} D}) & if & W + ξ_{H} H > H \\ 0 & if & W + ξ_{H} H \leq H \end{array} \\ (2) & R = \{\begin{cases} ε_{T} (W + ξ_{H} H - H) if (F > 0) and \\ (F G > γ_{E} R \sqrt{G}) and (G - F G > γ_{E} R \sqrt{G}) \\ 0 otherwise \end{cases} \\ (3) & S = \{\begin{array}{lcl} α_{S} F & if & (R > 0) \\ F & if & (R = 0) \end{array} \\ (4) & \frac{d G}{d t} = ρ_{E} (1 - \frac{D}{λ_{E}}) G - Δ (Υ (t)) (F G + γ_{E} R \sqrt{G}) \\ (5) & \frac{d D}{d t} = (M - \frac{D}{λ_{P}}) \frac{φ_{P}}{\sqrt{G}} \\ (6) & \frac{d H}{d t} = Δ (Υ (t)) R - κ_{T} H \\ (7) & \frac{d M}{d t} = Δ (Υ (t)) S - μ_{S} M . \end{array}

This model has four state variables, namely G, D, H, and M. G(t) (L²) is the size of the human settlement, D(t) (L) is the distance of the center of the mass of the human settlement from the river, H(t) (L) is the flood protection level (or levee height), and M(t) (.) is the social awareness of the flood risk. The time step was set to annual.

Equation (1) calculates the intensity of the flooding events F(t) (.) from the high water level W(t) (L), the height of the levee H(t) (L), and the distance of the human settlement from the river D(t) (L). Equation (2) calculates R(t) (L), the amount by which the levees are raised in response to the flood event. There are three required conditions under which people decide to raise the levee. First, the flood event occurs. Second, the damage of the flood (FG) should be larger than the cost of raising the levee. Third, the cost of raising levee should be lower than the wealth remaining after the flooding. Equation (3) shows the magnitude of the psychological shock caused by the flood event S(t) (.). If the levee is raised, the psychological shock is assumed to be mitigated. Equation (4) explains the dynamics of G(t), the size of the human settlement or the wealth of the community. Following the notation of Di Baldassarre et al. (2013), Δ(Υ(t))=1, with the integral only when time, t, passes the time of the flooding event (F>0), otherwise Δ(Υ(t))=0. The term $FG + γ_{E} R \sqrt{G}$ (total cost of flood damage and construction of levees) appears only if a flood occurs. Equation (5) shows the dynamics of the distance of the center of the mass of the human settlement from the river D(t). When the social awareness of the flood risk is high, people tend to live far from the river. Equation (6) computes the dynamics of the flood protection level H(t), and Eq. (7) shows the dynamics of the social awareness of the flood risk M(t). The explanation of the parameters can be found in Table 1.

Table 1Parameters of the flood risk model.

Download Print Version | Download XLSX

2.2 Data assimilation

In this study, we used a sampling importance resampling particle filtering (SIRPF) algorithm as a method of data assimilation. The SIRPF algorithm has been widely used in hydrological data assimilation (e.g., Moradkhani et al., 2005; Qin et al., 2009; Sawada et al., 2015). Compared with the other data assimilation algorithms, such as the ensemble Kalman filter, SIRPF is robust against model nonlinearity and associated non-Gaussian error distribution. The disadvantage of SIRPF is that the infeasible computational resources are required if the numerical model is computationally expensive, which is not the case in the flood risk model.

The flood risk model can be formulated as a discrete state–space dynamic system as follows:

\begin{matrix} (8) & x (t + 1) = f (x (t), θ, u (t)) + q (t), \end{matrix}

where x(t) is the state variable (i.e., G, D, H, and M), θ is the model parameters, u(t) is the external forcing (i.e., the high water level), and q(t) is the noise process which represents the model error. In data assimilation, it is useful to formulate an observation process as follows:

\begin{matrix} (9) & y^{f} (t) = h (x (t)) + r (t), \end{matrix}

where y^f(t) is the simulated observation, h is the observation operator which maps the model's state variables into the observable variables, and r(t) is the noise process which represents the observation error.

The SIRPF algorithm is a Monte Carlo approximation of a Bayesian update of the state variables and parameters as follows:

\begin{matrix} (10) & \begin{aligned} p (x (t), θ |y^{o} (1 : t)) \\ \propto p (y^{o} (t) |x (t), θ) p (x (t), θ |y^{o} (1 : t - 1)), \end{aligned} \end{matrix}

where $p (x (t), θ |y^{o} (1 : t))$ is the posterior probability of the state variables x(t) and parameters θ given all observations up to time t y^o(1:t). The prior knowledge, $p (x (t), θ |y^{o} (1 : t - 1))$ , based on the model integration, is updated using the likelihood, which includes the new observation at time t p(y^o(t)|x(t),θ). In this study, we assumed that our observation error follows a Gaussian distribution so that the likelihood can be formulated as follows:

\begin{matrix} (11) & \begin{aligned} p (y^{o} (t) |x (t), θ) \equiv L (y^{o} (t), x (t), θ) = \frac{1}{\sqrt{det (2 π R)}} \\ \exp [- \frac{1}{2} {(y^{o} (t) - y^{f} (t))}^{T} R^{- 1} (y^{o} (t) - y^{f} (t))], \end{aligned} \end{matrix}

where R is the covariance matrix of the observation error process r(t). Prior knowledge of the state variables is approximated by the ensemble simulation as follows:

\begin{matrix} (12) & \begin{aligned} p (x (t) |y^{o} (1 : t - 1)) \\ \approx \frac{1}{N} \sum_{i = 1}^{N} δ [x (t) - f (x^{i} (t - 1), θ^{i}, u^{i} (t - 1))], \end{aligned} \end{matrix}

where N is the ensemble size, $x^{i}, θ^{i}, u^{i}$ are the realizations of the ensemble member i, and δ(.) is the Dirac delta function.

The posterior probability of the state variables and parameters can be approximated as follows:

\begin{array}{l} (13) & p (x (t) |y^{o} (1 : t)) \approx \sum_{i = 1}^{N} w (i) δ (x (t) - x^{i} (t)), \\ (14) & p (θ |y^{o} (1 : t)) \approx \sum_{i = 1}^{N} w (i) δ (θ - θ^{i}), \end{array}

where w(i) is the normalized weight for the realization of the ensemble member i and is calculated using the likelihood (see also Eq. 11).

\begin{matrix} (15) & w (i) = \frac{L (y^{o} (t), x^{i} (t), θ^{i})}{\sum_{k = 1}^{N} L (y^{o} (t), x^{k} (t), θ^{k})} . \end{matrix}

Note that Eqs. (13) and (14) update all state variables and parameters of the model although the weight is calculated using only observable variables. Therefore, it is not necessary to observe all state variables in order to update all system variables.

The implementation of SIRPF is as follows:

1.
Updating the model state variables from time t−1 to t using the ensemble simulation (Eqs. 8 and 12).
2.
Calculating the simulated observations for all ensembles (Eq. 9).
3.
Calculating the likelihood for each ensemble member (Eq. 11).
4.
Obtaining the weights for all ensembles (Eq. 15).
5.
Applying a resampling procedure according to the normalized weights. The normalized weights of ensemble i, w(i) can be recognized as the probability that the ensemble i is selected after resampling. Resampled state variables and parameters are defined as $x_{resamp}^{i}$ and $θ_{resamp}^{i}$ , respectively.
6.
Adding the perturbation to the ensembles of parameters (Moradkhani et al., 2005), since there are no mechanisms to increase the variance of parameters of ensemble members, as follows:
$\begin{array}{l} (16) & θ^{i} \leftarrow θ_{resamp}^{i} + ε^{i}, \\ (17) & ε^{i} \sim N (0, max (ω, s \times {Var}^{θ})), \end{array}$
where N(.) is the Gaussian distribution, Var^θ is the variance of θⁱ, and ω is the fixed hyperparameter (see Table 1 for its variable), which guarantees that the ensembles of parameters do not converge into a single value. s is an adaptively changed factor according to the effective ensemble size, N_eff.
$\begin{matrix} (18) & s = s_{0} (1 - {(\frac{N_{eff}}{N})}^{2}) \end{matrix}$
$\begin{matrix} (19) & N_{eff} = \frac{1}{\sum_{i = 1}^{N} w (i)}, \end{matrix}$
where s₀=0.05. The effective ensemble size is the measure of the diversity of ensembles. If the effective ensemble size becomes small, ensembles should be strongly perturbed in order to maintain the diversity of ensembles. A similar strategy has been used in many SIRPF systems (e.g., Moradkhani et al., 2005; Poterjoy et al., 2019).

3 Experiment design

3.1 Observation system simulation experiment

In this study, we performed three observation system simulation experiments (OSSEs). In the OSSE, we generated the synthetic truth of the state and flux variables by driving the flood risk model with the specified parameters and input. Then, we generated synthetic observations by adding the noise to this synthetic truth. Those synthetic observations were assimilated into the model by SIRPF. The performance of SIRPF was evaluated by comparing the estimated state variables by SIRPF with the synthetic truth. Model parameters used to generate the synthetic truth can be found in Table 1. They are identical to Di Baldassarre et al. (2013). The OSSE has been recognized as an important preliminary step for verifying the newly developed data assimilation systems (e.g., Moradkhani et al., 2005; Vrugt et al., 2013; Penny and Miyoshi 2016; Sawada et al., 2018).

The high water level for the synthetic truth was generated by the following:

\begin{matrix} (20) & W = min (v - 10, 0) . \end{matrix}

v follows the Gumbel distribution as follows:

\begin{matrix} (21) & p (v) = \frac{\exp (- \frac{v - μ}{β})}{β} \exp (- \exp (- (v - μ) β)), \end{matrix}

where μ=9 and β=2.5. Although our high water level is not identical to that of Di Baldassarre et al. (2013), the estimated trajectory of the state variables is similar to Di Baldassarre et al. (2013).

Synthetic observations were generated by adding the Gaussian white noise to the F, G, D, H, and M (see Sect. 2.1) of the synthetic truth. The mean of the Gaussian white noise was 0. The observation error, namely the standard deviation of the Gaussian white noise, was first set to 10 % of the synthetic true variables. Although this observation error is generally larger than that used in meteorology and hydrology, we further increased the observation error and tested the sensitivity of the observation error to the SIRPF algorithm's performance. We first assumed that all of the F, G, D, H, and M can be observed every 10 years or every 10 model integration steps. Then, we evaluated the sensitivity of the observation network (i.e., the observable variables and the observation intervals) to the SIRPF algorithm's performance. Although it is not straightforward to observe the social memory M, several previous studies obtained the proxy of the social memory from interview data (Barendrecht et al., 2019) and a number of Google searches (Gonzales and Ajami, 2017).

We used the ensemble mean of root mean square errors (mRMSEs) as an evaluation metric as follows:

\begin{array}{l} (22) & {RMSE}^{i} = \sqrt{\frac{1}{T} \sum_{t = 1}^{T} (x^{i} (t) - z (t))}, \\ (23) & mRMSE = \frac{1}{N} \sum_{i = 1}^{N} {RMSE}^{i}, \end{array}

where RMSEⁱ is root mean square error for ith ensemble, T is the computational period, xⁱ(t) is the simulated state variable of ensemble i at time t, and z(t) is the synthetic truth at time t.

3.1.1 Experiment 1: perfect model with uncertain high water levels

In the first OSSE, we assumed that there is no uncertainty in the model parameters. We used the same parameter variables as the synthetic truth run, and we did not perform the estimation of parameters. Our SIRPF updated only the state variables. Although the model had no uncertainty, it was assumed that the input data, i.e., the time series of the high water level, were uncertain. Lognormal multiplicative noise was added to the synthetic true high water level so that different ensemble members have different high water levels in the data assimilation experiment. The two parameters of the lognormal distribution, commonly called μ and σ, were set to 0 and 0.15, respectively.

3.1.2 Experiment 2: unknown model parameters and uncertain high water levels

In the second OSSE, we assumed that some of the synthetic true parameter values were unknown. The unknown parameters in experiment 2 were the cost of levee raising γ_E, the rate at which new properties can be built φ_P, the rate of decay of levees κ_T, and the memory loss rate μ_S (see Table 1). We selected these unknown parameters one by one from four equations of economics, politics, technology, and society to discuss how each state variable's observation affects the estimation of parameters across these four equations (see Sect. 2.1). We have no unknown parameters related to F (Eq. 1) since it is unlikely that the parameters in Eq. (1) are much more inaccurate than the other parameters. The parameters related to the flood are mainly determined by the topography of the flood plain so that the process described in Eq. (1) can be replaced by more accurate hydrodynamic models in the real-world case study. The initial parameter variables were assumed to be distributed in the bounded uniform distributions whose ranges are found in Table 1. The uncertainty of the simulation induced by the parameters' uncertainty is large enough to demonstrate the potential of data assimilation to minimize the simulation's uncertainty (see Sect. 4). Our SIRPF sequentially assimilated observations and estimated both state variables and parameters in experiment 2. The high water level data were uncertain, as in experiment 1.

3.1.3 Experiment 3: unknown and time-variant model parameters and uncertain high water levels

To further demonstrate the potential of sequential data assimilation in socio-hydrology, we assumed that the description of the model was biased in experiment 3. Here we assumed that two of the model parameters were temporally varied by the unknown dynamics. Specifically, the rate at which new properties can be built, φ_P, and the memory loss rate, μ_S, were temporally varied in experiment 3, as follows:

\begin{matrix} (24) & \begin{aligned} φ_{P} (t) \\ = \{\begin{cases} 5000 (t < 250) \\ 5000 + (t - 250) \times \frac{40 000 - 5000}{500} (250 \leq t < 750) \\ 40 000 (750 \leq t) . \end{cases} \end{aligned} \end{matrix}

\begin{matrix} (25) & \begin{aligned} μ_{S} (t) \\ = \{\begin{cases} 0.01 (t < 250) \\ 0.01 + (t - 250) \times \frac{0.10 - 0.01}{500} (250 \leq t < 750) \\ 0.10 (750 \leq t) . \end{cases} \end{aligned} \end{matrix}

In the data assimilation experiment, we assumed that the dynamics of φ_P and μ_S were unknown, and we integrated the flood risk model with time-invariant φ_P and μ_S. We evaluated if SIRPF could track this time-variant parameter and reveal the bias of the model's description. The cost of levee raising γ_E and the rate of decay of levees κ_T were assumed to be time-invariant unknown parameters, as they were in experiment 2. The cost of levee raising γ_E affects the state variables of the flood risk model mainly in the initial early years, and the gradual change in the rate of decay of levees κ_T has few impacts on the state variables. Therefore, we found that it is difficult to track the temporal change in these two parameters. The input forcing data, i.e., the high water level, were uncertain, as described in experiment 1.

3.2 Real data experiment

In addition to the OSSEs, we performed the real-world experiment in the city of Rome, Italy. Ciullo et al. (2017) collected real-world data and calibrated their flood risk model. Using the data collected by Ciullo et al. (2017), we performed the data assimilation experiment. It should be noted that the flood risk model of Ciullo et al. (2017) is different from our model (i.e., Di Baldassarre et al., 2013), although they are conceptually similar.

All the data were collected from Fig. 1 of Ciullo et al. (2017) by WebPlotDigitizer (https://automeris.io/WebPlotDigitizer/, last access: 18 September 2020). The observed high water level of the Tiber river was used as input forcing data (W). The levee height (H) and population (G) were used as the observation data assimilated into the flood risk model. In Ciullo et al. (2017), population values within the Tiber's floodplain were normalized by the theoretical maximum of the Tiber's floodplain population, which is estimated to the range between 10⁶ and 2×10⁶. Since our flood risk model needs the population values (not normalized values), we multiplied 1.5×10⁶ and the normalized values shown in Fig. 1 of Ciullo et al. (2017) to obtain the population size in the floodplain.

We added lognormal multiplicative noise to the observed high water level as we did in the OSSEs. The observation errors of levee height and population were set to 10 % and 25 % of the observed values, respectively. Since Ciullo et al. (2017) showed a large uncertainty in the estimation of the theoretical maximum population (see above), it is reasonable to assume that the estimation of the population values also has a relatively large uncertainty.

As in the second and third OSSEs, we have four unknown parameters in this real-world experiment. We used the same settings of the parameters as for the OSSEs, which are shown in Table 1, except for ξ_H, the proportion of the additional high water level due to levee heightening. In this real-world experiment, we set ξ_H=0 because the observed high water level includes the effects of levee heightening. This treatment is consistent with Ciullo et al. (2017; see their Table 2).

The initial conditions of H and M were set to 0. The initial conditions of D were obtained from the uniform distribution between 1000 and 5000. The initial conditions of G were obtained from the uniform distribution between 1500 and 50 000.

4 Results

4.1 Observation system simulation experiment

4.1.1 Experiment 1: perfect model with uncertain high water levels

Figure 1 shows the time series of the model variables calculated by 5000 ensembles with no data assimilation. Although the ensemble mean of the state variables is close to the synthetic truth, the ensembles have a large spread, especially for G. The uncertainty in the input forcing brings the uncertainty in the estimation of the historical socio-hydrological condition.

https://hess.copernicus.org/articles/24/4777/2020/hess-24-4777-2020-f01

Figure 1Time series of (a) the high water level W(t), (b) the flood protection level (or levee height) H(t), (c) the distance of the center of the mass of the human settlement from the river D(t), (d) the size of the human settlement G(t), (e) the intensity of flooding events F(t), and (f) the social awareness of the flood risk M(t) simulated by 5000 ensembles, with uncertain high water levels and no data assimilation, in experiment 1 (see Sect. 3.1.1). The time step is annual. Gray, red, and black lines are the ensemble members, their mean, and the synthetic truth, respectively.

Socio-hydrological data assimilation: analyzing human–flood interactions by model–data integration

2.1 Model

2.2 Data assimilation

3.1 Observation system simulation experiment

3.1.1 Experiment 1: perfect model with uncertain high water levels

3.1.2 Experiment 2: unknown model parameters and uncertain high water levels

3.1.3 Experiment 3: unknown and time-variant model parameters and uncertain high water levels

3.2 Real data experiment

4.1 Observation system simulation experiment

4.1.1 Experiment 1: perfect model with uncertain high water levels

4.1.2 Experiment 2: unknown model parameters and uncertain high water levels

4.1.3 Experiment 3: unknown and time-variant model parameters and uncertain high water levels

4.2 Real data experiment