the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
On the potential of variational calibration for a fully distributed hydrological model: application on a Mediterranean catchment
Maxime JayAllemand
Igor Gejadze
Patrick Arnaud
PierreOlivier Malaterre
JeanAlain Fine
Didier Organde
Calibration of a conceptual distributed model is challenging due to a number of reasons, which include fundamental (model adequacy and identifiability) and algorithmic (e.g., local search vs. global search) issues. The aim of the presented study is to investigate the potential of the variational approach for calibrating a simple continuous hydrological model (GRD; Génie Rural distributed involved in several flash flood modeling applications. This model is defined on a rectangular 1 km^{2} resolution grid, with three parameters being associated with each cell. The Gardon d'Anduze watershed (543 km^{2}) is chosen as the study benchmark. For this watershed, the discharge observations at five gauging stations, gridded rainfall and potentialevapotranspiration estimates are continuously available for the 2007–2018 period at an hourly time step.
In the variational approach one looks for the optimal solution by minimizing the standard quadratic cost function, which penalizes the misfit between the observed and predicted values, under some additional a priori constraints. The cost function gradient is efficiently computed using the adjoint model. In numerical experiments, the benefits of using the distributed against the uniform calibration are measured in terms of the model predictive performance, in temporal, spatial and spatiotemporal validation, both globally and for particular flood events. Overall, distributed calibration shows encouraging results, providing better model predictions and relevant spatial distribution of some parameters. The numerical stability analysis has been performed to understand the impact of different factors on the calibration quality. This analysis indicates the possible directions for future developments, which may include considering a nonGaussian likelihood and upgrading the model structure.
 Article
(5609 KB)  Fulltext XML
 BibTeX
 EndNote
Flash flood prediction remains a challenging task of modern hydrology due to a number of reasons. First, the heavy precipitation events (HPEs) leading to flash floods are difficult to forecast due to complexity of the processes involved (deep convection triggered by orographic lifting, lowlevel wind convergence and/or cold pools) (Ducrocq et al., 2016). Second, the hydrological response of the watershed is difficult to model, since it depends on many factors. These include the watershed properties (topography, geology and land cover) and its initial state, for example the soil moisture (Braud et al., 2016). For the western Mediterranean region, which is often affected by flash floods, the HyMeX program (Hydrological Cycle in the Mediterranean Experiment) offers a good opportunity to conduct multidisciplinary studies on the relevant subjects (Drobinski et al., 2014).
In order to better predict flash floods and reduce their potentially devastating impact, warning systems have been developed or are currently under development (Collier, 2007; Hapuarachchi et al., 2011; Gourley et al., 2017). The distributed hydrological models utilizing the rainfall radar measurements are widely implemented in such systems. These models take into account the spatial variability of the catchment properties and of the rainfall and are capable of predicting the discharge at ungauged locations. The latter is important for small or mediumsized watersheds which are not covered by an extensive gauging network (Borga et al., 2011). Among operational models presently used for flash flood prediction at a national scale, one could mention the CREST (Coupled Routing and Excess Storage) model in the USA developed by Wang et al. (2011) or the G2G (GridtoGrid) model (UK) from Bell et al. (2007). Those distributed models are often “conceptual”, because considering more complex “physically based” models may not be justified for the flash flood prediction purpose (Beven, 1989). Since the conceptual parameters are not directly observable, they have to be defined using calibration. However, due to a potentially significant number of cells or subcatchments, the calibration process has to deal with overparameterization and uniqueness (equifinality) issues (Beven, 1993, 2001). Another set of difficulties stems from a dubious adequacy of such models, in which case the very definitions of “overparameterization” and “overfitting” should be refined.
As noticed in De Lavenne et al. (2019), all existing calibration methods developed for the distributed hydrological models involve some sort of regularization strategy. One possible approach is the control set reduction. For example, for each distributed parameter one can try to evaluate a nonuniform spatial pattern from information about the catchment characteristics, including its geological formation, soil properties and land use (Anderson et al., 2006). Then, instead of calibrating the local values of parameters associated with each grid cell, one calibrates a few “superparameters” (additive constant, multiplier and power) that modify this pattern according to a chosen law (see, e.g., Pokhrel and Gupta, 2010). The same idea but in the multiscale setting is implemented in the multiscale parameter regionalization (MPR) method described in Samaniego et al. (2010). Other strategies can also include the use of additional data, as in Rakovec et al. (2016), where a satellitebased total water storage (TWS) anomaly is used to complement the discharge data. For a lowdimensional unknown vector one can use a variety of probabilistic or gradientfree methods to find the sought estimate. It has to be recognized, however, that evaluating useful spatial patterns from auxiliary information is a difficult task by itself. In the presented paper we investigate the possibility of calibrating the distributed parameters without considering any predefined spatial structures. Such a calibration problem falls into the category of highdimensional inverse problems which can be addressed through the appropriate data assimilation methods.
Methods of data assimilation (DA) have been engaged for several decades in geosciences, including meteorology, oceanography, river hydraulics and hydrology applications. These methods are used for estimating the driving conditions, states and/or parameters (calibration) of a dynamical model describing the evolution of natural phenomena. The estimates are conditioned on observations (usually incomplete) of a prototype system. Some early applications of DA in hydrology are described in the review paper of McLaughlin (1995). More recently the review paper of Liu et al. (2012) reports the progress and challenges of data assimilation applications in operational hydrological forecasting. It seems that the Kalman filtering has been recently the most popular DA method in hydrology (Sun et al., 2016). For instance, in Quesney et al. (2000) the extended Kalman filter is applied with a lumped conceptual rainfall–runoff model to estimate the soil moisture by assimilating the SAR (synthetic aperture radar) data. In Munier et al. (2014), the standard Kalman filter is applied with the semidistributed conceptual TGR (Transfer with the Génie Rural) model, where the discharge observations are assimilated to adjust the initial model states. It has been shown that the predictive performance depends on the degree of “spatialization” of the watershed and on the number of gauging stations engaged. In Sun et al. (2015), the extended Kalman filter is used with the distributed SWAT (soil and water assessment tool) model to improve flood prediction on the upstream Senegal River catchment. In this work, given the large number of state variables, only the spatially averaged lowresolution updates are estimated. This shows that for DA involving distributed models, scalable methods must be used (scalable algorithm is the one able to maintain the same efficiency when the workload grows). The choice of DA methods is, therefore, limited to the ensemble Kalman filter and the variational estimation.
In variational estimation, one looks for the minimum of the cost function dependent on the control vector (i.e., vector of unknown model inputs) using a gradientbased iterative process. The cost function itself represents the maximum a posteriori (MAP) estimator, which turns into the standard 4DVar (variational) cost function (Rabier and Courtier, 1992) under the Gaussian assumption. The key issue of variational estimation is the method used for computing the gradient. For a lowdimensional control vector the finitedifference approach can be used. For example, Abbaris et al. (2014) explored such a variationalestimation algorithm involving the lumped conceptual HBV (Hydrologiska Byråns Vattenbalansavdelning) model in operational setting. It has been used to update the soil moisture and the states of the routing tank reservoirs on some events. It has been shown that DA helps to improve the peak flow prediction; however the correct choice of the assimilation period and the forecast horizon is vital. In Thirel et al. (2010), the cost function is minimized iteratively using the BLUE (best linear unbiased estimator) formulation, which is equivalent to the “algebraic” form of the Gauss–Newton method. Here, DA is implemented involving the SIM (SAFRAN–ISBA–MODCOU; Système d'analyse fournissant des renseignements atmosphériques à la neige–Interaction SolBiosphèreAtmosphère–MODlisation COUplée) model. It has been shown that the improved estimate of the moisture of the soil layers leads to a significantly better discharge simulation. However, the genuine variationalestimation method relies on the adjoint model, which allows the precise (up to roundoff errors) gradient of the cost function to be computed in a single adjoint run. Then, different minimization methods can be applied. For example, in weather and ocean forecasting, where the models involved are computationally very expensive, the Gauss–Newton method (e.g., “incremental approach”) is used. This method (as any localsearch method) leads to a nearest local minimum in the vicinity of the prior guess. This could be a serious problem if the posterior distribution is multimodal. Certain past attempts with the localsearch methods in hydrology were not always successful, and several authors have reported that these methods fail to deliver the global optimal solution (Moradkhani and Sorooshian, 2009; Abbaspour et al., 2007). For highdimensional but relatively inexpensive models, the gradientenhanced globalsearch minimization methods can be considered (Laurent et al., 2019).
Using the variational estimation involving the adjoint is very common in atmospheric and oceanographic applications. But, in hydrology, only a very few cases have been actually reported. In particular, in Castaings et al. (2009) and Nguyen et al. (2016), the adjoint model has been generated or derived for the kinematicwave overland flow model with the source term including the rainfall as a driving condition and the infiltration term described by a Green–Ampt model (Castaings et al., 2009) or the Horton model (Nguyen et al., 2016). Since models are represented by a partial differential equation, this is a standard case for which a significant experience has been accumulated within the data assimilation community. The major difference between the two papers is that in Castaings et al. (2009) the adjoint has been generated by automatic differentiation applied to the existing MARINE (Modélisation de l'Anticipation du Ruissellement et des Inondations pour des évéNements Extrêmes) model, while in Nguyen et al. (2016) it is derived and implemented manually. In Castaings et al. (2009) the distributed parameters of the infiltration model has been calibrated considering a single flood event in an identical twin experiment framework, whereas in Nguyen et al. (2016) the author looks for a few global parameters considering two realistic events. In Seo et al. (2009) the adjoint is used for state updating of a lumped model, while in Lee et al. (2012) it is for state updating of a distributed model.
The aim of our study is to assess a set of parameters which may represent the spatially varying hydrological properties of a chosen watershed; thus the distributed model parameters have to be calibrated over a very long assimilation window (i.e., several years). With this purpose we develop a variationalcalibration method using the adjoint applied on a simple fully distributed model (GRD; Génie Rural distributed), involving a conceptual celltocell routing scheme. This scheme has been designed keeping in mind the differentiability requirement. The adjoint is obtained by automatic differentiation and manually optimized to provide the capacity to work for long time periods (up to several consecutive years) over large spatial areas, with fine resolution. This requires a memory efficient and fast code. The distributed parameters of the GRD model are calibrated over a French Mediterranean catchment, the Gardon d'Anduze, using rainfall radar data and the discharge data from the outlet gauge station. The discharge data from other gauge stations available in this catchment are used for crossvalidation (10 years, being split in two periods). Thus, the major questions addressed in this paper are: (a) can we, in principle, benefit from considering the spatially distributed set of coefficients given by the method instead of the uniform (homogeneous) set of coefficients, and, if so, to what extent? In particular, does it help to improve the discharge prediction over the catchment area including “ungauged” locations? (b) What are the major difficulties associated with this approach (insufficient data, structural deficiency of the model, identifiability issue, etc.), and what could possibly be done to improve the model predictive performance?
The paper is organized as follows. In Sect. 2.1 the GRD model used in this study is described. In Sect. 2.2 we present the variationalestimation algorithm adapted for the parameter calibration purpose. The testing benchmark is described in Sect. 2.3, and the testing methodology in Sect. 2.4. The results are presented in Sect. 3, followed by the “Discussion and conclusions” section.
2.1 Distributed rainfall–runoff model GRD
The GRD model is a conceptual distributed hydrological model designed for flash flood prediction (Javelle et al., 2010, 2016; Arnaud et al., 2011; Javelle et al., 2014). Since March 2017, it has been used operationally by the national French flash flood warning system called “Vigicrues Flash” (Javelle et al., 2019). The model version used in the present study has been specially developed for testing the potential of distributed calibration using the variational approach. It is defined on a regular 1 km^{2} grid and runs continuously at an hourly time step. For each time step the model input includes the gridded rainfall and potential evapotranspiration, and the output is the discharge field defined at the routing scheme nodes.
Our model incorporates some features from the GR (Génie Rural) model family, which include several lumped and semidistributed bucketstyle continuous models developed over the last 30 years at INRAE Antony (Institut national de recherche pour l'agriculture, l'alimentation et l'environnement). Those models have been extensively tested and have demonstrated good performance in various conditions and for different time steps (Perrin et al., 2003; Mouelhi et al., 2006; Lobligeois et al., 2014; Ficchì et al., 2016; Santos et al., 2018; Riboust et al., 2019; De Lavenne et al., 2019).
Let us consider a 2D spatial domain (basin, catchment and watershed) Ω covered by the rectangular grid. For each cell (pixel), the model involves the production and transfer reservoirs, characterized by capacities c_{p} and c_{t}, correspondingly, and the discharge generated within each cell is routed between cells with local routing velocity v (see Fig. 1). It means that only three parameters need to be defined for each cell. The integral flow, without distinction between surface, subsurface or groundwater flow, is simulated by the production and transfer reservoirs at the cell level and by the routing scheme at the intercell level. Obviously, such model conceptually describes the overall hydrological process rather than its physically meaningful components.
2.1.1 The water balance operator
Let P(t) be the local total rainfall (function of time t) and E(t) be the local total potential evapotranspiration. For each cell (pixel), a water balance function determines the effective rainfall P_{r}, i.e., the amount of rainfall that will participate to the flow. This function is defined via the following steps. First, the net rainfall P_{n} and the net potential evapotranspiration E_{n} are defined from the following equation:
Next, the production reservoir is filled by P_{p}, a part of P_{n}. Similarly, the production reservoir is emptied by E_{p}, a part of E_{n} representing the actual evapotranspiration. The variation of the reservoir level h_{p} is driven by the following differential equation (Edijatno, 1991):
Assuming a stepwise approximation of input variables P(t) and E(t), Eq. (2) is integrated over one time step Δt to obtain the amount P_{p} filling the reservoir and the amount E_{p} evaporated from it:
It should be noted that with this discretized temporal formulation, h_{p} is the reservoir level at the beginning of Δt and P_{p} and E_{p} are the volume of water gained or lost by the reservoir over Δt. At the end of Δt, h_{p} is updated by adding P_{p} or removing E_{p} before progressing to the next time step. Finally, P_{r} is the part of the net rainfall that does not enter into the production reservoir:
One can see that the state of the production reservoir h_{p} plays the role of the humidity state of the soil. An empty reservoir (h_{p}=0) means that the soil is completely dry: effective rainfall and evapotranspiration are equal to zero, and all amount of the net rainfall enters into the production reservoir (E_{p}=0, P_{p}=P_{n} and P_{r}=0). On the contrary, a full reservoir (h_{p}=c_{p}) means that the soil is completely saturated: no more rainfall enters into the production reservoir, the evapotranspiration is maximal and all the net rainfall produces the effective rainfall (E_{p}=E_{n}, P_{p}=0 and P_{r}=P_{n}).
2.1.2 The transfer operator
The effective rainfall P_{r}(t) fills the transfer reservoir characterized by state h_{t} and capacity c_{t}. The emission from the transfer reservoir during Δt gives the elementary discharge q. This transformation is modeled by a conservative operator which is derived from the differential equation describing the evolution of h_{t} under the mass conservation condition:
It has been noticed (Perrin et al., 2003) that Eq. (6) correctly replicates the flooding and drying processes for b=5. This is an empirical knowledge which has no physical justification. Assuming P_{r} is the impulse function, Eq. (6) is integrated over one time step Δt to obtain the expression for q:
More details about the production and the transfer reservoirs can be found in Perrin et al. (2003).
2.1.3 The routing scheme (cell to cell)
The total discharge (Q) in a cell is then obtained by routing through the catchment all the upstream elementary discharges (q). This routing is built on top of a digital elevation model which, for a given node, defines the flow direction. The routing nodes are placed at the center of the corresponding cells.
For the sake of simplicity we describe the routing model in the 1D setting. The total discharge from node i−1 to node i is delayed by time
where ${d}_{i\mathrm{1},i}$ and ${v}_{i\mathrm{1},i}$ are the distance and the routing velocity between the nodes, respectively. In the simplest implementation, the output discharge (more precisely, the mass over the time step Δt) is given as
where Q_{i} is the total discharge in cell i, Q_{i−1} is the total discharge in the neighboring upstream cell and q_{i} is the elementary discharge emitted from the transfer reservoir over time period δt at cell i. Note that in a 2D case, the second term of the righthand side of Eq. (9) could be a sum of a few contributions from direct neighboring cells, with their own values of d and v. Since no explicit expression for Q is provided, Q is not differentiable with respect to v. That is why the above formulation is not suitable for the variational approach, which requires the gradient of the cost function to be computed. In order to tackle this issue we represent the second term in Eq. (9) in the integral form as follows:
Next, instead of the δ function we use the unscaled Gaussian function, i.e.,
where
It is easy to see that function Eq. (11) explicitly depends on v_{i} via ω; therefore the gradient of Q_{i−1} with respect to v_{i} can be computed. Assuming Q(t) is a constant during a time step period Δt, Eq. (11) can be written in the discrete form as follows:
where
and L defines the finite time period (in terms of Δt) instead of the semiinfinite period considered in Eq. (10). For the given estimate of routing velocities ${v}_{i\mathrm{1},i}$, the coefficient β_{i,l} does not change with time and, therefore, can be precomputed and saved in memory. The spread coefficient σ models the diffusion. The value of parameter σ should not be lower than 0.5 to avoid numerical instability (i.e., when a small variation of the routing velocity results in a large variation of “delay”). Therefore, σ=0.5 is used in computations. In terms of using the exponential weights the presented routing model resembles the lagandroute (LR) model described in Laganier et al. (2014) and Tramblay et al. (2010). However, the Gaussian function represents the hydraulic response function in a more realistic way. Indeed, in the lagandroute method, the kernel function ω is discontinuous, being zero for ${t}^{\prime}>t\mathit{\tau}$. It means the outflow from cell i−1 arrives to cell i in a “shock” manner. If the Gaussian function is used, there is no discontinuity; i.e., outflow from cell i−1 arrives to cell i progressively. This scheme is more suitable for celltocell implementation, as it is more stable for direct modeling and the absence of discontinuity is a necessary condition to achieve the differentiability of the forward operator.
2.2 Variationalcalibration algorithm
Calibrating a distributed model is often difficult due to a number of reasons. First, the total number of the sought parameters can be quite large (high dimensionality). This strictly limits the choice of suitable inference methodologies. Second, there is an identifiability issue given the sparsity of observations in space, the information content of the test signal (rainfall variability) and, possibly, the chosen model structure. The first two can be partially compensated by increasing the observation period or observation frequency to better analyze the system dynamics.
For distributed models the variationalestimation algorithm could be a natural choice, since it is scalable, i.e., it works efficiently for a practically unlimited size of the control vector. That is why this method (branded as 4DVar) is commonly used in meteorology and oceanography for operational forecasting and reanalysis (Ledimet and Talagrand, 1986; Rabier and Courtier, 1992). The method provides the exact mode of the posterior distribution by minimizing the cost function defined over the given observation window. The key element of the method is the adjoint model, which provides the precise gradient of the cost function with respect to all elements of the control vector in a single run (Errico, 1997). This allows for the fastconverging gradientbased minimization methods to be used, such as the BFGS (Broyden–Fletcher–Goldfarb–Shanno) or the Newton type. Quite often, the need for development of the adjoint model becomes an obstacle for the practical implementation of this method. Heuristic methods such as the Nelder–Mead algorithm do not require the gradient to evaluate the descent directions but converge slowly and are not suitable for solving problems in high dimensions. The same is true as for the generalpurpose statistical methods such as the Markov chain Monte Carlo (e.g., Metropolis–Hastings algorithm), as well as for the methods specially designed for hydrology applications, such as SUFI2 (Sequential Uncertainty Fitting) (Abbaspour et al., 2007).
Let us consider the rainfall and potentialevapotranspiration fields P(x,t) and E(x,t), $\forall x\in \mathrm{\Omega}$. We represent the hydrological model in Sect. 2.1 as an operator A mapping the input fields P(x,t) and E(x,t) into the discharge Q_{k}(t) at the gauged nodes ${x}_{k}\in \mathrm{\Omega},\phantom{\rule{0.25em}{0ex}}k=\mathrm{1},{N}_{\mathrm{g}}$:
where $\mathit{h}(x,t)=\left({\mathit{h}}_{\mathrm{p}}\right(x,t),{\mathit{h}}_{\mathrm{t}}(x,t){)}^{\mathrm{T}}$ is the state vector, which includes the states of production and transfer reservoirs for all cells at time t; $\mathit{\theta}\left(x\right)=\left({c}_{\mathrm{p}}\right(x),{c}_{\mathrm{t}}(x),v(x){)}^{\mathrm{T}}$ is the parameter vector, which includes the corresponding capacities and the routing velocities at all routing nodes; and N_{g} is the number of gauged routing nodes, i.e., where discharge observations are available. If the observation period is much longer than the characteristic time of the system (which is the case for calibration or reanalysis), one can use the trivial initial state $\mathit{h}(x,\mathrm{0})=\mathrm{0}$, but consider the observation window $t\in ({t}^{\ast},T)$, where t^{∗} is the relaxation period. Given the observed inputs ${\mathit{P}}^{\ast}(x,{t}^{\prime}),\phantom{\rule{0.25em}{0ex}}{\mathit{E}}^{\ast}(x,{t}^{\prime}),\phantom{\rule{0.25em}{0ex}}{t}^{\prime}\in (\mathrm{0},t)$ and the output ${Q}_{k}^{\ast}\left(t\right)$, the calibration cost function can be defined as follows:
where B is the background error covariance; θ^{∗} is a prior guess on θ, which comes from special measurements, land expertise or a modeling; α is the regularization parameter; and a_{k} are the scaling constraints. If we consider
where $\langle {Q}_{k}^{\ast}\rangle $ is the temporal mean of ${Q}_{k}^{\ast}\left(t\right)$, then for each k the misfit term becomes 1−NSE, where NSE stands for the Nash–Sutcliffe efficiency criterion (Nash and Sutcliffe, 1970) widely used in hydrology. In essence, Eq. (15) is more or less a standard quadratic cost function similar to the one used in variational data assimilation (4DVar), where the weight α is introduced to mitigate the uncertainty in θ^{∗} and B. Let us note that this paper is focused on the parameter calibration problem involving long time series of observations.
We use additional constraints in the form
where θ_{min} and θ_{max} are the bounds which come from the empirical knowledge or physical considerations. Thus, the optimal estimate of the parameters $\widehat{\mathit{\theta}}$ is obtained from the condition
given constraints of Eqs. (14) and (16).
Matrix B can be represented in the form $\mathbf{B}={\mathit{\sigma}}_{\mathit{\theta}}\cdot \mathbf{I}\phantom{\rule{0.125em}{0ex}}\mathbf{C}\phantom{\rule{0.25em}{0ex}}{\mathit{\sigma}}_{\mathit{\theta}}\cdot \mathbf{I}$, where σ_{θ} is the vector of mean deviations of θ, C is the correlation matrix, I is the identity matrix and “⋅” stands for the elementwise (Hadamard) product. Next, the scaling of parameters is introduced such that $\mathit{\theta}={\mathit{\theta}}_{min}+\stackrel{\mathrm{\u0303}}{\mathit{\theta}}({\mathit{\theta}}_{max}{\mathit{\theta}}_{min})$ to ensure that $\mathrm{0}\le \stackrel{\mathrm{\u0303}}{\mathit{\theta}}\le \mathrm{1}$. Then, the penalty term in Eq. (15) takes the form
Assuming that $({\mathit{\theta}}_{max}{\mathit{\theta}}_{min})\cdot {\mathit{\sigma}}_{\mathit{\theta}}^{\mathrm{1}}=\mathrm{const}$, the cost function of Eq. (15) reads as follows:
given
The results presented in this paper correspond to the simplest approach to regularization: we assume that C=I, and the regularization parameter is chosen a priori as a small value ($\mathit{\alpha}={\mathrm{10}}^{\mathrm{4}}$) to ensure the formal wellposedness of the calibration problem. More sophisticated approaches for regularization (nontrivial correlation matrix C and the optimal choice of α using the Lcurve approach) have been utilized (JayAllemand et al., 2018) but are not presented in this paper for the sake of simplicity. In practice, the simplifications mentioned above lead to significantly more oscillating parameter fields, which does not seem to have a critical influence on the predictive performance of the model (in the openloop forecasting, at least).
Minimization of Eq. (18) given constraints of Eq. (19) is performed by LBFGSB (limitedmemory Broyden–Fletcher–Goldfarb–Shanno boundconstrained; Zhu et al., 1994). The minimization process can be written in the form
where J^{′}(θ_{i}) and H^{−1}(θ_{i}) are the gradient (with respect to $\stackrel{\mathrm{\u0303}}{\mathit{\theta}}$) and the limitedmemory inverse Hessian of Eq. (18) at point θ_{i}, respectively, γ is a descent step, i is the iteration number, and R is the gradient projection operator to account for the box constraints. Let us note that H^{−1}(θ_{i}) is directly computed inside the minimization algorithm in such a way that its norm is always bounded. This serves as an additional regularization; thus the solution $\widehat{\mathit{\theta}}$ is always bounded, even for α=0 in Eq. (18), i.e., even without the penalty term. The gradient J^{′}(θ_{i}) is obtained by solving the adjoint model. This model has been generated by the automatic differentiation engine Tapenade (Hascoet and Pascual, 2013), then manually optimized and, finally, verified using the standard gradient test.
The background value θ^{∗} is used both as a starting point for iterations and in the penalty term. Given the fact that the information content of the test signal (rainfall) and observations (discharge) may not be sufficient to uniquely identify the distributed coefficients, evaluating an appropriate θ^{∗} becomes an important issue. Thus, the overall calibration process involves two steps. In the first step we consider a uniform approximation: ${\mathit{c}}_{\mathrm{p}}\left(\mathit{x}\right)={\stackrel{\mathrm{\u203e}}{\mathit{c}}}_{\mathrm{p}}$, ${\mathit{c}}_{\mathrm{t}}\left(\mathit{x}\right)={\stackrel{\mathrm{\u203e}}{\mathit{c}}}_{\mathrm{t}}$ and $\mathit{v}\left(\mathit{x}\right)=\stackrel{\mathrm{\u203e}}{\mathit{v}}$, $\forall x\in \mathrm{\Omega}$. In this case (referred as “uniform calibration”) the sought vector $\stackrel{\mathrm{\u203e}}{\mathit{\theta}}=({\stackrel{\mathrm{\u203e}}{\mathit{c}}}_{\mathrm{p}},{\stackrel{\mathrm{\u203e}}{\mathit{c}}}_{\mathrm{t}},\stackrel{\mathrm{\u203e}}{\mathit{v}}{)}^{\mathrm{T}}$ consists of just three elements. For such a lowdimensional problem, obtaining the globally optimal estimate $\widehat{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}$ (i.e., the one which corresponds to the global minimum of Eq. 18) is feasible by a variety of methods. In particular, we use a simple globalminimization algorithm, the steepest descent method summarized in Edijatno (1991). In the second step we estimate the distributed parameters using the uniform estimate as a background, i.e., ${\mathit{\theta}}^{\ast}=\widehat{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}$. Here, three unknown parameters for each cell are estimated using the variational algorithm described above. This twostep algorithm is referred as “distributed calibration”.
Parameter bounds are defined for each step. Numerical and physical considerations enforce the lower bounds so that $({\mathit{c}}_{\mathrm{p}},{\mathit{c}}_{\mathrm{t}},\mathit{v})>\mathrm{0}$. For the uniform calibration, the upper bounds are chosen to preserve the model dynamics. For example, 5 m s^{−1} is used as the velocity upper bound, since above this value the flow delay does not decrease significantly. For the production and transfer reservoirs the upper bounds are set to 5000 and 2000 mm, respectively, since the higher values do not noticeably change the model dynamics (reservoir states remain almost constant in time). For the distributed calibration, bounds are recomputed as ${\mathit{\theta}}_{max}=K\widehat{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}$ and ${\mathit{\theta}}_{min}=\widehat{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}/K$. In particular, K=4 is used for results presented in this paper. Further increasing K does not seem to have any influence on the estimates.
2.3 Study area and data
A French watershed, the Gardon d'Anduze, has been considered for testing our model and calibration algorithm. Located in the western Mediterranean region, this catchment and its surrounding have been deeply studied in the framework of the HyMeX program (Drobinski et al., 2014) to understand the processes leading to flash floods. For instance, some studies exploited a number of very detailed field measurements during severe storm events (Braud et al., 2014; Vannier et al., 2014). Others tested hypotheses using physically distributed models, for instance the MARINE model (Roux et al., 2011; Garambois et al., 2013, 2015; Douinot et al., 2016, 2018) or the CéVeNnes (VCN) model (Braud et al., 2010; Vannier et al., 2016). A few conceptual distributed models were also considered in this area, such as those implemented into the ATHYS (Atelier Hydrologique Spatialisé) platform (Bouvier and DelClaux, 1996; Laganier et al., 2014; Tramblay et al., 2010).
The main properties of the Gardon d'Anduze are described in Darras (2015). In brief, this is a steep mountainous watershed with a dense hydrographic network spreading over 540 km^{2} in the eastern part of the Cévennes mountains (France). The difference in levels between the highest elevation point and Anduze is about 800 m, and the slope reaches 50 % in the upstream part. A metamorphic (schist) but fractured geological formation dominates the watershed. Water infiltrates very quickly (the saturated hydraulic conductivity is greater than 100 mm h^{−1}). The water circulation appends mainly underground, but with very short response times (less than 12 h). This area is governed by a transitional Mediterranean–oceanic climate with warm and dry summers, alleviated by the oceanic influence, followed by recurrent short, intense but persistent heavy rainfalls in autumn and winter, known as épisode méditerranéen, generating flash floods.
This watershed is well gauged: at least five stations with continuous data collection are operational here (see Fig. 2 and Table 1). For numerical experiments, the discharge data have been extracted from the HYDRO database of the French ministry in charge of the environment. Rainfall gridded data have been provided by Météo France. It is a pseudorealtime reanalysis (ANTILOPE – ANalyse par spaTIaLisation hOraire des PrEcipitations – J+1, i.e., available 1 d after the current date), merging radar rainfall estimation with in situ rain gauges. The gridded potential evapotranspiration is computed using interannual air temperature values and the formula developed by Oudin et al. (2005). The temperature data were provided by the SAFRAN reanalysis (Vidal et al., 2010). All the time series have been processed at an hourly time step over the continuous 2007–2018 period. All the gridded data are defined at a 1 km × 1 km spatial resolution, over the same 46 km × 40 km domain overlapping the watershed. Furthermore, the flow direction (eight directions) map and the flow accumulation map have been carefully checked, in order to ensure that every cell is connected to the correct downstream cell.
2.4 Investigating methodology
The variational algorithm described in Sect. 2.2 is applied to the hydrological model presented in Sect. 2.1, using the Gardon d'Anduze watershed as a benchmark. The number of “active” cells (i.e., included into the watershed) is 540, so the total number of parameters to be calibrated is 3 ⋅ 540.
The calibratedmodel validation step consists of checking the model predictive performance (referred as MPP) using the data not involved in calibration. That is, the full set of observations ${Q}_{k}^{\ast}\left(t\right),\phantom{\rule{0.25em}{0ex}}k=\mathrm{1},\mathrm{\dots},{N}_{\mathrm{g}},\phantom{\rule{0.25em}{0ex}}t\in (\mathrm{0},T)$ is divided in two complementary subsets: a calibration subset and validation subset. Since Q^{∗} depends on k (defining the spatial distribution of sensors) and t, we distinguish the temporal, spatial and spatiotemporal validation, following the split sample test defined by Klemes (1986). In particular, we divide the whole period in two parts: P1 from 1 January 2008 to 1 January 2013 and P2 from 1 January 2013 to 1 January 2018. Each period of P1 and P2 can be considered as a calibration or validation period. A model warmup of 1 year is performed before starting the simulations. We assume that 1 year is enough, according to recommendations by Perrin et al. (2003).
If data from a station are used in calibration, the corresponding catchment is called the “calibration catchment”; otherwise it is called the “validation catchment”.
Both the calibration quality and the MPP in validation are measured using the NSE criterion and the Kling–Gupta efficiency (KGE) criterion (Gupta et al., 2009).
The spatiotemporal measured discharge data are partitioned into calibration and validation complementary sets. Then, we refer to:
 a.
“temporal validation” if MPP is evaluated for all calibration catchments over the validation period
 b.
“spatial validation” for all validation catchments over the calibration period
 c.
“spatiotemporal validation” for all validation catchments over the validation period.
The following numerical experiments have been performed:
 1.
Calibration uniform5station and calibration distributed5sta. These are uniform and distributed calibration, respectively, using discharge data from all five gauging stations, with calibration periods P1 or P2. This is followed by temporal validation; i.e., the parameter estimate obtained using data from P1 is validated on data from P2, and vice versa. The appends uniform5sta and distributed5sta to the “temporal validation” in Fig. 3 indicate the relevant calibration procedure.
 2.
Calibration uniform1sta and calibration distributed1sta. These are uniform and distributed calibration, respectively, using discharge data from one downstream gauging station (Anduze), with calibration periods P1 or P2. This is followed by spatial validation using data from the remaining four gauging stations (Générargues, Mialet, SaintJean and Corbès), with calibration periods P1 or P2 and by spatiotemporal validation on data from the remaining four gauging stations but for validation periods P2 or P1.
 3.
Ensemble distributed1sta. This is multiple distributed calibration using discharge data from the Anduze gauging station, starting from different uniform priors.
The main purpose of experiments 1 and 2 is to quantify the anticipated MPP improvement, achieved in “distributed calibration”. Given the rainfall data, we compare the ability of the calibrated model to predict the discharge at observation points without using the discharge data for the validation period (experiment 1, temporal validation); then we compare the ability to resolve the spatial distribution of discharge in ungauged areas, given the discharge data for the validation period at the catchment outlet (experiment 2, spatial validation) and the ability to predict the discharge in ungauged areas and without using the discharge data for the validation period (experiment 2, spatiotemporal validation). Clearly, the latter is the ultimate performance test. The purpose of experiment 3 is to investigate the stability of the parameter estimates with respect to their priors. This is necessary to assess the impacts of equifinality and limitations related to the localsearch minimization.
In addition, the current MPP is analyzed for the eight major flood events for periods P1 and P2, listed in Table 2. Three other criteria from (Artigue et al., 2012) are used, which compare the magnitude and the synchronization of the modeled and observed flood peak.

The percentage peak discharge P_{D}:
$$\begin{array}{}\text{(21)}& {P}_{\mathrm{D}}=\mathrm{100}\times {\displaystyle \frac{{Q}_{max}}{{Q}_{max}^{\ast}}},\end{array}$$where ${Q}_{max}=Q\left({t}_{\mathrm{m}}\right)$ and ${Q}_{max}^{\ast}={Q}^{\ast}\left({t}_{\mathrm{m}}^{\ast}\right)$ are the predicted and the observed maximum discharges, respectively. These maximum values are achieved at different time instants t_{m} and ${t}_{\mathrm{m}}^{\ast}$, within the chosen time period.

The synchronous percentage of the peak discharge S_{PPD}:
$$\begin{array}{}\text{(22)}& {S}_{\text{PPD}}=\mathrm{100}\times {\displaystyle \frac{Q\left({t}_{\mathrm{m}}^{\ast}\right)}{{Q}_{max}^{\ast}}}.\end{array}$$ 
The peak delay P_{D}:
$$\begin{array}{}\text{(23)}& {P}_{\mathrm{D}}={t}_{\mathrm{m}}{t}_{\mathrm{m}}^{\ast}.\end{array}$$
3.1 Performance analysis
The results associated with experiments 1 and 2 are presented in Fig. 3 in a “statistical” form (as a distribution of the NSE criterion ranked in increasing order), without relation to the period or gauge station.
Figure 3a shows the results of the “uniform” and “distributed” calibration from experiment 1. All five gauging stations are involved in calibration over two periods; thus we have 5⋅2 calibration points. One can see that the distributed calibration allows for a much better approximation of the observed discharge than the uniform one. This result confirms that the data assimilation procedure works. It also suggests that hydrological properties of the catchment are not uniform and may be captured using distributed parameters. On the other hand, the mismatch between the predicted and observed discharges remains significant. Since the model looks overparameterized and the regularization parameter in Eq. (18) is very small, this indicates that a few issues, either alone or in combination, may be presented: corrupted data (the data used are not synthetic), a deficient model structure (such as an unaccounted sink term, for example) or a local minimum is reached instead of the global one. Some of these issues will be later discussed. Figure 3b shows the same results for experiment 1, but they are this time in temporal validation. One can see that the distributed calibration allows for achieving a better “global” temporal MPP than the uniform one.
Figure 3c shows the results of spatial validation of experiment 2 (i.e., calibrating only with data from the Anduze gauging station). The remaining four gauging stations are involved in validation for two time periods; thus we have 4⋅2 spatial validation points. The calibration results for the Anduze station are not presented graphically, but the NSE values achieved are (for P1 and P2, respectively) uniform of 0.82 and 0.78 and distributed of 0.93 and 0.88, whereas the KGE values are uniform of 0.81 and 0.71 and distributed of 0.89 and 0.69. One can see once again that the distributed calibration allows for clearly achieving a better global spatial MPP than the uniform one. Finally, Fig. 3d shows the results of spatiotemporal validation (experiment 2), evaluated over the period not used for calibration. As before, only data from the Anduze gauging station are used for calibration, but data from all five gauging stations from a different time period are used for validation, giving 5⋅2 validation points. Two of them (in “×”) show the corresponding temporal validation results at the Anduze gauge station; others (shown in “⊗”) are the spatiotemporal validation results at the remaining gauge stations. One can see that the spatiotemporal MPP is generally better if the distributed calibration is used.
3.2 Periodbased analysis
To get a more detailed view of the results of experiment 2, Fig. 4 relates each NSE value to the particular time period and gauging station. Here, Fig. 4a presents the NSE values calculated over the period P1, with parameters optimized over the same period (calibration); Fig. 4b presents the NSE values calculated over P1, with parameters optimized over P2 (validation). Conversely, Fig. 4c contains the NSE values calculated over the period P2, with parameters optimized over the same period; Fig. 4d presents the values calculated over P2, with parameters optimized over P1. All panels present results for the uniform and distributed parameters (blue and red, respectively). The similar figure has been obtained for experiment 1 but not presented here, since the results are also similar. Based on this figure the following observations can be made:
 1.
Considering the results obtained with uniform parameters (blue), the NSE values are always better (larger) when calculated over the period P1, whatever case is considered: calibration (+), spatial validation (o), temporal validation (×) or spatiotemporal validation (⊗).
 2.
Considering the results obtained with distributed parameters (red), better results are always obtained when the NSE values are calculated over P1 in calibration (+) and temporal validation (×). But in spatial validation (o), better results are obtained on P2, for two out of four upstream stations: Mialet and Corbès (V7124015 and V7135010, respectively).
 3.
Comparing the NSE values for the uniform (blue) and the distributed (red) calibration, we notice that for the latter we obtain better results, except in two cases: in spatial validation over P1 for Mialet and Corbès (V7124015 and V7135010) and in spatiotemporal validation over P2 for Corbès and SaintJean (V7135010 and V7135017). Note that in both cases the distributed parameters have been calibrated over P1.
To complete the analysis, the MPP obtained in spatiotemporal validation at Mialet (V7124015) and SaintJean (V7135017) – two stations identified as “particular” just above – have been studied in more detail, for 2⋅4 flood events described in Table 2, corresponding to the four biggest floods over each period. The corresponding hydrographs are plotted in Fig. 5, and the resulting MPP criteria of NSE, KGE, PPD, S_{PPD} and P_{D} are presented in Tables 3 and 4. Based on these results, the following conclusions can be drawn.
 1.
Flood events at those stations which occur during the period P1 are well simulated using both a uniform and distributed set of parameters calibrated over P2 ($\stackrel{\mathrm{\u203e}}{\text{NSE}}>\mathrm{0.70}$ and $\stackrel{\mathrm{\u203e}}{\text{KGE}}>\mathrm{0.60}$). In particular, the prediction of the event 3∕P1 is noticeably improved for the distributed calibration. The simulated flood peak is well synchronized with the observed one, though slightly shifted when the distributed calibration is used. Note that these minor shifts are not critical for the peak discharge prediction, as PPD and S_{PPD} remain similar.
 2.
In contrast, the prediction of flood events which occur during P2 is unsatisfactory. Moreover, the results are even worse for distributed calibration. Modeled flood peaks are severely underestimated at station SaintJean (V7135017) for all four events, both for uniform and distributed calibration. At station Mialet (V7124015) the peak discharge is overestimated for the distributed calibration.
These results are consistent with those from Fig. 4 in spatiotemporal validation (⊗), for the same two stations. A smaller cost achieved during calibration does not necessary imply a better MPP, indicating an excessive assimilation of errors associated with P1.
Figures 6 and 7 represent the maps of the calibrated parameters obtained in experiment 1 and experiment 2, respectively, whereas Table 5 provides the corresponding uniform values $\widehat{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}$ used as priors in the variationalestimation step. Comparing panels a–c and d–f in Fig. 7 corresponding to different time periods P1 and P2, one can notice a significant difference between the calibrated capacities c_{p} and c_{t}. The spatial variability of c_{p} and c_{t} is also higher when the calibration is performed on period P1. One can notice that the results of the uniform calibration are also different for both periods. Concerning the routing velocity v, the maps look rather similar. The velocity changes occur mainly at the cells located in the drainage (see Fig. 2), because the sensitivity of the cost function with respect to the velocities in these cells is the largest. The trend is that these velocities increase during calibration. Note that the velocities above 5 m s^{−1} do not affect significantly the delay between cells. Moreover, the routing velocities obtained are generally consistent with the fast hydrological response of the Gardon watershed (Darras, 2015). A few questions concerning the results in Figs. 6 and 7 should be answered. For example, why are the estimates associated with periods P1 and P2 so different? Is it an “algorithmic” issue (for example, due to the use of the localsearch minimization, uncorrelated priors, etc.), which can be resolved by improving the calibration algorithm, or a “fundamental” issue related to identifiability (which depends of the data available, properties of the model under investigation, and its adequacy regarding the real natural system and its physical characteristics)? We try to answer this question in the next section.
3.3 Stability analysis
An inverse problem is wellposed if a solution to the problem exists, is unique and is continuous with respect to the input data. Let us consider a mathematical model describing a certain physical phenomenon and assume that some variables of this phenomenon are partly observed. If the model is perfect (adequate) and observations are exact then, given the input, the model's output has to match observations. This implies that the minimum of a cost function penalizing the mismatch (mismatch functional) equals zero. Parameter calibration problems are often illposed in terms of the uniqueness (equifinality): there may exist a set of solutions for which the mismatch functional equals zero. However, the “true” value of parameters does exist as one element of this set. If the model involved is nonlinear with respect to its parameters, the mismatch functional may contain additional minimum points where its value is not zero. While the corresponding parameters do not belong to the set of solutions, in the localsearch minimization methods such points are interpreted as solutions. Adding a penalty (regularization) term to the mismatch functional allows one particular solution, which is the “closest” to the prior in terms of a chosen norm, to be defined. This makes the problem formally wellposed but, in practical terms, transforms the nonuniqueness issue into the issue of choosing the prior. Thus, investigating the stability of the estimates with respect to the priors is an important step in the design and validation of a calibration algorithm. In particular, such an analysis allows for the parts of the solution dominated by observations and by the prior to be distinguished.
A straightforward approach implies solving an ensemble of calibration problems involving random (or quasirandom) uniform priors subjected to box constraints of Eq. (16). Let us remember that the priors are used in two ways: as starting points for minimization and to define the penalty term. The influence of the latter has been neglected by choosing small α in Eq. (18). Let $\widehat{\mathit{\theta}}{}_{{\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}_{\mathrm{l}}}$ be the parameter estimate conditioned on the uniform prior ${\stackrel{\mathrm{\u203e}}{\mathit{\theta}}}_{\mathrm{l}}$, $l=\mathrm{1},\mathrm{\dots},L$ from the ensemble of priors of size L. Then, the stability for each element of the parameter vector is measured by the standard deviation (SD) given as follows:
where $\langle \stackrel{\mathrm{\u203e}}{\mathit{\theta}}\rangle $ and $\langle \widehat{\mathit{\theta}}\rangle $ are the ensemble average of priors and estimates, respectively. One can see that the negative values of σ_{θ} correspond to the case when the solution tends to approach the same value for all priors; i.e., it is dominated (or stabilized) by observations, whereas the positive values of σ_{θ} are for the case when the solution is dominated by the prior.
In reality, the conceptual GRD model used in this study is a fairly crude approximation of the hydrological phenomenon. Thus, no solution can be considered as a “truth”, but it is rather as an interpretation of data, given the model and the judgment criteria (cost function). Moreover, the measured data (test signal and observations) are not perfect. These imperfections result in a “generalized observation error”. The stability analysis described above can also be applied in this case, though understanding of results is more difficult. For example, the ensemble average of the mismatch functional values (achieved in minimization) can be considered as a reference level. The parameter estimates which correspond to the values around this level can be considered as possible solutions, whereas the outliers must be discarded.
The stability analysis (experiment 3) has been performed for L=16 configurations of uniform priors ${\stackrel{\mathrm{\u203e}}{c}}_{\mathrm{p}}$ and ${\stackrel{\mathrm{\u203e}}{c}}_{\mathrm{t}}$ (see Table 6) with the same routing velocity $\stackrel{\mathrm{\u203e}}{v}$ = 5 m s^{−1}. The table shows the values of the cost function J(θ) of Eq. (18) for θ before and after minimization.
One can see that all minimization processes for a chosen assimilation period converge to a similar value of the cost function: for P1 the mean value is 0.055, and the SD is 0.002, i.e., about 3.6 % of the mean; for P2 the corresponding numbers are 0.129, 0.004 and 3.1 %. No obvious outliers have been observed, which indicates that the cost function surface is sufficiently regular and the issue of “local search vs. global search” is not critical. The latter is not surprising, since, considering the operators involved, the model seems to be mildly nonlinear.
The spatial distributions of σ_{θ} are presented in Fig. 8, where the stable areas of the estimated parameter fields, i.e., those influenced by observations, are shown in blue. The purpose of this analysis is to understand why the estimates associated with periods P1 and P2 are so different. In particular, the question rises concerning the high capacities obtained for P1; see red areas in Fig. 7a–c for c_{p} and darkyellow areas for c_{t}. In contrast, the capacities obtained for P2 are the lowest in the corresponding areas. However, comparing Figs. 7 and 8, we notice that these “questionable” areas in Fig. 7 largely coincide with the blue (stable) areas in Fig. 8. This clearly indicates that the corresponding estimates are determined by the observed data used in each case. Thus, one may suspect here the data information content issue, the data interpretation criteria (cost function) issue and/or the model adequacy issue, rather than algorithmic issues. This is the main conclusion drawn from the stability analysis. One can also notice that the estimates associated with period 2 (P2) are generally more stable than those associated with period 1 (P1), whereas the ensembleaverage cost function value achieved for P2 is larger than for P1, i.e., 0.129 compared to 0.055. While a better minimization result has been achieved for P1, it looks like more error has been assimilated in this case.
The validation results presented in Sects. 3.1 and 3.2 confirm that the distributed calibration globally improves the temporal and spatial MPP (measured by NSE) as compared to the uniform calibration. The conclusions are less definite with the spatiotemporal MPP, but this is the most difficult performance test. Moreover, the global improvement does not mean that any particular event could be better predicted, as follows from Sect. 3.2. We observe that, on the one hand, the capacity c_{p} and c_{t} estimates obtained using different calibration periods are quite different, whereas, on the other hand, for a chosen calibration period these estimates are relatively stable with respect to their priors. The latter means that the difference is rather due to different data involved in calibration than due to the algorithmic or identifiability problems (see Sect.3.3). Let us note that the crossvalidation experiments could help the best set of parameters to be selected. For example, the MPP analysis above suggests that the sets calibrated using data from P2 should be preferred.
It seems that the routing velocity v is the most stable among all estimated parameters: the velocity fields plotted in Figs. 6 and 7 remain similar for different experiments. Looking at the hydrographic network in Fig. 2, one can see that the velocities are generally much higher along the main drains than on the side slopes, which is in agreement with the true physical behavior of the system, though the routing scheme applied is conceptual. One can notice a few occasions where the velocities are lower in the drain: the Gardon de SaintJean, experiment 1 (P2), the southwestern tributary connected to Corbès, experiment 1 (P1) and the upstream Gardon catchment in experiment 2 (P2). This is likely to be a consequence of an overestimated uniform background, with $\stackrel{\mathrm{\u203e}}{v}=\mathrm{5}$ m s^{−1}.
Calibration of the distributed hydrological models is a difficult task, with the data information content issue, the data interpretation criteria issue and the model adequacy. Some of these issues will be addressed in the near future. For example, the use of the Gaussian likelihood which leads to the quadratic cost function of the type in Eq. (15) does not seem to be the best choice. Taking into account that the discharge itself is a positive variable and the discharge observation error is, most likely, an increasing function of discharge, using the gamma likelihood looks more appropriate.
It is evident that, because of its conceptual nature and simplicity, the GRD model has some structural limitations. Looking for a simple structural upgrade, which may help to improve the adequacy without increasing noticeably the dimensions and computational costs, is an important future task. Another one is to provide better priors. For instance, one can use the nonuniform priors. In particular, for the capacities c_{p} and c_{t} one could consider locally uniform values relevant to the given hydrogeological or soil occupation class. For the velocity, a higher value in the drainage network with respect to the accumulated flow surface has to be enforced. The abovementioned subjects will be addressed in immediate future research.
In this paper, the model has been calibrated using data from different time periods, and the crossvalidation experiments have helped to select the best optimal set of the distributed parameters. However, using a longer calibration period may not necessarily improve the model predictive performance due to the likely presence of very different hydrological regimes over the extended period, including some extreme cases. As a way forward, one could consider calibrating the model independently for different hydrological regimes. This approach is coherent to the idea of the “pooled analysis”. Dependent on the calibration strategy, different parameter sets could be used for the prediction purpose. In the operational flood forecasting context, the model parameters are likely to be fixed, whereas the realtime update of the model initial state should be considered instead. In that case, the initial states of the distributed reservoirs will serve as a control vector, and the assimilation will be performed over a relatively short assimilation window (comparable to the characteristic time of the system). Finally, a running ensemble of models with different calibrated sets of parameters could be considered. In the framework of a flash flood warning system design, the latter approach could be combined with an automatic predictive control strategy such as the treebased model (Ficchì et al., 2016).
In summary, the variational approach based on the adjoint model has proved its great computational efficiency and relevance for solving the calibration problem involving the distributed hydrological model GRD. Technically, this problem can be solved over long time periods and for large spatial areas. The difficulties discovered in this process are the fundamental issues of calibration, not related to the chosen method. The answer to the main research question formulated at the beginning of this paper is positive: it is possible and beneficial to calibrate and then use distributed parameters rather than uniform parameters. A variational algorithm involving adjoint sensitivities has proved to be an efficient tool for such calibration. The calibration quality is expected to be improved by using a more appropriate cost function and by enhancing the model structure. Overall, this means that the suggested research and hydrological forecasting tool development direction is quite promising.
Meteorological and streamflow data used in this study can be obtained from MétéoFrance and the French ministry in charge of environment, respectively. The simulation data that support the findings of this study have been obtained using the SMASH (Spatially distributed Modelling and ASsimilation for Hydrology) platform coded in FORTRAN. They are available from the second author on reasonable request.
MJA performed the computational work. PJ, IG and PA supervised this work. All the coauthors collaborated, interpreted the results, wrote the paper and replied to the comments from the reviewers.
The authors declare that they have no conflict of interest.
This article is part of the special issue “Hydrological cycle in the Mediterranean (ACP/AMT/GMD/HESS/NHESS/OS interjournal SI)”. It is not associated with a conference.
This research has been done in the framework of the 2010–2020 HyMex program. It contributes to the French national PICS (Prévision immédiate intégrée des impacts des crues soudaines) project. The authors wish to thank Etienne Leblois (INRAE Lyon) for having provided the flow direction map used in this study. The editor and the four anonymous referees are also thanked for their constructive comments and suggestions.
This research has been supported by the “Agence nationale de la recherche” (ANR–17–CE03–0011), the Institut national de recherche en sciences et technologies pour l'environnement et l'agriculture (IRSTEA, now INRAE), and by the French ministry in charge of the environment.
This paper was edited by Véronique Ducrocq and reviewed by four anonymous referees.
Abbaris, A., Dakhlaoui, H., Thiria, S., and Bargaoui, Z.: Variational data assimilation with the YAO platform for hydrological forecasting, P. Int. Ass. Hydrol. Sci., 364, 3–8, https://doi.org/10.5194/piahs36432014, 2014. a
Abbaspour, K. C., Yang, J., Maximov, I., Siber, R., Bogner, K., Mieleitner, J., Zobrist, J., and Srinivasan, R.: Modelling hydrology and water quality in the prealpine/alpine Thur watershed using SWAT, J. Hydrol., 333, 413–430, 2007. a, b
Anderson, R. M., Koren, V. I., and Reed, S. M.: Using SSURGO data to improve Sacramento Model a priori parameter estimates, J. Hydrol., 320, 103–116, https://doi.org/10.1016/j.jhydrol.2005.07.020, 2006. a
Arnaud, P., Lavabre, J., Fouchier, C., Diss, S., and Javelle, P.: Sensitivity of hydrological models to uncertainty in rainfall input, Hydrolog. Sci. J., 56, 397–410, 2011. a
Artigue, G., Johannet, A., Borrell, V., and Pistre, S.: Flash flood forecasting in poorly gauged basins using neural networks: case study of the Gardon de Mialet basin (southern France), Nat. Hazards Earth Syst. Sci., 12, 3307–3324, https://doi.org/10.5194/nhess1233072012, 2012. a
Bell, V. A., Kay, A. L., Jones, R. G., and Moore, R. J.: Development of a high resolution gridbased river flow model for use with regional climate model output, Hydrol. Earth Syst. Sci., 11, 532–549, https://doi.org/10.5194/hess115322007, 2007. a
Beven, K.: Changing ideas in hydrology – The case of physicallybased models, J. Hydrol., 105, 157–172, https://doi.org/10.1016/00221694(89)901017, 1989. a
Beven, K.: Prophecy, reality and uncertainty in distributed hydrological modelling, Adv. Water Res., 16, 41–51, 1993. a
Beven, K.: How far can we go in distributed hydrological modelling?, Hydrol. Earth Syst. Sci., 5, 1–12, https://doi.org/10.5194/hess512001, 2001. a
Borga, M., Anagnostou, E. N., Blöschl, G., and Creutin, J. D.: Flash flood forecasting, warning and risk management: the HYDRATE project, Environ. Sci. Policy, 14, 834–844, https://doi.org/10.1016/j.envsci.2011.05.017, 2011. a
Bouvier, C. and DelClaux, F.: ATHYS: a hydrological environment for spatial modelling and coupling with GIS, IAHS PublicationsSeries of Proceedings and ReportsIntern Assoc Hydrological Sciences, 235, 19–28, 1996. a
Braud, I., Roux, H., Anquetin, S., Maubourguet, M.M., Manus, C., Viallet, P., and Dartus, D.: The use of distributed hydrological models for the Gard 2002 flash flood event: Analysis of associated hydrological processes, J. Hydrol., 394, 162–181, https://doi.org/10.1016/j.jhydrol.2010.03.033, 2010. a
Braud, I., Ayral, P.A., Bouvier, C., Branger, F., Delrieu, G., Le Coz, J., Nord, G., Vandervaere, J.P., Anquetin, S., Adamovic, M., Andrieu, J., Batiot, C., Boudevillain, B., Brunet, P., Carreau, J., Confoland, A., DidonLescot, J.F., Domergue, J.M., Douvinet, J., Dramais, G., Freydier, R., Gérard, S., Huza, J., Leblois, E., Le Bourgeois, O., Le Boursicaud, R., Marchand, P., Martin, P., Nottale, L., Patris, N., Renard, B., Seidel, J.L., Taupin, J.D., Vannier, O., Vincendon, B., and Wijbrans, A.: Multiscale hydrometeorological observation and modelling for flash flood understanding, Hydrol. Earth Syst. Sci., 18, 3733–3761, https://doi.org/10.5194/hess1837332014, 2014. a
Braud, I., Borga, M., Gourley, J., Hurlimann Ziegler, M., Zappa, M., and Gallart, F.: Flash floods, hydrogeomorphic response and risk management, J. Hydrol., 541, 1–5, 2016. a
Castaings, W., Dartus, D., Le Dimet, F.X., and Saulnier, G.M.: Sensitivity analysis and parameter estimation for distributed hydrological modeling: potential of variational methods, Hydrol. Earth Syst. Sci., 13, 503–517, https://doi.org/10.5194/hess135032009, 2009. a, b, c, d
Collier, C.: Flash flood forecasting: What are the limits of predictability?, Q. J. Roy. Meteor. Soc., 133, 3–23, https://doi.org/10.1002/qj.29, 2007. a
Darras, T.: Flash flood forecasting by statistical learning, Ph.D. thesis, Université Montpellier, available at: https://tel.archivesouvertes.fr/tel01816929 (last access: 7 October 2020), 2015. a, b
De Lavenne, A., Andréassian, V., Thirel, G., Ramos, M.H., and Perrin, C.: A Regularization Approach to Improve the Sequential Calibration of a Semidistributed Hydrological Model, Water Resour. Res., 55, 8821–8839, https://doi.org/10.1029/2018WR024266, 2019. a, b
Douinot, A., Roux, H., Garambois, P.A., Larnier, K., Labat, D., and Dartus, D.: Accounting for rainfall systematic spatial variability in flash flood forecasting, J. Hydrol., 541, 359–370, 2016. a
Douinot, A., Roux, H., Garambois, P.A., and Dartus, D.: Using a multihypothesis framework to improve the understanding of flow dynamics during flash floods, Hydrol. Earth Syst. Sci., 22, 5317–5340, https://doi.org/10.5194/hess2253172018, 2018. a
Drobinski, P., Ducrocq, V., Alpert, P., Anagnostou, E., Béranger, K., Borga, M., Braud, I., Chanzy, A., Davolio, S., Delrieu, G., Estournel, C., Boubrahmi, N. F., Font, J., Grubišić, V., Gualdi, S., Homar, V., IvančanPicek, B., Kottmeier, C., Kotroni, V., Lagouvardos, K., Lionello, P., Llasat, M. C., Ludwig, W., Lutoff, C., Mariotti, A., Richard, E., Romero, R., Rotunno, R., Roussot, O., Ruin, I., Somot, S., TaupierLetage, I., Tintore, J., Uijlenhoet, R., and Wernli, H.: HyMeX: A 10Year Multidisciplinary Program on the Mediterranean Water Cycle, B. Am. Meteorol. Soc., 95, 1063–1082, https://doi.org/10.1175/BAMSD1200242.1, 2014. a, b
Ducrocq, V., Davolio, S., Ferretti, R., Flamant, C., Santaner, V. H., Kalthoff, N., Richard, E., and Wernli, H.: Introduction to the HyMeX Special Issue on “Advances in understanding and forecasting of heavy precipitation in the Mediterranean through the HyMeX SOP1 field campaign”, Q. J. Roy. Meteor. Soc., 142, 1–6, https://doi.org/10.1002/qj.2856, 2016. a
Edijatno: Mise au point d'un modèle élémentaire pluiedébit au pas de temps journalier, Ph.D. thesis, Université Louis Pasteur, ENGEES, Cemagref Antony, France, available at: https://webgr.inrae.fr/wpcontent/uploads/2012/07/1991EDIJATNOTHESE.pdf (last access: 7 October 2020), 1991. a, b
Errico, R. M.: What is an adjoint model?, B. Am. Meteorol. Soc., 78, 2577–2591, 1997. a
Ficchì, A., Perrin, C., and Andréassian, V.: Impact of temporal resolution of inputs on hydrological model performance: An analysis based on 2400 flood events, J. Hydrol., 538, 454–470, https://doi.org/10.1016/j.jhydrol.2016.04.016, 2016. a
Ficchì, A., Raso, L., Dorchies, D., Pianosi, F., Malaterre, P.O., Overloop, P.J. V., and JayAllemand, M.: Optimal Operation of the Multireservoir System in the Seine River Basin Using Deterministic and Ensemble Forecasts, J. Water Res. Pl., 142, 05015005, https://doi.org/10.1061/(ASCE)WR.19435452.0000571, 2016. a
Garambois, P. A., Roux, H., Larnier, K., Castaings, W., and Dartus, D.: Characterization of processoriented hydrologic model behavior with temporal sensitivity analysis for flash floods in Mediterranean catchments, Hydrol. Earth Syst. Sci., 17, 2305–2322, https://doi.org/10.5194/hess1723052013, 2013. a
Garambois, P.A., Roux, H., Larnier, K., Labat, D., and Dartus, D.: Parameter regionalization for a processoriented distributed model dedicated to flash floods, J. Hydrol., 525, 383–399, 2015. a
Gourley, J., Flamig, Z., Vergara, H., Kirstetter, P.E., Clark, R., Argyle, E., Arthur, A., Martinaitis, S., Terti, G., Erlingis, J., Hong, Y., and Howard, K.: The flash project Improving the Tools for Flash Flood Monitoring and Prediction across the United States, B. Am. Meteorol. Soc., 98, 361–372, https://doi.org/10.1175/BAMSD1500247.1, 2017. a
Gupta, H. V., Kling, H., Yilmaz, K. K., and Martinez, G. F.: Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling, J. Hydrol., 377, 80–91, https://doi.org/10.1016/j.jhydrol.2009.08.003, 2009. a
Hapuarachchi, H., Wang, Q., and Pagano, T.: A review of advances in flash flood forecasting, Hydrol. Process., 25, 2771–2784, https://doi.org/10.1002/hyp.8040, 2011. a
Hascoet, L. and Pascual, V.: The Tapenade Automatic Differentiation tool: principles, model, and specification, ACM T. Math. Software, 39, 20, https://doi.org/10.1145/2450153.2450158, 2013. a
Javelle, P., Fouchier, C., Arnaud, P., and Lavabre, J.: Flash flood warning at ungauged locations using radar rainfall and antecedent soil moisture estimations, J. Hydrol., 394, 267–274, 2010. a
Javelle, P., Demargne, J., Defrance, D., Pansu, J., and Arnaud, P.: Evaluating flashflood warnings at ungauged locations using postevent surveys: a case study with the AIGA warning system, Hydrolog. Sci. J., 59, 1390–1402, 2014. a
Javelle, P., Organde, D., Demargne, J., SaintMartin, C., de SaintAubin, C., Garandeau, L., and Janet, B.: Setting up a French national flash flood warning system for ungauged catchments based on the AIGA method, in: 3rd European Conference on Flood Risk Management (FLOODrisk 2016), 17–21 October 2016, Lyon, France, Proceedings, https://doi.org/10.1051/e3sconf/20160718010, 2016. a
Javelle, P., SaintMartin, C., Garandeau, L., and Janet, B.: Flash flood warnings: Recent achievements in France with the national Vigicrues Flash system, United Nations Office for Disaster Risk Reduction, Contributing Paper to the Global Assessment Report on Disaster Risk Reduction (GAR 2019), available at: https://www.undrr.org/publication/flashfloodwarningsrecentachievementsfrancenationalvigicruesflashsystem (last access: 7 October 2020), 2019. a
JayAllemand, M., Gejadze, I., Javelle, P., Organde, D., Fine, J.A., Patrick, A., and Malaterre, P.O.: Assimilation de données appliquée à un modèle pluiedébit distribué pour la prévision des crues, in: De la prévision des crues à la gestion de crise, Société hydaulique de France, Avignon, 2018. a
Klemes, V.: Operational testing of hydrological simulation models, Hydrolog. Sci. J., 31, 13–24, https://doi.org/10.1080/02626668609491024, 1986. a
Laganier, O., Ayral, P. A., Salze, D., and Sauvagnargues, S.: A coupling of hydrologic and hydraulic models appropriate for the fast floods of the Gardon River basin (France), Nat. Hazards Earth Syst. Sci., 14, 2899–2920, https://doi.org/10.5194/nhess1428992014, 2014. a, b
Laurent, L., Le Riche, R., Soulier, B., and Boucard, P.A.: An Overview of GradientEnhanced Metamodels with Applications, Arch. Comput. Method. E., 26, 61–106, 2019. a
Ledimet, F. and Talagrand, O.: Variational Algorithms For Analysis And Assimilation Of Meteorological Observations – Theoretical Aspects, Tellus A, 38, 97–110, https://doi.org/10.3402/tellusa.v38i2.11706, 1986. a
Lee, H., Seo, D.J., Liu, Y., Koren, V., McKee, P., and Corby, R.: Variational assimilation of streamflow into operational distributed hydrologic models: effect of spatiotemporal scale of adjustment, Hydrol. Earth Syst. Sci., 16, 2233–2251, https://doi.org/10.5194/hess1622332012, 2012. a
Liu, Y., Weerts, A. H., Clark, M., Hendricks Franssen, H.J., Kumar, S., Moradkhani, H., Seo, D.J., Schwanenberg, D., Smith, P., van Dijk, A. I. J. M., van Velzen, N., He, M., Lee, H., Noh, S. J., Rakovec, O., and Restrepo, P.: Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities, Hydrol. Earth Syst. Sci., 16, 3863–3887, https://doi.org/10.5194/hess1638632012, 2012. a
Lobligeois, F., Andréassian, V., Perrin, C., Tabary, P., and Loumagne, C.: When does higher spatial resolution rainfall information improve streamflow simulation? An evaluation using 3620 flood events, Hydrol. Earth Syst. Sci., 18, 575–594, https://doi.org/10.5194/hess185752014, 2014. a
McLaughlin, D.: Recent developments in hydrologic data assimilation, Rev. Geophys., 33, 977–984, 1995. a
Moradkhani, H. and Sorooshian, S.: General review of rainfallrunoff modeling: model calibration, data assimilation, and uncertainty analysis, in: Hydrological modelling and the water cycle, edited by: Sorooshian, S., Hsu, K.L., Coppola, E. Tomassetti, B. Verdecchia, M., and Visconti, G., Springer, Berlin, Heidelberg, 1–24, https://doi.org/10.1007/9783540778431, 2009. a
Mouelhi, S., Michel, C., Perrin, C., and Andréassian, V.: Linking stream flow to rainfall at the annual time step: the Manabe bucket model revisited, J. Hydrol., 328, 283–296, 2006. a
Munier, S., Litrico, X., Belaud, G., and Perrin, C.: Assimilation of discharge data into semidistributed catchment models for shortterm flow forecasting: Case study of the Seine River basin, J. Hydrol. Eng., 20, 05014021, https://doi.org/10.1061/(ASCE)HE.19435584.0001054, 2014. a
Nash, J. E. and Sutcliffe, J. V.: River flow forecasting through conceptual models. Part I. A conceptual models discussion of principles., J. Hydrol., 10, 282–290, 1970. a
Nguyen, V. T., Georges, D., Besançon, G., and Zin, I.: Parameter estimation of a real hydrological system using an adjoint method, IFAC Papersonline, 49, 300–305, https://doi.org/10.1016/j.ifacol.2016.07.978, 2016. a, b, c, d
Oudin, L., Hervieu, F., Michel, C., Perrin, C., Andréassian, V., Anctil, F., and Loumagne, C.: Which potential evapotranspiration input for a lumped rainfall–runoff model?: Part 2 – Towards a simple and efficient potential evapotranspiration model for rainfall–runoff modelling, J. Hydrol., 303, 290–306, 2005. a
Perrin, C., Michel, C., and Andréassian, V.: Improvement of a parsimonious model for streamflow simulation, J. Hydrol., 279, 275289, https://doi.org/10.1016/S00221694(03)002257, 2003. a, b, c, d
Pokhrel, P. and Gupta, H. V.: On the use of spatial regularization strategies to improve calibration of distributed watershed models, Water Resour. Res., 46, W01505, https://doi.org/10.1029/2009WR008066, 2010. a
Quesney, A., Francois, C., Ottle, C., Hegarat, S., Loumagne, C., Normand, M., et al.: Sequential assimilation of SAR/ERS data in a lumped rainfallrunoff model with an extended Kalman filter, IAHS Publ., 267, 495–497, 2000. a
Rabier, F. and Courtier, P.: Fourdimensional assimilation in the presence of baroclinic instability, Q. J. Roy. Meteor. Soc., 118, 649–672, 1992. a, b
Rakovec, O., Kumar, R., Attinger, S., and Samaniego, L.: Improving the realism of hydrologic model functioning through multivariate parameter estimation, Water Resour. Res., 52, 7779–7792, https://www.scopus.com, 2016. a
Riboust, P., Thirel, G., Moine, N. L., and Ribstein, P.: Revisiting a Simple DegreeDay Model for Integrating Satellite Data: Implementation of SweSca Hystereses, J. Hydrol. Hydromech., 67, 70–81, 2019. a
Roux, H., Labat, D., Garambois, P.A., Maubourguet, M.M., Chorda, J., and Dartus, D.: A physicallybased parsimonious hydrological model for flash floods in Mediterranean catchments, Nat. Hazards Earth Syst. Sci., 11, 2567–2582, https://doi.org/10.5194/nhess1125672011, 2011. a
Samaniego, L., Kumar, R., and Attinger, S.: Multiscale parameter regionalization of a gridbased hydrologic model at the mesoscale, Water Resour. Res., 46, W05523, https://doi.org/10.1029/2008WR007327, 2010. a
Santos, L., Thirel, G., and Perrin, C.: Continuous statespace representation of a buckettype rainfallrunoff model: a case study with the GR4 model using statespace GR4 (version 1.0), Geosci. Model Dev., 11, 1591–1605, https://doi.org/10.5194/gmd1115912018, 2018. a
Seo, D.J., Cajina, L., Corby, R., and Howieson, T.: Automatic state updating for operational streamflow forecasting via variational data assimilation, J. Hydrol., 367, 255–275, https://doi.org/10.1016/j.jhydrol.2009.01.019, 2009. a
Sun, L., Nistor, I., and Seidou, O.: Streamflow data assimilation in SWAT model using Extended Kalman Filter, J. Hydrol., 531, 671–684, 2015. a
Sun, L., Seidou, O., Nistor, I., and Liu, K.: Review of the Kalmantype hydrological data assimilation, Hydrolog. Sci. J., 61, 2348–2366, 2016. a
Thirel, G., Martin, E., Mahfouf, J.F., Massart, S., Ricci, S., and Habets, F.: A past discharges assimilation system for ensemble streamflow forecasts over France – Part 1: Description and validation of the assimilation system, Hydrol. Earth Syst. Sci., 14, 1623–1637, https://doi.org/10.5194/hess1416232010, 2010. a
Tramblay, Y., Bouvier, C., Crespy, A., and Marchandise, A.: Improvement of flash flood modelling using spatial patterns of rainfall: A case study in southern France, in: Sixth World FRIEND Conference, the Global Change: Facing Risks and Threats to Water Resources, October 2010, Fez, Morocco, IAHS Publ. 340, 172–178, 2010. a, b
Vannier, O., Braud, I., and Anquetin, S.: Regional estimation of catchmentscale soil properties by means of streamflow recession analysis for use in distributed hydrological models, Hydrol. Process., 28, 6276–6291, https://doi.org/10.1002/hyp.10101, 2014. a
Vannier, O., Anquetin, S., and Braud, I.: Investigating the role of geology in the hydrological response of Mediterranean catchments prone to flashfloods: Regional modelling study and process understanding, J. Hydrol., 541, 158–172, https://doi.org/10.1016/j.jhydrol.2016.04.001, 2016. a
Vidal, J. P., Martin, E., Franchistéguy, L., Baillon, M., and Soubeyroux, J. M.: A 50year highresolution atmospheric reanalysis over France with the Safran system, Int. J. Climatol., 30, 1627–1644, https://doi.org/10.1002/joc.2003, 2010. a
Wang, J., Hong, Y., Li, L., Gourley, J. J., Khan, S. I., Yilmaz, K. K., Adler, R. F., Policelli, F. S., Habib, S., Irwn, D., et al.: The coupled routing and excess storage (CREST) distributed hydrological model, Hydrolog. Sci. J., 56, 84–98, 2011. a
Zhu, C., Byrd, R. H., Lu, P., and Nocedal, J.: LBFGSB: a limited memory FORTRAN code for solving bound constrained optimization problems, Technical Report No. NAM11, EECS Department, Northwestern University, Evanston, IL, 1994. a