Assimilation of wideswath altimetry water elevation anomalies to correct largescale river routing model parameters
 ^{1}LEGOS, 16 Avenue Edouard Belin, 31400 Toulouse, France
 ^{2}Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
 ^{3}CNRMGAME, MeteoFrance, 42 Avenue Gaspard Coriolis, 31000 Toulouse, France
 ^{4}CECI, Université de Toulouse, CERFACS, CNRS, 42 Avenue Gaspard Coriolis, 31057 Toulouse CEDEX 1, France
 ^{5}Magellium, 1 Rue Ariane, 31520 RamonvilleSaintAgne, France
 ^{a}now at: CSGroup, Space Business Unit, 31500 Toulouse, France
 ^{1}LEGOS, 16 Avenue Edouard Belin, 31400 Toulouse, France
 ^{2}Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA, USA
 ^{3}CNRMGAME, MeteoFrance, 42 Avenue Gaspard Coriolis, 31000 Toulouse, France
 ^{4}CECI, Université de Toulouse, CERFACS, CNRS, 42 Avenue Gaspard Coriolis, 31057 Toulouse CEDEX 1, France
 ^{5}Magellium, 1 Rue Ariane, 31520 RamonvilleSaintAgne, France
 ^{a}now at: CSGroup, Space Business Unit, 31500 Toulouse, France
Correspondence: Charlotte Marie Emery (charlotte.emery@jpl.nasa.com)
Hide author detailsCorrespondence: Charlotte Marie Emery (charlotte.emery@jpl.nasa.com)
Land surface models combined with river routing models are widely used to study the continental part of the water cycle. They give global estimates of water flows and storages, but they are not without nonnegligible uncertainties, among which inexact input parameters play a significant part. The incoming Surface Water and Ocean Topography (SWOT) satellite mission, with a launch scheduled for 2021 and with a required lifetime of at least 3 years, will be dedicated to the measuring of water surface elevations, widths and surface slopes of rivers wider than 100 m, at a global scale. SWOT will provide a significant number of new observations for river hydrology and maybe combined, through data assimilation, with globalscale models in order to correct their input parameters and reduce their associated uncertainty. Comparing simulated water depths with measured water surface elevations remains however a challenge and can introduce within the system large bias. A promising alternative for assimilating water surface elevations consists of assimilating water surface elevation anomalies which do not depend on a reference surface. The objective of this study is to present a data assimilation platform based on the asynchronous ensemble Kalman filter (AEnKF) that can assimilate synthetic SWOT observations of water depths and water elevation anomalies to correct the input parameters of a largescale hydrologic model over a 21 d time window. The study is applied to the ISBACTRIP model over the Amazon basin and focuses on correcting the spatial distribution of the river Manning coefficients. The data assimilation algorithm, tested through a set of observing system simulation experiments (OSSEs), is able to retrieve the true value of the Manning coefficients within one assimilation cycle much of the time (basinaveraged Manning coefficient root mean square error, RMSEn, is reduced from 33 % to [1 %–10 %] after one assimilation cycle) and shows promising perspectives with assimilating water anomalies (basinaveraged Manning coefficient RMSEn is reduced from 33 % to [1 %–2 %] when assimilating water surface elevation anomalies over 1 year), which allows us to overcome the issue of unknown bathymetry.
Global hydrological models (GHMs) are extensively exploited to study the continental component of the global water cycle (Doll et al., 2015; Sood and Smakhtin, 2015). Such models have been extensively developed over the past 2 decades in order to quantify freshwater flows and storage changes over continental surfaces (Bierkens, 2015). They are based on the coupling of a land surface model (LSM) with a river routing model (RRM). As an example, the ISBACTRIP (Decharme et al., 2019) hydrologic model results from the coupling of the ISBA LSM (Noilhan and Planton, 1989) and the TRIP RRM (Oki and Sud, 1998). LSMs simulate the energy and water balance at the soil–atmosphere–vegetation interface, while RRMs emulate the lateral transfer of freshwater toward the continent–ocean interface. The current study focuses on the river component of the terrestrial water cycle simulated by the RRM.
GHMs give a global view of the state of the water flow and storage at model spatial and temporal resolutions. Nonetheless, they suffer from multiple sources of uncertainties which are related to the model structure, the external forcing and the input parameters (Liu and Gupta, 2007; Renard et al., 2010). Model structure uncertainties initially arose from a lack of knowledge of the hydrologic processes or from simplifying assumptions made to limit simulation computational cost. Still, with the increase in computational power, models are more and more complex (Liu and Gupta, 2007; Melsen et al., 2016): they run at finer spatial resolution, they include new physical processes and they use an increasing number of fully distributed forcing and parameter datasets (Liu et al., 2012). This has led to an increase in the number of model input parameters and potentially inflates the model uncertainty in those parameters. Input parameters express the spatial and/or temporal properties of the system. The spatial scale of parameters measurable on the field may differ from the model scale, while other conceptual parameters are not directly observable and measurable on the field (Moradkhani et al., 2005; Melsen et al., 2016) and are inferred using a geomorphological empirical formula and/or indirect methods such as calibration (Gupta et al., 1998; Beven, 2012).
Another way in which to study the terrestrial water cycle is to use direct observations of the system. Most parts of the terrestrial water cycle are currently observed and measured from in situ or remote techniques (Sanoo et al., 2011; Vinukollu et al., 2011; Rodell et al., 2015). For the observations of rivers, in situ techniques measure river water elevations at gauge stations. In situ measurements are commonly very accurate and also frequent (i.e., subdaily), but their main limitation is their spatially sparse sampling and their decreasing number over recent decades at a global scale (International Association of Hydrological Sciences Ad Hoc Group on Global Water Sets et al., 2001). Coincidentally, remotely sensed data provided by satellite missions have increased quite significantly since the 90s and deliver effective river observations. The most common instrument operating to assess river water levels remains the nadir altimeter. Nadir altimetry gives localized water elevation measurements along the satellite ground track. Initially, altimeters were designed to monitor ocean topography, but their application has broadened to the observation of lakes (Cretaux et al., 2009), floodplains (Birkett et al., 2002) and, later on, rivers (Silva et al., 2010). However, their main limitation remains their limited spatial and temporal samplings: generally several days between two consecutive measurements at a limited number of locations. Besides, over continental surfaces, the signal is not always retrievable. Current river observations therefore provide a more accurate view of the river system than models, but they are quite limited by their sparse availability in space and time.
The incoming Surface Water and Ocean Topography (SWOT) mission, jointly developed by NASA, CNES, CSA and UKSA and scheduled for launch in 2021, will be dedicated to the observation of continental free surface water with a better spatial and temporal coverage than the current nadir missions (such as EnviSat, the JASON series or also Sentinel3A/B). SWOT's main payload, called KaRIn, for Kaband Radar INterferometer (Fjørtoft et al., 2014), will observe surfaces under two swaths of 50 km each separated by a nadir gap of 20 km and will have a nearglobal coverage. For hydrology, SWOT will observe rivers wider than 100 m as well as lakes and wetlands larger than 250 × 250 m^{2} within the latitudes 78^{∘} S and 78^{∘} N and with a revisit time of 21 d. SWOT will provide twodimensional images of water surface elevations with a vertical accuracy of 10 cm when averaged over 1 km^{2} of water area. Along with water surface elevation measurements in rivers, SWOT will also provide observations of river width, surface slope and estimates of discharge based on SWOT observations. SWOT will provide a significant amount of new data for surface hydrology. It will give an ensemble of constraints that will allow a better depiction of surface water in hydrological models. These new data could be combined or integrated into globalscale hydrological models in order to correct them and improve their performances and forecasting capabilities.
Data assimilation techniques are a set of mathematical methods which combine a physical model and related external measurements, taking their relative uncertainties into account. Data assimilation aims at improving the model's ability to forecast and/or emulate the physical system's evolution. For this purpose, data assimilation methods are built to correct either the model's outputs (state estimation) or the model's input parameters (parameter estimation or PE), and sometimes both simultaneously. Data assimilation for state estimation has been widely applied in meteorology and oceanography and is more and more developed for largescale terrestrial hydrology (Clark et al., 2008; Michailovsky et al., 2013; Paiva et al., 2013; Emery et al., 2018). Data assimilation for PE in hydrology was initially developed as a dynamic alternative to model calibration (Montzka et al., 2011; Panzeri et al., 2013; Ruiz et al., 2013; Shi et al., 2015). In most models, parameters are assumed to be constant in time, whereas, in reality, they may vary seasonally or under evolving climate and/or anthropogenic conditions. Sequential data assimilation can therefore help track model parameter variations in time (Kurtz et al., 2012; Deng et al., 2016; Pathiraja et al., 2016). PE is also used to retrieve conceptual parameters of hydrologic models such as friction coefficients (Pedinotti et al., 2014; Oubanas et al., 2018; Hafliger et al., 2019) or residence times of quick and slowflow reservoirs and partition of runoff excess (Vrugt et al., 2012; Pathiraja et al., 2016), which can not be directly measured.
Before launch, in the preparatory phase, observing system simulation experiments (OSSEs) can be performed in order to assess the benefits of assimilating SWOT data into a hydrological model and to evaluate the most adapted methodologies to assimilate these data into models. Several studies assimilating synthetic and/or simplified SWOTlike data have been published so as to evaluate the correction of river model state, namely river depth (Andreadis et al., 2007; Biancamaria et al., 2011), river storages (Munier et al., 2015) and river discharges (Andreadis and Schumann, 2014), at various scales. But also, several studies focused on the possibility of using SWOT data to retrieve critical river parameters such as river bathymetry (Durand et al., 2008; Yoon et al., 2012; Mersel et al., 2013) and/or riverbed roughness/friction coefficient (Pedinotti et al., 2014; Oubanas et al., 2018; Hafliger et al., 2019). Indeed, SWOT is a scientific mission with a 3year nominal lifetime. Therefore, SWOT observations will help to better calibrate hydrological models and to improve their performances even over time periods beyond its lifetime. Moreover, other studies using real remotesensing data have also been published and give insight into the challenges related to the assimilation of spaceborne products, such as Michailovsky et al. (2013), Michailovsky and BauerGottwein (2014) and Emery et al. (2018), which assimilate nadir radar altimetry data.
In the present study, a data assimilation framework is used to correct input parameters of the largescale ISBACTRIP model. More specifically, synthetic SWOT observations of water surface depths and anomalies are assimilated in order to correct the spatially distributed riverbed friction coefficients (or Manning coefficients). As SWOT will not directly measure water depths (it provides water elevation and the bathymetry is required to derive water depth), the purpose of this study is to evaluate the possibility of assimilating water elevation anomalies to correct the model's parameters and to assess how the assimilation performances are impacted, compared to the direct assimilation of water depths. Assimilating water elevation anomalies is done to overcome a potential lack of bathymetry data.
This study is presented as a complementary study to that of Emery et al. (2018), which is dedicated to the state estimation (river storage and discharge) of the same ISBACTRIP model, using real satellitebased discharge products. The choice of the roughness coefficient as a control variable was made following the results from the ISBACTRIP sensitivity analysis in Emery et al. (2016). In this preliminary study, the sensitivity of the simulated water depths and also anomalies to several river input parameters (such as riverbed width, depth, slope and also friction coefficient) was evaluated. The results showed that the highest sensitivity was in the Manning coefficient.
This study is, furthermore, also built on the conclusions from the work of Pedinotti et al. (2014). In our study, an ensemble Kalman filter (EnKF) is used (instead of the extended Kalman filter in Pedinotti et al., 2014) to better account for the nonlinearities of the system and to better estimate the model errors. Also, Pedinotti et al. (2014) chose to update the Manning coefficient distribution at the grid cell scale, and the question of equifinality arose (Beven and Freer, 2001) in their results. For the current study, it was decided to update the Manning coefficient distribution not at the gridcell resolution, but at a coarser zonal resolution, by applying multiplying correcting factors uniformly over each zone, identical to the one used in Emery et al. (2016). Finally, Pedinotti et al. (2014) used an assimilation window of 2 d. This configuration resulted in updated Manning coefficient time series displaying “unrealistic jumps” with a frequency of about 20 d associated with the orbit repeat cycle (longer than the 2 d window). To avoid this phenomenon, the present study uses an assimilation window of 21 d corresponding to the current SWOT orbit repeat cycle.
Section 2 will first give a description of the ISBACTRIP model used for this study. Section 3 will present the particular data assimilation method developed for this study and, finally, after presenting the assimilation strategy in Sect. 4, Sects. 5 and 6 will give the data assimilation results.
2.1 The ISBACTRIP largescale hydrological model
The ISBA model (Noilhan and Planton, 1989) is a LSM defined at global scale on a 0.5^{∘} × 0.5^{∘} regular mesh grid that establishes the energy and water budget over continental surfaces. This study operates the ISBA3L version based on a threelayer soil (Boone et al., 1999). The budget equations are solved separately on each grid cell. Still, largerscale spatial patterns in the radiative and precipitation forcing, the soil composition and the vegetation cover ensure spatial correlations between those cells (for more details, see Decharme et al., 2012, 2019). In particular, ISBA gives a diagnostic of the surface runoff (Q_{ISBA,sur}) and the gravitational drainage (Q_{ISBA,sub}, i.e., water percolating to the deep layers of the soil) later used as forcing inputs for the RRM denoted CTRIP.
The CTRIP model (Decharme et al., 2010, 2012, 2019), is defined on the same mesh grid as ISBA and follows a river network to laterally transfer water from one cell to another, down to the interface with the ocean (Oki and Sud, 1998). The study is based on the CTRIP version from Decharme et al. (2012) with three reservoirs, as illustrated in Fig. 1a. The water mass (kg) stored in a groundwater reservoir G and a floodplain reservoir F interacts with the water mass in the surface reservoir S representing the river. Only the surface reservoir S is related to the river network and fills with the surface runoff Q_{ISBA,sur}, the outflow from upstream cells and the delayed drainage Q_{ISBA,sub} by means of the groundwater reservoir. Occasionally, when the amount of water in the river exceeds a given threshold (defined by the water level in the reservoir), the river spills into the floodplains.
2.2 CTRIP parameters
Within a 0.5^{∘} × 0.5^{∘} cell, the surface reservoir is a unique river channel that may gather multiple real river branches. Its rectangular cross section is described by its slope s (–), its width W (m), its bankful depth H_{c} (m), its length L (m) and finally a Manning or friction coefficient N ($\mathrm{s}\phantom{\rule{0.125em}{0ex}}{\mathrm{m}}^{\mathrm{1}/\mathrm{3}}$) that assesses the reach resistance at the bottom of the river.
Each cell's elevation is deduced from the STN30p digital elevation model (http://daac.ornl.gov/ISLSCP_II/islscpii.shtml, last access: 20 April 2020). These elevations are then compared to determine the riverbed slope s. Global empirical geomorphologic relationships are used to define the river width W and bankful depth H_{c}. The arc length between grid cell centers, inflated by a meandering factor μ, results in the river reach length L. More details on these parameters can be found in Oki and Sud (1998) and Decharme et al. (2012).
The Manning coefficient N is generally more complicated to estimate. Following Maidment (1993), it should take values between 0.025 and 0.03 for natural streams and values between 0.075 and 0.1 for smaller and mountainous tributaries and also floodplains. Global studies can apply either a constant (Beighley et al., 2009; Biancamaria et al., 2009) or a spatially distributed (Decharme et al., 2012) Manning coefficient. However, it is ordinarily accepted that this parameter should vary in space and even in time across the river catchment. Consequently, CTRIP uses a spatially distributed Manning coefficient based on a simple linear relationship between the relative stream size in the current cell, denoted SO, and the size at the river mouth and the source cells, so that
SO is the stream size relative measure at the current cell, SO_{max} (whose value depends on the network depth) the same measure at the river mouth and SO_{min}=1 the measure at source cells (namely cells without any upstream cells, according to the river network). The Manning coefficient is then set to be constant in time while its spatial values decrease towards the river outlet (following the river network), with values between N_{min}=0.04 and N_{max}=0.06.
All these parameters are eventually essential to estimate the spatially and timevarying average crosssectional flow velocity in the surface reservoir v(t) following the Manning formula (Manning, 1891):
where h_{S} is the river water depth estimated from the river storage S by
and ρ is the water density. The flow velocity is ultimately used to estimate the discharge leaving the CTRIP cell:
As the definitions of most of these parameters are based on empirical relationships, we have to be aware that they inevitably have substantial uncertainties.
2.3 CTRIP implementation over the Amazon basin
In this study, we present an OSSE test case over the Amazon River basin, whose hydrology is carefully described in Molinier et al. (1993) and Wisser et al. (2010). This choice was motivated by the present work following and complementing studies over the same domain (Emery et al., 2016, 2018).
For ISBACTRIP, the Amazon basin is composed of a total number of 2028 cells. Based on the basin geomorphology and hydrology (Meade et al., 1991), the basin has been split into nine spatial regions. These zones, illustrated in Fig. 1b, were initially introduced in (Emery et al., 2016) and will be reexploited here within the application of data assimilation. For a detailed description of the zones, the reader can refer to Emery et al. (2016).
2.4 ISBACTRIP forcing
For the present study, ISBACTRIP needs external atmospheric forcing in order to run. Similarly to Emery et al. (2016), such data are provided by Global Soil Wetness Projet 3 (GSWP3, http://hydro.iis.utokyo.ac.jp/GSWP3, last access: 20 April 2020) at a 3hourly time resolution.
3.1 OSSE framework
In OSSEs, we introduce beforehand a reference configuration for the model input parameters that we will consider thereafter to be the truth. From those true parameters, we directly deduce the true run from a ISBACTRIP model integration. The synthetic observations used for data assimilation are obtained by perturbing the true observables (variables that are used as observations) using an error model that is representative of the real observation errors. The control variables (the model variables to be corrected with data assimilation) first guess is obtained by directly perturbing the true control variables. Control error also has to be chosen to be representative of the real modeling errors. OSSEs are prerequisite tests to ensure that the implementation of the EnKF algorithm is correct and adapted to the hydrologic problem under consideration (temporal/spatial length scales, sources of uncertainty, observation operator, etc.).
3.2 Data assimilation variables
3.2.1 Observation variables and their errors
The observation vector, denoted ${\mathit{y}}_{k}^{\mathrm{o}}$ at the assimilation cycle k, is composed of the n_{y} available observations at cycle k:
where ${y}_{k,j}^{\mathrm{o}}$, j=1 … n_{y} is the jth observation among the n_{y} at cycle k.
In the present study, the observed variables are water depths issued from a simplified SWOT simulator. Note that this simulator will produce water depths, while the real SWOT satellite will provide water elevations. As in Biancamaria et al. (2011) and Pedinotti et al. (2014), this SWOT simulator replicates SWOT spatiotemporal coverage. At a given date, the simulator selects the ISBACTRIP cells contained (at least 50 % of their area) in the SWOT ground tracks. Figure 2 shows some selected ISBACTRIP cells under the real swaths over the Amazon basin. The true run is used as a basis to get the true water depths ${\mathit{y}}_{k}^{t}$. Then, in order to generate the observation vector ${\mathit{y}}_{k}^{\mathrm{o}}$ from the extracted true water depths, each of them is randomly perturbed by adding a white noise characterized by a standard deviation σ^{o} so that
Using water depth observations is a strong simplification of the real SWOT product. Therefore, in order to take into account that SWOT will provide water elevations and not directly water depths, this study will look at the assimilation of both water depths and water anomalies. The method for generating these anomalies will be further detailed in Sect. 4.2.
The observation error is the addition of the measurement error and the representativeness error. The first is associated with inherent instrumental errors when processes are observed and the second represents the error introduced when the observed and simulated variables are not exactly the same (in nature or scale). Following the SWOT uncertainty requirements (Esteban Fernandez, 2017), SWOTlike water surface elevation measurements have a vertical accuracy of 10 cm (when averaged over a water area of 1 km^{2}). This uncertainty accounts for measurement errors due to the remotely sensed acquisition such as instrumental thermal noise, speckle, troposphere and ionosphere effects. Moreover, we omit error correlations along the swath so that observation errors follow a white noise model. Accounting for spatially correlated observation errors is an active research area in the field of data assimilation (Guillet et al., 2018), which is beyond the scope of demonstrating the feasibility of assimilating SWOTtype data. In the framework of OSSEs, observed and simulated water depths have the same scale as the ISBACTRIP model which is used to generate both. In the following, we assume therefore that there is no representativeness error related to the scale in the system. However, it is worth acknowledging that we should expect higher errors in water depths, compared to water elevations, as we do not know the bathymetry. Assimilation of water depths is performed as a benchmark against which assimilation of water anomalies will be compared. Ultimately, σ^{o} is chosen as being equal to 10 cm for all observed variables (i.e., both water depths and water elevation anomalies).
3.2.2 Control, observation space and their errors
The control vector is denoted by ${\mathit{x}}_{k}\in {\mathbb{R}}^{{n}_{x}}$. It includes the n_{x} uncertain variables to be estimated through the kth data assimilation cycle. The choice of the control variables determines the observation operator ℋ_{k}:
where y_{k} are the simulated observables; in other words, ℋ_{k} maps the control variables onto the observation space. They are then compared to the measured observations ${\mathit{y}}_{k}^{\mathrm{o}}$ during the data assimilation experiment. This difference is referred to as the innovation vector.
Following the conclusions from Emery et al. (2016), we determined that assimilating waterdepthlike observations would be efficient for the correction of the distribution of the river Manning coefficients. These coefficients are spatially distributed at the gridcell scale. However, from Pedinotti et al. (2014), equifinality issues were raised through the correcting of the distribution at this scale. They also affected its upstreamtodownstream spatial distribution. We chose to correct it therefore by applying multiplying factors defined at a coarser scale, namely at the scale of the nine hydrogeomorphological areas defined in Sect. 2.3 and illustrated in Fig. 1b. Within the same area, the Manning coefficient values are all identically modified by being multiplied by the same factor.
The control vector is composed therefore of the n_{x} multiplying factors N_{mult,i}, i=1 … n_{x}, applied to the correction of the spatial distribution of the river Manning coefficient:
giving n_{x}=9.
The observation operator ℋ_{k} maps the control variables (Manning coefficient dimensionless multiplying factors) into the observables (river water depths in meters) as follows:

first, apply the multiplying factors (x_{k}) to the Manning coefficient distribution;

then, apply the ISBACTRIP model ${\mathcal{M}}_{[k\mathrm{1},k]}$ over the assimilation window $[k\mathrm{1},k]$ to determine the model states that correspond to the perturbed Manning coefficient distribution;

afterwards, turn the CTRIP surface water storage into equivalent water depths following Eq. (3) (we denote by 𝒵_{k} the diagnostic operator turning the surface storage variable into the water depth variable);

finally, select the simulated water depths under the SWOT swath mask (we denote by 𝒮_{k} this operator).
The observation operator is therefore the composition of three operators:
Such a nonlinear observation operator ℋ_{k} is difficult to formulate explicitly, which is why we use an EnKF algorithm to estimate the Kalman gain in a statistical way.
3.3 The EnKF general formulation
In the EnKF framework, the model ${\mathcal{M}}_{[k\mathrm{1},k]}$ and observation ℋ_{k} operators are generally not linear. The main assumption for the EnKF is to use stochastic ensembles to represent first and secondorder moments (namely the means and the covariances) of the control variable errors (Evensen, 1994, 2003). Indeed, it is assumed that the distribution of the ensemble is similar to that of the error of the control vector, and it is also assumed that the probability density function (PDF) of the error is Gaussian and thus well described by its first and second moments. The background control variables ${\mathit{x}}_{k}^{\mathrm{b}}$ (the first guess) are therefore represented by an ensemble of n_{e} members:
To avoid the collapsing of the ensemble, the observation vector in Eq. (5) is randomized by adding a supplementary white noise with the same observation error standard deviation σ^{o} (Burgers et al., 1998) so that
An observation ensemble is generated:
Note that alternatives exist to the observation randomization chosen here. However, for the present study, we choose to use a full stochastic filter.
Finally, the EnKF analysis step is applied to each member of the ensemble so that
where K_{e,k} is the Kalman gain. It is built from the control and observation error covariance matrices P and R and the linearized observation operator H so that (see Appendix A for more details)
Figure 3 summarizes the general OSSE framework used for the present study. The figure reads from top to bottom and from left to right. An assimilation cycle $[k\mathrm{1},\phantom{\rule{0.33em}{0ex}}k]$ includes a forecast step where an ensemble of ISBACTRIP simulations is integrated, each member having a different spatially distributed Manning coefficient; an analysis step where the ensemble of Manning coefficients is corrected using synthetic observations through the Kalman filter update in Eq. (13); and a cycling step where the ISBACTRIP model is rerun with these analysis estimates to obtain updated model states.
3.4 SWOTbased data assimilation special feature
3.4.1 Choice of the assimilation window
We use a 21 d assimilation window corresponding to a SWOT orbit revisit period. During one assimilation window, every pixel under the observation mask is therefore observed at least once. However, this implies that new observations are available at times which differ from the update time. Such a case has already been addressed in several studies. The ensemble Kalman smoother (EnKS) for example, introduced by Evensen and Leeuwen (2000), is a direct extension of the EnKF. It consists of generating an update of the control variables taking into consideration the present and all past observations when a new observation is available. The EnKS is actually a sequential version of the ensemble smoother (Leeuwen and Evensen, 1996). The latter takes into consideration all past and future observations, but turned out to be less effective than the EnKF and the EnKS (Evensen, 1997; Evensen and Leeuwen, 2000). Alternatively, Hunt et al. (2004) developed the 4DEnKF (4D as in the 4DVAR variational assimilation methods; Talagrand and Courtier, 1987), which also assimilates observations available at different time steps. In the 4DEnKF, all model observations are expressed as a linear combination of the model observations at analysis time, and the problem is transformed into a classic EnKF problem. Similarly, Hunt et al. (2007) also presented an asynchronous version of the local ensemble transform Kalman filter (Bishop et al., 2001; Ott et al., 2004). In the framework of the present study, we apply an asynchronous ensemble Kalman filter (AEnKF) as described by Sakov et al. (2010) and Rakovec et al. (2015). The principle is to increase the dimension of the state in order to consider observations at past and analysis times. This increases the dimension of the matrices which contain covariances between observations available at different times. To our knowledge, the AEnKF has not been used for parameter estimation, as Sakov et al. (2010) and Rakovec et al. (2015) described the method for state estimation experiments.
3.4.2 The asynchronous EnKF
To start with, k represents the assimilation cycle index, but it needs to be distinguished from the day index (within the assimilation cycle), which is the time unit for the observations. We will then denote by k(i), i=1 … 21 the ith day in the current assimilation cycle.
On the ith day of the kth assimilation cycle, the n_{y,k(i)} observations are gathered in the vector ${\mathit{y}}_{k\left(i\right)}^{\mathrm{o}}$. The overall observation vector at cycle k, ${\mathit{y}}_{k}^{\mathrm{o}}$, then concatenates the 21 daily observation vectors ${\mathit{y}}_{k\left(i\right)}^{\mathrm{o}}$ so that
Similarly to the observation vector, the overall observation operator at the kth cycle, ℋ_{k}, is the concatenation of the daily observation operators ℋ_{k(i)}, defined from Eq. (9), but by considering the operator ℳ integrating the model between k(0) ($=(k\mathrm{1})\left(\mathrm{21}\right)$) and k(i) as well as the diagnostic and selection operators, a time step k(i), 𝒵_{k(i)} and 𝒮_{k(i)}.
The observation error covariance matrix R_{k} is the concatenation of the daily observation error covariance matrices:
where ${\mathbf{I}}_{{n}_{y,k\left(i\right)}}$ is the identity matrix of size ${n}_{y,k\left(i\right)}\times {n}_{y,k\left(i\right)}$. It turns out that
Following the same equations as the EnKF, the AEnKF generates, for each member l=1 … n_{e} of the control ensemble, an analysis control vector ${\mathit{x}}_{k}^{a,\left[l\right]}$.
3.4.3 Generation of the ensemble
To generate the background control ensemble, we solely stochastically perturb the variables within the control vector. Note that it amounts to the assumption that all other features of the forward model, e.g., the atmospheric forcings, the LSM structure and therefore the surface and subsurface runoff, are perfect. While this applies for OSSEs, such features are never perfect in realcase experiments. This assumption is further discussed in Sect. 7.
The ensemble of background control vectors ${\mathbf{X}}_{e,k}^{\mathrm{b}}$, of size n_{x}×n_{e}, is generated so that ${\mathit{x}}_{k}^{b,\left[l\right]}$, l=1 … n_{e} follows a Gaussian law of mean $\stackrel{\mathrm{\u203e}}{{\mathit{x}}_{k}^{\mathrm{b}}}$ and covariance matrix ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. For the first assimilation cycle, the control variables' mean value $\stackrel{\mathrm{\u203e}}{{\mathit{x}}_{\mathrm{1}}^{\mathrm{b}}}$ is arbitrarily chosen as the openloop run input parameter (the openloop or free run is the model run without assimilation) and the background error covariance matrix ${\mathbf{P}}_{e,\mathrm{1}}^{\mathrm{b}}$ is a diagonal matrix defined as
with ${\mathbb{I}}_{{n}_{x}}$ the identity matrix of size n_{x}×n_{x} and σ^{b} the vector that gathers the initial control variable error standard deviation.
Once the analysis ensemble ${\mathbf{X}}_{e,k}^{a}$ is determined, the next step is to propagate the correction in time. In a PE framework, it is necessary to rerun the ensemble model runs during the current assimilation window with the analysis parameters as inputs. Then, the contribution of the updated parameters is propagated through the model, up to the end of the current assimilation window, and put into the model initial condition for the next assimilation cycle.
For the next assimilation cycles, the background mean estimate is set equal to the analysis mean estimate from the previous cycle:
There are different ways of defining ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. One could choose to stochastically estimate ${\mathbf{P}}_{e,k\mathrm{1}}^{a}$ from the analysis ensemble at the previous cycle and use it as ${\mathbf{P}}_{e,k}^{\mathrm{b}}$. Contrary to state estimation experiments where the analysis error covariance matrix is propagated in time using the model along with the control variables, parameter estimation experiments use it directly as the background error covariance matrix as there is no dynamical model for the Manning coefficient. The issue with this approach is that the analysis ensemble variance can be strongly reduced and provide too small an ensemble spread to have efficient AEnKF updates in time. To ensure that enough uncertainty is maintained in the ensemble, one can maintain the initial background error covariance matrix through all cycles or impose a minimal value for the variance elements (see Sect. 5.4).
The background error crosscovariance matrix [PH^{T}]_{e,k} and covariance matrix [HPH^{T}]_{e,k} are directly built from the definition suggested by Evensen (2004), Moradkhani et al. (2005) and Durand et al. (2008); see Appendix A for more details. The matrices are of sizes ${n}_{x}\times {n}_{y,k}$ and ${n}_{y,k}\times {n}_{y,k}$, respectively. The elements in the error crosscovariance matrices result directly from the characterization of the background ensemble; i.e., the parameter uncertainties accounted for the generation of the control matrix ${\mathbf{X}}_{e,k}^{\mathrm{b}}$ and $\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)$.
In the incoming experiments, the true control variables x^{t} are
and the a priori values at the first assimilation cycle $\stackrel{\mathrm{\u203e}}{{\mathit{x}}_{\mathrm{1}}^{\mathrm{b}}}$ are
We increase the Manning value in mountainous zones (zone 1 in the Andes and zones 8 and 9 over the shields) and lower the Manning value over the other zones with the lowest values (in zones 2 and 3) corresponding to the main stem. Both true and background values were chosen accordingly.
4.1 Sensitivity tests
During one EnKF assimilation cycle, the analysis potentially depends on the following parameters: model spinup, time period (high/low flow), size of the ensemble, and control error. Note that the observation error also has an impact on the analysis, but its value is already fixed for all subsequent experiments (see Sect. 5).
A first set of experiments (either model runs or data assimilation runs) will serve as sensitivity tests for the data assimilation platform with respect to the above features. During these sensitivity tests, the different features are tested individually. Table 1 details the range of variations for each tested feature.
4.2 Assimilation tests
Following the sensitivity tests, a set of three data assimilation experiments will be run and is presented in Table 2. The data assimilation experiments are divided into two categories: the first one uses water depths as observations, and the second considers water depth anomalies. All experiments are run across a year, corresponding to 17 assimilation cycles of 21 d.
The first experiment, denoted PE1, is configured from the aforementioned sensitivity test outcome. The parameters defining the experiment (spinup, starting date, ensemble size, control error) will be those which provide the best results in the sensitivity tests in Table 2. The reference level between the observed and simulated water depths is also the same. In other words, there is no bias in the observation. This first idealized experiment serves as a proofofconcept as the observation type matches exactly the type of the simulated variables. Consequently, with this experiment, we expect to retrieve the true value of the control variables and hence the correct water depths and discharges.
The next step is to head towards more realistic experiments by including new sources of uncertainties in the data assimilation system and seeing how to address them. In this context, two additional experiments denoted PE2 and PE3 will be carried out. As an example of new uncertainties, SWOT will in fact observe water elevations (water surface elevation as referenced to a geoid or an ellipsoid), whilst CTRIP produces water depths (water surface elevation as referenced to the bottom of the river bed). To perform data assimilation, one needs to convert CTRIP water depths (${h}_{S}^{\text{CTRIP}}$) into CTRIP water elevations (${H}_{\text{alti}}^{\text{CTRIP}}$) or inversely for SWOT. It is highly plausible that this operation induces a bias between the modeled and observed water elevations. A simplified example of such a situation is illustrated in Fig. 4. In this case, SWOT catches the right water elevation dynamic (as ${h}_{S}^{\text{SWOT}}$ and ${h}_{S}^{\text{CTRIP}}$ are equal), but the direct assimilation of SWOT water elevation ${H}_{\text{alti}}^{\text{SWOT}}$ will induce a bias as the elevations of the river bed (${H}_{\text{bed}}^{\text{SWOT}}$ and ${H}_{\text{bed}}^{\text{CTRIP}}$) are different between CTRIP and SWOT.
A solution for the handling of this issue is to assimilate water depth anomalies instead of water depths. The next data assimilation experiments, denoted PE2 and PE3, will therefore test the feasibility of assimilating anomalies. In these experiments, the water depth anomalies are generated by subtracting a timeaveraged reference water depth from the current water depth. For all runs (true, openloop or analysis), this timeaveraged reference water depth is computed as the mean (true, openloop or analysis) water depth over the year before the start of the assimilation window. It is therefore different for each member of the ensemble. Firstly, in experiment PE2, there will still be no bias between the observed and simulated river bathymetry to observe how the assimilation of anomalies performs. Similarly to PE1, we expect this experiment to be able to retrieve the true control and state variables. Finally, the last experiment, PE3, which introduces a constant relative bias between CTRIP and SWOT, will be carried out. For this experiment, we anticipate that the assimilation will still be able to retrieve the model state variables. The use of anomalies as observations should limit the impact of the inserted bias. We do not exclude the possibility however that it may be slightly echoed in the control variables.
5.1 Model spinup sensitivity tests
The objective of the spinup sensitivity tests is to evaluate the minimum spinup period required by the model before applying data assimilation. For this purpose, the model is run several times across 2 years, from 19 December 2006 to 22 December 2008, corresponding to 35 windows of 21 d (735 d).
A first simulation is run using the true Manning spatial distribution (see Eq. 18) over the 2year time period. We then run 18 additional simulations over the same period with a varying length of the spinup period (see Table 1). Initially, the simulation setup corresponds to the openloop configuration with the openloop Manning spatial distribution (see Eq. 19). At a given time during the first year, the Manning spatial distribution is instantaneously changed to the true distribution (see Eq. 18) and the model is run until the end of the 2 years with the true Manning spatial distribution. Table B1 summarizes, for each run, the date when the Manning coefficients are changed. The spinup period (expressed as a number of windows of 21 d) corresponds to the period between when the Manning distribution is changed and the start of the second year, i.e., 1 January 2008.
To evaluate the spinup impact, the relative difference between the reference run and the test runs is evaluated over the second year of simulation (from 1 January to 22 December 2008) and averaged over every window of 21 d. Figure 5 presents the results for the spinup sensitivity test. Each test run (on the x axis) is identified by its corresponding spinup period length (expressed as the number of windows of 21 d; see Table B1). We then count (on the y axis) the number of 21 d windows during which the relative error between the test run and the reference run is higher than a given threshold. We assume that the spinup period is long enough when this number is equal to 0. This number is evaluated from the basinaveraged relative difference and from the relative difference at the downstream station of Óbidos, both in terms of water depth and discharge. Note that in Fig. 5, when the number of spinup windows of 21 d is equal to 4 on the x axis, the change from the openloop Manning distribution to the true one is imposed on 9 October 2007. Similarly, when this number is equal to 10, the change is imposed on 5 June 2007. Note also that two thresholds are considered, 0.01 and 0.001. Basinaveraged results are not sensitive to this threshold. There are some differences at Óbidos; however, we retain the basinaveraged results to evaluate the required model spinup period.
From all of the results we conclude that a minimum spinup period of four windows of 21 cycles, i.e., 84 d, is required. This period corresponds to the basin concentration time or, in other terms, the required time for the river network to totally empty. In the following sensitivity tests, the model runs start on 9 October 2007, to be consistent with these results.
5.2 Data assimilation starting date sensitivity tests
To evaluate the impact of the starting date, a set of 17 onecycle long data assimilation experiments is carried out over the second running year. All experiments have the same general configuration, except for the initial date starting from 1 January 2008 and shifted by 21 d until 22 December 2008. This means that the last experiment starts on 2 December 2008. The performance of each experiment is evaluated by simply evaluating the spatial average difference between the analysis and the true Manning coefficients. Results presented in Fig. 6a and b indicate that there are no significant differences between the data assimilation experiments (for all 17 experiments, the error of the updated Manning coefficients with respect to the true value of the coefficients is below 5 %).
Note that the results in Fig. 6a–b show a slight increase in the errors for the experiment starting at the end of the year, presently from 9 September to 2 December 2008. This period corresponds to the lowflow season in the Amazon hydrological cycle. Concerning the sensitivity analysis results (Emery et al., 2016), the water depths showed a very low sensitivity to the Manning coefficient during the lowflow season. As a consequence of data assimilation, the EnKF is less effective in lowflow seasons in the correction of the Manning coefficient. The analysis relative error (in Fig. 6a) and the analysis ensemble dispersion (in Fig. 6b) are therefore higher in the lowflow season.
For all of the following sensitivity tests, we are only considering therefore oneassimilationcycle experiments, which will start on 1 January 2008.
5.3 Ensemble size sensitivity tests
The next sensitivity test is dedicated to the ensemble size n_{e}, a critical parameter of any EnKF algorithm. This parameter has to be high enough so as to accurately estimate the Kalman gain matrix but low enough to limit the computational cost (the higher n_{e}, the more model runs are required over each data assimilation window to obtain the analysis estimate of the Manning coefficients).
We consider different ensemble sizes through a oneassimilationcycle experiment; n_{e} varies between 10 and 200. Figure 6c compares the analysis Manning coefficient relative error for each ensemble size n_{e} as in Fig. 6a. Results show that the analysis relative error decreases when the ensemble size n_{e} increases. For an ensemble size n_{e} equal to 20, the analysis error is below 5 %. Also, for an ensemble size n_{e} higher than 50, the analysis relative error has converged to a constant value, while the analysis ensemble dispersion shown in Fig. 6d stabilizes.
These results indicate that the ensemble size for future data assimilation experiments should be at least equal to 20; we consider n_{e}=25 in the present study to limit computational time.
5.4 Model error standard deviation sensitivity tests
In this study, we only consider parameter estimation, implying that the background error covariance matrix is associated with the parameter space. We assume that the errors in the Manning coefficients are independent, so that the background error covariance matrix is initially specified as a diagonal matrix, where all diagonal elements correspond to the error variances in the spatially varying Manning coefficients and are equal to the same variance (σ^{b})^{2}, where σ^{b} is the background error standard deviation. Similarly to previous sensitivity tests, we study here the sensitivity of the data assimilation results to the value of σ^{b} through a oneassimilationcycle experiment. Figure 6c shows, in logarithmic scale on the x axis, the relative error of the updated Manning coefficient with respect to the true coefficient. The analysis error curve shows a decreasing behavior until σ^{b} is on the order of 0.4. It then increases again.
Note that the actual Manning coefficient error before data assimilation is equal to 0.33 (see the blue curve in Fig. 6c showing the openloop Manning coefficient error). Consistently, the best data assimilation results are obtained when σ^{b} provides a good approximation of the real error standard deviation. Note also that when σ^{b} becomes too small, data assimilation is less effective. The EnKF algorithm is known to be underdispersive. Therefore, for future data assimilation experiments, when updating the error covariance matrix from one cycle to another, we will need to make sure that the ensemble dispersion is high enough to cover possible model behavior over the forecast time window by imposing a minimum value for the error variance. Given the sensitivity test results, the minimum value for σ^{b} is set to 0.005. Thus, in the following data assimilation experiments, we use the analysis error covariance matrix as the background error covariance matrix for the next assimilation cycle while applying the minimum threshold value to the matrix diagonal terms.
We now present the results from the data assimilation experiments described in Table 2 and in Sect. 4.2. Recall that these experiments aim at correcting a set of nine multiplying factors applied to the Manning coefficient distribution and constant over nine hydrogeomorphological zones dividing the Amazon basin.
6.1 Assimilation of water depths (PE1)
Figure 7 gives, for each zone, the time evolution of the mean analysis control variable (red) with its dispersion (even though it is very narrow) compared to the truth (black) and the first guess (blue). Similarly, Fig. 8 shows, for each zone, the time evolution of the analysis water depth (red) compared to the truth (black) and the openloop (blue). To generate one plot per zone, we use, for each time step, the ensemble of water depths over all cells in the zone and estimate the median value, the first decile and the ninth decile. Furthermore, zoneaveraged normalized RMSEn statistics are given in Tables D1 and D2 in Appendix D.
In general, the PE1 experiment gives very good results as the analysis mean for each zone retrieves the true value with a very low dispersion. However, the data assimilation algorithm features spatially dependent behavior; see Fig. 7.

Firstly, the control variable for zones 1, 2, 3, 4, 5 and 9 converges instantaneously (in only one assimilation cycle) toward the true values and remains at these true values for all following cycles.

A similar behavior can be observed for zones 6, 7 and 8 from the first cycle to around the ninth cycle. For the remaining cycles, we notice an increase in the mean analysis estimate, along with the ensemble dispersion, until around the thirteenth cycle, and afterwards a decrease back to the true value.
These observations can be explained with the global sensitivity analysis results for water depths in Emery et al. (2016) and using Fig. 8.

Firstly, zones 1, 2 and 3 correspond to the river main stem, whilst zones 4, 5 and 9 correspond to the main leftbank tributaries, namely the Caquetá/Japurá River (zone 4) and the Negro River (zone 5). In these zones, as the Manning coefficient is directly corrected in the first assimilation cycle, the analysis water depths (red) overlap the true water depth (black). Furthermore, Fig. 8 for these zones shows that the openloop (blue) and true (black) water depths have a very similar variability in time but differ by a constant bias. The global sensitivity analysis results in these zones showed a constant firstorder sensitivity in time to the Manning coefficient all year long. This firstorder sensitivity means that the contribution of the Manning coefficient to the water depth is linear. Correcting the Manning coefficient in these zones equates therefore to correcting the bias between the openloop and the true water depths.

Subsequently, zones 6, 7 and 8 correspond to rightbank tributaries, namely the Juruá and Purus rivers (zone 6), the Madeira River (zone 7) and the Tapajós and Xingu rivers (zone 8). These rightbank tributary zones are characterized by a strong seasonal cycle (see Fig. 8, zones 6–8). By comparing the corresponding plots in Figs. 7 and 8, we notice that the period when the analysis control variable spreads from the truth corresponds to the lowflow season in these zones. According to the global sensitivity analysis results, water depths in these zones are less sensitive to the Manning coefficient in lowflow conditions. Additionally, there is very little water in the zones during this period and, consequently, the background control ensemble is not spread out enough for the EnKF to be efficient. Meanwhile, the EnKF still sees that the observations are higher than the model predictions (as seen with the positive innovations in these zones shown in Fig. E1). In order to increase the simulated water depth, the EnKF therefore corrects the Manning coefficient so that its value rises (a higher Manning coefficient means a slower flow velocity and then a higher simulated water depth). Finally, once the lowflow season ends, the analysis Manning coefficient converges back to the truth (see the last assimilation cycles).
6.2 Assimilation of water anomalies (PE2 and PE3)
The assimilation of anomalies has been tested over two experiments denoted PE2 and PE3 (see Table 2). Note that the observation error standard deviation σ^{o} remains equal to 10 cm as, with these experiments, we only aim at testing the feasibility of assimilating water depth anomalies. In the PE2 experiment, there is no difference of bathymetry between the simulated and observed water anomalies, whilst the river bankful depth is different in the PE3 experiment. Figure 9 gives, for each zone, the time evolution of the mean analysis control variable for PE2 (orange) and PE3 (purple) with their dispersion compared to the truth (black) and the first guess (blue). Again, zoneaveraged normalized RMSEn statistics for these experiments are given in Tables D1 and D2 in Appendix D.
The general configuration of experiments PE1 and PE2 is the same. The only difference between the two experiments is the nature of the observations: water depths for PE1 and water anomalies for PE2. Like experiment PE1, experiment PE2 (the orange line in Fig. 9) gives very good results. All control variables converge toward the true values more or less rapidly. The control variable for zones 4 to 8 instantaneously (in only one assimilation cycle) converges toward the true value, while the convergence is slower for the remaining zones as around five assimilation cycles are needed to retrieve the true value. This slower convergence for these zones can be explained by the fact that the magnitude of the observed water anomalies is generally smaller compared to the water depths assimilated in PE1. The ratio between the observation error and the observations themselves is also then smaller, resulting in a smaller EnKF gain. The control variable correcting increment is smaller for the anomalies therefore than for the water depth, and more cycles are needed to converge.
As for the PE3 experiment results – the purple line in Fig. 9 – the assimilation still gives good results, but not as good as in previous experiments. The control variables still instantaneously converge toward the truth in zones 4, 5 and 6. Concerning the other zones, there is no clear convergence towards the true value. Instead, the analysis control variables either get closer to but remain distinct from the true value (zones 2 and 3) or temporarily deviate from the truth during the experiment (zones 1, 7, 8 and 9). Still, despite the control variables not clearly converging towards the truth, the simulated water depths using the analysis control variables, presented in Fig. 10, display a very low deviation from the truth, confirming the general good performance of the data assimilation procedure.
Comparing the control variables and water depth time variations, it appears that the control variables are deviating from the truth mainly when water depths are decreasing, in between the highflow and lowflow seasons. During this period, the model goes from a state where floods occur to a state where there is no flood, particularly in zones 2–3 and 7–8 with a clear seasonal cycle. On the other hand, no flood event was spotted in zones 4, 5 and 6, where the best results were obtained. In ISBACTRIP, the activation/deactivation of the flooding scheme is triggered by the simulated water depth exceeding/becoming lower than the river bankful depth. However, in experiment PE3, this river bankful depth differs between the model and the observation because we artificially inserted a bias between the simulated and observed water depths. More specifically, the river bankful depth is lower in the model than in the observations. Therefore, the control variables deviating from the true value when the water depth is decreasing are indicative of the simulated water depths presenting floods when there is no flood in the observations. The activation of the flood scheme changes the dynamics of the water depth in the river. As part of the water in the river is spilled into the floodplains, waterlevel variations in the river are slower. The flooded model needs then a stronger variation of the Manning coefficient in order to catch the nonflooded observed water level. Ultimately, the stronger variations of the estimated Manning coefficient allow the retrieval of the true water depths.
The results presented here are preliminary investigations into the assimilation of a SWOT water surface elevations product into a largescale hydrological model. This study focused on the correction of a critical river parameter, here the river Manning coefficient.
For all the simulations, the Manning coefficient distribution is set to be constant in time. For each grid cell, one value of the Manning coefficient is used for the entire simulation. However, in reality, it is commonly accepted that this parameter could vary in time, depending on the seasonal cycle or also some extreme hydrological event such as large flooding events, which can even modify the bathymetry itself. The results showed that, for this OSSE, the data assimilation is able to converge quite quickly towards the true value. For example, for the leftbank tributary zones, namely zones 4 and 5, in every experiment, the associated control variable converges toward the true value in only one assimilation cycle. In a realcase experiment, we could expect to retrieve the temporal variations of the Manning coefficient from one assimilation cycle to another. The good performances of the assimilation platform are mainly related to the fact that, in the ISBACTRIP model, the water depth diagnostic variables are sensitive to the Manning coefficient (Emery et al., 2016). Simulated water depths are not then that sensitive to the Manning coefficient (e.g., in the rightbank tributary zones during the lowflow season), and the data assimilation performances slightly degrade. These results are specific to the ISBACTRIP model. To apply the same method to another model and even another region, one needs to first study the sensitivity of the (other) model to the (other) study region.
Secondly, the study investigates the potential of assimilating water surface anomalies instead of direct water surface elevations. The use of water surface anomalies is driven by the need to avoid potential bias between the control and the observed variables. Indeed, a bias will likely be introduced from a discrepancy between the elevation of the river bed in the model and in the observations with respect to a reference surface such as a geoid or an ellipsoid (see Fig. 4). Under the assumption that the water variations are the same between the model and the observations, the use of anomalies as observed variables should prevent this bias from affecting the results.
Another likely bathymetry error corresponds to errors in the river bankful depth, the river width and, more generally, representativeness errors due to the use of a simplified bathymetry. This type of error was artificially introduced by perturbing the model bankful depth in PE3. Specifically to ISBACTRIP, the river bankful depth controls when the model floods, which has a direct impact on the water depth dynamics. Background anomalies and observed anomalies may therefore present different dynamics where either the observed variables flood while the model variables do not or inversely. Experiment PE3 illustrated the effect of this bias on the variations of Manning coefficients. Instead of being maintained at their true value, their value slightly varied around the true values to account for the difference in dynamics between the model and the observations. However, one could expect even more variations in the updated control variables around the true value to increase if more and different errors in the bathymetry exist (which will likely happen with more realistic experiments).
Furthermore, realcase experiments may suffer from another type of bias originating from errors in the atmospheric forcing and in the surface and subsurface runoff provided by the LSM (i.e., ISBA). Both control the amount of water entering the river system. A basic idea to attenuate this issue would be to consider their uncertainties when generating the background ensemble. This approach may become limited when the errors in the forcing are very large. It may also lead to unrealistic, even nonphysical, updated Manning coefficient values. Besides, when correcting the model's parameters, we only redistribute the water volume within the basin, whilst such types of errors could actually require addition/withdrawal of water to/from the system. The potential solution would then be to include such forcing or LSM variables in the control vector or to update variables closer to the observations, including CTRIP's state variables such as the water storage. This would change the current framework to a dual stateparameter estimation approach.
Noting this, there may be an additional advantage in assimilating water anomalies instead of the direct water depths. Comparing the Kalman gain between the PE1 experiment (which assimilated direct water depths) and the PE2 and PE3 experiments (which assimilated water anomalies), the gain magnitude for the water anomalies is lower than the water depth gain magnitude. This is to be expected as the Kalman gain is stochastically estimated from an ensemble of model runs and the magnitude of the simulated water anomalies is lower than the simulated water depth magnitude. The consequence of this lower gain is that the correction applied to the control variable is also lower. If the convergence towards the true value takes more than one assimilation cycle, the divergence from it in the presence of bias is also diminished.
Beyond the bias issues, realdata assimilation configuration will raise the question of the unknown true parameter value, if it exists. Firstly, there will be no true estimates of the control variables with which to compare the assimilated simulations. The assimilation will be evaluated against the observed variables directly. Then, with the real data, model structure error will be introduced. To our knowledge, the model structure error is still a challenging error to estimate, and most data assimilation studies assume no model structure error. However, when using an ensemblebased model, a possibility for dealing with such structure error is to enrich the background ensemble by considering more uncertainties from variables that are not necessarily in the control vector (including errors in the forcing or parameters from both the LSM and the RRM). The capacity of such ensembles to tackle model structure errors can be tested using synthetic observations based on a different hydrological model.
Additionally, the real SWOT data will have a finer resolution than the synthetic SWOT data currently used. Still, the coarserresolution observations are found to provide information to constrain the model and to improve the value of the spatially varying Manning coefficients. Then, when moving to realdata assimilation experiments, we can consider averaging the finescale SWOT product over a coarse grid cell corresponding to an ISBACTRIP cell so that the resolutions of the observations and the model match.
Ultimately, heading towards more realistic experiments also implies more realistic representations of the observation errors. But more complex errors should be expected for the real SWOT product. Some correlated errors along the swath should be expected due to the instrument but also due to the motion of the satellite and delays due to propagation of the electromagnetic waves in the ionosphere and atmosphere. Nevertheless, as part of the mission science requirements, the sum of all errors should not exceed 10 cm when the measured data are averaged over 1 km^{2}. There should also be additional errors affecting the observations that can be described as “detectability errors”, such as “dark water”, “layover” and “false positive”. “Dark water” pixels will result in missing data and will not be included in the assimilation, and “layover” pixels will have a higher vertical error due to surrounding vegetation and topography, but should also be flagged (Biancamaria et al., 2016). Eventually, “false positive” pixels (i.e., pixels classified as water, whereas they correspond to land) will be the most complicated to anticipate. With these additional errors taken into account in the assimilation framework, one could expect a slower convergence of the control variables. Note that these aspects of the measurement errors are related to water surface elevation products.
This study presents a series of OSSEs that assimilates SWOTlike synthetic observations of water depths and anomalies into the ISBACTRIP largescale hydrological model in order to correct the spatially distributed Manning coefficient. The study is applied over the Amazon River basin. Prior to the actual data assimilation experiments, a series of sensitivity tests was conducted to study the sensitivity of the data assimilation performance to the different features of the EnKF, in particular the size of the ensemble. Then, three fullyear data assimilation experiments were run based on the outcomes of the sensitivity tests. For all three experiments, the assimilation was able to track back the true value of the Manning coefficient distribution.
The sensitivity tests successively studied the sensitivity of the data assimilation platform to model the spinup period, the experiment starting date through the hydrological year, the size of the ensemble for the EnKF and the initial control variable standard deviation. These tests showed first of all that a spinup of four windows of 21 d is sufficient for the transitional period due to a sudden change in the Manning coefficient distribution in the model. The second sensitivity test then demonstrated that the data assimilation performance is not clearly sensitive to the period of the hydrological year when the experiment is done. The next sensitivity test informed us that an ensemble of 25 members was enough to obtain good EnKF performances. Finally, the last sensitivity test studied the effect of the control variable error standard deviation, and the best performances were obtained for a prior standard deviation between 0.05 and 0.75, which corresponds to the order of magnitude of the actual error between the true and openloop control variables.
Using these results, we run three data assimilation experiments over approximately 1 year (the year 2008). The first experiment (PE1) assimilated direct pseudoobservations of water depths. Results showed the capability of the data assimilation algorithm to converge very quickly toward the true value, generally in only one assimilation cycle. Still, during the lowflow season, the assimilation was less effective in the zones with a clear seasonal cycle. This was explained by the fact that during this period, water depths are less sensitive to the Manning coefficient.
The other two experiments (PE2 and PE3) introduced and tested the assimilation of water surface anomalies. The anomalies were obtained by subtracting a yearlyaveraged water depth from the current water depth in both the model and the observations. The first water anomaly assimilation experiment (PE2) provided very good results, with all the control variables also converging towards their associated true values. However, the convergence was slightly slower than during the assimilation of the water depth (between one and five assimilation cycles). This is explained by a lower Kalman gain when updating the Manning coefficient.
The last experiment also assimilated water anomalies (PE3). For this particular experiment however a bias was artificially introduced in the river bathymetry. For this experiment, the assimilation was still able to get closer to the true value, but, for some zones like the mainstream zones, there was no convergence as the control variables kept varying around the true value. This phenomenon was explained by the detection of floods in the model but not in the observations. Still, the statistics of the Manning coefficient distribution and the simulated water depths after assimilation remain as improved compared to the openloop simulations. Ultimately, these two experiments demonstrated the feasibility of assimilating water surface anomalies to correct the Manning coefficient.
These experiments offer several perspectives. They mainly consist of approaching more realistic data assimilation experiments which take into account more sources of uncertainties between the model and the observations, such as correlated observation errors or uncertainties in the forcing and the LSM surface and subsurface runoff. To test the platform's limitations regarding the DEM/bathymetry bias issue, one can use simulated water surface elevations referenced to a geoid instead of water depths from the model or even assimilate water depths from another model where the bathymetry differs. As most applications generally require a good estimate of the river flow and river water volume, another lead of an investigation could maintain the SWOTbased OSSE framework but correct the simulated water storage and/or discharge, either as a single state estimation framework or as a dual state parameter estimation framework (similarly to dual discharge bathymetry inference methods developed by Oubanas et al., 2018, and Brisset et al., 2018, for some hydraulic models). Moreover, along with observations of water surface elevations, SWOT will also provide twodimensional maps of river widths and surface slopes. One can also study the possibility of assimilating such products to constrain other parameters such as the bankful depth that controls the model flooding scheme.
The background error crosscovariance matrices [PH^{T}]_{e,k} and [HPH^{T}]_{e,k} are defined based on Evensen (2004), Moradkhani et al. (2005), and Durand et al. (2008) so that
and
In those definitions, ${\mathbf{X}}_{e,k}^{\mathrm{b}}$ is the control matrix storing the n_{e} control vectors ${\mathit{x}}_{k}^{b,\left[l\right]},\phantom{\rule{0.33em}{0ex}}l=\mathrm{1}\phantom{\rule{0.25em}{0ex}}\mathrm{\dots}\phantom{\rule{0.25em}{0ex}}{n}_{\mathrm{e}}$ from the background ensemble such that
Next, $\mathcal{H}\left({\mathbf{X}}_{e,k}^{\mathrm{b}}\right)$ represents the same control matrix but mapped into the observation space:
Also, $\stackrel{\mathrm{\u203e}}{{\mathbf{X}}_{\u2022,k}^{\mathrm{b}}}$ and $\stackrel{\mathrm{\u203e}}{\mathcal{H}\left({\mathbf{X}}_{\u2022,k}^{\mathrm{b}}\right)}$ are the corresponding ensemble expectations such that
These vector dimensions are n_{x} and n_{y,k}, respectively. Finally, ${\mathbf{1}}_{{n}_{\mathrm{e}}}$ is a vector of size n_{e} containing only 1s.
Table B1 summarizes, for each run, the date when the Manning coefficients are changed. The spinup period (expressed as a number of windows of 21 d) corresponds to the period between when the Manning distribution is changed and the start of the second year, i.e., 1 January 2008.
At each grid cell of the study domain, we estimated the normalized root mean square error (RMSEn) before and after assimilation by comparing the openloop and mean analysis simulations, respectively, to the true simulation, for both the simulated water depth and discharge:
where the state variable V is either the discharge or the water depth, n is the time index, i is the gridcell index, the t superscript represents the “truth” and the ∗ superscript represents either the openloop or analysis ensemble average.
Tables D1 and D2 give these statistics for all experiments averaged over each control zone. Table D1 shows the water depth zoneaveraged RMSEn and Table D2 shows the discharge zoneaveraged RMSEn.
Figure E1 displays the evolution along the assimilation cycles of the averaged innovations. The sign of the innovation will drive the direction of the correction brought by the assimilation.

A positive innovation means that the observations are higher than the model. Physically, the simulated flow is too fast and the water leaves the river reservoir too quickly. This means that the river Manning coefficient needs to be increased to slow the flow.

A negative innovation means that the observations are lower than the model. Physically, the simulated flow is too slow and the water remains in the river reservoir. This means that the river Manning coefficient needs to be increased to accelerate the flow.
The CTRIP code is open source and is available as a part of the surface modeling platform called SURFEX, which can be downloaded at http://www.cnrmgamemeteo.fr/surfex/ (Decharme et al., 2012). SURFEX is updated approximately every 3 to 6 months and the CTRIP version presented in this paper is from SURFEX version 7.3. If more frequent updates are needed, please follow the procedure informing you of how to obtain an SVN or Git account in order to access realtime modifications of the code (see the instructions in the previous link). The ISBACTRIP model is coupled to the DA codes via the OpenPalm coupler available at http://www.cerfacs.fr/globc/PALM_WEB/ (Buis et al., 2006). To get the DA routines coupled to ISBACTRIP with OpenPalm, please directly contact Charlotte Marie Emery (charlotte.emery@jpl.nasa.gov) or Sylvain Biancamaria (sylvain.biancamaria@legos.obsmip.fr). To obtain the GSWP3 forcings, please refer to the following DOI: https://doi.org/10.20783/DIAS.501 (Kim, 2017).
CME designed and carried out the experiments under the supervision of SB and AB, as part of her PhD project. SB provided the SWOTbased observation mask used to generate the synthetic observations and AB provided support for the use of the ISBACTRIP hydrologic model. CME prepared the manuscript with contributions from all the coauthors.
The authors declare that they have no conflict of interest.
This work was supported by the CNES through a grant from the TerreOcéanSurfaces ContinentalesAtmosphère (TOSCA) committee assigned to the project entitled “Towards an improved understanding of the global hydrological cycle using SWOT measurements”. Charlotte Marie Emery received doctoral research support from a CNES/région MidiPyrénées grant. Charlotte Marie Emery and Cédric H. David received support from the Jet Propulsion Laboratory, California Institute of Technology, under a contract with NASA, including grants from the SWOT Science Team and the Terrestrial Hydrology Program.
This research has been supported by the Région Midi Pyrénées, the CNES, and CNES TOSCA, a grant from the SWOT Science Team: NASA ROSES 2015 SWOT Science Team (NNH15ZDA001NSWOT): Integration of SWOT Measurements into global terrestrial hydrologic models (15SWOT150014), and a grant from the NASA ROSES Terrestrial Hydrology Program: NASA ROSES 2017 Terrestrial Hydrology (NNH17ZDA001NTHP): Filling the Space/Time Gaps Between Surface Water Retrievals (17THP170012).
This paper was edited by Markus Hrachowitz and reviewed by Paul Bates, Claire I. Michailovsky, and Hessel Winsemius.
Andreadis, K. M. and Schumann, G. J. P.: Estimating the impact of satellite observations on the predictability of largescale hydraulic models, Adv. Water Res., 73, 44–54, https://doi.org/10.1016/j.advwatres.2014.06.006, 2014. a
Andreadis, K. M., Clark, E. A., Lettenmaier, D. P., and Alsdorf, D. E.: Prospects for river discharge and depth estimation through assimilation of swathaltimetry into a rasterbased hydrodynamics model, Geophys. Res. Lett., 34, L10403, https://doi.org/10.1029/2007GL029721, 2007. a
Beighley, R. E., Eggert, K. G., Dunne, T., He, Y., Gummadi, V., and Verdin, K. L.: Simulating hydrologic and hydraulic processed throughout the Amazon basin, Hydrol. Process., 23, 1221–1235, https://doi.org/10.1002/hyp.7252, 2009. a
Beven, K. and Freer, J.: Equifinality, data assimilation, and uncertainty estimation in mechanistic modelling of complex environmental systems using GLUE methodology, J. Hydrol., 249, 11–29, https://doi.org/10.1016/S00221694(01)004218, 2001. a
Beven, K. J.: Down to basics: runoff processes and the modelling of processes, in: RainfallRunoff Modelling, John Wiley and Sons, West Sussex, UK, chap. 1, 1–22, 2012. a
Biancamaria, S., Bates, P., Boone, A., and Mognard, N.: Largescale coupled hydrologic and hydraulic modelling of teh Ob river in Siberia, J. Hydrol., 379, 136–150, https://doi.org/10.1016/j.jhydrol.2009.09.054, 2009. a
Biancamaria, S., Durant, M., Andreadis, K. M., Bates, P. D., Boone, A., Mognard, N. M., Rodriguez, E., Alsdorf, D. E., Lettenmaier, D. P., and Clark, E. A.: Assimilation of virtual wide swath altimetry to improve Arctic river modeling, Remote Sens. Environ., 115, 373–381, https://doi.org/10.1016/j.rse.2010.09.008, 2011. a, b
Biancamaria, S., Lettenmaier, D. P., and Pavelsky, T. M.: The SWOT mission and its capabilities for land hydrology, Surv. Geophys., 37, 307–337, https://doi.org/10.1007/s107120159346y, 2016. a
Bierkens, M. F. P.: Global hydrology 2015: State, trends, and directions, Water Resour. Res., 51, 4923–4947, https://doi.org/10.1002/2015WR017173, 2015. a
Birkett, C. M., Mertes, L. A. K., Dunne, T., Costa, M. H., and Jasinski, M. J.: Surface water dynamics in the Amazon basin: Application of satellite radar altimetry, J. Geophys. Res., 107, L10403, https://doi.org/10.1029/2001JD000609, 2002. a
Bishop, C. H., Etherton, B. J., and Majumbar, S. J.: Adaptative sampling with the ensemble transform Kalman filter. Part I: Theoretical aspects, Mon. Weather Rev., 129, 420–436, https://doi.org/10.1175/15200493(2001)129<0420:ASWTET>2.0.CO;2, 2001. a
Boone, A., Calvet, J.C., and Noilhan, J.: Inclusion of a Third Soil Layer in a Land Surface Scheme Using the ForceRestore Method, J. Hydrometeorol., 38, 1611–1630, https://doi.org/10.1175/15200450(1999)038<1611:IOATSL>2.0.CO;2, 1999. a
Brisset, P., Monnier, J., Garambois, P.A., and Roux, H.: On the assimilation of altimetry data in 1D SaintVenant river models, Adv. Water Res., 119, 41–59, https://doi.org/10.1016/J.advwatres.2018.06.004, 2018. a
Buis, S., Piacentini, A., and Declat, D.: PALM: a computational framework for assembling highperformance computing applications, Concurrency Computat.: Pract. Exper., 18, 247–262, 2006 (data available at: http://www.cerfacs.fr/globc/PALM_WEB/, last access: 20 April 2020). a
Burgers, G., Leeuwen, P. J. V., and Evensen, G.: Analysis Scheme in the Ensemble Kalman Filter, Mon. Weather Rev., 126, 1719–1724, https://doi.org/10.1175/15200493(1998)126<1719:ASITEK>2.0.CO;2, 1998. a
Clark, M. P., Rupp, D. E., Woods, R. A., Zheng, X., Ibbitt, R. P., Slater, A. G., Schmidt, J., and Uddstrom, M. J.: Hydrological data assimilation with the ensemble Kalman filter: Use of streamflow observations to update states in a distributed hydrological model, Adv. Water Res., 31, 1309–1324, https://doi.org/10.1016/j.advwatres.2008.06.005, 2008. a
Cretaux, J.F., Calmant, S., Romanoski, V., Shabunin, A., Lyard, F., BergeNguyen, M., Cazenave, A., Hernandez, F., and Perosanz, F.: An absolute calibration site for radar altimeters in the continental domain: Lake Issykkul in Central Asia, J. Geodesy, 83, 723–735, https://doi.org/10.1007/s0019000802897, 2009. a
Decharme, B., Alkama, R., Douville, H., Becker, M., and Cazenave, A.: Global Evaluation of the ISBATRIP Continental Hydrological System. Part II: Uncertainties in River Routing Simulation Related to Flow Velocity and Groundwater Storage, J. Hydrometeorol., 11, 601–617, https://doi.org/10.1175/2010JHM1212.1, 2010. a
Decharme, B., Alkama, R., Papa, F., Faroux, S., Douville, H., and Prigent, C.: Global offline evaluation of the ISBATRIP flood model, Clim. Dynam., 38, 1389–1412, https://doi.org/10.1007/s0038201110549, 2012 (data available at: http://www.cnrmgamemeteo.fr/surfex/, last access: 20 April 2020). a, b, c, d, e, f
Decharme, B., Delire, C., Minvielle, M., Colin, J., Vergnes, J.‐P., Alias, A., Saint‐Martin, D., Séférian, R., Sénési, S., and Voldoire, A.: Recent changes in the ISBACTRIP land surface system for use in CNRMCM6 climate model and global offline hydrological applications, J. Adv. Model. Earth Sy., 11, 1207–1252, https://doi.org/10.1029/2018MS001545, 2019. a, b, c
Deng, C., Liu, P., Guo, S., Li, Z., and Wang, D.: Identification of hydrological model parameter variation using ensemble Kalman filter, Hydrol. Earth Syst. Sci., 20, 4949–4961, https://doi.org/10.5194/hess2049492016, 2016. a
Doll, P., Douville, H., Güntner, A., Schmied, H. M., and Wada, Y.: Modelling Freshwater Resources at the Global Scale: Challenges and Propects, Surv. Geophys., 37, 195–221, https://doi.org/10.1007/s1071201593431, 2015. a
Durand, M., Andreadis, K., Alsdorf, D., Lettenmaier, D., Moller, D., and Wilson, M.: Estimation of bathymetric depth and slope from data assimilation of swath altimetry into a hydrodynamic model, Geophys. Res. Lett., 35, L20401, https://doi.org/10.1029/2008GL034150, 2008. a, b, c
Emery, C. M., Biancamaria, S., Boone, A., Garambois, P.A., Ricci, S., Rochoux, M. C., and Decharme, B.: Temporal variancebased sensitivity analysis of the river routing component of the large scale hydrological model ISBATRIP: Application on the Amazon Basin, J. Hydrometeorol., 17, 3007–3027, https://doi.org/10.1175/JHMD160050.1, 2016. a, b, c, d, e, f, g, h, i, j, k
Emery, C. M., Paris, A., Biancamaria, S., Boone, A., Calmant, S., Garambois, P.A., and Santos da Silva, J.: Largescale hydrological model river storage and discharge correction using a satellite altimetrybased discharge product, Hydrol. Earth Syst. Sci., 22, 2135–2162, https://doi.org/10.5194/hess2221352018, 2018. a, b, c, d
Esteban Fernandez, D.: SWOT Project, Mission performance and error budget, Tech. rep., Jet Propulsion Laboratory, 2017. a
Evensen, G.: Sequential data assimilation with a nonlinear quasigeostropic model using Monte Carlo methods to forecast error statistics, J. Geophys. Res., 99, 10143–10162, https://doi.org/10.1029/94JC00572, 1994. a
Evensen, G.: Advanced data assimilation for strongly nonlinear dynamics, Mon. Weather Rev., 125, 1342–1354, https://doi.org/10.1175/15200493(1997)125<1342:ADAFSN>2.0.CO;2, 1997. a
Evensen, G.: The Ensemble Kalman Filter: theoretical formulation and practical implementation, Ocean Dynam., 53, 343–367, https://doi.org/10.1007/s1023600300369, 2003. a
Evensen, G.: Sampling strategies and square root analysis schemes for the EnKF, Ocean Dynam., 54, 539–560, https://doi.org/10.1007/s1023600400992, 2004. a, b
Evensen, G. and Leeuwen, P. V.: An ensemble kalman smoother for nonlinear dynamics, Mon. Weather Rev., 128, 1852–1867, https://doi.org/10.1175/15200493(2000)128<1852:AEKSFN>2.0.CO;2, 2000. a, b
Fjørtoft, R., Gaudin, J.M., Pourthie, N., Lalaurie, J.C., Mallet, A., Nouvel, J.F., MartinotLagarde, J., Oriot, H., Borderies, P., Ruiz, C., and Daniel, S.: KaRIn on SWOT: Characteristics of nearnadir Kaband interferometric SAR imagery, IEEE T. Geosci. Remote, 52, 2172–2185, https://doi.org/10.1109/TGRS.2013.2258402, 2014. a
Guillet, O., Weaver, A., Vasseur, X., Michel, M., Gratton, S., and Gürol, S.: Modelling spatially correlated observation errors in variational data assimilation using a diffusion operator on an unstructured mesh, Q. J. Roy. Meteor. Soc., 145, 1947–1967, https://doi.org/10.1002/qj.3537, 2018. a
Gupta, H. V., Sorooshian, S., and Yapo, P. O.: Toward improved calibration of hydrological models: multiple and noncommensurable measures of information, Water Resour. Res., 34, 751–763, https://doi.org/10.1029/97WR03495, 1998. a
Hafliger, V., Martin, E., Boone, A., Ricci, S., and Biancamaria, S.: Assimilation of synthetic SWOT river depths in a regional hydrometeorological model, Water, 11, 78, https://doi.org/10.3390/w11010078, 2019. a, b
Hunt, B., Kalnay, E., Kostelich, E. J., Ott, E., Patil, D. T., Sauer, T., Szunyogh, I., Yorke, J. A., and Zimin, A. V.: Fourdimensional ensemble Kalman filtering, Tellus, 56, 273–277, https://doi.org/10.1111/j.16000870.2004.00066.x, 2004. a
Hunt, B. R., Kostelich, E. J., and Szunyogh, I.: Efficient data assimilation for spatiotemporal chaos: A local ensemble transform Kalman filter, Physica D, 230, 112–126, https://doi.org/10.1016/j.physd.2006.11.008, 2007. a
International Association of Hydrological Sciences Ad Hoc Group on Global Water Sets, Vörösmarty, C., Askew, A., Grabs, W., Barry, R. G., Birkett, C., Döll, P., Goodison, B., Hall, A., Jenne, R., Kitaev, L., Landwehr, J., Keeler, M., Leavesley, G., Schaake, J., Strzepek, K., Sundarvel, S. S., Takeuchi, K., and Webster, F.: Global water data: a newly endangered species, EOS T. Am. Geophys. Un., 82, 54–58, https://doi.org/10.1029/01EO00031, 2001. a
Kim, H.: Global Soil Wetness Project Phase 3 Atmospheric Boundary Conditions (Experiment 1), Data set, Data Integration and Analysis System, https://doi.org/10.20783/DIAS.501, 2017. a
Kurtz, W., HendricksFrassen, H.J., and Vereecken, H.: Identification of timevariant river bed properties with Ensemble Kalman Filter, Water Resour. Res., 48, W10534, https://doi.org/10.1029/2011WR011743, 2012. a
Leeuwen, P. V. and Evensen, G.: Data assimilation and inverse methods in terms of a probabilistic formulation, Mon. Weather Rev., 124, 2898–2913, https://doi.org/10.1175/15200493(1996)124<2898:DAAIMI>2.0.CO;2, 1996. a
Liu, Y. and Gupta, H. V.: Uncertainty in hydrological modeling: Towards an integrated data assimilation framework, Water Resour. Res., 43, W07401, https://doi.org/10.1029/2006WR005756, 2007. a, b
Liu, Y., Weerts, A. H., Clark, M., Hendricks Franssen, H.J., Kumar, S., Moradkhani, H., Seo, D.J., Schwanenberg, D., Smith, P., van Dijk, A. I. J. M., van Velzen, N., He, M., Lee, H., Noh, S. J., Rakovec, O., and Restrepo, P.: Advancing data assimilation in operational hydrologic forecasting: progresses, challenges, and emerging opportunities, Hydrol. Earth Syst. Sci., 16, 3863–3887, https://doi.org/10.5194/hess1638632012, 2012. a
Maidment, D. R.: Handbook of Hydrology, McGraw Hill Professional, 1993. a
Manning, R.: On the flow of water in open channels and pipes, Institution of Civil Engineers of Ireland, 20, 161–207, 1891. a
Meade, R., Rayol, J., Conceicão, S. D., and Natividade, J.: Backwater Effects in the Amazon River Basin of Brazil, Environ. Geol. Water S., 18, 105–114, https://doi.org/10.1007/BF01704664, 1991. a
Melsen, L., Teuling, A., Torfs, P., Zappa, M., Mizukami, N., Clark, M., and Uijlenhoet, R.: Representation of spatial and temporal variability in largedomain hydrological models: case study for a mesoscale preAlpine basin, Hydrol. Earth Syst. Sci., 20, 2207–2226, https://doi.org/10.5194/hess2022072016, 2016. a, b
Mersel, M. K., Smith, L. C., Andreadis, K. M., and Durand, M. T.: Estimation of river depth from remotelysensed hydraulic relationship, Water Resour. Res., 49, 3165–3179, https://doi.org/10.1002/wrcr.20176, 2013. a
Michailovsky, C. I. and BauerGottwein, P.: Operational reservoir inflow forecasting with radar altimetry: the Zambezi case study, Hydrol. Earth Syst. Sci., 18, 997–1007, https://doi.org/10.5194/hess189972014, 2014. a
Michailovsky, C. I., Milzow, C., and BauerGottwein, P.: Assimilation of radar altimetry to a routing model of the Brahmaputra river, Water Resour. Res., 49, 4807–4816, https://doi.org/10.1002/wrcr.20345, 2013. a, b
Molinier, M., Guyot, J.L., Orstom, B., Guimarães, V., de Oliveira, E., and Dnaee, B.: Hydrologie du bassin de l'Amazone, in: Grands Bassins Fluviaux Périatlantiques, PEGIINSACNRSORSTOM, Paris, available at: http://horizon.documentation.ird.fr/exldoc/pleins_textes/pleins_textes_7/carton01/40102.pdf (last access: 4 May 2020), 335–345, 1993. a
Montzka, C., Moradkhani, H., Weihermüller, L., HendricksFranssen, H.J., Canty, M., and Vereecken, H.: Hydraulic parameter estimation by remotelysensed top soil moisture observations with the particle filter, J. Hydrol., 399, 410–421, https://doi.org/10.1016/j.jhydrol.2011.01.020, 2011. a
Moradkhani, H., Hsu, K.L., Gupta, H., and Sorooshian, S.: Uncertainty assessment of hydrologic model states and parameters: Sequential data assimilation using particle filter, Water Resour. Res., 41, W05012, https://doi.org/10.1029/2004WR003604, 2005. a, b, c
Munier, S., Polebistki, A., Brown, C., Belaud, G., and Lettenmaier, D. P.: SWOT data assimilation for operational reservoir management on the upper Niger river basin, Water Resour. Res., 51, 554–575, https://doi.org/10.1002/2014WR016157, 2015. a
Noilhan, J. and Planton, S.: A simple parameterization of land surface processes for meteorological models, Mon. Weather Rev., 117, 536–549, https://doi.org/10.1175/15200493(1989)117<0536:ASPOLS>2.0.CO;2, 1989. a, b
Oki, T. and Sud, Y. C.: Design of Total Integrating Pathways (TRIP)—A Global River Channel Network, Earth Interact., 2, 1–36, https://doi.org/10.1175/10873562(1998)002<0001:DOTRIP>2.3.CO;2, 1998. a, b, c
Ott, E., Hunt, B. R., Szunyogh, I., Kostelich, A. V. Z. A. J., Corazza, M., Kalnay, E., Patil, D. J., and Yorke, J. A.: A local ensemble Kalman filter for atmospheric data assimilation, Tellus A, 56, 415–428, https://doi.org/10.1111/j.16000870.2004.00076.x, 2004. a
Oubanas, H., Gejadze, I., Malaterre, P.O., Durand, M., Wei, R., Frasson, R. P. M., and Domeneghetti, A.: Discharge estimation in ungauged basins through variational data assimilation: the potential of the SWOT mission, Water Resour. Res., 54, 2405–2423, https://doi.org/10.1002/2017WR021735, 2018. a, b, c
Paiva, R. C. D., Buarque, D. C., Collischonn, W., Bonnet, M.P., Frappart, F., Calmant, S., and Mendes, C. A. B.: Large scale hydrological and hydrodynamic modeling of the Amazon River basin, Water Resour. Res., 49, 1226–1243, https://doi.org/10.1002/wrcr.20067, 2013. a
Panzeri, M., Riva, M., Guadagnini, A., and Neuman, S. P.: Data assimilation and parameter estimation via ensemble kalman filter coupled with stochastic moment equations of transient groudwater flow, Water Resour. Res., 49, 1334–1344, https://doi.org/10.1002/wrcr.20113, 2013. a
Pathiraja, S., Marshall, L., Sharma, A., and Moradkhani, H.: Detecting nonstationar hydrologic model parameters in a paired catchment system using data assimilation, Adv. Water Res., 94, 103–119, https://doi.org/10.1016/j.advwatres.2016.04.021, 2016. a, b
Pedinotti, V., Boone, A., Ricci, S., Biancamaria, S., and Mognard, N.: Assimilation of satellite data to optimize largescale hydrological model parameters: a case study for the SWOT mission, Hydrol. Earth Syst. Sci., 18, 4485–4507, https://doi.org/10.5194/hess1844852014, 2014. a, b, c, d, e, f, g, h
Rakovec, O., Weerts, A. H., Sumihar, J., and Uijlenhoet, R.: Operational aspects of asynchronous filtering for flood forecasting, Hydrol. Earth Syst. Sci., 19, 2911–2924, https://doi.org/10.5194/hess1929112015, 2015. a, b
Renard, B., Kavetski, D., Kuczera, G., Thyer, M., and Franks, S. W.: Understanding predictive uncertainty in hydrologic modeling: the challenge of identifying input and structural errors, Water Resour. Res., 46, W05521, https://doi.org/10.1029/2009WR008328, 2010. a
Rodell, M., Beaudoing, H. K., L'Ecuyer, T. S., Olson, W. S., Famiglietti, J. S., Houser, P. R., Adler, R., Bosilovich, M. G., Clayson, C. A., Chambers, D., Clark, E., Fetzer, E. J., Gao, X., Gu, G., Hilburn, K., Huffman, G. H., Lettenmaier, D. P., Liu, W. T., Roberton, F. R., Schlosser, C. A., Sheffield, J., and Wood, E. F.: The observed state of the water cycle in the early twentyfirst century, J. Climate, 28, 8289–8318, https://doi.org/10.1175/JCLID1400555.1, 2015. a
Ruiz, J. J., Pulido, M., and Miyoshi, T.: Estimating model parameters with ensemblebased data assimilation, J. Meteorol. Soc. Jpn., 91, 79–99, https://doi.org/10.2151/jmsj.2013201, 2013. a
Sakov, P., Evensen, G., and Bertino, L.: Asynchronous data assimilation with the EnKF, Tellus, 62, 24–29, https://doi.org/10.1111/j.16000870.2009.00417.x, 2010. a, b
Sanoo, A. K., Pan, M., Troy, T. J., Vinukollu, R. K., Sheffield, J., and Wood, E. F.: Reconciling the global terrestrial water budget using satellite remote sensing, Remote Sens. Environ., 115, 1850–1865, https://doi.org/10.1016/j.rse.2011.03.009, 2011. a
Shi, Y., Davis, K. J., Zhang, F., Duffy, C. J., and Yu, X.: Parameter estimation of physicallybased land surface model hydrologic model using an ensemble Kalman filter: a multivariate realdata experiment, Adv. Water Res., 83, 421–427, https://doi.org/10.1016/j.advwatres.2015.06.009, 2015. a
Silva, J. S. D., Calmant, S., Seyler, F., Filho, O. C. R., Cochonneau, G., and Mansur, W. J.: Water levels in the Amazon basin derived from the ERS 2 and ENVISAT radar altimetry missions, Remote Sens. Environ., 114, 2160–2181, https://doi.org/10.1016/j.rse.2010.04.020, 2010. a
Sood, A. and Smakhtin, V.: Global hydrological models: a review, Hydrolog. Sci. J., 60, 549–565, https://doi.org/10.1080/02626667.2014.950580, 2015. a
Talagrand, O. and Courtier, P.: Variational assimilation of meteorological observations with the adjoint vorticity equation, Part 1: Theory, Q. J. Roy. Meteor. Soc., 113, 1311–1328, https://doi.org/10.1002/qj.49711347812, 1987. a
Vinukollu, R. K., Meynadier, R., Sheffield, J., and Wood, E. F.: Multimodel, multisensor esti mates of global evapotranspiration: climatology, uncertainties and trents, Hydrol. Process., 25, 3993–4010, https://doi.org/10.1002/hyp.8393, 2011. a
Vrugt, J. A., ter Braak, C. J. F., Diks, C. G. H., and Shoups, G.: Hydrological data assimilation using particle Markov chain Monte Carlo simulation: Theory, concepts and applications, Adv. Water Res., 51, 457–478, https://doi.org/10.1016/j.advwatres.2012.04.002, 2012. a
Wisser, D., Fekete, B. M., Vörösmarty, C. J., and Schumann, A. H.: Reconstructing 20th century global hydrography: a contribution to the Global Terrestrial Network Hydrology (GTNH), Hydrol. Earth Syst. Sci., 14, 1–24, https://doi.org/10.5194/hess1412010, 2010. a
Yoon, Y., Durand, M., Merry, C. J., Clark, E. A., Andreadis, K. M., and Alsdorf, D. E.: Estimating river bathymetry from data assimilation of synthetic SWOT measurements, J. Hydrol., 464465, 363–375, https://doi.org/10.1016/j.jhydrol.2012.07.028, 2012. a
 Abstract
 Introduction
 Model
 Method: synthetic parameter estimation on ISBACTRIP
 Assimilation strategy
 Assimilation sensitivity tests
 Data assimilation results
 Discussions
 Conclusions
 Appendix A: Definition of error covariance matrices
 Appendix B: Spinup sensitivity test additional tables
 Appendix C: Sensitivity test results per zone
 Appendix D: Assimilation performances at the zone scales
 Appendix E: Assimilation results: additional figures
 Code and data availability
 Author contributions
 Competing interests
 Acknowledgements
 Financial support
 Review statement
 References
 Abstract
 Introduction
 Model
 Method: synthetic parameter estimation on ISBACTRIP
 Assimilation strategy
 Assimilation sensitivity tests
 Data assimilation results
 Discussions
 Conclusions
 Appendix A: Definition of error covariance matrices
 Appendix B: Spinup sensitivity test additional tables
 Appendix C: Sensitivity test results per zone
 Appendix D: Assimilation performances at the zone scales
 Appendix E: Assimilation results: additional figures
 Code and data availability
 Author contributions
 Competing interests
 Acknowledgements
 Financial support
 Review statement
 References