Simulating carbon and water ﬂuxes using a coupled process-based terrestrial biosphere model and joint assimilation of leaf area index and surface soil moisture

. Reliable modeling of carbon and water ﬂuxes is essential for understanding the terrestrial carbon and water cycles and informing policy strategies aimed at constraining carbon emissions and improving water use efﬁciency. We designed an assimilation framework (LPJ-Vegetation and soil moisture Joint Assimilation, or LPJ-VSJA) to improve gross primary production (GPP) and evapotranspiration (ET) estimates globally. The integrated model, LPJ-PM (LPJ-PT-JPL SM Model) as the underlying model, was coupled from the Lund–Potsdam–Jena Dynamic Global Vegetation

Abstract. Reliable modeling of carbon and water fluxes is essential for understanding the terrestrial carbon and water cycles and informing policy strategies aimed at constraining carbon emissions and improving water use efficiency. We designed an assimilation framework (LPJ-Vegetation and soil moisture Joint Assimilation, or LPJ-VSJA) to improve gross primary production (GPP) and evapotranspiration (ET) estimates globally. The integrated model, LPJ-PM (LPJ-PT-JPL SM Model) as the underlying model, was coupled from the Lund-Potsdam-Jena Dynamic Global Vegetation Model (LPJ-DGVM version 3.01) and a hydrology module (i.e., the updated Priestley-Taylor Jet Propulsion Laboratory model, PT-JPL SM ). Satellite-based soil moisture products derived from the Soil Moisture and Ocean Salinity (SMOS) and Soil Moisture Active and Passive (SMAP) and leaf area index (LAI) from the Global LAnd and Surface Satellite (GLASS) product were assimilated into LPJ-PM to improve GPP and ET simulations using a proper orthogonal decomposition (POD)-based ensemble four-dimensional variational assimilation method (PODEn4DVar). The joint assimilation framework LPJ-VSJA achieved the best model performance (with an R 2 ( coefficient of determination) of 0.91 and 0.81 and an ubRMSD (unbiased root mean square deviation) reduced by 40.3 % and 29.9 % for GPP and ET, respectively, compared with those of LPJ-DGVM at the monthly scale). The GPP and ET resulting from the assimilation demonstrated a better performance in the arid and semi-arid regions (GPP: R 2 = 0.73, ubRMSD = 1.05 g C m −2 d −1 ; ET: R 2 = 0.73, ubRMSD = 0.61 mm d −1 ) than in the humid and sub-dry humid regions (GPP: R 2 = 0.61, ubRMSD = 1.23 g C m −2 d −1 ; ET: R 2 = 0.66; ubRMSD = 0.67 mm d −1 ). The ET simulated by LPJ-PM that assimilated SMAP or SMOS data had a slight difference, and the SMAP soil moisture data performed better than SMOS data. Our global simulation modeled by LPJ-VSJA was compared with several global GPP and ET products (e.g., GLASS GPP, GOSIF GPP, GLDAS ET, and GLEAM ET) using the triple collocation (TC) method. Our products, especially ET, exhibited advantages in the overall error distribution (estimated error (µ): 3.4 mm per month; estimated standard deviation of µ: 1.91 mm per month). Our research showed that the assimilation of multiple datasets could reduce model uncertainties, while the model performance differed across regions and plant functional types. Our assimilation framework (LPJ-VSJA) can

Introduction
Gross primary production (GPP) and evapotranspiration (ET) are essential components of the carbon and water cycles. Carbon and water fluxes are inherently coupled on multiple spatial and temporal scales (Law et al., 2002;Sun et al., 2019;Waring and Running, 2010). Terrestrial biosphere models are the most sophisticated approach for providing a relatively detailed description of such interdependent relationships regarding water and carbon fluxes and understanding the response of terrestrial ecosystems to changes in atmospheric CO 2 and climate (Kaminski et al., 2017). Dynamic global vegetation models (DGVMs) are process-based dynamic terrestrial biosphere models, which can simulate water, carbon, and energy exchange between vegetation and the atmosphere under different conditions accounting for vegetation physiological processes and are widely used to estimate carbon and water fluxes of terrestrial vegetation. However, there are still large uncertainties in carbon and water flux estimates at regional to global scales. Both diagnostic and prognostic models show substantial differences in the magnitude and spatiotemporal patterns of GPP and ET. For example, the global annual GPP estimates exhibited a large range (130-169 Pg C yr −1 ) among 16 process-based terrestrial biosphere models (Anav et al., 2015). The global ET ranged from 70 000 to 75 000 km 3 yr −1 , and the uncertainty of regional or global ET estimates was up to 50 % of the annual mean ET value, especially in the semi-arid regions (Miralles et al., 2016). These uncertainties mainly arise from the forcing datasets, simplification of mechanisms or imperfect assumptions in processes, and uncertain parameters in the processed models and assimilation methods .
In the last 2 decades, remote sensing products have been assimilated into DGVMs to reduce the uncertainty in modeled carbon and water fluxes Scholze et al., 2017;Exbrayat et al., 2019). Data assimilation (DA) is an effective approach to reduce uncertainties in terrestrial biosphere models by integrating satellite products with models to constrain related parameters or state variables. A DA system contains four main components: a set of observations, an observation operator, an underlying model, and an assimilation method. The assimilation method considers the errors from both models and observations and reduces model uncertainties by minimizing a cost function. The ensemble Kalman filter (EnKF) has been widely applied in land surface process models for parameter optimization, which significantly improve simulations by periodically updating state variables (e.g., leaf area index (LAI) and soil moisture) using remote sensing data without altering the model structure (Rahman et al., 2022b;Bonan et al., 2020;Xu et al., 2021). Yet, the EnKF relies on the instantaneous observations to update the state variable at the current time and gives the predicted value at the next time based on the forward integration of the updated state variable. The four-dimensional variational method (4DVar) assimilation method can obtain the dynamic balance of the estimation in the time window when it is applied to the long-series forecast model (Bateni et al., 2014;Xu et al., 2019). In particular, the proper orthogonal decomposition (POD)-based ensemble 4DVAR assimilation method (referred to as PODEn4DVar) (Tian and Feng, 2015) requires relatively less computation and can simultaneously assimilate the observations at different time intervals. Meanwhile, it maintains the structural information of the four-dimensional space. This method has a satisfactory performance in land DA for carbon and water variables (Tian et al., 2009(Tian et al., , 2010 and can better estimate GPP and ET than EnKF (Ma et al., 2017).
Multiple sources of remote sensing data streams have been used to constrain models for assimilation. As a critical biophysical parameter of the land, leaf area index (LAI) is closely related to many land processes, such as photosynthesis, respiration, precipitation interception, ET, and surface energy exchange (Fang et al., 2019). LAI has a lot of impact on the simulation of carbon and water fluxes (Liu et al., 2018), and accurate LAI estimates can improve the simulations of the carbon and water fluxes Mu et al., 2007). He et al. (2021) assimilated land surface temperature and LAI observations into the 4DVar framework and improved ET and GPP estimates. Soil moisture is a major driving factor affecting vegetation production in arid ecosystems, especially in semi-arid areas . Introducing surface soil moisture (SSM) into the model can significantly improve GPP and ET simulation, particularly in water-limited areas (He et al., 2017;. The advancement of earth observation, machine learning, inversion algorithms, and computer technology has improved the accuracy of global LAI products and boosted model-data fusion studies (Fang et al., 2019;Kganyago et al., 2020;Xiao et al., 2017). The Advanced Very High-Resolution Radiometer (AVHRR) generates global LAI products with the longest historic record (since the early 1980s). The Global LAnd and Surface Satellite (GLASS) LAI product has been verified to have a better accuracy than that of MODIS and CYCLOPES and is more temporally continuous and spatially complete . Several recent studies showed that the assimilation of GLASS LAI into DGVMs enhanced the performance of the models in simulating carbon cycling (e.g., GPP and net ecosystem exchange -NEE) and hydrological (e.g., ET and SM) processes (Ling et al., 2019;Ma et al., 2017;Yan et al., 2016).
Microwave remote sensors are considered effective tools for measuring SM globally (Petropoulos et al., 2015). For example, SSM products have been derived from the Soil Moisture and Ocean Salinity (SMOS) and Soil Moisture Active and Passive (SMAP) satellites equipped with an L-band microwave instrument. The products from these satellites have been evaluated against in situ observations and other SSM products and overall have high accuracy (Burgin et al., 2017;Cui et al., 2018). Additionally, the SMAP performs better than SMOS and other SSM products (e.g., Advanced Scatterometer, ASCAT, and Advanced Microwave Scanning Radiometer 2, AMSR2) with an overall lower error and a higher correlation based on the verification with in situ SSM data from 231 sites (Cui et al., 2018;Kim et al., 2018). The assimilation of SMAP data can improve the simulation accuracy of carbon and water fluxes (He et al., 2017; and hydrological variables (surface soil moisture, root-zone soil moisture (RZSM), and streamflow) (Blyverket et al., 2019;Koster et al., 2018;Reichle et al., 2017). In addition, the assimilation of SMAP data performed slightly better than that of SMOS and ESA CCI data (Blyverket et al., 2019). In the nonlinear model or nonlinear observation operator, only simultaneous assimilation makes optimal use of observations . Therefore, a joint assimilation of SM and LAI can make full use of the two variables. From site Rüdiger et al., 2010; to regional assimilation (Ines et al., 2013), many studies showed that joint assimilation of vegetation parameters and SM can improve the simulation of the carbon and water cycles. Over small regions and at high spatial resolution, Xie et al. (2017) and Pan et al. (2019) showed that the joint assimilation of SM and LAI improved the accuracy of crop yield estimation using high-resolution satellite products from Sentinel-1 and Sentinel-2. At a large regional scale, Bonan et al. (2020) assimilated LAI and SSM together into the Interaction between Soil, Biosphere and Atmosphere (ISBA) land model and improved the modeled GPP, ET, and runoff in the Mediterranean region. Rahman et al. (2022b) jointly assimilated GLASS LAI and SMAP soil moisture to improve water and carbon flux simulations within the Noah-MP model over the continental United States domain. Albergel et al. (2020) jointly assimilated the ASCAT Soil Moisture Index (SMI) and LAI GEOV1 into ISBA through the Global Offline Land Data assimilation system LDAS-Monde to monitor extreme events such as drought and heat wave events. In conclusion, the Kalman filter and its variant methods are mostly used to implement joint assimilation methods at regional scales, which requires many kinds of observation data, and their accuracy directly affects the assimilation performance.
This study stems from the research studies discussed above and further explored the potential of jointly assimilating satellite LAI and soil moisture products globally. Specifically, it was the first time that an updated LPJ-DGVM model was used to jointly assimilate GLASS LAI and SMAP soil moisture for simulating global water and carbon fluxes. The latest global soil moisture datasets (SMOS and SMAP) were used, and the assimilation performance of these two observations was analyzed. Since previous work showed the importance of surface soil moisture in the semi-arid and arid areas, one of the specific objectives of our study is to compare the assimilation effect in the humid and arid areas and improve the understanding of the effect of surface soil moisture on vegetation activity in wet and dry zones. In addition, compared with the assimilation methods in previous studies (mostly using Kalman filter variants), the POD-En4DVar method was used, which greatly improves the computational efficiency.
2 LPJ-VSJA framework and assimilation strategy 2.1 Coupled model (LPJ-PM) for assimilation In this study, a coupled terrestrial biosphere model, LPJ-PM, was used to simulate daily GPP and ET by assimilating satellite-derived LAI and SSM. The LPJ-PM is coupled from LPJ-DGVM and PT-JPL SM . The original input data in PT-JPL SM were all inherited from LPJ-DGVM, with the exception of relative humidity (RH) and surface soil moisture (SMOS and SMAP), including the initial LAI calculated by the LPJ-DGVM or assimilated LAI obtained through the LAI assimilation scheme, canopy height, and the fraction of absorbed photosynthetic active radiation (fPAR). The detailed processes of the LPJ-PM have been described in , and the flow chart for the coupling is shown in Fig. 1.

LPJ-DGVM
The LPJ-DGVM is a process-oriented dynamic model, which considers mutual interaction of carbon and water cycling and is designed to simulate vegetation distribution and carbon, soil, and atmosphere fluxes (Sitch et al., 2003). For each plant functional type (PFT), the GPP is calculated by implementing coupled photosynthesis and water balance.
The canopy GPP is updated daily: where J C is the Rubisco limiting rate of photosynthesis, J E is the light-limiting rate of photosynthesis, and the empirical parameter θ represents the common limiting effect between the two terms. J E is related to APAR (absorbed photosynthetic active radiation, product of FPAR and PAR), while J C is related to Vcmax (canopy maximum carboxylation capacity, µmol CO 2 m −2 s −1 ): where C 1 and C 2 are determined by a variety of photosynthetic parameters and the intercellular partial pressure of CO 2 , which is related to atmospheric CO 2 content and further altered by leaf stomatal conductance (Sitch et al., 2003). APAR and FPAR are directly related to LAI. In the water cycle module, ET is calculated as the minimum of a plant-and soil-limited supply function (E supply ) and the atmospheric demand (E demand ) (Haxeltine and Prentice, 1996;Sitch et al., 2003). The soil structure is simplified  . The abbreviations in the model and the assimilation framework are explained in Table 1. to a "two-layer bucket" model (the top soil layer at a 0-50 cm depth and the bottom layer at a 50-100 cm depth).
In this module, it is assumed that the soil layer above 20 cm produces water through evaporation, and Wr 20 is the relative water content of the soil above 20 cm, which is used as the only soil water limit for calculating vegetation transpiration and soil evaporation. In the evapotranspiration estimation, the oversimplification of soil structure and the soil water limitation lead to a large error (Sitch et al., 2003), while LPJ-DGVM cannot directly assimilate surface soil water due to the limitation of soil layer stratification, and therefore, the satellite-derived SSM cannot be assimilated into LPJ-DGVM directly. The oversimplified soil structure and single soil moisture limitation inevitably lead to sizable uncertainty in ET simulation. Additionally, the monthly input caused a daily variation of the modeled SM, which was also not transmitted to the calculation of GPP and ET. Thus, the updated PT-JPL model (hereafter referred to as PT-JPL SM ) was coupled with LPJ-DGVM, and the model structure was modified so that SSM can be directly assimilated into the coupled model at the daily time step.

PT-JPL SM
In PT-JPL SM , three ET components are modeled: soil evaporation (E), vegetation transpiration (T ), and leaf evaporation (I ). The PT-JPL SM introduced a constraint (0-1, C RSM ) of SSM for T and E, which was used to avoid the implicit soil water control (represented by f SM = RH VPD ) in the PT-JPL model. Vegetation transpiration is calculated as follows: where w obs is the SMAP SSM, w pwp is the water content at the wilting point, and VWC is volumetric water content. w CR is a crucial parameter in characterizing the extent of SM restriction on ET, and w pwp_CH is the canopy height (CH)adjusted surface soil moisture wilting point and is related to the potential of roots capturing water from deeper sources to limit the transpiration rate and characterize the SM availabil-ity (Purdy et al., 2018;Evensen, 2004;Serraj et al., 1999). The specific formula is given in Purdy et al. (2018). Soil evaporation is calculated as follows: The proportion of available water limits the soil evapotranspiration to the maximum available water. This scalar was formulated to represent the relatively accurate extractable water content for the vegetation, determined by soil properties and the water available for evaporation, which is estimated via surface water constraints. The SMAP SSM was applied to model global ET using PT-JPL SM , and the results demonstrated the largest improvements for ET estimates in dry regions (Purdy et al., 2018). Due to the limitation of soil stratification in LPJ-DGVM, the model was coupled with an updated remote sensing ET algorithm in the PT-JPL SM that could better simulate ET in waterlimited regions than in humid regions (Purdy et al., 2018).

Assimilation scheme and experiment procedure
To improve the prediction capability of LPJ-PM, we designed three assimilation schemes: assimilating LAI only (LAI-only, output: ET LAI , GPP LAI ), assimilating SSM only (SSM-only, output: GPP SM , ET SM ), and joint assimilation of LAI and SSM (Joint LAI and SSM assimilation, output: ET joint , GPP joint ), i.e., LPJ-VSJA framework) to test the assimilation performance for simulating GPP and ET.
The proposed LPJ-VSJA framework consists of four main components: the model operator (the LPJ-PM), the observation operator (to establish the relation between the assimilation variable and the observed variable), the observation series (GLASS LAI and SMOS or SMAP products), and the assimilation algorithm (POD4DVar). With the surface soil moisture constraint in the PT-JPL SM , the LPJ-VSJA corrects the output fluxes (GPP and ET in this study).
The experiment consisted of six steps.
-Step 1. Initialize the LPJ-DGVM, and output the reference state variables without assimilation over the experimental period (2010-2018), referred to as the "control run" scenario. - Step 2. Implement three assimilation schemes respectively, and the results represent the assimilation integration state (daily GPP and ET assimilation results are referred to as the "GPP LAI " and "ET LAI " in the LAIonly scheme,"GPP SM " and "ET SM " in the SSM-only scheme, and "GPP joint " and "ET joint " in the Joint LAI and SSM assimilation scheme. This scenario used the same input data and model parameter scheme as the control run scenario. -Step 3. Evaluate GPP and ET results (three schemes) by comparing the parameters, R 2 ( coefficient of determination), BIAS, and ubRMSD (unbiased root mean square deviation), for conditions of without-DA (control run scenario) and with-DA states, and assess the assimilation performance of separate assimilation and joint assimilation to determine the optimal assimilation scheme for GPP and ET, respectively. - Step 4. Evaluate the in situ GPP and ET resulting from the assimilation where the sites are located in wet or dry regions by dividing these validation sites into four parts (humid, sub-dry humid, semi-arid, and arid regions), and this step was designed to assess the superiority of the proposed assimilation scheme in water-limited areas. - Step 5. Compare the ET assimilation performance by assimilating the SMOS data with that by assimilating the SMAP data.
-Step 6. Evaluate the simulated GPP and ET maps based on the optimal assimilation scheme against existing global flux products.

LAI-only assimilation scheme
In the LAI-only assimilation scheme, the observation operator determines the relationship between LAI and foliage projective cover (FPC) in the process model (Eq. 8), and the assimilated LAI will be propagated by energy transmission and ecosystem processes (e.g., photosynthesis, transpiration of vegetative process) in the dynamic model to improve GPP and ET simulations Mu et al., 2007). FPC, the vertically projected percentage of the land covered by foliage, regulates the rate of photosynthate conversion and transpiration. In this study, the GLASS LAI with 8 d interval for the period 2010-2018 was selected as the observation dataset for assimilation, and the FPC state variable was updated daily through running the LPJ-PM (FPC DA , GPP LAI , ET LAI in this study) as shown below: We set the model and observation errors at a given time as 20 % and 10 % (scale factor) of the LAI value and the observed LAI value, respectively. By verifying the assimilation performance (R 2 , RMSD, and BIAS) for different scale factors (f ) of model simulation and observations in the range of 0.05 to 0.40, taking a step size of 0.05 (a total of 64 combinations), the optimal scale factors (0.2 and 0.1) were determined . The model and observation errors was the LAI value multiplied by f . The model integration generation method described by Pipunic et al. (2008) was used to determine the minimum number of ensemble members required to achieve maximum efficiency, and the number of sets was 20.

SSM-only assimilation scheme
In this scheme, the SSM products (SMOS or SMAP) were assimilated into LPJ-PM to obtain more accurate ET (ET SM ) estimates in water-limited areas. The observation series was the SMOS or SMAP SSM product, and the observation operator was the PT-JPL SM model. The ET PM (see Table 1) was estimated by the coupled model (LPJ-PM) introducing SSM as a diagnostic variable. The ET values resulting from the assimilation were applied to compute the top layer SM (50 cm) at the next time step (a nonlinear soil water availability function described by Zhao et al., 2013), providing feedback for subsequent hydrologic and carbon cycle processes. Then, the updated SM values regulated the GPP simulation (output: GPP SM ). Different from other "constant" ET observations, the ET PM ("observation") at each time t was adjusted by absorbing intermediate variables updated after assimilation at time t − 1. The ET PM was shown to be better than ET simulated by LPJ-DGVM but not as good as that simulated by the model with SMAP SSM assimilated . Thus, it is also proven that this SSM assimilation scheme could improve the accuracy of ET simulations .
All assimilation simulations were conducted between January 2010 and December 2018. Between January 2010 and April 2015, SMOS data were used for assimilation, and after May 2015, both SMOS and SMAP data were used for assimilation. An assimilation scheme was conducted when RH and SMOS or SMAP SSM data existed simultaneously; otherwise, the original simulation of the LPJ-DGVM was conducted directly without adjustment of assimilation.
Similar to the LAI assimilation scheme, the model and observation errors were set as 15 % and 5 % of ET LPJ and ET PM , respectively (LPJ-PM was adopted before assimilation). The number of ensemble members was set to 50. The ET PM must be rescaled to the ET LPJ distribution via their corresponding cumulative probabilities using the cumulative distribution function (CDF) matching to avoid introducing any bias into the LPJ-VSJA system .

Joint LAI and SSM assimilation scheme
In this scheme, both LAI from GLASS and SSM from SMOS or SMAP were the observation datasets. The GLASS LAI was assimilated to obtain the FPC DA and ET LAI , and then the FPC DA served as input to LPJ-PM to simulate optimized ET PM , and the ET joint was generated using ET LAI and ET PM . Then, the SM (referred to as SM CO in Fig. S1 in the Supplement) updated by ET joint and the FPC DA were used as input to correct GPP (ET joint ).
Here, we applied the error regulation in the LAI-only scheme and maintained the error setting of the LAI observation and model simulation. Considering the transmission of integrated model error, we recalculated the model error of LPJ-PM after the LAI assimilation and set model and ob-servation errors of ET LAI and ET PM to be 15 % and 10 %, respectively.

POD-based ensemble 4D variational assimilation method
The proper orthogonal decomposition (POD)-based ensemble four-dimensional variational (4DVar) assimilation method (referred to as PODEn4DVar) (Tian and Feng, 2015) has the advantage of avoiding the calculation of adjoint patterns as its incremental analysis field, which can be represented linearly by the POD base (transformed OP, observing perturbation, and MP, model perturbation). Moreover, the PODEn4DVar can simultaneously assimilate multipletime observation data and provide flow-dependent (the flowdependent is the ensembles of forecasting statistical characteristics in the t time) error estimates of the background errors. It has shown advantages in terrestrial assimilation, the Tan-Tracker system (a Chinese carbon cycle dataassimilation system; in Chinese, "Tan" means carbon) and radar assimilation (Tian et al., 2010(Tian et al., , 2009(Tian et al., , 2014Zhang and Weng, 2015). By minimizing the following initial incremental format of the cost function in the 4DVar algorithm, an analysis field can be obtained as follows: Here, the is the model perturbation (MP) matrix, and y y 1 , y 2 , . . ., y N is the observation perturbation (OP) matrix with N samples. Following Rüdiger et al. (2010), the LAI perturbation was set to a fraction (0.001) of the LAI itself. The perturbation of ET PM and ET LPJ conforms to a Gaussian distribution with a mean of 0 and a specified covariance (10 % and 5 % of the ET PM and ET LPJ at time t). The subscript b represents the background field, the superscript T represents a transpose, H is the observation operator of the LAI-only assimilation scheme as described in Sect. 2.2.1, and the SSM-only assimilation scheme is the PT-JPL SM (described in Sect. 2.1.2). M is the forecast model (LPJ-PM in this study), B is the background error covariance, R is the observation error covariance, and obs denotes observation.
Assuming the approximately linear relationship between OP(y ) and MP(x ), POD decomposition and transformation were successively conducted for OP and MP. The transformed OP samples ( y = y 1 y 2 y n ) are orthogonal and independent, and the transformed MP samples ( x = x 1 x 2 x n ) are orthogonal to the corresponding OP samples, where n is the number of POD modes.
The manifestation of the background error covariance is the same as the ensemble Kalman filter (EnKF; Evensen, 2004), and the incremen- tal analysis x a was expressed by the x,n , and Finally, the optimal analysis x a is calculated as x a = x b + x,n˜ y y obs . The detailed derivation process of the algorithm is described by a previous study (Tian et al., 2011).
In the ensemble-based method (Evensen, 2004), the number of ensemble members is usually fewer than that of the observation data and the degrees of freedom of the model variables, and spurious long-range correlations occur between observation locations and model variables. A practical method, the localization technique, is applied to address this issue (Mitchell et al., 2002). The final incremental analysis is rewritten as where d h and d v are the horizontal and vertical distances between the spatial positions of state and observed variables, respectively; and d h,0 and d v,0 are the horizontal and vertical covariance localization Schur radii, respectively. The filtering function C 0 is expressed as where r is the radius of the filter. The assimilation algorithm is mainly divided into two steps: (1) prediction, by running LPJ-PM in the current assimilation window and generating simulation results and background field vectors, and (2) update, the algorithm being used to calculate the optimal assimilation increment x a and analysis solution x a . The simulation results and the initial conditions of the model in the current window are updated using the analysis solution. The updated initial con-ditions were applied for model LPJ-PM prediction, and the above process was repeated.

Validation method for assimilation performance
The R 2 (coefficient of determination), BIAS, and ubRMSD (unbiased root mean square deviation) between simulation and tower-based observations were applied for evaluation. In addition, a Taylor chart was also used to demonstrate the performance of two ET estimations with different SSM observations in terms of R, ubRMSD, and normalized standard deviation (NSD) on 2D plots, to display how closely the datasets matched observations in one diagram (Taylor, 2001). In the Taylor diagram, NSD represents the radial distance from the origin point and the correlation with the site observations as an angle in the polar plot. The ubRMSD is the distance between the observation and the model and is represented in the Taylor chart as a green semi-circular arc with point A as the center of the circle. The closer the model point to the reference point (Point A), the better the performance. This diagram is convenient and visual in evaluating multiple aspects of various models.
The error variance of GPP and ET products was estimated using the triple collocation (TC) approach (Stoffelen, 1998) to validate the global simulation in this study. The method has been extensively applied in the study of hydrology and oceanography (Caires and Sterl, 2003;Khan et al., 2018;O'Carroll et al., 2008;Stoffelen, 1998), particularly in SM studies (Chan et al., 2016;Kim et al., 2018). The TC provides a reliable platform for comparison of spatial assimilation results and in situ measurements. In this experiment, no calculation was performed on the non-vegetated areas, where the correlation was lower than 0.2, in order to have independent datasets and avoid correlated errors (crucial assumptions in TC) (Yilmaz and Crow, 2014).
In this study, the five products were divided into three product categories, including satellite products (MODIS and GOSIF GPP), reanalysis products (GLASS and GLDAS), and data-assimilation products (GLEAM ET and LPJ-VSJA) (C. . One product in each category was selected to form a group to calculate their error. The LPJ-VSJA product was set as the reference data.
For GPP products, GOSIF, GLASS, and LPJ-VSJA were treated as a group, and MODIS, GLASS and LPJ-VSJA were treated as another group to calculate the errors; the final errors were determined by the average of these two.
Similarly, to calculate the errors for ET, GLEAM, GLASS, and MODIS were chosen as a group; LPJ-VSJA, GLDAS, and MODIS were treated as a group; LPJ-VSJA, GLASS, and MODIS were considered a group. In order to reduce the influence of orthogonality hypothesis of error, the first and third groups are for indirect and effective comparison between LPJ-VSJA product and GLEAM product.
3 Experiment sites and data

Description of flux tower sites
We screened over 300 eddy covariance (EC) flux sites across the globe from the FLUXNET2015 (https://fluxnet.org/data/ fluxnet2015dataset/, last access: 10 April 2021), Ameri-Flux (https://ameriflux.lbl.gov/data-and-manifest/, last access: 10 April 2021), and the HeiHe River basin (https://doi.org/10.11888/Hydro.tpdc.270888, Xie, 2017) for the evaluation of assimilation performance over the period from January 2010 to December 2018. The in situ half-hourly latent heat flux (LE) and GPP data from the sites were aggregated into daily data. The daily gap-filled data were excluded if the percentage of gap-filled half-hourly values was more than 20 %. Then we corrected the data of energy non-closure using the Bowen ratio closure method (Twine et al., 2000) to improve the energy closure rate (Huang et al., 2015;Yang et al., 2020). The data were selected to cover the 2010-2018 period with at least 1 year of reliable data, and the result from the error of assimilation is relative to the LE value and seasonal variation (Purdy et al., 2018;Zou et al., 2017). It is essential to have available data every month during a 1-year period, and only days with less than 25 % missing data were processed per month . In addition, for flux tower data, the data were also excluded for the analysis if the SMAP/SMOS SSM data were not of good quality.
Finally, we identified a total of 105 sites across the globe encompassing five major biomes: grassland (18 for GPP and 19 for ET), savanna (11), shrubland (4), forest (49 and 53), and cropland (13 and 14). In the comparative analysis of the performance for simulating ET by assimilating SMOS and SMAP SSM data separately, we selected 46 AmeriFlux sites ( Fig. S3) with at least 1 year of reliable data from 2015 to 2018 based on the simultaneous availability of SMAP and SMOS data, including grassland (19), savanna (11), shrubland (5), forest (23), and cropland (7). Figures S2 and S3 illustrate the location and distribution of the 105 and 46 EC flux tower sites, respectively. A more detailed description is summarized in Table S1 in the Supplement.

Remote sensing datasets: LAI and SSM
The GLASS LAI product with an 8 d time step (8 d average) and 5 km resolution was derived from MODIS and CY-CLOPES surface reflectance and ground observations using general regression neural networks (GRNNs) (Xiao et al., , 2016. The verification of the product using the mean values of high-resolution LAI maps showed that the GLASS LAI values were closer to these high-resolution LAI maps (RMSD = 0.78 and R 2 = 0.81) (Xiao et al., 2016;. Therefore, the GLASS LAI product has satisfactory performance and can be assimilated into terrestrial biosphere models. The SMAP mission  and SMOS mission (Jacquette et al., 2010), the two dedicated soil moisture satellites currently in orbit equipped with L-band microwave instruments, provide SSM retrievals. We chose the SMOS-L2 product and the SMAP-L3-Enhanced product, which both provide global coverage every 3 d for soil depth of 5 cm. Only good-quality SMAP and SMOS data were used. The grid cells with water areas larger than 10 % and those with less than 50 % good-quality data in 1 year were masked out, which alleviates the undesirable model simulations caused by the decrease in SMAP retrieval accuracy (Chan et al., 2016;O'Neill et al., 2010). We only adopted the data with an uncertainty below 0.1 m 3 m −3 , in the actual range (0.00-0.6 m 3 m −3 ), and the temperature of the Land Surface Module (LSM) observation layer (the second layer) was higher than 2 • C (Blyverket et al., 2019).
The GLASS LAI, SMOS, and SMAP observations were resampled to 9 km for site simulation and 0.25 • for regional simulation. The 8 d average of GLASS LAI was assimilated for each day, and the SMAP or SMOS SSM was assimilated every 3 d.

Model-forcing and validation datasets
In this study, the meteorological, soil property, and CO 2 concentration datasets were used to drive the LPJ-PM. The climate-driven datasets used for the initialization of the LPJ-DGVM are the atmospheric CO 2 concentrations (1901-2018) of ice-core measurements and atmospheric observations at the Mauna Loa Observatory and CRU TS4.03 version Climate data from 1901 to 1930 provided by the Climatic Research Unit (CRU) of the Climate Laboratory, University of East Anglia, UK, including monthly precipitation, surface temperature, cloud cover and wet day. In the simulation period of 2010-2018, the Modern Era Retrospective-Analysis for Research and Applications Version 2 (MERRA-2) was adopted, and the variables used included precipitation, temperature, cloud cover, and relative humidity. Soil properties (including limited water content of vegetation at wilt-ing points, field capacity, and soil porosity) from the Harmonized World Soil Database (HWSD) V1.2 dataset (Wieder et al., 2014) were selected as inputs to the PT-JPLSM model. Table 2 provides the spatial and temporal characteristics of the model-forcing datasets in the LPJ-PM (submodule: LPJ-DGVM and PT-JPL SM ).
The GLASS LAI product, SMOS-L2 product, and the SMAP-L3-Enhanced product were assimilated to simulate GPP and ET. For site simulation, in order to maintain consistency with the SMAP Enhanced 3 Level product , model-forcing data were resampled to a 9 km spatial resolution based on EASE-2 projection grid. In the global spatial simulation, the model-forcing datasets were resampled to 0.25 • based on the bilinear method to ensure the consistency of spatial representation.
We used four global ET products and three global GPP products (S. Li and Xiao, 2019; that was resampled to 0.25 • to evaluate the performance of the model with the joint assimilation scheme. Table 3 shows the details of these GPP and ET products.

Performance of LPJ-PM for simulating GPP and
ET with the assimilation of LAI and soil moisture 4.1.1 Accuracy assessment of GPP for separate and joint assimilation In general, the R 2 between GPP LPJ and GPP OBS was above 0.4 at most of the sites (62 sites) and was relatively weak for some sites. The LAI assimilation improved the simulations at most sites (R 2 value increased at 82 sites), particularly for sites in the United States and Europe (Fig. S4). The R 2 improvement from the LAI assimilation (LAI-only assimilation) was superior to that from the SSM assimilation ( Fig. S4b and c). The performance of the joint assimilation was similar to that of LAI-only assimilation. Sites (Fig. S5a) that showed positive BIAS (GPP OBS -GPP LPJ ) were mainly distributed in the humid and sub-dry humid forest, grassland, and arid cropland regions, showing underestimation for GPP OBS . The assimilation improved the accuracy for overestimated sites, but there was no significant improvement for underestimated sites. The ubRMSD implied that the SSM assimilation alone had a better performance than the LAI assimilation alone, especially for sites in arid areas (Fig. 2). The analysis of the above three statistical measures (R 2 , BIAS, and ubRMSD) indicated that the accuracy of joint assimilation was much better than that of each separate assimilation. At the seasonal scale, all three assimilation schemes corrected the model trajectory and significantly improved the growing season simulations, especially for peak values (IT-Tor, US-NR1, and US-NE1) (Fig. 3). In addition, the lin-ear fitting of GPP joint and GPP OBS on a monthly scale was closer to 1 : 1 (y = 0.92x + 21.66, p < 0.001) than that of GPP LAI (y = 0.89x + 28.3, p < 0.001) and GPP SM (y = 0.86x + 41.70, p < 0.001) (Fig. S9). The results in Table S2 support the above analysis, and the joint assimilation showed advantages in overall accuracy in both arid and humid areas.
The residual analysis indicated that the three assimilation schemes for GPP (Fig. S11 left) were different. For the assimilation results, most of the errors were distributed around −70-60 g C m −2 per month. The high GPP OBS values were considerably underestimated. The maximum negative error reached 100 g C m −2 per month. The error distribution of GPP SM was more dispersed than that of GPP LAI and GPP joint . Among the residuals of these three schemes, GPP SM significantly overestimated the GPP OBS , mainly distributed in the 0-200 g C m −2 per month range. GPP LAI showed significant improvement in the overestimation of GPP OBS compared with GPP joint . In general, the GPP joint with the most concentrated error distribution had significant improvement.
After determining the optimal assimilation scheme (Joint LAI and SSM assimilation scheme), we evaluated the GPP LPJ and GPP joint at the site level (Fig. 4). The results showed that GPP joint performed better (R 2 = 0.83, ubRMSD = 1.15 g C m −2 d −1 ) than GPP LPJ (R 2 = 0.69, ubRMSD = 1.91 g C m −2 d −1 ). The noticeable underestimation in all PFTs and overestimation at most forest sites for GPP LPJ were corrected by joint assimilation (GPP joint ). Our joint assimilation methods had better performance in forests, shrublands, and grasslands than in croplands and savannas. Except for the cropland, the linear fitting results of other types were all below the 1 : 1 line, showing the overall underestimation. Superior performance in both the original simulation and assimilation occurred at shrubland (R 2 = 0.93, ubRMSD = 0.89 g C m −2 d −1 ) and grassland (R 2 = 0.97, ubRMSD = 0.83 g C m −2 d −1 ) sites. However, the standard deviation of GPP joint and GPP OBS at savanna sites was relatively large, and the GPP joint at several savanna sites was significantly underestimated.

Accuracy assessment of ET for separate and joint assimilation
In general, the coefficient of determination (R 2 ) between ET LPJ and ET OBS was generally over 0.4 (the simulations were superior to GPP LPJ ) (Fig. S6). ET LAI showed slightly higher R 2 , while some sites showed reduced values (41 sites). The ET SM and ET joint were significantly improved compared with the ET LAI . The R 2 increased considerably in Australia but declined at some sites in the United States after assimilation. For ubRMSD, ET joint performed better than ET SM and ET LAI . The SSM assimilation improved more in humid regions, while the ubRMSD of ET SM was slightly higher in South America (Fig. 5). In the original LPJ-DGVM simulation, the sites with a negative BIAS were mostly lo-   Martens et al. (2017) cated in the humid and sub-dry humid regions, while most of the sites in arid and semi-arid regions had underestimation (Fig. S7a, Table S3). The assimilation improved ET at some of the overestimated sites, but the underestimation over these sites showed little improvement. At the seasonal scale, the model simulations were able to capture the temporal trend of ET OBS , and joint assimilation significantly improved the simulation in the growing season (US-NR1, US-NE1); overall underestimation was observed for ET OBS , especially in winter (Fig. 6). Overall, the linear fitting of monthly ET joint and ET OBS was closer to 1 : 1 than that of ET LAI and ET SM (Fig. S6). The simulation accuracy of joint assimilation was better than that of each separate as- similation, and the performance of the SSM assimilation was better than that of the LAI assimilation.
The ET residual analysis (Fig. S11 right) indicated that the three assimilation scheme errors showed underestimation for ET OBS . In general, the error distribution of separate assimilations was more dispersed than that of the joint assimilation. Similar to the assimilation performance of GPP, ET joint and ET SM significantly improved the overestimation of ET OBS but did not significantly improve the underestimation. For the ET joint , most of the errors were distributed around −30-18 mm per month. The region with high ET OBS was considerably underestimated, and the maximum negative error reached −57 mm per month.
We also evaluated the ET assimilation results at the PFT scale (Fig. 7). The results showed that our ET values resulting from the assimilation performed better at the site level (R 2 = 0.77, ubRMSD = 0.65 mm d −1 ) than that of ET LPJ (R 2 = 0.67, ubRMSD = 0.95 mm d −1 ). Joint assimilation significantly reduced the errors of those shrubland sites with overestimation for ET OBS , and the site distribution was closer to the 1 : 1 line. Our assimilation methods had better performance in forest, savanna, and grassland ecosystems than in cropland and shrubland (Table S3). The linear fitting results of grassland and shrubland were all above the 1 : 1 line, showing overall overestimation. Although the original simulation and assimilation performance were superior at savanna sites (R 2 = 0.95, ubRMSD = 0.78 mm d −1 ), the standard deviations of ET joint and ET OBS at savanna sites were relatively large, which was similar to the GPP results at savanna sites.

Comparison of assimilation performance in semi-arid and arid regions with that in humid and sub-dry humid regions
During the period 2010-2014, monthly GPP joint and ET joint performed differently in humid and sub-dry humid regions and semi-arid and arid regions (Fig. 8, Tables S2, S3). Overall, the GPP and ET simulations had good consistency with the tower data in the two regions. For GPP joint , there was no significant difference in the correlation and fitting coefficients between the two regions. As for ET joint , the fitting results and R 2 values in the semi-arid and arid regions performed better than those in the humid and sub-dry humid regions, which also suggested the importance of SSM for ET estimation in water-limited areas.
On the daily scale, the original GPP simulations (GPP LPJ ) performed better in the semi-arid and arid regions than in the humid and sub-dry humid regions with higher R 2 and lower ubRMSD (Table S2). The R 2 and BIAS implied that the LAI assimilation alone had a better performance than the SSM assimilation alone. However, for sites in arid and semi-arid areas, the ubRMSD showed that the GPP SM improved better than GPP LAI , which both demonstrated SSM data are essential in water-limited regions. For GPP joint , the shrubland in the semi-arid and arid regions had the lowest R 2 values and the second lowest ubRMSD. The forest in the semi-arid and arid regions had the largest improvement after assimilation. In the humid and sub-dry humid regions, the GPP joint of the savanna  and cropland showed the largest improvement (R 2 increased by 64.7 % and 71.1 %, respectively; ubRMSD decreased by 47.0 % and 31.8 %, respectively). The grassland in the semiarid and arid regions had the highest R 2 , and the savanna by combining all indicators had the best assimilation results compared to other types in both regions. Similar to ET joint (Table S3), the ET LPJ in the semi-arid and arid regions was better than that in humid and subdry humid regions in terms of four evaluation indicators (ubRMSD decreased by 34.4 % in semi-arid and arid regions and the ubRMSD decreased by 30.9 % in humid and sub-dry humid regions compared with ET LPJ ). The R 2 and ubRMSD implied that the SSM assimilation alone had a better performance than the LAI assimilation alone, especially for sites in arid areas. and the BIAS showed that the ET LAI improved better than ET SM for sites in humid and sub-dry humid areas. The performance of the original simulation and assimilation of grassland sites in the semi-arid and arid regions was the best among all five PFTs.
To investigate the reasons for better assimilation performance in water-limited regions, we evaluated the GPP and ET simulated by the LPJ-PM according to R 2 , ubRMSD,  and BIAS (Fig. 9). Compared with the semi-arid and arid regions, the humid and sub-dry humid region had smaller R 2 mean, larger BIAS, and no significant difference in mean ubRMSD for GPP SM . In general, the evaluation results of joint assimilation for ET PM were generally consistent with those for GPP SM and GPP SM . ET PM showed underestimation, which was consistent with the underestimation in SSM assimilation. These results indicated that both GPP and ET modeled by LPJ-PM with joint assimilation were less stable Figure 9. Box plots of R 2 , ubRMSD, and BIAS for GPP SM (a) and ET PM (b). A represents the sites in arid and semi-arid areas, and B represents the sites in humid and dry sub-humid areas. and had a lower performance in the humid and sub-dry regions than in the semi-arid and arid regions.

Comparison of assimilation performance in assimilating SMOS and SMAP soil moisture data
The Taylor chart was used to compare the assimilation performance of ET SMAP and ET SMOS at 46 AmeriFlux sites (Fig. 10 left). The results showed that ET SMAP performed better than ET SMOS for most PFTs, except forest. Both ET SMAP and ET SMOS performed well for grassland (closer to point A), and there was little difference between R 2 and ubRMSD. The NSD of ET SMAP in grassland was 0.88, which was closer to 1 than that of ET SMOS . The assimilation of ET had a lower R 2 and higher ubRMSD (0.7-0.8) in forest than other PFTs, and the NSD of cropland and shrubland was lower than that of other PFTs (0.6-0.8), indicating that the assimilation for cropland and shrubland could not reproduce the variations in ET effectively. However, ET SMAP showed significant improvement in R 2 compared with ET SMOS for shrubland and cropland. The assimilation performance of ET SMAP and ET SMOS for savanna showed the greatest difference. In general, the ET SMAP and ET SMOS were slightly different, and the ET SMAP was more improved than ET SMOS . Figure 10 (right) shows the assimilation accuracy of ET SMOS and ET SMAP in different humid and arid regions. The ET SMAP had significant advantages for the four indicators. The R 2 of ET SMAP was higher than that of ET SMOS in all the areas. However, ET SMOS in some evaluation indicators showed a better performance than ET SMAP (BIAS in the humid region; ubRMSD in the sub-dry humid region). This may be due to the overall more humid nature of SMOS SSM than the SMAP SSM. Moreover, the sensitivity of deep soil moisture contributed more to the ET in humid areas than in the water-limited areas.

Global simulations of GPP and ET with joint assimilation of LAI and soil moisture data
To assess the spatial scalability of the LPJ-VSJA assimilation scheme, we simulated the global daily GPP and ET for 2010-2018 with a spatial resolution of 0.25 • . The original results simulated by the LPJ-DGVM and LPJ-VSJA were referred to as LPJ-DGVM GPP(ET) and LPJ-VSJA GPP(ET), respectively. We compared the annual spatial GPP and ET values and the error standard deviation of the LPJ-VSJA with several existing flux products. Figures 11 and 12 depict the spatial distribution of the annual mean and the differences between our simulation results and the global independent satellite-based products. The developed LPJ-VSJA GPP was the closest to GOSIF GPP (Li and Xiao, 2019) in most regions with the lowest spatial mean deviation (LPJ-VSJA-GOSIF) (27.9 g C m −2 yr −1 ), followed by GLASS GPP (51.2 g C m −2 yr −1 ) (Yuan et al., 2010), LPJ-DGVM (−73.4 g C m −2 yr −1 ), and MODIS GPP (93.1 g C m −2 yr −1 ). LPJ-VSJA had higher GPP values than GOSIF GPP in tropical regions, such as Amazonia, Central Africa, and Southeast Asia. In general, the annual mean and differences between MODIS, GOSIF GPP, LPJ-DGVM, and our LPJ-VSJA were in broad agreement (with higher R 2 ranging from 0.74 to 0.95).
LPJ-VSJA ET was the closest to GLEAM ET on the spatial average with the least spatial average deviation (−13.9 mm yr −1 ) and highest R 2 (0.88), followed by GLASS ET (−23.1 mm yr −1 and 0.82), GLDAS ET (−34.7 mm yr −1 and 0.73), LPJ-DGVM (−48.7 and 0.66 mm yr −1 ), and MODIS ET (−122.1 and 0.54 mm yr −1 ). Figure 13a-e represent the spatial error standard deviation (σ ) distribution of MODIS, GLASS, GOSIF, and LPJ-VSJA GPP, respectively. The graphs on the right side depict the corresponding histograms. The σ of the MODIS GPP was evenly distributed between 30 and 60 g C m −2 per month, while the average σ of other products was concentrated in 0-20 g C m −2 per month (90 %). The high errors of all products were concentrated in the hot and hu-mid areas of southern North America, eastern South America, humid and dry sub-humid areas of South Asia, and the savannas of Africa and Australia. The error histogram of GOSIF GPP and LPJ-DGVM GPP was in line with the normal distribution, with an average value of 8.3 and 22.4 g C m −2 per month. The GLASS GPP product had the lowest mean value (3.6 g C m −2 per month), followed by LPJ-VSJA (4.7 g C m −2 per month), but the error variance of the LPJ-VSJA product was the lowest, indicating the stability of the regional error (Table S4). Compared to the LPJ-DGVM, the joint assimilation results showed improvement in all regions (the average error reduced by 17.7 g C m −2 per month), especially in the humid regions of South Asia, Australia, and the United States. Our LPJ-VSJA GPP was generally proven to have high accuracy and stability for spatial analysis and could provide a reference for other model products. Figure 14a-f show the σ of MODIS, GLDAS, GLEAM, GLASS, and LPJ-VSJA ET (the units are millimeters per month), and the right graphs are the corresponding histograms. The σ values of GLDAS and LPJ-VSJA represented a normal distribution trend. Except for MODIS, GLASS, and LPJ-DGVM (0-60 mm per month), the σ of other products was generally between 0-20 mm per month. The simulation error was relatively smaller in the Northern Hemisphere than in the Southern Hemisphere, especially for GLASS ET and GLDAS ET. Significant improvements in joint assimilation were observed in the Northern Hemisphere (especially in the semi-arid areas of the western United States and savanna and cropland areas of central India) and African savanna areas, and the average error was reduced by 15.1 mm per month. In general, the error mean and variance of LPJ-VSJA and GLEAM products were relatively low (Table S4), and there was no apparent extremely high-value region in the error distribution. Among the five products, LPJ-VSJA had the lowest error mean and variance and the highest accuracy.

Advantage of joint assimilation for GPP and ET
The benefit of employing multiple data flows in an assimilation system is the complementarity of the data, which enables constraints on different components of the underlying process-based terrestrial biosphere model. Due to the interaction and feedback between the internal components of the model, the assimilation of multiple observations has a synergistic effect, and the integrated constraints are greater than the individual constraint (Kato et al., 2013). The advantage of our joint assimilation is that it can improve the simulation accuracy of both GPP and ET, especially ET, in arid and semi-arid regions.
In the GPP assimilation experiment, the performance of the LAI assimilation was better than that of the SSM assim- ilation possibly for two reasons: (1) the LPJ-VSJA is more controlled by LAI data because the ratio of assimilated LAI (daily input) to SSM observations (3 d interval input) is approximately 3 : 1, which makes the likelihood function biased to LAI data; and (2) the SM directly influences the simulation of ET, and the corresponding time function (computes the top layer SM -50 cm) used here by Zhao et al. (2013) will result in the error of the updated top SM and propagating the error to the GPP SM . In addition, the 8 d interval LAI has the capability to capture the temporal variability of phenology.
Current studies on terrestrial water and carbon flux assimilation mostly focus on the assimilation between a single model framework and observation results, lacking the fusion and comparison between multiple models. The processed models used in DA are simplifications and approximations of reality, and different models focus on different ecological processes. In this study, the updated ET module was integrated to compensate for the simplification of soil stratification and the lack of SM information in the hydrological module of the LPJ-DGVM. Therefore, the integration of multiple types of models and multi-source observation data (remotely sensed data, ecological inventory data; National Ecological Observatory Network, NEON; Keller et al., 2008) and other measurements (Desai et al., 2011;Hayes et al., 2012) is expected to more objectively and effectively simulate the real state of ecosystems.

Comparison of joint assimilation (LPJ-VSJA) and other models for GPP and ET across regions and vegetation types
Global GPP and ET for different products were calculated by multiplying the global mean GPP density flux by the global vegetation area (122.4 × 10 6 km 2 ) that originated from the MODIS land cover product (Friedl et al., 2010). The mean global GPP of the LPJ-VSJA (130.2 Pg C yr −1 ) was approximately 12 % lower than that of PML-V2 (145.8 Pg C yr −1 ) and 18 % higher than that of GLASS and MODIS, respectively (Table S6). The GPP values of LPJ-VSJA and GOSIF were the most similar. The GOSIF GPP was developed from gridded solar-induced chlorophyll fluorescence (SIF) using simple linear relationships between SIF and GPP. Our global LPJ-VSJA GPP estimates were within the currently most plausible 110-150 Pg C yr −1 range. As for ET, our results were similar to those of GLEAM ET and lower than those of PML-V2, GLDAS-2, and GLASS ET (∼ 72 000 km 2 yr −1 ). Joint assimilation improved the overestimation of LPJ-DGVM ET. At the daily scale, the estimation accuracy of PML-V2 and GLDAS-2 products, calibrated with flux tower data, was better than that of our estimates, which suggests underestimation of LPJ-VSJA ET in wet regions. It is likely because the SSM of SMAP or SMOS was underestimated in the wet region, or the influence of deep SM was underrepresented. According to Seneviratne Figure 11. Column A: spatial distribution of annual LPJ-VSJA GPP and other independent satellite-based datasets. Column B: spatial distribution of the difference between annual LPJ-VSJA GPP and other independent satellite-based datasets. Column C: scatter plots between these products. Black lines show the 1 : 1 line, and red lines show the regression fit. et al. (2010), satellite-based ET estimation approaches often overestimate ET in areas of arid and semi-arid climatic regimes in the magnitude of 0.50 to 3.00 mm d −1 . The poor performance of these models can largely be attributed to the lack of constraints of SSM or RZSM and more accurate vegetation parameters (Gokmen et al., 2012;Pardo et al., 2014). For instance, the monthly estimated ET modeled by the Penman-Monteith-Leuning (PML) model agreed with flux tower data well (R 2 = 0.77; BIAS = −9.7 %, approximately 0.2 mm d −1 ). Our annual ET simulations were lower than other products and slightly underestimated tower ET with a BIAS of 0.19 mm d −1 (ET OBS -ET joint ).
In general, GPP and ET had better assimilation performance in arid and semi-arid regions than in humid and sub-dry humid regions, likely because of the following reasons. First, the incorporation of SSM is more important for vegetation growth in water-limited areas. The module PT-JPL SM has been proven to have better performance in semi-arid and arid regions (Purdy et al., 2018). Our integrated model LPJ-PM also performed better in semi-arid and arid regions by assimilating SMAP soil moisture (Li et al.,Figure 12. Column A: spatial distribution of annual LPJ-VSJA ET and other independent satellite-based datasets. Column B: spatial distribution of the difference between annual LPJ-VSJA ET and other independent satellite-based datasets. Column C: scatter plots between these products are provided on the right of the difference maps. Black lines show the 1 : 1 line, and red lines show the regression fit. 2020). Second, the input performance, including SMOS and SMAP SSM products, is better in arid and temperate regions than in cold and humid regions (Zhang et al., 2019). Third, the vegetation types in humid regions are more complex and relatively less accurately simulated by the LPJ-DGVM within a single grid cell. For comparison, Zhang et al. (2020) used a data-driven upscaling approach to estimate GPP and ET in global semi-arid regions. This data-driven approach (R 2 = 0.79, RMSD = 1.13 g C m −2 d −1 ) had slightly higher performance in estimating GPP than our LPJ-VSJA (R 2 = 0.73 and RMSD = 1.14 g C m −2 d −1 ), and the datadriven method (R 2 = 0.72 and RMSD = 0.72 mm d −1 ) had identical performance for estimating ET with our LPJ-VSJA (R 2 = 0.73 and RMSD = 0.72 mm d −1 ). Our assimilation performance varied with PFT. The GPP and ET assimilation results of savanna sites performed well in both dry and wet regions, and those of shrubland sites showed the most remarkable improvement for simulations of LPJ-DGVM. The original simulation and assimilation performance of grassland sites in the semi-arid and arid regions were the best for all five PFTs. Consistent with our research, previous studies also showed better GPP or ET simulations for grassland, savannas, and shrublands biomes. For instance, Feng et al. (2015) validated five satellite-based ET algorithms for semi-arid ecosystems and concluded that all the models produced acceptable and relatively better results for most grassland, savanna, and shrubland sites. Yang et al. (2017) demonstrated that the GLEAM ET had a superior performance for the grassland sites. The GOSIF GPP demonstrated better simulation for grassland and woody savannas sites at 8 d time steps with higher R 2 (0.77 and 0.83, respectively) and lower RMSD (1.48 and 1.1 g C m −2 d −1 ) (Li and Xiao, 2019). In contrast, our LPJ-VSJA GPP showed an R 2 of 0.87 for grassland and 0.75 for savannas and an RMSD of 1.11 and 1.1 g C m −2 d −1 , respectively, in semi-arid and arid regions.

Uncertainty analysis of joint assimilation
Our validation results at both site and regional scales indicated that uncertainty existed in LPJ-VSJA daily GPP and ET estimates. The errors from the tower EC observations, model-driven data, model structure, error of satellite-based observations (e.g., LAI and SSM), and the spatial scale mismatch between the ground observed footprint size and satellite-derived footprint size were the vital factors affecting assimilation performance.
First, recent studies have revealed errors in the GLASS LAI and SMOS or SMAP SSM compared with ground measurements. By computing the RMSD and R 2 of each product, the GLASS LAI accuracy was clearly superior to that of MODIS and Four-Scale Geometric Optical Model-based LAI (FSGOM) in forests, and GLASS and FSGOM led to much higher annual GPP and ET estimates compared to MCD15 (Liu et al., 2018). The vegetation type (or land cover) misclassification caused 15 %-50 % differences in LAI retrieval (Fang and Liang, 2005;Gonsamo and Chen, 2011). Yan et al. (2016) calculated a RMSD of 0.18 for the GLASS LAI over a range of HeiHe River basin sites and used the error to improve the simulation of LAI and fluxes by assimilating GLASS LAI data. Previous studies reported an improvement in the performance of the SMOS and SMAP products (Lievens et al., 2015;Miernecki et al., 2014), which both provide an accuracy of 0.04 m 3 m −3 (Zhang et al., 2019). However, the actual observation error of these two products typically depends on the spatial location and time of the year (RMSD varying between 0.035 and 0.056 m 3 m −3 for several retrieval configurations) (Brocca et al., 2012). According to Purdy et al. (2018), the ET simulated by PT-JPL SM using the 9 km SM_L3_P_E data showed an inferior agreement (R 2 = 0.47) but a relatively low RMSD (0.77 mm d −1 ), due to the SMAP errors in the grid cell with soil heterogeneity and the climatological differences between model SM forecasts and SMAP SSM (Reichle and Koster, 2004). We rescaled the ET PM to the probability distribution of the ET LPJ through a cumulative distribution function (CDF) to correct the potential seasonal biases of ET PM before assimilation.
Second, there is large uncertainty in the influence of RZSM as the source of water available to plants (Albergel et al., 2008;Bonan et al., 2020). Our GPP results of irrigated sites were largely influenced by US-Ne1, an irrigated site. This site maintained high annual GPP in 2012 despite the drought (Fig. S4). However, the SMOS SSM in 2012 had a lower SSM annual mean than the site observations, likely because the detected soil layer (0-50 cm) of the site observation is deeper than that of the satellite retrieval, and the cumulative deep soil moisture due to the regular irrigation was higher than the SSM that could easily be vaporized during the drought period (Fig. S4). Therefore, the influence of deep SM of some cropland sites during the drought years induced large simulation errors and unsatisfactory assimilation performance. Moreover, some deep-rooted forests maintain a high LAI during drought by absorbing deep SM (> 2 m) and groundwater (Zhang et al., 2016). Thus, joint assimilation of the LAI and SSM may eliminate a portion of the underestimation of GPP of such vegetation in drought periods. Therefore, further research is needed on how to optimally utilize satellite SM data for improving GPP and ET simulations.
Third, the problem of mixed pixels and mismatches in the observation footprints may also have an influence on the accuracy of estimated GPP and ET. The 5 km spatial resolution of the GLASS LAI, 9 km of SMAP, and 25 km of SMOS products cannot capture the sub-grid-scale condition, especially in grid cells for complex land surfaces or strong soil heterogeneity. To ensure the consistency of the grid-cell representativeness for the LAI and SSM, the interpolation results in errors that propagate through the modeling and assimilation, causing the accumulation of output errors (Nijssen and Lettenmaier, 2004). Moreover, the shrubland in the LPJ-DGVM was most likely simulated as C 4 grassland in the hydrothermal condition of semi-arid and arid regions. In contrast, the shrubland tended to be hybrid vegetation types (grassland mixed with other types of forest vegetation) in the hydrothermal condition of humid and sub-dry humid regions, and the simulated canopy height is closer to the real condition of shrubland. This might also be the reason for the superior performance of ET LPJ and assimilation results of shrubland sites in humid and sub-dry humid regions.
When assimilating multiple data streams, all data streams could be in the same optimization (simultaneous assimilation) or use a sequential (step-by-step) approach. Mathematically, simultaneous optimization is optimal because strong parametric connections are maintained between different processes. However, complications may arise due to computational constraints related to the inversion of large matrices or the requirement of numerous simulations, particularly for global datasets (e.g., , and due to the "weight" of different data streams in the optimization (e.g., Wutzler and Carvalhais, 2014). This is particularly true when considering a regional-to global-scale, multiple-site optimization of a complex model that contains many parameters and which typically takes on the order of minutes to an hour to run a 1-year simulation. In practice, it is very difficult to define a probability distribution that properly characterizes the model structural uncertainty and observation errors, accounting for biases and non-Gaussian distributions. Nevertheless, a step-wise assimilation may be useful in dealing with possible inconsistencies on a temporary basis, since the parameter error covariance matrix must be propagated at each step. It is worth noting that the deviation between the model and observational data should be solved in the process of step-wise assimilation; for example, the joint assimilation in this study, the satellite observations, and model simulation were integrated using the CDF method so that the first step of the assimilation will strongly constrain the uncertainty of parameters related to phenology and carbon flux and propagate to the second step. Alternative solutions were found for water-related parameters through soil moisture, providing a better fit for all data streams. The sequence of assimilation is essential in the step-wise assimilation, and if the first observation contains a strong BIAS, then the associated error correlation will also propagate through the first assimilation. If the autocorrelation in the observation error, or the correlation between the data stream errors is not considered, it is likely that the posterior simulation has been overturned. That is, we overestimate the reduction in parametric uncertainty. If two observational data are less uncertain (i.e., high precision of observation data), the model of deviation is smaller (depending on the spatial scale and inversion method). Moreover, the correlation of these observations is stronger, and they contain enough spatiotemporal information to limit all the parameters' optimization accurately, and the step-wise assimilation performance is basically the same as that of simultaneous assimilation.

Conclusions
We developed an assimilation system LPJ-VSJA that integrates GLASS LAI, SMOS SSM, and SMAP SSM data to improve GPP and ET estimates globally. The system was designed to assimilate two SSM products (SMOS and SMAP) into the integrated model -LPJ-PM for both dry and humid regions through separate and joint assimilation. The results show that the joint constraints provided by vegetation and soil variable strategies improve model simulations. Both the original and joint assimilation results for GPP and ET in semi-arid and arid regions performed better than those in humid and sub-dry humid regions, and the LPJ-PM that emphasized the SSM information is more suitable for the waterlimited regions. For ET assimilation, the different SSM products influence assimilation performance, and SMAP SSM possesses a slight advantage in most vegetation types and in both dry and humid regions. Our global LPJ-VSJA GPP and ET products have relatively higher accuracy than other products, especially in water-limited regions with lower ET values.
Author contributions. SL and LZ designed the experiment and wrote the paper with support from all coauthors. SL and RM implemented the codes necessary for the experiments. JX contributed to the structure of the article and comparison of assimilation performance between the SMOS and SMAP experiments. XT provided the POD-En4DVAR method and the code. MY contributed to the validation and analysis of the results. All the authors contributed to the synthesis of results and key conclusions.