Information on the spatial and temporal patterns of surface carbon flux is crucial to understanding of source/sink mechanisms and projection of future atmospheric CO2 concentrations and climate. This study presents the construction and implementation of a terrestrial carbon cycle data assimilation system based on a dynamic vegetation and terrestrial carbon model Vegetation-Glob- al-Atmosphere-Soil (VEGAS) with an advanced assimilation algorithm, the local ensemble transform Kalman filter (LETKF, hereafter LETKF-VEGAS). An observing system simulation experiment (OSSE) framework was designed to evaluate the reliability of this system, and numerical experiments conducted by the OSSE using leaf area index (LAI) observations suggest that the LETKF -VEGAS can improve the estimations of leaf carbon pool and LAI significantly, with reduced root mean square errors and increased correlation coefficients with true values, as compared to a control run without assimilation. Furthermore, the LETKF-VEGAS has the potential to provide more accurate estimations of the net primary productivity (NPP) and carbon flux to atmosphere (CFta).
The understanding of carbon exchange between terrestrial ecosystems and the atmosphere can be improved through direct observations and experiments, as well as through modeling activities ( Yuan et al., 2009). Several terrestrial biosphere models, called forward models, have been developed to study the terrestrial carbon cycle and how the ecosystems respond to changes in climate and atmospheric CO2 concentration ( Chen et al., 1999; Zeng et al., 2005a; Thornton et al., 2007). However, forward modeling has large uncertainties due to various biased data sources and inaccurate parameterizations and parameters ( Piao et al., 2012).
Data assimilation has been applied to improve the terrestrial carbon cycle simulation by incorporating observational data into process-based models ( Wang et al., 2009; Luo et al., 2011). Rayner et al. (2005) developed a Carbon Cycle Data Assimilation System (CCDAS), which assimilated atmospheric CO2 concentration into a terrestrial biosphere model using the Bayesian approach. In addition, many studies have used this system to improve global carbon cycle simulation through assimilation of various data streams, mainly ground-based eddy covariance CO2 flux and remotely sensed data (e.g., the fraction of absorbed photosynthetically active radiation (fPAR)) on a global scale ( Scholze et al., 2007; Kato et al., 2013). However, the data assimilation frame used in the CCDAS needs to calculate the adjoint operator of the forecast model, and the computational cost would increase with adjoint.
Recently, a few studies have been conducted with encouraging results ( Mo et al., 2008; Gao et al., 2011) using an ensemble Kalman filter (EnKF) data assimilation algorithm, which does not require the adjoints of the model dynamics or observational operator, in the terrestrial carbon cycle data assimilation research. Unlike other EnKF schemes that assimilate observations serially, the local ensemble transform Kalman filter ( LETKF; Hunt et al., 2007) updates the local state variables of each grid point independently by assimilating all observations that may affect the state at that grid point. Moreover, the use of the localization approach in the LETKF makes this assimilation scheme highly parallel ( Szunyogh et al., 2007). Its computational efficiency, simplicity of implementation, and accuracy make the LETKF a particularly appealing EnKF scheme, and it has been used in the numerical weather forecast ( Miyoshi and Kunni, 2012) and atmospheric carbon cycle ( Kang et al., 2011). However, few studies have been conducted on the application of the LETKF algorithm in the terrestrial ecosystem research.
With this in mind, the LETKF algorithm is used to assimilate the leaf area index (LAI) observations into a dynamic vegetation and terrestrial carbon model, Vegetation-Global-Atmosphere-Soil (VEGAS; Zeng et al., 2004, 2005a, b), in this study. Our goal is to improve model-based terrestrial carbon cycle simulations (e.g., carbon pools and fluxes) through the assimilation of observational LAI data. Furthermore, we will investigate the effects of LAI observations on the terrestrial ecosystem components with different turnover times and the effect of vegetation types on assimilated results. In Section 2, a terrestrial carbon cycle data assimilation system based on the VEGAS model and the LETKF algorithm (hereafter LETKF-VEGAS) will be introduced. To evaluate the reliability and performance of the LETKF-VEGAS, an observational system simulation experiment (OSSE) frame- work will be designed and some preliminary results will be presented in Section 3.
The terrestrial carbon cycle data assimilation system LETKF-VEGAS consists of a forecast operator VEGAS to simulate the terrestrial carbon cycle, an observational operator to map the model state vector into observation vector space by providing model equivalents of the observed values ( Bouttier and Courtier, 1999), and an ensemble-based LETKF assimilation algorithm to optimize the state variables. A sketch of the LETKF-VEGAS is shown in Fig. 1a.
The VEGAS model ( Zeng et al., 2004, 2005a, b) can simulate the dynamics of vegetation growth and competition among different plant functional types (PFTs). It includes four PFTs (namely, broadleaf tree, needleleaf tree, cold grass, and warm grass), among which the competition is determined by climatic constraints and resource allocation strategy such as temperature tolerance and height-dependent shading. The vegetation module in VEGAS is coupled to a two-layer physical land surface model Simple-Land (SLand) ( Zeng et al., 2005a).
The VEGAS model includes five vegetation pools: leaf ( Cleaf), fine root ( Crootf), coarse root ( Crootc), sapwood ( Cwoods), and heartwood ( Cwoodh) and six soil carbon pools: metabolic litter ( Clmeta), structural litter ( Clstru), fast soil ( Csfast), intermediate soil ( Csmed), slow soil ( Csslow), and decomposer ( Cdcmp). The carbon balance of each carbon pool can be described using the following generic biomass equation, involvingpools and fluxes (sometimes termed as stocks and flows):
(1) |
(2) |
where C is the carbon pool size per unit area and Fin and Fout are the carbon flux into and out of the pool, respectively. The loss terms include the following: r represents respiration, t is turnover to other pools, b is background mortality due to natural loss or disturbance, s represents loss due to cold, drought and other stresses, f is loss due to fire. We use Eq. (1) to forecast dynamically the terrestrial carbon cycle.
The key carbon flux outputs from the VEGAS model are related as follows:
(3) |
(4) |
(5) |
where the gross primary productivity (GPP) is the total amount of carbon fixed the terrestrial ecosystem through photosynthesis; NPP is the net primary productivity; Ra and Rh are autotrophic respiration and heterotrophic respiration, respectively; and NEP is the net ecosystem production. We do not consider the contribution of anthropogenic effects to the VEGAS model; hence, the carbon flux to atmosphere (CFta) is referred to as the net ecosystem exchange (NEE).
The physical climate forcing input to VEGAS includes surface temperature, precipitation, atmospheric humidity, radiation forcing, and surface wind. For this study, the driving data of precipitation for VEGAS have been collected from a combination of the Climate Research Unit ( Mitchell and Jones, 2005) and the Xie and Arkin (1996) data set. Data on the surface air temperature and surface downward solar radiation have been taken from the National Aeronautics and Space Administration (NASA) Goddard Institute for Space Studies ( Hansen et al., 1999) and the Multi-scale Synthesis and Terrestrial Model Intercomparison Project (http://nacp.ornl.gov/MsTMIP.shtm), respectively. Seasonal climatology of humidity and wind speed were used in this study ( Qian et al., 2008).
The LETKF algorithm ( Hunt et al., 2007) is an advanced square-root EnKF data assimilation scheme, in which observations are assimilated simultaneously to update the ensemble mean, while the ensemble perturbations are updated by transforming the forecast perturbations through a transform matrix term. It assimilates all observations within a certain distance at all analysis grid points simultaneously ( Hunt et al., 2007). In the LETKF algorithm, the analysis ensemble mean
(6) |
(7) |
(8) |
(9) |
(10) |
where
In this study, we demonstrate a framework for the assimilation of LAI observations using the LETKF-VEGAS to update VEGAS-simulated carbon pools. As in the VEGAS model, there is a linear relationship between LAI and leaf carbon, which is expressed as follows:
(11) |
where
To evaluate the reliability of the LETKF-VEGAS, we conducted an OSSE using a set of synthetic data ( Gao et al., 2011; Kang et al., 2011). The OSSE framework was developed through twin experiments, in which the forecast model simulation, forced by the perfect initial condition (IC) and atmospheric forcing, was defined as the "nature" run and served as the true state of the OSSE to compare with the pure simulation or assimilation model run with imperfect IC and atmospheric forcing. Here, we assume that the prediction model and observation operator are perfect, an assumption that has been used in many earlier OSSE studies ( Tong and Xue, 2005; Xue et al., 2006; Kang et al., 2011). Model error will be an issue for future studies.
Figure 1b shows the flowchart of the OSSE. VEGAS was first spin-up run using the 1901 climate forcing repeatedly until the carbon pools reached equilibrium (steady state) ( Zeng et al., 2005a; Qian et al., 2008). This state was then used as the IC for the 1901-1992 run. For performing comparisons and analyses in this study, we focused only on the period of 1991-1992 in view of the reliability of LETKF-VEGAS. Here, the two-year (1991-1992) simulation was chosen as the nature run. The observational LAI data were generated from the nature run by selecting a 10-day (close to the temporal resolution of satellite-retrieved LAI, e.g., eight days for the Moderate Resolution Imaging Spectroradiometer (MODIS)) sampled time series of the data and adding random noise at a scale of 0.05 to represent observational errors. All the observations are assumed to be available during the experimental period.
Unlike the nature run, we chose an imperfect IC and atmospheric forcing to run VEGAS for the period of 1991-1992 in the control or assimilation experiments. The imperfect atmospheric forcing was generated using a climatological mean (without any interannual variation). The imperfect IC for the control run was chosen randomly from historical simulations. As the LETKF is one of the ensemble-based assimilation algorithms, the initial ensemble members (background) were also chosen by random sampling from historical long-term simulations ( Kang et al., 2011). It should be noted that VEGAS was operated on a daily time step at 2.5°×2.5° resolution for all experiments in this study. The focus of this study is the natural land-atmosphere fluxes. Therefore, land-use change is not included in the VEGAS model ( Zeng et al., 2004, 2005a).
The ensemble size ( N) is an important parameter in EnKF and should be large enough to ensure a correct estimate of the error covariance in the predicted model states ( Williams et al., 2005). However, a very large ensemble size may introduce a heavy computation burden. In this study, the ensemble size was set to 80 ( Mo et al., 2008).
We randomly chose a site (33.75°N, 91.25°W) in North America to evaluate the performance of the LETKF- VEGAS. The dominant PFT at this site is broadleaf tree and its climate type is temperate (Table 1). Figure 2 shows the time series of leaf carbon pool ( Cleaf), LAI, NPP, and CFta based on the data from nature run (TRUE), control run (CTRL), and assimilation model run (ASS). The observational LAI data (OBS) are also presented in Fig. 2b. It is readily found that the LETKF-VEGAS is capable of improving model-based leaf carbon pool due to the assimilation of LAI observations and of providing more accurate LAI estimations. Table 2 shows that the assimilated Cleaf is closer to true values with smaller root mean square error (RMSE = 8.4 g C m-2) and higher correlation ( r = 0.97), compared to the CTRL (RMSE = 20.3 g C m-2 and r = 0.50). LAI estimations from the LETKF-VEGAS also improved significantly due to their close relationship with Cleaf with reduced RMSE (0.19 g C m-2), as compared to CTRL (RMSE = 0.53 g C m-2) (Fig. 2b and Table 2). Moreover, the control run could not capture well the temporal variation of true LAI values ( r = 0.49) during the experimental period, which shows similar seasonal variations for the two years ( from 1991 to 1992) due to the effect of imperfect atmospheric forcing without any interannual variation. However, obvious improvement is observed in ASS with a higher correlation ( r = 0.95) with TRUE.
![]() | Table 1 Site characteristics of this study. |
However, the improvement in assimilated soil carbon pools is not as much as that in the leaf carbon pool (not shown in this study), which may be due to that observed LAI data contribute less information on the carbon pools with a slow turnover time ( Zeng et al., 2005b; Weng et al., 2011). As long-running forest stem surveys and tree ring data could provide an important constraint on C cycling of slow pools, how to implement the assimilation of these data in the LETKF-VEGAS will be a topic of our future studies.
Since carbon fluxes usually act as diagnostic variables in the VEGAS model, they are not changed by the analysis step of data assimilation in the LETKF-VEGAS and updated only by the model forecast using analyzed state variables from data assimilation. Here, we will discuss the effect of the assimilation of LAI observations on carbon fluxes using the LETKF-VEGAS. Due to an improvement in state variables (e.g., Cleaf), simulations of NPP and CFta made by the LETKF-VEGAS match true values better than the control run (Figs. 2c and 2d), with lower RMSEs of 0.49 g C m-2 d-1 (NPP) and 0.35 g C m-2 d-1 (CFta), respectively. It can be also found from Table 2 that assimilated NPP and CFta have higher correlation coeffi-cients ( r = 0.94 and 0.91, respectively) with true values, compared to those from control run ( r = 0.92 and 0.88, respectively).
![]() | Table 2 The root mean square errors (RMSEs) and correlation coefficients ( r) between model simulations and true values from nature run for the four variables ( Cleaf, LAI, NPP, and CFta) during the experimental period (1991-1992) at the broadleaf tree site (33.75°N, 91.25°W). |
To gain further insights into the effect of PFTs on the assimilated results, three other sites, which represent needleleaf tree (NLT), cold grass (CG), and warm grass (WG), were chosen to evaluate the performance of the LETKF-VEGAS under different vegetation types. Characteristics of these sites are presented in Table 1. Figure 3 shows the mean bias errors (MBEs), RMSEs, and correlation coefficients between the simulated (CTRL and ASS) daily Cleaf (Figs. 3a-c) and LAI (Figs. 3d-f) and true values from nature run over four PFTs from 1991 to 1992. It can easily be observed that the LETKF-VEGAS can improve the model simulations of Cleaf and LAI significantly for different PFTs. For the WG site, the assimilated Cleafshows a lower bias (1.6 g C m-2) and RMSE (1.9 g C m-2) (Figs. 3a and 3b) to true values, compared to the control run (MBE = 5.9 g C m-2 and RMSE = 12.8 g C m-2). Moreover, the LETKF-VEGAS has a higher correlation ( r = 0.93) with true values, while that with CTRL is only 0.58 (Fig. 3c). Similar results could be observed for LAI values at this site (Figs. 3d-f). It can also be seen from Figure 3 The mean bias errors (MBEs, RMSEs, and correlation coefficients between the simulated daily Cleaf (a-c) and LAI (d-f) and true values from nature run over four PFTs (defined in Table 1) from January 1991 to December 1992. CTRL is the control run without assimilation, and ASS is the assimilated results from the LETKF-VEGAS.
Fig. 3 that the LETKF-VEGAS has similar correlation coefficients with true values to the control run for both Cleaf and LAI at the NLT and CG sites. This may be due to that CTRL exhibits higher correlations with TRUE, which leads to a little improvement space for the LETKF- VEGAS. However, compared to CTRL, the LETKF- VEGAS reduces the biases significantly (Fig. 3). |
In this study, a terrestrial carbon cycle data assimilation system with the LETKF-VEGAS was developed, and an OSSE framework was designed to evaluate the reliability of the system. Preliminary results are promising as they show that the LETKF-VEGAS works effectively and is capable of improving significantly the model-based leaf carbon pool through the assimilation of observational LAI data over all four PFTs. However, observed LAI data have weak constraints on the estimates of other components of the carbon cycle with slow turnover times (e.g ., soil carbon pool), which leads to only slight improvement in these carbon pools. Due to the improvement of state variables (e.g., Cleaf), simulations of LAI and some key carbon fluxes (e.g., NPP and CFta) performed using the LETKF-VEGAS match true values better than control run simulations without assimilation.
As the design of the LETKF-VEGAS is flexible to include further streams of observational data (e.g., eddy flux measurements and satellite-derived fPAR), these data will be assimilated to VEGAS model through different observational operators using the LETKF-VEGAS ( Gao et al., 2011; Kato et al., 2013). In addition, parameters in the processed ecosystem model have large uncertainties due to the complexity of ecosystem processes, especially for those that could not be measured directly in the field ( Mo et al., 2008). Our future study will focus on the implementation of simultaneous optimizations of state variables and model parameters using the LETKF algorithm in the LETKF-VEGAS.
Acknowledgements. This research was supported by the National Natural Science Foundation of China (Grant No. 41305066), the Special Funds for Public Welfare of China (Grant No. GYHY201306045), and the National Basic Research Program of China (Grant Nos. 2010CB951101 and 2010CB428403). We would particularly like to thank Fang ZHAO, Dr. Eugenia KALNAY, Dr. Takemasa MIYOSHI, Dr. Ji-Sun KANG and Dr. Steven GREYBUSH for their helpful comments. We also thank two anonymous reviewers for constructive comments and suggestions, which have helped us in improving the paper.
1 |
|
2 |
|
3 |
|
4 |
|
5 |
|
6 |
|
7 |
|
8 |
|
9 |
|
10 |
|
11 |
|
12 |
|
13 |
|
14 |
|
15 |
|
16 |
|
17 |
|
18 |
|
19 |
|
20 |
|
21 |
|
22 |
|
23 |
|
24 |
|
25 |
|
26 |
|
27 |
|
28 |
|