Introduction and the need for diagnostics

ACP

Atmospheric Chemistry and Physics

ACP

Atmos. Chem. Phys.

1680-7324

Copernicus Publications

Göttingen, Germany

10.5194/acp-17-7405-2017

Diagnostic methods for atmospheric inversions of long-lived greenhouse gases

Michalak

Anna M.

michalak@stanford.edu Randazzo

Nina A.

Chevallier

Frédéric

https://orcid.org/0000-0002-4327-3813

1Department of Global Ecology, Carnegie Institution for Science, Stanford, California, USA 2Department of Earth System Science, Stanford University, Stanford, California, USA 3Laboratoire des Sciences du Climat et de l'Environnement, Gif-sur-Yvette, France

Anna M. Michalak (michalak@stanford.edu)

20June2017

17 12 74057421 8September2016 15September2016 12May2017 19May2017

This work is licensed under the Creative Commons Attribution 3.0 Unported License. To view a copy of this licence, visit https://creativecommons.org/licenses/by/3.0/

This article is available from https://acp.copernicus.org/articles/.html

The full text article is available as a PDF file from https://acp.copernicus.org/articles/.pdf

The ability to predict the trajectory of climate change requires a clear understanding of the emissions and uptake (i.e., surface fluxes) of long-lived greenhouse gases (GHGs). Furthermore, the development of climate policies is driving a need to constrain the budgets of anthropogenic GHG emissions. Inverse problems that couple atmospheric observations of GHG concentrations with an atmospheric chemistry and transport model have increasingly been used to gain insights into surface fluxes. Given the inherent technical challenges associated with their solution, it is imperative that objective approaches exist for the evaluation of such inverse problems. Because direct observation of fluxes at compatible spatiotemporal scales is rarely possible, diagnostics tools must rely on indirect measures. Here we review diagnostics that have been implemented in recent studies and discuss their use in informing adjustments to model setup. We group the diagnostics along a continuum starting with those that are most closely related to the scientific question being targeted, and ending with those most closely tied to the statistical and computational setup of the inversion. We thus begin with diagnostics based on assessments against independent information (e.g., unused atmospheric observations, large-scale scientific constraints), followed by statistical diagnostics of inversion results, diagnostics based on sensitivity tests, and analyses of robustness (e.g., tests focusing on the chemistry and transport model, the atmospheric observations, or the statistical and computational framework), and close with the use of synthetic data experiments (i.e., observing system simulation experiments, OSSEs). We find that existing diagnostics provide a crucial toolbox for evaluating and improving flux estimates but, not surprisingly, cannot overcome the fundamental challenges associated with limited atmospheric observations or the lack of direct flux measurements at compatible scales. As atmospheric inversions are increasingly expected to contribute to national reporting of GHG emissions, the need for developing and implementing robust and transparent evaluation approaches will only grow.

Introduction and the need for diagnostics

The ability to predict the trajectory of climate change requires a clear understanding of the historical and current emissions and uptake (i.e., surface fluxes) of long-lived greenhouse gases (GHGs), and chief among them carbon dioxide (CO2) and methane (CH4), over the Earth's land and ocean regions. For the natural components of the global budgets of these gases, understanding historical and contemporary flux patterns is needed for elucidating the biogeochemical processes that control flux variability and therefore the likely evolution of these fluxes under changing climate scenarios (e.g., Friedlingstein et al., 2014). The ability to constrain the anthropogenic components of greenhouse gas budget estimates, on the other hand, is becoming increasingly central to discussions aimed at setting emissions, or emissions reduction, targets at local to global scales (e.g., Pacala et al., 2010).

Direct monitoring of the fluxes of greenhouse gases is only feasible at a limited number of spatial and temporal scales, however. For example, point sources of anthropogenic emissions can be measured directly at discrete times (e.g., Allen et al., 2015; Subramanian et al., 2015; Zimmerle et al., 2015), while biospheric fluxes over land can be continuously monitored at plot scale (i.e., from a few hectares to a few km2, depending on sensor height) using the eddy covariance technique (e.g., Baldocchi et al., 2001; Law et al., 2002), and ocean fluxes can also be deduced locally from the difference between the partial pressure of CO2 measured in seawater and that in the overlying air (e.g., Takahashi et al., 1993, 2002). At the global scale, a network of observation sites tracks the global growth rate of atmospheric concentrations of greenhouse gases and gives broad insight into the temporal (e.g., seasonal, interannual) and spatial (e.g., hemispheric, latitudinal) signatures of net greenhouse gas emissions (e.g., Tans et al., 1990; Steele et al., 1992).

The target applications listed in the first paragraph, however, require an understanding of fluxes at intermediate scales, e.g., from urban to biome to national to continental. Direct observations of fluxes are not feasible at these scales, and gaining an understanding of flux budgets and controlling processes at these scales therefore invariably depends on a process of either “upscaling” small-scale flux observations or “downscaling” large-scale information provided by atmospheric concentration measurements. Upscaling strategies range from the implementation of mechanistic models calibrated using plot-scale flux observations (e.g., Richardson et al., 2012; Schaefer et al., 2012), to the development of statistical or machine learning approaches for elucidating dominant patterns (e.g., Beer et al., 2010; Jung et al., 2011), and to the combination of fine-scale flux measurements with activity data (e.g., fuel consumption for anthropogenic emissions, or burnt area for fire emissions) as the basis of emissions inventories (e.g., van der Werf et al., 2006; Jeong et al., 2014; Lyon et al., 2015). Downscaling strategies, on the other hand, most typically involve the solution of an inverse problem to elucidate spatially and temporally resolved flux information from upwind and downwind observations of atmospheric greenhouse gas abundance (e.g., Enting et al., 2002).

Inverse problems that couple atmospheric observations of greenhouse gas concentrations with an atmospheric chemistry and transport model in order to gain insights into underlying flux patterns have been used since the late 1980s (e.g., Enting and Mansbridge, 1989, 1991). While the observational network has expanded and the statistical and numerical methods have become more sophisticated (e.g., Ciais et al., 2010a; Michalak, 2013; Miller and Michalak, 2017; Houweling et al., 2017), the underlying principles have remained largely unchanged. Spatiotemporal flux patterns at the Earth's surface lead to spatial and temporal gradients in atmospheric concentrations of greenhouse gases. The inverse problem then amounts to using those gradients to recover information about the flux patterns. From a scientific perspective, an additional goal is often to also gain insight into the enviro-climatic factors driving these patterns (e.g., Gourdji et al., 2012; Fang and Michalak, 2015; Miller et al., 2014, 2016b). Although the principle is simple, the atmospheric inverse problem is ill-conditioned because the diffusive nature of atmospheric transport means that relatively small variations or errors in observed or modeled atmospheric concentrations can correspond to relatively large differences or errors in the inferred flux quantities and patterns. In addition, the atmospheric inverse problem is often under-determined because the sparse observational coverage precludes the possibility of resolving fluxes (spatially and temporally) at all the scales that are of scientific or policy interest as well as at all the scales to which atmospheric observations are locally sensitive.

Given the high scientific and policy value of accurate greenhouse gas budgets, the growing role of atmospheric inverse problems to obtain these budgets at relevant scales, and the inherent technical challenges associated with the solution of these inverse problems, it is imperative that objective approaches exist for evaluating the scientific value and accuracy of inverse modeling estimates of greenhouse gas fluxes. Here, we review diagnostics that have been implemented in recent studies and discuss their use in informing adjustments to model setup. We have structured the review in a manner that we hope will be useful to novices and specialists alike. We present a relatively comprehensive survey of recent approaches in order to provide a detailed representation of the state-of-the art for specialists. At the same time, we have organized the review around high-level categories in order to help guide researchers who are newer to the field and provide an entry point for further inquiry via the cited studies.

Fundamentally, the emphasis of diagnostic tools should be on the scientific value of insights that are based on the solution of an atmospheric inverse problem. This quality control approach (i.e., the evaluation of the flux estimates) also has to be complemented by quality assurance (i.e., the evaluation of the estimation process that yielded the flux estimates). Indeed, the solution of atmospheric inverse problems invariably involves a series of decision points including, but not limited to, (1) the choice of the atmospheric observations to be used; (2) the choice of the atmospheric chemistry and transport model to be implemented; (3) the choice of a statistical framework for defining an objective function that captures the relative contribution of atmospheric observations, the chemistry and transport model, and any prior information in informing flux patterns; and (4) the choice of a numerical framework for the solution of the inverse problem. Each of these choices will have a direct impact on estimates. It is therefore also imperative to have diagnostic tools that can evaluate the self-consistency of the modeling and statistical assumptions specific to the choices made in the setup of the inverse problem. In other words, at a minimum, the ultimate estimates must be consistent with the assumptions inherent to the specific modeling setup that was implemented.

Challenges of diagnosing atmospheric inversions

Having established the need for diagnostic tools to assess atmospheric inverse modeling results, the question then becomes one of identifying appropriate diagnostics, metrics, or benchmarks. As discussed in the last section, however, direct observation of greenhouse gas fluxes is not possible at the space and timescales targeted by atmospheric inversions. This is in part because inversion systems for long-lived greenhouse gases are run over time periods ranging from weeks to decades to capture the long dispersion times of tracers in the atmosphere and to capture temporal variability in fluxes. These long time spans are achieved at the expense of relatively coarse horizontal resolutions, ranging from tens of kilometers to one or more degrees, such that the large gap between flux measurements and inverse model scales precludes direct evaluation of inverse modeling results. This gap is filled only rarely by some regional inversions (e.g., Lauvaux et al., 2009; Meesters et al., 2012). This means that there is a basic lack of independent measures of flux to assess inverse modeling estimates.

Diagnostic tools used for assessing inverse modeling estimates must therefore rely on other indirect measures or information about the fluxes to be estimated. Such measures and information should, in principle, be independent from the information used in the solution of the original inverse problem. A natural choice might then be to use additional atmospheric concentration data not assimilated in the original inverse problem, because, as noted earlier, gradients in atmospheric greenhouse gas concentrations are themselves the result of underlying flux patterns. Given the ill-conditioned and typically under-determined nature of the atmospheric inverse problem, however, it is often desirable to use as much information (i.e., data) as possible to inform the initial solution of the inverse problem, in order to gain the deepest and most precise insights possible about flux patterns. This goal, however, is at odds with the desire to keep some independent flux-relevant observations for diagnosing the estimates obtained from the inversion. Although this problem is not unique to the solution of atmospheric inverse problems, it is certainly particularly salient in this context. Two examples follow.

In some ways, numerical weather forecasting (e.g., Kalnay, 2003) bears some resemblance to the flux estimation problem, as they both rely on atmospheric observations and a numerical representation of atmospheric dynamics. In both cases, the ability to diagnose the accuracy and precision of estimates is of high value. Key differences emerge upon closer examination, however. First, the target quantities predicted/estimated in numerical weather prediction, such as temperature, precipitation, and barometric pressure, are ones that can also be measured directly at a large number of locations, via both in situ and remote sensing observations, making a comparison to direct benchmarks feasible (e.g., ECMWF, 2016). Although it is technically true that in some cases a scale mismatch still occurs (e.g., a thermometer cannot measure the “average” temperature over a computational grid box), the quantities of interest are less likely to display the strong multi-scale heterogeneity that makes eddy covariance flux observations ill-suited for diagnosing grid-scale inverse-model-derived flux estimates at much coarser spatial resolution. Second, whereas atmospheric inverse problems aim to infer/estimate historical flux distributions that were never observed directly, the accuracy and precision of numerical weather forecast estimates can largely be verified, evaluated, and diagnosed simply by waiting for weather patterns to unfold. This is perhaps best illustrated through the long-standing comparisons of forecast skill among the world's weather forecasting bureaus (Simmons and Hollingsworth, 2002; WMO-LCDNV, 2016).

Another useful example is that of the development of retrieval algorithms for remote sensing observations of atmospheric constituents (e.g., Rodgers, 2000). Let us take as a prototypical example the process of obtaining estimates of column-integrated dry air mole fractions of atmospheric carbon dioxide (XCO2) from the spectrum of reflected sunlight measured by the Orbiting Carbon Observatory-2 (OCO-2) space-borne instrument (e.g., Crisp et al., 2012). In this case, the observations are radiances at specific wavelengths within the spectrum of reflected light, with a focus on specific absorption bands that are observed at high spectral resolution. The forward problem involves the solution of radiative transfer equations. The target variable of primary interest is XCO2. This problem has analogies to the flux estimation problem in that the column-integrated CO2 concentrations cannot be measured directly per se. A key difference, however, is that a number of validation data sets are available to help diagnose the retrieval algorithm (e.g., Osterman et al., 2011). These include, among others, observations from ground-based remote sensing instruments (that look up at the sun, rather than down at the Earth, e.g., Wunch et al., 2011) and targeted campaigns of in situ airborne observations that can capture CO2 concentration variability within a portion of the atmospheric column (e.g., Tadić et al., 2014; Frankenberg et al., 2016). Unlike in the flux estimation problem, there is no direct conflict between using these additional measurements for validation/diagnosis versus using them to directly inform the solution of the inverse problem itself, as there is no clear mechanism by which these additional observations could be routinely incorporated within the core retrieval algorithm, although they can be used for additional empirical bias correction.

Overall, then, while the need for diagnostics to evaluate the scientific validity and statistical self-consistency of flux estimates derived via the solution of atmospheric inverse problems is clear, this need poses very substantial challenges. These include the lack of independent measures of flux at comparable spatiotemporal scales and the inherent dilemma between using available atmospheric observations for estimation versus validation. These features make the process of developing and implementing diagnostics particularly challenging and fundamentally different from the challenges observed in other fields that might at first glance appear to be somewhat analogous.

Overview of existing diagnostics

Researchers have taken a number of approaches in tackling the challenges associated with the development of diagnostics that are both practical, given the unavoidable limitations in available data, and genuinely informative, in terms of assessing the accuracy and precision of flux estimates. Here we describe existing diagnostics that have been used as part of inverse modeling efforts. We focus primarily on diagnostics that evaluate the validity and self-consistency of the inversion setup, rather than on diagnostics designed to assess the information content of specific data sets. We also discuss how diagnostics are used to inform adjustments to model setup and the trade-offs inherent to alternative possible approaches to model evaluation. We focus primarily on examples from papers published between 2010 and 2016 and on papers that present recent applications of specific diagnostics rather than on the studies where these diagnostics were originally introduced. We do so in order to get a contemporary snapshot of approaches that are currently being used for diagnosing atmospheric inversions. The groupings of diagnostics are ordered here by starting with diagnostics that are most closely related to the actual scientific problem or question being targeted by the inversion to those that are most closely tied to the statistical and computational setup of the inversion framework itself. More fundamental overriding questions about the types of insights that the range of currently available diagnostics can (or cannot) actually provide are then discussed in Sect. 4.

Assessment against independent information

The most natural starting point for assessing the solution of an atmospheric inverse problem is through evaluation against independent information. Although, as discussed in earlier sections, direct observations of surface fluxes are seldom available at compatible scales, at least two additional avenues are available. The first is to evaluate flux estimates against unused atmospheric observations, whether from in situ monitoring or remote sensing. This is accomplished through the solution of the “forward” problem, which translates estimated fluxes into modeled atmospheric concentration fluctuations. The second is to compare estimates against any available large-scale scientific constraints. This approach can be challenging especially when large-scale constraints are themselves uncertain.

Evaluation against unused atmospheric observations

If any atmospheric observations are available that have not been used as a constraint in the solution of the inverse problem, they can be leveraged to evaluate final flux estimates. To do so, final flux estimates are used as an input into the atmospheric chemistry and transport model used as part of the inversion, and predicted concentrations at the times and locations of the additional available atmospheric observations are then compared to the measured concentrations. These additional observations can be of several types and inform the inversion setup in various ways, given differences in vertical information, spatial coverage, and precision.

Evaluating inversion results constrained by in situ observations using independent surface or satellite total column measurements can provide additional information about regional fluxes. The much broader spatial coverage of satellite observations makes it possible to assess flux estimates at large spatial scales and thus can help to identify large-scale spatial biases that are related to a lack of in situ coverage in some regions (e.g., biases in the latitudinal gradient or over land versus ocean; Lindqvist et al., 2015). However, it is important to note in the context of these comparisons that the satellite retrievals themselves may have regional biases, as will be discussed later.

Conversely, for inversions constrained by satellite observation of total column concentrations, evaluating results using in situ measurements can reveal errors in the column-constrained system's ability to reproduce surface fluxes, which can be related to aspects of the retrieval (such as biases) or to the transport model's representation of boundary layer dynamics (e.g., Locatelli et al., 2015; Cressot et al., 2014).

Comparisons to independent measurements can also be used to isolate transport errors from the other confounding errors. For example, comparing the total column mixing ratios simulated based on posterior flux estimates obtained using surface data to independent observations of total column mixing ratios can diagnose a transport model's skill in simulating the seasonality of the tropopause height and of the stratospheric partial column (e.g., Houweling et al., 2014). Performing this type of assessment for multiple inversions constrained by different types of measurements but using the same transport model can provide insight into whether seasonal biases in the inversion are caused by seasonal biases in an observing system or by seasonal biases in the transport model (e.g., Houweling et al., 2014). More generally, vertical transport bias can be assessed by comparing the vertical gradients of posterior vertical profiles to those of observed profiles (e.g., Pickett-Heaps et al., 2011; Saeki et al., 2013b; Liu and Bowman, 2016), because vertical gradients provide information about vertical mixing and convection.

More broadly, evaluation against all types of independent atmospheric observations provides an additional window into the degree to which estimated fluxes capture key features of the atmospheric signal, such as the seasonal cycle, latitudinal gradients, or regional patterns of concentrations (e.g., Zhang et al., 2014; Jiang et al., 2014; Díaz Isaac et al., 2014; Pandey et al., 2016; Liu and Bowman, 2016; Johnson et al., 2016).

Evaluation at aggregated scales against large-scale scientific constraints

The accuracy of inversion-derived flux estimates and the validity of the overall inversion framework can be assessed, at large scales, based on existing understanding of carbon cycle and atmospheric dynamics. This type of evaluation may involve comparisons of the inversion-derived estimates to existing information about flux magnitudes at large scales, about the overall direction of the net flux in a region (i.e., emission vs. uptake), or about flux seasonality. Care must be taken, however, for the approach not to become circular, i.e., for inversion results not to be evaluated by comparing them to assumed features of the very processes that the inversion is trying to inform.

In the simplest case, spatially aggregated posterior fluxes can be assessed based on expert knowledge of the system. For example, methane emissions in regions dominated by natural gas extraction, urbanization, wetlands, or cattle feedlots are expected to substantially outweigh soil methane uptake, and negative estimated emissions in such regions would point to errors in the inversion (e.g., Berchet et al., 2013). Similarly, global decadal atmospheric growth rates and latitudinal gradients of greenhouse gases are well constrained by long-term baseline observations (e.g., Conway et al., 1994), and posterior flux estimates can be evaluated against such large-scale constraints (e.g., Cressot et al., 2014). Evaluation against observed latitudinal gradients provides information not only about global total fluxes but can also inform the accuracy of the representation of interhemispheric transport, although more so for gases with limited uptake at the Earth surface (e.g., Thompson et al., 2014). This comparison is especially helpful when performed using both surface and upper-troposphere or total column concentrations, because this makes it possible to assess how both meridional and vertical transport are represented (e.g., Thompson et al., 2014).

More broadly, inversion-derived fluxes can be compared against independent estimates of fluxes for comparable regions, although the fact that both the inversion-derived and the independent estimates of fluxes are uncertain must be recognized. For example, the fraction of the global CO2 sink attributable to land versus ocean can be compared between inversions and independent model or mass-balance estimates (e.g., Le Quéré et al., 2015). For specific regions and periods, inversion results can also be compared against detailed inventory estimates of fluxes (e.g., Lauvaux et al., 2012; Schuh et al., 2013). A third example (noted already in Sect. 3.1.1) is the comparison of large-scale seasonal cycles of modeled trace gas concentrations to observations. For inversions constrained by remotely sensed data, checking for consistency in seasonal cycles between observations, estimates from a satellite-data-constrained inversion, and estimates from an in situ data-constrained inversion may draw attention to the need for seasonal bias correction in the observations, while also exploring other potential causes of regional or seasonal bias, such as seasonal biases in vertical transport (e.g., Houweling et al., 2014). Lastly, bottom-up studies also provide regional budget estimates at the annual or pluriannual scale that can be compared to inverse modeling results (e.g., Gourdji et al., 2012; Miller et al., 2013, 2014). The comparison may reveal convergence (e.g., Ciais et al., 2010b) or divergence (e.g., Chevallier et al., 2014; Miller et al., 2013, 2014) of the estimates. However, the attribution of any divergence remains subjective, given the uncertainty of the bottom-up estimates themselves (e.g., Chevallier et al., 2014; Reuter et al., 2014; Gourdji et al., 2012).

Finally, large dipoles in estimated fluxes between large regions can point to a lack of observational constraint for certain regions, to overfitting of the observations that do exist, and/or to biases in large-scale transport (e.g., Alexe et al., 2015; Nassar et al., 2011). The presence of flux dipoles can, however, also be representative of real spatial flux patterns, and sensitivity tests focusing on factors such as the coverage of observational constraints can help to evaluate such patterns in posterior fluxes (Cressot et al., 2014; Rivier et al., 2010). This point is also discussed in Sect. 3.3.

Statistical diagnostics of inversion results

Rather than comparing flux estimates against independent information directly, a second set of strategies focuses instead on assessing whether the prior and posterior flux estimates, uncertainties, and covariances are consistent with the assumptions built into the design of the implemented inversion framework. These strategies thereby focus on statistical self-consistency of the inversion setup and in this way can point to discrepancies that can signal unreliable results.

The majority of inverse modeling approaches used for greenhouse gas flux estimation leverage a combination of prior information and an observational constraint. Within the mathematical framework of the inversion, the uncertainty and spatiotemporal covariance structure of the prior information (i.e., prior error statistics), as well as the reliability with which the researchers expect to be able to reproduce the atmospheric observations (i.e., model-data-mismatch statistics), are represented through error covariances. These error covariances, the prior information, the observational data, and the chemistry and transport model are then also used to quantify the uncertainty associated with posterior estimates (see e.g., Rayner et al., 2016, for a detailed discussion). This framework provides an opportunity to evaluate the statistical self-consistency of the inversion setup.

For example, under the assumption of Gaussian and unbiased errors and for a given set of assumptions about error correlations, the sum of squared errors follows a chi-squared distribution with a known number of degrees of freedom; for this reason, posterior errors can be used to evaluate or scale assumed prior error variances (e.g., Michalak et al., 2005; Desroziers et al., 2006; Wu et al., 2013; Lauvaux et al., 2016; Cressot et al., 2014). In some cases, deviations between concentrations modeled based on posterior fluxes and atmospheric observations not included in the original inversion can be used for this purpose (e.g., Chevallier and O'Dell, 2013). This approach can also be used to assess how model-data-mismatch errors vary seasonally (e.g., Gourdji et al., 2012; Kim et al., 2011). Additionally, the very high resolution of some regional inversions and the availability of plot-scale flux measurements make it possible to validate the posterior uncertainty of fluxes directly in some cases (e.g., Broquet et al., 2013).

The spatial and temporal autocorrelation of posterior errors can also be used to inform model setup (Díaz Isaac et al., 2014) or to assess the identifiability of underlying fluxes (Yadav et al., 2016).

Other than assessing self-consistency, statistical diagnostics can also be used to quantify the error reduction (or information gain) made possible by the assimilation of atmospheric observations. In this approach, posterior uncertainties are compared to prior uncertainties. In cases where the explicit quantification of posterior flux uncertainties is prohibitively computationally expensive, it can also be approximated through approaches such as the use of a Monte Carlo ensemble of inversions in which model parameters are perturbed for each run (e.g., Chevallier et al., 2007; Cressot et al., 2014; Pandey et al., 2016). More simply, the deviations between atmospheric observations not included in the inversion and modeled concentrations based on posterior vs. prior fluxes can be used as a measure of error reduction (e.g., Liu and Bowman, 2016; Johnson et al., 2016; Lauvaux et al., 2016).

Sensitivity tests and analysis of robustness

The validity and robustness of inversion-derived estimates can also be assessed through sensitivity tests. These tests involve running additional inversions where one or several components have been altered. The most common of these are changes to the chemistry and transport model used to translate fluxes into atmospheric concentrations, changes to the set of atmospheric observations used to constrain flux estimates, and changes to the implemented statistical or computational framework. Examples of the latter include changes to prior estimates, boundary conditions, and flux spatiotemporal resolutions. Results shed light on the degree to which results are robust to specific implementation choices.

Chemistry and transport model

Recently, as inversions have become more sophisticated, transport model sensitivity tests have become more computationally expensive. As a result, it has become more difficult to assess the impact of model choice on inversion results (e.g., Gurney et al., 2002; Baker et al., 2006). Applications focusing exclusively on synthetic data are covered in Sect. 3.4, while here we present a few examples that included real observations.

Examining the effect of the choice of a chemistry and transport model can lead to various insights. For example, the transport model used by an inversion may be run using different boundary layer schemes to assess how the representation of vertical mixing affects the interpretation of assimilated data (e.g., Peters et al., 2009). Another aspect is the impact of the spatial resolution of the transport model and particularly the use of finer grids within mesoscale domains versus the coarser grids typical of global transport models. For example, including a finer-scale nested grid and changing the transport representation at these finer scales provides information about the effect of transport representation at scales finer than the grid scale of global transport models (e.g., Rivier et al., 2010). In addition, posterior meridional concentration gradients can be compared across inversions that use different global transport models to assess the effect of interhemispheric transport (e.g., Thompson et al., 2014).

The implementation of more than one transport model in a forward run can also shed light on consistent differences in the ability to represent observed atmospheric concentration signals, seasonal cycles of mixing ratios, or vertical profiles (e.g., Pillai et al., 2012; Díaz Isaac et al., 2014).

Atmospheric observations

Performing inversion sensitivity tests in which only the constraining observational data set is changed between inversions can shed light on the impact of various observations on flux estimates, and therefore on their relative information content with regard to underlying fluxes, and also makes it possible to assess the extent to which conclusions are robust to the choice of observations used to constrain the inversion.

For example, a major effort has been made to quantify the effects of including remotely sensed observations (specifically, satellite retrievals) as an additional constraint beyond in situ observations. This is distinct from the applications discussed in Sect. 3.1.1, where remote sensing observations were not included in the inversions but were instead used to evaluate inversion-derived flux estimates. Satellite data provide the benefit of broader spatial coverage than in situ measurements, potentially informing fluxes in regions not well constrained by current in situ networks. However, the informational value and robustness of the information provided by satellite observations is still the subject of ongoing research, and thus their use as constraints in inversions requires special consideration of the impacts of any potential biases. Several studies have included satellite total column or mixing ratio data as an additional constraint on a model otherwise constrained only by in situ concentration measurements to determine whether remotely sensed total column concentrations provide a significant amount of additional information (e.g., Alexe et al., 2015; Houweling et al., 2014; Nassar et al., 2011; Pandey et al., 2016; Saeki et al., 2013a). An inversion constrained only by in situ measurements may also be compared to an inversion constrained only by satellite measurements (e.g., Cressot et al., 2014). The spatial distribution and magnitude of fluxes and the source/sink status of particular regions are often the major posterior features compared between inversions constrained by different subsets of available data (e.g., Alexe et al., 2015; Cressot et al., 2014; Houweling et al., 2014; Nassar et al., 2011). The differences in the geographical flux patterns can be attributed through the use of various methods focusing on quantifying the information content and geographical coverage of satellite data. The relative information content of the different observational data sets can be quantified via the degrees of freedom (a metric based on posterior error covariances) provided to the inversion (see e.g., Rodgers, 2000), whereby data sets that represent a stronger constraint provide more degrees of freedom (e.g., Nassar et al., 2011). The constraint provided for specific regions by observations with extensive geographical coverage can also be qualitatively analyzed by creating visualizations of the sensitivity to fluxes from a certain region (e.g., Nassar et al., 2011). If satellite retrievals provide a large increase in coverage over a particular region, then this method may help to explain large changes in posterior fluxes in upwind areas.

In addition, the robustness of conclusions about flux distributions derived from satellite observations can be explored by using alternative sets of satellite-derived observations. Studies have checked for agreement in posterior fluxes for inversions run using different satellite instruments and retrieval algorithms (e.g., Alexe et al., 2015; Chevallier et al., 2014; Takagi et al., 2014). The effect of the bias correction scheme used for satellite retrieval post-processing has also been a subject of several sensitivity studies (e.g., Houweling et al., 2014; Alexe et al., 2015; Nassar et al., 2011; Cressot et al., 2014, Basu et al., 2013).

Sensitivity tests based on inversions constrained by different subsets of available observations have been used to examine the incremental gain in information obtained by expanding the in situ observation network. Such experiments can be used to estimate the uncertainty reduction (see Sect. 3.2) that could potentially be achieved by assimilating more observations over or downwind from poorly constrained regions as well as the effects of a more extensive observational network on the estimated spatial and temporal variability of fluxes (e.g., Butler et al., 2010; Saeki et al., 2013b; Kadygrov et al., 2015; Jiang et al., 2014; Peters et al., 2010). They can also be used to determine the value of episodic versus continuous observations (e.g., Peters et al., 2010). These sensitivity tests can also determine whether strong fluxes in some regions, such as the “dipoles” discussed in Sect. 3.1.2, are simply due to a relative lack of constraint for certain regions (e.g., Rivier et al., 2010).

Last, sensitivity tests have also been used to examine the potential role of bias of in situ measurements at a specific site. In such studies, an offset is added to specific observations, and the results of the control inversion and the inversion with the offset can be compared to determine the effect of potential biases on the posterior flux field (e.g., Peters et al., 2010; Masarie et al., 2011).

Statistical and computational framework

Sensitivity tests can be used to explore the impact of the statistical assumptions and computational framework used in inversions.

For example, the impact of assumptions about the statistical representation of prior errors and model-data-mismatch errors can be examined by performing multiple inversions, as can the impact of approaches aimed at optimizing these error statistics (e.g., Bousquet et al., 2011; Cressot et al., 2014; Wu et al., 2013; Ganesan et al., 2014; Berchet et al., 2013). Sensitivity tests may also be run on other statistical parameters such as the assumed correlation length of fluxes (Corazza et al., 2011).

Another key aspect of regional inversions that can be explored through sensitivity tests is the impact of the choice of a data set used to represent background concentrations of greenhouse gases entering the model domain. This can be done through the implementation of alternative boundary conditions and/or the exploration of the impact of uncertainty in individual sets of boundary conditions (e.g., Göckede et al., 2010b; Bréon et al., 2015; Schuh et al., 2010; Gourdji et al., 2012).

Similar to the case of boundary conditions, inversions aiming to isolate one component of greenhouse gas budgets (e.g., biospheric CO2 in the case of CO2 inversions) must rely on pre-existing estimates of other components of the budget (e.g., fossil fuel CO2 emissions). The impact of the choice of an estimate can be explored through sensitivity tests (e.g., Peylin et al., 2011; Peters et al., 2010).

The choice of a model or data set to be used as an a priori estimate in Bayesian inversions is another source of uncertainty in the inferred fluxes, particularly in areas where the observation constraint is weak. Inversions using alternative inventories or process-based models with different spatial and seasonal flux patterns as priors can be compared in terms of the spatial and temporal distributions of the posterior fluxes to assess the robustness of flux estimates (e.g., Kim et al., 2011; Göckede et al., 2010b; Bergamaschi et al., 2015; Corazza et al., 2011; Peters et al., 2010).

A final example is the use of sensitivity tests to explore the effect of the spatial and temporal aggregation and resolution of the unknown fluxes in the modeling framework. The impact of the choice of flux regions, model grid resolution, model grid nesting, or model time step can all be explored (e.g., Rivier et al., 2010; Göckede et al., 2010a; Kim et al., 2014; Peters et al., 2010).

Synthetic data experiments

Observing system simulation experiments (OSSEs) are studies in which synthetic observations are constructed at observation times and locations using a prescribed set of fluxes and a chemistry and transport model. These synthetic observations are then used instead of actual observations as data constraints on an inversion. OSSEs are particularly useful for diagnostics because the “true” transport and fluxes are known and can be manipulated. These types of studies constitute a necessary but certainly not sufficient condition for ensuring a good inversion setup, as many complexities of inversions using real observations can only be approximated within a synthetic data experiment context. OSSEs have become a key component of inversion model development, especially as models have become more complex.

Because the “true” fluxes are known in an OSSE, various metrics can be used to assess how well the inversion can recover fluxes. OSSEs can be used to quantify the magnitude and geographical distribution of uncertainty that stems from specific errors or assumptions in the inversion framework, such as transport model errors (e.g., Houweling et al., 2010; Berchet et al., 2015), spatiotemporal flux patterns within regions (e.g., Berchet et al. 2015), biased priors (e.g., Berchet et al., 2015), flux spatiotemporal resolutions (e.g., Wu et al., 2011), or parameter choices within computational data assimilation systems (e.g., Miyazaki et al., 2011, Chatterjee et al., 2012). Posterior flux errors and error covariances can be used to assess the impact of modeling simplifications or data limitations on the accuracy and precision of flux estimation (e.g., Berchet et al., 2015; Gourdji et al., 2010). OSSEs can also be used to understand sources of bias through a simple differencing of posterior and “true” fluxes (e.g., Locatelli et al., 2013; Thompson et al., 2011; Basu et al., 2016; Bloom et al., 2016). Similar tests can be run to determine the effects of observational biases and mistuning of error statistics on the accuracy of posterior estimates (e.g., Baker et al., 2010).

OSSEs can also be used to determine the sensitivity of inversions to transport errors. The model-data mismatch may be compared between an inversion that uses the “true” transport to calculate the sensitivity matrix versus that of an inversion that uses a different transport model (e.g., Chevallier et al., 2010; Houweling et al., 2010; Berchet et al., 2015; Locatelli et al., 2013). Assuming that the difference in performance between these two transport models is comparable to the difference between transport models used in real-data inversions, the inversion with inconsistent transport can be compared to the inversion with consistent transport to determine how much the inconsistencies in transport affect the inversion. A similar test can be conducted simply by adding transport or chemistry errors to the pseudo-observations for one run of the model (e.g., Gourdji et al., 2010; Baker et al., 2010; Thompson et al., 2011). In addition, the meteorological forcing field may be perturbed independently of the transport model itself to determine how the underlying meteorological assumptions affect the inversion; this is particularly important because the meteorology is often not optimized for transport runs (as noted by Berchet et al., 2015).

OSSEs are also useful for determining the sensitivity of the inversion to the choice of priors. Within a Bayesian inversion, perturbations of prior fluxes from the “true” fluxes in terms of spatial distribution, temporal distribution, and flux magnitude by region can be used for a synthetic data sensitivity test (e.g., Berchet et al., 2015). This type of study is useful for determining prior-related biases in cases when the bottom-up inventories for a particular trace gas in the model domain are highly uncertain.

OSSEs can also provide information about how much information can be obtained from the current observational network. Pseudo-observation sites and types of data (for example, mixing ratios, profiles, column averages, or isotopic signatures from flask samples) can be added or taken away from the inversion to determine how the density and distribution of observations affect the precision and accuracy of the posterior flux field (Villani et al., 2010; Miyazaki et al., 2011; Hungershoefer et al., 2010; Shiga et al., 2013; Basu et al., 2016; Bloom et al., 2016). In addition, the ability of existing monitoring network sites to detect specific types of fluxes or flux patterns can be explored, as well as the impact of various sources of uncertainty on detection (e.g., Shiga et al., 2014; Fang et al., 2014; Miller et al., 2016a). Such experiments can determine how much information about the true flux field is provided by an observational network. The uncertainty reduction from the prior to the posterior estimates (see Sect. 3.2 and 3.3.2) provides an overall metric for evaluating the information provided by hypothetical observations (e.g., Chevallier et al., 2010; Baker et al., 2010; Hungershoefer et al., 2010).

Finally, through sensitivity tests, OSSEs can help to determine optimal model resolution and observational averaging for obtaining the most accurate posterior fluxes. This has been done for model temporal resolution and observational temporal averaging (e.g., Gourdji et al., 2010). OSSEs can also be used to test the performance of the optimization of multi-scale grids, which can decrease computational costs relative to regularly spaced grids (e.g., Wu et al., 2011).

Evaluation of existing diagnostics

We have presented diagnostics as an approach to the needs of quality control and of quality assurance for atmospheric inversion systems. The diagnostics that were presented in Sect. 3, in many ways, address this question well. The diversity of diagnostics may even give the impression that they can compensate for the lack of direct independent validation measurements described in Sect. 2 and thereby ensure statistical optimality of inverse modeling systems. Indeed, even uncertain parameters (hyperparameters) of the prior and observation error covariance matrices are optimizable from the assimilated data (e.g., Sect. 3.3.3). In most cases, however, such an interpretation would be overly optimistic. The diagnostic approaches described in Sect. 3 provide a crucial toolbox for evaluating and improving flux estimates obtained through the solution of atmospheric inverse problems. Without diagnostics, it is impossible to assess whether flux estimates are reliable or to make sense of differences among alternative sets of estimates. At the same time, however, none of the presented approaches overcome the fundamental challenges described in Sect. 2. As such, the information provided by diagnostic tests must itself be taken with a proverbial “grain of salt”, and it is equally important to be aware of the aspects of an inversion that cannot be evaluated using existing diagnostics as it is to assess those that can.

The key information lies in available measurements; diagnostics can only help to reformulate this information by bringing to light the impact of specific assumptions, in the same way that the atmospheric inversion reformulates observed concentrations in terms of surface fluxes or that a retrieval scheme for an Earth-observing system reformulates the measured radiance information into a geophysical quantity. For instance, the principle of objectively tuning error statistics for atmospheric inversions (e.g., Michalak et al., 2004, 2005) ultimately relies on disentangling deviations between prior flux assumptions and observations into components attributable to prior uncertainty versus model-data-mismatch errors. The attribution to these two components of error is based on leveraging differences in their space-time structure, however, and is made easier when the two sources of error have features that are statistically distinct (e.g., Desroziers et al., 2005). Alternatively, some of the statistics may be well known from some other information source and can then play the role of a fixed point to deduce the other ones (e.g., Kuppel et al., 2013). It is important to remember, however, that diagnostics cannot bring original information to the problem, but rather provide a framework for interpreting available information. This is particularly obvious when no real measurements are assimilated (the synthetic data experiments of Sect. 3.4).

The interpretation of diagnostics is also complicated by the fact that many of them are not independent of the underlying assumptions of the inversion systems themselves (e.g., independence of prior errors from model-data-mismatch errors, uncorrelated nature of model-data-mismatch errors, linear observation operator, Gaussian error statistics). As a result, they may simply express the inadequacy of these assumptions rather than the misspecification of some particular component of the inversion setup. A common example is the inflation of observation error variances to compensate for neglecting observation error correlations, which yields a too-small model-data mismatch (see Sect. 3.2.2) that cannot be adequately resolved without removing the decorrelation hypothesis (e.g., Chevallier, 2007).

The comparison of inversion results with independent (un-assimilated) concentration measurements (Sect. 3.1.1) is also partly ambiguous, because an unknown fraction of the misfit is simply caused by the chemistry and transport model that simulates the independent measurements. Similarly, the interpretation of differences between inversion results and flux estimates from bottom-up inventories (Sect. 3.1.2) may revolve around estimating the uncertainty of the latter (see, e.g., the diverging conclusions of Chevallier et al., 2014, and Reuter et al., 2014, about the quality of the inferred carbon sink of Europe).

Sensitivity tests about some components of the inversion systems, like the chemistry and transport model (see Sect. 3.3.1), are implemented in an attempt to sample the same error statistics as those specified by the model-data-mismatch and prior error covariance matrices. In practice, however, they may instead reflect different opinions about the error statistics. For instance, intercomparisons of inversion results like those of Transcom (e.g., Gurney et al., 2002; Peylin et al., 2013) form “ensembles of convenience” rather than statistically coherent ensembles. They may underestimate the quality of state-of-the-art inversions (because some systems would underperform due to particularly coarse horizontal resolution or due to an outdated transport simulation configuration) as well as overestimate it (because the few participants cannot sample the whole uncertainty space). To represent inversion uncertainty, inversion intercomparisons should explore the space of uncertainty widely (e.g., the ensemble would not be limited to one particular source of information for its prior fluxes for a given source-sink process) and in a balanced way (e.g., the ensemble would not oversample marginally different versions of a single transport model at the expense of other transport model types). However, this goal is usually hampered by limited resources that favor existing setups over the design of systematic explorations of other plausible and defensible setups.

Overall then, satisfying the diagnostics described in Sect. 3 is, strictly speaking, neither a sufficient nor a necessary condition for optimality (see also the discussion in Talagrand, 2014). The degree of usefulness of diagnostics is proportional to the amount of information that is input to them; conversely, lack of independent information can lead to problems of equifinality, where similar apparent skill is achieved through widely different setups and assumptions. In some cases, the process of identifying and improving weak components of an inverse system itself represents an inference problem that may be ill-posed or under-determined. As a result, the interpretation of diagnostics itself often requires subjective expert knowledge.

Despite their ambiguity, however, the role and diversity of diagnostics has increased over the years, and this is an important and positive development. Indeed, the diagnostics described in Sect. 3 have proven their practical usefulness in understanding the behavior of inversion systems by providing a fresh perspective on inversion results. Moreover, they can reveal, or at least suggest, the presence of hidden flaws in inversion systems by shedding light on the symptoms of these flaws. As such, they form a critical basis for the credibility of the inversion approach to flux estimation. While existing diagnostics tools have limitations, some of which are unavoidable given the challenges described in Sect. 2, a careful review of the literature makes it clear that the implementation of diagnostics is a necessary step in the “exploration” of an inversion system.

Looking ahead

Atmospheric inversions are increasingly expected to contribute to national reporting of greenhouse gas emissions under future international treaties (see the discussions in Ogle et al., 2015, for biogenic emissions, Miller and Michalak, 2017, for anthropogenic emissions, and Wu et al., 2016, for urban emissions). The routine run of atmospheric inversion systems will necessitate reinforcing the robustness and the transparency of their process through commonly agreed upon quality insurance and quality control procedures. In practice, this implies systematically providing reliable associated uncertainty statistics together with the posterior fluxes and some evidence of the statistical consistency of these fluxes with the inversion assumptions. Such norms will have to rely on the systematic implementation of diagnostics of the type discussed here to a large extent, even for emerging applications like the quantification of urban emissions (McKain et al., 2012).

As we have seen in Sect. 4, many more measurements are needed to decrease diagnostics ambiguities. This requirement primarily relates to concentration measurements rather than flux measurements because scale mismatches usually hamper the comparison of inversions with the latter (see Sect. 2). A step in data density may be achieved by hypothetical low cost sensors (Wu et al., 2016) or from future satellite imagers (e.g., Rayner et al., 2014), provided these new data do not suffer from significant systematic errors. Efforts to substantially increase observational coverage are already underway (see, e.g., Climate-KIC, 2017, Ciais et al., 2015), but the feasibility of sufficiently limiting systematic errors remains to be demonstrated.

Interestingly, a (large) increase in the horizontal resolution of the inversion systems would also make it possible to incorporate direct flux measurements in the diagnostics, even when the targeted scales are coarser (see discussion in Sect. 2 and Lauvaux et al., 2009, or Meesters et al., 2012). Inversion systems could also be run at very high resolution for the express purpose of comparing estimates to flux measurements. The validation with accurate flux measurements would avoid some of the ambiguity imposed by the chemistry and transport models on the concentration-based diagnostics.

This would also open up new directions for diagnostics development. For example, direct comparison to flux observations would make it possible to better assess posterior uncertainties, for instance by building on diagnostics developed in the context of ensemble prediction systems – diagnostics that have not yet been used for atmospheric inversions (e.g., the reliability diagram of Talagrand et al., 1999). These ideas were explored, for example, by Broquet et al. (2013), using aggregates of flux measurements. Among other benefits, the direct validation of the posterior uncertainties would reveal possible departures from normality for flux errors, which may be especially important in the case of systematically positive emissions (e.g., Koohkan et al., 2013). Such diagnostics would certainly help to guide future developments of inversion systems.

Taken together, it is clear that the importance of developing and implementing carefully designed diagnostics for atmospheric inversions of long-lived greenhouse gases is only going to grow over time.

No data sets were used in this article.

The authors declare that they have no conflict of interest.

Acknowledgements

We acknowledge the support from the International Space Science Institute (ISSI). This publication is an outcome of the ISSI's Working Group on “Carbon Cycle Data Assimilation: How to consistently assimilate multiple data streams”. Support for Nina Randazzo was provided by the National Science Foundation under grant no. 1342076. Edited by: Marko Scholze Reviewed by: three anonymous referees

References 1

Alexe, M., Bergamaschi, P., Segers, A., Detmers, R., Butz, A., Hasekamp, O., Guerlet, S., Parker, R., Boesch, H., Frankenberg, C., Scheepmaker, R. A., Dlugokencky, E., Sweeney, C., Wofsy, S. C., and Kort, E. A.: Inverse modelling of CH4 emissions for 2010–2011 using different satellite retrieval products from GOSAT and SCIAMACHY, Atmos. Chem. Phys., 15, 113–133, 10.5194/acp-15-113-2015, 2015.

Allen, D. T., Pacsi, A. P., Sullivan, D. W., Zavala-Araiza, D., Harrison, M., Keen, K., Fraser, M. P., Hill, A. D., Sawyer, R. F., and Seinfeld, J. H.: Methane Emissions from Process Equipment at Natural Gas Production Sites in the United States: Pneumatic Controllers, Environ. Sci. Technol., 49, 633–640, 10.1021/es5040156, 2015.

Baker, D., Law, R., Gurney, K., Rayner, P., Peylin, P., Denning, A., Bousquet, P., Bruhwiler, L., Chen, Y., Ciais, P., Fung, I., Heimann, M., John, J., Maki, T., Maksyutov, S., Masarie, K., Prather, M., Pak, B., Taguchi, S., and Zhu, Z.: TransCom 3 inversion intercomparison: Impact of transport model errors on the interannual variability of regional CO2 fluxes, 1988–2003, Global Biogeochem. Cy., 20, GB1002, 10.1029/2004GB002439, 2006.

Baker, D. F., Bösch, H., Doney, S. C., O'Brien, D., and Schimel, D. S.: Carbon source/sink information provided by column CO2 measurements from the Orbiting Carbon Observatory, Atmos. Chem. Phys., 10, 4145–4165, 10.5194/acp-10-4145-2010, 2010.

Baldocchi, D., Falge, E., Gu, L. H., Olson, R., Hollinger, D., Running, S., Anthoni, P., Bernhofer, C., Davis, K., Evans, R., Fuentes, J., Goldstein, A., Katul, G., Law, B., Lee, X. H., Malhi, Y., Meyers, T., Munger, W., Oechel, W., Paw, K. T., Pilegaard, K., Schmid, H. P., Valentini, R., Verma, S., Vesala, T., Wilson, K., and Wofsy, S.: FLUXNET: A new tool to study the temporal and spatial variability of ecosystem-scale carbon dioxide, water vapor, and energy flux densities, B. Am. Meteorol. Soc., 82, 2415–2434, 10.1175/1520-0477(2001)082<2415:fantts>2.3.co;2, 2001.

Basu, S., Guerlet, S., Butz, A., Houweling, S., Hasekamp, O., Aben, I., Krummel, P., Steele, P., Langenfelds, R., Torn, M., Biraud, S., Stephens, B., Andrews, A., and Worthy, D.: Global CO2 fluxes estimated from GOSAT retrievals of total column CO2, Atmos. Chem. Phys., 13, 8695–8717, 10.5194/acp-13-8695-2013, 2013.

Basu, S., Miller, J. B., and Lehman, S.: Separation of biospheric and fossil fuel fluxes of CO2 by atmospheric inversion of CO2 and 14CO2 measurements: Observation System Simulations, Atmos. Chem. Phys., 16, 5665–5683, 10.5194/acp-16-5665-2016, 2016.

Beer, C., Reichstein, M., Tomelleri, E., Ciais, P., Jung, M., Carvalhais, N., Roedenbeck, C., Arain, M. A., Baldocchi, D., Bonan, G. B., Bondeau, A., Cescatti, A., Lasslop, G., Lindroth, A., Lomas, M., Luyssaert, S., Margolis, H., Oleson, K. W., Roupsard, O., Veenendaal, E., Viovy, N., Williams, C., Woodward, F. I., and Papale, D.: Terrestrial Gross Carbon Dioxide Uptake: Global Distribution and Covariation with Climate, Science, 329, 834–838, 10.1126/science.1184984, 2010.

Berchet, A., Pison, I., Chevallier, F., Bousquet, P., Conil, S., Geever, M., Laurila, T., Lavric, J., Lopez, M., Moncrieff, J., Necki, J., Ramonet, M., Schmidt, M., Steinbacher, M., and Tarniewicz, J.: Towards better error statistics for atmospheric inversions of methane surface fluxes, Atmos. Chem. Phys., 13, 7115–7132, 10.5194/acp-13-7115-2013, 2013.

Berchet, A., Pison, I., Chevallier, F., Bousquet, P., Bonne, J.-L., and Paris, J.-D.: Objectified quantification of uncertainties in Bayesian atmospheric inversions, Geosci. Model Dev., 8, 1525–1546, 10.5194/gmd-8-1525-2015, 2015.

Bergamaschi, P., Corazza, M., Karstens, U., Athanassiadou, M., Thompson, R. L., Pison, I., Manning, A. J., Bousquet, P., Segers, A., Vermeulen, A. T., Janssens-Maenhout, G., Schmidt, M., Ramonet, M., Meinhardt, F., Aalto, T., Haszpra, L., Moncrieff, J., Popa, M. E., Lowry, D., Steinbacher, M., Jordan, A., O'Doherty, S., Piacentino, S., and Dlugokencky, E.: Top-down estimates of European CH4 and N2O emissions based on four different inverse models, Atmos. Chem. Phys., 15, 715–736, 10.5194/acp-15-715-2015, 2015.

Bloom, A. A., Lauvaux, T., Worden, J., Yadav, V., Duren, R., Sander, S. P., and Schimel, D. S.: What are the greenhouse gas observing system requirements for reducing fundamental biogeochemical process uncertainty? Amazon wetland CH4 emissions as a case study, Atmos. Chem. Phys., 16, 15199–15218, 10.5194/acp-16-15199-2016, 2016.

Bousquet, P., Ringeval, B., Pison, I., Dlugokencky, E. J., Brunke, E.-G., Carouge, C., Chevallier, F., Fortems-Cheiney, A., Frankenberg, C., Hauglustaine, D. A., Krummel, P. B., Langenfelds, R. L., Ramonet, M., Schmidt, M., Steele, L. P., Szopa, S., Yver, C., Viovy, N., and Ciais, P.: Source attribution of the changes in atmospheric methane for 2006–2008, Atmos. Chem. Phys., 11, 3689–3700, 10.5194/acp-11-3689-2011, 2011.

Bréon, F. M., Broquet, G., Puygrenier, V., Chevallier, F., Xueref-Remy, I., Ramonet, M., Dieudonné, E., Lopez, M., Schmidt, M., Perrussel, O., and Ciais, P.: An attempt at estimating Paris area CO2 emissions from atmospheric concentration measurements, Atmos. Chem. Phys., 15, 1707–1724, 10.5194/acp-15-1707-2015, 2015.

Broquet, G., Chevallier, F., Bréon, F.-M., Kadygrov, N., Alemanno, M., Apadula, F., Hammer, S., Haszpra, L., Meinhardt, F., Morguí, J. A., Necki, J., Piacentino, S., Ramonet, M., Schmidt, M., Thompson, R. L., Vermeulen, A. T., Yver, C., and Ciais, P.: Regional inversion of CO2 ecosystem fluxes from atmospheric measurements: reliability of the uncertainty estimates, Atmos. Chem. Phys., 13, 9039–9056, 10.5194/acp-13-9039-2013, 2013.

Butler, M., Davis, K., Denning, A., and Kawa, S.: Using continental observations in global atmospheric inversions of CO2: North American carbon sources and sinks, Tellus B, 62, 550–572, 10.1111/j.1600-0889.2010.00501.x, 2010.

Chatterjee, A., Michalak, A. M., Anderson, J. L., Mueller, K. L., and Yadav, V.: Toward reliable ensemble Kalman filter estimates of CO2 fluxes, J. Geophys. Res.-Atmos., 117, D22306, 10.1029/2012JD018176, 2012.

Chevallier, F.: Impact of correlated observation errors on inverted CO2 surface fluxes from OCO measurements, Geophys. Res. Lett., 34, L24804, 10.1029/2007GL030463, 2007.

Chevallier, F. and O'Dell, C.: Error statistics of Bayesian CO2 flux inversion schemes as seen from GOSAT, Geophys. Res. Lett., 40, 1252–1256, 10.1002/grl.50228, 2013.

Chevallier, F., Bréon, F. M., and Rayner, P. J.: Contribution of the Orbiting Carbon Observatory to the estimation of CO2 sources and sinks: Theoretical study in a variational data assimilation framework, J. Geophys. Res., 112, D09307, 10.1029/2006JD007375, 2007.

Chevallier, F., Feng, L., Bosch, H., Palmer, P., and Rayner, P.: On the impact of transport model errors for the estimation of CO2 surface fluxes from GOSAT observations, Geophys. Res. Lett., 37, L21803, 10.1029/2010GL044652, 2010.

Chevallier, F., Palmer, P., Feng, L., Boesch, H., O'Dell, C., and Bousquet, P.: Toward robust and consistent regional CO2 flux estimates from in situ and spaceborne measurements of atmospheric CO2, Geophys. Res. Lett., 41, 1065–1070, 10.1002/2013GL058772, 2014.

Ciais, P., Rayner, P., Chevallier, F., Bousquet, P., Logan, M., Peylin, P., and Ramonet, M.: Atmospheric inversions for estimating CO2 fluxes: methods and perspectives, Climatic Change, 103, 69–92, 10.1007/s10584-010-9909-3, 2010a.

Ciais, P., Canadell, J., Luyssaert, S., Chevallier, F., Shvidenko, A., Poussi, Z., Jonas, M., Peylin, P., King, A., Schulze, E., Piao, S., Rödenbeck, C., Peters, W., and Bréon, F.: Can we reconcile atmospheric estimates of the Northern terrestrial carbon sink with land-based accounting?, Current Opinion in Environmental Sustainability, 2, 225–230, 10.1016/j.cosust.2010.06.008, 2010b.

Ciais, P., Crisp, D., van der Gon, H., Engelen, R., Heimann, M., Janssens-Maenhout, G., Rayner, P., and Scholze, M.: Towards a European Operational Observing System to Monitor Fossil CO2 emissions – Final Report from the expert group. European Commission, Copernicus Climate Change Service, avaialble at: http://www.copernicus.eu/sites/default/files/library/CO2_Report_22Oct2015.pdf (last access: 14 June 2017), 2015.

Climate-KIC: Miriade: Developing the market for the latest technology in greenhouse gas monitoring, available at: http://www.climate-kic.org/projects/miriade/, last access: 19 May 2017.

Conway, T., Tans, P., Waterman, L., and Thoning, K.: Evidence for interannual variability of the carbon cycle from the National Oceanic and Atmospheric Administration/Climate Monitoring and Diagnostics Laboratory Global Air Sampling Network, J. Geophys. Res.-Atmos., 99, 22831–22855, 10.1029/94JD01951, 1994.

Corazza, M., Bergamaschi, P., Vermeulen, A. T., Aalto, T., Haszpra, L., Meinhardt, F., O'Doherty, S., Thompson, R., Moncrieff, J., Popa, E., Steinbacher, M., Jordan, A., Dlugokencky, E., Brühl, C., Krol, M., and Dentener, F.: Inverse modelling of European N2O emissions: assimilating observations from different networks, Atmos. Chem. Phys., 11, 2381–2398, 10.5194/acp-11-2381-2011, 2011.

Cressot, C., Chevallier, F., Bousquet, P., Crevoisier, C., Dlugokencky, E. J., Fortems-Cheiney, A., Frankenberg, C., Parker, R., Pison, I., Scheepmaker, R. A., Montzka, S. A., Krummel, P. B., Steele, L. P., and Langenfelds, R. L.: On the consistency between global and regional methane emissions inferred from SCIAMACHY, TANSO-FTS, IASI and surface measurements, Atmos. Chem. Phys., 14, 577–592, 10.5194/acp-14-577-2014, 2014.

Crisp, D., Fisher, B. M., O'Dell, C., Frankenberg, C., Basilio, R., Bösch, H., Brown, L. R., Castano, R., Connor, B., Deutscher, N. M., Eldering, A., Griffith, D., Gunson, M., Kuze, A., Mandrake, L., McDuffie, J., Messerschmidt, J., Miller, C. E., Morino, I., Natraj, V., Notholt, J., O'Brien, D. M., Oyafuso, F., Polonsky, I., Robinson, J., Salawitch, R., Sherlock, V., Smyth, M., Suto, H., Taylor, T. E., Thompson, D. R., Wennberg, P. O., Wunch, D., and Yung, Y. L.: The ACOS CO2 retrieval algorithm – Part II: Global XCO2 data characterization, Atmos. Meas. Tech., 5, 687–707, 10.5194/amt-5-687-2012, 2012.

Desroziers, G., Berre, L., Chapnik, B., and Poli, P.: Diagnosis of observation, background and analysis-error statistics in observation space, Q. J. Roy. Meteor. Soc., 131, 3385–3396, 10.1256/qj.05.108, 2005.

Díaz Isaac, L. I., Lauvaux, T., Davis, K. J., Miles, N. L., Richardson, S. J., Jacobson, A. R., and Andrews, A. E.: Model-datacomparison of MCI field campaign atmospheric CO2 mole fractions, J. Geophys. Res.-Atmos., 119, 10536–10551, 10.1002/2014JD021593, 2014.

ECMWF: Quality of Our Forecasts, available at: http://www.ecmwf.int/en/forecasts/quality-our-forecasts (last access: 14 June 2017), 2016.

Enting, I. G.: Inverse Problems in Atmospheric Constituent Transport, Cambridge University Press, Cambridge, United Kingdom, 392 pp., 2002.

Enting, I. G. and Mansbridge, J. V.: Seasonal sources and sinks of atmospheric CO(2) Direct inversion of filtered data, Tellus B, 41, 111–126, 10.1111/j.1600-0889.1989.tb00129.x, 1989.

Enting, I. G. and Mansbridge, J. V.: Latitudinal distribution of sources and sinks of CO2 – results of an inversion study, Tellus B, 43, 156–170, 10.1034/j.1600-0889.1991.00010.x, 1991.

Fang, Y. and Michalak, A. M.: Atmospheric observations inform CO2 flux responses to enviroclimatic drivers, Global Biogeochem. Cy., 29, 555–566, 10.1002/2014gb005034, 2015.

Fang, Y., Michalak, A. M., Shiga, Y. P., and Yadav, V.: Using atmospheric observations to evaluate the spatiotemporal variability of CO2 fluxes simulated by terrestrial biospheric models, Biogeosciences, 11, 6985–6997, 10.5194/bg-11-6985-2014, 2014.

Frankenberg, C., Kulawik, S. S., Wofsy, S. C., Chevallier, F., Daube, B., Kort, E. A., O'Dell, C., Olsen, E. T., and Osterman, G.: Using airborne HIAPER Pole-to-Pole Observations (HIPPO) to evaluate model and remote sensing estimates of atmospheric carbon dioxide, Atmos. Chem. Phys., 16, 7867–7878, 10.5194/acp-16-7867-2016, 2016.

Friedlingstein, P., Meinshausen, M., Arora, V. K., Jones, C. D., Anav, A., Liddicoat, S. K., and Knutti, R.: Uncertainties in CMIP5 Climate Projections due to Carbon Cycle Feedbacks, J. Climate, 27, 511–526, 10.1175/jcli-d-12-00579.1, 2014.

Ganesan, A. L., Rigby, M., Zammit-Mangion, A., Manning, A. J., Prinn, R. G., Fraser, P. J., Harth, C. M., Kim, K.-R., Krummel, P. B., Li, S., Mühle, J., O'Doherty, S. J., Park, S., Salameh, P. K., Steele, L. P., and Weiss, R. F.: Characterization of uncertainties in atmospheric trace gas inversions using hierarchical Bayesian methods, Atmos. Chem. Phys., 14, 3855–3864, 10.5194/acp-14-3855-2014, 2014.

Göckede, M., Michalak, A., Vickers, D., Turner, D., and Law, B.: Atmospheric inverse modeling to constrain regional-scale CO2 budgets at high spatial and temporal resolution, J. Geophys. Res.-Atmos., 115, D15113, 10.1029/2009JD012257, 2010a.

Göckede, M., Turner, D., Michalak, A., Vickers, D., and Law, B.: Sensitivity of a subregional scale atmospheric inverse CO2 modeling framework to boundary conditions, J. Geophys. Res.-Atmos., 115, D24112, 10.1029/2010JD014443, 2010b.

Gourdji, S. M., Hirsch, A. I., Mueller, K. L., Yadav, V., Andrews, A. E., and Michalak, A. M.: Regional-scale geostatistical inverse modeling of North American CO2 fluxes: a synthetic data study, Atmos. Chem. Phys., 10, 6151–6167, 10.5194/acp-10-6151-2010, 2010.

Gourdji, S. M., Mueller, K. L., Yadav, V., Huntzinger, D. N., Andrews, A. E., Trudeau, M., Petron, G., Nehrkorn, T., Eluszkiewicz, J., Henderson, J., Wen, D., Lin, J., Fischer, M., Sweeney, C., and Michalak, A. M.: North American CO2 exchange: inter-comparison of modeled estimates with results from a fine-scale atmospheric inversion, Biogeosciences, 9, 457–475, 10.5194/bg-9-457-2012, 2012.

Gurney, K., Law, R., Denning, A., Rayner, P., Baker, D., Bousquet, P., Bruhwiler, L., Chen, Y., Ciais, P., Fan, S., Fung, I., Gloor, M., Heimann, M., Higuchi, K., John, J., Maki, T., Maksyutov, S., Masarie, K., Peylin, P., Prather, M., Pak, B., Randerson, J., Sarmiento, J., Taguchi, S., Takahashi, T., and Yuen, C.: Towards robust regional estimates of CO2 sources and sinks using atmospheric transport models, Nature, 415, 626–630, 10.1038/415626a, 2002.

Houweling, S., Aben, I., Breon, F.-M., Chevallier, F., Deutscher, N., Engelen, R., Gerbig, C., Griffith, D., Hungershoefer, K., Macatangay, R., Marshall, J., Notholt, J., Peters, W., and Serrar, S.: The importance of transport model uncertainties for the estimation of CO2 sources and sinks using satellite measurements, Atmos. Chem. Phys., 10, 9981–9992, 10.5194/acp-10-9981-2010, 2010.

Houweling, S., Krol, M., Bergamaschi, P., Frankenberg, C., Dlugokencky, E. J., Morino, I., Notholt, J., Sherlock, V., Wunch, D., Beck, V., Gerbig, C., Chen, H., Kort, E. A., Röckmann, T., and Aben, I.: A multi-year methane inversion using SCIAMACHY, accounting for systematic errors using TCCON measurements, Atmos. Chem. Phys., 14, 3991–4012, 10.5194/acp-14-3991-2014, 2014.

Houweling, S., Bergamaschi, P., Chevallier, F., Heimann, M., Kaminski, T., Krol, M., Michalak, A. M., and Patra, P.: Global inverse modeling of CH4 sources and sinks: an overview of methods, Atmos. Chem. Phys., 17, 235–256, 10.5194/acp-17-235-2017, 2017.

Hungershoefer, K., Breon, F.-M., Peylin, P., Chevallier, F., Rayner, P., Klonecki, A., Houweling, S., and Marshall, J.: Evaluation of various observing systems for the global monitoring of CO2 surface fluxes, Atmos. Chem. Phys., 10, 10503–10520, 10.5194/acp-10-10503-2010, 2010.

Jeong, S., Millstein, D., and Fischer, M. L.: Spatially Explicit Methane Emissions from Petroleum Production and the Natural Gas System in California, Environ. Sci. Technol., 48, 5982–5990, 10.1021/es4046692, 2014.

Jiang, F., Wang, H. M., Chen, J. M., Machida, T., Zhou, L. X., Ju, W. M., Matsueda, H., and Sawa, Y.: Carbon balance of China constrained by CONTRAIL aircraft CO2 measurements, Atmos. Chem. Phys., 14, 10133–10144, 10.5194/acp-14-10133-2014, 2014.

Johnson, M. S., Xi, X., Jeong, S., Yates, E. L., Iraci, L. T., Tanaka, T., Loewenstein, M., Tadic, J. M., and Fischer, M. L.: Investigating seasonal methane emissions in Northern California using airborne measurements and inverse modeling, J. Geophys. Res.-Atmos., 121, 13753–13767, 10.1002/2016JD025157, 2016.

Jung, M., Reichstein, M., Margolis, H. A., Cescatti, A., Richardson, A. D., Arain, M. A., Arneth, A., Bernhofer, C., Bonal, D., Chen, J., Gianelle, D., Gobron, N., Kiely, G., Kutsch, W., Lasslop, G., Law, B. E., Lindroth, A., Merbold, L., Montagnani, L., Moors, E. J., Papale, D., Sottocornola, M., Vaccari, F., and Williams, C.: Global patterns of land-atmosphere fluxes of carbon dioxide, latent heat, and sensible heat derived from eddy covariance, satellite, and meteorological observations, J. Geophys. Res.-Biogeo., 116, G00J07, 10.1029/2010jg001566, 2011.

Kadygrov, N., Broquet, G., Chevallier, F., Rivier, L., Gerbig, C., and Ciais, P.: On the potential of the ICOS atmospheric CO2 measurement network for estimating the biogenic CO2 budget of Europe, Atmos. Chem. Phys., 15, 12765–12787, 10.5194/acp-15-12765-2015, 2015.

Kalnay, E.: Atmospheric Modeling, Data Assimilation and Predictability, Cambridge University Press, Cambridge, United Kingdom, 2003.

Kim, H., Maksyutov, S., Glagolev, M., Machida, T., Patra, P., Sudo, K., and Inoue, G.: Evaluation of methane emissions from West Siberian wetlands based on inverse modeling, Environ. Res. Lett., 6, 035201, 10.1088/1748-9326/6/3/035201, 2011.

Kim, J., Kim, H., and Cho, C.: The effect of optimization and the nesting domain on carbon flux analyses in Asia using a carbon tracking system based on the ensemble Kalman filter, Asia-Pac. J. Atmos. Sci., 50, 327–344, 10.1007/s13143-014-0020-y, 2014.

Koohkan, M. R., Bocquet, M., Roustan, Y., Kim, Y., and Seigneur, C.: Estimation of volatile organic compound emissions for Europe using data assimilation, Atmos. Chem. Phys., 13, 5887–5905, 10.5194/acp-13-5887-2013, 2013.

Kuppel, S., Chevallier, F., and Peylin, P.: Quantifying the model structural error in carbon cycle data assimilation systems, Geosci. Model Dev., 6, 45–55, 10.5194/gmd-6-45-2013, 2013.

Lauvaux, T., Gioli, B., Sarrat, C., Rayner, P., Ciais, P., Chevallier, F., Noilhan, J., Miglietta, F., Brunet, Y., Ceschia, E., Dolman, H., Elbers, J., Gerbig, C., Hutjes, R., Jarosz, N., Legain, D., and Uliasz, M.: Bridging the gap between atmospheric concentrations and local ecosystem measurements, Geophys. Res. Lett., 36, L19809, 10.1029/2009GL039574, 2009.

Lauvaux, T., Miles, N., Deng, A., Richardson, S., Cambaliza, M., Davis, K., Gaudet, B., Gurney, K., Huang, J., O'Keefe, D., Song, Y., Karion, A., Oda, T., Patarasuk, R., Razlivanov, I., Sarmiento, D., Shepson, P., Sweeney, C., Turnbull, J., and Wu, K.: High-resolution atmospheric inversion of urban CO2 emissions during the dormant season of the Indianapolis Flux Experiment (INFLUX), J. Geophys. Res.-Atmos., 121, 5213–5236, 10.1002/2015JD024473, 2016.

Lauvaux, T., Schuh, A. E., Uliasz, M., Richardson, S., Miles, N., Andrews, A. E., Sweeney, C., Diaz, L. I., Martins, D., Shepson, P. B., and Davis, K. J.: Constraining the CO2 budget of the corn belt: exploring uncertainties from the assumptions in a mesoscale inverse system, Atmos. Chem. Phys., 12, 337–354, 10.5194/acp-12-337-2012, 2012.

Law, B. E., Falge, E., Gu, L., Baldocchi, D. D., Bakwin, P., Berbigier, P., Davis, K., Dolman, A. J., Falk, M., Fuentes, J. D., Goldstein, A., Granier, A., Grelle, A., Hollinger, D., Janssens, I. A., Jarvis, P., Jensen, N. O., Katul, G., Mahli, Y., Matteucci, G., Meyers, T., Monson, R., Munger, W., Oechel, W., Olson, R., Pilegaard, K., Paw, K. T., Thorgeirsson, H., Valentini, R., Verma, S., Vesala, T., Wilson, K., and Wofsy, S.: Environmental controls over carbon dioxide and water vapor exchange of terrestrial vegetation, Agr. Forest Meteorol., 113, 97–120, 10.1016/s0168-1923(02)00104-1, 2002.

Le Quéré, C., Moriarty, R., Andrew, R. M., Canadell, J. G., Sitch, S., Korsbakken, J. I., Friedlingstein, P., Peters, G. P., Andres, R. J., Boden, T. A., Houghton, R. A., House, J. I., Keeling, R. F., Tans, P., Arneth, A., Bakker, D. C. E., Barbero, L., Bopp, L., Chang, J., Chevallier, F., Chini, L. P., Ciais, P., Fader, M., Feely, R. A., Gkritzalis, T., Harris, I., Hauck, J., Ilyina, T., Jain, A. K., Kato, E., Kitidis, V., Klein Goldewijk, K., Koven, C., Landschützer, P., Lauvset, S. K., Lefèvre, N., Lenton, A., Lima, I. D., Metzl, N., Millero, F., Munro, D. R., Murata, A., Nabel, J. E. M. S., Nakaoka, S., Nojiri, Y., O'Brien, K., Olsen, A., Ono, T., Pérez, F. F., Pfeil, B., Pierrot, D., Poulter, B., Rehder, G., Rödenbeck, C., Saito, S., Schuster, U., Schwinger, J., Séférian, R., Steinhoff, T., Stocker, B. D., Sutton, A. J., Takahashi, T., Tilbrook, B., van der Laan-Luijkx, I. T., van der Werf, G. R., van Heuven, S., Vandemark, D., Viovy, N., Wiltshire, A., Zaehle, S., and Zeng, N.: Global Carbon Budget 2015, Earth Syst. Sci. Data, 7, 349–396, 10.5194/essd-7-349-2015, 2015.

Lindqvist, H., O'Dell, C. W., Basu, S., Boesch, H., Chevallier, F., Deutscher, N., Feng, L., Fisher, B., Hase, F., Inoue, M., Kivi, R., Morino, I., Palmer, P. I., Parker, R., Schneider, M., Sussmann, R., and Yoshida, Y.: Does GOSAT capture the true seasonal cycle of carbon dioxide?, Atmos. Chem. Phys., 15, 13023–13040, 10.5194/acp-15-13023-2015, 2015.

Liu, J. and Bowman, K.: A method for independent validation of surface fluxes from atmospheric inversion: Application to CO2, Geophys. Res. Lett., 43, 3502–3508, 10.1002/2016GL067828, 2016.

Locatelli, R., Bousquet, P., Chevallier, F., Fortems-Cheney, A., Szopa, S., Saunois, M., Agusti-Panareda, A., Bergmann, D., Bian, H., Cameron-Smith, P., Chipperfield, M. P., Gloor, E., Houweling, S., Kawa, S. R., Krol, M., Patra, P. K., Prinn, R. G., Rigby, M., Saito, R., and Wilson, C.: Impact of transport model errors on the global and regional methane emissions estimated by inverse modelling, Atmos. Chem. Phys., 13, 9917–9937, 10.5194/acp-13-9917-2013, 2013.

Locatelli, R., Bousquet, P., Saunois, M., Chevallier, F., and Cressot, C.: Sensitivity of the recent methane budget to LMDz sub-grid-scale physical parameterizations, Atmos. Chem. Phys., 15, 9765–9780, 10.5194/acp-15-9765-2015, 2015.

Lyon, D. R., Zavala-Araiza, D., Alvarez, R. A., Harriss, R., Palacios, V., Lan, X., Talbot, R., Lavoie, T., Shepson, P., Yacovitch, T. I., Herndon, S. C., Marchese, A. J., Zimmerle, D., Robinson, A. L., and Hamburg, S. P.: Constructing a Spatially Resolved Methane Emission Inventory for the Barnett Shale Region, Environ. Sci. Technol., 49, 8147–8157, 10.1021/es506359c, 2015.

Masarie, K. A., Pétron, G., Andrews, A., Bruhwiler, L., Conway, T. J., Jacobson, A. R., Miller, J. B., Tans, P. P., Worthy, D. E., and Peters, W.: Impact of CO2 measurement bias on CarbonTracker surface flux estimates, J. Geophys. Res., 116, D17305, 10.1029/2011JD016270, 2011.

McKain, K., Wofsy, S. C., Nehrkorn, T., Eluszkiewicz, J., Ehleringer, J. R., and Stephens, B.: Assessment of ground-based atmospheric observations for verification of greenhouse gas emissions from an urban region, P. Natl. Acad. Sci. USA, 109, 8423–8428, 10.1073/pnas.1116645109, 2012.

Meesters, A., Tolk, L., Peters, W., Hutjes, R., Vellinga, O., Elbers, J., Vermeulen, A., van der Laan, S., Neubert, R., Meijer, H., and Dolman, A.: Inverse carbon dioxide flux estimates for the Netherlands, J. Geophys. Res.-Atmos., 117, D20306, 10.1029/2012JD017797, 2012.

Michalak, A. M.: Atmospheric observations and inverse modeling approaches for identifying geographical sources and sinks of carbon, in: Land Use and the Carbon Cycle: Advances in Integrated Science, Management, and Policy, edited by: Brown, D. G., Robinson, D. T., French, N. H., and Reed, B. C., Cambridge University Press, New York, New York, 144–177, 2013.

Michalak, A., Bruhwiler, L., and Tans, P.: A geostatistical approach to surface flux estimation of atmospheric trace gases, J. Geophys. Res.-Atmos., 109, D14109, 10.1029/2003JD004422, 2004.

Michalak, A., Hirsch, A., Bruhwiler, L., Gurney, K., Peters, W., and Tans, P.: Maximum likelihood estimation of covariance parameters for Bayesian atmospheric trace gas surface flux inversions, J. Geophys. Res.-Atmos., 110, D24107, 10.1029/2005JD005970, 2005.

Miller, S., Wofsy, S., Michalak, A., Kort, E., Andrews, A., Biraud, S., Dlugokencky, E., Eluszkiewicz, J., Fischer, M., Janssens-Maenhout, G., Miller, B., Miller, J., Montzka, S., Nehrkorn, T., and Sweeney, C.: Anthropogenic emissions of methane in the United States, P. Natl. Acad. Sci. USA, 110, 20018–20022, 10.1073/pnas.1314392110, 2013.

Miller, S., Worthy, D., Michalak, A., Wofsy, S., Kort, E., Havice, T., Andrews, A., Dlugokencky, E., Kaplan, J., Levi, P., Tian, H., and Zhang, B.: Observational constraints on the distribution, seasonality, and environmental predictors of North American boreal methane emissions, Global Biogeochem. Cy., 28, 146–160, 10.1002/2013GB004580, 2014.

Miller, S. M. and Michalak, A. M.: Constraining sector-specific CO2 and CH4 emissions in the US, Atmos. Chem. Phys., 17, 3963–3985, 10.5194/acp-17-3963-2017, 2017.

Miller, S. M., Commane, R., Melton, J. R., Andrews, A. E., Benmergui, J., Dlugokencky, E. J., Janssens-Maenhout, G., Michalak, A. M., Sweeney, C., and Worthy, D. E. J.: Evaluation of wetland methane emissions across North America using atmospheric data and inverse modeling, Biogeosciences, 13, 1329–1339, 10.5194/bg-13-1329-2016, 2016a.

Miller, S., Miller, C., Commane, R., Chang, R., Dinardo, S., Henderson, J., Karion, A., Lindaas, J., Melton, J., Miller, J., Sweeney, C., Wofsy, S., and Michalak, A.: A multiyear estimate of methane fluxes in Alaska from CARVE atmospheric observations, Global Biogeochem. Cy., 30, 1441–1453, 10.1002/2016GB005419, 2016b.

Miyazaki, K., Maki, T., Patra, P., and Nakazawa, T.: Assessing the impact of satellite, aircraft, and surface observations on CO2 flux estimation using an ensemble-based 4-D data assimilation system, J. Geophys. Res.-Atmos., 116, D16306, 10.1029/2010JD015366, 2011.

Nassar, R., Jones, D. B. A., Kulawik, S. S., Worden, J. R., Bowman, K. W., Andres, R. J., Suntharalingam, P., Chen, J. M., Brenninkmeijer, C. A. M., Schuck, T. J., Conway, T. J., and Worthy, D. E.: Inverse modeling of CO2 sources and sinks using satellite observations of CO2 from TES and surface flask measurements, Atmos. Chem. Phys., 11, 6029–6047, 10.5194/acp-11-6029-2011, 2011.

Ogle, S. M., Davis, K. J., Lauvaux, T., Schuh, A., Cooley, D., West, T. O., Heath, L. S., Miles, N. L., Richardson, S. J., Breidt, F. J., Smith, J. E., McCarty, J. L., Gurney, K. R., Tans, P., and Denning, A. S.: An approach for verifying biogenic greenhouse gas emissions inventories with atmospheric CO2 concentration data, Environ. Res. Lett., 10, 034012, 10.1088/1748-9326/10/3/034012, 2015.

Osterman, G., Wennberg, P. O., Gunson, M., and Basilio, R.: OCO (Orbiting Carbon Observatory)-2 Science Validation Plan, National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology, 2011.

Pacala, S. W., Breidenich, C., Brewer, P. G., Fung, I., Gunson, M. R., Heddle, G., Law, B., Marland, G., Paustian, K., Prather, M., Randerson, J. T., Tans, P., Wofsy, S. C., Linn, A. M., Sturdivant, J., Busalacchi, J., Atnonio J., Bierbaum, R. M., Carbone, R., Dabberdt, W. F., Dow, K., Forbes, G. S., Held, I., Lee, A., Pierrehumbert, R. T., Prather, K., Smith, K. R., Snow, J. T., Vonder Haar, T. H., Zeng, X., Meehl, G. A., Elfring, C., Geller, L., Kraucunas, I., McConnell, M., Walser, M., Warden, T., Casola, J., Gaskins, R., Weller, K., Brown, L., Greenway, R., Freeland, S.-A., Purcell, A., Payne, R., and Banskota, S.: Verifying Greenhouse Gas Emissions: Methods to Support International Climate Agreements, National Research Council of the National Academies, Washington, DC, 2010.

Pandey, S., Houweling, S., Krol, M., Aben, I., Chevallier, F., Dlugokencky, E. J., Gatti, L. V., Gloor, E., Miller, J. B., Detmers, R., Machida, T., and Röckmann, T.: Inverse modeling of GOSAT-retrieved ratios of total column CH4 and CO2 for 2009 and 2010, Atmos. Chem. Phys., 16, 5043–5062, 10.5194/acp-16-5043-2016, 2016.

Peters, W., Krol, M. C., Van Der Werf, G. R., Houweling, S., Jones, C. D., Hughes, J., Schaefer, K., Masarie, K. A., Jacobson, A. R., Miller, J. B., Cho, C. H., Ramonet, M., Schmidt, M., Ciattaglia, L., Apadula, F., Heltai, D., Meinhardt, F., Di Sarra, A. G., Piacentino, S., Sferlazzo, D., Aalto, T., Hatakka, J., Ström, J., Haszpra, L., Meijer, H. A. J., Van Der Laan, S., Neubert, R. E. M., Jordan, A., Rodó, X., Morguí, J. A., Vermeulen, A. T., Popa, E., Rozanski, K., Zimnoch, M., Manning, A. C., Leuenberger, M., Uglietti, C., Dolman, A. J., Ciais, P., Heimann, M., and Tans, P. P.: Seven years of recent European net terrestrial carbon dioxide exchange constrained by atmospheric observations, Glob. Change Biol., 16, 1317–1337, 10.1111/j.1365-2486.2009.02078.x, 2009.

Peylin, P., Houweling, S., Krol, M. C., Karstens, U., Rödenbeck, C., Geels, C., Vermeulen, A., Badawy, B., Aulagnier, C., Pregger, T., Delage, F., Pieterse, G., Ciais, P., and Heimann, M.: Importance of fossil fuel emission uncertainties over Europe for CO2 modeling: model intercomparison, Atmos. Chem. Phys., 11, 6607–6622, 10.5194/acp-11-6607-2011, 2011.

Peylin, P., Law, R. M., Gurney, K. R., Chevallier, F., Jacobson, A. R., Maki, T., Niwa, Y., Patra, P. K., Peters, W., Rayner, P. J., Rödenbeck, C., van der Laan-Luijkx, I. T., and Zhang, X.: Global atmospheric carbon budget: results from an ensemble of atmospheric CO2 inversions, Biogeosciences, 10, 6699–6720, 10.5194/bg-10-6699-2013, 2013.

Pickett-Heaps, C., Rayner, P., Law, R., Ciais, P., Patra, P., Bousquet, P., Peylin, P., Maksyutov, S., Marshall, J., Rödenbeck, C., Langenfelds, R., Steele, L., Francey, R., Tans, P., and Sweeney, C.: Atmospheric CO2 inversion validation using vertical profile measurements: Analysis of four independent inversion models, J. Geophys. Res.-Atmos., 116, D12305, 10.1029/2010JD014887, 2011.

Pillai, D., Gerbig, C., Kretschmer, R., Beck, V., Karstens, U., Neininger, B., and Heimann, M.: Comparing Lagrangian and Eulerian models for CO2 transport – a step towards Bayesian inverse modeling using WRF/STILT-VPRM, Atmos. Chem. Phys., 12, 8979–8991, 10.5194/acp-12-8979-2012, 2012.

Rayner, P. J., Utembe, S. R., and Crowell, S.: Constraining regional greenhouse gas emissions using geostationary concentration measurements: a theoretical study, Atmos. Meas. Tech., 7, 3285–3293, 10.5194/amt-7-3285-2014, 2014.

Rayner, P., Michalak, A. M., and Chevallier, F.: Fundamentals of Data Assimilation, Geosci. Model Dev. Discuss., 10.5194/gmd-2016-148, in review, 2016.

Reuter, M., Buchwitz, M., Hilker, M., Heymann, J., Schneising, O., Pillai, D., Bovensmann, H., Burrows, J. P., Bösch, H., Parker, R., Butz, A., Hasekamp, O., O'Dell, C. W., Yoshida, Y., Gerbig, C., Nehrkorn, T., Deutscher, N. M., Warneke, T., Notholt, J., Hase, F., Kivi, R., Sussmann, R., Machida, T., Matsueda, H., and Sawa, Y.: Satellite-inferred European carbon sink larger than expected, Atmos. Chem. Phys., 14, 13739–13753, 10.5194/acp-14-13739-2014, 2014.

Richardson, A. D., Anderson, R. S., Arain, M. A., Barr, A. G., Bohrer, G., Chen, G., Chen, J. M., Ciais, P., Davis, K. J., Desai, A. R., Dietze, M. C., Dragoni, D., Garrity, S. R., Gough, C. M., Grant, R., Hollinger, D. Y., Margolis, H. A., McCaughey, H., Migliavacca, M., Monson, R. K., Munger, J. W., Poulter, B., Raczka, B. M., Ricciuto, D. M., Sahoo, A. K., Schaefer, K., Tian, H., Vargas, R., Verbeeck, H., Xiao, J., and Xue, Y.: Terrestrial biosphere models need better representation of vegetation phenology: results from the North American Carbon Program Site Synthesis, Glob. Change Biol., 18, 566–584, 10.1111/j.1365-2486.2011.02562.x, 2012.

Rivier, L., Peylin, P., Ciais, P., Gloor, M., Rödenbeck, C., Geels, C., Karstens, U., Bousquet, P., Brandt, J., and Heimann, M.: Aerocarb Experimentalists: European CO2 fluxes from atmospheric inversions using regional and global transport models, Climatic Change, 103, 93–115 10.1007/s10584-010-9908-4, 2010.

Rodgers, C. D.: Inverse Methods for Atmospheric Sounding: Theory and Practice, Series on Atmospheric, Oceanic and Planetary Physics, World Scientific, 238 pp., 2000.

100

Saeki, T., Maksyutov, S., Saito, M., Valsala, V., Oda, T., Andres, R., Belikov, D., Tans, P., Dlugokencky, E., Yoshida, Y., Morino, I., Uchino, O., and Yokota, T.: Inverse Modeling of CO2 Fluxes Using GOSAT Data and Multi-Year Ground-Based Observations, Sola, 9, 45–50, 10.2151/sola.2013-011, 2013a.

101

Saeki, T., Maksyutov, S., Sasakawa, M., Machida, T., Arshinov, M., Tans, P., Conway, T., Saito, M., Valsala, V., Oda, T., Andres, R., and Belikov, D.: Carbon flux estimation for Siberia by inverse modeling constrained by aircraft and tower CO2 measurements, J. Geophys. Res.-Atmos., 118, 1100–1122, 10.1002/jgrd.50127, 2013b.

102

Schaefer, K., Schwalm, C. R., Williams, C., Arain, M. A., Barr, A., Chen, J. M., Davis, K. J., Dimitrov, D., Hilton, T. W., Hollinger, D. Y., Humphreys, E., Poulter, B., Raczka, B. M., Richardson, A. D., Sahoo, A., Thornton, P., Vargas, R., Verbeeck, H., Anderson, R., Baker, I., Black, T. A., Bolstad, P., Chen, J., Curtis, P. S., Desai, A. R., Dietze, M., Dragoni, D., Gough, C., Grant, R. F., Gu, L., Jain, A., Kucharik, C., Law, B., Liu, S., Lokipitiya, E., Margolis, H. A., Matamala, R., McCaughey, J. H., Monson, R., Munger, J. W., Oechel, W., Peng, C., Price, D. T., Ricciuto, D., Riley, W. J., Roulet, N., Tian, H., Tonitto, C., Torn, M., Weng, E., and Zhou, X.: A model-data comparison of gross primary productivity: Results from the North American Carbon Program site synthesis, J. Geophys. Res.-Biogeo., 117, G03010, 10.1029/2012jg001960, 2012.

103

Schuh, A. E., Denning, A. S., Corbin, K. D., Baker, I. T., Uliasz, M., Parazoo, N., Andrews, A. E., and Worthy, D. E. J.: A regional high-resolution carbon flux inversion of North America for 2004, Biogeosciences, 7, 1625–1644, 10.5194/bg-7-1625-2010, 2010.

104

Schuh, A., Lauvaux, T., West, T., Denning, A., Davis, K., Miles, N., Richardson, S., Uliasz, M., Lokupitiya, E., Cooley, D., Andrews, A., and Ogle, S.: Evaluating atmospheric CO2 inversions at multiple scales over a highly inventoried agricultural landscape, Glob. Change Biol., 19, 1424–1439, 10.1111/gcb.12141, 2013.

105

Shiga, Y., Michalak, A. M., Kawa, S. R., and Engelen, R. J.: In-situ CO2 monitoring network evaluation and design: A criterion based on atmospheric CO2 variability, J. Geophys. Res.-Atmos., 118, 2007–2018, 10.1002/jgrd.50168, 2013.

106

Shiga, Y., Michalak, A., Gourdji, S., Mueller, K., and Yadav, V.: Detecting fossil fuel emissions patterns from subcontinental regions using North American in situ CO2 measurements, Geophys. Res. Lett., 41, 4381–4388, 10.1002/2014GL059684, 2014.

107

Simmons, A. J. and Hollingsworth, A.: Some aspects of the improvement in skill of numerical weather prediction, Q. J. Roy. Meteor. Soc., 128, 647–677, 10.1256/003590002321042135, 2002.

108

Steele, L. P., Dlugokencky, E. J., Lang, P. M., Tans, P. P., Martin, R. C., and Masarie, K. A.: Slowing down of the global accumulation of atmospheric methane during the 1980s, Nature, 358, 313–316, 10.1038/358313a0, 1992.

109

Subramanian, R., Williams, L. L., Vaughn, T. L., Zimmerle, D., Roscioli, J. R., Herndon, S. C., Yacovitch, T. I., Floerchinger, C., Tkacik, D. S., Mitchell, A. L., Sullivan, M. R., Dallmann, T. R., and Robinson, A. L.: Methane Emissions from Natural Gas Compressor Stations in the Transmission and Storage Sector: Measurements and Comparisons with the EPA Greenhouse Gas Reporting Program Protocol, Environ. Sci. Technol., 49, 3252–3261, 10.1021/es5060258, 2015.

110

Tadić, J. M., Loewenstein, M., Frankenberg, C., Butz, A., Roby, M., Iraci, L. T., Yates, E. L., Gore, W., and Kuze, A.: A Comparison of In Situ Aircraft Measurements of Carbon Dioxide and Methane to GOSAT Data Measured Over Railroad Valley Playa, Nevada, USA, IEEE T. Geosci. Remote, 52, 7764-7774, 10.1109/tgrs.2014.2318201, 2014.

111

Takagi, H., Houweling, S., Andres, R., Belikov, D., Bril, A., Boesch, H., Butz, A., Guerlet, S., Hasekamp, O., Maksyutov, S., Morino, I., Oda, T., O'Dell, C., Oshchepkov, S., Parker, R., Saito, M., Uchino, O., Yokota, T., Yoshida, Y., and Valsala, V.: Influence of differences in current GOSAT XCO2 retrievals on surface flux estimation, Geophys. Res. Lett., 41, 2598–2605, 10.1002/2013GL059174, 2014.

112

Takahashi, T., Olafsson, J., Goddard, J. G., Chipman, D. W., and Sutherland, S. C.: Seasonal-variation of CO2 and nutrients in the high-latitude surface oceans – a comparative-study, Global Biogeochem. Cy., 7, 843–878, 10.1029/93gb02263, 1993.

113

Takahashi, T., Sutherland, S. C., Sweeney, C., Poisson, A., Metzl, N., Tilbrook, B., Bates, N., Wanninkhof, R., Feely, R. A., Sabine, C., Olafsson, J., and Nojiri, Y.: Global sea-air CO2 flux based on climatological surface ocean pCO(2), and seasonal biological and temperature effects, Deep-Sea Res. Pt. II, 49, 1601–1622, 10.1016/s0967-0645(02)00003-6, 2002.

114

Talagrand, O.: Errors. A posteriori diagnostics, in Advanced Data Assimilation for Geosciences Lecture Notes of the Les Houches School of Physics: Special Issue, June 2012, edited by: Blayo, É., Bocquet, M., Cosme, E., and Cugliandolo, L. F., Oxford University Press, 608 pp., 2014.

115

Talagrand, O., Vautard, R., and Strauss, B.: Evaluation of probabilistic prediction systems, in: Proceeding of workshop on predictability, 20–22 October 1997, European Centre for Medium-Range Weather Forecasts, Shinfield Park, Reading, Berkshire RG2 9AX, UK, 25 pp., available at: https://www.ecmwf.int/sites/default/files/elibrary/1997/12555-evaluation-probabilistic-prediction-systems.pdf (last access: 14 June 2017), 1999.

116

Tans, P. P., Fung, I. Y., and Takahashi, T.: Observational constraints on the global atmospheric CO2 budget, Science, 247, 1431–1438, 10.1126/science.247.4949.1431, 1990.

117

Thompson, R., Bousquet, P., Chevallier, F., Rayner, P., and Ciais, P.: Impact of the atmospheric sink and vertical mixing on nitrous oxide fluxes estimated using inversion methods, J. Geophys. Res.-Atmos., 116, D17307, 10.1029/2011JD015815, 2011.

118

Thompson, R. L., Ishijima, K., Saikawa, E., Corazza, M., Karstens, U., Patra, P. K., Bergamaschi, P., Chevallier, F., Dlugokencky, E., Prinn, R. G., Weiss, R. F., O'Doherty, S., Fraser, P. J., Steele, L. P., Krummel, P. B., Vermeulen, A., Tohjima, Y., Jordan, A., Haszpra, L., Steinbacher, M., Van der Laan, S., Aalto, T., Meinhardt, F., Popa, M. E., Moncrieff, J., and Bousquet, P.: TransCom N2O model inter-comparison – Part 2: Atmospheric inversion estimates of N2O emissions, Atmos. Chem. Phys., 14, 6177–6194, 10.5194/acp-14-6177-2014, 2014.

119

van der Werf, G. R., Randerson, J. T., Giglio, L., Collatz, G. J., Kasibhatla, P. S., and Arellano Jr., A. F.: Interannual variability in global biomass burning emissions from 1997 to 2004, Atmos. Chem. Phys., 6, 3423–3441, 10.5194/acp-6-3423-2006, 2006.

120

Villani, M. G., Bergamaschi, P., Krol, M., Meirink, J. F., and Dentener, F.: Inverse modeling of European CH4 emissions: sensitivity to the observational network, Atmos. Chem. Phys., 10, 1249–1267, 10.5194/acp-10-1249-2010, 2010.

121

WMO-LCDNV: WMO Lead Centre for Deterministic Forecast Verification, available at: http://apps.ecmwf.int/wmolcdnv/, last access: 10 June 2016.

122

Wu, L., Bocquet, M., Lauvaux, T., Chevallier, F., Rayner, P., and Davis, K.: Optimal representation of source-sink fluxes for mesoscale carbon dioxide inversion with synthetic data, J. Geophys. Res.-Atmos., 116, D21304, 10.1029/2011JD016198, 2011.

123

Wu, L., Bocquet, M., Chevallier, F., Lauvaux, T., and Davis, K.: Hyperparameter estimation for uncertainty quantification in mesoscale carbon dioxide inversions, Tellus B, 65, 20894, 10.3402/tellusb.v65i0.20894, 2013.

124

Wu, L., Broquet, G., Ciais, P., Bellassen, V., Vogel, F., Chevallier, F., Xueref-Remy, I., and Wang, Y.: What would dense atmospheric observation networks bring to the quantification of city CO2 emissions?, Atmos. Chem. Phys., 16, 7743–7771, 10.5194/acp-16-7743-2016, 2016.

125

Wunch, D., Toon, G. C., Blavier, J.-F. L., Washenfelder, R. A., Notholt, J., Connor, B. J., Griffith, D. W. T., Sherlock, V., and Wennberg, P. O.: The Total Carbon Column Observing Network, Philos. T. R. Soc. A, 369, 2087–2112, 10.1098/rsta.2010.0240, 2011.

126

Yadav, V., Michalak, A. M., Ray, J., and Shiga, Y. P.: A statistical approach for isolating fossil fuel emissions in atmospheric inverse problems, J. Geophys. Res.-Atmos., 121, 12490–12504, 10.1002/2016JD025642, 2016.

127

Zhang, H., Chen, B., van der Laan-Luijkx, I., Chen, J., Xu, G., Yan, J., Zhou, L., Fukuyama, Y., Tans, P., and Peters, W.: Net terrestrial CO2 exchange over China during 2001–2010 estimated with an ensemble data assimilation system for atmospheric CO2, J. Geophys. Res.-Atmos., 119, 3500–3515, 10.1002/2013JD021297, 2014.

128

Zimmerle, D., Williams, L., Vaughn, T., Quinn, C., Subramanian, R., Duggan, G., Willson, B., Opsomer, J., Marchese, A., Martinez, D., and Robinson, A.: Methane Emissions from the Natural Gas Transmission and Storage System in the United States, Environ. Sci. Technol., 49, 9374–9383, 10.1021/acs.est.5b01669, 2015.

</app></app-group></back> </article>