the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Development of a parametrised atmospheric NOx chemistry scheme to help quantify fossil fuel CO2 emission estimates
Auke Visser
Nicolas Bousserez
Success of the Paris Agreement relies on rapid reductions in fossil fuel CO2 (ffCO2) emissions. Atmospheric data can verify the ffCO2 reductions pledged by nations in their nationally determined contributions. However, estimating ffCO2 from atmospheric CO2 is challenging due to natural fluxes and varying backgrounds. One approach is to combine with nitrogen oxides (NOx = NO + NO2), which are co-emitted with CO2 during combustion. A key challenge in using NOx to estimate ffCO2 is the computational cost of modelling atmospheric photochemistry. Additionally, the NO2 : NO column ratio must be well understood to convert model NOx columns to NO2 columns for comparison with satellite data. We use random forest regression to parameterise NOx chemistry, relying only on meteorological parameters and NOx concentration. The regression is trained on outputs from a nested GEOS (Goddard Earth Observing System)-Chem model simulation for mainland Europe in 2019. We develop a monthly NOx chemistry parameterisation that performs well when tested on perturbed emission runs (R2 > 0.95) and on unseen meteorology for 2021 (R2 > 0.79). We also parameterise the NO2 : NO ratio (R2 > 0.99 on perturbed outputs, R2 > 0.92 on unseen meteorology). Additionally, we present an alternative method to predict NOx rates by scaling baseline NOx rates with changes in NOx concentration (R2 = 1.0 on perturbed outputs). Our models reproduce NO2 columns with minimal deviation from full-chemistry models, with reconstruction error smaller than the TROPOspheric Monitoring Instrument (TROPOMI) precision in over 99.9 % of cases, supporting robust ffCO2 inversion efforts. These results provide a robust framework for accurately estimating fossil fuel CO2 emissions from atmospheric data, enabling more reliable monitoring and verification of global emissions reductions.
- Article
(14402 KB) - Full-text XML
- BibTeX
- EndNote
Reaching net zero greenhouse gas emissions is a global goal, needed to curb further warming of our planet. Achieving that goal on a national scale requires accurate knowledge about fossil fuel emissions of CO2 (ffCO2) to verify a country's progress towards achieving their Nationally Determined Contributions under the Paris Agreement. But how can a country assess whether they are heading in the right direction? The default approach is to use national inventories that are compiled from energy statistics and emission factors but they are uncertain for various reasons, mainly associated with the veracity of the statistics and their spatial and temporal distributions and the default assumption of time-invariant emission factors (Kuenen et al., 2014; Hoesly et al., 2018b). Such “bottom-up” inventories are typically available with a delay of 2 years (Janssens-Maenhout et al., 2019) thereby introducing a temporal disconnect between climate action and results. The alternative “top-down”, data-driven approach uses Bayes' theory to infer CO2 emission estimates from observed changes in atmospheric CO2. This approach is also subject to uncertainties including errors in atmospheric transport models, sparse observational coverage, and background concentration estimation (Peylin et al., 2013; Andrew, 2020). One of the remaining challenges associated with this atmospheric approach is isolating the combustion and natural contributions to atmospheric CO2 (Oda et al., 2023). Various approaches have been proffered to address that challenge, which fall into two broad categories: spatial disaggregation of combustion (Shu and Lam, 2011; Liu et al., 2018) and natural fluxes and using an additional trace gas (Meijer et al., 1996; Lopez et al., 2013; Wenger et al., 2019; Super et al., 2020), associated exclusively with combustion or natural processes common to CO2. One such trace gas is NOx, but due to the large computational overhead of directly modelling the atmospheric NOx photochemistry, we endeavor to determine an alternative methodology to model NOx chemistry. Here we describe a parameterisation of tropospheric nitrogen oxide (NOx = NO + NO2) chemistry that effectively unlocks our ability to use NOx alongside CO2 to quantify ffCO2 estimates within an Bayesian inference framework, particularly in the context of an operational system.
Extracting energy from carbon-based fuels relies on breaking apart atomic bonds that form the molecular structure of the fuel, thereby releasing energy. This is achieved by combustion in which the fuel, composed primarily of hydrogen-carbon bonds, is oxidized by molecular oxygen (O2). Generally, more energy is released during combustion for fuels with a higher H : C ratio. The primary products of combustion are CO2 and water vapour. However, when combustion is inefficient – for example, due to insufficient O2 to fully oxidise the fuel – a wider range of compounds is released, depending on the composition of the fuel being burned. For many combustion processes, air is used to provide O2. While molecular nitrogen (N2) in air does not take part in the combustion reaction, the high temperatures involved can thermally dissociate N2 to facilitate the production of NO (and to a lesser extent NO2), which is subsequently co-emitted with the CO2 emissions. The advantage of using atmospheric NOx as a tracer of ffCO2 is its relatively short lifetime, on the order of hours to days, which means that we can link elevated NO2 satellite columns directly to their parent NOx emissions. Numerous studies are using observations of NOx and NO2 to constrain estimates of ffCO2 (Berezin et al., 2013; Lopez et al., 2013; Goldberg et al., 2019; Super et al., 2020). With the increasing availability of in situ and satellite measurements of atmospheric CO2, NO2 and other fossil-fuel tracers, deriving ffCO2 through multi-species model inversion techniques is becoming a widely used approach (Feng et al., 2009; Nayagam et al., 2023; Super et al., 2024; Wang et al., 2025). However, a key limitation of this method is the uncertainty in CO2 : NOx emission ratios, which vary by sector, fuel type, and combustion technology (Jiang et al., 2010; Wang et al., 2025) . Additional challenges include errors in atmospheric transport modelling, accurate representation of chemical processes, and limited observational coverage.
We present a methodology for parameterising NOx chemistry to reduce the associated computational overhead. We consider NOx because its constituents, NO and NO2, rapidly interconvert (Jacob, 1999). By modelling NOx as a proxy for the combined NO and NO2 we can save a considerable amount of computational time that would otherwise be spent on photochemical calculations (previously shown in Wu et al., 2023). To do this we need a model that can predict the net loss of NOx at each time step and grid point. The rate of decay of NOx is driven by a number of meteorological parameters (Nguyen et al., 2022) including, but not limited to, the irradiance from sunlight, air temperature and solar zenith angle. In this study, we develop a machine learning-based random forest regression model, trained on a full-chemistry version of the GEOS (Goddard Earth Observing System)-Chem atmospheric chemistry model, to accurately predict the atmospheric NOx rate of change using a small set of driving variables. We evaluate the robustness of our parameterised NOx chemistry using perturbed emissions on the order of those we typically employ in ensemble Kalman filter techniques. With atmospheric inversion methods in mind, atmospheric NOx emission estimates tend to be constrained by satellite column observations of NO2 (Napelenok et al., 2008; Zhao and Wang, 2009; Kemball-Cook et al., 2015) so our parameterised model must also be able to describe changes in NO2. We achieve this by developing a further random forest-based model, which can predict the species concentration NO2 : NO ratio.
Figure 1A schematic illustrates how NOx chemistry parameterisation models are integrated into GEOS-Chem for modelling of atmospheric NOx without a full chemistry scheme.
Figure 1 shows a schematic overview of the steps used to parameterise NOx chemistry and partitioning for efficient modelling of NO2 columns. The first stage involves running atmospheric simulations of NOx using offline chemistry rates, which are either predicted by random forest models (described in Sect. 2.2) or estimated through relative scaling (described in Sect. 2.3). In the second stage, the NOx output from these simulations is converted to NO2, enabling direct comparison with satellite observations such as TROPOMI NO2. This approach provides an efficient framework suitable for data assimilation applications.
In the next section, we describe the GEOS-Chem atmospheric chemistry transport model that we use to train our random forest models, the satellite observations of column NO2 that we use to evaluate our parameterised atmospheric chemistry model for NO2, and the approach we take to construct the random forest model. In Sect. 3, we report the performance of random forest models of atmospheric NOx and NO2 : NO, and evaluate the corresponding atmospheric NO2 columns using satellite data. We conclude the paper in Sect. 4.
Here, we describe the GEOS-Chem atmospheric transport model used to build our random forest regression models, the satellite column data we use to evaluate our parameterised model of atmospheric NOx chemistry, and details that describe how we develop our random forest regression models. A random forest regression model, or a constant lifetime scaling based approach can be used to predict the chemistry rates. The modelled NOx concentrations are then converted to NO2 using an additional random forest model. This efficient approach significantly reduces GEOS-Chem's computational cost for forward modelling of NO2 columns. This is particularly useful for high resolution data assimilation, allowing anthropogenic NOx emission perturbations to be compared with satellite NO2 observations, such as the TROPOspheric Monitoring Instrument (TROPOMI).
2.1 GEOS-Chem atmospheric chemistry transport model
We use version 14.2.2 of the GEOS-Chem atmospheric chemistry transport model (Bey et al., 2001) to describe the emissions, transport, and chemical production/loss of atmospheric NOx. For the purpose of our study, we use a nested version of the full chemistry model, centred over mainland Europe (32.75 to 61.25° N, −15 to 40° E) with 47 vertical levels, approximately 30 of which fall below the dynamic tropopause, where the first model layer has a depth of 130–180 m. The nested model runs with a horizontal spatial resolution of 0.25° × 0.3125°. Initial conditions and lateral boundary conditions to the nested domain were created from a consistent global version of the GEOS-Chem model run at 4° × 5°, with three-hourly output fields. We ran the model with a transport timestep of 5 min and a chemistry timestep of 10 min.
The model is driven by offline meteorology fields from the GEOS Forward Processing (GEOS-FP) product from the Global modelling and Assimilation Office (GMAO) at NASA Goddard Space Flight Center. GEOS-FP has a native horizontal resolution of 0.25° × 0.3125° with 72 vertical pressure levels and 3 h temporal resolution. To describe the emissions of NOx we used anthropogenic emissions from the Community Emissions Data System (CEDS) version 2 (Hoesly et al., 2018a, b), which provides NO emissions for anthropogenic combustion (industry, energy extraction), and non-combustion sources (agriculture, solvents), including surface transport and shipping. Aircraft emissions for NO and NO2 are taken from the Aviation Emissions Inventory Code (AEIC) (Simone et al., 2013). Pyrogenic emissions of NO are taken from the Global Fire Emissions Database (GFED) version 4.1 (Randerson et al., 2017).
GEOS-Chem's full-chemistry mechanism simulates atmospheric chemistry by explicitly solving a comprehensive network of chemical reactions, capturing the production, transformation, and loss of NOx and related species. NOx chemical loss is simulated through key reactions such as NO2 reacting with ozone (O3) to form NO3, hydroxyl radicals (OH) to produce nitric acid (HNO3), and hydroperoxyl radicals (HO2) to form peroxynitric acid (HNO4). Organic nitrate formation is included through the reactions of NO2 with methyl peroxy radicals (MO2) and methacryloyl peroxy radicals (MCO3), forming methyl peroxy nitrate (MPN) and peroxyacetyl nitrate (PAN), respectively. Additional loss occurs via NO3 reacting with NO2 to produce dinitrogen pentoxide (N2O5). Simultaneously, the model accounts for important regeneration pathways, including the thermal decomposition of N2O5 into NO3 and NO2, the breakdown of PAN to release NO2 and methacryloyl peroxy radicals (MCO3), and the photolysis of HNO4 to produce NO2 and HO2. Rapid NO to NO2 exchange is simulated through key reactions, including NO + O3 → NO2 + O2, which relies on ozone to oxidize NO, and NO + NO3 → 2 NO2, which occurs through the reaction of nitric oxide with nitrate radicals. Additionally, photochemical reactions driven by sunlight include NO2 + O2 + hv → NO + O3, where nitrogen dioxide photodissociates to form nitric oxide. The mechanism determines reaction rates using reaction rate coefficients that depend on temperature, pressure, and solar radiation, alongside environmental inputs like meteorological fields and species concentrations.
The average diurnal cycle of NOx chemical rate of change calculated from full-chemistry simulations is presented in Fig. A1 for the four seasons of the year. The shape of the diurnal cycle in the NOx tendency varies seasonally, influenced by changing sunlight intensity and atmospheric conditions. In winter, the net NOx loss peaks predominantly at night, when photolytic regeneration ceases and reservoir species like HNO3 and PAN accumulate, removing NOx from the reactive pool. During spring and autumn, while a nighttime peak loss remains, there is an additional peak of comparable magnitude in the morning around 09:00–10:00 local solar time (LST). In summer, the maximum net loss shifts to the early morning hours 07:00–08:00 LST, likely driven by rapid photochemical activity as sunlight increases. Meanwhile, by the afternoon we find episodes of net NOx production, reflecting stronger photolytic regeneration under high solar intensity. These seasonal and diurnal variations reflect complex interactions between photochemistry, emission patterns, and atmospheric transport, resulting in shifts of NOx sinks and sources throughout the day and year.
Table 1Input parameters selected through forward feature selection for random forest regression models predicting the NOx chemical net rate of change [molec. cm−3 s−1] and the NO2 : NO partitioning ratio.
The NOx concentration, the NOx chemical rates of change, and relevant meteorology were output at a temporal resolution of one hour. The chosen meteorological parameters are shown in Table 1. These were selected as they were all found to have a relationship with the net NOx chemical rate of change.
Figure 2(a) Sensitivity testing shows that the impact of 20 % emission perturbations on modelled NO2 columns is on the same order as the deviations between GEOS-Chem and TROPOMI. (b) The impact of emission perturbations on the NOx chemistry rate becomes negligible (< 1 % change, or ΔNOx rate < 9 × 103 molec. cm−3 s−1) above 3 km from the ground. Additionally, chemistry rate change is negligible in all cases where ΔNOx concentration < 5 × 104 molec. cm−3.
The model was run for the full year 2019 with baseline (unperturbed) NOx anthropogenic emissions taken from the CEDs emission inventory. This data was used to train the regression models. To further validate the regression model's performance under varying emissions, additional model runs were conducted with random perturbations applied to anthropogenic NOx emissions on the order of ±20 %. We chose this size of perturbation because a 20 % increase in emissions induces changes in NO2 columns on the same order of magnitude as the difference observed between GEOS-Chem and TROPOMI (as in Fig. 2a). These perturbed runs were performed for 10 d in January, April, July, and October. A model run for the year 2021 was also performed in order to test the regression performance for an unseen meterological period.
2.2 Random Forest regression modelling
We trained two random forest regressor models to predict the NOx net chemical rate of change, and the NO2 : NO partitioning ratio. Random forest models are an ensemble machine learning method, which combine the predictions of many decision trees to improve accuracy and reduce overfitting (Breiman, 2001). A decision tree is a simple predictive model that makes a series of splits in the data based on input variables. At each node, the algorithm chooses the predictor and threshold that best separate the data with respect to the target, continuing until each final branch (or “leaf”) gives a prediction. While a single tree is easy to interpret, it can overfit the data. Random forests address this by building a “forest” of many trees, each trained on a random subset of the data and predictors. This randomness ensures the trees capture diverse patterns, and averaging their outputs yields more robust predictions. Such an algorithm is well-suited to this study as, unlike traditional regression approaches, it does not require assumptions about linearity and can flexibly capture complex relationships and interactions between meteorological drivers and chemical tendencies. Additionally, random forests are relatively computationally efficient to train and can handle correlated predictor variables, making them well suited for large atmospheric datasets.
These models were built using the Sci-kit learn python package (Pedregosa et al., 2011). We evaluated model performance using the coefficient of determination (R²), which quantifies the proportion of variance explained by the model; the mean absolute error (MAE), which measures the mean magnitude of prediction errors; and the mean bias, which indicates the mean tendency of the model to overpredict or underpredict relative to observations. These are defined by the following equations, where yi are true values, are predicted values, is the mean of the true values, and N is the number of datapoints:
We separately trained both regression models for each month of the year, for which we report results from January, April, July, and October 2019. The models were developed using the NOx concentration, the spatial location and a range of meteorological variables as input parameters. We considered a total of 14 input parameters as predictors in the models, shown in Table 1.
To identify the most relevant features for the models, we performed a comprehensive forward selection wrapper procedure, which iteratively adds the feature that yields the largest improvement in mean absolute error until no further gain is observed. Figure A2a and b detail how the performance of the models changed as we added features for the prediction of chemistry rate and the partitioning ratio, respectively. Based on this procedure, we selected a set of nine features for the chemistry rate model, and eight features for the partitioning ratio model (presented in Table 1). Five of the parameters; air pressure, air density, height of the planetary boundary layer, and the mixing ratio of ozone (O3) and the mixing ratio of carbon monoxide (CO), were consistently excluded from all models during feature selection. The respective importance of each feature across both models for the 4 months studied are plotted in Fig. A2c. For the chemistry rate prediction, the NOx concentration and the solar zenith angle are consistently emerge as the most important predictorss, contributing around 70 % of the total feature importance in the model. In the ratio prediction, solar zenith angle, altitude, and temperature are the primary predictors during the colder months (January and October), while temperature alone serves as the dominant predictor in the warmer months (April and July). Additionally, the impact on model performance of removing each of the 14 parameters in turn is presented in Fig. A2d. The individual relationship between the nine selected predictors and the NOx chemistry rate of change are shown in Fig. A3.
To avoid unnecessarily complex models, we tuned the model hyperparameter values to optimise the trade-off between computational efficiency and prediction accuracy. Specifically, we conducted a grid search across the four main hyperparameters in the random forest regression model: the number of trees (estimators), maximum tree depth, maximum number of leaf nodes, and maximum number of features considered at each split. We selected each hyper-parameter as the value at which performance plateaued, defined here as the point beyond which further increases in the parameter resulted in less than a 2 % improvement in model performance. The results of the tuning are presented in Fig. A4. The final optimised model achieved a prediction time of 6 ms per sample, providing a good balance between accuracy and computational cost. In addition to reducing computational time, simplifying the random forest by limiting tree complexity and number also reduces the risk of overfitting, thereby improving the generalisability of the model to new data.
We trained and tested our NOx chemistry regression models on model grid points in the first 3 km above the surface – the region where changes to surface emissions were found to directly influence the atmospheric chemistry, see Fig. 2b. The regression model for the NO2 : NO ratio was predicted for each level in the troposphere, and trained on the subset of model data that coincides with the TROPOMI swath (11:30–15:30 LST overpass). The NO2 : NO ratio can be used to convert the concentration of NOx to NO2:
To evaluate model generalisability, we tested model performance using two complementary approaches. Primarily, we assessed predictions on unseen emission perturbation scenarios while holding meteorology fixed. Specifically, we focused on ±20 % emission perturbations similar to those used in ensemble Kalman filter applications (Feng et al., 2009, 2023). This isolates the model's responsiveness to emission changes under consistent atmospheric conditions and reflects its intended use in inversion frameworks, where emissions are perturbed while meteorology remains prescribed. In addition, we include in the appendix (Fig. A6) an evaluation on an entirely independent simulation run for the year 2021, representing unseen meteorological conditions due to its different temporal period. For both approaches, training and testing datasets were constructed via random sampling across all spatial locations and time steps. The training set comprised a random 10 % subset of the unperturbed data, while the test set comprised 0.25 % of the perturbed (or 2021) data, ensuring minimal overlap in specific spatiotemporal conditions. Combined, this dual testing strategy rigorously evaluates the models' ability to generalise across both emission changes and meteorological variability, providing confidence in their performance for atmospheric inversion applications.
2.3 NOx chemical lifetime
In an alternative formulation, we apply the assumption that the effective lifetime of atmospheric NOx remains constant under stable meteorological conditions. Hence, if a full chemistry model run is available for a baseline emission scenario, the chemistry rates for perturbed scenarios can be calculated by scaling the original rate according to the proportional change in NOx concentration. This approach serves as an alternative to using regression models for predicting the chemistry rates.
The effective atmospheric lifetime, τ of NOx is given by:
where NOx denotes the combined NO and NO2 species concentrations [molec. cm−3] and is the instantaneous chemical rate of net loss [molec. cm−3s−1], which accounts for the balance between its chemical production (e.g., from reactions involving NO or NO2 precursors) and its chemical loss processes (e.g., reactions forming reservoirs like HNO3 or NOy species). Note that when NOx experiences an instantaneous net chemical production, this effective atmospheric lifetime becomes negative. We advise the reader that this effective lifetime does not represent an intrinsic first-order decay timescale for NOx. Instead, it provides a practical framework to express net rates of change relative to the amount of NOx present, which we find to be an intrinsically stable metric. The benefit of looking at the effective chemical lifetime, rather than the net rate of change, is that the quantity is largely independent of species concentration. This independence allows for a more stable understanding of the NOx chemistry, irrespective of fluctuations in its concentration caused by emission changes.
We found that while the influence of ±20 % emission perturbations cause clear changes to the NOx chemical net rate of change, the resulting changes to atmospheric lifetime are considerably smaller (see Fig. A5). This result suggests that the chemical lifetime is driven by the meteorology and location in the model but is less sensitive to changing concentrations of NOx. The unperturbed model run provides NOx concentrations and rates of change at a 1-hour temporal resolution, allowing the chemical rate of change to be updated every hour under the assumption of an unchanged chemical lifetime. The new rate of change can be determined using the NOx lifetime, τ, and the local NOx concentration:
For this method, an initial unperturbed full-chemistry model run must be employed to determine the NOx chemical lifetime for each grid-point and time-point for the spatial and temporal region of interest. Then for any further perturbed model runs, the chemistry rates can be determined without the need of an integrated chemistry scheme, thereby saving considerable computational time. The updated chemistry rates are then simply scaled by the ratio of the new NOx concentration to the original NOx concentration; so, if the concentration doubles then we assume a doubling in the net chemical rate of change. This method for updating the NOx chemistry is referred to as the constant lifetime scaling-based method.
2.4 Regression-based atmospheric chemistry transport modelling
For this study, we added the NOx species to the GEOS-Chem tagged carbon model, CO2, CO, methane, and carbonyl sulphide, in which individual tagged tracers track contributions of these trace gases from geographical regions and/or natural and human-driven fluxes. This model does not include an integrated chemistry scheme and therefore the NOx species chemical rate of change is determined using the NOx chemistry regression model. Going forward, we refer to this model as the regression-based atmospheric chemistry transport model (shown in Fig. 1).
We performed a full-chemistry model run with emission perturbations to evaluate the impact of emission changes on NOx chemistry, and later to assess the performance of our regression model in predicting the effects of emission changes. An analysis of how the emission-driven changes in chemistry rate varied with the atmospheric altitude as well as the change in NOx concentration is shown in Fig. 2b. The net rate of change in NOx chemistry showed minimal variability at altitudes above 3 km, where the chemistry change was less than 9 × 103 molec. cm−3 s−1. Additionally, minimal variability in atmospheric chemistry was observed when the absolute change in NOx concentration was less than 5 × 104 molec. cm−3, which corresponds to a chemistry change of less than 2 × 103 molec. cm−3 s−1. Based on these findings, we set a condition to update the NOx net chemical rate of change using the unperturbed full-chemistry outputs for altitudes above 3 km and for regions where the change in NOx concentration is less than 5 × 104 molec. cm−3. For all other regions, the chemistry regression model is used to predict the new rate of change.
We also used the constant lifetime scaling method (see above) to predict the new rate of change. Looking to Fig. 1 we can see that this methodology provides an alternative approach to the regression-based atmospheric chemistry model for modelling NOx columns. Throughout this paper we will compare the results of the regression-based chemistry scheme and the constant lifetime scaling-based approach.
We ran the model for 10 d in January, April, July, and October which provided contrasting seasonal conditions to test the model. For each run, we use the ±20 % perturbed anthropogenic NOx emission sets. To evaluate the veracity of the NOx column model outputs for the regression-based chemistry model and for the constant lifetime scaling model, we compare them with the full-chemistry model outputs. We use our NO2 : NO ratio regression model to convert NOx results from our atmospheric chemistry regression model to NO2 columns, sampled at the time and location of TROPOMI data, so they can be compared with TROPOMI NO2 column data.
2.5 TROPOMI satellite column observations of NO2
We use TROPOMI NO2 tropospheric columns (S5P Level 2, product version 2.2.0, processing version 1.6.0.) (European Space Agency, 2021) to compare with the GEOS-Chem model output (see Fig. 1). TROPOMI was launched in 2017 in a Sun-synchronous orbit with a local equatorial overpass time of 13:30. It has a swath width of 2600 km and a ground pixel of 7 × 7 km2 in the nadir. Due to the width of the swath, the 13:30 overpass time corresponds to data captured with local solar time (LST) ranging from 11:30 and 15:30 LST in the highest latitude regions of the European domain. We only used data with a quality flag ≥ 0.75, filtering out data affected by elevated cloud cover, aerosol loading, and larger solar and viewing zenith angles. We analysed TROPOMI data for 10 d in January, April, July, and October 2019.
For our study, we regridded TROPOMI data to our 0.25° × 0.3125° GEOS-Chem model grid. To enable a comparison between TROPOMI and GEOS-Chem, we sampled the model at the location and time of each TROPOMI observation. We applied scene-dependent TROPOMI averaging kernels, describing the instrument sensitivity to changes in atmospheric NO2, to the corresponding model NO2 profiles.
Here, we report the model performance of our atmospheric chemistry prediction models for NOx and the accompanying regression model for the NO2 : NO ratio that enables us to convert NOx columns to NO2 columns observed by satellites. We assess the fidelity of our results from these models using the full-chemistry version of GEOS-Chem and evaluate our results using TROPOMI NO2 column data.
3.1 Performance of atmospheric chemistry regression models for NOx
3.1.1 NOx chemistry random forest
Figure 3a shows that the NOx chemistry random forest model has an impressive performance at reproducing results from the full-chemistry version of GEOS-Chem for the 4 months we study in 2019. The model performance R2 values are 0.97, 0.97, 0.96 and, 0.95 for January, April, July, and October 2019, respectively. The MAE values are largest in July (4 × 104 molec. cm−3 s−1) and smallest in January (2.3 × 104 molec. cm−3 s−1), reflecting the increase in magnitude of chemistry rates during summer months over Europe.
Figure 3Actual versus predicted scatter plots for models tested on simulations with unseen emission perturbations. (a) The random forest regression model for predicting the NOx chemistry rate, (b) the constant lifetime scaling for reconstructing the NOx chemistry rate using an unperturbed chemistry dataset, (c) the reconstruction of NO2 from NOx using the random forest regression model for predicting the NO2 : NO ratio.
We also tested our regression-based atmospheric chemistry model with model data from 2021 (Fig. A6). As expected, the regression model performance has less skill in reproducing data that has not been used for training. In this case, the MAE values are higher by a factor of 1.3–1.8 compared with the overall performance comparison shown in Fig. 4). Nevertheless, the model still shows substantial skill despite substantial differences in anthropogenic emissions between 2019 and 2021 due to COVID-19. Specifically, NOx emissions were found to decrease by 18 %–24 % during lockdown periods (Miyazaki et al., 2021) leading to a mean observed reduction in NO2 of 29 % (Cooper et al., 2022).
3.1.2 NOx chemistry prediction using constant lifetime scaling
Figure 3b shows results from using our alternative atmospheric chemistry regression NOx model that employs a constant atmospheric lifetime scaling approach (Eq. 4). The resulting model performance is a significant improvement above the other regression model for all 4 study months. Using our scaling approach, we found consistent values of R2 = 1.0 and MAE values that are approximately 2–3 times smaller than the other regression model. As with the other regression model, the size of the error is scaled by the seasonal changes in chemistry rates.
While this approach shows extremely encouraging abilities to determine NOx chemistry rates, its effectiveness relies on having a full-chemistry model run available for at least one set of emission inputs. Consequently, this approach is particularly useful for emission perturbation studies, for which numerous emission distribution scenarios might be needed for model inversion work. In this case, the full-chemistry model would only need to be run once for the given time period of interest. However, we cannot predict the NOx chemistry using this method for a previously unmodelled meterological period.
3.1.3 NO2 : NO ratio regression model
We find the random forest regression model to predict NO2 : NO ratios also demonstrates significant performance. The predicted ratio is used to convert NOx concentrations to NO2 concentrations (Eq. 2). Figure 3c shows that the regression model can reproduce “true” NO2 values from the full-chemistry of the GEOS-Chem model, with values of R2 of 1.0; the exception is January when R2 = 0.99.
Generally, the model performance is better during summer months and worse in winter months, with MAE values an order of magnitude smaller in July compared to January. This is partly due to NO2 concentrations increasing during colder months due to increased combustion and longer nights, and because we find that NO2 : NO ratios become increasingly hard to determine at higher solar zenith angles, typically experienced over Europe during daytime through winter months. We also examine the performance of this regression model using data from the unseen year 2021. As with the atmospheric chemistry regression model, described above, the performance was good but worse than for 2019 in which data was used to train the model. The MAE increased by a factor of 3.25, 3.52, 3.04, and 3.14 for January, April, July, and October respectively. We found the R2 performance reduced most for January from 0.99 to 0.92, During April and October R2 reduced from 1.0 to 0.99, while R2 = 1.0 was maintained in July.
Figure 5The modelled NOx columns sampled at 12:00 UTC after a 10 d model run with ±20 % emission perturbations. NOx columns are compared for the GEOS-Chem full-chemistry model and (a) NOx columns are simulated using the regression-based chemistry method and (b) using the constant lifetime scaling method.
3.2 NOx atmospheric modelling
Figure 5 shows the NOx column reconstruction for the two regression models used to describe the NOx chemistry rates from the full-chemistry version of the GEOS-Chem model. From a visual inspection, there are no obvious differences in the spatial distribution of the NOx columns reconstructed using both the regression-based chemistry model and the constant lifetime scaling model. However, when mapping the differences, there are areas of deviation from the full-chemistry model. Broadly, this deviation is significantly smaller when we use the scaling-based model compared to the regression-based. In addition, the error accumulation in January is notably smaller than in other months.
Figure 6Comparison of the temporal variation in NOx column reconstruction for the regression-based and scaling-based model. (a) The median (dashed line), IQR (light-shaded region) and range (dark-shaded region) of the NOx column reconstruction error over the 10 d runs. (b) The mean absolute percentage error over the 10 d runs. (c) Shows the reduction in computational time when modelling atmospheric NOx using each of our chemistry prediction methods compared to running with the full-chemistry model.
Figure 6 shows the temporal variation in the reconstruction error. The range, IQR, and median values are shown in Fig. 6a and the mean absolute percentage error (MAPE) is shown in Fig. 6b. For the regression-based chemistry method the range in deviation peaks at up to 3 × 1014 molec. cm−2 in January, 5 × 1014 molec. cm−2 in April and 6 × 1014 molec. cm−2 in July and October. This is reflected in maximum MAPE values of 2.8 %, 9.7 %, 8.9 %, and 9.3 % for the 4 months, respectively. On the whole, the MAPE reduces through time, with final deviation values of 1.7 %, 3.4 %, 2.0 %, and 4.8 % after the full 10 d run.
Reconstruction errors for the constant lifetime scaling model show much smaller errors, particularly in January, with MAPE < 0.2 % throughout the 10 d run. This is driven by the smaller impact that emission perturbations have on the NOx chemistry in January as shown by Fig. A5. In particular, the lifetime of NOx is relatively unchanged between the unperturbed and perturbed model runs. This reduced impact in January is likely due to the slower rate of photochemical reactions in the winter months and increased atmospheric stability at lower temperatures. The other months do see a more prominent deviation of up to a maximum of 4 × 1014 molec. cm−2, with peak MAPE values of 6.6 %, 5.7 %, and 4.5 %, for April, July, and October, respectively. As with the regression-based model outputs, here the MAPE also generally decreases through time with final deviation values of 0.1 %, 1.1 %, 0.2 %, and 0.3 % for each month, respectively. Interestingly, while the range and IQR are relatively stable throughout the run when using the regression-based reconstruction, these quantities decrease considerably with time when we use the scaling-based reconstruction.
The reconstruction error has a small diurnal cycle, peaking in the morning and to a lesser extent in the evening, reflecting the diurnal cycle of NOx chemistry (Fig. A1). Overall the absolute model error for both the regression-based and scaling-based methods peaks after the first day and then gradually reduce, plateauing by ≃day 6. This early peak in error followed by a reduction and eventual plateau is likely due to compensating errors, where the regression model's over- and under-predictions balance each other out over time, leading to a stabilisation of the overall error. It is encouraging that there is no accumulation of error through time, suggesting this approach would be suitable for studies longer than for 10 d. It is clear that the optimal reconstruction performance is found when using the scaling-based method, but as we already note there are limitations to this method. The regression-based approach still provides excellent reconstruction performance for our purposes.
Figure 7(a) The modelled NOx columns sampled at 12:00 UTC after a 10 d model run in 2021 using the regression models trained on 2019 compared with full-chemistry. (b) The mean absolute percentage error for the 10 d runs.
To evaluate the performance of the regression-based chemistry modelling approach with regression models trained on a different meteorological time period, the same models were applied to simulate atmospheric NOx over Europe for 2021. Figure 7a shows the reconstructed NOx columns after a 10 d model run. As expected, the reconstruction performance is clearly worse than when the regression-based chemistry is just applied in 2019 with emission perturbations (Fig. 5a). However, from a visual inspection, there are no obvious changes to the spatial distribution of the NOx columns reconstructed using regression-based chemistry in comparison to the full-chemistry model output. Additionally, the temporal variation in error is shown through plots of the MAPE (Fig. 7b). We see maximum MAPE values of 11.0 %, 10.0 %, 16.7 %, and for January, April, July, and October 2021 respectively. For all months this is an increase in the maximum deviation observed when applying this methodology to a perturbed 2019 run. Overall, this is reflective of the reduction in prediction power of the regression models when we apply to 2021, which has unseen meteorology. Overall, the same pattern of the absolute error gradually reducing and plateauing by ≃ day 6 is also observed here. However, the diurnal cycle of variation in the reconstruction error is more pronounced in the 2021 case, likely due to the fact that the regression model is worse performing during the night for unseen meteorology. The error tends to reduce dramatically towards the middle of the day, which is helpful if we consider the application of model comparison with satellite data such as a TROPOMI, which has a 13:30 overpass time.
Substantial computational time is saved when we employ these regression methods to model atmospheric NOx. Figure 6c shows the time taken for each model to perform a 1 d model run. This was calculated as the mean average for the model to run for a single day out of the 10 d run for each of the 4 months, repeated for 3 model runs. Clearly, the full-chemistry model takes the longest, with a mean of 52 min per day for our nested model over Europe. The regression-based chemistry model is significantly faster with a mean of 16 min (3.25 times improvement), while the constant lifetime scaling method is even faster, with a mean of 12 min (4.3 times improvement). It is important to note that the model run times reported here are subject to variability due to fluctuations in the relative loading experienced by the computer system used.
3.3 NO2 column reconstruction
Finally, we assess the capability of our NO2 : NO regression model, convolved with TROPOMI instrument averaging kernels, to reproduce observation column distributions of NO2 from TROPOMI. The absolute differences in NO2 columns between GEOS-Chem full-chemistry and the GEOS-Chem regression-based and scaling-based models are compared to the absolute difference in TROPOMI NO2 and GEOS-Chem full-chemistry, as well as to the magnitude of the TROPOMI NO2 column precision data. This is presented in Fig. 8a, compared for 8 d in January, April, July, and October. We apply the regression-based method to a 2019 perturbed model run, and to a 2021 model run.
Figure 8(a) The absolute difference in NO2 between GEOS-Chem full-chemistry and the constant lifetime scaling based model (blue); the regression-based chemistry model applied to a 2019 perturbed run (green) and applied to a 2021 run (purple); deviation from the observed NO2 TROPOMI columns (red); as well as the TROPOMI NO2 tropospheric column precision values (yellow). (b) The normalised NO2 differences are calculated by normalising the reconstructed model deviation by the absolute deviation between GEOS-Chem and TROPOMI, as well as by the TROPOMI column precision values. For the different model reconstructions, the NO2 deviation is consistently less than the corresponding TROPOMI precision value in more than 99.5 % datapoints.
We find comparable NO2 reconstruction errors for the 4 months we study. Earlier, with the NOx reconstruction, we found that the error was smaller for January than the other months (Fig. 3a and b), however, the higher error from the January NO2 : NO regression model (Fig. 3c) offsets this advantage, ultimately bringing the overall reconstruction error for all months to a comparable level. We observe comparable magnitudes of reconstruction error when we compare our NO2 reconstructions based on the scaling-based and regression-based methods applied to the 2019 model run. However, the reconstruction error tends to be consistently larger when we apply our regression-based method to the year 2021. This is particularly notable in January and July, which can be attributed to the greatest deterioration in NOx chemistry regression performance in July 2021, and the greatest deterioration in the NO2 prediction performance in January 2021 (see Fig. 4).
When we compare the difference between GEOS-Chem and TROPOMI NO2 columns, we find that the NO2 reconstruction errors are much smaller and much smaller than the estimated precision values for the data. This is the case for the scaling-based approach and the regression-based approach applied to both 2019 and 2021. This provides confidence that our model reconstruction performance is robust enough for use in inversion work, even in the case of using regression models that have been trained on unseen meteorological periods. See Appendix B for a more detailed analysis on the difference between modelled column NO2 and observed TROPOMI data.
Figure 8b, shows that the median NO2 column model reconstruction errors are 2.8 % of the actual deviation from TROPOMI in the scaling-based approach, compared to 6.5 % and 7.3 % in the regression-based approach for 2019 and 2021, respectively. Similarly, these construction errors represent a median value of 1.3 % of the TROPOMI precision value for the scaling-based approach, compared to 2.9 % and 3.2 % for the regression-based approach for 2019 and 2021, respectively. Across all reconstructed data points, we found that over 99.9 % of the data had reconstruction errors smaller than the corresponding TROPOMI column precision for both reconstruction methods in 2019. For the regression-based method applied in 2021, this was true for over 99.7 % of the data.
We have demonstrated that the NOx chemistry rates and NO2 : NO ratio described by a leading 3-D atmospheric chemistry model can be reproduced using random forest-based regression models using NOx concentrations, the spatial location, and meteorological variables as input parameters. The models perform successfully on perturbed testing data through all months of 2019 with R2>0.95 for predicting NOx chemistry rates and R2>0.99 for predicting the corresponding NO2 : NO concentration ratios. We also show that these models maintain their prediction capability when tested on model outputs from an unseen year (2021) with contrasting environment conditions.
We have also demonstrated that the atmospheric lifetime of NOx is stable against varying emissions, particularly in winter months. From this, we have demonstrated that it is also possible to predict updated NOx chemistry rates of change as a result of emission perturbations, with knowledge of NOx chemistry from an initial unperturbed model run. This scaling-based approach has impressive prediction performance with R2 = 1.0.
We have developed two viable methodologies to model atmospheric NOx in a more computationally efficient way than using the GEOS-Chem 3-D model. The regression-based chemistry method has the advantage of not requiring prior knowledge of the NOx lifetimes for a baseline model run, and reduces the computational time by a factor of 3.25. The lifetime scaling-based approach reduces the model run time slightly further by a factor of 4.3, but a baseline full-chemistry model run is required. This scaling-based approach has smaller model reconstruction errors, but generally both approaches have reconstruction errors smaller than the TROPOMI precision values for over 99.9 % of the reconstructed data (399 502 points).
Our study provides confidence in random forest models being used to describe NOx chemistry to a sufficient accuracy for them to play an important role in inversion methods. Previous work has already found that NO2 can be used to help constrain ffCO2 (Berezin et al., 2013; Lopez et al., 2013; Goldberg et al., 2019; Super et al., 2020), and this work develops a new methodology to more efficiently infer NO2 column enhancements from changes to NOx emission inputs. The methodologies developed here will be used within a joint NOx : CO2 model inversion to constrain geographically resolved ffCO2. This will be explored using an ensemble Kalman filter within the GEOS-Chem model framework, as well as within the Integrated Forecasting System (IFS) using an incremental 4D-Var algorithm (Inness et al., 2013). Results from our study are particularly timely with the launch in the next few years of the Copernicus Anthropogenic Carbon Dioxide Monitoring constellation (CO2M) that include column measurements of CO2 and NO2. Overall this work will support the development and employment of European CO2 measurement, reporting and verification systems.
Figure A1Diurnal cycle of NOx chemistry for 4 months of the year. Median and interquartile range net rates of change at the surface of the atmosphere averaged across the European domain.
Figure A2(a) Feature selection results for the rate prediction models, obtained using a forward selection wrapper method. Plotted are the coefficient of determination (R2) and mean absolute error (MAE) as functions of the number of features included, for each of the four seasonal models (January, April, July, October). (b) Same as (a), but for the partitioning ratio prediction models. (c) Feature importance distributions for each of the four monthly models, showing the relative contributions of each predictor variable to the rate prediction models (using nine features) and the partitioning ratio prediction models (using eight features). (d) Change in MAE resulting from the removal of each of the 14 features in turn, demonstrating the individual impact of each feature on model performance and highlighting the importance of specific predictors for accurate rate and ratio estimates.
Figure A3Individual relationships between the nine regression input parameters and the NOx net rate of change. A LOWESS fit (red line) illustrates smoothed trends in the data, with R2 values reported for each fit. Among the parameters, NOx concentration, altitude, and temperature exhibit noticeable trends with chemistry rates, while the remaining parameters show little to no clear trends individually.
Figure A4Impact of hyperparameter changes on random forest regression model performance for predicting NOx chemistry rates. Plots show the effect of varying the number of trees, maximum tree depth, maximum leaf nodes, and maximum features per decision on mean R2, MAE, and prediction time (shaded regions represent performance ranges across monthly models). Increased algorithm complexity improves R2 and reduces MAE but increases prediction time. Optimal hyperparameters – 40 trees, depth of 30, 300 000 leaf nodes, and 4 features per decision – achieve balanced performance with a prediction time of 6 ms.
Figure A5The spatial distribution of the impact of ±20 % emission perturbations on (a) the NOx net rate of change, and (b) the atmospheric lifetime of NOx. Overall, it is clear that the impact on the atmospheric lifetime is much smaller, due to its independence from the NOx species concentration. Note that a negative lifetime of NOx arises in areas where we have a net chemical production of NOx.
The NO2 columns modelled by GEOS-Chem was compared directly with the TROPOMI data for assessment of agreement. Scatter plots between the two are shown in Fig. B1, where we found significant Pearson correlations (p<0.001) in all months. In January we observe a general positive bias, where the model is overestimating NO2, while in July and October, a negative bias is seen.
The spatial distribution of the deviation between GEOS-Chem and TROPOMI is shown in Fig. B2. While there are clear areas of difference, it is notable that the general regions where we observe elevated levels of NO2 are in alignment. In general, the spatial distribution of high-emission regions throughout Europe is fairly well understood. However, there is likely some error on the magnitudes of the emissions in the inventories used. This is likely to explain the majority of the areas of large bias between the model and the observations. However, it must be noted that other sources of error are present, which include model errors in transport processes, potential inaccuracies in the model meteorology used, errors in parameterising deposition processes, and the limiting factor of the model spatial resolution. Furthermore, there is also error on the TROPOMI measurements (largely characterised by the TROPOMI column precision value) including from instrument noise, cloud and aerosol interference, and vertical profile and sensitivity assumptions. Looking to Fig. 8 it is clear that there are many regions where the error between the model and observations is significantly smaller than the satellite precision, and for such areas the contribution of NOx emissions is likely to be accurate.
On the whole, it is promising to the performance of the model that there is a general correlation of agreement between the model and satellite data. However, there is room for improvement in model agreement, and model inversions would be one approach to achieve this.
The analysis code, model output data, and random forest regression models (in .pkl format) are available upon request from the corresponding author (cschooli@ed.ac.uk).
CS performed the GEOS-Chem model runs and data analysis. CS, PP, AV, and NB were involved in discussions and contributed to the development of the methodology. CS and PP wrote the paper. AV and NB provided feedback and comments on the paper.
The contact author has declared that none of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We also thank the GEOS-Chem community, especially the team at Harvard University for maintaining the GEOS-Chem model, and the NASA Global Modelling and Assimilation Office (GMAO) for providing the GEOS-FP data product.
This research has been supported by the CO2MVS Research on Supplementary Observations (CORSO) project funded by the Horizon Europe programme (grant no. 101082194) and the NERC National Centre for Earth Observation (grant no. NE/R016518/1).
This paper was edited by Beatriz Monge-Sanz and reviewed by two anonymous referees.
Andrew, R. M.: A comparison of estimates of global carbon dioxide emissions from fossil carbon sources, Earth Syst. Sci. Data, 12, 1437–1465, https://doi.org/10.5194/essd-12-1437-2020, 2020. a
Berezin, E. V., Konovalov, I. B., Ciais, P., Richter, A., Tao, S., Janssens-Maenhout, G., Beekmann, M., and Schulze, E.-D.: Multiannual changes of CO2 emissions in China: indirect estimates derived from satellite measurements of tropospheric NO2 columns, Atmos. Chem. Phys., 13, 9415–9438, https://doi.org/10.5194/acp-13-9415-2013, 2013. a, b
Bey, I., Jacob, D. J., Yantosca, R. M., Logan, J. A., Field, B. D., Fiore, A. M., Li, Q., Liu, H. Y., Mickley, L. J., and Schultz, M. G.: Global modeling of tropospheric chemistry with assimilated meteorology: Model description and evaluation, Journal of Geophysical Research: Atmospheres, 106, 23073–23095, 2001. a
Breiman, L.: Random Forests, Machine Learning, 45, 5–32, https://doi.org/10.1023/a:1010933404324, 2001. a
Cooper, M. J., Martin, R. V., Hammer, M. S., Levelt, P. F., Veefkind, P., Lamsal, L. N., Krotkov, N. A., Brook, J. R., and McLinden, C. A.: Global fine-scale changes in ambient NO2 during COVID-19 lockdowns, Nature, 601, 380–387, https://doi.org/10.1038/s41586-021-04229-0, 2022. a
European Space Agency: TROPOMI NO2 tropospheric column (S5P Level 2, product version 2.2.0, processing version 1.6.0), Copernicus Sentinel-5P, Copernicus Data Space Ecosystem [data set], https://doi.org/10.5270/S5P-9bnp8q8, 2021. a
Feng, L., Palmer, P. I., Bösch, H., and Dance, S.: Estimating surface CO2 fluxes from space-borne CO2 dry air mole fraction observations using an ensemble Kalman Filter, Atmos. Chem. Phys., 9, 2619–2633, https://doi.org/10.5194/acp-9-2619-2009, 2009. a, b
Feng, L., Palmer, P. I., Parker, R. J., Lunt, M. F., and Bösch, H.: Methane emissions are predominantly responsible for record-breaking atmospheric methane growth rates in 2020 and 2021, Atmos. Chem. Phys., 23, 4863–4880, https://doi.org/10.5194/acp-23-4863-2023, 2023. a
Goldberg, D. L., Lu, Z., Oda, T., Lamsal, L. N., Liu, F., Griffin, D., McLinden, C. A., Krotkov, N. A., Duncan, B. N., and Streets, D. G.: Exploiting OMI NO2 satellite observations to infer fossil-fuel CO2 emissions from U.S. megacities, The Science of the Total Environment, 695, 133805–133805, 2019. a, b
Hoesly, R. M., Smith, S. J., Feng, L., and Bond, T. C.: Community Emissions Data System (CEDS): Historical emissions (1750–2014), Zenodo [data set], https://doi.org/10.5281/zenodo.1188083, 2018a. a
Hoesly, R. M., Smith, S. J., Feng, L., Klimont, Z., Janssens-Maenhout, G., Pitkanen, T., Seibert, J. J., Vu, L., Andres, R. J., Bolt, R. M., Bond, T. C., Dawidowski, L., Kholod, N., Kurokawa, J.-I., Li, M., Liu, L., Lu, Z., Moura, M. C. P., O'Rourke, P. R., and Zhang, Q.: Historical (1750–2014) anthropogenic emissions of reactive gases and aerosols from the Community Emissions Data System (CEDS), Geosci. Model Dev., 11, 369–408, https://doi.org/10.5194/gmd-11-369-2018, 2018b. a, b
Inness, A., Baier, F., Benedetti, A., Bouarar, I., Chabrillat, S., Clark, H., Clerbaux, C., Coheur, P., Engelen, R. J., Errera, Q., Flemming, J., George, M., Granier, C., Hadji-Lazaro, J., Huijnen, V., Hurtmans, D., Jones, L., Kaiser, J. W., Kapsomenakis, J., Lefever, K., Leitão, J., Razinger, M., Richter, A., Schultz, M. G., Simmons, A. J., Suttie, M., Stein, O., Thépaut, J.-N., Thouret, V., Vrekoussis, M., Zerefos, C., and the MACC team: The MACC reanalysis: an 8 yr data set of atmospheric composition, Atmos. Chem. Phys., 13, 4073–4109, https://doi.org/10.5194/acp-13-4073-2013, 2013. a
Jacob, D.: Introduction to atmospheric chemistry, Princeton University Press, Princeton, NJ, ISBN 978-0-691-00185-2, 1999. a
Janssens-Maenhout, G., Crippa, M., Guizzardi, D., Muntean, M., Schaaf, E., Dentener, F., Bergamaschi, P., Pagliari, V., Olivier, J. G. J., Peters, J. A. H. W., van Aardenne, J. A., Monni, S., Doering, U., Petrescu, A. M. R., Solazzo, E., and Oreggioni, G. D.: EDGAR v4.3.2 Global Atlas of the three major greenhouse gas emissions for the period 1970–2012, Earth Syst. Sci. Data, 11, 959–1002, https://doi.org/10.5194/essd-11-959-2019, 2019. a
Jiang, X., Huang, X., Liu, J., and Han, X.: NOx emission of fine-and superfine-pulverized coal combustion in O2/CO2 atmosphere, Energy & Fuels, 24, 6307–6313, 2010. a
Kemball-Cook, S., Yarwood, G., Johnson, J., Dornblaser, B., and Estes, M.: Evaluating NOx emission inventories for regulatory air quality modeling using satellite and air quality model data, Atmospheric Environment (1994), 117, 1–8, 2015. a
Kuenen, J. J. P., Visschedijk, A. J. H., Jozwicka, M., and Denier van der Gon, H. A. C.: TNO-MACC_II emission inventory; a multi-year (2003–2009) consistent high-resolution European emission inventory for air quality modelling, Atmos. Chem. Phys., 14, 10963–10976, https://doi.org/10.5194/acp-14-10963-2014, 2014. a
Liu, X., Ou, J., Wang, S., Li, X., Yan, Y., Jiao, L., and Liu, Y.: Estimating spatiotemporal variations of city-level energy-related CO2 emissions: An improved disaggregating model based on vegetation adjusted nighttime light data, Journal of Cleaner Production, 177, 101–114, 2018. a
Lopez, M., Schmidt, M., Delmotte, M., Colomb, A., Gros, V., Janssen, C., Lehman, S. J., Mondelain, D., Perrussel, O., Ramonet, M., Xueref-Remy, I., and Bousquet, P.: CO, NOx and 13CO2 as tracers for fossil fuel CO2: results from a pilot study in Paris during winter 2010, Atmos. Chem. Phys., 13, 7343–7358, https://doi.org/10.5194/acp-13-7343-2013, 2013. a, b, c
Meijer, H., Smid, H., Perez, E., and Keizer, M.: Isotopic characterisation of anthropogenic CO2 emissions using isotopic and radiocarbon analysis, Physics and Chemistry of the Earth, 21, 483–487, 1996. a
Miyazaki, K., Bowman, K., Sekiya, T., Takigawa, M., Neu, J. L., Sudo, K., Osterman, G., and Eskes, H.: Global tropospheric ozone responses to reduced NOx emissions linked to the COVID-19 worldwide lockdowns, Science Advances, 7, https://doi.org/10.1126/sciadv.abf7460, 2021. a
Napelenok, S. L., Pinder, R. W., Gilliland, A. B., Marin, R. V., Miranda, A. I., Borrego, C., Miranda, A., and Borrego, C.: Developing a Method for Resolving NOx Emission Inventory Biases Using Discrete Kalman Filter Inversion, Direct Sensitivities, and Satellite-Based NO2 Columns, in: AIR POLLUTION MODELING AND ITS APPLICATION XIX, NATO Science for Peace and Security Series Series C: Environmental Security, Springer Netherlands, Dordrecht, 322–330, ISBN 9781402084522, 2008. a
Nayagam, L., Maksyutov, S., Oda, T., Janardanan, R., Trisolino, P., Zeng, J., Kaiser, J. W., and Matsunaga, T.: A top-down estimation of subnational CO2 budget using a global high-resolution inverse model with data from regional surface networks, Environmental Research Letters, 19, 014031, https://doi.org/10.1088/1748-9326/ad0f74, 2023. a
Nguyen, D.-H., Lin, C., Vu, C.-T., Cheruiyot, N. K., Nguyen, M. K., Le, T. H., Lukkhasorn, W., Vo, T.-D.-H., and Bui, X.-T.: Tropospheric ozone and NOx: A review of worldwide variation and meteorological influences, Environmental Technology & Innovation, 28, 102809, https://doi.org/10.1016/j.eti.2022.102809, 2022. a
Oda, T., Feng, L., Palmer, P. I., Baker, D. F., and Ott, L. E.: Assumptions about prior fossil fuel inventories impact our ability to estimate posterior net CO2 fluxes that are needed for verifying national inventories, Environmental Research Letters, 18, 124030, https://doi.org/10.1088/1748-9326/ad059b, 2023. a
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research, 12, 2825–2830, 2011. a
Peylin, P., Law, R. M., Gurney, K. R., Chevallier, F., Jacobson, A. R., Maki, T., Niwa, Y., Patra, P. K., Peters, W., Rayner, P. J., Rödenbeck, C., van der Laan-Luijkx, I. T., and Zhang, X.: Global atmospheric carbon budget: results from an ensemble of atmospheric CO2 inversions, Biogeosciences, 10, 6699–6720, https://doi.org/10.5194/bg-10-6699-2013, 2013. a
Randerson, J., van Der Werf, G., Giglio, L., Collatz, G., and Kasibhatla, P.: Global Fire Emissions Database, Version 4.1 (GFEDv4), ORNL DAAC [data set], https://doi.org/10.3334/ORNLDAAC/1293, 2017. a
Shu, Y. and Lam, N. S.: Spatial disaggregation of carbon dioxide emissions from road traffic based on multiple linear regression model, Atmospheric Environment, 45, 634–640, 2011. a
Simone, N., Stettler, M., Eastham, S., and Barrett, S.: Aviation Emissions Inventory Code (AEIC), Zenodo [code], https://doi.org/10.5281/zenodo.6461767, 2013. a
Super, I., Denier van der Gon, H. A. C., van der Molen, M. K., Dellaert, S. N. C., and Peters, W.: Optimizing a dynamic fossil fuel CO2 emission model with CTDAS (CarbonTracker Data Assimilation Shell, v1.0) for an urban area using atmospheric observations of CO2, CO, NOx, and SO2, Geosci. Model Dev., 13, 2695–2721, https://doi.org/10.5194/gmd-13-2695-2020, 2020. a, b, c
Super, I., Scarpelli, T., Droste, A., and Palmer, P. I.: Improved definition of prior uncertainties in CO2 and CO fossil fuel fluxes and its impact on multi-species inversion with GEOS-Chem (v12.5), Geosci. Model Dev., 17, 7263–7284, https://doi.org/10.5194/gmd-17-7263-2024, 2024. a
Wang, S., Cohen, J. B., Guan, L., Lu, L., Tiwari, P., and Qin, K.: Observationally constrained global NOx and CO emissions variability reveals sources which contribute significantly to CO2 emissions, npj Climate and Atmospheric Science, 8, 87, https://doi.org/10.1038/s41612-025-00977-2, 2025. a, b
Wenger, A., Pugsley, K., O'Doherty, S., Rigby, M., Manning, A. J., Lunt, M. F., and White, E. D.: Atmospheric radiocarbon measurements to quantify CO2 emissions in the UK from 2014 to 2015, Atmos. Chem. Phys., 19, 14057–14070, https://doi.org/10.5194/acp-19-14057-2019, 2019. a
Wu, D., Laughner, J. L., Liu, J., Palmer, P. I., Lin, J. C., and Wennberg, P. O.: A simplified non-linear chemistry transport model for analyzing NO2 column observations: STILT–NOx, Geosci. Model Dev., 16, 6161–6185, https://doi.org/10.5194/gmd-16-6161-2023, 2023. a
Zhao, C. and Wang, Y.: Assimilated inversion of NOx emissions over east Asia using OMI NO2 column measurements, Geophysical research letters, 36, L06805, https://doi.org/10.1029/2008GL037123, 2009. a