Articles | Volume 26, issue 1
https://doi.org/10.5194/acp-26-809-2026
https://doi.org/10.5194/acp-26-809-2026
Research article
 | 
16 Jan 2026
Research article |  | 16 Jan 2026

Beyond binary maps from HCHO∕NO2: a deep neural network approach to global daily mapping of net ozone production rates and sensitivities constrained by satellite observations (2005–2023)

Amir H. Souri, Gonzalo González Abad, Bryan N. Duncan, and Luke D. Oman
Abstract

Previous studies on net ozone production rates (PO3) and their sensitivities to precursors relied on limited in-situ data, often coarse and uncertain chemical transport models (CTMs), and ozone indicators like the formaldehyde-to-nitrogen dioxide ratio (FNR). However, FNR fails to fully capture PO3's complex relationships with pollution, light, and water vapor. To address this, we refine the satellite-based PO3 product from Souri et al. (2025) with key advancements: (i) a deep neural network to parametrize high-dimensional non-linear ozone chemistry without the need for empirical linearization of atmospheric conditions, (ii) incorporation of water vapor, (iii) improved error characterization, and (iv) the application of a finer CTM to dynamically convert column retrievals into near-surface mixing ratios. Our PO3 sensitivity maps surpass traditional FNR-based assessments by quantifying sensitivity magnitudes – factoring in photolysis rates and water vapor – with greater spatial information. Our new product provides daily near-clear sky PO3 and sensitivity maps using bias-corrected OMI (2005–2019, 0.25° × 0.25°) and TROPOMI (2018–2023, 0.1° × 0.1°), with values aligning within 10 %. High PO3 rates (> 8 ppbv h−1) appear in urban and biomass-burning regions under strong photochemical activity, including during a heatwave in the northeastern U.S. Photolysis rates are the dominant factor dictating the seasonality of PO3 magnitudes and sensitivities. The stability and long-term records of OMI retrievals (2005–2019) enable us to provide the first global maps of PO3 linear trends showing a surge of > 30 % over China, the Middle East, and India, while a reduction in the eastern U.S., southern Europe, and several regions in Africa.

Share
1 Introduction

To mitigate tropospheric ozone pollution, a pervasive trace gas that impacts human health, climate, and crop productivity (Fleming et al., 2018; Mills et al., 2018; Gaudel et al., 2018), it is essential to quantify the spatiotemporal variations of two primary components: (i) the sensitivity of the chemical net production rates of ozone (PO3) to its two main precursors, nitrogen oxides (NOx= NO + NO2) and volatile organic compounds (VOCs), and (ii) the magnitude of PO3 itself. The first component provides insights into the positive and negative contributions of these precursors to PO3, which are typically categorized as NOx-sensitive (where PO3 is influenced mainly by NOx), VOC-sensitive (where PO3 is affected primarily by VOCs), and transitional regimes (where PO3 is responsive to both NOx and VOCs) (Kleinman et al., 2002; Sillman and He, 2002; Duncan et al., 2010). The latter component is crucial for understanding how locally produced ozone, in conjunction with advected or diffused ozone, can lead to high-ozone events (e.g., Kleinman et al., 2002, 2005; Sullivan et al., 2019).

Creating global maps of PO3 and its sensitivity at spatiotemporal scales relevant to air quality policies is a challenge. Unique instruments can directly measure PO3 by calculating the difference in ozone molecules from air samples drawn through two distinct tubes – one exposed to sunlight and the other shielded by an ultra-violet (UV) filter (Cazorla and Brune, 2010; Sadanaga et al., 2017; Sklaveniti et al., 2018). However, these instruments suffer from various interferences, such as heterogeneous chemistry or photo-enhanced loss of ozone within the tubes, and they are limited to sparse super sites that restrict spatial variability. Similarly, box-model simulations of PO3, which are observationally constrained by intensive atmospheric composition measurements, are also limited by sparse aircraft sampling (Cazorla et al., 2012; Ren et al., 2013; Mazzuca et al., 2016; Souri et al., 2020a; Schroeder et al., 2020; Brune et al., 2022; Wolfe et al., 2022; Souri et al., 2023a). Currently, our understanding of the global spatiotemporal variability of PO3 mainly relies on chemical transport models, which can possess significant uncertainties such as those associated with transport, emissions, and dry deposition. Moreover, they may lack the spatial resolution necessary to capture the non-linear dynamics associated with NOx and thus, ozone chemistry (Valin et al., 2011; Vinken et al., 2011; Yu et al., 2016).

The “gold standard” approach to determine three-dimensional PO3 within a process-based framework involves running a high-resolution chemical transport model, with prognostic inputs constrained by observations. This approach falls into the realm of an inversion/data assimilation framework (Bocquet et al., 2015). Numerous studies have aimed to constrain various model prognostic inputs, including NOx and VOCs emissions and/or concentrations, using aircraft and satellite remote sensing retrievals (e.g., Stavrakou et al., 2009, 2016; Souri et al., 2016; Bauwens et al., 2016; Miyazaki et al., 2020; Opacka et al., 2025). Notably, Souri et al. (2020b) developed a non-linear joint inversion of NOx and VOCs to better constrain PO3, thereby shedding light on the impact of recent emission regulations in East Asia on the different chemical pathways governing the formation and loss of surface ozone. However, these studies face a fundamental challenge: discrepancies between simulated fields and observations are often blamed solely on emissions. In fact, such discrepancies can also stem from various model components, including chemical mechanisms, dry deposition, photolysis rates, vertical diffusion, and transport. Given the limited observations available for constraining all of these uncertain parameters, the optimization problem becomes grossly under-determined. This means we lack sufficient information to uniquely determine the optimal values of these parameters altogether. Additionally, the underlying physics of these models is inherently uncertain, necessitating the explicit propagation of model physics errors into our final estimates or the execution of ensemble model realizations to stochastically vet the credibility of the top-down estimates across different realizations. Conducting these ensemble optimizations at fine-scale grid boxes around the globe is prohibitively computationally intensive.

At the expense of sacrificing the full capability of a physics-based model, we can take advantage of a statistical approach to predict PO3 using several observable variables with improved computational efficiency. Chatfield et al. (2010) made an early effort to parameterize the gross production of ozone via NO + HO2 through a multivariable power law function that depended on formaldehyde (HCHO), nitrogen dioxide (NO2), UV photolysis rates, and ambient temperature. Their model successfully reproduced over 60 % of the variance observed in the ozone gross production rates. Souri et al. (2023a) introduced a bilinear equation based on HCHO × NO2 and HCHO/NO2, which explained more than 80 % of the variance in simulated PO3. Building on these findings, Souri et al. (2025) developed a regularized piece-wise linear regression to parameterize PO3 using retrospective aircraft observations and a set of variables, including HCHO/NO2, HCHO, NO2, jO1D (photolysis frequency for O1D +hv), and jNO2 (photolysis frequency for NO2+hv). Their algorithm successfully reproduced over 90 % of the variance in observationally-constrained PO3 with minimal biases across moderately to extremely polluted regions.

These parameterizations present a unique opportunity to globally map PO3, as their primary inputs can be largely constrained by well-characterized satellite retrievals with extensive horizontal coverage (Gonzalez Abad et al., 2019). For this reason, Souri et al. (2025), compiled various satellite observations including TROPOspheric Monitoring Instrument (TROPOMI) surface albedo, HCHO, and NO2 columns in conjunction with pre-computed model fields to populate the inputs to their parametrization, allowing them to generate the first-ever maps of PO3 worldwide. Because their algorithm had an explicit mathematical form, they were also able to break down PO3 into HCHO and NO2 contributions, providing much more detailed spatial information about ozone sensitivity maps compared to binary information (i.e., NOx-sensitive or VOC-sensitive) made from HCHO to NO2 ratios (known as formaldehyde to nitrogen dioxide ratios – FNR) (Martin et al., 2004; Duncan et al., 2010; Choi et al., 2012; Choi and Souri, 2015a, b; Jin et al., 2017; Schroeder et al., 2017; Souri et al., 2017; Jeon et al., 2018; Tao et al., 2022; Johnson et al., 2024). However, FNR was a central component of their algorithm to transform the non-linear ozone chemistry into several linear segments (i.e., a piecewise regression).

The inclusion of FNR in Souri et al. (2025) introduces several complications, such as (i) the amplification of unresolved systematic and random errors in satellite retrievals associated with PO3 estimates, and (ii) discounting the dependency of PO3 sensitivity to HCHO and NO2 concentrations as function of available light and water vapor. In fact, FNR does not provide useful information about ozone chemistry in less photochemically active environments, such as early morning or late afternoon conditions (known as light-limited or radical-limited conditions). Although the parametrization of PO3 crafted in Souri et al. (2025) relied on photolysis rates, the sensitivity of PO3 to NO2 (a proxy for reactive nitrogen) and HCHO (a proxy for VOC reactivity) did not directly depend on photolysis rates.

The overarching goal of producing ozone chemistry sensitivity maps is to inform regulatory agencies about the impact of emission reductions on locally produced ozone. Unlike conventional FNR-based binary maps, these maps must quantify the magnitude of sensitivity rather than merely indicating its direction. This quantitative approach is essential because both the sign and magnitude of sensitivities are crucial for understanding the impact of emission changes. While detailed sensitivity maps can be derived from chemical transport models by perturbing underlying emissions, the lack of observational constraints on these models can introduce significant biases. Souri et al. (2025) attempted to address this limitation by providing magnitude-dependent sensitivity maps of PO3 to NO2 and HCHO using piecewise linear regression. However, their approach yielded derivatives of PO3 with respect to NO2 and HCHO that remained invariant with changes in light and humidity conditions. This limitation is problematic because reduced light conditions are known to substantially dampen the sensitivity of PO3 to NOx and VOCs, even under identical emission rates. The current work is therefore motivated by the need to capture the complex, multidimensional dependencies of PO3 on ozone precursors, light intensity, and humidity using a more flexible data-driven approach through a machine learning algorithm without the need for segregation or linearization. While these maps will not replace process-based chemical transport model experiments, they can efficiently provide first-order assessments to: (i) strategize improved modeling experiments, (ii) gauge the added value of satellites on predictions of PO3, and (iii) guide the design of sub-orbital missions in regions with poorly documented elevated PO3.

The new product of PO3 along with spatially varying ozone sensitivity maps using bias-corrected OMI and TROPOMI retrievals are generated globally for 2005–2023. We will document the advantages of this algorithm over the older one and how the new results can bring fresh insights into PO3 behavior across various seasons, locations, and global trends.

2 Data

2.1 Satellite Retrievals

2.1.1 TROPOMI HCHO and NO2

We use daily level-2 (L2) products of TROPOMI (v2.4–v2.5) tropospheric NO2 and total HCHO columns (v2.4–v2.6) obtained from UV-Vis radiances ( 328–496 nm) onboard the European Space Agency's (ESA's) Sentinel Precursor (S5P) spacecraft with an equatorial overpass time of  13:30 local standard time (LST) (Veefkind et al., 2012; van Geffen et al., 2022; De Smedt et al. 2021). These products offer near-daily global coverage of NO2 and HCHO columns at a horizontal resolution of 7.2 km (reduced to 5.6 km after August 2019) by 3.6 km at nadir, extending to approximately 14 km at the edges of the scanline, with a swath width of 2600 km. The data products used in this study span from May 2018 to the end of 2023. The retrieval process follows a two-step framework: first, a differential spectral fitting algorithm is used to determine the number of integrated molecules along the slant light path, and second, air mass factor calculations are done based on simulated gas absorber profiles and radiative transfer model calculations to convert slant columns into vertical ones.

Both products have been thoroughly vetted against ground-based remote sensing retrievals, including the multi-axis differential optical absorption spectrometer (MAX-DOAS) (De Smedt et al., 2021; Verhoelst et al., 2021; van Geffen et al. 2022; Souri et al., 2025) and Fourier transform infrared spectroscopy (FTIR) (Vigouroux et al., 2020; Souri et al., 2025), showing a general tendency towards underestimation in polluted regions. We include in our study only pixels with a quality flag (q_value) exceeding 0.5 and 0.75 for HCHO and NO2 products, respectively. The quality flag encapsulates errors coming from clouds, snow, surface refractivity, and algorithm performance. The selected values are based on the user manual recommendation. The daily HCHO and NO2 columns, along with the retrieval errors, are mapped onto a 0.1° × 0.1° global grid using a mass-conserved bilinear interpolation approach described in Souri et al. (2024).

2.1.2 OMI HCHO and NO2

We use the Quality Assurance for the Essential Climate Variables (QA4ECV) NO2 daily Level 2 product (Boersma et al., 2018) which is based on global radiances captured by the Ozone Monitoring Instrument (OMI) sensor aboard NASA's Aura spacecraft. This product is retrieved with a similar overpass time as TROPOMI. The horizontal resolution of the product ranges from 13 × 24 km2 at nadir to 165 × 13 km2 at the edge of the scanline. It relies on OMI Collection 3 radiance data. Since 2008, OMI has faced significant anomalies resulting in the loss of reliable data in areas of its detector, a situation referred to as the “row anomaly.” This has led to inconsistent spatial resolution and global coverage throughout its operational phase. However, the unaffected pixels have demonstrated a high level of stability over the past two decades, making this product suitable for long-term trend analysis. Detailed description of the retrieval algorithm, along with validation against ground remote sensing data, can be found in Boersma et al. (2018), Compernolle et al. (2020), and Pinardi et al. (2020). We include good quality pixels based on an effective cloud fraction below 50 %, a quality processing flag parameter equal to zero, and exclusion of snowy regions. Additionally, we discard the last two rows of the detector because of their poor horizontal resolution. We use the OMI NO2 product for the period from 2005 until the end of 2019.

We also use the OMI Smithsonian astrophysical observatory (SAO) daily HCHO Level 2 product from the same sensor, which is generated using a newly developed algorithm and Collection 4 OMI radiances (Ayazpour et al. 2025; Nowlan et al., 2023). This improved algorithm enhances the radiance information content used to retrieved HCHO columns, significantly reducing noise in the slant column fit. The stability of this product in extracting new information related to long-term global trends of HCHO has been well demonstrated in recent studies (Souri et al., 2024; Anderson et al., 2024). We include only good data following the quality flag provided with the dataset along with effective cloud fraction below 40 %. Both OMI products are mapped onto a global grid with a resolution of 0.25° × 0.25° using the same algorithm used for TROPOMI daily.

2.1.3 Bias correction using ground-based remote sensing data

In order to remove large biases in both TROPOMI and OMI products, we bias correct their columns using the offset (additive term) and slope (multiplicative term) determined from a linear fit to paired MAX-DOAS/FTIR and these datasets, as described by Souri et al. (2025). The rationale for defining retrieval biases as a function of magnitude is to enhance correction factor generalizability across seasons and locations. We take advantage of three studies characterizing the bias correction factors, listed in Table 1. The application of these correction factors yields consistency across OMI and TROPOMI NO2 and HCHO columns within 10 % (Sect. 4.4.4)

Table 1The slopes and offsets derived from various validation studies used to bias correct the satellite retrievals employed in the parameterization of PO3.

Download Print Version | Download XLSX

2.1.4 Surface albedo

To estimate near-surface photolysis rates of jO1D (O3+hv, < 350 nm) and jNO2 (NO2+hv,  400–500 nm) used in the parametrization of PO3, we are required to provide reasonable surface albedo estimates (Sect. 2.4). We use a monthly Directionally Dependent Lambertian-Equivalent reflectivity (DLER) climatology derived from TROPOMI radiances at the spatial resolution of 0.125° × 0.125°; the product is in good agreement with the MODIS BRDF product (Tilstra et al., 2024). This climatology has two sets of values for both shortwave (328 nm) and longwave UV (463 nm) that are used separately for calculating jO1D and jNO2, respectively. We use only the isotropic part of the DLER product (named minimum_LER), which is added to an offset coefficient provided with the dataset.

2.2 Aircraft Measurements

The use of aircraft observations is twofold: first, they provide a vast number of measured geophysical variables suitable to simulate our observationally-constrained PO3 training dataset (Sect. 3.1); second, they enable a rigorous validation of column-to-planetary boundary layer (PBL) conversion factors derived from a chemical transport model (Appendix B). We use the dataset compiled by Souri et al. (2025), who curated various aircraft campaigns measuring photolysis rates, meteorological variables, and atmospheric composition from varying atmospheric conditions, including urban/suburban settings (DISCOVER-AQs, and KORUS-AQ), high-vegetated regions (SENEX), and remote areas (INTEX-B and AToms). The sampling frequency varies from 10 to 30 s. More detailed information regarding the choice of instrument, gap filling, and data exclusion can be found in Souri et al. (2025).

2.3 MINDS simulations

We use a global chemical transport model simulation designed to support trace gas retrievals. The simulation, called Multi-Decadal Nitrogen Dioxide and Derived Products from Satellites (MINDS) (Fisher et al., 2024), was generated using the Goddard Earth Observing System (GEOS) Earth system model (Molod et al., 2015; Nielsen et al., 2017) equipped with the full chemistry Global Modeling Initiative (GMI) mechanism (Duncan et al., 2007; Strahan et al., 2007) and coupled with the Goddard Chemistry Aerosol Radiation and Transport (GOCART) aerosol module (Chin et al., 2002). The rapid radiative transfer model, which was designed for global climate models (GCMs) and is known as the Radiative Transfer Module for GCM (RRTMG), calculates the longwave and shortwave radiation influenced by aerosols simulated by GOCART, enabling the incorporation of the direct effects of aerosols on meteorological conditions (Nielsen et al., 2017). Meteorology is resolved using GEOS with several prognostic inputs, including water vapor, being constrained by MERRA-2 reanalysis using “replay” mode at 3-hourly basis (Orbe et al., 2017). The model is setup at c360 grid (0.25° × 0.25°) and covers the period of 1993 until the end of 2023. The model follows 72 hybrid sigma values ranging from the surface to 0.01 hPa.

Lightning production of NO is parametrized based on the simulated convection. The model uses the Monitoring Atmospheric Chemistry and Climate and CityZen (MACCity) inventory (Granier et al., 2011) of anthropogenic emissions downscaled to 0.1° × 0.1° using the Emissions Database for Global Atmospheric Research version 4.2 (EDGAR 4.2). These anthropogenic emissions change by year and month. Biomass burning emissions rely on the Fire Energetics and Emissions Research (FEER) dataset (Ichoku and Ellison, 2014). Biogenic emissions are modeled interactively by the Model of Emissions of Gases and Aerosols from Nature (MEGAN) v2.1 (Guenther et al., 2012). It is known that isoprene emissions in MEGANv2.1 are largely overestimated (Bauwens et al., 2016; Souri et al., 2020b), therefore they are scaled down by a factor of two.

2.4 TUV NCAR Photolysis Rates Look-up Table

To estimate jNO2 and jO1D, we refer to a detailed look-up table provided by the Framework for 0-D Atmospheric Modeling (F0AM) model (Wolfe et al. 2016). This table is developed for clear-sky conditions based on over 20 064 solar spectra calculations. The data encompasses a broad spectrum of solar zenith angles (SZA) from 0 to 90° in 5° increments, altitudes ranging from 0 to 15 km in 1 km steps, overhead total ozone columns from 100 to 600 DU in increments of 50 DU, and surface UV albedo values from 0 to 1 in 0.2 increments. These calculations were carried out using NCAR's Tropospheric Ultraviolet and Visible radiation model (TUV v5.2), along with cross sections and quantum yields from IUPAC and JPL (Wolfe et al., 2016). Information on SZA and surface elevation is obtained from the L2 TROPOMI/OMI granule data. Surface albedo is based on the TROPOMI DLER climatology (Sect. 2.1.4). The overhead total ozone columns are derived from MINDS simulations (Sect. 2.3). For any values that fall between the entries in the tables, we apply a linear interpolation method.

2.5 Empirical PO3 estimates using LASSO

We will compare our new product (Sect. 3.2) to an empirical method developed by Souri et al. (2025), who took advantage of simulated PO3 data constrained by aircraft measurements to parameterize PO3 using four geophysical variables: NO2, HCHO, jNO2, and jO1D. Their algorithm used a piecewise L1-regularized linear regression model known as Least Absolute Shrinkage and Selection Operator (LASSO). Since the algorithm was based on a linear model which was ill-suited for the non-linear ozone chemistry, it was necessary to linearize the parameterization using various thresholds for FNRs. Despite the method's simplicity, Souri et al. (2025) were able to reproduce approximately 88 % of the variance with low biases (less than 20 %) in observationally-constrained PO3. Using the empirical method, they generated the first maps of PO3 by combining bias-corrected TROPOMI HCHO and NO2 columns, simulated photolysis rates, and a global transport model designed for the conversion from column measurements to the PBL.

To isolate the performance of the PO3 estimator used in Souri et al. (2025) in comparison to the proposed algorithm in this study, we will ensure that the input variables, including the mixing ratios of HCHO and NO2 within the PBL as well as the photolysis rates, remain identical for both the empirical product and our new algorithm. Hereafter, we will refer to this empirical product as “PO3LASSO”.

3 Methodology

Figure 1 illustrates the three-stage process of our newly developed algorithm to operationally produce long-term maps of PO3 within the PBL along with the sensitivity and error maps. The product is called “PO3DNN”.

  • Stage I. This stage serves as the foundation for the product, focusing on parameterizing PO3 using a regularized Deep Neural Network (DNN). The training dataset, detailed in Sect. 3.1, is derived from an observationally-constrained F0AM box model that provides simulated PO3 along with various atmospheric quantities directly or indirectly constrained by aircraft measurements. The decision to make use of aircraft data is based on two main factors: (i) they capture real-world atmospheric conditions across diverse parts of the atmosphere and various geographic regions, and (ii) the significant fluctuations inherent in the data rigorously test the DNN's capability to generalize (i.e., to fit the model through the data rather than merely to the data). However, a notable limitation of aircraft data is its restriction to specific atmospheric conditions. To address this, we have expanded the training dataset by perturbing the inputs to the F0AM model (Sect. 3.1), resulting in a synthetic dataset. This expanded training dataset is then used for validation, testing, and calibration of the DNN algorithm.

  • Stage II. The objective of this stage is to prepare spatiotemporal geophysical variables necessary for the prediction of PO3 (done in Stage III). We need five parameters on a global scale with daily frequency: jNO2, jO1D, HCHO, NO2, and H2O(v). To generate global daily maps of near-surface photolysis rates, we use the NCAR's look-up table as detailed in Sect. 2.4; this table relies on SZA, which varies with time and location, as well as surface UV-Vis albedo, ozone overhead columns, and surface altitudes. Both SZA and surface altitude are provided as auxiliary fields in the satellite L2 products. Ozone overhead columns are from MINDS. For surface UV-Vis albedo, we use two different wavelengths based on TROPOMI's climatology (Sect. 2.1.4). These calculations assume clear sky conditions, which are somewhat achieved by the effective cloud fraction thresholds derived from both the OMI and TROPOMI products. Our algorithm uses HCHO and NO2 columns obtained from OMI or TROPOMI, which are bias-corrected against ground remote sensing data. These measurements are then transformed into the mixing ratios in the PBL region using the vertical distribution of HCHO and NO2 profiles simulated by MINDS. The final variable is the average number of water vapor (H2O(v)) molecules per cubic meters in the PBL region at the satellite overpass time, which is obtained directly from the MINDS simulation. It is important to note that the MINDS simulation is based on constraints from MERRA-2 reanalysis, underscoring that the H2O(v) simulations are constrained by many observations.

  • Stage III. In the final stage, we predict PO3, generate sensitivity maps, and provide both systematic and random errors associated with these estimates. To create PO3 maps, we input the five parameters from Stage II into the DNN model developed in Stage I. To generate the sensitivity maps of PO3 in relation to NO2 and HCHO, we apply perturbations to NO2 and HCHO based on the methodology described in Sect. 3.3. These perturbations also serve another purpose which is to propagate the errors associated with the retrievals of HCHO and NO2, as well as their corresponding conversion factors from MINDS into the final product. A comprehensive explanation of the error budget and characterization can be found in Sect. 3.4.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f01

Figure 1Processing stages developed to operationally generate PO3 and sensitivity maps along with daily frequency errors on a global scale. Stage I aims to establish a regularized DNN model based on synthetic and real-world aircraft measurements. Stage II prepares the necessary satellite-based input features used for PO3 prediction in Stage III. Stage III feeds the DNN model with Stage II values and some statistical error analysis to generate the final product.

Download

While we perform Stage I only once to establish a PO3 estimator, we need to run Stage II and III for any desired location/time or spatial resolution. The need to operationally run these two stages has motivated us to create an open-source and object-oriented Python package called ozonerates v1.0 (Souri and Gonzalez Abad, 2025), which is capable of running all steps while leveraging parallel computation.

3.1 Training dataset generation using F0AM box model

To establish a relationship between several geophysical variables related to PO3, we use F0AM version 4 box model (Wolfe et al., 2016). This model is capable of simulating detailed chemical kinetics based on user inputs regarding meteorological variables, atmospheric compositions, and photolysis rates. F0AM uses a solver for ordinary differential equations (ODEs) designed for stiff systems, which allows it to determine the chemical evolution of all species included in the selected chemical mechanism. We adhere to previous configurations that apply the Carbon Bond 6 (CB06, r2) chemical mechanism within F0AM (Souri et al., 2020a; Souri et al., 2023a; Souri et al., 2025). The model is constrained by data collected during aircraft campaigns, including meteorological data, photolysis rates, and various trace gas concentrations. Additional details regarding the selection of instruments, bias corrections for photolysis, choices of dilution factors, and other configurations can be found in Souri et al. (2025). We incorporate data from seven aircraft campaigns, including DISCOVER-AQ (Texas, Washington D.C., Colorado), KORUS-AQ, ATOMs, INTEX-B, and SENEX, to further constrain the model. Souri et al. (2025) demonstrated that this setup effectively reproduces several unconstrained yet measured compounds, such as HCHO, HO2, OH, and PAN; moreover, the performance of the model was on par with other studies (e.g., Brune et al., 2020; Brune et al., 2022; Miller and Brune, 2022), indicating that it is a suitable model setup for understanding local ozone chemistry. This model-derived dataset consists of  134 000 points.

A limitation to the training dataset prepared by Souri et al. (2025) originates from the fact that only a subset of atmospheric conditions could be observed by the suborbital missions. A remedy for this limitation is to synthetically regenerate data by systematically perturbing several of the inputs used in the F0AM model. As a result, we apply a scaling factor, ranging from 0.1 up to 10 in 12 evenly-spaced steps, separately to NOx, VOCs, H2O(v), and photolysis rates. This expands the dataset to  6.4 million datapoints, covering a much wider range of atmospheric states.

Once the simulations are done, we determine simulated PO3 by:

(1) PO 3 = FO 3 - LO 3

where LO3 is all possible chemical loss pathways of ozone (negative stoichiometric multiplier matrix) and FO3 is all possible chemical pathways producing ozone molecules (positive stoichiometric multiplier matrix). This equation is also known as ozone tendency. This definition simplifies intercomparison with estimates derived from different chemical mechanisms by eliminating the requirement to explicitly match individual production and loss terms, which often exhibit inconsistencies across mechanisms, especially in their treatment of peroxy radicals. The calculation of PO3 is under a steady-state assumption.

3.2 DNN architecture and configuration

The overall architecture of the DNN model is portrayed in Fig. 2. The design consists of three fully-connected hidden layers each having 32 neurons. The neurons are equipped with rectified linear unit (ReLU) activation functions. The training dataset ( 6.4 millions) is split into 20 % test, 24 % validation, and 56 % training. Training inputs to the parametrization consists of HCHO, NO2, jO1D, jNO2, and H2O(v). Prior to the training, we normalize them, such that each feature (x) is rescaled according to x=x-μσx, where μ and σ represent the mean and standard deviation of the feature, respectively, ensuring a mean of zero and a variance of one. The optimization (training) of the DNN follows the backpropagation rule armed with Adaptive Moment Estimation (ADAM) optimizer which is known to perform well with noisy data (Kingma and Ba, 2017). The initial learning rate is set to 10−5. We use 500 epochs. The loss function (L) of the optimalization problem is:

(2) L = 1 2 k = 1 N ( y k - o k ) 2 + λ i = 1 p w i 2

where the first term on the right side represents the mean squares error (MSE) of the prediction derived from difference between the target PO3 (y) and the predicted PO3 (o). N represents the number of training datapoints. The second term is L2-regularization with a factor of λ to reduce the squares of p number of neuron weights (w).

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f02

Figure 2The architecture of the DNN model. The model contains three hidden layers with 32 neurons each.

Download

An important aspect of this optimization is the use of L2 regularization, which effectively helped us determine the optimal number of hidden layers and neurons. L2 regularization penalizes the cost function if an illusion of high prediction accuracy (the first term) is achieved with excessive variance in the solution (weights). Failing to balance the prediction error and the solution variance can lead to overfitting, which harms model performance in two ways: (i) it results in erroneous predictions for atmospheric conditions that fall outside the training dataset; (ii) it diminishes the physical interpretability of the statistical model because of large fluctuations in the weights, a common issue in regression models known as collinearity. When we used too many neurons or layers, the regularization penalized the weights, causing a substantial proportion to approach zero (not shown), indicating that those neurons were unnecessary. However, incorporating regularization does have some drawbacks: (i) it requires a smaller initial learning rate (set to 10−5) to avoid falling into local minima, which demands more computational resources; and (ii) the regularization factor also needs to be optimized. We find that a value of λ= 10−5 provides the best results among the set of values [10−4, 10−5, and 10−6], based on the symmetry in the statistical distributions of the test residuals, MSE, and the overall level of physical interpretability observed in the sensitivity tests.

The implementation of the DNN model is done using the open-source TensorFlow application programming interface (API) package in Python (Abadi et al., 2016). To thoroughly validate the performance of this model from various angles we (i) compare the DNN prediction with the test data using various standard metrics, (ii) investigate the evolution of the loss function derived from both the training set and the validation one over epochs, (iii) study the physical explanation of the response of PO3 to NO2 and HCHO, water vapor, and photolysis rates, and (iv) finally compare the DNN results to PO3LASSO. We will use a number of statistical metrics, including the coefficient of the determination (R2), mean bias, mean square error, mean absolute error, and root mean square error (RMSE), to carry out the quantitative assessment (Sect. 4.1).

3.3 Sensitivity calculations

To elucidate the response of PO3 to its inputs, we calculate the semi-normalized sensitivities through the finite difference method:

(3) SPO 3 i = [ PO 3 ] i 110 % - [ PO 3 ] i 90 % 0.2

where [PO3]i110% and [PO3]i90% are PO3 from perturbing input parameters (i= 1 for NO2, and i= 2 for HCHO) by 1.1 and 0.9 scaling factors. A mathematical proof showing that these sensitivity calculations are equivalent to the directional derivative is provided in Appendix A.

3.4 Error budget and characterization

Since the PO3DNN integrates atmospheric models, satellite trace gas retrievals, ground remote sensing, and a machine learning approach, it contains various sources of errors, some of which will be formulated in this section. Spatially and temporally averaging satellite-based products is a common practice to reduce noise and fill gaps; therefore, we attempt to separate systematic errors (irreducible by averaging) from random ones (reducible by averaging). We assign the total PO3 within PBL region error (etotal) based on the following equation:

(4) e total = e syst 2 + e rand 2

where esyst and erand are systematic and random errors associated with PO3 estimates. Systematic errors account for the errors associated with the bias correction of OMI and TROPOMI against ground remote sensing retrievals (eHCHO_bias_c and eNO2_bias_c), the model-based conversion of columns to the PBL mixing ratios (eHCHO_conversion,eNO2_conversion), and the DNN estimator error (eDDN), and are given by:

(5)esyst=eHCHO_bias_c2+eNO2_bias_c2+eHCHO_conversion2+eNO2_conversion2+eDNN2(6)eHCHO_bias_c2=PO3HCHOγebc-HCHO2(7)eNO2_bias_c2=PO3NO2γebc-NO22(8)eHCHO_conversion2=PO3HCHOVCDHCHOeconv-HCHO2(9)eNO2_conversion2=PO3NO2VCDNO2econv-NO22

where γ is the conversion factor of the satellite total to the PBL columns translation based on MINDS and the formulation by Souri et al. (2025); ebc-HCHO and ebc-NO2, in column units, are calculated following the formulation from Souri et al. (2025) who used the errors of slope and offset obtained from the comparison of satellite VCDs to ground remote sensing benchmarks; econv-HCHO and econv-NO2 are quantified by validating the simulated conversion factors compared to those of aircraft vertical spirals (Appendix B). The unit for these two errors is ppbv per the column unit; accordingly, we multiply these terms to satellite VCDs. The last term in Eq. (5) is a fixed systematic error associated with the DNN estimates which will be quantified based on the MSE of the DNN prediction. Both PO3HCHO and PO3NO2 are derived from the sensitivity calculations from Eq. (3) divided by the satellite columns. All error terms in Eqs. (6)–(9) are spatially and temporally invariant, but the derivatives vary from pixel to pixel resulting in spatiotemporally-varying systematic errors.

Random errors originate from the uncertainty estimates coming with the TROPOMI and OMI L2 products and are somewhat reducible by averaging, and are given by:

(10) e rand = PO 3 HCHO γ e rand-HCHO 2 + PO 3 NO 2 γ e rand-NO 2 2

where erand-HCHO and erand-NO2 are random retrieval errors. All terms in Eq. (10) vary by time and location.

Table 2 summarizes the numbers used in the above equations and their origin.

Table 2Values used in error calculations.

Download Print Version | Download XLSX

It is important to acknowledge that the defined total error budget here is only a good guess and optimistic. Some underlying sources of error, which are difficult to quantify, are not included. For example, errors related to the training dataset derived from the F0AM model are challenging to assess because of the lack of PO3 measurements. We assume other inputs to the PO3 parametrization, such as the monthly climatology TROPOMI surface albedo to be error-free. Additionally, all datasets used to estimate PO3 contain spatial representation errors (Souri et al., 2022), which are difficult to measure without knowing their true state of global spatial variability. Moreover, we do not consider correlated errors among HCHO and NO2 retrievals. It is worth noting that some of the inputs such as H2O(v) and the overhead ozone column have minimal biases because of MINDS simulations being observationally constrained (Fisher et al., 2024; Souri et al., 2024).

There are also assumptions regarding the equations mentioned earlier. For instance, it is assumed that the validation of conversion factors can account for all systematic issues related to the vertical distribution of NO2 and HCHO in MINDS. Furthermore, we presume that the reported retrieval errors are mostly random; however, this is not the case (Eskes and Boersma, 2003; Boersma et al., 2018) and distinguishing between these errors is not straightforward.

Another source of uncertainty arises from partially cloudy pixels and aerosols, which can introduce errors in calculated photolysis rates. While we successfully filtered out cloud cover and strong aerosol loadings (e.g., from wildfires) using effective cloud fraction thresholds, some aerosol or cloud-contaminated pixels may pass cloud screening due to low optical depth or height characteristics. Rigorously quantifying the errors coming from these effects would require running a radiative transfer model with detailed three-dimensional optical properties of clouds and aerosols on a global scale, particularly critical for aerosols, which can have complex effects on photolysis rates depending on their absorption and scattering properties and vertical distribution. Unfortunately, such comprehensive datasets are typically limited to the narrow swaths of spaceborne lidar observations, which themselves carry substantial uncertainties (Thorsen and Fu, 2015). While these complications cannot be entirely avoided, particularly for aerosol effects, users can apply additional quality control measures by filtering pixels using aerosol optical depth retrievals from TROPOMI, OMI, or other sensors to more rigorously identify contaminated observations.

In case of oversampling of the PO3 product both temporally and spatially, the total error will be given by:

(11) e total _ oversampled = 1 m e syst 2 + 1 m 2 e rand 2

where m is the total number of samples. Equation (11) suggests that the systematic errors are persistent across all samples and are not reducible by averaging, whereas the random errors become smaller by root square of samples. In this equation, the assumption is that the root-mean-square of the systematic errors is a good approximation of the systematic errors in the oversampled data because they are independent of each other.

4 Results and Discussion

In this section, we begin by validating and contrasting PO3DNN against PO3LASSO. Following that, we use OMI to investigate the spatiotemporal variability of PO3 and its sensitivity to photolysis rates, HCHO, and NO2 globally. We provide an application of data to understand the global long-term trends PO3 derived from OMI. Afterward, we offer a comprehensive global view of the PO3 estimates algorithm by integrating data from the TROPOMI compared with that one based on OMI. Finally, we document the total error budget of the products.

4.1 DNN performance

We investigate the predictive power of the DNN algorithm against both validation and test data for each air quality campaign or the entire aircraft dataset (Sect. 2.2). All training datasets described in Sect. 3.1 are used in this stage. Except for the early stages of training, both training and validation curves, explaining the evolution of the prediction against the number of epochs corresponding to the number of iterations of training the network for one cycle, closely follow each other, indicating that we possibly do not have overfitting issues (Fig. S11 in the Supplement). The curves are fairly smooth, resulting from using the ADAM optimizer with a strictly small learning rate initially. Both curves converge to RMSE below 0.88 ppbv h−1 which we use to assign the error of PO3DNN prediction in Eq. (5).

PO3DNN has promising skill at predicting PO3 across various atmospheric conditions. Figure 3 presents a comparison of the predicted PO3 values against observationally-constrained F0AM values for the test data for each suborbital mission. A similar comparison, which includes all data points measured during each mission, can be found in Fig. S12. The primary reason for highlighting the test data is that they have never been used to fine-tune the DNN parameters. There is a strong correlation between the predictions and the benchmarks across most campaigns for both the test data points (Fig. 3) and the complete set of aircraft measurements (Fig. S12). Notably, the slope for the “All” test dataset is close to the unity line. The DNN algorithm can reproduce over 96 % of the variance in the test data. Similar to the approach of Souri et al. (2025), we completely exclude each suborbital mission from the training dataset and use it as an independent benchmark to evaluate the model's performance. The resulting accuracy is comparable to that achieved when 56 % of the data are used for training, indicating that the PO3 parameterization has reached a high degree of generalization (Fig. S13).

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f03

Figure 3Scatterplots comparing observationally-constrained F0AM model PO3 and the predictions that were based on the DNN for the test data from each air quality campaign. The test data have never been used for hyper tuning the algorithm. “All” denotes all test data.

Download

The model performs significantly better than PO3LASSO over INTEX-B compared to LASSO (as shown in Fig. 7 in Souri et al., 2025). While the DNN's performance over the ATom campaigns is less impressive than in other areas, it still represents a considerable improvement over LASSO, which was unable to reproduce PO3 in pristine regions (R2<0.05). One key factor contributing to this improvement is the inclusion of H2O(v) in the input. Various parameters, including HOx, are known to influence PO3 in remote regions, but these factors were not included in our parametrization. The method does not artificially inflate results by introducing non-physical relationships in remote regions; the inability of the DNN to fully explain PO3 during AToms suggests that it does not force unrealistic relationships between PO3 and the inputs to completely align with the F0AM results, leaving areas for future improvement in parametrization over remote regions.

4.2 Advantages of PO3DNN over PO3LASSO

There are primarily four major benefits of PO3DNN over PO3LASSO that make the former parameterization a superior algorithm. The discussion of these advantages is as follows:

  • Higher predictive power. PO3LASSO predicted PO3 for all datapoints collected from the suborbital missions with a R2= 0.88, RMSE = 1.2 ppbv h−1, and a slope of 0.87 (Souri et al., 2025), whereas PO3DNN reproduced the exact datapoints (Fig. S12) with a R2= 0.96, RMSE = 0.7 ppbv h−1, and a slope of 1.00. Furthermore, as shown in Fig. 3, PO3DNN has a great degree of generalization for datapoints outside of the training/validation data points. Consequently, these statistics suggest that DNN is a more powerful predictor.

  • Better representation of PO3 over remote regions. One notable limitation of PO3LASSO was its inadequate representation of PO3 in remote regions, such as during the ATOMs or INTEX-B campaigns. This led Souri et al. (2025) to entirely mask PO3 estimates below 1 ppbv h−1. In these remote areas, PO3 is typically influenced by the reactions between ozone and HOx in addition to jO1D and H2O. While Souri et al. (2025) attempted to incorporate H2O into the LASSO parametrization, the algorithm assigned a zero coefficient to this parameter because of the use of the L1-regularization term. This term typically assigns a zero coefficient for a geophysical variable that is either irrelevant to the target or shows strong non-linear relationship with the target. PO3LASSO did not factor in H2O(v) because H2O(v) exhibits a non-linear relationship with PO3 – although the reaction between O1D and H2O can suppress ozone formation through the removal of O1D, it produces two molecules of OH regenerating ozone in polluted places (Bates and Jacob, 2020). Consequently, the non-linear relationship between H2O and PO3 is one that LASSO was unable to capture. While we could have addressed this by dividing the training dataset into different humidity levels (i.e., dry and humid), such an approach would have resulted in more discretization in the parametrization. Conversely, PO3DNN can consider the non-linear relationship between H2O and PO3 without the need for empirical linearization. We observe a significant improvement in predicted PO3 for both AToms and INTEX-B campaigns compared to Souri et al. (2025).

  • Diminished satellite error effects. The reliance of PO3LASSO on FNR increases the contamination of PO3 predictions from satellite random noise. This primarily occurs because satellite errors associated with HCHO and NO2 adversely influence FNR (see Fig. 12 in Souri et al., 2023a), resulting in noise in the empirical linearization approach used in PO3LASSO. Even if we assume that all inputs to the PO3LASSO parameterization, except for FNR, are error-free, the inherent randomness from choosing among four different sets of equations segregated by the noisy FNR will still feed noise into the final estimate. Although PO3DNN is inevitably influenced by satellite errors because of its dependence on HCHO and NO2 columns, it does not exacerbate these errors because it operates independently of FNR. To demonstrate this tendency, Fig. 4 shows the global PO3 random error maps induced by OMI HCHO and NO2 retrieval random errors averaged in June 2006. We use identical inputs and errors for both algorithms. Figure 4 is evidence of the diminished contamination of satellite random errors in PO3DNN as compared to PO3LASSO. The error differences tend to be larger over clean areas, because FNR random errors are higher when both HCHO and NO2 levels are small.

  • Continuity. It is known that neural networks equipped with three hidden layers can well approximate almost any high-dimensional non-linear function (Shen et al., 2021). An important superiority of PO3DNN over PO3LASSO lies in the strength of the DNN algorithm at approximating high-dimensional non-linear relationships between PO3 and HCHO (a proxy for VOCR), NO2 (a proxy for reactive nitrogen), jNO2 and jO1D (a proxy for photochemistry), and H2O. While some of these non-linearities were reasonably approximated in PO3LASSO by empirically segregating the chemical conditions using FNR, the non-linear ozone photochemistry can go beyond the dependency on VOCs and NOx levels. In fact, the relationship between PO3 and VOCs and NOx can behave non-linearly depending on the available light and water vapor as discussed in Sect. 4.3. This indicates that traditional linear models, such as those using VOCR/NOx (or HCHO/NO2) ratios, often fall short in capturing this complexity because of the continuous and non-linear nature of these relationships.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f04

Figure 4The comparison of the effect of satellite random errors in HCHO and NO2 on PO3 predictions based on PO3LASSO and PO3DNN algorithms in June 2006. The data used for generating these maps are based on OMI retrievals.

4.3 PO3DNN can capture non-linear PO3 chemistry as a function of pollution, light, and humidity

To further elaborate on the capability of PO3DNN to reasonably respond to variations in its five major parameters in a mathematically continuous fashion, we create six isopleths, each specifically designed to represent a particular atmospheric condition listed in Table 3. These isopleths are based on perturbing HCHO and NO2 in PO3DNN and are shown in Fig. 5.

Table 3Six different atmospheric conditions defined to understand the response of PO3 to HCHO and NO2 changes.

Download Print Version | Download XLSX

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f05

Figure 5The contour maps of PO3 isopleth generated by PO3DNN algorithm for six different atmospheric conditions defined in Table 3. In the first subplot, blue, green, and cyan lines indicate FNR = 1.5, 2.5, and 3.5, respectively. Numbers on isopleths are in ppbv h−1.

Download

It is immediately apparent that the hyperbolic shape of the PO3 curve relative to NO2 and HCHO can be recreated by our algorithm, displaying a positive response to both HCHO and NO2 on the right and left sides of the ridgelines. This observation underscores the effective parametrization of the non-linearities in ozone photochemistry achieved through the DNN algorithm. In the subplot representing normal conditions, we overlaid three lines indicating FNR values of 1.5 (blue), 2.5 (green), and 3.5 (cyan). Souri et al. (2025) used these lines to determine various coefficients in the PO3LASSO parameterization. For instance, the derivative of PO3 with respect to NO2 was determined to be 0.14 ppbv h−1 for FNR < 1.5 but increased to 6.54 ppbv h−1 for FNR > 3.5. However, in practice, the thickness and curvature of the PO3 isopleths vary based on the prevailing atmospheric conditions, implying that the derivatives cannot consistently retain the same values across the broad range of conditions.

In bright conditions, not only do we observe a significantly accelerated response of PO3 compared to the norm at identical NO2 and HCHO concentrations, but the responses of PO3 to these two compounds also become more pronounced. Conversely, in dim conditions, both the magnitudes and responses are weaker.

These results underscore the importance of including photolysis rates in ozone sensitivity analysis, rather than relying solely on FNR in former studies. For example, a lower FNR in the morning ( 09:30 LST) compared to the afternoon may wrongly suggest that PO3 would become more sensitive to VOCs earlier in the day. However, decreased light in the morning reduces the sensitivity of PO3 to VOCs, despite a lower FNR (Sect. S1).

The contrast between dry and humid isopleths suggests that the presence of H2O(v) enhances PO3 when abundant NO2 and HCHO are present. This trend is similarly observed in the F0AM model, as depicted in Figure S4, indicating that an increase in H2O(v) over polluted regions (arbitrarily defined as HCHO × NO2> 10) increases PO3. Nonetheless, more humidity suppresses PO3 especially where VOC is limited and NO2 is elevated possibly because the generated OH molecules from O1D + H2O(v) predominantly react with elevated NO2.

Lastly, we see the highest PO3 rates recorded among all scenarios under a hypothetical condition characterized by high humidity and photolysis rates. This condition is rare in nature because large amounts of H2O(v) (0.8 × 1018 molec. cm−3) are confined to marine regions where surface reflectivity is low; nonetheless, an intuitive tendency from PO3DNN suggests that the algorithm does not create non-physical extrapolation values.

4.4 PO3 Maps and Sensitivities using OMI and TROPOMI: A General View, Long-term analysis, Intercomparisons, and Error Analysis

4.4.1 Global PO3 and Seasonality using OMI in 2005–2007

Figure 6 shows the global distribution of PO3 rates averaged over a quarter-degree in 2005–2007, using OMI HCHO and NO2 retrievals. It also includes whisker-box plots highlighting seasonal variations in PO3 for selected regions and cities. We selected the 2005–2007 timeframe for this analysis because the OMI data were free from degradation issues, including the row anomaly. The map indicates accelerated PO3 rates across heavily polluted regions, such as cities in the Middle East, Asia, the U.S., Central Europe, and Africa, aligning with what we observed in Souri et al. (2025). While some areas exhibit significant seasonal fluctuations, others show little variability throughout the seasons. Notably, the east coast of the U.S., Central Europe, China, Tehran, and Johannesburg experience peak PO3 rates in summer. This pattern is primarily attributed to enhanced photochemistry and the elevated sensitivity of PO3 to NOx, driven by increases in VOCR/NOx (Souri et al., 2025).

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f06

Figure 6(center) The averaged global PO3 map at 0.25° × 0.25° in 2005–2007 based on the new algorithm. OMI data are used to populate HCHO and NO2 abundance. (margins) the whisker-box plots of PO3 seasonality over various selected regions. In the box plot, the central red line shows the median, and the top and bottom edges of the box show the 25th (q1) and 75th (q3) percentiles. The dark solid lines at the very beginning and the end of each plot show the minimum and maximum values excluding the outliers. The outliers are removed based on by any value above q3+ 1.5 × (q3q1) or below q1 1.5 × (q3q1).

The seasonal variability of PO3 in two African regions, characterized by biomass burning, exhibits an anti-correlation. This occurs because biomass burning in the northern hemisphere of Africa occurs from November to March, while the southern hemisphere in Africa experiences it from June to September (Roberts et al., 2009). Maritime Southeast Asia also shows a peak in PO3 during the biomass burning season (August–September).

Places like Mexico City, several major Brazilian cities (including Sao Paulo and Rio de Janeiro), northern India, and the southwest coast of the U.S. show minimal seasonal variability in PO3. The lack of pronounced seasonal changes may be attributed to less pronounced fluctuations in photolysis rates or substantial spatial heterogeneity in the seasonal variabilities of HCHO and NO2, resulting in reduced seasonal variations but with greater variance. Nonetheless, certain weather conditions can influence these results; for instance, monsoon flows can disperse and scavenge pollution from the northern India around July-September (David and Nair, 2013), dampening PO3. Mexico City also experiences a monsoon season in summer causing pollution to subside temporarily. The attribution of the seasonality will be discussed in the next section.

4.4.2 The attribution of PO3 seasonality

Figure 7 illustrates the sensitivity of PO3 to NO2, HCHO, and combined J-values (jNO2 and jO1D) based on Eq. (3) across the same regions and months presented in Fig. 6. The absolute values of PBL HCHO, NO2, and jNO2 are shown in Fig. S14. As shown in Appendix A, these sensitivity values are influenced by both the magnitude of the precursor and the first derivative of PO3 with respect to that precursor. Thus, the sensitivity values should be interpreted as the result of these combined effects. Moreover, these sensitivities are calculated with respect to local HCHO and NO2 concentrations rather than local emissions (unlike typical modeling experiments). Local concentrations reflect the combined influence of both local and external emissions through various physicochemical processes. We exclude water vapor from sensitivity analysis because its impact is an order of magnitude smaller than the three other factors.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f07

Figure 7The bar plots of the sensitivity of PO3 to photolysis rates (yellow bar), NO2 (red bar), and HCHO (blue bar) concentrations within the PBL over the selected regions shown in Fig. 6. These sensitivities are influenced by both the magnitude of the precursors and the first-order derivative of PO3 to the precursor, detailed in Appendix A. jNO2 values are in 1 × 10−3 s−1 units.

Download

The amplitude of photolysis rates dictates the amplitude of the sensitivity of PO3 to NO2 and HCHO. For instance, over East Coast, Central Europe, and Tehran, the first derivative of PO3 to NO2 tends to be small during colder months, primarily because of reduced photochemistry. As a result, despite significantly higher NO2 concentrations in these months, the sensitivity of PO3 to NO2 is muted. Conversely, in warmer months, the larger positive derivative of PO3 relative to NO2, driven by increased HCHO levels (shifting away from VOC-sensitive regimes) and enhanced photolysis rates, markedly increases the contributions of low summer NO2 levels to PO3. Likewise, we observe substantially higher sensitivity of PO3 to HCHO concentrations during warmer seasons. This increase is attributed to both the elevated levels of HCHO and the growing derivative of PO3 with respect to HCHO, both of which are directly influenced by enhanced photochemistry. One might argue that summer conditions should lead to a shift towards extremely NOx-sensitive regimes, resulting in a reduced first-order derivative of PO3 to HCHO. However, most polluted regions chosen for this figure are in transitional regimes during the summer, which renders PO3 fairly responsive to HCHO concentrations.

The sensitivity of PO3 to photolysis rates is dependent on pollution levels, just as its sensitivity to HCHO and NO2 concentrations is influenced by photolysis rates. This is primary reason for seeing minimal seasonality of PO3 over Mexico City, various Brazilian cities, and northern India. These minimal changes in photolysis rate sensitivities are caused by the less pronounced seasonality in both photolysis rates and pollution levels compared to other areas (Fig. S14). Souri et al. (2025) found that photolysis rates significantly contribute to the production of PO3 when there is an adequate amount of ozone precursors. This was reflected in larger (smaller) coefficients associated with photolysis rates in PO3LASSO algorithm for polluted (pristine) regions. For example, high photolysis rates over the Sahara do not significantly contribute to PO3 because of the limited availability of ozone precursors needed to initiate the ROx–HOx cycle. A notable example can be observed in Africa, where photolysis rates tend to remain consistently high throughout the year under near cloud-free conditions (Fig. S14). However, there is a marked seasonality in the sensitivity of PO3 with respect to photolysis rates during polluted months suggesting that the ample precursors can leverage available lights to form more ozone molecules. This pattern underscores the algorithm's capability to understand the intertwined relationships between the photolysis rate sensitivities and pollution levels, as well as the pollution sensitivities and photolysis rates.

4.4.3 Global PO3 linear trends using OMI (2005–2019)

Using the linear trend calculation method outlined by Souri et al. (2024), we compute global long-term linear trends of PO3 from OMI data, shown in Fig. 8. High-latitude regions (> 65°) are excluded due to limited photochemical activity. We observe large variability in both the signs and magnitudes of the linear trends. Predominantly positive trends occur over the Middle East, India, and China, while negative trends are mostly found in the eastern U.S., southern parts of Europe, maritime Southeast Asia, and several areas in Africa. The largest upward trend in PO3 over the U.S. occurs in oil and gas producing regions, including the Permian Basin. While various physicochemical processes beyond near-surface PO3 influence tropospheric ozone trends, the strong agreement between predominantly upward PO3 trends in Asia and the Middle East suggested by satellite-based ozone observations (Gaudel et al., 2018; Boynard et al., 2025) is noteworthy.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f08

Figure 8The linear trend maps of PO3 within PBL derived from our new algorithm using OMI in 2005–2019. Dots indicate that the trend has passed a statistical test based on the Mann–Kendall test at 95 % confidence interval.

To gather a more relative perspective, Fig. 9 shows relative PO3 trends (as percentages relative to 2005 annual averages) for regions where PO3 exceeds 0.5 ppbv h−1. The largest relative changes (> 30 %) are evident over the Persian Gulf, Chile, India, and China. Large negative values dominate over the eastern U.S. and over the central Africa (> 20 %).

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f09

Figure 9Similar to Fig. 8 but percentage changes are instead shown over PO3> 0.5 ppbv h−1.

Multiple factors in our parameterization can simultaneously influence these trends, including changes in HCHO VCDs, NO2 VCDs, dynamic changes in column-to-PBL conversion factors from MINDS, water vapor, and photolysis rates. However, photolysis rate trends should be negligible because long-term changes in total overhead ozone are insignificant at midlatitudes (Fig. S2 in Souri et al., 2024), and surface albedo is based on a monthly climatology dataset. While water vapor increases over time in response to global warming (Souri et al., 2024; Borger et al., 2022), these changes are insufficient to explain the large variability in PO3 linear trends over polluted regions. Accordingly, simultaneous changes in HCHO and NO2 boundary layer mixing ratios are the main drivers of PO3 trends.

The PO3 trends are generally explained by changes in ozone precursor concentrations which are mapped in Figs. S15 and S16. The attribution of trends in OMI HCHO and NO2 have been partly discussed in Souri et al. (2024) and the references therein. Increases in both HCHO and NO2 over the Middle East, India, and China drive rising PO3 over time. Conversely, reduced HCHO and NO2 concentrations over parts of Africa, the eastern U.S., and maritime Southeast Asia, have led to PO3 reductions. However, many localized areas exhibit strong non-linearity. For instance, Tehran (Iran) shows positive PO3 trends caused by NO2 increases in a predominantly VOC-sensitive regime, reducing ozone loss through NO2+ OH reactions. Los Angeles (USA) shows upward trends attributed to rapid NO2 reductions, resulting in the opposite effect (Sect. S2)

The quantitative characterization of these trends (similar to our analysis of PO3 seasonality in Sect. 4.4.2 or rapid PO3 changes during a heatwave in Sect. S3) presents significant challenges for several reasons: (i) the amplitudes of these trends are generally an order of magnitude smaller than seasonal changes, requiring more stringent attribution methods, (ii) the sensitivities of PO3 to input parameterization can behave non-linearly, making a linear trend analysis ill-suited for some localized areas, and (iii) changes in ozone precursors have effects on the sensitivity of PO3 to photolysis rates as described in Sect. 4.4.2, introducing a convoluted problem.

Since our PO3 parameterization encapsulates non-linear and interdependent relationships between pollution levels, light intensity, and water vapor, fully isolating individual effects on PO3 trends requires reproducing the product while holding either NO2 or HCHO constant individually and allowing others to evolve over time (an approach similar to modeling experiments in Souri et al., 2024). This approach comprehensively captures the non-linear dependencies between input variables and PO3, circumventing the need for crude linear approximations.

4.4.4 High resolution TROPOMI-based PO3 maps contrasted with OMI in 2019

Accelerated rates of PO3 at approximately 13:30 LST are observed consistently across polluted midlatitude regions characterized by high photolysis rates. This pattern is substantiated by the global PO3 maps derived from TROPOMI and OMI data for the year 2019 illustrated in Fig. 10. While the maps presented are averages for 2019, significant PO3 hotspots (exceeding 8 ppbv h−1) are identified over metropolitan/industrial areas including Mexico City (Mexico), Tehran (Iran), the Persian Gulf, and Hunan Province (China). There are less documented regions undergoing elevated locally-produced ozone such as Johannesburg (South Africa), Rio de Janeiro (Brazil), Sao Paulo (Brazil), and Santiago (Chile). In contrast, Europe emerges as a region with comparatively low PO3 levels despite its dense population. This tendency may be attributed to lower photolysis rates (characterized by high solar zenith angles and low surface reflectivity) as well as effective emissions mitigation strategies. A notable similarity exists between these identified hotspots and those reported by Souri et al. (2025), although the contrast between clean and polluted areas is more pronounced in the PO3DNN product because of an improved representation of PO3DNN in clean regions.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f10

Figure 10Global maps of PO3 derived from TROPOMI (top) and OMI (bottom) datasets based on the PO3DNN algorithm in 2019. These values are estimated within the PBL region at  13:30 LST. The data exclude cloudy pixels, strong smoke, sensor anomalies, and snow based on the recommended quality flags coming with TROPOMI and OMI products.

PO3 exhibits a slight negative value over oceanic and densely forested areas (such as the Amazon and Congo), primarily because of ozone sinks associated with water vapor (H2O(v)) and alkenes, which are implicitly included in our parametrization. However, a marked contrast is observed between the slightly negative and positive PO3 levels along marine vessel pathways. These ship paths are informed not only by remote sensing data (Georgoulias et al., 2020) but also by the conversion of column measurements to PBL mixing ratios thorough the MINDS simulation, which accounts for ship emissions. Given that the PBL is typically shallow over marine regions, the conversion factor is expected to be substantial for these pathways, resulting in a pronounced contrast in pollution levels.

The finer spatial resolution of the TROPOMI dataset enhances the detail of the PO3 maps compared to those derived from OMI, yielding less noise and fuller data. This reduction in gaps in TROPOMI-based PO3 is attributed to a lower likelihood of cloud contamination and the full coverage of all pixels in the detector, in contrast to OMI, which suffers from the row anomaly. Visual analysis of the two datasets indicates that TROPOMI often shows higher PO3 than OMI over polluted regions. Except for NO2 and HCHO VCDs, the inputs to the parametrization are identical across both products.

To further investigate these differences, we synchronize the TROPOMI datasets at the OMI-based spatial resolution and produced scatterplots, as displayed in Fig. 11. The correspondence between the two products is high (R2= 0.86). Nonetheless, TROPOMI-based PO3 levels are approximately 10 % greater than those derived from OMI. The fact that we observe this overestimation given that TROPOMI has been coarsened to match OMI's footprint suggests that the differing spatial resolutions (0.25° vs. 0.1°) are unlikely to account for the discrepancy. Moreover, we undertake a comparative analysis of NO2 and HCHO mixing ratios within the PBL region as obtained from MINDS alongside these two satellite datasets. Given that the conversion factor remains consistent between the two products, any observed differences can be attributed to variations in their respective VCDs. Our analysis reveals that both NO2 and HCHO mixing ratios are higher in TROPOMI relative to OMI (by 5 %–6 %), thereby providing a good explanation for higher TROPOMI-based PO3 in comparison to OMI.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f11

Figure 11Scatterplots of (left) OMI PO3 vs. TROPOMI PO3, (middle) OMI PBL NO2 vs. TROPOMI PBL NO2, and (right) OMI PBL HCHO vs. TROPOMI PBL HCHO based on 2019. We coarsen TROPOMI dataset to match OMI's spatial resolution to remove the effect of spatial footprint on these results.

Download

4.4.5 Error Analysis

Based on the formulation outlined in Sect. 3.4, we evaluate both the systematic and random error components of PO3 for July 2019, based on data from both OMI and TROPOMI retrievals. Figure 12 presents the average error values for the month. Total PO3 errors range from 25 % to 80 % in areas characterized by moderate to extreme pollution, while in more remote regions, errors can surpass 200 %.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f12

Figure 12The maps of total error, systematic, and random errors for (a) OMI, and (b) TROPOMI computed for July 2019.

On average, random errors constitute only a small fraction of the total error budget, with OMI showing consistently larger random errors than TROPOMI across the region. This is primarily a result of OMI's limited sampling caused by row anomaly issues. As mentioned in Sect. 4.2, these random errors are significantly lower when compared to the PO3LASSO random errors (Souri et al., 2025).

Systematic errors account for most of the total error, exceeding 90 %. These systematic errors are comprised of three components: biases arising from the correction of VCDs using ground-based remote sensing data, errors related to DNN predictions, and conversion factors derived from the MINDS framework. The first two components contribute minimally to the overall error (less than 5 %), making the MINDS conversion factors the dominant contributor to the total error budget. Therefore, any parametrization aimed at converting satellite-based VCDs to near-surface concentrations, including the one presented in this study, should always seek out a model that accurately reflects the shape of the profiles.

We also quantify the impact of inconsistent shape factors used in the retrievals and the MINDS profile on PO3 estimates and find them introducing systematic errors of 5 %–25 % over PO3>0.5 ppbv h−1 (Figs. S17–S20). Refining TROPOMI and OMI products with MINDS shape factors would require reproducing several large-scale validation efforts (e.g., Verhoelst et al., 2021; Vigouroux et al., 2020; Pinardi et al., 2020; Ayazpour et al., 2025), which is beyond the practical scope and resources of this study.

4.4.6 Beyond binary maps obtained from FNR: Ozone sensitivity maps using high-resolution TROPOMI data

We explore the spatially varying sensitivity of PO3 to HCHO and NO2 worldwide using TROPOMI. These maps provide finer information compared to binary maps obtained from FNRs. Figure 13 illustrates global maps of these sensitivities averaged for the year 2019. We observe negative sensitivity values of PO3 to NO2 in urban areas, which aligns with our understanding of non-linear ozone chemistry. These negative values are particularly pronounced in northern China, where VOCR/NOx ratios remain low throughout the year. Similar non-linear feedback patterns can be seen in the Benelux region and the United Kingdom, primarily driven by elevated NO2 levels. In contrast, NO2 significantly contributes to higher PO3 in southern China, India, Mexico, and several regions across Africa.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f13

Figure 13The sensitivity of PO3 to NO2 (top) and HCHO (bottom) based on our algorithm using TROPOMI data in 2019.

As indicated in Souri et al. (2025), the influence of HCHO on PO3 is largely governed by NOx emissions. This relationship explains why the sensitivity of PO3 to HCHO closely mirrors global NO2 levels, which dictates the locations of VOC-sensitive regimes. We observe slightly negative sensitivity of PO3 to HCHO in remote and densely vegetated regions, likely a result of the effects of alkenes on ozone. However, the implicit nature of DNN makes it challenging to identify the exact chemical reasons behind these patterns. Noteworthy examples of areas where PO3 is significantly influenced by HCHO include eastern China, Los Angeles (USA), Tehran (Iran), Mexico City (Mexico), and Johannesburg (South Africa).

Figure 14 presents the maps of PO3 along with sensitivities across four seasons in 2019 over Middle East, derived from TROPOMI data. Notably, PO3 values surge during the summer months in several densely populated and industrial regions of the Middle East. Furthermore, we observe considerable PO3 values in the fall, primarily caused by the influence of HCHO. This fall peak is consistent with the observations made by Souri et al. (2025), who reported a sharp rise in PO3 in late fall 2019 over Tehran (Iran). The overall seasonality of PO3 is well aligned with the discussions presented in Sect. 4.4.1. The sensitivity of PO3 to NO2 exhibits notable variation, shifting from low and negative values during the colder months to positive and high values in the warmer months. We identify HCHO as the predominant contributor to PO3 in these regions, as the majority of these cities fall in VOC-sensitive environments and emit significant amounts of anthropogenic HCHO, whether from primary or secondary sources.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f14

Figure 14The magnitude of PO3 and the corresponding sensitivity to NO2 and HCHO over Middle East grouped into four different seasons. DJF: December-January-February, MAM: March-April-May, JJA: June-July-August, and SON: September-October-November. Sens. means sensitivity.

These maps eliminate the need for binarization of chemical conditions, as they effectively illustrate the spatial variability in ozone response to HCHO and NO2 while accounting for light and humidity, two important dimensions missing in FNR-based ozone sensitivity diagnosis. A more detailed discussion about FNR's inability to fully describe ozone chemistry is documented in Sect. S1.

5 Summary

Early data-driven analyses of ozone chemistry sensitivity primarily relied on “ratio-based” indicators to partially linearize the non-linear aspects of urban ozone chemistry, which are influenced by pollution levels, light, and water vapor. With the development of more sophisticated algorithms, including machine learning techniques capable of fitting high-dimensional non-linear functions, we have shown that a highly effective parameterization of net ozone production rates (PO3) can be achieved. This approach not only eliminates the need for empirical linearization of ozone chemistry through various indicators, but it also allows for the primary inputs to be accurately constrained using satellite observations. This advancement allowed us to move beyond the previously employed formaldehyde-to-nitrogen dioxide ratio (FNR) and to generate more comprehensive sensitivity maps, which account for variations not only in HCHO and NO2 but also in light and water vapor.

We significantly enhanced the empirical parametrization of PO3 described in Souri et al. (2025), in several key ways: (i) we improved the representation of PO3 in both polluted and clean areas using a L2-regularized deep neural network (DNN) and eliminated the need for empirical linearization of atmospheric conditions with the FNR approach, resulting in reduced complexity and noise in the final estimates; (ii) we used a finer, up-to-date global transport model called MINDS to convert satellite-retrieved vertical column density (VCD) into planetary boundary layer (PBL) mixing ratios; (iii) we incorporated the error from these conversion factors, derived from comprehensive validation against aircraft spirals, into the total error budget; and (iv) we generated long-term records of PO3 magnitudes and sensitivities to nitrogen dioxide (NO2) and formaldehyde (HCHO) using bias-corrected data from the Ozone Monitoring Instrument (OMI) for the years 2005–2019 (at a resolution of 0.25° × 0.25°) and the TROPOspheric Monitoring Instrument (TROPOMI) for 2018–2023 (at a resolution of 0.1° × 0.1°). These datasets were collected under partially cloud-free conditions around 13:30 equatorial local standard time. The two products show strong agreement, with TROPOMI-based PO3 being approximately 10 % higher than OMI, which is attributed to higher NO2 and HCHO concentrations noted by TROPOMI.

The DNN algorithm (PO3DNN) accounted for more than 96 % of the variance in both the test and training datasets derived from observationally-constrained box simulations across various atmospheric composition campaigns, with a slope close to the unity line. The new algorithm improved the representation of PO3 in remote regions compared to the version developed in Souri et al. (2025), due to the inclusion of water vapor and the use of a more robust regression model. We found PO3DNN to be logically responsive to its inputs during various idealized experiments that involved changing light conditions, pollution levels, and water vapor.

Expectedly, our results indicate that PO3 magnitudes and sensitivity maps are primarily influenced by the levels of ozone precursors, non-linearity of ozone chemistry, and photolysis rates. We revisited the accelerated PO3 observed in Souri et al. (2025) across polluted areas, such as major cities and during biomass burning activities in photochemically active environments. Using sensitivity calculations derived from the new algorithm, we investigated the contributors to PO3 seasonality around the globe. We found that photolysis rates were the primary drivers of PO3 seasonality. During darker months, both the magnitude of PO3 and its sensitivity to NO2 and HCHO decrease due to limited light availability to initiate the ROx–HOx cycle. This critical trend is not represented by the pollution levels alone, highlighting the necessity of including photolysis rates in ozone sensitivity analyses. Fortunately, we can largely constrain these rates using satellite observations. In regions with minimal variability in photolysis rates (such as the tropics), pollution levels became the main driver of PO3 seasonality.

The long record of stable observations from OMI allowed us to generate the first-ever maps of PO3 linear trends from 2005 to 2019 globally. The global long-term trends revealed substantial spatial variability, with predominantly positive trends over Asia and the Middle East (> 30 % relative to 2005) and negative trends across the eastern U.S., Europe, and parts of Africa. Analysis indicated that simultaneous changes in HCHO and NO2 boundary layer concentrations were the primary drivers of these trends. Although increases in both precursors over Asia and the Middle East, rising PO3 and reduced concentrations elsewhere lead to PO3 decreases, localized non-linearities complicated this relationship, as demonstrated by contrasting chemical regimes in Tehran vs. Los Angeles. Quantitative attribution of these trends presents challenges because of their small amplitudes relative to seasonal variations and non-linear sensitivities in the parameterization, necessitating “hold-one-out” approaches that account for complex interdependencies between input variables.

We error characterized both systematic and random errors associated with PO3DNN for both OMI and TROPOMI-based products. We showed that total errors range from 25 % to over 200 %, with smaller errors in polluted areas. Random errors are minor on monthly-basis, with OMI exhibiting larger errors due to row anomaly issues. Systematic errors exceed 90 % of the total error, primarily driven by MINDS conversion factors. The total errors budget emphasizes on the role of model used for converting satellite-based VCDs to near-surface concentrations and its importance for precisely determining ozone precursors levels near to the surface. Furthermore, in future efforts, we also need to refine satellite retrievals using spatially higher-resolution AMFs derived from MINDS while simultaneously performing retrieval validation against ground-based remote sensing observations.

We developed a novel product aimed at enhancing our understanding of the variability in PO3 and its interactions with NOx and VOCs on a global scale. This advanced algorithm has undergone meticulous tuning and training using an extensive dataset derived from a reliable box model, which is further constrained by intensive atmospheric composition campaigns conducted by NASA and NOAA. The algorithm not only yields accurate estimates of PO3 with minimal bias in comparison to observationally-constrained values but also facilitates the derivation of PO3 in relation to HCHO and NO2. However, as indicated by Souri et al. (2025), there remain several opportunities for further improvement, including: (i) the incorporation of heterogeneous chemistry; (ii) consideration of the impact of partially cloudy regions and aerosols on photolysis rates; (iii) gauging the potential benefits of using more sophisticated chemical mechanisms for the generation of the training dataset; and (iv) enhanced representation of vertical profiles of NO2 and HCHO using observationally-constrained chemical transport models with more rigorous column to near-surface conversion methods (Cooper et al., 2020). Some of these enhancements present significant challenges, particularly the fine-resolution three-dimensional characterization of aerosol and cloud properties on a global scale, which is not obtainable with current reanalysis data. However, with the advent of newer satellite technologies such as PACE and MAIA, there may be opportunities to improve the representation of atmospheric models with respect to cloud and aerosol characteristics.

While the OMI- and TROPOMI-based PO3 products maintain algorithmic consistency in several key components, including photolysis rates and water vapor calculations, the underlying satellite retrievals of HCHO and NO2 VCDs remain unharmonized between the two instruments. To address the resulting inter-instrument biases, we implemented bias correction using ground-based remote sensing retrievals as reference standards. This approach achieved OMI and TROPOMI PO3 agreement within 10 % on average. However, this level of consistency may be insufficient for robust joint trend analysis of the combined OMI-TROPOMI PO3 record over areas with non-linear or minor trends, potentially requiring the implementation of trend harmonization algorithms (e.g., Hilboll et al., 2013) to ensure statistical reliability in long-term analyses.

The emergence of novel geosynchronous orbit (GEO) technologies is becoming increasingly important for monitoring the daylight hourly variability in ozone precursors. In particular, the finer spatial and temporal resolution offered by the Tropospheric Emissions: Monitoring of Pollution (TEMPO), Geostationary Environment Monitoring Spectrometer (GEMS), and Sentinel-4 instruments will aid in distinguishing exceptional events from typical atmospheric conditions. In light of the success of emission mitigation strategies over high income countries, the occurrences of elevated PO3 are becoming more infrequent, thereby necessitating a more detailed and rapid observational strategy for monitoring such events. This presents a timely opportunity to address ozone exceedance events using TEMPO in conjunction with our PO3 estimator, especially since the algorithm is designed to handle light-limited conditions – such as those encountered during early morning and late afternoon periods when TEMPO collects data – conditions that are not feasible to analyze via the FNR approach.

Appendix A: The sensitivity maps are the directional derivative

To demonstrate that the sensitivity calculation of PO3 to its inputs resembles (Eq. 3) a directional derivative output, we can approximate the perturbations in the PO3DNN (denoted as f(x), where x is the targeted sensitivity parameter) using the Taylor expansion:

(A1)f1.1xfx+1.1x-xfx=fx+0.1xfx(A2)f0.9xfx+0.9x-xfx=fx-0.1xfx

The sensitivity calculation presented in Eq. (3) can be rewritten in the following form:

(A3) S = ( f x + 0.1 f x ) - ( f x - 0.1 f x ) 0.2 = 0.2 x f x 0.2 = x f x

Therefore, the first-order approximation of the DNN prediction, when using the given sensitivity calculation, is xf(x) which represents the first-order Taylor expansion term that describes how the output changes with respect to both the gradient and the magnitude of x (i.e., directional derivative).

Appendix B: MINDS conversion factor validation

We validate the column conversion factors obtained from the MINDS simulations against corresponding values derived from aircraft spirals from several suborbital missions. The concentrations of HCHO and NO2 in both datasets are collocated in time and space and are resampled onto a common vertical grid, ranging from the near surface up to 450 hPa in 20 hPa increments. To determine the conversion factors, these resampled concentrations are averaged within the PBL and then divided by the vertically integrated partial columns from the surface to 450 hPa. The PBLH is based on the MINDS simulations. Figure B1 displays scatterplots of the paired conversion factor binned at 12:15 and 15:15 LST (±45 min around the TROPOMI/OMI local revisit time) for NO2 and HCHO, respectively. The unit for these conversion factors is ppbv col−1, where col represents 1 × 1015 molec. cm−2. The comparison shows a good level of agreement between the two datasets for both species (R2>0.7). The MINDS simulations perform slightly better for NO2 than for HCHO. This performance difference may arise from the fact that HCHO is mainly a secondary product, meaning various uncertain VOC emissions, along with uncertain chemical processes in the model, could pile up leading to discrepancies in the vertical distribution of simulated HCHO compared to observations. Furthermore, HCHO vertical profiles can be easily affected by local circulation patterns that are difficult to resolve in coarse models (Souri et al., 2023b). We observe consistent model performance across various campaigns, except for DISCOVER-AQ Colorado. This discrepancy may result from complex topography and wind conditions in that region that the model might not fully capture. The differences between the two datasets can also be attributed to sources of error beyond the model deficiencies. For instance, the MINDS simulations represent a quarter-degree averaged concentration, which differs from the localized air samples derived from aircraft, known as the spatial representation error (Souri et al., 2022).

To account for the systematic errors resulting from the MINDS simulation in our error budget, we assign econv-HCHO and econv-NO2 in Eq. (5) to RMSE values obtained from the comparison. The choice of RMSE is based on the fact that it contains information about the bias and the dispersion of MINDS with respect to the observations. We assume these errors to be invariant by time or location, mainly because of limited aircraft spirals (N= 57) we have from the suborbital missions.

https://acp.copernicus.org/articles/26/809/2026/acp-26-809-2026-f15

Figure B1The scatterplot of the column to the PBL conversion factor for (a) NO2 and (b) HCHO obtained from aircraft spirals (x axis) and MINDS simulation (y axis) at the same time and location from four different suborbital missions. These 57 spirals are limited to OMI/TROPOMI overpass ±45 min buffering time. “col” denotes 1 × 1015 molec. cm−2.

Download

Data availability

The PO3 products can be obtained from https://www.ozonerates.space (last access: 12 January 2026).

TROPOMI satellite data are derived from copernicus Sentinel-5P (https://doi.org/10.5270/S5P-9bnp8q8, Copernicus Sentinel-5P, 2021), and copernicus Sentinel-5P (https://doi.org/10.5270/S5P-vg1i7t0, Copernicus Sentinel-5P, 2020). The TROPOMI UV DLER can be obtained from https://www.temis.nl/surface/albedo/tropomi_ler.php (last access: 10 November 2024). OMI SAO HCHO at https://aura.gesdisc.eosdis.nasa.gov/data/Aura_OMI_Level2/OMHCHO.004/ (last access: 12 January 2026). MINDS simulations can be obtained from https://portal.nccs.nasa.gov/datashare/merra2_gmi/gmi-minds/ (last access: 10 April 2025). OMI NO2 (QA4ECV) can be downloaded from https://www.temis.nl/ (last access: 10 April 2025).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/acp-26-809-2026-supplement.

Author contributions

AHS designed and implemented the research idea, analyzed the data, made all figures except for Figs. 2 and 3, and wrote the manuscript. GG implemented, designed, and validated the DNN algorithm, and made Figs. 2 and 3. LDM provided the MINDS simulations. BND helped with the interpretation of the results and editing.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Atmospheric Chemistry and Physics. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

Resources supporting this work were provided by the NASA High-End Computing (HEC) Program through the NASA Center for Climate Simulation (NCCS) at Goddard Space Flight Center.

Financial support

This study is funded by NASA's ACMAP/Aura project (grant no. 80NSSC23K1250).

Review statement

This paper was edited by Michel Van Roozendael and reviewed by two anonymous referees.

References

Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: A system for large-scale machine learning, arXiv [preprint], https://doi.org/10.48550/arXiv.1605.08695, 31 May 2016. 

Anderson, D. C., Duncan, B. N., Liu, J., Nicely, J. M., Strode, S. A., Follette-Cook, M. B., Souri, A. H., Ziemke, J. R., González-Abad, G., and Ayazpour, Z.: Trends and Interannual Variability of the Hydroxyl Radical in the Remote Tropics During Boreal Autumn Inferred From Satellite Proxy Data, Geophys. Res. Lett., 51, e2024GL108531, https://doi.org/10.1029/2024GL108531, 2024. 

Ayazpour, Z., González Abad, G., Nowlan, C. R., Sun, K., Kwon, H.-A., Chan Miller, C., Chong, H., Wang, H., Liu, X., Chance, K., O'Sullivan, E., Zhu, L., Vigouroux, C., De Smedt, I., Stremme, W., Hannigan, J. W., Notholt, J., Sun, X., Palm, M., Petri, C., Strong, K., Röhling, A. N., Mahieu, E., Smale, D., Té, Y., Morino, I., Murata, I., Nagahama, T., Kivi, R., Makarova, M., Jones, N., Sussmann, R., and Zhou, M.: Aura Ozone Monitoring Instrument (OMI) Collection 4 Formaldehyde Products, Earth Space Sci., 12, e2024EA003792, https://doi.org/10.1029/2024EA003792, 2025. 

Bates, K. H. and Jacob, D. J.: An Expanded Definition of the Odd Oxygen Family for Tropospheric Ozone Budgets: Implications for Ozone Lifetime and Stratospheric Influence, Geophys. Res. Lett., 47, e2019GL084486, https://doi.org/10.1029/2019GL084486, 2020. 

Bauwens, M., Stavrakou, T., Müller, J.-F., De Smedt, I., Van Roozendael, M., van der Werf, G. R., Wiedinmyer, C., Kaiser, J. W., Sindelarova, K., and Guenther, A.: Nine years of global hydrocarbon emissions based on source inversion of OMI formaldehyde observations, Atmos. Chem. Phys., 16, 10133–10158, https://doi.org/10.5194/acp-16-10133-2016, 2016. 

Bocquet, M., Elbern, H., Eskes, H., Hirtl, M., Žabkar, R., Carmichael, G. R., Flemming, J., Inness, A., Pagowski, M., Pérez Camaño, J. L., Saide, P. E., San Jose, R., Sofiev, M., Vira, J., Baklanov, A., Carnevale, C., Grell, G., and Seigneur, C.: Data assimilation in atmospheric chemistry models: current status and future prospects for coupled chemistry meteorology models, Atmos. Chem. Phys., 15, 5325–5358, https://doi.org/10.5194/acp-15-5325-2015, 2015. 

Boersma, K. F., Eskes, H. J., Richter, A., De Smedt, I., Lorente, A., Beirle, S., van Geffen, J. H. G. M., Zara, M., Peters, E., Van Roozendael, M., Wagner, T., Maasakkers, J. D., van der A, R. J., Nightingale, J., De Rudder, A., Irie, H., Pinardi, G., Lambert, J.-C., and Compernolle, S. C.: Improving algorithms and uncertainty estimates for satellite NO2 retrievals: results from the quality assurance for the essential climate variables (QA4ECV) project, Atmos. Meas. Tech., 11, 6651–6678, https://doi.org/10.5194/amt-11-6651-2018, 2018. 

Borger, C., Beirle, S., and Wagner, T.: Analysis of global trends of total column water vapour from multiple years of OMI observations, Atmos. Chem. Phys., 22, 10603–10621, https://doi.org/10.5194/acp-22-10603-2022, 2022. 

Boynard, A., Wespes, C., Hadji-Lazaro, J., Sinnathamby, S., Hurtmans, D., Coheur, P.-F., Doutriaux-Boucher, M., Onderwaater, J., Steinbrecht, W., Pennington, E. A., Bowman, K., and Clerbaux, C.: Assessment of 16-year tropospheric ozone trends from the IASI Climate Data Record, Atmos. Chem. Phys., 25, 11719–11755, https://doi.org/10.5194/acp-25-11719-2025, 2025. 

Brune, W. H., Miller, D. O., Thames, A. B., Allen, H. M., Apel, E. C., Blake, D. R., Bui, T. P., Commane, R., Crounse, J. D., Daube, B. C., Diskin, G. S., DiGangi, J. P., Elkins, J. W., Hall, S. R., Hanisco, T. F., Hannun, R. A., Hintsa, E. J., Hornbrook, R. S., Kim, M. J., McKain, K., Moore, F. L., Neuman, J. A., Nicely, J. M., Peischl, J., Ryerson, T. B., St. Clair, J. M., Sweeney, C., Teng, A. P., Thompson, C., Ullmann, K., Veres, P. R., Wennberg, P. O., and Wolfe, G. M.: Exploring Oxidation in the Remote Free Troposphere: Insights From Atmospheric Tomography (ATom), J. Geophys. Res.-Atmos., 125, e2019JD031685, https://doi.org/10.1029/2019JD031685, 2020. 

Brune, W. H., Miller, D. O., Thames, A. B., Brosius, A. L., Barletta, B., Blake, D. R., Blake, N. J., Chen, G., Choi, Y., Crawford, J. H., Digangi, J. P., Diskin, G., Fried, A., Hall, S. R., Hanisco, T. F., Huey, G. L., Hughes, S. C., Kim, M., Meinardi, S., Montzka, D. D., Pusede, S. E., Schroeder, J. R., Teng, A., Tanner, D. J., Ullmann, K., Walega, J., Weinheimer, A., Wisthaler, A., and Wennberg, P. O.: Observations of atmospheric oxidation and ozone production in South Korea, Atmos. Environ., 269, 118854, https://doi.org/10.1016/j.atmosenv.2021.118854, 2022. 

Cazorla, M. and Brune, W. H.: Measurement of Ozone Production Sensor, Atmos. Meas. Tech., 3, 545–555, https://doi.org/10.5194/amt-3-545-2010, 2010. 

Cazorla, M., Brune, W. H., Ren, X., and Lefer, B.: Direct measurement of ozone production rates in Houston in 2009 and comparison with two estimation methods, Atmos. Chem. Phys., 12, 1203–1212, https://doi.org/10.5194/acp-12-1203-2012, 2012. 

Chatfield, R. B., Ren, X., Brune, W., and Schwab, J.: Controls on urban ozone production rate as indicated by formaldehyde oxidation rate and nitric oxide, Atmos. Environ., 44, 5395–5406, https://doi.org/10.1016/j.atmosenv.2010.08.056, 2010. 

Chin, M., Ginoux, P., Kinne, S., Torres, O., Holben, B. N., Duncan, B. N., Martin, R. V., Logan, J. A., Higurashi, A., and Nakajima, T.: Tropospheric Aerosol Optical Thickness from the GOCART Model and Comparisons with Satellite and Sun Photometer Measurements, J. Atmos. Sci., 59, 461–483, https://doi.org/10.1175/1520-0469(2002)059<0461:TAOTFT>2.0.CO;2, 2002. 

Choi, Y. and Souri, A. H.: Chemical condition and surface ozone in large cities of Texas during the last decade: Observational evidence from OMI, CAMS, and model analysis, Remote Sens. Environ., 168, 90–101, https://doi.org/10.1016/j.rse.2015.06.026, 2015a. 

Choi, Y. and Souri, A. H.: Seasonal behavior and long-term trends of tropospheric ozone, its precursors and chemical conditions over Iran: A view from space, Atmos. Environ., 106, 232–240, https://doi.org/10.1016/j.atmosenv.2015.02.012, 2015b. 

Choi, Y., Kim, H., Tong, D., and Lee, P.: Summertime weekly cycles of observed and modeled NOx and O3 concentrations as a function of satellite-derived ozone production sensitivity and land use types over the Continental United States, Atmos. Chem. Phys., 12, 6291–6307, https://doi.org/10.5194/acp-12-6291-2012, 2012. 

Compernolle, S., Verhoelst, T., Pinardi, G., Granville, J., Hubert, D., Keppens, A., Niemeijer, S., Rino, B., Bais, A., Beirle, S., Boersma, F., Burrows, J. P., De Smedt, I., Eskes, H., Goutail, F., Hendrick, F., Lorente, A., Pazmino, A., Piters, A., Peters, E., Pommereau, J.-P., Remmers, J., Richter, A., van Geffen, J., Van Roozendael, M., Wagner, T., and Lambert, J.-C.: Validation of Aura-OMI QA4ECV NO2 climate data records with ground-based DOAS networks: the role of measurement and comparison uncertainties, Atmos. Chem. Phys., 20, 8017–8045, https://doi.org/10.5194/acp-20-8017-2020, 2020. 

Cooper, M. J., Martin, R. V., McLinden, C. A., and Brook, J. R.: Inferring ground-level nitrogen dioxide concentrations at fine spatial resolution applied to the TROPOMI satellite instrument, Environ. Res. Lett., 15, 104013, https://doi.org/10.1088/1748-9326/aba3a5, 2020. 

Copernicus Sentinel-5P: TROPOMI Level 2 Formaldehyde Total Column products, Version 02, European Space Agency, Copernicus Sentinel-5P [data set], https://doi.org/10.5270/S5P-vg1i7t0, 2020. 

Copernicus Sentinel-5P: TROPOMI Level 2 Nitrogen Dioxide total column products, Version 02, European Space Agency, Copernicus Sentinel-5P [data set], https://doi.org/10.5270/S5P-9bnp8q8, 2021. 

David, L. M. and Nair, P. R.: Tropospheric column O3 and NO2 over the Indian region observed by Ozone Monitoring Instrument (OMI): Seasonal changes and long-term trends, Atmos. Environ., 65, 25–39, https://doi.org/10.1016/j.atmosenv.2012.09.033, 2013. 

De Smedt, I., Pinardi, G., Vigouroux, C., Compernolle, S., Bais, A., Benavent, N., Boersma, F., Chan, K.-L., Donner, S., Eichmann, K.-U., Hedelt, P., Hendrick, F., Irie, H., Kumar, V., Lambert, J.-C., Langerock, B., Lerot, C., Liu, C., Loyola, D., Piters, A., Richter, A., Rivera Cárdenas, C., Romahn, F., Ryan, R. G., Sinha, V., Theys, N., Vlietinck, J., Wagner, T., Wang, T., Yu, H., and Van Roozendael, M.: Comparative assessment of TROPOMI and OMI formaldehyde observations and validation against MAX-DOAS network column measurements, Atmos. Chem. Phys., 21, 12561–12593, https://doi.org/10.5194/acp-21-12561-2021, 2021. 

Duncan, B., Yoshida, Y., Olson, J., Sillman, S., Retscher, C., Martin, R., Lamsal, L., Hu, Y., Pickering, K., Retscher, C., Allen, D., and Crawford, J.: Application of OMI observations to a space-based indicator of NOx and VOC controls on surface ozone formation, Atmos. Environ., 44, 2213–2223, https://doi.org/10.1016/j.atmosenv.2010.03.010, 2010. 

Duncan, B. N., Strahan, S. E., Yoshida, Y., Steenrod, S. D., and Livesey, N.: Model study of the cross-tropopause transport of biomass burning pollution, Atmos. Chem. Phys., 7, 3713–3736, https://doi.org/10.5194/acp-7-3713-2007, 2007. 

Eskes, H. J. and Boersma, K. F.: Averaging kernels for DOAS total-column satellite retrievals, Atmos. Chem. Phys., 3, 1285–1291, https://doi.org/10.5194/acp-3-1285-2003, 2003. 

Fisher, B. L., Lamsal, L. N., Fasnacht, Z., Oman, L. D., Joiner, J., Krotkov, N. A., Choi, S., Qin, W., and Yang, E.-S.: Revised estimates of NO2 reductions during the COVID-19 lockdowns using updated TROPOMI NO2 retrievals and model simulations, Atmos. Environ., 326, 120459, https://doi.org/10.1016/j.atmosenv.2024.120459, 2024. 

Fleming, Z. L., Doherty, R. M., von Schneidemesser, E., Malley, C. S., Cooper, O. R., Pinto, J. P., Colette, A., Xu, X., Simpson, D., Schultz, M. G., Lefohn, A. S., Hamad, S., Moolla, R., Solberg, S., and Feng, Z.: Tropospheric Ozone Assessment Report: Present-day ozone distribution and trends relevant to human health, Elem. Sci. Anth., 6, 12, https://doi.org/10.1525/elementa.273, 2018. 

Gaudel, A., Cooper, O. R., Ancellet, G., Barret, B., Boynard, A., Burrows, J. P., Clerbaux, C., Coheur, P.-F., Cuesta, J., Cuevas, E., Doniki, S., Dufour, G., Ebojie, F., Foret, G., Garcia, O., Granados-Muñoz, M. J., Hannigan, J. W., Hase, F., Hassler, B., Huang, G., Hurtmans, D., Jaffe, D., Jones, N., Kalabokas, P., Kerridge, B., Kulawik, S., Latter, B., Leblanc, T., Le Flochmoën, E., Lin, W., Liu, J., Liu, X., Mahieu, E., McClure-Begley, A., Neu, J. L., Osman, M., Palm, M., Petetin, H., Petropavlovskikh, I., Querel, R., Rahpoe, N., Rozanov, A., Schultz, M. G., Schwab, J., Siddans, R., Smale, D., Steinbacher, M., Tanimoto, H., Tarasick, D. W., Thouret, V., Thompson, A. M., Trickl, T., Weatherhead, E., Wespes, C., Worden, H. M., Vigouroux, C., Xu, X., Zeng, G., and Ziemke, J.: Tropospheric Ozone Assessment Report: Present-day distribution and trends of tropospheric ozone relevant to climate and global atmospheric chemistry model evaluation, Elem. Sci. Anth., 6, 39, https://doi.org/10.1525/elementa.291, 2018. 

Georgoulias, A. K., Boersma, K. F., van Vliet, J., Zhang, X., van der A, R., Zanis, P., and de Laat, J.: Detection of NO2 pollution plumes from individual ships with the TROPOMI/S5P satellite sensor, Environ. Res. Lett., 15, 124037, https://doi.org/10.1088/1748-9326/abc445, 2020. 

Gonzalez Abad, G., Souri, A. H., Bak, J., Chance, K., Flynn, L. E., Krotkov, N. A., Lamsal, L., Li, C., Liu, X., Miller, C. C., Nowlan, C. R., Suleiman, R., and Wang, H.: Five decades observing Earth's atmospheric trace gases using ultraviolet and visible backscatter solar radiation from space, J. Quant. Spectrosc. Ra., 238, 106478, https://doi.org/10.1016/j.jqsrt.2019.04.030, 2019. 

Granier, C., Bessagnet, B., Bond, T., D'Angiola, A., Denier van der Gon, H., Frost, G. J., Heil, A., Kaiser, J. W., Kinne, S., Klimont, Z., Kloster, S., Lamarque, J.-F., Liousse, C., Masui, T., Meleux, F., Mieville, A., Ohara, T., Raut, J.-C., Riahi, K., Schultz, M. G., Smith, S. J., Thompson, A., van Aardenne, J., van der Werf, G. R., and van Vuuren, D. P.: Evolution of anthropogenic and biomass burning emissions of air pollutants at global and regional scales during the 1980–2010 period, Clim. Change, 109, 163, https://doi.org/10.1007/s10584-011-0154-1, 2011. 

Guenther, A. B., Jiang, X., Heald, C. L., Sakulyanontvittaya, T., Duhl, T., Emmons, L. K., and Wang, X.: The Model of Emissions of Gases and Aerosols from Nature version 2.1 (MEGAN2.1): an extended and updated framework for modeling biogenic emissions, Geosci. Model Dev., 5, 1471–1492, https://doi.org/10.5194/gmd-5-1471-2012, 2012. 

Hilboll, A., Richter, A., and Burrows, J. P.: Long-term changes of tropospheric NO2 over megacities derived from multiple satellite instruments, Atmos. Chem. Phys., 13, 4145–4169, https://doi.org/10.5194/acp-13-4145-2013, 2013. 

Ichoku, C. and Ellison, L.: Global top-down smoke-aerosol emissions estimation using satellite fire radiative power measurements, Atmos. Chem. Phys., 14, 6643–6667, https://doi.org/10.5194/acp-14-6643-2014, 2014. 

Jeon, W., Choi, Y., Souri, A. H., Roy, A., Diao, L., Pan, S., Lee, H. W., and Lee, S.-H.: Identification of chemical fingerprints in long-range transport of burning induced upper tropospheric ozone from Colorado to the North Atlantic Ocean, Sci. Total Environ., 613–614, 820–828, https://doi.org/10.1016/j.scitotenv.2017.09.177, 2018. 

Jin, X., Fiore, A. M., Murray, L. T., Valin, L. C., Lamsal, L. N., Duncan, B., Folkert Boersma, K., De Smedt, I., Abad, G. G., Chance, K., and Tonnesen, G. S.: Evaluating a Space-Based Indicator of Surface Ozone-NOx-VOC Sensitivity Over Midlatitude Source Regions and Application to Decadal Trends, J. Geophys. Res.-Atmos., 122, 10439–10461, https://doi.org/10.1002/2017JD026720, 2017. 

Johnson, M. S., Philip, S., Meech, S., Kumar, R., Sorek-Hamer, M., Shiga, Y. P., and Jung, J.: Insights into the long-term (2005–2021) spatiotemporal evolution of summer ozone production sensitivity in the Northern Hemisphere derived with the Ozone Monitoring Instrument (OMI), Atmos. Chem. Phys., 24, 10363–10384, https://doi.org/10.5194/acp-24-10363-2024, 2024. 

Kingma, D. P. and Ba, J.: Adam: A Method for Stochastic Optimization, arXiv [preprint], https://doi.org/10.48550/arXiv.1412.6980, 30 January 2017. 

Kleinman, L. I., Daum, P. H., Imre, D., Lee, Y.-N., Nunnermacker, L. J., Springston, S. R., Weinstein-Lloyd, J., and Rudolph, J.: Ozone production rate and hydrocarbon reactivity in 5 urban areas: A cause of high ozone concentration in Houston, Geophys. Res. Lett., 29, 105-1–105-4, https://doi.org/10.1029/2001GL014569, 2002. 

Kleinman, L. I., Daum, P. H., Lee, Y.-N., Nunnermacker, L. J., Springston, S. R., Weinstein-Lloyd, J., and Rudolph, J.: A comparative study of ozone production in five U.S. metropolitan areas, J. Geophys. Res.-Atmos., 110, https://doi.org/10.1029/2004JD005096, 2005. 

Martin, R. V., Fiore, A. M., and Van Donkelaar, A.: Space-based diagnosis of surface ozone sensitivity to anthropogenic emissions, Geophys. Res. Lett., 31, https://doi.org/10.1029/2004GL019416, 2004. 

Mazzuca, G. M., Ren, X., Loughner, C. P., Estes, M., Crawford, J. H., Pickering, K. E., Weinheimer, A. J., and Dickerson, R. R.: Ozone production and its sensitivity to NOx and VOCs: results from the DISCOVER-AQ field experiment, Houston 2013, Atmos. Chem. Phys., 16, 14463–14474, https://doi.org/10.5194/acp-16-14463-2016, 2016. 

Miller, D. O. and Brune, W. H.: Investigating the Understanding of Oxidation Chemistry Using 20 Years of Airborne OH and HO2 Observations, J. Geophys. Res.-Atmos., 127, e2021JD035368, https://doi.org/10.1029/2021JD035368, 2022. 

Mills, G., Pleijel, H., Malley, C. S., Sinha, B., Cooper, O. R., Schultz, M. G., Neufeld, H. S., Simpson, D., Sharps, K., Feng, Z., Gerosa, G., Harmens, H., Kobayashi, K., Saxena, P., Paoletti, E., Sinha, V., and Xu, X.: Tropospheric Ozone Assessment Report: Present-day tropospheric ozone distribution and trends relevant to vegetation, Elem. Sci. Anth., 6, 47, https://doi.org/10.1525/elementa.302, 2018. 

Miyazaki, K., Bowman, K. W., Yumimoto, K., Walker, T., and Sudo, K.: Evaluation of a multi-model, multi-constituent assimilation framework for tropospheric chemical reanalysis, Atmos. Chem. Phys., 20, 931–967, https://doi.org/10.5194/acp-20-931-2020, 2020. 

Molod, A., Takacs, L., Suarez, M., and Bacmeister, J.: Development of the GEOS-5 atmospheric general circulation model: evolution from MERRA to MERRA2, Geosci. Model Dev., 8, 1339–1356, https://doi.org/10.5194/gmd-8-1339-2015, 2015. 

Nielsen, J. E., Pawson, S., Molod, A., Auer, B., da Silva, A. M., Douglass, A. R., Duncan, B., Liang, Q., Manyin, M., Oman, L. D., Putman, W., Strahan, S. E., and Wargan, K.: Chemical Mechanisms and Their Applications in the Goddard Earth Observing System (GEOS) Earth System Model, J. Adv. Model. Earth Syst., 9, 3019–3044, https://doi.org/10.1002/2017MS001011, 2017. 

Nowlan, C. R., González Abad, G., Kwon, H.-A., Ayazpour, Z., Chan Miller, C., Chance, K., Chong, H., Liu, X., O'Sullivan, E., Wang, H., Zhu, L., De Smedt, I., Jaross, G., Seftor, C., and Sun, K.: Global Formaldehyde Products From the Ozone Mapping and Profiler Suite (OMPS) Nadir Mappers on Suomi NPP and NOAA-20, Earth Space Sci., 10, e2022EA002643, https://doi.org/10.1029/2022EA002643, 2023. 

Opacka, B., Stavrakou, T., Müller, J.-F., De Smedt, I., van Geffen, J., Marais, E. A., Horner, R. P., Millet, D. B., Wells, K. C., and Guenther, A. B.: Natural emissions of VOC and NOx over Africa constrained by TROPOMI HCHO and NO2 data using the MAGRITTEv1.1 model, Atmos. Chem. Phys., 25, 2863–2894, https://doi.org/10.5194/acp-25-2863-2025, 2025. 

Orbe, C., Oman, L. D., Strahan, S. E., Waugh, D. W., Pawson, S., Takacs, L. L., and Molod, A. M.: Large-Scale Atmospheric Transport in GEOS Replay Simulations, J. Adv. Model. Earth Syst., 9, 2545–2560, https://doi.org/10.1002/2017MS001053, 2017. 

Pinardi, G., Van Roozendael, M., Hendrick, F., Theys, N., Abuhassan, N., Bais, A., Boersma, F., Cede, A., Chong, J., Donner, S., Drosoglou, T., Dzhola, A., Eskes, H., Frieß, U., Granville, J., Herman, J. R., Holla, R., Hovila, J., Irie, H., Kanaya, Y., Karagkiozidis, D., Kouremeti, N., Lambert, J.-C., Ma, J., Peters, E., Piters, A., Postylyakov, O., Richter, A., Remmers, J., Takashima, H., Tiefengraber, M., Valks, P., Vlemmix, T., Wagner, T., and Wittrock, F.: Validation of tropospheric NO2 column measurements of GOME-2A and OMI using MAX-DOAS and direct sun network observations, Atmos. Meas. Tech., 13, 6141–6174, https://doi.org/10.5194/amt-13-6141-2020, 2020. 

Ren, X., van Duin, D., Cazorla, M., Chen, S., Mao, J., Zhang, L., Brune, W. H., Flynn, J. H., Grossberg, N., Lefer, B. L., Rappenglück, B., Wong, K. W., Tsai, C., Stutz, J., Dibb, J. E., Thomas Jobson, B., Luke, W. T., and Kelley, P.: Atmospheric oxidation chemistry and ozone production: Results from SHARP 2009 in Houston, Texas, J. Geophys. Res.-Atmos., 118, 5770–5780, https://doi.org/10.1002/jgrd.50342, 2013. 

Roberts, G., Wooster, M. J., and Lagoudakis, E.: Annual and diurnal african biomass burning temporal dynamics, Biogeosciences, 6, 849–866, https://doi.org/10.5194/bg-6-849-2009, 2009. 

Sadanaga, Y., Kawasaki, S., Tanaka, Y., Kajii, Y., and Bandow, H.: New System for Measuring the Photochemical Ozone Production Rate in the Atmosphere, Environ. Sci. Technol., 51, 2871–2878, https://doi.org/10.1021/acs.est.6b04639, 2017. 

Schroeder, J. R., Crawford, J. H., Fried, A., Walega, J., Weinheimer, A., Wisthaler, A., Müller, M., Mikoviny, T., Chen, G., Shook, M., Blake, D. R., and Tonnesen, G. S.: New insights into the column CH2O/NO2 ratio as an indicator of near-surface ozone sensitivity, J. Geophys. Res.-Atmos., 122, 8885–8907, https://doi.org/10.1002/2017JD026781, 2017. 

Schroeder, J. R., Crawford, J. H., Ahn, J.-Y., Chang, L., Fried, A., Walega, J., Weinheimer, A., Montzka, D. D., Hall, S. R., Ullmann, K., Wisthaler, A., Mikoviny, T., Chen, G., Blake, D. R., Blake, N. J., Hughes, S. C., Meinardi, S., Diskin, G., Digangi, J. P., Choi, Y., Pusede, S. E., Huey, G. L., Tanner, D. J., Kim, M., and Wennberg, P.: Observation-based modeling of ozone chemistry in the Seoul metropolitan area during the Korea-United States Air Quality Study (KORUS-AQ), Elem. Sci. Anth., 8, 3, https://doi.org/10.1525/elementa.400, 2020. 

Shen, Z., Yang, H., and Zhang, S.: Neural network approximation: Three hidden layers are enough, Neural Netw., 141, 160–173, https://doi.org/10.1016/j.neunet.2021.04.011, 2021. 

Sillman, S. and He, D.: Some theoretical results concerning O3-NOx-VOC chemistry and NOx-VOC indicators, J. Geophys. Res.-Atmos., 107, ACH 26-1–ACH 26-15, https://doi.org/10.1029/2001JD001123, 2002. 

Sklaveniti, S., Locoge, N., Stevens, P. S., Wood, E., Kundu, S., and Dusanter, S.: Development of an instrument for direct ozone production rate measurements: measurement reliability and current limitations, Atmos. Meas. Tech., 11, 741–761, https://doi.org/10.5194/amt-11-741-2018, 2018. 

Souri, A. and Gonzalez Abad, G.: “ahsouri/ozonerates: Ozonerates v1.0”, Zenodo [code], https://doi.org/10.5281/zenodo.15076487, 2025. 

Souri, A. H., Choi, Y., Jeon, W., Li, X., Pan, S., Diao, L., and Westenbarger, D. A.: Constraining NOx emissions using satellite NO2 measurements during 2013 DISCOVER-AQ Texas campaign, Atmos. Environ., 131, 371–381, https://doi.org/10.1016/j.atmosenv.2016.02.020, 2016. 

Souri, A. H., Choi, Y., Jeon, W., Woo, J.-H., Zhang, Q., and Kurokawa, J.: Remote sensing evidence of decadal changes in major tropospheric ozone precursors over East Asia, J. Geophys. Res.-Atmos., 122, 2474–2492, https://doi.org/10.1002/2016JD025663, 2017. 

Souri, A. H., Nowlan, C. R., Wolfe, G. M., Lamsal, L. N., Chan Miller, C. E., Abad, G. G., Janz, S. J., Fried, A., Blake, D. R., Weinheimer, A. J., Diskin, G. S., Liu, X., and Chance, K.: Revisiting the effectiveness of HCHO/NO2 ratios for inferring ozone sensitivity to its precursors using high resolution airborne remote sensing observations in a high ozone episode during the KORUS-AQ campaign, Atmos. Environ., 224, 117341, https://doi.org/10.1016/j.atmosenv.2020.117341, 2020a. 

Souri, A. H., Nowlan, C. R., González Abad, G., Zhu, L., Blake, D. R., Fried, A., Weinheimer, A. J., Wisthaler, A., Woo, J.-H., Zhang, Q., Chan Miller, C. E., Liu, X., and Chance, K.: An inversion of NOx and non-methane volatile organic compound (NMVOC) emissions using satellite observations during the KORUS-AQ campaign and implications for surface ozone over East Asia, Atmos. Chem. Phys., 20, 9837–9854, https://doi.org/10.5194/acp-20-9837-2020, 2020b. 

Souri, A. H., Chance, K., Sun, K., Liu, X., and Johnson, M. S.: Dealing with spatial heterogeneity in pointwise-to-gridded- data comparisons, Atmos. Meas. Tech., 15, 41–59, https://doi.org/10.5194/amt-15-41-2022, 2022. 

Souri, A. H., Johnson, M. S., Wolfe, G. M., Crawford, J. H., Fried, A., Wisthaler, A., Brune, W. H., Blake, D. R., Weinheimer, A. J., Verhoelst, T., Compernolle, S., Pinardi, G., Vigouroux, C., Langerock, B., Choi, S., Lamsal, L., Zhu, L., Sun, S., Cohen, R. C., Min, K.-E., Cho, C., Philip, S., Liu, X., and Chance, K.: Characterization of errors in satellite-based HCHO/NO2 tropospheric column ratios with respect to chemistry, column-to-PBL translation, spatial representation, and retrieval uncertainties, Atmos. Chem. Phys., 23, 1963–1986, https://doi.org/10.5194/acp-23-1963-2023, 2023a. 

Souri, A. H., Kumar, R., Chong, H., Golbazi, M., Knowland, K. E., Geddes, J., and Johnson, M. S.: Decoupling in the vertical shape of HCHO during a sea breeze event: The effect on trace gas satellite retrievals and column-to-surface translation, Atmos. Environ., 309, 119929, https://doi.org/10.1016/j.atmosenv.2023.119929, 2023b. 

Souri, A. H., Duncan, B. N., Strode, S. A., Anderson, D. C., Manyin, M. E., Liu, J., Oman, L. D., Zhang, Z., and Weir, B.: Enhancing long-term trend simulation of the global tropospheric hydroxyl (TOH) and its drivers from 2005 to 2019: a synergistic integration of model simulations and satellite observations, Atmos. Chem. Phys., 24, 8677–8701, https://doi.org/10.5194/acp-24-8677-2024, 2024. 

Souri, A. H., González Abad, G., Wolfe, G. M., Verhoelst, T., Vigouroux, C., Pinardi, G., Compernolle, S., Langerock, B., Duncan, B. N., and Johnson, M. S.: Feasibility of robust estimates of ozone production rates using a synergy of satellite observations, ground-based remote sensing, and models, Atmos. Chem. Phys., 25, 2061–2086, https://doi.org/10.5194/acp-25-2061-2025, 2025. 

Stavrakou, T., Müller, J.-F., De Smedt, I., Van Roozendael, M., Kanakidou, M., Vrekoussis, M., Wittrock, F., Richter, A., and Burrows, J. P.: The continental source of glyoxal estimated by the synergistic use of spaceborne measurements and inverse modelling, Atmos. Chem. Phys., 9, 8431–8446, https://doi.org/10.5194/acp-9-8431-2009, 2009. 

Stavrakou, T., Müller, J.-F., Bauwens, M., De Smedt, I., Lerot, C., Van Roozendael, M., Coheur, P.-F., Clerbaux, C., Boersma, K. F., van der A, R., and Song, Y.: Substantial Underestimation of Post-Harvest Burning Emissions in the North China Plain Revealed by Multi-Species Space Observations, Sci. Rep., 6, 32307, https://doi.org/10.1038/srep32307, 2016. 

Strahan, S. E., Duncan, B. N., and Hoor, P.: Observationally derived transport diagnostics for the lowermost stratosphere and their application to the GMI chemistry and transport model, Atmos. Chem. Phys., 7, 2435–2445, https://doi.org/10.5194/acp-7-2435-2007, 2007. 

Sullivan, J. T., McGee, T. J., Stauffer, R. M., Thompson, A. M., Weinheimer, A., Knote, C., Janz, S., Wisthaler, A., Long, R., Szykman, J., Park, J., Lee, Y., Kim, S., Jeong, D., Sanchez, D., Twigg, L., Sumnicht, G., Knepp, T., and Schroeder, J. R.: Taehwa Research Forest: a receptor site for severe domestic pollution events in Korea during 2016, Atmos. Chem. Phys., 19, 5051–5067, https://doi.org/10.5194/acp-19-5051-2019, 2019. 

Tao, M., Fiore, A. M., Jin, X., Schiferl, L. D., Commane, R., Judd, L. M., Janz, S., Sullivan, J. T., Miller, P. J., Karambelas, A., Davis, S., Tzortziou, M., Valin, L., Whitehill, A., Civerolo, K., and Tian, Y.: Investigating Changes in Ozone Formation Chemistry during Summertime Pollution Events over the Northeastern United States, Environ. Sci. Technol., 56, 15312–15327, https://doi.org/10.1021/acs.est.2c02972, 2022. 

Thorsen, T. J. and Fu, Q.: CALIPSO-inferred aerosol direct radiative effects: Bias estimates using ground-based Raman lidars, J. Geophys. Res.-Atmos., 120, 12209–12220, https://doi.org/10.1002/2015JD024095, 2015. 

Tilstra, L. G., de Graaf, M., Trees, V. J. H., Litvinov, P., Dubovik, O., and Stammes, P.: A directional surface reflectance climatology determined from TROPOMI observations, Atmos. Meas. Tech., 17, 2235–2256, https://doi.org/10.5194/amt-17-2235-2024, 2024. 

Valin, L. C., Russell, A. R., Hudman, R. C., and Cohen, R. C.: Effects of model resolution on the interpretation of satellite NO2 observations, Atmos. Chem. Phys., 11, 11647–11655, https://doi.org/10.5194/acp-11-11647-2011, 2011. 

van Geffen, J., Eskes, H., Compernolle, S., Pinardi, G., Verhoelst, T., Lambert, J.-C., Sneep, M., ter Linden, M., Ludewig, A., Boersma, K. F., and Veefkind, J. P.: Sentinel-5P TROPOMI NO2 retrieval: impact of version v2.2 improvements and comparisons with OMI and ground-based data, Atmos. Meas. Tech., 15, 2037–2060, https://doi.org/10.5194/amt-15-2037-2022, 2022. 

Veefkind, J. P., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H. J., de Haan, J. F., Kleipool, Q., van Weele, M., Hasekamp, O., Hoogeveen, R., Landgraf, J., Snel, R., Tol, P., Ingmann, P., Voors, R., Kruizinga, B., Vink, R., Visser, H., and Levelt, P. F.: TROPOMI on the ESA Sentinel-5 Precursor: A GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications, Remote Sens. Environ., 120, 70–83, https://doi.org/10.1016/j.rse.2011.09.027, 2012. 

Verhoelst, T., Compernolle, S., Pinardi, G., Lambert, J.-C., Eskes, H. J., Eichmann, K.-U., Fjæraa, A. M., Granville, J., Niemeijer, S., Cede, A., Tiefengraber, M., Hendrick, F., Pazmiño, A., Bais, A., Bazureau, A., Boersma, K. F., Bognar, K., Dehn, A., Donner, S., Elokhov, A., Gebetsberger, M., Goutail, F., Grutter de la Mora, M., Gruzdev, A., Gratsea, M., Hansen, G. H., Irie, H., Jepsen, N., Kanaya, Y., Karagkiozidis, D., Kivi, R., Kreher, K., Levelt, P. F., Liu, C., Müller, M., Navarro Comas, M., Piters, A. J. M., Pommereau, J.-P., Portafaix, T., Prados-Roman, C., Puentedura, O., Querel, R., Remmers, J., Richter, A., Rimmer, J., Rivera Cárdenas, C., Saavedra de Miguel, L., Sinyakov, V. P., Stremme, W., Strong, K., Van Roozendael, M., Veefkind, J. P., Wagner, T., Wittrock, F., Yela González, M., and Zehner, C.: Ground-based validation of the Copernicus Sentinel-5P TROPOMI NO2 measurements with the NDACC ZSL-DOAS, MAX-DOAS and Pandonia global networks, Atmos. Meas. Tech., 14, 481–510, https://doi.org/10.5194/amt-14-481-2021, 2021. 

Vigouroux, C., Langerock, B., Bauer Aquino, C. A., Blumenstock, T., Cheng, Z., De Mazière, M., De Smedt, I., Grutter, M., Hannigan, J. W., Jones, N., Kivi, R., Loyola, D., Lutsch, E., Mahieu, E., Makarova, M., Metzger, J.-M., Morino, I., Murata, I., Nagahama, T., Notholt, J., Ortega, I., Palm, M., Pinardi, G., Röhling, A., Smale, D., Stremme, W., Strong, K., Sussmann, R., Té, Y., van Roozendael, M., Wang, P., and Winkler, H.: TROPOMI–Sentinel-5 Precursor formaldehyde validation using an extensive network of ground-based Fourier-transform infrared stations, Atmos. Meas. Tech., 13, 3751–3767, https://doi.org/10.5194/amt-13-3751-2020, 2020. 

Vinken, G. C. M., Boersma, K. F., Jacob, D. J., and Meijer, E. W.: Accounting for non-linear chemistry of ship plumes in the GEOS-Chem global chemistry transport model, Atmos. Chem. Phys., 11, 11707–11722, https://doi.org/10.5194/acp-11-11707-2011, 2011. 

Wolfe, G. M., Marvin, M. R., Roberts, S. J., Travis, K. R., and Liao, J.: The Framework for 0-D Atmospheric Modeling (F0AM) v3.1, Geosci. Model Dev., 9, 3309–3319, https://doi.org/10.5194/gmd-9-3309-2016, 2016. 

Wolfe, G. M., Hanisco, T. F., Arkinson, H. L., Blake, D. R., Wisthaler, A., Mikoviny, T., Ryerson, T. B., Pollack, I., Peischl, J., Wennberg, P. O., Crounse, J. D., St. Clair, J. M., Teng, A., Huey, L. G., Liu, X., Fried, A., Weibring, P., Richter, D., Walega, J., Hall, S. R., Ullmann, K., Jimenez, J. L., Campuzano-Jost, P., Bui, T. P., Diskin, G., Podolske, J. R., Sachse, G., and Cohen, R. C.: Photochemical evolution of the 2013 California Rim Fire: synergistic impacts of reactive hydrocarbons and enhanced oxidants, Atmos. Chem. Phys., 22, 4253–4275, https://doi.org/10.5194/acp-22-4253-2022, 2022. 

Yu, K., Jacob, D. J., Fisher, J. A., Kim, P. S., Marais, E. A., Miller, C. C., Travis, K. R., Zhu, L., Yantosca, R. M., Sulprizio, M. P., Cohen, R. C., Dibb, J. E., Fried, A., Mikoviny, T., Ryerson, T. B., Wennberg, P. O., and Wisthaler, A.: Sensitivity to grid resolution in the ability of a chemical transport model to simulate observed oxidant chemistry under high-isoprene conditions, Atmos. Chem. Phys., 16, 4369–4378, https://doi.org/10.5194/acp-16-4369-2016, 2016. 

Download
Short summary
We create long-term maps of PO3 magnitudes along with their corresponding sensitivity maps. This is achieved using a deep learning parameterization method that relies on satellite data, atmospheric models, and ground-based remote sensing. Our approach provides more quantitative information than commonly used methods that depend on ratio-based indicators (such as HCHO/NO2). Additionally, our method considers light and water vapor, making it suitable for applications with GEO satellites.
Share
Altmetrics
Final-revised paper
Preprint