Toward a learnable Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI) based on the Multi-Head Self-Attention algorithm

Xia, Zihan; Zhao, Chun; Yang, Zining; Du, Qiuyan; Feng, Jiawang; Jin, Chen; Shi, Jun; An, Hong

doi:https://doi.org/10.5194/acp-25-6197-2025

Articles | Volume 25, issue 12

https://doi.org/10.5194/acp-25-6197-2025

Articles | Volume 25, issue 12

Research article

25 Jun 2025

Research article |

| 25 Jun 2025

Toward a learnable Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI) based on the Multi-Head Self-Attention algorithm

Zihan Xia, Chun Zhao, Zining Yang, Qiuyan Du, Jiawang Feng, Chen Jin, Jun Shi, and Hong An

Abstract

Simulating aerosol chemistry and interactions (ACI) is crucial in climate and atmospheric modeling, yet conventional numerical schemes are computationally intensive due to the stiff differential equations and iterative methods involved. While artificial intelligence (AI) has demonstrated potential in accelerating photochemistry simulations, it has not been applied for simulating the full ACI processes, which encompass not only chemical reactions but also other processes, such as nucleation and coagulation. To bridge this gap, we develop a novel Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI), focusing initially on inorganic aerosols. Trained based on a conventional scheme, it has been validated in both offline and online modes (referring to whether it is coupled into a three-dimensional atmospheric model). Results demonstrate that AIMACI is not only comparable with conventional schemes in spatial distributions, temporal variations, and evolution of particle size distribution of main aerosol species, including water content in aerosols, but also exhibits robust generalization ability, reliably simulating one month under different environmental conditions across four seasons despite being trained on limited data from merely 16 d. Notably, it exhibits ∼5× speedup with a single CPU and ∼277× speedup with a single GPU, compared with conventional schemes. However, the stability of AIMACI for year-scale global simulations remains to be seen, requiring further testing. AIMACI's generalization capability and its modular design suggest potential for future coupling to global climate models, which are expected to enhance the precision and efficiency of ACI simulations in climate modeling that neglects or simplifies ACI processes.

Download & links

Article (PDF, 10038 KB)

Supplement (721 KB)

Download & links

How to cite.

Received: 12 Sep 2024 – Discussion started: 14 Oct 2024 – Revised: 27 Mar 2025 – Accepted: 21 Apr 2025 – Published: 25 Jun 2025

1 Introduction

Atmospheric aerosols, which consist of a suspension of solid and liquid particles in the air, exert a profound influence on Earth's climate system and air quality (Charlson et al., 1992). Their multifaceted impacts are evident in their capacity to alter the Earth's radiation balance through the scattering and absorption of solar and longwave radiation, as well as in their role as cloud condensation nuclei that influence the formation and characteristics of clouds (Twomey, 1974; Lohmann and Feichter, 2005; Bellouin et al., 2020; Li et al., 2022). The presence of atmospheric aerosols extends its reach to environmental well-being, with implications that span visibility, human health, and the integrity of ecological ecosystems (Pöschl, 2005; Arfin et al., 2023). Full aerosol chemistry and interactions (ACI) consist of multiple highly nonlinear subprocesses, including chemical reactions, phase equilibrium, gas-particle partitioning, particle size growth, coagulation, and nucleation, which have a significant impact on the concentration of atmospheric aerosols (Zaveri et al., 2008). Numerical models stand as indispensable analytical tools, pivotal for comprehending the aforementioned phenomena, and are instrumental in air quality management and the formulation of mitigation strategies for climate change. However, simulating full ACI processes within these models poses a significant computational challenge (Carmichael et al., 1999; Ebel et al., 2006). As quantified in Fig. S1, using the Model for Simulating Aerosol Interactions and Chemistry (MOSAIC) scheme with four bins for full ACI simulation in the Weather Research and Forecasting with Chemistry (WRF-Chem) model accounts for 31.4 % of the total computational time of the chemistry module. This is primarily due to the requirement to solve a complex set of stiff nonlinear differential equations governing aerosol processes, coupled with the use of implicit integration schemes to ensure numerical stability (Sandu et al., 1997a, b). Furthermore, to accommodate the diverse methodologies for describing the evolution of particle size distribution (PSD), some aerosol processes may require repeated calculations (J. L. Wang et al., 2022). For example, when employing a discrete model, the coagulation collision frequency functions need to be computed for each discrete size (Zhang et al., 2020). Consequently, the computational burden is significantly amplified. This computational intensity often creates a dilemma, as it competes with other priorities in numerical modeling, such as enhancing spatial resolution (Gu et al., 2022), recognized as helpful for minimizing uncertainties in numerical models. Numerous numerical models opt for simplified or even deactivated ACI schemes during long-term simulations, particularly in high-resolution atmospheric and climate models, introducing considerable uncertainties in the simulation results (Lee et al., 2016; Zhang et al., 2020). Consequently, there is a pressing need to achieve rapid, accurate, and stable simulation of ACI within numerical models.

Over the past few decades, extensive research efforts have been dedicated to striking a tradeoff between accuracy and computational efficiency in simulating ACI. Researchers have primarily approached this challenge from two distinct perspectives: one is the exploration of various methodologies for describing the evolution of PSD. For instance, in the discrete model, the PSD is divided into discrete sizes, with calculations performed for each individual size. This approach yields the most precise results but also demands the highest computational resources (Landgrebe and Pratsinis, 1990; Zhao et al., 2013b). The moment model, which tracks the lower-order moments of an unknown aerosol distribution, is particularly well-suited for scenarios where the size distribution is lognormal (Pratsinis, 1988). Concurrently, researchers have been engaged in employing diverse methodologies to solve the system of stiff differential equations. For example, the Multicomponent Taylor Expansion Method (MTEM) has been developed to compute activity coefficients in aqueous atmospheric aerosols (Zaveri et al., 2005). This method offers an efficient noniterative solution for systems rich in sulfate aerosols. The Adaptive Step Time-split Euler Method (ASTEM) leverages several key characteristics of the atmospheric gas-particle partitioning, systematically reducing stiffness while preserving the integrity of the numerical solution (Zaveri et al., 2008). Despite these advancements significantly improving the computational efficiency of simulating ACI, current progress remains far from sufficient.

An alternative approach is to utilize artificial intelligence (AI) schemes to replace conventional numerical schemes in atmospheric and climate models, which could potentially bring about a transformative impact. Recent studies by Liu et al. (2021) developed an AI scheme based on a Residual Neural Network (ResNet) algorithm for simulating atmospheric photochemistry, achieving a nearly 10.6× increase in computational efficiency. However, they adopted a hybrid approach, combining a numerical scheme for radicals and oxidants with an AI scheme for volatile organic compounds (VOCs). Kelp et al. (2022) employed an online training strategy to refine an AI scheme for a simplified super-fast chemistry scheme (12 species) in atmospheric models, achieving stable simulations over a year with a nearly 5× speedup. However, their approach required training four separate AI emulators for each season and simulation discontinuities (characterized by abrupt changes in simulation errors) were observed when switching between seasonal AI emulators. Sharma et al. (2023) developed a physics-informed AI approach to study isoprene epoxydiols in acidic aqueous aerosols over the Amazon rainforest, halving computational costs but requiring the training of separate AI models for each size bin. Xia et al. (2025) have taken a step further by developing an artificial intelligence photochemistry (AIPC) scheme, leveraging the Multi-Head Self-Attention (MHSA) algorithm to simulate a full complex atmospheric photochemistry (79 species) with a unified AI model. When coupled with three-dimensional (3D) numerical models, their approach not only reliably simulates the continuous spatiotemporal evolution of 79 chemical species over 15 consecutive days but also improves computational efficiency by 77 %.

While these studies have demonstrated AI schemes' capability to capture nonlinear chemical relationships and reproduce complex spatiotemporal distributions, no current AI scheme has achieved fast, accurate, and stable end-to-end simulation of full ACI within 3D atmospheric models. Unlike atmospheric photochemistry, which only involves chemical reactions between species, the full ACI encompasses multiple other intricate processes. Furthermore, since the PSD of an aerosol significantly influences aerosol behavior, an accurate depiction of the evolution of PSD is as critical as the precise simulation of aerosol species concentration. These requirements collectively present a heightened challenge for AI scheme development. Consequently, the fundamental question of whether an AI scheme can effectively replace traditional numerical schemes for full ACI simulation – while simultaneously achieving high fidelity in spatiotemporal distributions and substantial improvements in computational efficiency – remains unresolved in atmospheric modeling research.

To bridge this gap, in this study, we have developed a novel Artificial Intelligence Model for Aerosol Chemistry and Interactions, termed AIMACI, which is based on the Multi-Head Self-Attention algorithm and has been coupled online with a 3D numerical atmospheric model. As a first step, this study primarily focuses on inorganic aerosols because they constitute significant amounts of secondary aerosols globally and serve as the important driver of aerosol radiative forcing and cloud condensation nuclei activity in current climate models. Given that the production of secondary organic aerosols (SOAs) involves significantly more complex chemical pathways, encompassing a wider array of precursor species and heterogeneous reaction mechanisms, the current AIMACI does not include them. Their incorporation will be considered in the future development of AIMACI. To validate the accuracy, stability, and computational efficiency of the AIMACI scheme, we conducted a series of experiments for both offline simulations (where the AIMACI scheme was not coupled to a numerical model) and online simulations (where the AIMACI scheme was coupled to a numerical model). The structure of this paper is organized as follows: Sect. 2 provides a detailed description of the WRF-Chem model and the establishment of the AIMACI. Section 3 discusses the results, and Sect. 4 presents the conclusion, outlining the implications of our findings for the field.

2 Methods

2.1 WRF-Chem model and MOSAIC scheme

In this study, we utilize the updated version of WRF-Chem developed by the University of Science and Technology of China (USTC) for conducting all simulations. This USTC version of WRF-Chem boasts additional functionalities compared with the publicly released version, including the capability to diagnose radiative forcing of aerosol species, land-surface-coupled biogenic volatile organic compound emissions, and aerosol–snow interactions (Du et al., 2020; Hu et al., 2019; Zhang et al., 2021; Zhao et al., 2013a, b, 2014, 2016).

The conventional numerical scheme adopted in this study for full ACI simulation is the MOSAIC scheme (Zaveri et al., 2008), coupled with the CBM-Z (Carbon Bond Mechanism version Z) photochemistry scheme (Zaveri and Peters, 1999). The MOSAIC scheme stands out for its innovative approach to address the long-standing issues in solving the dynamic partitioning of semivolatile inorganic gases (HNO₃, HCl, and NH₃) to size-distributed atmospheric aerosol particles. It has been validated against a benchmark model version utilizing a rigorous solver for the integration of stiff differential equations, demonstrating both computational efficiency and high fidelity (Zaveri et al., 2008). The MOSAIC scheme used in this study features four discrete size bins (0.039–0.156, 0.156–0.625, 0.625–2.5, and 2.5–10.0 µm in diameter) and treats all the major aerosol species important at urban, regional, and global scales, including sulfate (SO $_{4}^{2 -}$ ), nitrate (NO $_{3}^{-}$ ), chloride (Cl⁻), carbonate (CO $_{3}^{2 -}$ ), ammonium (NH4⁺), sodium (Na⁺), calcium (Ca²⁺), black carbon (BC), organic carbon (OC), other inorganic mass (OIN), mineral dust, methanesulfonic acid (MSA), and liquid water content of aerosol (water). It also considers the impact of marine biogenic sources of dimethyl sulfide on atmospheric aerosols and some aqueous processes. The chemical reactions among various species are detailed in Zaveri et al. (2008). However, secondary organic aerosols (SOAs) and complex heterogeneous chemical processes (e.g., oxidation of dissolved S(IV) by H₂O₂, O₃, NO₂, and O₂ catalyzed by transition metal ions (TMIs) in aerosol water, Ruan et al., 2022) are not included in the chosen MOSAIC scheme in this study. These aspects will be considered in future development.

2.2 Learnable AIMACI scheme

2.2.1 Scheme construction

Previous attempts to substitute conventional numerical schemes with AI schemes have predominantly utilized simple AI algorithms, such as random forest regression (Keller and Evans, 2019). This preference stems from the challenge of coupling sophisticated AI algorithms, often written in Python, with numerical models coded in Fortran. While some studies have explored the use of advanced AI algorithms, such as fully connected neural networks (Sharma et al., 2023) and residual neural networks (Kelp et al., 2018, 2020, 2022; Liu, 2021; Z. Wang et al., 2022), these have occasionally encountered difficulties when dealing with high-dimensional input variables and have demonstrated limitations in accurately simulating highly nonlinear systems (Xia et al., 2025).

In this study, for the first time, we attempt to use an AI algorithm to achieve end-to-end simulation of the full ACI within a 3D atmospheric numerical model, replacing the chosen MOSAIC numerical scheme to improve computational efficiency. Given the complexity of multiple subprocesses involved in full ACI, there is a clear need for AI algorithms with superior representational capacity for nonlinear systems. The Multi-Head Self-Attention (MHSA) mechanism, a pivotal component of state-of-the-art transformer architectures, has demonstrated exceptional performance across diverse domains such as natural language processing (Vaswani et al., 2017), computer vision (Liu et al., 2021), and weather forecasting (Bi et al., 2023). Additionally, in the development of an artificial intelligence photochemistry scheme, Xia et al. (2025) have highlighted that the MHSA algorithm excels in capturing the intricate chemical relationships among different species through calculating attention weights. Not only does it offer high accuracy and computational efficiency but it is also less susceptible to the increase in the number of chemical species. Building upon these advancements, this study leverages the MHSA algorithm to develop an Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI).

Figure 1 illustrates the development strategy of the AIMACI scheme and its integration into our hybrid atmospheric model with physics and AI schemes (physics-AI hybrid model). The corresponding pseudocode schematic diagram for replacing the traditional numerical scheme with the AIMACI scheme in WRF-Chem is provided in Fig. S2. The AI model architecture shown in Fig. 1 is designed with three main components, each serving a distinct function in the simulation process:

Input embedding layer. This initial layer receives meteorological variables and chemical species as input features. The input embedding layer is designed as a fully connected layer, which maps the input data into a higher-dimensional space where interdependencies between variables can be more effectively captured.
Integrator. As the core of the AI model, it is composed of two identical blocks, each of which contains two sublayers: an attention layer and a feed-forward layer. We apply residual connections around each of these two sublayers, followed by layer normalization. This integrator is responsible for learning the complex and high nonlinear processes of ACI within the data and integrating them over time.
Output representation layer. Following the integrator, it is also implemented as a fully connected layer. This layer translates the processed information from the integrator into chemical concentrations, providing the output targets for the simulation.

Furthermore, the AI model is complemented by pre-processing and post-processing steps, such as min-max normalization, to constitute the comprehensive AIMACI scheme. The trained AIMACI scheme was packaged into a static (or dynamic) library and then coupled into WRF-Chem, utilizing TorchScript and LibTorch tools officially provided by PyTorch. Compared with coupling approaches relying on third-party libraries, such as CFFI (C Foreign Function Interface for Python), our method demonstrates two distinct technical advantages:

Enhanced computational efficiency. TorchScript and LibTorch, as core components of the PyTorch ecosystem, are specifically optimized for AI model deployment in C $+ +$ environments. This optimization reduces computational overhead compared with generic third-party libraries.
Streamlined implementation. PyTorch's official documentation provides standardized workflows for serializing trained AI models into static or dynamic libraries via TorchScript and LibTorch.

In contrast, third-party libraries lack native support for AI model deployment, necessitating manual reimplementation of low-level interfaces. Therefore, our coupling approach minimizes alterations to the original codebase and offers a lightweight, adaptable, and modular solution. It is capable of encapsulating a wide range of complex AI algorithms and coupling them with atmospheric and climate models.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f01

Figure 1The Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI) in Weather Research and Forecasting with Chemistry (WRF-Chem). The trained AIMACI is packaged into a static/dynamic library using TorchScript and LibTorch, and can be called by WRF-Chem through an interface to replace the numerical scheme for simulating aerosol chemistry and interactions, while the remaining processes maintain the original numerical scheme.

Download

2.2.2 Training and testing procedure

To generate the training, validation, and test datasets, we conducted the WRF-Chem simulations over East China, spanning the period from 1 March 2019 00:00 UTC to 19 March 2019 23:00 UTC. The simulation result was segmented as follows: the initial 16 d, from 2 March 2019 00:00 UTC, were designated as the training set; the penultimate day served as the validation set; and the final day constituted the test set. The simulation was configured with a 0.2° horizontal resolution, covering 140 × 105 grid cells within the geographical bounds of 107.1 to 127.9° E and 19.7 to 47.5° N, and featured 49 vertical layers extending up to 50 hPa. A dynamic time step of 2 min and a chemical time step of 1 h were employed. For emission and meteorological fields, we used the Multi-resolution Emission Inventory for China (MEIC) at 0.25°×0.25° resolution for 2019 (Li et al., 2017; Geng et al., 2024), and the NCEP final reanalysis (FNL) data (NCEP, 2000) with a 1°×1° resolution and 6 h temporal resolution within the simulation domain. Concentrations of aerosol and gas species pertinent to gas-particle partitioning were recorded hourly, along with key meteorological variables influencing chemistry: temperature, pressure, air density, and water vapor mixing ratio. A comprehensive list of variables used for training the AIMACI scheme is presented in Table 1. Due to computational cost considerations, the period of the training dataset is not very long; however, the volume of training samples is large due to the hourly chemical time step and the fine spatial resolution of our simulation. With 140 by 105 grid cells, 49 vertical layers, and 24 h in a day, the total number of training samples amounts to 276 595 200 (140 × 105 × 49 × 24 × 16), reaching the hundred million scale. This large dataset provides a rich and diverse set of samples for training, ensuring that the AI model does not suffer from a lack of convergence due to insufficient data.

In the training of the AI model we built from scratch, each training sample included 65 input features (4 meteorological variables, 5 gas species, and 14 aerosol species with 4 size bins) and 61 output targets (5 gas species and 14 aerosol species with 4 size bins). All features and targets underwent min-max normalization to standardize the data. We employed the PyTorch deep learning framework for model training, with a batch size of 2048, an initial learning rate of 0.001, and the Adam optimization algorithm. The mean squared error (MSE) was used as the loss function. To optimize the training process, we implemented a learning rate decay strategy using the ReduceLROnPlateau scheduler, along with an early stopping mechanism after 10 consecutive epochs without improvement in the validation loss. All other hyperparameters not mentioned are kept at their default values. For this study, we trained the model using three GPUs for approximately 1 d, and the model achieved optimal performance at epoch 32.

To comprehensively evaluate the performance of the trained AIMACI scheme, we conducted a series of additional experiments in both offline and online modes, as detailed in Table 2. These experiments included the following:

Offline simulations were performed on the test dataset without coupling the AIMACI scheme into WRF-Chem, treating AIMACI as a standalone box model to assess its single-step performance.
Online simulations were conducted for a 10 d continuous period outside the training phase, following the coupling of the AIMACI scheme into WRF-Chem, where it is invoked at each chemical time step and interacts with other model processes to create a dynamic and iterative aerosol simulation.
A month-long online continuous simulation was carried out under different environmental conditions across all four seasons to evaluate the AIMACI scheme's generalization ability.

In order to quantify the performance of the AIMACI scheme, we evaluated statistical indicators for all species and a focused examination of spatial distributions, temporal series, and the evolution of particle size distribution (PSD) for four representative aerosol species. The four selected species are sulfate, nitrate, liquid water content in aerosols, and aerosol number concentration. Sulfate, mainly from fossil fuel emissions, contributes to acid rain, aerosol formation, and aerosol–cloud interactions (Calvert et al., 1985; Fuzzi et al., 2015; Penkett et al., 1979). Nitrate, formed from nitrogen oxides, is a key aerosol component that affects air quality and ecosystems (Parrish et al., 2012; Saiz-Lopez et al., 2017). Liquid water content in aerosols is important for understanding how particles contribute to cloud formation and precipitation (Hodas et al., 2014; Liu et al., 2019; Nguyen et al., 2016; Wu et al., 2018). Aerosol number concentration serves as a key metric for assessing aerosol loading and its direct impact on visibility, radiation balance, and climate feedback mechanisms (Spracklen et al., 2010). Collectively, these four species offer a holistic perspective on the multifaceted role of aerosols in atmospheric processes.

Table 1Input and output variables of Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI).

Download Print Version | Download XLSX

Table 2Numerical experiments conducted in this study.

Download Print Version | Download XLSX

2.3 Evaluation metric

In this research, a comprehensive evaluation of the AIMACI scheme's effectiveness was conducted utilizing five recognized statistical measures. For every species examined, calculation of the Pearson correlation coefficient (R²), the root mean square error (RMSE), the normalized mean bias (NMB), the absolute error (AE), and relative error (RE) was performed:

\begin{array}{l} (1) & ↑ (R^{2}) & = \frac{{(\sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) (c_{i, j} - \overline{c}) ({\hat{c}}_{i, j} - \overline{\hat{c}}))}^{2}}{\begin{matrix} \sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) {(c_{i, j} - \overline{c})}^{2} \\ \times \sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) ({\hat{c}}_{i, j} - \overline{\hat{c}})^{2} \end{matrix}}, \\ (2) & ↓ RMSE & = \sqrt{\frac{\sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) {({\hat{c}}_{i, j} - c_{i, j})}^{2}}{N_{lat} \times N_{long}}}, \\ (3) & ↓ NMB & = \frac{\sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) ({\hat{c}}_{i, j} - c_{i, j})}{\sum_{i = 1}^{N_{lat}} \sum_{j = 1}^{N_{long}} L (i) c_{i, j}}, \\ (4) & ↓ AE & = \hat{C} - C, \\ (5) & ↓ RE & = \frac{\hat{C} - C}{C} . \end{array}

In these equations, $L (i) = N_{lat} \times \cos φ_{i} / \sum_{i = 1}^{N_{lat}} \cos φ_{i}$ is the weight at latitude φ_i, N_lat is the number of grid points in the latitudinal direction, N_long is the number of grid points in the longitudinal direction, $\hat{C}$ denotes the concentration simulated by the AIMACI scheme, C denotes the concentration simulated by the MOSAIC scheme, ↑ denotes higher values are better, ↓ denotes lower values are better.

2.4 Computational configuration

A primary incentive for coupling AI schemes into atmospheric and climate models is the pursuit of substantial computational acceleration. However, such acceleration is not inherently guaranteed, as demonstrated by Keller and Evans (2019). Consequently, it is imperative to meticulously compare the temporal expenditure of AI schemes against those of traditional numerical schemes.

In this study, we undertook a comparative analysis of the computational time required by the numerical scheme and the AIMACI scheme for simulating ACI in 720 300 discrete grid cells, which roughly corresponds to a global simulation at 2.5° × 2.5° horizontal resolution with 72 vertical layers. To ensure a holistic and unbiased assessment of the speedup achieved, we measured the computational time by averaging the duration of 24 consecutive daily simulations. Both schemes were tested utilizing a single CPU core; we additionally evaluated the AIMACI scheme with a GPU-accelerated scenario using a single GPU. The computational hardware employed in our tests consisted of an Intel-Xeon-E5-2680 2.40 GHz CPU core and an NVIDIA A100-80G GPU.

3 Results

3.1 Offline single-step simulations with the AIMACI scheme

Before coupling the AIMACI scheme with the 3D numerical model WRF-Chem for continuous simulation, we first evaluated its performance on a test dataset that was separate from the training data. The test dataset, as detailed in previous sections, comprises a series of 3D spatial outcomes taken at 24 hourly intervals, on 19 March 2019. It provides representative samples that span a wide range of meteorological conditions and species concentrations. The evaluation on this dataset provides insight into the AIMACI scheme's performance in various atmospheric conditions. Table 3 presents the statistical metrics for all simulated species, offering a comprehensive assessment of the scheme's simulation capabilities.

Quantitative evaluation across 61 output targets demonstrates close alignment between the simulations using the AIMACI scheme and the MOSAIC scheme (hereinafter referred to as the numerical scheme), as evidenced by an average R² of 0.98 and an average NMB of 3.02 %. While 55 output targets, including major inorganic aerosols, such as sulfate, nitrate, and ammonium, achieve R²≥ 0.95, there are still a few species exhibiting relatively poorer statistical metrics, particularly carbonates (Table 3). To delve deeper into this observation, we have plotted the frequency histograms of the concentration distributions for all species in the test data (Fig. S3). Our analysis revealed that species with skewed concentration distributions, particularly those where more than 99 % of the values are close to zero, tend to exhibit poorer statistical indicators. However, this does not signify that the AIMACI scheme has entirely forfeited its predictive capability. As demonstrated in Fig. S4, which illustrates the simulated carbonate concentrations in the 0.625–2.5 µm particle size range (CO₃_a03), the AIMACI scheme continues to perform well in predicting concentration changes in high-value regions. The poorer statistical indicators are primarily attributed to the challenge of accurately forecasting the very low values that are close to zero.

Table 3Statistical metrics on the test dataset of Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI). (The RMSEs of different species have different units: aerosol (µg kg⁻¹), num (kg⁻¹), gas (ppmv).)

Download Print Version | Download XLSX

Figure 2 presents the data density and distribution for column concentration of the four key aerosol species selected in Sect. 2.2.2. The results from the numerical scheme simulations indicate that sulfate, nitrate, and liquid water content of aerosol exhibit higher column concentrations within 0.156 to 0.625 µm (size bin 2), whereas the number concentration is notably larger within 0.039 to 0.156 µm (size bin 1). This indicates that, despite the greater number of smaller particles in size bin 1, their overall contribution to the total mass is less significant due to their lower individual mass compared with the larger particles in size bin 2. The AIMACI scheme effectively captures these nuanced aerosol characteristics, as corroborated by the R² of 1.0 depicted in Fig. 2, which underscore the scheme's fidelity in modeling aerosol behavior across various particle sizes.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f02

Figure 2The density plot of column concentration of four key aerosol species (sulfate (SO $_{4}^{2 -}$ ), nitrate (NO $_{3}^{-}$ ), liquid water content of aerosol (water), and number concentration of aerosol (Num)) simulated by the AIMACI scheme on the test dataset. The results are calculated by covering the region spanning from 109.1 to 125.9° E and from 22.1 to 44.9° N, with the data being averaged across the time dimension. The black line is the identical line (y=x), and the red line is the fitted line.

Download

3.2 Online multi-step simulations with the AIMACI scheme

The integration of the AIMACI scheme into 3D atmospheric models to build a physics-AI hybrid model presents unique challenges beyond offline single-step simulation. Online continuous simulation involves not only iterative computation across multiple time steps but also interactions and feedback with numerous other processes in the model. Consequently, a thorough evaluation of the AIMACI scheme's online simulation performance is essential. We focus on its performance across three critical dimensions:

Stable and accurate simulation capability. The AIMACI scheme should accurately reproduce the spatiotemporal and size distribution of various aerosol species without rapid accumulation of errors during the simulation process.
Robust generalization ability. The AIMACI scheme should be applicable to scenarios beyond the training data, such as different seasons, demonstrating its robustness in a variety of environmental conditions.
High computational efficiency. Compared with the conventional numerical scheme, the AIMACI scheme should offer enhanced computational efficiency, which is vital for high-resolution, long-term simulations.

Although these requirements are often challenging to satisfy simultaneously, achieving these benchmarks is crucial for leveraging the full potential of the AIMACI scheme in advancing our understanding of aerosol interactions.

3.2.1 Stable and accurate simulation capability

Coupling AI schemes into numerical models for stable and accurate simulations across multiple time steps has long been a significant challenge. While the simulation errors for individual species at each time step may be minimal, they can accumulate over multiple time steps, and may even spread to other species and physical-chemical processes, leading to chaotic simulation outcomes at the end. Typically, simulations with sophisticated aerosol processes at high resolution, such as those in WRF-Chem, are limited to a few weeks due to computational costs. In this study, we conducted a 10 d continuous simulation from 20 March 2019 00:00 UTC to 30 March 2019 00:00 UTC to evaluate the performance of the AIMACI scheme in a coupled mode.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f03

Figure 3Sulfate column concentration simulations across different size bins. The first and second columns depict the spatial distribution at the 10 d continuous simulation's end (30 March 2019 00:00 UTC), as simulated online by both the numerical scheme and the AIMACI scheme. The third column is the absolute error between them. The fourth column shows the temporal evolution of the hourly RMSE over the 10 d period. The mean RMSE (unit: mg m⁻²) for all days and the slope (unit: mg m⁻² h⁻¹) for different simulation stages (2–4, 5–7, 8–10 d) are given.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f04

Figure 4Zonal mean total concentrations (summed across four size bins) of sulfate and nitrate between 109.1 and 125.9° E, as simulated online by both the numerical scheme and the AIMACI scheme. Results are averages over the entire 10 d simulation period.

Download

Figure 3 illustrates the spatial distribution of sulfate column concentrations across different size bins at the end of the 10 d continuous simulation (i.e., 30 March 2019 00:00 UTC), accompanied by the temporal evolution of RMSE during the simulation period. The results reveal that the high-value areas of sulfate column concentrations for different particle sizes exhibit a hook-like structure, extending northeastward from the Yangtze River Economic Belt to the northeastern regions of China. The distinct patterns may be attributed to the complex interplay of meteorological conditions, emission sources, and atmospheric transport processes. The sulfate column concentrations are predominantly concentrated in the range 0.156 to 0.625 µm (size bin 2), with relatively lower column concentrations in the range 2.5 to 10 µm (size bin 4), which is consistent with the findings in Fig. 2. AIMACI reproduces these spatial distribution patterns with strong fidelity, as evidenced by R² exceeding 0.88 across all size bins. However, a systematic bias pattern is observed that, for each particle size, AIMACI tends to underestimate the higher concentration regions and overestimate lower values. This aligns with the findings of Kelp et al. (2022), who report a similar bias pattern in MSE-optimized emulators for photochemistry simulation. We attribute this bias pattern to two main factors: (1) the imbalance in the training dataset, where high-value samples are underrepresented compared with low-value samples, leading to insufficient learning of high-value instances by the AI model; and (2) the use of the RMSE (or MSE) loss function, whose quadratic term penalizes large errors more heavily, thereby biasing the model toward predicting the mean of the target distribution to minimize the overall loss (Gneiting, 2011). While alternative loss functions (e.g., Huber loss) may reduce this bias, RMSE (or MSE) remains widely used in autoregressive simulation studies. One of the key rationales stems from its inherent capability to effectively prevent large errors, which is critical for ensuring the stability of long-term simulations. To alleviate this bias pattern, we plan to explore data transformation techniques to reduce the skewness of the training data distribution in future work. This approach may help alleviate the bias without compromising the stability of the AIMACI.

Additionally, RMSE temporal evolution reveals that, during the entire simulation period, rapid growth is not exhibited and oscillations are maintained within a constrained range. The analysis of the RMSE trend slopes across different simulation stages for four size bins demonstrates nonuniform error progression patterns (e.g., in Fig. 3l, the slope for the 2–4 d period is 0.000959 mg m⁻² h⁻¹, while the slope for the 8–10 d period is −0.000127 mg m⁻² h⁻¹) where, even within identical simulation phases, distinct size bins of the same species manifest divergent error trends (e.g., during the 8–10 d period, the slopes for different size bins are 0.00204, 0.00187, −0.000127, and −0.0000213 mg m⁻² h⁻¹, respectively). The emergence of this complex error variation is related to the dual influence governing each grid point error in online continuous simulations: (1) the inherent limitations of the AIMACI scheme, which achieves accurate simulations when encountering well-learned input feature combinations, while exhibiting degraded performance under insufficiently trained input patterns; and (2) the compound error propagation mechanisms during continuous simulations, where input biases of species concentrations at each time step are affected by three factors – local error accumulation from preceding steps, error propagation through transport processes in numerical models from neighboring grids, and perturbations induced by other processes in numerical models (e.g., dry deposition and wet scavenging). Consequently, input biases exhibit nonlinear variability, with AIMACI's simulation accuracy being inversely correlated to input error magnitudes. In operational implementations of physics-AI hybrid models for online simulations, the influences of these two factors are often interrelated rather than independent, thereby amplifying the complexity of error variation. Introducing an error-correcting operator in the continuous simulation process may enhance the stability of long-term simulations.

Figure 4 presents a comparison of the zonal average total concentrations (summed across all size bins) of sulfate and nitrate, simulated by both the numerical scheme and the AIMACI scheme, with results averaged over the entire 10 d simulation period. Figure 4a and e reveal that high-concentration zones for both sulfate and nitrate are predominantly located between 25 and 40° N, a latitudinal band encompassing the Yangtze River Economic Belt. This spatial distribution is consistent with the region's substantial anthropogenic emissions. Through turbulent mixing and convective transport processes, sulfate and nitrate are vertically transported from lower to higher altitudes, with concentrations exhibiting a characteristic decay profile as altitude increases. Figure 4b and f demonstrate strong agreement between the AIMACI scheme and the numerical scheme, with R² reaching 0.99 for both species. The RMSE further confirms this alignment, measuring 0.10 µg kg⁻¹ for sulfate and 0.48 µg kg⁻¹ for nitrate. The absolute error exhibits a vertical gradient, with larger errors near the surface and smaller errors at higher altitudes. Conversely, the relative error distribution shows an inverse pattern, with lower relative errors near the surface and higher relative errors at elevated altitudes. This behavior arises because species concentrations at higher altitudes are typically below 1.0 µg kg⁻¹, and the relative error calculation divides the absolute error by these low concentration values. Consequently, even small absolute perturbations at high altitudes can result in significantly large relative errors. This effect is particularly pronounced for nitrate, which exhibits concentrations at higher altitudes that are one to two orders of magnitude lower than sulfate, often approaching zero.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f05

Figure 5Time series of surface total concentrations (summed across 4 size bins) of four key aerosol species (sulfate, nitrate, liquid water content of aerosol, and number concentration of aerosol), as simulated online by both the numerical scheme and the AIMACI scheme. Results represent the calculated averages for the Yangtze River Delta region (119.1–121.9° E, 30.1–31.9° N). The gray vertical lines mark the time intervals from 0 to 24 h in Beijing Time for each corresponding day.

Download

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f06

Figure 6Frequency distributions of different particle sizes for the surface concentrations of four key aerosol species (sulfate (SO4), nitrate (NO₃), liquid water content of aerosol (water), and number concentration of aerosol (Num)), as simulated online by both the numerical scheme and the AIMACI scheme. The last column showcases the particle size distributions of these key aerosol species surface concentrations. The results are calculated by covering the region spanning from 109.1 to 125.9° E and from 22.1 to 44.9° N, with the data being averaged over the entire simulation period.

Download

Figure 5 illustrates the ability of the AIMACI scheme to reproduce temporal variations of surface total concentrations of four key aerosol species. These results represent the calculated averages for the Yangtze River Delta region, a crucial urban agglomeration in China, spanning the coordinates 119.1 to 121.9° E and 30.1 to 31.9° N. Throughout the simulation period, sulfate concentrations primarily fluctuate within the range 0 to 6 µg kg⁻¹, while nitrate concentrations exhibit a broader variability, predominantly ranging from 0 to 20 µg kg⁻¹. Notably, all four key aerosol species experience several instances of abrupt concentration spikes and declines. For instance, between the 11th and 30th hours of the simulation, the liquid water content of aerosol experiences a dramatic increase from 32.86 to 263.47 µg kg⁻¹, followed by a sharp decrease to 28.72 µg kg⁻¹. The occurrence of these pronounced fluctuations may be related to the following factors:

Variations in meteorological conditions, such as changes in wind speed and direction, can significantly influence the dispersion and transport of aerosols. Increased wind speed or frequent shifts in wind direction may lead to more pronounced concentration fluctuations. Additionally, changes in humidity can affect the wet scavenging and hygroscopic growth of aerosols. Precipitation events, in particular, can enhance the wet scavenging effect, leading to a significant reduction in aerosol concentrations.
Variations in anthropogenic emissions, such as differences in human activities during weekdays and weekends, can also contribute to changes in aerosol concentrations.

A more detailed analysis would be required to identify the specific causes. Despite these pronounced fluctuations, the AIMACI scheme adeptly reproduces these features without introducing systematic bias, achieving R² values larger than 0.97.

The aforementioned analyses have examined the AIMACI scheme's capability in capturing the spatiotemporal distribution and variation trends of major aerosol species. Here, we further assess the scheme's performance in accurately reproducing the evolution of the aerosol particle size distribution (PSD), given that particle size critically governs aerosol radiative properties and cloud nucleation efficiencies – key determinants of atmospheric energy balance and hydrological cycles. Figure 6 presents the PSD and frequency distribution of the surface concentrations for four key aerosol species. The frequency distributions of sulfate and nitrate surface concentrations exhibit a relatively uniform pattern, whereas the liquid water content and number concentration of aerosols display extreme values, leading to pronounced skewness in their distributions. Additionally, there are significant differences in the PSD among the aerosol species. Sulfate and nitrate concentrations peak within the range 0.156 to 0.625 µm (size bin 2), whereas the liquid water content of aerosols is concentrated in the range 2.5 to 10.0 µm (size bin 4) and the number concentration of aerosols is predominantly found in the range 0.039 to 0.156 µm (size bin 1). These findings underscore the challenge of accurately modeling the evolution of the aerosol PSD. However, the AIMACI scheme accurately captures these distributions, highlighting its capability in characterizing complex nonlinear processes.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f07

Figure 7Monthly average surface total concentrations (summed across four size bins) of nitrate for different environmental conditions across seasons, as simulated online by both the numerical scheme and the AIMACI scheme.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f08

Figure 8Time series of surface total concentrations (summed across four size bins) of nitrate for different environmental conditions across seasons, as simulated online by both the numerical scheme and the AIMACI scheme. The gray bar is the difference between the two schemes. Results represent the calculated averages for the Yangtze River Delta region (119.1–121.9° E, 30.1–31.9° N).

Download

3.2.2 Robust generalization ability

Following validation of the AIMACI scheme's performance in simulating the 3D spatiotemporal distributions and PSD of major aerosol species concentrations, we assess its generalization ability under diverse environmental conditions, a critical aspect for its future integration into climate models to mitigate uncertainties stemming from oversimplified or absent aerosol processes. To evaluate this, we implemented a comparative analysis of 1 month simulations for each of the four seasons – spring, summer, autumn, and winter – between the numerical scheme and the AIMACI scheme. This comprehensive assessment helps to enhance our understanding of how the AIMACI scheme's performance varies under different meteorological conditions. It also provides insights for future model optimization to improve the stability of long-term simulations.

Figure 7 shows monthly average surface nitrate concentrations simulated by AIMACI across four seasons, with absolute and relative error plots. Nitrate concentrations exhibit seasonal variations, with higher values in January and lower values in July. This pattern corresponds to increased winter heating emissions and enhanced summer atmospheric dispersion from the East Asian monsoon. The spatial distribution of nitrate concentrations shows agreement with the numerical scheme (R² ≥ 0.93), despite AIMACI being trained on only 16 d of March data. Figure 8 displays hourly time series of nitrate concentrations in the Yangtze River Delta (119.1–121.9° E, 30.1–31.9° N). The results reveal that the simulation for January experiences more pronounced concentration fluctuations, with peaks exceeding 30 µg kg⁻¹, while other months' simulations exhibit more rapid concentration variations. Simulating hourly concentration changes poses a greater challenge than simulating multi-day average concentrations, as the latter allows for the offsetting of positive and negative errors. Nonetheless, the AIMACI scheme demonstrates small biases from the numerical scheme (RMSE ≤ 2.71 µg kg⁻¹) and maintains stable performance throughout continuous simulations, with absolute errors oscillating within ±5 µg kg⁻¹ and occasionally approaching zero.

Additionally, we note that the simulation errors in July are notably higher than in other months. Considering that July is a typhoon-prone season, while our training data only include partial data from March, we hypothesize that the lack of representation of such distinct seasonal weather events in our training data may contribute to the larger bias in July. To test this hypothesis, we plotted the minimum values of the mean sea level pressure (MSLP) in the training data and simulation results of different months, as shown in Fig. 9. The time series of the minimum MSLP for different months exhibit some differences from the training data, with July showing the most significant difference. This may be one of the contributing factors leading to the overall suboptimal simulation performance in July. Moreover, we observed the three lowest MSLPs in the time series for April, July, and October, which likely correspond to typhoon events, as indicated by the red points in Fig. 9. Since the lowest MSLP in October is at the start of the simulation, where the concentration of pollutants is almost zero, it may not be diagnostic of simulation bias. Therefore, we additionally selected a relatively later time point, combined with the moments corresponding to the three lowest values, and plotted the spatial distribution of nitrate surface concentrations at these four moments, along with corresponding MSLP and wind fields, as shown in Fig. 10. These results reveal that the lowest values in both July and October correspond to typhoons over the western Pacific, while the lowest value in April corresponds to a low-pressure system over land. The absolute error plot in Fig. 10 shows that, in the regions affected by typhoons, the simulation errors are significantly increased (Fig. 10h, p), whereas the accuracy of the AIMACI scheme in simulating the low-pressure system in April is not significantly impacted. This suggests that, while the AIMACI scheme demonstrates quantifiable generalization capability across diverse environmental conditions, its performance significantly degrades when facing significantly different seasonal weather events, such as typhoons. To surmount these limitations and bolster the precision of the AIMACI scheme, future iterations should consider integrating a more comprehensive and diverse training dataset that accounts for a broader spectrum of environmental conditions, notably incorporating additional seasonal meteorological phenomena, such as typhoons. The augmentation of the training dataset would facilitate the fine-tuning of the AIMACI scheme, thereby enhancing its reliability in delivering robust simulation outcomes.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f09

Figure 9Time series of the minimum of mean sea level pressure (MSLP) in the training data and simulation results of different environmental conditions across seasons. The red points represent the lowest MSLP in the corresponding time series and the date of its occurrence. Since the lowest MSLP in October is at the start of the simulation, where the concentration of pollutants is almost zero, it may not be diagnostic of simulation bias. Therefore, we additionally selected a relatively later time point, as shown by the green point.

Download

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f10

Figure 10The spatial distribution of nitrate surface concentrations at four specified time points, along with corresponding mean sea level pressure (MSLP) and wind fields. The black box corresponds to the area with the lowest MSLP.

3.2.3 High computational efficiency

A primary motivation for the development of the AIMACI scheme is the potential for increased computational efficiency offered by AI schemes, compared with conventional numerical schemes. However, past research has indicated that such computational efficiency gains are not always guaranteed (Keller and Evans, 2019), necessitating a direct comparison of the computational speeds of the AIMACI scheme and the numerical scheme. Given that WRF-Chem, written in Fortran, is not conducive to GPU acceleration, we conducted offline tests of the AIMACI scheme's computational speed on a GPU and compared it with the numerical scheme on a CPU, where the AIMACI scheme was coupled to the WRF-Chem.

Figure 11 demonstrates that, when utilizing a single CPU core, the AIMACI scheme achieves a computational speedup of approximately 5×, with a time cost of 48.51 s compared with the numerical scheme's 229.74 s. This advancement is further amplified when employing a single GPU: the AIMACI scheme completes the computation in a mere 0.83 s, which is approximately 277× faster than the numerical scheme running on a single CPU core. While it is reasonable to anticipate that future implementation of heterogeneous computing platforms, integrating both CPUs (central processing units) and GPUs (graphics processing units), will yield significant enhancements in computational efficiency, further testing is necessary to account for the potential overhead associated with communication between CPUs and GPUs.

https://acp.copernicus.org/articles/25/6197/2025/acp-25-6197-2025-f11

Figure 11Comparison of computational speeds between the numerical scheme and the AIMACI scheme under different computational configurations. The time cost for the GPU is measured in a mode where the AIMACI scheme is not yet coupled to the model, while the time cost for the CPU is measured in a mode with the AIMACI scheme coupled to the model. The calculations are based on simulating the concentrations of 61 output targets across 720 300 grid cells.

Download

4 Conclusions

This study develops and evaluates a novel Artificial Intelligence Model for Aerosol Chemistry and Interactions, termed AIMACI, with a special focus on addressing the long-standing challenge of significant computational burden caused by using traditional numerical schemes to simulate aerosol chemistry and interactions (ACI) in atmospheric models. The differential equations governing ACI are notably stiff, coupled with a stringent time integration scheme required for numerical stability, resulting in limited breakthroughs in simulation speed with available numerical techniques. While previous studies have explored AI schemes as alternatives for conventional numerical schemes of photochemistry simulation, the application of AI schemes to achieve end-to-end simulation of full ACI within a 3D atmospheric model has not yet been studied. In contrast to atmospheric photochemistry, which only involves chemical reactions among species, full ACI consists of multiple highly nonlinear subprocesses, including chemical reactions and phase equilibrium, gas-particle partitioning, particle size growth, coagulation, and nucleation. Therefore, whether an AI scheme can effectively supplant the traditional numerical scheme for simulating full ACI, achieving both high fidelity in simulation accuracy and a significant improvement in computational efficiency, remains an open question.

To bridge this gap, the AIMACI scheme was established based on the Multi-Head Self-Attention algorithm, which is renowned for its powerful nonlinear representation capabilities and can efficiently capture complex reaction relationships between different chemical species. In an offline mode, where the AIMACI scheme was not yet integrated with a 3D atmospheric model, it demonstrated robust performance on a test dataset, with all 61 evaluated output targets exhibiting an average R² of 0.98, and an average NMB of 3.02 %. This strong consistency between the numerical and AIMACI schemes lays a solid foundation for further online continuous simulations.

To facilitate the coupling of the Python-written AIMACI scheme with Fortran-based numerical models, we utilized PyTorch's TorchScript and LibTorch tools to encapsulate the AIMACI scheme in a static library callable by the numerical model. This approach minimizes changes to the existing numerical model's codebase and offers a lightweight, adaptable, and modular solution for coupling AI algorithms of diverse complexities with a range of numerical models.

Employing a physics-AI hybrid model, we implemented additional experiments to evaluate the online simulation performance of the AIMACI scheme. The 10 d continuous simulation results indicate that the AIMACI scheme not only accurately captures the spatiotemporal distribution of various aerosol species but also effectively reproduces their size distributions, maintaining stability throughout the simulation period without rapid error growth. Furthermore, the AIMACI scheme exhibits robust generalization capabilities despite being trained on data from only 16 d in March. This is evidenced by the results that, during month-long continuous simulations across various environmental conditions in all four seasons, R² for monthly average surface nitrate concentrations is greater than 0.93, and the RMSE for hourly time series of nitrate concentrations in the Yangtze River Delta is smaller than 2.71 µg kg⁻¹. However, systematic biases were observed in the simulation results of the AIMACI scheme: underestimation of high concentrations and overestimation of low concentrations. This bias pattern could be attributed to two main factors. First, the limited number of high-value samples in the training set leads to insufficient learning of high-value instances by the AIMACI scheme. Second, the quadratic term in the RMSE loss function penalizes large errors more heavily, thereby biasing the AIMACI scheme toward predicting the mean of the target distribution to minimize the overall loss. Future improvements may involve exploring data transformation techniques to reduce the skewness of the training data distribution. Additionally, compared with other months, the performance of the AIMACI scheme in July was less accurate. Our analysis indicates that this is primarily due to the significant difference between the meteorological conditions in July and those in the training data. Seasonal weather events, such as typhoons, have a notable impact on the performance of AIMACI. Therefore, in future iterations of AIMACI, we plan to use a more diverse training dataset, particularly one that includes seasonal weather events, to further enhance its performance.

In terms of computational speed, the AIMACI scheme is approximately 5 times faster than the conventional numerical scheme when predicting 720 300 grid points with 61 output targets using a single CPU core. This speedup increases significantly to about 277 times faster when utilizing a GPU. Future simulations on heterogeneous platforms, integrating both CPUs and GPUs, are expected to significantly improve this speedup ratio. However, considering the potential communication overhead between CPUs and GPUs, further testing is still required.

An important outcome of this work is the first-time successful application of an AI scheme to achieve fast, accurate, and stable end-to-end simulation for full ACI within the numerical model, replacing the traditional numerical scheme. As the first step, we mainly focus on inorganic aerosol species and regional-scale monthly simulation tests. Future work will consider integrating complex secondary organic aerosols in our AIMACI scheme, performing global-scale annual simulation tests, and coupling the AIMACI scheme to climate models. These advancements will help to address current limitations in climate models where full ACI is often oversimplified or even omitted due to the large computational burden associated with numerical simulations of ACI, thereby improving the representation of aerosol–climate interactions.

Code availability

The hybrid version of WRF-Chem with both physics and AI schemes is publicly accessible. The source code for this hybrid model, along with the parameter file for the Multi-Head Self-Attention model used in the Artificial Intelligence Model for Aerosol Chemistry and Interactions (AIMACI) is available under an open license at https://doi.org/10.5281/zenodo.15702248 (Xia et al., 2024).

Data availability

The Multi-resolution Emission Inventory for China (MEIC) at 0.25°×0.25° resolution for 2019 is available at http://meicmodel.org.cn/?page_id=560 (Li et al., 2017; Geng et al., 2024). The NCEP final reanalysis (FNL) data with a 1°×1° resolution and 6 h temporal resolution are available at https://rda.ucar.edu/datasets/ds083.2/ (NCEP, 2000).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/acp-25-6197-2025-supplement.

Author contributions

ZX and CZ designed the experiments and conducted and analyzed the simulations. Manuscript preparation was done by ZX and CZ, with additional contributions from all co-authors. Model simulations were provided by ZX, ZY, and QD. Analysis was carried out by ZX, CZ, FJ, CJ, JS, and HA. All authors contributed to the discussion and final version of the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

This research was supported by the National Key Scientific and Technological Infrastructure project “Earth System Numerical Simulation Facility” (EarthLab), and the CMA-USTC Laboratory of Fengyun Remote Sensing. The study used the computing resources from the Supercomputing Center of the University of Science and Technology of China (USTC) and the Qingdao Supercomputing and Big Data Center. The authors thank the editor and anonymous reviewers for their constructive comments and suggestions, which significantly improved the quality of this manuscript.

Financial support

This research has been supported by the National Key Research and Development Program of China (grant no. 2022YFC3700701), the Strategic Priority Research Program of the Chinese Academy of Sciences (grant no. XDB0500303), the National Natural Science Foundation of China (grant no. 41775146), the USTC Research Funds of the Double First-Class Initiative (grant nos. YD2080002007, KY2080000114), and the Science and Technology Innovation Project of Laoshan Laboratory (grant no. LSKJ202300305).

Review statement

This paper was edited by Minghuai Wang and reviewed by two anonymous referees.

References

Arfin, T., Pillai, A. M., Mathew, N., Tirpude, A., Bang, R., and Mondal, P.: An overview of atmospheric aerosol and their effects on human health, Environ. Sci. Pollut. Res., 30, 125347–125369, https://doi.org/10.1007/s11356-023-29652-w, 2023.

Bellouin, N., Quaas, J., Gryspeerdt, E., Kinne, S., Stier, P., Watson-Parris, D., Boucher, O., Carslaw, K. S., Christensen, M., Daniau, A.-L., Dufresne, J.-L., Feingold, G., Fiedler, S., Forster, P., Gettelman, A., Haywood, J. M., Lohmann, U., Malavelle, F., Mauritsen, T., McCoy, D. T., Myhre, G., Mülmenstädt, J., Neubauer, D., Possner, A., Rugenstein, M., Sato, Y., Schulz, M., Schwartz, S. E., Sourdeval, O., Storelvmo, T., Toll, V., Winker, D., and Stevens, B.: Bounding Global Aerosol Radiative Forcing of Climate Change, Rev. Geophys., 58, e2019RG000660, https://doi.org/10.1029/2019RG000660, 2020.

Bi, K., Xie, L., Zhang, H., Chen, X., Gu, X., and Tian, Q.: Accurate medium-range global weather forecasting with 3D neural networks, Nature, 619, 533–538, https://doi.org/10.1038/s41586-023-06185-3, 2023.

Calvert, J. G., Lazrus, A., Kok, G. L., Heikes, B. G., Walega, J. G., Lind, J., and Cantrell, C. A.: Chemical mechanisms of acid generation in the troposphere, Nature, 317, 27–35, https://doi.org/10.1038/317027a0, 1985.

Carmichael, G. R., Sandu, A., Song, C. H., He, S., Phandis, M. J., Daescu, D., Damian-Iordache, V., and Potra, F. A.: Computational Challenges of Modeling Interactions Between Aerosol and Gas Phase Processes in Large Scale Air Pollution Models, in: Large Scale Computations in Air Pollution Modelling, edited by: Zlatev, Z., Brandt, J., Builtjes, P. J. H., Carmichael, G., Dimov, I., Dongarra, J., van Dop, H., Georgiev, K., Hass, H., and Jose, R. S., Springer Netherlands, Dordrecht, 99–136, https://doi.org/10.1007/978-94-011-4570-1_10, 1999.

Charlson, R. J., Schwartz, S. E., Hales, J. M., Cess, R. D., Coakley, J. A., Hansen, J. E., and Hofmann, D. J.: Climate Forcing by Anthropogenic Aerosols, Science, 255, 423–430, https://doi.org/10.1126/science.255.5043.423, 1992.

Du, Q., Zhao, C., Zhang, M., Dong, X., Chen, Y., Liu, Z., Hu, Z., Zhang, Q., Li, Y., Yuan, R., and Miao, S.: Modeling diurnal variation of surface PM_2.5 concentrations over East China with WRF-Chem: impacts from boundary-layer mixing and anthropogenic emission, Atmos. Chem. Phys., 20, 2839–2863, https://doi.org/10.5194/acp-20-2839-2020, 2020.

Ebel, A., Memmesheimer, M., Friese, E., Jakobs, H. J., Feldmann, H., Kessler, C., and Piekorz, G.: Long-Term Atmospheric Aerosol Simulations – Computational and Theoretical Challenges, in: Large-Scale Scientific Computing, Berlin, Heidelberg, 490–497, https://doi.org/10.1007/11666806_56, 2006.

Fuzzi, S., Baltensperger, U., Carslaw, K., Decesari, S., Denier van der Gon, H., Facchini, M. C., Fowler, D., Koren, I., Langford, B., Lohmann, U., Nemitz, E., Pandis, S., Riipinen, I., Rudich, Y., Schaap, M., Slowik, J. G., Spracklen, D. V., Vignati, E., Wild, M., Williams, M., and Gilardoni, S.: Particulate matter, air quality and climate: lessons learned and future needs, Atmos. Chem. Phys., 15, 8217–8299, https://doi.org/10.5194/acp-15-8217-2015, 2015.

Geng, G., Liu, Y., Liu, Y., Liu, S., Cheng, J., Yan, L., Wu, N., Hu, H., Tong, D., Zheng, B., Yin, Z., He, K., and Zhang, Q.: Efficacy of China’s clean air actions to tackle PM_2.5 pollution between 2013 and 2020, Nat. Geosci., 17, 987–994, https://doi.org/10.1038/s41561-024-01540-z, 2024 (data available at http://meicmodel.org.cn/?page_id=560, last access: 20 June 2025).

Gneiting, T.: Making and Evaluating Point Forecasts, J. Am. Stat. Assoc., 106, 746–762, https://doi.org/10.1198/jasa.2011.r10138, 2011.

Gu, J., Feng, J., Hao, X., Fang, T., Zhao, C., An, H., Chen, J., Xu, M., Han, W., Yang, C., Li, F., and Chen, D.: Establishing a non-hydrostatic global atmospheric modeling system at 3-km horizontal resolution with aerosol feedbacks on the Sunway supercomputer of China, Sci. Bull., 67, 1170–1181, https://doi.org/10.1016/j.scib.2022.03.009, 2022.

Hodas, N., Sullivan, A. P., Skog, K., Keutsch, F. N., Collett, J. L. Jr., Decesari, S., Facchini, M. C., Carlton, A. G., Laaksonen, A., and Turpin, B. J.: Aerosol Liquid Water Driven by Anthropogenic Nitrate: Implications for Lifetimes of Water-Soluble Organic Gases and Potential for Secondary Organic Aerosol Formation, Environ. Sci. Technol., 48, 11127–11136, https://doi.org/10.1021/es5025096, 2014.

Hu, Z., Huang, J., Zhao, C., Bi, J., Jin, Q., Qian, Y., Leung, L. R., Feng, T., Chen, S., and Ma, J.: Modeling the contributions of Northern Hemisphere dust sources to dust outflow from East Asia, Atmos. Environ., 202, 234–243, https://doi.org/10.1016/j.atmosenv.2019.01.022, 2019.

Keller, C. A. and Evans, M. J.: Application of random forest regression to the calculation of gas-phase chemistry within the GEOS-Chem chemistry model v10, Geosci. Model Dev., 12, 1209–1225, https://doi.org/10.5194/gmd-12-1209-2019, 2019.

Kelp, M. M., Tessum, C. W., and Marshall, J. D.: Orders-of-magnitude speedup in atmospheric chemistry modeling through neural network-based emulation, arXiv, https://doi.org/10.48550/arXiv.1808.03874, 2018.

Kelp, M. M., Jacob, D. J., Kutz, J. N., Marshall, J. D., and Tessum, C. W.: Toward Stable, General Machine-Learned Models of the Atmospheric Chemical System, J. Geophys. Res.-Atmos., 125, e2020JD032759, https://doi.org/10.1029/2020JD032759, 2020.

Kelp, M. M., Jacob, D. J., Lin, H., and Sulprizio, M. P.: An Online-Learned Neural Network Chemical Solver for Stable Long-Term Global Simulations of Atmospheric Chemistry, J. Adv. Model. Earth Sy., 14, e2021MS002926, https://doi.org/10.1029/2021MS002926, 2022.

Landgrebe, J. D. and Pratsinis, S. E.: A discrete-sectional model for particulate production by gas-phase chemical reaction and aerosol coagulation in the free-molecular regime, J. Colloid Interf. Sci., 139, 63–86, https://doi.org/10.1016/0021-9797(90)90445-T, 1990.

Lee, L. A., Reddington, K., and Carslaw, S.: On the relationship between aerosol model uncertainty and radiative forcing uncertainty, P. Natl. Acad. Sci. USA, 113, 5820–5827, https://doi.org/10.1073/pnas.1507050113, 2016.

Li, J., Carlson, B. E., Yung, Y. L., Lv, D., Hansen, J., Penner, J. E., Liao, H., Ramaswamy, V., Kahn, R. A., Zhang, P., Dubovik, O., Ding, A., Lacis, A. A., Zhang, L., and Dong, Y.: Scattering and absorbing aerosols in the climate system, Nat. Rev. Earth Environ., 3, 363–379, https://doi.org/10.1038/s43017-022-00296-7, 2022.

Li, M., Liu, H., Geng, G., Hong, C., Liu, F., Song, Y., Tong, D., Zheng, B., Cui, H., Man, H., Zhang, Q., and He, K.: Anthropogenic emission inventories in China: a review, Nat. Sci. Rev., 4, 834–866, https://doi.org/10.1093/nsr/nwx150, 2017.

Liu, C., Zhang, H., Cheng, Z., Shen, J., Zhao, J., Wang, Y., Wang, S., and Cheng, Y.: Emulation of an atmospheric gas-phase chemistry solver through deep learning: Case study of Chinese Mainland, Atmos. Pollut. Res., 12, 101079, https://doi.org/10.1016/j.apr.2021.101079, 2021.

Liu, Y., Wu, Z., Huang, X., Shen, H., Bai, Y., Qiao, K., Meng, X., Hu, W., Tang, M., and He, L.: Aerosol Phase State and Its Link to Chemical Composition and Liquid Water Content in a Subtropical Coastal Megacity, Environ. Sci. Technol., 53, 5027–5033, https://doi.org/10.1021/acs.est.9b01196, 2019.

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B.: Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows, Proceedings of the IEEE/CVF International Conference on Computer Vision, in: 2021 IEEE/CVF International Conference on Computer Vision (ICCV), 9992–10002, IEEE Computer Society, Los Alamitos, CA, USA, https://doi.org/10.1109/ICCV48922.2021.00986, 2021.

Lohmann, U. and Feichter, J.: Global indirect aerosol effects: a review, Atmos. Chem. Phys., 5, 715–737, https://doi.org/10.5194/acp-5-715-2005, 2005.

NCEP: NCEP FNL operational model global tropospheric analyses, continuing from July 1999, [Dataset], Research Data Archive at the National Center for Atmosp Ncep: NCEP FNL operational model global tropospheric analyses, continuing from July 1999, Research Data Archive at the National Center for Atmospheric Research, Computational and Information Systems Laboratory [data set], https://doi.org/10.5065/D6M043C6, 2000.

Nguyen, T. K. V., Zhang, Q., Jimenez, J. L., Pike, M., and Carlton, A. G.: Liquid Water: Ubiquitous Contributor to Aerosol Mass, Environ. Sci. Technol. Lett., 3, 257–263, https://doi.org/10.1021/acs.estlett.6b00167, 2016.

Parrish, D. D., Law, K. S., Staehelin, J., Derwent, R., Cooper, O. R., Tanimoto, H., Volz-Thomas, A., Gilge, S., Scheel, H.-E., Steinbacher, M., and Chan, E.: Long-term changes in lower tropospheric baseline ozone concentrations at northern mid-latitudes, Atmos. Chem. Phys., 12, 11485–11504, https://doi.org/10.5194/acp-12-11485-2012, 2012.

Penkett, S. A., Jones, B. M. R., Brich, K. A., and Eggleton, A. E. J.: The importance of atmospheric ozone and hydrogen peroxide in oxidising sulphur dioxide in cloud and rainwater, Atmos. Environ., 13, 123–137, https://doi.org/10.1016/0004-6981(79)90251-8, 1979.

Pöschl, U.: Atmospheric Aerosols: Composition, Transformation, Climate and Health Effects, Angew. Chem. Int. Edit., 44, 7520–7540, https://doi.org/10.1002/anie.200501122, 2005.

Pratsinis, S. E.: Simultaneous nucleation, condensation, and coagulation in aerosol reactors, J. Colloid Interf. Sci., 124, 416–427, https://doi.org/10.1016/0021-9797(88)90180-4, 1988.

Ruan, X., Zhao, C., Zaveri, R. A., He, P., Wang, X., Shao, J., and Geng, L.: Simulations of aerosol pH in China using WRF-Chem (v4.0): sensitivities of aerosol pH and its temporal variations during haze episodes, Geosci. Model Dev., 15, 6143–6164, https://doi.org/10.5194/gmd-15-6143-2022, 2022.

Saiz-Lopez, A., Borge, R., Notario, A., Adame, J. A., Paz, D. de la, Querol, X., Artíñano, B., Gómez-Moreno, F. J., and Cuevas, C. A.: Unexpected increase in the oxidation capacity of the urban atmosphere of Madrid, Spain, Sci. Rep., 7, 45956, https://doi.org/10.1038/srep45956, 2017.

Sandu, A., Verwer, J. G., Blom, J. G., Spee, E. J., Carmichael, G. R., and Potra, F. A.: Benchmarking stiff ode solvers for atmospheric chemistry problems II: Rosenbrock solvers, Atmos. Environ., 31, 3459–3472, https://doi.org/10.1016/S1352-2310(97)83212-8, 1997a.

Sandu, A., Verwer, J. G., Van Loon, M., Carmichael, G. R., Potra, F. A., Dabdub, D., and Seinfeld, J. H.: Benchmarking stiff ode solvers for atmospheric chemistry problems – I. implicit vs explicit, Atmos. Environ., 31, 3151–3166, https://doi.org/10.1016/S1352-2310(97)00059-9, 1997b.

Sharma, H., Shrivastava, M., and Singh, B.: Physics informed deep neural network embedded in a chemical transport model for the Amazon rainforest, npj Clim. Atmos. Sci., 6, 1–10, https://doi.org/10.1038/s41612-023-00353-y, 2023.

Spracklen, D. V., Carslaw, K. S., Merikanto, J., Mann, G. W., Reddington, C. L., Pickering, S., Ogren, J. A., Andrews, E., Baltensperger, U., Weingartner, E., Boy, M., Kulmala, M., Laakso, L., Lihavainen, H., Kivekäs, N., Komppula, M., Mihalopoulos, N., Kouvarakis, G., Jennings, S. G., O'Dowd, C., Birmili, W., Wiedensohler, A., Weller, R., Gras, J., Laj, P., Sellegri, K., Bonn, B., Krejci, R., Laaksonen, A., Hamed, A., Minikin, A., Harrison, R. M., Talbot, R., and Sun, J.: Explaining global surface aerosol number concentrations in terms of primary emissions and particle formation, Atmos. Chem. Phys., 10, 4775–4793, https://doi.org/10.5194/acp-10-4775-2010, 2010.

Twomey, S.: Pollution and the planetary albedo, Atmos. Environ., 8, 1251–1256, https://doi.org/10.1016/0004-6981(74)90004-3, 1974.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., and Polosukhin, I.: Attention is All you Need, in: Advances in Neural Information Processing Systems, in: Proceedings of the 31st International Conference on Neural Information Processing Systems, Curran Associates Inc., Red Hook, NY, USA, 6000–6010, ISBN: 978151086096, 2017.

Wang, J. L., Curtis, J. H., Riemer, N., and West, M.: Learning Coagulation Processes With Combinatorial Neural Networks, J. Adv. Model. Earth Sy., 14, e2022MS003252, https://doi.org/10.1029/2022MS003252, 2022.

Wang, Z., Li, J., Wu, L., Zhu, M., Zhang, Y., Ye, Z., and Wang, Z.: Deep learning-based gas-phase chemical kinetics kernel emulator: Application in a global air quality simulation case, Front. Environ. Sci., 10, 955980, https://doi.org/10.3389/fenvs.2022.955980, 2022.

Wu, Z., Wang, Y., Tan, T., Zhu, Y., Li, M., Shang, D., Wang, H., Lu, K., Guo, S., Zeng, L., and Zhang, Y.: Aerosol Liquid Water Driven by Anthropogenic Inorganic Salts: Implying Its Key Role in Haze Formation over the North China Plain, Environ. Sci. Technol. Lett., 5, 160–166, https://doi.org/10.1021/acs.estlett.8b00021, 2018.

Xia, Z., Zhao, C., Yang, Z., Du, Q., Jia, W., Jin, C., Shi, J., and An, H.: Source code for Hybrid WRFCHEM with Physics and AI Schemes (Aerosol Chemistry and Interactions) (v1.0), Zenodo [code], https://doi.org/10.5281/zenodo.15702248, 2024.

Xia, Z., Zhao, C., Du, Q., Yang, Z., Zhang, M., and Qiao, L.: Advancing Sophisticated Photochemistry Simulation in Atmospheric Numerical Models with Artificial Intelligence PhotoChemistry (AIPC) Scheme Using the Feature-Mapping Subspace Self-Attention algorithm, J. Adv. Model. Earth Sy., in process, 2025.

Zaveri, R. A. and Peters, L. K.: A new lumped structure photochemical mechanism for large-scale applications, J. Geophys. Res.-Atmos., 104, 30387–30415, https://doi.org/10.1029/1999JD900876, 1999.

Zaveri, R. A., Easter, R. C., and Wexler, A. S.: A new method for multicomponent activity coefficients of electrolytes in aqueous atmospheric aerosols, J. Geophys. Res.-Atmos., 110, D02201, https://doi.org/10.1029/2004JD004681, 2005.

Zaveri, R. A., Easter, R. C., Fast, J. D., and Peters, L. K.: Model for Simulating Aerosol Interactions and Chemistry (MOSAIC), J. Geophys. Res.-Atmos., 113, D13204, https://doi.org/10.1029/2007JD008782, 2008.

Zhang, H., Sharma, G., Dhawan, S., Dhanraj, D., Li, Z., and Biswas, P.: Comparison of discrete, discrete-sectional, modal and moment models for aerosol dynamics simulations, Aerosol Sci. Technol., 54, 739–760, https://doi.org/10.1080/02786826.2020.1723787, 2020.

Zhang, M., Zhao, C., Yang, Y., Du, Q., Shen, Y., Lin, S., Gu, D., Su, W., and Liu, C.: Modeling sensitivities of BVOCs to different versions of MEGAN emission schemes in WRF-Chem (v3.6) and its impacts over eastern China, Geosci. Model Dev., 14, 6155–6175, https://doi.org/10.5194/gmd-14-6155-2021, 2021.

Zhao, C., Ruby Leung, L., Easter, R., Hand, J., and Avise, J.: Characterization of speciated aerosol direct radiative forcing over California, J. Geophys. Res.-Atmos., 118, 2372–2388, https://doi.org/10.1029/2012JD018364, 2013a.

Zhao, C., Chen, S., Leung, L. R., Qian, Y., Kok, J. F., Zaveri, R. A., and Huang, J.: Uncertainty in modeling dust mass balance and radiative forcing from size parameterization, Atmos. Chem. Phys., 13, 10733–10753, https://doi.org/10.5194/acp-13-10733-2013, 2013b.

Zhao, C., Hu, Z., Qian, Y., Ruby Leung, L., Huang, J., Huang, M., Jin, J., Flanner, M. G., Zhang, R., Wang, H., Yan, H., Lu, Z., and Streets, D. G.: Simulating black carbon and dust and their radiative forcing in seasonal snow: a case study over North China with field campaign measurements, Atmos. Chem. Phys., 14, 11475–11491, https://doi.org/10.5194/acp-14-11475-2014, 2014.

Zhao, C., Huang, M., Fast, J. D., Berg, L. K., Qian, Y., Guenther, A., Gu, D., Shrivastava, M., Liu, Y., Walters, S., Pfister, G., Jin, J., Shilling, J. E., and Warneke, C.: Sensitivity of biogenic volatile organic compounds to land surface parameterizations and vegetation distributions in California, Geosci. Model Dev., 9, 1959–1976, https://doi.org/10.5194/gmd-9-1959-2016, 2016.

Articles

Download

Article (10038 KB)
Full-text XML

Short summary

Traditional numerical schemes of aerosol chemistry and interactions (ACI) in atmospheric models are computationally costly and are often simplified or omitted, introducing uncertainties. We use an AI scheme to achieve fast, accurate, and stable end-to-end simulation for full ACI within an atmospheric model, replacing numerical schemes. This innovation is expected to enhance the accuracy and efficiency of ACI simulations in climate models that would otherwise neglect or simplify ACI processes.