<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article"><?xmltex \bartext{Research article}?>
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ACP</journal-id><journal-title-group>
    <journal-title>Atmospheric Chemistry and Physics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ACP</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Chem. Phys.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1680-7324</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/acp-23-375-2023</article-id><title-group><article-title>Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data</article-title><alt-title>Capturing synoptic-scale variations in surface aerosol pollution</alt-title>
      </title-group><?xmltex \runningtitle{Capturing synoptic-scale variations in surface aerosol pollution}?><?xmltex \runningauthor{J.~Feng et al.}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Feng</surname><given-names>Jin</given-names></name>
          <email>jfeng@ium.cn</email>
        <ext-link>https://orcid.org/0000-0003-4454-5785</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Li</surname><given-names>Yanjie</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Qiu</surname><given-names>Yulu</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Zhu</surname><given-names>Fuxin</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Institute of Urban Meteorology (IUM), China Meteorological
Administration (CMA), Beijing, 100089, China</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>State Key Laboratory of Numerical Modeling for Atmospheric Sciences
and Geophysical Fluid Dynamics, Institute of Atmospheric Physics, Chinese
Academy of Sciences, Beijing, 100029, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jin Feng (jfeng@ium.cn)</corresp></author-notes><pub-date><day>10</day><month>January</month><year>2023</year></pub-date>
      
      <volume>23</volume>
      <issue>1</issue>
      <fpage>375</fpage><lpage>388</lpage>
      <history>
        <date date-type="received"><day>11</day><month>August</month><year>2022</year></date>
           <date date-type="rev-request"><day>19</day><month>August</month><year>2022</year></date>
           <date date-type="rev-recd"><day>6</day><month>December</month><year>2022</year></date>
           <date date-type="accepted"><day>19</day><month>December</month><year>2022</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2023 </copyright-statement>
        <copyright-year>2023</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://acp.copernicus.org/articles/.html">This article is available from https://acp.copernicus.org/articles/.html</self-uri><self-uri xlink:href="https://acp.copernicus.org/articles/.pdf">The full text article is available as a PDF file from https://acp.copernicus.org/articles/.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d1e115">The estimation of daily variations in aerosol concentrations using meteorological data is meaningful and challenging, given the need for accurate air quality forecasts and assessments. In this study, a <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>-layer spatiotemporal deep learning (DL) model is proposed to link synoptic variations in aerosol concentrations and meteorology, thereby building a “deep” Weather Index for Aerosols (deepWIA). The model was trained and validated using 7 years of data and tested in January–April 2022. The index successfully reproduced the variation in daily PM<inline-formula><mml:math id="M2" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observations in China. The coefficient of determination
between PM<inline-formula><mml:math id="M3" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations calculated from the index and observation
was 0.72, with a root mean square error (RMSE) of 16.5 <inline-formula><mml:math id="M4" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M5" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The DeepWIA performed better than Weather Forecast and Research (WRF)-Chem simulations for eight aerosol-polluted cities in China. The simulating power of the model also outperformed commonly used PM<inline-formula><mml:math id="M6" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration retrieval models based on random forest (RF), extreme gradient boost (XGB), and multilayer perceptron (MLP). The index and the DL model can be used as robust tools for estimating daily variations in aerosol concentrations.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d1e187">Meteorology and emissions drive variations in aerosol concentrations, with
the latter strongly modulating seasonality and long-term trends (Q. Zhang et
al., 2019; Wang et al., 2011) but remaining stable at synoptic scales,
excluding unexpected events such as volcanic activity and emergency lockdowns. Meteorology dominates synoptic scale (i.e., high-frequency)
variations in aerosol concentrations (Bei et al., 2016; Zheng et al., 2015;
Leung et al., 2018) and regulates aerosol physicochemical processes including their generation, diffusion, transport, and deposition (Feng et al., 2016), thus synchronizing periodic accumulation–removal of aerosol pollution with activities of synoptic systems (Chen et al., 2008; Guo et al., 2014).</p>
      <p id="d1e190">Air quality forecasts and emission-reduction evaluations require the estimation of aerosol concentrations and their variations from meteorological data. The strong impacts of meteorology on physicochemical processes make such estimation possible. Chemical transport models (CTMs) can be used as a tool for this purpose. Given an emission inventory, CTMs aim to detail the physicochemical processes and simulate variations in aerosol concentrations over all timescales. The CTM-based simulations provide information on intermediate processes, allowing convenient analysis of mechanisms of aerosol pollution. However, uncertainties in parameterization and emission inventories lead to significant estimation errors in aerosol concentrations (Zhong et al., 2016; Zhang et al., 2016, 2018). Taking the commonly used Weather Forecast and Research (WRF)-Chem model as an example, Sicard et al. (2021) reported a Pearson correlation coefficient of 0.44 (equivalent to a coefficient of determination (<inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) of <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.2</mml:mn></mml:mrow></mml:math></inline-formula>) between simulated and observed daily surface PM<inline-formula><mml:math id="M9" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> (particle matter of diameter <inline-formula><mml:math id="M10" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 2.5 <inline-formula><mml:math id="M11" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m) concentrations in China, based on a resolution simulation of 8 km in 2015. Another WRF-Chem simulation over 2014–2015 gave a better <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value of 0.44 for a smaller WRF-Chem
simulation domain over 131 cities in eastern China (Zhou et al., 2017). In
addition, the complexity of CTMs requires large computational resources.</p>
      <p id="d1e250">Data-based models provide another estimation tool, using historical datasets
to establish empirical or semiempirical models linking meteorology and aerosol concentrations without a description of intermediate processes. A
data-based model requires negligible computational resources compared with
CTMs. In China, two semiempirical meteorological indices are used for daily
variations in aerosol concentrations, the Parameter Linking Air quality to
Meteorological conditions (PLAM; Yang et al., 2016) and Air Stagnation
Index (ASI; Feng et al., 2018, 2020b). Both indices include an extra
“background factor” describing the effects of slowly changing emissions
and regional differences. However, the weak nonlinear fitting power of these
meteorological indices makes it difficult to beat CTMs for daily aerosol
concentration estimation. In addition, such simple meteorological indices
cannot be applied to a large region such as the whole of China (Sect. 4).</p>
      <p id="d1e253">As machine learning (ML) and deep learning (DL) are approaches to promoting
the nonlinear fitting power of data-based models, it is possible to establish an ML/DL model for variations in aerosol concentrations. The ML/DL-based observation retrieval for PM<inline-formula><mml:math id="M13" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration has become very popular (Yuan et al., 2020). Estimations in such studies use satellite-based aerosol optical depth (AOD; Wei et al., 2019a, 2020; Geng et al., 2021; Li et al., 2020) or surface visibility observations (Zhong et al., 2021; Gui et al., 2020) as “primary” data and meteorological variables and other quasistatic data (e.g., topography, population, emissions) as “auxiliary” data, with these being fed into a generic ML/DL model to estimate PM<inline-formula><mml:math id="M14" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. Commonly used models include random forest (RF; Wei et al., 2019a; Geng et al., 2021), extreme gradient boost (XGB; Gui et al., 2020), and multilayer perceptron (MLP; Li et al., 2020) methods, applied individually or together (Song et al., 2021). Compared with CTM simulations and meteorological indices, the injection of observation data improves the estimation of PM<inline-formula><mml:math id="M15" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations and its variations. In turn, the popularity of these studies indicates that using only meteorological data as primary data for aerosol concentrations is a challenging task, even with ML/DL.</p>
      <p id="d1e284">To address this issue, two key points should be considered in model design.
First, the model should focus only on the synoptic-scale variability of
aerosols, as meteorology is not a predominant factor in the low-frequency
variability of aerosol concentrations. Indeed, the direct fitting of aerosol
concentrations misinterprets the relationship between meteorology and
aerosols, possibly leading to an overfitting ML/DL model. Second, the model should include more spatiotemporal meteorological features and a more
powerful nonlinear capability to cover the complex characteristics of aerosol variations over large regions such as China than previous linear and DL/ML models.</p>
      <p id="d1e287">Therefore, here we propose a spatiotemporal deep neural network linking daily averaged meteorological fields and aerosol concentrations in China. Rather than fitting PM<inline-formula><mml:math id="M16" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, the DL model focuses on capturing their synoptic variations. In the DL model, daily averaged meteorological variables over 3 d and quasistatic data (as the input variables) are fused to provide a daily deep Weather Index for Aerosols (as the model output), termed “deepWIA” (the model is named “deepWIA model”). Compared with CTM-based and other data-based estimations reported in previous studies, the model efficiently reduces the estimation error in PM<inline-formula><mml:math id="M17" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations over China with no significant overfitting, as often occurs in previous ML-based models.</p>
      <p id="d1e308">The rest of this paper is organized as follows. Section 2 describes the
deepWIA model, training data, methods of feature engineering (i.e.,
preprocessing to generate input variables), and results with training–validation datasets. Section 3 focuses on the performance of the
model using a test dataset, with a comparison with a WRF-Chem simulation in
eight heavily polluted cities. Section 4 gives a comparison with related
studies. We also undertook several ablation experiments to illustrate possible reasons for the strong performance of the deepWIA model. Section 5
provides the geographic distribution of synoptic variations in aerosol pollution over the test period. Section 6 concludes the study.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>DeepWIA model</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Input variables</title>
      <p id="d1e326">Input variables of the deepWIA model includes daily averaged meteorological
variables from the fifth-generation European Centre for Medium-range Weather
Forecasts (ECMWF) reanalysis data (ERA5), with a horizontal resolution of
<inline-formula><mml:math id="M18" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.25</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn><mml:msup><mml:mi/><mml:mo>∘</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula>. Since a trained DL model can automatically select the input variables to compose the best model that fits
the target variable (PM<inline-formula><mml:math id="M19" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations) with activation functions,
the task for feature engineering is to feed the DL model with as many
variables as possible that are related to the day-to-day variation of
PM<inline-formula><mml:math id="M20" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. These input variables (Table 1) can be classified
into four categories as follows.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1" specific-use="star"><?xmltex \currentcnt{1}?><label>Table 1</label><caption><p id="d1e370">The input variables and their corresponding categories, references
and importance ranking in the deepWIA model.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:thead>
       <oasis:row>

         <oasis:entry colname="col1">Variable name</oasis:entry>

         <oasis:entry colname="col2">Category</oasis:entry>

         <oasis:entry colname="col3">References</oasis:entry>

         <oasis:entry colname="col4">Importance</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1"/>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4">ranking</oasis:entry>

       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>

         <oasis:entry colname="col1">10 m wind components</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4">12</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">2 m temperature</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3">Geng et al. (2021), Wei</oasis:entry>

         <oasis:entry colname="col4">6</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Surface pressure</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3">et al. (2020), Gui et al.</oasis:entry>

         <oasis:entry colname="col4">7</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">2 m mixing ratio</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3">(2020); Li et al. (2020)</oasis:entry>

         <oasis:entry colname="col4">2</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Precipitation</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4">21</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">100 m wind components</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry colname="col3">Newly introduced</oasis:entry>

         <oasis:entry colname="col4">9</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Downward short-wave radiation</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

         <oasis:entry rowsep="1" colname="col3" morerows="1">Geng et al. (2021)</oasis:entry>

         <oasis:entry rowsep="1" colname="col4" morerows="1">17</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Low cloud cover</oasis:entry>

         <oasis:entry colname="col2">Surface, basic</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="1">Surface turbulence stress components</oasis:entry>

         <oasis:entry rowsep="1" colname="col2" morerows="1">Surface, basic</oasis:entry>

         <oasis:entry colname="col3">Jia and Zhang (2020)</oasis:entry>

         <oasis:entry rowsep="1" colname="col4" morerows="1">4</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">Yin et al. (2019)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Geopotential height at 850 hPa</oasis:entry>

         <oasis:entry colname="col2">Upper air</oasis:entry>

         <oasis:entry colname="col3">Miao et al. (2020)</oasis:entry>

         <oasis:entry colname="col4">21</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Temperature at 850 hPa</oasis:entry>

         <oasis:entry colname="col2">Upper air</oasis:entry>

         <oasis:entry colname="col3">Hou et al. (2018)</oasis:entry>

         <oasis:entry colname="col4">12</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">2 m potential temperature</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry rowsep="1" colname="col3" morerows="1">Yang et al. (2016)</oasis:entry>

         <oasis:entry colname="col4">15</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">2 m wet-equivalent potential temperature</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry colname="col4">15</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Ventilation potency</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry rowsep="1" colname="col3" morerows="2">Feng et al. (2018, 2020b)</oasis:entry>

         <oasis:entry colname="col4">21</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Vertical diffusion potency</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry colname="col4">9</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Wet deposition potency</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry colname="col4">24</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Max. 2 m temperature</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry rowsep="1" colname="col3" morerows="2">Porter et al. (2015)</oasis:entry>

         <oasis:entry colname="col4">4</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Max. 100 m wind speed</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry colname="col4">12</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Max. and min. low troposphere stability</oasis:entry>

         <oasis:entry colname="col2">Derived</oasis:entry>

         <oasis:entry colname="col4">9</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="2">Population density</oasis:entry>

         <oasis:entry rowsep="1" colname="col2" morerows="2">Quasistatic</oasis:entry>

         <oasis:entry colname="col3">Geng et al. (2021), Wei</oasis:entry>

         <oasis:entry rowsep="1" colname="col4" morerows="2">3</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">et al. (2020), Li et al.</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">(2020)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="2">High vegetation cover</oasis:entry>

         <oasis:entry rowsep="1" colname="col2" morerows="2">Quasistatic</oasis:entry>

         <oasis:entry colname="col3">Wei et al. (2019a), Li et</oasis:entry>

         <oasis:entry rowsep="1" colname="col4" morerows="2">17</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">al. (2020) (use</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">vegetation index)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Surface altitude</oasis:entry>

         <oasis:entry colname="col2">Quasistatic</oasis:entry>

         <oasis:entry colname="col3">Geng et al. (2021)</oasis:entry>

         <oasis:entry colname="col4">7</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1">Latitude and longitude</oasis:entry>

         <oasis:entry colname="col2">Spatiotemporal</oasis:entry>

         <oasis:entry colname="col3">Gui et al. (2020)</oasis:entry>

         <oasis:entry colname="col4">1</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1">Day of year</oasis:entry>

         <oasis:entry colname="col2">Spatiotemporal</oasis:entry>

         <oasis:entry colname="col3">Newly introduced</oasis:entry>

         <oasis:entry colname="col4">17</oasis:entry>

       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d1e791">Basic meteorological variables near the surface. We use 10 m altitude wind
components, 2 m temperature, surface pressure, surface downward short-wave
radiation, and total precipitation, which are frequently used as input
variables in ML/DL-based studies of PM<inline-formula><mml:math id="M21" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> retrieval (Geng et al., 2021;
Wei et al., 2020; Gui et al., 2020; Li et al., 2020). In addition, we
introduce 100 m wind components and surface turbulent stress, as they are
related to horizontal and vertical diffusion in the planetary boundary layer (PBL), respectively.</p>
      <p id="d1e804">Meteorological fields in the upper air include geopotential height and temperature at 850 hPa. We introduce these two variables for the deepWIA model in learning the effects of synoptic patterns on aerosol variations.</p>
      <p id="d1e807"><?xmltex \hack{\newpage}?>Derived input variables referring to previous studies of aerosol concentration–meteorology relationships. Our model contains potential
temperature and wet-equivalent potential temperature derived from PLAM, as
they can identify the types of aerosol-related air masses controlling the
local area (Yang et al., 2016). In addition, we introduce three kernel
parameters of ASI, including ventilation potency, vertical diffusion potency, and wet deposition potency of aerosols (Feng et al., 2018). The ventilation potency illustrates the effects of wind speed in local PBL, which are simply represented by the nonlinear function of the height-weighted average of wind speed over the PBL; vertical diffusion potency is represented by the inverse of PBL height, which roughly presents the vertical diffusion range of aerosols due to turbulence; and wet deposition potency illustrates a significant decrease in the aerosol concentrations due to precipitation. The values of 0 and <inline-formula><mml:math id="M22" display="inline"><mml:mi>e</mml:mi></mml:math></inline-formula> correspond to precipitation greater than or equal to and less than 3 mm d<inline-formula><mml:math id="M23" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, respectively. All the formulae for these variables derived from ASI are given in the Supplement (Eqs. S1–S3l). Moreover, referring to Porter et al. (2015), we use the daily maxima and minima of low-troposphere stability (i.e., the potential temperature difference between 700 hPa and the surface) and daily maxima of 2 m temperature and of 100 m wind speed.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1" specific-use="star"><?xmltex \currentcnt{1}?><?xmltex \def\figurename{Figure}?><label>Figure 1</label><caption><p id="d1e832">Probability density functions of <bold>(a)</bold> observed PM<inline-formula><mml:math id="M24" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations (<inline-formula><mml:math id="M25" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula>, orange line) and background concentrations (<inline-formula><mml:math id="M26" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>,
blue line), <bold>(b)</bold> <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mi>C</mml:mi><mml:mi>B</mml:mi></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:math></inline-formula>, and <bold>(c)</bold> <inline-formula><mml:math id="M28" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> (deepWIA target variable).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f01.png"/>

        </fig>

      <p id="d1e899">Quasistatic and spatiotemporal variables (non-meteorological variables) include population density, surface altitude, and surface
high vegetation cover, which are also commonly used in PM<inline-formula><mml:math id="M29" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observation retrieval. The population density is regridded from the Gridded Population of the World (GPW) version 4 dataset at an original resolution of 1 km. Surface altitude and surface high vegetation cover are from the ERA5 datasets. These variables as well as latitude and longitude (Gui et al., 2020; Zhong et al., 2021) aid learning of the local characteristics of aerosol concentration. In addition, the model is built uniformly using all observed
samples in China as the dataset (see Sect. 2.3). It is difficult for the model to obtain the correct seasonal information in the meteorological variables of these samples. Hence, we introduce seasonal information to the
deepWIA model through a variable of “day of the year”, which has rarely been considered in previous models.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Target</title>
      <p id="d1e919">The fitting target of the deepWIA model is not the PM<inline-formula><mml:math id="M30" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration
per se but an index that tracks synoptic variations in PM<inline-formula><mml:math id="M31" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. Motivated by the ASI and PLAM approaches, we use the predefined form
            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M32" display="block"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mi>C</mml:mi><mml:mo>/</mml:mo><mml:mi>B</mml:mi></mml:mrow></mml:math></disp-formula>
          to separate the long-term background aerosol concentration, <inline-formula><mml:math id="M33" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, and synoptic
variability, <inline-formula><mml:math id="M34" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>, superimposed on <inline-formula><mml:math id="M35" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, where <inline-formula><mml:math id="M36" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula> is the daily averaged PM<inline-formula><mml:math id="M37" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration. We term this process “timescale separation”. <inline-formula><mml:math id="M38" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> is calculated as a 31 d running average over the current year and the previous year, i.e.,
            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M39" display="block"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">62</mml:mn></mml:mfrac></mml:mstyle><mml:mfenced close=")" open="("><mml:mrow><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>y</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>y</mml:mi></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>d</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">15</mml:mn></mml:mrow></mml:munderover><mml:mi>C</mml:mi></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
          where <inline-formula><mml:math id="M40" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M41" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> denote the date and year of the PM<inline-formula><mml:math id="M42" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> sample, respectively. The seasonality, the long-term trend in emissions and local characteristics of each sample are contained in <inline-formula><mml:math id="M43" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, and <inline-formula><mml:math id="M44" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>, estimated from meteorological data, indicates the effect of weather on high-frequency variations in PM<inline-formula><mml:math id="M45" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. It should be noted that the timescale of the running average is not a sensitive parameter for the performance of the deepWIA model. When a new model with the same structure, input variables, and training method as the original deepWIA model, but with a 61 d running average for the current year and the previous year as the background is used, the model performance is close to the original one using the background value with 31 d running averaged (see Fig. S1 in the Supplement).</p>
      <p id="d1e1099">Target data imbalance is an issue of concern. Previous studies have shown
that PM<inline-formula><mml:math id="M46" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations have an extremely asymmetric long-tailed
probability distribution function (PDF; Lu, 2002; Feng et al., 2018). The
number of samples with low and medium values is much larger than that for high values (Fig. 1); <inline-formula><mml:math id="M47" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> has a similar PDF, with values of 0–15, but
concentrated mainly between 0 and 2. Such a distribution would weaken the
performance of a data-based model, as it is difficult for such a model to
discern small differences among low-value samples. To mitigate such data imbalance, the fitting target (i.e., the deepWIA, labeled <inline-formula><mml:math id="M48" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula>) of our
model is defined as
            <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M49" display="block"><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:msub><mml:mi>log⁡</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mi>r</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>
          This label transformation maintains the value of the target between <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> and 4 (Fig. 1c), giving a meaningful weather index for aerosol, with positive
and negative values denoting aerosol-pollution days and clean days, respectively. For example, <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> mean that the PM<inline-formula><mml:math id="M53" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>
concentrations will be 2 times (i.e., 2<inline-formula><mml:math id="M54" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msup></mml:math></inline-formula>) and <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> of (i.e., 2<inline-formula><mml:math id="M56" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>) the background concentration <inline-formula><mml:math id="M57" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>, respectively.</p>
      <p id="d1e1238">National surface PM<inline-formula><mml:math id="M58" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observations are from the real-time air quality
platform (<uri>https://air.cnemc.cn:18007</uri>, last access: 3 August 2022) of the China National Environmental Monitoring Center. This platform has published air quality data since 2013. We use data from 2015 because the number of observation sites since that year exceeds 1000, with a widespread distribution across the country, making the sample more representative. Furthermore, the number of PM<inline-formula><mml:math id="M59" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observation sites within different ERA5 grid cells is uneven, which would also undermine the representativeness of the sampling. Therefore, we use gridded observations, with the PM<inline-formula><mml:math id="M60" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observation in a grid cell being the mean of all observations within that cell.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Model description</title>
      <p id="d1e1279">Aerosol concentrations at specific times and locations depend on local and
surrounding meteorological fields over the current and past few days, as CTMs indicate. Therefore, we designed the deepWIA model as a spatiotemporal neural network (Fig. 2).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><?xmltex \currentcnt{2}?><?xmltex \def\figurename{Figure}?><label>Figure 2</label><caption><p id="d1e1284">Backbone architecture of the deepWIA model.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f02.png"/>

        </fig>

      <p id="d1e1293">The spatial module of the model is based on residual network (ResNet; He et al., 2016). At each time step (i.e., day), the module can extract the information of the input variable and its spatial pattern within <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> ERA5 grid cells (about <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:mn mathvariant="normal">200</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> km in China) around each observation sample point. We chose such a <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> sampling grid cell with reference to Feng et al. (2020b) and the limitations of our computational resources. The ResNet has a structure similar to that of the classical ResNet-50 (He et al., 2016), but only 49 convolution layers and a maximum of 512 channels (i.e., variables in convolution layers). These convolution layers of the ResNet automatically reorganize the input variables into multiple features associated with the
target (i.e., PM<inline-formula><mml:math id="M64" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations). This ResNet does not have the final pooling (i.e., spatial average) layer of the original ResNet-50, because a sample over the <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">9</mml:mn></mml:mrow></mml:math></inline-formula> ERA5 grid cell has shrunk to a scalar spatially after 49 convolution layers. The number of channels is also less than the traditional ResNet-50 due to our computational resource limitation. And more channels do not provide better model performance. To be summarized, the ResNet module fuses meteorological and quasistatic variables around the sample points at each time step into multiple features.</p>
      <p id="d1e1354"><?xmltex \hack{\newpage}?>The ResNet-extracted features are fed into the temporal module based on a gated recurrent unit (GRU) (Cho et al., 2014). The GRU is a recurrent neural network (RNN) that links the multiple features in a day-by-day order, combines the features together, and provides the final estimation of PM<inline-formula><mml:math id="M66" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. Here, we consider a short 3 d GRU structure, with the exclusion of impacts of weather more than 3 d earlier. Unlike other applications of GRU, we do not use the output in every time step, except for the final day (Fig. 2), as we fit the deepWIA only on the last day. The GRU has learnable “gate” parameters that determine the extent to which features in previous days affect current aerosol concentrations. In other words, they would help the model understand aerosol accumulation–removal processes caused by weather changes. There is only 1 hidden layer with 1024 channels, and it is therefore computationally efficient. To summarize, GRU quantifies the influences of meteorology over 3 consecutive days and maps these influences on the PM<inline-formula><mml:math id="M67" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations on the final day.</p>
      <p id="d1e1376">Model outputs on the final day fit the target <inline-formula><mml:math id="M68" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> for observation samples, using the mean squared error as the loss function.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Training and validation</title>
      <p id="d1e1397">We used ERA5 data and PM<inline-formula><mml:math id="M69" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observations for 2015–2021 for training and validation. The number of training–validation samples was about 1.6 million. We selected the model using traditional 10-fold cross validation (CV), dividing training–validation samples randomly into 10 approximately equal parts, 9 of which were used for training and the remaining 1 for validation. To avoid model overfitting, the training process stopped when the loss function in the validation dataset did not decrease for several training epochs. Using every part as a validation dataset, the training–validation process was then repeated 10 times, generating 10 models. The RMSE for all validation datasets was used to select optimal hyperparameters such as learning rate, number of convolution channels, and batch size. Finally, retraining the entire training–validation dataset using these hyperparameters determined the final deepWIA model.</p>
      <p id="d1e1409">Both the deepWIA and the PM<inline-formula><mml:math id="M70" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations from Eqs. (1) and (3) were
evaluated to illustrate model performance. We used five evaluation metrics in scatterplots, including the commonly used <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, RMSE, and mean absolute error (MAE). It is common for ML/DL-based models to underestimate high values and overestimate low values due to data imbalance (including in PM<inline-formula><mml:math id="M72" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> retrieval models). Therefore, we used biases in the ranges of <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> to evaluate model performance for clean and polluted weather, respectively. For PM<inline-formula><mml:math id="M75" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations (<inline-formula><mml:math id="M76" display="inline"><mml:mi>C</mml:mi></mml:math></inline-formula>), we used the ranges of <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mi>C</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">35</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M78" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M79" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mi>C</mml:mi><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">35</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M81" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M82" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, as 35 <inline-formula><mml:math id="M83" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M84" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> is the PM<inline-formula><mml:math id="M85" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration limit of the China
ambient air quality standard.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3" specific-use="star"><?xmltex \currentcnt{3}?><?xmltex \def\figurename{Figure}?><label>Figure 3</label><caption><p id="d1e1584">Training density scatterplots of <bold>(a)</bold> deepWIA (<inline-formula><mml:math id="M86" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula>) and <bold>(b)</bold> PM<inline-formula><mml:math id="M87" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations using data for 2015–2021 as a training set.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f03.png"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><?xmltex \currentcnt{4}?><?xmltex \def\figurename{Figure}?><label>Figure 4</label><caption><p id="d1e1621">As for Fig. 3, but for the first validation dataset.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f04.png"/>

        </fig>

      <p id="d1e1630"><?xmltex \hack{\newpage}?>Figure 3 shows the fitting scatterplots of deepWIA and PM<inline-formula><mml:math id="M88" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations for the entire training–validation dataset. The <inline-formula><mml:math id="M89" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> value had an RMSE of 0.45, an MAE of 0.34, and an <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value of 0.58. The PM<inline-formula><mml:math id="M91" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration had an RMSE of 16.91 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M93" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, an MAE of 9.5 <inline-formula><mml:math id="M94" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M95" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and an R<inline-formula><mml:math id="M96" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:math></inline-formula> value of 0.76. Additionally, The DL model still underestimated high values and overestimated low values, although label transformation and some other processes were performed.</p>
      <p id="d1e1723">Scatterplots for the first validation dataset (Fig. 4) show slightly lower
performance than that for the training set (RMSE <inline-formula><mml:math id="M97" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.49, MAE <inline-formula><mml:math id="M98" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.38, and <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.49</mml:mn></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M100" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>; and 16.01 <inline-formula><mml:math id="M101" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M102" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, 9.67 <inline-formula><mml:math id="M103" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M104" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and 0.70, respectively, for PM<inline-formula><mml:math id="M105" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration), partly because of the smaller set of training samples than that used in final training. Validations in the other nine validation datasets had similar performance, as summarized in Figs. S2 and S3. The RMSE and <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for <inline-formula><mml:math id="M107" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> for these validation datasets were in narrow ranges of 0.48–0.55 and 0.47–0.50, and the RMSE and <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for PM<inline-formula><mml:math id="M109" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations were 0.67–0.77 and 15.54–21.68 <inline-formula><mml:math id="M110" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M111" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, respectively. These metrics for 10-fold CV indicate no significant overfitting by the final deepWIA model and prove the stability of the model generated by the ResNet–GRU structure.</p>
      <p id="d1e1877">Once the DL model is established after training, a question worth discussing is the relative importance of these input variables. A DL model cannot answer by voting as the RF model. Therefore, here we perform sensitivity experiments to solve the problem: (1) for every input variable shown in Table 1, we deactivate it by setting all related model parameters to zero in the first convolutional layer; (2) we apply the modified model (i.e., without the effect of the given variable) to the training dataset and compute the RMSE of deepWIA; (3) we compute the difference between the RMSE and that of the original model. The larger the RMSE increases, the more important the input variable is. We applied these steps to all input variables and showed their importance rankings in Table 1. The five most important variables are latitude and longitude, 2 m mixing ratio, population density, maximal 2 m temperature, and surface turbulence stress components. However, some variables take little effect on the model (with an RMSE increase of less than 0.001), including wet deposition potency, precipitation, geopotential height at 850 hPa, ventilation potency, downward short-wave radiation, low cloud cover, and high vegetation cover.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><?xmltex \currentcnt{5}?><?xmltex \def\figurename{Figure}?><label>Figure 5</label><caption><p id="d1e1882">As for Fig. 3, but for the test dataset for 3 January to 30 April 2022.</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f05.png"/>

        </fig>

      <p id="d1e1892">Nevertheless, it would not be fair to compare the contribution of individual
input variables to the DL model because there are overlaps in the contribution of several variables, such as 100 and 10 m winds. Therefore, we grouped all variables into six groups, namely near-surface wind variables, near-surface temperature-humidity variables, near-surface vertical diffusion variables, spatiotemporal geographic variables, synoptic pattern and radiation variables, and precipitation variables (Table S1 in the Supplement). Using the same approach as the individual variable, we compute the importance of each group of variables. The most important group is the spatiotemporal geographic variable, followed by the vertical diffusion and near-surface wind variables. The least important one is precipitation (Fig. S4).</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Model performance on the test dataset</title>
      <p id="d1e1904">Data for 3 January to 30 April 2022 were used as the test dataset including about 85 000 samples to demonstrate model performance in the normal aerosol-pollution season in China. Feeding the input variables from the test dataset into the final deepWIA model yields the estimated <inline-formula><mml:math id="M112" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula>. A scatterplot of <inline-formula><mml:math id="M113" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> and the corresponding PM<inline-formula><mml:math id="M114" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration of the test dataset is shown in Fig. 5. The <inline-formula><mml:math id="M115" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> value had an RMSE of 0.5, an MAE of 0.39, and <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of 0.53. The performance just decreased slightly relative to that with the training set, indicating that the deepWIA model is strongly robust with the test dataset. And the <inline-formula><mml:math id="M117" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>-based PM<inline-formula><mml:math id="M118" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations had an RMSE of 16.54 <inline-formula><mml:math id="M119" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M120" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, an MAE of
10.25 <inline-formula><mml:math id="M121" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M122" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> of 0.72. Note that some of the evaluation metrics were better than those of validation datasets because more samples were used to generate the final model than were used in validation. The stable performance using the training set, the 10-fold CV sets, and the test dataset indicates that our model can be safely used for quantifying weather conditions of PM<inline-formula><mml:math id="M124" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, at least in aerosol-pollution seasons.</p>
      <p id="d1e2038">The geographic distribution of biases and RMSEs for <inline-formula><mml:math id="M125" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> and PM<inline-formula><mml:math id="M126" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>
concentrations estimated by the deepWIA model are shown in Fig. 6. There was
no significant estimation bias of <inline-formula><mml:math id="M127" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> with observations in most grid
cells. Small overestimations (positive biases) of <inline-formula><mml:math id="M128" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> occurred in
Northeast China, the North China Plain (NCP), Ningxia, and the Zhuhai–Hong
Kong–Macao Bay area (ZHM), whereas underestimations (negative biases)
mainly occurred in south-central China. The estimated PM<inline-formula><mml:math id="M129" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration remained unbiased in some areas but was underestimated in some grid cells in the NCP, Northeast China, the Sichuan Basin, and south-central
China, with values of <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">8</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M132" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M133" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The model also
significantly underestimated PM<inline-formula><mml:math id="M134" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations in the area around
the Taklamakan Desert by up to <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M136" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M137" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The <inline-formula><mml:math id="M138" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> values had small RMSEs in the southern NCP, the Sichuan Basin, and the ZHM, with corresponding small RMSEs in estimated PM<inline-formula><mml:math id="M139" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations of 0–10 <inline-formula><mml:math id="M140" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M141" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Larger RMSEs for PM<inline-formula><mml:math id="M142" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations occurred in some grid cells located in Northeast China, Xinjiang, Ningxia, and the western NCP, with values of <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M144" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M145" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. Large RMSEs and biases in Xinjiang and Ningxia may be attributed to the frequent occurrence of dust storms there (Wang et al., 2004). Due to the scarcity of samples, a meteorological data-based model cannot fully understand dust storm occurrence.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F6" specific-use="star"><?xmltex \currentcnt{6}?><?xmltex \def\figurename{Figure}?><label>Figure 6</label><caption><p id="d1e2251">Test biases <bold>(a, c)</bold> and RMSEs <bold>(b, d)</bold> in deepWIA
(<inline-formula><mml:math id="M146" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula>) <bold>(a, b)</bold> and PM<inline-formula><mml:math id="M147" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations <bold>(c, d)</bold> over China from 3 January to 30 April 2022.</p></caption>
        <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f06.png"/>

      </fig>

      <p id="d1e2293">Eight cities were selected to illustrate the performance of the deepWIA model in time series, with analysis of daily variations in PM<inline-formula><mml:math id="M148" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations (Fig. 7). The cities (Fig. 6c) are in northern China (Beijing and Xi'an), eastern China (Shanghai and Hangzhou), southwest China (Chengdu and Chongqing), and south-central China (Wuhan and Changsha), all of which suffer from aerosol pollution.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F7" specific-use="star"><?xmltex \currentcnt{7}?><?xmltex \def\figurename{Figure}?><label>Figure 7</label><caption><p id="d1e2307">Day-to-day series of PM<inline-formula><mml:math id="M149" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations based on observations (blue curves), WRF-Chem (orange curves), and deepWIA model (green curves) in eight cities in China, 3 January to 30 April 2022.</p></caption>
        <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f07.png"/>

      </fig>

      <p id="d1e2325">For comparison, the results of a WRF-Chem simulation are also presented (Fig. 7). Similar to deepWIA, we also use the ERA5 data to drive the WRF-Chem model. Hence, both WRF-Chem and deepWIA models are run in hindcast
mode. The simulation domain covered China, including the above eight cities,
with a high horizontal resolution of 9 km. The model used the Multi-resolution Emission Inventory for China (MEIC, <uri>http://meicmodel.org/</uri>, last access: 3 August 2022) (Li et al., 2017) as an emission inventory. To avoid weather-system drift due to long-term model integration (Feng et al., 2020a), the simulation restarted every day at 12:00 UTC, with the mean of 12–35 h (i.e., 00:00–23:00 UTC) simulated PM<inline-formula><mml:math id="M150" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations being used as the daily value.</p>
      <p id="d1e2340">Estimations using the deepWIA model captured day-to-day variations in PM<inline-formula><mml:math id="M151" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, outperforming the WRF-Chem simulation in all eight cities with a significant reduction in RMSEs and improvement in <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (RMSEs <inline-formula><mml:math id="M153" display="inline"><mml:mo>≤</mml:mo></mml:math></inline-formula> 19 <inline-formula><mml:math id="M154" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M155" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.65</mml:mn></mml:mrow></mml:math></inline-formula>). The simulation accuracy of WRF-Chem varied substantially in different regions of China. The four cities, including Beijing, Shanghai, Hangzhou, and Chengdu, yielded good performances, with RMSE <inline-formula><mml:math id="M157" display="inline"><mml:mo>≤</mml:mo></mml:math></inline-formula> 30 <inline-formula><mml:math id="M158" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M159" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>. The WRF-Chem model largely failed to capture the day-to-day variations in aerosol concentrations in the other five cities. In comparison, the deepWIA model gave a robust performance in both northern and southern China, indicating a wide application potential for different regions. In conclusion, Fig. 5 shows that the main problem with the deepWIA model is underestimation in extreme values of PM<inline-formula><mml:math id="M160" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration, leading to the omission of some heavy haze events.</p>
      <p id="d1e2442">To further present the good performance of the deepWIA model, two additional comparisons with WRF-Chem are given. The first is the comparison of synoptic variabilities that remove the variation longer than 31 d (Fig. S5), like the timescale focused by the deepWIA model. The second is a comparison with an operational system for air quality forecast based on WRF-Chem (Fig. S6). The simulation has the same spatial and temporal resolution as the ERA5-driven one above but is optimized for northern China. To reduce initial and boundary errors, the system used the real-time assimilated meteorological field and assimilated PM<inline-formula><mml:math id="M161" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, PM<inline-formula><mml:math id="M162" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">10</mml:mn></mml:msub></mml:math></inline-formula>, <inline-formula><mml:math id="M163" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M164" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M165" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>, and CO concentrations within the domain using the newly developed 3DVar module for WRF-Chem. In both comparisons, the deepWIA model significantly outperforms the corresponding WRF-Chem simulations for all eight cities.</p>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Ablation experiments and related studies</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Comparison of ablation experiments</title>
      <p id="d1e2511">Although the deepWIA appears accurate and robust in capturing synoptic
variations in PM<inline-formula><mml:math id="M166" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, it is of interest to investigate the reason for its strong performance. The model has three key points: (1) a
ResNet–GRU structure with more meteorological variables; (2) a timescale
separation approach making the model focus only on the effects of meteorology on synoptic variations in PM<inline-formula><mml:math id="M167" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations; and (3) a label transformation approach based on a logarithmic function to mitigate data imbalance. To investigate the relative importance of these processes for the final deepWIA model, two additional ablation experiments were performed for comparison:
<list list-type="bullet"><list-item>
      <p id="d1e2534">AbExp_1: with fitting of PM<inline-formula><mml:math id="M168" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations directly using the same ResNet–GRU structure, samples, and training strategy, but with no timescale separation or label transformation. This experiment was similar to studies of ML-based PM<inline-formula><mml:math id="M169" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration retrieval but using meteorological variables as primary data. This experiment was intended to assess the basic fitting power due to the DL structure and input variables.</p></list-item><list-item>
      <p id="d1e2556">AbExp_2: with fitting of <inline-formula><mml:math id="M170" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> (Sect. 2.2) using the same model structure, samples, training strategy, and timescale separation, but with no label transformation. A comparison of the results of AbExp_1 and AbExp_2 illustrates the importance of timescale separation. A comparison of the results of AbExp_2 and original deepWIA illustrates the impacts of label transform.</p></list-item></list></p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F8" specific-use="star"><?xmltex \currentcnt{8}?><?xmltex \def\figurename{Figure}?><label>Figure 8</label><caption><p id="d1e2568">Density scatterplots of PM<inline-formula><mml:math id="M171" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations for the test dataset from the ablation experiments <bold>(a)</bold> directly using the PM<inline-formula><mml:math id="M172" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>concentration as the target, and <bold>(b)</bold> using <inline-formula><mml:math id="M173" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> as the target (i.e., without label transform based on logarithmic function).</p></caption>
          <?xmltex \igopts{width=341.433071pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f08.png"/>

        </fig>

      <p id="d1e2608"><?xmltex \hack{\noindent}?>Scatterplots of PM<inline-formula><mml:math id="M174" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations for AbExp_1 and AbExp_2 using the same test dataset as that used for the deepWIA model are shown in Fig. 8. The AbExp_1 experiment had an RMSE of 19.18 <inline-formula><mml:math id="M175" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M176" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, an MAE of 12.9 <inline-formula><mml:math id="M177" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M178" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and an <inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value of 0.63, achieving the level of ML-based PM<inline-formula><mml:math id="M180" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration retrieval (Sect. 4.2). The DL structure and the feature engineering for input variables thus builds a solid foundation for the fitting power of the deepWIA model. Compared with AbExp_1, AbExp_2 improved the <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> value to 0.70, with the RMSE decreasing to 17.13 <inline-formula><mml:math id="M182" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M183" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> and the MAE to 10.92 <inline-formula><mml:math id="M184" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M185" display="inline"><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, indicating the importance of timescale separation. Furthermore, the focus on synoptic variation also helped mitigate the overestimation of low values and underestimation of high values. The final deepWIA model further improved the general performance in estimating PM<inline-formula><mml:math id="M186" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, with improved <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, MAE, and RMSE values. The logarithmic function-based label transformation mitigated the overestimation of low values while exacerbating the underestimation of high values, with this treatment increasing the distance between low values but decreasing the distance between high values of the samples. A scheme such as AbExp_2 may therefore be applicable to studies of extreme haze events. To summarize, model and feature engineering are most important in determining the final performance of the deepWIA model, with timescale separation and label transformation following in that order.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Comparison with models used in previous studies</title>
      <p id="d1e2762">Recent studies of PM<inline-formula><mml:math id="M188" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration retrieval use ML/DL models such
as RF, XGB and MLP (Table 2). Unlike our model, these studies were not concerned with the role of meteorology but only with the accuracy of estimated PM<inline-formula><mml:math id="M189" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations. There are many differences between these methods and the deepWIA model in the model-building processes. For example, (1) the deepWIA model uses timescale separation to focus on synoptic variations in aerosol concentrations caused by meteorology. We do not use an
emission inventory as an input feature for the model because of its significant uncertainty. It is difficult for DL models, which rely heavily
on input data, to build robust relationships among emissions, meteorology,
and aerosol concentrations. (2) Except for the approach of Geng et al. (2021), the training sample size used in deepWIA is much larger than that
used in previous models, which often used data of 1 year for training (Geng
et al., 2021 also built the ML model year-by-year, starting from 2013). The
large sample size aids the building of a more robust model. (3) We introduce more derived meteorological variables than most studies by feature
engineering.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><?xmltex \currentcnt{2}?><label>Table 2</label><caption><p id="d1e2786">Comparison of studies of observation retrieval of PM<inline-formula><mml:math id="M190" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>
concentrations and deepWIA. “<inline-formula><mml:math id="M191" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula>” indicates data used as model input
features. ERT and GBDT denote extreme random trees and gradient boosting
decision trees, respectively.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.85}[.85]?><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:colspec colnum="8" colname="col8" align="left"/>
     <oasis:colspec colnum="9" colname="col9" align="left"/>
     <oasis:thead>
       <oasis:row>

         <oasis:entry colname="col1"/>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3">Wei</oasis:entry>

         <oasis:entry colname="col4">Li</oasis:entry>

         <oasis:entry colname="col5">Gui</oasis:entry>

         <oasis:entry colname="col6">Wei</oasis:entry>

         <oasis:entry colname="col7">Geng</oasis:entry>

         <oasis:entry colname="col8">Song</oasis:entry>

         <oasis:entry colname="col9">deepWIA</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1"/>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3">et al.</oasis:entry>

         <oasis:entry colname="col4">et al.</oasis:entry>

         <oasis:entry colname="col5">et al.</oasis:entry>

         <oasis:entry colname="col6">et al.</oasis:entry>

         <oasis:entry colname="col7">et al.</oasis:entry>

         <oasis:entry colname="col8">et al.</oasis:entry>

         <oasis:entry colname="col9"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1"/>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3">(2019a)</oasis:entry>

         <oasis:entry colname="col4">(2020)</oasis:entry>

         <oasis:entry colname="col5">(2020)</oasis:entry>

         <oasis:entry colname="col6">(2020)</oasis:entry>

         <oasis:entry colname="col7">(2021)</oasis:entry>

         <oasis:entry colname="col8">(2021)</oasis:entry>

         <oasis:entry colname="col9"/>

       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>

         <oasis:entry rowsep="1" colname="col1" morerows="4">Data</oasis:entry>

         <oasis:entry colname="col2">Meteor.</oasis:entry>

         <oasis:entry colname="col3"><inline-formula><mml:math id="M192" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4"><inline-formula><mml:math id="M193" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col5"><inline-formula><mml:math id="M194" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col6"><inline-formula><mml:math id="M195" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col7"><inline-formula><mml:math id="M196" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col8"><inline-formula><mml:math id="M197" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col9"><inline-formula><mml:math id="M198" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Quasistatic</oasis:entry>

         <oasis:entry colname="col3"><inline-formula><mml:math id="M199" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4"><inline-formula><mml:math id="M200" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col5"><inline-formula><mml:math id="M201" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col6"><inline-formula><mml:math id="M202" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col7"><inline-formula><mml:math id="M203" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col8"><inline-formula><mml:math id="M204" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col9"><inline-formula><mml:math id="M205" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Satellite</oasis:entry>

         <oasis:entry colname="col3"><inline-formula><mml:math id="M206" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4"><inline-formula><mml:math id="M207" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col5"/>

         <oasis:entry colname="col6"><inline-formula><mml:math id="M208" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col7"><inline-formula><mml:math id="M209" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col8"><inline-formula><mml:math id="M210" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col9"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Visibility</oasis:entry>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4"/>

         <oasis:entry colname="col5"><inline-formula><mml:math id="M211" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

         <oasis:entry colname="col8"/>

         <oasis:entry colname="col9"/>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col2">CTM</oasis:entry>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4"/>

         <oasis:entry colname="col5"/>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"><inline-formula><mml:math id="M212" display="inline"><mml:mo>√</mml:mo></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col8"/>

         <oasis:entry colname="col9"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col1" morerows="2">Model key points</oasis:entry>

         <oasis:entry colname="col2">Backbone</oasis:entry>

         <oasis:entry colname="col3">RF</oasis:entry>

         <oasis:entry colname="col4">MLP</oasis:entry>

         <oasis:entry colname="col5">XGB</oasis:entry>

         <oasis:entry colname="col6">ERT</oasis:entry>

         <oasis:entry colname="col7">RF</oasis:entry>

         <oasis:entry colname="col8">RF, GBDT, MLP</oasis:entry>

         <oasis:entry colname="col9">ResNet–GRU</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Data size (million)</oasis:entry>

         <oasis:entry colname="col3">0.15</oasis:entry>

         <oasis:entry colname="col4">0.06</oasis:entry>

         <oasis:entry colname="col5">0.37</oasis:entry>

         <oasis:entry colname="col6">0.23</oasis:entry>

         <oasis:entry colname="col7"><inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col8">Not reported</oasis:entry>

         <oasis:entry colname="col9"><inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">1.7</mml:mn></mml:mrow></mml:math></inline-formula></oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col2">Spatiotemporal info.</oasis:entry>

         <oasis:entry colname="col3">Tempo. dist.</oasis:entry>

         <oasis:entry colname="col4">Tempo. dist.</oasis:entry>

         <oasis:entry colname="col5">Not used</oasis:entry>

         <oasis:entry colname="col6">Tempo. dist.</oasis:entry>

         <oasis:entry colname="col7">Not used</oasis:entry>

         <oasis:entry colname="col8">Not used</oasis:entry>

         <oasis:entry colname="col9">Convolution and gates</oasis:entry>

       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <p id="d1e3259">Therefore, to make a fair comparison of the model per se, we use six popular
ML/DL models, with the same periods, stations, and input parameters as the
deepWIA model, including two RF, two XGB, and two MLP models using the input
data over 3 d (i.e., the same as the deepWIA) and only 1 d that is fitted, named RF1, RF3, XGB1, XGB3, MLP1, and MLP3 respectively (Table 3). The MLP models have nine full connection layers with the maximal 512 neurons in the fifth layer. Following the previous studies, all the models fit the PM<inline-formula><mml:math id="M215" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations directly. It should be noted that these models are applied here for the role of meteorological variables and thereby do not introduce satellite or visibility data, so the RMSEs here are slightly higher than those reported in previous studies.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3"><?xmltex \currentcnt{3}?><label>Table 3</label><caption><p id="d1e3275">Comparison of ML/DL models performance using the same time periods,
stations, and input parameters as the deepWIA model.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="6">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="center"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="center"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1">Models</oasis:entry>
         <oasis:entry rowsep="1" namest="col2" nameend="col3" align="center">Training set </oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry rowsep="1" namest="col5" nameend="col6" align="center">Test set </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">RMSE</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">RMSE</oasis:entry>
         <oasis:entry colname="col6"><inline-formula><mml:math id="M219" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">RF1</oasis:entry>
         <oasis:entry colname="col2">7.15</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">25.43</oasis:entry>
         <oasis:entry colname="col6">0.34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">RF3</oasis:entry>
         <oasis:entry colname="col2">6.72</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">23.66</oasis:entry>
         <oasis:entry colname="col6">0.43</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">XGB1</oasis:entry>
         <oasis:entry colname="col2">22.40</oasis:entry>
         <oasis:entry colname="col3">0.60</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">24.59</oasis:entry>
         <oasis:entry colname="col6">0.38</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">XGB3</oasis:entry>
         <oasis:entry colname="col2">20.36</oasis:entry>
         <oasis:entry colname="col3">0.67</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">23.76</oasis:entry>
         <oasis:entry colname="col6">0.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MLP1</oasis:entry>
         <oasis:entry colname="col2">23.98</oasis:entry>
         <oasis:entry colname="col3">0.54</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">26.22</oasis:entry>
         <oasis:entry colname="col6">0.30</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">MLP3</oasis:entry>
         <oasis:entry colname="col2">20.42</oasis:entry>
         <oasis:entry colname="col3">0.67</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">22.10</oasis:entry>
         <oasis:entry colname="col6">0.50</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">deepWIA</oasis:entry>
         <oasis:entry colname="col2">16.91</oasis:entry>
         <oasis:entry colname="col3">0.76</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">16.54<inline-formula><mml:math id="M220" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col6">0.72</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d1e3278"><inline-formula><mml:math id="M216" display="inline"><mml:msup><mml:mi/><mml:mo>*</mml:mo></mml:msup></mml:math></inline-formula> Note that the RMSE of deepWIA on the test dataset is smaller than that on the training dataset because the model does not directly fit the PM<inline-formula><mml:math id="M217" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration.</p></table-wrap-foot></table-wrap>

      <p id="d1e3530">These six models all have higher RMSEs and lower <inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> than the deepWIA
model in the test set (even than that of the AbExp_1, which also fits PM<inline-formula><mml:math id="M222" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations directly; Fig. 8a). The models with 3 d data always performed better than these with only 1 d data, indicating the importance of temporal information. Additionally, there is more severe overfitting for these models than the deepWIA model, as evidenced by the large performance difference between the training and test sets, especially those of the RF1 and RF3.</p>
      <p id="d1e3553">The advantages of deepWIA over traditional RF, XGB and MLP models should be
attributed to two points: (1) the deepWIA model is much deeper than the
commonly used RF, XGB and MLP models, which aids learning of the complex
nonlinear relationship between meteorology and aerosol concentration; and (2) previous models do not necessarily include temporal correlations of aerosol concentrations; rather, some use a predefined spatiotemporal distance for the injection of temporal information (Wei et al., 2019a, b, 2020; Li et al., 2020). The deepWIA model uses gate parameters to learn dynamic links of aerosol concentration among days.</p>
      <p id="d1e3556">We also compare the deepWIA and two semiempirical meteorological indices for aerosol pollution, namely PLAM and ASI. These indices are commonly used to assess meteorological effects on variations in aerosol concentrations (Wang et al., 2021; X. Zhang et al., 2019). PLAM was applied to the NCP (Yang et al., 2016), using visibility as the target variable, while ASI was applied to North and Northeast China, using PM<inline-formula><mml:math id="M223" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations as the target variable. Both indices only considered the meteorology on that day only. By comparison, as described in Section 2.1, deepWIA includes all the kernel variables of these two indices, as well as other spatiotemporal information. It will form the best DL model to take advantage of these variables. Hence, its applicability extends to the whole country. Additionally, PLAM and ASI cannot provide a uniform model for PM<inline-formula><mml:math id="M224" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, unlike deepWIA. PLAM focused on the relationship between meteorology and visibility; ASI just illustrates the temporal relationship between meteorology and PM<inline-formula><mml:math id="M225" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, which varies from location to location. Therefore, estimating PM<inline-formula><mml:math id="M226" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations also requires additional linear modeling at each grid cell. Due to these advantages, the deepWIA could be a better tool for assessing the impact of weather on aerosol concentrations.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Spatial distribution of deepWIA and its application in quantifying the aerosol-related weather condition</title>
      <p id="d1e3605">This section is to show the geographic distribution of deepWIA (<inline-formula><mml:math id="M227" display="inline"><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula>) over the test period, which can also be used to quantify the aerosol-related
weather conditions over China. A positive or negative deepWIA indicates
weather-related enhancement or reduction of aerosol pollution, respectively,
relative to background concentrations (<inline-formula><mml:math id="M228" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula>). We prepared an animation of daily deepWIA from 3 January to 30 April 2022, to illustrate synoptic variations in aerosol-associated weather in China (see the data availability
statements). To assess weather conditions over the test period, we applied a
statistical metric, the ratio of good weather (RGW) days for aerosol pollution calculated as
          <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M229" display="block"><mml:mrow><mml:mi mathvariant="normal">RGW</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mi>N</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>
        where <inline-formula><mml:math id="M230" display="inline"><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M231" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> denote the number of days with <inline-formula><mml:math id="M232" display="inline"><mml:mrow><mml:mover accent="true"><mml:mi>r</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> values and total days over the test period, respectively.</p>
      <p id="d1e3696">The geographic distributions of the RGW indicate that most areas in China had good weather for higher air quality during January–April 2022 (Fig. 9). In
south-central China, almost all grid points had RGWs <inline-formula><mml:math id="M233" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 0.5 and negative MVs, implying favorable weather conditions for higher air quality.
In Beijing, RGW was about 0.65, implying a 15 % increase in clean air days
relative to background concentrations. Unfavorable weather for aerosol pollution was found mainly in the south-central NCP and on the western
fringe of the Sichuan Basin, with RGWs of 0.4–0.5. Note that with Eqs. (1)
and (2), all synoptic-scale changes are relative to long-term background
concentrations for the same season of the last two years. A similar approach
can be used to compare the effects of weather on aerosols between two periods (e.g., 2 years), by replacing the background concentration with that calculated over the base period.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F9"><?xmltex \currentcnt{9}?><?xmltex \def\figurename{Figure}?><label>Figure 9</label><caption><p id="d1e3708">Geographic distributions of the ratio of good weather (RGW) days for PM<inline-formula><mml:math id="M234" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations, 3 January to 30 April 2020.</p></caption>
        <?xmltex \igopts{width=199.169291pt}?><graphic xlink:href="https://acp.copernicus.org/articles/23/375/2023/acp-23-375-2023-f09.png"/>

      </fig>

</sec>
<sec id="Ch1.S6" sec-type="conclusions">
  <label>6</label><title>Conclusions</title>
      <p id="d1e3734">We propose a spatiotemporal deep network architecture to link meteorology and aerosol concentrations. The network uses a 49-layer ResNet structure to extract meteorological information in the vicinity of observed grid points
and a GRU to dynamically fuse the information from the ResNet for 3
consecutive days. Many approaches were undertaken in improving its performance, including feature engineering, timescale separation, and logarithmic function-based label transformation. Based on the model, we
produced a meteorology index, deepWIA, to capture synoptic variations in aerosol concentrations.</p>
      <p id="d1e3737">The model was trained and 10-fold CV applied using ground-based PM<inline-formula><mml:math id="M235" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>
observations in China and ERA5 meteorological fields for the period
2015–2021. Tests were performed using data for January–April 2022. The results indicate that the model well estimates synoptic variations in PM<inline-formula><mml:math id="M236" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations and corresponding weather changes. Performance using the test dataset does not decrease significantly relative to the training set, indicating very weak overfitting in the model. We also compared time series of PM<inline-formula><mml:math id="M237" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations between deepWIA and WRF-Chem in eight cities in China. The deepWIA performed better than WRF-Chem simulations with higher <inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values and lower RMSEs in each city. In particular, the model yields consistent simulating power in both southern and northern China, whereas WRF-Chem failed to capture aerosol variations in four cities in southern China. The predictive power of the deepWIA model also outperformed the previously reported PM<inline-formula><mml:math id="M239" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration retrieval scheme based on other ML/DL models.</p>
      <p id="d1e3787">The strong performance of deepWIA is due to the powerful ResNet–GRU architecture and the treatment of timescale separation. Meteorology and
emissions dominate different timescales in aerosol variations. Meteorological variables also vary on different timescales, ranging from hourly to interannually. Therefore, it is very difficult to accurately estimate aerosol concentrations directly using a single data-based model. The timescale separation used in this study is thus necessary in allowing the model, despite its complexity, to focus on day-to-day variations in aerosol concentrations and associated weather.</p>
      <p id="d1e3790">As the background aerosol concentration is currently computed from observations, the deepWIA model cannot directly provide the spatial distribution of aerosol concentrations. However, this can be obtained from a
CTM simulation, observation retrieval, or even another ML/DL learning model.
Owing to the strong performance of deepWIA, a study is planned for short-
and medium-range forecast schemes for PM<inline-formula><mml:math id="M240" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentrations based on the
spatiotemporal DL model and numerical weather prediction (NWP). In a real medium-range forecast system, a re-trained deepWIA model should be applied,
with the real-time NWP data (i.e., from ECMWF or WRF) as input meteorological data. Moreover, a short-range forecast DL model should be more complex as it is more sensitive to initial aerosol concentrations. Therefore, more variables such as pre-forecast observations should be injected into the DL model to provide better initial conditions.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d1e3806">The deepWIA data in the test dataset can be downloaded from <ext-link xlink:href="https://doi.org/10.5281/zenodo.6982879" ext-link-type="DOI">10.5281/zenodo.6982879</ext-link> (Feng, 2022a). The animation of daily deepWIA from 3 January to 30 April 2022 can be downloaded from <ext-link xlink:href="https://doi.org/10.5281/zenodo.6982971" ext-link-type="DOI">10.5281/zenodo.6982971</ext-link> (Feng, 2022b).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d1e3815">The supplement related to this article is available online at: <inline-supplementary-material xlink:href="https://doi.org/10.5194/acp-23-375-2023-supplement" xlink:title="pdf">https://doi.org/10.5194/acp-23-375-2023-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d1e3824">JF conceived the study, designed the model, performed data preprocessing and analyses of the results, and wrote the manuscript. YL helped edit the manuscript. YQ helped with data curation. FZ helped with visualization.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d1e3830">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d1e3836">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p>
  </notes><?xmltex \hack{\newpage}?><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d1e3843">The study is supported by the National Science Foundation of China (grant nos. 42275009 and 42175080) and the National Key R &amp; D Program of China (grant noo. 2019YFB2102901).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d1e3850">This paper was edited by Duncan Watson-Parris and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><?label 1?><mixed-citation>Bei, N., Li, G., Huang, R.-J., Cao, J., Meng, N., Feng, T., Liu, S., Zhang,
T., Zhang, Q., and Molina, L. T.: Typical synoptic situations and their impacts on the wintertime air pollution in the Guanzhong basin, China, Atmos. Chem. Phys., 16, 7373–7387, <ext-link xlink:href="https://doi.org/10.5194/acp-16-7373-2016" ext-link-type="DOI">10.5194/acp-16-7373-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><?label 1?><mixed-citation>Chen, Z. H., Cheng, S. Y., Li, J. B., Guo, X. R., Wang, W. H., and Chen, D.
S.: Relationship between atmospheric pollution processes and synoptic pressure patterns in northern China, Atmos. Environ., 42, 6078–6087, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2008.03.043" ext-link-type="DOI">10.1016/j.atmosenv.2008.03.043</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><?label 1?><mixed-citation>Cho, K., van Merrienboer, B., Bahdanau, D., and Bengio, Y.: On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, arXiv [preprint], arXiv:1409.1259, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1409.1259" ext-link-type="DOI">10.48550/arXiv.1409.1259</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><?label 1?><mixed-citation>Feng, J.: Data for “Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data”, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.6982879" ext-link-type="DOI">10.5281/zenodo.6982879</ext-link>, 2022a.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><?label 1?><mixed-citation>Feng, J.: Animation for “Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data”, Zenodo [video/audio], <ext-link xlink:href="https://doi.org/10.5281/zenodo.6982971" ext-link-type="DOI">10.5281/zenodo.6982971</ext-link>, 2022b.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><?label 1?><mixed-citation>Feng, J., Liao, H., and Gu, Y.: A Comparison of Meteorology-Driven Interannual Variations of Surface Aerosol Concentrations in the Eastern United States, Eastern China, and Europe, SOLA, 12, 146–152, <ext-link xlink:href="https://doi.org/10.2151/sola.2016-031" ext-link-type="DOI">10.2151/sola.2016-031</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><?label 1?><mixed-citation>Feng, J., Quan, J., Liao, H., Li, Y., and Zhao, X.: An Air Stagnation Index to Qualify Extreme Haze Events in Northern China, J. Atmos. Sci., 75, 3489–3505, <ext-link xlink:href="https://doi.org/10.1175/JAS-D-17-0354.1" ext-link-type="DOI">10.1175/JAS-D-17-0354.1</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><?label 1?><mixed-citation>Feng, J., Sun, J., and Zhang, Y.: A Dynamic Blending Scheme to Mitigate
Large-Scale Bias in Regional Models, J. Adv. Model. Earth Syst., 12, e2019MS001754, <ext-link xlink:href="https://doi.org/10.1029/2019MS001754" ext-link-type="DOI">10.1029/2019MS001754</ext-link>, 2020a.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><?label 1?><mixed-citation>Feng, J., Liao, H., Li, Y., Zhang, Z., and Tang, Y.: Long-term trends and
variations in haze-related weather conditions in north China during 1980–2018 based on emission-weighted stagnation intensity, Atmos. Environ., 240, 117830, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2020.117830" ext-link-type="DOI">10.1016/j.atmosenv.2020.117830</ext-link>, 2020b.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><?label 1?><mixed-citation>Geng, G., Xiao, Q., Liu, S., Liu, X., Cheng, J., Zheng, Y., Xue, T., Tong,
D., Zheng, B., Peng, Y., Huang, X., He, K., and Zhang, Q.: Tracking Air
Pollution in China: Near Real-Time PM<inline-formula><mml:math id="M241" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> Retrievals from Multisource Data Fusion, Environ. Sci. Technol., 55, 12106–12115, <ext-link xlink:href="https://doi.org/10.1021/acs.est.1c01863" ext-link-type="DOI">10.1021/acs.est.1c01863</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><?label 1?><mixed-citation>Gui, K., Che, H., Zeng, Z., Wang, Y., Zhai, S., Wang, Z., Luo, M., Zhang, L., Liao, T., Zhao, H., Li, L., Zheng, Y., and Zhang, X.: Construction of a virtual PM<inline-formula><mml:math id="M242" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> observation network in China based on high-density surface
meteorological observations using the Extreme Gradient Boosting model, Environ. Int., 141, 105801, <ext-link xlink:href="https://doi.org/10.1016/j.envint.2020.105801" ext-link-type="DOI">10.1016/j.envint.2020.105801</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><?label 1?><mixed-citation>Guo, S., Hu, M., Zamora, M. L., Peng, J., Shang, D., Zheng, J., Du, Z., Wu,
Z., Shao, M., Zeng, L., Molina, M. J., and Zhang, R.: Elucidating severe
urban haze formation in China, P. Natl. Acad. Sci. USA, 111, 17373–17378, <ext-link xlink:href="https://doi.org/10.1073/pnas.1419604111" ext-link-type="DOI">10.1073/pnas.1419604111</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><?label 1?><mixed-citation>He, K., Zhang, X., Ren, S., and Sun, J.: Deep residual learning for image
recognition, in: Proceedings of the IEEE conference on computer vision and
pattern recognition, 27–30 June 2016, Las Vegas, NV, USA, 770–778, <ext-link xlink:href="https://doi.org/10.1109/CVPR.2016.90" ext-link-type="DOI">10.1109/CVPR.2016.90</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><?label 1?><mixed-citation>Hou, X., Fei, D., Kang, H., Zhang, Y., and Gao, J.: Seasonal statistical analysis of the impact of meteorological factors on fine particle pollution in China in 2013–2017, Nat. Hazards, 93, 677–698, <ext-link xlink:href="https://doi.org/10.1007/s11069-018-3315-y" ext-link-type="DOI">10.1007/s11069-018-3315-y</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><?label 1?><mixed-citation>Jia, W. and Zhang, X.: The role of the planetary boundary layer parameterization schemes on the meteorological and aerosol pollution simulations: A review, Atmos. Res., 239, 104890, <ext-link xlink:href="https://doi.org/10.1016/j.atmosres.2020.104890" ext-link-type="DOI">10.1016/j.atmosres.2020.104890</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><?label 1?><mixed-citation>Leung, D. M., Tai, A. P. K., Mickley, L. J., Moch, J. M., van Donkelaar, A.,
Shen, L., and Martin, R. V.: Synoptic meteorological modes of variability for fine particulate matter (PM<inline-formula><mml:math id="M243" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>) air quality in major metropolitan regions of China, Atmos. Chem. Phys., 18, 6733–6748, <ext-link xlink:href="https://doi.org/10.5194/acp-18-6733-2018" ext-link-type="DOI">10.5194/acp-18-6733-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><?label 1?><mixed-citation>Li, M., Liu, H., Geng, G., Hong, C., Liu, F., Song, Y., Tong, D., Zheng, B.,
Cui, H., Man, H., Zhang, Q., and He, K.: Anthropogenic emission inventories
in China: a review, Natl. Sci. Rev., 4, 834–866, <ext-link xlink:href="https://doi.org/10.1093/nsr/nwx150" ext-link-type="DOI">10.1093/nsr/nwx150</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><?label 1?><mixed-citation>Li, T., Shen, H., Yuan, Q., and Zhang, L.: Geographically and temporally
weighted neural networks for satellite-based mapping of ground-level PM<inline-formula><mml:math id="M244" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula>, ISPRS J. Photogram. Remote Sens., 167, 178–188,
<ext-link xlink:href="https://doi.org/10.1016/j.isprsjprs.2020.06.019" ext-link-type="DOI">10.1016/j.isprsjprs.2020.06.019</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><?label 1?><mixed-citation>Lu, H.-C.: The statistical characters of PM<inline-formula><mml:math id="M245" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">10</mml:mn></mml:msub></mml:math></inline-formula> concentration in Taiwan
area, Atmos. Environ., 36, 491–502, <ext-link xlink:href="https://doi.org/10.1016/S1352-2310(01)00245-X" ext-link-type="DOI">10.1016/S1352-2310(01)00245-X</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><?label 1?><mixed-citation>Miao, Y., Che, H., Zhang, X., and Liu, S.: Integrated impacts of synoptic forcing and aerosol radiative effect on boundary layer and pollution in the Beijing–Tianjin–Hebei region, China, Atmos. Chem. Phys., 20, 5899–5909, <ext-link xlink:href="https://doi.org/10.5194/acp-20-5899-2020" ext-link-type="DOI">10.5194/acp-20-5899-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><?label 1?><mixed-citation>Porter, W. C., Heald, C. L., Cooley, D., and Russell, B.: Investigating the
observed sensitivities of air-quality extremes to meteorological drivers via
quantile regression, Atmos. Chem. Phys., 15, 10349–10366,
<ext-link xlink:href="https://doi.org/10.5194/acp-15-10349-2015" ext-link-type="DOI">10.5194/acp-15-10349-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><?label 1?><mixed-citation>Sicard, P., Crippa, P., De Marco, A., Castruccio, S., Giani, P., Cuesta, J.,
Paoletti, E., Feng, Z., and Anav, A.: High spatial resolution WRF-Chem model
over Asia: Physics and chemistry evaluation, Atmos. Environ., 244, 118004, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2020.118004" ext-link-type="DOI">10.1016/j.atmosenv.2020.118004</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><?label 1?><mixed-citation>Song, Z., Chen, B., Huang, Y., Dong, L., and Yang, T.: Estimation of PM<inline-formula><mml:math id="M246" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> concentration in China using linear hybrid machine learning model,
Atmos. Meas. Tech., 14, 5333–5347, <ext-link xlink:href="https://doi.org/10.5194/amt-14-5333-2021" ext-link-type="DOI">10.5194/amt-14-5333-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><?label 1?><mixed-citation>Wang, X., Dong, Z., Zhang, J., and Liu, L.: Modern dust storms in China: an
overview, J. Arid Environ., 58, 559–574, <ext-link xlink:href="https://doi.org/10.1016/j.jaridenv.2003.11.009" ext-link-type="DOI">10.1016/j.jaridenv.2003.11.009</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><?label 1?><mixed-citation>Wang, Y., Xin, J., Li, Z., Wang, S., Wang, P., Hao, W. M., Nordgren, B. L.,
Chen, H., Wang, L., and Sun, Y.: Seasonal variations in aerosol optical
properties over China, J. Geophys. Res., 116, D18209, <ext-link xlink:href="https://doi.org/10.1029/2010JD015376" ext-link-type="DOI">10.1029/2010JD015376</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><?label 1?><mixed-citation>Wang, Z., Feng, J., Diao, C., Li, Y., Lin, L., and Xu, Y.: Reduction in
European anthropogenic aerosols and the weather conditions conducive to PM<inline-formula><mml:math id="M247" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> pollution in North China: a potential global teleconnection pathway, Environ. Res. Lett., 16, 104054, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ac269d" ext-link-type="DOI">10.1088/1748-9326/ac269d</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><?label 1?><mixed-citation>Wei, J., Huang, W., Li, Z., Xue, W., Peng, Y., Sun, L., and Cribb, M.:
Estimating 1-km-resolution PM2.5 concentrations across China using the
space-time random forest approach, Remote Sens. Environ., 231, 111221, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2019.111221" ext-link-type="DOI">10.1016/j.rse.2019.111221</ext-link>, 2019a.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><?label 1?><mixed-citation>Wei, J., Li, Z., Guo, J., Sun, L., Huang, W., Xue, W., Fan, T., and Cribb,
M.: Satellite-Derived 1-km-Resolution PM<inline-formula><mml:math id="M248" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:math></inline-formula> Concentrations from 2014 to 2018 across China, Environ. Sci. Technol., 53, 13265–13274,
<ext-link xlink:href="https://doi.org/10.1021/acs.est.9b03258" ext-link-type="DOI">10.1021/acs.est.9b03258</ext-link>, 2019b.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><?label 1?><mixed-citation>Wei, J., Li, Z., Cribb, M., Huang, W., Xue, W., Sun, L., Guo, J., Peng, Y.,
Li, J., Lyapustin, A., Liu, L., Wu, H., and Song, Y.: Improved 1 km
resolution PM<inline-formula><mml:math id="M249" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> estimates across China using enhanced space–time extremely randomized trees, Atmos. Chem. Phys., 20, 3273–3289,
<ext-link xlink:href="https://doi.org/10.5194/acp-20-3273-2020" ext-link-type="DOI">10.5194/acp-20-3273-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><?label 1?><mixed-citation>Yang, Y. Q., Wang, J. Z., Gong, S. L., Zhang, X. Y., Wang, H., Wang, Y. Q.,
Wang, J., Li, D., and Guo, J. P.: PLAM – a meteorological pollution index
for air quality and its applications in fog-haze forecasts in North China,
Atmos. Chem. Phys., 16, 1353–1364, <ext-link xlink:href="https://doi.org/10.5194/acp-16-1353-2016" ext-link-type="DOI">10.5194/acp-16-1353-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><?label 1?><mixed-citation>Yin, J., Gao, C. Y., Hong, J., Gao, Z., Li, Y., Li, X., Fan, S., and Zhu, B.: Surface Meteorological Conditions and Boundary Layer Height Variations During an Air Pollution Episode in Nanjing, China, J. Geophys. Res.-Atmos., 124, 3350–3364, <ext-link xlink:href="https://doi.org/10.1029/2018JD029848" ext-link-type="DOI">10.1029/2018JD029848</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><?label 1?><mixed-citation>Yuan, Q., Shen, H., Li, T., Li, Z., Li, S., Jiang, Y., Xu, H., Tan, W., Yang, Q., Wang, J., Gao, J., and Zhang, L.: Deep learning in environmental remote sensing: Achievements and challenges, Remote Sens. Environ., 241, 111716, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2020.111716" ext-link-type="DOI">10.1016/j.rse.2020.111716</ext-link>, 2020.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib33"><label>33</label><?label 1?><mixed-citation>Zhang, L., Zhao, T., Gong, S., Kong, S., Tang, L., Liu, D., Wang, Y., Jin, L., Shan, Y., Tan, C., Zhang, Y., and Guo, X.: Updated emission inventories
of power plants in simulating air quality during haze periods over East China, Atmos. Chem. Phys., 18, 2065–2079, <ext-link xlink:href="https://doi.org/10.5194/acp-18-2065-2018" ext-link-type="DOI">10.5194/acp-18-2065-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><?label 1?><mixed-citation>Zhang, Q., Zheng, Y., Tong, D., Shao, M., Wang, S., Zhang, Y., Xu, X., Wang,
J., He, H., Liu, W., Ding, Y., Lei, Y., Li, J., Wang, Z., Zhang, X., Wang,
Y., Cheng, J., Liu, Y., Shi, Q., Yan, L., Geng, G., Hong, C., Li, M., Liu,
F., Zheng, B., Cao, J., Ding, A., Gao, J., Fu, Q., Huo, J., Liu, B., Liu,
Z., Yang, F., He, K., and Hao, J.: Drivers of improved PM<inline-formula><mml:math id="M250" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> air quality in China from 2013 to 2017, P. Natl. Acad. Sci. USA, 116, 24463–24469, <ext-link xlink:href="https://doi.org/10.1073/pnas.1907956116" ext-link-type="DOI">10.1073/pnas.1907956116</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><?label 1?><mixed-citation>Zhang, X., Xu, X., Ding, Y., Liu, Y., Zhang, H., Wang, Y., and Zhong, J.:
The impact of meteorological changes from 2013 to 2017 on PM<inline-formula><mml:math id="M251" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> mass
reduction in key regions in China, Sci. China Earth Sci., 62, 1885–1902, <ext-link xlink:href="https://doi.org/10.1007/s11430-019-9343-3" ext-link-type="DOI">10.1007/s11430-019-9343-3</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><?label 1?><mixed-citation>Zhang, Y., Zhang, X., Wang, L., Zhang, Q., Duan, F., and He, K.: Application
of WRF/Chem over East Asia: Part I. Model evaluation and intercomparison with MM5/CMAQ, Atmos. Environ., 124, 285–300, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2015.07.022" ext-link-type="DOI">10.1016/j.atmosenv.2015.07.022</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><?label 1?><mixed-citation>Zheng, G. J., Duan, F. K., Su, H., Ma, Y. L., Cheng, Y., Zheng, B., Zhang, Q., Huang, T., Kimoto, T., Chang, D., Pöschl, U., Cheng, Y. F., and He, K. B.: Exploring the severe winter haze in Beijing: The impact of synoptic
weather, regional transport and heterogeneous reactions, Atmos. Chem. Phys., 15, 2969–2983, <ext-link xlink:href="https://doi.org/10.5194/acp-15-2969-2015" ext-link-type="DOI">10.5194/acp-15-2969-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><?label 1?><mixed-citation>Zhong, J., Zhang, X., Gui, K., Wang, Y., Che, H., Shen, X., Zhang, L., Zhang, Y., Sun, J., and Zhang, W.: Robust prediction of hourly PM<inline-formula><mml:math id="M252" display="inline"><mml:msub><mml:mi/><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:math></inline-formula> from meteorological data using LightGBM, Natl. Sci. Rev., 8, nwaa307, <ext-link xlink:href="https://doi.org/10.1093/nsr/nwaa307" ext-link-type="DOI">10.1093/nsr/nwaa307</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><?label 1?><mixed-citation>Zhong, M., Saikawa, E., Liu, Y., Naik, V., Horowitz, L. W., Takigawa, M.,
Zhao, Y., Lin, N.-H., and Stone, E. A.: Air quality modeling with WRF-Chem v3.5 in East Asia: sensitivity to emissions and evaluation of simulated air quality, Geosci. Model Dev., 9, 1201–1218,
<ext-link xlink:href="https://doi.org/10.5194/gmd-9-1201-2016" ext-link-type="DOI">10.5194/gmd-9-1201-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><?label 1?><mixed-citation>Zhou, G., Xu, J., Xie, Y., Chang, L., Gao, W., Gu, Y., and Zhou, J.: Numerical air quality forecasting over eastern China: An operational application of WRF-Chem, Atmos. Enviro., 153, 94–108, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2017.01.020" ext-link-type="DOI">10.1016/j.atmosenv.2017.01.020</ext-link>, 2017.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Bei, N., Li, G., Huang, R.-J., Cao, J., Meng, N., Feng, T., Liu, S., Zhang,
T., Zhang, Q., and Molina, L. T.: Typical synoptic situations and their impacts on the wintertime air pollution in the Guanzhong basin, China, Atmos. Chem. Phys., 16, 7373–7387, <a href="https://doi.org/10.5194/acp-16-7373-2016" target="_blank">https://doi.org/10.5194/acp-16-7373-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Chen, Z. H., Cheng, S. Y., Li, J. B., Guo, X. R., Wang, W. H., and Chen, D.
S.: Relationship between atmospheric pollution processes and synoptic pressure patterns in northern China, Atmos. Environ., 42, 6078–6087, <a href="https://doi.org/10.1016/j.atmosenv.2008.03.043" target="_blank">https://doi.org/10.1016/j.atmosenv.2008.03.043</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Cho, K., van Merrienboer, B., Bahdanau, D., and Bengio, Y.: On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, arXiv [preprint], arXiv:1409.1259, <a href="https://doi.org/10.48550/arXiv.1409.1259" target="_blank">https://doi.org/10.48550/arXiv.1409.1259</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Feng, J.: Data for “Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data”, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.6982879" target="_blank">https://doi.org/10.5281/zenodo.6982879</a>, 2022a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Feng, J.: Animation for “Capturing synoptic-scale variations in surface aerosol pollution using deep learning with meteorological data”, Zenodo [video/audio], <a href="https://doi.org/10.5281/zenodo.6982971" target="_blank">https://doi.org/10.5281/zenodo.6982971</a>, 2022b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Feng, J., Liao, H., and Gu, Y.: A Comparison of Meteorology-Driven Interannual Variations of Surface Aerosol Concentrations in the Eastern United States, Eastern China, and Europe, SOLA, 12, 146–152, <a href="https://doi.org/10.2151/sola.2016-031" target="_blank">https://doi.org/10.2151/sola.2016-031</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Feng, J., Quan, J., Liao, H., Li, Y., and Zhao, X.: An Air Stagnation Index to Qualify Extreme Haze Events in Northern China, J. Atmos. Sci., 75, 3489–3505, <a href="https://doi.org/10.1175/JAS-D-17-0354.1" target="_blank">https://doi.org/10.1175/JAS-D-17-0354.1</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Feng, J., Sun, J., and Zhang, Y.: A Dynamic Blending Scheme to Mitigate
Large-Scale Bias in Regional Models, J. Adv. Model. Earth Syst., 12, e2019MS001754, <a href="https://doi.org/10.1029/2019MS001754" target="_blank">https://doi.org/10.1029/2019MS001754</a>, 2020a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Feng, J., Liao, H., Li, Y., Zhang, Z., and Tang, Y.: Long-term trends and
variations in haze-related weather conditions in north China during 1980–2018 based on emission-weighted stagnation intensity, Atmos. Environ., 240, 117830, <a href="https://doi.org/10.1016/j.atmosenv.2020.117830" target="_blank">https://doi.org/10.1016/j.atmosenv.2020.117830</a>, 2020b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Geng, G., Xiao, Q., Liu, S., Liu, X., Cheng, J., Zheng, Y., Xue, T., Tong,
D., Zheng, B., Peng, Y., Huang, X., He, K., and Zhang, Q.: Tracking Air
Pollution in China: Near Real-Time PM<sub>2.5</sub> Retrievals from Multisource Data Fusion, Environ. Sci. Technol., 55, 12106–12115, <a href="https://doi.org/10.1021/acs.est.1c01863" target="_blank">https://doi.org/10.1021/acs.est.1c01863</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Gui, K., Che, H., Zeng, Z., Wang, Y., Zhai, S., Wang, Z., Luo, M., Zhang, L., Liao, T., Zhao, H., Li, L., Zheng, Y., and Zhang, X.: Construction of a virtual PM<sub>2.5</sub> observation network in China based on high-density surface
meteorological observations using the Extreme Gradient Boosting model, Environ. Int., 141, 105801, <a href="https://doi.org/10.1016/j.envint.2020.105801" target="_blank">https://doi.org/10.1016/j.envint.2020.105801</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Guo, S., Hu, M., Zamora, M. L., Peng, J., Shang, D., Zheng, J., Du, Z., Wu,
Z., Shao, M., Zeng, L., Molina, M. J., and Zhang, R.: Elucidating severe
urban haze formation in China, P. Natl. Acad. Sci. USA, 111, 17373–17378, <a href="https://doi.org/10.1073/pnas.1419604111" target="_blank">https://doi.org/10.1073/pnas.1419604111</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
He, K., Zhang, X., Ren, S., and Sun, J.: Deep residual learning for image
recognition, in: Proceedings of the IEEE conference on computer vision and
pattern recognition, 27–30 June 2016, Las Vegas, NV, USA, 770–778, <a href="https://doi.org/10.1109/CVPR.2016.90" target="_blank">https://doi.org/10.1109/CVPR.2016.90</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Hou, X., Fei, D., Kang, H., Zhang, Y., and Gao, J.: Seasonal statistical analysis of the impact of meteorological factors on fine particle pollution in China in 2013–2017, Nat. Hazards, 93, 677–698, <a href="https://doi.org/10.1007/s11069-018-3315-y" target="_blank">https://doi.org/10.1007/s11069-018-3315-y</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Jia, W. and Zhang, X.: The role of the planetary boundary layer parameterization schemes on the meteorological and aerosol pollution simulations: A review, Atmos. Res., 239, 104890, <a href="https://doi.org/10.1016/j.atmosres.2020.104890" target="_blank">https://doi.org/10.1016/j.atmosres.2020.104890</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Leung, D. M., Tai, A. P. K., Mickley, L. J., Moch, J. M., van Donkelaar, A.,
Shen, L., and Martin, R. V.: Synoptic meteorological modes of variability for fine particulate matter (PM<sub>2.5</sub>) air quality in major metropolitan regions of China, Atmos. Chem. Phys., 18, 6733–6748, <a href="https://doi.org/10.5194/acp-18-6733-2018" target="_blank">https://doi.org/10.5194/acp-18-6733-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Li, M., Liu, H., Geng, G., Hong, C., Liu, F., Song, Y., Tong, D., Zheng, B.,
Cui, H., Man, H., Zhang, Q., and He, K.: Anthropogenic emission inventories
in China: a review, Natl. Sci. Rev., 4, 834–866, <a href="https://doi.org/10.1093/nsr/nwx150" target="_blank">https://doi.org/10.1093/nsr/nwx150</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Li, T., Shen, H., Yuan, Q., and Zhang, L.: Geographically and temporally
weighted neural networks for satellite-based mapping of ground-level PM<sub>2.5</sub>, ISPRS J. Photogram. Remote Sens., 167, 178–188,
<a href="https://doi.org/10.1016/j.isprsjprs.2020.06.019" target="_blank">https://doi.org/10.1016/j.isprsjprs.2020.06.019</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Lu, H.-C.: The statistical characters of PM<sub>10</sub> concentration in Taiwan
area, Atmos. Environ., 36, 491–502, <a href="https://doi.org/10.1016/S1352-2310(01)00245-X" target="_blank">https://doi.org/10.1016/S1352-2310(01)00245-X</a>, 2002.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Miao, Y., Che, H., Zhang, X., and Liu, S.: Integrated impacts of synoptic forcing and aerosol radiative effect on boundary layer and pollution in the Beijing–Tianjin–Hebei region, China, Atmos. Chem. Phys., 20, 5899–5909, <a href="https://doi.org/10.5194/acp-20-5899-2020" target="_blank">https://doi.org/10.5194/acp-20-5899-2020</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Porter, W. C., Heald, C. L., Cooley, D., and Russell, B.: Investigating the
observed sensitivities of air-quality extremes to meteorological drivers via
quantile regression, Atmos. Chem. Phys., 15, 10349–10366,
<a href="https://doi.org/10.5194/acp-15-10349-2015" target="_blank">https://doi.org/10.5194/acp-15-10349-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Sicard, P., Crippa, P., De Marco, A., Castruccio, S., Giani, P., Cuesta, J.,
Paoletti, E., Feng, Z., and Anav, A.: High spatial resolution WRF-Chem model
over Asia: Physics and chemistry evaluation, Atmos. Environ., 244, 118004, <a href="https://doi.org/10.1016/j.atmosenv.2020.118004" target="_blank">https://doi.org/10.1016/j.atmosenv.2020.118004</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Song, Z., Chen, B., Huang, Y., Dong, L., and Yang, T.: Estimation of PM<sub>2.5</sub> concentration in China using linear hybrid machine learning model,
Atmos. Meas. Tech., 14, 5333–5347, <a href="https://doi.org/10.5194/amt-14-5333-2021" target="_blank">https://doi.org/10.5194/amt-14-5333-2021</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Wang, X., Dong, Z., Zhang, J., and Liu, L.: Modern dust storms in China: an
overview, J. Arid Environ., 58, 559–574, <a href="https://doi.org/10.1016/j.jaridenv.2003.11.009" target="_blank">https://doi.org/10.1016/j.jaridenv.2003.11.009</a>, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Wang, Y., Xin, J., Li, Z., Wang, S., Wang, P., Hao, W. M., Nordgren, B. L.,
Chen, H., Wang, L., and Sun, Y.: Seasonal variations in aerosol optical
properties over China, J. Geophys. Res., 116, D18209, <a href="https://doi.org/10.1029/2010JD015376" target="_blank">https://doi.org/10.1029/2010JD015376</a>, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Wang, Z., Feng, J., Diao, C., Li, Y., Lin, L., and Xu, Y.: Reduction in
European anthropogenic aerosols and the weather conditions conducive to PM<sub>2.5</sub> pollution in North China: a potential global teleconnection pathway, Environ. Res. Lett., 16, 104054, <a href="https://doi.org/10.1088/1748-9326/ac269d" target="_blank">https://doi.org/10.1088/1748-9326/ac269d</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Wei, J., Huang, W., Li, Z., Xue, W., Peng, Y., Sun, L., and Cribb, M.:
Estimating 1-km-resolution PM2.5 concentrations across China using the
space-time random forest approach, Remote Sens. Environ., 231, 111221, <a href="https://doi.org/10.1016/j.rse.2019.111221" target="_blank">https://doi.org/10.1016/j.rse.2019.111221</a>, 2019a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
Wei, J., Li, Z., Guo, J., Sun, L., Huang, W., Xue, W., Fan, T., and Cribb,
M.: Satellite-Derived 1-km-Resolution PM<sub>1</sub> Concentrations from 2014 to 2018 across China, Environ. Sci. Technol., 53, 13265–13274,
<a href="https://doi.org/10.1021/acs.est.9b03258" target="_blank">https://doi.org/10.1021/acs.est.9b03258</a>, 2019b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Wei, J., Li, Z., Cribb, M., Huang, W., Xue, W., Sun, L., Guo, J., Peng, Y.,
Li, J., Lyapustin, A., Liu, L., Wu, H., and Song, Y.: Improved 1&thinsp;km
resolution PM<sub>2.5</sub> estimates across China using enhanced space–time extremely randomized trees, Atmos. Chem. Phys., 20, 3273–3289,
<a href="https://doi.org/10.5194/acp-20-3273-2020" target="_blank">https://doi.org/10.5194/acp-20-3273-2020</a>, 2020.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Yang, Y. Q., Wang, J. Z., Gong, S. L., Zhang, X. Y., Wang, H., Wang, Y. Q.,
Wang, J., Li, D., and Guo, J. P.: PLAM – a meteorological pollution index
for air quality and its applications in fog-haze forecasts in North China,
Atmos. Chem. Phys., 16, 1353–1364, <a href="https://doi.org/10.5194/acp-16-1353-2016" target="_blank">https://doi.org/10.5194/acp-16-1353-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Yin, J., Gao, C. Y., Hong, J., Gao, Z., Li, Y., Li, X., Fan, S., and Zhu, B.: Surface Meteorological Conditions and Boundary Layer Height Variations During an Air Pollution Episode in Nanjing, China, J. Geophys. Res.-Atmos., 124, 3350–3364, <a href="https://doi.org/10.1029/2018JD029848" target="_blank">https://doi.org/10.1029/2018JD029848</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Yuan, Q., Shen, H., Li, T., Li, Z., Li, S., Jiang, Y., Xu, H., Tan, W., Yang, Q., Wang, J., Gao, J., and Zhang, L.: Deep learning in environmental remote sensing: Achievements and challenges, Remote Sens. Environ., 241, 111716, <a href="https://doi.org/10.1016/j.rse.2020.111716" target="_blank">https://doi.org/10.1016/j.rse.2020.111716</a>, 2020.

</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Zhang, L., Zhao, T., Gong, S., Kong, S., Tang, L., Liu, D., Wang, Y., Jin, L., Shan, Y., Tan, C., Zhang, Y., and Guo, X.: Updated emission inventories
of power plants in simulating air quality during haze periods over East China, Atmos. Chem. Phys., 18, 2065–2079, <a href="https://doi.org/10.5194/acp-18-2065-2018" target="_blank">https://doi.org/10.5194/acp-18-2065-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Zhang, Q., Zheng, Y., Tong, D., Shao, M., Wang, S., Zhang, Y., Xu, X., Wang,
J., He, H., Liu, W., Ding, Y., Lei, Y., Li, J., Wang, Z., Zhang, X., Wang,
Y., Cheng, J., Liu, Y., Shi, Q., Yan, L., Geng, G., Hong, C., Li, M., Liu,
F., Zheng, B., Cao, J., Ding, A., Gao, J., Fu, Q., Huo, J., Liu, B., Liu,
Z., Yang, F., He, K., and Hao, J.: Drivers of improved PM<sub>2.5</sub> air quality in China from 2013 to 2017, P. Natl. Acad. Sci. USA, 116, 24463–24469, <a href="https://doi.org/10.1073/pnas.1907956116" target="_blank">https://doi.org/10.1073/pnas.1907956116</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Zhang, X., Xu, X., Ding, Y., Liu, Y., Zhang, H., Wang, Y., and Zhong, J.:
The impact of meteorological changes from 2013 to 2017 on PM<sub>2.5</sub> mass
reduction in key regions in China, Sci. China Earth Sci., 62, 1885–1902, <a href="https://doi.org/10.1007/s11430-019-9343-3" target="_blank">https://doi.org/10.1007/s11430-019-9343-3</a>, 2019.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Zhang, Y., Zhang, X., Wang, L., Zhang, Q., Duan, F., and He, K.: Application
of WRF/Chem over East Asia: Part I. Model evaluation and intercomparison with MM5/CMAQ, Atmos. Environ., 124, 285–300, <a href="https://doi.org/10.1016/j.atmosenv.2015.07.022" target="_blank">https://doi.org/10.1016/j.atmosenv.2015.07.022</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Zheng, G. J., Duan, F. K., Su, H., Ma, Y. L., Cheng, Y., Zheng, B., Zhang, Q., Huang, T., Kimoto, T., Chang, D., Pöschl, U., Cheng, Y. F., and He, K. B.: Exploring the severe winter haze in Beijing: The impact of synoptic
weather, regional transport and heterogeneous reactions, Atmos. Chem. Phys., 15, 2969–2983, <a href="https://doi.org/10.5194/acp-15-2969-2015" target="_blank">https://doi.org/10.5194/acp-15-2969-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Zhong, J., Zhang, X., Gui, K., Wang, Y., Che, H., Shen, X., Zhang, L., Zhang, Y., Sun, J., and Zhang, W.: Robust prediction of hourly PM<sub>2.5</sub> from meteorological data using LightGBM, Natl. Sci. Rev., 8, nwaa307, <a href="https://doi.org/10.1093/nsr/nwaa307" target="_blank">https://doi.org/10.1093/nsr/nwaa307</a>, 2021.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
Zhong, M., Saikawa, E., Liu, Y., Naik, V., Horowitz, L. W., Takigawa, M.,
Zhao, Y., Lin, N.-H., and Stone, E. A.: Air quality modeling with WRF-Chem v3.5 in East Asia: sensitivity to emissions and evaluation of simulated air quality, Geosci. Model Dev., 9, 1201–1218,
<a href="https://doi.org/10.5194/gmd-9-1201-2016" target="_blank">https://doi.org/10.5194/gmd-9-1201-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
Zhou, G., Xu, J., Xie, Y., Chang, L., Gao, W., Gu, Y., and Zhou, J.: Numerical air quality forecasting over eastern China: An operational application of WRF-Chem, Atmos. Enviro., 153, 94–108, <a href="https://doi.org/10.1016/j.atmosenv.2017.01.020" target="_blank">https://doi.org/10.1016/j.atmosenv.2017.01.020</a>, 2017.
</mixed-citation></ref-html>--></article>
