<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="methods-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">ACP</journal-id><journal-title-group>
    <journal-title>Atmospheric Chemistry and Physics</journal-title>
    <abbrev-journal-title abbrev-type="publisher">ACP</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Chem. Phys.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1680-7324</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/acp-26-4771-2026</article-id><title-group><article-title>Technical note: Hybrid machine learning model for bias correction of UTLS relative humidity against IAGOS observations in ERA5 reanalysis</article-title><alt-title>Hybrid ML Bias Correction of UTLS Humidity in ERA5</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Antonopoulos</surname><given-names>Mathieu</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Juvin-Quarroz</surname><given-names>Jérémie</given-names></name>
          <email>jeremie.juvin-quarroz@ipsl.fr</email>
        <ext-link>https://orcid.org/0000-0001-8852-771X</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Boucher</surname><given-names>Olivier</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-2328-5769</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Institut Pierre-Simon Laplace, Sorbonne Université/CNRS, Paris, France</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Institut Polytechnique de Paris, 91120 Palaiseau, France</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Jérémie Juvin-Quarroz (jeremie.juvin-quarroz@ipsl.fr)</corresp></author-notes><pub-date><day>10</day><month>April</month><year>2026</year></pub-date>
      
      <volume>26</volume>
      <issue>7</issue>
      <fpage>4771</fpage><lpage>4784</lpage>
      <history>
        <date date-type="received"><day>16</day><month>September</month><year>2025</year></date>
           <date date-type="rev-request"><day>10</day><month>December</month><year>2025</year></date>
           <date date-type="rev-recd"><day>22</day><month>January</month><year>2026</year></date>
           <date date-type="accepted"><day>25</day><month>March</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Mathieu Antonopoulos et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026.html">This article is available from https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026.html</self-uri><self-uri xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026.pdf">The full text article is available as a PDF file from https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e105">Persistent contrail cirrus form in Ice-Supersaturated Regions (ISSRs) and are responsible for a large portion of aviation's non-<inline-formula><mml:math id="M1" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> climate impact. Avoiding ISSRs through flight rerouting has been proposed as a short-term mitigation strategy. However, accurate prediction of the Relative Humidity with respect to ice, <inline-formula><mml:math id="M2" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, distribution within ISSRs at cruising altitude remains difficult. Observations are problematic: Satellite-based global measurements carry large uncertainties while in-situ measurements offer a limited spatial coverage. On the contrary, reanalysis offer a global estimate of <inline-formula><mml:math id="M3" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, but it suffers from a dry bias near the tropopause where ISSRs are located as well as significant random errors.  In this study, we develop a hybrid machine learning model to improve <inline-formula><mml:math id="M4" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates in the upper troposphere and lower stratosphere using ERA5 and aircraft measurements from the In-service Aircraft for a Global Observing System. The model combines an XGBoost regressor for drier conditions (<inline-formula><mml:math id="M5" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M6" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 85 %) and an Artificial Neural Network (ANN) for more humid cases (<inline-formula><mml:math id="M7" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M8" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 85 %). This hybrid approach significantly outperforms raw ERA5 data, leveraging the ANN's ability to capture non-linear relationships and the XGBoost's robustness in handling drier conditions. The mean absolute error (MAE) is reduced from 13.7 % to 11.4 % and the Equitable Threat Score (ETS) for ISSR detection is improved from 0.36 to 0.44. The greatest improvement is observed in the lower stratosphere, where the ETS increases by 0.18 and the MAE drops to 10.71 %. These improvements mark a key step toward more reliable identification of ISSRs, helping reduce the uncertainties that currently limit effective flight-rerouting strategies.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e198">Aviation contributes to climate change through both carbon dioxide (<inline-formula><mml:math id="M9" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) emission and non-<inline-formula><mml:math id="M10" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> effects. The latter may have accounted for 66 % of aviation's total net Effective Radiative Forcing (ERF) in 2018 <xref ref-type="bibr" rid="bib1.bibx26" id="paren.1"/>. The largest contributor to non-<inline-formula><mml:math id="M11" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> ERF is the radiative forcing from condensation trails – contrails – which are linear ice clouds formed behind aircraft as hot exhaust gases mix with cold ambient air. The physical processes governing contrail formation are well understood <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx2 bib1.bibx35" id="paren.2"/>. While most contrails dissipate quickly and have a negligible climatic impact, those that form within Ice Supersaturated Regions (ISSRs) can persist for hours <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx22" id="paren.3"/>, eventually developing into contrail cirrus clouds with microphysical and optical properties similar to those of natural cirrus <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx42" id="paren.4"/>.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e249">Density map of IAGOS data points in the North Atlantic region for the year 2022. The color scale indicates the number of data points per 0.25° <inline-formula><mml:math id="M12" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 0.25° gridbox.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f01.png"/>

      </fig>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e267">Illustration of the collocation process between IAGOS and ERA5 datasets. Diagram <bold>(a)</bold> presents the datasets before collocation, while diagram <bold>(b)</bold> illustrates the data after preprocessing. IAGOS measurements are averaged within each ERA5 grid box, and the ERA5 variables are interpolated to match the mean altitude of the IAGOS point. The <inline-formula><mml:math id="M13" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is then recomputed based on the interpolated variables.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f02.png"/>

      </fig>

      <p id="d2e294">ISSRs are regions of the atmosphere where the Relative Humidity with respect to ice is supersaturated, <inline-formula><mml:math id="M14" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M15" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 100 %, which is a meta-stable thermodynamic state. The size of ISSRs has been estimated based on the distance flown within such regions by Measurement of Ozone and Water Vapor by Airbus In-Service Aircraft (MOZAIC) aircraft, showing that they can extend over <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mi mathvariant="script">O</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">10</mml:mn><mml:mtext>–</mml:mtext><mml:mn mathvariant="normal">100</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>
<xref ref-type="bibr" rid="bib1.bibx12" id="paren.5"/>. ISSRs are typically located at the tropopause level and are rather frequent across the globe <xref ref-type="bibr" rid="bib1.bibx13 bib1.bibx37" id="paren.6"/>. Their formation is facilitated by the presence of upward winds and the meeting of cold and warm air streams <xref ref-type="bibr" rid="bib1.bibx11" id="paren.7"/>. The dynamics of ISSRs also present complex patterns as it has been determined that ISSRs' displacement may be slower than the surrounding wind field's <xref ref-type="bibr" rid="bib1.bibx20" id="paren.8"/>.</p>
      <p id="d2e354">On average, it is estimated that aircraft spends 13.5 % of their flight time in ISSRs <xref ref-type="bibr" rid="bib1.bibx13" id="paren.9"/> and that only a small proportion of flights might be responsible for the majority of persistent contrail formation. For instance, it has been estimated that only 2.7 % of total flights are responsible for 80 % of the total contrail RF in 2019 <xref ref-type="bibr" rid="bib1.bibx41" id="paren.10"/>. These findings have motivated flight rerouting as a short-term mitigation strategy, e.g. <xref ref-type="bibr" rid="bib1.bibx33" id="text.11"/>. Being able to meteorologically forecast ISSRs with enough accuracy would allow a slight deviation of the flight route to avoid the formation of persistent contrails, at the cost of a small amount of additional <inline-formula><mml:math id="M18" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> emissions.</p>
      <p id="d2e377">Global ISSR forecasts are generally derived from Numerical Weather Prediction (NWP) models, informed by in-situ and satellite observations. However, detecting ISSRs remains challenging, particularly through satellite observations <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx10 bib1.bibx25 bib1.bibx16" id="paren.12"/> due to their relatively narrow vertical extent <xref ref-type="bibr" rid="bib1.bibx38 bib1.bibx44" id="paren.13"/>. The tropopause region, where ISSRs typically occur, also features steep vertical gradients in humidity <xref ref-type="bibr" rid="bib1.bibx39 bib1.bibx46" id="paren.14"/>, necessitating high vertical resolution for accurate characterization. Both remote measurements with lidar <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx24" id="paren.15"/> and in-situ observations, such as those made with balloon-borne instruments <xref ref-type="bibr" rid="bib1.bibx19" id="paren.16"/> and with on-flight sensors <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx29" id="paren.17"/>, provide accurate measurements of <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, but are spatially and temporally limited.</p>
      <p id="d2e410">Hence we must rely on NWP models to obtain <inline-formula><mml:math id="M20" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values across the global atmosphere. NWP models contribute in two ways: forecasts, which predict atmospheric conditions in real time and are required for operational contrail avoidance, and reanalysis, which retrospectively reconstruct past atmospheric states by assimilating all historical observations. While forecasts are necessary for live avoidance decisions, reanalysis data is well-suited for retrospective studies such as this one. We therefore use the ERA5 reanalysis dataset <xref ref-type="bibr" rid="bib1.bibx17" id="paren.18"/>, which is widely regarded as a highly trusted reconstruction of the global atmosphere. However, as other NWP products, ERA5 is known to suffer from a dry bias in the Upper Troposphere (UT) and Lower Stratosphere (LS) <xref ref-type="bibr" rid="bib1.bibx40 bib1.bibx32 bib1.bibx30" id="paren.19"/>, which means that the model often predicts lower relative humidity values compared to the observed values. This bias, as well as significant random errors, can lead to considerable errors in <inline-formula><mml:math id="M21" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates, particularly in regions where contrails are likely to form. Previous work on the correction of humidity bias in ERA5 have shown potential for improvement of the ISSR detection in the UTLS. <xref ref-type="bibr" rid="bib1.bibx45" id="text.20"/> applied a bivariate Quantile Mapping (QM) correction to ERA5 temperature and relative humidity fields, significantly reducing the dry bias. They report a reduction of the <inline-formula><mml:math id="M22" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> bias from approximately <inline-formula><mml:math id="M23" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>4.5 % to <inline-formula><mml:math id="M24" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>1.2 % on average at pressure levels of 250, 225, and 200 <inline-formula><mml:math id="M25" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>. This bias reduction leads to an improved classification of different contrail categories. <xref ref-type="bibr" rid="bib1.bibx43" id="text.21"/> proposed a machine learning (ML) approach to improve ERA5's <inline-formula><mml:math id="M26" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates in the UT using an Artificial Neural Network (ANN).</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e494">Correlation matrix showing the relationships between selected engineered features, the target variable <monospace>RHi_IAGOS</monospace>, and the initial ERA5 estimate. The engineered gradient features show a relatively strong correlations with the target <monospace>RHi_IAGOS</monospace>. The time embeddings (<monospace>cos_day</monospace>) exhibit low direct correlation with <monospace>RHi_IAGOS</monospace>.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f03.png"/>

      </fig>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e518">Comparison of <inline-formula><mml:math id="M27" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>  predictions between different models with respect to the target value (<monospace>RHi_IAGOS</monospace>). <bold>(a)</bold> presents the baseline ERA5 reanalysis, <bold>(b)</bold> ANN model, <bold>(c)</bold> XGBoost model, and <bold>(d)</bold> the hybrid ensemble model. All predictions are made on the test dataset between 400 and 200 <inline-formula><mml:math id="M28" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula> over the North Atlantic region (cf. Fig. <xref ref-type="fig" rid="F1"/>) for the year 2022.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f04.png"/>

      </fig>

      <p id="d2e564">In this study, we present a hybrid ensemble machine learning approach to correct the dry bias in ERA5's <inline-formula><mml:math id="M29" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> in the UTLS. We combine an XGBoost regressor <xref ref-type="bibr" rid="bib1.bibx5" id="paren.22"/> and an ANN, leveraging the complementary strengths of both models.  ANNs are well-suited for modeling complex, non-linear relationships in multivariate data <xref ref-type="bibr" rid="bib1.bibx47" id="paren.23"/>, while tree-based models like XGBoost excel in handling structured data and often deliver state-of-the-art performance <xref ref-type="bibr" rid="bib1.bibx5" id="paren.24"/>.  We dynamically chose the model based on the initial <inline-formula><mml:math id="M30" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> value provided by ERA5: the XGBoost is used for prediction in drier conditions (<inline-formula><mml:math id="M31" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M32" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 85 %), while the ANN is used for more humid cases (<inline-formula><mml:math id="M33" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M34" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 85 %).  ISSR formation involves more complex and non-linear atmospheric dynamics <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx8" id="paren.25"/>, which are better captured by ANNs due to their flexibility and non-linear representation capabilities. Effectively, the ANN captures subtle variations relevant to ISSR classification, while the XGBoost model is used to correct bias under simpler thermodynamic conditions.</p>
      <p id="d2e638">This paper is organized as follows. In Sect. <xref ref-type="sec" rid="Ch1.S2"/> we describe the IAGOS and ERA5 datasets, the collocation procedure, filtering, and feature engineering. Sect. <xref ref-type="sec" rid="Ch1.S3"/> details the ANN and XGBoost architectures, training procedures, and the hybrid model. Sect. <xref ref-type="sec" rid="Ch1.S4"/> presents our results, showing improvements in regression and classification scores under different meteorological conditions.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
      <p id="d2e655">In this study, we will use two complementary datasets. The ERA5 reanalysis data from the European Centre for Medium-Range Weather Forecasts (ECMWF, <xref ref-type="bibr" rid="bib1.bibx6" id="altparen.26"/>), and the in-situ observations from the In-service Aircraft for a Global Observing System (IAGOS, <xref ref-type="bibr" rid="bib1.bibx29" id="altparen.27"/>) program. ERA5 provides a global and consistent estimate of atmospheric conditions through a wide range of variables, while IAGOS offers direct measurements of <inline-formula><mml:math id="M35" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> at cruise altitudes. Combining these datasets enables a supervised learning approach where the ERA5 features serve as model inputs, while the estimation of <inline-formula><mml:math id="M36" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from IAGOS from temperature and relative humidity measurements serve as the target variable. The following subsections describe each dataset, along with our preprocessing and feature engineering methodology.</p>
      <p id="d2e686">Unlike <xref ref-type="bibr" rid="bib1.bibx43" id="text.28"/>, we focus on the North Atlantic region, spanning latitudes from 40 to 70° N and longitudes from 65° W to 5° E (Fig. <xref ref-type="fig" rid="F1"/>). This region shows a relatively uniform distribution of the IAGOS data and is known for frequent contrail formation due to high air traffic and favorable atmospheric conditions. For these reasons, it is thought to be a potential region for contrail avoidance trials. We selected the year 2022 as our study period as it offers recent atmospheric conditions and comes after the maximum COVID disruption of international flights in 2020 and 2021. Our data processing is implemented based on methodology from <xref ref-type="bibr" rid="bib1.bibx43" id="altparen.29"/>, with some modifications to ensure proper separation of the training and test datasets to avoid temporal leakage.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>ECMWF reanalysis data (ERA5)</title>
      <p id="d2e704">ERA5 data are provided on a regular latitude–longitude grid with a horizontal resolution of 0.25° <inline-formula><mml:math id="M37" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 0.25°, corresponding to an approximate grid spacing of 19.5 <inline-formula><mml:math id="M38" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> at 45° N latitude. Vertically, the dataset includes 37 pressure levels extending from the surface to the LS. We focus on pressure levels between 500 and 125 <inline-formula><mml:math id="M39" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>, specifically using the following levels: 500, 450, 400, 350, 300, 250, 225, 200, 175, 150 and 125 <inline-formula><mml:math id="M40" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>. The extracted ERA5 variables provide a comprehensive description of the atmospheric conditions (see Table <xref ref-type="table" rid="T1"/> for a summary of all extracted variables and their units). The ERA5 data serve as input for both statistical validation and the training of machine learning models. We use all variables as features for the model, and the <monospace>RHi_ERA5</monospace> measurement as a baseline to evaluate model performance.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e747">Summary of all features used for training, grouped by category. ERA5 variables include values at current pressure level and context, <inline-formula><mml:math id="M41" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 2 pressure levels, and 2 and 6 <inline-formula><mml:math id="M42" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:math></inline-formula> prior current timestep (i.e. 7 versions for each ERA5 variable). The target IAGOS value is indicated in bold. <sup>∗</sup> <monospace>RHi_ERA5</monospace> is recomputed after linear interpolation to maintain consistency and physical dependencies</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Category</oasis:entry>
         <oasis:entry colname="col2">Feature Name</oasis:entry>
         <oasis:entry colname="col3">Description</oasis:entry>
         <oasis:entry colname="col4">Unit</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">ERA5 variables</oasis:entry>
         <oasis:entry colname="col2"><monospace>T_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Air temperature</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M44" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>q_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Specific humidity</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M45" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>w_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Vertical wind speed</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M46" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">Pa</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>u_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Zonal (west-east) wind component</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M47" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>v_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Meridional (south-north) wind component</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M48" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>pv_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Potential vorticity</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M49" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>ciwc_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Cloud ice water content</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M50" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">kg</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>geopt_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Geopotential</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M51" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>vo_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Relative Vorticity</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M52" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>d_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col3">Horizontal divergence</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M53" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>RHi_ERA5</monospace><sup>∗</sup></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M55" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">%</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Engineered</oasis:entry>
         <oasis:entry colname="col2"><monospace>T_grad_up</monospace></oasis:entry>
         <oasis:entry colname="col3">Temp. gradient: current/upper level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M56" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">features</oasis:entry>
         <oasis:entry colname="col2"><monospace>T_grad_down</monospace></oasis:entry>
         <oasis:entry colname="col3">Temp. gradient: current/lower level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M57" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>T_grad_centered</monospace></oasis:entry>
         <oasis:entry colname="col3">Temp. gradient: <inline-formula><mml:math id="M58" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 25 <inline-formula><mml:math id="M59" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M60" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>T_grad_overall</monospace></oasis:entry>
         <oasis:entry colname="col3">Temp. gradient: <inline-formula><mml:math id="M61" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 50 <inline-formula><mml:math id="M62" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M63" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>RHi_grad_up</monospace></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M64" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>  gradient: current/upper level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M65" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>RHi_grad_down</monospace></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M66" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>  gradient: current/lower level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M67" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>RHi_grad_centered</monospace></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M68" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>  gradient: <inline-formula><mml:math id="M69" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 25 <inline-formula><mml:math id="M70" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M71" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>RHi_grad_overall</monospace></oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M72" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>  gradient: <inline-formula><mml:math id="M73" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 50 <inline-formula><mml:math id="M74" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M75" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>vo_grad_up</monospace></oasis:entry>
         <oasis:entry colname="col3">Vorticity gradient: current/upper level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M76" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>vo_grad_down</monospace></oasis:entry>
         <oasis:entry colname="col3">Vorticity gradient: current/lower level</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M77" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>vo_grad_centered</monospace></oasis:entry>
         <oasis:entry colname="col3">Vorticity gradient: <inline-formula><mml:math id="M78" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 25 <inline-formula><mml:math id="M79" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M80" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>vo_grad_overall</monospace></oasis:entry>
         <oasis:entry colname="col3">Vorticity gradient: <inline-formula><mml:math id="M81" display="inline"><mml:mo>±</mml:mo></mml:math></inline-formula> 50 <inline-formula><mml:math id="M82" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M83" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>PVU</monospace></oasis:entry>
         <oasis:entry colname="col3">Potential vorticity in PVU</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">PVU</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>cloudy</monospace></oasis:entry>
         <oasis:entry colname="col3">Ice-cloud flag</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>cos_hour</monospace></oasis:entry>
         <oasis:entry colname="col3">Cosine of local time</oasis:entry>
         <oasis:entry colname="col4">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>sin_hour</monospace></oasis:entry>
         <oasis:entry colname="col3">Sine of local time</oasis:entry>
         <oasis:entry colname="col4">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>cos_day</monospace></oasis:entry>
         <oasis:entry colname="col3">Cosine of day in year</oasis:entry>
         <oasis:entry colname="col4">–</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><monospace>sin_day</monospace></oasis:entry>
         <oasis:entry colname="col3">Sine of day in year</oasis:entry>
         <oasis:entry colname="col4">–</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">IAGOS</oasis:entry>
         <oasis:entry colname="col2"><monospace>p_IAGOS</monospace></oasis:entry>
         <oasis:entry colname="col3">IAGOS pressure</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M86" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">variables</oasis:entry>
         <oasis:entry colname="col2"><bold><monospace>RHi_IAGOS</monospace></bold></oasis:entry>
         <oasis:entry colname="col3">IAGOS <inline-formula><mml:math id="M87" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">%</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>In-service Aircraft for a Global Observing System (IAGOS)</title>
      <p id="d2e1830">IAGOS is a European research initiative providing long-term, high-resolution in-situ atmospheric measurements to support the understanding of atmospheric composition, air quality, and climate processes. Data are collected by equipping commercial aircraft, mostly Airbus A330s, with the “P1 package” set of scientific instruments. Currently, the IAGOS fleet comprises ten aircraft operating worldwide. The <inline-formula><mml:math id="M88" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is derived from measured temperature, pressure, and water vapor mixing ratio, using the saturation vapor pressure formula by <xref ref-type="bibr" rid="bib1.bibx36" id="text.30"/>. The measurements are recorded at a temporal resolution of 4 <inline-formula><mml:math id="M89" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:math></inline-formula>, corresponding to a spatial resolution of approximately 1 <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> at cruise speeds.  However the data points are averaged over a longer period of few minutes because of the time lags involved with the instrument <xref ref-type="bibr" rid="bib1.bibx3" id="paren.31"/>.</p>
      <p id="d2e1866">The spatial distribution of IAGOS measurements is inherently non-uniform as commercial flights tend to follow similar routes (cf. Fig. <xref ref-type="fig" rid="F1"/>). Additionally, the limited number of aircraft in the IAGOS fleet contributes to a concentration of the data in some regions. The highest data density is found along transatlantic flight corridors and Western Europe. The North Atlantic and European regions are the most represented, as most flights from IAGOS are issued by European companies, and transatlantic flights represent an important share of global aviation traffic. This area is of particular interest for contrail studies, given the frequent occurrence of ISSRs <xref ref-type="bibr" rid="bib1.bibx31" id="paren.32"/>. Most measurements are collected between 400 and 200 <inline-formula><mml:math id="M91" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>, matching the standard cruise altitude.</p>
      <p id="d2e1882">To ensure data quality, we retain only samples flagged as “good” by the IAGOS quality control system. We also restrict the dataset to measurements between 400 and 200 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>. According to <xref ref-type="bibr" rid="bib1.bibx32" id="text.33"/> and their analysis of the airborne measurements, the P1 sensors used in IAGOS show limited accuracy in dry conditions (<inline-formula><mml:math id="M93" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M94" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 10 %). Hence, we exclude those samples from our dataset. After applying all these filtering steps for the year 2022 on the North-Atlantic region, we obtain a dataset comprising 1 535 724 data points from 678 flights.</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Data collocation</title>
      <p id="d2e1923">The IAGOS and ERA5 data have different resolutions in both time and space, hence we must proceed to a collocation of the data. Once we collected and filtered the IAGOS data for each flight, we determine the closest point in ERA5 by matching the closest pressure level, longitude and latitude points (at a 0.25° resolution) and timestep (closest hour). Once we get these indices, we find the closest grid box in ERA5 to each IAGOS data point. The IAGOS points matching the same grid box are grouped. Given the four-second resolution of the data and the average grid box length of 19 <inline-formula><mml:math id="M95" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>, each group contains up to 23 IAGOS points and 15 IAGOS points on average. Once grouped, the IAGOS data are averaged within one grid box to obtain a single IAGOS point per ERA5 grid box and timestep. Given that points from same flights are usually located at a similar altitude within a grid box, we effectively horizontally average the IAGOS data for longitudes and latitudes. We further proceed to a vertical linear interpolation of the ERA5 data to match the averaged altitude of the IAGOS point. Future work should test log-pressure linear interpolation to reflect the atmosphere's vertical structure. A visual representation of this process is given in Fig. <xref ref-type="fig" rid="F2"/>. In order to maintain physical relationships between variables, we recalculate <monospace>RHi_ERA5</monospace> with the interpolated variables <inline-formula><mml:math id="M96" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M97" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula>, which are, respectively the air temperature and the specific humidity at the relevant pressure value <inline-formula><mml:math id="M98" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>. This is done using the Clausius-Clapeyron equation, with a Magnus form approximation for saturation vapor pressure over ice as described by <xref ref-type="bibr" rid="bib1.bibx1" id="text.34"/>. A comparison between observed <inline-formula><mml:math id="M99" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from IAGOS and <inline-formula><mml:math id="M100" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> predictions from ERA5 reanalysis is shown in Fig. <xref ref-type="fig" rid="F4"/>a.</p>
      <p id="d2e1988">The dataset is then separated into training, validation and test datasets. Careful separation is essential, as atmospheric conditions can exhibit strong spatial and temporal correlations. Without proper splitting, the model might train on samples nearly identical to those in the test dataset, potentially leading to overfitting and artificially high performance on validation and test datasets. To avoid this issue while keeping a representative distribution across seasons, days, and hours, we implemented a by-day sampling strategy. Specifically, we sample 5 <inline-formula><mml:math id="M101" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for training, 1 <inline-formula><mml:math id="M102" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> gap, 1 <inline-formula><mml:math id="M103" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for validation, 1 <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> gap, 5 <inline-formula><mml:math id="M105" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for training, 1 <inline-formula><mml:math id="M106" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> gap, 1 <inline-formula><mml:math id="M107" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for testing and 1 <inline-formula><mml:math id="M108" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> gap, uniformly throughout the year. This differs from <xref ref-type="bibr" rid="bib1.bibx43" id="text.35"/> in two ways: (i) we use 5 <inline-formula><mml:math id="M109" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> for training (4 <inline-formula><mml:math id="M110" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> in <xref ref-type="bibr" rid="bib1.bibx43" id="altparen.36"/>), (ii) an additional day gap is used after the validation or testing day to prevent correlations with the following training period. After collocation and split, we obtain 64 712 training samples, 7025 validation samples and 6085 test samples. To address the underrepresentation of high <inline-formula><mml:math id="M111" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values and reduce potential bias, data augmentation is applied to the training dataset: samples with <inline-formula><mml:math id="M112" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> above 120 % are oversampled by a factor of 3, while those below 20 % <inline-formula><mml:math id="M113" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are undersampled by a factor of 0.4. This approach results in a <inline-formula><mml:math id="M114" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> distribution that is more suitable for our needs.</p>
      <p id="d2e2123">Finally, we ensure that our processed data are representative of ISSRs characteristics by comparing it to the results in <xref ref-type="bibr" rid="bib1.bibx11" id="text.37"/>. We report the distribution of vorticity and divergence in Fig. <xref ref-type="fig" rid="FA1"/>.</p>

<table-wrap id="T2"><label>Table 2</label><caption><p id="d2e2135">Hyperparameter grid used for tuning the ANN model. Each row lists the set of values tested for a specific hyperparameter during the grid search process. The values in bold indicate the configuration that achieved the best performance on the validation dataset and was selected for final training.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Hyperparameter</oasis:entry>
         <oasis:entry colname="col2">Values</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>BATCH_SIZE</monospace></oasis:entry>
         <oasis:entry colname="col2">[512, <bold>1024</bold>, 2048]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>DROPOUT_RATE</monospace></oasis:entry>
         <oasis:entry colname="col2">[0.0, <bold>0.1</bold>, 0.3]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>EPOCHS</monospace></oasis:entry>
         <oasis:entry colname="col2">[100, <bold>150</bold>]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>HIDDEN_DIM</monospace></oasis:entry>
         <oasis:entry colname="col2">[64, <bold>100</bold>, 128, 256, 512]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>LEARNING_RATE</monospace></oasis:entry>
         <oasis:entry colname="col2">[0.1, 0.01, <bold>0.001</bold>, 0.0001]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>NUM_LAYERS</monospace></oasis:entry>
         <oasis:entry colname="col2">[2, <bold>3</bold>, 4, 5]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>WEIGHT_DECAY</monospace></oasis:entry>
         <oasis:entry colname="col2">[<bold>0.005</bold>, 0.0001, 0.00001]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>MOMENTUM</monospace></oasis:entry>
         <oasis:entry colname="col2">[0.85, 0.95, <bold>0.98</bold>, 0.99]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>PATIENCE</monospace></oasis:entry>
         <oasis:entry colname="col2">[10, <bold>20</bold>, 30]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>MIN_DELTA</monospace></oasis:entry>
         <oasis:entry colname="col2">[<bold>0.0001</bold>]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>WEIGHT_FACTOR</monospace></oasis:entry>
         <oasis:entry colname="col2">[3, 10, <bold>30</bold>]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>OPTIMIZER</monospace></oasis:entry>
         <oasis:entry colname="col2">[<bold>adam</bold>, sgd]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>SCHEDULER_FACTOR</monospace></oasis:entry>
         <oasis:entry colname="col2">[<bold>0.5</bold>]</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>SCHEDULER_PATIENCE</monospace></oasis:entry>
         <oasis:entry colname="col2">[10, <bold>20</bold>]</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Feature engineering</title>
      <p id="d2e2354">ERA5 provides an extensive set of variables, but to further improve the input data and capture new relation (Fig. <xref ref-type="fig" rid="F3"/>), additional derived variables are added as input features. Following the methodology proposed by <xref ref-type="bibr" rid="bib1.bibx43" id="text.38"/>, we add for each data point, the selected variables two pressure levels above and below the current pressure level (totaling 5 levels) as well as data from two hours and 6 <inline-formula><mml:math id="M115" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:math></inline-formula> prior to the current timestep. In addition we extracted several features from the original ERA5 and IAGOS data to enrich the model inputs and improve the correction of the <inline-formula><mml:math id="M116" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates. These features aim to capture the vertical structure and dynamics of the atmosphere.</p>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e2384">Performance comparison of models for <inline-formula><mml:math id="M117" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> correction. Improvement relative to ERA5 baseline shown in parentheses. Best values are indicated in bold.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">ETS</oasis:entry>
         <oasis:entry colname="col3">MAE (%)</oasis:entry>
         <oasis:entry colname="col4">Description</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">ERA5 (Baseline)</oasis:entry>
         <oasis:entry colname="col2">0.36</oasis:entry>
         <oasis:entry colname="col3">13.70</oasis:entry>
         <oasis:entry colname="col4">Original reanalysis data (baseline)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ANN</oasis:entry>
         <oasis:entry colname="col2">0.44 (+0.08)</oasis:entry>
         <oasis:entry colname="col3">12.09 (<inline-formula><mml:math id="M118" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>1.61)</oasis:entry>
         <oasis:entry colname="col4">Strong ISSR classification, moderate error reduction</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">XGBoost</oasis:entry>
         <oasis:entry colname="col2">0.35 (<inline-formula><mml:math id="M119" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>0.01)</oasis:entry>
         <oasis:entry colname="col3">11.56 (<inline-formula><mml:math id="M120" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>2.14)</oasis:entry>
         <oasis:entry colname="col4">Weak ISSR classification, strong error reduction</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hybrid Model</oasis:entry>
         <oasis:entry colname="col2"><bold>0.44</bold> (+0.08)</oasis:entry>
         <oasis:entry colname="col3"><bold>11.37</bold> (<inline-formula><mml:math id="M121" display="inline"><mml:mo lspace="0mm">-</mml:mo></mml:math></inline-formula>2.33)</oasis:entry>
         <oasis:entry colname="col4">Strong ISSR classification, strong error reduction</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2525">To capture the vertical temperature profile, we compute temperature gradients at different scales. These include the gradient between the current level and the two levels immediately above (<monospace>T_grad_up</monospace>), between the current level and the two levels immediately below (<monospace>T_grad_down</monospace>), a centered gradient between one level above and one below (<monospace>T_grad_centered</monospace>), and a lower-resolution gradient between two levels above and two below (<monospace>T_grad_overall</monospace>), in <inline-formula><mml:math id="M122" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. Similarly, gradients of the vorticity (<monospace>vo_grad_up</monospace>, <monospace>vo_grad_down</monospace>, <monospace>vo_grad_centered</monospace>, <monospace>vo_grad_overall</monospace>), in <inline-formula><mml:math id="M123" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">hPa</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and the <inline-formula><mml:math id="M124" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (<monospace>RHi_grad_up</monospace>, <monospace>RHi_grad_down</monospace>, <monospace>RHi_grad_centered</monospace>, <monospace>RHi_grad_overall</monospace>), in %, are computed following the same pattern.</p>

<table-wrap id="T4"><label>Table 4</label><caption><p id="d2e2621">Top five most important features for the ANN and XGBoost models. As expected, <monospace>RHi_ERA5</monospace> ranks highest in both models due to its strong correlation with the target variable. Several engineered features demonstrate strong importance, specifically the temperature gradient, the binary cloud presence indicator, and the time embeddings. Time embedding features show strong importance for the ANN model but not for XGBoost, indicating that each model captures different aspects of the underlying atmospheric patterns.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right" colsep="1"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col2" align="center" colsep="1">ANN Model </oasis:entry>
         <oasis:entry namest="col3" nameend="col4" align="center">XGBoost Model </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Feature</oasis:entry>
         <oasis:entry colname="col2">Importance</oasis:entry>
         <oasis:entry colname="col3">Feature</oasis:entry>
         <oasis:entry colname="col4">Importance</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>RHi_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col2">0.14</oasis:entry>
         <oasis:entry colname="col3"><monospace>RHi_ERA5_prior_2h</monospace></oasis:entry>
         <oasis:entry colname="col4">0.31</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>cos_day</monospace></oasis:entry>
         <oasis:entry colname="col2">0.09</oasis:entry>
         <oasis:entry colname="col3"><monospace>RHi_ERA5</monospace></oasis:entry>
         <oasis:entry colname="col4">0.13</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>RHi_ERA5_up1</monospace></oasis:entry>
         <oasis:entry colname="col2">0.06</oasis:entry>
         <oasis:entry colname="col3"><monospace>Cloudy</monospace></oasis:entry>
         <oasis:entry colname="col4">0.12</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>cos_hour</monospace></oasis:entry>
         <oasis:entry colname="col2">0.05</oasis:entry>
         <oasis:entry colname="col3"><monospace>T_grad_down</monospace></oasis:entry>
         <oasis:entry colname="col4">0.09</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"><monospace>Cloudy</monospace></oasis:entry>
         <oasis:entry colname="col2">0.04</oasis:entry>
         <oasis:entry colname="col3"><monospace>RHi_ERA5_up1</monospace></oasis:entry>
         <oasis:entry colname="col4">0.06</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e2757">To account for the diurnal and seasonal variability in atmospheric conditions, we embedded time information using cosine functions. The local time at the observation point was computed from the UTC hour and longitude, then encoded using sine and cosine transforms to capture daily cycles (<monospace>cos_hour</monospace>, <monospace>sin_hour</monospace>). Similarly, the day of the year (<monospace>cos_day</monospace>, <monospace>sin_day</monospace>) is encoded to reflect seasonal effects. These four quantities are normalized by 2<inline-formula><mml:math id="M125" display="inline"><mml:mi mathvariant="italic">π</mml:mi></mml:math></inline-formula>.</p>
      <p id="d2e2779">These engineered features were incorporated into the training data to allow the model to better represent the complex atmospheric processes influencing the <inline-formula><mml:math id="M126" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> (cf. Table <xref ref-type="table" rid="T1"/>). Some of the engineered features proved to be statistically meaningful. In particular, the temperature gradient showed a stronger correlation with <inline-formula><mml:math id="M127" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> than the original temperature variable. Time embeddings exhibited almost null direct correlation with RHi. However, they were identified as important contributors in the feature importance analysis of the ANN, indicating that their effect may be captured in more complex, non-linear relationships within the model (cf. Fig. <xref ref-type="fig" rid="F3"/> and Table <xref ref-type="table" rid="T4"/>).</p>
      <p id="d2e2810">Data is then categorized according to distinct meteorological conditions following methodology from <xref ref-type="bibr" rid="bib1.bibx43" id="text.39"/>. To determine the cloud presence, we used the cloud ice water content (<monospace>ciwc_ERA5</monospace>) variable. If ciwc equals zero at the current pressure level and two levels above and below it (<inline-formula><mml:math id="M128" display="inline"><mml:mo lspace="0mm">±</mml:mo></mml:math></inline-formula> 2 pressure levels), the data point is classified as clear sky. If any of these five levels has a non-zero ciwc value, the point is classified as cloudy. Additionally, we separated the points from the UT and LS. Commercial flights cruise at an atmospheric level called the tropopause, which is the interface between the troposphere and the stratosphere. This separation helps to account for the differing thermodynamic properties and humidity distributions between the troposphere and stratosphere. The classification is based on potential vorticity (<monospace>pv_ERA5</monospace>). Data points are labeled as UT if potential vorticity is less than 2 <inline-formula><mml:math id="M129" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">PVU</mml:mi></mml:mrow></mml:math></inline-formula> (Potential Vorticity Units). Conversely, points with PV <inline-formula><mml:math id="M130" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 2 <inline-formula><mml:math id="M131" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">PVU</mml:mi></mml:mrow></mml:math></inline-formula> are classified as LS. We use the typical conversion rate of 1 <inline-formula><mml:math id="M132" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">PVU</mml:mi></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M133" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 10<sup>6</sup> <inline-formula><mml:math id="M135" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">K</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">kg</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. In the test dataset, 2641 points are classified as Cloudy-Sky, 3444 as Clear-Sky, 4566 as LS and 1519 as UT.</p>

<table-wrap id="T5" specific-use="star"><label>Table 5</label><caption><p id="d2e2912">Comparison of ERA5 and hybrid model (HYB) prediction performance against IAGOS observations across different meteorological conditions on the test dataset. We show the percentage of True Positive (TP), False Negative (FN), False Positive (FP) and True Negative(TN) per condition.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Condition</oasis:entry>
         <oasis:entry colname="col2">Model</oasis:entry>
         <oasis:entry colname="col3">TP</oasis:entry>
         <oasis:entry colname="col4">FN</oasis:entry>
         <oasis:entry colname="col5">FP</oasis:entry>
         <oasis:entry colname="col6">TN</oasis:entry>
         <oasis:entry colname="col7">ETS</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">UT</oasis:entry>
         <oasis:entry colname="col2">ERA5</oasis:entry>
         <oasis:entry colname="col3">24.09</oasis:entry>
         <oasis:entry colname="col4">11.45</oasis:entry>
         <oasis:entry colname="col5">13.50</oasis:entry>
         <oasis:entry colname="col6">50.95</oasis:entry>
         <oasis:entry colname="col7">0.30</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">HYB</oasis:entry>
         <oasis:entry colname="col3">24.36</oasis:entry>
         <oasis:entry colname="col4">11.19</oasis:entry>
         <oasis:entry colname="col5">10.66</oasis:entry>
         <oasis:entry colname="col6">53.79</oasis:entry>
         <oasis:entry colname="col7">0.35 (+0.05)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LS</oasis:entry>
         <oasis:entry colname="col2">ERA5</oasis:entry>
         <oasis:entry colname="col3">1.18</oasis:entry>
         <oasis:entry colname="col4">3.29</oasis:entry>
         <oasis:entry colname="col5">1.34</oasis:entry>
         <oasis:entry colname="col6">94.20</oasis:entry>
         <oasis:entry colname="col7">0.19</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">HYB</oasis:entry>
         <oasis:entry colname="col3">2.01</oasis:entry>
         <oasis:entry colname="col4">2.45</oasis:entry>
         <oasis:entry colname="col5">0.72</oasis:entry>
         <oasis:entry colname="col6">94.81</oasis:entry>
         <oasis:entry colname="col7">0.37 (+0.18)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Cloudy</oasis:entry>
         <oasis:entry colname="col2">ERA5</oasis:entry>
         <oasis:entry colname="col3">14.96</oasis:entry>
         <oasis:entry colname="col4">8.56</oasis:entry>
         <oasis:entry colname="col5">9.66</oasis:entry>
         <oasis:entry colname="col6">66.83</oasis:entry>
         <oasis:entry colname="col7">0.33</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">HYB</oasis:entry>
         <oasis:entry colname="col3">16.02</oasis:entry>
         <oasis:entry colname="col4">7.50</oasis:entry>
         <oasis:entry colname="col5">7.08</oasis:entry>
         <oasis:entry colname="col6">69.41</oasis:entry>
         <oasis:entry colname="col7">0.42 (+0.09)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Clear-sky</oasis:entry>
         <oasis:entry colname="col2">ERA5</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">2.85</oasis:entry>
         <oasis:entry colname="col5">0.32</oasis:entry>
         <oasis:entry colname="col6">96.11</oasis:entry>
         <oasis:entry colname="col7">0.18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">HYB</oasis:entry>
         <oasis:entry colname="col3">1.13</oasis:entry>
         <oasis:entry colname="col4">2.44</oasis:entry>
         <oasis:entry colname="col5">0.23</oasis:entry>
         <oasis:entry colname="col6">96.20</oasis:entry>
         <oasis:entry colname="col7">0.29 (+0.11)</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Evaluation Metrics</title>
      <p id="d2e3171">The performance of the model is assessed using different metrics. Since we primarily solve a regression task, we use the Mean Absolute Error (MAE), expressed as <inline-formula><mml:math id="M136" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> percentage points. However, MAE alone does not fully capture the model's ability to correct the biases in ERA5. The aim of the model is to accurately identify ISSRs, which requires a classification perspective. Therefore, following common practices in ISSR prediction (such as in <xref ref-type="bibr" rid="bib1.bibx43" id="altparen.40"/> and <xref ref-type="bibr" rid="bib1.bibx45" id="altparen.41"/>), the Equitable Threat Score (ETS) is used as a complementary evaluation metric. The ETS is a skill score that quantifies the accuracy of a binary classification (e.g., detecting ISSRs where <inline-formula><mml:math id="M137" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M138" display="inline"><mml:mo>≥</mml:mo></mml:math></inline-formula> 100 %) while accounting for hits that could occur by random chance. The ETS is defined as

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M139" display="block"><mml:mrow><mml:mtext>ETS</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>-</mml:mo><mml:mi>r</mml:mi></mml:mrow><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext><mml:mo>-</mml:mo><mml:mi>r</mml:mi></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

          where TP denotes true positives (correctly predicted ISSRs), FP denotes false positives (non-ISSRs incorrectly predicted as ISSRs), FN denotes false negatives (ISSRs missed by the model), TN denotes true negatives (correctly predicted non-ISSRs), and <inline-formula><mml:math id="M140" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula> is the number of hits expected by random chance, computed as

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M141" display="block"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mo>(</mml:mo><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>)</mml:mo><mml:mo>(</mml:mo><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext><mml:mo>+</mml:mo><mml:mtext>TN</mml:mtext></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e3302">The ETS adjusts for random hits and is particularly suited for evaluating rare event prediction. It ranges from <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> (worse than random chance) to 1 (perfect prediction), with 0 indicating no better results than random guessing. Our baseline for evaluating model improvement is the ERA5 <inline-formula><mml:math id="M143" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> prediction, which shows a 0.36 ETS and 13.7 % MAE (see Table <xref ref-type="table" rid="T3"/>).</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Hybrid Machine Learning Model development</title>
      <p id="d2e3341">Correcting the dry bias in ERA5's <inline-formula><mml:math id="M144" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates in the UTLS presents a significant challenge due to the inherently non-linear nature of atmospheric processes. To address this, we propose a hybrid machine learning approach that combines two complementary models: an ANN and an XGBoost. The ANN is well-suited for capturing complex, non-linear dependencies, particularly under high humidity conditions, while XGBoost performs robustly in drier regimes.</p>
      <p id="d2e3355">The hybrid strategy dynamically selects the appropriate model for prediction based on the ERA5-estimated <inline-formula><mml:math id="M145" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> value. Specifically, for samples where ERA5 <inline-formula><mml:math id="M146" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is below 85 %, predictions are taken from the XGBoost model, which has demonstrated superior performance in lower-humidity environments. Conversely, for <inline-formula><mml:math id="M147" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values exceeding 85 %, the ANN model is employed, leveraging its capacity to extract highly non-linear patterns, such as those associated with ISSRs.  The 85 % threshold was chosen based on two key considerations. First, ERA5 is known to exhibit a MAE of approximately 15 % in humid conditions. By subtracting this from the 100 % threshold typically used to identify ISSRs, we ensure that the ANN model is prioritized in critical high-humidity cases. Second, validation dataset performance confirmed that this hybrid architecture consistently outperforms either model used in isolation.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e3393">Comparison of ISSR prediction performance between ERA5 and the hybrid model across four meteorological conditions. These are the predictions made on the test dataset between 400 and 200 <inline-formula><mml:math id="M148" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula> over the North Atlantic region (cf. Fig. <xref ref-type="fig" rid="F1"/>) for the year 2022.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f05.png"/>

      </fig>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e3415">Number of correctly predicted ISSRs by ERA5 and the hybrid model compared to IAGOS ground truth across different meteorological conditions on the 2022 test dataset. Blue bars indicate true positives from ERA5, red bars from the hybrid model, and the dashed curve represents the total number of actual ISSRs in each condition based on IAGOS observations. The largest improvement is observed in the LS. The far left bars show the overall comparison across all conditions.</p></caption>
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f06.png"/>

      </fig>

      <p id="d2e3424">Firstly, the ANN component is trained to predict <inline-formula><mml:math id="M149" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> using the input features summarized in Table 1. Data are normalized using min–max scaling, with the <inline-formula><mml:math id="M150" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> range extended to 200 % to reduce sensitivity to outliers and mitigate dry bias. The final network architecture includes three hidden layers with 100 neurons each, employing He initialization and ReLU activation functions. Batch normalization and dropout regularization are applied between layers to stabilize training and reduce overfitting. The output layer uses a linear activation function to predict <inline-formula><mml:math id="M151" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> , denoted as <monospace>RHi_ANN</monospace>.</p>
      <p id="d2e3463">Training is conducted using the Adam optimizer <xref ref-type="bibr" rid="bib1.bibx23" id="paren.42"/> with a learning rate of 0.001, decay rate of 5 <inline-formula><mml:math id="M152" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 10<sup>−3</sup>, and momentum of 0.98. The model is trained over 150 epochs with a batch size of 1024, and early stopping is employed to halt training if validation loss does not improve over 20 consecutive epochs. A custom loss function, based on the Mean Squared Error (MSE), is used to emphasize accuracy in high-<inline-formula><mml:math id="M154" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> conditions. The loss function is defined as:

          <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M155" display="block"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>⋅</mml:mo><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>,</mml:mo><mml:mtext> with </mml:mtext><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>i</mml:mi><mml:mi mathvariant="italic">α</mml:mi></mml:msubsup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e3594">In Eq. (<xref ref-type="disp-formula" rid="Ch1.E3"/>), <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the target value at index <inline-formula><mml:math id="M157" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the predicted value at index <inline-formula><mml:math id="M159" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M160" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of samples, and <inline-formula><mml:math id="M161" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula> is a weight factor. The weight parameter (<inline-formula><mml:math id="M162" display="inline"><mml:mi mathvariant="italic">α</mml:mi></mml:math></inline-formula>) that defines the weight of the <inline-formula><mml:math id="M163" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> value is denoted <monospace>WEIGHT_FACTOR</monospace>. This weighted approach prioritizes performance in ISSRs, addressing the ERA5 dry bias. Hyperparameters were optimized via grid search on the train dataset as detailed in Table <xref ref-type="table" rid="T2"/>.</p>
      <p id="d2e3676">Secondly, the XGBoost model uses the same input feature set as the ANN, including engineered variables. XGBoost is an efficient implementation of gradient boosting known for its scalability, accuracy, and interpretability. The final model was trained with 100 estimators, a learning rate of 0.1, a maximum tree depth of 4, a subsampling ratio of 0.9, and a column sampling ratio per tree of 0.8. In Table <xref ref-type="table" rid="TA1"/>, we report the performance comparison for a set of ML models.</p>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
      <p id="d2e3689">The hybrid model achieved an ETS of 0.44 and a MAE of 11.37 %. Compared to ERA5, it showed an improvement of +0.08 in ETS and <inline-formula><mml:math id="M164" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>2.33 % in MAE.  Fig. <xref ref-type="fig" rid="F4"/> compares scatter plots of our different models predictions on the test dataset to the baseline ERA5 estimates, see Fig. <xref ref-type="fig" rid="F4"/>a. Figure <xref ref-type="fig" rid="F4"/>c shows that the XGBoost model was able to correct low <inline-formula><mml:math id="M165" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values with great accuracy, consequently showing strong MAE improvement. However, it struggles with higher <inline-formula><mml:math id="M166" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values, resulting in poor ETS. The ANN, Fig. <xref ref-type="fig" rid="F4"/>b, shows a strong ETS, indicating its ability to classify ISSRs, but it overestimates the lower <inline-formula><mml:math id="M167" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values leading to a higher MAE. The hybrid model shown in Fig. <xref ref-type="fig" rid="F4"/>d combines the strengths of both models, achieving a good balance between ETS and MAE.</p>
      <p id="d2e3743">The <inline-formula><mml:math id="M168" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> estimates are effectively improved by combining the strengths of both models, leveraging the ANN's ability to capture non-linear relationships and the XGBoost's robustness in handling drier conditions. The hybrid model achieves a higher ETS comparable to the ANN's while maintaining a lower MAE than both models. This hybrid approach demonstrates the potential of combining different machine learning techniques to improve predictions in complex atmospheric conditions. Table <xref ref-type="table" rid="T3"/> summarizes the performance of all models in terms of ETS and MAE.</p>
      <p id="d2e3759">Table <xref ref-type="table" rid="T4"/> presents the most important features used by the ANN and XGBoost models. As expected we find <monospace>RHi_ERA5</monospace> as the most important feature. This table shows that the models relied on different features for predictions: the XGBoost model relies more heavily on features with stronger linear relationships to the target, such as the vertical temperature gradients, while the ANN is able to exploit more complex, non-linear patterns, like the time embeddings. This observation highlights the complementary nature of the two models, with the ANN capturing complex relationships (often associated with more humid regions) and the XGBoost focusing on more linear patterns.</p>
      <p id="d2e3767">The performance of the hybrid model relative to the baseline ERA5 are assessed for both UT and LS under clear- and cloudy-sky conditions. Results are shown in Fig. <xref ref-type="fig" rid="F5"/>. The best improvement in MAE is observable in the LS, from 13.19 % to 10.71 %. This is explained because the model performs a better correction on lower <inline-formula><mml:math id="M169" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values and the LS counts significantly less ISSRs (see Fig. <xref ref-type="fig" rid="F6"/>). In Table <xref ref-type="table" rid="T5"/>, we report the predictions performance for each of the cases. In comparison, while QM successfully removes bias from the distribution, it does not improve point-wise predictive accuracy. <xref ref-type="bibr" rid="bib1.bibx45" id="text.43"/> explicitly report that after applying QM, the “RMSE, MAE, and MSE increase unnoticeably”. In contrast, our model significantly reduces the MAE from 13.70 % to 11.37 %.</p>
      <p id="d2e3791">Finally, Fig. <xref ref-type="fig" rid="F6"/> presents the number of correctly predicted ISSRs for the various cases. We observe that the hybrid model correctly predicts significantly more ISSRs than ERA5 in the LS, 92 out of 204, compared to 54 out of 204 for ERA5. Overall, the hybrid model improves ISSR detection by approximately 10 % across all conditions. In the UT, the improvement is less significant, with only 4 additional ISSRs correctly classified.</p>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e3805">In this study, we used a hybrid ensemble model to improve <inline-formula><mml:math id="M170" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> from ERA5 reanalysis in the UTLS using in-situ measurement from IAGOS. The dry bias of ERA5 is considerably reduced and the number of correctly predicted ISSRs is increased, especially in the LS where it nearly doubles.</p>
      <p id="d2e3819">The hybrid model is built on two complementary models: an XGBoost regressor that excels in dry conditions (<inline-formula><mml:math id="M171" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M172" display="inline"><mml:mo>&lt;</mml:mo></mml:math></inline-formula> 85 %), and an ANN that performs better in more humid regions (<inline-formula><mml:math id="M173" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M174" display="inline"><mml:mo>&gt;</mml:mo></mml:math></inline-formula> 85 %). The hybrid model demonstrated significant improvements compared to the initial ERA5 estimates, both in terms of MAE and ETS. It achieved an MAE of 11.37 %, compared to 13.7 % for ERA5, and improved the ETS from 0.36 to 0.44. The best improvement was observed in the LS, where the hybrid model increased ETS by +0.18 and reduced MAE from 13.19 % to 10.71 %. The hybrid model consistently outperformed ERA5 across all defined meteorological conditions, detecting approximately 10 % more ISSRs overall.</p>
      <p id="d2e3858">However, the model presents limitations, especially in predicting very high <inline-formula><mml:math id="M175" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">RH</mml:mi><mml:mi mathvariant="normal">i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> values (above 120 %), likely due to the very limited availability of such samples in the dataset. Although ISSR detection has improved, reanalysis products still exhibit limitations for supporting accurate contrail impact estimates, and similar challenges are expected to apply to forecast products required for real-time contrail avoidance, particularly given the absence of clearly established performance requirements. Future work could explore advanced downsampling techniques to improve collocation between IAGOS and ERA5 datasets despite their different spatial and temporal resolutions.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title/>

      <fig id="FA1"><label>Figure A1</label><caption><p id="d2e3884">Histograms of vorticity <bold>(a)</bold> and divergence <bold>(b)</bold> for the North-Atlantic region in 2022 on the 250 <inline-formula><mml:math id="M176" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula> level. This pressure level corresponds to the UT, where ISSRs are frequently observed. The blue bars refer to all points region while the red ones refer to ISSRs only. The data are normalized using the standard deviation of all grid boxes. We compared these distributions with studies of dynamical characteristics of ISSRs from <xref ref-type="bibr" rid="bib1.bibx11" id="text.44"/>. By ensuring that our processed data exhibit comparable statistical properties, we confirmed the reliability of our data processing pipeline.</p></caption>
        
        <graphic xlink:href="https://acp.copernicus.org/articles/26/4771/2026/acp-26-4771-2026-f07.png"/>

      </fig>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e3917">Model performance comparison across regression and classification metrics for a set of ML model available in Scikit-learn <xref ref-type="bibr" rid="bib1.bibx28" id="paren.45"/>. The train and test dataset are described in Sect. <xref ref-type="sec" rid="Ch1.S2.SS3"/>. Best values per metric are in bold.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="9">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Model</oasis:entry>
         <oasis:entry colname="col2">Train <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col3">Test <inline-formula><mml:math id="M178" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col4">Train MAE</oasis:entry>
         <oasis:entry colname="col5">Test MAE</oasis:entry>
         <oasis:entry colname="col6">Train MSE</oasis:entry>
         <oasis:entry colname="col7">Test MSE</oasis:entry>
         <oasis:entry colname="col8">Train ETS</oasis:entry>
         <oasis:entry colname="col9">Test ETS</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Decision Tree</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.66</oasis:entry>
         <oasis:entry colname="col4">8.76</oasis:entry>
         <oasis:entry colname="col5">13.64</oasis:entry>
         <oasis:entry colname="col6">12.79</oasis:entry>
         <oasis:entry colname="col7">19.45</oasis:entry>
         <oasis:entry colname="col8">0.47</oasis:entry>
         <oasis:entry colname="col9"><bold> 0.40</bold></oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Random Forest</oasis:entry>
         <oasis:entry colname="col2"><bold>0.99</bold></oasis:entry>
         <oasis:entry colname="col3">0.74</oasis:entry>
         <oasis:entry colname="col4"><bold>1.37</bold></oasis:entry>
         <oasis:entry colname="col5">12.12</oasis:entry>
         <oasis:entry colname="col6"><bold> 2.54</bold></oasis:entry>
         <oasis:entry colname="col7">17.00</oasis:entry>
         <oasis:entry colname="col8"><bold>0.92</bold></oasis:entry>
         <oasis:entry colname="col9">0.29</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Gradient Boosting</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.75</oasis:entry>
         <oasis:entry colname="col4">9.28</oasis:entry>
         <oasis:entry colname="col5">11.84</oasis:entry>
         <oasis:entry colname="col6">12.90</oasis:entry>
         <oasis:entry colname="col7">16.56</oasis:entry>
         <oasis:entry colname="col8">0.46</oasis:entry>
         <oasis:entry colname="col9">0.31</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Hist GB</oasis:entry>
         <oasis:entry colname="col2">0.89</oasis:entry>
         <oasis:entry colname="col3">0.74</oasis:entry>
         <oasis:entry colname="col4">7.36</oasis:entry>
         <oasis:entry colname="col5">11.76</oasis:entry>
         <oasis:entry colname="col6">11.73</oasis:entry>
         <oasis:entry colname="col7">17.00</oasis:entry>
         <oasis:entry colname="col8">0.58</oasis:entry>
         <oasis:entry colname="col9">0.34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">AdaBoost</oasis:entry>
         <oasis:entry colname="col2">0.69</oasis:entry>
         <oasis:entry colname="col3">0.61</oasis:entry>
         <oasis:entry colname="col4">17.0</oasis:entry>
         <oasis:entry colname="col5">17.79</oasis:entry>
         <oasis:entry colname="col6">19.88</oasis:entry>
         <oasis:entry colname="col7">20.84</oasis:entry>
         <oasis:entry colname="col8">0.06</oasis:entry>
         <oasis:entry colname="col9">0.02</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">XGBoost</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3"><bold>0.76</bold></oasis:entry>
         <oasis:entry colname="col4">9.36</oasis:entry>
         <oasis:entry colname="col5">11.56</oasis:entry>
         <oasis:entry colname="col6">13.05</oasis:entry>
         <oasis:entry colname="col7"><bold>16.25</bold></oasis:entry>
         <oasis:entry colname="col8">0.48</oasis:entry>
         <oasis:entry colname="col9">0.35</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">LightGBM</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.76</oasis:entry>
         <oasis:entry colname="col4">9.19</oasis:entry>
         <oasis:entry colname="col5"><bold> 11.55</bold></oasis:entry>
         <oasis:entry colname="col6">12.82</oasis:entry>
         <oasis:entry colname="col7">16.30</oasis:entry>
         <oasis:entry colname="col8">0.48</oasis:entry>
         <oasis:entry colname="col9">0.32</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Linear Regression</oasis:entry>
         <oasis:entry colname="col2">0.77</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">12.30</oasis:entry>
         <oasis:entry colname="col5">12.74</oasis:entry>
         <oasis:entry colname="col6">16.98</oasis:entry>
         <oasis:entry colname="col7">17.41</oasis:entry>
         <oasis:entry colname="col8">0.29</oasis:entry>
         <oasis:entry colname="col9">0.35</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Ridge</oasis:entry>
         <oasis:entry colname="col2">0.77</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">12.25</oasis:entry>
         <oasis:entry colname="col5">12.62</oasis:entry>
         <oasis:entry colname="col6">16.94</oasis:entry>
         <oasis:entry colname="col7">17.34</oasis:entry>
         <oasis:entry colname="col8">0.29</oasis:entry>
         <oasis:entry colname="col9">0.37</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Lasso</oasis:entry>
         <oasis:entry colname="col2">0.75</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">13.53</oasis:entry>
         <oasis:entry colname="col5">13.71</oasis:entry>
         <oasis:entry colname="col6">17.95</oasis:entry>
         <oasis:entry colname="col7">17.69</oasis:entry>
         <oasis:entry colname="col8">0.08</oasis:entry>
         <oasis:entry colname="col9">0.11</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ElasticNet</oasis:entry>
         <oasis:entry colname="col2">0.75</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">13.13</oasis:entry>
         <oasis:entry colname="col5">13.38</oasis:entry>
         <oasis:entry colname="col6">17.77</oasis:entry>
         <oasis:entry colname="col7">17.56</oasis:entry>
         <oasis:entry colname="col8">0.12</oasis:entry>
         <oasis:entry colname="col9">0.14</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Bayesian Ridge</oasis:entry>
         <oasis:entry colname="col2">0.77</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">12.25</oasis:entry>
         <oasis:entry colname="col5">12.62</oasis:entry>
         <oasis:entry colname="col6">16.95</oasis:entry>
         <oasis:entry colname="col7">17.37</oasis:entry>
         <oasis:entry colname="col8">0.29</oasis:entry>
         <oasis:entry colname="col9">0.37</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SGD Regressor</oasis:entry>
         <oasis:entry colname="col2">0.77</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">12.23</oasis:entry>
         <oasis:entry colname="col5">12.62</oasis:entry>
         <oasis:entry colname="col6">17.07</oasis:entry>
         <oasis:entry colname="col7">17.41</oasis:entry>
         <oasis:entry colname="col8">0.32</oasis:entry>
         <oasis:entry colname="col9">0.39</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">KNN</oasis:entry>
         <oasis:entry colname="col2">0.99</oasis:entry>
         <oasis:entry colname="col3">0.60</oasis:entry>
         <oasis:entry colname="col4">1.76</oasis:entry>
         <oasis:entry colname="col5">15.18</oasis:entry>
         <oasis:entry colname="col6">3.54</oasis:entry>
         <oasis:entry colname="col7">21.09</oasis:entry>
         <oasis:entry colname="col8">0.88</oasis:entry>
         <oasis:entry colname="col9">0.32</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</app>
  </app-group><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d2e4455">The IAGOS data are available on the IAGOS Data Portal at <ext-link xlink:href="https://doi.org/10.25326/06" ext-link-type="DOI">10.25326/06</ext-link> (<xref ref-type="bibr" rid="bib1.bibx4" id="altparen.46"/>). The ERA5 reanalysis data are available from the Copernicus Climate Change Service (C3S) Climate Data Store (CDS) (<ext-link xlink:href="https://doi.org/10.24381/cds.bd0915c6" ext-link-type="DOI">10.24381/cds.bd0915c6</ext-link>, <xref ref-type="bibr" rid="bib1.bibx18" id="altparen.47"/>).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e4470">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/acp-26-4771-2026-supplement" xlink:title="zip">https://doi.org/10.5194/acp-26-4771-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e4479">OB and JJQ conceived and designed the experiments; MA conducted the experiments and analyzed the data; MA and JJQ wrote the manuscript; OB provided critical review.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e4485">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e4491">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e4497">The authors thank Ziming Wang for useful discussions on her work which has initiated our study.</p></ack><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e4502">This paper was edited by Franziska Aemisegger and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Alduchov and Eskridge(1996)</label><mixed-citation>Alduchov, O. A. and Eskridge, R. E.: Improved Magnus Form Approximation of Saturation Vapor Pressure, J. Appl. Meteorol., 35, 601–609, <ext-link xlink:href="https://doi.org/10.1175/1520-0450(1996)035&lt;0601:imfaos&gt;2.0.co;2" ext-link-type="DOI">10.1175/1520-0450(1996)035&lt;0601:imfaos&gt;2.0.co;2</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Appleman(1953)</label><mixed-citation>Appleman, H.: The formation of exhaust condensation trails by jet aircraft, B. Am. Meteorol. Soc., 34, 14–20, <ext-link xlink:href="https://doi.org/10.1175/1520-0477-34.1.14" ext-link-type="DOI">10.1175/1520-0477-34.1.14</ext-link>, 1953.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Borella et al.(2024)Borella, Vignon, Boucher, and Rohs</label><mixed-citation>Borella, A., Vignon, É., Boucher, O., and Rohs, S.: An Empirical Parameterization of the Subgrid-Scale Distribution of Water Vapor in the UTLS for Atmospheric General Circulation Models, J. Geophys. Res.-Atmos., 129, <ext-link xlink:href="https://doi.org/10.1029/2024JD040981" ext-link-type="DOI">10.1029/2024JD040981</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Boulanger et al.(2018)Boulanger, Thouret, and Petzold</label><mixed-citation>Boulanger, D., Thouret, V., and Petzold, A.: IAGOS final quality controlled Observational Data L2 – Time series, Aeris [data set], <ext-link xlink:href="https://doi.org/10.25326/06" ext-link-type="DOI">10.25326/06</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Chen and Guestrin(2016)</label><mixed-citation>Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '16, p. 785–794, ACM, <ext-link xlink:href="https://doi.org/10.1145/2939672.2939785" ext-link-type="DOI">10.1145/2939672.2939785</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Copernicus Climate Change Service(2025)</label><mixed-citation>Copernicus Climate Change Service: ERA5 hourly data on pressure levels from 1940 to present, Climate Data Store [data set], <ext-link xlink:href="https://doi.org/10.24381/cds.bd0915c6" ext-link-type="DOI">10.24381/cds.bd0915c6</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Diao et al.(2014)Diao, Zondlo, Heymsfield, Avallone, Paige, Beaton, Campos, and Rogers</label><mixed-citation>Diao, M., Zondlo, M. A., Heymsfield, A. J., Avallone, L. M., Paige, M. E., Beaton, S. P., Campos, T., and Rogers, D. C.: Cloud-scale ice-supersaturated regions spatially correlate with high water vapor heterogeneities, Atmos. Chem. Phys., 14, 2639–2656, <ext-link xlink:href="https://doi.org/10.5194/acp-14-2639-2014" ext-link-type="DOI">10.5194/acp-14-2639-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Diao et al.(2015a)Diao, Jensen, Pan, Homeyer, Honomichl, Bresch, and Bansemer</label><mixed-citation>Diao, M., Jensen, J. B., Pan, L. L., Homeyer, C. R., Honomichl, S., Bresch, J. F., and Bansemer, A.: Distributions of ice supersaturation and ice crystals from airborne observations in relation to upper tropospheric dynamical boundaries, J. Geophys. Res.-Atmos., 120, 5101–5121, <ext-link xlink:href="https://doi.org/10.1002/2015JD023139" ext-link-type="DOI">10.1002/2015JD023139</ext-link>, 2015a.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Diao et al.(2015b)Diao, Jensen, Pan, Homeyer, Honomichl, Bresch, and Bansemer</label><mixed-citation>Diao, M., Jensen, J. B., Pan, L. L., Homeyer, C. R., Honomichl, S., Bresch, J. F., and Bansemer, A.: Distributions of ice supersaturation and ice crystals from airborne observations in relation to upper tropospheric dynamical boundaries, J. Geophys. Res.-Atmos., 120, 5101–5121, <ext-link xlink:href="https://doi.org/10.1002/2015JD023139" ext-link-type="DOI">10.1002/2015JD023139</ext-link>, 2015b.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Gettelman et al.(2006)Gettelman, Fetzer, Eldering, and Irion</label><mixed-citation>Gettelman, A., Fetzer, E. J., Eldering, A., and Irion, F. W.: The global distribution of supersaturation in the upper troposphere from the Atmospheric Infrared Sounder, J. Climate, 19, 6089–6103, <ext-link xlink:href="https://doi.org/10.1175/JCLI3955.1" ext-link-type="DOI">10.1175/JCLI3955.1</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Gierens and Brinkop(2012)</label><mixed-citation>Gierens, K. and Brinkop, S.: Dynamical characteristics of ice supersaturated regions, Atmos. Chem. Phys., 12, 11933–11942, <ext-link xlink:href="https://doi.org/10.5194/acp-12-11933-2012" ext-link-type="DOI">10.5194/acp-12-11933-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Gierens and Spichtinger(2000)</label><mixed-citation>Gierens, K. and Spichtinger, P.: On the size distribution of ice-supersaturated regions in the upper troposphere and lowermost stratosphere, Ann. Geophys., 18, 499–504, <ext-link xlink:href="https://doi.org/10.1007/s00585-000-0499-7" ext-link-type="DOI">10.1007/s00585-000-0499-7</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Gierens et al.(1999)Gierens, Schumann, Helten, Smit, and Marenco</label><mixed-citation>Gierens, K., Schumann, U., Helten, M., Smit, H., and Marenco, A.: A distribution law for relative humidity in the upper troposphere and lower stratosphere derived from three years of MOZAIC measurements, Ann. Geophys., 17, 1218–1226, <ext-link xlink:href="https://doi.org/10.1007/s00585-999-1218-7" ext-link-type="DOI">10.1007/s00585-999-1218-7</ext-link>, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Gierens et al.(2004)Gierens, Kohlhepp, Spichtinger, and Schroedter-Homscheidt</label><mixed-citation>Gierens, K., Kohlhepp, R., Spichtinger, P., and Schroedter-Homscheidt, M.: Ice supersaturation as seen from TOVS, Atmos. Chem. Phys., 4, 539–547, <ext-link xlink:href="https://doi.org/10.5194/acp-4-539-2004" ext-link-type="DOI">10.5194/acp-4-539-2004</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Groß et al.(2014)Groß, Wirth, Schäfler, Fix, Kaufmann, and Voigt</label><mixed-citation>Groß, S., Wirth, M., Schäfler, A., Fix, A., Kaufmann, S., and Voigt, C.: Potential of airborne lidar measurements for cirrus cloud studies, Atmos. Meas. Tech., 7, 2745–2755, <ext-link xlink:href="https://doi.org/10.5194/amt-7-2745-2014" ext-link-type="DOI">10.5194/amt-7-2745-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Hegglin et al.(2013)Hegglin, Tegtmeier, Anderson, Froidevaux, Fuller, Funke, Jones, Lingenfelser, Lumpe, Pendlebury, Remsberg, Rozanov, Toohey, Urban, von Clarmann, Walker, Wang, and Weigel</label><mixed-citation>Hegglin, M. I., Tegtmeier, S., Anderson, J., Froidevaux, L., Fuller, R., Funke, B., Jones, A., Lingenfelser, G., Lumpe, J., Pendlebury, D., Remsberg, E., Rozanov, A., Toohey, M., Urban, J., von Clarmann, T., Walker, K. A., Wang, R., and Weigel, K.: SPARC Data Initiative: Comparison of water vapor climatologies from international satellite limb sounders, J. Geophys. Res., 118, 11824–11846, <ext-link xlink:href="https://doi.org/10.1002/jgrd.50752" ext-link-type="DOI">10.1002/jgrd.50752</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Hersbach et al.(2020)Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume, and Thépaut</label><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <ext-link xlink:href="https://doi.org/10.1002/qj.3803" ext-link-type="DOI">10.1002/qj.3803</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Hersbach et al.(2023)Hersbach, Bell, Berrisford, Biavati, Horányi, Muñoz Sabater, Nicolas, Peubey, Radu, Rozum, Schepers, Simmons, Soci, Dee, and Thépaut</label><mixed-citation>Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.: ERA5 hourly data on pressure levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <ext-link xlink:href="https://doi.org/10.24381/cds.bd0915c6" ext-link-type="DOI">10.24381/cds.bd0915c6</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Heymsfield et al.(1998)Heymsfield, Miloshevich, Twohy, Sachse, and Oltmans</label><mixed-citation>Heymsfield, A. J., Miloshevich, L. M., Twohy, C., Sachse, G., and Oltmans, S.: Upper-tropospheric relative humidity observations and implications for cirrus ice nucleation, Geophys. Res. Lett., 25, 1343–1346, <ext-link xlink:href="https://doi.org/10.1029/98GL01089" ext-link-type="DOI">10.1029/98GL01089</ext-link>, 1998.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Hofer and Gierens(2025)</label><mixed-citation>Hofer, S. M. and Gierens, K. M.: Kinematic properties of regions that can involve persistent contrails over the North Atlantic and Europe during April and May 2024, Atmos. Chem. Phys., 25, 6843–6856, <ext-link xlink:href="https://doi.org/10.5194/acp-25-6843-2025" ext-link-type="DOI">10.5194/acp-25-6843-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Kärcher(2018)</label><mixed-citation>Kärcher, B.: Formation and radiative forcing of contrail cirrus, Nat. Commun., 9, 1824, <ext-link xlink:href="https://doi.org/10.1038/s41467-018-04068-0" ext-link-type="DOI">10.1038/s41467-018-04068-0</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Kärcher and Corcos(2025)</label><mixed-citation> Kärcher, B. and Corcos, M.: On the lifetimes of persistent contrails and contrail cirrus, manuscript submitted to J. Geophys. Res-Atmos., 2025.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Kingma and Ba(2014)</label><mixed-citation>Kingma, D. and Ba, J.: Adam: A Method for Stochastic Optimization, International Conference on Learning Representations, <uri>http://arxiv.org/abs/1412.6980</uri> (last access: 31 July 2025), 2014.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Krüger et al.(2022)Krüger, Schäfler, Wirth, Weissmann, and Craig</label><mixed-citation>Krüger, K., Schäfler, A., Wirth, M., Weissmann, M., and Craig, G. C.: Vertical structure of the lower-stratospheric moist bias in the ERA5 reanalysis and its connection to mixing processes, Atmos. Chem. Phys., 22, 15559–15577, <ext-link xlink:href="https://doi.org/10.5194/acp-22-15559-2022" ext-link-type="DOI">10.5194/acp-22-15559-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Lamquin et al.(2012)Lamquin, Stubenrauch, Gierens, Burkhardt, and Smit</label><mixed-citation>Lamquin, N., Stubenrauch, C. J., Gierens, K., Burkhardt, U., and Smit, H.: A global climatology of upper-tropospheric ice supersaturation occurrence inferred from the Atmospheric Infrared Sounder calibrated by MOZAIC, Atmos. Chem. Phys., 12, 381–405, <ext-link xlink:href="https://doi.org/10.5194/acp-12-381-2012" ext-link-type="DOI">10.5194/acp-12-381-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Lee et al.(2021)Lee, Fahey, Skowron, Allen, Burkhardt, Chen, Doherty, Freeman, Forster, Fuglestvedt, Gettelman, De León, Lim, Lund, Millar, Owen, Penner, Pitari, Prather, Sausen, and Wilcox</label><mixed-citation>Lee, D., Fahey, D., Skowron, A., Allen, M., Burkhardt, U., Chen, Q., Doherty, S., Freeman, S., Forster, P., Fuglestvedt, J., Gettelman, A., De León, R., Lim, L., Lund, M., Millar, R., Owen, B., Penner, J., Pitari, G., Prather, M., Sausen, R., and Wilcox, L.: The contribution of global aviation to anthropogenic climate forcing for 2000 to 2018, Atmos. Environ., 244, 117834, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2020.117834" ext-link-type="DOI">10.1016/j.atmosenv.2020.117834</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Li et al.(2023)Li, Mahnke, Rohs, Bundke, Spelten, Dekoutsidis, Groß, Voigt, Schumann, Petzold, and Krämer</label><mixed-citation>Li, Y., Mahnke, C., Rohs, S., Bundke, U., Spelten, N., Dekoutsidis, G., Groß, S., Voigt, C., Schumann, U., Petzold, A., and Krämer, M.: Upper-tropospheric slightly ice-subsaturated regions: frequency of occurrence and statistical evidence for the appearance of contrail cirrus, Atmos. Chem. Phys., 23, 2251–2271, <ext-link xlink:href="https://doi.org/10.5194/acp-23-2251-2023" ext-link-type="DOI">10.5194/acp-23-2251-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel, Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos, Cournapeau, Brucher, Perrot, and Duchesnay</label><mixed-citation> Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Petzold et al.(2015)Petzold, Thouret, Gerbig, Zahn, Brenninkmeijer, Gallagher, Hermann, Pontaud, Ziereis, Boulanger, Marshall, Nédélec, Smit, Friess, Flaud, Wahner, Cammas, Volz-Thomas, and Team</label><mixed-citation>Petzold, A., Thouret, V., Gerbig, C., Zahn, A., Brenninkmeijer, C. A. M., Gallagher, M., Hermann, M., Pontaud, M., Ziereis, H., Boulanger, D., Marshall, J., Nédélec, P., Smit, H. G. J., Friess, U., Flaud, J.-M., Wahner, A., Cammas, J.-P., Volz-Thomas, A., and Team, I.: Global-scale atmosphere monitoring by in-service aircraft – current achievements and future prospects of the European Research Infrastructure IAGOS, Tellus B, 67, 28452, <ext-link xlink:href="https://doi.org/10.3402/tellusb.v67.28452" ext-link-type="DOI">10.3402/tellusb.v67.28452</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Platt et al.(2024)Platt, Shapiro, Engberg, McCloskey, Geraedts, Sankar, Stettler, Teoh, Schumann, Rohs, Brand, and Van Arsdale</label><mixed-citation>Platt, J. C., Shapiro, M. L., Engberg, Z., McCloskey, K., Geraedts, S., Sankar, T., Stettler, M. E. J., Teoh, R., Schumann, U., Rohs, S., Brand, E., and Van Arsdale, C.: The effect of uncertainty in humidity and model parameters on the prediction of contrail energy forcing, Environmental Research Communications, 6, 095015, <ext-link xlink:href="https://doi.org/10.1088/2515-7620/ad6ee5" ext-link-type="DOI">10.1088/2515-7620/ad6ee5</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Reutter et al.(2020)Reutter, Neis, Rohs, and Sauvage</label><mixed-citation>Reutter, P., Neis, P., Rohs, S., and Sauvage, B.: Ice supersaturated regions: properties and validation of ERA-Interim reanalysis with IAGOS in situ water vapour measurements, Atmos. Chem. Phys., 20, 787–804, <ext-link xlink:href="https://doi.org/10.5194/acp-20-787-2020" ext-link-type="DOI">10.5194/acp-20-787-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Rolf et al.(2023)Rolf, Rohs, Smit, Krämer, Bozóki, Hofmann, Franke, Maser, Hoor, and Petzold</label><mixed-citation>Rolf, C., Rohs, S., Smit, H., Krämer, M., Bozóki, Z., Hofmann, S., Franke, H., Maser, R., Hoor, P., and Petzold, A.: Evaluation of compact hygrometers for continuous airborne measurements, Meteorol. Z., 33, <ext-link xlink:href="https://doi.org/10.1127/metz/2023/1187" ext-link-type="DOI">10.1127/metz/2023/1187</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Rosenow et al.(2018)Rosenow, Fricke, Luchkova et al.</label><mixed-citation>Rosenow, J., Fricke, H., Luchkova, O., and Schultz, M.: Minimizing contrail formation by rerouting around dynamic ice-supersaturated regions, Aeronautics and Aerospace Open Access Journal, 2, 105–111, <ext-link xlink:href="https://doi.org/10.15406/aaoaj.2018.02.00039" ext-link-type="DOI">10.15406/aaoaj.2018.02.00039</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Schmidt(1941)</label><mixed-citation> Schmidt, E.: Die Entstehung von Eisnebel aus den Auspuffgasen von Flugmotoren, Eintrag von Ulrich Schumann, zur Sicherstellung des Zugangs zu diesem wissenschaftshistorisch wichtigen Dokument; mit Zustimmung des Rechteinhabers, Nachfolger des Oldenbourg Verlags, 1941.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Schumann(1996)</label><mixed-citation>Schumann, U.: On conditions for contrail formation from aircraft exhausts, DLR Forschungsbericht DLR-FB 1996-10, Deutsches Zentrum für Luft- und Raumfahrt (DLR), <uri>https://elib.dlr.de/8302/</uri> (last access: 31 July 2025), 1996.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Sonntag(1994)</label><mixed-citation>Sonntag, D.: Advancements in the field of hygrometry, Meteorol. Z., 3, 51–66, <ext-link xlink:href="https://doi.org/10.1127/metz/3/1994/51" ext-link-type="DOI">10.1127/metz/3/1994/51</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Spichtinger et al.(2003a)Spichtinger, Gierens, Leiterer, and Dier</label><mixed-citation>Spichtinger, P., Gierens, K., Leiterer, U., and Dier, H.: Ice supersaturation in the tropopause region over Lindenberg, Germany, Meteorol. Z., 12, 143–156, <ext-link xlink:href="https://doi.org/10.1127/0941-2948/2003/0012-0143" ext-link-type="DOI">10.1127/0941-2948/2003/0012-0143</ext-link>, 2003a.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Spichtinger et al.(2003b)Spichtinger, Gierens, and Read</label><mixed-citation>Spichtinger, P., Gierens, K., and Read, W.: The global distribution of ice-supersaturated regions as seen by the Microwave Limb Sounder, Q. J. Roy. Meteor. Soc., 129, 3391–3410, <ext-link xlink:href="https://doi.org/10.1256/qj.02.141" ext-link-type="DOI">10.1256/qj.02.141</ext-link>, 2003b.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Stenke et al.(2008)Stenke, Grewe, and Ponater</label><mixed-citation>Stenke, A., Grewe, V., and Ponater, M.: Lagrangian transport of water vapor and cloud water in the ECHAM4 GCM and its impact on the cold bias, Clim. Dynam., 31, 491–506, <ext-link xlink:href="https://doi.org/10.1007/S00382-007-0347-5" ext-link-type="DOI">10.1007/S00382-007-0347-5</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Teoh et al.(2022)Teoh, Schumann, Gryspeerdt, Shapiro, Molloy, Koudis, Voigt, and Stettler</label><mixed-citation>Teoh, R., Schumann, U., Gryspeerdt, E., Shapiro, M., Molloy, J., Koudis, G., Voigt, C., and Stettler, M. E. J.: Aviation contrail climate effects in the North Atlantic from 2016 to 2021, Atmos. Chem. Phys., 22, 10919–10935, <ext-link xlink:href="https://doi.org/10.5194/acp-22-10919-2022" ext-link-type="DOI">10.5194/acp-22-10919-2022</ext-link>, 2022. </mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Teoh et al.(2024)Teoh, Engberg, Schumann, Voigt, Shapiro, Rohs, and Stettler</label><mixed-citation>Teoh, R., Engberg, Z., Schumann, U., Voigt, C., Shapiro, M., Rohs, S., and Stettler, M. E. J.: Global aviation contrail climate effects from 2019 to 2021, Atmos. Chem. Phys., 24, 6071–6093, <ext-link xlink:href="https://doi.org/10.5194/acp-24-6071-2024" ext-link-type="DOI">10.5194/acp-24-6071-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Wang et al.(2024)Wang, Wolf, Boucher, and Bellouin</label><mixed-citation>Wang, X., Wolf, K., Boucher, O., and Bellouin, N.: Radiative Effect of Two Contrail Cirrus Outbreaks Over Western Europe Estimated Using Geostationary Satellite Observations and Radiative Transfer Calculations, Geophys. Res. Lett., 51, <ext-link xlink:href="https://doi.org/10.1029/2024gl108452" ext-link-type="DOI">10.1029/2024gl108452</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Wang et al.(2025)Wang, Bugliaro, Gierens, Hegglin, Rohs, Petzold, Kaufmann, and Voigt</label><mixed-citation>Wang, Z., Bugliaro, L., Gierens, K., Hegglin, M. I., Rohs, S., Petzold, A., Kaufmann, S., and Voigt, C.: Machine learning for improvement of upper-tropospheric relative humidity in ERA5 weather model data, Atmos. Chem. Phys., 25, 2845–2861, <ext-link xlink:href="https://doi.org/10.5194/acp-25-2845-2025" ext-link-type="DOI">10.5194/acp-25-2845-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Wolf et al.(2023)Wolf, Bellouin, and Boucher</label><mixed-citation>Wolf, K., Bellouin, N., and Boucher, O.: Long-term upper-troposphere climatology of potential contrail occurrence over the Paris area derived from radiosonde observations, Atmos. Chem. Phys., 23, 287–309, <ext-link xlink:href="https://doi.org/10.5194/acp-23-287-2023" ext-link-type="DOI">10.5194/acp-23-287-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Wolf et al.(2025)Wolf, Bellouin, Boucher, Rohs, and Li</label><mixed-citation>Wolf, K., Bellouin, N., Boucher, O., Rohs, S., and Li, Y.: Correction of ERA5 temperature and relative humidity biases by bivariate quantile mapping for contrail formation analysis, Atmos. Chem. Phys., 25, 157–181, <ext-link xlink:href="https://doi.org/10.5194/acp-25-157-2025" ext-link-type="DOI">10.5194/acp-25-157-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Zahn et al.(2014)Zahn, Christner, van Velthoven, Rauthe-Schöch, and Brenninkmeijer</label><mixed-citation>Zahn, A., Christner, E., van Velthoven, P. F. J., Rauthe-Schöch, A., and Brenninkmeijer, C. A. M.: Processes controlling water vapor in the upper troposphere/lowermost stratosphere: An analysis of 8 years of monthly measurements by the IAGOS-CARIBIC observatory, J. Geophys. Res., 119, 11505–11525, <ext-link xlink:href="https://doi.org/10.1002/2014JD021687" ext-link-type="DOI">10.1002/2014JD021687</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Živković et al.(2009)</label><mixed-citation> Živković, Ž., Mihajlović, I., and Nikolić, D.: Artificial neural network method applied on the nonlinear multivariate problems, Serbian Journal of Management, 4, 143–155, 2009.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Technical note: Hybrid machine learning model for bias correction of UTLS relative humidity against IAGOS observations in ERA5 reanalysis</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Alduchov and Eskridge(1996)</label><mixed-citation>
      
Alduchov, O. A. and Eskridge, R. E.:
Improved Magnus Form Approximation of Saturation Vapor Pressure, J. Appl. Meteorol., 35, 601–609, <a href="https://doi.org/10.1175/1520-0450(1996)035&lt;0601:imfaos&gt;2.0.co;2" target="_blank">https://doi.org/10.1175/1520-0450(1996)035&lt;0601:imfaos&gt;2.0.co;2</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Appleman(1953)</label><mixed-citation>
      
Appleman, H.:
The formation of exhaust condensation trails by jet aircraft, B. Am. Meteorol. Soc., 34, 14–20, <a href="https://doi.org/10.1175/1520-0477-34.1.14" target="_blank">https://doi.org/10.1175/1520-0477-34.1.14</a>, 1953.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Borella et al.(2024)Borella, Vignon, Boucher, and Rohs</label><mixed-citation>
      
Borella, A., Vignon, É., Boucher, O., and Rohs, S.:
An Empirical Parameterization of the Subgrid-Scale Distribution of Water Vapor in the UTLS for Atmospheric General Circulation Models, J. Geophys. Res.-Atmos., 129, <a href="https://doi.org/10.1029/2024JD040981" target="_blank">https://doi.org/10.1029/2024JD040981</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Boulanger et al.(2018)Boulanger, Thouret, and Petzold</label><mixed-citation>
      
Boulanger, D., Thouret, V., and Petzold, A.:
IAGOS final quality controlled Observational Data L2 – Time series, Aeris [data set], <a href="https://doi.org/10.25326/06" target="_blank">https://doi.org/10.25326/06</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Chen and Guestrin(2016)</label><mixed-citation>
      
Chen, T. and Guestrin, C.:
XGBoost: A Scalable Tree Boosting System, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '16, p. 785–794, ACM, <a href="https://doi.org/10.1145/2939672.2939785" target="_blank">https://doi.org/10.1145/2939672.2939785</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Copernicus Climate Change Service(2025)</label><mixed-citation>
      
Copernicus Climate Change Service:
ERA5 hourly data on pressure levels from 1940 to present, Climate Data Store [data set], <a href="https://doi.org/10.24381/cds.bd0915c6" target="_blank">https://doi.org/10.24381/cds.bd0915c6</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Diao et al.(2014)Diao, Zondlo, Heymsfield, Avallone, Paige, Beaton, Campos, and Rogers</label><mixed-citation>
      
Diao, M., Zondlo, M. A., Heymsfield, A. J., Avallone, L. M., Paige, M. E., Beaton, S. P., Campos, T., and Rogers, D. C.:
Cloud-scale ice-supersaturated regions spatially correlate with high water vapor heterogeneities, Atmos. Chem. Phys., 14, 2639–2656, <a href="https://doi.org/10.5194/acp-14-2639-2014" target="_blank">https://doi.org/10.5194/acp-14-2639-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Diao et al.(2015a)Diao, Jensen, Pan, Homeyer, Honomichl, Bresch, and Bansemer</label><mixed-citation>
      
Diao, M., Jensen, J. B., Pan, L. L., Homeyer, C. R., Honomichl, S., Bresch, J. F., and Bansemer, A.:
Distributions of ice supersaturation and ice crystals from airborne observations in relation to upper tropospheric dynamical boundaries, J. Geophys. Res.-Atmos., 120, 5101–5121, <a href="https://doi.org/10.1002/2015JD023139" target="_blank">https://doi.org/10.1002/2015JD023139</a>, 2015a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Diao et al.(2015b)Diao, Jensen, Pan, Homeyer, Honomichl, Bresch, and Bansemer</label><mixed-citation>
      
Diao, M., Jensen, J. B., Pan, L. L., Homeyer, C. R., Honomichl, S., Bresch, J. F., and Bansemer, A.:
Distributions of ice supersaturation and ice crystals from airborne observations in relation to upper tropospheric dynamical boundaries, J. Geophys. Res.-Atmos., 120, 5101–5121, <a href="https://doi.org/10.1002/2015JD023139" target="_blank">https://doi.org/10.1002/2015JD023139</a>, 2015b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Gettelman et al.(2006)Gettelman, Fetzer, Eldering, and Irion</label><mixed-citation>
      
Gettelman, A., Fetzer, E. J., Eldering, A., and Irion, F. W.:
The global distribution of supersaturation in the upper troposphere from the Atmospheric Infrared Sounder, J. Climate, 19, 6089–6103, <a href="https://doi.org/10.1175/JCLI3955.1" target="_blank">https://doi.org/10.1175/JCLI3955.1</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Gierens and Brinkop(2012)</label><mixed-citation>
      
Gierens, K. and Brinkop, S.:
Dynamical characteristics of ice supersaturated regions, Atmos. Chem. Phys., 12, 11933–11942, <a href="https://doi.org/10.5194/acp-12-11933-2012" target="_blank">https://doi.org/10.5194/acp-12-11933-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Gierens and Spichtinger(2000)</label><mixed-citation>
      
Gierens, K. and Spichtinger, P.:
On the size distribution of ice-supersaturated regions in the upper troposphere and lowermost stratosphere, Ann. Geophys., 18, 499–504, <a href="https://doi.org/10.1007/s00585-000-0499-7" target="_blank">https://doi.org/10.1007/s00585-000-0499-7</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Gierens et al.(1999)Gierens, Schumann, Helten, Smit, and Marenco</label><mixed-citation>
      
Gierens, K., Schumann, U., Helten, M., Smit, H., and Marenco, A.:
A distribution law for relative humidity in the upper troposphere and lower stratosphere derived from three years of MOZAIC measurements, Ann. Geophys., 17, 1218–1226, <a href="https://doi.org/10.1007/s00585-999-1218-7" target="_blank">https://doi.org/10.1007/s00585-999-1218-7</a>, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Gierens et al.(2004)Gierens, Kohlhepp, Spichtinger, and Schroedter-Homscheidt</label><mixed-citation>
      
Gierens, K., Kohlhepp, R., Spichtinger, P., and Schroedter-Homscheidt, M.:
Ice supersaturation as seen from TOVS, Atmos. Chem. Phys., 4, 539–547, <a href="https://doi.org/10.5194/acp-4-539-2004" target="_blank">https://doi.org/10.5194/acp-4-539-2004</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Groß et al.(2014)Groß, Wirth, Schäfler, Fix, Kaufmann, and Voigt</label><mixed-citation>
      
Groß, S., Wirth, M., Schäfler, A., Fix, A., Kaufmann, S., and Voigt, C.:
Potential of airborne lidar measurements for cirrus cloud studies, Atmos. Meas. Tech., 7, 2745–2755, <a href="https://doi.org/10.5194/amt-7-2745-2014" target="_blank">https://doi.org/10.5194/amt-7-2745-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Hegglin et al.(2013)Hegglin, Tegtmeier, Anderson, Froidevaux, Fuller, Funke, Jones, Lingenfelser, Lumpe, Pendlebury, Remsberg, Rozanov, Toohey, Urban, von Clarmann, Walker, Wang, and Weigel</label><mixed-citation>
      
Hegglin, M. I., Tegtmeier, S., Anderson, J., Froidevaux, L., Fuller, R., Funke, B., Jones, A., Lingenfelser, G., Lumpe, J., Pendlebury, D., Remsberg, E., Rozanov, A., Toohey, M., Urban, J., von Clarmann, T., Walker, K. A., Wang, R., and Weigel, K.:
SPARC Data Initiative: Comparison of water vapor climatologies from international satellite limb sounders, J. Geophys. Res., 118, 11824–11846, <a href="https://doi.org/10.1002/jgrd.50752" target="_blank">https://doi.org/10.1002/jgrd.50752</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hersbach et al.(2020)Hersbach, Bell, Berrisford, Hirahara, Horányi, Muñoz-Sabater, Nicolas, Peubey, Radu, Schepers, Simmons, Soci, Abdalla, Abellan, Balsamo, Bechtold, Biavati, Bidlot, Bonavita, De Chiara, Dahlgren, Dee, Diamantakis, Dragani, Flemming, Forbes, Fuentes, Geer, Haimberger, Healy, Hogan, Hólm, Janisková, Keeley, Laloyaux, Lopez, Lupu, Radnoti, de Rosnay, Rozum, Vamborg, Villaume, and Thépaut</label><mixed-citation>
      
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.:
The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049, <a href="https://doi.org/10.1002/qj.3803" target="_blank">https://doi.org/10.1002/qj.3803</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Hersbach et al.(2023)Hersbach, Bell, Berrisford, Biavati, Horányi, Muñoz Sabater, Nicolas, Peubey, Radu, Rozum, Schepers, Simmons, Soci, Dee, and Thépaut</label><mixed-citation>
      
Hersbach, H., Bell, B., Berrisford, P., Biavati, G., Horányi, A., Muñoz Sabater, J., Nicolas, J., Peubey, C., Radu, R., Rozum, I., Schepers, D., Simmons, A., Soci, C., Dee, D., and Thépaut, J.-N.:
ERA5 hourly data on pressure levels from 1940 to present, Copernicus Climate Change Service (C3S) Climate Data Store (CDS), <a href="https://doi.org/10.24381/cds.bd0915c6" target="_blank">https://doi.org/10.24381/cds.bd0915c6</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Heymsfield et al.(1998)Heymsfield, Miloshevich, Twohy, Sachse, and Oltmans</label><mixed-citation>
      
Heymsfield, A. J., Miloshevich, L. M., Twohy, C., Sachse, G., and Oltmans, S.:
Upper-tropospheric relative humidity observations and implications for cirrus ice nucleation, Geophys. Res. Lett., 25, 1343–1346, <a href="https://doi.org/10.1029/98GL01089" target="_blank">https://doi.org/10.1029/98GL01089</a>, 1998.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Hofer and Gierens(2025)</label><mixed-citation>
      
Hofer, S. M. and Gierens, K. M.:
Kinematic properties of regions that can involve persistent contrails over the North Atlantic and Europe during April and May 2024, Atmos. Chem. Phys., 25, 6843–6856, <a href="https://doi.org/10.5194/acp-25-6843-2025" target="_blank">https://doi.org/10.5194/acp-25-6843-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Kärcher(2018)</label><mixed-citation>
      
Kärcher, B.:
Formation and radiative forcing of contrail cirrus, Nat. Commun., 9, 1824, <a href="https://doi.org/10.1038/s41467-018-04068-0" target="_blank">https://doi.org/10.1038/s41467-018-04068-0</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Kärcher and Corcos(2025)</label><mixed-citation>
      
Kärcher, B. and Corcos, M.:
On the lifetimes of persistent contrails and contrail cirrus, manuscript submitted to J. Geophys. Res-Atmos., 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Kingma and Ba(2014)</label><mixed-citation>
      
Kingma, D. and Ba, J.:
Adam: A Method for Stochastic Optimization, International Conference on Learning Representations, <a href="http://arxiv.org/abs/1412.6980" target="_blank"/> (last access: 31 July 2025), 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Krüger et al.(2022)Krüger, Schäfler, Wirth, Weissmann, and Craig</label><mixed-citation>
      
Krüger, K., Schäfler, A., Wirth, M., Weissmann, M., and Craig, G. C.:
Vertical structure of the lower-stratospheric moist bias in the ERA5 reanalysis and its connection to mixing processes, Atmos. Chem. Phys., 22, 15559–15577, <a href="https://doi.org/10.5194/acp-22-15559-2022" target="_blank">https://doi.org/10.5194/acp-22-15559-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Lamquin et al.(2012)Lamquin, Stubenrauch, Gierens, Burkhardt, and Smit</label><mixed-citation>
      
Lamquin, N., Stubenrauch, C. J., Gierens, K., Burkhardt, U., and Smit, H.:
A global climatology of upper-tropospheric ice supersaturation occurrence inferred from the Atmospheric Infrared Sounder calibrated by MOZAIC, Atmos. Chem. Phys., 12, 381–405, <a href="https://doi.org/10.5194/acp-12-381-2012" target="_blank">https://doi.org/10.5194/acp-12-381-2012</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Lee et al.(2021)Lee, Fahey, Skowron, Allen, Burkhardt, Chen, Doherty, Freeman, Forster, Fuglestvedt, Gettelman, De León, Lim, Lund, Millar, Owen, Penner, Pitari, Prather, Sausen, and Wilcox</label><mixed-citation>
      
Lee, D., Fahey, D., Skowron, A., Allen, M., Burkhardt, U., Chen, Q., Doherty, S., Freeman, S., Forster, P., Fuglestvedt, J., Gettelman, A., De León, R., Lim, L., Lund, M., Millar, R., Owen, B., Penner, J., Pitari, G., Prather, M., Sausen, R., and Wilcox, L.:
The contribution of global aviation to anthropogenic climate forcing for 2000 to 2018, Atmos. Environ., 244, 117834, <a href="https://doi.org/10.1016/j.atmosenv.2020.117834" target="_blank">https://doi.org/10.1016/j.atmosenv.2020.117834</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Li et al.(2023)Li, Mahnke, Rohs, Bundke, Spelten, Dekoutsidis, Groß, Voigt, Schumann, Petzold, and Krämer</label><mixed-citation>
      
Li, Y., Mahnke, C., Rohs, S., Bundke, U., Spelten, N., Dekoutsidis, G., Groß, S., Voigt, C., Schumann, U., Petzold, A., and Krämer, M.:
Upper-tropospheric slightly ice-subsaturated regions: frequency of occurrence and statistical evidence for the appearance of contrail cirrus, Atmos. Chem. Phys., 23, 2251–2271, <a href="https://doi.org/10.5194/acp-23-2251-2023" target="_blank">https://doi.org/10.5194/acp-23-2251-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Pedregosa et al.(2011)Pedregosa, Varoquaux, Gramfort, Michel, Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos, Cournapeau, Brucher, Perrot, and Duchesnay</label><mixed-citation>
      
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.:
Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Petzold et al.(2015)Petzold, Thouret, Gerbig, Zahn, Brenninkmeijer, Gallagher, Hermann, Pontaud, Ziereis, Boulanger, Marshall, Nédélec, Smit, Friess, Flaud, Wahner, Cammas, Volz-Thomas, and Team</label><mixed-citation>
      
Petzold, A., Thouret, V., Gerbig, C., Zahn, A., Brenninkmeijer, C. A. M., Gallagher, M., Hermann, M., Pontaud, M., Ziereis, H., Boulanger, D., Marshall, J., Nédélec, P., Smit, H. G. J., Friess, U., Flaud, J.-M., Wahner, A., Cammas, J.-P., Volz-Thomas, A., and Team, I.:
Global-scale atmosphere monitoring by in-service aircraft – current achievements and future prospects of the European Research Infrastructure IAGOS, Tellus B, 67, 28452, <a href="https://doi.org/10.3402/tellusb.v67.28452" target="_blank">https://doi.org/10.3402/tellusb.v67.28452</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Platt et al.(2024)Platt, Shapiro, Engberg, McCloskey, Geraedts, Sankar, Stettler, Teoh, Schumann, Rohs, Brand, and Van Arsdale</label><mixed-citation>
      
Platt, J. C., Shapiro, M. L., Engberg, Z., McCloskey, K., Geraedts, S., Sankar, T., Stettler, M. E. J., Teoh, R., Schumann, U., Rohs, S., Brand, E., and Van Arsdale, C.:
The effect of uncertainty in humidity and model parameters on the prediction of contrail energy forcing, Environmental Research Communications, 6, 095015, <a href="https://doi.org/10.1088/2515-7620/ad6ee5" target="_blank">https://doi.org/10.1088/2515-7620/ad6ee5</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Reutter et al.(2020)Reutter, Neis, Rohs, and Sauvage</label><mixed-citation>
      
Reutter, P., Neis, P., Rohs, S., and Sauvage, B.:
Ice supersaturated regions: properties and validation of ERA-Interim reanalysis with IAGOS in situ water vapour measurements, Atmos. Chem. Phys., 20, 787–804, <a href="https://doi.org/10.5194/acp-20-787-2020" target="_blank">https://doi.org/10.5194/acp-20-787-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Rolf et al.(2023)Rolf, Rohs, Smit, Krämer, Bozóki, Hofmann, Franke, Maser, Hoor, and Petzold</label><mixed-citation>
      
Rolf, C., Rohs, S., Smit, H., Krämer, M., Bozóki, Z., Hofmann, S., Franke, H., Maser, R., Hoor, P., and Petzold, A.:
Evaluation of compact hygrometers for continuous airborne measurements, Meteorol. Z., 33, <a href="https://doi.org/10.1127/metz/2023/1187" target="_blank">https://doi.org/10.1127/metz/2023/1187</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Rosenow et al.(2018)Rosenow, Fricke, Luchkova et al.</label><mixed-citation>
      
Rosenow, J., Fricke, H., Luchkova, O., and Schultz, M.: Minimizing contrail formation by rerouting around dynamic ice-supersaturated regions, Aeronautics and Aerospace Open Access Journal, 2, 105–111, <a href="https://doi.org/10.15406/aaoaj.2018.02.00039" target="_blank">https://doi.org/10.15406/aaoaj.2018.02.00039</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Schmidt(1941)</label><mixed-citation>
      
Schmidt, E.:
Die Entstehung von Eisnebel aus den Auspuffgasen von Flugmotoren, Eintrag von Ulrich Schumann, zur Sicherstellung des Zugangs zu diesem wissenschaftshistorisch wichtigen Dokument; mit Zustimmung des Rechteinhabers, Nachfolger des Oldenbourg Verlags, 1941.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Schumann(1996)</label><mixed-citation>
      
Schumann, U.:
On conditions for contrail formation from aircraft exhausts, DLR Forschungsbericht DLR-FB 1996-10, Deutsches Zentrum für Luft- und Raumfahrt (DLR), <a href="https://elib.dlr.de/8302/" target="_blank"/> (last access: 31 July 2025), 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Sonntag(1994)</label><mixed-citation>
      
Sonntag, D.:
Advancements in the field of hygrometry, Meteorol. Z., 3, 51–66, <a href="https://doi.org/10.1127/metz/3/1994/51" target="_blank">https://doi.org/10.1127/metz/3/1994/51</a>, 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Spichtinger et al.(2003a)Spichtinger, Gierens, Leiterer, and Dier</label><mixed-citation>
      
Spichtinger, P., Gierens, K., Leiterer, U., and Dier, H.:
Ice supersaturation in the tropopause region over Lindenberg, Germany, Meteorol. Z., 12, 143–156, <a href="https://doi.org/10.1127/0941-2948/2003/0012-0143" target="_blank">https://doi.org/10.1127/0941-2948/2003/0012-0143</a>, 2003a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Spichtinger et al.(2003b)Spichtinger, Gierens, and Read</label><mixed-citation>
      
Spichtinger, P., Gierens, K., and Read, W.:
The global distribution of ice-supersaturated regions as seen by the Microwave Limb Sounder, Q. J. Roy. Meteor. Soc., 129, 3391–3410, <a href="https://doi.org/10.1256/qj.02.141" target="_blank">https://doi.org/10.1256/qj.02.141</a>, 2003b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Stenke et al.(2008)Stenke, Grewe, and Ponater</label><mixed-citation>
      
Stenke, A., Grewe, V., and Ponater, M.:
Lagrangian transport of water vapor and cloud water in the ECHAM4 GCM and its impact on the cold bias, Clim. Dynam., 31, 491–506, <a href="https://doi.org/10.1007/S00382-007-0347-5" target="_blank">https://doi.org/10.1007/S00382-007-0347-5</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Teoh et al.(2022)Teoh, Schumann, Gryspeerdt, Shapiro, Molloy, Koudis, Voigt, and Stettler</label><mixed-citation>
      
Teoh, R., Schumann, U., Gryspeerdt, E., Shapiro, M., Molloy, J., Koudis, G., Voigt, C., and Stettler, M. E. J.:
Aviation contrail climate effects in the North Atlantic from 2016 to 2021, Atmos. Chem. Phys., 22, 10919–10935, <a href="https://doi.org/10.5194/acp-22-10919-2022" target="_blank">https://doi.org/10.5194/acp-22-10919-2022</a>, 2022.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Teoh et al.(2024)Teoh, Engberg, Schumann, Voigt, Shapiro, Rohs, and Stettler</label><mixed-citation>
      
Teoh, R., Engberg, Z., Schumann, U., Voigt, C., Shapiro, M., Rohs, S., and Stettler, M. E. J.:
Global aviation contrail climate effects from 2019 to 2021, Atmos. Chem. Phys., 24, 6071–6093, <a href="https://doi.org/10.5194/acp-24-6071-2024" target="_blank">https://doi.org/10.5194/acp-24-6071-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Wang et al.(2024)Wang, Wolf, Boucher, and Bellouin</label><mixed-citation>
      
Wang, X., Wolf, K., Boucher, O., and Bellouin, N.:
Radiative Effect of Two Contrail Cirrus Outbreaks Over Western Europe Estimated Using Geostationary Satellite Observations and Radiative Transfer Calculations, Geophys. Res. Lett., 51, <a href="https://doi.org/10.1029/2024gl108452" target="_blank">https://doi.org/10.1029/2024gl108452</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Wang et al.(2025)Wang, Bugliaro, Gierens, Hegglin, Rohs, Petzold, Kaufmann, and Voigt</label><mixed-citation>
      
Wang, Z., Bugliaro, L., Gierens, K., Hegglin, M. I., Rohs, S., Petzold, A., Kaufmann, S., and Voigt, C.:
Machine learning for improvement of upper-tropospheric relative humidity in ERA5 weather model data, Atmos. Chem. Phys., 25, 2845–2861, <a href="https://doi.org/10.5194/acp-25-2845-2025" target="_blank">https://doi.org/10.5194/acp-25-2845-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Wolf et al.(2023)Wolf, Bellouin, and Boucher</label><mixed-citation>
      
Wolf, K., Bellouin, N., and Boucher, O.:
Long-term upper-troposphere climatology of potential contrail occurrence over the Paris area derived from radiosonde observations, Atmos. Chem. Phys., 23, 287–309, <a href="https://doi.org/10.5194/acp-23-287-2023" target="_blank">https://doi.org/10.5194/acp-23-287-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Wolf et al.(2025)Wolf, Bellouin, Boucher, Rohs, and Li</label><mixed-citation>
      
Wolf, K., Bellouin, N., Boucher, O., Rohs, S., and Li, Y.:
Correction of ERA5 temperature and relative humidity biases by bivariate quantile mapping for contrail formation analysis, Atmos. Chem. Phys., 25, 157–181, <a href="https://doi.org/10.5194/acp-25-157-2025" target="_blank">https://doi.org/10.5194/acp-25-157-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Zahn et al.(2014)Zahn, Christner, van Velthoven, Rauthe-Schöch, and Brenninkmeijer</label><mixed-citation>
      
Zahn, A., Christner, E., van Velthoven, P. F. J., Rauthe-Schöch, A., and Brenninkmeijer, C. A. M.:
Processes controlling water vapor in the upper troposphere/lowermost stratosphere: An analysis of 8 years of monthly measurements by the IAGOS-CARIBIC observatory, J. Geophys. Res., 119, 11505–11525, <a href="https://doi.org/10.1002/2014JD021687" target="_blank">https://doi.org/10.1002/2014JD021687</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Živković et al.(2009)</label><mixed-citation>
      
Živković, Ž., Mihajlović, I., and Nikolić, D.: Artificial neural network method applied on the nonlinear multivariate problems, Serbian Journal of Management, 4, 143–155, 2009.

    </mixed-citation></ref-html>--></article>
