Technical note: Hybrid machine learning model for bias correction of UTLS relative humidity against IAGOS observations in ERA5 reanalysis

Antonopoulos, Mathieu; Juvin-Quarroz, Jérémie; Boucher, Olivier

doi:10.5194/acp-26-4771-2026

Articles | Volume 26, issue 7

https://doi.org/10.5194/acp-26-4771-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/acp-26-4771-2026

© Author(s) 2026. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 26, issue 7

Technical note

|

10 Apr 2026

Technical note |

| 10 Apr 2026

Technical note: Hybrid machine learning model for bias correction of UTLS relative humidity against IAGOS observations in ERA5 reanalysis

Mathieu Antonopoulos, Jérémie Juvin-Quarroz, and Olivier Boucher

Download

Final revised paper (published on 10 Apr 2026)
Supplement to the final revised paper
Preprint (discussion started on 10 Dec 2025)

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-4529', Anonymous Referee #1, 23 Dec 2025

## Overall
This is a nice technical note expanding on the work of Wang et al 2025. I think the manuscript deserves to be published after improving the clarity of the presentation and considering the major comments below. Ideally the manuscript would be accompanied by example training code in the author's language of choice.
## Major Comments
- L3 & L46: Many publications on this topic often make some form of the statement "[There are] considerable errors in RH_i estimates" which makes "accurate forecast of ISSRs [difficult]." I'm interested to see more analysis on the type and distribution of errors to better understand how ISSR forecast errors will result in ineffective (or inefficient) avoidance measures. In our experience, RH_i (and ISSRs) have high pointwise error, but overall ISSR regions are (generally) spatially and temporally correlated with ISSR forecasts.

- L253: What are the requirements to support effective contrail avoidance strategies?

- L51: Wang et al 2025 published a ANN humidity correction methodology. This publication adds an XGBoost regression for RH_i < 85%, and a different training/validation data split. Given the similarities, this line deserves a whole paragraph describing the differences with Wang 2025, and how this methodology aims to improve on the previous work.

- L74: What kind of biases in the weather might this domain selection introduce? Have you tested how well your models apply outside this domain?

- L83: Did you consider model levels? It may be worth exploring if the higher vertical resolution would improve your results.

- L117-121: How did you interpolate the values for T and q? Linear interpolation in q introduces bias when working with coarse pressure levels.

- Table 1: Teoh et al 2024 introduced a latitude correction for the humidity correction. Should latitude be a feature?
## Minor Comments
- L31: "are spending" -> "spend"

- L33: Suggest using stats from more recent Teoh, R. et al. (2024) “Global aviation contrail climate effects from 2019 to 2021,” Atmospheric Chemistry and Physics, 24(10), pp. 6071–6093. Available at: https://doi.org/10.5194/acp-24-6071-2024.

- L37: Its worth motivating why we need to detect ISSRs. Its presumed that the reader knows "to meteorologically forecast ISSRs with enough accuracy" we need ISSR detections. May want to add context e.g. "Global ISSR forecasts are generally derived for numerical weather forecasting systems, or nowcast from in situ measurements or inferred from remote sensing. Both approaches rely on accurate detections of ISSRs, in the first case to validate models, or in the second through measurements"

- L44: Not just ERA5 - any numerical weather prediction system. I'd flip this around - numerical weather prediction models provide a comprehensive prediction across the global atmosphere. ERA5 is a highly trusted source of numerical weather prediction.

- L45: Define what a dry-bias means

- L48: Other publications with humidity correction: (constant) Schumann, 2012; Schumann et al., 2015; Teoh et al., 2020; Schumann et al., 2021; (piecewise function) Teoh et al 2022; Teoh et al 2024; (quantile mapping) Platt et al 2024

- L55: This sentence sounds like an LLM. I'd move L59-L61 up front, remove this sentence, and then have L57-58. Can you be more specific as to why you chose the hybrid model? From this description it sounds like you used XGBoost for compute performance reasons rather than accuracy.

- L94: How long is the "longer period"?

- L104: Just confirming that IAGOS accuracy is a function of RH_i or of absolute humidity. I had remembered that humidity sensor accuracy was a function of absolute humidity.

- L126: How does this compare to Wang 2025?

- L156: Is it possible the ANN is overfitting these engineered features? You acknowledge the proper data split, but could you use additional data outside the domain to gain confidence?

- L160: This criteria sounds more like "No existing cirrus" rather than "clear sky." Could also look at the IAGOS ice crystal measurements to judge pre-existing cirrus (Petzoldt 2025)

- L170: (Re)Introduce acronym MAE

- L182: Add citation? Where does this baseline come from?

- L186-188: Its not clear to me why "structured input data" ~ drier regimes. Its more clear to me that "high humidity conditions" ~ complex non-linear dependencies.

- L221-222: This is first clear explanation of why XGBoost is preferable to ANN for the drier regimes. L230 - L233 is also great. Bring this language up front!

- L223: Repeats the previous line

- Table 3 is super helpful - It would be helpful to use this language up front when describing the benefits of the hybrid architecture.

- Table 3, Table 4: How do these results compare with Wolf et al 2025 or Platt et al 2024 (quantile mapping)

Citation: https://doi.org/10.5194/egusphere-2025-4529-RC1
- AC1: 'Reply on RC1', Jérémie Juvin-Quarroz, 14 Jan 2026
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4529/egusphere-2025-4529-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-4529-AC1
RC2:
'Comment on egusphere-2025-4529', Anonymous Referee #2, 07 Jan 2026
In this study, the authors build up a hybrid machine learning model to correct bias of relative humidity over ice in ERA5 in the UTLS over North Atlantic. The model consists of a XGBoost regressor in dry conditions and an ANN in more humid regions, with the threshold of 85% RHi. In the test data, the model can reduce the dry bias of ERA5, and increase the number of correctly predicted ISSRs.
The paper is well written, with a clear presentation of technical details. The figures and tables effectively illustrate the comparisons.
I have a few additional comments that should be clarified as part of a minor revision before the study is published.
In Sect 2.3 or Fig 2, the ERA5 variables are linearly interpolated to match the mean altitude of the IAGOS data points. This is a reasonable approach to reduce discrepancies between model level and pressure level input data. However, I am thinking if log-pressure linear interpolation—as used in tools like pycontrails—might be more appropriate, since it better represents the vertical structure of the atmosphere.

In Table A1, in addition to XGBoost, LightGBM also performs well in predicting RHi, particularly in terms of MAE and ETS. It would be helpful to include a sentence or two explaining the choice of XGBoost over LightGBM, or whether the results could be generalized to gradient boosting decision trees as a class of models.

Other comments:
The definition of contrail formation and persistence could be clarified. Contrails form when hot and humid jet engine exhaust mixes with the cold ambient atmosphere, resulting in local liquid saturation within the plume. Contrails persist when the surrounding air is ISSRs, allowing the ice crystals to grow and spread into contrail-induced cirrus. This clarification relates to the first sentence of the abstract and Lines 35–36.

The description of Fig 4a as the baseline ERA5 reanalysis could be introduced earlier

Lines 26: full name of MOZAIC

Line 70: the measurements of RHi are not direct; they are calculated from temperature and water vapor measurements
Citation: https://doi.org/10.5194/egusphere-2025-4529-RC2
- AC2: 'Reply on RC2', Jérémie Juvin-Quarroz, 14 Jan 2026
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-4529/egusphere-2025-4529-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-4529-AC2

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

AR by Jérémie Juvin-Quarroz on behalf of the Authors (22 Jan 2026) Author's response Author's tracked changes Manuscript

EF by Polina Shvedko (22 Jan 2026) Supplement

ED: Referee Nomination & Report Request started (24 Feb 2026) by Franziska Aemisegger

RR by Anonymous Referee #1 (07 Mar 2026)

ED: Publish subject to technical corrections (25 Mar 2026) by Franziska Aemisegger

AR by Jérémie Juvin-Quarroz on behalf of the Authors (30 Mar 2026) Manuscript

Download

Article (5632 KB)
Full-text XML

Short summary

Aviation impacts climate by forming contrails that trap heat and can persist for hours at cruising altitudes. Forecasting these humid regions is difficult, as satellites lack accuracy, aircraft data are limited, and ERA5 reanalysis has random errors. This study presents a hybrid machine learning method that corrects ERA5 with aircraft data, using decision trees in dry air and neural networks in humid air. It improves relative humidity predictions, especially in the lower stratosphere.

Technical note: Hybrid machine learning model for bias correction of UTLS relative humidity against IAGOS observations in ERA5 reanalysis

Download

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection