the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Evaluation of upper-tropospheric lower-stratospheric properties over the Asian monsoon region in a storm-resolving model
Aiko Voigt
Edgardo Sepúlveda Araya
Silvia Bucci
Annette Miltenberger
Meredith K. Kupinski
Christian Rolf
Martina Krämer
The structure of the tropical upper troposphere-lower stratosphere (UTLS) affects radiative balance, stability, and regional dynamics in important ways. Lack of reliable observational baselines poses a challenge to evaluating model representation of UTLS properties. Here, we use in-situ data, primarily from the StratoClim field campaign over the Asian Monsoon area, to assess the UTLS temperature, moisture, and ice clouds in the Icosahedral Nonhydrostatic (ICON) model at storm-resolving grid spacing. We also employ superpressure balloon data and updrafts of the POSIDON and ATTREX campaigns to evaluate the UTLS convective updrafts and gravity wave activity in ICON. Our simulations show the upper troposphere is too cold, while the lower stratosphere is too warm and excessively dry relative to observations. These thermodynamic biases coincide with overestimated cloud ice in the upper troposphere and underestimated cloud ice in the lower stratosphere. The mean convective updraft is underestimated by 80 % in the model, and the power spectral density for temperature fluctuations of frequencies greater than 10−3 s−1 is underestimated by orders of magnitude. Too weak dynamics exacerbate a lack of ice cloud above 100 hPa. Too weak and too infrequent convective overshoots or too rapid dissipation of upper-tropospheric ice clouds in the model are two possible explanatory mechanisms for these biases.
- Article
(5226 KB) - Full-text XML
- BibTeX
- EndNote
In the tropics, atmospheric properties transition from those of the troposphere to those of the stratosphere over a few kilometers in the upper-troposphere lower-stratosphere (UTLS). Accurate representation of UTLS thermodynamics and cloud structure is critical for several reasons. First, warmer minimum temperatures in the UTLS allow more moisture to enter the stratosphere, since higher temperatures raise the minimum saturation vapor pressure of ascending air. Stratospheric moisture can, in turn, intensify the warming from carbon dioxide emissions by 0.1–0.3 because of the strong infrared absorption of water vapor (Banerjee et al., 2019; Dessler et al., 2013). This feedback can alter globally-averaged surface temperatures nonlinearly, for example with a 10 % decrease in stratospheric water vapor leading to a 25 % decrease in the surface warming rate (Solomon et al., 2010). Second, UTLS stability determines characteristics of convective overshoots and anvil cloud outflow from convection. Sharper gradients in potential temperature correspond to greater stability and suppress overshoot occurrence and strength, as well as anvil extent. Finally, the radiative properties of UTLS moisture and clouds affect dynamics. For example, increased stratospheric moisture can cause poleward shifts in the storm tracks due to differential radiative effects between the midlatitudes and tropics (Maycock et al., 2013; Charlesworth et al., 2023). Incorrect representation of UTLS ice clouds can also disrupt the description of feedbacks between these clouds and gravity wave-generated turbulence (Podglajen et al., 2017).
Existing evaluation studies find pervasive but inconsistent UTLS biases across models. For example, in regard to temperature biases, a comparison of 13 different chemistry-climate models found a 10 K spread in tropical tropopause temperatures (Gettelman et al., 2009). The CMIP5 models exhibited too-warm anomalies near the tropopause (Kim et al., 2013), while too-cold anomalies have been documented near the tropopause for the Icosahedral Nonhydrostatic (ICON) model used here (Crueger et al., 2018). More recently, Merlis et al. (2024) evaluated changes in the vertical distribution of temperature using both CMIP6 global climate models (GCMs) and the X-SHiELD GSRM under +4 K sea surface temperature or 4xCO2 perturbations. Already in the control climatology, the average tropopause temperature bias in X-SHiELD was ∼6 K, 4 K higher than that in the CMIP6 GCMs. In regard to moisture structure, Gettelman et al. (2010) reported too much moisture above the cold point across 18 chemistry-climate models. More recently, Ploeger et al. (2024) documented too much water vapor over the Pacific UTLS across many CMIP6 models. UTLS ice cloud biases also exist; for example, nudged GSRMs robustly overestimate upper-tropospheric cloud fraction and underestimate its altitude (Atlas et al., 2024).
Importantly, anomalies in UTLS moisture are often largest for the Asian monsoon area (AMA) (e.g., Schoeberl et al., 2013; Ploeger et al., 2024), and multiple factors complicate the realistic model representation of UTLS thermodynamic and cloud structure in this region. Cloud overlap complicates the vertical profiles of UTLS radiative fluxes and heating (Johansson et al., 2019). The fine-scale features of UTLS moisture and temperature are not well-captured by the limited vertical grid spacing of most models (e.g., Marécal et al., 2007; Wang et al., 2019), and validating measurements are often limited for this region (e.g., Gettelman et al., 2010). In regard to this last challenge, flying above 15 km poses a challenge for many aircraft, given the very low temperatures and pressures there. The water vapor mixing ratios of the tropical UTLS are very low, requiring high instrument accuracy (Singer et al., 2022). Reanalysis data do not reliably capture the UTLS structure, with different reanalyses predicting variations in tropopause height of 100–300 m and in tropopause temperature of ∼1.5 K, despite improvements over time (Hoffmann and Spang, 2022). Known biases existed in the representation of the UTLS region in the ERA-5 reanalysis, including underestimates of lower stratospheric temperature and overestimates of lower stratospheric moisture due to overactive mixing (e.g., Krüger et al., 2022; Simmons et al., 2020). Several challenges exist also for satellite measurements: Microwave sounders lack vertical resolution, while moisture retrieval from radio occultation requires additional constraints and infrared sounders are susceptible to cloud contamination.
In this study, we make use of a unique in-situ dataset of UTLS properties to evaluate their representation in the ICON model at storm-resolving grid spacing (Δx=2.5 km). Dauhut and Hohenegger (2022) have evaluated the UTLS moisture budget in a global storm-resolving model (SRM) for the first time, but most recent studies evaluating UTLS structure or chemistry still use coarse grid spacing (e.g., Smith et al., 2024; Cohen et al., 2025). Exceptions are studies of Nugent et al. (2022) and Turbeville et al. (2022) which assess tropical ice cloud properties across several SRMs that participated in the DYAMOND model intercomparison against satellite data. We build upon these studies but consider multiple aspects of UTLS structure – thermodynamic and dynamic fields, as well as cloud properties – in the ICON model. We also use in-situ measurements of the Stratospheric and upper tropospheric processes for better climate prediction (StratoClim) and Strateole-2 field campaigns, rather than satellite data, for our evaluation. StratoClim took place out of Kathmandu, Nepal in August 2017 and measured temperature, specific humidity, trace gas concentrations, and cloud and aerosol particle size distributions in the AMA UTLS, while the first campaign of Strateole-2 released several long-duration superpressure balloons from the Seychelles in February 2020. We finally use vertical velocity measurements of the POSIDON campaign in October 2016 and the ATTREX campaign in early 2014, both over the west Pacific and detailed in Sect. 2.1. With these data, we investigate how well (or not) ICON can represent temperature, moisture, vertical velocity, and ice cloud structures in the AMA UTLS.
2.1 Field Campaign Measurements
We compare our synthetic flight tracks to measurements from the M55 Geophysica aircraft during StratoClim Flight 7, which observed interesting UTLS moisture and ice cloud structure. StratoClim had the broad goal of better understanding dynamics, microphysics, and chemistry in the upper troposphere lower stratosphere (UTLS) of the Asian monsoon region (http://www.stratoclim.org, last access: 27 January 2026). Flight 7 took place from 04:30 to 06:50 UTC on 8 August 2017 between Kathmandu and west Bengal (red trace in Fig. 1a). Two instruments aboard the Geophysica measured water vapor concentrations: the Fluorescent Lyman-α Stratospheric Hygrometer for Aircraft (FLASH-A) and the Fast In-situ Stratospheric Hygrometer (FISH) (Khaykin et al., 2013; Meyer et al., 2015). Ice water content is then calculated as the difference of total water (both gas-phase and evaporated ice crystals) measured by FISH and gas-phase water measured by FLASH-A (Afchine et al., 2018). Measurement frequency was 1 Hz with an uncertainty of 6 % for FLASH-A measurements and 6 %–8 % for FISH measurements (Lee et al., 2019). To prevent measurement contamination by very moist tropospheric air, FLASH-A only measures at pressures below 250 hPa and FISH only measures below 350–400 hPa. When FISH and FLASH-A measurements were not simultaneously available, ice water content was inferred from particle size distribution measurements of the NIXE-CAPS cloud spectrometer (Krämer et al., 2020).
Figure 1Simulation domain over the Asian monsoon region (a) with StratoClim Flight 7 track shown in red. Gray: Microwave Limb Sounder swath; pink stars: radiosonde locations; orange box: subdomain around Flight 7, 80–90° E and 19–30° N. Illustration of online trajectories in ICON, 2.3.0 over our simulation domain colored by trajectory height (b). Yellow box: patch over which trajectories initiate.
Vertical velocities were not measured during StratoClim, so we compare our simulated ascent rates to those from the Pacific Oxidants, Sulfur, Ice, Dehydration, and cONvection (POSIDON) campaign in October 2016 and the Airborne Tropical TRopopause EXperiment (ATTREX) in January and February 2014, both based out of Guam (Jensen et al., 2017). POSIDON and ATTREX both studied the western Pacific tropopause moisture and chemical composition and measured vertical velocities with a Meteorological Measurement System (MMS), with POSIDON sampling more often near deep convection. Global statistics of vertical velocity are only beginning to emerge with novel retrievals and remote sensing techniques (e.g., Dolan et al., 2023; Poujol and Bony, 2024), so we rely on in-situ data as geographically close as possible for our evaluation. Finally, superpressure balloon measurements of altitude during the first field campaign of Strateole-2 are used to construct and evaluate power spectra of temperature and vertical velocity (Podglajen et al., 2016; Haase et al., 2018; Corcos et al., 2021). Strateole-2 balloons were launched from the Seychelles and floated around the Equator in a Lagrangian sense for ∼80 d with a goal of determining chemical and momentum fluxes in the tropical tropopause layer. We map fluctuations in altitude at a given time i to those in vertical velocity using the 30 s measurement frequency and to those in temperature assuming dry adiabaticity (). A Butterworth filter is used to filter the altitude fluctuations with periods greater than 15 min.
2.2 Other Observations and Reanalysis
We also assess the ICON model output against several other reanalysis and observational datasets. We use temperature, specific humidity, and cloud ice mass mixing ratio of the ERA-5 reanalysis of the European Centre for Medium-Range Weather Forecasting at 0.25° resolution and hourly frequency (Hersbach et al., 2020). Then we use Aura Level 2 Water Vapor (H2O) Mixing Ratio V005 (ML2H2O) and Temperature V005 (ML2T) products of the Microwave Limb Sounder (MLS) (Waters et al., 2006) with a single overpass transecting our subdomain of interest on 8 August 2017 (Fig. 1a, gray). MLS averaging kernels are used in comparing MLS data to in-situ measurements. We also download radiosonde data from the University of Wyoming Atmospheric Soundings database on 8 August 2017 at 00:00 UTC preceding Flight 7 (Sondes 42379, 42492, 42701, 42867, 42809, and 42361; Fig. 1a, pink stars). As for the synthetic flight tracks, we calculate vertical profiles of observational differences by binning the in-situ data into the pressure levels of ERA-5, MLS, or the sondes and taking statistics over those binned values.
2.3 ICON Simulations
We run six simulations using the ICON model, version 2.6.4, and various couplings of its cloud microphysics and optics (Table 1) (Giorgetta et al., 2018). Both the one- and two-moment microphysics of Doms et al. (2005) and Seifert and Beheng (2006), respectively, are coupled to a hierarchy of three optical schemes in the ecRad radiative transfer scheme (Sullivan, 2026a). The Fu (1996) and Fu et al. (1998) schemes are the simplest schemes and the default option in ICON. They assume all ice crystals are hexagonal with sizes derived from in-situ measurements. Then the Baran et al. (2016) scheme includes hexagonal ice columns with unit aspect ratio, along with “3-, 5-, 8- and 10-branched hexagonal ice aggregates” with a description of ice crystal surface roughness for shortwave radiative properties. Optical properties depend on temperature, as a proxy for ice crystal habit (which itself is highly temperature dependent). Lastly, Yi et al. (2013) include “droxtals, plates, solid and hollow columns, solid and hollow bullet rosettes, an aggregate of solid columns, and a small/large aggregate of plate” for a total of 9 different habits, again with a description of ice crystal surface roughness. Sepúlveda Araya et al. (2025) document optical parameters and behavior of these schemes in greater detail.
(Doms et al., 2005)(Seifert and Beheng, 2006)Fu (1996)Fu et al. (1998)Baran et al. (2016)Yi et al. (2013)For all possible combinations of microphysics and optics, ICON is run over 4 d from 5 August 2017 12:00 UTC to 9 August 2017 12:00 UTC for an Asian monsoon domain from 55 to 115° E and from 5° S to 40° N. We use a 24 s timestep, 75 vertical levels, and 2.5 km-equivalent resolution (R2B10 icosahedral grid), so that deep convection is explicit. Shallow convection remains parameterized with the Nordeng (1994) scheme. The Integrated Forecast System (IFS) analysis data of the European Center for Medium-Range Weather Forecast (ECMWF) provides initial and 3-hourly boundary conditions. All trace gas mixing ratios are set to the annual global means from the input4MIPS project. No output is generated until a restart file at 8 August 2017 03:00 UTC; thereafter, both 2D and 3D fields are generated at 10 min frequency until 8 August 2017 07:00 UTC for a subdomain extending from 19 to 30° N and from 80 to 90° E (Fig. 1a). This timing and subdomain correspond to Flight 7 of the StratoClim field campaign (Fig. 1a, red).
2.4 ICON Output in Lagrangian Formats
2.4.1 Synthetic Flight Tracks
An ensemble of synthetic flight tracks is generated from the 10 min frequency ICON output to perform model-measurement comparison. We define a domain of around the StratoClim flight position and randomly sample 20 values with replacement from within this domain for the time and pressure level closest to the aircraft. We continue this random sampling at every time point along the actual flight track to create 20 “flight tracks” from model output. This sampling strategy accounts for possible spatiotemporal offsets between simulated and observed features. We test how different the 20 synthetic flight tracks are relative to values/biases evaluated over a small subdomain around the Flight 7 track (Fig. 1a, orange). We calculate vertical profiles of model bias using the synthetic and actual flight tracks, binning the data into the pressure levels of the model and taking statistics over those binned values. Comparisons of vertical profiles focus on the times between 06:20 and 06:48 UTC on 8 August 2017 as in Lee et al. (2019), since the aircraft performs a continuous descent during this period, sampling the full altitudinal range.
2.4.2 Online Trajectories
A trajectory module was also implemented into ICON, version 2.3.0, as detailed in Miltenberger et al. (2020). These online trajectories use an implicit Euler scheme and the resolved wind fields of the simulation to calculate kinematic airmass trajectories. Calculations are performed on the native ICON grid at the physics timestep of the model and interpolation of wind field and trace variables to the trajectory position uses triangular horizontal and linear vertical interpolation. At every half level between 8 and 22 km and at 09:00 UTC on 6 August 2017, we initiate trajectories at all grid cell centers between 30 and 32° N and between 108.5 and 110.5° E in the east of our simulation domain (Fig. 1b, yellow). This region was the source of overshooting convection that moistened the lower stratosphere, according to the study of Lee et al. (2019). We allow the online trajectories to propagate over 51 h with the 24 s timestep of the model, as they flow westward toward the region where our observations are located (Fig. 1a). We use the temperature and vertical winds along these trajectories to compare to superpressure balloon measurements in a statistical sense. We show results only of a 2M0O setup, denoted 2M0Ot, for these online trajectories (Table 1). All scripts to reproduce figures from the ICON simulations and observational data are available from Sullivan (2026b).
Figure 2Temperature along the StratoClim Flight 7 track and the average temperature across the 20 synthetic flight tracks from the ICON 1M0O simulation (a). Temperature biases along the synthetic flight tracks relative to the in-situ measurements (b). The temperature bias of the “best trajectory” for which bias is minimized most often is overlaid in gray. Standard deviation in temperature over the 20 synthetic flight tracks (c). A period of continuous descent from 06:20–06:48 UTC is shaded in gray for all panels.
3.1 Evaluating UTLS Temperature
We first visualize the temperature over time of Flight 7 and our synthetic flight tracks (Fig. 2). This comparison helps to validate that the simulated values have been correctly extracted. Agreement is reasonable with 65 % of the temperature biases within ±2 K and 82 % within ±3 K; however, 10 % of the temperature biases remain larger than ±5 K. Temperature biases show a strong negative correlation with observed temperature (, ), with the strongest relationships occurring below 195 K (, ). All biases greater than ±5 K are associated with observed temperatures colder than 200 K. Along the flight track, these largest biases occur between 05:30 and 06:30 UTC, when the aircraft is flying at the highest altitudes.
To understand spatial persistence of these biases, we examine the “best” synthetic flight track, the one for which the bias is most often minimized (Fig. 2b, gray). Again, for this best track, simulated values are warm-biased 85 % of the time for temperatures colder than 200 K. As another metric of evaluation robustness, we plot the standard deviation in temperature (σ(T)) across the synthetic flight tracks (Fig. 2c). Seventy five percent of the time, σ(T) is less than 25 % as large as the bias. σ(T) also correlates only weakly with the aircraft descent or ascent, as quantified in (, ; not shown), and with temperature itself (r=0.13, ). Our temperature evaluation is thus robust across space and the synthetic flight tracks.
The vertical distribution of UTLS temperature, including tropopause position, is crucial for moisture transport into the stratosphere and overshoot occurrence, as mentioned earlier. Vertical profiles of T and θ have smaller, negative biases at pressures greater than 110 hPa and larger, positives ones at pressures less than 110 hPa (Fig. 3a and b). The maximum temperature bias at pressures greater than 110 hPa (+3.5 K) is 2.5 times larger in magnitude than that at pressures less than 110 hPa (−1.3 K). Profiles of temperature bias are qualitatively consistent if we average over a small domain around Flight 7, rather than along the synthetic flight tracks (Fig. 3c and d). The vertical differences in temperature are magnified for potential temperature (θ) at low pressures: The max bias in θ at pressures less than 110 hPa (+10.0 K) is 4 times larger in magnitude than that at pressures greater than 100 hPa (−2.5 K). Inter-simulation differences are modest with temperature biases varying by at most 0.8 K, and we discuss these more thoroughly in Sect. 3.2.
Figure 3ICON simulations are too cold in the upper troposphere and too warm in the lower stratosphere. Comparisons of in-situ temperature T and potential temperature θ profiles with those from 20 synthetic flight tracks in the 6 ICON simulations for the period of aircraft descent from 06:20–06:48 UTC (a). Profiles of T and θ biases relative to in-situ values (b). For these panels, T is shown in circles and solid lines, and θ is shown in triangles and dashed lines. Comparisons of in-situ T to those from the Microwave Limb Sounder (MLS) with separate swaths in thin red-orange lines and the closest swath to the Flight 7 track in bold red, the ERA-5 reanalysis in purple, and sondes launched nearby with individual stations in blue dashed lines and their average in solid blue-green (c). ICON simulation profiles in panel (c) are shown averaged over the entire small subdomain around Flight 7, rather than from synthetic trajectories. Profile of T differences relative to in-situ values for all data in panel (c) (d).
Lastly, we examine robustness of the temperature biases to other observational baselines (Fig. 3c and d). The ERA-5 reanalysis agrees well with the in-situ measured temperatures except at the highest vertical level in the stratosphere, where it predicts a warmer temperature in line with ICON. The sondes near Flight 7 also tend to register warmer temperatures than the in-situ measurements at pressures lower than 100 hPa; for pressures greater than 100 hPa, the sonde-in-situ differences are noisy. The Microwave Limb Sounder (MLS) records 8 swaths in the small subdomain around the Flight 7 track, and several of these show much colder upper tropospheric temperatures than any of the other datasets (Fig. 3c, thin red lines). Even the swath geographically closest to the flight track is consistently colder than the in-situ measurements after application of MLS averaging kernels. If we had used MLS as our baseline, we would then report a much larger stratospheric warm bias and no upper tropospheric cold bias. This discrepancy points to the difficulty of establishing an observational truth in evaluation of UTLS structure.
3.2 Evaluating UTLS Moisture
We next consider moisture structure with a time series of specific humidity (qv) from the synthetic and actual flight tracks (Fig. 4). The FLASH-A instrument only measures at pressures below 250 hPa for about 2 h from 04:35 to 06:30 UTC to avoid contamination by the abundant moisture at lower altitudes. Again, the synthetic flight tracks generally match the behavior of the in-situ data. A relative bias makes more sense than an absolute one in this case, as qv spans a huge range of values. Normalizing by the observation, 24 % of time points fall within ±10 % of the observed qv, 58 % fall within ±20 %, and 99 % fall within ±50 %, relative to a 6 % measurement uncertainty for FLASH-A (Fig. 4b).
Figure 4Specific humidity along the StratoClim Flight 7 track and averaged from the ICON 1M0O simulation synthetic flight tracks over time (a). Specific humidity biases along the synthetic flight tracks relative to the in-situ measurements (b). The specific humidity bias of the “best trajectory” for which bias is minimized most often is overlaid in gray and measurement uncertainty of 6 % is shown in red. Standard deviation in specific humidity over the 20 synthetic flight tracks (c). A period of continuous descent from 06:20–06:48 UTC is shaded in gray for all panels.
Dry biases are two times more frequent than moist ones. The dry biases correlate negatively with qv (, ) and the moist biases correlate positively (r=0.22, ), meaning that the model makes the drier locations too dry and the moister ones too moist. The worst dry biases consistently occur for qv values from 5 to 10 ppmv, reinforcing this point. The qv biases do not correlate strongly with observed temperature or aircraft ascent/descent () but are similar to those in temperature in that their general behavior is not track-specific. Examining the qv biases from the “best-performing track”, we find more inter-track differences for qv than for T but that these remain small relative to the magnitude of the bias itself (Fig. 4b). The standard deviation in qv across the trajectories reinforces this point (Fig. 4c): σ(qv) is almost always less than 10 %, while only a quarter of the qv biases are within ±10 %, as noted above.
In vertical profiles of qv, ICON captures the observed moisture fairly well at pressures lower than 160 hPa, given its limited vertical resolution (Fig. 5a and b); however, it does miss anomalously high qv around 80 hPa, a “hydration patch” traced back to convective overshoots over the Sichuan basin 36 h prior by Lee et al. (2019). qv increases almost twofold over 10 hPa in the observations (from 3.3 to 5.8 ppmv between 87 and 78 hPa), whereas it increases only by 20 % over 20 hPa in the model (from 4.1 to 5.2 ppmv between 94.5 and 75.2 hPa). In contrast, ICON overestimates moisture in two upper-tropospheric layers from 140–120 hPa and from 200–160 hPa. As for temperature, these qv biases remain if we average over the Flight 7 subdomain, rather than along the synthetic flight tracks (Fig. 5c and d). The ERA-5 reanalysis shows similar qv behavior to ICON, reflecting that the hydration patch aloft and dryness below are both quite localized vertically (Fig. 5c and d). The kernel-averaged MLS profiles somewhat ∼20 % more moisture below 140 hPa and 20 % less above. These inter-measurement differences again point to the difficulty not only in simulating UTLS structure accurately but also in observing it accurately.
Figure 5ICON struggles to capture localized hydration aloft and dryness below. Comparisons of in-situ specific humidity qv profiles with those from the ICON synthetic flight tracks for the period of aircraft descent from 06:20–06:48 UTC (a). Profiles of qv biases relative to in-situ values, measurement uncertainty in dashed red (b). Comparisons of in-situ qv to those from the Microwave Limb Sounder (MLS) with separate swaths in thin red-orange lines and the closest swath to the Flight 7 track in bold red, the ERA-5 reanalysis in purple, and sondes launched nearby with individual stations in blue dashed lines and their average in solid blue-green (c). ICON simulation profiles in panel (c) are shown averaged over the entire small subdomain around Flight 7, rather than from synthetic trajectories. Profile of qv differences relative to in-situ values for all data in panel (c), measurement uncertainty in dashed red (d).
The vertical profiles suggest covariance between the biases in temperature and specific humidity, and we visualize this more clearly in joint distributions, constructed for all of the flight track above 14 km, not only the descent period from 06:20–06:48 UTC (Fig. 6). UTLS biases are most often too warm and too dry and rarely too cold and too moist. We understand this result to mean that the behavior in ICON above 120 hPa – with overestimated T and underestimated qv – is the most widespread. The distributions also show that the two-moment simulations are more sensitive to the optics scheme used, as covariance of biases shifts more for these runs than the one-moment ones. We highlight this point again in Sect. 4.
Figure 6Too-warm and too-dry biases are most common for the upper troposphere-lower stratosphere in ICON. Joint distributions of the absolute bias in temperature (ΔT) and the relative bias in specific humidity (Δqv) for all of the in-situ and synthetic flight tracks above 14 km. The six panels correspond to the six simulations listed in Table 1. Four quadrants are denoted in panel (a).
3.3 Evaluating UTLS Vertical Velocities
To evaluate the UTLS ascent rates in ICON, we turn to observations of the ATTREX and POSIDON campaigns, as w was not measured during StratoClim (see Sect. 2.1). The evaluations can only be statistical then, with probability distributions of updraft and downdraft velocities between 200 and 70 hPa (Fig. 7a and b). The mean observed updraft speed is 0.40 m s−1 from POSIDON and 0.20 m s−1 from ATTREX with a standard deviation in these speeds of 0.31 and 0.17 m s−1, respectively. POSIDON sampled close to deep convection more often, yielding its stronger and more variable speeds. Simulated updraft velocities are slower with a mean value of 0.077 m s−1 and have less spread with a standard deviation of 0.11 m s−1. The same is true for downdrafts: While ATTREX and POSIDON record mean downdraft speeds of −0.20 and −0.40 m s−1 and standard deviations of 0.45 and 0.16 m s−1, downdrafts in the ICON simulations have an average speed of −0.073 m s−1 and spread of 0.078 m s−1. Inter-simulation differences in the updraft statistics are limited with the one-moment simulations predicting updraft speeds 12 % faster and downdraft speeds 6.5 % faster.
Figure 7Strong vertical motions and high-frequency temperature and vertical velocity fluctuations are underestimated in the ICON simulation. Probability distributions of updraft (a) and downdraft (b) velocities between 200 and 70 hPa from the six ICON simulations (colored traces), relative to those of the POSIDON (gray) and ATTREX (black) field campaigns in the west Pacific tropical tropopause. Power spectra for temperature for the ICON 2M0O trajectories, both with (magenta) and without (orange) temperature fluctuations, and relative to the Strateole-2 superpressure balloon measurements (panel c). Power spectra for updraft velocities in both ICON 2M0O and Strateole-2 data (d). The dashed gray lines in panels (c) and (d) indicate the upper bound in resolvable frequencies at 2.5 km grid spacing, assuming a gravity wave speed ∼𝒪(10 m s−1).
Along with convective ascent, upper-tropospheric gravity waves generate temperature and moisture fluctuations that influence ice formation (e.g., Potter and Holton, 1995; Podglajen and Plougonven, 2018). The Lagrangian format of the online ICON trajectories allows us to assess the simulated gravity wave spectrum in the model (Fig. 1b). As noted in Sect. 2.4, these online trajectories come from a Flight 7 simulation in ICON version 2.3.0, whereas other evaluations use an identical setup in ICON version 2.6.4. The same dynamical core, two-moment microphysics, Fu optics, and initial and boundary conditions are used across these two versions, and the online trajectories are not yet implemented in the more recent version. We expect that qualitative and approximate quantitative results from the version 2.3.0 trajectories would hold also in version 2.6.4.
With a horizontal resolution of ∼2 km, we only expect to resolve waves of 8 km wavelength or larger (Fig. 7, gray dashed line). Indeed, in power spectra of temperature and vertical velocity fluctuations along the ICON trajectories, we see dramatic underestimations of high-frequency power despite their 24 s output frequency (Fig. 7c and d). Relative to the power spectra of Strateole-2 measurements, ICON underestimates the power for frequencies on the order of 10−2 s−1 by almost four orders of magnitude in both T, reflective of lower-frequency gravity waves, and w, reflective of higher-frequency gravity waves. Similar magnitude underestimations have been found between spectra of kinetic and potential energy from reanalysis versus balloon measurements (Podglajen et al., 2020), as well as between spectra simulated in the COSMO-Model at 2 km resolution versus aircraft observations (Kienast-Sjögren et al., 2015). More recently, the review of Achatz et al. (2024) has also discussed the inability of even kilometer-scale simulations to fully resolve the gravity wave spectrum due to omission of individual convective cells and interactions of gravity waves with turbulence.
Given this large underestimate of high-frequency motions in the ICON simulations, we activate a parameterization of temperature fluctuations, derived from measurements of another superpressure balloon campaign, PreConcordiasi (Podglajen et al., 2016):
where the altitudinal displacement from one time step i to the next i+1 is determined by a white noise process W associated with vertical velocity fluctuations with an observationally derived variance of 0.16 m s−1. A high-pass filter removes any fluctuations with frequency lower than 2 d, and altitudinal perturbations are mapped to temperature ones assuming dry adiabaticity. Inclusion of this parameterized temperature variability lifts the high-frequency tail ( s−1) of the temperature power spectrum by about two orders of magnitude from 10−4 up to 10−2 K2 s−1, so that the decrease in power with frequency aligns with the −2-slope of the Strateole-2 measurements (Fig. 7c). From this last analysis, we conclude that balloon-informed parameterizations of gravity wave activity could improve the UTLS realism of even kilometer-scale simulations.
3.4 Evaluating UTLS Ice Clouds
We lastly consider ice cloud formation along the real and synthetic flight tracks (Fig. 8). We mask instances of negligible qi (defined as qi<0.001 ppmv), so that the qi time series are discontinuous but with a large amount of time spent sampling within cirrus nonetheless: 50.3 % in the simulation tracks and 50.1 % in the real one. Although this in-cloud duration agrees well, the timing and magnitude of in-cloud qi differ. Ice clouds tend to be predicted but not observed when the aircraft measures warmer temperatures and higher specific humidities and the model underestimates T and overestimates qv. The model cold bias is the strongest driver of these overestimates in ice cloud occurrence, as quantified by a t-statistic between distributions for the full qi series and masked for periods of qi overestimate (, ). When ice clouds are observed but not predicted, conditions are just the opposite – temperatures are colder, specific humidities are lower, and the model exhibits overestimates in T and underestimates in qv. The model warm bias is again the strongest driver of the underestimates in ice cloud occurrence (tΔT=37.5, ).
Figure 8Ice mass mixing ratio along the StratoClim Flight 7 track and averaged from the ICON 1M0O simulation synthetic flight tracks over time (a). Ice mass mixing ratio biases along the synthetic flight tracks relative to the in-situ measurements (b). The ice mass mixing ratio bias of the “best trajectory” for which bias is minimized most often is overlaid in gray. Standard deviation in ice mass mixing ratio over the 20 synthetic flight tracks (c). A period of continuous descent from 06:20–06:48 UTC is shaded in gray for all panels.
Vertical profiles of qi corroborate this picture in which ice cloud is underestimated for warm-dry-biased pressure levels above 100 hPa and overestimated for cold-moist-biased pressure levels below 160 hPa (Fig. 9). The too-warm and too-dry conditions in the lower stratosphere are consistent with the lack of ice clouds there and suggest that the overshooting convection at the root of moisture and ice aloft during Flight 7 could be underrepresented in ICON (Lee et al., 2019). The aircraft also samples almost no ice clouds at pressures greater than 160 hPa, while the model simulations predict qi from 10–100 ppmv there. High-frequency gravity waves generally act to amplify, rather than quench, ice crystal formation near the tropical tropopause (Jensen et al., 2016), so the underestimation of strong updrafts and gravity wave activity likely suppress cloud ice aloft.
Figure 9Ice cloud occurrence is overestimated in the upper troposphere and underestimated in the lower stratosphere. Comparisons of in-situ ice mass mixing ratio qi profiles with those from the ICON synthetic flight tracks for the period of aircraft descent from 06:20–06:48 UTC (a). Profiles of qi biases relative to in-situ values (b). Mean qi profiles for the 1M0O and 2M0O simulations in circles and solid lines and median qi profiles in triangles and dashed lines (c). Profiles taken over the whole subdomain around the Flight 7 track (Fig. 1a, orange) are shown in squares and solid lines.
The spread in qi for these altitudes is the largest inter-track difference. Qualitative behavior is the same across runs but a quantitative hierarchy emerges in which all the one-moment simulations predict larger qi than the two-moment ones. The simulations that employ the Yi et al. (2013) ice optical scheme with hexagonal ice aggregates (*M3O runs) bound the range of qi. Interestingly then, our synthetic flight tracks can be approximately interpreted as Lagrangian piggybacking in which thermodynamic inputs are held constant while cloud physics and optics change (Sullivan et al., 2022). Our findings echo those of Sullivan et al. (2022) in which altering ice microphysics for fixed (thermo)dynamic inputs generates a 5-fold difference in upper-tropospheric qi.
We close by illustrating the localization of ice clouds in a few ways. First, we get qi values about 5-fold smaller if we take the median across the synthetic flight tracks rather than the mean (Fig. 9c, triangles). Many grid cells sampled from the model output have much lower qi then and a few instances of strong ice formation pull the mean qi higher. Then, if we take the mean qi over a subdomain around the Flight 7 track rather than along our synthetic flight tracks, qi can be twice as small below 160 hPa (Fig. 9c, squares). In other words, the qi evaluation is more sensitive to the sampling method for evaluation than either T or qv. Finally, in the time series of qi, the standard deviation in ice mass mixing ratio (σqi) across the synthetic flight tracks can be on the same order as qi itself; in contrast, σT only reaches ∼0.5 % of temperature values.
We have evaluated tropical upper troposphere-lower stratosphere (UTLS) structure in the ICON storm-resolving model in several ways. We use unique high-altitude measurements of the StratoClim field campaign to assess simulated temperature, moisture, and ice cloud fields, both over time and binned vertically. We also use both flight campaign data of ATTREX and POSIDON and superpressure balloon data of Strateole-2 to assess simulated convective updrafts/downdrafts and gravity wave activity above 14 km in ICON. In this evaluation, a picture emerges in which the upper troposphere is slightly too cold and occasionally too moist, leading to overestimates of ice cloud formation there relative to in-situ observations. Meanwhile, the lower stratosphere is too warm, too dry, and too quiescent, suppressing ice cloud formation there relative to in-situ observations. Of these biases, the overestimates of lower stratospheric temperature are the most robust.
We have tested robustness of the evaluation in several ways. First, we account for spatial offset using synthetic flight tracks from model output, rather than simply averaging the output over a domain around the actual flight track. Ice mass mixing ratio can vary by a factor of 2–5 between these methods, but otherwise we do not gain much from the Lagrangian evaluation, as the biases behave similarly across these two methods. Second, we explore how different biases would be if we used satellite, sonde, or reanalysis data as our observational baseline. Temperature and especially moisture biases would be quite different when using these products. For example, the Microwave Limb Sounder would have reported much larger stratospheric warm biases and no stratospheric dry bias. These discrepancies point to another layer of complication in accurately capturing tropical UTLS structure, as well as the value of high-altitude in-situ measurements.
Finally, we look at different representations of ice cloud microphysics and optics. The stratospheric warm and dry biases are somewhat larger in the one-moment microphysics run, but the largest inter-scheme differences by far are in ice mass mixing ratio. We make two final points about model couplings. Joint distributions of temperature and moisture biases indicate that the two-moment runs are more sensitive to the coupled optics. With a freely evolving ice crystal number, rather than one evaluated from qi, the two-moment scheme “explores” more of the parameter space of optical properties. In addition, Lee et al. (2019) were able to more successfully capture the TTL temperatures and moisture from StratoClim Flight 7 with the same 2.5 km equivalent grid spacing as in our simulations but almost twice as many vertical levels. Their relative success emphasizes the importance of refining vertical grid spacing to reproduce UTLS properties. Microphysical and optical sensitivities may also emerge more strongly with refined vertical resolution, and as Achatz et al. (2024) point out, vertical resolution has a large influence on more reliable representation of gravity wave activity, which feeds back upon the representation of ice clouds.
One mechanism consistent with the reported biases is that convective overshoots are too weak or too infrequent. If the Sichuan Basin convection causing the elevated moisture and ice layers observed in Flight 7 were too weak (Lee et al., 2019), then the lower stratosphere would be left too dry, and lack of stratospheric moisture would cause localized warming. The absence of injected ice would also contribute to the negative biases in qi downstream. Given that our synthetic flight track methodology allows for some spatial offset, it is unlikely that incorrect westward advection of sufficient moisture and ice is responsible. A second mechanism for the biases could be overly rapid dissipation of upper-tropospheric ice clouds. Shorter-lived ice clouds would produce less longwave cooling aloft and less longwave heating below, in line with the temperature biases. Too-quick ice growth would also dehydrate too much aloft and hydrate too much below, as the crystals sedimented and then sublimated. The one- and two-moment differences align with this idea. Updraft speeds and ice mass mixing ratio below 160 hPa are systematically larger for the one-moment run. These conditions suggest more rapid ice formation and correspond to the worse warm and dry biases in the lower stratosphere.
Ice crystal complexity is determinative for this latter mechanism. How large, elongated, and oriented the ice crystals are will determine how long they stay aloft in the cloud. Ice cloud microproperties are underdetermined by current remote sensing platforms measuring radiance only. As a result, two-moment microphysics schemes cannot be fully constrained. Spaceborne polarimetry may be one avenue toward better constraint of upper-level cloud schemes and hence toward better representation of UTLS structure. For example, in the microwave band, polarization of ice crystal scattering is particularly useful to infer ice water path, crystal size, and diurnal cycles for the optically thin cirrus that would form in the UTLS (Gong and Wu, 2017; Gong et al., 2018). Recent launches from the Columbia Scientific Balloon Facility in New Mexico and the Esrange Space Center in Sweden prove viability of thermal polarimetry as another path toward additional observational constraints for ice clouds (Shanks et al., 2024; John et al., 2025). These exciting new observations may be one path toward more reliable representation of the complicated and influential UTLS region.
Code to reproduce analysis and figure generation is available in an archived git repository at https://doi.org/10.5281/zenodo.17252591 (Sullivan, 2026b). All postprocessed data to reproduce figures have been archived at https://zenodo.org/records/17211372 (Sullivan, 2026a). Microwave Limb Sounder data can be downloaded from NASA's Earthdata portal https://search.earthdata.nasa.gov/search/ (last access: 27 January 2026), and MLS averaging kernels are available at https://mls.jpl.nasa.gov/eos-aura-mls/data.php (last access: 27 January 2026). ERA-5 data is publicly available at https://cds.climate.copernicus.eu/datasets/reanalysis-era5-pressure-levels (last access: 27 January 2026). Sonde data can be downloaded from the University of Wyoming archive at https://weather.uwyo.edu/upperair/sounding.shtml (last access: 27 January 2026). A POSIDON overview is provided at https://espo.nasa.gov/posidon (last access: 27 January 2026).
SCS and AV conceptualized the study with support of CR and MK. AM facilitated implementation of online trajectories in ICON version 2.3.0. SCS completed ICON version 2.3.0 runs, and EISA completed ICON version 2.6.4 runs. CR and MK assisted in handling of in-situ and satellite datasets. AV and MKK supported funding acquisition. SCS wrote the manuscript and generated figures. All authors provided feedback on the manuscript.
At least one of the (co-)authors is a member of the editorial board of Atmospheric Chemistry and Physics.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
We acknowledge compute time from the German Climate Supercomputing Center (DKRZ) under allocation bb1430, as well as a Research Licence of the ECMWF permitting download of ERA-5 reanalysis fields. Thanks to Eric Jensen for preparing the vertical velocities from ATTREX and POSIDON and to Aurelien Podglajen for the Stratoeole-2 data and advice on power spectra to characterize gravity waves. We also thank two reviewers whose feedback has improved this manuscript.
This research has been supported by the Faculty of Earth Sciences, Geography, and Astronomy at the University of Vienna through an Ida Pfeiffer Professorship for SCS. SCS, EISA, and MKK received support from the National Aeronautics and Space Administration (grant no. 80NSSC25K7306): CHanneled Infrared Polarimeter (CHIRP). The European Commission, Seventh Framework Programme funded the StratoClim field campaign (grant no. 603557).
This paper was edited by Shaocheng Xie and reviewed by two anonymous referees.
Achatz, U., Alexander, M. J., Becker, E., Chun, H.-Y., Dörnbrack, A., Holt, L., Plougonven, R., Polichtchouk, I., Sato, K., Sheshadri, A., Stephan, C. C., van Niekerk, A., and Wright, C. J.: Atmospheric gravity waves: Processes and parameterization, J. Atmos. Sci., 81, 237–262, 2024. a, b
Afchine, A., Rolf, C., Costa, A., Spelten, N., Riese, M., Buchholz, B., Ebert, V., Heller, R., Kaufmann, S., Minikin, A., Voigt, C., Zöger, M., Smith, J., Lawson, P., Lykov, A., Khaykin, S., and Krämer, M.: Ice particle sampling from aircraft – influence of the probing position on the ice water content, Atmos. Meas. Tech., 11, 4015–4031, https://doi.org/10.5194/amt-11-4015-2018, 2018. a
Atlas, R., Bretherton, C., Sokol, A., Blossey, P., and Khairoutdinov, M.: Tropical Cirrus Are Highly Sensitive to Ice Microphysics Within a Nudged Global Storm-Resolving Model, Geophysical Research Letters, 51, e2023GL105868, https://doi.org/10.1029/2023GL105868, 2024. a
Banerjee, A., Chiodo, G., Previdi, M., Ponater, M., Conley, A. J., and Polvani, L. M.: Stratospheric water vapor: an important climate feedback, Climate Dynamics, 53, 1697–1710, 2019. a
Baran, A. J., Hilla, P., Walters, D., Hardman, S. C., Furtado, K., Field, P. R., and Manners, J.: The Impact of Two Coupled Cirrus Microphysics–Radiation Parameterizations on the Temperature and Specific Humidity Biases in the Tropical Tropopause Layer in a Climate Model, Journal of Climate, 29, 5299–5316, https://doi.org/10.1175/JCLI-D-15-0821.1, 2016. a, b
Charlesworth, E., Plöger, F., Birner, T., Baikhadzhaev, R., Abalos, M., Abraham, N. L., Akiyoshi, H., Bekki, S., Dennison, F., Jöckel, P., Keeble, J., Kinnison, D., Morgenstern, O., Plummer, D., Rozanov, E., Strode, S., Zeng, G., Egorova, T., Riese, M.: Stratospheric water vapor affecting atmospheric circulation, Nature Communications, 14, 3925, https://doi.org/10.1038/s41467-023-39559-2, 2023. a
Cohen, Y., Hauglustaine, D., Bellouin, N., Lund, M. T., Matthes, S., Skowron, A., Thor, R., Bundke, U., Petzold, A., Rohs, S., Thouret, V., Zahn, A., and Ziereis, H.: Evaluation of O3, H2O, CO, and NOy climatologies simulated by four global models in the upper troposphere–lower stratosphere with IAGOS measurements, Atmos. Chem. Phys., 25, 5793–5836, https://doi.org/10.5194/acp-25-5793-2025, 2025. a
Corcos, M., Hertzog, A., Plougonven, R., and Podglajen, A.: Observation of gravity waves at the tropical tropopause using superpressure balloons, Journal of Geophysical Research: Atmospheres, 126, e2021JD035165, https://doi.org/10.1029/2021JD035165, 2021. a
Crueger, T., Giorgetta, M. A., Brokopf, R., Esch, M., Fiedler, S., Hohenegger, C., Kornblueh, L., Mauritsen, T., Nam, C., Naumann, A. K., Peters, K., Rast, S., Roeckner, E., Sakradzija, M., Schmidt, H., Vial, J., Vogel, R., and Stevens, B.: ICON-A, The atmosphere component of the ICON Earth System Model: II. Model evaluation, J. Adv. Model Earth Sys., 10, 1638–1662, 2018. a
Dauhut, T. and Hohenegger, C.: The contribution of convection to the stratospheric water vapor: The first budget using a global storm-resolving model, Journal of Geophysical Research: Atmospheres, 127, e2021JD036295, https://doi.org/10.1029/2021JD036295, 2022. a
Dessler, A., Schoeberl, M., Wang, T., Davis, S., and Rosenlof, K.: Stratospheric water vapor feedback, Proceedings of the National Academy of Sciences, 110, 18087–18091, 2013. a
Dolan, B., Kollias, P., van den Heever, S. C., Rasmussen, K. L., Oue, M., Luke, E., Lamer, K., Treserras, B. P., Haddad, Z., Stephens, G., and Chandrasekar, V.: Time resolved reflectivity measurements of convective clouds, Geophysical Research Letters, 50, e2023GL105723, https://doi.org/10.1029/2023GL105723, 2023. a
Doms, G., Förstner, J., Heise, E., Herzog, H.-J., Raschendorfer, M., Schrodin, R., Reinhardt, T., and Vogel, G.: A description of the nonhydrostatic regional model LM, Tech. rep., Deutscher Wetterdienst, P.O. Box 100465, 63004 Offenbach, Germany, https://www.cosmo-model.org/content/model/cosmo/coreDocumentation/cosmo_physics_4.20.pdf (last access: 27 January 2026), 2005. a, b
Fu, Q.: An Accurate Parameterization of the Solar Radiative Properties of Cirrus Clouds for Climate Models, Journal of Climate, 9, 2058–2082, https://doi.org/10.1175/1520-0442(1996)009<2058:AAPOTS>2.0.CO;2, 1996. a, b
Fu, Q., Yang, P., and Sun, W. B.: An Accurate Parameterization of the Infrared Radiative Properties of Cirrus Clouds for Climate Models, Journal of Climate, 11, 2223–2237, https://doi.org/10.1175/1520-0442(1998)011<2223:AAPOTI>2.0.CO;2, 1998. a, b
Gettelman, A., Birner, T., Eyring, V., Akiyoshi, H., Bekki, S., Brühl, C., Dameris, M., Kinnison, D. E., Lefevre, F., Lott, F., Mancini, E., Pitari, G., Plummer, D. A., Rozanov, E., Shibata, K., Stenke, A., Struthers, H., and Tian, W.: The Tropical Tropopause Layer 1960–2100, Atmos. Chem. Phys., 9, 1621–1637, https://doi.org/10.5194/acp-9-1621-2009, 2009. a
Gettelman, A., Hegglin, M. I., Son, S.-W., Kim, J., Fujiwara, M., Birner, T., Kremser, S., Rex, M., Añel, J. A. A., Akiyoshi, H., Austin, J., Bekki, S., Kraesike, P., Brühl, C., Butchart, N., Chipperfield, M., Dameris, M., Dhomse, S., Garny, H., Hardiman, S. C., Jöckel, P., Kinnison, D. E., Lamarque, J. F., Mancini, E., Marchand, M., Michou, M., Morgenstern, O., Pawson, S., Pitari, G., Plummer, D., Pyle, J. A., Rozanov, E., Scinocca, J., Shepherd, T. G., Shibata, K., Smale, D., Teyssèdre, H., and Tian, W.: Multimodel assessment of the upper troposphere and lower stratosphere:Tropics and global trends, J. Geophys. Res., 115, https://doi.org/10.1029/2009JD013638, 2010. a, b
Giorgetta, M. A., Brokopf, R., Crueger, T., Esch, M., Fiedler, S., Helmert, J., Hohenegger, C., Kornblueh, L., Köhler, M., Manzini, E., Mauritsen, T., Nam, C., Raddatz, T., Rast, S., Reinert, D., Sakradzija, M., Schmidt, H., Schneck, R., Schnur, R., Silvers, L., Wan, H., Zängl, G., and Stevens, B.: ICON-A, the Atmosphere Component of the ICON Earth System Model: I. Model Description, J. Adv. Modeling Earth Sys., 10, 1613–1637, https://doi.org/10.1029/2017MS001242, 2018. a
Gong, J. and Wu, D. L.: Microphysical properties of frozen particles inferred from Global Precipitation Measurement (GPM) Microwave Imager (GMI) polarimetric measurements, Atmos. Chem. Phys., 17, 2741–2757, https://doi.org/10.5194/acp-17-2741-2017, 2017. a
Gong, J., Zeng, X., Wu, D. L., and Li, X.: Diurnal variation of tropical ice cloud microphysics: Evidence from Global Precipitation Measurement Microwave Imager polarimetric measurements, Geophysical Research Letters, 45, 1185–1193, 2018. a
Haase, J. S., Alexander, M. J., Hertzog, A., Kalnajs, L., Deshler, T., Davis, S. M., Plougonven, R., Cocquerez, P., and Venel, S.: Around the world in 84 days, Eos, 99, https://doi.org/10.1029/2018EO091907, 2018. a
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J. M., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., Chiara, G. D., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépautet, J.: The ERA5 global reanalysis, Q. J. Roy. Meteorol. Soc., 146, 1999–2049, 2020. a
Hoffmann, L. and Spang, R.: An assessment of tropopause characteristics of the ERA5 and ERA-Interim meteorological reanalyses, Atmos. Chem. Phys., 22, 4019–4046, https://doi.org/10.5194/acp-22-4019-2022, 2022. a
Jensen, E. J., Ueyama, R., Pfister, L., Bui, T. V., Alexander, M. J., Podglajen, A., Hertzog, A., Woods, S., Lawson, R. P., Kim, J.-E., and Schoeberl, M. R.: High-frequency gravity waves and homogeneous ice nucleation in tropical tropopause layer cirrus, Geophys. Res. Lett., 43, 6629–6635, 2016. a
Jensen, E. J., Pfister, L., Jordan, D. E., Bui, T. V., Ueyama, R., Singh, H. B., Thornberry, T. D., Rollins, A. W., Gao, R.-S., Fahey, D. W., Pfeilsticker, K., Diskin, G. S., DiGangi, J. P., Lawson, R. P., Woods, S., Atlas, E. L., Bardeen, C. G., Toon, O. B., Kindel, B. C., Newman, P. A., McGill, M. J., Hlavka, D. L., Lait, L. R., Schoeberl, M. R., Bergman, J. W., Selkirk, H. B., Alexander, M. J., Kim, J.-E., Lim, B. H., and Stutz, J.: The NASA Airborne Tropical Tropopause Experiment: High-altitude aircraft measurements in the tropical western Pacific, Bulletin of the American Meteorological Society, 98, 129–143, 2017. a
Johansson, E., Devasthale, A., Ekman, A. M., Tjernström, M., and L'Ecuyer, T.: How does cloud overlap affect the radiative heating in the tropical upper troposphere/lower stratosphere?, Geophysical Research Letters, 46, 5623–5631, 2019. a
John, J. A., Gong, J., Parkinson, J. C., Sullivan, S. C., Wu, D. L., and Kupinski, M. K.: Cloud Top Observations of LWIR Spectro-Polarimetry, Wiley Online Library, https://doi.org/10.1117/12.3063972, 2025. a
Khaykin, S. M., Engel, I., Vömel, H., Formanyuk, I. M., Kivi, R., Korshunov, L. I., Krämer, M., Lykov, A. D., Meier, S., Naebert, T., Pitts, M. C., Santee, M. L., Spelten, N., Wienhold, F. G., Yushkov, V. A., and Peter, T.: Arctic stratospheric dehydration – Part 1: Unprecedented observation of vertical redistribution of water, Atmos. Chem. Phys., 13, 11503–11517, https://doi.org/10.5194/acp-13-11503-2013, 2013. a
Kienast-Sjögren, E., Miltenberger, A. K., Luo, B. P., and Peter, T.: Sensitivities of Lagrangian modelling of mid-latitude cirrus clouds to trajectory data quality, Atmos. Chem. Phys., 15, 7429–7447, https://doi.org/10.5194/acp-15-7429-2015, 2015. a
Kim, J., Grise, K. M., and Son, S.-W.: Thermal characteristics of the cold-point tropopause region in CMIP5 models, Journal of Geophysical Research: Atmospheres, 118, 8827–8841, 2013. a
Krämer, M., Rolf, C., Spelten, N., Afchine, A., Fahey, D., Jensen, E., Khaykin, S., Kuhn, T., Lawson, P., Lykov, A., Pan, L. L., Riese, M., Rollins, A., Stroh, F., Thornberry, T., Wolf, V., Woods, S., Spichtinger, P., Quaas, J., and Sourdeval, O.: A microphysics guide to cirrus – Part 2: Climatologies of clouds and humidity from observations, Atmos. Chem. Phys., 20, 12569–12608, https://doi.org/10.5194/acp-20-12569-2020, 2020. a
Krüger, K., Schäfler, A., Wirth, M., Weissmann, M., and Craig, G. C.: Vertical structure of the lower-stratospheric moist bias in the ERA5 reanalysis and its connection to mixing processes, Atmos. Chem. Phys., 22, 15559–15577, https://doi.org/10.5194/acp-22-15559-2022, 2022. a
Lee, K.-O., Dauhut, T., Chaboureau, J.-P., Khaykin, S., Krämer, M., and Rolf, C.: Convective hydration in the tropical tropopause layer during the StratoClim aircraft campaign: pathway of an observed hydration patch, Atmos. Chem. Phys., 19, 11803–11820, https://doi.org/10.5194/acp-19-11803-2019, 2019. a, b, c, d, e, f, g
Marécal, V., Durry, G., Longo, K., Freitas, S., Rivière, E. D., and Pirre, M.: Mesoscale modelling of water vapour in the tropical UTLS: two case studies from the HIBISCUS campaign, Atmos. Chem. Phys., 7, 1471–1489, https://doi.org/10.5194/acp-7-1471-2007, 2007. a
Maycock, A. C., Joshi, M. M., Shine, K. P., and Scaife, A. A.: The circulation response to idealized changes in stratospheric water vapor, Journal of Climate, 26, 545–561, 2013. a
Merlis, T. M., Guendelman, I., Cheng, K.-Y., Harris, L., Chen, Y.-T., Bretherton, C. S., Bolot, M., Zhou, L., Kaltenbaugh, A., Clark, S. K., and Fueglistaler, S.: The vertical structure of tropical temperature change in global storm-resolving model simulations of climate change, Geophysical Research Letters, 51, e2024GL111549, https://doi.org/10.1029/2024GL111549, 2024. a
Meyer, J., Rolf, C., Schiller, C., Rohs, S., Spelten, N., Afchine, A., Zöger, M., Sitnikov, N., Thornberry, T. D., Rollins, A. W., Bozóki, Z., Tátrai, D., Ebert, V., Kühnreich, B., Mackrodt, P., Möhler, O., Saathoff, H., Rosenlof, K. H., and Krämer, M.: Two decades of water vapor measurements with the FISH fluorescence hygrometer: a review, Atmos. Chem. Phys., 15, 8521–8538, https://doi.org/10.5194/acp-15-8521-2015, 2015. a
Miltenberger, A. K., Lüttmer, T., and Siewert, C.: Secondary Ice Formation in Idealised Deep Convection–Source of Primary Ice and Impact on Glaciation, Atmosphere, 11, https://doi.org/10.3390/atmos11050542, 2020. a
Nordeng, T. E.: Extended versions of the convective parametrization scheme at ECMWF and their impact on the mean and transient activity of the model in the tropics, Tech. rep., ECMWF Tech. Memo., Shinfield Park, Reading, https://doi.org/10.21957/e34xwhysw, 1994. a
Nugent, J., Turbeville, S., Bretherton, C., Blossey, P., and Ackerman, T.: Tropical Cirrus in Global Storm-Resolving Models: 1. Role of Deep Convection, Earth and Space Science, 9, https://doi.org/10.1029/2021EA001965, 2022. a
Ploeger, F., Birner, T., Charlesworth, E., Konopka, P., and Müller, R.: Moist bias in the Pacific upper troposphere and lower stratosphere (UTLS) in climate models affects regional circulation patterns, Atmos. Chem. Phys., 24, 2033–2043, https://doi.org/10.5194/acp-24-2033-2024, 2024. a, b
Podglajen, A., Hertzog, A., Plougonven, R., and Legras, B.: Lagrangian temperature and vertical velocity fluctuations due to gravity waves in the lower stratosphere, Geophys. Res. Lett., 43, 3543–3553, 2016. a, b
Podglajen, A., Bui, T. P., Dean-Day, J. M., Pfister, L., Jensen, E. J., Alexander, M. J., Hertzog, A., Kärcher, B., Plougonven, R., and Randel, W. J.: Small-scale wind fluctuations in the tropical tropopause layer from aircraft measurements: Occurrence, nature, and impact on vertical mixing, Journal of the Atmospheric Sciences, 74, 3847–3869, 2017. a
Podglajen, A., Plougonven, R., Hertzog, A., and Jensen, E.: Impact of gravity waves on the motion and distribution of atmospheric ice particles, Atmos. Chem. Phys., 18, 10799–10823, https://doi.org/10.5194/acp-18-10799-2018, 2018. a
Podglajen, A., Hertzog, A., Plougonven, R., and Legras, B.: Lagrangian gravity wave spectra in the lower stratosphere of current (re)analyses, Atmos. Chem. Phys., 20, 9331–9350, https://doi.org/10.5194/acp-20-9331-2020, 2020. a
Potter, B. E. and Holton, J. R.: The role of monsoon convection in the dehydration of the lower tropical stratosphere, J. Atm. Sci., 52, 1034–1050, 1995. a
Poujol, B. and Bony, S.: Measuring clear-air vertical motions from space, AGU Advances, 5, e2024AV001267, https://doi.org/10.1029/2024AV001267, 2024. a
Schoeberl, M. R., Dessler, A. E., and Wang, T.: Modeling upper tropospheric and lower stratospheric water vapor anomalies, Atmos. Chem. Phys., 13, 7783–7793, https://doi.org/10.5194/acp-13-7783-2013, 2013. a
Seifert, A. and Beheng, K. D.: A two-moment cloud microphysics parameterization for mixed-phase clouds. Part I: Model description, Meteorol. Atmos. Phys., 92, 45–66, https://doi.org/10.1002/2014JD021917, 2006. a, b
Sepulveda Araya, E. I., Sullivan, S. C., and Voigt, A.: Ice crystal complexity leads to weaker ice cloud radiative heating in idealized single-column simulations, Atmos. Chem. Phys., 25, 8943–8958, https://doi.org/10.5194/acp-25-8943-2025, 2025. a
Shanks, K. A., John, J. A., Parkinson, J. C., Wu, D. L., and Kupinski, M. K.: High-altitude demonstration of LWIR polarimeter using uncooled microbolometer, Journal of Quantitative Spectroscopy and Radiative Transfer (JQSRT), 315, 108872, https://doi.org/10.1016/j.jqsrt.2023.108872, 2024. a
Simmons, A., Poli, P., Dee, D., Berrisford, P., Dragani, R., Fuentes, M., Kobayashi, S., Radu, R., and Schepers, D.: Global stratospheric temperature bias and other stratospheric aspects of ERA5 and ERA5.1, ERA Report Series, 10, https://www.ecmwf.int/en/elibrary/81149-global-stratospheric-temperature-bias (last access: 27 January 2026), 2020. a
Singer, C. E., Clouser, B. W., Khaykin, S. M., Krämer, M., Cairo, F., Peter, T., Lykov, A., Rolf, C., Spelten, N., Afchine, A., Brunamonti, S., and Moyer, E. J.: Intercomparison of upper tropospheric and lower stratospheric water vapor measurements over the Asian Summer Monsoon during the StratoClim campaign, Atmos. Meas. Tech., 15, 4767–4783, https://doi.org/10.5194/amt-15-4767-2022, 2022. a
Smith, W. P., Pan, L. L., Kinnison, D., Atlas, E., Honomichl, S., Zhang, J., Tilmes, S., Fernandez, R. P., Saiz-Lopez, A., Treadaway, V., Adcock, K. E., Laube, J. C., von Hobe, M., Kloss, C., Viciani, S., D’Amato, F., Volk, C. M., and Ravegnani, F.: Evaluating the model representation of Asian summer monsoon upper troposphere and lower stratosphere transport and composition using airborne in situ observations, Journal of Geophysical Research: Atmospheres, 129, e2023JD039756, https://doi.org/10.1029/2023JD039756, 2024. a
Solomon, S., Rosenlof, K. H., Portmann, R. W., Daniel, J. S., Davis, S. M., Sanford, T. J., and Plattner, G.-K.: Contributions of stratospheric water vapor to decadal changes in the rate of global warming, Science, 327, 1219–1223, 2010. a
Sullivan, S. C.: Observational and Synthetic Flight Track Data for UTLS Evaluation in ICON, Zenodo [data set], https://zenodo.org/records/17211372 (last access: 28 January 2026), 2026a. a, b
Sullivan, S. C.: sylviasullivan/UTLS-profiles: Submission version 1 (submission-version1), Zenodo [code], https://doi.org/10.5281/zenodo.17252591, 2026b. a, b
Sullivan, S. C., Voigt, A., Miltenberger, A., Rolf, C., and Krämer, M.: A Lagrangian Perspective of Microphysical Impact on Ice Cloud Evolution and Radiative Heating, J. Adv. Model. Earth Sys., 14, https://doi.org/10.1029/2022MS003226, 2022. a, b
Turbeville, S., Nugent, J., Ackerman, T., Bretherton, C., and Blossey, P.: Tropical cirrus in global storm-resolving models: 2. Cirrus life cycle and top-of-atmosphere radiative fluxes, Earth and Space Science, 9, e2021EA001978, https://doi.org/10.1029/2021EA001978 2022. a
Wang, W., Shangguan, M., Tian, W., Schmidt, T., and Ding, A.: Large uncertainties in estimation of tropical tropopause temperature variabilities due to model vertical resolution, Geophysical Research Letters, 46, 10043–10052, 2019. a
Waters, J. W., Froidevaux, L., Harwood, R. S., Jarnot, R. F., Pickett, H. M., Read, W. G., Siegel, P. H., Cofield, R. E., Filipiak, M. J., Flower, D. A., Holden, J. R., Lau, G. K., Livesey, N. J., Manney, G. L., Pumphrey, H. C., Santee, M. L., Wu, D. L., Cuddy, D. T., Lay, R. R., Loo, M. S., Perun, V. S., Schwartz, M. J., Stek, P. C., Thurstans, R. P., Boyles, M. A., Chandra, K. M., Chavez, M. C., Chen, G.-S., Chudasama, B. V., Dodge, R., Fuller, R. A., Girard, M. A., Jiang, J. H., Jiang, Y., Knosp, B. W., LaBelle, R. C., Lam, J. C., Lee, K. A., Miller, D., Oswald, J. E., Patel, N. C., Pukala, D. M., Quintero, O., Scaff, D. M., Snyder, W. V., Tope, M. C., Wagner, P. A., and Walch, M. J.: The earth observing system microwave limb sounder (EOS MLS) on the Aura satellite, IEEE Transactions on Geoscience and Remote Sensing, 44, 1075–1092, 2006. a
Yi, B., Yang, P., Baum, B. A., L'Ecuyer, T., Oreopoulos, L., Mlawer, E. J., Heymsfield, A. J., and Liou, K. N.: Influence of Ice Particle Surface Roughening on the Global Cloud Radiative Effect, Journal of the Atmospheric Sciences, 70, 2794–2807, https://doi.org/10.1175/JAS-D-13-020.1, 2013. a, b, c