Thermodynamic and cloud evolution in a cold air outbreak during HALO-(AC) 3 : Quasi-Lagrangian observations compared to the ERA5 and CARRA reanalyses

. Intense air mass transformations take place when cold, dry Arctic air masses move southward from the closed sea ice onto the much warmer ice-free Arctic ocean during marine cold air outbreaks (MCAOs). In spite of intensive research on MCAOs during recent years, the temporal rates of diabatic heating and moisture uptake relevant also for cloud formation/dissipation have not been measured along MCAO flows. Instead, reanalyses have typically been used for climatological investigations of MCAOs or to supply higher resolution models with lateral boundary conditions and time-dependent forcings. 5 Meanwhile, the uncertainties connected to those datasets remain unclear. Here, we present height-resolved observations of diabatic heating rates, moisture uptake, and cloud evolution measured in a quasi-Lagrangian manner. The investigated specific MCAO was observed on 01 April 2022 during the HALO-(AC) 3 airborne campaign that was conducted in spring 2022. Shortly after passing the ice edge, maximum diabatic heating rates larger than 6 K h -1 and moisture uptake of more than 0.3 g kg -1 h -1 were measured close above the ocean surface. As the air 10 mass continued its drift southwards, clouds started to form and vertical mixing within the steadily deepening boundary layer was intensified. The quasi-Lagrange observations are compared with reanalysis data from the European Centre for Medium-Range Weather Forecasts


Introduction
Arctic air masses over closed sea ice are subject to a sustained radiative cooling.Therefore, they are characterized both by low air temperatures and low atmospheric moisture contents.Marine cold-air outbreaks (CAOs) manifest when such Arctic air masses depart the closed sea ice, traverse the marginal sea-ice zone (MIZ), and ultimately move southward onto considerably warmer ice-free oceans (Fletcher et al., 2016a;Dahlke et al., 2022).In the early stages of CAOs, significant air mass transformations occur.They are driven by strong surface energy fluxes of sensible and latent heat, as well as by additional entrainment fluxes through mixing with the overlying warmer air masses (Brümmer, 1996;Tet-B. Kirbus et al.: Thermodynamic and cloud evolution in a cold-air outbreak observed during HALO-(AC) 3 zlaff et al., 2015).The intense diabatic heating and moisture uptake initiates roll convection that leads to cloud evolution and a deepening of the atmospheric boundary layer (ABL; Fletcher et al., 2016a;Papritz and Spengler, 2017;Pithan et al., 2018).As a result, the near-surface air temperature can increase by more than 20 K in a matter of hours (Pithan et al., 2018;Wendisch et al., 2023).Characteristic cloud streets of up to 1000 km length are formed, which later break up due to processes such as ABL decoupling and precipitation formation (Fletcher et al., 2016a;Pithan et al., 2018;Lloyd et al., 2018;Tornow et al., 2021;Dahlke et al., 2022;Sanchez et al., 2022;Murray-Watson et al., 2023).This transition finally results in cellular cloud structures that have been reported to occur for boundary layer heights (BLHs) of over 1.4 km (Brümmer, 1999).Then, the heat release from water vapor condensation into cloud droplets can even exceed the surface heat fluxes (Brümmer, 1996).In the temperature range of −25 to 0 °C, typical CAO clouds are of mixed-phase type, where the upper portions of the clouds are dominated by supercooled liquid water and the lower parts by ice particles (Shupe et al., 2006;Morrison et al., 2012).The strongest CAO events occur in winter, when the horizontal surface temperature gradient between the cold sea ice and the adjacent ice-free ocean is the largest (Fletcher et al., 2016a;Papritz and Spengler, 2017;Dahlke et al., 2022).One of the primary gateways into and out of the central Arctic is the Fram Strait, located between Greenland and the Svalbard archipelago.CAOs are favored in this area because the North Atlantic Current transports significant heat northward, and consequently the MIZ and sea-ice edge are located far northward as well (Dahlke et al., 2022), which promotes intense CAOs in this region (Papritz and Spengler, 2017).
Several factors have sparked scientific interest in studying CAOs.The formation of cloud streets and their transition into open cells have important implications for the Arctic and the mid-latitude radiative energy budget, as the bright clouds over dark, ice-free ocean surfaces reflect a large fraction of incoming solar radiation, which causes a significant cooling at the surface (Li et al., 2011;Sanchez et al., 2022;Murray-Watson et al., 2023).Furthermore, large amounts of heat are transferred from the ocean into the atmosphere.Estimates show that about 60 %-80 % of oceanic heat loss in the Nordic Seas in winter is caused by CAOs, which has important implications for deep water formation (Papritz and Spengler, 2017;Svingen et al., 2023).CAOs have been linked to the evolution of short-lived polar lows and mesoscale cyclones (Shapiro et al., 1987;Stoll et al., 2018;Landgren et al., 2019;Meyer et al., 2021;Terpstra et al., 2021).Either with or without such low-pressure systems being present, CAOs can trigger extreme weather conditions, such as freezing sea spray, intense snowfall, or high near-surface winds.These phenomena pose significant hazards at affected coastlines (Kolstad, 2017;Landgren et al., 2019).The Arctic amplification observed in recent decades has caused a significant reduction in strong wintertime CAOs in the Fram Strait (Dahlke et al., 2022) and Barents Sea (Narizhnaya et al., 2020).Also, in the future, strong wintertime CAOs are expected to decrease (Landgren et al., 2019).On the contrary, springtime CAOs are observed to intensify (Dahlke et al., 2022).Not only are the CAO intensities expected to change, but the melting Arctic sea ice is also leading to a shift in spatial patterns (Landgren et al., 2019).
In addition, sophisticated atmospheric reanalyses have been developed.They assimilate a large amount of available measurements, such as atmospheric soundings and satellite data (Hersbach et al., 2020).Reanalyses deliver meteorological parameters on a continuous latitude-longitudeheight grid, as well as at high temporal resolution down to 1 h.The fifth-generation atmospheric reanalysis (ERA5) of the European Centre for Medium-Range Weather Forecasts (ECMWF) is frequently used for climatological studies (Papritz and Spengler, 2017;Papritz et al., 2019;Dahlke et al., 2022).Furthermore, dedicated Arctic reanalyses have been developed, such as the spatially much higher resolved Copernicus Arctic Regional Reanalysis (CARRA).Investigations into characteristic properties and trends of Arctic CAOs based on reanalyses have been created for classical Eulerian (Dahlke et al., 2022) and quasi-Lagrangian frameworks (Papritz and Spengler, 2017)."Quasi-Lagrangian" highlights the fact that an air mass is not truly physically followed, as it may be possible by meteorological balloons (Businger et al., 2006).Instead, wind fields as available from reanalyses are used to model the flow of air masses (Sprenger and Wernli, 2015).Such kinematic trajectories are oblivious to sub-gridscale turbulent motion leading to exchanges across neighboring air masses, which can be diagnosed as sources and sinks of, for example, moisture and heat.Yet they account for the mean drift along prevailing winds.Aircraft can be employed to trace the properties of specific air parcels along their trajectory (Boettcher et al., 2021;Sanchez et al., 2022).Finally, reanalysis output is used to supply the boundary conditions and time-dependent forcings to much higher resolved models (Seethala et al., 2021;Li et al., 2022).
However, microphysical properties and the processes governing the evolving clouds and their radiative properties re-main notoriously difficult to model (Pithan and Mauritsen, 2014;Pithan et al., 2018;Wendisch et al., 2021).This is especially true over sea ice and the MIZ, where the widely employed satellite-based remote sensing faces serious challenges.As a result, many satellite studies investigating CAOs focus solely on the evolution over the fully ice-free open ocean (Wu and Ovchinnikov, 2022;Murray-Watson et al., 2023;Mateling et al., 2023).Furthermore, the vertically nonuniform diabatic heating and moisture uptake by air masses along CAO trajectories are not sufficiently represented in models, which may cause issues in terms of atmospheric stability and the lapse-rate feedback (Linke et al., 2023).While the contributing processes are generally well understood, their relative importance and absolute magnitudes remain unspecified (Pithan et al., 2018;Wendisch et al., 2021;You et al., 2021b, a).As a result, the overall cloud effects on Arctic climate remain uncertain (Boucher et al., 2014;Wendisch et al., 2021Wendisch et al., , 2023)).
Here, we present airborne measurements of the heightdependent heating and moistening rates during a specific CAO event, based on quasi-Lagrangian airborne observations.The investigated flight of the High Altitude and LOng Range Aircraft (HALO) was conducted as part of the HALO-(AC) 3 airborne campaign, which took place in spring 2022.We compare the quasi-Lagrangian observations to the ERA5 and CARRA reanalyses.In our article, we address three specific research questions.(Q1) How do air temperature, specific humidity, and clouds evolve in the first 4 h of the developing CAO? (Q2) How do the ERA5 and CARRA reanalyses perform with respect to observations and compared to each other?(Q3) What are possible sources of errors which could explain deviations between reanalysis output and observations?
The study is structured as follows.Section 2 details the airborne observations which are the foundation of this study.The two ERA5 and CARRA reanalyses are introduced, and the trajectory analysis is described.In Sect.3, the airborne measurements are analyzed in a classical Eulerian framework.Subsequently, the quasi-Lagrangian analysis will be used to present and discuss novel observation-derived heating and moistening rates along the CAO flow, as well as correlated cloud properties, and to compare them between the two reanalyses.

Airborne observations
The CAO analyzed in this study was observed on 1 April 2022 during the HALO-(AC) 3 campaign, which was conducted in March and April 2022 as a dedicated quasi-Lagrangian Arctic airborne campaign (Wendisch et al., 2021(Wendisch et al., , 2024)).The meteorological conditions that prevailed during the campaign are described in Walbröl et al. (2023).HALO-(AC) 3 involved the HALO research aircraft operated by the German Aerospace Center (Krautstrunk and Giez, 2012;Stevens et al., 2019) for the long-range investigation of air mass transformations in combination with the lowerflying Polar 5 and Polar 6 research aircraft operated by the Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research (Wesche et al., 2016).After taking off from the base in Kiruna (Sweden) at 07:30 UTC, HALO headed north.It then sampled the CAO cloud streets west of Svalbard; see Fig. 1.The speed of HALO at its typical flight altitude of 10-12 km is around 800 km h −1 , which is much faster than the wind speed of 30-60 km h −1 measured by dropsondes on this day.Therefore, in order to facilitate a quasi-Lagrangian (i.e., air mass following) sampling of cloudy air masses, long horizontal cross-sections were flown across the off-ice flow.These flight legs not only covered the icefree ocean, but also parts of the adjacent Arctic sea ice; see Fig. 1.Several such flight legs were conducted, where the legs were stepwise shifted south roughly according to the forecast wind speed in the atmospheric boundary layer.Similar quasi-Lagrangian airborne sampling was performed previously, but taking place over the Atlantic (Methven et al., 2006) and for a warm conveyor belt over Europe (Boettcher et al., 2021).Similar to our case, Sanchez et al. (2022) investigated the aerosol and cloud evolution in CAOs.From their quasi-Lagrangian observations, they contrast the evolving particle mode distributions between within and outside CAO flow.However, they do not report, for example, on heating or moistening rates.
We analyze a set of 40 dropsondes which were released from HALO northwest of Svalbard.These RD94 dropsondes recorded air pressure p (accuracy 0.4 hPa), air temperature T (0.2 K), relative humidity RH (2 %), derived potential temperature θ , and specific humidity q, as well as horizontal wind components (0.2 m s −1 ; Vaisala, 2010; George et al., 2021).The data were assimilated by the ECMWF Integrated Forecasting System (IFS), also serving as input for the ERA5 and CARRA reanalyses.The profiles of θ are used to derive the atmospheric BLH from dropsonde measurements and reanalyses.The BLH is defined here as the altitude where the largest vertical gradient in θ is found (similar to Seidel et al., 2010;Dai et al., 2014;Bakas et al., 2020;Sinclair et al., 2022).For estimating the cloud top heights (CTHs), the 532 nm backscatter ratio from the water vapor differential absorption (WALES) lidar is used (Wirth et al., 2009).WALES has a vertical resolution of 15 m.We define the CTH as the maximum altitude above ground where the backscatter ratio exceeds that of cloud-free sections.Cloud radar data from the HALO Microwave Package (HAMP) are used to additionally evaluate cloud evolution (Mech et al., 2014).The radar data have a vertical resolution of 30 m. Furthermore, to better understand the heating and moistening rates, airborne observations from HALO are used to estimate the surface sensible and latent heat fluxes (SSHF, SLHF), similar to Li et al. (2022)  where dropsondes were released.White-blueish contours represent 1 km high-resolution sea-ice concentration retrieved from a merged MODIS-AMSR2 satellite product (Ludwig et al., 2020).
Over the ice-free ocean, yellow-brownish contours indicate the ERA5-derived CAO index M 850 hPa ; see Eq. (3).Finally, the colored lines show 24 h backwards and 24 h forward trajectories initialized at the location of each dropsonde at 10 hPa above ground.
The colors represent the evolving potential temperature (θ ) of these air masses as traced from ERA5 data.
iment (COARE) bulk air-sea flux algorithms and aerodynamic formulas (Fairall et al., 2003).COARE is widely used for the calculation of air-sea turbulent heat fluxes (Edson et al., 2013;Bharti et al., 2019;Lin et al., 2023) and has been found to perform the best among 12 examined bulk aerodynamic formulas (Brunke et al., 2003).The following basic equations were used to estimate SSHF and SLHF (Fairall et al., 2003): where ρ air denotes the air density (kg m −3 ), C H and C Q are the transfer coefficients for heat and humidity (dimensionless), c p is the specific heat capacity at constant pressure (c p = 1004.7 J kg −1 K −1 ), L v is the latent heat of evaporation (L v = 2.5008 J kg −1 ), U 10 m is the wind speed at 10 m height (m s −1 ), (T 10 m − T skin ) is the temperature difference between the 10 m air temperature and skin temperature (K), and (q 10 m − 0.98q sat,skin ) is the difference in specific humidity between the 10 m level and the specific saturation humidity taken at skin temperature ( kg kg −1 ).The factor of 0.98 accounts for the reduction in vapor pressure resulting from a typical seawater salinity of 3.4 % (Fairall et al., 2003).Dropsonde profiles are used to extract ρ air , U 10 m , T 10 m , and q 10 m via interpolation to the 10 m height level.The Video airbornE Longwave Observations within siX channels (VELOX) thermal infrared imager (Schäfer et al., 2022) is applied to obtain T skin (accuracy 0.5 K) and q sat,skin for the cloud-free sections.
The transfer coefficients of heat and humidity are directly calculated using the most recent COARE 3.5 bulk air-sea algorithm (Edson et al., 2013;Bariteau et al., 2021).At winds speeds up to 20 m s −1 , COARE has a reported uncertainty of around 10 % (Fairall et al., 2003;Edson et al., 2013).Together with the measurement uncertainties, a combined uncertainty on the bulk fluxes (SSHF, SLHF) of at least 12 % is assumed.For the MIZ with its many open leads, the calculated fluxes were multiplied with the open sea fraction of a merged MODIS-AMSR2 satellite product (Ludwig et al., 2020).However, it should be stressed that the real surface heat fluxes can be assumed to be highly heterogeneous in the MIZ and thus prone to much higher uncertainties (Tetzlaff et al., 2015).Finally, to collect in situ cloud measurements, the Polar 6 aircraft sampled concurrently with HALO (Fig. S2).Polar 6 was based in Longyearbyen on Svalbard and was equipped with a wide range of in situ probes (Moser et al., 2023), including a Nevzorov sonde from which the liquid and frozen cloud water contents were obtained.For liquid water contents of around 0.05 g kg −1 similar to that discussed here, the uncertainty on measurements is assumed to be at approximately 17 % of the observed values (Korolev et al., 1998;Lucke et al., 2022;Mech et al., 2022a).

Reanalysis products
The ERA5 global reanalysis features a sophisticated fourdimensional variational data assimilation scheme and is based on ECMWF's IFS cycle 41r2 (Hersbach et al., 2020).ERA5 data fields have a temporal resolution of 1 h, have a horizontal grid resolution of 31 km, and are available on 137 model levels.The model levels start 10 m above ground level (m a.g.l.) and are then situated approximately every 20 m, with an increasing spacing upwards.Several studies note the high performance of ERA5 in the Arctic region (Graham et al., 2019a;Wu et al., 2023), specifically in the Fram Strait region (Graham et al., 2019b).Thus, numerous authors performing trajectory analysis in the Arctic rely on wind and meteorological data fields from ERA5 (e.g., Papritz and Spengler, 2017;Papritz, 2020;Dahlke et al., 2022;You et al., 2021a;Kirbus et al., 2023a, b;Svensson et al., 2023).The CARRA regional reanalysis was specifically tailored towards the unique conditions in the Arctic environment, such as the prevailing cold surfaces on Arctic sea ice and ice sheets.Notably, it explicitly simulates a snow layer on sea ice.CARRA is based on the HARMONIE-AROME nonhydrostatic regional numerical weather prediction model, which is operational in the Nordic countries and several other European countries (Bengtsson et al., 2017;Yang et al., 2023).The reanalysis data can be retrieved for two distinct domains (CARRA-West covering Greenland and CARRA-East encompassing Svalbard and northern Scandinavia) that overlap in the vicinity of Svalbard (Yang et al., 2023).Boundary forcings are taken from ERA5.CARRA analysis fields have a temporal resolution of 3 h, a horizontal grid resolution of 2.5 km, and 65 vertical model levels.The model levels start 15 m a.g.l. and are then situated approximately every 30 m, with an increasing spacing upwards.
In this study, ERA5 and CARRA wind fields are used for trajectory calculations, thermodynamic profiles are extracted at the dropsonde locations, and several cloud-related parameters and turbulent energy fluxes are retrieved.Compared to ERA5, a larger amount of local observations is assimilated into CARRA's three-dimensional variational assimilation scheme, such as snow depths from satellite observations or actual measurements of glacier albedos.Satelliteborne sea-surface temperature and sea-ice data are assimilated at a higher spatial resolution compared to ERA5.Especially in areas with steep topography, the increased resolution of CARRA versus ERA5 is expected to better fit to observations (Yang et al., 2023).Isaksen et al. (2022) show that both reanalyses reproduce the key features of the observed exceptional warming over the Barents Sea.However, CARRA shows more spatial details and larger regional surface air temperature trends.Moore and Imrit (2022) investigate winds in the 40-100 km narrow Nares Strait northwest of Greenland.They find a significant underestimation of local wind speeds in ERA5, which on average reach 40 % of the observed values versus 80 % in CARRA.Box et al. (2023) evaluate five contemporary numerical prediction systems against in situ rainfall data from Greenland stations.CARRA shows the lowest average bias and the highest explained variance.Køltzow et al. (2022) systematically evaluate the representation of 10 m wind speed and 2 m air temperature against observations for the two CARRA domains.The largest differences between CARRA and ERA5 are found in regions with complex terrain and coastlines, as well as over the Arctic sea ice for 2 m air temperature in winter.Over flat terrain, the added value is especially obvious for the air temperature.With these reported advantages in mind, CARRA focuses solely on the European Arctic sector and starts only in 1991.The 3-hourly analysis fields must be combined with short-range forecasts to match the same 1-hourly resolution of ERA5 (Yang et al., 2023).
To classify the strength of the observed CAO, the marine cold-air-outbreak index M (Kolstad et al., 2009;Fletcher et al., 2016b) is calculated based on ERA5 data and an 850 hPa reference level (Papritz et al., 2015;Papritz and Spengler, 2017;Knudsen et al., 2018;Dahlke et al., 2022;Geerts et al., 2022;Mateling et al., 2023).Using the potential temperature θ , M 850 hPa is computed as follows: where θ skin,ocean denotes the potential skin temperature over ice-free ocean.A positive M 850 hPa over a large area indicates the presence of a CAO event.The daily M 850 hPa is averaged temporally from the hourly input data and spatially over a box surrounding Fram Strait.With an extent of 75-80°N and 10°W-10°E, this box is identical with previous studies (Papritz and Spengler, 2017;Dahlke et al., 2022).Consistent with the aforementioned works, CAO events can be classified as weak (M 850 hPa below 4 K), moderate (M 850 hPa between 4-8 K), or strong (M 850 hPa above 8 K).

Trajectory analysis
To evaluate whether the quasi-Lagrangian flight strategy on 1 April 2022 had been a success, both the ERA5 and CARRA three-dimensional wind fields are retrieved on model levels.Note that all 40 released dropsondes were assimilated by ECMWF, which greatly improves the reliability of trajectory calculations.As will be shown, no significant differences in calculated trajectories are found when using ERA5 or CARRA data.A comparison of the very similar wind profiles is given in Figs.S3 and S4.Køltzow et al. ( 2022) also reported only small differences between ERA5 and CARRA wind fields in areas with flat terrain, such as over the Arctic Ocean.
The Lagrangian Analysis Tool (LAGRANTO; Sprenger and Wernli, 2015) is then used to identify quasi-Lagrangian matches, where the same air masses were sampled within a 20 km radius below HALO twice, first at times t 1 and then again at t 2 .Air masses are initialized every 1 min along HALO's flight track, vertically every 5 hPa between 250 hPa and the surface, and horizontally evenly spaced every 7 km in a 20 km radius.In total, 2.1 million trajectories are calculated 6 h forward in time.Caused by the vertical shear of wind direction and wind speed, the sampled air masses start moving in different directions.Only for a certain fraction, due to successful flight planning and/or some luck, are some of the same air masses sampled again in a different location and for a second time.A match is registered if the same air mass is seen again in the column below HALO within the same 20 km radius.In the final step, observations from dropsondes are included.Only those matches in the lowest 2 km are kept where the time difference between the matching air mass below the aircraft and the dropsonde in its time during descent is below 90 s.At a flight speed of around 800 km h −1 , this again corresponds to a maximum distance of 20 km.More details on the quasi-Lagrangian flight strategy during HALO-(AC) 3 can be found in Wendisch et al. (2024).As matches are altitudedependent, from the closest dropsonde the vertically nearest potential air temperature and specific humidity measurements are retained.Potential temperature is chosen instead of regular air temperature to focus on diabatic processes (Papritz and Spengler, 2017; Dahlke et al., 2022).Applying all filters yields approx.24 200 quasi-Lagrangian matches.The net diabatic heating and moistening rates are calculated as The air mass transformations occurring in CAOs are primarily forced by the transition from closed sea ice to ice-free ocean (Pithan et al., 2018;Wendisch et al., 2023).Therefore, the quasi-Lagrangian matches are grouped by the time each air mass has spent over ice-free ocean.For all dropsonde locations, 12 h backward trajectories are calculated using ERA5 and for the air masses in the lowest 10 hPa (approx.100 m) above ground.The sea-ice concentration (SIC) is traced along each trajectory (Fig. S1 in the Supplement).For this purpose, the merged MODIS and ASI-AMSR2 data at 1 km grid resolution generated by the University of Bremen (Ludwig et al., 2020) are interpolated to a 0.05°× 0.05°l atitude-longitude grid.The duration over ocean is defined as the time the air mass spends over ice-free ocean (sea-ice concentration SIC ≤ 20 %) until it first reaches a SIC > 20 %.
While the flight leg of Polar 6 on 1 April 2022 was aligned in parallel with HALO's center leg, it still covered different regions at different times than HALO, not least due to the much lower speed of Polar 6 of around 300 km h −1 .To make data comparable, the same approach is taken as for HALO: every 1 min along the flight track, air masses are initialized.However, due to the in situ sampling method, the air masses are started at the actual flight level of Polar 6 and SIC traced (Fig. S2).As a result, the in situ observations are transformed into the same coordinate system of time over ice-free ocean as for HALO, which is the assumed primary driver of the observed air mass transformations.Due to its limited range, Polar 6 only sampled the first 3 h of the CAO.

Case overview
Figure 1 gives an overview of the conditions on 1 April 2022.The flight track of HALO and the dense grid of dropsondes released west of Svalbard are depicted.The daily averaged CAO index M 850 hPa in the Fram Strait box is found to be 7.7 K.This qualifies the CAO investigated here between a moderate and strong case, following the classification of Papritz and Spengler (2017) and Dahlke et al. (2022).According to the ERA5-based CAO climatology 1979-2020 by Dahlke et al. (2022), the median daily frequency of occurrence for CAOs in the Fram Strait is at around 50 %-70 % both in March and April.Furthermore, events of similar magnitude can be expected at around 40 % of all days (Dahlke et al., 2022).This means that on 1 April 2022, HALO sampled a quite typical event for this region and time of the year.Figure 1 also reveals a maximum M 850 hPa of above 12 K close to the marginal sea-ice zone.This highlights the strong temperature contrasts that the cool Arctic air masses experience when departing the closed Arctic sea ice.
To better comprehend the air mass flow, a set of 40 trajectories is initialized at the location of each dropsonde with 1 min temporal resolution.These trajectories are started at 10 hPa above ground and calculated both forwards and backwards in time over a 24 h period.The ERA5-derived potential temperature is then traced.As seen in Fig. 1, during their drift over closed Arctic sea ice, the near-surface air parcels do not undergo any significant diabatic temperature changes.However, once they cross the MIZ and reach the ice-free ocean, the air masses undergo a rapid diabatic heating of up to 20 K within 24 h.

Sea-ice and cloud structures
Figure 2 depicts a first comparison between observations and the reanalyses.Figure 2a shows the Terra/MODIS corrected reflectance from NASA Worldview for 1 April 2022 (NASA Worldview, 2023).From the satellite imagery, it becomes clear that the Arctic sea ice northwest of Svalbard features many leads on different length scales.However, the MIZ is rather sharp, and the transition from closed sea ice to ice-free ocean water typically occurs within less than 1 km distance.
Over the ice-free ocean, cloud streets due to roll convection are evident.The cloud streets form along the prevailing wind direction.Furthermore, a clear lee effect due to Svalbard's mountain ranges is seen to the west of the archipelago.
Figure 2b shows the corresponding fields as represented by ERA5 at 12:00 UTC noon.Due to its coarse spatial resolution, no leads are modeled in the SIC data fields, and the MIZ width is on a length scale of approximately 80 km.This is a typical MIZ width for ERA5 (Renfrew et al., 2021).Instead of cloud streets, a stratiform liquid and ice containing cloud deck is simulated, which thickens in off-ice direction.Clouds are partly also already formed over closed sea ice.In contrast, clouds in CARRA are exclusively formed over the ice-free ocean; see Fig. 2c.The high spatial resolution allows convection to be modeled.As a result, several distinct cloud streets are reproduced.In addition, CARRA better reproduces the sharp MIZ, which here is on the scale of around 10 km.The sharper MIZ of CARRA in comparison to ERA5 is not only a matter of spatial resolution (2.5 km for CARRA versus 30 km for ERA5).The sea-ice concentrations in ERA5 are derived from the Operational Sea Surface Temperature and Ice Analysis dataset, produced by the UK Met Office (OSTIA; Donlon et al., 2012).OSTIA outputs daily sea-surface temperature and sea-ice concentration fields based on satellite observations, with a native resolution of 0.05°× 0.05°(roughly 6 km).Yet the sea-ice data are based on the EUMETSAT OSI-SAF 401 dataset utilizing 19 GHz and 37 GHz microwave channels at along-track resolutions of coarse 69 and 37 km (Tonboe et al., 2017;Renfrew et al., 2021).On the contrary, CARRA strongly relies on the European Space Agency's Sea Ice Climate Change Initiative product (SICCI; Toudal Pedersen et al., 2017) with a native resolution of 15-25 km.These data are additionally filtered based on the high-resolution sea-surface temperature fields and then regridded to the CARRA grid (Yang et al., 2023).Several authors noted that improved sea-ice and MIZ representation crucially improve the performance of models in the lower-tropospheric layers (Liu et al., 2006;Gryschka et al., 2008;Chechin et al., 2013;Müller et al., 2017;Spensberger and Spengler, 2021).As will be shown later, the magnitude of turbulent heat fluxes is directly correlated to the distribution of sea-ice versus ice-free ocean, which is the primary driver of CAO transformations.Errors in MIZ width can have significant downstream effects over several hundreds of kilometers (Tomassini et al., 2017;Spensberger and Spengler, 2021).

Vertical thermodynamic profiles
Figure 3a shows the profiles of air temperature from observations.Over sea ice, clear temperature inversions are found.The coldest near-surface temperatures reach −27 °C, and the thickness of the inversions is around 0.6-0.9km.As air masses spend more time over ice-free waters, they become warmer near the surface, leading to stronger coupled ABLs and the development of a typical marine stratification.This is accompanied by a steady, linear increase in the calculated BLHs and closely correlated CTHs.
By linearly interpolating all data to 100 m vertical resolution, the temperature differences T between ERA5/CARRA and the observations are computed.The results are shown in Fig. 3b and c.Despite the reanalysis assimilating all the employed dropsondes, the ERA5 profiles show a distinct warm bias in near-surface air temperatures of mean 2 K over Arctic sea ice.Many authors reported on similar warm biases of skin and near-surface air temperatures in ERA5 (Batrak and Müller, 2019;Wang et al., 2019;Tjernström et al., 2021;McCusker et al., 2023).The skin temperatures are generally considered too warm as an insulating layer of snow is missing atop the floating ice, which can introduce surplus heat into the lower atmosphere (Batrak and Müller, 2019;Wang et al., 2019).The surface warm bias turns to a mean cold bias of −1 K at altitudes of 0.25-0.50km.Over the ocean, the mean temperature bias is much lower and reaches −0.5 K at around 1 km altitude.In the CARRA data, the near-surface temperature bias is reduced to an average of 1 K. Similar improvements over ERA5 have been reported by others (Køltzow et al., 2022).However, CARRA also faces challenges in accurately representing temperature inversions.This is reflected in the cold bias of around −1.5 K at altitudes of 0.20-0.40km.
The mean absolute errors (MAEs) of ERA5 and CARRA with regards to measurements are computed.Output from both reanalyses as well as dropsondes is interpolated to a common vertical coordinate of altitude above ground in 10 m steps.To evaluate especially the crucial ABL representation, MAEs are averaged vertically from the surface up to the observation-derived BLHs, plus an additional 200 m margin to capture the dipole pattern of errors.Table 1 summarizes the results separately for dropsondes released over sea ice and ice-free ocean.For air temperature over ice, CARRA shows a slightly smaller MAE of 1.00 K versus 1.14 K for ERA5.Over the ice-free waters of Fram Strait, these errors are significantly reduced in both products, yielding a MAE of 0.39 K in CARRA and 0.44 K in ERA5.
Next, the vertical profiles of specific humidity are examined.Figure 4a depicts the observed profiles, as extracted from the dropsonde measurements.Over sea ice, a uniform and dry ABL is found, where maximum values of around https://doi.org/10.5194/acp-24-3883-2024Atmos.Chem.Phys., 24, 3883-3904, 2024  0.069 g kg −1 0.6 g kg −1 are measured.Near-surface layers are the driest, at around 0.4 g kg −1 .The longer the air masses reside over the sea, the more water vapor is picked up by the lower air mass layers through evaporation from the ocean surface.Over sea ice, ERA5 shows a mean near-surface moist bias of 0.05 g kg −1 (Fig. 4b), as well as a slight dry bias close to the BLHs.Once air masses drift over the sea, a strong dry bias is found throughout the ABL.It increases over time and reaches down to −0.5 g kg −1 , which corresponds to about 30 % of the observed values.CARRA shows different patterns (Fig. 4c).Over the closed ice pack, the lowest 0.2 km shows a negligible humidity bias.However, in higher layers above 0.5 km, a slight moist bias is seen.During the off-ice drift, at first a slight moist and later dry bias becomes obvi-ous; however, this is much smaller compared to the ERA5 reanalysis.The same patterns are found in the quantified MAEs within the ABLs; see again Table 1.Notably, over the ice-free ocean, CARRA's MAE of 0.069 g kg −1 is significantly lower than ERA5's MAE of 0.112 g kg −1 .

Quasi-Lagrangian matches
Figure 5 gives an overview of the quasi-Lagrangian matches calculated with reference to the dropsondes.All matches are colored by the time air masses spent over ice-free ocean.
As described in the Methods (Sect.2), these approximately 24 200 matches are a function of height above ground because not only are the zonal and meridional winds heightdependent, but also the vertical velocity is used for the threedimensional trajectory calculations.This allows air masses to ascend or descend along their horizontal flow.The matches cover 150 km along the prevailing wind direction over the Arctic sea ice and about 200 km along the CAO evolution over ice-free ocean.Naturally, the question arises of how reliable the trajectory calculations presented here are.In previous studies, sometimes additional criteria were applied to prove the reliability of trajectories.These are similar hydrocarbon fingerprints between matches (Methven et al., 2006) or an inert perfluoromethylcyclopentane tracer being deployed (Boettcher et al., 2021).However, such a use of tracers is only possi-  ble in case of in situ sampling.In the CAO presented here, the assimilation of the high-density grid of dropsondes serves as crucial input for ERA5 and CARRA.This can also be seen in the comparison of wind profiles as shown in Figs.S3  and S4.With the exception of the nearest-surface layers, a close match is seen between dropsondes and both reanalyses.Also, as trajectories are only calculated over short spa-tiotemporal scales (on the order of 1-4 h, 50-200 km), small errors can not add up as much.For some research questions, it might be more valuable to investigate transformations over larger spatiotemporal scales, such as was done, for example, for aerosol and hydrocarbon species (Methven et al., 2006;Sanchez et al., 2022).However, as will be demonstrated, the highly important thermodynamic evolution occurs on timescales of a few hours and below.If too much time passes between two matching observations, the "net" rates, e.g. of ( θ/ t) net and ( q/ t) net introduced in Eqs. ( 4) and ( 5), would smooth out short-lived effects even more; models would be increasingly needed to disentangle the net rates calculated over longer time frames into sections of more or less intense transformations.Finally, the approach presented here of initializing and then registering matches for a large number of trajectories within a radius of 20 km around HALO's location is also essential to better assess the statistical significance of matches.Notably, all deviations seen between the aforementioned observed and modeled wind profiles result in an error of less than this 20 km radius over 1-3 h of drift.

Diabatic heating and moistening rates
The evolution of thermodynamic properties in the form of heating and moistening rates is analyzed in a quasi-Lagrangian framework, grouped by time air masses spent over ice-free ocean.
First, the evolution of diabatic heating rates is analyzed.Figure 6a shows the vertically resolved diabatic heating rates based on the quasi-Lagrangian dropsonde observations.No significant differences exist between using ERA5 or CARRA https://doi.org/10.5194/acp-24-3883-2024Atmos.Chem.Phys., 24, 3883-3904, 2024 winds as input.The MAE between ERA5 and CARRA heating rates in the lowest 1.5 km above ground and averaged for all times over ice-free ocean amounts to only 0.13 K h −1 .Note that air parcels of the first category (t ocean = 0-1 h) are drifting almost exclusively over sea ice and leads (Fig. 5).For these air masses, a maximum near-surface warming of around 1.8 K h −1 is found.This possibly stems from some of the leads crossed by the trajectories.However, the heat is contained within the very shallow ABL, and all heating rates above the BLHs of around 0.30 km are around 0 K h −1 .After crossing the MIZ and reaching the ice-free ocean (t ocean = 1-2 h), a very intense surface warming is seen, where values larger than 6 K h −1 are found.This heating is starting to be mixed upwards into the increasingly deep ABL, which reaches BLHs of around 0.70 km.After the initial rapid exposure of the cold and dry Arctic air masses to the much warmer ocean surface, the heating in the lowest layers declines rapidly and stays around 2 K h −1 .The vertical mixing is now dominating and leads to almost homogeneous mixing within the lowest 0.75 km of the troposphere.Interestingly, some layers above show regions with negative heating rates, i.e., a net cooling of air masses at altitudes around the BLHs.An analysis of ERA5 temperature tendencies indicates that this is not a sign of a net cloud-top radiative cooling effect but instead of the mixing of colder near-surface air with the original, overlying warmer air (Fig. S5).Previous airborne studies have shown that in the early stages of CAOs, the heat budget has several sources.Notably, the strong surface heat fluxes are reinforced by entrainment fluxes with the warmer inversion aloft.The relative contributions of the heat sources change with distance from the sea-ice edge.Over leads in the MIZ, entrainment heat fluxes exceeding 30 % of the surface heat fluxes have been observed in single cases (Tetzlaff et al., 2015).Shortly after passing the sea-ice edge, this ratio can initially increase to 80 %, which then again decreases to 30 % for fetches over 150 km (Brümmer, 1996).In the region of deep, cellular convection, condensation can even become the dominating contributor to air-mass heating (Brümmer, 1996).Overall, these previous studies indicate that the diabatic cooling near cloud tops corresponds to a warming below cloud tops, caused by the entrainment fluxes.The ERA5-derived heating rates reflect the general features of the observations (Fig. 6b).This can be attributed to the assimilation of all 40 dropsondes into ERA5.However, some important differences are found.Over the Arctic sea ice, ERA5 shows BLHs almost twice as high as that seen in observations.While the slight surface warming of around 1.6-1.8K h −1 is also seen in ERA5, it shows excess heat that it mixes upwards towards the BLH.The intense surface warming at t ocean = 1-2 h is not represented in ERA5.This is in agreement with the sea-ice distribution shown on the overview map in Fig. 2b, which revealed a wide MIZ on the order of 80 km -much wider than that shown in the observations.As a result, the initial stage of the CAO is delayed in ERA5.The later stages (t ocean > 2 h) of the CAO, however, are represented rather well, yet again with an exagger-ated vertical mixing.The essential feature of negative heating rates in higher altitudes is captured.
Figure 6c shows the heating rates extracted from the CARRA product.Generally, these settle in between the observations and ERA5.All BLHs are lower than in ERA5 and significantly closer to the observed values.For t ocean = 1-2 h, the observed intense warming rate higher than 6 K h −1 is also not represented fully, yet it is much better than in ERA5.A maximum value for the near-surface heating of around 4 K h −1 is found, which is homogeneously mixed upwards up to 0.40 km altitude.This might in part be caused by the much sharper MIZ, which is closer to reality (Fig. 2c).
In the early stage of CAOs, the primary source for turbulent heat fluxes is the warm ocean surface (Brümmer, 1996;Pithan et al., 2018).To study the evolution of the heating profiles along the CAO, it is thus essential to investigate the surface sensible heat fluxes (SSHFs).A comparison between the observation-derived and reanalysis-based SSHFs is given in Fig. 7a.All data are grouped by time spent over ice-free ocean.Over the Arctic sea ice and MIZ, the observationderived SSHFs show a mean of below 50 W m −2 , with the 95th percentile peaking above 150 W m −2 .However, as was outlined in Sect.2, the computation of turbulent heat fluxes over the sea ice and MIZ based on dropsondes is prone to high uncertainty.Both reanalyses show higher values, especially ERA5.For t ocean = 0-1 h, this corresponds to heating rates in both reanalyses being slightly too large and a mixing that is exaggerated.
After the air masses cross the MIZ, large values of SSHF of around 520 W m −2 are observed.Such values are typical in CAOs (Shapiro et al., 1987).ERA5 significantly underestimates the observed SSHFs by at least 130 W m −2 , while CARRA slightly exaggerates it.Later into the CAO, the observed SSHFs drop slightly yet are again best captured by CARRA.In general, the reduction of SSHFs over time is expected, as the temperature difference between the sea surface and the overlying air is reducing.This can potentially be counterbalanced, for example, by increasing underlying seasurface temperatures, increased winds, or decreased surface roughness (Papritz and Spengler, 2017).
The different parameters that are required for the calculations of SSHF as shown in Eq. ( 1) are investigated in Fig. S6.Notably, over ocean both reanalyses significantly underestimate U 10 m , with CARRA being always closer to the observations.The horizontal thermal gradient between sea ice and the ice-free water surface causes a marked off-ice breeze, an analogue to sea-land breezes.Similar to that reported by Brümmer (1996), in our case, U 10 m reached its maximum near the sea-ice edge, and the off-ice acceleration due to thermal contrasts is estimated to be around 2.6 m s −1 h −1 (calculations can be found in the Appendix).Therefore, the U 10 m in CARRA might be closer to observations than ERA5 as (i) the MIZ is narrower in CARRA and (ii) the discussed nearsurface warm bias over sea ice is weaker in CARRA.Previous studies have also found ERA5 underestimates the highest near-surface winds over the ocean next to the MIZ, as well as SSHFs and SLHFs over the MIZ (Renfrew et al., 2021).Feeding coarse-resolution sea-ice data (with a MIZ of around 80 km, such as in ERA5) into higher-resolution models was also found to smear out the rapid increases in air temperature, wind speed, and surface fluxes (Renfrew et al., 2021).
In order to evaluate whether the differences between CARRA and ERA5 discovered for 1 April 2022 are of a systematic nature, a climatological comparison of SSHFs from both reanalyses for 1991-2022 can be found in Fig. S8.It shows that during CAO conditions, CARRA SSHFs are systematically larger than ERA5 SSHFs, and this is consistent over several decades.It is especially pronounced over ocean and corroborates our results for the case study of 1 April 2022.Similar systematic differences in the output surface turbulent heat fluxes have been reported also for comparisons of other reanalyses (Zhang et al., 2016;Taylor et al., 2018).Underestimated fluxes result in too-low uptake rates for heat and moisture, particularly close to the ice edge (Tomassini et al., 2017;Spensberger and Spengler, 2021).However, similar studies like Slättberg et al. (2023) are required for a deeper systematic evaluation of ERA5 versus CARRA, for example, to disentangle the combined effects on U 10 m as caused by MIZ width, parameterized surface roughness, or synoptic patterns.
Figure 8a shows the vertically resolved moistening rates based on the quasi-Lagrangian dropsonde observations.The MAE on the observed moisture uptake rates based on ERA5 versus CARRA trajectories as input is very low at 0.01 g kg −1 .For the air masses mostly sampled over sea ice, only minimal moisture uptake is found.The highest uptake at t ocean = 1-2 h reaches around 0.4 g kg −1 h −1 at the surface.For longer times over the ice-free ocean, this moisture is then quickly mixed upwards.The magnitude of upward mixing partly exceeds the moisture uptake near the surface at later stages.
Figure 8b shows the corresponding rates as extracted from ERA5.ERA5 underestimates the near-surface moistening rates significantly; also layers further up show rates which are 2-3 times too low.CARRA performs better than ERA5 (Fig. 8c).Not only are the near-surface moistening rates closer to observations, but also the upward mixing is more realistic.An insufficient moistening rate within the lower troposphere during a CAO can be caused by (i) an insufficient supply of moisture from the surface, i.e., too-low SLHF, and/or (ii) an exaggerated removal of water vapor from the atmospheric column.Here, we check both factors separately.
Figure 7b compares the SLHFs between observations and the two reanalyses.Over sea ice, very low SLHFs with a median below 25 W m −2 are found.Over the ocean, both ERA5 and CARRA underestimate the SLHF, and they are close to each other.Surprisingly, ERA5 always predicts slightly higher SLHFs than CARRA, which at first seems not to agree with the much lower moistening rates.Also the climatological comparison under CAO conditions shows that ERA5 https://doi.org/10.5194/acp-24-3883-2024Atmos.Chem.Phys., 24, 3883-3904, 2024  SLHFs exhibit a constant bias towards larger values than in CARRA (Fig. S8).Overall, these findings hint towards mechanisms in ERA5 leading to an exaggerated removal of water vapor, namely cloud processes and precipitation.This is investigated in the next section.

Cloud properties
To help understand possible errors in the reanalyses connected to cloud physics, HAMP radar reflectivities from aboard HALO (Mech et al., 2014) as well as in situ Nevzorov measurements of cloud liquid water and ice contents by the Polar 6 aircraft are utilized (Lucke et al., 2022).A deeper investigation of cloud microphysical processes, such as riming, precipitation formation, or cloud street aspect ratios, is outside the scope of this article.However, details on these Figure 9a shows the measured HAMP radar reflectivity profiles averaged for 2 min around the time of each dropsonde release and up to 6 h into the CAO.The profiles are plotted as function of time air masses spent over icefree ocean.For the locations over sea ice, either very low (radar reflectivity below −25 dBz) or no radar signals were recorded.As the air moves onto open waters, the radar reflectivities increase, and cloud tops are seen to rise linearly.Radar reflectivities for the first time cross the threshold of −5 dBz only for t ocean > 1 h.This indicates the presence of precipitation (Schirmacher et al., 2023;Maahn et al., 2014).
As Polar 6 has a range much lower than HALO, it was able to only sample the first 3 h of the CAO. Figure 9b-d shows the height-resolved measured specific cloud liquid water contents q liq .Over the closed sea ice (Fig. 9b), in the sampled lowest 0.6 km above ground, no cloud liquid water was found, yet with low amounts at the top of the ABLs.After the drift across the MIZ, noticeable amounts of cloud water of up to 0.07 g kg −1 are seen up to around 1 km altitude, which corresponds to the altitude of moisture uptake.Surprisingly, the liquid water content decreases in the next time step.This might be correlated with an increase of the frozen hydrometeors (i.e., cloud ice and snow) depicted in Fig. S7a.Several in situ probes confirm the occurrence of riming during the flight of Polar 6 (Maherndl et al., 2023).With the air temperatures always in the range of −25 to 0 °C (see Fig. 3), mixed-phase clouds are possible, and also the typical pattern of a supercooled layer above the ice layer is reproduced.
Figure 9c shows the cloud structures as reproduced by ERA5.ERA5 tends to overestimate the amount of liquid water present in the clouds.A similar enhanced abundance of liquid-bearing clouds especially over sea ice has been reported for the IFS, the model behind ERA5 (Tjernström et al., 2021;McCusker et al., 2023).In the CAO case here, this is in contrast to CARRA (Fig. 9d).With the exception of missing the strong increase in liquid clouds at t ocean = 1-2 h, CARRA matches the observations well.The overabundance of liquid-bearing clouds in ERA5 might explain the exaggerated vertical mixing of heat in ERA5.Even though ERA5 featured lower SSHFs compared to CARRA, the more pronounced cloud tops might lead to stronger entrainment fluxes.Also, the condensation into cloud droplets releases additional heat to the surrounding air masses.
Furthermore, total precipitation at the surface is much higher in ERA5 than in CARRA, which creates an additional sink for atmospheric moisture already over sea ice (Fig. 10).Precipitation over sea ice is unlikely based on the presented HAMP radar reflectivities.The −5 dBz threshold is only exceeded at t ocean > 1 h.This threshold translates to a precipitation rate of around 0.02-0.09mm h −1 (Maahn et al., 2014;Schirmacher et al., 2023).The statistical comparison between ERA5 and CARRA presented in Fig. S8 substantiates the finding of this case study.ERA5 has a strong bias to form liquid-bearing clouds already over sea ice and the MIZ.Over the ocean, there also is a strong bias towards higher cloud liquid hydrometeor contents.The ERA5 clouds systematically precipitate more strongly over the MIZ and ocean than clouds in CARRA.The significance of our findings is reinforced by McCusker et al. (2023), who showed that issues such as an overabundance of low, liquid-bearing clouds https://doi.org/10.5194/acp-24-3883-2024Atmos.Chem.Phys., 24, 3883-3904, 2024 can propagate into higher-resolution models through largescale forcings.

Summary and conclusions
We have presented a combined Eulerian and quasi-Lagrangian analysis of an Arctic marine cold-air outbreak (CAO).The CAO was closely sampled as part of the HALO-(AC) 3 airborne campaign on 1 April 2022 in the Fram Strait, west of Svalbard.It was a representative CAO, which can be considered typical for this location and time of year as well as in its intensity.The performance of two state-of-theart atmospheric reanalyses, ERA5 and CARRA, was evaluated against the measurements, with a focus on thermodynamic (air temperature, humidity) and cloud (cloud liquid water content) properties.We furthermore apply the quasi-Lagrangian approach to convert observations from both HALO and Polar 6 into a common coordinate system (time over ocean).The spatiotemporally highly resolved airborne measurements allow for a thorough characterization of the state of the lower troposphere over Arctic sea ice, the marginal sea-ice zone (MIZ), and the ice-free ocean.To the best of our knowledge, this is the first report on heightresolved diabatic heating and moistening rates in a developing CAO directly derived from quasi-Lagrangian observations.Going back to the research questions posed at the beginning of this article, we can answer them as follows.
Q1. How do air temperature, specific humidity, and clouds evolve in the first 4 h of the developing CAO?Still over sea ice, some leads cause a weak heating and moisture uptake into the shallow atmospheric boundary layer of around 0.2 km height.Within the first hour of departing the closed sea ice, the strong contrast between the cold and dry Arctic air masses and the much warmer ocean causes a diabatic heating larger than 6 K h −1 at the surface, a moisture uptake of more than 0.3 g kg −1 h −1 , and the formation of mixed-phase clouds.As time progresses and clouds start forming, heat and moisture mix upwards vertically in the developing marine boundary layer.After 4 h, the atmospheric boundary-layer height exceeds 1.5 km.At around the boundary-layer heights, a slight net diabatic cooling and moisture loss are registered, which can be attributed to the upward mixing of air masses into the original, overlying warmer air.
Q2. How do the ERA5 and CARRA reanalyses perform with respect to observations and compared to each other?In the Eulerian (i.e., fixed in space) framework, the coarse-resolution ERA5 reproduces some well-known issues.The skin and near-surface air temperatures are exaggerated, atmospheric boundary-layer heights are too large, and too many clouds are present.
CARRA significantly improves all of these issues: for air temperature over sea ice, ERA5 features a mean absolute error (MAE) 14 % higher than CARRA (1.14 K versus 1.00 K), while for specific humidity over ice-free ocean the MAE is found to be 62 % higher in ERA5 compared to CARRA (0.112 g kg −1 versus 0.069 g kg −1 ).Taking the quasi-Lagrangian perspective, the heating rates are reasonably reproduced both in ERA5 and CARRA.However, the strong initial surface-based heating is not captured by ERA5.Even more pronounced are the differences in the moistening rates, where ERA5 estimates are up to 3 times too low and much better captured by CARRA.Overall, our height-resolved diabatic heating and moistening rates extend the quasi-Lagrangian, ERA5-based climatological CAO investigation of Papritz and Spengler (2017) to the vertical dimension.However, as the intense fluxes and transformations in the MIZ are not represented well by ERA5, the heating and especially moistening rates reported by them are likely biased towards lower values.
Q3.What are some possible sources of errors which could explain deviations between reanalysis output and observations?Generally, uncertainties in reanalyses can stem from insufficient spatiotemporal resolution, different measurement sets being assimilated, and also the underlying model physics.The observed discrepancies between the two reanalyses and the observations result from the complex interplay of several processes.Over sea ice, the missing snow on ice layer leads to skin and near-surface air temperatures being too high in ERA5, which might explain the exaggerated boundarylayer heights.Moreover, it is well established that the MIZ is too wide in ERA5.Thus, turbulent fluxes are underestimated significantly in the first 2 to 3 h of the CAO.The reduced 10 m wind speeds might be related to the too-wide MIZ as well.Especially for the surface sensible heat flux, CARRA greatly improves on this.ERA5 forms liquid-bearing clouds too early and too thick.This can be due to too-warm initial temperatures or due to issues with the parameterization of the mixed-phase clouds, as well as a combination of both.In all stages investigated, ERA5 clouds thus precipitate considerably more than in CARRA, and too much water vapor is lost to this sink.A similar propagation of errors in initial conditions has been previously reported to affect the atmospheric state hundreds of kilometers downstream.
Overall, we find CARRA fulfilling its intended goal of improving on the global ERA5 reanalysis with regard to the thermodynamic and cloud evolution, based on the parameters investigated in the critical first 4 h of the CAO.CARRA might thus be better suited for driving higher-resolution models, such as large eddy simulations.While some climatological comparisons of differences between ERA5 and CARRA were supplied, deeper investigations are required to further support the statistical significance of our findings and to determine which components of CARRA are primarily responsible for the improvements.Ideally, these analyses should include extended data rows of observations, such as from regular radiosonde launches.Finally, the unprecedented quasi-Lagrangian observations collected during HALO-(AC) 3 pose a rich database for future studies.For example, sensitivity studies could reveal the influence that initial aerosol concentrations (cloud-condensation nuclei, ice-nucleating particles) and different cloud schemes (one-moment or two-moment) have on the vertical mixing of heat and moisture, especially considering the intense surface forcings and additional entrainment fluxes.
Disclaimer.Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper.While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
The results presented here contain modified C3S information.Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains.
Special issue statement.This article is part of the special issue "HALO-(AC) 3 -an airborne campaign to study air mass transformations during warm-air intrusions and cold-air outbreaks".It is not associated with a conference.
Acknowledgements.We gratefully acknowledge the funding and support of TRR 172 by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation), within the Transregional Collaborative Research Center's Arctic Amplification: Climate Relevant Atmospheric and Surface Processes, and Feedback Mechanisms (AC) 3 .We are furthermore grateful for funding and support by DFG within the framework of the priority program SPP 1294 to promote research with HALO.The publication of this article was supported by the Open Access Publication Funding program of the Publishing Fund of Leipzig University.We cordially thank the Alfred Wegener Institute, German Aerospace Center (DLR), as well as all aircraft crew and participants which made the HALO-(AC) 3 campaign possible.We also thank the Institute of Environmental Physics at the University of Bremen for providing the merged MODIS-AMSR2 sea-ice concentration data (https: //data.seaice.uni-bremen.de/modis_amsr2,last access: 20 October 2023).We acknowledge the use of imagery from the NASA Worldview application (https://worldview.earthdata.nasa.gov/,last access: 14 November 2023), part of the NASA Earth Observing System Data and Information System (EOSDIS).
Financial support.This research has been supported by the Deutsche Forschungsgemeinschaft (grants nos.268020496 and 316646266).
Review statement.This paper was edited by Franziska Aemisegger and reviewed by two anonymous referees.

Figure 1 .
Figure 1.Case overview.The gray line shows the flight track of HALO on 1 April 2022.Diamond shapes show the locationswhere dropsondes were released.White-blueish contours represent 1 km high-resolution sea-ice concentration retrieved from a merged MODIS-AMSR2 satellite product(Ludwig et al., 2020).Over the ice-free ocean, yellow-brownish contours indicate the ERA5-derived CAO index M 850 hPa ; see Eq. (3).Finally, the colored lines show 24 h backwards and 24 h forward trajectories initialized at the location of each dropsonde at 10 hPa above ground.The colors represent the evolving potential temperature (θ ) of these air masses as traced from ERA5 data.

Figure 2 .
Figure 2. Sea-ice and cloud structures to the northwest of Svalbard on 1 April 2022 based on observations and reanalyses.In each subplot, the red line shows the flight path of HALO, and diamond shapes show the locations of released dropsondes.The shapes are colored by time near-surface air masses spent over ice-free ocean (sea-ice concentration SIC below 20 %; Ludwig et al., 2020).(a) The Terra/MODIS corrected reflectance shows the formation of cloud streets shortly after the off-ice drift.The image is taken from NASA Worldview (2023).(b) ERA5 data at 12:00 UTC.The SIC (filled contours) and the total column cloud liquid and ice water (contour lines) are shown.(c) CARRA data at 12:00 UTC.SIC (filled contours) and the total combined column cloud liquid, ice, and graupel water (contour lines) are depicted.

Figure 3 .
Figure 3. Vertical profiles of air temperature (T ) in the lowest 2 km above ground taken from observations and reanalyses.In all panels, profiles are colored by the time air masses spent over open ocean.(a) Observed profiles of air temperature.Measurement-derived atmospheric BLHs and lidar-derived CTHs are indicated on the left-hand side.(b) Deviation of the ERA5 profiles from the observed profiles, and (c) deviation of the CARRA profiles from the observed profiles.

Figure 4 .
Figure 4. Same as Fig. 3 but for specific humidity in the lowest 2 km above ground.(a) Observed profiles of specific humidity.(b) Deviation of the ERA5 profiles from the observed profiles, and (c) deviation of the CARRA profiles from the observed profiles.

Figure 5 .
Figure 5. Spatial overview of the location of matching ERA5 trajectories, which were calculated with respect to a 20 km circle around the dropsondes.Matching lines are colored by the time air masses spent over ocean.The background Terra/MODIS satellite image is taken from NASA Worldview (2023).

Figure 6 .
Figure 6.Diabatic heating rates grouped by time air masses spent over the ice-free ocean.On the left side of each panel, the mean BLH in each class is plotted as a square.Lines depict the mean values and shading the 25th-75th percentiles.(a) Heating rates based on the quasi-Lagrangian dropsonde observations.Solid lines are from ERA5 trajectories, and the dashes lines are from CARRA trajectories.(b) Corresponding heating rates extracted from ERA5.(c) Corresponding heating rates extracted from CARRA.

Figure 7 .
Figure 7. (a) Surface sensible heat fluxes (SSHFs) and (b) surface latent heat fluxes (SLHFs) as derived from observations, ERA5, and CARRA.Box plots show the median as thick lines, the 25th-75th percentiles as boxes, and the 5th-95th percentiles indicated as whiskers.Data are grouped by time over ice-free ocean.

Figure 8 .
Figure 8. Moistening rates expressed as the change of specific humidity q per hour as a function of the time air masses spent over the ice-free ocean.Lines depict the mean values and shading the 25th-75th percentiles.(a) Moistening rates based on the quasi-Lagrangian dropsonde observations.Solid lines are from ERA5 trajectories, and the dashes lines are from CARRA trajectories.(b) Corresponding moistening rates extracted from ERA5.(c) Corresponding moistening rates extracted from CARRA.

Figure 9 .
Figure 9. Vertical profiles of cloud structures, sorted by time air masses spent over ice-free ocean.(a) Profiles of the HAMP radar reflectivities measured aboard HALO.(b) Profiles of cloud liquid water content q liq., based on a Nevzorov sonde aboard Polar 6, (c) q liq.taken from ERA5, with the observed values as dashed lines, and (d) q liq.taken from CARRA, with the observed values as dashed lines.

Figure 10 .
Figure 10.Total precipitation reaching surface level for ERA5 and CARRA.Box plots show the median as thick lines, the 25th-75th percentiles as boxes, and the 5th-95th percentiles as whiskers.Data are grouped by time over ice-free ocean.

Table 1 .
Mean absolute errors (MAEs) of ERA5 and CARRA profiles compared to observations.The MAEs are averaged vertically up to the observed boundary-layer heights, plus an additional 200 m margin.Results are shown for the variables air temperature (T ) and specific humidity (q), grouped by surface type.Profiles are classed as sea ice (open ocean) if the AMSR2 sea-ice concentrations is above (below) 50 %.