Radiative closure and cloud effects on the radiation budget based on satellite and shipborne observations during the Arctic summer research cruise , PS 106

For understanding Arctic climate change, it is critical to quantify and address uncertainties in climate data records on clouds and radiative fluxes derived from long-term passive satellite observations. A unique set of observations collected during the PS106 expedition of the research vessel Polarstern (28 May to 16 July 2017) by the OCEANET facility, is exploited here for this purpose and compared with the CERES SYN1deg ed. 4.1 satellite remote-sensing products. Mean cloud fraction (CF) of 86.7 % for CERES SYN1deg and 76.1 % for OCEANET were found for the entire cruise. The difference of CF between both data sets is due to different spatial resolution and momentary data gaps, which are a result of technical limitations of the set of shipborne instruments. A comparison of radiative fluxes during clear-sky (CS) conditions enables radiative closure (RC) for CERES SYN1deg products by means of independent radiative transfer simulations. Several challenges were encountered to accurately represent clouds in radiative transfer under cloudy conditions, especially for icecontaining clouds and low-level stratus (LLS) clouds. During LLS conditions, the OCEANET retrievals were particularly compromised by the altitude detection limit of 155 m of the cloud radar. Radiative fluxes from CERES SYN1deg show a good agreement with ship observations, having a bias (standard deviation) of −6.0 (14.6) and 23.1 (59.3) Wm−2 for the downward longwave (LWD) and shortwave (SWD) fluxes, respectively. Based on CERES SYN1deg products, mean values of the radiation budget and the cloud radiative effect (CRE) were determined for the PS106 cruise track and the central Arctic region (70–90 N). For the period of study, the results indicate a strong influence of the SW flux in the radiation budget, which is reduced by clouds leading to a net surface CRE of −8.8 and −9.3 Wm−2 along the PS106 cruise and for the entire Arctic, respectively. The similarity of local and regional CRE supports the consideration that the PS106 cloud observations can be representative of Arctic cloudiness during early summer.

bination of observations and modelling is needed to better understand Arctic clouds and climate (Kay et al., 2016).
A key advantage of using satellite instruments for investigating changes in the climate system, is the provision of consistent observations covering relatively long time periods with near-global spatial coverage (Stubenrauch et al., 2013;Christensen et al., 2016;Huang et al., 2017). For example, Hartmann and Ceppi (2014) find large changes in the Arctic radiation budget at the top of the atmosphere (TOA), based on CERES (Clouds and the Earth's Radiant Energy System) observations, with trends of −5 and 3 W m −2 per decade for the SW and LW net fluxes, respectively. In contrast, Devasthale et al. (2016) report large discrepancies and no significant trends in cloud data records based on the Advanced Very-High-Resolution Radiometer (AVHRR). Hence, these observations require critical evaluation to identify their shortcomings before they can be used to diagnose processes responsible for Arctic climate change.
Due to the limited availability of ground-based observations in the Arctic, only a few studies have compared ground-and satellite-based observations of clouds and radiative fluxes. Dong et al. (2016) present a radiative closure (RC) study comparing ground-based and satellite retrievals of microphysical and radiative properties of single-layer clouds at the Atmospheric Radiation Measurement North Slope of Alaska (ARM NSA) site at Utqiaġvik, Alaska. Considering overpasses of the Terra and Aqua satellites from 2000 to 2006, and the CERES Synoptic 1 • daily flux (SYN1deg, eds. 2 and 4) products (Loeb et al., 2009;Rutan et al., 2015;Minnis et al., 2020), this study reports good agreement of retrievals of liquid water path (Q L ) and cloud optical depth (τ ) under both snow-covered and snow-free surface conditions. Using ARM and CERES SYN1deg cloud retrievals as input for radiative transfer simulations, the modelled downward SW fluxes (SWD) agree with the corresponding ARM observations and CERES SYN1deg products within 10 W m −2 .
The investigation by Riihelä et al. (2017) presents an intercomparison between ground-based observations and several satellite products of surface (SFC) radiative fluxes. Downward and upward LW and SW radiative flux observations, from the Tara drifting ice camp and long-term observations on the Greenland Ice Sheet, are compared to the CERES SYN1deg ed.3A, FluxNet, and Satellite Application Facility on Climate Monitoring cLoud, Albedo and RAdiation (CLARA) data sets (Karlsson et al., 2017), and the Global Energy and Water Exchanges (GEWEX) Surface Radiation Budget (SRB; (Wu and Fu, 2011)). This study concludes that CERES SYN1deg has the smallest root-mean-square error (RMSE) compared to in situ fluxes. This study recommends further investigation of differences in the surface and cloud properties that lead to discrepancies in flux retrievals.
Alongside RC studies with satellite products, groundbased long-term observations can provide independent estimates of the cloud radiative effect (CRE), and thus show how clouds influence the radiation budget. For example, Ebell et al. (2020) simulated radiative fluxes based on more than 2 years (June 2016 to September 2018) of cloud properties retrieved from remote-sensing observations with the Cloudnet algorithms at the atmospheric observatory of the Arctic French-German research station -The German Alfred Wegener Institute for Polar and Marine Research and French Polar Institute Paul Emile Victor (AWIPEV) -at Ny-Ålesund, Norway . Simulated and observed surface broadband LW and SW fluxes were compared for allsky (AS), clear-sky (CS), and cloudy conditions. For AS conditions, a mean difference of 3.1 and 0.2 W m −2 was found for the SWD flux and the downward LW (LWD) flux, respectively, confirming RC for the study period. Based on the simulations, estimates of CRE at the SFC (CRE SFC ), TOA (CRE TOA ), and within the atmosphere (CRE ATM ) were derived. For the location of Ny-Ålesund, the annual averages revealed a surface warming by clouds of about 11.1 and −16.1 W m −2 at the TOA, with significant variability observed particularly during summer.
In contrast to large-scale observations, field campaigns provide a higher degree of detail to investigate relevant physical processes, and to study remote locations with previously poor observational coverage. The Surface Heat Budget of the Arctic Ocean (SHEBA) programme investigated the physical processes governing the surface energy budget and sea-ice mass balance, covering the Beaufort Sea and a complete annual cycle from October 1997 to October 1998 . Based on SHEBA data, Shupe and Intrieri (2004) determined the CRE SFC , considering the influence of surface albedo (α) and cloud properties in detail. This study found that low-level liquid clouds had the largest radiative effect. This study also recommended the use of polarizationsensitive lidars to improve the differentiation of ice crystals and cloud droplets, and to generally improve retrieval algorithms for estimating liquid water content (q L ) towards a better understanding of the radiative effect of Arctic clouds.
With the aim to better understand the physical mechanisms underlying Arctic Amplification, the project Arctic Amplification: Climate Relevant Atmospheric and SurfaCe Processes and Feedback Mechanisms (AC) 3 held two major field campaigns in the early summer of 2017, northwest of Svalbard (Wendisch et al., 2019). Both campaigns performed in situ and remote-sensing observations over the Arctic Ocean. While ACLOUD (Arctic Cloud Observations Using Airborne Measurements during Polar Day) was an aircraft campaign, the PS106 shipborne campaign took place aboard the German research vessel (R/V) Polarstern. The PS106 expedition consisted of two legs, PASCAL (Physical Feedbacks of Arctic Boundary Layer, Sea Ice, Cloud, and Aerosol; PS106/1) from 24 May to 21 June 2017, and with some observations continuing during SiPCA (Survival of Polar Cod in a Changing Arctic Ocean; PS106/2) from 23 June to 20 July (Macke and Flores, 2018). The mobile remotesensing platform OCEANET-Atmosphere (hereafter denoted as OCEANET) was set up aboard Polarstern (Macke, 2009;Kalisch and Macke, 2012;Hanschmann et al., 2012;Kanitz et al., 2013;Griesche et al., 2020). The atmospheric observations collected with OCEANET were subsequently utilized to derive macro-and microphysical properties of clouds based on the Cloudnet algorithm (Illingworth et al., 2007;Griesche et al., 2020).
The present paper aims to investigate the radiative effects of clouds and their influence on the radiation budget during the PS106 expedition. Radiative fluxes are simulated based on reanalysis profiles and observed cloud properties. They are compared to the CERES SYN1deg ed. 4.1 products (hereafter referred to as CERES SYN1deg), as well as shipborne flux observations as a basis for a radiative closure assessment. From these data, an estimate of CRE and the radiation budget during PS106 is given. Our investigation specifically attempts to answer the following questions: 1. How consistent are cloud properties derived from the ship-based observations and the CERES SYN1deg cloud products?
2. How closely do our radiative transfer simulations agree with the CERES SYN1deg SFC fluxes and Polarstern flux observations under CS conditions?
3. Under which circumstances is it possible to confirm RC between flux observations, our radiative transfer simulations, and the CERES SYN1deg products?
4. What was the radiation budget during PS106, how was it influenced by clouds as quantified by CRE, and how representative were the local conditions for the entire Arctic?
The paper is structured as follows: first, Sect. 2 presents a description of the observations and methods. Results and discussion follow this description in Sect. 3. Its first part summarizes the atmospheric conditions during PS106, followed by a sensitivity study to determine the uncertainties of the radiative transfer simulations. The differences between simulations and observations are quantified for specific case studies, and the entire PS106 expedition is discussed in Sect. 3.3. and 3.4, respectively. Based on these results, Sect. 3.5 presents the estimated radiation budget and CRE for PS106 and the entire Arctic. Finally, the paper closes with conclusions and an outlook to future research in Sect. 4.

Observations and methods
This section describes the shipborne observations and resulting cloud products, as well as the satellite-based cloud and radiation products used as basis for this study. Furthermore, a description of ancillary data, used for the radiative transfer simulations and to describe the surface and atmospheric conditions, are presented alongside the method used for the radiative transfer simulations. In addition, a classification of sky conditions based on the shipborne data sets and the setup used for the broadband radiative transfer simulations are introduced.

Shipborne instrumentation
The OCEANET facility was established to provide continuous atmospheric observations aboard research vessels such as the German icebreaker Polarstern. It has been operated since 2009 during transfer cruises between the hemispheres, crossing the Atlantic Ocean from Bremerhaven in Germany, to Punta Arena in Chile, and Stellenbosch or Cape Town in South Africa (Macke, 2009;Kalisch and Macke, 2012;Hanschmann et al., 2012;Kanitz et al., 2013). In 2017, OCEANET was operated for the first time in the Arctic Ocean during two legs of the PS106 expedition named PAS-CAL and SiPCA, respectively (see Fig. 1; Macke and Flores, 2018;Griesche et al., 2020).
The instrumentation of OCEANET consists of a multiwavelength Raman polarization lidar, Polly XT , a 14-channel microwave radiometer (MWR), a Humidity And Temperature PROfiler (HATPRO), and a fish-eye sky camera. For the first time in the standard OCEANET observations, a Doppler cloud radar of type Mira-35 was installed about 10 m above the container during the PS106 cruise, complementing the OCEANET observations. The cloud radar can profile optically thick clouds and measure Doppler spectra produced by the vertical cloud motion due to its polarimetric capabilities, and valuable information on the hydrometeor type can also be derived. These remote-sensing instruments are combined with observations by an optical disdrometer of type OM470, and Vaisala RS92-SGP radiosondes launched every 6 h as input to the Cloudnet processing (see Sect. 2.1.2;Griesche et al., 2020).
On the roof of the measurement container, a CMP21 broadband pyranometer (0.285-2.8 µm) and a CGR4 broadband pyrgeometer (4.5-42 µm), both manufactured by Kipp & Zonen, were installed. For surface radiation measurements in polar regions, the accuracy of these instruments is expected to be within ±10 and ±20 W m −2 for pyrgeometers and pyranometers, respectively (Lanconelli et al., 2011). In the case of the pyrgeometer, additional uncertainties due to variation in the vertically integrated water vapour (IWV) column need to be considered. At values below 10 mm of IWV, the atmospheric window produces spectral inhomogeneities, which distort the pyrgeometer measurements (Gröbner et al., 2014). Gröbner and Wacker (2015) compared the performance of a calibrated World Infrared Standard Group (WISG) and a CGR4 pyrgeometer, and found that for specific periods, deviations can reach up to ±3 W m −2 . Thus, in this study, the total instrumental uncertainty for the pyrgeometer is assumed to be ±13 W m −2 . Operating these instruments aboard Polarstern might introduce additional uncertainties due to harsh environmental conditions like the exhaust plume of the ship, and the superstructures of the ship interfering with the observations (see Fig. A1).
In addition, a meteorological station measuring atmospheric pressure, temperature, and relative humidity with a sensor of type Series EE33 from E+E Elektronik, was installed at about 10 m above sea level (m a.s.l.) next to the OCEANET container during PS106. The measured values of this near-surface temperature are presented in Fig. 2b.

Cloudnet products and description of sky conditions
The Cloudnet project targets a systematic evaluation of the representation of clouds in the weather forecast and climate models (Illingworth et al., 2007). Within its scope, a robust suite of algorithms has been developed for retrieving vertical profiles of macro-and microphysical properties of clouds from a synergistic combination of cloud radar, lidar, and MWR observations. A complete description is given in Illingworth et al. (2007), while details about its adaptation for PS106, including a new approach for the continuous determination of the ice effective radius (r E,I ), is presented in Griesche et al. (2020). This section provides a summary of the Cloudnet products used as the basis of the present study, i.e. the location of the cloud boundaries, as well as vertical profiles of q L , liquid effective radius (r E,L ), ice water content (q I ) and r E,I . As a first step, the measurements are averaged onto a common pixel grid with a vertical and temporal resolution of 31.18 m and 30 s, respectively, leaving a total of 595 vertical pixel grids and, in general, more than 2700 time steps . Then, each pixel is categorized into seven distinct classes, following a bitwise diagnosis described by Hogan and Connor (2004). The final target classification provides 11 different pixel categories: clear sky, cloud droplets only, drizzle or rain, drizzle or rain and cloud droplets, ice, ice and supercooled droplets, melting ice, melting ice and cloud droplets, aerosols, insects, and aerosols and insects.
The determination of the cloud thermodynamic phase is implemented following the methodology introduced in Griesche et al. (2020), which is based on the lidar and radar measurements (see Table 1). The liquid phase is assigned to pixels when the lidar backscatter signal exceeds a threshold of 2× −5 Mm −1 sr −1 , and is reduced by at least a factor of 10 within 250 m due to the strong attenuation by liquid clouds. When the Doppler radar signal indicates falling particles at a dew point temperature below 0 • C, the pixel is considered as ice. If both criteria are fulfilled simultaneously, the pixel is classified as a mixed-phase pixel.
Once a cloudy pixel is identified, the cloud water content and effective radius are determined, regardless of whether the cloud is detected as a single or mixed-phase cloud. The Q L is retrieved based on the HATPRO MWR measurements using the retrieval method developed in Löhnert and Crewell (2003). This method relies on a long-term radiosonde training data set, which in this case is based on Ny-Ålesund, NO (78.9 • N, 11.85 • E, WMO Code 6260). Once Q L is known, q L and r E,L are determined. The retrieval of q L is obtained by distributing the observed Q L from the HATPRO MWR adiabatically among the identified liquid and mixed-phase cloud pixels classified by the Cloudnet algorithm. This method assumes a log-normal cloud-droplet size distribution, which is constant with height. The uncertainties of q L are calculated by error propagation assuming a typical uncertainty of 20-25 g m −2 in Q L (Löhnert and Crewell, 2003). The retrieval of r E,L considers the radar reflectivity factor and the profile of Q L based on the methodology described in Frisch et al. (2002). The reflectivity used to derive the r E,L was likely not affected by the presence of ice crystals, since most of the mixed-phase cloud cases observed during PS106 had cloud top temperatures larger than −10 • C and a radar reflectivity around −20 dBZ (not shown). The study of Bühl et al. (2016) showed that at these temperatures and radar reflectivities, the signal of the mixed-phase cloud is dominated by the presence of liquid droplets rather than the ice crystals. In cases where the radar reflectivity is lower (e.g. −40 dBZ) the ice crystals dominate the signal of the mixed-phase cloud more, and other methods are necessary to derive the r E,L (Kalesse et al., 2019;Radenz et al., 2019).    Lidar and radar (Illingworth et al., 2007;Hogan et al., 2009) Base (hPa) or (m) CERES  Lidar and radar (Illingworth et al., 2007;Griesche et al., 2020) Top (hPa) or (m) MODIS and CERES  Lidar and radar (Illingworth et al., 2007) Q L and r E,L MODIS and CERES ) MWR and radar (Frisch et al., 1998 MODIS and CERES  MWR and radar (Hogan et al., 2006) r E,I (µm) MODIS  Cloud radar Delanoë et al., 2007) The q I is obtained based on the measurements from the cloud radar for pixels flagged as ice or mixed-phase cloud (Hogan et al., 2006). The ice water path (Q I ) is calculated by vertically integrating q I . These parameters depend on temperature (T ; • C) and cloud radar reflectivity (Z e ; dBZ). Based on the values of q L and q I , vertical profiles of cloud fraction (CF) are calculated. The r E,I is derived based on empirical relationships between the visible extinction coefficient, cloud radar reflectivity and model temperature, as it is further described in Griesche et al. (2020). Additionally, Cloudnet provides CF, averaged to hourly values for 67 equidistant height layers, with a thickness of 300 m, ranging from 150 m up to 19.95 km (see Fig. 4a).
The Cloudnet products also contain quality flags that indicate when the pixels have a reliable retrieval, contain a mixture of liquid droplets and ice crystals, and when large ice crystals, drizzle, or rain might bias the radar reflectivity. Precipitation (PPT) conditions compromise the retrieval accuracy of Q L and Q I from the MWR and cloud radar, respectively.
Three independent types of flagging categories are used to complement the analysis. These flags are based on Cloud-net target classification, and the identification of low-level stratus (LLS) clouds from Griesche et al. (2020). Quality flags determine the atmospheric conditions directly affecting the lidar beam or cloud radar signal. This characterization identifies "optimum conditions (OC)", moments with "LLS clouds", "PPT", and "PPT and LLS". A second classification describes structural flags by identifying "clear-sky", "singlelayer", and "multilayer" clouds. Moreover, the last flag classification focuses on the cloud phase. We identify "clear-sky" moments, clouds with "PPT", "ice", "liquid", "mixed-phase clouds type 1" in which ice and liquid droplets are separately identified, and "mixed-phase clouds type 2" in which Cloudnet distinguishes a mixed layer of ice and supercooled droplets in the same layer of cloud. Complex cloud systems like multilayer mixed-phase clouds can also be identified by the simultaneous use of structural and phase flags; however, they are outside the analysis of the time series classification (Fig. 5).
The three classification types only focus on clear or cloudy pixels. Thus, as a first step, any Cloudnet pixel of "aerosols", "insects", and "aerosols and insects" is removed by changing its assigned value to zero to discard them from the analysis.
Once this step is done, a set of iterative conditional assignments is applied independently to each vertical column. For instance, if no cloudy pixel is present in the atmospheric column, that column is flagged as "clear-sky". If cloudy pixels are found in a single column, and no clear-sky pixels are identified in between, then the algorithm classifies the column as "single-layer". If one or more clear pixels are identified between cloudy pixels, the flag of multiple levels is assigned. A similar method is followed, but with different conditions for quality and phase flags. Lastly, the three types of flags are linearly interpolated to match the temporal resolution of the simulations, and stored separately in the output files of the radiative transfer simulations. Section 3.1 provides a broader description of the atmospheric conditions and analysis of the flagging system during PS106.

Satellite data set
The CERES SYN1deg product provides global SW and LW radiative fluxes at the TOA, at four pressure levels, and at the SFC interpolated to an hourly resolution. While TOA fluxes are directly based on observations by the CERES SYN1deg instruments, in-atmosphere and SFC fluxes are calculated based on the Fu-Liou radiative transfer model (Fu and Liou, 1992), and are adjusted for consistency with the TOA observations (Gupta et al., 2010;Rose et al., 2013;Rutan et al., 2015;Kato et al., 2018;Minnis et al., 2020). The CERES SYN1deg uses cloud properties from geostationary and polar satellites observations, in particular from the Moderate Resolution Imaging Spectroradiometer (MODIS). As ancillary input, CERES SYN1deg relies on reanalysis data from the Global Modelling Assimilation Office's (GMAO) Global Earth Observing System version 5.4 (GEOS-5). In addition to radiative fluxes, various surface, cloud and aerosol parameters are included in the product to enable an exploration of the relationships among clouds, aerosols, and radiation . Specifically, CERES SYN1deg provides several relevant cloud properties. The parameters considered in this study are CF, Q L , Q I , r E,L , r E,I , cloud base (P B ) and top pressure (P T ). The cloud properties are reported for four atmospheric pressures intervals. Nevertheless, it has been decided to consider the total atmospheric values for the analysis. Note that the cloud properties mentioned are retrieved based on MODIS retrievals of cloud emissivity, cloud effective temperature, cloud particle effective radius, and cloud optical thickness. A description of the retrievals is presented in Minnis et al. (2020). A summary of the cloud parameters used in this study is presented in Table 1. The CERES SYN1deg flux products considered in this study provide fluxes based on an AS, cloudy without aerosols (NA), CS, and virtually pristine (P , neither clouds nor aerosols) scenario.
Amongst the surface parameters included in CERES SYN1deg, α, skin temperature (T s ), and snow/ice coverage are relevant for our analysis. In contrast to previous versions, α is determined considering the 1.24 µm channel instead of the 2.13 µm channel over snow surfaces. This change has the advantage of increasing the range of retrievable cloud optical depths over snow-cover areas . However, this modification also increases the uncertainty of α due to higher variability in snow albedo and bidirectional reflectance . All CERES SYN1deg parameters are provided on a spatial grid with a resolution of 1 • latitude by 1 • longitude and at a temporal resolution of 1 h.

Radiative transfer simulations
The TROPOS Cloud and Aerosol Radiative effect Simulator (henceforward T-CARS) is a Python-based framework to carry out radiative transfer simulations with a particular focus on the investigation of the radiative effects of aerosols and clouds. Parts of this framework have already been applied and described in Barlakas et al. (2020) and Witthuhn et al. (2021). The T-CARS enables the use of various sources for input data, such as atmospheric profiles of trace gases, temperature, humidity, properties of clouds, aerosols, and surface parameters. The present study employs the widely used rapid radiative transfer model (RRTM) for GCM applications (RRTMG; Mlawer et al., 1997;Barker et al., 2003;Clough et al., 2005).
In this study, the daily T-CARS output files have a standard grid that consists of 197 atmospheric levels ranging from the surface up to 20 km height and 1 min temporal resolution. The first 10 km of the atmosphere is divided into 160 levels with a geometric layer thickness of 62.5 m. The level thickness of each pixel for the first 10 km of the atmosphere corresponds to two vertical levels of Cloudnet's pixel which are averaged to the standard grid. The following 5 km of the atmosphere have a layer thickness of 250 m, while the last 5 km of the atmosphere have a layer thickness of 193.8 m.
Hourly pressure level profiles of temperature, pressure, ozone mass mixing ratio, and specific humidity from the European Centre for Medium-Range Weather Forecasts (ECMWF) Re-Analysis (ERA5) data set are used as input parameters for the simulations. The ERA5 uses 4D-Var assimilation using polar and geostationary satellites, surface, radiosonde, dropsonde, and aircraft measurements (Hersbach et al., 2020). In the case of PS106, the Vaisala Radiosonde RS92-SGP launched from Polarstern every 6 h were also assimilated.
The atmospheric temperature and pressure measured at 10 m a.s.l. aboard Polarstern are used in T-CARS as T s and surface pressure, respectively. We opted for this setup since these parameters are the closest to the surface and are the only measurements available for the entire cruise. For comparison purposes, the surface temperature from ERA5 and CERES SYN1deg are also considered.
The α for the radiative transfer simulations is based on CERES SYN1deg. The data set is interpolated in space and time to the position of Polarstern at 1 min resolution for the entire PS106 cruise.
All input data sets are linearly interpolated to the standard grid. For trace gases, the climatological values from the Air Force Geophysics Laboratory (AFGL) sub-Arctic summer atmosphere (Anderson et al., 1986) are used. The pressure level ERA5 data set is used in the model by interpolating atmospheric pressure, temperature, specific humidity, and ozone mass mixing ratio. Considering the Cloudnet cloud properties, r E,L and r E,I are linearly interpolated onto the standard grid. The values of q L and q I are converted to values of Q L and Q I by multiplying with the layer thickness.
For the LW and SW simulations, the RRTMG parameterizations for ice and liquid cloud optical properties are based on the radiative transfer model, Streamer, and Key (1996) and Hu and Stamnes (1993), respectively, have been selected. The parameterization for ice clouds assumes spherical ice crystals with r E,I values with an allowed range between 5.0 and 131.0 µm. Radiative fluxes are known to be sensitive to assumptions about the crystal habit, e.g. hexagonal shape (Wendisch et al., 2005). However, the decision was made based on the availability of parameterizations in RRTMG and to be consistent with the Cloudnet parameterization of ice crystals. In the case of r E,L , the model only allows values within the range from 2.5 to 60 µm. To maximize data coverage, the Cloudnet r E,L values below 2.5 µm have been clipped to this range. Note that this modification increases the original values by less than 0.05 % of the number of observations. Therefore, this choice modification does not significantly change the distribution or mean value of r E,L .
The surface input parameters, such as the ship measurement of pressure, temperature, and the albedo from CERES SYN1deg, are also interpolated to 1 min resolution. The surface emissivity is set to a constant value based on the fraction of sea ice in the vicinity of Polarstern, using CERES SYN1deg as source. When the sea-ice fraction exceeds 50 %, a constant surface emissivity of 0.9999 is used, while a value of 0.9907 is used below this threshold. These constant values are based on Wilber et al. (1999).
The T-CARS output provides vertical profiles of broadband upward and downward LW and SW fluxes, and heating rates for cloudy and CS conditions along the PS106 cruise track. Additionally, the geographic coordinates, the quality flags mentioned in Sect. 2.1.2, profiles of temperature and pressure levels, as well as the cloud top and cloud base height obtained from the Cloudnet data set, and the cloud boundaries from the analysis of LLS described in Griesche et al. (2020) are included as output variables.
The present study defines CRE as the difference between the AS and CS net fluxes in W m −2 , following, e.g. Mace et al. (2006). The net CRE is obtained as the sum of the CRE LW and CRE SW components, which are calculated using Eq. (1). In this equation, "x" stands for either LW or SW, and is computed both at the SFC and TOA. Given the net CRE SFC and CRE TOA , the net CRE throughout the atmo-sphere (CRE ATM ) is obtained by subtracting of the values at the TOA and SFC. (1)

Results and discussion
The main results of the present investigation are described and discussed in this section. First, the atmospheric and surface conditions during PS106 are described in Sect. 3.1. In Sect. 3.2, a sensitivity study of the radiative fluxes in CS conditions is given in order to quantify the expected uncertainty of the radiative transfer simulations, and to quantify the effect of atmospheric and surface variability on fluxes. Section 3.3 presents three case studies, comparing our own radiative transfer simulations based on T-CARS, the CERES SYN1deg-based flux products, and ship-based flux observations. An assessment of RC for the CERES SYN1deg fluxes considering the entire PS106 cruise is given in Sect. 3.4. The radiation budget and its modulation by clouds as quantified by CRE for the PS106 expedition is investigated in Sect. 3.5. Results obtained along the ship track and for the whole Arctic are compared in order to assess the representativeness of the observations conducted during the PS106 expedition.

Atmospheric and surface conditions
A general description of the meteorological and synoptic conditions during the PASCAL expedition (leg 1 of PS106) has already been given by Knudsen et al. (2018). Here, a complementary and more specific description is provided, focusing in particular on aspects influencing radiative fluxes, in situ observations, and satellite and ancillary data sets required as input for radiative transfer simulations. Time series of α, near-surface and T s along the PS106 cruise track are presented in Fig. 2. The track covered open ocean, the marginal ice zone, and ocean covered by dense sea ice. These regions can be differentiated by their α, which has been obtained here from the CERES SYN1deg data set (see Fig. 2a). The retrieval of α used by CERES SYN1deg is described in Minnis et al. (2020) and references therein. Based on the sea-ice concentration, the ERA5 broadband α is calculated as a linear combination of the sea ice and open water contributions. For the considered time period, the seaice concentration in ERA5 is based on the corresponding operational product provided by EUMETSAT's Ocean and Sea Ice Satellite Application Facility (OSI SAF; Eastwood et al., 2014) and the Operational Sea surface Temperature and Ice Analysis (OSTIA) data set (Hirahara et al., 2016). For PS106, there is, in general, a relatively good agreement between both α data sets along the cruise track, with a standard deviation of about 0.1. The largest differences occur during the second leg for the period from 25 June to 8 July 2017, when the CERES SYN1deg albedo is systematically higher than the ERA5-based values. Some of the ob-served differences might be attributed to the different spatial resolutions of the data sets (1 • for CERES SYN1deg versus 0.25 • for ERA5). Another potential cause for discrepancies is the omission of the effect of melt ponds in ERA5, which results in a systematic underestimation of the broadband α by ERA5 from June to mid-August, as discussed in Pohl et al. (2020).
The CERES SYN1deg α is used later as input for the radiative transfer simulations. It is based directly on solar reflectance observations, and due to the maturity of CERES SYN1deg products, it is expected to yield accurate results (Rutan et al., 2015). Its use also ensures consistency with the CERES SYN1deg flux products and allows us to better focus on the influence of other parameters. Nevertheless, the large influence of α on the solar radiation budget is investigated and discussed in more detail (see Sect. 3.5).
The T s and the near-surface air temperature are warmest at the beginning of the expedition when Polarstern was located over open ocean. For the rest of the expedition, the temperatures maintained a relatively steady value near 270 K, except for 8 June and 3 July 2017, when the temperatures dropped to around 266 K. In general, all temperatures presented in Fig. 2b are in good agreement, except for 29, 9-10, 22-23 June, 3-4, and 15 July 2017. Most of these differences might be due to local variability, as there is good agreement of onboard measurements. The largest difference in T s between CERES SYN1deg and ERA5 is found for 23 June 2017, which was the start of the second leg where Polarstern was located in Svalbard (see Fig. 1). This difference might be due to the challenges posed by a realistic representation of the marginal sea-ice zone.
The anomalies of the vertical profiles of atmospheric temperature and specific humidity based on ERA5 are shown in Fig. 3 for the PS106 track, together with the mean profiles and the sub-Arctic summer standard atmosphere (Anderson et al., 1986). The anomalies have been calculated with respect to the mean profiles of the cruise. Figure 3a and b, indicate a strong temperature and humidity inversion layers from 1-9 June 2017, followed by a generally warmer atmosphere from 10-14 June 2017. Moreover, relatively warm and humid conditions are observed at the end of leg 2 (11-16 July 2017), caused by water vapour transport as described in Knudsen et al. (2018) and Viceto et al. (2022). Based on the shipborne observations, the near-surface temperature varied more strongly during leg 1 than during leg 2 of PS106 (see Fig. 2b). Most of the humidity intrusions observed in Fig. 3b have southerly and westerly origins of the advected air masses, based on the wind direction obtained from the radiosondes (not shown). The mean vertical profiles of atmospheric temperature and specific humidity indicate good agreement of radiosondes and ERA5, which are colder and dryer than the climatological values of the sub-Arctic summer standard atmosphere.
A characterization of cloud conditions during the PS106 expedition is given next, considering CF, vertical layer struc-ture, and thermodynamic phase. The Cloudnet vertical profiles of CF are shown in Fig. 4a, while a comparison of daily mean CF is presented in Fig. 4b. For the latter panel, Cloudnet and LLS clouds are combined in the comparison to CF values of CERES SYN1deg. All CF values have been aggregated from hourly to daily means. To ensure consistent temporal sampling, hourly values with data gaps in the shipborne observations have been excluded from CERES SYN1deg. It is worth mentioning that the combination of LLS and Cloudnet CF improved the analysis by reducing the fraction of data gaps from 25.2 % to 6.6 % for the entire PS106 period.
The comparison aims to determine the consistency of the CERES SYN1deg and Cloudnet CF, despite their different instrumental origin, perspectives, spatial and temporal sampling, and retrieval algorithms. It is worth noting that CERES SYN1deg provides a spatially averaged CF for a 1 • latitude by 1 • longitude region, while Cloudnet yields vertically resolved information on cloud cover as time series for the location of Polarstern. Mean values of CF are 86.7 % and 76.1 % for CERES SYN1deg and Cloudnet plus LLS, respectively. The CERES SYN1deg CF, without exclusion of Cloudnet data gaps, is higher at 86.9 %, indicating that data gaps occur more frequently in cloudy conditions.
A description of the type of clouds observed during PS106 is shown in Fig. 5. The used classification is based on the Cloudnet target classification and supplemented by the analysis of LLS clouds. While the cloud type classification was already presented in Fig. 18c of Griesche et al. (2020), data quality and a more detailed description of mixed-phase clouds are considered here. Details of the classification methodology for the three flags are explained in Sect. 2.1.2.
During PS106, approximately 45.4 % and 35.6 % of the time, single-and multilayer clouds were observed by Cloudnet, respectively (Fig. 5a). The remaining 12.4 % of the time, CS conditions were detected, and data gaps occurred for 6.6 % of the time. While single-layer clouds were more frequent during the first leg, the frequency of multilayer clouds was higher during the second leg (see Figs. 5a and 18c in Griesche et al., 2020). The frequencies of single-and multilayer clouds are typical for early summer conditions, as previously reported by other studies (e.g. Shupe et al., 2011;. The cloud phase flag was included to analyse the thermodynamic phase of clouds. Even though it is available for the entire PS106 time series, the focus here is directed to the thermodynamic phase of single-layer clouds since these cases are the most frequent, and an analysis is less complex than for multilayer conditions. Figure 5b shows only single-layer cloud periods. This figure indicates an occurrence frequency of 36.7 % for single-layer mixed-phase clouds of type 2 (ice and supercooled droplets), 19.4 % for mixed-phase clouds of type 1 (well-separated ice and liquid phase), and 21.5 % for single-layer ice clouds. The remaining period is composed of single-layer clouds with PPT (17.9 %), and single-layer liquid clouds (4.5 %). We emphasize the need to consider  PPT periods since, during these events, there are larger uncertainties in observations (i.e. cloud radar, lidar, MWR, and radiometers).
The quality status flag shows that only 40.1 % of the time, optimum observation conditions were identified (see Fig. 5c). For about 39.0 % of the time, LLS with a cloud base below 155 m prevailed, implying that during these periods, the cloud base height lay below the altitude detection limit of 155 m of the cloud radar (see Fig. 18a in Griesche et al., 2020). The PPT alone, and moments with LLS and PPT occurred for about 5.1 % and 9.2 % of the time, respectively. Under LLS and PPT conditions, the observations of cloud macro-and microphysical parameters have a reduced accuracy. Thus, for only about 40.1 % of the time, observations are of sufficient quality, e.g. for radiative transfer simulations, while observations with a degraded quality occurred for 59.9 % of the period.

Sensitivity analysis of clear-sky (CS) radiation
This section presents a sensitivity analysis of the radiative fluxes at the surface for CS conditions. Its goal is to quantify the response of radiative fluxes to variability and uncertainty of various atmospheric and surface parameters during PS106, which are required as input to radiative transfer simulations. The accuracy of CS radiative fluxes is of particular interest, as it serves as a reference for the calculation of CRE, and thus places a limit on its accuracy. The propagation of the input uncertainties to radiative fluxes is thus used to establish uncertainty limits for the subsequent RC study.
A CS atmosphere has been created as a reference case, based on the conditions at a position of 81.9 • N and 32.51 • E on 3 July 2017. This day was selected because it is the day with the longest CS period during PS106 (see Figs. 1,4 and 5). For the sensitivity analysis, all atmospheric and surface parameters required as input were prescribed by the actual conditions, with the exception of α, which was kept constant and set to the daily mean value of 0.65. The intention is to avoid resulting variations in fluxes due to fluctuations of the α, which ranged from 0.58 to 0.78 on that day (see Fig. 2a). For the day, the CERES SYN1deg ice coverage lay above 0.5. Thus, the surface emissivity was set to a constant value of 0.9999. The solar zenith angle (SZA) ranged from 59 to 75 • , which does not cover the full range of SZA encountered during the PS106 expedition (47.6 to 80.1 • for the period from 31 May to 16 July 2017). Despite such minor discrepancies, we assume here that the conditions of this reference day are representative for the entire PS106 cruise for the purpose of this sensitivity analysis.
The response of radiative fluxes to perturbations of atmospheric parameters, including temperature, ozone, and humidity, has been quantified. In addition, the effects of vari- ations of T s , α, and surface emissivity are also considered. This analysis focuses on both the downward and upward fluxes at the SFC, as well as the upward fluxes at the TOA for both the LW and SW broadband radiation. A summary of the main sensitivities is presented in Fig. 6, while the full results are listed in the Appendix as Tables A1 to A4. Note that the values used later as uncertainty limits for the RC analysis are highlighted in bold.
The perturbation of atmospheric temperature is based on the instrumental uncertainty of ±0.5 K of the radiosonde temperature sensor, as well as the observed range of atmospheric temperature anomalies of about ±7 K during PS106 (see Fig. 3c). The variation of ±0.5 K has the strongest effect on the LWD flux at the SFC, resulting in a variation of ±1 W m −2 . Note that 3 July 2017 is a day with slightly colder than average temperatures for the first 8 km of the atmosphere, and warmer than average temperatures at about 10 km (see Fig. 3a). Considering the larger temperature perturbation of ±7 K results in a variation of up to ±14.9 W m −2 (see Table A1). At the TOA, the variations of ±0.5 and ±7 K yield daily mean differences of ±1.1 and ±15.4 W m −2 , respectively.
Ozone reduces the SW flux in the atmosphere by absorption at ultraviolet (λ 0.35 µm) and visible wavelengths (0.5 µm λ 0.7 µm). To quantify the sensitivity to ozone, the findings of Bahramvash Shams et al. (2019) are used as basis, who investigated the variation of the vertical profiles of ozone at four Arctic sites from 2005-2017. Perturbation of the ozone column amount by ±12.5 % and ±25 % are assumed. The former value is used to approximate the monthly variation of ozone during summer months (see Fig. 5 in Bahramvash Shams et al., 2019), which is similar to the uncertainty of ±10 % ozonesondes (Deshler et al., 2017). The smaller variation leads to a decrease of ±0.7 and ±0.3 W m −2 at the SFC and TOA for the SW flux, respectively, and ±0.3 W m −2 at both the SFC and TOA for the LW. Variations of ozone concentration are particularly important in the stratosphere, where reduced ozone causes colder temperatures, which enhances the decrease of ozone (Randel and Wu, 1999). It is also worth noting that ozone concentration is linked to synoptic mechanisms through interactions with atmospheric dynamics and photochemistry (Anstey and Shepherd, 2014).
Water vapour is the dominant absorber throughout most of the LW (Delamere et al., 2010) and has strong absorption bands in the SW (λ > 900 nm). The strength of SW absorption also depends on the SZA (Wyser et al., 2008). Our sensitivity analysis considers a variation of 5 %, which is used to represent the instrumental uncertainty of radiosondes, and 15 %, which approximates the range between the minimum and maximum column amounts. The variation of 5 % leads to differences of ±1 and ±0.9 W m −2 for the SWD and LWD, respectively, at the SFC. At the TOA, this perturbation yields differences of ±0.5 and ±0.7 W m −2 for the upward LW (LWU) and SW (SWU), respectively (see Table A1). The column amount of water vapour is largest in July, and a positive trend in water vapour since the 1990s has been reported, especially during summer (Di Biagio et al., 2012;Rinke et al., 2019). In addition, recurrent episodes of humidity intrusions can increase the water vapour column by about 8 times above the background level, and increase the LWD flux at the SFC by up to 16 W m −2 (Doyle et al., 2011).
During PS106, the temperature measurement aboard Polarstern closest to the surface was located at 10 m a.s.l. The accuracy of the temperature of ±0.3 • C is used for the sensitivity study, together with a perturbation of ±5 • C, which corresponds to the largest difference between ship measurements and T s obtained from ERA5 and CERES SYN1deg (see Fig. 2b). The variation of this parameter only influences the LW fluxes, causing changes of ±0.3 and ±1.4 W m −2 for the LWD and LWU, respectively, at the SFC. At the TOA, a difference of ±0.5 W m −2 is noted in the LWU. Naturally, more considerable variation in T s yields to larger flux differences as indicated in Tables A2 and A4, which is particularly relevant for days when these differences are more pronounced (e.g. 6-8 June, 22-23 June, 2-5 July 2017; see Fig. 2b).
The α is an extremely important parameter for the SW radiative fluxes Sedlar et al., 2011;Ebell et al., 2020;Stapf et al., 2020). The retrieval of this parameter from satellites in the Arctic is particularly challenging due to the difficulty of cloud detection over snow-and ice-covered surfaces and rapid temporal changes induced by melting. In contrast, ground-based observations often have limited spatial representativeness. Figure 2a shows the time series of hourly α from CERES SYN1deg. From 27 June to 8 July 2017, the difference is most noticeable. For the PS106 cruise, the mean difference of α between CERES SYN1deg and ERA5 is 0.08. For the sensitivity study, the daily mean value of 0.65 is used for α. Variations by ±0.08 are used to investigate the sensitivity of radiative fluxes. Additionally, the minimum (0.05), mean (0.53), and maximum (0.84) values of the CERES SYN1deg α for the entire cruise are used to quantify the sensitivity of SW fluxes to α during the PS106 cruise.
A variation of the α by ±0.08 yields a mean flux difference of ±2.1 and ±33.6 W m −2 at the SFC for SWD and SWU, respectively. This difference also depends strongly on the SZA. For instance, a SZA of 59 • leads to a flux difference of SWU at the SFC of ±46.6 W m −2 , whereas at a SZA of 75 • , this difference is ±20.8 W m −2 (not shown). The values presented in Tables A2 and A4 for the α also indicate the large contrast between the additional values studied (0.84, 0.53, 0.3, and 0.05) minus the constant α of 0.65. At the SFC, open ocean (e.g. α equal to 0.05) causes a large reduction of SWD and SWU by 13.9 and 241.1 W m −2 , respectively. On the other hand, the highest α observed during PS106 (0.84) results in an increase of SWD and SWU of 5.1 and 8.4 W m −2 , respectively. At the TOA, the daily mean values indicate a reduction by 217.6 and an increase by 72 W m −2 for α of 0.05 and 0.84, respectively. To the best of our knowledge, the importance of α under CS Arctic conditions has also been analysed in other studies (Wyser et al., 2008;Di Biagio et al., 2012;Sedlar and Devasthale, 2012). The values mentioned are consistent with the radiative kernel calculations presented by Bright and O'Halloran (2019) (see their Fig. 1 for the months of June-July-August). Due to the values mentioned, the SWU is the most relevant parameter, sensitive to large differences when estimating the radiation budget and CRE.
The last parameter considered in the analysis is the surface emissivity. A value of 0.9999 is used for surfaces covered by ice, while a value of 0.9907 is used for water surfaces based on Wilber et al. (1999). Additionally, the constant value of 0.988 used by Riihelä et al. (2017) is considered, who used this value for a wider Arctic area, based on Hori et al. (2006). The variation of this parameter only affects the LWU. Contrasting the surface emissivity over sea ice and open ocean, a mean flux difference of 0.8 and 0.7 W m −2 at the SFC and TOA, respectively, are found. The comparison with the value used in Riihelä et al. (2017) yields a smaller difference of 0.2 W m −2 at the SFC and 0.1 W m −2 at TOA.
A summary of the results of the sensitivity study is shown in Fig. 6. Considering the propagation of the uncertainty of input parameters to radiative fluxes, a total uncertainty of the LWD and SWD fluxes at the SFC of ±2.6 and ±3.7 W m −2 , respectively, is inferred. The largest part of this uncertainty is introduced by the amount of water vapour and the atmospheric temperature for LWD, and by the α for SWD. Combining these values with the instrumental uncertainties presented in Sect. 2.1.1, total uncertainty limits of ±16 and ±24 W m −2 for LWD and SWD, respectively, at the SFC are inferred as basis of the subsequent RC assessment for CS conditions.
It is worth clarifying that additional uncertainties come from neglecting the presence of aerosols, the assumption of near-surface temperature as T s , the extrapolation of hourly data to 1 min resolution, the assumption that the coarse spatial grid of 0.25 • latitude by 0.25 • longitude (i.e. ERA5 data set) or 1 • latitude by 1 • longitude (i.e. CERES SYN1deg products) can capture the atmospheric and surface conditions, and variability experienced during PS106. While it was attempted to quantify some of these uncertainties (e.g. the omission of aerosols in Fig. B1 and Sect. 3.4.1), a careful and more specific analysis will be made by carrying out several sensitivity analyses to quantify these uncertainties considering the observations from the Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) expedition (Shupe et al., 2022). Our primary focus will be on the spatiotemporal differences among shipborne, reanalysis, and satellite observations, which we believe are the largest source of uncertainties.

Case studies
The following subsection presents three case studies, which have been selected to cover typical sky conditions during PS106. Radiative fluxes and CRE at the SFC obtained from the T-CARS simulations, CERES SYN1deg products, and the shipborne observations are compared to illustrate periods with a good agreement and larger discrepancies. A day with a long CS period, single-layer, multilayer, and mixed-phase clouds are presented. For the cloud cases, the comparison of fluxes is accompanied by a comparison of cloud properties to investigate their role as a potential source for observed differences in fluxes.
For these cases, it is assessed whether RC can be reached between the shipborne observations on the one hand and the T-CARS simulations and CERES SYN1deg products on the other hand. For CS conditions, the uncertainty limits have been established in Sect. 3.2. In contrast to the RC assessment at the SFC, the lack of independent observations and inputs only allow an assessment of the consistency of fluxes simulated by the T-CARS setup and CERES SYN1deg at the TOA.
A common figure layout is used to give an overview of meteorological conditions and radiative fluxes to present the case days in Figs. 7, 8 and 10. Their first panels show the Cloudnet target classification overlaid with the cloud top and base heights obtained from CERES SYN1deg as time series. Panels (b) and (c) show the time series of downward (upward) fluxes at the SFC (TOA) for the SW and LW fluxes averaged to 10 min temporal resolution. Background shading is used for periods when environmental conditions or instrumental limitations compromised the observations. The pink background indicates periods when the shipborne flux observations were obstructed by Polarstern's superstructure, mainly affecting the SW flux. The pale yellow background is used for periods when the Cloudnet retrievals were unable to correct for attenuation, and light blue indicates the periods of CS conditions that are agreeable between CERES SYN1deg and Cloudnet. Panels (d) and (e) show the distribution of the flux difference between shipborne observations and simulations for SWD and LWD, respectively. Panel (f) shows the comparison between CERES SYN1deg products and T-CARS simulations for the SWU flux, panel (g) shows the same as (f), but for the LWU flux.
The comparison of the microphysical properties of clouds such as water path (Q) and effective radius (r E ) between Cloudnet and CERES SYN1deg are shown in panels (a) and (b), respectively, of Figs. 9 and 11. Additionally, the time series of CRE at the SFC and TOA based on T-CARS and CERES SYN1deg are presented in panel (c) of the figures mentioned.

CS case: 3 July 2017
As mentioned before, 3 July was the day with the longest cloud-free period during PS106 (see Figs. 4,5,and C1). An overview of meteorological conditions and radiative fluxes for this day is given in Fig. 7. Panel (a) shows that ice clouds were observed by Cloudnet from 00:00 to around 02:30 Z on that day. Later periods when CERES SYN1deg reported clouds while Cloudnet did not, were further analysed by means of the AS camera images. They showed a thin cloud at the horizon for the period between 21:30 to 23:59 Z, which is outside the field of view for the zenith-pointing active remote-sensing instruments.
Good agreement is found for the shipborne flux observations, the T-CARS simulations, and the CERES SYN1deg products for SWD at the SFC. The mean difference of the simulations minus the observations lies below 8 W m −2 , a value that is well within the uncertainty limit of ±24 W m −2 used for the RC assessment. For the LWD fluxes, there is a larger difference between the shipborne observations on the one hand, and the T-CARS simulations and CERES SYN1deg products on the other hand. However, the mean difference remains below 14 W m −2 , again smaller than the uncertainty limit of ±16 W m −2 chosen for the RC assessment. Therefore, our results confirm that RC is reached for both the SWD and LWD fluxes for this CS case.
To analyse the consistency of T-CARS simulations and CERES SYN1deg products at the TOA, the SWU and LWU fluxes are considered. Figure 7a shows a similar temporal behaviour of the SWU fluxes for the considered period. The mean difference between T-CARS and CERES SYN1deg is 10.0 W m −2 , with the largest instantaneous differences occurring after 18:40 Z. Since the T-CARS simulations use the same α as CERES SYN1deg for input, the reason for this difference is most likely due to the spatiotemporal interpolation of CERES SYN1deg (Young et al., 1998). For the LWU fluxes, the mean difference is 3.7 W m −2 , possibly due to differences in the CERES SYN1deg T s and the shipborne measurement of near-surface temperature used by T-CARS (see Fig. 2). The mean differences of the radiative fluxes are sufficiently small to confirm the consistency of CERES SYN1deg and T-CARS fluxes, in agreement with similar studies which compared CERES SYN1deg with other radiative transfer simulations (e.g. Dong et al., 2016;Dolinar et al., 2016).

Single-and multilayer ice cloud case: 2 July 2017
Single-and multilayer clouds were present for 45.4 % and 35.6 % of the time during PS106, respectively (see Fig. 5b). 2 July 2017 has been chosen as the case day with a dominant presence of this type of clouds. Based on the Cloudnet target classification, this day was characterized by well-defined single-and multilayer ice clouds (see Figs. 8a and C2). Middle and high-level ice clouds were observed for most of the day, with an average cloud base at or above 2.6 km. An exception is the period from 05:57 to 08:35 Z, when a relatively thin cloud layer consisting of ice and supercooled droplets was identified (see Fig. 8a). For most of the day, the cloud top height from CERES SYN1deg is significantly lower than that from Cloudnet. The cloud base height is relatively close to the base obtained from the Cloudnet target classification. It is well-known that the retrieval of cloud top height from passive satellite instruments is limited by large uncertainties for thin ice clouds and polar regions (e.g. Yost et al., 2020), so these discrepancies are not surprising.
A comparison of CERES SYN1deg and T-CARS fluxes against observations for the SWD and LWD reveals good agreement, with values below the CS uncertainty limits established in Sect. 3.2 (see Table 2). In general, this comparison suggests that RC is achieved for T-CARS and CERES SYN1deg considering the daily mean. At the TOA, however, there is a more significant difference of −15.4 and −15.0 W m −2 for LWU and SWU, respectively. This difference is likely linked to the differences in cloud properties retrieved by Cloudnet and CERES SYN1deg that are displayed in Fig. 9. Discrepancies in other parameters such as the T s , atmospheric profiles, cloud top height, and cloud geometrical thickness might also be relevant. Panels (a) and (b) of Fig. 9 show the time series of Q and r E obtained from Cloudnet and CERES SYN1deg, respectively. The comparison of Q shows the integrated values for the entire atmosphere, whereas panel (b) displays the mean values obtained from CERES SYN1deg and the maximum derived by Cloudnet. Despite the difference in retrieval methods, there is, in general, a good agreement between the values of Q I and r E,I from CERES SYN1deg and Cloudnet (see Fig. 9a and b). Based on CERES SYN1deg, the entire day was characterized by the presence of a mixed-phase cloud. The values shown for Q L are relatively large, especially during the period from 17:00 to 19:00 Z. It is possible that within the CERES SYN1deg footprint, a cloud with such a large Q L might have been present. Nevertheless, it is worth noting that the fluxes shown in Fig. 8b and c are calculated considering the CF, which remained below 15 % from 12:30 Z until the end of the day (not shown). The latter suggests that care must be taken when comparing cloud properties obtained from the shipborne active instruments and from CERES SYN1deg with its coarse spatial footprint.
For this case study, the net CRE SFC is similar between T-CARS and CERES SYN1deg, despite the noted discrepan-cies in cloud properties. The net CRE SFC has mean values of 1.3 and 2.7 W m −2 for CERES SYN1deg and T-CARS, respectively. At the TOA, the difference is more significant due to the mentioned differences in LW. The net CRE TOA derived by T-CARS is 8.9 W m −2 , whereas CERES SYN1deg suggests a cooling of −11.1 W m −2 . Such inconsistencies need to be clarified and further investigated in future studies.

Mixed-phase clouds: 26 June 2017
The last case study chosen is 26 June 2017. This day was selected due to the presence of well-defined cloud layers consisting of ice and liquid droplets, corresponding to a mixedphase cloud of type 1. A further reason is the long period of optimum observation conditions reported by Cloudnet (Fig. 5c). Moreover, this day is also of interest due to an underestimation of a high-level cloud amount by CERES SYN1deg in comparison to Cloudnet, as is corroborated in Fig. 10a. This day is also characterized by changing surface conditions, as the ship crossed through the sea-ice transit zone (see Figs. 1, 2a and C3). This day was characterized by a low-level cloud located within the first 2 km of the atmosphere, and by several periods with a relatively thin ice cloud layer located between 6 and 9 km height (Fig. 10a). According to Cloudnet, the wellseparated liquid-phase layer within the ice clouds is characterized by low-radar reflectivity values, upward-directed Doppler velocity, and high-lidar backscatter. There were two moments around 10:00Z and 23:00Z where, due to uncorrected attenuation, Q L could not be derived by Cloudnet ( Fig. 10a and b). These two moments, marked by the paleyellow shaded areas, are excluded from the T-CARS and CERES SYN1deg histogram analysis for a fair comparison ( Fig. 10d-g). Panels (b) and (c) of Fig. 10 indicate a good agreement between observations and CERES SYN1deg fluxes. The daily mean difference of fluxes is 16.1 and −0.6 W m −2 for SWD and LWD, respectively. For the T-CARS simulations, the mean flux difference is significantly larger, with 77.1 W m −2 for SWD and −12.0 W m −2 for LWD (see Table 2). Based on these results, RC can be confirmed for the LWD and SWD fluxes from CERES SYN1deg products and the LWD T-CARS simulations. The difference in the T-CARS SWD fluxes exceeds the expected uncertainty limits. To investigate the reasons for this, the cloud properties from both data sets are compared. Panels (a) and (b) of Fig. 11 show the time series of cloud properties from the Cloudnet and CERES SYN1deg data sets. In contrast to Cloudnet, CERES SYN1deg reports only three periods with the presence of a mixed-phase cloud. The largest difference in cloud properties occurs in the cloud water path products. For Cloudnet, the mean Q L is 56.2 g m −2 , and for Q I , the mean is 1.9 g m −2 . In the case of CERES SYN1deg, these values are 119.7 and 38.1 g m −2 for Q L and Q I , respectively. The variable and lower values of Q are   likely responsible for the rapid changes and the positive (negative) bias of the SWD (LWD) flux visible in Fig. 10a and b. The radiative effect of clouds on this day has a strong cooling influence both at the SFC and TOA, that is enhanced by α. An abrupt change of the CRE at the SFC and TOA is visible at 05:00 Z in Fig. 11c, due to a simultaneous rapid reduction of α from a value of 0.6 to 0.27 (see also Fig. 2a).
Based on CERES SYN1deg, a daily mean net CRE of −79.5 and −127.9 W m −2 is found at the SFC and TOA.
The T-CARS simulations also indicate radiative cooling at the SFC and TOA, but smaller in magnitude (see Table 2). As the SWD and LWD fluxes at the SFC from CERES SYN1deg are more consistent with observations, the CERES SYN1deg values are considered to be more accurate.

Radiative closure assessment based on PS106 observations
In this subsection, CERES SYN1deg fluxes and CS T-CARS simulations are compared to the shipborne observations of the broadband SWD and LWD fluxes for the PS106 expedition. This comparison enables an assessment of RC for the entire expedition, and to identify conditions with significant discrepancies. In Sect. 3.4.1 and 3.4.2, CS and AS conditions are considered separately.

CS radiative fluxes
For the CS comparison, simulated and observed fluxes have been analysed based on the atmospheric classification described in Sect. 3.1 (see Fig. A1). Furthermore, to improve the data quality, AS camera images were used to screen periods with larger differences. With this supplementary information, periods with broken cloud conditions and periods with external factors, which could potentially compromise the radiation observations, were excluded (e.g. the exhaust plume of Polarstern). The comparison of SWD and LWD fluxes from T-CARS and CERES SYN1deg with shipborne observations for CS conditions is presented in Fig. 12. Figure 12a shows a histogram of the differences of the radiative fluxes from T-CARS minus observations of LWD. This comparison indicates a skewed-left distribution with a negative mean bias of the T-CARS simulations of −24.9 W m −2 . After applying the improved quality screening described above, a mean flux difference of −14.2 W m −2 is found, together with a correlation coefficient (R 2 ) of 0.92, and a more symmetric distribution than without this quality screening. The mean flux difference below the uncertainty limit of ±16 W m −2 , and the good R 2 confirms that RC is achieved for the T-CARS simulations under CS conditions. In the case of CERES SYN1deg, the mean flux difference between simulations minus observations for the LWD flux is −4.6 W m −2 , with a standard deviation of 20.9 W m −2 , and a RMSE of 18.5 W m −2 . While the bias suggests that RC is found for CERES SYN1deg, the rather low R 2 of 0.476 indicates that the values do not reproduce variability as well as the T-CARS simulations, and might be affected by the presence of clouds within the CERES SYN1deg footprint (see Fig. 12c). To test this hypothesis, CS and pristine (P) CERES SYN1deg products were also considered. For both data sets, R 2 reached values above 0.765, which confirms that the AS CERES SYN1deg R 2 is reduced by the presence of clouds. With this change, the bias however increased to −19.8 and −20.2 W m −2 for the CS and P data sets, respectively (see Fig. 12c).
Several simulations were conducted using only the Vaisala RS92-SGP radiosondes, launched every 6 h from Polarstern (Schmithüsen, 2017a, b), and also sensitivity analyses were made by varying the atmospheric temperature and humidity to try to match the observations of the LWD flux (not shown). However, the negative bias found for both T-CARS and CERES SYN1deg fluxes might also be caused by a positive bias of the shipborne pyrgeometer observations, e.g. due to the influence of the exhaust plume of Polarstern, or other factors affecting the pyrgeometer. As there was only one pyrgeometer measurement aboard Polarstern, it is impossible to further investigate this bias. However, for future campaigns, it is recommended here to operate two pyrgeometers, installed in different locations of the research vessel, to exclude such influences.
The comparison for the SWD flux uses a stricter screening of data, which also excludes all periods when the pyranometer's field of view was obstructed by the superstructure of Polarstern. For T-CARS, a positive bias of 44.2 W m −2 and R 2 of 0.85 were found initially, without this screening. With screening, a bias of 9.5 W m −2 and R 2 of 0.95 were obtained. In the case of CERES SYN1deg, the biases for AS, CS, and P conditions were found to have values of −27.1, 3.6, and 12.0 W m −2 , respectively. These values confirm that the larger negative bias for AS conditions is due to the presence of clouds that were captured within the CERES SYN1deg footprint, but did not pass over the shipborne remote-sensing instrumentation. The second peak, centred around −50.0 W m −2 in Fig. 12d, is most likely due to momentary obstructions on the observations that were not captured by the initial screening that contains about 5 data points of CERES SYN1deg simulations. In general, the biases of T-CARS and CERES SYN1deg are both within the uncertainty limit of ±20 W m −2 , indicating that RC is determined for both data sets.
Previous studies reported a similar magnitude of differences between simulated and observed downward fluxes for CS conditions. For instance, the analysis by Ebell et al. (2020) focused on Ny-Ålesund and reported a mean (median) flux difference of −5.0 (−5.5) W m −2 and 12.6 (−2.6) W m −2 for LWD and SWD, accordingly (see their Fig. 3). The studies of Shupe et al. (2015) and Miller et al. (2015) found a median difference of simulations and observations for the LWD (SWD) flux of −6.9 (5.4) and −5.5 (15.6) W m −2 for the Barrow and Summit-Greenland sites, respectively.
The treatment of aerosol in the CERES SYN1deg products is based on the aerosol optical depth (AOD) obtained from the Model of Atmospheric Transport and Chemistry (MATCH; Collins et al., 2001). The comparison between the CERES SYN1deg CS and P fluxes yields a mean difference of 0.4 and −7.5 W m −2 for LWD and SWD, respectively. Considering the entire PS106 cruise, mean flux differences of 0.5 for LWD and −10.8 W m −2 for SWD are found (see Fig. B1a, b, e, and f).
While the LW value indicates that LW aerosol effects are negligible for most purposes, the SW values are relatively large in comparison to the direct aerosol effect of −0.44 to −2.6 W m −2 reported by Rastak et al. (2014).

AS radiative fluxes
In this subsection, the CERES SYN1deg products are compared to the shipborne observations for AS conditions. The T-CARS simulations analysed as a time series are not considered in the discussion due to the instrumental limitations that occurred during PPT, LLS conditions, or uncorrected attenuation instances, making it impossible to conduct a RC assessment including cloudy-sky conditions. The comparison between observations and CERES SYN1deg fluxes is shown in Figs. 13 and 14. Figure 13a shows the time series of the LWD observations and CERES SYN1deg AS simulations, indicating mostly good agreement. Some periods can be identified with a reduced agreement. These cases occur particularly during PPT periods, which might affect the pyrgeometer measurements (e.g. 12, 20, 28 to 29 June, and 11-14 July). Larger discrepancies are also observed in the presence of multilayer clouds (e.g. 20 June and 5-9 July), which feature a more challenging structure and pose challenges for passive remote sensing Yost et al., 2020).
The discrepancy shown for 13 June 2017 might also be attributed to PPT as it was also the case on 12 June. However, this cannot be confirmed by the observations due to missing data. This day was also brought to attention by Barrientos Velasco et al. (2020), since the near-surface temperature measured on the ice floe by several instruments reached a mean temperature of 281.1, about 4 K warmer than the temperature measured aboard Polarstern (Barrientos , CERES SYN1deg and ERA5 T s . This day was also characterized by a more humid than usual upper atmosphere, a cyclonic weather system, northerly winds (Barrientos Velasco et al., 2020), strong temperature inversions, and an intense persistent fog leading to a very low-horizontal visibility (see Fig. 18b in Griesche et al., 2020). This humidity in-trusion can lead to an additional energy flux to the surface, enhancing the fog to persist (Tjernström et al., 2019). The fluctuations of atmospheric temperature and relative humidity described might be different from the atmospheric profiles used by CERES SYN1deg, causing the difference of up to 20 W m −2 (Fig. 13a).
The mean flux difference of the LWD fluxes between CERES SYN1deg and observations for AS conditions is −6.0 W m −2 , a value that is within the instrumental uncertainty of ±13 W m −2 of the pyrgeometer (see Fig. 13c). Hence, RC for the LWD flux can be confirmed for CERES SYN1deg. Figure 14a shows a scatter plot comparing CERES SYN1deg SFC fluxes with the shipborne observations. The linear regression is calculated and illustrated in the same plot to determine the correlation of both data sets, which indicates R 2 of 0.69. Similar to LWD, there is relatively good agreement between the CERES SYN1deg simulations and observations for SWD for the entire PS106 (see Figs. 13b and 14b). Most of the discrepancies are caused by PPT (e.g. 13 to 14 July), broken cloud conditions, and instances when Polarstern's superstructures compromised the pyranometer observations (see Fig. A1). Figure 13d shows the distribution of the time series of the SWD flux. This panel shows a similar distribution between the observations and the SWD flux simulated by CERES SYN1deg. This comparison indicates that CERES SYN1deg SWD flux is positively biased by 23.1 W m −2 , with a standard deviation of 59.3 W m −2 . This value is considered acceptable since the instrumental uncertainty is ±20 W m −2 , and the moments with broken cloud conditions or obstruction of the observations are not excluded from the comparison. Nevertheless, it is important to stress that obtaining good quality shipborne observations of downward fluxes is challenging due to recurrent obstructions of the view of the radiometer's sensor.
The study of Riihelä et al. (2017) presents a comparison of radiative fluxes between the CERES SYN1deg ed. 3 products and in situ observations from the drifting Tara ice camp from April-September 2007. Their results are presented as daily means and indicate a RMSE of 24.5 and 17.1 W m −2 for SWD and LWD, respectively. In our study, results are relatively similar for LWD, with a RMSE of 12.3 W m −2 . In the case of SWD, the RMSE found for PS106 has a value of 46.5 W m −2 . It should be noted that our results are based on hourly means instead of the daily mean reported by Riihelä et al. (2017). Furthermore, the observations made during the Tara ice floe camp were unaffected by interference of the ship's superstructure with the observations.
The study by Dong et al. (2016) focuses on the RC of SWD flux for single-layer overcast liquid-phase Arctic stratus clouds over the snow-free and snow-covered surfaces. Their analysis considers CERES SYN1deg eds. 2 and 4 products, which are subsequently compared to radiation observations and cloud retrievals at the ARM NSA site at Utqiaġvik, Alaska. Edition 4 of CERES SYN1deg shows a  mean bias against observations below 10 W m −2 for SWD and for snow-free as well as snow conditions. Evidently, our flux difference of 23.1 W m −2 is larger than the value reported by Dong et al. (2016); however, the present study includes more complex cloud situations, which enlarges the differences of radiative fluxes.

Cloud radiative effect (CRE) and radiation budget during PS106
This section presents an analysis of the radiation budget and CRE for the summertime Arctic and for the period of the PS106 expedition, based on CERES SYN1deg products. While it was initially planned to also include the T-CARS simulations in this analysis, their limited temporal coverage precludes meaningful results. In particular, the exclusion of situations with the prevailing LLS clouds is expected to bias mean fluxes. The CERES SYN1deg data for the period of the first (from 16:45 Z on 28 May until 00:00 Z on 20 June 2017) and second leg of PS106 (22 June to 16 July 2017) are used for consistency with the temporal analyses given in the previous sections. The cruise track during this period lay entirely within the Arctic, defined here to cover the range from 70-90 • N, and consistent with the definition used by several previous studies (e.g. Walsh et al., 2009;Huang et al., 2017). In addition to the effects of clouds on the radiation budget, the relevance of the direct aerosol radiative effect for the radia- tion budget is briefly discussed based on CERES SYN1deg data sets. An overview of the most important components of the radiation budget is given in Fig. 15, showing mean values along the PS106 cruise track and for the entire Arctic as separate panels. A full list of the different flux components can be found in the Appendix in Table B1. Along the PS106 cruise track and under AS conditions, the radiation budget at the SFC indicates a strong warming influence by the SW net flux with 110.9 W m −2 , while the LW fluxes cool the SFC by −22.1 W m −2 . The presence of clouds enhances the LWD flux at the SFC by about 62.1 W m −2 , while reducing the SWD and SWU by 121.0 and 51.5 W m −2 , respectively. For the period of interest, the net radiation budget at the SFC has a value of 88.8 W m −2 for PS106, and a value of 94.4 W m −2 for the entire Arctic. The difference in the net radiation budget for the PS106 track and the entire Arctic is relatively small with a value of 5.6 W m −2 , and can be attributed to differences in the mean incoming SW radiation at the TOA, LWD radiation, the transmission of SW radiation through the atmosphere, and α.
At the TOA and for the PS106 cruise, the mean net radiation budget is 0.2 W m −2 , while for CS conditions, the net radiation budget would be48.6 W m −2 . At the TOA, the presence of clouds increases the reflected solar radiation by 62.9 W m −2 and reduces outgoing LW flux by 14.5 W m −2 . For the Arctic, the net radiation budget under AS conditions has a value of 7.8 W m −2 , versus a value of 47.9 W m −2 for CS conditions. This indicates that the cooling by CRE is smaller for the entire Arctic than for PS106. This difference can mainly be attributed to an increased reflection of solar radiation due to a larger CF and higher α along the PS106 track compared to the Arctic.
To consider the radiation budget of the atmosphere, the flux divergence has been calculated as the difference of the mean values at the TOA and SFC. The net values found for the entire Arctic (−86.6 W m −2 ) and PS106 (−88.6 W m −2 ) show a significant cooling, and are once more relatively similar in magnitude. Since the net radiation budget for PS106 and the Arctic are relatively similar at the SFC, TOA, and within the atmosphere, the PS106 expedition can be considered representative for the entire Arctic (see Table B1).
Additionally, the direct radiative effect of aerosols on the radiative fluxes and the radiation budget has been examined for CS and AS conditions. By considering the different CERES SYN1deg flux products, specifically the AS, cloudy without aerosols (NA), CS, and P fluxes, an estimate of the direct radiative effect of aerosols can be obtained. A summarized version of Table B1 is given in Table B2 in the Appendix, showing the mean perturbation in radiative fluxes arising from aerosols. To be expected, aerosol effects are mainly limited to the SW radiation and are the largest under CS conditions. A decrease of SWD (SWU) of 10.8 W m −2 (6.1 W m −2 ) is found at the SFC for the PS106 track. Slightly larger values are calculated for the entire Arctic, with a reduction of the SWD (SWU) by 13.9 W m −2 (7.4 W m −2 ). Aerosols have a small warming effect in LW, leading to an increase no larger than 0.7 and 0.1 W m −2 for LWD and LWU, for both the PS106 track and the entire Arctic, respectively. At the TOA, the effect of aerosols does not surpass more than 0.2 W m −2 for the SWU and LWU fluxes.
Considering net fluxes for P and CS conditions, the direct aerosol radiative effect at the SFC along the PS106 track and the entire Arctic are −4.2 and −5.9 W m −2 , respectively. At the TOA, the radiative effect of aerosols causes a minor cooling by −0.1 W m −2 for both the PS106 track and the Arctic for CS conditions, which changes to a small warming effect of 2.2 and 1.9 W m −2 for PS106 and the Arctic, respectively, in the presence of clouds. These values are consistent with the study of Markowicz et al. (2021), who determined the aerosol radiative effect based on radiative transfer simulations and the long-term aerosol reanalysis, provided by the Navy Aerosol Analysis and Prediction System (NAAPS) from 2003-2015. Focusing on the Arctic, they report an annual mean net direct (indirect) aerosol radiative effect of −3.01 (−1.88) and −0.73 (0.31) W m −2 , at the SFC and TOA, respectively (see their Table 5). While the values calculated in our study are relatively larger than their annual mean values, our values are consistent with the early summer values reported for the months from May-July (see their Figs. 12a and 13a).
It is worth pointing out that the values of aerosol radiative effects reported here critically depend on an accurate representation of aerosol properties in the radiative transfer calculations underlying the CERES SYN1deg products. As aerosol properties are represented based on the assimilation of MODIS products into the MATCH aerosol transport model, its accuracy in the Arctic determines the accuracy of our findings, and the use of reanalysis properties can have significant biases (e.g. Witthuhn et al., 2021).
Given that the main interest of this study is the radiative effect of clouds in the Arctic, the CRE along the cruise track of the PS106 expedition is shown as a time series with hourly resolution in Fig. 16 for the SW, LW, and net components at the SFC and the TOA. The LW CRE at the SFC and TOA has mean values of 60.6 and 14.4 W m −2 , respectively. These values imply a significant warming of the surface and cooling of the atmosphere caused by clouds. Moments when the LW CRE at the SFC surpassed 80 W m −2 occurred mostly for persistent mixed-phase, single-layer low-level clouds (e.g. 3-6, 16-17, 25 June, and 1 July 2017; see Fig. 5a and c). The persistent cloud structure observed from 5-7 June 2017 is studied in more detail in Egerer et al. (2021), who suggest that humidity inversions supply moisture to the cloud layer, increasing the persistence of the cloud. Time periods with the largest LW CRE and cloud conditions described occurred mostly over sea ice or the marginal zone.
The SW CRE shows a clear dependency on the α and SZA, as shown by the oscillations visible at the SFC and TOA in Fig. 16b. The colour band shown at the bottom of Fig. 16 indicates whether Polarstern was located in the open sea, the sea-ice marginal zone, or within the sea ice. The surface conditions and SZA strongly modulate the magnitude of SW CRE. Thus, the highest values of SW CRE are found for open ocean rather than for sea ice, due to the high α of ice, which increases the amount of reflected SW radiation and reduces SW CRE.
For the cruise track of the PS106 expedition, SW cooling by clouds dominates over LW warming, leading to a net cooling of −8.8 W m −2 at the SFC (see Table 3). For a similar time period, the net CRE at the SFC was also investigated by two other studies, both reporting stronger cooling. For Ny-Ålesund, NO, Ebell et al. (2020) find values within a range of about −20 to −40 W m −2 (see Fig. 6c in Ebell et al., 2020). Considering SW CRE, the study of Stapf et al. (2020) focused on the ACLOUD airborne campaign, calculated a mean SW CRE at the SFC of −32 W m −2 . However, by considering a proposed cloud-free retrieval of α, taking spectral effects into account, this value increases to −62 W m −2 , which is similar to our result of −68.8 W m −2 (see Table 3). It is worth noting that Ebell et al. (2020) explicitly take the effects of white-sky and black-sky albedo, and values obtained by CERES SYN1deg represent a cloud-free albedo  into account. Due to the strong sensitivity of SW CRE to α, a critical assessment of the accuracy of α used by CERES SYN1deg, including spectral and directionality effects, e.g. with in situ observations, is recommended.
A similar analysis has been applied to the entire Arctic to embed the results obtained for PS106 in a wider context. Panels (f) and (i) of Fig. 17 show the mean CRE at the SFC and TOA for the entire time period of PS106, respectively. Additionally, the mean α, the TOA albedo, and the CF are shown in panels (a), (b), and (c), respectively.
For the considered time period and the Arctic region, a larger CF is found over open ocean than over land. This might be attributed to an increase of lower-tropospheric stability over land, which inhibits the mixing between the free troposphere and cloud layer, thus reducing CF (Morrison et al., 2012). The latter effect is most evident over the Barents Sea, central Arctic, Baffin Bay, and Siberian Sea (Fig. 17c). Over sea ice, CF is also large, with a particularly high occurrence of low and mid-low level clouds (not shown).
Based on CERES SYN1deg, low-level clouds are frequently found over the entire Arctic Ocean, with enhanced occurrence frequencies over the Barents Sea, Kara Sea, Laptev Sea, and the central Arctic. Mid-level clouds are mainly present over Greenland, East Arctic-Russia, Ellesmere Island and the central Arctic Ocean. High-level clouds occur more frequently over land, specifically over the Arctic-Russia, near Iqaluit in Canada, north of Sweden, Finland, and Norway (not shown). The spatial distribution of CF as visible in Fig. 17c, is consistent with the results of Palm et al. (2010) who report about an anticorrelation between sea-ice extent and CF based on satellite-based lidar measurements from the CALIOP mission for a 5-year period.
Differences in the spatial distribution of CF yield differences in CRE values both at the SFC and TOA. As indicated by Fig. 17f, the CRE SFC is highly dependent on α, suggesting a warming effect of clouds by up to 20.5 W m −2 over highly reflective regions covered by snow/sea ice (Fig. 17a). At the TOA, the CRE shows a similar dependence on α, but with a stronger cooling effect. The mean CRE for the considered period is −9.3 W m −2 at the SFC, while it is −40.1 W m −2 at the TOA (see Table 3). The TOA albedo depends on CF and α. Low values are only expected for CS conditions over ocean, while high values correspond to either snow-covered or opaque clouds. This effect is, in particular, visible for the Barents Sea area, which corresponds to open ocean with a low α (Fig. 17a). Thus, the relatively high values found in Fig. 17b are caused by the frequent occurrence of clouds, as indicated in Fig. 17c. This region is of particular interest because mean values of CRE at the TOA and SFC are larger than for the rest of the Arctic.
Previous studies have also investigated the CRE SFC in the Arctic; see Table 4 for a list. The SHEBA expedition carried out observations in the Beaufort and Chukchi seas, and its observations were used to estimate the CRE SFC for an entire annual cycle. Based on SHEBA, Intrieri et al. (2002) concluded that clouds warm the surface for most of the year. During early July, however, a cooling effect of the surface by  about −4 W m −2 was found. Based on the Arctic Summer Cloud Ocean Study (ASCOS) shipborne campaign, CRE SFC was also calculated, finding a warming effect, with values ranging from 5 to about 75 W m −2 (see Fig. 8 in Sedlar et al., 2011). Long-term observations can be used to resolve details of the annual cycle. Based on 10 year observations at NSA and the National Oceanic and Atmospheric Administration's (NOAA) Barrow Observatory (BRW) sites, the analysis of Dong et al. (2010) found a CRE SFC ranging from −15 to −35 W m −2 for the time period from end of May to mid-July (see Fig. 3 in Dong et al., 2010). In contrast, the mixed-phase Arctic Clouds Experiment (M-PACE) conducted at the NSA site found a CRE SFC ranging from 40 to 80 W m −2 for the months from September-December (de Boer et al., 2011), which is significantly larger than values reported by Dong et al. (2010). In contrast to the remainder of the Arctic, Greenland is covered by ice and snow during the entire year. Clouds have a warming effect during the entire annual cycle, with the highest values of CRE found from July-August (see Fig. 5 in Miller et al., 2015). Miller et al. (2015) report values within a range of about 25 to 45 W m −2 , which is larger than the mean values reported here, reaching a maximum of 20.5 W m −2 over Greenland (see Fig. 17).
Kay and L'Ecuyer (2013) derived the CRE based on a combination of active and passive satellite-based remotesensing observations over the Arctic Ocean, and report a mean CRE SFC of about −30 to −40 W m −2 at the surface for the period from the end of May to mid-July. These val- ues are evidently larger than the mean value of −9.3 W m −2 found in the present study. However, the results of Kay and L'Ecuyer (2013) focus mostly on the Arctic Ocean and exclude Greenland.
The comparison and discussion of our results with previous studies provide valuable context on the CRE across the Arctic for different seasons and locations. Nevertheless, it is worth noting that such a comparison cannot fully account for all factors affecting the results, including the consideration of particular sites, regions, cloud conditions, seasons, data sets, methods to obtain α, and different temporal averaging.

Conclusions and Outlook
This study investigated the characteristics of Arctic clouds, and their effect on radiative fluxes and the radiation budget during Polarstern's PS106 cruise held in the early summer of 2017. An intercomparison of cloud properties, derived from the shipborne cloud remote-sensing observations using the CERES SYN1deg ed. 4.1 satellite products, was first conducted, followed by an intercomparison of radiative fluxes from shipborne observations, radiative transfer simulations, and CERES. For this purpose, a radiative transfer setup for the simulation of SW and LW fluxes, denoted as T-CARS, was implemented using atmospheric profiles from the ERA5 reanalysis, Cloudnet cloud products, and other ancillary data as input. This setup has also been used to quantify the sensitivity of CS radiative fluxes to various input parameters.
Considering the different perspectives and sensitivities of the satellite and shipborne remote-sensing observations, the horizontally resolved field of view of CERES SYN1deg, and the vertically resolved view of the active remote-sensing instruments aboard Polarstern are found to offer complementary information on Arctic clouds.
A list of the main conclusions of this paper is given here: 1. Based on the CERES SYN1deg (Cloudnet) products, clouds occurred for about 86.7 % (76.1 %) of the time during PS106. Differences between the CERES SYN1deg and Cloudnet are mainly due to the different spatial resolution, data gaps, and moments of misidentifying clouds by Cloudnet. In the case studies, situations were identified where CERES SYN1deg underestimates high-level clouds, likely due to previously reported limitations of CERES SYN1deg products in polar regions Sun-Mack et al., 2018).

2.
A case study comparing the CERES SYN1deg products (T-CARS simulations) with ship-based observations of downward fluxes in CS conditions yields satisfactory agreement, with flux mean differences of −13 W m −2 (−13.6 W m −2 ) for LWD, and −2.1 W m −2 (−7.9 W m −2 ) for SWD. This finding holds, despite the harsh environmental conditions and shipborne operation, which likely increase the instrumental uncertainties (estimated to be ±20 W m −2 for pyranometer/SWD and ±13 W m −2 for pyrgeometers/LWD). While the T-CARS simulations currently neglect aerosol effects, the CERES SYN1deg products suggest that on average, aerosols increase the LWD flux by 0.5 W m −2 , and decrease SWD by 10.8 W m −2 (Fig. B1), confirming that aerosol effects are minor for LW and only relevant for SW.
3. For AS conditions, CERES SYN1deg surface radiative fluxes and observations agree well during PS106, with a mean difference of 6.0 and 23.1 W m −2 for SWD and LWD, respectively. The comparison yields a R 2 of 0.69 (0.77) and RMSE of 12.2 W m −2 (46.5 W m −2 ) for LWD (SWD). These results are consistent with the find-ings by Dong et al. (2016), and Riihelä et al. (2017), who also performed comparisons between in situ radiative fluxes and CERES SYN1deg products at the ARM NSA site, as well as the Tara ice camp, and on the Greenland Ice Sheet, respectively. Instrumental limitations arise from the ship-based operation of instruments and the harsh environmental conditions, which cause a reduced accuracy of the observed radiative fluxes, and limits their accuracy for radiative closure studies.  (Table 3). For the entire Arctic, the net CRE is similar to PS106, with values of −9.3, and −40.1 W m −2 , at the SFC and TOA, respectively (Fig. 17). The similarity of the local and regional CRE suggests that the PS106 cloud observations along the cruise track are representative of the cloud conditions found over the Arctic during this time of the year. Our results are also consistent with the summer results obtained from Ebell et al. (2020), who calculated the CRE during 2017 at Ny-Ålesund -NO (see their Fig. 6).
In the future, we plan to carry out a similar analysis based on observations from the MOSAiC expedition. This will extend the temporal coverage of observations to a full annual cycle in the central Arctic, and thus increase the climatological relevance of findings.
The radiative closure assessment using shipborne remotesensing observations as input for the T-CARS simulations has been limited to specific cases in the present study. An extension to the full period of the cruise would allow a more in-depth investigation of the accuracy of the Cloudnet cloud products for specific cloud conditions. Particular attention should be given to the accuracy of the Cloudnet retrievals for relatively low values of Q L and Q I . For MOSAiC, cloud products based on the Shupe-Turner retrievals  and distributed by ARM will also be available, allowing for a comparison of the Cloudnet and ARM products with respect to their ability to accurately represent the optical properties of Arctic clouds. Particular attention will be given to periods when the LW radiative properties are sensitive to small changes in Q L (e.g. < 50 g m −2 ; Turner, 2007;Tjernström et al., 2015;Achtert et al., 2020), which have been found to be poorly captured in our study. Moreover, given the importance of α and T s to the interpretation of the radiation budget, it is planned to evaluate the local values observed during MOSAiC to the values used in this study.
Satellite observations and products are necessary to extend the analysis of CREs to a wider regional and decadal perspective. This would allow the investigation of long-term changes of CRE and the radiation budget. To contribute to the latter, we acknowledge the particular importance of the LW radiation budget across the entire annual cycle in the Arctic region (Sedlar and Tjernström, 2017). Therefore, we will expand our analysis by implementing radiative kernel techniques to diagnose climate feedback based on ground-based and satellite remote-sensing observations (Soden et al., 2008;Tan and Storelvmo, 2019).      The CERES SYN1deg products were obtained from the NASA Langley Research Center Atmospheric Science Data Center, which is available at NASA/LARC/SD/ASDC (2017) (https://doi.org/10.5067/TERRA+AQUA/CERES/SYN1DEG-1HOUR_L3.004A).
Author contributions. CBV, HD and AH conceptualized the manuscript. PS and HJG were responsible for the data curation of the observations. CBV performed the formal analysis, methodology, visualization and writing of the original draft preparation. AM was responsible for the funding acquisition and the project administration. All authors contributed to the subsequent improvement of the analysis and editing of the manuscript.
Competing interests. The contact author has declared that neither they nor their co-authors have any competing interests.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Special issue statement.
This article is part of the special issue "Arctic mixed-phase clouds as studied during the ACLOUD/-PASCAL campaigns in the framework of (AC) 3 (ACP/AMT/ESSD inter-journal SI)". It is not associated with a conference.