Articles | Volume 22, issue 22
Atmos. Chem. Phys., 22, 14603–14630, 2022
Atmos. Chem. Phys., 22, 14603–14630, 2022
Research article
17 Nov 2022
Research article | 17 Nov 2022

Southern Ocean cloud and shortwave radiation biases in a nudged climate model simulation: does the model ever get it right?

Southern Ocean cloud and shortwave radiation biases in a nudged climate model simulation: does the model ever get it right?
Sonya L. Fiddes1,2, Alain Protat3,1, Marc D. Mallet1, Simon P. Alexander4,1, and Matthew T. Woodhouse2,1 Sonya L. Fiddes et al.
  • 1Australian Antarctic Program Partnership, Institute for Marine and Antarctic Studies, University of Tasmania, Hobart, Australia
  • 2Climate Science Centre, Oceans and Atmosphere, Commonwealth Scientific and Industrial Research Organisation, Aspendale, Australia
  • 3Bureau of Meteorology, Melbourne, Australia
  • 4Australian Antarctic Division, Hobart, Australia

Correspondence: Sonya L. Fiddes (


The Southern Ocean radiative bias continues to impact climate and weather models, including the Australian Community Climate and Earth System Simulator (ACCESS). The radiative bias, characterised by too much shortwave radiation reaching the surface, is attributed to the incorrect simulation of cloud properties, including frequency and phase. To identify cloud regimes important to the Southern Ocean, we use k-means cloud histogram clustering, applied to a satellite product and then fitted to nudged simulations of the latest-generation ACCESS atmosphere model. We identify instances when the model correctly or incorrectly simulates the same cloud type as the satellite product for any point in time or space. We then evaluate the cloud and radiation biases in these instances.

We find that when the ACCESS model correctly simulates the cloud type, cloud property and radiation biases of equivalent, or in some cases greater, magnitude remain compared to when cloud types are incorrectly simulated. Furthermore, we find that even when radiative biases appear small on average, cloud property biases, such as liquid or ice water paths or cloud fractions, remain large. Our results suggest that simply getting the right cloud type (or the cloud macrophysics) is not enough to reduce the Southern Ocean radiative bias. Furthermore, in instances where the radiative bias is small, it may be so for the wrong reasons. Considerable effort is still required to improve cloud microphysics, with a particular focus on cloud phase.

1 Introduction

Global climate models, including coupled, Earth system and atmosphere-only models, have presented a significant radiative bias over the Southern Ocean (SO) for a number of generations, as documented by the Coupled Model Intercomparison Project (CMIP) phases 3, 5 and 6 (Bodas-Salcedo et al.2014; Schuddeboom and McDonald2021). This bias has also been found in numerical weather prediction (NWP) models (Protat et al.2017; McFarquhar et al.2021). The radiative bias has largely been found to be a result of poor simulation of cloud properties within models, in particular, the cloud phase in the cold sector of extra-tropical cyclones (Haynes et al.2011; Bodas-Salcedo et al.2016b). Over the SO, clouds are ubiquitously made up of supercooled liquid water droplets, setting them apart from clouds over other parts of the Earth (Huang et al.2012; Chubb et al.2013; Mace and Protat2018; Listowski et al.2019). This distinction creates a key problem for the simulation of SO clouds given that the majority of cloud and aerosol microphysical parameterisations have been developed from observations collected in regions of the world where supercooled liquid water droplets do not occur frequently. The resulting positive surface radiative bias caused by the poor simulation of cloud characteristics has implications for numerous aspects of the global and regional climate system, including for the global energy budget (Haynes et al.2011; Frey et al.2017; Schuddeboom and McDonald2021), the strength and positioning of the Intertropical Convergence Zone (ITCZ) (e.g. Hwang and Frierson2013, debated in Hawcroft et al.2017), and the jet streams (Kay et al.2016; Hawcroft et al.2017) as well as SO sea surface temperatures (SSTs) (Hyder et al.2018).

Within the cloud radiation bias literature, model evaluation techniques now commonly involve both the use of an observational simulator (satellite or ground based) to ensure that model and observed products are comparable (e.g. Bodas-Salcedo et al.2011; Kuma et al.2020) and a method to characterise cloud types and/or the associated synoptic conditions (e.g. Williams and Webb2009; Field and Wood2007). Many of these cloud characterisation techniques have focused on a clustering-based approach using cloud histogram products from the Cloud Feedback Model Intercomparison Project (CFMIP) Observation Simulator Package (COSP), utilising various satellites and models. Examples of both k-means clustering (Jakob and Tselioudis2003; Williams and Tselioudis2007; Williams and Webb2009; Tselioudis et al.2013; Oreopoulos et al.2014; Mason et al.2015; Oreopoulos et al.2016; Leinonen et al.2016; Tselioudis et al.2021; Cho et al.2021) and self-organising maps (SOMs) (McDonald et al.2016; Schuddeboom et al.2018) are found in the literature, often presenting comparable cloud regime structures.

For example, Tselioudis et al. (2021), in analysing the CMIP6 model suite, find an improvement on CMIP5 in simulating global mean “weather state” distributions (i.e. cloud regimes derived via k-means clustering from the updated International Satellite Cloud Climatology Project – ISCCP – cloud histograms). Shallow cumulus clouds are found to be consistently underestimated by a selection of CMIP6 models, while for the other weather states described, a larger model spread is found (Tselioudis et al.2021). However, they note, as do Schuddeboom et al. (2018), that averaging masks important regional biases, such as those over the SO. Schuddeboom and McDonald (2021) in fact find important differences in cluster frequency biases for the SO compared to the global values in CMIP6 models. Of particular interest, they find that stratocumulus clouds are occurring too frequently over the SO in the CMIP6 models but compensate for the associated positive radiative bias by not being bright enough, which is opposite to CMIP5 findings of too few and too bright clouds. Schuddeboom and McDonald (2021) also note that the models with the largest compensating errors tend to have the lowest radiation biases over the SO, indicating that updated parameterisations and tuning are causing the “right” answer for the wrong reasons. This finding is particularly concerning, given the effort to reduce the SO radiative bias so far. Similar conclusions have been made by Gettelman et al. (2020). Additionally, Zelinka et al. (2020) find that CMIP6 generation models have a higher climate sensitivity, in part due to changes in how SO cloud microphysics are now treated.

Other studies have developed regimes dependent on cloud dynamic or thermodynamic properties, such as cyclone compositing techniques (Field and Wood2007) or mid-tropospheric large-scale vertical motion (Bony and Dufresne2005). Importantly, for many subsequent studies, the evaluation methods described in such papers as listed above are used to understand biases beyond those of the cloud histograms (e.g. radiative biases or supercooled liquid water biases). For example, the Bodas-Salcedo et al. (2014) work relies on the methods presented by both Williams and Webb (2009) and Field and Wood (2007).

The majority of the aforementioned cloud regime studies have compared free-running simulations, such as those performed for the CMIP experiments, including the Atmospheric Model Intercomparison Project (AMIP), where sea surface temperatures and sea ice concentrations are prescribed. However, with free-running simulations, the depth of analysis is limited as the synoptic-scale meteorology cannot be considered the same. Some studies have used synoptic compositing to alleviate this issue, where certain synoptic situations can be compared like-for-like, and location and timing are then considered irrelevant. However, these studies are often limited to one synoptic type, such as cyclone centres, ignoring a number of other synoptic situations relevant for the SO, as well as any compensating errors that may exist. Additionally, focusing on just one synoptic condition follows a pre-conceived idea of the error, which may or may not hold true for newer model generations. Alternatively, many studies use a decomposition of the radiative bias into three components: a term that quantifies the bias in the frequency of occurrence of cloud clusters, a term for the bias in the radiative balance itself and a second-order co-variation (Williams and Tselioudis2007; Williams and Webb2009; Mason et al.2015; Schuddeboom et al.2018; Schuddeboom and McDonald2021). This decomposition only considers the overall means or frequencies of occurrence, and so day-for-day comparison is not required. With this decomposition, both Mason et al. (2015) and Schuddeboom et al. (2018) found that the largest proportion of the radiative bias was explainable by biases in the cloud frequency of occurrence. This has led Schuddeboom et al. (2018) to further speculate that clouds themselves may be “simulated well” but their distributions are wrong, leading to large errors in radiative biases. However this has not been explored further.

In this paper, we present an in-depth evaluation of the SO radiative bias of the Australian Community Climate and Earth System Simulator (ACCESS) – atmosphere-only model version 2 (AM2). Importantly, we run a nudged simulation, so the model can be compared to satellite products directly in time and space. We then use cloud regime clustering to understand the underlying reasons for the observed cloud and radiative biases. By using a nudged simulation, we are able to composite and evaluate days and locations where cloud regimes are correctly identified by the model (i.e. are the same as what was seen by the satellite), as well as instances when the model incorrectly simulates the cloud regime. We aim to answer the following question: if the model simulates the correct cloud structure for the time and place, is the radiative bias improved? This work provides the essential first step towards a long-term goal of improving the ACCESS model (and possibly the Unified Model family) and the representation of clouds over the SO. The methods outlined in this work will be used to inform and further evaluate any developments made to the model to ensure that any resulting changes in the model are understood.

2 Data and methods

2.1 ACCESS-AM2 model set-up

The ACCESS-AM2 model is an atmosphere-only configuration of the ACCESS-CM2 coupled climate model and is described in Bodman et al. (2020). The atmospheric component of ACCESS is the Unified Model (UM) at vn10.6, GA7.1, which is fully described in Walters et al. (2019). In brief, the radiation scheme used in ACCESS-AM2 is the Suite Of Community RAdiative Transfer codes based on Edwards and Slingo (SOCRATES) (Edwards and Slingo1996) and is called hourly. The prognostic cloud fraction and condensate (PC2) cloud scheme is used (Wilson et al.2008), which includes large-scale and convective clouds. The convective scheme, including downdraughts and momentum transport, is based on Gregory and Rowntree (1990). Further details on ACCESS-CM2 and GA7.1 configurations can be found in Bi et al. (2020) and Walters et al. (2019). In the atmosphere-only configuration, we prescribe sea surface temperature (SST) and sea ice concentration (SIC) following the CMIP6 AMIP protocol (Eyring et al.2016). The input fields can be found at the input4MIPs web page (, last access: 4 March 2021), and the data method is described in Hurrell et al. (2008). Solar forcing, greenhouse gases, volcanic aerosol optical depth, aerosol chemistry emissions and ozone are also prescribed according to CMIP6 (Eyring et al.2016) and are available at the input4MIPs site.

ACCESS-AM2 includes the GLOMAP-mode (Global Model of Aerosol Processes modal version) aerosol microphysical scheme (Mann et al.2010, 2012), including parameterised sulfur chemistry driven by prescribed oxidants. GLOMAP-mode is a two-moment, pseudo-modal microphysical aerosol scheme, representing four soluble aerosol modes (nucleation, Aitken, accumulation and coarse) and one insoluble aerosol mode (Aitken). Each mode is internally mixed. GLOMAP uses a scaled (1.7×, as recommended in Mulcahy et al.2018) dimethyl sulfide surface water monthly climatology (Lana et al.2011) combined with the Liss and Merlivat (1986) flux parameterisation (see also Fiddes et al.2018). Sea salt emissions are calculated online and occur following the Gong (2003) parameterisation. Dust is treated separately according to Woodward (2001), using a six-bin scheme.

Historical (pre-2014) anthropogenic aerosol emissions are provided by the Community Emissions Data System (Hoesly et al.2018), and biomass burning emissions are provided by the Global Fire Emissions Database with small fires (GFED4s) (van Marle et al.2017). Post-2014, the shared socioeconomic pathway (SSP) 2-4.5, a “middle of the road emissions pathway” (Fricko et al.2017), is used, with emissions developed by the Integrated Assessment Modeling Consortium and which is described in Feng et al. (2020).

The land surface scheme in ACCESS-AM2 is the Community Atmosphere Biosphere Land Exchange (CABLE) version 2.5 land surface model (also described in Bi et al.2020). ACCESS-AM2 runs at a 1.25 × 1.875 horizontal resolution with 85 vertical levels and uses the “ENDGame” dynamical core (Wood et al.2014). Further information on the general model set-up and preliminary model evaluation against standard climate fields, such as surface air temperatures, sea surface temperature, rainfall, mean sea level pressure (MSLP) and precipitation, can be found in Bi et al. (2020) and Bodman et al. (2020).

The simulation presented in this study has been nudged to the European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis 5 (ERA5) product (Hersbach et al.2020). Nudging occurs at every dynamical time step from reanalysis fields that are updated every 3 h for the horizontal winds and temperature in the free troposphere and stratosphere. In this study, output is created at monthly and daily mean resolution for the years 2015–2019 (chosen to overlap with recent Southern Ocean field campaigns described in McFarquhar et al.2021).

2.2 Observational products

Simulated cloud fields are evaluated against the Moderate Resolution Imaging Spectroradiometer (MODIS) combined Aqua and Terra, Level 3 daily, 1 × 1 grid, Collection 6.1 COSP product (MCD06COSP_D3_MODIS) (Platnick et al.2017). This product has been derived specifically for use in model evaluation using the COSP outputs for CMIP6 and is described in Hubanks et al. (2020). Available properties include the cloud optical depth (τ) for total, ice and liquid clouds; cloud top pressure (CTP); cloud mask fraction (derived from pixel-level cloudiness assessments) for total, low, middle and high clouds; cloud retrieval fraction (derived from the successful retrieval of τ) for total, ice and liquid clouds; cloud effective particle radius (Reff) for ice and liquid clouds; liquid water path (LWP); ice water path (IWP); and joint histogram for CTP and τ for cloudy and partly cloudy pixels. All cloud properties in the MCD06COSP_D3_MODIS product are for daytime-only scenes. Full descriptions of these cloud properties can be found in Pincus et al. (2012), Platnick et al. (2017) and Hubanks et al. (2020).

Collection 6 now includes partly cloudy scenes that represent heterogeneous broken cloudy or cloud edge pixels (Platnick et al.2017). Successfully retrieved cloudy pixels are considered to be high quality, whereas partly cloudy scenes have a higher rate of retrieval failure of 34 % and are slightly less robust (Platnick et al.2017). Collection 6 has shown a marked improvement upon Collection 5 in part due to the re-writing of the cloud optical property retrievals, resulting in an increase in cloud-phase classification of 10 % and a 90 % agreement in total cloud phases between MODIS and the CALIOP (Cloud-Aerosol Lidar With Orthogonal Polarization) retrievals for cloudy pixels (Platnick et al.2017; Marchant et al.2016). The largest improvement in cloud-phase detection was found for opaque clouds over ice- or snow-covered areas, whilst detection for thin cloud retrievals over warm or bright surfaces remains an issue. Improved optical retrievals have also reduced the biases in Reff; however, it has been noted that much of the evaluation performed has been for single-layered clouds, with multi-layered clouds remaining un-assessed (Marchant et al.2016).

Comparisons of MODIS retrievals to aircraft field campaigns over the north-eastern and south-eastern Pacific, as well as ground-based observations in Finland, have indicated that liquid Reff is overestimated, which impacts the LWP retrievals (Painemal and Zuidema2011; King et al.2003; Sporre et al.2016; Noble and Hudson2015; Min et al.2012). However, a number of studies have also noted that there is good agreement in variability between liquid Reff and the LWP (Min et al.2012; Noble and Hudson2015). The cloud retrieval fraction and cloud optical depth have been found to perform better and be strongly correlated with field observations (Sporre et al.2016; Noble and Hudson2015). Ice cloud retrievals have been known to be more difficult for a number of reasons relating to crystal shape and scattering properties as well as lifetime and expanse. Evaluation of the MODIS Collection 5 ice retrievals found an overestimation compared to an infrared radiative closure method for determining ice τ (Holz et al.2016). The updates to the optical retrieval method in Collection 6 have reduced the bias to a more satisfactory level. We note that the evaluations discussed above have not been performed for the specific COSP product we are using in this work but give us an idea of the MODIS satellite retrievals' overall performance.

In this work, we are using the cloud retrieval fraction, which we subsequently refer to as the cloud fraction (CF), for ice (CFI) and liquid (CFL) clouds and the joint histogram for CTP and τ for cloudy and partly cloudy pixels. Following the methods of Oreopoulos et al. (2016), Schuddeboom et al. (2018) and Saponaro et al. (2020), we use the combined joint histogram product (i.e. the sum of the cloudy and partly cloudy products). Very large biases that we considered unrealistic were found for the modelled Reff, and hence these fields have not been used for this work. Similar biases were found for the LWP and IWP, and hence, the COSP-derived products (described in the next section) for these fields were not used but were replaced with the raw model output. While this adds a degree of uncertainty to this work, we believe such an analysis with the derived COSP fields would not have been useful.

Furthermore, care has been taken to ensure that we are comparing only grid-box mean values of the LWP and IWP. Model outputs (both COSP and raw) are provided directly as grid-box means, while the MODIS products are provided as in-cloud mean values. We have performed the appropriate conversions where the grid-box mean is equal to the in-cloud mean multiplied by the cloud fraction. After comparison of both grid-box mean and in-cloud mean values, we have chosen to use the grid-box mean values. This choice does effect some of the results of this study, as would choosing to use the in-cloud values instead. However, we believe this choice is robust for two reasons: firstly, grid-box means are the native model output and this is a model evaluation study and, secondly, the grid-box means showed a better model performance than the in-cloud mean values, likely due to the weighting of the cloud fraction.

For this analysis we have removed all instances of clear sky from both the model and the observed histograms. We have done this by finding the instances where the CTP–τ histograms summed to zero. While this allows us to focus on instances when the model is simulating cloud, either correctly or incorrectly, it also means we are not considering instances when either the model or the satellite simulated cloud while the other simulated clear sky. In this work, we are considering grid boxes of 1.25 × 1.875 and daily means, meaning that there are very few instances of clear sky occurring for a full day over a large domain. On average, there are fewer than 1.5 % of grid boxes simulated as clear sky in ACCESS-AM2 over the Southern Ocean and far fewer in the satellite (in part due to the addition of the partly cloudy optical depth retrievals as well). For these reasons, we believe this choice is robust.

To evaluate the radiative bias in ACCESS-AM2, we use the Clouds and the Earth’s Radiant Energy System (CERES) SYN1deg product (Doelling et al.2013, 2016) for evaluation at daily timescales. The bias in the outgoing top-of-atmosphere (TOA) shortwave (SW) cloud radiative effect (CRE) (SWCRETOA) has been chosen for analysis in this work based on previous findings from the literature presenting the SWCRE as the most problematic aspect of the SO energy balance. The CRE presented in this work is the difference between the clear-sky radiation and the all-sky radiation fields (for both the model and the satellite products). Throughout this paper, a predominantly positive SWCRETOA is present; e.g. the ACCESS-AM2 model is allowing too much shortwave radiation to pass through the clouds and not reflecting enough shortwave radiation out to space via clouds, corresponding to too much shortwave radiation reaching the surface.

2.3 COSP

To directly compare the ACCESS-AM2 cloud properties to those of satellite products, we use the COSP, described by (Bodas-Salcedo et al.2011), as prescribed for simulations within the CFMIP activity of CMIP6 (Webb et al.2017). The MODIS products provided the best coverage for the time period of interest to this study. For this reason, no other satellite products are considered in this work. Simulated fields include the joint histograms of CTP and τ, as well as liquid and ice cloud fractions, water paths, and τ. Using COSP output allows an appropriate comparison of the model to satellite products. This method applies the assumptions and limitations of the satellite algorithms to the model output, limiting the possibility that biases are due to differences in processing. For the LWP and IWP, we are considering the grid-box mean (as opposed to the in-cloud mean). We note that in the IWP and LWP fields, significantly large and seemingly unrealistic biases between the model COSP product and the MODIS product were found. The IWP and LWP are reliant on Reff retrievals, which as discussed above, are less well captured in satellite products. For this reason, as stated above, the actual simulated (i.e. raw model output) IWP and LWP are used for this analysis instead of the COSP-derived product.

2.4 Cloud regime clustering

In this study, we use k-means clustering to derive cloud regimes (Anderberg1973). k-means clustering is a form of unsupervised machine learning that separates N points into k clusters by minimising the sum of squared distances within each cluster. In this case, the Euclidean distance is considered, which is equivalent to the minimisation of variance (or inertia) within each cluster. To perform this analysis, the scikit-learn (Pedregosa et al.2011) and Dask-ML Python packages were used, where k-means clustering has been implemented for distributed computing.

We apply k-means clustering to 4 years (2015–2019) of daily MODIS histogram data (CTP–τ), over the entire globe. We have normalised the CTP–τ histograms to 1 (as opposed to the cloud fraction) to limit the impact of cloud fraction biases within the ACCESS-AM2 model on the clustering results. Whilst this impacts our ability to compare our results to other studies, it allows the clustering to target cloud vertical extent and thickness regardless of the total fraction and how well it is captured by the ACCESS-AM2 model.

A complication of clustering methods, including k-means clustering, is the choice in the number of clusters. While there is no right or wrong answer to the number of clusters to select, there are practical considerations and statistical metrics that can help guide this choice. Three statistical metrics were applied to this work in an attempt to aid the decision on the number of clusters to choose, including the “elbow” method (Wilks2011), the Calinski–Harabasz (CH) index (Calinski and Harabasz1974) and the Davies–Bouldin (DB) index (Davies and Bouldin1979). The scikit-learn application programming interface provides a detailed explanation of each metric in addition to their advantages and disadvantages. Unfortunately, the statistical guidance provided by these metrics was not useful in cluster number selection (suggesting 2, 4 and 17 clusters respectively). After consideration of a range of choices, we selected 12 clusters.

We then used the MODIS cluster regimes to predict the cloud regimes of ACCESS-AM2, by fitting each ACCESS-AM2 data point to 1 of the 12 cluster centres. We chose this method, as opposed to a hybrid approach as taken by Mason et al. (2015), so that we could apply the same cluster centres to multiple model simulations, allowing a direct comparison over a number of simulations. A similar method is described in Williams and Webb (2009), where they also point out that this method eliminates further subjective choices.

We have chosen to perform clustering over the entire globe, though only results for the SO (defined for this work as the broad region from 30–69 S) are examined in this current study. This choice will allow us to assess if any changes applied to the model focused on improving Southern Ocean clouds have unintended effects outside this region. For this reason, while the cloud regime histograms are defined globally, only those important to the SO will be examined in detail.

2.5 Bias decomposition

To explore cloud property biases, we follow the decomposed bias metrics described in Williams and Webb (2009), Mason et al. (2015), and Schuddeboom et al. (2018), where the bias (Δ: ACCESS-AM2  satellite) of a simulated field of interest F for each cluster r can be summarised as the errors due to the relative frequency of occurrence (RFO; FrsatΔRFOr, referred to as RFO errors) plus the errors in the simulated field (ΔFrRFOrsat, referred to as field errors) plus a second-order co-variation term (ΔFr⋅ΔRFOr) otherwise referred to as the cross term.

(1) δ F r = F r sat Δ RFO r + Δ F r RFO r sat + Δ F r Δ RFO r
3 The ACCESS-AM2 radiation and cloud biases

The annual and seasonal ACCESS-AM2 SWCRETOA and cloud biases compared to the CERES-SYN1deg or MODIS products are shown in Fig. 1. The boundaries of our analysis, shown by the dashed lines, represent three regions of interest: the mid-latitudes defined as 30–43 S, the sub-polar region defined as 43–58 S and the polar region defined as 58–69 S of the SO. While seasonal expansion and contraction of the Ferrel and polar atmospheric cells mean these stationary boundaries may not capture the seasonal bias boundaries perfectly, for the purpose of this work they are satisfactory. These regions will be used throughout the rest of this work to explore how differently cloud types are contributing to the SWCRETOA bias in each region.

Figure 1Column (a) shows the annual mean satellite (CERES-SYN1deg or MODIS)-derived field for row 1, SWCRETOA (W m−2); row 2, CFL; row 3, CFI; row 4, LWP (g m−2); and row 5, IWP (g m−2); column (b) shows annual biases (ACCESS-AM2  satellite) in the same respective fields; columns (c)(f) show seasonal biases (DJF, JJA, MAM, SON). The yellow line in columns (c)(f) shows the mean seasonal extent of sea ice (calculated at 15 % coverage), taken from the Hurrell et al. (2008) AMIP sea ice concentration data set. Dashed lines are located at 30, 43, 58 and 69 S.

Considering firstly the SWCRETOA (row 1, Fig. 1), a persistent positive bias in the polar region of the SO is found across all seasons. This bias has not been improved upon in this latest-generation model, with a similar total SWTOA radiative bias (not shown) to that reported in a previous version of ACCESS presented in Fiddes et al. (2018). The summer (DJF), shown in Fig. 1c1, continues to have the largest polar bias, while the winter (JJA) season has the smallest bias overall, in agreement with previous work (Bodas-Salcedo et al.2012; Kuma et al.2020). A positive–negative–positive “tripole” can broadly be seen in the annual mean biases (Fig. 1b1), where the SWCRETOA is on average overestimated in the mid-latitudes, weakly underestimated in some parts of the sub-polar SO and largely overestimated in the polar SO. MAM and JJA (Fig. 1d1, e1) show very weak positive biases in the polar region, though with well-defined transitions from negative to positive in the sub-polar to mid-latitude regions. In SON and DJF, the positive biases in the polar and mid-latitude regions are well defined, while the negative sub-polar biases become far less zonally coherent. Some strongly positive regions in spring are also found along the edge of the sea ice zone (yellow line). This feature could be due to some discrepancy between the sea ice concentrations prescribed to the model and what was seen by the CERES satellite product, or there could be issues with the satellite retrieving observations over ice-covered areas. Kuma et al. (2020) also noted a strong latitudinal dependence of radiative biases in a similar version of the UM and a reanalysis product. While much of the recent literature has focused on the strongly positive biases in the polar region of the SO in DJF, we show here that the radiative biases for the Southern Ocean are complex and not always zonal. The summertime position of the positive and negative biases agrees broadly with recent observational work analysing air mass characteristics (e.g. aerosol/air mass origin and impact on cloud condensation nuclei and cloud properties; Humphries et al.2021; Simmons et al.2021; Mace et al.2021). This agreement highlights the importance of understanding not just the region where the biases are the most problematic but also the Southern Ocean system as a whole.

Rows 2–3 in Fig. 1 show the respective biases for the CFLs and CFIs compared to MODIS. Over all SO regions and seasons, negative CFL biases are found, in particular for the sub-polar and polar regions. The strongest bias is found in DJF for the polar region, though this extends into the sub-polar region, while the JJA appears to have the weakest biases. Interestingly, this bias in the CFL is not compensated for by a consistent overestimation in the CFI for all regions (Fig. 1, row 3). Annually, a too high CFI is found for the polar and sub-polar regions, while this transitions to a too small CFI in the mid-latitude region. In DJF, the positive sub-polar bias is very well defined and, as with the polar bias, at its largest. Over MAM, the sub-polar bias remains consistently constrained by the mid-latitude–sub-polar boundary, while the polar-region bias begins to weaken and by JJA has become weakly negative. The clear distinction between the mid-latitude and sub-polar regions has also weakened in JJA, with the biases becoming less zonal. SON returns to predominantly positive polar and sub-polar regions, though with a less well defined boundary between the sub-polar and mid-latitude regions than what is seen for MAM or DJF.

Of note is the fact that the positive biases in the CFI are much weaker in magnitude than the negative biases in the CFL. Further, the spatial patterns of the biases are not the same. On average annually, the positive biases in the CFI over the polar and sub-polar regions only partially compensate for the negative biases in the CFL. In the mid-latitude region however, both the cloud fraction biases (liquid and ice) are weakly negative. This indicates too few clouds overall in this region, which can explain the positive SWCRETOA bias. Too few liquid clouds, which are instead simulated as ice clouds, will result in clouds that are more optically thin causing too little shortwave radiation to be reflected out to space. This may help explain the average positive SWCRETOA bias in the polar region; however it does not explain the more negative bias in the sub-polar region. The lack of spatial correlation for the cloud fractions indicates that although the inaccurate partitioning of liquid and ice cloud is an important contributor to the radiative bias, it is not the whole story.

Rows 4–5 in Fig. 1 show the respective biases for the LWPs and IWPs compared to MODIS. Broadly, the LWPs and IWPs tend to be of opposite signs to each other and similar in magnitude, with positive LWP biases in the mid-latitudes, a transition to negative in the sub-polar region and negative LWPs in the polar region. However, the change from positive to negative (or vice versa for IWPs) appears to be more southwards for the LWPs and more northwards for the IWPs. Too much ice, in place of water, potentially supercooled liquid water, again would produce an optically thinner cloud, which would cause too much sunlight to reach the surface and a positive SWCRETOA bias. In the polar region, this process is likely contributing to the positive SWCRETOA bias. In the sub-polar region, in DJF and to a lesser degree the other seasons, both water paths are in places positive, increasing the optical thickness of the cloud overall, which may contribute to a negative SWCRETOA bias. However, the regions of negative SWCRETOA bias are not easily reconciled with the biases in the LWPs and IWPs, indicating that this is a complex system. Finally, in the mid-latitudes region, a positive LWP bias is found over all seasons, a positive IWP is found in winter and spring, and a weakly negative IWP is found in summer and autumn. Positive water paths may have a compensating effect on the too little cloud fractions found in the region, resulting in the much weaker radiative bias. One reason for the different biases in the warmer mid-latitude regions compared to the other regions is likely the temperature dependence with respect to supercooled liquid water formation.

To summarise, in the polar region, the frequency and LWP of liquid clouds are largely underestimated, likely resulting in a very strong positive radiative bias. This is compounded by a slight overestimation of ice clouds containing too much ice, which are optically thinner, adding to the positive radiative bias. This finding agrees well with the literature in that not enough liquid water exists below 0 C, instead being simulated as ice (Bodas-Salcedo et al.2016b). In the sub-polar region, the frequency of liquid clouds is weakly underestimated but is compensated for by a too high LWP, creating too few optically thicker clouds. Combining these biases with too much ice cloud and a too high IWP results in a weak radiative bias that fluctuates between positive and negative. Finally, in the mid-latitudes, negative CFLs are again combined with positive LWPs, while the CFI and IWPs are both weakly negative, resulting in a weak positive radiation bias. These patterns are found to be generally consistent across the seasons, with some degree of variability in the strength of the biases.

These results show that in the polar region, the biases in cloud fraction and water paths can satisfactorily explain the SWCRETOA bias. For the other two regions however, the influence of the cloud biases is not as clear-cut. The role of the cloud phase, in particular that of supercooled liquid water, appears to have significant latitudinal dependence, likely influenced by a range of factors, including temperatures and ice-nucleating particle (INP) availability. This is a significant issue for model development, as has been shown previously, where fixing one of these issues for the SO region (e.g. the ratio of liquid to ice clouds or the number of INPs) may have detrimental effects on other parts of the system. For this reason, we suggest that we need a more in-depth analysis of the problem with respect to the ACCESS-AM2 model before we can attempt to reduce this SWCRETOA bias. To do this, the next section presents results from the cloud regime clustering, which will then be used to understand the radiative and cloud biases in more detail.

4 Cloud regimes


The k-means clustering technique was applied to 5 years of MODIS daily-mean joint histograms over the entire globe. The 12 resulting cluster centres are shown in Fig. 2. We have not shown the global RFO but have chosen to show the RFO for the Southern Hemisphere only in Fig. 3 to focus on the Southern Ocean domain (30–69 S). The clusters have been approximately arranged from low to high CTP along the vertical and from thin to thick τ in the horizontal.

Three of the resulting clusters are limited to the Antarctic region and have similar (yet accordingly distinct) cloud characteristics. These clusters have come about in part as a result of our normalisation by 1, instead of the cloud fraction, and make up only a small fraction of overall cloud occurrence. Furthermore, as noted by Williams and Webb (2009), cloud retrievals over ice-covered regions with high albedo can be problematic. For these reasons, the three Antarctic clusters are merged into one for the remainder of this analysis. We have labelled the remaining cloud regimes according to the cloud types that best reflect these profiles; however we acknowledge that CTP–τ is often not enough to truly distinguish one cloud type from another (and that some quite different clouds may have similar CTP–τ profiles). Hence, these labels should be considered just that: a way of easily differentiating the cloud regimes in this study.

Figure 2The mean CTP (y axis, hPa)–τ (x axis, optical depth unitless) histograms for each cloud regime found by k-means clustering on the MODIS product (daily means, 2015–2019). Each histogram has been assigned a “cloud type”. The histograms have been loosely arranged by pressure and depth.


Figure 3Spatial maps of the relative frequency of occurrence for each MODIS cloud cluster found by k-means clustering. Note that the three Antarctic clusters have been merged into one. The dashed lines represent the boundaries of the mid-latitude, sub-polar and polar regions as defined in Sect. 3. The numbers in each title represent the mean frequency of occurrence for each cloud regime (in time and space) over the SO region.

Along the top row of Figs. 2 and 3, high-level clouds are shown. The thin cirrus cluster is restricted to tropical regions, particularly over the western Pacific and eastern Indian oceans, and is characterised by clouds that are optically thin with very low CTP. Similarly, the cirrus cluster is characterised by very optically thin clouds, at a slightly lower altitude, and is found predominantly in the tropical and mid-latitude regions. Both the cirrus and thin cirrus show some thin, low-level clouds too, likely associated with shallow convective clouds. At a similar height, the convective cluster is more optically thick and more strongly associated with the ITCZ and South Pacific Convergence Zone (SPCZ), although this is not shown by this map projection. Finally, along the top row, the frontal cluster, while similar in altitude to the convective cluster, is more optically thick and is found more predominantly in the mid-latitudes, in particular around westerly storm tracks. We note that convective and frontal clouds can have similar CTP–τ profiles (Williams and Webb2009), and hence their location is also important to consider.

The mid-level cluster shows mid-range optical properties as well as mid-level CTPs and is without one or two clearly dominant bins as found in the other clusters (Fig. 2). This mid-level cluster is most common over the SO, marine equatorial regions and terrestrial regions (in particular on the west coasts of a number of continents). These varied geographic locations represent some significantly different cloud formation processes, raising an important point for this work: while the optical and height characteristics within a cluster may be similar, each cloud's trigger mechanism is not necessarily the same. While further analysis to understand the formation processes for the cloud regimes is possible via meteorological fields such as proximity to cyclone centres or vertical motion or via physical fields such as proximity to topography and would be of interest, such analysis is out of scope for this work.

Along the bottom row, the shallow cumulus cluster is found to have low, optically thin clouds that occur mostly over marine regions in the tropical latitudes and mid-latitudes, away from any large-scale cloud formations such as in the ITCZ/SPCZ (not shown) or stratiform cloud decks. This cluster has little vertical structure. By comparison, the cloud deck cluster is optically thicker and dominant at the eastern boundary of ocean basins, regions of extended stratiform cloud, again with little vertical structure. The marine stratiform cluster is the most optically thick cloud regime, with high CTPs and little vertical structure, and is found predominantly in higher-latitude marine regions, as well as of the very eastern boundaries of the cloud deck regions. Finally, the stratocumulus cluster is characterised by mid-range optical thickness but has lower CTPs than those of the other low-cloud clusters, suggesting a greater vertical extent associated with stratocumulus clouds. This cluster is geographically widespread, although dominant in the polar region of the SO and absent in regions of deep tropical convection.


Using the cluster centres defined by the MODIS joint histograms described above, we fit the ACCESS-AM2 joint histograms to the same 12 cluster definitions (noting that the 3 Antarctic clusters were subsequently merged). Figure 4 shows the difference in frequency of occurrence between ACCESS-AM2 and MODIS. It is immediately clear that ACCESS-AM2 has significant problems with the mid-level and stratocumulus clouds, simulating them too frequently (by 13.4 % and 19.8 %) across all regions. The low-level cloud fields by contrast are all underestimated, including shallow cumulus (−9.9 %), cloud decks (−9.5 %) and marine stratiform clouds (−10.0 %). The higher clouds (thin cirrus, cirrus, convective and frontal) are simulated comparatively well, although also with slight underestimation of frequency. Interestingly, for the SO, most of the RFO biases are spatially consistent in sign. This result could be interpreted as a consistent bias, with consistent causes, across the latitudes. We will explore this in more detail in the subsequent sections.

Figure 4Spatial maps of the difference in the relative frequency of occurrence for ACCESS-AM2 – MODIS for each cloud cluster. The dashed lines represent the boundaries of the mid-latitude, sub-polar and polar regions as defined in Sect. 3. The numbers in each title represent the mean frequency of occurrence for each cloud regime (in time and space) over the SO region.

5 Understanding the biases

5.1 Bias decomposition

We can now use the biases in the frequency of occurrence of the cloud regimes to help gain a better understanding of the biases in radiation using the traditional decomposition method described in Sect. 2.5. Note that from this point on, we only consider the broad SO region and the three sub-regions defined within. Figure 5 shows the cumulative components of the SWCRETOA biases over each region, for each cloud regime. The horizontal bar represents the total bias. Similarly to previous studies, including Bodas-Salcedo et al. (2014), Mason et al. (2015) and Schuddeboom et al. (2018), the largest component of the SWCRETOA bias is due to the errors in the RFO (blue bars). As previously discussed, DJF has the largest SWCRETOA biases, though the dominance of the RFO bias is true for all seasons. For DJF, clear negative RFO errors are found for the mid-level clouds and the stratocumulus clouds, which are both overestimated in frequency. This negative RFO error has a compensating effect, as the mid-level clouds are simulated in the place of other lower-cloud regimes (e.g. marine stratiform, cloud decks and shallow cumulus), which are underestimated in frequency and hence have positive RFO errors. A much smaller component of the bias is made up by the error in the simulated field (pink bars), primarily in DJF, while this is insignificant in the other seasons. Schuddeboom et al. (2018) hypothesised that the dominance of the RFO errors and comparatively smaller field errors, also found here, indicate that the clouds themselves, when simulated with the correct frequency, could in fact be simulated well. This hypothesis will be tested in the next section. The cross term (green bars) appears to contribute to the radiative biases again only in summer and predominantly in the polar region. What this cross term represents is not easy to evaluate; however it is clear that the field biases and RFO biases for this time and season can only explain part of the story.

Figure 5The decomposed mean biases in the SWCRETOA (W m−2) for each cloud regime (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant) over the three regions from left to right mid-latitudes, sub-polar and polar, for each season (DJF, a; MAM, b; JJA, c; SON, d). The sum of the decomposed biases is shown by the horizontal bar, while the terms of the bias decomposition (see Eq. 1) are shown in blue (FrsatΔRFOr), pink (ΔFrRFOrsat) and green (ΔFr⋅ΔRFOr). On the far right of each plot is the mean radiative bias for each region/season as a reference.


We have also applied this bias decomposition technique to other cloud properties, including biases in the LWP, IWP, and CFI and CFL (see Appendix). The results for the LWP, IWP and CFL, for all regions, show large biases across all cloud clusters and decomposed terms. Interestingly, the field error and RFO error are of opposite signs and approximately cancel out, leaving the cross term to make up the total radiative bias. For the CFI, the RFO error was dominant, similarly to the SWCRETOA, though with the field error and cross term being non-negligible, unlike for the SWCRETOA. The importance of the cross term in these results makes the respective biases much more difficult to understand, though it is clear that the decomposed errors in RFO and the fields themselves are playing a large role in the total bias.

The strength in this method is that it is time and space agnostic; i.e. it does not require the two “climates” to be directly comparable as it is using the mean RFOs and field values to make judgements about how each of them contribute to a particular bias. For this reason, this method has been popular in the literature, whose authors have often used CMIP simulations as the basis of their work. However, a limitation of this method is that it cannot compare like-for-like instances in cloud properties to gain a better understanding of what particular conditions are leading to large SWCRETOA biases. This work suggests that for all seasons and all regions, a negative bias occurs because the frequency of mid-level and stratocumulus clouds are overestimated, compensating for the underestimation of the lower-level clouds. We ask at what cost, specifically, were the mid-level and stratocumulus clouds overestimated? Is the compensating error due to the model predicting clouds that are too frequent, optically thick, high or with too much liquid? Applying the same technique (the bias decomposition) to such cloud fields was unable to satisfactorily give us these answers. Additionally, we would like to know whether the model ever actually does a good job, with the correct cloud type and a small (or wishfully no) radiative bias.

5.2 Biases when clusters are correctly or incorrectly simulated

One strength of comparing a daily, nudged simulation to daily MODIS fields is the ability to make direct comparisons in time and space. The synoptic meteorology is considered to be the same in the model and the observed conditions, due to the nudging of the model. We therefore expect that the model microphysics, if accurate, would generate the same cloud types that the large-scale dynamics prescribes. With this assumption, we are able to isolate instances (exact points in time and space) where the model simulates the same cloud type as MODIS, which we define as “correctly” simulating the cloud type. Similarly, we can also define the instances where the model simulates a different cloud type, which we consider an “incorrect” cloud type assignment by the model. We can then use these instances to generate a statistical understanding of the conditions during these times and places. We demonstrate these definitions in Fig. 6a and b.

Figure 6For each grid box (a) and for every point in time (b), the cloud type has been determined via k-means clustering for the MODIS product and subsequently fitted to the ACCESS-AM2 output. The assignment of cloud type is shown for a 1-month period in (b), where we can see where ACCESS-AM2 correctly (shown by a tick) or incorrectly (shown by a cross) simulates the same cloud type as the MODIS product. Using this information, we calculate the seasonal mean radiation or cloud property bias (c) over the instances where ACCESS correctly (filled bars) or incorrectly (outlined bars) simulated the cloud type for the three regions of the Southern Ocean. As an example, (c) shows the mean radiation biases for polar mid-level clouds.

In the previous section, we saw that the model generally tends to simulate the cloud type RFOs incorrectly and that this bias in RFOs dominates the radiative bias. This result demonstrates the limit of the decomposition method as we are not able to gain any further information about how the microphysical biases in cloud types affect the radiative properties or about the nuances of which cloud types are being simulated in place of others. In this section, we take advantage of the model nudging and explore in detail the instances when ACCESS-AM2 correctly and incorrectly simulates the MODIS-identified cloud regimes and begin to understand the associated radiative and cloud biases in much greater detail than has previously been achieved.

Each panel of Fig. 7 shows, for each MODIS cloud type (y axis), the percent of time that each cloud type is assigned by ACCESS-AM2 (x axis). The total number of instances that cloud type is observed by MODIS for each region is shown on the right of each panel. If ACCESS-AM2 simulated every cloud type to be the same as what the MODIS product did, we would expect a diagonal line through each panel in Fig. 7 of 100 %, with zeros elsewhere. What is shown, however, reflects clearly the biases of the RFOs, where ACCESS incorrectly assigns the stratocumulus and mid-level clusters over most other clusters. For the low-level cloud regimes (shallow cumulus, cloud decks and marine stratiform), we see that a large number (between 48 % and 59 %) of points have been wrongly assigned to the stratocumulus cluster, while the high-level clouds (convective and frontal, as well as the less important cirrus and thin cirrus) tend to be assigned as the mid-level cluster (between 38 % and 67 %). Only 12 %, 11 % and 5 % of the marine stratiform points have been correctly assigned by ACCESS-AM2 for each region (mid-latitudes, sub-polar and polar). This statistic is worse for some other less important SO cloud types. In contrast, 63 %, 68 % and 68 % of mid-level clouds and 60 %, 66 % and 56 % of stratocumulus clouds are simulated in the correct time and place for the three SO regions (mid-latitudes, sub-polar and polar).

Figure 7Confusion matrices for each region, (a) mid-latitudes, (b) sub-polar and (c) polar, showing the MODIS cluster assignment on the y axis and the ACCESS-AM2 cluster assignment on the x axis (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant). The colours (and inset text) represent the proportion of points assigned to each MODIS cluster (e.g. sum to 100 along the x axis). The text on the right hand side indicates the number of points (in time and space) that make up these statistics for each MODIS cluster.


Having identified exactly when ACCESS-AM2 correctly or incorrectly simulates cloud regimes, we can now begin to evaluate what the biases in radiation and other cloud fields are in these instances. Figures 8 (mid-latitude), 9 (sub-polar) and 10 (polar) show seasonal mean biases when the ACCESS-AM2 correctly simulates each cloud cluster (coloured bars) or incorrectly simulates each cloud cluster (transparent bars with black outlines). An example of how one of these subsets (polar mid-level clouds) is derived is shown in Fig. 6. We note that these results are not weighted by the frequency of occurrence, unlike in the previous figure, as we are interested in understanding the microphysical properties of each cloud type, even if they occur infrequently. Figure 7 can provide an indication of the frequency of each cloud type and how often it is correctly or incorrectly identified. Furthermore, we emphasise that these plots show the impact on the radiation and cloud fields from the perspective of the right cluster being assigned wrongly, unlike in Fig. 5, which shows the radiative biases from the perspective of the accumulative wrong clusters. For this reason, some of the values between the figures have opposite signs but support the same overall finding.

Figure 8(a) Mid-latitude SWCRETOA biases (W m−2) averaged over each cloud cluster (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant), seasonally (DJF, dark blue; MAM, green; JJA, pink; SON, light blue), for instances when the MODIS clusters are correctly assigned (coloured bars) and incorrectly assigned (black outline) by ACCESS-AM2 for the mid-latitude region. Panels (b)(e) show the same as plot (a) but for the CFL (fraction), CFI (fraction), LWP (g m−2) and IWP (g m−2) biases respectively.


5.2.1 The mid-latitude region

Figure 8a shows the mid-latitude region's mean SWCRETOA biases when ACCESS-AM2 correctly and incorrectly simulates the cloud types. For the mid-level and marine stratiform clouds, the SWCRETOA biases are larger when the cloud types are assigned incorrectly (outlined bars) than when they have been correctly simulated by ACCESS-AM2 (coloured bars). However, when these cloud types are correctly assigned (which happens 63 % and 12 % of the time respectively), the SWCRETOA biases are also non-negligible in most seasons. For the mid-level clouds, the SWCRETOA biases are smaller in magnitude when the clusters are correctly identified by the model. Interestingly, this is not the case for the CFL and LWPs, which both have larger biases when the clusters agree, suggesting that the radiative effects associated with too few liquid water clouds are partially compensated for by them being too optically thick. This indicates that the lower SWCRETOA bias when the mid-level clouds are correctly simulated may be occurring for the wrong reasons. For the marine stratiform clouds, the CFL is strongly underestimated when it is incorrectly simulated, which is expected to produce positive SWCRETOA biases. The CFI is comparatively well simulated. Interestingly, the LWPs and IWPs seem to be relatively well captured for marine stratiform clouds. This suggests that the SWCRETOA bias may be predominantly driven by an underestimated CFL, while the amount of water in them is somewhat correct.

The stratocumulus cluster shows almost uniformly equal biases in all fields when the cluster assignments are correct and incorrect. The positive radiative biases are predominantly associated with too few liquid water clouds, similarly to those of the mid-level clouds. Considering that stratocumulus clouds are simulated correctly 60 % of the time and are otherwise simulated as mid-level clouds 28 % of the time, these biases make sense. These results suggest that we cannot focus solely on improving the lowest-level clouds but must also improve the mid-level cloud representation.

Cloud decks present a notable exception of where the radiative biases are larger and of opposite sign when the cluster is correctly identified. Cloud decks are not simulated frequently enough in ACCESS-AM2 (correct only 14 % of the time) and are instead assigned as stratocumulus clouds 54 % of the time. Figure 8 shows a weak negative SWCRETOA bias for the MAM and JJA seasons when the ACCESS-AM2 clusters are incorrect, while the bias in the other seasons is negligible. When cloud decks are correctly assigned, the SWCRETOA bias is positive in all seasons, indicating that for these low-lying clouds, even if we are able to simulate the height and optical thickness properties of the cloud correctly, issues remain. Looking at the cloud fields, most show small biases (compared to the other cloud regimes), with the CFL having the largest negative bias. We suggest that a too low cloud fraction would result in a positive SWCRETOA bias, though how this seemingly minor cloud fraction bias is resulting in the more significant radiative bias is difficult to say. Regardless, this result demonstrates the complexity of this issue, where we need to not only get the right clouds for the large-scale conditions but also ensure the underlying microphysical properties are correctly simulated.

Shallow cumulus clouds show a small positive SWCRETOA bias when the ACCESS-AM2 correctly simulates the cluster, which is also reflected in the cloud fraction and water paths, indicating that the model performs well in these instances. However, Fig. 7a shows that the model infrequently gets this cloud type right (only 2 % of the time), instead simulating it as stratocumulus 48 % of the time in the mid-latitudes. The negative SWCRETOA bias found in all seasons when shallow cumulus is incorrectly simulated is likely a reflection of the higher and more optically thick clouds found in the stratocumulus cloud type, indicated by overestimated LWPs and to a lesser degree, overestimated IWPs. This result also agrees well with our previous plot (Fig. 5), where too many stratocumulus clouds result in a compensating negative bias. A similar result can be observed for the less frequent cirrus clouds, which are too often simulated as lower, more optically thick stratocumulus or mid-level clouds (22 % and 34 % of the time) and cloud decks in winter and autumn.

The frontal cloud type has a large positive SWCRETOA bias when the model does not correctly simulate the MODIS cluster, which occurs 64 % of the time in the mid-latitude region. The frontal clouds are a relatively high type of cloud (see Fig. 2) and in these lower latitudes have a greater likelihood of being ice clouds. This bias, found across all seasons, could be due to a too high LWP, though the CFL is relatively well captured. For the frontal ice clouds, too few clouds are simulated with a negative IWP bias. In the case of correctly assigned frontal clouds, we can see that the SWCRETOA bias is small, despite a too low CFI and too high LWP. This result may again indicate that while the ACCESS-AM2 model is simulating correct SWCRETOA, it may be doing so for the wrong reasons. A similar result, i.e. a strongly underestimated CFI causing a positive radiative bias, is found for the convective clouds, in instances when they both disagree and, to a greater extent, agree. This result may indicate that for these higher clouds, getting the cloud fractions right, with the right phase partitioning, may go a long way to reducing the radiative biases. The thin cirrus clouds are shown to have large biases in the spring (SON) when correctly simulated for all fields. However, we note that these cloud types occur infrequently (see Fig. 7a) and are only correctly simulated 1 % of the time, and hence, we do not draw any conclusions from this result.

Figure 9The same as Fig. 8 but for the sub-polar region.


5.2.2 The sub-polar region

The sub-polar total radiative bias, shown in the far right column of Fig. 5, is in general quite small. If we consider just the zonal means, the model appears to perform well. However, after analysing Fig. 1, we know that large spatial variability exists within this region that leads to a small overall bias. In this section we explore if some of that variability can be attributed to particular cloud types.

The mid-level and stratocumulus clouds are the two most accurately simulated cloud types (68 % and 66 % of the time). The cloud radiative biases in these cases are relatively low, implying that the model may be doing a good job. We note that the biases are fairly similar for when they disagree too, and that is likely due to each one being predominately assigned as the other when they are incorrectly simulated. When the stratocumulus clouds are correctly simulated, examination of the cloud fields indicates that despite the small radiative biases, the apparent model skill may be misleading. In these cases, the CFL and LWP are underestimated and the CFI and IWP are overestimated across most seasons. We suggest that too few optically thinner (derived from a too little LWP) liquid clouds, causing a strong positive SWCRETOA bias, may be partially compensated for by too many optically thicker (a too high IWP) ice clouds. A similar result is found for the mid-level clouds, though with smaller biases in the CFL and IWP and negligible biases in the CFI and LWP.

We can see that the majority of the negative bias in the sub-polar region is driven by shallow cumulus clouds that are incorrectly simulated. Shallow cumulus clouds are optically thin and low lying (see Fig. 2). They are only correctly simulated in this region 1 % of the time. Instead they are simulated as stratocumulus clouds 57 % of the time; these clouds are higher and more optically thick, both of which would lead to more sunlight being reflected back out to space. Again, this agrees well with the findings of Fig. 5, where negative radiative biases are found to be due to too many stratocumulus and mid-level clouds. Looking at the shallow cumulus cloud properties, we can see that when these clouds are incorrectly defined, we have too few liquid clouds, too many ice clouds, and too high LWPs and IWPs. When shallow cumulus clouds are correctly simulated, the majority of these biases are much smaller and the SWCRETOA bias is weakly positive, indicating again that the model is doing a better job.

The cloud deck clusters, when incorrectly assigned, also appear to contribute towards a negative SWCRETOA bias. Cloud decks are only correctly simulated 9 % of the time and are also considered by the ACCESS-AM2 model to be stratocumulus clouds 57 % of the time instead. Examining the biases in the cloud fields for these instances indicates a similar process to that for the shallow cumulus clouds for the SWCRETOA bias to occur. For the sake of brevity, we will not discuss the remaining cloud types for this region. The majority of them (bar some of the less frequently occurring types) are generally contributing to a positive SWCRETOA bias regardless of whether or not the clusters are correctly identified or not.

5.2.3 The polar region

Finally, we now consider the polar region in Fig. 10. Here it is clear that the mid-level, stratocumulus and marine stratiform clouds are contributing the most to the positive SWCRETOA bias, whether the clusters are correctly or incorrectly assigned. Few compensating errors are found in cloud types with a meaningful impact on the region (e.g. that occur frequently), adding to the larger magnitude of the SWCRETOA bias in this region. In each of the three aforementioned cloud types, the CFL is strongly underestimated (by over 50 % in most seasons) by the model. The LWP is predominantly underestimated (though, interestingly, less so for summer), and the IWP is overestimated. Each of these cloud biases are expected to produce a positive SWCRETOA bias. The CFI, for the marine stratiform and stratocumulus clouds, is overestimated, while for the mid-level clouds, it is in most cases underestimated. This difference may be due to (in part) the higher CTPs found in these mid-level clouds, meaning that ice formed in the model is done correctly. The cloud biases found in these plots are of similar magnitude throughout the year, while the radiative bias is large in summertime and almost non-existent in winter. This is due to the fact that despite consistently poorly simulated cloud properties, they have much less impact on the radiative balance in winter due to the much smaller amount of solar radiation compared to in the summer. Nevertheless, the fact that even when the cloud types are correctly simulated by the ACCESS-AM2 model, the SWCRETOA bias is similarly positive to, if not more positive than, when the clusters are incorrectly assigned is a cause for concern. These results strongly highlight the issues of cloud phase within the model that exist even if the height and optical depth of the cloud are correctly simulated. Furthermore, these results show that the cloud property biases are not limited to the summer season, when they are able to have a large impact on the SWCRETOA.

Figure 10The same as Fig. 8 but for the polar region.


5.2.4 Summary of findings

There are two broad groups of cloud types driving the SWCRETOA bias apparent from this analysis: the high clouds, including thin cirrus, cirrus, convective and frontal clouds, and middle- to low-level clouds, including the mid-level, stratocumulus, marine stratiform, cloud deck and shallow cumulus clouds. The high clouds consistently, across regions and seasons, contain too few ice clouds (negative CFI) with a too high IWP. The IWP biases become larger at higher latitudes. While these cloud types do not make up a large fraction of clouds over the SO (see Fig. 3) and hence have not been widely studied, it is clear that the different microphysical processes are also contributing to a positive SO SWCRETOA bias.

For the middle- to low-level clouds, too few liquid clouds are found consistently across the regions and seasons, which again, becomes worse at higher latitudes. This contribution to the positive SWCRETOA bias is compounded in the optically thicker cloud types (mid-level, stratocumulus and marine stratiform) by negative biases in the LWP, also increasing in magnitude with higher latitudes. Positive biases in the CFI and IWP are also found for these cloud types, which again are largest in magnitude in the polar region. These positive biases in ice clouds may have a compensating negative effect on the SWCRETOA bias if they are increasing the overall cloud and water path fractions; however, if they are simulated instead of supercooled liquid clouds, they may indeed have the opposite effect. The optically thinner shallow cumulus clouds are found to have a compensating negative effect when the optically thicker clouds are simulated in their place for all regions.

6 Conclusions

Recent evaluation of CMIP6 models has found continuing problems in the SO shortwave cloud radiation effect. While updates to model parameterisations and tuning have occurred since CMIP5, these have resulted in some models having too high climate sensitivity (Zelinka et al.2020), particularly driven by the SO, that may be producing lower SO SWCRETOA biases for the wrong reasons (Schuddeboom and McDonald2021; Gettelman et al.2020). In this work we take a detailed look at the SWCRETOA bias in one model, the ACCESS-AM2 model. By running ACCESS-AM2 with nudging, we are able to make day-for-day comparisons with satellite products (in this case CERES-SYN1deg and MODIS COSP products) for the first time, allowing more in-depth analysis of this problem than what has previously been completed.

In this analysis, we use unsupervised k-means clustering on CTP–τ histograms, a method that has proven useful in numerous previous studies. We use 12 clusters for our analysis, though later merge 3 into 1 “Antarctic” cluster, leaving 10 cloud regimes from which we can begin to understand under what conditions the SWCRETOA bias occurs and the associated cloud properties. In particular, we are able to analyse the SWCRETOA and cloud property biases in instances when the model correctly simulates (e.g. hits) a cloud regime and compare them against instances when the model incorrectly simulates a cloud regime (e.g. misses – assigned as something else) with respect to the MODIS product.

We find that ACCESS-AM2 strongly overestimates the occurrence of stratocumulus and mid-level clouds over the entire SO, in general agreement with what is found for CMIP6 models in Schuddeboom and McDonald (2021). The over-prediction of these two cloud types comes predominantly at the cost of marine stratiform, cloud deck or shallow cumulus clouds (and others), while stratocumulus and mid-level clouds themselves are accurately predicted 60 % and 67 % of the time respectively by the model. In particular, we find shallow cumulus clouds are simulated as the higher and thicker stratocumulus clouds by ACCESS-AM2 50 % of the time, leading to a compensating negative SWCRETOA bias over the entire SO for all seasons. A similar result is found in the sub-polar region for cloud decks in all seasons. Interestingly, when shallow cumulus clouds are correctly simulated by ACCESS-AM2, the radiative biases are much smaller, as are the biases in cloud properties. This result implies that correctly simulating the conditions for shallow cumulus to form may have a beneficial impact on the simulated radiative balance. The same cannot be said for the cloud deck clouds however, where errors in both the SWCRETOA and the cloud properties remain even if they are correctly simulated.

We find that when stratocumulus clouds are correctly simulated by the ACCESS-AM2 model, the SWCRETOA bias in the mid-latitude and sub-polar parts of the SO is relatively small. However, examination of the respective biases in cloud properties indicates that the model is likely producing these smaller radiative biases, which differ by latitude, for the wrong reasons. Concerningly, in the polar-region summertime, stratocumulus clouds have a worse SWCRETOA bias when they are correctly simulated by the ACCESS-AM2 model than when they are incorrectly assigned, indicating that significant issues remain within cloud microphysical properties, in particular the cloud phase, even if the macrophysics are somewhat correct. These findings provide important knowledge to help guide future model development, which must target simulating not only the correct cloud type but also the correct cloud microphysics.

Marine stratiform, stratocumulus and mid-level clouds contribute the most to the polar region's positive SWCRETOA bias. Each of these cloud types is associated with large biases in cloud properties, including too few liquid clouds, a too little LWP and a too high IWP. This is the case for when these cloud types are both correctly and incorrectly assigned. In the case of the marine stratiform and stratocumulus clouds, the CFI is overestimated, while for the mid-level clouds, it is underestimated in all seasons except summer, where the bias is small. These results indicate that the issue of cloud phase within the ACCESS-AM2 model is still causing significant problems for the polar radiative balance. This cloud-phase problem in these three cloud regimes does not appear to be as large an issue for the same clouds in the mid-latitudes or even the sub-polar region. This finding implies that any changes to the model parameterisations must be made with caution, taking into account latitudinal dependencies on things such as temperature, boundary layer coupling or ice-nucleating particle presence.

While we have in this work explored the SWCRETOA biases between ACCESS-AM2 and the CERES-SYN1deg product and the cloud feature and type biases between ACCESS-AM2 and MODIS products, some key caveats must be raised. Firstly, we must note that comparing across two different satellite products may not be ideal given different assumptions and requirements. While this is standard practice in the field, the two products are derived from different platforms with different limitations, pointing towards a greater need for combined “Earth system” satellite products. We note that other satellite products have not been used in this work due to inconsistencies with the time period of focus. Secondly, we must recognise that both satellite products have their own biases. While in this work we regard them as “truth”, this is not necessarily the case. Hinkelman and Marchand (2020), for example, compared CERES and CloudSat radiation fields to in situ observations at Macquarie Island and found a +10 W m−2 bias, which they expect would exacerbate the biases found in model evaluation. As discussed in Sect. 2.2, significant issues remain with the retrievals of Reff and subsequently the water paths. The propagation of these errors through the COSP framework meant that comparisons between the COSP product and MODIS product were found to be very unrealistic, and hence these fields were not used and the raw model field LWP and IWP were used instead, adding uncertainty to these results. Similarly, having to convert between in-cloud and grid-box mean water paths has added another layer of uncertainty to this work. We chose to use the grid-box mean values, but the use of in-cloud values would have changed our results and their interpretation. Whilst the COSP framework and specifically derived MODIS product go a long way to helping model evaluation, more needs to be done to ensure that modellers can use observational products with confidence and minimal transformation. Further work evaluating cloud properties derived from satellite products would be of upmost help to studies such as this.

Gettelman et al. (2020) also noted that the nudging choices made can have large, in some cases detrimental, effects on the ability to simulate cloud fields, in particular cloud phase partitioning and water content, due to the control on temperatures (Zhang et al.2014). While a study such as this could not be performed without the use of nudging, it is possible that the effects found in Gettelman et al. (2020) may be present here. We must also consider the impact of a single-moment cloud scheme used in ACCESS-AM2 (the PC2 scheme). A double-moment cloud scheme that includes prognostic equations for both the cloud droplet size and the number is preferable for a study such as this. Even better would be a cloud scheme that is fully coupled to the aerosol scheme. While in ACCESS-AM2 the cloud droplet number concentration is resolved, ice-nucleating particles are not explicitly resolved. We hope to be able to address both of these issues in the future within ACCESS.

The work presented in this study has provided a relatively qualitative view of the cloud and radiative biases associated with cloud types. Studies such as Bender et al. (2017) have used more quantitative methods to evaluate the role cloud fractions play in determining the radiative balance. Specifically, Bender et al. (2017) examine the distributions of cloud albedo and the associated cloud fraction in CMIP5 models and their linear relationships. In further exploration of the data used in this work, we note that most of the relationships between cloud properties and radiative bias are in fact non-linear, which is also highlighted in this work by the differing relationships observed by latitude/cloud type. Similar results have been noted in Bodas-Salcedo et al. (2016a), where the LWP and radiative biases were not found to be as tightly coupled as expected. We will present findings of a quantitative analysis employing machine learning to understand the role cloud properties play in determining the radiative bias in an upcoming paper.

Furthermore, in this work we have only considered a few cloud properties that impact the radiation budget. Other important factors, such as the number and size of cloud droplets, precipitation phase and amount, or other thermodynamical properties, are likely to impact the absorption and scattering properties of clouds and their lifetime (for example, Mülmenstädt et al.2021). Additionally, we have only considered these properties in isolation, i.e. one field at a time, and not how they are operating together. This will also be addressed in our upcoming machine learning publication.

To summarise, in this work we have found that considerable radiative biases continue to exist within the ACCESS-AM2 model. By analysing this bias with respect to cloud types and their properties, we find that significant issues with respect to the cloud phase remain in the middle- to low-level cloud regimes for the polar region of the SO, regardless of season, and that even if these cloud types are correctly simulated by the model, the large microphysical biases still persist. We also find compensating errors due to the underestimation of shallow cumulus clouds in favour of stratocumulus clouds. Our results show that significant effort must be continued to reduce these SO cloud biases within models.

Appendix A: Additional figures

Figure A1The decomposed mean biases in the CFL (fraction) for each cloud regime (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant) over the three regions from left to right mid-latitudes, sub-polar and polar, for each season (DJF, a; MAM, b; JJA, c; SON, d). The sum of the decomposed biases is shown by the horizontal bar, while the terms of the bias decomposition (see Eq. 1) are shown in blue (FrsatΔRFOr), pink (ΔFrRFOrsat) and green (ΔFr⋅ΔRFOr). On the far right of each plot is the mean radiative bias for each region/season as a reference.


Figure A2The decomposed mean biases in the CFI (fraction) for each cloud regime (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant) over the three regions from left to right mid-latitudes, sub-polar and polar, for each season (DJF, a; MAM, b; JJA, c; SON, d). The sum of the decomposed biases is shown by the horizontal bar, while the terms of the bias decomposition (see Eq. 1) are shown in blue (FrsatΔRFOr), pink (ΔFrRFOrsat) and green (ΔFr⋅ΔRFOr). On the far right of each plot is the mean radiative bias for each region/season as a reference.


Figure A3The decomposed mean biases in the LWP (g m−2) for each cloud regime (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant) over the three regions from left to right mid-latitudes, sub-polar and polar, for each season (DJF, a; MAM, b; JJA, c; SON, d). The sum of the decomposed biases is shown by the horizontal bar, while the terms of the bias decomposition (see Eq. 1) are shown in blue (FrsatΔRFOr), pink (ΔFrRFOrsat) and green (ΔFr⋅ΔRFOr). On the far right of each plot is the mean radiative bias for each region/season as a reference.


Figure A4The decomposed mean biases in the IWP (g m−2) for each cloud regime (thin cirrus, TC; cirrus, Ci; convective, Cv; frontal, Fr; mid-level, ML; stratocumulus, StC; marine stratiform, MS; cloud decks, CD; shallow cumulus, ShC; Antarctic, Ant) over the three regions from left to right mid-latitudes, sub-polar and polar, for each season (DJF, a; MAM, b; JJA, c; SON, d). The sum of the decomposed biases is shown by the horizontal bar, while the terms of the bias decomposition (see Eq. 1) are shown in blue (FrsatΔRFOr), pink (ΔFrRFOrsat) and green (ΔFr⋅ΔRFOr). On the far right of each plot is the mean radiative bias for each region/season as a reference.


Code and data availability

Limited model data and code for this project are available at (Fiddes2022). The model set-up (under licence) can be found by UM users under suite bx400. CERES data can be downloaded from (last access: 25 March 2022) (NASA2022a). MODIS data can be downloaded from (last access: 25 March 2022) (NASA2022b).

Author contributions

SLF completed the model simulations, analysis and writing of this work. AP, MDM, SPA and MTW contributed to the planning, ideas and revisions of this paper. MTW provided guidance on the model set-up.

Competing interests

The contact author has declared that none of the authors has any competing interests.


Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


This project received grant funding from the Australian Government as part of the Antarctic Science Collaboration Initiative programme, under the Australian Antarctic Program Partnership, ASCI000002. This research was undertaken with the assistance of resources and services from the National Computational Infrastructure (project jk72), which is supported by the Australian Government. Sonya L. Fiddes would like to thank Scott Wales and the ARC Centre of Excellence for Climate Extremes for their maintenance of virtual environments and code/model support. Sonya L. Fiddes would also like to thank Robert Pincus and Alejandro Bodas-Salcedo for early advice on the use of MODIS COSP products. The authors would like to acknowledge the teams at NASA CERES, NASA Earthdata and the CFMIP project for making the tools and data used in this work available. The contribution of Simon P. Alexander was supported by the Australian Antarctic Science project 4292.

Financial support

This research has been supported by the Australian Government (grant no. ASCI000002).

Review statement

This paper was edited by Susannah Burrows and reviewed by Alex Schuddeboom and one anonymous referee.


Anderberg, M. R.: Cluster Analysis for Applications. A volume in Probability and Mathematical Statistics: A Series of Monographs and Textbooks, vol. 19, Academic Press, New York,, 1973. a

Bender, F. A., Engström, A., Wood, R., and Charlson, R. J.: Evaluation of hemispheric asymmetries in marine cloud radiative properties, J. Climate, 30, 4131–4147,, 2017. a, b

Bi, D., Dix, M., Marsland, S., O'Farrell, S., Sullivan, A., Bodman, R., Law, R., Harman, I., Srbinovsky, J., Rashid, H. A., Dobrohotoff, P., Mackallah, C., Yan, H., Hirst, A., Savita, A., Dias, F. B., Woodhouse, M., Fiedler, R., and Heerdegen, A.: Configuration and spin-up of ACCESS-CM2, the new generation Australian Community Climate and Earth System Simulator Coupled Model, Journal of Southern Hemisphere Earth Systems Science, 70, 225–251,, 2020. a, b, c

Bodas-Salcedo, A., Webb, M. J., Bony, S., Chepfer, H., Dufresne, J. L., Klein, S. A., Zhang, Y., Marchand, R., Haynes, J. M., Pincus, R., and John, V. O.: COSP: Satellite simulation software for model assessment, B. Am. Meteorol. Soc., 92, 1023–1043,, 2011. a, b

Bodas-Salcedo, A., Williams, K. D., Field, P. R., and Lock, A. P.: The surface downwelling solar radiation surplus over the southern ocean in the met office model: The role of midlatitude cyclone clouds, J. Climate, 25, 7467–7486,, 2012. a

Bodas-Salcedo, A., Williams, K. D., Ringer, M. A., Beau, I., Cole, J. N. S., Dufresne, J. L., Koshiro, T., Stevens, B., Wang, Z., and Yokohata, T.: Origins of the solar radiation biases over the Southern Ocean in CFMIP2 models, J. Climate, 27, 41–56,, 2014. a, b, c

Bodas-Salcedo, A., Andrews, T., Karmalkar, A. V., and Ringer, M. A.: Cloud liquid water path and radiative feedbacks over the Southern Ocean, Geophys. Res. Lett., 43, 938–10,, 2016a. a

Bodas-Salcedo, A., Hill, P. G., Furtado, K., Williams, K. D., Field, P. R., Manners, J. C., Hyder, P., and Kato, S.: Large contribution of supercooled liquid clouds to the solar radiation budget of the Southern Ocean, J. Climate, 29, 4213–4228,, 2016b. a, b

Bodman, R. W., Karoly, D. J., Dix, M. R., Harman, I. N., Srbinovsky, J., Dobrohotoff, P. B., and Mackallah, C.: Evaluation of CMIP6 AMIP climate simulations with the ACCESS-AM2 model, Journal of Southern Hemisphere Earth Systems Science, 70, 166–179,, 2020. a, b

Bony, S. and Dufresne, J. L.: Marine boundary layer clouds at the heart of tropical cloud feedback uncertainties in climate models, Geophys. Res. Lett., 32, L20806,, 2005. a

Calinski, T. and Harabasz, J.: A dendrite method for cluster analysis, Communications in Statistics, 3, 1–27,, 1974. a

Cho, N., Tan, J., and Oreopoulos, L.: Classifying planetary cloudiness with an updated set of modis cloud regimes, J. Appl. Meteorol. Clim., 60, 981–997,, 2021. a

Chubb, T. H., Jensen, J. B., Siems, S. T., and Manton, M. J.: In situ observations of supercooled liquid clouds over the Southern Ocean during the HIAPER Pole-to-Pole Observation campaigns, Geophys. Res. Lett., 40, 5280–5285,, 2013. a

Davies, D. L. and Bouldin, D. W.: A Cluster Separation Measure, IEEE T. Pattern Anal., PAMI-1, 224–227,, 1979. a

Doelling, D. R., Loeb, N. G., Keyes, D. F., Nordeen, M. L., Morstad, D., Nguyen, C., Wielicki, B. A., Young, D. F., and Sun, M.: Geostationary enhanced temporal interpolation for ceres flux products, J. Atmos. Ocean. Tech., 30, 1072–1090,, 2013. a

Doelling, D. R., Sun, M., Nguyen, L. T., Nordeen, M. L., Haney, C. O., Keyes, D. F., and Mlynczak, P. E.: Advances in geostationary-derived longwave fluxes for the CERES synoptic (SYN1deg) product, J. Atmos. Ocean. Tech., 33, 503–521,, 2016. a

Edwards, J. M. and Slingo, A.: Studies with a flexible new radiation code. I: Choosing a configuration for a large-scale model, Q. J. Roy. Meteor. Soc., 122, 689–719,, 1996. a

Eyring, V., Bony, S., Meehl, G. A., Senior, C. A., Stevens, B., Stouffer, R. J., and Taylor, K. E.: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization, Geosci. Model Dev., 9, 1937–1958,, 2016. a, b

Feng, L., Smith, S. J., Braun, C., Crippa, M., Gidden, M. J., Hoesly, R., Klimont, Z., van Marle, M., van den Berg, M., and van der Werf, G. R.: The generation of gridded emissions data for CMIP6, Geosci. Model Dev., 13, 461–482,, 2020. a

Fiddes, S. L.: ACCESS-AM2 Southern Ocean cloud and radiation data for k-means clustering and analysis, Zenodo [data set],, 2022. a

Fiddes, S. L., Woodhouse, M. T., Nicholls, Z., Lane, T. P., and Schofield, R.: Cloud, precipitation and radiation responses to large perturbations in global dimethyl sulfide, Atmos. Chem. Phys., 18, 10177–10198,, 2018. a, b

Field, P. R. and Wood, R.: Precipitation and cloud structure in midlatitude cyclones, J. Climate, 20, 233–254,, 2007. a, b, c

Frey, W. R., Maroon, E. A., Pendergrass, A. G., and Kay, J. E.: Do Southern Ocean Cloud Feedbacks Matter for 21st Century Warming?, Geophys. Res. Lett., 44, 447–12,, 2017. a

Fricko, O., Havlik, P., Rogelj, J., Klimont, Z., Gusti, M., Johnson, N., Kolp, P., Strubegger, M., Valin, H., Amann, M., Ermolieva, T., Forsell, N., Herrero, M., Heyes, C., Kindermann, G., Krey, V., McCollum, D. L., Obersteiner, M., Pachauri, S., Rao, S., Schmid, E., Schoepp, W., and Riahi, K.: The marker quantification of the Shared Socioeconomic Pathway 2: A middle-of-the-road scenario for the 21st century, Global Environmental Change, 42, 251–267,, 2017. a

Gettelman, A., Bardeen, C. G., McCluskey, C. S., Järvinen, E., Stith, J., Bretherton, C., McFarquhar, G., Twohy, C., D'Alessandro, J., and Wu, W.: Simulating Observations of Southern Ocean Clouds and Implications for Climate, J. Geophys. Res.-Atmos., 125, e2020JD032619,, 2020. a, b, c, d

Gong, S. L.: A parameterization of sea-salt aerosol source function for sub- and super-micron particles, Global Biogeochem. Cy., 17, 1097,, 2003. a

Gregory, D. and Rowntree, P. R.: A Mass Flux Convection Scheme with Representation of Cloud Ensemble Characteristics and Stability-Dependent Closure, Mon. Weather Rev., 118, 1483–1506,<1483:AMFCSW>2.0.CO;2, 1990. a

Hawcroft, M., Haywood, J. M., Collins, M., Jones, A., Jones, A. C., and Stephens, G.: Southern Ocean albedo, inter-hemispheric energy transports and the double ITCZ: global impacts of biases in a coupled model, Clim. Dynam., 48, 2279–2295,, 2017. a, b

Haynes, J. M., Jakob, C., Rossow, W. B., Tselioudis, G., and Brown, J. B.: Major characteristics of Southern Ocean cloud regimes and their effects on the energy budget, J. Climate, 24, 5061–5080,, 2011. a, b

Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz‐Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J.: The ERA5 global reanalysis, Q. J. Roy. Meteor. Soc., 146, 1999–2049,, 2020. a

Hinkelman, L. M. and Marchand, R.: Evaluation of CERES and CloudSat Surface Radiative Fluxes Over Macquarie Island, the Southern Ocean, Earth and Space Science, 7, e2020EA001224,, 2020. a

Hoesly, R. M., Smith, S. J., Feng, L., Klimont, Z., Janssens-Maenhout, G., Pitkanen, T., Seibert, J. J., Vu, L., Andres, R. J., Bolt, R. M., Bond, T. C., Dawidowski, L., Kholod, N., Kurokawa, J.-I., Li, M., Liu, L., Lu, Z., Moura, M. C. P., O'Rourke, P. R., and Zhang, Q.: Historical (1750–2014) anthropogenic emissions of reactive gases and aerosols from the Community Emissions Data System (CEDS), Geosci. Model Dev., 11, 369–408,, 2018. a

Holz, R. E., Platnick, S., Meyer, K., Vaughan, M., Heidinger, A., Yang, P., Wind, G., Dutcher, S., Ackerman, S., Amarasinghe, N., Nagle, F., and Wang, C.: Resolving ice cloud optical thickness biases between CALIOP and MODIS using infrared retrievals, Atmos. Chem. Phys., 16, 5075–5090,, 2016. a

Huang, Y., Siems, S. T., Manton, M. J., Protat, A., and Delanoë, J.: A study on the low-altitude clouds over the Southern Ocean using the DARDAR-MASK, J. Geophys. Res.-Atmos., 117, D18204,, 2012. a

Hubanks, P., Pincus, R., Platnick, S., and Meyer, K.: Level-3 COSP Cloud Properties (MCD06COSP_L3) Combined Terra & Aqua MODIS Global Gridded Product for Climate Studies User Guide, Tech. Rep., NASA, (last access: 20 July 2021), 2020. a, b

Humphries, R. S., Keywood, M. D., Gribben, S., McRobert, I. M., Ward, J. P., Selleck, P., Taylor, S., Harnwell, J., Flynn, C., Kulkarni, G. R., Mace, G. G., Protat, A., Alexander, S. P., and McFarquhar, G.: Southern Ocean latitudinal gradients of cloud condensation nuclei, Atmos. Chem. Phys., 21, 12757–12782,, 2021. a

Hurrell, J. W., Hack, J. J., Shea, D., Caron, J. M., and Rosinski, J.: A new sea surface temperature and sea ice boundary dataset for the community atmosphere model, J. Climate, 21, 5145–5153,, 2008. a, b

Hwang, Y. T. and Frierson, D. M.: Link between the double-intertropical convergence zone problem and cloud biases over the southern ocean, P. Natl. Acad. Sci. USA, 110, 4935–4940,, 2013. a

Hyder, P., Edwards, J. M., Allan, R. P., Hewitt, H. T., Bracegirdle, T. J., Gregory, J. M., Wood, R. A., Meijers, A. J., Mulcahy, J., Field, P., Furtado, K., Bodas-Salcedo, A., Williams, K. D., Copsey, D., Josey, S. A., Liu, C., Roberts, C. D., Sanchez, C., Ridley, J., Thorpe, L., Hardiman, S. C., Mayer, M., Berry, D. I., and Belcher, S. E.: Critical Southern Ocean climate model biases traced to atmospheric model cloud errors, Nat. Commun., 9, 3625,, 2018. a

Jakob, C. and Tselioudis, G.: Objective identification of cloud regimes in the Tropical Western Pacific, Geophys. Res. Lett., 30, 2082,, 2003. a

Kay, J. E., Wall, C., Yettella, V., Medeiros, B., Hannay, C., Caldwell, P., and Bitz, C.: Global climate impacts of fixing the Southern Ocean shortwave radiation bias in the Community Earth System Model (CESM), J. Climate, 29, 4617–4636,, 2016. a

King, M. D., Menzel, W. P., Kaufman, Y. J., Tanré, D., Gao, B. C., Platnick, S., Ackerman, S. A., Remer, L. A., Pincus, R., and Hubanks, P. A.: Cloud and aerosol properties, precipitable water, and profiles of temperature and water vapor from MODIS, IEEE T. Geosci. Remote, 41, 442–456,, 2003. a

Kuma, P., McDonald, A. J., Morgenstern, O., Alexander, S. P., Cassano, J. J., Garrett, S., Halla, J., Hartery, S., Harvey, M. J., Parsons, S., Plank, G., Varma, V., and Williams, J.: Evaluation of Southern Ocean cloud in the HadGEM3 general circulation model and MERRA-2 reanalysis using ship-based observations, Atmos. Chem. Phys., 20, 6607–6630,, 2020. a, b, c

Lana, A., Bell, T. G., Simó, R., Vallina, S. M., Ballabrera-Poy, J., Kettle, A. J., Dachs, J., Bopp, L., Saltzman, E. S., Stefels, J., Johnson, J. E., and Liss, P. S.: An updated climatology of surface dimethlysulfide concentrations and emission fluxes in the global ocean, Global Biogeochem. Cy., 25, GB1004,, 2011. a

Leinonen, J., Lebsock, M. D., Oreopoulos, L., and Cho, N.: Interregional differences in MODIS-derived cloud regimes, J. Geophys. Res., 121, 11648–11665,, 2016. a

Liss, P. S. and Merlivat, L.: Air-Sea Gas Exchange Rates: Introduction and Synthesis, in: The Role of Air-Sea Exchange in Geochemical Cycling, edited by: Buat-Ménard, P., Springer Netherlands, Dordrecht, 113–127,, 1986. a

Listowski, C., Delanoë, J., Kirchgaessner, A., Lachlan-Cope, T., and King, J.: Antarctic clouds, supercooled liquid water and mixed phase, investigated with DARDAR: geographical and seasonal variations, Atmos. Chem. Phys., 19, 6771–6808,, 2019. a

Mace, G. G. and Protat, A.: Clouds over the Southern Ocean as observed from the R/V Investigator during CAPRICORN. Part I: Cloud occurrence and phase partitioning, J. Appl. Meteorol. Clim., 57, 1783–1803,, 2018. a

Mace, G. G., Protat, A., Humphries, R. S., Alexander, S. P., McRobert, I. M., Ward, J., Selleck, P., Keywood, M., and McFarquhar, G. M.: Southern Ocean Cloud Properties Derived From CAPRICORN and MARCUS Data, J. Geophys. Res.-Atmos., 126, e2020JD033368,, 2021. a

Mann, G. W., Carslaw, K. S., Spracklen, D. V., Ridley, D. A., Manktelow, P. T., Chipperfield, M. P., Pickering, S. J., and Johnson, C. E.: Description and evaluation of GLOMAP-mode: a modal global aerosol microphysics model for the UKCA composition-climate model, Geosci. Model Dev., 3, 519–551,, 2010. a

Mann, G. W., Carslaw, K. S., Ridley, D. A., Spracklen, D. V., Pringle, K. J., Merikanto, J., Korhonen, H., Schwarz, J. P., Lee, L. A., Manktelow, P. T., Woodhouse, M. T., Schmidt, A., Breider, T. J., Emmerson, K. M., Reddington, C. L., Chipperfield, M. P., and Pickering, S. J.: Intercomparison of modal and sectional aerosol microphysics representations within the same 3-D global chemical transport model, Atmos. Chem. Phys., 12, 4449–4476,, 2012. a

Marchant, B., Platnick, S., Meyer, K., Arnold, G. T., and Riedi, J.: MODIS Collection 6 shortwave-derived cloud phase classification algorithm and comparisons with CALIOP, Atmos. Meas. Tech., 9, 1587–1599,, 2016. a, b

Mason, S., Fletcher, J. K., Haynes, J. M., Franklin, C., Protat, A., and Jakob, C.: A hybrid cloud regime methodology used to evaluate Southern Ocean cloud and shortwave radiation errors in ACCESS, J. Climate, 28, 6001–6018,, 2015. a, b, c, d, e, f

McDonald, A. J., Cassano, J. J., Jolly, B., Parsons, S., and Schuddeboom, A.: An automated satellite cloud classification scheme using self-organizing maps: Alternative ISCCP weather states, J. Geophys. Res., 121, 13009–13030,, 2016. a

McFarquhar, G. M., Bretherton, C. S., Marchand, R., Protat, A., DeMott, P. J., Alexander, S. P., Roberts, G. C., Twohy, C. H., Toohey, D., Siems, S., Huang, Y., Wood, R., Rauber, R. M., Lasher-Trapp, S., Jensen, J., Stith, J. L., Mace, J., Um, J., Järvinen, E., Schnaiter, M., Gettelman, A., Sanchez, K. J., McCluskey, C. S., Russell, L. M., McCoy, I. L., Atlas, R. L., Bardeen, C. G., Moore, K. A., Hill, T. C. J., Humphries, R. S., Keywood, M. D., Ristovski, Z., Cravigan, L., Schofield, R., Fairall, C., Mallet, M. D., Kreidenweis, S. M., Rainwater, B., D’Alessandro, J., Wang, Y., Wu, W., Saliba, G., Levin, E. J. T., Ding, S., Lang, F., Truong, S. C. H., Wolff, C., Haggerty, J., Harvey, M. J., Klekociuk, A. R., and McDonald, A.: Observations of Clouds, Aerosols, Precipitation, and Surface Radiation over the Southern Ocean: An Overview of CAPRICORN, MARCUS, MICRE, and SOCRATES, B. Am. Meteorol. Soc., 102, E894–E928,, 2021. a, b

Min, Q., Joseph, E., Lin, Y., Min, L., Yin, B., Daum, P. H., Kleinman, L. I., Wang, J., and Lee, Y.-N.: Comparison of MODIS cloud microphysical properties with in-situ measurements over the Southeast Pacific, Atmos. Chem. Phys., 12, 11261–11273,, 2012. a, b

Mulcahy, J. P., Jones, C., Sellar, A., Johnson, B., Boutle, I. A., Jones, A., Andrews, T., Rumbold, S. T., Mollard, J., Bellouin, N., Johnson, C. E., Williams, K. D., Grosvenor, D. P., and McCoy, D. T.: Improved Aerosol Processes and Effective Radiative Forcing in HadGEM3 and UKESM1, J. Adv. Model. Earth Sy., 10, 2786–2805,, 2018. a

Mülmenstädt, J., Salzmann, M., Kay, J. E., Zelinka, M. D., Ma, P.-L., Nam, C., Kretzschmar, J., Hörnig, S., and Quaas, J.: An underestimated negative cloud feedback from cloud lifetime changes, Nat. Clim. Change, 11, 508–513,, 2021. a

NASA: CERES Data Products, NASA [data set],, last access: 25 March 2022a. a

NASA: LAADS DAAC, NASA [data set],, last access: 25 March 2022b. a

Noble, S. R. and Hudson, J. G.: MODIS comparisons with northeastern Pacific in situ stratocumulus microphysics, J. Geophys. Res.-Atmos., 120, 8332–8344,, 2015. a, b, c

Oreopoulos, L., Cho, N., Lee, D., Kato, S., and Huffman, G. J.: An examination of the nature of global MODIS cloud regimes, J. Geophys. Res.-Atmos., 119, 8362–8383,, 2014. a

Oreopoulos, L., Cho, N., Lee, D., and Kato, S.: Radiative effects of global MODIS cloud regimes, J. Geophys. Res.-Atmos., 121, 2299–2317,, 2016. a, b

Painemal, D. and Zuidema, P.: Assessment of MODIS cloud effective radius and optical thickness retrievals over the Southeast Pacific with VOCALS-REx in situ measurements, J. Geophys. Res.-Atmos., 116, D24206,, 2011. a

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 12, 2825–2830, 2011. a

Pincus, R., Platnick, S., Ackerman, S. A., Hemler, R. S., and Patrick Hofmann, R. J.: Reconciling simulated and observed views of clouds: MODIS, ISCCP, and the limits of instrument simulators, J. Climate, 25, 4699–4720,, 2012. a

Platnick, S., Meyer, K. G., King, M. D., Wind, G., Amarasinghe, N., Marchant, B., Arnold, G. T., Zhang, Z., Hubanks, P. A., Holz, R. E., Yang, P., Ridgway, W. L., and Riedi, J.: The MODIS Cloud Optical and Microphysical Products: Collection 6 Updates and Examples From Terra and Aqua, IEEE T. Geosci. Remote, 55, 502–525,, 2017. a, b, c, d, e

Protat, A., Schulz, E., Rikus, L., Sun, Z., Xiao, Y., and Keywood, M. D.: Shipborne observations of the radiative effect of Southern Ocean clouds, J. Geophys. Res.-Atmos., 122, 318–328,, 2017. a

Saponaro, G., Sporre, M. K., Neubauer, D., Kokkola, H., Kolmonen, P., Sogacheva, L., Arola, A., de Leeuw, G., Karset, I. H. H., Laaksonen, A., and Lohmann, U.: Evaluation of aerosol and cloud properties in three climate models using MODIS observations and its corresponding COSP simulator, as well as their application in aerosol–cloud interactions, Atmos. Chem. Phys., 20, 1607–1626,, 2020. a

Schuddeboom, A., McDonald, A. J., Morgenstern, O., Harvey, M., and Parsons, S.: Regional Regime-Based Evaluation of Present-Day General Circulation Model Cloud Simulations Using Self-Organizing Maps, J. Geophys. Res.-Atmos., 123, 4259–4272,, 2018. a, b, c, d, e, f, g, h, i

Schuddeboom, A. J. and McDonald, A. J.: The Southern Ocean Radiative Bias, Cloud Compensating Errors, and Equilibrium Climate Sensitivity in CMIP6 Models, J. Geophys. Res.-Atmos., 126, e2021JD035310,, 2021. a, b, c, d, e, f, g

Simmons, J. B., Humphries, R. S., Wilson, S. R., Chambers, S. D., Williams, A. G., Griffiths, A. D., McRobert, I. M., Ward, J. P., Keywood, M. D., and Gribben, S.: Summer aerosol measurements over the East Antarctic seasonal ice zone, Atmos. Chem. Phys., 21, 9497–9513,, 2021. a

Sporre, M. K., O'Connor, E. J., Håkansson, N., Thoss, A., Swietlicki, E., and Petäjä, T.: Comparison of MODIS and VIIRS cloud properties with ARM ground-based observations over Finland, Atmos. Meas. Tech., 9, 3193–3203,, 2016. a, b

Tselioudis, G., Rossow, W., Zhang, Y., and Konsta, D.: Global weather states and their properties from passive and active satellite cloud retrievals, J. Climate, 26, 7734–7746,, 2013. a

Tselioudis, G., Rossow, W. B., Jakob, C., Remillard, J., Tropf, D., and Zhang, Y.: Evaluation of Clouds, Radiation, and Precipitation in CMIP6 Models Using Global Weather States Derived from ISCCP-H Cloud Property Data, J. Climate, 34, 7311–7324,, 2021. a, b, c

van Marle, M. J. E., Kloster, S., Magi, B. I., Marlon, J. R., Daniau, A.-L., Field, R. D., Arneth, A., Forrest, M., Hantson, S., Kehrwald, N. M., Knorr, W., Lasslop, G., Li, F., Mangeon, S., Yue, C., Kaiser, J. W., and van der Werf, G. R.: Historic global biomass burning emissions for CMIP6 (BB4CMIP) based on merging satellite observations with proxies and fire models (1750–2015), Geosci. Model Dev., 10, 3329–3357,, 2017. a

Walters, D., Baran, A. J., Boutle, I., Brooks, M., Earnshaw, P., Edwards, J., Furtado, K., Hill, P., Lock, A., Manners, J., Morcrette, C., Mulcahy, J., Sanchez, C., Smith, C., Stratton, R., Tennant, W., Tomassini, L., Van Weverberg, K., Vosper, S., Willett, M., Browse, J., Bushell, A., Carslaw, K., Dalvi, M., Essery, R., Gedney, N., Hardiman, S., Johnson, B., Johnson, C., Jones, A., Jones, C., Mann, G., Milton, S., Rumbold, H., Sellar, A., Ujiie, M., Whitall, M., Williams, K., and Zerroukat, M.: The Met Office Unified Model Global Atmosphere 7.0/7.1 and JULES Global Land 7.0 configurations, Geosci. Model Dev., 12, 1909–1963,, 2019. a, b

Webb, M. J., Andrews, T., Bodas-Salcedo, A., Bony, S., Bretherton, C. S., Chadwick, R., Chepfer, H., Douville, H., Good, P., Kay, J. E., Klein, S. A., Marchand, R., Medeiros, B., Siebesma, A. P., Skinner, C. B., Stevens, B., Tselioudis, G., Tsushima, Y., and Watanabe, M.: The Cloud Feedback Model Intercomparison Project (CFMIP) contribution to CMIP6, Geosci. Model Dev., 10, 359–384,, 2017. a

Wilks, D. S.: Statistical Methods in the Atmospheric Sciences, Elsevier, 3rd edn., ISBN 978-0-12-815823-4,, 2011. a

Williams, K. D. and Tselioudis, G.: GCM intercomparison of global cloud regimes: Present-day evaluation and climate change response, Clim. Dynam., 29, 231–250,, 2007. a, b

Williams, K. D. and Webb, M. J.: A quantitative performance assessment of cloud regimes in climate models, Clim. Dynam., 33, 141–157,, 2009. a, b, c, d, e, f, g, h

Wilson, D. R., Bushell, A. C., Kerr-Munslow, A. M., Price, J. D., and Morcrette, C. J.: PC2: A prognostic cloud fraction and condensation scheme. I: Scheme description, Q. J. Roy. Meteor. Soc., 134, 2093–2107,, 2008. a

Wood, N., Staniforth, A., White, A., Allen, T., Diamantakis, M., Gross, M., Melvin, T., Smith, C., Vosper, S., Zerroukat, M., and Thuburn, J.: An inherently mass-conserving semi-implicit semi-Lagrangian discretization of the deep-atmosphere global non-hydrostatic equations, Q. J. Roy. Meteor. Soc., 140, 1505–1520,, 2014. a

Woodward, S.: Modeling the atmospheric life cycle and radiative impact of mineral dust in the Hadley Centre climate model, J. Geophys. Res.-Atmos., 106, 18155–18166,, 2001. a

Zelinka, M. D., Myers, T. A., McCoy, D. T., Po-Chedley, S., Caldwell, P. M., Ceppi, P., Klein, S. A., and Taylor, K. E.: Causes of Higher Climate Sensitivity in CMIP6 Models, Geophys. Res. Lett., 47, e2019GL085782,, 2020.  a, b

Zhang, K., Wan, H., Liu, X., Ghan, S. J., Kooperman, G. J., Ma, P.-L., Rasch, P. J., Neubauer, D., and Lohmann, U.: Technical Note: On the use of nudging for aerosol–climate model intercomparison studies, Atmos. Chem. Phys., 14, 8631–8645,, 2014. a

Short summary
Climate models have difficulty simulating Southern Ocean clouds, impacting how much sunlight reaches the surface. We use machine learning to group different cloud types observed from satellites and simulated in a climate model. We find the model does a poor job of simulating the same cloud type as what the satellite shows and, even when it does, the cloud properties and amount of reflected sunlight are incorrect. We have a lot of work to do to model clouds correctly over the Southern Ocean.
Final-revised paper