Articles | Volume 26, issue 2
https://doi.org/10.5194/acp-26-1459-2026
https://doi.org/10.5194/acp-26-1459-2026
Research article
 | 
28 Jan 2026
Research article |  | 28 Jan 2026

Inferring the controlling factors of ice aggregation from targeted cloud seeding experiments

Huiying Zhang, Fabiola Ramelli, Christopher Fuchs, Nadja Omanovic, Anna J. Miller, Robert Spirig, Zhaolong Wu, Yunpei Chu, Xia Li, Ulrike Lohmann, and Jan Henneberger
Abstract

Ice aggregation in clouds plays a crucial role in cloud development and precipitation formation. Despite the significance of ice aggregation, direct in situ quantification of aggregation rates in natural clouds has been challenging due to the difficulty of tracking ice crystals. Here, we present in situ measurements of ice aggregation rates in persistent supercooled stratiform clouds. Using novel glaciogenic seeding experiments (CLOUDLAB), ice crystals are nucleated upwind and subsequently measured downwind after a known advection time in cloud, allowing us to estimate their age. A deep-learning-based detection algorithm (IceDetectNet) counts the individual monomers of aggregates to derive the initial ice crystal number concentration (ICNCt0). We considered several factors that may influence ice aggregation, including ICNCt0, temperature, ice crystal size, aspect ratio, and turbulence. Among these, ICNCt0 was found to be the dominant factor controlling aggregation rates by three independent approaches: causal inference, a physical equation, and machine learning models. We report, however, a subquadratic dependence of the aggregation rate on ICNCt0 (mean exponent  0.92; 95 % CI: 0.88–0.97), in contrast to theoretical expectations (quadratic dependence). One possible explanation is that aggregation may also involve smaller ice crystals, but this remains hypothetical. To predict aggregation rates, we evaluated 11 machine learning models and a physically based formulation. CatBoost achieved the best statistical performance, while the physical model proved more robust in sensitivity tests. These findings provide new insights into the microphysical and environmental controls of ice aggregation and establish a robust methodological foundation for studying aggregation processes in natural clouds.

Share
1 Introduction

Ice aggregation is a key microphysical process that influences cloud development and precipitation formation. This process has important implications for weather prediction and climate modeling. During the early stage of ice growth, diffusional processes such as the Wegener–Bergeron–Findeisen mechanism dominate (Korolev2007) and vapor deposition in fully glaciated clouds such as cirrus(Gierens et al.2003). As clouds mature, collisional processes including aggregation and riming become increasingly important (Connolly et al.2012; Heymsfield1986; Sölch and Kärcher2011; Hosler et al.1957). These processes enable ice crystals to grow into larger particles more rapidly than by vapor deposition alone and are fundamental for the formation of snowflakes and graupel, eventually leading to precipitation-sized hydrometeors (Heymsfield1986; Sölch and Kärcher2011).

Numerous in situ observations have confirmed that ice–ice aggregation occurs in clouds over a wide temperature range, from just below 0 down to 60 °C (e.g., Connolly et al.2005; Crosier et al.2011; Field and Heymsfield2003). Aggregated ice crystals often constitute a substantial fraction of total ice. For example, irregular ice crystals – which include both aggregated and aged ice crystals – have been reported to account for 84 % of total ice crystals in stratiform clouds (Korolev et al.2000), 88 % in ground-based snow particle measurements (Zamorsky1955), and 94 % in Arctic clouds (Korolev et al.1999). More specifically, aggregated ice alone has been observed to comprise 38 % of ice crystals in Arctic mixed-phase clouds (Zhang et al.2024), 45 % in thunderstorm clouds (Jaffeux et al.2022), and 52 % on average across multiple cloud types during aircraft campaigns (Moss and Johnson1994). Despite their apparent ubiquity, quantitative understanding of aggregation rates remains limited.

Several factors have been hypothesized to influence ice aggregation, such as temperature, ice crystal shape, size, and turbulence. Temperature dependence has long been debated, with laboratory studies conflicting trends: Hosler and Hallgren (1960) observed a maximum aggregation efficiency near 15 °C, possibly due to the prevalence of dendritic growth forms that interlock upon collision. In contrast, earlier findings by Hosler et al. (1957) suggested peak aggregation rates around 0 to 5 °C, where quasi-liquid layers on ice surfaces may enhance adhesion (Lamb and Verlinde2011). These competing mechanisms – the habit-based, dendritic ice crystal interlocking mechanism at colder temperatures versus the enhanced surface stickiness near freezing, which happens at the same time – highlight the complexity of aggregation processes. Turbulence may further enhance aggregation by introducing small-scale velocity fluctuations that promote collisions (Chellini and Kneifel2024; Sheikh et al.2022). They also suggest that multiple other pathways may operate, depending on the ambient conditions. Ice crystal size and number concentration also play important roles. Larger crystals exhibit greater fall-speed differences and larger geometric cross-sections, while higher ice crystal number concentrations (ICNC) increase collision frequency; both factors enhance the probability of collision and sticking (Hobbs et al.1974; Field and Heymsfield2003; Field et al.2006; Connolly et al.2012; Karrer et al.2021). However, disentangling these effects in natural clouds remains challenging due to observational constraints and the interplay of multiple factors.

Several experimental approaches have been developed to estimate aggregation, yet each carries inherent limitations. Aircraft measurements, often using a “Lagrangian spiral descent” strategy, infer aggregation efficiency from changes in ice crystal size distributions as the aircraft descends at approximately the terminal fall speed of the crystals (Field and Heymsfield2003; Field et al.2006). These measurements rely on 2-D imaging probes and typically assume that decreases in ice crystal number concentration result from aggregation. However, uncertainties arise from experimental artifacts (e.g., ice crystal breaking up on the inlets of the probe) and misattribution of number loss to aggregation alone (McFarquhar et al.2007; Lawson2011). Laboratory studies using ice cloud chambers offer better control over experimental conditions and avoid shattering artifacts, but chamber dimensions are generally too small to accommodate the timescales required for aggregation to occur naturally (Shaw et al.2020). Moreover, both aircraft and laboratory studies typically rely on indirect estimates – based on changes in ICNC – to infer aggregation efficiency, rather than direct observations (Connolly et al.2012; Field and Heymsfield2003).

These challenges make direct in situ measurements of aggregation rates in natural cloud systems challenging, yet critical for constraining microphysical parameterizations in models. In this study, we use a novel combination of UAV-based glaciogenic cloud seeding experiments (Henneberger et al.2023) within CLOUDLAB project and an advanced deep-learning detection algorithm (Zhang et al.2024) to address these gaps. This approach establishes a well-defined initial condition for ice crystal formation and subsequently captures a detailed snapshot of the ice crystal population after their residence in cloud, thereby allowing aggregation to be quantified over a precisely constrained time interval. The deep-learning algorithm (IceDetectNet) robustly identifies individual ice monomers within aggregated ice crystals, providing a monomer-resolved level of detail for quantifying aggregation.

The specific research questions that we address are:

  1. Which microphysical and meteorological factors control the rate of ice aggregation?

  2. How rapidly does aggregation occur following initial ice formation, and to what extent can the aggregation rate be inferred from the controlling parameters?

To answer these questions, we investigate ice aggregation rates in stratiform clouds using both data-driven and physically derived approaches. Specifically, we (i) identify the microphysical and meteorological controlling controls on aggregation (Sect. 3.1); (ii) disentangle the direct and indirect effects of each factor using causal inference (Sect. 3.2); (iii) evaluate predictive models trained on these factors (Sect. 4.2 and Sect. 4.3); and (iv) test the sensitivity of the predictions to temperature and the initial ice crystal number concentration (ICNCt0) (Sect. 4.4).

2 The CLOUDLAB campaign and observational data

The CLOUDLAB experiments were conducted in persistent wintertime stratus clouds over the Swiss Plateau, with measurements centered at a site near Eriswil (47°0414′′ N, 7°5222′′ E; 920 m a.s.l.). These clouds were typically supercooled, liquid-dominated, and quasi-stationary, with bases below 1000 m a.g.l. and thicknesses of several hundred meters (Scherrer and Appenzeller2014). We performed glaciogenic seeding upwind of the measurement site using uncrewed aerial vehicles (UAVs) and sampled the resulting microphysical changes with a suite of in situ and remote sensing instruments. Methods specific to this study are detailed below, including the seeding operations and instrumentation (Sect. 2.1) and observed ice properties (Sect. 2.4).

2.1 Seeding Operations and Instrumentation

Glaciogenic seeding was performed upwind of the measurement site using a customized uncrewed aerial vehicle (UAV; Meteodrone MM-670, Meteomatics AG, Switzerland) equipped with burn-in-place flares (Zeus MK2, Cloud Seeding Technologies, Germany). Each flare contained approximately 200 g of seeding material, including about 20 g of silver iodide and other ice-active compounds effective at temperatures below 5 °C (Chen et al.2024; Miller et al.2024). The UAV was operated by flying multiple crosswind legs (200–400 m) while releasing seeding particles for 5–6 min at distances of 2–3 km upwind of the measurement site at seeding time t0 (In-cloud seeding and ice nucleation processes in Fig. 1). Further details on the seeding operations and ice nucleation mechanisms are provided in Miller et al. (2024, 2025).

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f01

Figure 1Scheme for quantifying in situ ice aggregation after cloud seeding. Seeding particles are released by a UAV, initiating ice crystal formation at time t0. The ice crystals grow and aggregate during the advection time t1t0, and at t1 an image of them is captured by a holographic imager on a tethered balloon system. The ice habit and the number of monomers per aggregate are quantified based on the IceDetectNet (Zhang et al.2024). For example, the right panel shows an aggregate consisting of two monomers classified as a frozen droplets and a column-plate. The aggregation rate (Ragg) is defined as the total number of aggregation events, (ni-1), divided by the advection time t and the sampled volume Vt1 at t1.

The plume of ice crystals generated by seeding arrived at the measurement site after 5–10 min depending on the wind speed. Because the plume was typically several hundred meters wide and persisted over the site for several minutes, it could be sampled continuously as it passed, yielding a time series of microphysical observations. The advection time t – the time between ice nucleation and observation – was calculated by comparing the times of seeding particles release and first ice detection, and using the observed wind speed across the plume to estimate advection time. To avoid underestimating travel time, the maximum wind speed among the measured instruments was used in the calculation (see Fuchs et al.2025 for more details regarding the method). Throughout this growth period, the temperature and wind speed were assumed to remain constant.

The microphysical properties of the plume were measured using a suite of in situ and remote sensing instruments at measurement time t1 (see Fig. 1, Ice detection). Remote sensing measurements included ground-based Ka- and W-band Doppler cloud radars. These radars were used to observe the general cloud structure (e.g., cloud top height) before and after seeding.They were also used to retrieve the turbulence intensity, which is expressed as the eddy dissipation rate (EDR). EDR was retrieved from the Ka- and W-band radars following the spectral-width method (Appendix E in Wu et al.2025). Primary estimates were obtained from the Mira35 MBR7 Ka-band radar. In addition, the RPG94 W-band radar was used to supplement data gaps. The retrieval was temporally averaged over 30 s, and the resulting EDR fields have a spatial resolution of approximately 30 m.

In situ observations were provided by a tethered balloon system (TBS) carrying the HOLographic Imager for Microscopic Observations (HOLIMO; Ramelli et al.2020). The TBS consisted of a 200 m3 helium-filled kytoon capable of reaching altitudes up to 1 km above ground. Suspended 30 m below the kytoon, the instrument platform carrying HOLIMO, a digital in-line holography system with resolving sizes of 6 µm for cloud droplets and 25 µm for ice crystals. HOLIMO operated continuously before, during, and after each seeding experiment, recording cloud droplets throughout and ice crystals over the duration of the plume encounter. Holograms were reconstructed using HoloSuite (Fugal and Shaw2009) at 1 s resolution, and particles were classified as artifacts, cloud droplets, or ice crystals using a convolutional neural network (Touloupas et al.2020). Manual verification was performed for all cloud droplets and ice crystals larger than 35 µm to ensure classification accuracy; no ice crystals smaller than this threshold were observed.

Classified ice crystals were then further analyzed using a fine-tuned version of IceDetectNet (Zhang et al.2024), named IceDetectNet-CLOUDLAB (see Sect. 2.3 for details). IceDetectNet-CLOUDLAB is a rotated object detection model trained to assign each monomer a shape label comprising its basic habit and microphysical process, and to estimate the number of monomers per ice crystal. The classification scheme includes three basic ice habits – column, plate, and irregular – and one microphysical process, riming. If an ice crystal contains more than one monomer, it is additionally classified as aggregated. This results in two independent microphysical processes (riming and aggregation) and four possible states for each ice crystal: pristine, rimed, aggregated, and rimed + aggregated. Combining the three habits with these four states yields a total of 12 distinct ice crystal classes. The uncertainty in cloud droplet number concentration is approximately ±5 %, while that for ice crystal number concentration ranges from 5 %–10 % for crystals larger than 100 µm and about 15 % for smaller ice crystals. Uncertainty quantification for habit/process classification and monomer counts is detailed in Appendix A.

2.2 Estimating Aggregation Rates from Experimental Data

Ice aggregation was quantified by estimating the number of monomer-level collisions that formed each observed aggregate. Under the definition that all ice crystals are monomers at the initial time t0 and that aggregation proceeds via binary collisions, an observed aggregate containing ni monomers has undergone (ni−1) aggregation events to reach its observed state at time t1. The aggregation rate Ragg (s−1 L−1) was thus defined as the mean number of aggregation events per unit time per unit cloud volume, averaged over the advection time t:

(1) R agg = i = 1 N ( n i - 1 ) t V t 1

where N is the total number of detected aggregates, ni is the monomer count of the ith aggregate, t is the advection time estimated based on the method in (Fuchs et al.2025), and Vt1 is the sampled cloud volume at time t1. The approach relies on the following assumptions:

  1. Steady background conditions. Environmental and microphysical properties of the background cloud (e.g., temperature, wind speed) are assumed to remain constant throughout each experiment.

  2. Instantaneous nucleation. Ice nucleation is assumed to occur immediately upon release of seeding particles, providing a well-defined initial time to ice formation. Our observations have revealed that ice formation is likely initiated very quickly after the release of the highly hygroscopic and ice-active seeding material (Miller et al.2025).

  3. Conservation of monomer number. The total number of ice crystal monomers is assumed to be conserved between the seeding and observation locations. This implies that (i) sedimentation losses within the plume are approximately balanced by ice crystals falling from above. This assumption is supported by radar observations, which do not show a systematic downward displacement of the enhanced reflectivity associated with the seeded plume during the observation period (see Fig. 2 in Fuchs et al.2025). (ii) no secondary ice production (SIP) occurs. The latter is supported by observations: no large droplets or graupel were detected, and no evidence of frozen droplet breakup was observed, consistent with conditions unfavorable for SIP.

  4. Complete detection of aggregate monomers. All monomers within an aggregate are assumed to be accurately identified by IceDetectNet. However, some early aggregates may become unresolved after diffusional growth and be classified as irregular, but this fraction is very small (see Fig. 2h). Other potential detection errors and classification errors are discussed in Appendix A and are considered negligible.

2.3 Training IceDetectNet-CLOUDLAB for Aggregated Ice Monomer Identification

To retrieve the number of monomers in each aggregate, we fine-tuned the original IceDetectNet model (Zhang et al.2024) to create IceDetectNet-CLOUDLAB, specifically adapted for holographic images collected during CLOUDLAB seeding experiments. The original model was trained to classify ten ice crystal habits; for this application, the output layer was reconfigured to distinguish four categories: column, plate, column-rimed, and plate-rimed. The final classification layer was randomly initialized, while all other parameters were taken from the IceDetectNet. A total of 2380 manually labeled images were used, randomly selected to ensure an approximate balance across all seeding experiments. Of these, 2134 images were used for training and 246 for testing. The model architecture followed IceDetectNet-CLOUDLAB (Zhang et al.2024), based on S2ANet with a ResNet-50 backbone. To improve training stability on the smaller CLOUDLAB dataset, which contains fewer ice crystals and fewer classes than previous applications, the initial learning rate was reduced to 0.0001. The number of training epochs was extended to 200, with learning rate decay scheduled earlier at epochs 32 and 48 to encourage earlier convergence and to prevent overfitting. A linear warmup phase of 1000 iterations was applied to avoid early gradient instability. During inference, up to six predictions per image were retained after non-maximum suppression (IoU threshold = 0.5).

2.4 Statistical Characterization of Ice Crystal Properties

We analyzed 21 seeding experiments (Table F1) conducted at temperatures between 4.7 and 7.8 °C. The ice crystal habit distribution differed systematically with temperature: warmer experiments (T>7 °C) contained exclusively columnar crystals, whereas the colder experiments (T7 °C) contained both plates and short columns (Fig. E1). Plates and columns have been shown to exhibit different diffusional growth rates along their major axes (Fuchs et al.2025), which can influence subsequent aggregation. Based on this habit difference, the experiments were classified into “warmer” (T>7 °C; 14 experiments, 2508 data points) and “colder” (T7 °C; 7 experiments, 797 data points) regimes (Fig. 2a). Advection times were generally shorter in the colder experiments, ranging from 294–532 s, compared to 371–629 s in the warmer experiments (Fig. 2b).

ICNCt0, the initial ice crystal number concentration, was estimated by assuming that each aggregated monomer corresponds to one ice crystal present at t0. This allows estimation of the ICNCt0 at the initial state t0. ICNCt0 was higher in the warmer regime than in the colder, with mean ± standard deviation of approximately 815 L−1± 394 L−1 and 566 L−1± 291 L−1, respectively (Fig. 2c). While these concentrations are higher than those typically found in unseeded clouds (Heymsfield and Willis2014), the aggregation process is governed by the same underlying collisional physics.Aggregates were more frequent in the warmer regime, with fractions reaching up to 43 % of total observed ice crystals, while remaining around 15 % in the colder regime (Fig. 2d). Ice crystals in the warmer regime reach mean major size (defined as the length of the crystal’s major axis) per experiment of up to  312 µm (minimum  101 µm), compared to a range of  66–129 µm in the colder regime; the warmer regime also exhibited a broader, long-tailed size distribution (Fig. 2e). Aspect ratio (defined as the ratio of the major to minor axis lengths) distributions differed as well: the colder regime was narrowly peaked around 1.4–1.6, while the warmer regime showed a broader and flatter distribution around 1.73–4.13 (Fig. 2f). The riming ratio (RR) is defined as the fraction of rimed crystals. Rimed crystals were abundant in both regimes, with fractions exceeding 38 % in all experiments and reaching up to 94 % in one single experiment, indicating that riming was nearly ubiquitous (Fig. 2g). Finally, irregular ice crystals, which have no identifiable habit, were uncommon overall (2 %–5 %; Fig. 2h). Some early aggregates may have grown sufficiently by diffusion to obscure their monomer structure, thus being classified as irregular. These crystals were slightly more frequent in the colder regime, suggesting that any resulting underestimation of aggregation is likely negligible. EDR values measured during the seeding flights were modest (on the order of 10−410-3m2s-3, Fig. B1), comparable to turbulence levels reported for other boundary-layer cloud environments (Chu et al.2025). This indicates that aggregation developed under a weak-turbulence conditions characteristic of stratiform mixed-phase clouds.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f02

Figure 2Distributions of ice crystal properties during seeded periods, grouped by temperature: colder (T7 °C, blue) and warmer (T>7 °C, red). (a) Temperature; (b) Advection time; (c) ICNCt0; (d) Aggregation ratio; (e) Major size of ice crystals; (f) Aspect ratio; (g) Riming ratio; (h) Irregular ratio. For (a) and (b), the solid lines show the per-experiment distributions. For (c) through (h), the solid lines represent one-second resolution distributions within each regime, and the scatter points indicate the means of the experiments (The scatter positions along the y axis are offset solely to reduce overlap. Only the values along the x axis carry meaning).

Download

3 Microphysical and Environmental Controls on Ice Aggregation

We examined the microphysical and environmental controls on ice aggregation through a structured sequence of analyses. We first examined how microphysical and environmental factors influence the observed aggregation rates (Sect. 3.1). We then applied a causal graph to disentangle the direct and indirect effects of these factors on aggregation (Sect. 3.2).

3.1 Correlations Between Aggregation Rate and Microphysical and Environmental Factors

To better understand the mechanisms underlying aggregation, we first examine the Pearson correlations between the aggregation rate and four factors: ICNCt0, advection time, temperature, and EDR. No clear correlation was observed between aggregation rate and EDR, which characterizes the turbulence intensity within the cloud (see Appendix  B for the EDR correlation analysis). This suggests that, under the present experimental conditions, turbulence at the resolved scales (30 m × 30 m) did not significantly influence aggregation. The correlations with ICNCt0, advection time, and temperature are all presented in the following.

3.1.1 Strong positive correlation between ICNCt0 and aggregation rate

ICNCt0 exhibited a strong positive correlation with the aggregation rate in all experiments (Fig. D1). This is consistent with theoretical expectations, as higher crystal concentrations increase the probability of collisions leading to aggregation (Hobbs et al.1974).

To illustrate this relationship more clearly, we present two experiments for reference: a warmer case (SM054, T=5 °C) and a colder case (SM069, T=7.6 °C) (Fig. 3a, b). Under the condition that the ice crystals are homogeneously distributed, the instantaneous aggregation rate (Seifert and Beheng2006, Eq. 62), which is the number of aggregation events per unit time per unit volume of cloud, can be expressed as:

Ragg=1/2K(D1,D2,T)N1N2,

where N1 and N2 are the number concentrations of two ice crystal populations of sizes D1 and D2, T is temperature, and K(D1,D2,T) is the collision kernel, which depends on ice crystal size, shape, fall speed, and temperature (Connolly et al.2012). In the simplest case, if the shape of the size distribution is approximately preserved and only its overall magnitude changes with the total ice number concentration, then the concentrations of the relevant size classes, N1 and N2, both scale linearly with the initial total ice concentration, ICNCt0. Under this approximation, both the instantaneous and time-averaged aggregation rates are expected to scale as Ragg=1/2K(D1,D2,T)(ICNCt0)2.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f03

Figure 3Correlation between aggregation rate and ICNCt0 in two temperature regimes. (a) warmer case (SM054, T=5.0 °C); (b) colder case (SM069, T=7.6 °C). One-second data points are shown as red circles (warmer) and blue squares (colder). Solid lines with shaded areas show free power-law fits Ragg=aICNCt0n with 95 % confidence intervals (red for warmer, blue for colder); dashed lines with grey shading show quadratic fits (n=2). The prefactor a was fitted independently for each experiment and reflects the effective collision kernel.

Download

Motivated by this scaling, we first evaluated the data against this fixed quadratic relationship (Ragg =a ICNCt02), where the prefactor a effectively represents 1/2K(D1,D2,T). To account for differences between experiments, a was fitted independently in each case. While this model gets correlation coefficients of r=0 and r=0 for the warmer and colder cases, respectively, it systematically underestimated aggregation at low ICNCt0 (Fig. 3a, b, dashed grey lines).

A better fit was achieved using a free power-law relationship (Ragg =a ICNCt0n), where both a and n were estimated separately for each experiment. This empirical model closely matched the observations across the full range of concentrations (Fig. 3a, b, solid lines). In the warmer-case experiment (SM054), the aggregation rate scaled with an exponent of 1, improving the correlation to r=0 (Fig. 3a). In the colder-case experiment (SM069), the best-fit exponent was 0, with a corresponding correlation of r=0 (Fig. 3b). This pattern was consistent across all 21 experiments, where the free-fit power-law systematically performed better than the fixed-quadratic form (see Fig. D1). The average exponent across all experiments was 0 ± 0, smaller than the quadratic dependence expected from the collision kernel formulation (Melzak1957; Connolly et al.2012; Seifert and Beheng2006). Bulk microphysics schemes such as Lin et al. (1983) and Morrison and Milbrandt (2015) have even adopted a linear dependence on ICNC, which is closer to our observations, often with an additional size threshold so that aggregation only occurs once ice crystals exceed a certain size. However, our observations hint that aggregation may also occur among smaller ice crystals. This could result in effective dependence with an exponent smaller than one.

3.1.2 No significant correlation between aggregation rate and advection time

Advection time in our experiments ranged from 294 to 650 s (Table F1, Fig. 4) and showed no significant correlation with aggregation rate (r=0.09, p=0). This parameter is specific to our experimental design, as the ice crystals were artificially generated by cloud seeding, enabling an explicit measurement of the total growth period. In natural-cloud studies, advection time is rarely quantified, and direct comparisons are therefore not straightforward. Theoretically, a longer advection time would be expected to yield a smaller time-averaged aggregation rate, because ICNC decreases over time. We find only a very weak tendency for Ragg to decrease with advection time (Fig. 4, R2=0), suggesting that our results remain broadly comparable to previous observational and laboratory studies despite this experimental uniqueness (Connolly et al.2012; Field and Heymsfield2003; Field et al.2006). This weak dependence between aggregation rate and advection time may reflect the fact that ice crystals must first grow to sufficiently large sizes before collisional aggregation becomes efficient. During the early growth phase, ICNC decreases rapidly due to plume dilution (as seeded particles are initially concentrated and then disperse; see Fig. 2 in Ramelli et al.2024), but this shows little impact on the aggregation rate, making it appear largely insensitive to advection time.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f04

Figure 4Correlation between aggregation rate and advection time. Orange markers with error bars show the mean ± standard deviation for each experiment, and the solid line represents the linear fit (r=0, p=0) across all 21 experiments.

Download

3.1.3 Temperature dependence of aggregation rate associated with ice crystal size and aspect ratio

To investigate the role of temperature in shaping aggregation, motivated by the higher aggregation rate observed in the warmer case (SM054) compared to the colder case (SM069), we first examined how ice crystal size and geometry differ between warmer and colder experiments. Using the same representative cases shown in Fig. 3, we compared the warmer case SM054 (5 °C) with the colder case SM069 (7.6 °C). Consistent with known temperature-dependent ice habits, the warmer group (above 7 °C) consisted almost exclusively of columnar ice crystals (Fig. E1a), whereas the colder group (below 7 °C) included both columnar and plate-like crystals (Fig. E1b). Correspondingly, the warmer case exhibited consistently broader distributions of both major axis length and aspect ratio (Fig. 6). This behavior was not unique to these two cases: across the full dataset, warmer experiments showed broader distributions of major size (Fig. 2e) and aspect ratio (Fig. 2f) than colder experiments. Increased variability in crystal size and shape broadens the range of fall velocities among ice crystals (Heymsfield1972; Mitchell1996), which is expected to enhance collision frequency and thereby promote aggregation.

Consistent with this interpretation, aggregation rate shows a very weak positive association with temperature across all experiments (Pearson r=0, p=0; Fig. 5). While this relationship does not reach conventional significance levels and may therefore be difficult to distinguish from noise over the limited temperature range sampled, it is qualitatively consistent with the observed structural differences in the ice population and with previous laboratory studies suggesting enhanced aggregation under warmer conditions (Hosler et al.1957).

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f05

Figure 5Correlation between aggregation rate and temperature. Same as Fig. 3, but for the Temperature.

Download

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f06

Figure 6Distributions of ice crystal major axis length and aspect ratio in warmer and colder cases. (a) Major axis length and (b) aspect ratio for SM054 (warmer, red) and SM069 (colder, blue).

Download

3.2 Causal Pathways Among Aggregation Drivers

Previous analyses showed that ICNCt0, temperature, major axis length, and aspect ratio each correlate with aggregation rate to varying degrees. However, these factors are not independent, and their relative contributions and interactions remain unclear. To disentangle direct and indirect effects among these variables, we constructed a causal graph in the form of a directed acyclic graph (DAG; Fig. 7), incorporating these 4 variables as well as riming ratio (RR), which is the percentage of rimed ice crystals to all the observed ice crystals. Because riming influences ice crystal geometry – particularly major size and aspect ratio – and it may also influence surface stickiness, especially shortly after the riming event, it can, in principle, affect aggregation indirectly.

DAGs represent hypothesized cause–effect relationships, with nodes denoting variables and arrows indicating the direction of influence (Pearl2009; Peters et al.2016). Unlike correlation-based analyses, DAGs explicitly separate direct effects from mediated pathways, enabling quantitative decomposition of total effects. For example, temperature may influence aggregation rate both directly and indirectly by altering ice crystal size and aspect ratio.

The graph structure was specified by combining prior physical knowledge with statistical dependencies inferred from the data. Specifically, candidate causal links were first proposed based on established microphysical understanding of ice aggregation (e.g., ICNC, size, and habit influencing collision and sticking probability), while arrows implying physically implausible causality (e.g., aggregation rate influencing temperature) were excluded a priori. Statistical associations were then used to assess whether the proposed links were supported by the data. The final DAG therefore reflects a physically constrained causal hypothesis rather than a data-driven structure learned automatically. Path coefficients were estimated using structural equation modeling (SEM), which fits a system of regression equations to quantify the strength of each link. All variables were standardized prior to fitting, so that each coefficient represents the change in aggregation rate (in standard deviations) associated with a one-standard-deviation increase in the predictor, reflecting their relative importance.

The fitted DAG (Fig. 7) shows that ICNCt0 (+0.81), temperature (+0.24), and major size (+0.08) have positive direct effects on aggregation rate, with ICNCt0 clearly dominating. Aspect ratio exhibits a weak negative direct effect (0.16), likely because stronger aggregation tends to increase the number of monomers per particle, producing more compact and rounded aggregates and thus reducing the aspect ratio. Beyond its direct effect, temperature also strongly influences aspect ratio (+0.61) and major size (+0.40), which in turn affect aggregation. Since aspect ratio and major size have opposing effects (negative and positive, respectively), these indirect contributions partially cancel, and temperature maintains an overall positive influence.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f07

Figure 7Causal graph of factors influencing aggregation rate. The graph depicts the inferred direct and indirect effects on aggregation rate (Ragg) among five variables: ICNCt0, temperature (T), major axis length (Majsiz), aspect ratio (AR), and riming ratio (RR). Nodes represent variables, and directed arrows indicate causal relationships. Arrow thickness, color, and the numerical labels on the arrows denote the magnitude and direction of the standardized effects (change in aggregation rate in standard deviations per standard deviation change in the predictor).

Download

RR has a negligible direct effect on aggregation (0.01). RR indirectly increases the major axis length (+0.31), which tends to promote aggregation; however, the positive contribution of major size to the aggregation rate (+0.08) is itself very small. Moreover, most rimed ice crystals in our measurements exhibit only light riming (Fig. E1), limiting the potential for riming-induced enhancements in fall speed or sticking efficiency. Under these specific experimental conditions, the overall influence of RR on aggregation appears minimal, suggesting that any coupling between riming and aggregation is weak or not detectable within these experiments.

4 Aggregation Rate Predictive Models

Building on the preceding analysis, we next evaluated whether aggregation rates can be quantitatively predicted from the identified microphysical and meteorological factors. We first describe the training procedures and evaluation metrics (Sect. 4.1). Then, we implemented and compared eleven machine learning models (Sect. 4.2) and one physically based equation (Sect. 4.3), using temperature, major axis length, aspect ratio, and ICNCt0 as input features and the aggregation rate as the target variable. RR was excluded from the predictive modeling as its direct effect on aggregation was found to be negligible in our causal inference analysis (Sect.3.2). Finally, to evaluate the robustness of these models under varying atmospheric and microphysical conditions, we performed a sensitivity analysis with respect to temperature and ICNCt0 (Sect. 4.4).

4.1 Predictive Model Training and Evaluation

We trained the models using temperature, major axis length, aspect ratio, and ICNCt0 as predictors of the aggregation rate. All variables except temperature were available at 1 s resolution; temperature was constant for each experiment. We fitted the analytical model (Eq. 2) by minimizing the mean absolute error (L1 norm) between predicted and observed aggregation rates, with L2 regularization applied to all parameters (penalty coefficient =10-4). All coefficients were initialized to one and optimized using gradient descent with a learning rate of 10−2 over 1500 iterations. All machine learning models were trained with default hyperparameters, except for a fixed random seed (42) to ensure reproducibility. No early stopping, feature engineering, or categorical coding was applied, as all inputs were continuous. Model performance was assessed using five-fold cross-validation with group-based splits defined by seeding experiment identifiers. This ensured that data from a given experiment were restricted to either training or test sets. For each model and fold, we computed the root mean square error (RMSE), which quantifies the typical magnitude of prediction error, and the coefficient of determination (R2), which reflects the proportion of variance explained. We reported the mean and standard deviation of both metrics across the five folds. After evaluation, each model was retrained on the full dataset and archived for reproducibility. Model interpretability was assessed using SHapley Additive exPlanations (SHAP; Lundberg and Lee2017). SHAP values quantify how much each feature shifts a prediction away from the dataset’s average prediction, with positive values indicating an increase and negative values a decrease. Conceptually, each feature is treated as a “player” in a cooperative game, and its SHAP value represents the average change in the prediction when the feature is added to all possible subsets of other features. This provides a consistent measure of each feature’s contribution to the prediction, accounting for interactions with other features. Global feature importance was obtained by averaging the absolute SHAP values across all samples, and the results were visualized using summary plots.

4.2 Machine Learning Models for Aggregation Rate Prediction

4.2.1 Overview of Machine Learning Algorithms

We implemented eleven supervised regression algorithms to model the relationship between environmental and microphysical predictors and aggregation rate. These included linear methods (Linear, Ridge, and Bayesian Ridge Regression), instance-based learning (K-Nearest Neighbors), decision tree–based models (Decision Tree, Extremely Randomized Trees), and ensemble boosting techniques (Adaptive Boosting, Gradient Boosting, Light Gradient Boosting Machine, Extreme Gradient Boosting, and gradient boosting with categorical features support (CatBoost)). Linear models assume independent, additive effects and are limited in representing nonlinear interactions. Tree-based models capture nonlinear and interaction effects through hierarchical partitioning of the feature space, while boosting methods iteratively refine predictions by combining multiple weak learners. Detailed descriptions of all models are provided in Appendix C. Among these, CatBoost is particularly well-suited for the current problem because of its ability to handle nonlinear interactions and its robustness to overfitting. CatBoost is a gradient boosting method that uses symmetric (oblivious) decision trees, where all nodes at the same depth split on the same feature and threshold. This symmetry simplifies optimization and improves generalization. It also introduces “ordered boosting,” which builds trees in a way that avoids using future information during training, thereby reducing overfitting. Additionally, CatBoost efficiently handles categorical features and typically requires minimal hyperparameter tuning.

4.2.2 Predictive Performance and Feature Contributions

Among all machine learning models, CatBoost and Gradient Boosting achieved the highest performance, with the lowest RMSE (1×10-1 s−1 L−1) and high R2 (0) for CatBoost, and a slightly higher RMSE but marginally better R2 (0) for Gradient Boosting. Their comparable skill likely reflects their shared ensemble structure and residual learning strategy. Extra Trees and Light Gradient Boosting Machine followed closely in performance (Table 1). For subsequent analyses, CatBoost was selected as the representative data-driven model owing to its competitive accuracy and robust generalization. The physically derived equation also performed comparably to the tree-based models, achieving an RMSE of 2×10-1 s−1 L−1 and R2 of 0, suggesting that it captures the dominant aggregation dependencies despite its reduced complexity and lack of data-driven tuning. In contrast, linear models (Linear Regression, Ridge, Bayesian Ridge) performed poorly, with R2 values around 0 and RMSE around 2×10-1 s−1 L−1, highlighting the importance of nonlinear interactions and feature dependencies that linear approaches cannot represent. In each experiment (Fig. 8), the agreement between observed aggregation rates and CatBoost predictions is evident, with residuals having means near zero and relatively small standard deviations. These results demonstrate that the model effectively captures the relevant physical mechanisms.

Table 1Model performance comparison. Mean RMSE (10−1 s−1 L−1) and R2 values along with their standard deviations, sorted by RMSE in ascending order.

Download Print Version | Download XLSX

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f08

Figure 8Observed, predicted, and residual aggregation rates per experiment. For each experiment, observed aggregation rates (dark blue circles), CatBoost-predicted aggregation rates (yellow triangles), and residuals (pink squares, observed minus predicted) are shown side by side, with observed values on the left, predicted in the middle and residuals on the right. Markers indicate mean values, and error bars represent the standard deviation within each experiment. Rates are expressed in L−1 s−1.

Download

To interpret the predictions, we computed SHAP, which quantifies the relative contribution of each feature factor. ICNCt0 was the dominant predictor, explaining 69.9 % of the variance (Fig. 9), consistent with the earlier correlation analysis (Figs. 3, D1) and the causal graph (Fig. 7), where ICNCt0 also showed the strongest standardized effect. However, a simple linear regression model using ICNCt0 alone performed poorly (R2=0±0, RMSE =2×10-4±0×10-1 s−1 L−1; Table 1), underscoring the importance of nonlinear interactions among multiple variables.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f09

Figure 9SHAP analysis of feature contributions to CatBoost model predictions. Each dot represents one prediction at 1 s resolution from the test data. SHAP values (horizontal axis) indicate the impact of each input feature on the predicted aggregation rate. Features (i.e. ICNCt0, Major size, Aspect ratio, and Temperature) are ranked by their overall importance (right), with percentages representing the mean absolute SHAP value normalized across all features. Dot colors reflect the corresponding feature values, with red denoting higher and blue denoting lower values of the feature.

Download

Major axis length (14.9 %) also contributed meaningfully to the predictions. Larger major sizes were associated with higher aggregation rates, consistent with the expectation that larger crystals, with greater cross-sectional area and fall-speed variability, enhance collision likelihood (Heymsfield and Miloshevich2003) – in agreement with the positive direct effect identified in the causal graph (+0.08, Fig. 7). Aspect ratio (8.3 %) had a weaker and more complex effect: SHAP values clustered near zero, with a slight tendency for lower aspect ratios to favor aggregation and higher aspect ratios to suppress it, consistent with its negative effect in the causal graph (0.16).

Temperature accounted for 6.9 % of the model output. Higher temperatures generally promoted aggregation, consistent with laboratory findings (Hosler et al.1957) and the causal graph (+0.24, Fig. 7). Also, unlike the other variables, temperature was measured as a single value per experiment rather than at a 1 s resolution, which may have limited its explanatory power in the model. Nevertheless, its limited contribution suggests that its influence is largely indirect and mediated by changes in crystal habit and size, as reflected by aspect ratio and major axis length. Therefore, temperature likely acts as a secondary driver embedded within other structural parameters.

4.3 Physical Model

4.3.1 Model Formulation

We constructed an empirical formulation for the ice aggregation rate based on established microphysical principles. The basic structure follows the collision kernel framework (Melzak1957), in which the aggregation rate is proportional to the product of the number concentrations of the interacting particles. Under the simplest assumption of equal-sized populations, this scaling becomes quadratic in ICNCt0. However, several bulk microphysics schemes adopt a linear dependence. For example, in Lin et al. (1983), the aggregation rate is parameterized as being proportional to the ice mixing ratio (Lin et al.1983, Eq. 21). Because the mixing ratio is given by the product of number concentration and mean ice crystal mass, this formulation effectively yields a linear dependence on number concentration when the ice-crystal-mass–number relationship (i.e., the assumed PSD shape) is held fixed within the scheme. Similar linear forms also appear in two-moment parameterizations such as Morrison and Milbrandt (2015). To allow flexibility between these theoretical and parameterization-based scalings, the exponent on ICNCt0 was treated as a free parameter. The temperature dependence is represented as an exponential term, consistent with parameterizations of collision efficiency in two-moment schemes (e.g. Seifert and Beheng2006), where temperature modulates the quasi-liquid layer and hence the sticking probability upon collision. The dependence on major axis length is formulated as a power law to account for its role in determining both geometric cross-section and differential fall speed, which are classical components of aggregation kernels. Thus, the proposed equation is:

(2) R agg = α ICNC t 0 β 0 exp β 1 T MajSiz β 2

where: Ragg is the observed aggregation rate (s−1 L−1), α is a dimensioned scaling constant, β0, β1, and β2 are empirically fitted exponents, ICNCt0 is the initial ice crystal number concentration (L−1), T is the ambient temperature (°C), MajSiz is the average major axis length of ice crystals (m). The term ICNCt0β0 accounts for the increased probability of collisions as a function of ice crystal concentration. Collection theory suggests a near-quadratic dependence under the assumptions of random motion and homogeneous mixing (Connolly et al.2012). We retain a flexible exponent β0 to account for inhomogeneities and dispersion effects in real clouds. The temperature-dependent exponential term, eβ1T, reflects the nonlinear role of temperature in aggregation-relevant processes. Temperature modulates diffusional growth, growing dimension and the thickness of the quasi-liquid layer, which shapes ice size and habit. Many of these processes exhibit exponential or Arrhenius-like behavior with temperature. The exponential formulation is consistent with aggregation parameterizations in operational two-moment microphysics schemes (e.g. Seifert and Beheng2006; Lin et al.1983). The power-law dependence on ice crystal size, MajSizβ2, represents the combined influence of fall speed variability and geometric collision cross section. Larger crystals sediment faster and offer larger interaction surfaces, increasing the likelihood of collision and adhesion. This formulation follows classical approaches to the collection kernel under differential sedimentation. Aspect ratio was excluded from the final model, as it is not explicitly represented in two-moment bulk schemes (Lohmann and Roeckner1996; Seifert and Beheng2006), and its fitted coefficient in our analysis was negligible compared to the other factors.

The final fitted form is:

(3) R agg = 0 ICNC t 0 0 exp ( 0 T ) MajSiz 0 ,

When optimized on the same dataset, this physical model achieved an RMSE of 2×10-4 s−1 L−1 and an R2 of 0.78 (Table 1). Although lower than the CatBoost model (R2=0), its performance remains strong, suggesting that the dominant dependencies of aggregation are well captured.

The fitted ICNCt0 exponent (β0=0) is below 1, which is consistent with the above free power-law fitting, which includes only one controlling factor, ICNCt0. This strong, but nonlinear, positive contribution aligns with both the causal graph (Fig. 7) and SHAP analysis (Fig. 9), which also identified ICNCt0 as the dominant predictor. The temperature coefficient (β1=0) indicates a positive sensitivity to temperature, consistent with laboratory evidence (Hosler et al.1957) as well as the causal graph and SHAP findings, though the effect is modest. However, this coefficient reflects only the behavior within our observational temperature range (between 4.7 and 7.8 °C). Finally, the major size exponent (β3=0) is relatively small, suggesting a weaker dependence on crystal size compared to ICNCt0 or temperature. This may reflect either the limited variation in crystal size across experiments, especially the colder cases (Fig. 2e) or the dominant influence of concentration effects under high-ICNCt0 seeded conditions.

4.4 Sensitivity to temperature and ICNCt0

We evaluated the sensitivity of predicted aggregation rates to temperature and ICNCt0 by comparing predictions from CatBoost (CatB) and the physical equation against observations (Fig. 10). Three prescribed ICNCt0 levels – 101, 102, and 103 L−1 – were selected to represent conditions characteristic of stratus clouds (Gultepe et al.2001), deep convection (Heymsfield and Willis2014), and secondary ice production in convective cloud systems (Korolev et al.2020). To enable a meaningful comparison between predictions and observations, we stratified the measurements into three corresponding ICNCt1 intervals: (100, 101), (101, 102), and (102, 103) L−1. These intervals approximately align with the predicted ICNCt0 levels and allow an assessment of the models’ ability to capture the observed magnitudes and trends across representative regimes.

Both models reproduce the observed positive temperature dependence within the measurement domain (7.8 to 4.7 °C; shaded region in Fig. 10) and exhibit physically plausible trends beyond. At all fixed ICNCt0, predicted aggregation rates increased systematically with temperature, broadly consistent with the observations. Observed rates generally clustered around the predicted values at their respective ICNC levels, supporting the proposed scaling.

CatBoost predictions were smooth and closely aligned with observations at high and intermediate ICNC levels, but at low ICNC they exhibited pronounced fluctuations at warmer temperatures (T>6 °C) and a nearly flat response at colder temperatures (T<6 °C; solid dark blue line in Fig. 10). This behavior reflects the high variability of observations in the low-ICNC regime. We quantify this variability using the coefficient of variation (CV), defined as CV=σ/μ, where σ and μ are the standard deviation and mean of Ragg within a given bin. Across the observed temperature range, CV values at low ICNC are typically 1.2–2.1, compared to 0.7–1.0 at intermediate ICNC and  0.6 at high ICNC. Such large CV values indicate that aggregation rates in this regime are dominated by strong fluctuations rather than a clear temperature signal. In combination with the imposed lower bound on the target variable during training, this lack of a resolvable signal causes the tree-based model to revert to the lower-bound baseline, resulting in an approximately constant prediction at low temperatures. The flat behavior therefore reflects the absence of a learnable temperature dependence in this regime, rather than a physically meaningful relationship. The fact that a localized dip near 5.3 °C appears consistently across all ICNC regimes, while being most pronounced at low ICNC and progressively weaker at higher ICNC, further supports this interpretation. This feature is more likely to reflect experiment-to-experiment variability and condition mixing near this temperature, together with the non-smooth, threshold-based nature of tree models, rather than a robust, generalizable physical transition.

In contrast, predictions from the physical equation were stable and monotonic across the full ICNC and temperature ranges, reproducing the general trends well, though slightly overestimating aggregation rates at the lowest ICNC and T<7 °C. The increased variability in low-ICNC observations suggests that other factors, such as habit composition, size distribution, or turbulence, may exert a proportionally stronger influence when ICNC is low. From a modeling perspective, this implies that physically based parameterizations provide a more reliable baseline representation of aggregation under such conditions, with observational or machine-learning-based approaches best suited for diagnosing variability or informing uncertainty estimates rather than defining deterministic temperature dependence.

Although evaluation based on RMSE and R2 over the full test set indicated slightly better performance of CatBoost compared to the physical equation, Fig. 10 suggests that the two approaches perform similarly in most regimes, with the physical equation outperforming CatBoost in some cases (e.g., low ICNC at T<7 °C). This apparent discrepancy can be partly attributed to how the performance metrics are computed: RMSE and R2 are dominated by data-rich regions, where CatBoost tends to follow the observed scatter more closely – especially when the observations themselves are subject to variability from binning into three ICNCt1 intervals [(100, 101), (101, 102), and (102, 103) L−1]. In such regions, CatBoost can achieve lower residuals by adapting flexibly to local variations, whereas the physical equation prioritizes smooth, process-consistent behavior. In contrast, in sparsely sampled or more physically complex regimes, the physical equation yields more stable and monotonic trends, while CatBoost shows larger deviations. These complementary behaviors underscore the value of combining data-driven flexibility with physically constrained formulations.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f10

Figure 10Sensitivity of predicted and observed aggregation rates to temperature across ICNC levels. Predictions from the CatBoost model (CatB) and a physical equation are shown for three prescribed initial ice crystal number concentrations (ICNCt0) of 101 (solid), 102 (dashed), and 103 L−1 (dash-dotted); blue and red curves correspond to CatB and the physical equation, respectively. Observations are divided into three ICNCt1 intervals: (100, 101), (101, 102), and (102, 103) L−1, and are plotted as crosses in cyan, orange, and red, respectively. The shaded region indicates the observed temperature range (7.8 to 4.7 °C).

Download

5 Conclusions

In this study, we present the in-situ quantification of ice aggregation rates in persistent supercooled stratiform clouds. This was made possible by a novel experimental design combining glaciogenic cloud seeding, which provided a controlled initial state and allowed us to determine the age of ice crystals from nucleation to observation, and IceDetectNet, a deep-learning algorithm capable of counting the number of monomers in individual ice crystals. This approach offered new insights into the microphysical and meteorological conditions that govern aggregation. The ICNCs observed during the experiments exceeded those typically found in naturally occurring stratiform clouds but were comparable to levels observed during secondary ice production events in convective clouds (Korolev et al.2020). The combination of high ICNC and the absence of secondary ice production provided a rare opportunity to isolate the contribution of aggregation without the confounding influence of ice multiplication processes and to generate observational constraints relevant to natural clouds.

We found ICNCt0, temperature, major axis length, and aspect ratio as the primary controls on aggregation rate among all the factors we investigated, with ICNCt0 emerging as the dominant factor. Even though ICNCt0 has the strongest influence, a linear model based solely on ICNCt0 failed to reproduce the observed rates, showing the importance of other factors and the nonlinearity of their interactions. While ICNCt0, temperature, and major axis length contribute positively to aggregation, aspect ratio acts as a negative factor, and riming showed no detectable effect, showing that riming and aggregation are largely independent processes. The influence of temperature operates both directly and indirectly by modifying ice crystal shape, consistent with current microphysical schemes (e.g. Seifert and Beheng2006). EDR showed no significant correlation with the aggregation rate. This finding likely reflects the combined effects of narrow size distributions and the dominant roles of ICNCt0 and temperature in these seeded clouds. Within the EDR range sampled, aggregation appears to be at most weakly sensitive to EDR, and any residual dependence is small compared with the primary controlling factors.

We demonstrated that both machine learning models and the physically derived equation successfully reproduced the observed aggregation rates. CatBoost achieved the best performance in terms of RMSE and R2 on the test dataset by capturing nonlinear interactions, whereas the physically based model proved more robust and stable in sensitivity tests, particularly in regimes with low ICNCt0, higher observational variability, and outside the observed temperature range. These differences highlight the complementary nature of data-driven and physically constrained approaches.

This study advances our understanding of the microphysical and environmental controls on ice aggregation. It emphasizes the central role of ICNCt0 and the influence of temperature-dependent ice crystal properties. The study also showed that the aggregation occurs within 5–10 min after the ice formed. However, the mechanism behind the observed sub-quadratic relationship between ICNCt0 and aggregation rate remains uncertain. One possible explanation that is consistent with our observations is that aggregation may involve smaller ice crystals. However, this remains hypothetical. Addressing these uncertainties requires experimental designs that can capture the intermediate stages of ice crystal growth and interaction. The single-point experimental setup limits our ability to capture the full evolution of microphysical processes and may obscure intermediate stages of aggregation. Future experiments would benefit from a Lagrangian observational setup to better resolve the complete aggregation pathway. This would allow us to refine the representation of aggregation in weather and climate models, ultimately improving predictions of precipitation and cloud radiative effects.

Appendix A: IceDetectNet-CLOUDLAB Uncertainties

To assess the uncertainty of IceDetectNet-CLOUDLAB, we compared its predicted monomer counts with manual annotations on a test set comprising 246 images of aggregates, totaling 359 labeled monomers. Discrepancies were calculated as the difference between predicted and annotated monomer counts. For example, if the hand label (HL) assigned two monomers while the prediction assigned one, the discrepancy was recorded as 1; if the prediction assigned three, the discrepancy was +1. Across all test images, the total discrepancy summed to 14, indicating a slight net overestimation of 14 monomers. To investigate the model's behavior across different aggregation levels, we grouped ice crystals by their hand-labelled monomer count (1 to 4) and computed the mean and standard deviation of the predicted counts within each group. Ideally, a perfect model would yield a mean equal to the true monomer count and a standard deviation of zero. The largest relative deviation occurred in the 2-monomer category, where the model overestimated the monomer count by 21.9 % (Table A1). Predictions for 3-monomer aggregates showed a slight underestimation. For the most frequent class 1-monomer ice, ice crystals – IceDetectNet-CLOUDLAB performed reliably, with a mean prediction of 1.10 and a low standard deviation of 0.35 (Table F1). These results indicate that the model is robust for monomer number counting but tends to slightly overestimate complexity in multi-monomer aggregates.

Table A1Prediction uncertainty of IceDetectNet-CLOUDLAB. Comparison between predicted (Pred) and hand-labeled (HL) monomer counts on the test set. Discrepancy (Discr.) is defined as prediction minus ground truth. Discr. (%) is normalized by HL Count.

Download Print Version | Download XLSX

Appendix B: Correlation Between Aggregation Rate and EDR

We evaluated whether turbulence intensity, represented by EDR, influenced the aggregation rate across the 21 seeding experiments. No significant correlation was found in either the warmer (r=0, p=0) or colder (r=-0, p=0) temperature regimes. Mean aggregation rates remained largely invariant across EDR bins, with substantial within-bin variability. HOLIMO imaged cloud particles within a three-dimensional volume of 11.76 cm3 at 20 Hz during seeding conditions (Fuchs et al.2025; Ramelli et al.2020, 2024), yielding an effective spatial resolution of approximately 1 m along the flight path.

Several factors may contribute to the absence of a detectable EDR signal: (1) the ice crystal size distribution in seeding experiments was relatively narrow and often dominated by specific ice habits, unlike the broader and more complex size distributions typical of natural mixed-phase clouds. This simpler distribution composition may reduce the sensitivity of aggregation processes to turbulence; and (2) the dominant influence of ICNC and temperature on aggregation rates could obscure turbulence effects, particularly in short-lived seeded clouds. A more straightforward interpretation is that aggregation is only weakly sensitive to EDR within the range of dissipation rates sampled here, and any remaining dependence is small relative to the dominant effects of ICNCt0 and temperature.

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f11

Figure B1Correlations between aggregation rate and EDR grouped by temperature. Same as Fig. 2c, but with EDR as the x axis.

Download

https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f12

Figure B2Distribution of EDR grouped by temperature group. Top: Probability density functions of EDR for all seeding experiments, grouped by temperature: colder (T7 °C, blue) and warmer (T>7 °C, red). Bottom: Mean EDR values for individual experiments, shown as red circles (warmer) and blue squares (colder).

Download

Appendix C: Machine Learning Aggregation prediction Models

We used a total of 10 supervised regression algorithms. A general introduction to these machine learning methods can be found in Géron (2022). Below we summarize the basic principles of each model:

  • Linear Regression fits a linear function by minimizing the squared difference between predicted and observed values. It assumes independent, additive relationships among input features and cannot capture nonlinear interactions.

  • Ridge Regression extends linear regression by adding L2 regularization, which penalizes large coefficients and reduces overfitting, particularly in the presence of small sample sizes.

  • Bayesian Ridge Regression introduces a probabilistic prior over the regression coefficients and estimates them using Bayesian inference. This yields both regularized predictions and uncertainty estimates, while still assuming linearity.

  • K-Nearest Neighbors is a nonparametric method that predicts output values by averaging the labels of the k training samples closest to the test point. Similarity is typically measured by Euclidean distance in the standardized feature space. Although simple and interpretable, the method becomes less effective in high-dimensional spaces.

  • Decision Tree Regression recursively partitions the input space by choosing feature-value thresholds that minimize the prediction error (typically mean squared error) at each split. The resulting model is a set of hierarchical “if–then” rules – for example, “if ICNCt0> 800 L−1 and temperature <6 °C, then…”. At each node, the best split is selected without looking ahead. While decision trees can capture nonlinear relationships and feature interactions, they tend to overfit if not regularized.

  • Adaptive Boosting constructs an ensemble of weak learners – models that perform only slightly better than random guessing – by sequentially reweighting the training samples. After each iteration, higher weights are assigned to mispredicted samples, forcing the next learner to focus more on difficult cases. The final output is a weighted sum of all learners. While this can substantially reduce bias, AdaBoost can be sensitive to noise and outliers.

  • Gradient Boosting also builds an ensemble of decision trees, but instead of adjusting sample weights, it fits each new tree to the residuals (i.e., errors) of the combined ensemble so far. This stage-wise additive approach allows the model to incrementally improve predictions and capture complex nonlinear dependencies.

  • Light Gradient Boosting Machine (Ke et al.2017) is an optimized gradient boosting implementation. It uses histogram-based feature binning, where continuous input features are discretized into fixed-width bins to accelerate training and reduce memory usage. Unlike traditional level-wise tree growth, LightGBM grows trees leaf-wise by expanding the leaf with the highest loss, leading to deeper, more specialized trees. These optimizations make LightGBM highly efficient on large datasets.

  • Extreme Gradient Boosting enhances standard gradient boosting with L1 and L2 regularization to penalize model complexity and prevent overfitting. It also supports parallelized training and handles missing values natively during tree construction. These improvements make it robust and scalable for structured data tasks.

  • Extremely Randomized Trees is an ensemble of decision trees where both the features and split thresholds are selected randomly at each node, rather than chosen based on an optimal impurity measure. This high degree of randomness reduces variance and helps avoid overfitting, especially when the data contain noise.

Appendix D: Correlation Between Aggregation Rate and ICNC
https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f13

Figure D1ICNCt0 – aggregation rate relationships across all experiments. Same as Fig. 3, but shown individually for all 21 experiments. Red and blue markers represent warmer (T>7 °C) and colder (T7 °C) cases, respectively.

Download

Appendix E: Examples of ice crystals
https://acp.copernicus.org/articles/26/1459/2026/acp-26-1459-2026-f14

Figure E1A randomly selected sample of ice crystal images observed by HOLIMO during (a) SM054 and (b) SM069.

Download

Appendix F: Experiments overview

Table F1Summary of all seeding experiments. For each experiment (Exp.) (where “SM” denotes seeding mission, consistent with Fuchs et al.2025; Miller et al.2025), the table lists the number of 1 s data points (#Data), advection time between seeding and sampling (Time, in s), wind speed at seeding level (Wind, in m s−1), temperature (Temp, in (°C)), mean ice crystal number concentration (ICNC, in cm−3), background liquid water content (LWC, in mg m−3), cloud droplet number concentration (CDNC, in cm−3), and mean eddy dissipation rate (EDR, in 10−3 m2 s−3). ICNC, LWC, CDNC, and EDR reported values are means ± standard deviations.

Download Print Version | Download XLSX

Code and data availability

The data (https://doi.org/10.5281/zenodo.18374164, Zhang et al.2026a), code and models (https://doi.org/10.5281/zenodo.18348241, Zhang et al.2026b), are available via the links provided in the references.

Author contributions

HZ performed the scientific analysis, prepared the figures, and wrote the manuscript. HZ (SM048–SM062, except SM055 and SM056), CF (SM063–SM073), and FR (SM055, SM056, SM074, SM075) analyzed the data from the holographic imager. HZ, CF, FR, AJM, NO, RS, and JH performed the seeding experiments and in situ measurements. HZ and YC manually labeled the ice crystals used for training IceDetectNet-CLOUDLAB. HZ and XL developed the original IceDetectNet model and designed the training strategy for IceDetectNet-CLOUDLAB. HZ trained 11 machine learning models, with some hyperparameter suggestions from XL. HZ, JH, and UL designed the physical model, and HZ performed the training. ZW calculated the eddy dissipation rate. UL, JH, and FR supervised the analysis and provided scientific guidance. UL, JH, and FR conceived CLOUDLAB and secured project funding. All authors contributed to revising the manuscript and approved the final version.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We acknowledge financial support from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program (grant no. 101021272). We would also like to extend our sincere gratitude to the following colleagues and collaborators: From the ETH domain, Benjamin Krummenacher for developing the deep learning algorithm to classify ice crystals as aggregated or not as part of his Bachelor thesis with HZ. From TROPOS, we thank Patric Seifert for providing the vertically pointing and scanning Doppler cloud radars and Kevin Ohneiser for maintaining the instrumentation. We are also grateful to Robert Oscar David (UiO) for many insightful scientific discussions.

Financial support

This research has been supported by the EU H2020 European Research Council (grant no. 101021272).

Review statement

This paper was edited by Greg McFarquhar and reviewed by Christopher Westbrook and one anonymous referee.

References

Chellini, G. and Kneifel, S.: Turbulence as a key driver of ice aggregation and riming in Arctic low-level mixed-phase clouds, revealed by long-term cloud radar observations, Geophysical Research Letters, 51, e2023GL106599, https://doi.org/10.1029/2023GL106599, 2024. a

Chen, J., Rösch, C., Rösch, M., Shilin, A., and Kanji, Z. A.: Critical size of silver iodide containing glaciogenic cloud seeding particles, Geophysical Research Letters, 51, e2023GL106680, https://doi.org/10.1029/2023GL106680, 2024. a

Chu, Y., Lin, G., Deng, M., and Wang, Z.: Characteristics of Eddy Dissipation Rates in Atmosphere Boundary Layer Using Doppler Lidar, Remote Sensing, 17, 1652, https://doi.org/10.3390/rs17091652, 2025. a

Connolly, P., Saunders, C., Gallagher, M., Bower, K., Flynn, M., Choularton, T., Whiteway, J., and Lawson, R.: Aircraft observations of the influence of electric fields on the aggregation of ice crystals, Quarterly Journal of the Royal Meteorological Society: A Journal of the Atmospheric Sciences, Applied Meteorology and Physical Oceanography, 131, 1695–1712, 2005. a

Connolly, P. J., Emersic, C., and Field, P. R.: A laboratory investigation into the aggregation efficiency of small ice crystals, Atmospheric Chemistry and Physics, 12, 2055–2076, https://doi.org/10.5194/acp-12-2055-2012, 2012. a, b, c, d, e, f, g

Crosier, J., Bower, K. N., Choularton, T. W., Westbrook, C. D., Connolly, P. J., Cui, Z. Q., Crawford, I. P., Capes, G. L., Coe, H., Dorsey, J. R., Williams, P. I., Illingworth, A. J., Gallagher, M. W., and Blyth, A. M.: Observations of ice multiplication in a weakly convective cell embedded in supercooled mid-level stratus, Atmospheric Chemistry and Physics, 11, 257–273, https://doi.org/10.5194/acp-11-257-2011, 2011. a

Field, P., Heymsfield, A., and Bansemer, A.: A test of ice self-collection kernels using aircraft data, Journal of the Atmospheric Sciences, 63, 651–666, 2006. a, b, c

Field, P. R. and Heymsfield, A. J.: Aggregation and scaling of ice crystal size distributions, Journal of the Atmospheric Sciences, 60, 544–560, 2003. a, b, c, d, e

Fuchs, C., Ramelli, F., Miller, A. J., Omanovic, N., Spirig, R., Zhang, H., Seifert, P., Ohneiser, K., Lohmann, U., and Henneberger, J.: Quantifying ice crystal growth rates in natural clouds from glaciogenic cloud seeding experiments, Atmospheric Chemistry and Physics, 25, 12177–12196, https://doi.org/10.5194/acp-25-12177-2025, 2025. a, b, c, d, e, f

Fugal, J. P. and Shaw, R. A.: Cloud particle size distributions measured with an airborne digital in-line holographic instrument, Atmospheric Measurement Techniques, 2, 259–271, https://doi.org/10.5194/amt-2-259-2009, 2009. a

Géron, A.: Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, O'Reilly Media, Inc., https://www.oreilly.com/library/view/hands-on-machine-learning/9781098125967/ (last access: 27 January 2026), 2022. a

Gierens, K. M., Monier, M., and Gayet, J.-F.: The deposition coefficient and its role for cirrus clouds, Journal of Geophysical Research: Atmospheres, 108, https://doi.org/10.1029/2001JD001558, 2003. a

Gultepe, I., Isaac, G., and Cober, S.: Ice crystal number concentration versus temperature for climate studies, International Journal of Climatology: A Journal of the Royal Meteorological Society, 21, 1281–1302, 2001. a

Henneberger, J., Ramelli, F., Spirig, R., Omanovic, N., Miller, A. J., Fuchs, C., Zhang, H., Bühl, J., Hervo, M., Kanji, Z. A., Karrer, M., Seifert, A., Ori, D., and Kneifel, S.: Seeding of supercooled low stratus clouds with a UAV to study microphysical ice processes: an introduction to the CLOUDLAB project, Bulletin of the American Meteorological Society, 104, E1962–E1979, 2023. a

Heymsfield, A.: Ice crystal terminal velocities, Journal of Atmospheric Sciences, 29, 1348–1357, 1972. a

Heymsfield, A. and Willis, P.: Cloud conditions favoring secondary ice particle production in tropical maritime convection, Journal of the Atmospheric Sciences, 71, 4500–4526, 2014. a, b

Heymsfield, A. J.: Ice particle evolution in the anvil of a severe thunderstorm during CCOPE, Journal of Atmospheric Sciences, 43, 2463–2478, 1986. a, b

Heymsfield, A. J. and Miloshevich, L. M.: Parameterizations for the cross-sectional area and extinction of cirrus and stratiform ice cloud particles, Journal of the Atmospheric Sciences, 60, 936–956, 2003. a

Hobbs, P. V., Chang, S., and Locatelli, J. D.: The dimensions and aggregation of ice crystals in natural clouds, Journal of Geophysical Research, 79, 2199–2206, 1974. a, b

Hosler, C. and Hallgren, R.: The aggregation of small ice crystals, Discussions of the Faraday Society, 30, 200–207, 1960. a

Hosler, C. L., Jensen, D., and Goldshlak, L.: On the aggregation of ice crystals to form snow, Journal of Atmospheric Sciences, 14, 415–420, 1957. a, b, c, d, e

Jaffeux, L., Schwarzenböck, A., Coutris, P., and Duroure, C.: Ice crystal images from optical array probes: classification with convolutional neural networks, Atmospheric Measurement Techniques, 15, 5141–5157, https://doi.org/10.5194/amt-15-5141-2022, 2022. a

Karrer, M., Seifert, A., Ori, D., and Kneifel, S.: Improving the representation of aggregation in a two-moment microphysical scheme with statistics of multi-frequency Doppler radar observations, Atmospheric Chemistry and Physics, 21, 17133–17166, https://doi.org/10.5194/acp-21-17133-2021, 2021. a

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T.-Y.: Lightgbm: A highly efficient gradient boosting decision tree, Advances in Neural Information Processing Systems, 30, 3149–3157, 2017. a

Korolev, A.: Limitations of the Wegener–Bergeron–Findeisen mechanism in the evolution of mixed-phase clouds, Journal of the Atmospheric Sciences, 64, 3372–3375, 2007. a

Korolev, A., Isaac, G., and Hallett, J.: Ice particle habits in Arctic clouds, Geophysical Research Letters, 26, 1299–1302, 1999. a

Korolev, A., Isaac, G., and Hallett, J.: Ice particle habits in stratiform clouds, Quarterly Journal of the Royal Meteorological Society, 126, 2873–2902, 2000. a

Korolev, A., Heckman, I., Wolde, M., Ackerman, A. S., Fridlind, A. M., Ladino, L. A., Lawson, R. P., Milbrandt, J., and Williams, E.: A new look at the environmental conditions favorable to secondary ice production, Atmos. Chem. Phys., 20, 1391–1429, https://doi.org/10.5194/acp-20-1391-2020, 2020. a, b

Lamb, D. and Verlinde, J.: Physics and chemistry of clouds, Cambridge University Press, https://doi.org/10.1017/CBO9780511976377, 2011. a

Lawson, R. P.: Effects of ice particles shattering on the 2D-S probe, Atmospheric Measurement Techniques, 4, 1361–1381, https://doi.org/10.5194/amt-4-1361-2011, 2011. a

Lin, Y.-L., Farley, R. D., and Orville, H. D.: Bulk parameterization of the snow field in a cloud model, Journal of Applied Meteorology and Climatology, 22, 1065–1092, 1983. a, b, c, d

Lohmann, U. and Roeckner, E.: Design and performance of a new cloud microphysics scheme developed for the ECHAM general circulation model, Climate Dynamics, 12, 557–572, 1996. a

Lundberg, S. M. and Lee, S.-I.: A unified approach to interpreting model predictions, Advances in Neural Information Processing Systems, 30, 4768–4777, 2017. a

McFarquhar, G. M., Um, J., Freer, M., Baumgardner, D., Kok, G. L., and Mace, G.: Importance of small ice crystals to cirrus properties: Observations from the Tropical Warm Pool International Cloud Experiment (TWP-ICE), Geophysical Research Letters, 34, https://doi.org/10.1029/2007GL029865, 2007. a

Melzak, Z.: A scalar transport equation, Transactions of the American Mathematical Society, 85, 547–560, 1957. a, b

Miller, A. J., Ramelli, F., Fuchs, C., Omanovic, N., Spirig, R., Zhang, H., Lohmann, U., Kanji, Z. A., and Henneberger, J.: Two new multirotor uncrewed aerial vehicles (UAVs) for glaciogenic cloud seeding and aerosol measurements within the CLOUDLAB project, Atmospheric Measurement Techniques, 17, 601–625, https://doi.org/10.5194/amt-17-601-2024, 2024 a, b

Miller, A. J., Fuchs, C., Ramelli, F., Zhang, H., Omanovic, N., Spirig, R., Marcolli, C., Kanji, Z. A., Lohmann, U., and Henneberger, J.: Quantified ice-nucleating ability of AgI-containing seeding particles in natural clouds, Atmospheric Chemistry and Physics, 25, 5387–5407, https://doi.org/10.5194/acp-25-5387-2025, 2025. a, b, c

Mitchell, D. L.: Use of Mass- and Area-Dimensional Power Laws for Determining Precipitation Particle Terminal Velocities, Journal of Atmospheric Sciences, 53, 1710–1723, https://doi.org/10.1175/1520-0469(1996)053<1710:UOMAAD>2.0.CO;2, 1996. a

Morrison, H. and Milbrandt, J. A.: Parameterization of cloud microphysics based on the prediction of bulk ice particle properties. Part I: Scheme description and idealized tests, Journal of the Atmospheric Sciences, 72, 287–311, 2015. a, b

Moss, S. and Johnson, D.: Aircraft measurements to validate and improve numerical model parametrisations of ice to water ratios in clouds, Atmospheric Research, 34, 1–25, 1994. a

Pearl, J.: Causality, Cambridge University Press, https://doi.org/10.1017/CBO9780511803161, 2009. a

Peters, J., Bühlmann, P., and Meinshausen, N.: Causal inference by using invariant prediction: identification and confidence intervals, Journal of the Royal Statistical Society Series B: Statistical Methodology, 78, 947–1012, 2016. a

Ramelli, F., Beck, A., Henneberger, J., and Lohmann, U.: Using a holographic imager on a tethered balloon system for microphysical observations of boundary layer clouds, Atmospheric Measurement Techniques, 13, 925–939, https://doi.org/10.5194/amt-13-925-2020, 2020. a, b

Ramelli, F., Henneberger, J., Fuchs, C., Miller, A. J., Omanovic, N., Spirig, R., Zhang, H., David, R. O., Ohneiser, K., Seifert, P., and Lohmann, U.: Repurposing weather modification for cloud research showcased by ice crystal growth, PNAS Nexus, 3, pgae402, https://doi.org/10.1093/pnasnexus/pgae402, 2024. a, b

Scherrer, S. C. and Appenzeller, C.: Fog and low stratus over the Swiss Plateau-a climatological study, International Journal of Climatology, 34, 678–686, 2014. a

Seifert, A. and Beheng, K. D.: A two-moment cloud microphysics parameterization for mixed-phase clouds. Part 1: Model description, Meteorology and Atmospheric Physics, 92, 45–66, 2006. a, b, c, d, e, f

Shaw, R. A., Cantrell, W., Chen, S., and Chuang, P.: Cloud–aerosol–turbulence interactions: Science priorities and concepts for a large-scale laboratory facility, Bulletin of the American Meteorological Society, 101, E1026–E1035, 2020. a

Sheikh, M., Gustavsson, K., Lévêque, E., Mehlig, B., Pumir, A., and Naso, A.: Colliding ice crystals in turbulent clouds, Journal of the Atmospheric Sciences, 79, 2205–2218, 2022. a

Sölch, I. and Kärcher, B.: Process-oriented large-eddy simulations of a midlatitude cirrus cloud system based on observations, Quarterly Journal of the Royal Meteorological Society, 137, 374–393, 2011. a, b

Touloupas, G., Lauber, A., Henneberger, J., Beck, A., and Lucchi, A.: A convolutional neural network for classifying cloud particles recorded by imaging probes, Atmospheric Measurement Techniques, 13, 2219–2239, https://doi.org/10.5194/amt-13-2219-2020, 2020. a

Wu, Z., Seifert, P., He, Y., Baars, H., Li, H., Jimenez, C., Li, C., and Ansmann, A.: Assessment of horizontally oriented ice crystals with a combination of multiangle polarization lidar and cloud Doppler radar, Atmospheric Measurement Techniques, 18, 3611–3634, https://doi.org/10.5194/amt-18-3611-2025, 2025. a

Zamorsky, A.: Ice in the Atmosphere, Akademia Nauk SSSR, 1955. a

Zhang, H., Li, X., Ramelli, F., David, R. O., Pasquier, J., and Henneberger, J.: IceDetectNet: a rotated object detection algorithm for classifying components of aggregated ice crystals with a multi-label classification scheme, Atmospheric Measurement Techniques, 17, 7109–7128, https://doi.org/10.5194/amt-17-7109-2024, 2024. a, b, c, d, e, f

Zhang, H., Ramelli, F., Fuchs, C., Omanovic, N., Miller, A. J., Spirig, R., Wu, Z., Yunpei, C., Li, X., Lohmann, U., and Henneberger, J.: Data for publication “Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments”, Zenodo [data set], https://doi.org/10.5281/zenodo.18374164, 2026a. a

Zhang, H., Ramelli, F., Fuchs, C., Omanovic, N., Miller, A. J., Spirig, R., Wu, Z., Yunpei, C., Li, X., Lohmann, U., and Henneberger, J.: Model and Code for publication “Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments”, Zenodo [code], https://doi.org/10.5281/zenodo.18348241, 2026b. a

Download
Short summary
Ice crystals in clouds aggregate, shaping snow and rain, yet rates are hard to measure. Using cloud seeding, we sampled crystals downwind after known times. A deep-learning algorithm quantified aggregation by counting crystal components. Initial ice concentration was the main driver, confirmed by causal analysis, physics, and machine learning, though weaker than theory predicts. Temperature, size, and shape also mattered, while turbulence was negligible.
Share
Altmetrics
Final-revised paper
Preprint