Observations of highly oxidized molecules and particle nucleation in the atmosphere of Beijing

. Particle nucleation is one of the main sources of atmospheric particulate matter by number, with new particles having great relevance for human health and climate. Highly oxidized multifunctional organic molecules (HOMs) have been recently identiﬁed as key constituents in the growth and, sometimes, in initial formation of new particles. While there have been many studies of HOMs in atmospheric chambers, ﬂow tubes, and clean environments, analyses of data from polluted environments are scarce. Here, measurements of HOMs and particle size distributions down to small molecular clusters are presented alongside volatile organic compounds (VOCs) and trace-gas data from a campaign in June 2017, in Beijing. Many gas-phase HOMs have been characterized and their temporal trends and behaviours analysed in the context of new particle formation. The HOMs identiﬁed have a degree of oxidation comparable to that seen in other, cleaner, environments, likely due to an interplay between the higher temperatures facilitating rapid hydrogen abstractions and the higher concentrations of NO x and other RO q 2 termi-nators ending the autoxidation sequence more rapidly. Our data indicate that alkylbenzenes, monoterpenes, and isoprene are important precursor VOCs for HOMs in Beijing. Many of the C 5 and C 10 compounds derived from isoprene and monoterpenes have a slightly greater degree of average oxidation state of carbon compared to those from other precursors. Most HOMs except for large dimers have daytime peak of and coincides acid vapours, suggesting that the nucleation process is sulfuric-acid-dependent, with HOMs contributing to subsequent particle growth.

Abstract. Particle nucleation is one of the main sources of atmospheric particulate matter by number, with new particles having great relevance for human health and climate. Highly oxidized multifunctional organic molecules (HOMs) have been recently identified as key constituents in the growth and, sometimes, in initial formation of new particles. While there have been many studies of HOMs in atmospheric chambers, flow tubes, and clean environments, analyses of data from polluted environments are scarce. Here, measurements of HOMs and particle size distributions down to small molecular clusters are presented alongside volatile organic compounds (VOCs) and trace-gas data from a campaign in June 2017, in Beijing. Many gas-phase HOMs have been characterized and their temporal trends and behaviours analysed in the context of new particle formation. The HOMs identified have a degree of oxidation comparable to that seen in other, cleaner, environments, likely due to an interplay between the higher temperatures facilitating rapid hydrogen abstractions and the higher concentrations of NO x and other RO q 2 terminators ending the autoxidation sequence more rapidly. Our data indicate that alkylbenzenes, monoterpenes, and isoprene are important precursor VOCs for HOMs in Beijing. Many of the C 5 and C 10 compounds derived from isoprene and monoterpenes have a slightly greater degree of average oxidation state of carbon compared to those from other precursors. Most HOMs except for large dimers have daytime peak concentrations, indicating the importance of OH q chemistry in the formation of HOMs, as O 3 tends to be lower on days with higher HOM concentrations; similarly, VOC concentrations are lower on the days with higher HOM concentrations. The daytime peaks of HOMs coincide with the growth of freshly formed new particles, and their initial formation coincides with the peak in sulfuric acid vapours, suggesting that the nucleation process is sulfuric-acid-dependent, with HOMs contributing to subsequent particle growth.

Introduction
Atmospheric particle nucleation, or the formation of solid or liquid particles from vapour-phase precursors, is one of the dominant sources of global aerosol by number, with primary emissions typically dominating the mass loadings (Tomasi et al., 2017). New particle formation (NPF) or the secondary formation of fresh particles is a two-step process comprising initial homogeneous nucleation of thermodynamically stable clusters and their subsequent growth. The rate of growth needs be fast enough to outcompete the loss of these particles by coagulation and condensation processes in order for the new particles to grow, and hence NPF is a function of the competition between source and sink (Gong et al., 2010). New particle formation has been shown to occur across a J. Brean et al.: Observations of highly oxidized molecules and particle nucleation wide range of environments (Kulmala et al., 2005). The high particle load in urban environments was thought to suppress new particle formation until measurements in the early 2000s (McMurry et al., 2000;Shi et al., 2001;Alam et al., 2003), with frequent occurrences observed even in the most polluted urban centres. NPF events in Beijing occur on about 40 % of days annually, with the highest rates in the spring (Wu et al., 2007(Wu et al., , 2008Wang et al., 2016). Chu et al. (2019) review many studies of NPF which have taken place in China and highlight the need for long-term observations and mechanistic studies.
NPF can lead to the production of cloud condensation nuclei (CCN) (Wiedensohler et al., 2009;Yu and Luo, 2009;Yue et al., 2011;Kerminen et al., 2012), which influences the radiative atmospheric forcing (Penner et al., 2011). A high particle count, such as that caused by nucleation events, has been shown to precede haze events in environments such as Beijing (Guo et al., 2014). These events are detrimental to health and quality of life. The sub-100 nm fraction of particles to which new particle formation contributes is often referred to as the ultrafine fraction. Ultrafine particles (UFPs) pose risks to human health due to their high number concentration. UFPs exhibit gas-like behaviour and enter all parts of the lung before penetrating the bloodstream (Miller et al., 2017). They can initiate inflammation via oxidative stress responses, progressing conditions such as atherosclerosis and initiating cardiovascular responses such as hypertension and myocardial infarction (Delfino et al., 2005;Brook et al., 2010).
Highly oxidized multifunctional molecules (HOMs), organic molecules with O : C ratios > 0.6, are the result of atmospheric autoxidation and have recently been subject to much investigation, in part because the extremely low volatilities arising from their high O : C ratios favour their condensation into the particulate phase. HOMs are most well characterized as the product of oxidation of the biogenic monoterpenoid compound α-pinene (Riccobono et al., 2014;Tröstl et al., 2016;Bianchi et al., 2017). Although globally biogenic volatile organic compound (BVOC) concentrations far exceed anthropogenic volatile organic compound (AVOC) concentrations, in the urban environment the anthropogenic fraction is far more significant. Formation of HOMs from aromatic compounds has been demonstrated in laboratory studies and these have been hypothesized to be large drivers of NPF in urban environments Molteni et al., 2018;Qi et al., 2018). The formation of HOMs through autoxidation processes begins with the reaction of VOCs with OH q , O 3 , or NO q 3 ; formation of a peroxy radical (RO q 2 ) is followed by rapid O 2 additions and intramolecular hydrogen abstractions Rissanen et al., 2014;Kurtén et al., 2015). Furthermore, generation of oligomers from stabilized Criegee intermediates arising from short-chain alkenes has been hypothesized as a contributor of extremely low-volatility organic compounds (ELVOCs) and low-volatility organic compounds (LVOCs) (Zhao et al., 2015). The low volatilities of these molecules arise from their numerous oxygen-containing functionalities, and this allows them to make a significant contribution to early stage particle growth where other species cannot due to the Kelvin effect (Tröstl et al., 2016), although the contribution of HOMs to the initial molecular clusters is still debated Elm et al., 2017;Myllys et al., 2017).
Recent technological advances have facilitated insights into the very first steps of nucleation, which were previously unseen, with mass spectrometric techniques such as the atmospheric-pressure-interface time-of-flight mass spectrometer (APi-ToF-MS) and its chemical ionization counterpart (CI-APi-ToF-MS) allowing for high-mass and hightime-resolution measurements of low-volatility compounds and molecular clusters. Diethylene glycol-based particle counters, such as the particle size magnifier (PSM), allow for measurements of particle size distributions down to the smallest molecular clusters nearing 1 nm. Recent chamber studies have elucidated the contribution of individual species to particle nucleation, ammonia, and amines, greatly enhancing the rate of sulfuric acid nucleation (Kirkby et al., 2011;Almeida et al., 2013). In these studies, HOMs have been identified, formed through autoxidation mechanisms Riccobono et al., 2014;. These are key to early particle growth (Tröstl et al., 2016) and can nucleate even in the absence of sulfuric acid in chambers  and in the free troposphere (Rose et al., 2018). In this paper, we report the results of HOM and particle size measurements during a summer campaign in Beijing, China.

Sampling site
Sampling was performed as part of the Air Pollution and Human Health in a Developing Megacity (APHH-Beijing) campaign, a large international collaborative project examining emissions, processes, and health effects of air pollution. For a comprehensive overview of the programme, see Shi et al. (2019). All sampling was conducted across a 1-month period at the Institute for Atmospheric Physics (IAP), Chinese Academy of Sciences, Beijing (39 • 58.53 ′ N, 116 • 22.69 ′ E). The sampling was conducted from a shipping container, with sampling inlets 1-2 m above ground level, the nearest road being 30 m away. Meteorological parameters (wind speed, wind direction, relative humidity (RH), and temperature) were measured at the IAP meteorological tower, 20 m away from the sampling site, 30 m from the nearest road at a height of 120 m. Data were continuously taken from the CI-APi-ToF-MS during a 2-week period, but due to data losses only 5 d of data are presented here. Particle size distribution measurements were taken during a 33 d period from 24 May to 26 June 2017.

Chemical ionization atmospheric-pressure-interface time-of-flight mass spectrometry
The Aerodyne nitrate chemical ionization atmosphericpressure-interface time-of-flight mass spectrometer (CI-APi-ToF-MS) was used to make measurements of neutral oxidized organic compounds, sulfuric acid, and their molecular clusters at high time resolution with high resolving power.
The ionization system charges molecules by adduct formation, such as in the case of organic compounds with two or more hydrogen bond donor groups , or proton transfer in the case of strong acids like sulfuric acid. Hydroxyl or hydroperoxyl functionalities are both common hydrogen-bond-donating groups, with hydroperoxyl being the more efficient hydrogen bond donor (Møller et al., 2017). This instrument has been explained in great detail elsewhere (Junninen et al., 2010;Jokinen et al., 2012), but briefly the front end consists of a chemical ionization system where a 10 L min −1 sample flow is drawn in through the 1 m long 1 in. OD stainless-steel tubing opening. A secondary flow was run parallel and concentric to this sample flow, rendering the reaction chamber effectively wall-less. A 3 cm 3 min −1 flow of a carrier gas (N 2 ) is passed over a reservoir of liquid HNO 3 , entraining vapour, which is subsequently ionized to NO − 3 via an X-ray source. This flow is then guided into the sample flow. The nitrate ions will then charge molecules by either clustering or proton transfer. The mixed flows travelling at 10 L min −1 enter the critical orifice at the front end of the instrument at 0.8 L min −1 and are guided through a series of differentially pumped chambers before reaching the ToF analyser. Two of these chambers contain quadrupoles, which can be used to select greater sensitivity for certain mass ranges, and the voltages across each individual chamber can be tuned to maximize sensitivity and resolution for ions of interest. Mass spectra are taken at a frequency of 20 kHz but are recorded at a rate of 1 Hz. All data analysis was carried out in the Tofware package in Igor Pro 6 (Tofwerk AG, Switzerland). A seven-point mass calibration was performed for every minute of data, and all data were normalized to signal at 62, 80, and 125 m/Q to account for fluctuations in ion signal, these masses representing NO − 3 , H 2 ONO − 3 , and HNO 3 NO − 3 respectively. Typical values for calibration coefficients range from 10 9 to 10 10 molec. cm −3 from these normalized data (Kürten et al., 2012), producing peak sulfuric acid concentrations in the range of 10 6 molec. cm −3 . From the very limited periods with simultaneous data for SO 2 , OH radical, and condensation sink, it was possible to calculate H 2 SO 4 concentrations of 10 3 to 10 5 molec. cm −3 , in which range the calibration constant was 7.0±1.6×10 8 cm −3 , which fits well with that expected for this concentration range (Kürten et al., 2012). The nitrate-water cluster is included as the presence of many nitrate-water clusters of the general formula (H 2 O) x (HNO 3 ) y NO − 3 were found, where x = (1, 2, 3, . . .20) and y = (0, 1). No sensitivity calibration was performed for these measurements, and so all values are reported in normalized signal intensity. Due to the high resolving power of the CI-APi-ToF-MS system (mass resolving power of 3500 m/ m and mass accuracy of 20 ppm at 288 m/Q; resolving power is measured as the mass/charge, termed m divided by the peak width at its half maximum, dubbed m), multiple peaks can be fit at the same unit mass and their molecular formulae assigned. These peaks follow the general formula C x H y O z N w , where x = 2-20, y = 2-32, z = 4-16, and w = 0-2, spanning from small organic acids like oxalic and malonic acid through to large dimers of oxidized monoterpene RO q 2 radicals such as C 20 H 31 O 9 N. Beyond 500 m/Q, peak fitting and assignment of compositions becomes problematic as signal decreases, mass accuracy decreases, and the total number of chemical compositions increases, so peaks above the C 20 region have not been assigned, and a number of peaks have been unassigned due to this uncertainty (Cubison and Jimenez, 2015). As proton transfer mostly happens with acids, and nearly all HOM molecules will be charged by adduct formation, it is possible to infer the uncharged formula; therefore all HOMs from here onwards will be listed as their uncharged form.

Size distribution measurements
Two scanning mobility particle sizer (SMPS) instruments measured particle size distributions at 15 min time resolution, with one long SMPS (TSI 3080 EC, 3082 long DMA, 3775 CPC, TSI, USA) and one nano SMPS (3082 EC, 3082 nano DMA, 3776 CPC, TSI, USA) measuring the ranges 14-615 and 4-65 nm respectively. A particle size magnifier (A10, Airmodus, FN) linked to a CPC (3775, TSI, USA) measured the sub-3 nm size fraction. The PSM was run in stepping mode, operating at four different saturator flows to vary the lowest size cut-off of particles that it will grow (this cut-off is technically a point of 50 % detection efficiency) of < 1.30, 1.36, 1.67, and 2.01 nm. The instrument switched between saturator flows per 2.5 min, giving a sub-2.01 nm size distribution every 10 min. The data were treated with a moving-average filter to account for jumps in total particle count, and due to the similar behaviour of the two upper and two lower size cuts, these have been averaged to two size cuts at 1.30 and 1.84 nm.

Calculations
The condensation sink (CS) was calculated from the size distribution data as follows: where D is the diffusion coefficient of the diffusing vapour (assumed sulfuric acid), β m is a transition regime correction , d ′ p is particle diameter, and N d ′ p is the number of particles at diameter d ′ p .

Other measurements
Measurements of the classical air pollutants were measured at the same site and have been reported in the campaign overview paper (Shi et al., 2019). SO 2 was measured using a 43i SO 2 analyser (Thermo Fisher Scientific, USA), O 3 with a 49i O 3 analyser (Thermo Fisher Scientific, USA), and NO x with a 42i-TL trace NO x analyser (Thermo Fisher Scientific, USA) and a T500U CAPS NO 2 analyser (Teledyne API, USA). VOC mixing ratios were measured using a proton-transfer-reaction time-of-flight mass spectrometer (PTR-ToF-MS 2000, Ionicon, Austria).
3 Results and discussion

Characteristics of sampling period
A total of 5 d of CI-API-ToF-MS data were collected successfully, from 21 June 2017 midday through 26 June 2017 midday. New particle formation events were observed on 24 June in the late afternoon and 25 June at midday. Some nighttime formation of molecular clusters was seen earlier in the campaign, as were several peaks in the 1.5-100 nm size range, likely from pollutant plumes containing freshly nucleating condensable materials. The trace gases, O 3 , SO 2 , NO, and NO 2 , are plotted in Fig. S1 in the Supplement. O 3 shows mid-afternoon peaks, around ∼ 120 ppb on the first 2 d of the campaign and 50-70 ppb for the later days. SO 2 shows a large peak, reaching 4 ppb on 22 June but < 1 ppb for the rest of campaign. NO shows strong mid-morning rush-hourrelated peaks, declining towards midday due to being rapidly consumed by O 3 . NO 2 shows large traffic-related peaks. The sulfuric acid signal across this period as measured by NO − 3 CI-APi-ToF-MS showed strong midday peaks, with the highest signal on 24 and 25 June 2017. The meteorological data are shown in Fig. S2 alongside condensation sink (CS). The conditions were generally warm and humid, with temperature reaching its maximum on 25 June 2017, with a peak hourly temperature of 31 • C. High temperatures were also seen on 21 and 24 June, 30 and 26 • C respectively.
3.2 Gas-phase HOM chemistry

Bulk chemical properties
For the peaks that have had chemical formulae assigned, oxidation state of carbon, or OS c , can be used to describe their bulk oxidation chemistry. OS c is defined as (Kroll et al., 2011) This does not account for the presence of nitrate ester groups, which has been accounted for previously by subtracting 5 times the N : C ratio , under the assumption that all nitrogen-containing functionality is in the form of nitrate ester (RONO 2 ) groups. In Beijing, multiple sources of nitrate-containing organic compounds are seen, in the forms of amines, nitriles, and heterocycles. The variation in oxidation state with carbon number (C n ) without correction for nitrate esters is plotted in Fig. 1. The average oxidation state of carbon in this dataset tends to decrease with an increase in C n , highest where C n = 5, attributable to both high O : C and peak area, for the peak assigned to C 5 H 10 N 2 O 8 at m/Q 288. C n = 5 also shows the greatest distribution of oxidation states, likely due to the high ambient concentration of isoprene and therefore its many oxidation products being of high enough signal for many well-resolved peaks to be seen in this dataset. It is worth noting that some of the ions plotted here may not form through peroxy radical autoxidation, such as C 5 H 10 N 2 O 8 , which may be a second-generation oxidation product of isoprene under high NO x (Lee et al., 2016). C n = 10 and 15 also see a small increase in average oxidation number compared to their neighbours. The lower oxidation state of the larger products is likely a function of two things. First and foremost, any autoxidation mechanism must undergo more steps in order for a larger molecule to reach an O : C ratio equivalent to that of a smaller one, and the equivalent O : C ratio is ultimately less likely to be reached before the radical is terminated . Secondly, the lower vapour pressures of these larger products will lead to their partitioning into the condensed phase more readily than the smaller; thus they are more rapidly lost (Mutzel et al., 2015). The degrees of OSc observed here are similar to those seen in other environments such as during the SOAS campaign in 2013 in the southern United States, characterized by low NO/NO 2 and high temperatures, where campaign averages of 0.3 ppb, 0.4-0.5 ppb, and 25 • C respectively were measured, although an additional parameter to account for nitrogen-containing VOCs is included in the calculation . The OS c observed in Beijing is also higher than that seen in the boreal forest environment of Hyytiälä, despite extremely low NO x concentrations, likely due to low temperature conditions dominating in those conditions . These degrees of oxidation relatively similar to those seen in other, cleaner environments are likely due to an interplay between the higher temperatures facilitating rapid hydrogen abstractions (Crounse et al., 2013;Quéléver et al., 2019) and the higher concentrations of NO x , HO q 2 , and other RO q 2 molecules terminating the autoxidation sequence more efficiently (Praske et al., 2018;Rissanen, 2018;Garmash et al., 2019).
A mass defect plot is shown in Fig. 2, which shows nominal mass plotted against mass defect for all peaks in this dataset. Mass defect is defined as the ion mass minus integer mass. This is shown for two separate daytime periods, one where nucleation was not occurring and HOM concentrations are lower (10:30-12:00 CST 23 June 2017) and one where nucleation was occurring under high HOM concentrations (10:30-12:00 CST 25 June 2017). The band of Figure 1. Oxidation state of carbon calculated as 2 times the oxygen-to-carbon ratio minus the hydrogen-to-carbon ratio against carbon number for (coloured) individual ions and (blue circles) signal-weighted average for each carbon number. Area and colour are both proportional to the peak area for each ion. lower mass defect is characterized by a number of large peaks with high signal, for example, at m/Q 436 the ion (C 2 H 7 N) 2 (H 2 SO 4 ) 2 HSO − 4 . The upper component of the mass defect is dominated by organic compounds, and the upper end of the more positive mass defect is occupied by molecules with more 1 H (mass defect 7.825 mDa) and 14 N (mass defect 3.074 mDa). The end of the less positive mass defect has lower 1 H and more 16 O (mass defect −5.085 mDa); alternatively put, the mass defect reflects the variation in OS c . The organic components with more positive mass defects will be more volatile than their lower mass defect counterparts as they will contain fewer oxygen functionalities (Tröstl et al., 2016;Stolzenburg et al., 2018). These higher-volatility products may still contribute to larger size particle growth. The more negative mass defect components will be those of greater O : C and therefore lower volatility, LVOCs, and the yet larger and more oxidized components, ELVOCs (Tröstl et al., 2016). During the nucleation period, the signal intensity for the species in the upper band of more negative mass defect have the most marked increase in concentration, with significantly less difference > 500 m/Q. This region 200-400 m/Q will contain most of the > C 5 monomer HOMs seen in this dataset.

Diurnal trends of HOMs
Temporal trends of HOMs in the urban atmosphere can reveal their sources and behaviour in the atmosphere. Most of the HOM species peak in the daytime. These species all follow a similar diurnal trend, as shown in Fig. 3. The concentrations of both O 3 and OH q are high during the summer period in Beijing (although the nitrate chemical ionization technique is not sensitive to all OH q oxidation products; Berndt et al., 2015). Figure S1 shows the time series of concentrations of NO, which is considered a dominant peroxy radical terminator of particular importance in the polluted urban environment (Khan et al., 2015). Radicals such as HO q 2 and RO q 2 also typically peak during daytime. The HOM components peaking in the daytime are presumed to be the oxidation products of a mixture of anthropogenic and biogenic components, such as alkylbenzenes, monoterpenes, and isoprene. The oxidation of monoterpenes, specifically the monoterpene α-pinene, has been the subject of extensive study recently, with the O 3 -initiated autoxidation sequence being the best characterized Jokinen et al., 2014;Kurtén et al., 2015;Kirkby et al., 2016); ozonolysis of α-pinene opens the ring structure and produces a RO q 2 radical . In the case of aromatics, OH q addition to the ring and the subsequently formed bicyclic peroxy radical are the basis for the autoxidation of compounds such as xylenes and trimethylbenzenes .
The identified compounds have been roughly separated into several categories, each of these plotted in Fig. 3. Figure 3a shows the separation of components into nonnitrogen-containing HOMs and nitrogen-containing HOMs, or organonitrates (ONs). The ON signal is much higher than that of the HOM, attributable in part to a few ions of high signal, such as the isoprene organonitrate C 5 H 10 N 2 O 8 . A few similar structural formulae are seen (C 5 H 10 N 2 O 6 , C 5 H 11 NO 6 , C 5 H 11 NO 7 , etc.), some of which have been identified as important gas-phase oxidation products of isoprene under high-NO x conditions (Xiong et al., 2015), and their contribution to secondary organic aerosol (SOA) has been explored previously (Lee et al., 2016). A high nitrophenol signal is also seen, C 6 H 5 NO 3 . The signal for HOM compounds is less dominated by a few large ions. The prevalence of ON compounds points towards the important role of NO x as a peroxy radical terminator, with the probability of the RO q 2 + NO x reaction producing nitrate ester compounds increasing with the size of the RO q 2 molecule (Atkinson et al., 1982). The NO x concentrations in urban Beijing are approximately a factor of 10 higher than seen at the Hyytiälä station in Finland as reported by Yan et al. (2016), and hence it is expected to be a more significant peroxy radical terminator.
Despite the very large fluxes of anthropogenic organic pollutants in Beijing, biogenic emissions are still an important source of reactive VOCs in the city, with abundant isoprene oxidation products observed (see above), as well as monoter- pene monomers (C 10 H 16 O 9 , C 10 H 15 O 9 N) and some dimer products (C 20 H 30 O 11 , C 20 H 31 O 11 N). The time series of the signals of all C 5 , C 10 , and C 20 molecules is plotted in Fig. 3b, with C 5 species assumed to be isoprene-dominated and C 10 and C 20 assumed to be monoterpene-dominated. Signals for isoprene oxidation products are higher, with abundant isoprene nitrate and dinitrate products. C 10 products show similar behaviour, with, for example, several C 10 H 15 O x N x = 5-9 compounds seen. The C 20 signal intensities are low and follow the general formula C 20 H x O y N z , where x = 26-32, y = 7-11, and z = 0-2; in Fig. 3 the signal for C 20 compounds has been multiplied by a factor of 50 for visibility. The low signals reflect the lack of RO q 2 cross reactions necessary for the production of these accretion products.
Other identified peaks are plotted in Fig. 3c. The C 2 -C 4 components are summed together, these being small organic acids such as malonic acid and oxalic acid, as well as products such as C 4 H 7 O 6 N. Malonic acid is the most prominent here, seen as both an NO − 3 adduct (C 3 H 4 O 4 NO − 3 ) and a proton transfer product (C 3 H 3 O − 4 ) at a ratio of around 2 : 3. Measurements of particle-phase dicarboxylic acids in cities typically show greater concentrations of oxalic acid than malonic acid (Ho et al., 2010), and these acids are primarily produced in the aqueous phase (Bikkina et al., 2014). Primary sources of dicarboxylic acid include fossil fuel combustion (Kawamura and Kaplan, 1987) and biomass burning (Narukawa et al., 1999), which are both plentiful in urban Beijing. The C 6 -C 9 components are assumed to be dominated by oxidation products of alkylbenzenes such as C 8 H 12 O 5 , although fragments of other compounds, i.e. monoterpenes, can also oc-cupy this region (Isaacman-Vanwertz et al., 2018). It is assumed the majority of the signal for these peaks come from alkylbenzenes. This assumption is supported by the relative signal intensity ratios of the oxygen numbers of monomer C 8 H 12 O n compounds being similar to those seen for xylene oxidation products in previous work . The largest fraction, C 11 through C 18 , includes the larger compounds, oxidation products of larger aromatics, or products of the cross reaction of smaller RO q 2 radicals. Here they are grouped without more sophisticated disaggregation as they all follow much the same time series, with species such as C 11 H 11 O 8 N following the same temporal trends as C 15 H 16 O 9 and C 16 H 24 O 12 .
Nearly all ions with the exception of the larger compounds attributed to the cross reaction of C 10 monomers follow similar temporal patterns, with the majority of peaks occurring in the daytime. This reflects the importance of the concentration of atmospheric oxidants. Some selected oxidation products are plotted against their precursor VOCs in Fig. 4. The concentration of isoprene is plotted against the signal of a nitrate HOM product, C 5 H 9 NO 6 (Xiong et al., 2015;Lee et al., 2016), while monoterpenes are plotted against C 10 H 16 O 9 Berndt et al., 2016;Yan et al., 2016;Kirkby et al., 2016;Massoli et al., 2018) and C 2 benzenes against C 8 H 12 O 6 Wang et al., 2017). The first half of the time series shows little correlation between the VOC species and the resultant oxidation products, while isoprene, monoterpenes, and C 2 benzenes follow their usual diurnal cycles, with isoprene having the most distinct cycle with a strong midday peak. The last 2 d, however, show sim- Figure 3. Summed time series of the normalized signals of (a) all non-nitrogen-containing HOMs and all organonitrates identified; (b) C 5 , C 10 , and C 20 components, assumed to be dominated by isoprene, monoterpene monomer, and monoterpene dimers, and signal for C 20 multiplied 50 times to fit scale; and (c) summed C 6 -C 9 components and summed C 11 -C 18 components, assumed to be dominated by alkylbenzenes and other larger components respectively. ilar and coinciding peaks in both the VOCs and HOMs -HOMs show afternoon peaks on both days and an initial shelf on the final half day. The C 5 H 9 NO 6 peak follows some of the peaks of the isoprene, but not all (e.g. morning shelf of isoprene on 24 June). Concentrations of isoprene do not seem to determine directly the signal of HOM, as the day with the lowest isoprene of all is the day with the highest C 5 H 9 NO 6 . The C 10 H 16 O 9 trace also has coincidental peaks with the monoterpene trace, including two 4 h separated simultaneous peaks on 25 June. The peaks in the concentrations of C 2 benzenes are nearly synchronous with the peaks in C 8 H 12 O 6 , for which the data exhibit a strong mid-afternoon peak likely due to the lack of an efficient ozonolysis reaction pathway; the main oxidant of C 2 benzenes is the OH q radical. Trends of both C 3 benzenes and their HOMs are much the same as C 2 benzenes as discussed above, pointing to similar sources and oxidation chemistries. The concentration of precursor VOC is likely a driving force in the identity and quantity of various HOM products, but not the sole determinant, as while there are simultaneous peaks of VOCs and HOMs, both the condensation sink and oxidant concentrations also influence HOM product signals.
The first half of campaign measurements are marked by an episode of low HOM signals. A diurnal cycle still exists but it is weak. The radiation intensity was significantly lower on these prior days than it was on 24 June. No data are available for the final period of measurement. Ozone is higher on the prior measurement days with lower HOM signals (see Fig. S1). Little agreement is seen between VOC concentration and HOM signals on these days. The condensational sinks are roughly similar to those on days of higher HOM concentrations, but temperature and solar radi- ation are much lower. HOM formation is largely dependent upon VOC concentration, oxidant concentration (which will be lower if solar radiation is lower, especially in the case of OH q , the main oxidant of aromatic species especially), and temperature (as H shifts are highly temperature-dependent) (Quéléver et al., 2019), as well as losses by RO q 2 termination before a molecule can become HOM and losses to condensational sink. The low HOM concentration is likely due to these lower temperatures and weaker solar radiation not facilitating HOM formation.
The C 20 compounds plotted in Fig. 3b show no strong diurnal sequence, contrasting with other HOMs. We can presume that all C 20 compounds identified are the result of the reaction of two monoterpenoid C 10 RO q 2 radicals, a reasonable assumption as all identified C 20 species follow the general formula outlined for these reactions (C 20 H 28-32 O 6-16 ).
The formation of C 20 dimers is dependent upon two processes, initial oxidation of monoterpenes and RO 2 -RO 2 termination. Initial oxidation is contingent upon oxidant concentration, which is highest in the daytime, and RO q 2 -RO q 2 termination is contingent upon the probability of the molecular collision between the RO q 2 molecules occurring before other radical termination (i.e. RO q 2 -NO x or RO q 2 -HO q 2 ). There is likely a strong diurnal sequence in the dominant RO q 2 termination mechanisms across the daytime period, and the combination of the two factors discussed above results in there being no strong diurnal trend in these molecules. A lower oxidant concentration at night results in fewer RO q 2 molecules, but less NO and HO q 2 results in a greater chance for those RO q 2 molecules to dimerise (Rissanen, 2018;Garmash et al., 2019). As the levels of NO x in Beijing fall, the peroxy radical termination reactions will be less probable compared to continued autoxidation (Praske et al., 2018), and it is expected that more oxidized HOM products will be seen with lower volatilities and therefore a greater potential contribution to earlier stage particle formation and growth.

New particle formation
Nearly all the signal intensity in the CI-APi-ToF-MS instrument arises from molecules charged by NO − 3 ; therefore plotting the unit mass resolution data (the data gained by integrating over the entire area at each m/Q integer) against time simply describes the evolution of oxidized organic molecules, acids, and their molecular clusters with both each other and stabilizing amine species. This is done in Fig. 5. As the signal intensity varies by factors of 10 from mass to mass, each value has been normalized so they have maxima at 1. This has been done separately for 2 d for clarity, as the signal intensity also varies from day to day. PSM data for these 2 d is also plotted in Fig. 5, with both total particle count > 1.30 nm in black and the number difference between the lower and upper size cuts (1.30 and 1.84 nm) in blue, which shows the number of particles between these sizes. The relationship between mass and electrical mobility diameter can be defined thus (Tammet, 1995) where d e is the electrical mobility diameter of the cluster or particle, m is the mass of the cluster or particle expressed in kilogrammes, ρ is the density, and d g is the effective gas diameter, determined to be 0.3 nm for smaller particles (Larriba et al., 2011). We can use this to draw a comparison between the PSM and CI-APi-ToF-MS measurements. If a density of 1.2 g cm −3 is assumed, then once molecular clusters reach the > 400 m/Q range, they will be seen in the lowest size cut of the PSM, or > 700 m/Q if a density of 2.0 g cm −3 is assumed. A full table of densities is provided in the Supplement. A burst in the signal seen by the CI-APi-ToF-MS occurs first in the late morning in Fig. 5a, and this is at the same time as peaks begin to rise in the identified HOMs (see Fig. 3). Here, the PSM is not available due to an instrumental fault until 16:00 CST; however, at that point, an elevation to particle count and a large elevation to cluster count can be seen. Moving into the evening period, the mass contour shows peaks in larger masses > 400 m/Q. These are likely dimerized compounds and products of NO q 3 chemistry with little contribution to newly forming particles but still sensitive to chemical ionization by NO − 3 . Many of these peaks cannot be assigned due to uncertainties in the structural formula assignment for higher mass peaks, as the number of possible dimerized compounds is many, being the combination of most possible RO 2 radicals. Graphically, these are over-represented in Fig. 5 due to the normalization, and their signals (especially > 500 m/Q) are much lower than the signals < 400 m/Q. . Each individual unit mass was normalized to a maximum of 1. Each period is normalized separately so the individual signal maxima on each day are visible. The graph is plotted between 200 and 600 mass units, with every 10 mass units averaged for simplicity. On the secondary axis PSM data are plotted, both total particle count > 1.30 nm (black trace) and total clusters between 1.30 and 1.84 nm (blue trace). Data are plotted at 1 h time resolution.
The second day plotted in Fig. 5b (25 June 2017) shows a strong afternoon peak to the HOMs (for most HOMs, stronger than that on the day prior). Particle formation is shown in the PSM data. A strong midday peak to particle number is seen with two distinct peaks in cluster count. These two peaks are not coincidental with the two peaks in HOM signal (i.e. nitrogen-containing HOMs in Fig. 3a peaking at 11:00 and 16:00 CST). Sulfuric acid, however, does peak synchronously with the particle number count. Sulfuric acid is plotted across the contour plot in Fig. 6, where PSM data are also shown in the bottom panel. The peak in CI-APi-ToF-MS mass signal, visible in Fig. 5, occurs at around 12:00-13:00 CST; peaks in the PSM cluster count occur at 10:00 and 13:00 CST. Peaks in mass up to 550 m/Q are seen in the CI-APi-ToF-MS at 13:00 CST. Assuming the density of these species is ≤ 1.6 g cm −3 , then these will be suitably sized to be grown in the PSM saturator. These newly formed particles then go on to grow and contribute significantly to the larger particle count (Fig. S3). As initial particle formation coincides with sulfuric acid signal peaks and before HOM signals peak, it can be assumed on these days that the HOM contribution to the initial particle formation is modest.
There is recent strong evidence to suggest that the driving force of the earliest stages of particle formation in urban Shanghai is sulfuric acid and C 2 amines (Yao et al., 2018), and the coincidental peaks of sulfuric acid with new particles as seen in Fig. 6 suggest a similar behaviour. Dimethylamine (DMA) can efficiently stabilize the sulfuric acid clusters . Here, few larger sulfuric acid-DMA clusters were visible in the dataset, as seen in the work by Yao et al. (2018). Although five sulfuric aciddimethylamine (SA-DMA) ions were observed, the others were likely too low in signal to be confidently resolved from their neighbouring peaks; however, clusters of up to four sulfuric acid molecules and three dimethylamine molecules were seen, with similar diurnal trends in sulfuric acid. The scarcity of SA-DMA clusters is likely due to instrumental conditions, rather than their absence in the atmosphere. The nitrate chemical ionization system tends to evaporate amine compounds upon charging, and as specific voltage-tuning setups can lend themselves towards preservation or breakage of molecular clusters, the signal for larger sulfuric acid clusters was also very weak. The formation of HOM-sulfuric acid clusters is unlikely under atmospheric conditions  and few of these were observed. Signals of HOMs seem to coincide with later particle growth; it can be expected that HOM molecules make a more significant contribution to particle growth than to early particle formation, with the largest and most oxidized being involved in early growth and the smaller and less oxidized contributing to later growth as the necessary vapour pressure properties become less demanding.

Conclusions
The average degree of HOM oxidation in Beijing is comparable with that seen in other environments. Rapid intramolecular hydrogen shifts during autoxidation due to the higher temperatures are probably offset by the frequent termination reactions due to high NO x concentrations. OS c values seem to be marginally higher for biogenic species.
The temporal trend of nearly every HOM shows afternoon or evening maxima. Both O 3 and OH q have high daytime concentrations, and these likely drive the initial oxidation steps. The species arising from alkylbenzene precursors show sharper afternoon peaks, probably since their oxidation is OH q -dominated. Many of the rest of the peaks, coming from largely BVOC precursors, show broader daytime peaks, being influenced by O 3 also. There seems to be no direct link between VOC concentrations and HOM signals, with days of lower precursor VOC sometimes having higher HOM signals and vice versa.
Initial particle formation coincides with peak sulfuric acid signals, while the growth of the particles correlates more closely with the signals of HOMs. This is very similar to behaviour observed in a study of NPF in Shanghai which was attributed to sulfuric acid-dimethylamine-water nucleation with condensing organic species contributing to particle growth (Yao et al., 2018), and this is further backed up by numerous SA-DMA clusters present in this dataset. The freshly formed particles grow and contribute significantly to total particle loading. This is visible when the unit mass CI-APi-ToF-MS data are plotted as a contour plot, and further this is visible in the PSM data, with bursts in both total number count > 1.30 nm and the number of molecular clusters between 1.30 and 1.84 nm. As NO x levels fall in Beijing due to traffic emission control measures being enforced, it is likely that autoxidation will become increasingly significant in the new particle formation processes. The number of molecules detected by the NO − 3 CI-APi-ToF-MS is undoubtedly many more than have had formulae assigned here, but to identify more requires a more sophisticated data deconvolution.
Author contributions. The study was conceived and planned by RMH and ZS. DCSB and JB set up and operated the main instrumental measurements, and JB prepared the first draft of the paper and responded to comments from RMH and ZS. CNH and WJFA contributed the hydrocarbon data and provided comments on the draft paper, and FAS and JL contributed the gas-phase pollutant data.
Competing interests. The authors declare that they have no conflict of interest.
Special issue statement. This article is part of the special issue "In-depth study of air pollution sources and processes within Beijing and its surrounding region (APHH-Beijing) (ACP/AMT interjournal SI)". It is not associated with a conference.
Acknowledgements. This was part of the APHH-Beijing programme funded by the UK Natural Environmental Research Council, the National Centre for Atmospheric Science, and the Natural Sciences Funding Council of China. We thank Xinming Wang from the Guangzhou Institute of Geochemistry, Chinese Academy of Sciences; Brian Davison from Lancaster University; and Ben Langford, Eiko Nemitz, Neil Mullinger, and other staff from the Centre for Ecology and Hydrology, Edinburgh for assistance with the VOC measurements and associated infrastructure.
Financial support. This research has been supported by the Natural Environmental Research Council (grant no. NE/N007190/1) and the Natural Sciences Funding Council of China. It was additionally facilitated by the National Centre for Atmospheric Science ODA national capability programme ACREW (NE/R000034/1), which is supported by NERC and the GCRF.
Review statement. This paper was edited by Kimitaka Kawamura and reviewed by three anonymous referees.