Insight into the composition of organic compounds ( ≥ C 6 ) in PM 2 . 5 in wintertime in Beijing , China

Organic matter is a major component of PM2.5 in megacities. In order to understand the detailed characteristics of organic compounds (≥C6) at a molecular level on nonhaze and haze days, we determined more than 300 organic compounds in the PM2.5 from an urban area of Beijing collected in November–December 2016 using two-dimensional gas chromatography coupled to time-of-flight mass spectrometry (GC×GC-ToF-MS). The identified organic compounds have been classified into groups, and quantitative methods were used to calculate their concentrations. Primary emission sources make significant contributions to the atmospheric organic compounds, and six groups (including nalkanes, polycyclic aromatic hydrocarbons – PAHs, levoglucosan, branched alkanes, n-alkenes and alkyl-benzenes) account for 66 % of total identified organic compound mass. In addition, PAHs and oxygenated PAHs (O-PAHs) were abundant amongst the atmospheric organic compounds on both haze and non-haze days. The most abundant hydrocarbon groups were observed with a carbon atom range of C19– C28. In addition, the total concentration of unidentified compounds present in the chromatogram was estimated in the present study. The total identified compounds account for approximately 47 % of total organic compounds (≥C6) in the chromatogram on both the non-haze and haze days. The total mass concentrations of organic compounds (≥C6) in the chromatogram were 4.0 and 7.4 μg m−3 on the non-haze and haze days, respectively, accounting for 26.4 % and 18.5 % of organic matter, respectively, on those days estimated from the total organic carbon concentration. Ratios of individual compound concentrations between haze and non-haze days do not give a clear indication of the degree of oxidation, but the overall distribution of organic compounds in the chromatogram provides strong evidence that the organic aerosol is less GC volatile and hence more highly oxidized on haze days.


Introduction
China is suffering from severe PM 2.5 pollution, especially in its capital, with the annual average concentration of PM 2.5 in Beijing being in the range 69.7-122 µg m −3 from 2000 to 2015 (Lang et al., 2017), 2.0-3.5 times the national standard (35 µg m −3 ).A recent study showed that the average PM 2.5 Published by Copernicus Publications on behalf of the European Geosciences Union.concentration during the haze days was 256 µg m −3 in the winter period from 1 to 31 December 2015 in Beijing, much higher than that on non-haze days (24.7 µg m −3 ; Li et al., 2019) and 25 times the World Health Organization (WHO) guideline of 10 µg m −3 .
Organic matter is a large and important fraction of atmospheric fine particles, and a substantial number of organic compounds can be found in the atmospheric particulate phase and may originate as either primary emissions or from the secondary formation process (Wu et al., 2018).The primary emission tracers and precursor compounds have been extensively studied in the Beijing aerosol and showed significant contributions from coal combustion, biomass burning and traffic emissions (Ren et al., 2016;Yao et al., 2016).These studies concentrated on the identification of individual organic compounds from the organic aerosol, such as n-alkanes, n-alkenes, polycyclic aromatic hydrocarbons (PAHs) and hopanes, but the structurally specific identification of the chemical composition of the organic aerosol is far from complete.Due to its huge complexity, particulate organic matter is still inadequately characterized up to the present.Hence, the identification of organic compounds in generic groups may be more informative in elucidating the molecular distribution of atmospheric organic compounds and bulk aerosol characteristics (Alam et al., 2018).Previous studies have shown that the organic compounds were highly oxidized during haze days, and secondary formation has made a significant contribution to the particulate matter (PM) (Li et al., 2019).However, these studies focused only on specific individual oxidized organic compounds or the ratios of C, N and O to assess the entire aerosol ageing process (Li et al., 2019), and the relationship between the molecular distribution and oxidizing processes during haze formation is still not clear.
Two-dimensional gas chromatography (GC × GC) coupled with time-of-flight mass spectrometry (ToF-MS) offers much enhanced resolution of complex mixtures, and the technique has been extended in the last 10 years to encompass atmospheric analysis.The two independent analytical dimensions in GC × GC-ToF-MS make this technique potentially ideal for measuring the organic components within a complex matrix such as ambient particulate matter (Hamilton et al., 2004;Welthagen et al., 2003), and its ability to separate complex mixtures of organics at low concentrations makes it an ideal technique to measure partially oxidized, isomeric and homologous series compounds, and even groups of compounds (Alam et al., 2016a;Alam and Harrison, 2016;Hamilton et al., 2004).In an earlier study of organic compounds in the Beijing atmosphere, Zhou et al. (2009) reported that 68.4 % of particulate organic matter was in the previously "unresolved complex mixture" found in conventional GC separations.The GC × GC technique is able to resolve and identify the components contributing to the unresolved mixture, and the molecular distribution of at-mospheric organic compounds can be clearly identified in the chromatogram.
In order to establish relationships between organic compounds in fine particles and their characteristics on non-haze and haze days, as well as to identify the relative importance of their emission sources, further investigation of particulate organic matter composition was conducted.The objective of this study was to investigate the organic compounds with a carbon number higher than C 6 in PM 2.5 samples collected in central Beijing during wintertime 2016.In this paper, particle samples were analysed by the GC × GC-ToF-MS technique after solvent extraction, and the detailed organic composition was observed for polar and non-polar organic compound groups.Here, we report a large number of organic compounds and their concentrations and molecular distributions sampled on non-haze and haze days.The characteristics of the molecular distribution of atmospheric organic compounds on non-haze days were analysed and compared with haze days during aerosol ageing.In addition, we report their possible sources and formation processes and reveal and assess their pollution characteristics during non-haze and haze periods.Finally, the mass of unidentified organic compounds (> C 6 ) is estimated and compared between non-haze and haze days.

Sampling method and site characteristics
This work was part of the APHH-Beijing study; an overview is provided by Shi et al. (2019).PM 2.5 samples were collected at the Institute of Atmospheric Physics (IAP), Chinese Academy of Sciences, in Beijing, China.The sampling site (39 • 58 N, 116 • 22 E) was located between North 3rd Ring Road and North 4th Ring Road.The site is approximately 1 km from 3rd Ring Road, 200 m west of the G6 Expressway (which runs north-south) and 50 m south of Beitucheng West Road (which runs east-west).The annual average vehicular speeds in the morning and evening traffic peak were 27.4 and 24.3 km h −1 , respectively.No industrial sources were located in the vicinity of the sampling site.The experimental campaign took place from 9 November to 11 December 2016.The samples were collected onto pre-baked quartz fibre filters (Pallflex) by a gravimetric high-volume sampler (Tisch, USA) with a PM 2.5 inlet at a flow rate of 1.0 m 3 min −1 during the sampling period.The collecting time was 24 h per sample, and three blank samples were collected during this period.The filters were previously enveloped with aluminium foils and then baked at 450 • C for 6 h before sampling.After sampling, each filter was packed separately and stored in a refrigerator below −20 • C until the analysis.

Analytical instrumentation
The sample extracts were analysed using a 2-D gas chromatograph (GC, 7890A, Agilent Technologies, Wilmington, DE, USA) equipped with a Zoex ZX2 cryogenic modulator (Houston, TX, USA).The first dimension was separated on a SGE DBX5, non-polar capillary column (30.0 m; 0.25 mm inner diameter; 0.25 mm -5.00 % phenyl polysilphenylenesiloxane), and the second-dimension column was an SGE DBX50 (4.0 m; 0.10 mm inner diameter; 0.10 mm -50.0 % phenyl polysilphenylene-siloxane).The GC × GC was interfaced with a BenchTOF-Select time-of-flight mass spectrometer (Markes International, Llantrisant, UK).The acquisition speed was 50.0 Hz, with a mass resolution of > 1200 FWHM (full width at half maximum) at 70.0 eV, and the mass range was m/z 35.0 to m/z 600.All data produced were processed using GC Image version 2.5 (Zoex Corporation, Houston, TX, USA).

Extraction and analysis methods of filters
The filters were spiked with 30.0 µL of 30.0 µg mL −1 deuterated internal standards (pentadecane-d 32 , eicosaned 42 , pentacosane-d 52 , triacontane-d 62 , butylbenzene-d 14 , nonylbenzene-2,3,4,5,6-d 5 , biphenyl-d 10 and p-terphenyld 14 ; Sigma-Aldrich, UK) for quantification and then immersed in methanol/dichloromethane (DCM) (1 : 1, v/v) and ultra-sonicated for 20 min at 20 • C. The extract was filtered using a clean glass pipette column packed with glass wool and anhydrous Na 2 SO 4 and concentrated to 100 µL under a gentle flow of nitrogen for analysis using GC × GC-ToF-MS; 1 µL of the extracted sample was injected in a split ratio of 50 : 1 at 300 • C. The initial temperature of the primary oven (80 • C) was held for 2 min and then increased at 2 • C min −1 to 210 • C, followed by 1.5 • C min −1 to 325 • C. The initial temperature of the secondary oven (120 • C) was held for 2 min and then increased at 3 • C min −1 to 200 • C, followed by 2 • C min −1 to 300 • C and a final increase of 1 • C min −1 to 330 • C to ensure that all species passed through the column.The transfer line temperature was 330 • C, and the ion source temperature was 280 • C. Helium (99.999 %) was used as the carrier gas at a constant flow rate of 1 mL min −1 .Further details of the instrumentation and data-processing methods are given by Alam and Harrison (2016) and Alam et al. (2016a).
Compound identification was based on the GC × GC-ToF-MS spectral library, NIST mass spectral library and coinjection with authentic standards.Compounds within the homologous series for which standards were not available were identified by comparing the retention time interval between homologues, by comparison of mass spectra with the standards for similar compounds within the series, by comparison to the NIST mass spectral library and by the analysis of fragmentation patterns.The quantification for identified compounds was performed by the linear regression method using the seven-point calibration curves (0.05, 0.10, 0.25, 0.50, 1.00, 2.00 and 3.00 ng µL −1 ) established between the concentration ratios of authentic standards to internal standards and the corresponding peak area ratios.The calibration curves for all target compounds were highly linear (r 2 > 0.98; from 0.978 to 0.998), demonstrating the consistency and reproducibility of this method.Limits of detection for individual compounds were typically in the range 0.001-0.08ng m −3 .The identified compounds which have no commercial authentic standards were quantified using the calibration curves for similar structure compounds or isomeric compounds.This applicability of quantification of individual compounds using isomers of the same compound functionality (which have authentic standards) has been discussed elsewhere and has a reported uncertainty of 24 % (Alam et al., 2018).
The branched alkanes, alkyl-benzenes, alkyl-decalins, alkyl-phenanthrene and anthracene (alkyl-Phe and Ant), alkyl-naphthalene (alkyl-Nap), and alkyl-benzaldehyde were identified in the samples with the graphics method of the GC Image version 2.5 (Zoex Corporation, Houston, TX, USA), and the detailed descriptions are given elsewhere (Alam et al., 2018).Briefly, the structurally similar compounds (similar physico-chemical properties) were identified as a group via drawing a polygon around a section of the chromatogram with the polygon selection tool.All compounds included in the polygon belong to a special compound class, and the total concentrations were calculated via a calibration curve of the adjacent compounds and internal standards (ISs).
Field and laboratory blanks were routinely analysed to evaluate analytical bias and precision.Blank levels of indi-vidual analytes were normally very low and, in most cases, not detectable.The major contaminants observed were very minor amounts of n-alkanes ranging from C 11 to C 21 , with no carbon number predominance and maximum at C 18 ; PAH was not detectable.The major proportion of the contaminants could be distinguished by its low concentrations and distribution fingerprints (especially the n-alkanes).These contaminants did not interfere with the recognition or quantification of the compounds of interest.Recovery efficiencies were determined by analysing the blank samples spiked with standard compounds.Mean recoveries ranged between 82 % and 98 %.All quantities reported here have been corrected according to their recovery efficiencies.Analytical data from the GC × GC analysis were compared with a conventional GC-MS analysis for levoglucosan and 13 PAH.The results from two analytical instruments were compared, and the correlations (r 2 ) between them were in the range of 0.5 to 0.8, with 10 mean concentrations of individual compounds from each technique within 20 % of one another, 2 within 20 %-30 %, and the remainder (2) within 30 %-40 % of one another.The largest outlier was levoglucosan, which was underestimated, probably since it decomposed due to a lack of the usual derivatization.

General aerosol characteristics
Thirty-three samples were separated into non-haze (13) and haze (20) days (the latter with PM 2.5 exceeding 75 µg m −3 for 24 h average) according to the National Ambient Air Quality Standards of China (NAAQS) report released in 2012 by the Ministry of Environmental Protection (MEP) of the People's Republic of China.The concentrations of PM 2.5 , black carbon (BC), organic carbon (OC), element carbon (EC), gaseous pollutants (SO 2 , NO, NO 2 , NO x and CO) and meteorological parameters (wind speed -WS, wind direction -WD -and relative humidity -RH) were simultaneously determined during the field campaigns and appear in Table S1 in the Supplement.
The average daily PM 2.5 mass was 99 µg m −3 , and that on haze days (average 141 µg m −3 ) was 4 times higher than that on non-haze days (35.3 µg m −3 ).The wind and temperature during the haze and non-haze days were 0.94 and 1.44 m s −1 and 6.1 and 4.0 • C, respectively.However, the relative humidity during haze episodes (56.3 %) was slightly higher than the non-haze periods (39.8 %).The concentrations of gaseous pollutants SO x , NO x and CO were simultaneously elevated with the increase in PM 2.5 concentrations, whereas the O 3 concentration presented an opposite trend to PM 2.5 concentrations (Shi et al., 2019).The average concentration of organic matter (OM) was estimated as 30.2 µg m −3 using the OC concentration (18.9 µg m −3 ) and a multiplying factor of 1.6 for aged aerosols (Turpin and Lim, 2001).The OM concentration was 40.0 and 15.0 µg m −3 on haze and nonhaze days, respectively.

The characteristics of organic compound groups on non-haze and haze days
The average total concentration of identified groups was calculated for the non-haze (13 d) and haze periods (20 d).The comparisons of two periods (non-haze and haze days) are shown in Fig. 2, and the detailed concentrations of each group are shown in the Table S3.The concentrations of most organic compound groups on the haze days were higher than non-haze days, especially for the n-alkanols and n-C n -cyclohexanes.The alkyl-benzenes, alkyl-benzaldehydes, monoaromatic compounds and quinoline have approximately similar concentrations on the non-haze and haze days.As many compound groups have not been reported in previous studies, and complete data on the relative abundance of these compounds in various source emissions are not available at present, it is not yet possible to calculate source contributions to ambient organic compound concentrations via molecular marker or mathematical modelling methods.However, several important consistency checks on the potential source can be performed.In the sections that follow, the literature on the origin of each of these compound classes is reviewed briefly and the measured compound concentrations are described.Table 1 shows the comparison of identified organic compounds between the present and previous studies in Beijing.In many but not all cases, concentrations are comparable.

n-Alkanoic acids, n-alkanols and carbonyl compounds
The n-alkanoic acids with carbon numbers from C 6 to C 10 were identified in the PM 2.5 .Higher-molecular-weight (HMW) alkanoic acids generated from the biomass burning (Simoneit and Mazurek, 1982) were not identified from the samples, probably due to low volatility in the GC.The nalkanoic acids were observed at a similar magnitude to a previous study in Beijing (Zhou et al., 2009; Table 1).Previous studies have found that the n-alkanoic acid homologues were significantly impacted by cooking emissions in Beijing and showed higher concentrations on non-haze days and a similar distribution pattern in all seasons (Huang et al., 2006;He et al., 2006;Sun et al., 2013).Consistent results for acids were observed in this study, and the n-alkanoic acids had an average concentration on the non-haze days of 36.4 ng m −3 , higher than 24.6 ng m −3 on haze days, strongly implying a dominant contribution from cooking emissions as opposed to secondary formation.
greater than 8.39 ng m −3 on non-haze days.The above results suggest that n-alkanol formation is more efficient on haze days, even though vehicular emissions appear to be another important source.Aliphatic carbonyl compounds, including n-alkanals, nalkan-2-ones and n-alkan-3-ones, were described in detail by Lyu et al. (2019a).Briefly, the daily sum of aliphatic carbonyls ( AC) ranged from 8.87 to 164 ng m −3 , accounting for 0.02 %-0.46 % of OM.The average AC was 75.8 ng m −3 during all haze days, approximately double the 39.5 ng m −3 of the non-haze period.Lyu et al. (2019a) showed that the n-alkanals were mainly originated from vehicle exhaust or formed from OH oxidation of n-alkanes, while the n-alkanones were probably emitted mainly by coal combustion.

Nitrogen-containing organic compounds
Nitrogen-containing (N-containing) organic compounds have been reported in many previous studies, and the important sources of N-containing compounds are coal combustion, biomass burning, vehicular exhaust and atmospheric photochemical reactions (Rogge et al., 1994 1993b; Schauer et al., 1996;Zhang et al., 2002;Fan et al., 2018).N-containing compounds were identified in the samples, including heterocyclic compounds (alkyl-pyridines and alkyl-quinolines) and other N-containing compounds (nitro and amine compounds).The average alkyl-pyridines, alkyl-quinolines and other N-containing compounds were 17.4±7.58,16.6±15.0and 30.0±23.1 ng m −3 , respectively, and the average total concentration of N-containing compounds was 64.0 ng m −3 , accounting for approximately 0.2 % of the OM.
The quinolines have been proposed for use as tracers of vehicular exhaust (Rogge et al., 1993a) and crude oils and shale oil combustions (Schmitter et al., 1983;Simoneit et al., 1971), while the straight-chain alkyl-pyridines (n-C npyridine) are related to petrochemical industries (Botalova et al., 2009) and secondary formation from pyrolysis of proteins and amino acids under a high temperature (Chiavari and Galletti, 1992;Hendricker and Voorhees, 1998;Kögel-Knabner, 1997).This study found that both quinolines and alkyl-pyridines showed similar concentrations on the non-haze and haze days, 16.8 ± 16.5 ng m −3 (non-haze) and 16.5 ± 14.4 ng m −3 (haze days) and 12.0 ± 6.02 ng m −3 (non-haze days) and 15.3 ± 8.36 ng m −3 (haze days), respectively.Amino compounds can originate from biomass burning and coal combustion and are abundant in winter fine particulate-matter samples compared to summer (Zhang et al., 2002;Akyüz, 2008).In the present study, the average other N-containing compounds was 34.2±24.6ng m −3 on the haze days, somewhat higher than 22.6 ± 19.4 ng m −3 on non-haze days.
The similar concentrations on the non-haze and haze days suggest that N-containing organic compounds mainly originated from primary sources and are subject to degradation during the haze formation process.
Tracers of tobacco smoke, benzoquinoline and isoquinoline have previously been determined in the PM collected in Beijing, with concentrations of 3.10 and 0.22 ng m −3 , respectively (Zhou et al., 2009).These two compounds were also identified in the present study, with 4.40 and 0.80 ng m −3 , respectively.Phthalimide was identified in the PM at 0.91 ng m −3 and was considered to be derived from cyclization and aromatization reactions of proteins or from intermediates in the transformation of carboxyl ammonium salts to nitriles by Zhao et al. (2009).

Esters
Phthalate esters are organic chemicals that are commonly used in a variety of consumer products and in various industrial and medical applications and are predominantly used as plasticizers to improve the flexibility of polyvinyl chloride (PVC) resins and other polymers.Table 1 shows a comparison of phthalate esters (DBP, DEP and DEHP) between the present and previous studies in the winter in Beijing; it seems that the concentrations of some phthalate esters have signif- icantly decreased from earlier studies (Wang et al., 2006;Zhou et al., 2009).The present study found that diisodecyl phthalates, DBP and DEHP were abundant compounds in the ester group, with 49.7 ± 43.2, 16.9 ± 15.5 and 16.0 ± 12.6 ng m −3 , respectively.The DBP, DEP and DEHP in Beijing were far lower than those in winter in Tianjin (Kong et al., 2013) and another 15 cities around China (Li and Wang, 2015;Wang and Kawamura, 2005;Wang et al., 2006).In addition, the average ester was 117 ± 82.1 ng m −3 , with 132 ± 87.1 and 89.4 ± 70.0 ng m −3 on haze and non-haze days, respectively.Since phthalates are not chemically bound to the polymeric matrix, they can enter the environment by escaping from manufacturing processes and by leaching or vaporizing from final products (Staples et al., 1997).
The distribution of PAHs is shown in Fig. 3; the most abundant PAHs were BbF, followed by CHR, FLT, BaA and PYR.In all samples, the PAHs ranged from 46.7 to 727 ng m −3 , with an average of 281 ± 176 ng m −3 , accounting for 0.93 % of OM.In addition, the average PAHs was 364 ng m −3 during haze days but only 159 ng m −3 on the non-haze days.It should be noted that retene was detected in most samples, with an average concentration of 14.4 ± 17.5 ng m −3 .It has been suggested that retene predominantly originates from the combustion of conifer wood (Simoneit et al., 1991).
Nineteen oxygenated PAHs (O-PAHs) make up of a class of PAH derivatives that are present in the atmosphere as a result of direct emission during combustion and secondary for-mation by homogeneous and heterogeneous photo-oxidation processes (Keyte et al., 2013;Ringuet et al., 2012).They are also of scientific interest because they are, typically, found in the secondary organic aerosol (SOA) formed by photo-oxidation of PAH (Shakya and Griffin, 2010).In urban samples, polycyclic aromatic ketones (PAKs), polycyclic aromatic quinones (PAQs) and polycyclic aromatic furanones (PAFs) are typical groups of compounds (Lin et al., 2015).The average total concentration of O-PAH measured in this study (Fig. 4) was 67.9 ng m −3 .The polycyclic aromatic ketones 4,5-pyrenequinone (4,5-PyrQ; 8.75 ng m −3 ) and 1,6-pyrenequinone (1,6-PyrQ; 7.38 ng m −3 ) were the most abundant compounds during the sampling campaign.Four O-PAHs were identified previously at the Peking University (PKU) site in the 2012 heating season in Beijing (Table 1); it is notable that the concentration of AQ was up to 108 ng m −3 , approximately 20 times that in the present study (5.12 ng m −3 ).As O-PAHs can be formed during sampling, it is necessary to be very careful in reconciling their presence with specific sources (Pitts et al., 1980).The average O-PAHs was 86.5 ng m −3 during haze days but 39.7 ng m −3 on the non-haze days.The ratio of quinone to parent PAH has been used to assess the air mass age (Alam et al., 2014;Harrison et al., 2016).The average ratios of phenanthraquinone to phenanthrene (PQ : PHE), anthraquinone to anthracene (AQ : ANT) and benzo(a)anthracene-7,12-quinone to benzo(a)anthracene (BaAQ : BaA) were 0.37, 1.27 and 0.32, respectively, with PQ : PHE, AQ : ANT and BaAQ : BaA ratios of 0.25, 0.88 and 0.26 on the haze days, which were lower than 0.55, 1.92 and 0.40 on non-haze days.The BaAQ : BaA ratios were lower than earlier published data of 1.28 measured in Beijing (Li et al., 2019), 1.40 in Xi'an (Wang et al., 2016) and 0.54 in Beijing-Tianjin (Wang, 2010) but higher than the 0.08 measured in Guangzhou (Wei et al., 2012) and 0.09 in Zhuanghu (Ding et al., 2012).Shen et al. (2011) reported that the BaAQ : BaA ratio was 0.03 for coal combustion, 0.16 for crop residue burning (Shen et al., 2012a) and 6.6 from biomass pellet burning (Shen et al., 2012b).The low ratios of O-PAHs to PAHs in our data probably indicated that the particulate matter mainly originated from coal combustion and biomass burning.However, the lower ratios on haze days than non-haze days may imply continued oxidation of the O-PAH to products which were not analysed.Li et al. (2019) also reported that ratios of O-PAH to PAH were very similar during haze and clean-air periods, which provides support for this conclusion.

Molecular markers
The hopanes are compounds present in crude oil as a result of the decomposition of sterols and other biomass and are not by-products of combustion (Simoneit, 1985).They are very stable and have been proposed for use as tracers for atmospheric particles from fossil fuel combustion, such as motor vehicle exhaust (Simoneit, 1985) and coal combustion (Oros and Simoneit, 2000).The hopanes are widely used as tracers of traffic emission due to vehicle emissions having high loadings of hopanes (Cass, 1998).The comparison of hopanes between this study and previous studies in the winter or heating season of Beijing is shown in Table 1.Hopanes were extensively present in Beijing PM 2.5 samples, and their carbon numbers ranged from C 27 to C 32 but not C 28 (Table 2).The average concentration of hopanes in Beijing was 32.7 ± 24.7 ng m −3 , with 15.2±10.7 ng m −3 and 44.6±24.6 ng m −3 on non-haze and haze days, respectively.Previous studies have found that C 29 (17α(H),21β(H)-norhopane) was dominant in the hopane series and consistent with that from coal combustion (He et al., 2006), while C 30 (17β(H)21α(H)hopane and 17α(H),21β(H)-hopane) was similar to C 29 in the winter time in Beijing and attributed to gasoline and diesel exhaust (Simoneit, 1985).
Levoglucosan and methoxyphenols from pyrolysis of cellulose and lignin are usually used as unique tracers for biomass burning in source apportionment models (Schauer and Cass, 2000).Levoglucosan (1,6-anhydro-β-Dglucopyranose) has been employed as the specific molecular marker for a long time for long-range transport of biomass burning aerosol, based on its high-emission factors and assumed chemical stability (Fraser and Lakshmanan, 2000;Simoneit et al., 2000).It is a highly abundant compound, and the concentrations in winter in Beijing have a significant fluctuation (Table 1).The average levoglucosan was 355 ± 232 ng m −3 during the entire sampling period, and 417 ± 223 ng m −3 in haze episodes, approximately 2 times that of the non-haze days, 238±193 ng m −3 , indicating a significant impact of biomass burning on wintertime aerosols in Beijing.Methoxyphenols are usually also considered to be tracers for wood burning (Simpson et al., 2005;Yee et al., 2013), with the average methoxyphenols at 7.29 ± 7.11 ng m −3 and the haze days (9.03 ± 7.93 ng m −3 ) being 2 times greater than non-haze days (4.74 ± 4.95 ng m −3 ) during the campaigns.In Beijing and its surrounding areas, harvest occurs in late September to October for corn, and biomass fuels are used for cooking and heating purposes in the winter.However, the methoxyphenols are abundant components in the smoke from broadleaf tree and shrub burning (Wang et al., 2009) and have been identified in all coal smoke (Simoneit, 2002); thus they cannot be used as source-specific markers for biomass burning.
Phenolic compounds from the thermal degradation of lignin have been proposed as potentially useful tracers for wood smoke, and many of them are emitted in relatively high quantities and are specific to wood combustion sources (Simoneit, 2002;Simoneit et al., 2004).Another important source of phenolic compounds is oxidation of monoaromatic compounds and PAHs (Pan and Wang, 2014).Phenols and naphthalenol were identified in the PM 2.5 , with the average phenolic compounds of 21.6 ± 17.0 ng m −3 , being 14.0±13.2and 25.9±17.9ng m −3 on the non-haze and haze days, respectively.However, it is notable that the concentrations of naphthalenol identified in the present study were far lower than that of previous studies (Table 1).
Pristane (Pr) and phytane (Ph) have been found in the exhaust of petrol and diesel engines and in lubricating oil, indicating their origin from petroleum (Simoneit, 1984).Since their presence is ubiquitous in vehicle exhausts and negligible in contemporary biogenic sources in urban environments, they can be used as petroleum tracers for airborne particulate matter.The mean values of Pr and Ph in our samples are 2.24 and 1.94 ng m −3 , respectively.Biogenic inputs are often characterized by a predominance of the odd carbon alkanes and Pr.Since Ph is rarely found in biological material, most biological hydrocarbons have a Pr/Ph ratio far higher than 1.0 (Oliveira et al., 2007), but values approaching unity indicate a hydrocarbon signature derived from petrochemical use.The average Pr/Ph ratios were 1.15 for PM 2.5 samples, and this finding is quite similar to the results from the southern Chinese city of Guangzhou, 1.1-1.8(Bi et al., 2002), but almost 4 times greater than Beijing summer samples (0.3; Simoneit et al., 1991).The high Pr/Ph indicated that the hydrocarbons in urban aerosol derive mainly from petroleum residues, probably coming from vehicular emissions in Beijing.

The molecular distributions of aliphatic hydrocarbons
Figure 4 shows the molecular distributions of aliphatic hydrocarbons on non-haze and haze days.The details on the n-alkanes are given by Lyu et al. (2019a).Briefly, the nalkanes (C 10 -C 36 ) ranged from 42.4 to 1241 ng m −3 , with an average 450 ± 316 ng m −3 , and the average n-alkanes was 577 ng m −3 during haze episodes, more than twice that of the non-haze period (264 ng m −3 ).The n-alkanes (C 20 -C 31 ) were the most abundant homologues (Fig. 4), accounting for approximately 83 % of the n-alkanes.
The total concentrations of branched alkanes (C 12 -C 36 ) ranged from 125 to 647 ng m −3 , with the average of 356 ± 173 ng m −3 , during the sampling period.The average branched alkanes concentration was 440±144 ng m −3 during all haze episodes, which was higher than 234 ± 138 ng m −3 on the non-haze days.The most abundant branched alkanes were observed at C 22 , with the average concentration of 29.2 ng m −3 , and the greatest abundance of branched alkanes groups was observed within the carbon atom range of C 20 -C 30 , accounting for 67.7 % of branched alkanes.The branched alkanes have lower concentrations than n-alkanes when the carbon number is > C 20 on haze and non-haze days while showing higher concentrations than n-alkanes when the carbon number is lower than C 19 .
It is difficult to identify the potential sources of branched alkanes from the literature, although Alam et al. (2016b) reported that branched alkanes (C 11 -C 33 ) were an abundant compound group in diesel exhaust.The increase in HMW branched alkanes (C 20 -C 30 ) from non-haze days to haze days is consistent with a primary emission source, probably linked to coal combustion or vehicular emissions.The fact that both n-alkanes and branched alkanes increase quite similarly between non-haze and haze conditions is consistent with them arising from the same sources or sources with highly correlated emissions.
Other groups of aliphatic and alicyclic compounds identified in the PM 2.5 include alkyl-decalins, alkyl-pyridines, alkyl-furanones, alkyl-cyclohexanes and alkyl-benzenes.Figure 5 shows the molecular distributions of these series of compounds.Engine studies (Alam et al., 2016b) have also found that compounds observed in vehicle exhaust beside n-alkanes and PAHs include straight and branched cyclohexanes (C 11 -C 25 ), various cyclic aromatics, alkyl-decalins, and alkyl-benzenes.The particle-bound n-C n -cyclohexanes with carbon numbers from C 12 to C 26 were identified in diesel exhaust (Alam et al., 2016b), with a dominant range C 18 -C 25 , and the total (particle plus gas) concentration of n-C n -cyclohexanes was 2.05 µg m −3 .The n-C n -cyclohexanes (C 20 -C 30 ) were identified at the IAP site with average n-C n -cyclohexane 39.4 ± 37.1 ng m −3 .The most abundant range was observed at C 22 -C 27 , highly consistent with the engine study, implying a significant contribution from vehicle emissions.In addition, the average n-C n -cyclohexane (C 20 -C 30 ) was 53.3 ± 39.3 ng m −3 during haze episodes, approximately 5 times higher than 10.8 ± 8.22 ng m −3 in the non-haze period, a larger ratio than for other primary emissions.The alkyl-decalins and tetralin are products obtained by hydrogenation of naphthalene and its derivatives during the refining process and have been identified in vehicle exhaust (Afzal et al., 2008;Alam et al., 2016b;Ogawa et al., 2007).The average alkyl-decalins was 110 ng m −3 , with 85.4±65.5 and 126±110 ng m −3 on non-haze and haze days, respectively.The n-C n -benzene (C 16 -C 25 ) identified in the samples ranged from 7.71 to 410 ng m −3 with an average of 56.6 ± 73.0 ng m −3 .The average n-C n -benzene (C 16 -C 25 ) was 77.2 ± 88.2 ng m −3 during haze episodes, approximately 4 times the 23.3 ± 15.1 ng m −3 of the non-haze period.Other alkyl-benzenes (C 9 -C 25 ) were also identified and have higher concentrations at C 12 , especially for the nonhaze days.

Distribution of compounds with respect to volatility and polarity, and the estimation of unidentified mass
The method for characterizing the volatility/polarity distribution of compounds is detailed in the Supplement.Briefly, the chromatography image was separated into seven parts according to the main chemical and physical properties of the organic compounds and the distribution of ISs, and the detailed protocol is shown in Table S4.The diagram of the separated image with seven parts is shown in Fig. 6a, and the concentrations measured in each part are shown in Fig. 6 and Table 3.In the chromatogram (Fig. 6), volatility decreases from left to right, and polarity increases from bottom to top.
Table 3 shows the estimated mass concentration of all com-  ponents of the chromatogram alongside the amount of mass not accounted for by the specific compounds reported in this paper.
For the non-haze days, the sum of identified organic compounds (IOCs) with carbon numbers higher than C 6 was 1.84 µg m −3 , accounting for 46.5 % of total organic compounds.The IOC of the haze days was almost 2 times that of non-haze periods, with an average of 3.42 µg m −3 , accounting for 46.3 % of total measured organic matter.In addition, the sum of unidentified compounds increased from 2.12 µg m −3 on non-haze days to 3.96 µg m −3 on haze days, accounting for 53.5 % and 53.7 % of total measured organic matter, respectively.Hence there is no marked difference in the proportions of identified and unidentified compounds between haze and non-haze conditions.
For the non-haze days, Section 1 of the chromatogram has the highest concentration of 802 ng m −3 , followed by Section 7 (792 ng m −3 ), accounting for 20.3 % and 20.0 % of the total organic compounds, respectively, implying that both low-molecular-weight (LMW) hydrocarbons (Section 1) and HMW PAHs (Section 7; three to six rings) and compounds of similar volatility and polarity were the main organic components of atmospheric particulate matter measurable by the GC × GC separation technique.The PAHs are important organic compounds appearing in Sections 6 and 7, accounting for 32.3 % of total measured organic compounds during the non-haze days.Sections 2, 3 and 4 showed relatively low concentrations, and medium-molecular-weight hydrocarbons in the range of C 23 -C 27 (Section 3) were the more abundant aliphatic hydrocarbons relative to Section 2 (C 17 -C 23 ) and Section 4 (> C 27 ), probably caused by primary emissions from vehicular and coal combustion (Cao et al., 2018).Section 5 contains oxidized monoaromatic compounds, and the concentrations were higher than Section 6 (mainly containing naphthalene derivatives) and lower than Section 1, probably mainly arising from vehicular emissions or from oxidization from the monoaromatic precursors (Section 1; Schwantes et al., 2017).
The polarity distribution characteristics of atmospheric organic compounds on the non-haze days were also studied.For the volatile areas, low-polarity compounds (Section 1) have a lower concentration than polar compounds (Sections 5 and 6) during the non-haze days.On the contrary, for the semi-and non-volatile area, the sum of low-polarity compounds (Sections 2, 3 and 4) have higher concentrations than polar organic compounds (Section 7).
The concentrations in all sections increased from non-haze to haze days, and the main difference between haze and nonhaze days follows Sections 5, 6 and 7 (Fig. 6b), indicating a more polar aerosol during periods of haze.Section 6 has the highest concentrations on the haze days (1556 ng m −3 ), increasing by more than threefold on the haze days in contrast to non-haze days (485 ng m −3 ), followed by Section 7 (1337 ng m −3 ) and Section 5 (1309 ng m −3 ), indicating that the oxidized monoaromatics, naphthalene derivatives and oxidized HMW PAHs were the main identified components of the atmospheric particulate matter during the haze days.The concentrations were compared among the seven sections, and the highest concentrations of Section 6 were probably contributed by the degradation of HMW PAHs (from Section 7).For the oxidized monoaromatic compounds (Section 5), the degradation of naphthalene derivatives was probably a major contributor but not compounds oxidized from Section 1.The concentrations of Section 3 were also observed to increase from non-haze days (573 ng m −3 ) to haze days (1060 ng m −3 ), indicating that accumulation has an obvious effect on the stable compounds with the carbon number between C 23 and C 27 during haze formation under low wind speed (Table S1).

Elevation of primary and secondary constituents during haze events
By definition, concentrations of PM 2.5 are elevated during haze events, but the question arises as to whether primary or secondary organic compounds make a larger contribution to the rise in concentrations.Constituents that are expected to be primary are typically elevated in mean concentration by a factor of around 2 (Table S3).Examples are n-alkanes (ratio of haze to non-haze of 2.2), levoglucosan (1.8) and hopanes (2.9).This is consistent with the ratios for primary gaseous emissions, including SO 2 (ratio of 2.6), CO (2.5) and NO x (2.2; Table S1).Surprisingly, however, both BC (ratio of 3.8) and EC (3.4; Table S1) are primary constituents with a large haze : non-haze ratio, comparable to that of PM 2.5 mass (4.0).Consequently the factors leading to an elevation of concentrations during the haze appear complex and are likely to be resolved fully only by chemistry-transport models.OC/EC ratios are used to estimate the relative contribution of primary and secondary sources; high OC/EC ratios (> 2.0) have been observed for aerosols with significant SOA contributions in Beijing (Lv et al., 2019;Ji et al., 2018).The OC/EC ratio in this study was 3.88 on average, suggesting a significant contribution of SOA in Beijing aerosols, which is consistent with the results of Section 3.5.The aliphatic carbonyls, which have both primary and secondary sources (Lyu et al., 2019a, b), range from ratios of 1.6 (n-alkanals) to 2.8 (n-alkan-2-ones).This result was consistent with Section 3.5; it was found that the chromatogram of Sections 2 and 3, which contained alkanals (C 15 ≤ C n ≤ C 25 ) and alkanones (C 15 ≤ C n ≤ C 25 ), has slightly higher concentrations on haze days than non-haze days.However, the low-ratio alkanal and alkanone compounds are quite readily oxidized (Chacon-Madrid et al., 2010;Chacon-Madrid and Donahue, 2011), and a low ratio may reflect a high degree of further processing to form more oxidized species on the haze days, compensating for enhanced formation.
There are no compounds in Table S3 certain to be exclusively secondary.However, the results in Fig. 6 show an appreciable elevation in more polar compounds (upper part of the chromatogram) on haze days, suggestive of a greater relative abundance of more oxidized, possibly secondary compounds in the haze.The ratio of average PM 2.5 mass between haze and non-haze days was 4.0, and it was 2.7 for organic carbon (Table S1).The ratio for organic matter would be greater than 2.7 due to a higher OM/OC ratio in secondary compounds.This is strongly suggestive of a greater contribution from an elevation in secondary than primary species concentrations during the haze events and suggests that much of the mass lies outside of the chromatogram due to the low volatility of the secondary species.

Conclusions
Over 300 polar and non-polar organic compounds were determined in the fine-particle samples from Beijing, and these compounds have been grouped into more than 20 classes, including normal and branched alkanes, n-alkenes, aliphatic carbonyl compounds (1-alkanals, n-alkan-2-ones and n-alkan-3-ones), n-alkanoic acids, n-alkanols, PAHs, oxygenated PAHs (O-PAHs), alkylated-(PAHs and O-PAHs), hopanes, n-C n -benzene, alkyl-benzenes, n-C n -cyclohexane, pyridines, quinolines, furanones and biomarkers (levoglucosan, cedrol, phytane, pristane, supraene and phytone).The total concentrations of identified organic compounds ranged from 0.94 to 5.14 µg m −3 , with an average of 2.84 ± 1.19 µg m −3 , accounting for 9.40 % of OM mass.The six groups which accounted for 66 % of total identified organic compound mass included n-alkanes, levoglucosan, branched alkanes, PAHs, n-alkenes and alkyl-benzenes, and these were significantly impacted by primary emission sources.In addition, the average of total polycyclic aromatic compounds (the sum of PAHs, O-PAHs, alkylated-(PAHs and O-PAHs), alkyl-PHE, and ANT and alkyl-NAP) was 560 ng m −3 , accounting for 1.88 % of OM.The comparisons of identified groups between non-haze and haze periods showed that most organic compound groups have a higher concentration on the haze days relative to the nonhaze days.The average sum of the identified compounds increased from 1.84 to 3.42 µg m −3 from non-haze days to haze days.A unimodal molecular distribution of alkanes was observed in the range from C 8 to C 36 , and these compounds make significant contributions to atmospheric organic compounds in the range of C 19 -C 28 , especially on the haze days.The unidentified compounds in the chromatogram were estimated, and the results show that the average sum of unidentified compounds increased from 2.12 µg m −3 on non-haze days to 3.96 µg m −3 on haze days, accounting approximately for 53.5 % and 53.7 % of total organic compounds, respectively.Finally, the total mass concentrations of measured organic compounds (≥ C 6 ) were 3.96 and 7.39 µg m −3 on the non-haze and haze days, accounting for 26.4 % and 18.5 % of OM mass, respectively, on these days.The remaining mass is that which is not volatile under the conditions of the gas chromatography.The higher percentage of non-GC-volatile organic matter on haze days is indicative of a greater degree of oxidation of the organic aerosol, consistent with the difference in the chromatogram between haze and non-haze days.The greater contribution of secondary constituents during haze events was reported previously by Huang et al. (2014) and Ma et al. (2017) but not the greater extent of oxidation of organic matter.In a modelling study, Li et al. (2017)  secondary PM 2.5 had formed 1 or more days prior to arrival, hence explaining its highly oxidized condition.
Author contributions.The study was conceived by RMH and ZS, and the fieldwork was organized and supervised by ZS and PF.TVV and DL undertook air sampling work and general data analyses for the campaign, while RL carried analytical work on the Beijing samples under the guidance of MSA and CS.XW contributed analyses of data.RL produced the first draft of the paper, with guidance from YF and RMH, and all authors contributed to the refinement of the submitted paper.
Competing interests.The authors declare that they have no conflict of interest.
Special issue statement.This article is part of the special issue "In-depth study of air pollution sources and processes within Beijing and its surrounding region (APHH-Beijing) (ACP/AMT interjournal SI)".It is not associated with a conference.

Figure 1 .
Figure1.The percentages of the organic compound groups in the total identified organic compounds.

Figure 2 .
Figure 2. A comparison of organic compound groups between nonhaze and haze days.The average total concentration of the identified group was calculated in the non-haze (13 d) and haze periods (20 d).

Figure 6 .
Figure 6.The concentration (ng m −3 ) sum of identified and unknown organic compounds in each chromatogram image section during (a) non-haze and (b) haze days.

Table 1 .
Comparison of identified organic compounds with earlier studies in Beijing.Data from the present study are mean ± standard deviation for n = 33 samples.
a Beijing, PKU, heating seasons

Table 2 .
Molecular formula, diagnostic ions and average concentrations of hopanes identified in PM 2.5 .

Table 3 .
Estimated average concentrations of unknown compounds (ng m −3 ) in each section of the chromatogram for haze and non-haze conditions.