Molecular composition and photochemical evolution of water-soluble organic carbon (WSOC) extracted from field biomass burning aerosols using high-resolution mass spectrometry

Photochemistry plays an important role in the evolution of atmospheric water-soluble organic carbon (WSOC), which dissolves into clouds, fogs, and aerosol liquid water. In this study, we tentatively examined the molecular composition and evolution of a WSOC mixture extracted from field-collected wheat straw burning aerosol (WSBA) samples upon photolysis, using direct infusion electrospray ionisation (ESI) coupled to high-resolution mass spectrometry (HRMS) and liquid chromatography (LC) coupled with HRMS. For comparison, two typical phenolic compounds (i.e. phenol and guaiacol) emitted from lignin pyrolysis in combination with hydrogen peroxide (H2O2) as a typical OH radical precursor were simultaneously exposed to simulated sunlight irradiation. Their photochemical products such as phenolic dimers (e.g. m/z 185.0608 for phenol dimer and m/z 245.0823 for guaiacol dimer) or their isomers, were also observed in field-collected WSBA samples, suggesting that the aqueous-phase reactions might contribute to the formation of emitted biomass burning aerosols. The aqueous photochemistry of both the phenols (photooxidation) and WSBA extracts (direct photolysis) could produce a series of highly oxygenated compounds, which in turn increases the oxidation degree of organic composition and acidity of the bulk solution. In particular, the LC/ESI-HRMS technique revealed significant photochemical evolution of the WSOC composition in WSBA samples, e.g. the photodegradation of low oxygenated species and the formation of highly oxygenated products. We also tentatively compared the mass spectra of photolytic time-profile WSBA extracts with each other for a more comprehensive description of the photolytic evolution. The calculated average oxygen-to-carbon ratio (O/C) of oxygenated compounds in bulk extract increases from 0.38±0.02 to 0.44±0.02 (mean± standard deviation), while the intensity (S/N )-weighted average O/C (O/Cw) increases from 0.45±0.03 to 0.53±0.06 as the time of irradiation extends from 0 to 12 h. These findings indicate that the watersoluble organic fraction of combustion-derived aerosols has the potential to form more oxidised organic matter, contributing to the highly oxygenated nature of atmospheric organic aerosols.

Abstract. Photochemistry plays an important role in the evolution of atmospheric water-soluble organic carbon (WSOC), which dissolves into clouds, fogs, and aerosol liquid water. In this study, we tentatively examined the molecular composition and evolution of a WSOC mixture extracted from field-collected wheat straw burning aerosol (WSBA) samples upon photolysis, using direct infusion electrospray ionisation (ESI) coupled to high-resolution mass spectrometry (HRMS) and liquid chromatography (LC) coupled with HRMS. For comparison, two typical phenolic compounds (i.e. phenol and guaiacol) emitted from lignin pyrolysis in combination with hydrogen peroxide (H 2 O 2 ) as a typical OH radical precursor were simultaneously exposed to simulated sunlight irradiation. Their photochemical products such as phenolic dimers (e.g. m/z 185.0608 for phenol dimer and m/z 245.0823 for guaiacol dimer) or their isomers, were also observed in field-collected WSBA samples, suggesting that the aqueous-phase reactions might contribute to the formation of emitted biomass burning aerosols. The aqueous photochemistry of both the phenols (photooxidation) and WSBA extracts (direct photolysis) could produce a series of highly oxygenated compounds, which in turn increases the oxidation degree of organic composition and acidity of the bulk solution. In particular, the LC/ESI-HRMS technique revealed significant photochemical evolution of the WSOC composition in WSBA samples, e.g. the photodegradation of low oxygenated species and the formation of highly oxygenated products. We also tentatively compared the mass spectra of photolytic time-profile WSBA extracts with each other for a more comprehensive description of the photolytic evolution. The calculated average oxygen-to-carbon ratio (O/C) of oxygenated compounds in bulk extract increases from 0.38±0.02 to 0.44±0.02 (mean ± standard deviation), while the intensity (S/N)-weighted average O/C (O/C w ) increases from 0.45 ± 0.03 to 0.53 ± 0.06 as the time of irradiation extends from 0 to 12 h. These findings indicate that the watersoluble organic fraction of combustion-derived aerosols has the potential to form more oxidised organic matter, contributing to the highly oxygenated nature of atmospheric organic aerosols.
J. Cai et al.: Molecular composition and photochemical evolution of WSOC spite its significance, little is known about the chemical composition and sources of WSOC, with less than 10 %-20 % of the organic mass being structurally identified (Cappiello et al., 2003;Fu et al., 2015). Biomass burning is a wellknown emission source of WSOC (Anastasio et al., 1997;Fine et al., 2001;Graham et al., 2002;Gilardoni et al., 2016). Although the composition varies with fuel type and combustion conditions (Simoneit, 2002;Smith et al., 2009), the WSOC mixture often covers a common range of polar and oxygenated aromatic compounds Mayol-Bracero et al., 2002;Duarte et al., 2007;Chang and Thompson, 2010;Yee et al., 2013;Gilardoni et al., 2016) with molecules incorporating different numbers of functional groups like hydroxyl, carboxyl, aldehyde, ketone, ester, amino, and/or other nitrogen-containing groups . In particular, lignin pyrolysis often yields a large amount of aromatic alcohols, carbonyls, and acid compounds Chang and Thompson, 2010;Gilardoni et al., 2016). Once dissolved into cloud, fog, and even aerosol liquid water, these substances can undergo aqueous-phase reactions to generate low-volatility species under sunlight irradiation, which have the potential to form secondary organic aerosol (SOA) after water evaporation Cappiello et al., 2003;Duarte et al., 2007;Sun et al., 2010;Yu et al., 2014).
Field and laboratory studies have demonstrated that aqueous photochemical processes contribute significantly to the aqueous SOA formation from biomass burning precursors and the evolution of smoke particles (Sun et al., 2010;Lee et al., 2011;Kitanovski et al., 2014;Yu et al., 2014;McNeill, 2015;Gilardoni et al., 2016). Gilardoni et al. (2016) observed aqueous SOA formation in both fog water and wet aerosols, resulting in an enhancement in the oxidised organic aerosol, and following atmospheric ageing the overall oxidation degree of aerosols has also increased. In laboratory studies, phenols and methoxyphenols (important biomass burning intermediates) are often used as SOA precursors to examine the photochemical evolution in aqueous environments and aerosol-forming potential under relevant atmospheric conditions (Chang and Thompson, 2010;Sun et al., 2010;Yu et al., 2014;Vione et al., 2019). The corresponding photochemical products formed through hydroxylation, oligomerisation, and fragmentation typically cover a series of low-volatility and highly oxygenated species. For instance, the methoxyphenol-derived SOA are proposed as a proxy for atmospheric humic-like substances (HULIS) (Ofner et al., 2011;Yee et al., 2013). Other compounds emitted from lignin pyrolysis, e.g. aromatic aldehydes, ketones, polycyclic aromatic hydrocarbon (PAH), have also been found to produce coloured products via aqueous photooxidation, which may become a part of HULIS (Anastasio et al., 1997;Chang and Thompson, 2010;Haynes et al., 2019). In addition, photochemical processing of common water-soluble aliphatic compounds such as aldehydes (Lim and Turpin, 2015), polyols (Daumit et al., 2014), and organic acids (Griffith et al., 2013) in aqueous solution can also lead to the formation of oligomers, highly oxygenated, and multifunctional organic matter (McNeill, 2015).
In recent years, high-resolution mass spectrometry (HRMS) has been commonly applied to study the organic molecular composition in cloud water (Zhao et al., 2013;Boone et al., 2015), fog water (Cappiello et al., 2003), rainwater (Altieri et al., 2009a, b), laboratory-generated SOA (Bateman et al., 2011;Romonosky et al., 2015;Lavi et al., 2017), and field-collected aerosol samples Lin et al., 2012a, b;Kourtchev et al., 2013;Tong et al., 2016;Wang et al., 2017). It has also been used in timeprofile observations of the photochemical evolution of aqueous extracts from laboratory-generated SOAs (Bateman et al., 2011;Romonosky et al., 2015). However, direct infusion mass spectrometry (MS) methods are prone to ion suppression caused by other organic species, inorganic salts, and adduct formation (Kourtchev et al., 2013). Therefore, liquid chromatography (LC) coupled with HRMS might be another complementary powerful tool for relieving ion suppression due to its abilities to separate and analyse different kind of compounds with differences in LC retention time (Kourtchev et al., 2013;Wang et al., 2016). It could also provide more information enabling the identification of possible isomers from the ions with same mass-to-charge ratio (m/z).
To our knowledge, the aqueous photochemical evolution of WSOC extracted from real ambient aerosols has not been studied in detail at the molecular level. Our previous study has revealed that the ultraviolet-visible (UV-vis) absorption spectra of aqueous extracts from field biomass burning aerosols were modified under simulated sunlight illumination (Cai et al., 2018). Based on the previously studied field-collected samples, the present study is focused on a further analysis to investigate the molecular characteristics of water-soluble organic molecules by the photochemical evolution using electrospray ionisation (ESI)-HRMS and LC/ESI-HRMS performed in negative ionisation mode. For comparison, we also evaluated the photochemistry of phenol and guaiacol (representing the basic structures of phenols emitted from lignin pyrolysis) under laboratory conditions, and tentatively traced some of their photochemical products (e.g. dimers) in field-collected samples under study. The wheat straw burning aerosol (WSBA) samples were collected during the summer harvest season of 2013, at rural fields in the plain of north China where the wheat was the main agricultural crop (Cai et al., 2018). To facilitate subsequent planting and management, a large amount of fresh wheat straw was directly burned in the field during the harvest season, and the water emitted from burning plant body could provide a suitable environment for aqueous photochemistry of dissolved compounds. The selected WSBA samples used for HRMS analysis were collected from two sampling sites, located at rural fields in Wenxian in Henan Province (denoted: HNWX) and Daming in Hebei Province (HBDM). As described in Cai et al. (2018), the selected sampling sites were mainly affected by heavy smog from wheat straw burning (Fig. 1). The emitted fine particulate matter with aerodynamic diameter ≤ 2.5 µm (PM 2.5 ) was collected at a flow rate of 5 L min −1 by a portable particulate sampler (MiniVol TAS, AirMetrics, USA), with quartz fibre filters (47 mm in diameter, QMA, Whatman, UK) baked at 600 • C for 6 h before sampling. The sampling flow rate was calibrated with a standard flow meter (Bios Defender 520), and the sampling time of each filter was restricted to 30-60 min depending on the ambient biomass burning aerosol concentration and expected filter loading (Cai et al., 2018). After collection, the filter samples were stored in the dark and transported to the laboratory and then stored at −20 • C under light-proof conditions. The preparation of WSOC extracts and measurements for carbon content including OC, elemental carbon (EC) (Zhi et al., 2014) and WSOC were described in detail in Cai et al. (2018). Briefly, a part of each quartz fibre filters (1.6-3.2 cm 2 ) was placed in a brown vial and extracted with ultrapure water (Milli-Q, Milipore) twice; each time 5 mL ultrapure water with a 30 min ultrasonic agitation was applied. The two-time extracts were combined and filtered through a PTFE syringe filter (0.2 µm pore size, Thermo Scientific), followed by a pH measurement with a pH meter (Mettler Toledo SevenEasyTM S20) that had been regularly calibrated at pH 4.00 and 6.86. Prior to analysis the extracts were stored at −20 • C in the dark. To reduce the WSOC mass loss, the desalting treatment (e.g. solid phase extraction, SPE) was not performed on these samples.

Direct photolysis of WSOC extracts
A 12 h direct photolysis of WSOC extracts obtained from WSBA samples was performed in a photo-reactor (BL-GHX-V, Bilon Instruments Co. Ltd., China; see Fig. S1 in the Supplement) that was equipped with a solar simulator (Xe lamp, 1000 W) placed in a double-deck quartz condenser (Cai et al., 2018). Cooling water (18 • C) was circulating in the outer tube of the condenser to avoid heating of the samples. In the wavelength range of 310-400 nm relevant to the boundary layer of the atmosphere, the actinic flux of the lamp is about 5 times stronger than the solar actinic flux, meaning that the spectral evolution via the 12 h simulated solar irradiation might be equal with the effect caused by actual sunlight irradiation with a duration of at least 60 h (Cai et al., 2018). Airtight quartz tubes (1.5 cm in diameter, 3 mL solution per tube) loading extracts were equidistantly arranged around the lamp. Each extract was distributed into three tubes that corresponded to three different irradiation times, i.e. 0, 4, and 12 h, with no oxidants added externally throughout the whole photolytic process. At each irradiation time point (e.g. 0 and 4 h), the related tubes were wrapped with aluminium foil and placed in the initial location until the end of the 12 h photolysis (Cai et al., 2018).
As described in Cai et al. (2018), the water extraction resulted in a dilution of the collected organic compounds; however, the ratio of the water mass to PM 2.5 mass for extract samples (ranging from 1.8 × 10 3 to 3.4 × 10 4 ) was compatible with the ratio of water mass to WSOC content in cloud water (in a wide range from 1.4 × 10 2 to 1.6 × 10 4 ) (Li et al., 2017), indicating that the present aqueous extracts are relevant to the atmospheric cloud water condition.

Photooxidation of phenolic compounds under laboratory conditions
Initial solutions of 0.1 mM phenol (C 6 H 6 O) and 0.1 mM guaiacol (C 7 H 8 O 2 ) in combination with an OH radical precursor (0.1 mM H 2 O 2 ) were prepared in ultra-pure water (Milli-Q, Milipore). The pH of the solution was adjusted to 5 with 0.1 M sulfuric acid (H 2 SO 4 ), which is usually relevant to the acidity in fog and cloud waters (Collet et al., 1998;Fahey et al., 2005). The prepared solution and reference blank were irradiated by simulated sunlight irradiation with a duration of 4 h. Hereby, we mainly focus on acquiring the chemical characteristics of aqueous products of phenols and tentatively identify whether certain tracer compounds (e.g. phenolic dimers) exist in the present biomass burning particulate samples.

Sample analysis
The direct infusion MS analysis was conducted using a Thermo Scientific Orbitrap Fusion Tribrid mass spectrometer equipped with quadrupole, orbitrap, and linear ion trap mass analysers, with a heated ESI source. To assist in ionisation and desolvation, the sample was diluted to a 1 : 1 mixture of acetonitrile and sample by volume. The full scan mass spectra were acquired in negative ionisation mode, with a resolution of 120 000 at m/z 200 for the Orbitrap analyser and a mass scan range of m/z 50-750. Before determination, the Orbitrap analyser was externally calibrated for mass accuracy using a Thermo Scientific Pierce LTQ Velos ESI calibration solution. The direct infusion parameters were as follows: sample flow rate 5 µL min −1 ; capillary temperature 300 • C; S-lens RF 65 %; spray voltage −3.5 kV; sheath gas, auxiliary gas, and sweep gas flows were 10, 3, and 0 arbitrary units, respectively. Data collecting was performed when the intensity of the total ion current (TIC) maintained constant with a relative standard deviation (RSD) under 5 %. At least 100 data points (mass spectral scans) were collected for each test sample, and each exported mass spectrum for analysis was derived from the average result of 100 spectra. The LC/ESI-HRMS analysis operated in negative ionisation mode was performed using a U3000 system coupled with a T3 Atlantis C18 column (3 µm; 2.1×150 mm; Waters, Milford, USA) and an Orbitrap Fusion MS. A 10 µL sample was injected, with a flow rate of 0.2 mL min −1 for the mobile phase, which consisted of H 2 O (A) and acetonitrile (B). The gradient applied was 0-5 min at 3 % B and 5-20 min from 3 % to 95 % (linear); it was then kept for 25 min at 95 %; then 45-50 min from 95 % to 3 %, and it was then held for 10 min at 3 % (total run time 60 min).

Data processing
Mass spectral peaks 3 times larger than the signal-to-noise ratio (S/N) were extracted from the raw files. Peaks in both sample and blank spectra were retained if their intensity in the former was 5 times larger than in the latter. A common molecular assignment based on the accurate mass was performed using Xcalibur software (V3.0 Thermo Scientific) with the following constraints: 12 C ≤ 50, 13 C ≤ 1, 1 H ≤ 100, 16 O ≤ 50, 14 N ≤ 4, 32 S ≤ 1, and 34 S ≤ 1. All mathematically possible elemental formulas, with a mass tolerance of ±3 ppm were calculated. Elemental formulas containing 13 C or 34 S were checked for the presence of 12 C or 32 S counterparts, respectively. If they were not matched with the corresponding mono-isotopic formulas, then the assignment to the next larger mass error was considered. Isotopic and unassigned peaks were excluded from further analysis.
Ions were also characterised by the number of rings plus double bonds (i.e. double-bond equivalents, DBEs), which were calculated as DBE = c − h/2 + n/2 + 1 for an elemental composition of C c H h O o N n S s . The assigned formula was additionally checked with the nitrogen rule. For ambient samples, based on the presence of various elements in a molecule, the identified elemental formulas were classified into several main compound classes: CHO (i.e. molecules containing only C, H, and O atoms), CHOS, CHON, and CHONS, and others including CHN and CHS. In the present study, because the detected water-soluble ions almost were below m/z 400, we focused our molecular analysis on m/z 50-400.
3 Results and discussion 3.1 Mass spectral characteristics of WSOC extracts from WSBA samples The preliminary analysis showed that the PM 2.5 concentration in ambient air near the burning sites ranged from 6.46 to 28.03 mg m −3 (Table S1 in the Supplement). OC was the major component of the collected PM 2.5 with a proportion of 50.9 ± 7.6 % (mean ± standard deviation), whereas EC represented a negligible fraction (average 1.3 ± 0.4 %). Meanwhile, WSOC accounted for 35.5±7.5 % of OC in the tested samples.
Although this batch of aerosol samples were collected from different sites, their water-extracted solutions showed similar light-absorbing characteristics in UV-vis absorption spectra (Cai et al., 2018). Here, four extract samples (HNWX-1, HNWX-2, HBDM-1, and HBDM-2) (Table S1) were chosen for further analysis using high-resolution mass spectrometry. These samples also exhibited similar patterns in mass distribution of water-soluble molecular species that mainly range from 50 to 400 Da, which indicated a similar burning source for these samples. A reconstructed mass spectrum (subtracted blank) for one representative sample of HNWX-1 is shown in Fig. 2a (others are shown in Fig. S2). In the mass range of 50-400 Da, there were 827 ± 44 molecular formulas identified throughout the all samples, and most of the formulas (above 75 %) overlapped between these analysed samples. The classification features of assigned compounds for analysed extracts are shown in Table S2. In the amount of assigned formulas, CHO composition was the most abundant group, accounting for 59.2±2.2 % of the total assignments, followed by CHON (35.0 ± 2.2 %). These results are consistent with previous observations of laboratorygenerated biomass burning aerosol ) and field particulate samples influenced by biomass combustion  in spite of the differences of biomass varieties, extracted solvents, and HRMS techniques between present and previous studies.
On the other hand, CHOS and CHONS compounds contributed with less than 5 % to the total assignment. A number of studies have shown the wide presence of organosulfates and nitrooxy-organosulfates in urban (Lin et al., 2012b;Wang et al., 2016), rural (Lin et al., 2012a), and forest aerosols (Kourtchev et al., 2013) and even in cloud water (Boone et al., 2015); however, most of these compounds were not observed in our negative mass spectra. This could be accounted for by the low extent of aerosol evolution, due to the limited oxidation conditions available for the formation of organosulfates and nitrooxy-organosulfates in fresh smoke aerosols. For example, laboratory studies have observed the significant formation of organosulfates via photooxidation in the presence of acidic sulfate aerosol (with significant level of SO 2 concentration) (Surratt et al., 2007(Surratt et al., , 2008. All detected ion species with enabled formula assignments in the present samples are listed in Table S3. In general, CHN and CHS compounds are not ionised well in negative ESI mode, which could be a reason why these species were not the most prevalent compounds in this study. It should be also noted that the negative ionisation mode selectively targets to detect those molecules containing polar functional groups (e.g. -OH and -COOH) that could be readily deprotonated. There are number of compounds that are not easily deprotonated and might show up preferentially in positive ionisation mode (e.g. amines). Furthermore, the formula numbers detected in the HRMS potentially contain multiple structural isomers; therefore, the actual number of water-soluble organic species is expected to be underestimated. The additional LC/ESI-HRMS analysis operated in negative mode confirmed a substantial number of ion masses (e.g. assigned CHO and CHON compounds) containing more than one structural isomer, which could be observed at different retention times (RTs) in chromatograms. Two representative groups of extracted chromatograms for CHO ([C 7 H 5 O n ] − , (n = 2-4)) and CHON ([C 7 H 5 O n N] − , (n = 1-3)) compounds are shown in Figs. S3 and S4, respectively, where increasing the O or N atom number in a molecule might lead to more isomer peaks. However, it should be noted that these LC-separated peaks might also include other unidentified compounds that were outside of the elemental assignment considered in this study. Additionally, low mass loading and potential decomposition under the ionisation can also limit the detection of some high molecularweight species.
The interpretation of the complex organic mass spectra generated by high-resolution mass spectrometry can be simplified by plotting the hydrogen-to-carbon ratio (H/C) against the oxygen-to-carbon ratio (O/C) for individual assigned atomic formulas in form of the Van Krevelen (VK) diagram (e.g. Lin et al., 2012a;Kourtchev et al., 2013).  (Mazzoleni et al., 2012;Kourtchev et al., 2013). The average DBE showed relatively high values of 5.5 for CHO compounds and 6.1 for CHON compounds (Table S2), suggesting that unsaturated organic species were abundant in the present samples, and their presence could partially account for the strong lightabsorbing feature in the near-UV region as observed in our previous study (Cai et al., 2018).
Throughout the extract samples, the average H/C and O/C values ranged from 1.26 ± 0.38 to 1.31 ± 0.40 and from 0.34 ± 0.24 to 0.42 ± 0.29 for CHO compounds and from 1.19±0.32 to 1.23±0.35 and from 0.28±0.17 to 0.29±0.15 for CHON compounds (Table S2), respectively. Although the ESI analysis was performed in the negative ionisation mode, the measured O/C exhibits rather low values, which fall in the range of O/C ratios typical for biomass burning organic aerosol derived from positive ionisation mode (Aiken et al., 2008;Kourtchev et al., 2016). Due to fresh emission and a smaller ageing effect, the present O/C was obviously lower than the O/C of long-range transport biomass burning aerosols . The carbon oxidation state (OS C ) was observed to increase with oxidation for atmospheric organic aerosol and was linked strongly to aerosol volatility (Kroll et al., 2011). OS C for each molecular formula can be calculated using the following equation: where OS i is the oxidation state associated with the noncarbon element i and n i /n C is the molar ratio of element i to carbon within the molecule (Kroll et al., 2011;Kourtchev et al., 2013). Considering that nitrogen and sulfur atoms can present multiple oxidation states, the OS C was calculated and analysed only for CHO compounds in this study. A similar pattern of OS C values versus the number of carbon atoms (n C ) was observed for CHO compounds detected in the present WSBA samples (Figs. 3 and S5). From Figs. 3 and S5, it can be seen that OS C of each sample ranges mainly from −1.5 to +1 with the average ranging from −0.6 to −0.4. Consistent with previous studies (Kroll et al., 2011;Kourtchev et al., 2016), the majority of molecules with OS C < 0 (low oxidised organics) and carbon atoms lower than 20 are suggested to be associated with the primary organic aerosols emitted from biomass burning. A minor fraction of molecular formulas with OS C ≥ 0 values might be associated with semivolatile and low-volatility oxidised organic aerosols (Kroll et al., 2011). Figure 3 also shows the plot of OS C versus n C for products obtained from the photooxidation of phenol and guaiacol, respectively, and their comparison with WSBA samples will be discussed in Sect. 3.3.

Mass spectral characteristics of the products from photooxidation of phenolic compounds in the aqueous phase
Phenol and guaiacol were chosen as two representative model compounds derived from biomass combustion. Two high-resolution mass spectra of aqueous phenol and guaiacol exposed to OH radicals for 4 h are shown in Fig. S6, where 435 C x H y O z molecular formulas (m/z 90-500) were assigned for product ions of phenol (with C 3 -C 24 ) and 624 C x H y O z formulas (m/z 90-600) were assigned for product ions of guaiacol (with C 3 -C 27 ). The average H/C and O/C values were 0.79 ± 0.28 and 0.52 ± 0.23 for phenol, and 0.88 ± 0.24 and 0.59 ± 0.24 for guaiacol, respectively. Clearly, the photochemical processing induced by OH oxidation resulted in an increase in the average O/C of product molecules relative to their precursors (O/C = 0.17 for phenol and O/C = 0.29 for guaiacol). The formation mechanisms of series of oxygenated products, e.g. phenolic oligomers, hydroxylated phenolic species, and ring-opening and highly oxygenated compounds, are proposed in the literature (e.g. Sun et al., 2010;Chang and Thompson, 2010;Yu et al., 2014). The OH-initiated reactions would result in enhanced hydroxylation of the aromatic ring as well as in increased yields of carboxylic acids and toxic dicarbonyl compounds (Sun et al., 2010;Yu et al., 2014;Prasse et al., 2018). For example, some highly oxygenated C 2 -C 5 aliphatic compounds (e.g. C 2 H 2 O 4 , C 3 H 4 O 4 , C 4 H 6 O 4 , and C 5 H 6 O 5 ) corresponding to carboxylic acids  were clearly observed in the mass spectra of present photochemical products. The occurrence of these oxygenated products not only directly increased the degree of oxygenation in the bulk solution composition but also contributed to the variation in solution acidity. After the 4 h photochemical process, the pH values of the irradiated solution were significantly lower than the pH values of the solution prior to irradiation (t test, p < 0.05), and the calculated acidities ([H + ]) of the bulk solution increased by (2.96 ± 0.15) × 10 −5 M and (4.26±0.16)×10 −5 M for phenol and guaiacol, respectively.
The oligomerisation induced by photochemical transformation of phenolic substances is an important formation pathway for low-volatility, light-absorbing compounds . Here, phenolic dimmers (i.e. C 12 H 10 O 2 for phenol dimer and C 14 H 14 O 4 for guaiacol dimer) and higher oligomers (e.g. C 18 H 14 O 3 and C 24 H 18 O 4 for phenol trimer and tetramer, C 21 H 20 O 6 for guaiacol trimer), as well as their hydroxylated species were observed. The formation mechanism can be ascribed to C-O or C-C coupling of phenoxy radicals that were formed via H abstraction of the phenols or OH addition to the aromatic ring (Net et al., 2009;Sun et al., 2010). The reaction at the para position or para-para coupling was more likely to occur due to a higher probability of free electrons to occur in this position (Lavi et al., 2017) or a weaker steric hindrance in the para position.

Comparison of the photochemical products of phenolic compounds and the CHO composition in WSOC extracts from WSBA samples
Compared to the CHO compounds detected in WSOC extracts, the photochemical products of the two phenols under study showed a higher O/C and a lower H/C value. The average OS C of photochemical products from phenol (OS C = −0.7) and guaiacol (OS C = −0.6) after a 4 h photooxidation rose to +0.2 and +0.3, respectively, showing distinctly a higher degree of oxidation than the present WSBA samples. In Fig. 3, more species with OS C < 0 (especially OS C < −0.5) are presented in the field sample (HBDM-1), while the species with OS C ≥ 0 are prevalent in photochemical products of phenol and guaiacol. The single-precursor systems in laboratory did not completely reflect the CHO composition features in water-soluble extracts from real strawburning samples that contained a myriad of precursors and unknown substances from atmospheric background, soil, and other sources. Considering that a large number of phenols and methoxyphenols exist in the straw-burning smokes and their potential to undergo photochemical ageing, the nature of emitted primary organic aerosols is reasonably more complicated than the nature of simulated products derived from single-precursor systems. The extracted LC chromatograms of m/z 185.0608 and 245.0823 are shown in Fig. 4, respectively, where both ions involve dimers of phenol and guaiacol with several structures and/or other isomers. The presence of guaiacol dimer and syringol dimer was previously observed in aerosol samples largely affected by wood combustion. Based on the aerosol mass spectrometer (AMS) analysis, these two dimers were suggested as markers of biomass burning aerosols (Sun et al., 2010;Yu et al., 2014). In the composition of present biomass burning aerosols, the phenolic dimers (m/z 185.0608 and 245.0823) were also observed in the present mass spectra, but the extracted LC chromatograms shown in Fig. 4 indicate that these ions contain multiple RT peaks. The same peaks with RT 18.3 and 19.2 min which are assumed to be the phenol dimers were observed during the photochemical transformation of phenol (Fig. 4a) and in the WSBA samples. Meanwhile, the present particle extracts may also involve guaiacol dimer, since its m/z 245.0823 has two LC peaks emerging at RT 17.7 and 19.5 min (Fig. 4b), the same as the peaks identified during the photochemical transformation of guaiacol. Considering that a substantial amount of moisture in the plant body (Bi et al., 2009) was discharged during the process of straw combustion, the occurrence of phenolic dimers might indicate that the aqueous-phase reactions played an important role in the formation and evolution of emitted aerosol organic composition. Typical hydroxylated species, such as C 2 H 2 O 4 , C 6 H 6 O 2 , C 7 H 6 O 3 , and C 7 H 8 O 3 , were also found in the samples from photooxidation of both phenols and the WSBA samples. The comparison of the photochemical products from phenols and the WSBA samples revealed their significant difference, pointing to the importance of studying real aerosol samples against the laboratory model compounds. However, evaluating the model compounds as a proxy of real aerosol samples is always helpful as a reference. To this end, it is worth noting that potentially other phenols and methoxyphenols (e.g. acetosyringone, vanillin) that dissolve into cloud, fog droplets, or aerosol liquid water can be photochemically transformed and can contribute to SOA formation Zhou et al., 2019).

Photolysis of WSOC extracts from WSBA samples
Although the direct photolysis was performed on present WSOC extracts from WSBA samples in the presence of simulated sunlight irradiation without adding any oxidants, the photooxidation process still occurred since the particle extracts were very likely to include various oxidants, e.g. singlet molecular oxygen ( 1 O 2 ), peroxides, hydroxyl radical (OH), or an excited triplet state of organics produced under light excitation (Anastasio et al., 1997;Vione et al., 2006;Net et al., 2009Net et al., , 2010aBateman et al., 2011;Rossignol et al., 2014;Gómez Alvarez et al., 2012). In particular, the excited triplet state of aromatic carbonyls (e.g. 3, 4-dimethoxybenzaldehyde) (Net et al., 2010b) was found to be more efficient than OH radicals to oxidise phenols and produce hydroxylated species Yu et al., 2014). This photosensitised reaction is likely to play an important role in the WSOC evolution, due to high quantities of aromatic carbonyls present in the extracts of biomass burning aerosols.
The variation in peak abundance at unique retention times in the chromatogram could reflect the extent of the evolution of WSOC molecules with accurate molecular weights, although no available standards were utilised for absolute quantification. The LC/ESI-HRMS monitors obvious change in the molecular features of partial CHO species, i.e. photodegradation of low oxygenated compounds and formation of highly oxygenated compounds. Table 1 lists the CHO compounds for which the LC peak intensities significantly increased and decreased after the 12 h photolysis.

Photodegradation of low oxygenated compounds and formation of highly oxygenated compounds
As shown in Table 1, ion masses assigned to high unsaturated and low oxygenated species (O/C < 0.5) are prone to photodegradation, especially C 7 -C 9 compounds (possible aromatic species), the intensity of which decreased by nearly 1 order of magnitude. For example, for m/z 123.0450 ([C 7 H 7 O 2 ] − ), as shown in Fig. 5a, the peaks at RT 16.2 and 16.7 min in the LC chromatogram reduced in area by 95 % after the 12 h irradiation. Using a standard it was verified that both peaks did not belong to guaiacol (peak at RT 17.3 min), but they were also found within the products of guaiacol photooxidation, suggesting that they might be isomers of guaiacol or aromatic dihydric alcohol. The phenolic dimers (C 12 H 10 O 2 and C 14 H 14 O 4 ) as described above also exhibited a decreasing tendency, with an almost complete disappearance after 12 h direct photolysis. The photochemical processing led to an increased formation of low MW compounds (e.g. C 2 -C 5 species), with a relatively high O/C. For example, the C 2 compounds, includ- (Fig. S8), which may correspond to glyoxylic acid, glycolic acid, acetic acid, and oxalic acid, respectively, were likely to be formed via the oxidation pathway of several water-soluble molecules with photochemical reactivity, e.g. glyoxal (Carlton et al., 2007;Lim et al., 2010), methylglyoxal (Altieri et al., 2008;Lim et al., 2010), pyruvic acid (e.g. Grgic et al., 2010;Griffith et al., 2013;Reed Harris et al., 2014;Rapf et al., 2017;Eugene and Guzman, 2017;Mekic et al., 2018Mekic et al., , 2019, and phenols (Sun et al., 2010). The presence of these highly oxygenated compounds that possibly contain acidic groups (e.g. -COOH and -OH) undoubtedly contributed to the increase in the solution acidity. Higher levels of other highly oxygenated species such as and [C 5 H 7 O 5 ] − were also observed (Fig. S9).
To identify the impact of photolysis on the evolution of specific WSOC, the ions of [C 7 H 7 O n ] − in the HBDM-1 sample with significant variation were chosen as representative cases for description. ] − . Although we could not verify this hypothesis, the oxidised species formed undoubtedly have a high O/C, which highlights the possibility of this reaction pathway.

Presentation of photochemically stable organic species
Some of the detected organic species seemed to exhibit a good photochemical stability, as their relative intensities only slightly decreased (< 10 %) after 12 h light irradiation. The m/z 161.0454 ([C 6 H 9 O 5 ] − ) presented two prominent peaks at RT 1.9 and 2.4 min (Fig. S10). The peak at RT 2.4 min was further confirmed with a standard compound to be levoglucosan, a typical tracer of biomass burning aerosols with a high photochemical stability in atmospheric aerosols (Hu et al., 2013). The relatively good photochemical stability was also observed for some C 6 homologue compounds, such as N] − , RT 17.9 min). The photochemical stability of some compounds may be ascribed to their low concentrations, or the light-shielding effect from other light-absorbing species.
Another intriguing finding was that different structural isomers with the same molecular mass might have exhibited different fates upon prolonged light irradiation of the samples. For example, the intensity of the peak at m/z 165.0405 ([C 5 H 9 O 6 ] − ) decreased when it was eluted at 4.9 min, but increased at RT 1.8 min, with the irradiation time (Fig. S11). A simultaneous degradation and formation among isomers of some CHON ion masses upon prolonged light irradiation was also observed, as was the case for the CHO compounds. For example, the m/z 108.0453 assigned to [C 6 H 6 ON] − , might include hydroxy and amino groups on the phenyl ring to present three possible isomers (Fig. S12). During photolytic processing, the intensity of the peak at RT 3.2 min increased dramatically, while there was a clear decreasing tendency of the peak intensity at RT 5.5 and 12.5 min, which was suggestive of possible isomerisation among these isomers. Other ion masses that exhibited possible isomerisation included m/z 122.0610 (

Comparison of time-profile mass spectra of CHO composition in WSOC extracts from WSBA samples
Since the LC method just separated a fraction of polar compounds, we tentatively utilised the change in HRMS to gain more comprehensive information about the WSOC evolution. We compared the time-profile (0, 4, and 12 h) mass spectra with each other, based on the assumption of the same interference from inorganic species, and the good reproducibility and stability for Orbitrap MS operated under the same instrumental parameters (the RSD of TIC intensity within 5 %). It is well known that ESI mass spectral abundances are influenced by the solution composition, concentration of analytes, and instrumental factors (Bateman et al., 2011); hence, it is quite challenging to directly quantify the absolute concentration levels of the complex mixtures. Despite this, the photochemical degradation of WSOC compounds and corresponding formation of organic compounds can be well described by the variation in signal intensity from mass spectrometry. The average O/C and H/C for CHO compounds ranged from 0.38 ± 0.02 to 0.44 ± 0.02 and 1.24 ± 0.03 to 1.26 ± 0.01, respectively, as the irradiation time extended from 0 to 12 h. The comparison of these time-profile mass spectra indicates that the 12 h photolysis resulted in a significant reduction in 28 ± 11 % in the total ion abundance (S/N). Since the photolysis induced changes in abundance for most of the CHO compounds, we also calculated the intensity (S/N)-weighted average O/C (O/C w ) and H/C (H/C w ) (Bateman et al., 2011;Romonosky et al., 2015) with values ranging from 0.45 ± 0.03 to 0.53 ± 0.06 and from 1.32 ± 0.09 to 1.40 ± 0.11, respectively. After the 12 h photolysis, both average H/C and H/C w values slightly increased, compared to the samples prior to irradiation; however, both average O/C and O/C w values have increased more distinctly, indicating an elevation in oxidation degree of bulk extract composition. This phenomenon could be partly reflected in the LC-HRMS observation, i.e. the formation of highly oxygenated species and the consumption of low oxygenated compounds. In our previous study, the UV-vis measurements revealed that the 12 h photochemical evolution leads to a modification of absorptive properties for WSBA extracts (e.g. photo-bleaching at wavelengths below 380 nm and photo-enhancement above 380 nm) (Cai et al., 2018), which might be partially linked to the present findings about molecular functionalisation, e.g. hydroxylation facilitating a red shift for light-absorbing wavelengths.

Conclusions
This study was focused on the effect of direct photolysis on the molecular composition of actual WSOC extracted from field straw-burning aerosol. The phenol dimer (m/z 185.0608) and guaiacol dimer (m/z 245.0823), or their isomers generated from laboratory aqueous-phase photooxidation of phenol and guaiacol, were also observed in the present field WSBA samples, suggesting that the aqueousphase reaction might contribute to the formation of emitted biomass burning aerosols. The laboratory observation on the aqueous photochemistry of phenols indicated that those phenolic compounds in real biomass burning aerosols would likely have the potential to experience a similar evolution to form various oxygenated compounds under the relevant atmospheric water conditions. The direct photolysis of the molecular composition of WSOC extracts from WSBA samples was performed to gain more insight into the evolution of aerosol composition. Because the extract composition was very complex, the techniques (ESI-HRMS and LC/ESI-HRMS) used in this study, although advanced, still had limitations in monitoring the modification of molecular composition, especially for determining the potential formation of compounds present at low concentrations or compounds that were poorly ionised. However, a series of polar molecules were identified that changed their molecular composition via photochemical ageing. In particular, the degradation of low oxygenated compounds with strong photochemical reactivity and the formation of highly oxygenated compounds might directly result in an increasing O/C in WSOC composition, which was likely linked to the modification of lightabsorbing characteristics for extracts in previous study. This finding indicates that the water-soluble organic fraction of field combustion-derived aerosols has the potential to form more oxidised organic matter, which might contribute to the highly oxygenated nature of atmospheric organic aerosols. Further studies focused on the photochemical evolution of WSOC composition will be performed in the future, including enlarging measurements on compound species (e.g. applying positive ESI-HRMS), identifying biomarkers, and evaluating their role in photochemical processes.
Data availability. The data used in this study are available from the corresponding author on request.
Author contributions. JC and ZY designed the experiments, and JC and XZ carried them out. GZ provided the straw-burning aerosol samples; ZY and SG helped to perform the analysis of light irradiation and to edit the paper. GS, XW, and PP provided some technical consultations about organic chemistry. JC prepared the paper with contributions from all co-authors.
Competing interests. The authors declare that they have no conflict of interest.