Volatile organic compounds and ozone air pollution in an oil production region in northern China

Oil and natural gas (O&NG) exploration presents a significant source of atmospheric volatile organic compounds (VOCs), which are central players of tropospheric chemistry and contribute to formations of ozone (O3) and secondary organic aerosols. The impacts of O&NG extraction on regional air quality have been investigated in recent years in North America, but have long been overlooked in China. To assess the impacts of O&NG exploration on tropospheric O3 and regional air quality in China, intensive field observations were conducted during February–March and June–July 2017 in the Yellow River delta, an oil extraction region in northern China. Very high concentrations of ambient VOCs were observed at a rural site, with the highest alkane mixing ratios reaching 2498 ppbv. High-O3 episodes were not encountered during wintertime but were frequently observed in summer. The emission profiles of VOCs from the oil fields were directly measured for the first time in China. The chemical budgets of ROx radicals and O3 were dissected with a detailed chemical box model constrained by in situ observations. The highly abundant VOCs facilitated strong atmospheric oxidation capacity and O3 formation in the region. Oxygenated VOCs (OVOCs) played an essential role in the ROx primary production, OH loss, and radical recycling. Photolysis of OVOCs, O3, and HONO as well as ozonolysis reactions of unsaturated VOCs were major primary sources of ROx . NOx was the limiting factor of radical recycling and O3 formation. This study underlines the important impacts of O&NG extraction on atmospheric chemistry and regional air quality in China.

Abstract. Oil and natural gas (O&NG) exploration presents a significant source of atmospheric volatile organic compounds (VOCs), which are central players of tropospheric chemistry and contribute to formations of ozone (O 3 ) and secondary organic aerosols. The impacts of O&NG extraction on regional air quality have been investigated in recent years in North America, but have long been overlooked in China. To assess the impacts of O&NG exploration on tropospheric O 3 and regional air quality in China, intensive field observations were conducted during February-March and June-July 2017 in the Yellow River delta, an oil extraction region in northern China. Very high concentrations of ambient VOCs were observed at a rural site, with the highest alkane mixing ratios reaching 2498 ppbv. High-O 3 episodes were not encountered during wintertime but were frequently observed in summer. The emission profiles of VOCs from the oil fields were directly measured for the first time in China. The chemical budgets of RO x radicals and O 3 were dissected with a detailed chemical box model constrained by in situ observations. The highly abundant VOCs facilitated strong atmospheric oxidation capacity and O 3 formation in the region. Oxygenated VOCs (OVOCs) played an essential role in the RO x primary production, OH loss, and radical recycling. Photolysis of OVOCs, O 3 , and HONO as well as ozonolysis reactions of unsaturated VOCs were major primary sources of RO x . NO x was the limiting factor of radical recycling and O 3 formation. This study underlines the important impacts of

Introduction
Oil and natural gas (O&NG) compose the most significant fraction of global energy consumption and play an essential role in the industry, economy, and social development. By the end of 2017, O&NG consumption accounted for approximately 58 % of global primary energy consumption (British Petroleum Company plc., 2018). In recent years, with the breakthroughs in exploration and extraction technologies for tight oil and shale gas such as horizontal drilling and hydraulic fracturing (EIA, 2014), the unconventional O&NG production has experienced explosive growth in the United States, resulting in an upward trend of O&NG production since the 1980s (EIA, 2018). Increases in O&NG production are also projected in other countries with abundant reservoirs of shale oil and gas in the near future (EIA, 2014). O&NG production emits a large amount of air pollutants to the atmosphere, causing different levels of air pollution problems in the O&NG extraction region and its surrounding areas (Schnell et al., 2009;Edwards et al., 2013). The growth in O&NG production has indeed raised increasing concerns about the deteriorated air quality, public health, and climate in North America (Alvarez et al., 2012;McKenzie et al., 2012;Adgate et al., 2014;Colborn et al., 2014;. Potential air pollutant emission sources during O&NG production include deliberate venting and flaring, fugitive emissions, diesel engines for power supply, and leakage from infrastructure and transport (Adgate et al., 2014). Such activities have been shown to result in the increase in volatile organic compounds (VOCs) and nitrogen oxides (NO x ) in the ambient air (Allen et al., 2013;Helmig et al., 2014;Warneke et al., 2014). Photochemical oxidation of VOCs in the presence of NO x produces ozone (O 3 ), a secondary pollutant with adverse effects on human health, vegetation, materials, and climate (National Research Council, 1992). Several field campaigns have observed unusually high levels of wintertime O 3 in oil and gas field basins in the US, including the Uintah Basin (Edwards et al., 2013Lee et al., 2014) and Upper Green River basin (Schnell et al., 2009;Carter and Seinfeld, 2012). Such high wintertime O 3 episodes occur under the combined action of specific meteorological conditions and chemical processes. The favorable meteorological conditions include a shallow boundary layer, calm winds, and increased photolysis flux induced by the snow deposition (Schnell et al., 2009;Carter and Seinfeld, 2012;Ahmadov et al., 2015). In terms of atmospheric chemistry processes, the accumulated high concentrations of VOCs lead to a significant increase in O 3 production efficiency, and radicals generated by photolysis of oxygenated VOCs (OVOCs) also play an important role (Edwards et al., 2013. In addition, the O&NG production also affects O 3 formation and air quality during other seasons, especially in summer. Rodriguez et al. (2009) used a regional chemical transport model (CAMx) to assess the impacts of O&NG operation on O 3 pollution in the western US, and they found enhancement in the maximum daily 8 h average O 3 (MDA8 O 3 ) by considering O&NG emissions can reach up to 9.6 ppbv in southwestern Colorado and northwestern New Mexico. Using the same model, Kemball-Cook et al. (2010) indicated that emissions from Haynesville Shale can explain up to 5 ppbv of MDA8 O 3 enhancement within northeast Texas and northwest Louisiana. Other works also found that the O&NG extraction activities pose important effects on regional O 3 levels in summertime (Olaguer, 2012;Rutter et al., 2015;Vinciguerra et al., 2015;McDuffie et al., 2016).
The O&NG exploration activities are very active in China, with crude oil and natural gas production both ranking sixth in the world (EIA, 2017;Statista, 2018). China is also rich in shale resources, with the reserves of shale gas and shale oil ranking first and third in the world, respectively (EIA, 2014). It is expected that China's future O&NG exploration will further increase and may have increasingly important effects on atmospheric and environmental issues. Currently, O 3 pollution has become a major air quality concern in China, with monitored O 3 concentrations exceeding the national ambient air quality standard frequently in metropolitan areas nationwide (L. K. Wang et al., 2017). Available long-term observations also demonstrated significant upward trends in surface O 3 levels in the last 2 decades over China (Ding et al., 2008;Wang et al., 2009;Xu et al., 2008Xu et al., , 2018Sun et al., 2016;Ma et al., 2016). A large number of studies have been dedicated to understanding the formation mechanisms of O 3 pollution and identified the major sources of O 3 precursors (particularly VOCs) in China (e.g., Zhang et al., 2008;Yuan et al., 2012;Dang et al., 2015;Shao et al., 2016;Zhao et al., 2016;Wang et al., 2017). However, O&NG extraction has long been overlooked as an important source of VOCs, compared to other anthropogenic activities such as industry, power plants, transportation, biomass burning, etc. To the best of our knowledge, to date there is no report that has assessed the impacts of O&NG exploration on VOCs and O 3 pollution in China.
To fill this gap, two intensive measurement campaigns were conducted at a rural site surrounded by open oil fields in the Yellow River delta (YelRD) region, an important oil extraction area in China, during February-March and June-July of 2017. A large suite of parameters including O 3 , CO, NO, NO 2 , NO y , SO 2 , HONO, C 1 -C 10 hydrocarbons, C 1 -C 8 carbonyls, aerosol properties, and meteorological parameters were measured in situ. Air samples were also collected from oil wells to characterize the source profiles of VOCs in the oil field. A detailed chemical box model was then constrained with the abovementioned in situ observations to dissect the chemistry of O 3 formation, atmospheric oxidation capacity, and radical budgets. Overall, this study provides some new insights into the emission characteristics of VOCs from oil fields and their effects on atmospheric oxidation processes and regional O 3 pollution in China.

Site description
We target the YelRD region for assessing the impacts of oil field emissions on VOCs and O 3 pollution. The YelRD is located to the south of Bohai Sea and in the northern part of Shandong Province. It includes Dongying, Binzhou, and parts of Weifang, Dezhou, Zibo, and Yantai cities, with a total area of 26 500 km 2 and a population of 9.85 million ( Fig. 1). It is abundant in natural resources and hosts the third largest oil field in China (i.e., Shengli oil field). Active O&NG exploration has made it one of China's largest petrochemical industry bases. In addition, the YelRD estuary is a typical estuarine wetland ecosystem and is rich in ecological resources. Furthermore, it is located at the junction of the Beijing-Tianjin-Hebei region and Shandong Peninsula, the most polluted regions in north China, with distances of approximately 300, 200, and 190 km away from Beijing, Tianjin, and Jinan, respectively. Therefore, it may also suffer from regional transport of aged continental air masses from these metropolitan areas under the influence of winter monsoons. Two phases of field campaigns were carried out in winterspring (from 9 February to 1 April) and summer (from 1 June to 10 July) 2017 at the YelRD Ecological Research Station of Coastal Wetland (37.75 • N, 118.97 • E; 1 m above sea level), Chinese Academy of Sciences. This site lies roughly 32 km to the northeast of the Dongying urban area and 10 km to the west of the Bohai Sea (Fig. 1). It is a typical rural site surrounded by open oil fields and without any other anthropogenic emission sources nearby. There are two intensive oil production areas near the site. One is mainly distributed in the coastal area (about 10 km to the northeast), while the other is in the urban area (about 30 km to the southwest). In view of the regional scale, the observation site is constrained by both aged continental air masses transported from the Beijing-Tianjin-Hebei region and marine air from the Bohai Sea. Details of the sampling site can be found elsewhere (Zhang et al., 2019). Source samples were also collected from the nearby oil and gas wells to obtain the source profiles of VOCs from the oil field.

Measurement techniques
All in situ measurement instruments were housed in a temperature-controlled container, and the sampling inlets were mounted on top of the container with an altitude of about 5 m above the ground. A large suite of chemical species and meteorological parameters were measured. Briefly, O 3 was monitored by an ultraviolet photometric analyzer (Thermo Environmental Instruments (TEI) model 49C). NO and NO y were measured by a chemiluminescence instrument (Advanced Pollution Instrumentation (API) model T200U) equipped with an externally placed molybdenum oxide (MoO) catalytic converter. NO 2 was observed with a cavity attenuated phase shift (CAPS) analyzer that is highly selective for true NO 2 (API, model T500U). SO 2 was observed using a pulsed ultraviolet fluorescence analyzer (TEI, model 43C). CO was detected using a gas fil-ter correlation nondispersive infrared analyzer (API model 300U). These trace gas analyzers were calibrated manually every 3 d during the measurement campaigns, including zero and span checks as well as conversion efficiency calibration of the MoO catalytic converter, with additional zero calibration automatically done every 4 h for the CO instrument. The particle number size distributions between 5 and 350 nm were measured by a wide-range particle spectrometer (WPS, model 1000XP, MSP Corporation, USA), while those in the range of 300 nm to 10 µm were monitored by a handheld particle counter (model 9306, TSI, USA). PM 2.5 mass concentrations were measured using a synchronized hybrid ambient real-time particulate monitor (SHARP; Thermo Scientific model 5030). HONO was detected by a long-path absorption photometer named LOPAP (QUMA GmbH, Germany). Meteorological parameters including wind direction, wind speed, temperature, and relative humidity (RH) were continuously observed by a weather station (PC-3, Jinzhou Sunshine). Photolysis frequencies of H 2 O 2 , HCHO, HONO, O 3 , NO 3 , and NO 2 were observed by a CCD-detector spectrometer (Metcon GmbH, Germany). The time resolution was 1 min averaged for trace gases and photolysis frequency, 5 min averaged for meteorological parameters, and 30 min averaged for PM 2.5 .
Whole-air samples were collected with clean and evacuated 2 L stainless-steel canisters for quantification of methane and C 2 -C 10 non-methane hydrocarbons (NMHCs). The samples were mainly collected on sunny days (with a small part on cloudy days) during selected pollution episodes, with each sample taken every 2-3 h for 30 s from 07:00 to 19:00 local time (LT) in June-July and from 06:00 to 21:00 LT in February-March. In addition, seven samples were taken at 00:00 LT during the winter-spring campaign. The purpose of such sampling strategy is to better recognize the VOC pollution characteristics in this area and to facilitate detailed modeling analysis of O 3 pollution events. Whole-air samples were also collected exactly in the sur-roundings of oil wells and petrochemical industrial areas using the same method. A total of 111 ambient samples (including 58 samples in winter-spring and 53 samples in summer 2017) as well as 21 source samples (including 18 oil field samples and 3 petrochemical plant samples) were taken in this study. After sampling, concentrations of methane and C 2 -C 10 NMHCs were then quantified by gas chromatography (GC) separation followed by flame ionization detection (FID), mass spectrometry detection (MSD), and electron capture detection (ECD) at the laboratory of the University of California at Irvine (UCI) (Simpson et al., 2010;Xue et al., 2013). The detection limit is 0.01 ppmv for methane and 3 pptv for C 2 -C 10 NMHCs (Simpson et al., 2010). Note that O 3 scrubbers were not used ahead of the canisters during the sampling, and the sampled canisters were shipped to the UCI for analysis immediately after the individual field campaign. Some reactive VOC compounds (such as alkenes) may be decayed more or less during the time span from sampling to lab analysis. Thus, one should keep in mind that the VOC observations in this study may be subject to some uncertainty and the reactive compounds may be underestimated to some extent.
Carbonyl samples were collected by adsorption of ambient air in a 2,4-dinitrophenylhydrazine-coated sorbent cartridge (Waters Sep-Pak DNPH-silica) at a flow rate of 0.5 L min −1 . An O 3 scrubber was attached to the front of the cartridge to avoid O 3 interference. The sampling strategy was similar to that of VOC canister samples. Specifically, the carbonyl samples were taken during selected episodes every 3 h from 06:00 to 21:00 LT in winter-spring and every 2 h from 07:00 to 19:00 LT in summer (the sampling time for each sample in winter-spring and summer was 3 and 2 h, respectively). A total of 128 ambient samples (including 58 samples in winter-spring and 70 samples in summer) and 10 source samples were taken at the rural site and in the oil fields, respectively. After the campaign, the samples were analyzed with the high-performance liquid chromatography (HPLC) for quantification of 14 C 1 -C 8 carbonyl species .
All of the above measurement techniques have been successfully applied in many previous studies, and the detailed measurement principles, detection limits, quality assurance, and quality control procedures can be found elsewhere Simpson et al., 2010;Yang et al., 2018;Li et al., 2018).

Observation-based chemical box model
The Observation-Based Model for investigating the Atmospheric Oxidation Capacity and Photochemistry (OBM-AOCP) was used to simulate the in situ atmospheric photochemical processes and to quantify the O 3 production rate, OH reactivity, and radical budgets (RO x : OH, HO 2 , and RO 2 ). This model has been successfully adopted in many previous studies (e.g., L. K. Xue et al., 2016;Yang et al., 2018;Li et al., 2018;Sun et al., 2018). In short, it is based on the latest version of the Master Chemical Mechanism (MCM v3.3.1), a nearly explicit mechanism describing the gas-phase chemical reactions that involve 143 primary VOC species (Saunders et al., 2003). In addition to the existing reactions in MCM v3.3.1, OBM-AOCP also incorporates over 200 reactions which represent the oxidation of VOCs by chlorine radical (Xue et al., 2015) and heterogeneous processes involving reactive nitrogen oxides (L. K. . Physical processes such as dry deposition and dilution mixing in the boundary layer are also taken into account, and details can be found elsewhere (L. K. . OBM-AOCP is able to simultaneously quantify the O 3 production rate, atmospheric oxidation capacity (AOC), OH reactivity, and the primary production, recycling, and termination rates of RO x radicals. It tracks and calculates the individual reaction rate of almost all the reactions in the MCM, including the free radical chemistry. Among them, the sum of oxidation rates of various pollutants (CO, VOCs, NO x , SO 2 , etc.) by the major oxidants (i.e., OH, O 3 , NO 3 , and Cl) is regarded as the AOC . The reaction rates of OH with CO, VOCs, NO x , SO 2 , HONO, HNO 3 , and HO 2 NO 2 are computed as the OH reactivity. Primary sources of OH, HO 2 , and RO 2 include the photolysis reactions of O 3 , HONO, formaldehyde, and other OVOCs as well as reactions of VOCs with O 3 and NO 3 radicals . Related reactions were grouped into a dozen major routes of production, recycling, and loss for quantifying the RO x chemical budget . The O 3 chemical budget was also quantified by the model. O 3 production rate (P (O 3 )) was calculated as the sum of reaction rates for HO 2 + NO and RO 2 + NO reactions (Eq. 1), and O 3 loss rate (L(O 3 )) was computed as the sum of reaction rates for O 3 photolysis, O 3 +OH, O 3 +HO 2 , O 3 +VOCs, NO 2 +OH, NO 2 +RO 2 (minus the decomposition rate of organic nitrates), NO 3 +VOCs, and loss of N 2 O 5 (Eq. 2). The net O 3 production rate can be calculated as the difference between P (O 3 ) and L(O 3 ) (Eq. 3). Here, k i is the corresponding reaction constant. Details of the above chemistry calculation can be found elsewhere (L. K. Xue et al., 2016).
Measured data of O 3 , SO 2 , CO, NO, NO 2 , HONO, J values, temperature, and RH were averaged to a time resolution of 5 min to constrain the model. In addition, measured concentrations of CH 4 , C 2 -C 10 NMHCs, and C 1 -C 8 carbonyl compounds were interpolated to a time resolution of 30 min for model inputs. For the nighttime data, when direct observations were generally unavailable, CH 4 and C 2 -C 10 NMHC (except isoprene) concentrations were interpolated according to their linear regressions with CO, and concentrations of isoprene were interpolated based on the linear relationship with temperature . The nighttime OVOC data were interpolated according to the multiple linear regressions with CO and O 3 . Such approximation was mainly to facilitate the pre-run of the model and should not affect the formal daytime modeling results. Unmeasured photolysis frequencies within the model were calculated as a function of the solar zenith angle (Saunders et al., 2003) and then were scaled with the measured J (NO 2 ). The model starts at 00:00 LT and pre-runs for 4 d under constraints of input data to stabilize the species which were not measured in the field campaign, and the daytime modeling results of the last day were subject to further analyses.

Overview of O and VOC pollution
The overall air quality and meteorological conditions measured during the two-phase campaign are presented in Fig. 2. Descriptive statistics of major trace gases, aerosols, and meteorological parameters are summarized in Table 1. Seasonal variability of air pollution and weather is clearly illustrated. The winter and early spring (i.e., February-March) is featured by cold weather and higher levels of primary air pollutants. All the trace gases (except for O 3 ) and PM 2.5 showed significantly higher concentrations in February and March than in summer (June-July). This can be explained by the shallow boundary layer, less active photochemistry, and additional emissions from residential heating in winter-spring. In contrast, O 3 exhibited much higher levels in summer, mainly corresponding to the more intense photochemical formation as a result of the hot weather and strong solar radiation. Elevated O 3 concentrations were frequently observed during the summer campaign, with 22 non-attainment days (defined as the day when the maximum hourly O 3 concentration exceeds China's National Ambient Air Quality Standard, Grade II, 93 ppbv) throughout the 40 d measurement period. The maximum hourly O 3 value was recorded at 177 ppbv in summer. These observations demonstrate the severity of photochemical air pollution in the YelRD region. O 3 pollution was also encountered in early spring. In March, 2 O 3 non-attainment days were identified with a maximum hourly O 3 mixing ratio of 106 ppbv. When looking at the MDA8 O 3 , the number of non-attainment days (with MDA8 O 3 exceeding 75 ppbv) increased to 5 in March 2017. However, no O 3 episodes occurred in February. This is quite different from the recent observations in the US that have found very high levels of O 3 in winter in the oil basin (Schnell et al., 2009;Edwards et al., 2014). We examined the observed chemical environments and weather conditions in the YelRD region. As detailed below, there were abun-dant O 3 precursors, especially VOCs, in this study region, which would sustain as much as photochemical O 3 formation. The major difference between this study and the US efforts lies in the weather conditions. As proposed by Ahmadov et al. (2015), snow cover is a prerequisite for the occurrence of wintertime O 3 episodes in the US oil basins. During the wintertime observation period, the weather was quite dry and only small amounts of snowfall occurred during the nighttime of 21 February. The snow cover was very thin and it quickly disappeared with increase in temperature under the influence of a subsequent high-pressure system. Furthermore, the YelRD region is usually affected by strong winds in winter (Fig. 2) due to its flat and coastal topography. Thus, the meteorological conditions encountered in the present study were unfavorable for the occurrence of winter O 3 episodes. Similarly, O 3 episodes were also not observed in the Uintah Basin in the snow-free winter of 2012 . More observations are still needed to examine the wintertime O 3 issues in the oil extraction areas of China. Table 2 documents the statistics of individual VOC species observed in the present study. Obviously, the ambient air in the YelRD region is very rich in VOCs, in particular alkanes, which accounted for the majority (i.e., 84.3 % for winterspring and 70.6 % for summer) of the measured NMHCs. Extremely high levels of VOCs were frequently observed at the study site, although it is located in a remote coastal area. The maximum concentrations of total NMHCs were 2823 and 176 ppbv in winter-spring and summer, respectively. These samples were heavily affected by the gas leakage from the surrounding oil fields and will be discussed further in Sect. 4. In addition, elevated concentrations of light olefins such as ethene, propene, and butenes were also detected, especially during the winter and early spring when the photochemical oxidation was less active. This was mainly attributed to the emissions from refining industry in the YelRD region, which is well known as an important base for petrochemical industry in north China. A number of refining plants are indeed located to the southwest and north of the sampling site. Such a VOC-rich atmosphere is expected to efficiently facilitate O 3 production with a certain amount of NO x . Furthermore, similar to other primary pollutants, all of the VOC compounds (except for cyclopentane and isoprene) showed a typical seasonal variation with higher concentrations in winter-spring and lower levels in summer.
Figures 3-4 present the average diurnal variation patterns of major trace gases (including VOCs), PM 2.5 , and meteorological parameters during the two campaigns. All the pollutants showed well-defined diurnal profiles that can be explained by the evolution of the planetary boundary layer, local emissions, and atmospheric photochemistry. Specifically, O 3 showed a broad afternoon concentration peak with a trough in the early morning in both seasons. The other primary pollutants (e.g., CO, SO 2 , and NO x ) and PM 2.5 exhibited higher concentrations in the morning and the lowest levels in the afternoon. VOCs generally showed higher levels  during the nighttime or the early morning and lower mixing ratios during the day, with long-chain alkenes (comprising isoprene, 3-methyl-1-butene, 2-methyl-1-butene, alphapinene, and beta-pinene) as an exception that shows an opposite diurnal pattern in summer (Fig. 4). A noteworthy result is the fast accumulation of O 3 during the morning period. For example, the average increases in O 3 concentrations in the morning (06:00-12:00 LT) were 49.2 and 30.2 ppbv in summer and winter-spring, respectively. The early morning (i.e., 05:00-07:00 LT) O 3 increase may be attributed to the downward intrusion of O 3 -laden residual layer air (see Fig. S1 in the Supplement), while the rapid O 3 increase throughout the morning period suggests strong in situ photochemical formation in this VOC-rich area. This will be further quantified with the model in Sect. 6.

Emission profiles of VOCs from oil fields
To characterize the VOC emissions from the oil fields in China, 18 whole-air samples were taken exactly close to the oil extraction machines in the open oil fields. The data can provide direct insights into the composition profile of VOCs from Chinese oil field emissions. Regional background of  VOC species was calculated as the average of the lowest 10th percentile of measurement data at the study site and was subtracted from the oil field source data to derive the VOC emission profiles. Figure 5 shows the measured oil field emission profiles of VOCs in the YelRD region. It is obvious that oil field emissions are dominated by alkanes. On a concentration basis, light alkanes (C 2 -C 5 ), long-chain alkanes (C 6 -C 10 ), alkenes, and aromatics account for 83.7 %, 8.7 %, 3.1 %, and 2.9 % of the total measured NMHCs, respectively. The top 10 abundant species (in proportion) are propane (25.3 %), ethane (22.1 %), n-butane (13.6 %), i-butane (8.3 %), ipentane (7.8 %), n-pentane (6.0 %), ethene (1.9 %), n-hexane (1.8 %), ethyne (1.6 %), and 2-methylpentane (1.3 %). Note that all the aforementioned calculations are based on the median VOC emission profile shown in Fig. 5. Since alkanes are major components of crude oil and natural gas, measured oil field emissions in this study are believed to be due to the leakage of oil and natural gas in this oil field region. To our knowledge, this should be the first direct measurement of oil field VOC emission profiles in China, which is valuable for better understanding the emissions of O&NG production and can be used for future air quality modeling studies. Figure 6 compares the oil field emission profile in the YelRD region with those obtained from measurements adjacent to or surrounded by US oil fields. Overall, the measured VOC speciation patterns agree well with each other, although the absolute VOC concentrations vary case by case. For example, the VOC concentrations in the oil field in this study are generally higher than or comparable to those in the Fort Worth Basin, Denver-Julesburg Basin, and Upper Green River basin, but they are much lower than those measured in the Uintah Basin during wintertime O 3 episodes. Such differences should be mainly caused by different atmospheric dilution conditions during the sampling campaigns. The extremely high VOC levels in the Uintah Basin can be ascribed to the strong inversion under unfavorable weather conditions (Neemann et al., 2015). There are also some differences in the detailed VOC speciation between the YelRD oil field emissions and those in US. The fraction of C 2 -C 5 light alkanes in the YelRD oil fields was lower than those in the Uintah Basin (93.9 %), Fort Worth Basin (90.4 %), and Denver-Julesburg Basin (92.9 %) (ERG, 2011;Gilman et al., 2013;Koss et al., 2015). In comparison, the loadings of longchain alkanes (8.7 %) and aromatics (2.9 %) were higher in the YelRD oil field than in the US oil basins (4.2 %-6.9 % for long-chain alkanes, < 1.6 % for aromatics). Such VOC speciation was attributed to the fact that oil extraction, rather than natural gas production, dominates in this study area.
As mentioned above, the ambient air at the sampling site may be influenced by the oil field emissions significantly. To verify this issue, all the ambient VOC data were subject to the Tukey test (Seo, 2006), and 11 samples were identified as "abnormal" sample. According to the VOC concentrations and speciation, the ambient VOC samples can be classified into three categories. Type 1 contains four abnormal samples and these samples have the highest concentrations for most species, especially alkanes, butenes, and aromatics (Fig. 6). Type 2 includes seven abnormal samples which have almost the same chemical speciation and absolute concentra-  The box plot provides the 10th, 25th, 50th, 75th, and 90th percentiles of the source sample data, and the red dot gives the average of the data. Note that the regional background has been subtracted from the source data. Figure 6. Comparison of the VOC composition of oil field samples (grey area) with three types of ambient samples in this study and in four US oil fields (ERG, 2011;Field et al., 2015;Gilman et al., 2013;Koss et al., 2015). tions (only with slightly lower levels of light alkanes) as the oil field emission profiles (Fig. 6). The remaining 100 "normal" samples are classified as Type 3. Compared with the oil field emission profile, they have similar chemical speciation but lower concentrations. In terms of the sampling time, Types 1 and 2 samples were mainly collected in the early morning or at midnight, whilst most of the Type 3 samples were taken during the daytime. Figure 7 shows the scatter plots of i-pentane versus n-pentane for the three identified ambient VOC types as well as the oil field source data. Because i-pentane is generally recognized as tracer of gasoline, the ratio of i-pentane / n-pentane can be adopted to diagnose the potential impact of O&NG operations on the VOC measurements in the O&NG extraction region . As shown in Fig. 7, Type 2 (1.2) and Type 3 (1.3) samples have comparable i-pentane / n-pentane ratios to the oil field source data (1.0). Meanwhile, Type 1 samples have a much higher ratio of 4.5, which is similar to the signature Figure 7. Scatter plot and regression lines of i-pentane versus npentane for the three types of ambient samples and oil field samples (grey: Type 1; red: Type 2; green: Type 3; blue: source; refer to the main text for the description of different types of data). of gasoline emissions (4.87) (Lu and Zhang, 2003). In view of the above analyses, we propose that Type 1 samples were affected by short-term leakage from the surrounding refinery and oil storage areas, Type 2 samples were heavily influenced by the O&NG extraction activities in the oil fields, and the normal Type 3 samples were also affected by the O&NG extraction in this region. This indicates that the VOC-rich environment in the YelRD region is mainly influenced by the O&NG extraction activities.

Atmospheric oxidation capacity and radical chemistry
In the following sections, we examine the detailed photochemical processes that occurred during the O 3 pollution episodes. As O 3 episodes were mainly encountered during the summer campaign, here we focus on the summertime pollution events (with the modeling results for winter-spring provided in the Supplement). Nine severe O 3 episodes (i.e., 8, 9, 14, 15, 16, 18, 29, and 30 June and 9 July 2017) with the maximum hourly O 3 concentrations exceeding 100 ppbv and with concurrent comprehensive observation data were sorted out for chemical box modeling analyses. Detailed chemical budgets of RO x radicals and O 3 were quantified by the OBM-AOCP. Simulation results for different cases were generally similar. Below we present the results that have been averaged across all selected episodes. Figure 8 shows the average diurnal variations in OH and HO 2 during the O 3 episode days. High levels of HO x radicals were simulated by the model. The daily maxima of OH and HO 2 concentrations were 4.7-7.0 × 10 6 and 10.3-14.1 × 10 8 molec. cm −3 , with mean values of 5.9 × 10 6 and 12.5 × 10 8 molec. cm −3 , respectively. Model-predicted concentrations of HO x radicals in the rural area of YelRD are higher than those at Heshan (a rural site in the Pearl River Delta, southern China) and Mace Head (a coastal site in Ireland) (Smith et al., 2006;Tan et al., 2019a). Comparable noontime maxima HO x concentrations were observed at a rural site in the North China Plain (NCP) region (Wangdu; Tan et al., 2017) and in some polluted urban areas, such as Tokyo and Houston (Kanaya et al., 2007;Mao et al., 2010). This demonstrates the strong potential of atmospheric oxidation in the YelRD region. A noteworthy result is the OH concentration peak occurring in the morning (at around 10:00 LT), which is different from the most common results showing noontime OH peaks with intense solar radiation (Rohrer and Berresheim, 2006). To a large extent, the diurnal pattern of OH follows that of NO (see Fig. 3), suggesting the important role of NO in OH chemistry at the sampling site. Considering the VOC-rich conditions and relatively low levels of NO x (e.g., observed average concentrations of NO are 0.43 and 0.23 ppb during 09:00-12:00 and 12:00-16:00 LT, respectively), efficient radical propagation of OH → RO 2 → HO 2 is expected and the abundance of NO should be the limiting factor in the recycling of HO 2 to OH. The higher ratios of HO 2 /OH (∼ 257) in this study also indicate that the HO 2 +NO → NO 2 +HO reaction is the rate-determining step of the radical recycling. A similar phenomenon was also found at Backgarden (a VOC-saturated and NO x -limited environment) in the Pearl River Delta (PRD) region .
The strong atmospheric oxidation capacity (AOC, defined as the oxidation rates of all reduced substances by major oxidants) was confirmed by the model calculation and is shown in Fig. 9. The daily maxima and daily mean values of AOC during the selected episodes were in the range of 0.7-1.8 × 10 8 and 2.6-4.8 × 10 7 molec. cm −3 s −1 , respectively. AOC levels in the YelRD region are comparable to those obtained in some urban areas (Elshorbany et al., 2009;Xue et al., 2016) but are higher than that derived from rural areas (Geyer et al., 2001;Li et al., 2018). As expected, OH is the predominant oxidant during the daytime, accounting for 85.3±16.4 % of AOC. NO 3 is the major oxidant at nighttime (18:00-06:00 LT), contributing 46.8 ± 17.1 % of nocturnal  AOC, followed by O 3 (27.0±7.9 %) and OH (26.2±17.8 %). Figure 10 elucidates the 24 h evolution and partitioning of the chemical loss of OH radical (also known as the OH reactivity or K OH ). K OH in this study (23.3 ± 5.6 s −1 ) is significantly higher than those determined from some rural sites such as Hok Tsui (9.2 ± 3.7 s −1 ; Li et al., 2018), Nashville (11.3±4.8 s −1 ; Martinez et al., 2003), and Whiteface Mountain (5.6 s −1 ; Ren et al., 2006a) and is comparable to that measured in some polluted areas like Beijing (10-30 s −1 ; Lu et al., 2013;Williams et al., 2016;Yang et al., 2017) and Guangzhou (20-50 s −1 ; Lou et al., 2010). OVOCs (including the measured carbonyls and model-simulated OVOCs) were the dominant contributor (69.1 ± 7.2 %) to K OH . CO, NO x , alkenes, alkanes, and aromatics are the other important reactants, explaining 13.2 ± 2.5 %, 5.6 ± 4.1 %, 4.4 ± 1.5 %, 3.6 ± 1.2 %, and 1.6 ± 0.5 % of K OH , respectively. The relatively higher fraction of alkanes is probably due to the highly abundant alkanes in the YelRD region as a result of influences from the oil field emissions. Figure 11 presents major primary sources of OH, HO 2 , and RO 2 radicals quantified in the YelRD region, and the detailed RO x radical budget is summarized in Fig. 12. Photolysis of OVOCs is identified as the dominant primary RO x radical source, with daytime (06:00-18:00 LT) aver-age production rates of 2.15 ± 1.40 ppbv h −1 for HO 2 (of which 1.10±0.79 ppbv h −1 is from formaldehyde alone) and 0.86 ± 0.53 ppbv h −1 for RO 2 , respectively. O 3 photolysis is the second largest source of RO x and the predominant primary source of OH (1.22 ± 1.10 ppbv h −1 ). HONO photolysis is the third largest source and supplies OH at an average rate of 0.49 ± 0.48 ppb h −1 during the daytime. The contribution of HONO photolysis is higher than that of O 3 photolysis in the early morning (e.g., before 09:00 LT) but then becomes significantly lower with the decrease in HONO concentrations and photochemical formation of O 3 . Note that the model was constrained by the observed HONO data. Ozonolysis reactions of unsaturated VOCs are also important radical sources, accounting for 0.26 ± 0.11, 0.17 ± 0.07, and 0.14±0.07 ppbv h −1 of OH, HO 2 , and RO 2 , respectively, on a daytime average basis. NO 3 + VOC reactions are only a minor radical source (for RO 2 only). The above analysis illustrates the significant role of OVOCs (both primary carbonyls and secondary compounds formed from oxidation of abundant VOCs) in the primary production of radicals and thus initiation of atmospheric oxidation processes. The dominance of photolysis of OVOCs in the atmospheric photochemistry was also found during the wintertime O 3 episodes in the Uintah Basin . In comparison, a recent study illustrated the importance of HONO and formaldehyde photolysis in four polluted Chinese megacities (Beijing, Shanghai, Guangzhou, and Chongqing), which accounted for ∼ 50 % of the total primary RO x source (Tan et al., 2019b).
As shown in Fig. 12, the radical recycling processes were generally efficient and approximately 4-6 times faster than the primary radical production. This is ascribed to the high abundances of VOCs in the study region, despite the restriction from the relatively low NO x concentrations. In terms of radical termination, the cross reactions of radicals such as HO 2 + HO 2 and HO 2 + RO 2 were the most important processes with daytime average contributions of 0.55±0.48 and 1.12 ± 0.94 ppbv h −1 , respectively. In comparison, the reactions of RO x with NO x (i.e., OH+NO 2 and RO 2 +NO) contributed 1.19 ± 1.62 ppbv h −1 to the radical sink. Such results are not surprising given the VOC-rich and low-NO x chemical environment at our study site. This is quite different from those derived from the polluted urban areas, where the RO x + NO x reactions generally dominate the radical termination processes (Tan et al., 2019b). Overall, the radical budget analysis elucidates the strong atmospheric oxidation capacity, the importance of OVOCs, and the limiting role of NO x in the VOC-rich atmosphere of the YelRD region.
We also examined the atmospheric oxidation capacity, RO x radical budget, and O 3 formation for eight winterspring cases, and the modeling results are documented in Figs. S2-S7. Note that few O 3 episodes were encountered during the winter-spring campaign, and the cases were selected mainly because of the availability of multiple NMHC and carbonyl sampling data. The daily maximum hourly O 3 Figure 11. Simulated average primary production rates of (a) OH, (b) HO 2 , and (c) RO 2 during the summertime O 3 pollution episodes. The error bars indicate the standard deviations of the mean. concentrations during these cases were in the range of 40-98 ppbv. Several aspects are noteworthy from the modeling results for winter-spring. First, the model-simulated HO x levels, AOC, RO x production and propagation rates, and O 3 formation rate were much lower than those determined for the summertime episodes. This is as expected due to the weaker solar radiation and less active photochemistry in winter-spring than in summer. Second, OH showed a normal noontime concentration peak in winter-spring, which is different from the morning peak (∼ 10:00 LT) found in summer (see Figs. 8 and S2). This was ascribed to the higher levels of NO x at the study site in winter-spring (Fig. 3), which were high enough to maintain the radical recycling from HO 2 to OH. Third, the partitioning of the primary RO x sources was generally similar between both seasons, despite the rel-atively lower contributions from the O 3 -involved sources (i.e., O 3 photolysis and O 3 + VOC reactions). Photolysis of OVOCs other than formaldehyde was the dominant primary RO x source, followed by HONO and formaldehyde photolysis. Fourth, the radical termination processes were different between winter-spring and summer. The dominant radical sinks were the cross reactions between NO x and RO x in winter-spring, as a result of the relatively abundant ambient NO x .

Ozone formation mechanism
We also examined the ozone formation mechanisms for the summertime episode days. Figure 13 shows the average detailed O 3 chemical budgets during the nine cases. Strong photochemical formation of O 3 was clearly illustrated, with daily maximum net O 3 production rates of 14.5-38.7 ppbv h −1 and daytime average rates (06:00-18:00 LT) of 9.8-19.6 ppbv h −1 . The O 3 production intensity in the rural area of the YelRD is higher than that derived from a rural site downwind of Beijing (Changping) and comparable to that in polluted suburban areas downwind of Shanghai and Lanzhou (L. K. . Interestingly, the O 3 production rate shows its maxima in the morning period (at around 10:00 LT) followed by a significant decrease in the afternoon, which differs from general results from previous studies showing noontime or afternoon peaks. This pattern is similar to that of OH and NO (Figs. 3 and 8) and should be due to the lower concentrations of NO in the afternoon. In the VOC-rich YelRD region, a certain amount of NO in the morning is enough to sustain efficient O 3 production. In the afternoon, NO x has been photochemically consumed due to its short lifetime and thus becomes the limiting factor in O 3 formation (note that O 3 production rate is defined as the reaction rates of HO 2 +NO and RO 2 +NO). This also explains the observed unusual diurnal variation in O 3 (Fig. 3), with a significant increase during the morning period and constant or reduced levels in the afternoon.
The relationships between O 3 and its precursors were further diagnosed by the relative incremental reactivity (RIR)  calculation using the OBM-AOCP model. RIR is defined as the ratio of the change in O 3 production rate to changes in precursor concentrations, and it can be used as an indicator for assessing the effect of precursor reduction on O 3 formation (Cardelino and Chameides, 1995). A number of sensitivity modeling runs were conducted for individual episode days with 20 % reduction in the input concentrations of each target O 3 precursor group. As presented in Fig. 14, simulation results for most cases are similar. O 3 production was most sensitive to NO x concentrations, as indicated by the highest positive RIR values. This is expected as the aforementioned analyses suggest the limiting role of NO x in radical recycling and O 3 production. Alkenes, especially longchain alkenes, showed moderate positive RIR values, indicating they controlled O 3 formation to some extent as well. Alkanes and aromatics are usually in high abundances owing to the extensive oil extraction in the YelRD region, showing minor RIR values, and were not the limiting factors for O 3 formation. Overall, reducing NO x emissions would be the most effective strategy for mitigating photochemical air pollution in the YelRD region.
Nonetheless, the oil field emissions of VOCs may have high potential to affect the regional air quality in the polluted YelRD and even the surrounding NCP regions, where ambient NO x is usually abundant. The oil-field-emitted VOCs may significantly contribute to the formations of O 3 and secondary organic aerosol on a regional scale. To address this issue, an oil field emission inventory of VOCs and NO x as well as three-dimensional chemical transport model simulations are needed. So far, the oil field emissions have not been included by the emission inventories in China. More efforts are urgently needed to develop an accurate oil field emission inventory and evaluate their impacts on the regional air quality and climate.

Conclusions
We combined intensive field observations with chemical box modeling to understand the characteristics of VOC emissions from oil fields and their impacts on atmospheric chemistry and O 3 pollution in the YelRD region, north China. Influenced by the O&NG extraction and petrochemical industry, this area is characterized by a VOC-rich atmosphere with extremely high levels of alkanes. O 3 pollution episodes occurred frequently in summertime. Meanwhile, no events were encountered in winter-spring because of the unfavorable weather conditions for O 3 formation. The VOC chemical speciation from the oil field emissions was detected for the first time in China in this study. Driven by the high abundances of VOCs on a regional scale, strong atmospheric oxidation capacity and intense O 3 formation were confirmed by observation-based modeling analyses. OVOCs played a dominant role in OH reactivity and hence radical recycling and were the major primary source of RO x radicals. Photolysis of O 3 and HONO was also found to be an important radical source. The radical termination processes were governed by radical cross reactions under the high-VOC and low-NO x conditions. RIR analysis indicated that O 3 formation was mainly in a NO x -controlled regime, and reducing NO x emissions would be an effective way to control O 3 pollution in the YelRD region. In summary, this study emphasized the key role of O&NG extraction in the photochemical air pollution and regional atmospheric chemistry in the oil extraction regions of China, and the results are helpful for formulating anti-pollution strategies in the YelRD and other similar oil-extracting regions.
Data availability. The data that support the results are available from the corresponding author upon request.
Author contributions. LX designed the study. TC, PZ, YL, JS, and HYL conducted the field campaigns. GH provided logistics for the field campaigns. HL, XZ, and YL analyzed the OVOC samples. TC analyzed the measurement data. TC and YZ conducted the chemical