Background heterogeneity and other uncertainties in estimating urban methane flux: results from the Indianapolis Flux Experiment (INFLUX)

As natural gas extraction and use continues to increase, the need to quantify emissions of methane (CH4), a powerful greenhouse gas, has grown. Large discrepancies in Indianapolis CH4 emissions have been observed when comparing inventory, aircraft mass balance, and tower inverse modeling estimates. Four years of continuous CH4 mole fraction observations from a network of nine towers as a part of the Indianapolis Flux Experiment (INFLUX) are utilized to investigate four possible reasons for the abovementioned inconsistencies: (1) differences in definition of the city domain, (2) a highly temporally variable and spatially non-uniform CH4 background, (3) temporal variability in CH4 emissions, and (4) CH4 sources that are not accounted for in the inventory. Reducing the Indianapolis urban domain size to be consistent with the inventory domain size decreases the CH4 emission estimation of the inverse modeling methodology by about 35 %, thereby lessening the discrepancy and bringing total city flux within the error range of one of the two inventories. Nevertheless, the inverse modeling estimate still remains about 91 % higher than inventory estimates. Hourly urban background CH4 mole fractions are shown to be spatially heterogeneous and temporally variable. Variability in background mole fractions observed at any given moment and a single location could be up to about 50 ppb depending on a wind direction but decreases substantially when averaged over multiple days. Statistically significant, longterm biases in background mole fractions of 2–5 ppb are found from single-point observations for most wind directions. Boundary layer budget estimates suggest that Indianapolis CH4 emissions did not change significantly when comparing 2014 to 2016. However, it appears that CH4 emissions may follow a diurnal cycle, with daytime emissions (12:00–16:00 LST) approximately twice as large as nighttime emissions (20:00–05:00 LST). We found no evidence for large CH4 point sources that are otherwise missing from the inventories. The data from the towers confirm that the strongest CH4 source in Indianapolis is South Side landfill. Leaks from the natural gas distribution system that were detected with the tower network appeared localized and nonpermanent. Our simple atmospheric budget analyses estimate the magnitude of the diffuse natural gas source to be 70 % higher than inventory estimates, but more comprehensive analyses are needed. Long-term averaging, spatially extensive upwind mole fraction observations, mesoscale atmoPublished by Copernicus Publications on behalf of the European Geosciences Union. 4546 N. V. Balashov et al.: Background heterogeneity and other uncertainties spheric modeling of the regional emissions environment, and careful treatment of the times of day are recommended for precise and accurate quantification of urban CH4 emissions.

Abstract. As natural gas extraction and use continues to increase, the need to quantify emissions of methane (CH 4 ), a powerful greenhouse gas, has grown. Large discrepancies in Indianapolis CH 4 emissions have been observed when comparing inventory, aircraft mass balance, and tower inverse modeling estimates. Four years of continuous CH 4 mole fraction observations from a network of nine towers as a part of the Indianapolis Flux Experiment (INFLUX) are utilized to investigate four possible reasons for the abovementioned inconsistencies: (1) differences in definition of the city domain, (2) a highly temporally variable and spatially non-uniform CH 4 background, (3) temporal variability in CH 4 emissions, and (4) CH 4 sources that are not accounted for in the inventory. Reducing the Indianapolis urban domain size to be consistent with the inventory domain size decreases the CH 4 emission estimation of the inverse modeling methodology by about 35 %, thereby lessening the discrepancy and bringing total city flux within the error range of one of the two inventories. Nevertheless, the inverse modeling estimate still remains about 91 % higher than inventory estimates. Hourly urban background CH 4 mole fractions are shown to be spatially heterogeneous and temporally variable. Variability in background mole fractions observed at any given moment and a single location could be up to about 50 ppb depending on a wind direction but decreases substantially when averaged over multiple days. Statistically significant, longterm biases in background mole fractions of 2-5 ppb are found from single-point observations for most wind directions. Boundary layer budget estimates suggest that Indianapolis CH 4 emissions did not change significantly when comparing 2014 to 2016. However, it appears that CH 4 emissions may follow a diurnal cycle, with daytime emissions (12:00-16:00 LST) approximately twice as large as nighttime emissions (20:00-05:00 LST). We found no evidence for large CH 4 point sources that are otherwise missing from the inventories. The data from the towers confirm that the strongest CH 4 source in Indianapolis is South Side landfill. Leaks from the natural gas distribution system that were detected with the tower network appeared localized and nonpermanent. Our simple atmospheric budget analyses estimate the magnitude of the diffuse natural gas source to be 70 % higher than inventory estimates, but more comprehensive analyses are needed. Long-term averaging, spatially extensive upwind mole fraction observations, mesoscale atmo-

Introduction
From the beginning of the Industrial Revolution to 2011, atmospheric methane (CH 4 ) mole fractions increased by a factor of 2.5 due to anthropogenic processes such as fossil fuel production, waste management, and agricultural activities (Ciais et al., 2013;Hmiel et al., 2020). The increase in CH 4 is a concern as it is a potent greenhouse gas (GHG) with a global warming potential 28-34 times greater than that of CO 2 over a period of 100 years (Myhre et al., 2013). The magnitudes of component CH 4 sources responsible for the recent increase in the global CH 4 budget are not well understood, with theories attributing these changes to biogenic, thermogenic, and pyrogenic emissions or a decline in the atmospheric CH 4 sink (Nisbet et al., 2016(Nisbet et al., , 2019Saunois et al., 2016;Hmiel et al., 2020). Improved understanding of CH 4 emissions is needed (National Academies of Sciences and Medicine, 2018).
In particular, the estimates of continental US anthropogenic CH 4 emissions disagree. Inventories from the Environment Protection Agency (EPA) and Emissions Database for Global Atmospheric Research (EDGAR) in 2008 reported emission values of 19.6 and 22.1 Tg C yr −1 (U.S. EPA, 2013;European Commission Joint Research Centre and Netherlands Environmental Assessment Agency, 2010). However, top-down methodologies using aircraft and inverse modeling frameworks found emission values of 32.4 ± 4.5 Tg C yr −1 for 2004 and 33.4 ± 1.4 Tg C yr −1 for 2007-2008 respectively (Kort et al., 2008;Miller et al., 2013). Underestimation of natural gas (NG) production and agricultural sources are possible reasons for this disagreement (Miller et al., 2013;Brandt et al., 2014;Jeong et al., 2014). Efforts to reconcile GHG emissions estimates using atmospheric methods and inventory assessment have sometimes succeeded (Schuh et al., 2013;Zavala-Araiza et al., 2015;Turnbull et al., 2019) when careful attention is given to the details of each method, and targeted atmospheric data are available. A recent synthesis of emissions from the US NG supply chain demonstrated similar success and concluded that current inventory estimates of emissions from US NG production are too low and that emission from NG distribution is one of the greatest remaining sources of uncertainty in the NG supply chain (Alvarez et al., 2018).
Due to the uncertainties in CH 4 emissions from NG distribution it is natural that urban emissions are of interest as well. For example, two studies (McKain et al., 2015;Hendrick et al., 2016) indicate that ∼ 60 %-100 % of Boston CH 4 emissions are attributable to the NG distribution system. Recent studies of urban CH 4 emissions in California indicate that the California Air Resources Board (CARB) inventory tends to underestimate the actual CH 4 urban fluxes, possibly due to fugitive emissions from NG infrastructures in urban environments (Wunch et al., 2009;Jeong et al., 2016Jeong et al., , 2017. The accuracy and precision of atmospheric estimates of urban CH 4 emissions are limited by available atmospheric observations (Townsend-Small et al., 2012), potential source magnitude variability with time (Jackson et al., 2014;Lamb et al., 2016), errors in atmospheric transport modeling (Hendrick et al., 2016;Deng et al., 2017;Sarmiento et al., 2017), and complexity in atmospheric background conditions (Cambaliza et al., 2014;Karion et al., 2015;Heimburger et al., 2017). In this work, detailed analysis of urban CH 4 mole fractions is performed for the city of Indianapolis to better understand the aforementioned uncertainties of urban CH 4 emissions.
The Indianapolis Flux Experiment (INFLUX; Davis et al., 2017) is a testbed for improving the quantification of urban GHGs emissions and their variability in space and time. INFLUX (http://influx.psu.edu, last access: 8 April 2020) is located in Indianapolis partly because of its isolation from other urban centers and the flat Midwestern terrain. It includes a very dense GHG monitoring network, comprised of irregular in situ aircraft measurements (Heimburger et al., 2017;Cambaliza et al., 2014), continuous in situ observations from communications towers using cavity ring-down spectroscopy Miles et al., 2017a), and automated flask sampling systems for the quantification of a wide variety of trace gases (Turnbull et al., 2015). Meteorological sensors include a Doppler lidar providing continuous boundary layer depth and wind profiles and tower-based eddy covariance measurements of the fluxes of momentum and sensible and latent heat . The network is designed for emissions quantification using top-down methods such as tower-based inverse modeling  and aircraft mass balance estimates . Lamb et al. (2016) compared Indianapolis CH 4 emissions estimates from a variety of approaches, specifically inventory, aircraft mass balances, and inverse modeling. The results revealed large mean differences among the city fluxes estimated from these methods (Fig. 1). In general, the inventory methods arrived at lower estimates of emissions compared to the atmospheric or top-down approaches. CH 4 fluxes calculated using the aircraft mass balance technique varied considerably between flights, more than would be expected from propagation of errors of the component measurements (Cambaliza et al., 2014;Lamb et al., 2016). The atmospheric inverse estimate was significantly higher than the inventory and some of the aircraft-derived values.
Biogenic emissions from the city are dominated by a landfill close to downtown, and these emissions are thought to be fairly well known (GHG reporting program), although evidence of possible variability in landfill emissions exists from Cambaliza et al. (2015), which used aircraft mass balance on five different occasions to calculate CH 4 flux from this  -December, 2014, and (g) contains the same five flights over April-July of 2011 as in (e) but uses different methodology. Methodologies for (c-f) are described in Lamb et al. (2016) and the methodology for (g) is described in Cambaliza et al. (2015). Error bars show 95 % confidence intervals (for more details see abovementioned articles). landfill. Uncertainty in total city emissions is mainly driven by the uncertainty in thermogenic emissions, which are hypothesized to emerge largely from the NG distribution system (Mays et al., 2009;Cambaliza et al., 2015;Lamb et al., 2016). In this study, we explore potential explanations for the discrepancies in CH 4 emissions estimates from Indianapolis and posit methods and recommendations for the study of CH 4 emissions from other urban centers.
We examine four different potential explanations for the CH 4 flux discrepancies reported in Lamb et al. (2016): (1) inconsistent geographic boundaries between top-down and bottom-up studies, (2) heterogeneity in the urban-scale CH 4 background and (3) temporal variability in urban emissions, which is not captured by the existing top-down studies, and (4) CH 4 sources that are not accounted for in the inventories. Well-calibrated CH 4 sensors on the INFLUX tower network (Miles et al., 2017a) collected continuous CH 4 ob-servations from 2013 to 2016 and provide a unique opportunity to explore these issues.

Experimental site
This study uses data from a tower-based GHG observational network located in the city and surrounding suburbs of Indianapolis, Indiana, in the Midwestern US. Prior studies have used varying definitions for the region of Indianapolis Lamb et al., 2016). In this work, we follow Gurney et al. (2012) and define Indianapolis as the area of Marion County. The flat terrain of the region simplifies interpretation of the atmospheric transport. The land-surface heterogeneity inherent in the urban environment (building roughness, spatial variations in the surface energy balance) does have a modest influence on the wind and boundary layer depth within the city compared to nearby rural areas . Figure 2 shows two domains that have been used for the evaluation of Indianapolis CH 4 emissions (Lamb et al., 2016;Lauvaux et al., 2016). The first domain is the whole area shown in the figure enclosing both Indianapolis and places that lie outside of its boundaries. This domain was used for the inversion performed in Lamb et al. (2016). The second domain is Marion County, outlined with a green dashed line. It is assumed here that this domain is much more representative of the actual Indianapolis municipal boundaries as this area encompasses the majority of the urban development associated with the city of Indianapolis (Gurney et al., 2012). The larger domain has three additional landfills that, based on the EPA gridded inventory (Maasakkers et al., 2016), increase Indianapolis CH 4 emissions by about 50 % when compared to the smaller domain. The inversion explained in Lamb et al. (2016) has been rerun for two of the domains mentioned above and the results ( Fig. 1) have been reexamined.

INFLUX tower network
The continuous GHG measurements from INFLUX are described in detail in Richardson et al. (2017). The measurements were made using wavelength-scanned cavity ring down spectrometers (CRDSs, Picarro, Inc., models G2301, G2302, G2401, and G1301), installed at the base of existing communications towers, with sampling tubes secured as high as possible on each tower (39-136 m above ground level (a.g.l.); Miles et al., 2017a). A few towers also included measurements at 10 m a.g.l. and one or two intermediate levels. While INFLUX tower in situ measurements began in September 2010, here we focus on the CH 4 measurements from 2013-2016. From June through December 2012, there were two or three towers with operational CH 4 measurements. By July 2013, five towers included measurements of  CH 4 , and throughout the majority of the years 2015-2016 there were eight INFLUX towers with CH 4 measurements ( Fig. 3). Comparisons between flask and in situ measurements and round-robin-style testing indicated compatibility across the tower network of 0.6 ppb CH 4 . In this study we use hourly means of CH 4 .

Meteorological data
Wind speed and direction were measured at the Indianapolis International Airport (KIND), Eagle Creek Airpark (KEYE), and Shelbyville Municipal Airport (KGEZ). The data used are hourly values from the Integrated Surface Dataset (ISD) (https://www.ncdc.noaa.gov/isd, last access: 8 April 2020) and 5 min values directly from the Automated Surface Observing System (ASOS). A complete description of ASOS stations is available at https://www.weather.gov/media/asos/ aum-toc.pdf (last access: 8 April 2020). The accuracy of the wind speed measurements are ±1 m s −1 or 5 % (whichever is greater) and the accuracy of the wind direction is 5 • when the wind speed is ≥ 2.6 m s −1 . The anemometers are located at about 10 m a.g.l. The wind data reported in ISD are given for a single point in time recorded within the last 10 min of an hour and are closest to the value at the top of the hour.
The planetary boundary layer height (BLH) was determined from a Doppler lidar deployed in Lawrence, Indiana, about 15 km to the northeast of downtown. The lidar is a Halo Streamline unit, which was upgraded to have extended range capabilities in January 2016. The lidar continuously performs a sequence of conical, vertical-slice, and staring scans to measure profiles of the mean wind, turbulence, and relative aerosol backscatter. All of these measurements are combined using a fuzzy-logic technique to automatically determine the BLH continuously every 20 min (Bonin et al., 2018). The BLH is primarily determined from the turbulence measurements, but the wind and aerosol profiles are also used to refine the BLH estimate. The BLHs are assigned quality-control flags that can be used to identify times when the determined BLH is unreliable, such as when the air is exceptionally clean, the BLH is below a minimum detectable height, or clouds and fog that attenuate the lidar signal exist. Additional details about the algorithm and the lidar operation for INFLUX are provided in Bonin et al. (2018). Doppler lidar measurements are available at https://www.esrl.noaa.gov/csd/projects/influx/ (last access: 8 April 2020).

Urban methane background
Both the aircraft mass balance and inverse modeling methodologies rely on an accurate estimation of the urban CH 4 enhancement relative to the urban CH 4 background in order to produce a reliable flux estimate (Cambaliza et al., 2014;Lamb et al., 2016). The CH 4 mole fraction enhancement is defined as where C downwind is the CH 4 mole fraction measured downwind of a source and C bg is the CH 4 background mole fraction, which can be measured upwind of the source, but this is not necessary. Background, as defined in this body of literature, is a mole fraction measurement that does not con-tain the influence of the source of interest, but which is assumed to accurately represent mole fractions that are upwind of the source of interest and measured simultaneously with the downwind mole fractions. The aircraft mass balance studies of Indianapolis mentioned used two main methods to determine a background value. The first method calculates an average of the aircraft transect edges that lie outside of the city domain (Cambaliza et al., 2014). In the second approach, a horizontally varying background is introduced by linearly interpolating median background values of each of the transect edges (Heimburger et al., 2017). In theory there is also a third method that uses an upwind transect as a background field, but in the studies above it was assumed that the edges are representative of an upwind flow. In the case of an inversion, it is common to pick a tower that is located away from urban sources and has on average the smallest overall enhancement . Because choosing the background involves a degree of subjectivity (Heimburger et al., 2017) we consider how these choices may influence emission estimates and introduce error, both random and systematic, using data from the INFLUX tower network.
Using tower network data from November 2014 through the end of 2016, two CH 4 background fields are generated for the city of Indianapolis based on two different sets of criteria. The notion is based on the fact that a choice of background is currently rather arbitrary in the literature (Heimburger et al., 2017), and at every point in time it is possible to choose multiple background values that are equally acceptable for the flux estimation. In our case both approaches identify a tower suitable to serve as a background for each of the eight wind directions (N, NE, E, SE, S, SW, W, NW), where an arc of 45 • represents a direction (e.g., winds from N are between 337.5 and 22.5 • ). Estimating background for different wind directions is implemented to more accurately represent upwind flow that is hopefully not contaminated by local sources.
Criterion 1 corresponds to a typical choice of a background in a case of tower inversion and is based on the concept that the lowest CH 4 mole fraction measured at any given time is not affected by the city sources and therefore is a viable approximation of the background CH 4 mole fractions outside of the city (Miles et al., 2017a;Lauvaux et al., 2016). Given this assumption, the tower with the lowest median of the CH 4 enhancement distribution (calculated by assuming the lowest measurement among all towers at a given hour as a background) for each of the wind directions over the November 2014 through December 2016 time period is chosen as a background site (Miles et al., 2017a). Criterion 2 requires that the tower is outside of Marion County (outside of the city boundaries) and is not downwind of any known regional CH 4 source (Fig. 2). For some wind directions, there are multiple towers that could qualify as a background; we pick towers in such a manner that they are different for each criterion given a wind direction in order to calculate the er- ror associated with the use of different but acceptable backgrounds. The towers used for both criteria and for each of the eight wind directions are displayed in Table 1. Quantifying differences between these two backgrounds allows for an opportunity to better understand the degree of uncertainty that exists in the atmospheric CH 4 background at Indianapolis.
To make the comparison as uniform as possible only data from 12:00-16:00 LST are utilized (all hours are inclusive) when the boundary layer is typically well mixed (Bakwin et al., 1998). A lag 1 autocorrelation is found between 12:00 and 16:00 LST; i.e., the hourly afternoon data are correlated to the next hour, but the correlation is not significant for samples separated by 2 h or more. Therefore, hours 13:00 and 15:00 LST are eliminated to satisfy the independence assumption for hourly samples. Furthermore, we make an assumption that the data satisfy steady-state conditions. If the difference between consecutive hourly wind directions exceeds 30 • or the difference between hours 16:00 and 12:00 LST exceeds 40 • , the day is eliminated. Days with average wind speeds below 2 m s −1 are also eliminated due to slow transport across the city (the transit time from tower 1 to tower 8 is about 7 h at a wind speed of 2 m s −1 ).

Frequency and bivariate polar plots
Frequency and bivariate polar plots are used in this work to gain more knowledge regarding CH 4 background variability based on criteria 1 and 2, and to identify sources located within the city. To generate these polar plots, we use the openair package (from R programming language) created specifically for air quality data analysis (Carslaw and Ropkins, 2012). Bivariate and frequency polar plots indicate the variability of a pollutant concentration at a receptor (such as an observational tower) as a function of wind speed and wind direction, preferably measured at the location of the receptor or within several kilometers of the receptor. The frequency polar plot is generated by partitioning the CH 4 hourly data into the wind speed and direction bins of 1 m s −1 and 10 • respectively. To generate bivariate polar plots, wind components u and v are calculated for hourly CH 4 mole fraction values, which are fitted to a surface using a generalized additive model (GAM) framework in the following way: where C is the CH 4 mole fraction transformed by a square root to improve model diagnostics such as a distribution of residuals, β is mean of the response, s is the isotropic smoothing function of the wind components u and v, and is the residual. For more details on the model see Carslaw and Beevers (2013).

Temporal variability and approximate flux estimation
Temporal variability in urban CH 4 emissions may play an important role in the corresponding emissions quantification procedures. Lamb et al. (2016) suggested that such temporal variability might partially explain the differences among CH 4 flux estimates shown in Fig. 1. If temporal variability of CH 4 emissions exists within the city, disagreements in the CH 4 flux between studies could be attributed to differences in their sampling period. Because the INFLUX tower data at Indianapolis contain measurements at all hours of the day over multiple years, we can utilize this dataset to better understand the temporal variability in methane emissions in the city. We apply a simplified atmospheric boundary layer budget, not to estimate precisely the actual city emissions, but rather to evaluate temporal variability of the emissions. We begin by assuming CH 4 emissions Q a (mass per unit time per unit area) are not chemically active and are constant over a distance x spanning a significant portion of the city. The next assumption is that a CH 4 plume measured downwind of the city is well mixed within a layer of depth H (which is the same as BLH). We treat wind speed u as constant within the layer for every hour considered. Given the abovementioned assumptions we can write a continuity equation describing mass conservation of CH 4 concentration C within a box in the following fashion: where C b is the CH 4 concentration upwind of the city (or background), and C a is the CH 4 concentration above the mixed layer (Hanna et al., 1982;Arya, 1999;Hiller et al., 2014). The left-hand side of the equation represents the change in CH 4 concentration with time, xQ a denotes a constant CH 4 source over the distance x, uH (C b − C) indicates a change of CH 4 concentration due to horizontal advection, and finally the x ∂H ∂t (C a − C) term accounts for the vertical advection and encroachment processes that result from changing BLH. By assuming steady-state conditions ( ∂C ∂t = 0 and ∂H ∂t = 0), the equation can be simplified to We use Eq. (4) to estimate hourly CH 4 emissions (Q a ) from Indianapolis (see assumptions in the paragraph below) given hourly averaged data of H from the lidar positioned in the city, wind speed (u) from the local weather stations, and upwind (C b ) and downwind (C) CH 4 mole fractions measured (and then converted to concentrations) at towers 1, 8, and 13 (depending on a wind direction) using data from heights of 40, 41, and 87 m respectively (see Fig. 2).
The CH 4 concentrations are derived from CH 4 mole fractions by approximating average molar density of dry air (in mol m −3 ) within the boundary layer for every hour of the day, where variability of pressure with altitude is calculated using the barometric formula, and it is assumed that temperature decreases with altitude by 6.5 K per kilometer. The hourly surface data for pressure and temperature are taken from KIND weather station. The difference between concentrations C and C b is instantaneous and not lagged, where C b represents an air parcel entering the city and C represents the same air parcel exiting the city (Turnbull et al., 2015). The CH 4 enhancements (C − C b ) are estimated for daytime by averaging observations spanning 12:00-16:00 LST and for nighttime by averaging observations spanning 20:00-05:00 LST. These time periods are based on lidar estimations of when on average H varies the least. Each daytime and nighttime were required to contain at least 3 and 9 h of CH 4 values respectively for averaging to occur, otherwise they were eliminated from the computation process. Observations when H is below 100 m are not used to avoid the cases when measurements from towers may be above the boundary layer. In order to better achieve the assumption that the boundary layer is fully mixed (especially at night), all hours with wind speeds below 4 m s −1 are eliminated (Van De Wiel et al., 2012). To approximate the emissions of the whole city we need to know the approximate area of the city and the distance over which the plume is affected by the city CH 4 sources. The area of the city is about 1024 km 2 (the area of Marion County) and the length that plume traverses when it is over the city ranges from 32 to 35 km depending on which downwind tower is used. We assume that CH 4 measurements at towers 8 and 13 are representative of a vertically wellmixed city plume as the towers are located outside of the city boundaries and allow for sufficient vertical mixing to occur. For S and SW wind directions tower 8 observations are used to represent downwind conditions, with background observations coming from towers 1 and 13, respectively (based on criterion 1 shown in Table 1). For W wind direction, tower 13 observations represent the downwind with background obtained from tower 1. The wind direction is required to be sustained for at least 2 h, otherwise the data point is eliminated.

Indianapolis CH 4 sources
Only a few known CH 4 point sources exist within Indianapolis Lamb et al., 2016). The South Side landfill (SSLF), located near the center of the city, is thought to be the largest point source in the city, with emissions ranging between about 28 mol s −1 (inventory from Maasakkers et al., 2016, GHG reporting program, and inverse estimates from ground-based mobile sampling employed in Lamb et al., 2016) and 45 mol s −1 (aircraft; Cambaliza et al., 2015) depending on an emission estimation methodology. However, using Cambaliza et al. (2015) aircraft data and applying a different background formulation, Lamb et al. (2016) found emission values of SSLF closely agreeing with the 28 mol s −1 estimate. SSLF could account for as little as 33 % (top-down from Cambaliza et al., 2015) or as much as 63 % (inventory from Maasakkers et al., 2016) of total Marion County CH 4 emissions. Other city point sources are comparatively small; the wastewater treatment facility located near SSLF contributes about 3-7 mol s −1 (inventory from Lamb et al., 2016), and the transmissiondistribution transfer station at the Panhandle Eastern Pipeline (also known as a city gate and in this study abbreviated to PEP) is estimated to be about 1 mol s −1 (inventory from Lamb et al., 2016). The remaining CH 4 sources, mainly from NG infrastructure leaks and livestock, are considered to be diffuse sources and are not well known. Potential sources of emissions related to NG activities include gas regulation meters, transmission and storage, distribution leaks, and compressed natural gas (CNG) fleets. These diffuse NG sources account for 21 %-67 % of the city emissions or 20 mol s −1 (inventory from Maasakkers et al., 2016) to 64 mol s −1 (topdown from Cambaliza et al., 2015). Livestock emissions for Marion County are estimated to be around 1.5 mol s −1 (inventory from Maasakkers et al., 2016). These prior studies present conflicting conclusions regarding the magnitude of the diffuse NG CH 4 source in Indianapolis.

Inversion and city boundaries
A significant portion of CH 4 emissions across the US can be characterized by numerous relatively large point sources scattered throughout the country rather than by broad areas of smaller enhancements (Maasakkers et al., 2016). Because of this, the total emissions for a given domain can be very sensitive to how that domain is defined. A small increase or decrease in the domain area could add or remove a large point source and significantly impact the total emissions defined within the domain.
In the case of Indianapolis, this issue became apparent when the emissions were calculated using an atmospheric inversion model (Lamb et al., 2016;Lauvaux et al., 2016). The atmospheric inversion solved for fluxes in domain 1 (Fig. 2), which significantly increased the estimated emissions in comparison with the inventory values that were gathered mainly within Marion County (domain 2). When reduced to domain 2, the inverse modeling emission estimate decreases to 107 mol s −1 (from about 160 mol s −1 ), which falls within an error bar of the Lamb et al. (2016) inventory estimate. This difference is significant and could at least partially explain the discrepancy shown in Fig. 1 between the emission values from the inventories and emission results from the inverse modeling. However, even the decreased inverse modeling estimate is about 91 % higher than the inventories.
Additionally, the subject of the domain is relevant for airborne mass balance flights because a priori the magnitude and variability of background plume is unknown and could be easily influenced by upwind sources. The issue of background is discussed further in the next section.

Variability in CH 4 background
Comparisons between criterion 1 and criterion 2 CH 4 background mole fractions as a function of wind speed and direction are visualized using frequency and bivariate polar plots (Fig. 4). Both backgrounds generally agree on the higher CH 4 originating from the SW, SE, and E wind directions (Fig. 4c-f); however, the values themselves differ, especially when winds are from NW, SW, and SE. As background difference plots (Fig. 4g-h) indicate, there is a noticeable variability between the magnitudes of the CH 4 backgrounds, where criterion 2, by design, typically has higher background mole fractions. The background differences, at a given hour, suggest that the CH 4 field flowing into the city is heterogeneous, with differences between towers ranging from 0 to over 45 ppb (Fig. 4g). Because large gradients in CH 4 background over the city could pose challenges for flux estimations using top-down methods such as inverse modeling and aircraft mass balance, it is imperative to establish whether the background differences vary randomly or systematically and how to choose a background to minimize these errors.
To further understand the nature of background variability we calculate the mean, standard deviation, and standard error of background hourly differences between criterion 2 and criterion 1 from November 2014 to December 2016 for each of the eight wind directions mentioned in Table 1. The results are shown in Fig. 5. Systematic bias is evident for the SE, S, SW, W, and NW wind sectors, whereas random error dominates the N, NE, and E wind directions. Wind directions showing statistically significant bias have mean biases ranging from 2 to 5 ppb, with values as large as 8 ppb falling within the range of 2 times the standard error. The standard deviation plot indicates a potential background discrepancy that can occur on any given day, where the W wind direction is the least variable, with 2 times the standard deviation close to 20 ppb, while SE wind direction is the most variable, with 2 times the standard deviation falling at about 50 ppb.
Random errors in the mole fractions of background differences (biases) are also important and are a function of the length of the data record. We quantify the random error in the CH 4 background mole fraction differences using the bootstrap method by randomly sampling 2 to 150 h (small and large sample size) of the background CH 4 differences for each of the wind directions with replacements (we make the assumption that our differences are independent since we eliminated lag 1 autocorrelation from the data). This subsampling experiment is repeated 5000 times (Efron and Tibshirani, 1986). The standard deviations of the mean (stan-dard error) of the 5000 simulated differences are calculated for each wind direction. The resulting standard errors of the city CH 4 background differences, multiplied by 2 to represent the 95 % confidence intervals, are shown as a function of the length of the data record in Fig. 6. Because random error falls as sample size grows it makes sense to assign a threshold indicating a minimum number of samples needed to achieve a theoretical precision for each wind direction.
One way to assign a required precision would be to make sure that the standard error (random error) reaches a point where it is less than the Indianapolis enhancement of about 12 ppb (a higher estimate of the Indianapolis enhancement from Sect. 3.3) by a factor of 2 when combined with a bias (Table 2), meaning that the sum of bias and standard error must be at most 6 ppb. In this approach each wind direction Figure 5. Average of the differences between criteria 2 and 1 CH 4 backgrounds at Indianapolis as a function of wind direction. These averages are generated from the same data that are used in Fig. 4 and reflect results shown in Fig. 4g. Error bars indicate in (a) 2 times the standard error and in (b) 2 times the standard deviation. would have a different threshold because of the differences in biases. For instance, given this requirement the NW direction would need a random error of 1 ppb since its bias is 5 ppb. For the NW direction, this threshold would require more than 150 samples. For the N direction on the other hand, where the bias is 1 ppb, the requirement is fulfilled when random error crosses 5 ppb at 74 samples. Now we consider these random and systematic errors in CH 4 background differences in the context of Indianapolis urban CH 4 emissions. For Indianapolis, using the INFLUX network, we estimated that depending on sample size (number of hours sampled) and wind direction, background gradient across the city Figure 6. Bootstrap simulation of two times the standard error in Indianapolis CH 4 background mole fraction differences (between criteria 2 and 1) as a function of sample size and wind direction (see text for details). Thresholds for each of the wind directions indicate a random error threshold needed for the background uncertainty to be within 50 % of Indianapolis CH 4 enhancement of 12 ppb. over 12:00-16:00 LST could vary from 0 to about 50 ppb (Fig. 5b). Given that the average afternoon CH 4 enhancement of the city is around 8-12 ppb (Sect. 3.3; Fig. 7; Cambaliza et al., 2015;Miles et al., 2017a), the error on the estimated emissions could easily be over 100 % if the analysis does not approach the issue of background with enough sampling. A sample size of about 50 independent hours significantly decreases background uncertainty for N, NE, E, S, and W wind directions and allows for a more accurate assessment of the CH 4 emissions at Indianapolis. For CH 4 sources with a significantly larger signal than their regional background, the mentioned background variability becomes less impactful on results, but because Indianapolis is a relatively small emitter of CH 4 , and because there are relatively large sources outside of the city, uncertainties due to background estimation are comparatively large. Our uncertainty assessment suggests that the highly variable CH 4 emission values of Indianapolis from aircraft mass balance calculations shown in Fig. 1 are at least partially due to the variability in the urban CH 4 background of Indianapolis.  temporal comparison because they do not contain major BLH data gaps. The error bars in the figure show the standard error multiplied by 2 indicating 95 % confidence interval of each average. One of the more interesting features in Fig. 7 is a daynight variability of CH 4 emissions at Indianapolis. The most prominent example of this feature is found in Fig. 7c, where the estimates for both years suggest that daytime emissions are approximately twice as large as the emissions at night. The decrease in the CH 4 emissions at night also appears in tower 13, but the errors are too high in those estimates to make any definitive conclusions. A similar urban CH 4 emissions diurnal variability is reported by Helfter et al. (2016) in their study of GHGs for London, UK, where they attribute diurnal variation of CH 4 emissions to the NG distribution network activities, fugitive emissions from NG appliances, and temperature-sensitive CH 4 emission sources of biogenic origin (such as a landfill). Taylor et al. (2018) suggest that CH 4 emissions from landfills exhibit a diurnal cycle with higher emissions in the early afternoon and 30 %-40 % lower emissions at night.

Temporal variability of methane enhancements and fluxes in Indianapolis
With regard to yearly temporal variability we are only able to compare years 2014 and 2016 due to limited BLH data for other years. Results from both towers suggest that Indianapolis overall CH 4 emissions did not change significantly between 2014 and 2016. Although it is important to be cautious about interpreting actual flux estimations given the as-sumptions mentioned in Sect. 2.6, it is interesting to note that the flux values from both towers average to about 70 mol s −1 , which puts our value right in between inventory and inversion estimates shown in Fig. 1. If we assume that SSLF emissions are generally known (GHG reporting program) that would indicate that emissions from NG distribution are likely to be about 14 mol s −1 (70 %) higher than what both of the inventories currently estimate but within the error bars of the Lamb et al. (2016) inventory calculation. Another possible scenario is that SSLF emissions are higher than what is currently assumed. Given these complexities, uncertainty regarding the exact emissions from NG distribution at Indianapolis still remains.

Methane sources in Indianapolis
Bottom-up emission inventories have difficulty tracking changes in sources over time. Our continuous tower network observations can monitor temporal and spatial variability in sources of CH 4 in Indianapolis. To do so we employ the aforementioned bivariate polar plots to verify known sources and potentially identify unknown sources across the city. We compare two time periods, 2014-2015 (two full years) and 2016. Figure 8 displays bivariate polar plots of CH 4 enhancements using criterion 1 background at 9 INFLUX towers in Indianapolis over the two years of 2014 and 2015. Figure 9 shows the same plot, but for the year 2016. Here we have separated 2016 from 2014-2015 because of different results noted during these times.
The images reveal that the most consistent and strongest source in the city is the SSLF. This is most evident from the 40+ ppb CH 4 enhancements detected at towers 7, 10, and 11 coming from the location of the SSLF (by triangulation). Enhancements from the landfill appear to also be detectable at towers 2, 4, 5, and 13. Based on these observations it can be concluded that there are no other point sources in Marion County comparable in size to the SSLF. A small fraction of the SSLF plume is likely due to the co-located wastewater facility, but the inventory estimates suggest that the wastewater treatment facility is responsible for no more than 7 % of this plume Maasakkers et al., 2016). The PEP, located in the northwestern section of the city, may be partially responsible for a plume of 5-10 ppb at towers 5 and 11. However, the plume is less detectable using the criterion 2 background value that has higher background (using tower 8 as a background) from the NW wind direction (not shown), adding uncertainty to the true magnitude of the enhancement from this source. The same is true for towers 2 and 13, which have pronounced plumes when winds are from the NW with the criterion 1 background, but when background 2 is used these plumes vanish (not shown). Such inconsistency makes it difficult to attribute these plumes to a specific source.
Another important point is the cluster of large enhancements surrounding tower 10 in 2014-2015. Because no other tower sees these enhancements (at least at comparable mag- nitudes), we believe that they are the result of nearby NG leaks. These plumes are not consistent temporally or spatially as they mostly disappear in 2016, potentially indicating that they are transient and localized NG distribution leaks. It is difficult to ascertain the exact combined magnitude of these leaks since they mix together with SSLF into an aggregated city plume when observed from downwind towers such as 8 and 13. None of the individual leaks appears to be similar in magnitude to the emissions that originate from SSLF. Diffuse NG emissions comparable to the SSLF source (Lamb et al., 2016) may exist. Our flux estimations at towers 8 and 13, however, imply that the magnitude of the NG diffuse source suggested by the top-down analyses in  and Lamb et al. (2016) are probably overestimates (see Sect. 3.3). We hypothesize that the relatively high Indianapolis CH 4 emissions (see Fig. 1) reported by Cambaliza et al. (2015) could be a result of random errors in upwind conditions (see Sect. 3.2) influencing the small number of airborne flux estimates.

Conclusions
We have examined four potential contributions to discrepancies between urban top-down and bottom-up estimates of CH 4 emissions from Indianapolis: domain definition, heterogeneous background mole fractions, temporal variability in emissions, and sources missing from inventories. Results indicate that the urban domain definition is crucial for the comparison of the emission estimates among various methods. Our atmospheric inverse flux estimates for Marion County, which is similar to the domain that is analyzed by inventory and airborne mass balance methodologies (Mays et al., 2009;Cambaliza et al., 2014;Lamb et al., 2016), is 107 mol s −1 compared to the 160 mol s −1 that is estimated for the larger domain (Hestia inventory domain; Gurney et al., 2012). This partially explains higher emissions in inverse modeling estimates shown by Lamb et al. (2016); however, 107 mol s −1 is still 91 % higher than what EPA and Lamb et al. (2016) find in their inventories (Fig. 1).
To better understand background variability at Indianapolis two different but acceptable background estimates, based on specific criteria for each wind direction, and their differences are used to assess the heterogeneity of the CH 4 background at Indianapolis. Background criterion 1 looks for a tower that is consistently lower than other towers, while background criterion 2 picks a tower that is outside of Marion N. V. Balashov et al.: Background heterogeneity and other uncertainties County domain and is not downwind of any nearby sources as determined by the EPA 2012 inventory. We focus on midday atmospheric conditions to avoid the complexities of vertical stratification in the stable boundary layer. The midday Indianapolis atmospheric CH 4 mole fraction background is shown to be heterogeneous, with 2-5 ppb statistically significant biases for the NW, W, SW, S, and SE wind directions. Random errors of background differences are a function of sample size and decrease as a number of independent samples increase. Small sample sizes, such as a few hours of data from a single point, are prone to random errors on the order of 10-30 ppb in the CH 4 background, similar to the magnitude of the total enhancement from the city of Indianapolis, which is estimated to be on average around 10-12 ppb. Longer-term sampling and/or more extensive background sampling are necessary to reduce the random errors. Sample size required to reduce random errors of background differences to an acceptable value for flux calculation is largely dependent on a wind direction. Both bias (long-term average of background differences) and its random error are important when estimating total background uncertainty. The results indicate that the N, NE, E, S, and W wind directions are more favorable for flux estimation and would require multiple days of measurements (e.g., about 50 independent hours of measurements) to reduce background uncertainty to about 6 ppb, which is half the magnitude of the typical CH 4 enhancement from Indianapolis. The remaining wind directions would require over 150 independent hourly measurements to achieve similar precision. We also estimate that depending on a wind direction for any given hour the spatial variability in background can be anywhere from 0 to 50 ppb. This uncertainty in the CH 4 background may partially explain the Heimburger et al. (2017) finding of large variability in airborne estimates of Indianapolis CH 4 emissions. Given many samples, the airborne studies converge to an average value of CH 4 flux that is noticeably closer to the inventory estimates for Indianapolis than several of the individual estimates presented in Fig. 1.
Measurement and analysis strategies can minimize the impacts of these sources of error. Spatially extensive measurement of upwind CH 4 mole fractions are recommended. For towers or other point-based measurements, multiple upwind measurement locations are clearly beneficial. For the aircraft mass balance approach, we recommend an upwind transect to be measured, lagged in time if possible, to provide a more complete understanding of the urban background conditions. Complex background conditions might suggest that data from certain days or wind directions should not be used for flux calculation. Finally, a mesoscale atmospheric modeling system informed with the locations of important upwind CH 4 sources can serve as a powerful complement to the atmospheric data (Barkley et al., 2017). Such simulations can guide sampling strategies and aid in interpretation of data collected with moderately complex background conditions.
With regard to temporal variability, no statistically detectable changes in the emission rates were observed when comparing 2014 and 2016 CH 4 emissions. However, a large difference between day and night CH 4 emissions was implied from a simple budget estimate. Night (20:00-05:00 LST) emissions may be 2 times lower than the emissions during the afternoon (12:00-16:00 LST) hours. Because prior estimates of top-down citywide emissions are derived using afternoon-only measurements, overall emissions of Indianapolis may be lower than these studies suggest. This bias may be present in studies performed in other cities as well. Our study suggests that day-night differences in CH 4 emissions must be understood if regional emission estimates are to be calculated correctly. Long-term, tower-based observations are an effective tool for understanding and quantifying multi-year variability in urban emissions.
One final point addressed in this study is the location of major CH 4 sources in Indianapolis. Analysis of the INFLUX tower observations suggest a diffuse NG source that exceeds both of the inventory estimates by 70 %, but additionally our analysis shows that the discrepancy is less than that proposed by the highest values reported in Lamb et al. (2016) (see Fig. 1). Uncertainty remains regarding the magnitude of the diffuse NG source of CH 4 . The only major point source in the city is SSLF and it is observed at multiple towers. There is evidence for occasional point-source NG leaks, but they appear to be transient in time and limited in their strength.
Overall, assessment of the CH 4 emissions at Indianapolis highlights a number of uncertainties that need to be considered in any serious evaluation of urban CH 4 emissions. These uncertainties are amplified for Indianapolis since the enhancement signal from its CH 4 emissions is comparable in magnitude to variability in the regional background flow and as our results show it may be difficult at times to distinguish noise in the background from the actual city emissions signal. The evaluation of larger CH 4 sources may be easier with respect to separating signal from background. However, all of the points raised in this work will be nonetheless relevant and need to be addressed for our understanding of urban CH 4 emissions to significantly improve.
Data availability. All of the data from the INFLUX tower network used in this article are available at https://doi.org/10.18113/D37G6P (Miles at al., 2017b).
Author contributions. NVB, KJD, and NLM developed the study and worked together on generating the main hypothesis of this work. They also wrote most of the paper. NVB wrote all of the codes and performed the analyses presented in this work as well as generating all of the figures. NLM and SJR helped with maintenance and gathering of the INFLUX tower data. They also wrote Sect. 2.2 of the paper. TL helped with the analysis presented in Fig. 1 and Sect. 3.1 concerning interpretation of the inversion modeling results from Lamb et al. (2016). Thomas Lauvaux also helped with repeating the inversion experiment for two different Indianapolis domains (Fig. 1). ZRB significantly contributed to discussions regarding the hypothesis and careful presentation of Sects. 2.6 and 3.3. TAB provided all of the lidar data and wrote the second part of Sect. 2.3 regarding the lidar and the methodology used to determine planetary boundary layer heights. He also contributed to Sects. 2.6 and 3.3.