An evaluation of the efficacy of very high resolution air-quality modelling over the Athabasca oil sands region, Alberta, Canada

Russell, Matthew; Hakami, Amir; Makar, Paul A.; Akingunola, Ayodeji; Zhang, Junhua; Moran, Michael D.; Zheng, Qiong

doi:https://doi.org/10.5194/acp-19-4393-2019

Articles | Volume 19, issue 7

https://doi.org/10.5194/acp-19-4393-2019

Articles | Volume 19, issue 7

Research article

04 Apr 2019

Research article |

| 04 Apr 2019

An evaluation of the efficacy of very high resolution air-quality modelling over the Athabasca oil sands region, Alberta, Canada

Matthew Russell, Amir Hakami, Paul A. Makar, Ayodeji Akingunola, Junhua Zhang, Michael D. Moran, and Qiong Zheng

Abstract

We examine the potential benefits of very high resolution for air-quality forecast simulations using a nested system of the Global Environmental Multiscale – Modelling Air-quality and Chemistry chemical transport model. We focus on simulations at 1 and 2.5 km grid-cell spacing for the same time period and domain (the industrial emissions region of the Athabasca oil sands). Standard grid cell to observation station pair analyses show no benefit to the higher-resolution simulation (and a degradation of performance for most metrics using this standard form of evaluation). However, when the evaluation methodology is modified, to include a search over equivalent representative regions surrounding the observation locations for the closest fit to the observations, the model simulation with the smaller grid-cell size had the better performance. While other sources of model error thus dominate net performance at these two resolutions, obscuring the potential benefits of higher-resolution modelling for forecasting purposes, the higher-resolution simulation shows promise in terms of better aiding localized chemical analysis of pollutant plumes, through better representation of plume maxima.

How to cite.

Received: 11 Sep 2018 – Discussion started: 19 Oct 2018 – Revised: 06 Mar 2019 – Accepted: 19 Mar 2019 – Published: 04 Apr 2019

1 Introduction

Numerical modelling of the atmosphere in an Eulerian framework relies on discretization of the computational domain into a numerical grid. The horizontal grid-cell size of atmospheric simulations can range from hundreds of kilometres to the metre-scale of large eddy simulation models. Air-quality model grid-cell size typically follows the grid-cell sizes used in weather forecasting models, which in turn have followed a gradual progression towards finer discretization where more explicit representation of cloud formation and local radiative transfer effects may be represented. The most recent weather forecasting applications (e.g. Leroyer et al., 2014) have reached grid-cell sizes as small as 250 m over limited domains such as individual cities, and have shown promising results in terms of being able to resolve some aspects of local circulation. In addition, as grid resolution reaches the 3 to 4 km scale, explicit cloud microphysics packages may be used, allowing potentially better performance, particularly with regards to feedbacks between meteorology and chemistry (Yu et al., 2014; Gong et al., 2015). However, while these models promise better physical representation of local chemistry, their performance may be limited by the quantity and availability of initialization and boundary condition meteorological data; these data may be used in a data assimilation context to improve their initial state. The accuracy of broader-scale meteorological predictions may thus influence local model accuracy, despite the ongoing decrease in meteorological model (and consequently air-quality model) grid-cell size. Some recent air-quality model simulation studies with grid-cell sizes on the order of 1 to 4 km include Thompson and Selin (2012), Li et al. (2014), Joe et al. (2014), Kheirbek et al. (2014, 2016), and Pan et al. (2017).

For the purposes of this study, very high resolution (VHR) modelling refers to the current higher-resolution limits of chemical transport models (CTMs), employing a horizontal grid-cell spacing of 1 km or less. It is in this regime that the photochemical processes may be forecasted with resolved microphysics (e.g. Milbrandt and Yau, 2005a, b) and detailed particle and gas-phase chemistry, using currently available computer technology. VHR modelling is very computationally expensive, and also introduces its own set of challenges, such as the availability of surface boundary condition fields as the model grid-cell size decreases. Moreover, it is not currently clear whether decreases in model grid-cell size leads to more accurate results when compared to observations. The motivation behind VHR modelling in CTMs is to reduce the impact of diluting chemical concentrations – especially from averaging emission plumes into large grid cells – in order to better capture inhomogeneities in emission profiles, to better simulate local transport processes associated with terrain that would otherwise be smoothed by the use of a coarse grid, and to reduce truncation errors and hence achieve better numerical accuracy (Jacobson, 1999).

We note here that while the terms “grid-cell size” and “resolution” tend to be used interchangeably in the literature, this is not true in a precise mathematical sense; more formally, the ability to resolve features of size 2Δx requires a grid-cell spacing of size Δx, and the highest spatial frequency which can be reconstructed from a discrete sampling of the latter grid-cell spacing will be $\frac{1}{2 Δ x}$ , the Nyquist wavenumber of the grid-cell size discretization. Furthermore, atmospheric models may make use of energy dissipation techniques that broaden the size of resolvable wavelengths to 3 to 4 Δx (Grasso, 2000; Pielke, 2001). Model resolution is thus a function of, but not equivalent to, grid-cell size. Here, we define “resolution” as the ability of a model to clearly distinguish components of a predicted atmospheric variable, as a function of grid-cell size.

The issue of a model to distinguish these features is also compounded by uncertainties in model inputs. For example, in a large rural setting, a large model grid cell will represent an area containing many roads, whose emissions will be averaged into one value per species per time. As the grid-cell size decreases, however, this averaging effect will be reduced, giving each road's emissions more impact on the resulting concentrations in the grid cell containing it. However, the smaller grid-cell size will also result in steeper concentration gradients in the model between adjacent grid cells, which can in turn result in numerical instabilities that contaminate predictions (Salvador et al., 1999). At the same time, a reduction in grid-cell size can be shown formally to reduce inaccuracies in the discretization of the governing equations for atmospheric motion (Coiffier, 2011). Previous efforts to address these issues through variable grid size or structure in air quality modelling have not received sustained attention, and therefore most current air quality models use a uniform (albeit nested) grid-cell size in applications (Garcia-Menendez et al., 2010; Kumar et al., 1997).

As resolution increases further, the presence of local topographical features (e.g. buildings and street canyons) becomes more important. Both the increased topographic complexity and potential numerical instabilities can lead to differences in meteorological forcing as resolution increases (Wolke et al., 2012; Gego et al., 2005). The contribution of meteorological uncertainties due to resolution becomes more significant, especially for secondary pollutants such as ozone (Valari and Menut, 2008) or secondary particulate matter (PM). For example, Markakis et al. (2015) in their analysis of 4 km CHIMERE simulations for the relatively flat terrain of Paris, France, suggested that model meteorological grid-cell size does not significantly impact forecast accuracy. That may not have been the case, had their terrain been more complex. In contrast, Queen and Zhang (2008) observed considerable meteorological sensitivity to the more complex terrain in their 4 km resolution Community Multiscale Air Quality (CMAQ; EPA, 1999) model simulations over the Appalachian Mountains in the eastern United States, as did Salvador et al. (1999) for meteorological model simulations.

A number of studies have tried to evaluate the benefits of higher-resolution simulations and to quantify the impact of sub-grid variability by using different model grid-cell sizes (Vardoulakis et al., 2003; Ching et al., 2006; Pepe et al., 2016). These studies have often demonstrated that failure to account for higher-resolution features may result in mischaracterization of concentrations or health impacts (Isakov et al., 2007), although the capability of current models to provide this information with sufficient accuracy is unclear. One study found that increasing resolution did not change predicted health outcomes and concluded that “resolution requirements should be assessed on a case-by-case basis” (Thompson and Selin, 2012), while others (e.g. Kheirbek et al., 2014, 2016) have employed 1 km resolution without discussing the impacts of resolution on predicted health outcomes. Population exposure studies using air pollution models may be affected by resolution in a more complex fashion, given that both the predicted field (a pollutant with a known health impact) and the data to which the predicted field is to be linked (the human population) both have resolution dependencies. The health studies carried out to date highlight the need for better understanding of the underlying controlling factors for model accuracy with decreasing grid-cell size.

Terrain and meteorology are not the only factors that contribute to greater uncertainties as horizontal grid-cell size is reduced – for example, the ability of the model to locally resolve emission fluxes may also become a factor. This may result in improved or deteriorated model performance as the size of the grid cells decreases. Gridded model emissions may have an intrinsic resolution dependence due to the underlying spatial disaggregation fields, and this can contribute to uncertainties and errors in emissions as grid-cell size is decreased. For instance, Valari and Menut (2008) found that the discrepancy between their modelled and observed concentrations grew rather than shrank, in response to decreases in grid-cell size from 48 to 6 km, and they associated these results with changes in the resulting local emission fluxes. They showed that in their model setup, with regard to ozone, a grid-cell size was reached (12 km × 12 km) where errors in inputs (errors in the emission inventory, wind direction, etc.) outweighed the importance of other sources of model error such as grid-cell size. The authors, however, noted that Paris' ozone photochemistry very often resides on the transition between a ${NO}_{x}^{-}$ -sensitive and a VOC-sensitive regime (Sillman et al., 2003). These are chemical conditions which can alternatively produce or titrate ozone and hence have a degree of sensitivity to precursor emissions, and therefore, also, to any errors in those emissions. Conversely, in a 3-level nested MM5–CMAQ simulation with grid-cell sizes going from 9 to 3 to 1 km over Osaka, Japan, Shrestha et al. (2009) found that ozone comparisons to observations improved as the grid resolution increased. This was also the case for a 3-level nested MM5–CMAQ simulation going from 36 to 12 to 4 km over Houston, USA (Ching et al., 2006), where the ozone forecast improvement associated with higher resolution was attributed to the ability of the finer grid-cell size model nests to adequately resolve high concentrations of freshly emitted NO_x and hence allow for more local ozone titration. The latter process might not take effect until the grid-cell size is sufficiently fine to resolve the NO_x source patterns (i.e. a level where traffic and industrial sources can be identified.) This titration was not seen until they decreased their grid-cell sizes to 2 km and smaller. Stroud et al. (2011) noted a similar grid-cell-size-dependent chemical impact on model performance, where secondary organic aerosol formation maxima were better simulated with a 2.5 km grid-cell size model than a 10 km grid-cell size model. In general, the impact of resolution on model performance appears to depend on a number of factors, such as the terrain, spatial distribution of sources, pollutant of concern, season, etc. (Arunachalam et al., 2006; Queen and Zhang, 2008; Dore et al., 2012).

Salvador et al. (1999) studied the prediction accuracy impacts of meteorological model grid-cell size in a region with a complex domain and found that 2 km or smaller grid-cell sizes were required to resolve local-scale complex terrain flow features, and that daytime vertical advection and predictions of turbulent kinetic energy and potential temperature were influenced by grid-cell size. Dore et al. (2012) evaluated air quality model NO₂ simulations employing 1, 5, and 50 km grid-cell sizes against observations and found the best performance for the 1 km simulation, with more physically realistic distributions of reactive nitrogen, attributing this performance gain to more realistically precipitation simulations and emissions inputs for the smallest grid-cell size. The availability of high-resolution emissions information may be a limiting factor in improved simulations as grid-cell size decreases. Valari and Menut (2008) noted that emissions inaccuracy was the principal cause of noise in small grid-cell size simulations conducted for the Paris area, and proposed the use of statistical downscaling in favour of predictive modelling at scales at or below 1 km grid-cell size. The current state of model science is typically evaluated through multi-model intercomparisons (e.g. Im et al., 2015), and the meta-analysis of these studies can be used to provide useful benchmarks to assess current model performance for specific model species and observations (Emery et al., 2017). However, such studies do not identify the causes for good or poor performance relative to the benchmarks – diagnostic studies “in which chemical and physical processes within the model are analyzed individually and collectively” (Emery et al., 2017) are required for this purpose. Examinations of the impact of model grid-cell size on performance are an example of such a diagnostic evaluation.

The benefits for model performance with increased spatial resolution are unclear, based on the above literature. However, most papers converge towards the following qualitative conclusions:

The impact of terrain topology on meteorological forcing as grid-cell size decreases can dwarf the impact of a more accurate spatial apportionment of the corresponding emissions.
Decreases in grid-cell size result in a more realistic spatial distribution of chemical species, whether or not model performance is improved.
Uncertainties in spatial and temporal emissions allocation have an increasing influence on overall model uncertainty as model grid-cell size decreases.

The 1980s saw several studies in which the potential impacts of wind direction errors on dispersion model performance were examined. Fox (1981) noted that pairing of model output at observation station locations could be done as a function of both time and space: as a function of time (by combining the data across all stations), as a function of space (by combining all times, at each station location), or without any pairing (observations and data were compared as cumulative frequency distributions). The accuracy of regulatory dispersion models in the early 1980s was such that Fox (1984) concluded that model and observation values paired in time and space exhibited “little to no correlation” and discussed potential errors associated with transport. Poor correlations were also noted in the report on the first generation of reactive-transport models by Hanha (1988), who stated “wind direction errors are the major cause of the poor agreement in hourly predictions of concentrations at short distances downwind of point sources,” and described metrics for air-quality model evaluation. Hanha (1988) also noted that model predictions could be offset in space and time relative to observations, leading to poor performance statistics, despite a greater degree of similarity of behaviour if the offsets are taken into account. Errors in wind-field modelling were described as the main source of error in simulations of plumes by Carhart et al. (1989), who again showed how better agreement resulted when model and observations were unpaired in time and/or space and noted that other metrics such as maximum plume width might better represent model performance. Lee (1987) found that small perturbations in space and time could result in poor correlations, despite similar histogram distributions of both model and observations.

More recently, Kang et al. (2007) examined the concept of using the area of the limiting resolution of the model (2 to 3Δx, where Δx is the horizontal grid-cell size) to weight or spatially average model evaluation metrics for a single grid-cell size, noting how the model's rated ability to capture high-concentration events (“hits”) was increased when the limiting resolution of the model was incorporated into the performance metrics. However, the use of averaging may mask the potential for a model with a small grid-cell size to contain both the desired plume magnitude and much lower concentrations, within the same larger representative area, in turn masking the potential impact of the reduction in grid-cell size.

We expand on this concept to evaluate the impact of model grid-cell size in the context of an equivalent area about a given observation location. We examine area-weighted metrics in the form of averages over roughly equivalent areas for different model grid-cell sizes, and also use the a priori knowledge of the observations to determine whether the closest match to observations may be found within an equivalent area. We show that the latter metric demonstrates a positive impact of model grid-cell size on simulation results, while more simple paired comparisons, and averages over similar areas, mask these benefits.

We examine the impact of grid-cell size on model performance in a region of intense petrochemical extraction and upgrading, the Athabasca oil sands region (AOSR). The AOSR refers to the northernmost of three large bitumen deposits located the northern part of the province of Alberta in Canada: the Athabasca, Peace River, and Cold Lake areas. Together these areas cover 142 200 km² in total and constitute the third largest oil reserves in the world (Government of Alberta, 2016), as shown in Fig. 1. The oil sands sector is the second largest source of SO₂ and the third largest source of industrial NO_x in the province of Alberta. This sector is also a significant source of industrial PM, CO, and volatile organic compound (VOC) emissions (Zhang et al., 2018), from a variety of source types and industrial processes (e.g. open pit mine tailings ponds, large diesel fleets, bitumen upgrading facilities). As is described below, very high resolution emissions data are available for these sources, and emissions take place in a region with significant topography, hence the region provides a good test case for the relative impact of grid-cell size on air-quality model prediction results.

Next we describe our model, the simulation domains and forecasting setup, the emissions data, our evaluation methodology, and the results of our analysis.

https://www.atmos-chem-phys.net/19/4393/2019/acp-19-4393-2019-f01

Figure 1Map showing the oil sands regions (based on an image from Government of Alberta, 2016).

Download

2 Methodology

2.1 GEM-MACH

The air-quality model used in this work is Environment and Climate Change Canada's (ECCC) Global Environmental Multiscale – Modelling Air-quality and Chemistry (GEM-MACH) model, which has been in use as Canada's operational air-quality forecast model since 2009 (Moran et al., 2010). GEM-MACH is an on-line model; that is, both meteorological and chemistry processes are handled within a single model. The chemical processes reside within the physics module of the Global Environmental Multiscale meteorological forecast model (Côté et al., 1998a, b), originate with Environment Canada's earlier off-line model (A Unified Regional Air-quality Modelling System, AURAMS; Gong et al., 2006), and include process representation for particle microphysics (Gong et al., 2003a, b), inorganic heterogeneous chemistry (Makar et al., 2003), aqueous phase chemistry, in-cloud and below-cloud scavenging (Gong et al., 2006), and secondary organic aerosol formation (Stroud et al., 2011). GEM-MACH employs a sectional approach to represent the size distribution of atmospheric particles, with 12-bin (Makar et al., 2015a, b; Gong et al., 2015) or 2-bin configurations (Moran et al., 2010). The latter configuration is designed for maximum computational efficiency, with re-binning to the 12-bin distribution for key particle microphysics processes, in order to improve accuracy. Here, the 2-bin version of the model has been used, the main focus of the work being the impact of horizontal grid-cell size on model results. Eight aerosol chemical components are resolved in GEM-MACH (sulfate, nitrate, ammonium, elemental carbon, primary organic aerosol, secondary organic aerosol, sea salt, and crustal material). In the present study, we make use of GEM-MACH v.1.5.1, described in more detail in Makar et al. (2015a, b), employing 80 levels in a hybrid vertical coordinate system extending up to 0.1 hPa (∼30 km). Both model grid-cell size simulations compared here (2.5 and 1 km grid-cell sizes; see below) make use of the Milbrandt–Yau double-moment explicit microphysics scheme; that is, cloud processes are resolved explicitly at these scales (Milbrandt and Yau, 2005a, b).

2.2 Model setup

2.2.1 Grid nesting

Four levels of nesting have been employed in our simulations, shown in Fig. 2a. This version of GEM-MACH operates on a rotated latitude–longitude coordinate system wherein the position of the coordinate system poles is set by the user, allowing rotations of the grid with decreasing grid-cell size during nesting. The outermost nested grid corresponds to the westernmost two-thirds of the operational GEM-MACH forecasting domain, with a 10 km grid-cell size, and employ a combination of the Kain–Fritsch sub-gridscale convective cloud scheme (Kain and Fritsch, 1990; Kain, 2004) and a Sunqvist (1988) scheme for cloud parameterizations. Within that outer grid is nested a 10 km grid-cell size western Canada domain (yellow region, Fig. 2a) which has been rotated to match the horizontal orientation of the Rocky Mountains, and which makes use of a double-moment microphysics scheme (Milbrandt and Yau, 2005a, b) in place of the Sundqvist (1988) parameterization. The intention of this intermediate local 10 km simulation domain was to provide initial hydrometeors for the two innermost domains, to reduce the “spin-up” time required for the inner domains' meteorology to reach an equilibrium with respect to cloud formation. The latter two domains (2.5 and 1 km grid-cell sizes) resolve the cloud microphysics explicitly using the double-moment scheme alone and no convective parameterization (Milbrandt and Yau, 2005a, b). The third nested grid inset (green region, Fig. 2a) is the 2.5 km grid-cell size domain, which covers most of the Canadian provinces of Alberta and Saskatchewan. This grid will hereafter be referred to as the OS2.5km domain. The fourth and final nested grid (blue square, Fig. 2a) is a 1 km grid-cell size domain, roughly centered over and covering the immediate environs of the Athabasca oil sands, and is referred to hereafter as the OS1km model. This last nest also shows the region within which 22 instrumented aircraft flights were conducted during August and September of 2013, providing a unique measurement data set for our evaluation of the OS2.5 and OS1km model output for the same time period. Table 1 provides details on the horizontal dimensions of each of these nested domains and the duration of the simulations on each grid. All four model nests make use of the same vertical coordinate and levels. Figure 2b shows the topography of the 1 km domain in detail; the region to be modelled is situated in a broad river valley, with a local vertical relief of 750 m. Significant wind shears and frequent inversions are observed in the region, and part of our interest in 1 km grid-cell size simulations is to determine the extent to which these local features may influence model prediction accuracy.

2.2.2 Simulation cycling strategy

Model simulations mimic an operational forecasting system, starting from the use of archived, data-assimilated meteorological analyses as meteorological input and boundary conditions every 36 h. The use of analysis fields is a standard meteorological forecasting practice to prevent the chaotic drift of the model results from observed meteorology over time. The outermost 10 km domain uses initial and boundary conditions from the output of a meteorological simulation, that is itself driven by an analysis field. The outermost domain model then carries out a 36 h forecast, of which the first 6 h is discarded as spin-up; the final 30 h is used as initial and boundary conditions for the rotated 10 km grid-cell size domain (the OS10 km domain). An OS10 km simulation of 30 h is then carried out, with the first 6 h being discarded as spin-up, and the latter 24 h forming the initial and boundary conditions for the 2.5 km grid-cell size OS2.5km simulation. The OS2.5km simulation is of 24 h duration. The OS1km simulation covers the same 24 h (and hence both 2.5 and 1 km simulations start from the same OS10 km initial conditions for every 24 h forecast), with the 2.5 km simulation providing boundary conditions thereafter to the OS1km model. Continuity between 24 h forecasts is thus maintained at the level of the outermost nest. The outermost domain is cycled every 12 h starting at 00:00 and 12:00 UT; however, we have selected the set of contiguous OS2.5 and OS1km 24 h simulations starting from the 12:00 UT continental domain for our comparison.

Meteorological boundary conditions for the lowest-resolution GEM-MACH simulations are taken from operational GEM forecasts, in turn driven by data assimilation analyses performed at the Canadian Meteorological Centre.

https://www.atmos-chem-phys.net/19/4393/2019/acp-19-4393-2019-f02

Figure 2(a) The four nested domains of the GEM-MACH simulations. From outermost to innermost domains, these are CONT10 km (outermost, red dots), OS10 km (yellow), OS2.5km (green), and OS1km (blue). The model simulations from the two innermost domains are the focus of the present study. (b) Topography in the OS1km domain centered on Fort McMurray, Alberta (m a.g.l.). The coloured area corresponds to the central blue domain in (a).

Download

Table 1Nested domain specifications.

^* Note that both OS2.5 and OS1km output frequency was hourly.

An evaluation of the efficacy of very high resolution air-quality modelling over the Athabasca oil sands region, Alberta, Canada

2.1 GEM-MACH

2.2 Model setup

2.2.1 Grid nesting

2.2.2 Simulation cycling strategy

2.3 Model emissions

2.4 Model evaluation methodology and metrics

3.1 Model-to-model comparisons and averages

3.2 Quantitative comparisons

3.2.1 Surface observation comparison

3.2.2 Comparisons to aircraft observations