Articles | Volume 25, issue 9
https://doi.org/10.5194/acp-25-4639-2025
https://doi.org/10.5194/acp-25-4639-2025
Research article
 | 
05 May 2025
Research article |  | 05 May 2025

High-resolution air quality maps for Bucharest using a mixed-effects modeling framework

Camelia Talianu, Jeni Vasilescu, Doina Nicolae, Alexandru Ilie, Andrei Dandocsi, Anca Nemuc, and Livio Belegante
Abstract

High-resolution mapping of pollutants based on mobile observation facilitates deep understanding of air pollutant distributions within a city. This approach fosters science-based decisions to improve air quality, by adding to the existing but not optimally distributed permanent monitoring stations. In this study, we developed high-resolution concentration maps of nitrogen dioxide (NO2), particulate matter (PM10) and ultrafine particles (UFP) for Bucharest, Romania, to evaluate the spatial variation of pollutants across the city during the warm and cold seasons. Maps were generated using a mixed-effects method applied to a land-use regression (LUR) model. The approach relies on multiple land-use and traffic predictor variables and assimilation of data collected by mobile measurements over 30 d in the periods May–July 2022 and January–February 2023. Cross-validation was done against in situ data extracted from the same collection, while validation was organized by comparison with standard measurements at fixed reference sites. Our study shows that this combined method has a good performance for all pollutants (R2>0.65), the highest performance being observed for the cold season. PM10 concentration maps indicate multiple sources of particles during the warm season, the most important source being traffic. During the cold season, PM10 concentration maps show a more uniform distribution of sources in Bucharest. The city's principal roads, particularly the Bucharest ring road, are also highlighted in the NO2 maps, with a higher gradient during the warm period.

Share
1 Introduction

The atmosphere is an essential element for the environment and life-forms on Earth. Therefore, any change in natural composition of the atmosphere due to the presence of one or more pollutants in the atmosphere, such as gases or aerosols released directly into the atmosphere from natural or anthropogenic sources, can dramatically influence the Earth's climate and biosphere, human life and health, and economic activities (Nemuc et al.2013; Kokkalis et al.2016; Ilie et al.2023). Short-term exposure to very high concentrations can also be a significant risk factor to human health in addition to prolonged exposure at lower concentrations. One major concern nowadays is the air quality in urban areas due to its significant health risks determined by prolonged population exposure to gaseous pollutants like nitrogen dioxide (NO2), as well as particulate matter (PM2.5 and PM10) (Brunekreef and Holgate2002; Bernstein et al.2004; Almetwally et al.2020). Numerous epidemiological studies related short- and long-term PM10 and NO2 exposure with mortality and morbidity. Short-term exposure to high concentrations of pollutants can be related to both minor discomfort, such as irritation of the eyes, respiratory tract, or skin, and serious conditions, such as asthma, pneumonia, bronchitis, chronic obstructive pulmonary disease and heart problems (Rongqi Abbie et al.2022; Hasegawa et al.2023). Furthermore, years of continuous exposure to PM was shown to be associated with both newborn mortality and cardiovascular disorders. A PM2.5 concentration increase of 10 µg m−3 was associated with an increase of 0.67 %–1.04 % (Hamanaka and Mutlu2018) in all-cause mortality, 0.52 % in cardiovascular hospital admissions and 1.74 % increase in respiratory admissions (Hasegawa et al.2023). A PM10 concentration increase of 10 µg m−3 was associated with a 43 % increase in fatal coronary heart disease (Hamanaka and Mutlu2018) and 39.31 % increase in deaths from cardiovascular diseases from short-term exposure (Seihei et al.2024). A smaller impact is foreseen in the case of short-term exposure to NO2 concentration, where a 10 ppb increase in concentration was associated with a 0.19 % increase in all-cause mortality in the USA (Hamanaka and Mutlu2018). Despite its critical impact (West et al.2016), air pollution information in urban areas is not always available or not at an appropriate spatial resolution, hindering effective air quality management efforts. High-resolution air quality maps are pivotal for environmental stewardship and public awareness by filling the gaps in our understanding of urban air quality. These maps can help to identify pollution hotspots, offering new opportunities for pollution mitigation strategies and influencing both policy and individual behavior (Apte et al.2017; Schmitz et al.2019).

Mapping pollutant concentrations in urban areas requires fine-scale spatial interpolation of data collected at air quality monitoring stations, taking into account known emission sources and sinks to estimate the actual distribution of pollutants at ambient surface level. Moreover, changes in the composition of the atmosphere caused by urban agglomeration are highly variable in space and time, making their spatial variation difficult to assess with air quality monitoring instruments from ground-based networks or on board satellites (Hoek et al.2015). Fixed monitoring stations are suitable for recording the temporal variation of air pollution, including long-term trends, but unsuitable to capture the spatial variation of air pollution at the local level (Li et al.2019).

It has been demonstrated that gradients at the urban scale can be identified by mobile monitoring (Deshmukh et al.2020). High-resolution mapping of air quality can be done based on long-term averages of a significant number of repeated measurements (Upadhya et al.2024). However, mobile monitoring measurements to obtain reliable small-scale variations (for a street segment or a residential area) that are subsequently time-averaged to provide long-term concentrations is time-consuming, involving extended resources, due to the necessity to collect a large number of co-located data and at the same time to cover the whole relevant area (Yuan et al.2024). Several models have been developed to overcome the weakness of the limited availability of the observational data, collected either at fixed locations or during mobile campaigns or measured by instruments on board of satellites. Data from ground-based mobile and fixed stations as well as satellite data have been used in land-use regression (LUR) models (e.g., Apte et al.2017; Anand and Monks2017; Messier et al.2018; Shairsingh et al.2019; Kerckhoffs et al.2021, 2022a; Xu et al.2021b; Knibbs et al.2018; Lee2019) and dispersion models (e.g., Hamer et al.2020; Ramacher et al.2021; Snoun et al.2023). Both model types have emerged as very promising and efficient tools for high-resolution mapping of the changes in the composition of the atmosphere, as well as for quantifying the air quality by long-term averaging at a high spatial resolution.

The LUR model is more widely used in air quality studies compared to dispersion modeling because of the following: (a) it is a multivariate linear regression model built on significant covariates that can be further used to estimate pollutant concentration elsewhere; (b) linear regression is one of the most used fine-scale spatial interpolation methods because it is fast, easy to implement (Hoek et al.2008; Jerrett et al.2005), and does not require high computing power such as computational fluid dynamics based on large-eddy simulation or Reynolds-averaged Navier–Stokes approaches (Lin et al.2023, 2024); and (c) a LUR model does not require detailed information on atmospheric conditions or an emission inventory as input data. A LUR model usually requires measurement data and land-use predictor variables (e.g., CORINE dataset) (Kerckhoffs et al.2021, 2022b). Initially, LUR models were developed to estimate the concentration of air pollutants linked with traffic emissions, specifically NO2 and NOx (Briggs et al.1997; Stedman et al.1997; Hoek et al.2008; Eeftens et al.2012; Lu et al.2020; Zhang et al.2021). Lately, LUR models have been successfully expanded to include other air pollutants, such as particulate matter (PM) (Taheri Shahraiyni and Sodoudi2016; Karimi and Shokrinezhad2021; Zhao et al.2021; Wallek et al.2022), ozone (O3) (De Marco et al.2022; Wei et al.2022), carbon monoxide (CO) (Bi et al.2022) and sulfur dioxide (SO2) (Wu et al.2019; Mikeš et al.2023). LUR models can now estimate a wide range of air pollutants, including black carbon (BC) (Xu et al.2021a; Van den Bossche et al.2015), volatile organic compounds (VOCs) (Zapata-Marin et al.2022; Choi et al.2022) and ultrafine particles (UFP) (Ge et al.2022; Kerckhoffs et al.2021; Lloyd et al.2023; van Nunen et al.2020; Jones et al.2020; Saha et al.2019). LUR models can be used for both specific sites, such as highways (Lee et al.2013; Patton et al.2014) or neighborhoods (Lim et al.2019), and in detailed studies covering a wide range of land-use types across large city areas (Hatzopoulou et al.2017; Van den Hove et al.2020). Recent studies report on variations in pollutant levels across different times of the year, based on seasonal measurement campaigns (Xu et al.2021a; Miri et al.2019; Shi et al.2020).

In general, LUR models tend to smooth concentration levels over a wide area, leading to underestimation or overestimation of observed concentrations within each pixel. Therefore, one of the most feasible and robust approaches to map air quality at high resolution is to use the mixed-effects modeling framework that combines the advantages of measurement-only mapping and LUR modeling (Kerckhoffs et al.2022a, b). Mixed-effects modeling is mostly used in scenarios where data are hierarchical or clustered (e.g., Fokkema et al.2018; Seibold et al.2019). In air pollution research, mixed-effects models are powerful tools that can account for spatial or temporal clustering inherent in air quality data. They can accommodate factors like geographic regions or repeated measurements over time, providing a nuanced understanding of pollutant distribution. These models can be computationally complex, especially when dealing with large datasets or complicated random-effect structures (Kerckhoffs et al.2022a).

The mixed-effects model framework has been used in recent air quality studies for urban areas, like Amsterdam and Copenhagen (Kerckhoffs et al.2022b) or Oakland (USA) (Kerckhoffs et al.2024). Also, a similar mixed-effects approach has been used to estimate NO2 concentrations over Hong Kong SAR (Anand and Monks2017). Up to now, no high-resolution mapping of air pollutants at high spatial resolution has been performed for urban areas in Romania, although many cities face serious atmospheric pollution episodes (Marin et al.2019; Ilie et al.2023).

In this paper, we present the development and use of the mixed-effects modeling framework for high-resolution mapping of NO2, PM10, PM2.5 and UFP concentrations in Bucharest, the capital of Romania. Data from two mobile measurement campaigns, representative for warm and cold seasons, were combined with fine-scale land-use parameters to provide the spatiotemporal information necessary to predict seasonal surface concentrations. Results were validated against in situ measurements from the Magurele Center for Atmosphere and Radiation Studies (MARS) and eight fixed observation stations operated by the National Air Quality Monitoring Network (NAQMN) of Romania. A detailed description of the study area, measurements, and data treatment are given in Sect. 2, and the mixed-effects modeling framework tuning for Bucharest is given in Sect. 3, together with model performance evaluation and aggregated pollutant maps for the warm and cold seasons in Bucharest.

2 Materials and methods

2.1 Study area

Bucharest is the most populated urban area and the most important industrial and commercial center of Romania. According to the latest census, the population of Bucharest is approximately 2.1 million residents (INS2024), making it the sixth-largest city in the European Union by population. The city covers an area of about 240 km2 and has a dense urban structure. The land use of Bucharest is diverse (Fig. 1). The central and northern parts of the city are predominantly residential areas, characterized by a mix of old and new housing developments. Most of the production sectors (such as machinery, textile, chemical, and electronics industries, as well as business parks – all contributing significantly to the economic base of Bucharest) are located in the southern and western areas (Ilie et al.2023; Balaceanu et al.2018). The surroundings of Bucharest are mostly agricultural areas and rural/pre-urban residential areas.

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f01

Figure 1Land-use distribution in Bucharest, according to CORINE, 2018. The black line overlapping the data represents the mobile measurement route, and the dashed rectangle represents the modeled area.

Located in the southeastern part of Romania, in the Romanian Plain, Bucharest has a humid continental climate, characterized by hot summers, cold winters and two short transitional seasons, spring and autumn. Due to atmospheric circulation patterns specific to the north midlatitude zone, episodes of long-range transport of aerosols from desert regions (Sahara, Arabian Peninsula and Persia) and from wildfires can affect Bucharest's air quality. However, the major pollution sources are local, influenced by the topography, the different local-scale wind regimes and anthropocentric activities (Fenger1999; Grønskei1998; Marmureanu et al.2017; Balaceanu et al.2018; Marin et al.2019; Ilie et al.2023).

2.2 Observational data

Data were collected during two intensive mobile measurement campaigns carried out in Bucharest during May–July 2022 and January–February 2023. For each campaign, at least 15 measurement routes of approximately 100 km long and around 8 h duration were carried out, under various meteorological conditions. In order to ensure consistent and quality data that highlight the variability of pollutants specific to warm or cold seasons, rainy and/or windy days were excluded from the measurement campaigns. Measurements were performed from Monday to Friday, from early morning to the afternoon, being therefore representative for daytime working times. The route comprised high-traffic streets; residential, industrial, and commercial districts; and suburban neighborhoods. The mobile measurement route is limited to the areas where the car had access, excluding some urban areas (e.g., parks, agricultural zones and water bodies). Portable instruments measuring UFP, particle matter fractions (PM1, PM2.5, PM10) and NO2 were employed in both campaigns, with a measurement rate of 1 s. An additional GPS system had been used to independently save the precise location. A Nafion dryer was also used during warm periods to reach a humidity below 40 % for UFP measurements.

The mobile data had been filtered before being ingested into the model. A moving-average filter with a 3-data-point window was used to remove data points with values exceeding 1.5 times the window mean (above or below).

2.3 High-resolution mapping model

We used the land-use regression and mixed-effects modeling framework to develop high-resolution air quality maps for Bucharest. In a LUR model, the concentration of a pollutant is expressed as a linear combination of variables that approximates the influence of different emission sources and sinks. Usually, LUR models are fixed-effects models because of the following: (a) they use predictor variables that are temporally invariant (e.g., classes from the land cover inventories, statistical values of population density or traffic intensity, and street networks) and, (b) they are applied to the average atmospheric state over the entire observation period. Therefore, LUR models based on fixed variables are not sensitive to unobserved heterogeneity arising from temporal variability in emissions and/or other environmental conditions.

The mixed-effects LUR model considered in this work is similar to the one developed in Kerckhoffs et al. (2022a). The daily mean concentration at a reference point i on day j (Yij) is assumed to be a linear function of the random effect (Aij) and the predictor variables (Xijk) computed at the same reference point:

(1) Y i j = α 0 + α 1 A i j + k 0 β k X i j k + i j .

In Eq. (1), α0 is the random intercept, while α1 is the random slopes of Aij, respectively. The βk represents the regression coefficients of predictor variables Xijk at reference point i and day j. The regression coefficients are the same for all measurement days. The error term of the model is represented by ij. Mixed-effects model results were averaged per point, similar to the average of the data-only approach. Mean NO2, PM10, PM2.5 and UFP concentrations from each reference point were used as the dependent variables Yij in Eq. (1).

The route taken during campaigns was divided into road segments with a length of approximately 250 m, equivalent to the distance traveled by a car with an average speed of 30 km h−1 in a time interval of 60 s. The reference points were considered at the midpoint of each road segment.

A temporal correction was applied on the data to synchronize the measurements. The correction factor is calculated for each point as the difference between the daily average values corresponding to the point and the whole-campaign average value corresponding to the same point. Values measured within a 250 m street segment were averaged for each day. Similar correction and averaging methods were reported in previous studies (e.g Kerckhoffs et al.2021, 2022a).

The performance of the model has been evaluated in three steps. First, a subset containing 15 % of the data collected through mobile measurements (and not used to tune the model) was used for cross-validation. This percentage represents the optimal value for which the models developed in this study can recognize the relationship between the attributes of the input data and the output variable with R2 score greater than 0.75. When selecting this percentage, providing as much quality data as possible (85 %) was considered an important factor in the learning process to increase the performance of the model, as well as to avoid data leakage between the learning process and cross-validation. Second, an independent set of data collected at fixed sites was used for validation. In addition to the MARS site, eight monitoring stations operated by NAQMN in Bucharest were selected for validation, based on data availability, representing all types of environments: two urban-type stations located in the east and northeast of the city, one suburban station located 4 km to the south of the capital city, three industrial-type stations situated in the southern half of the city, and two traffic-type stations located in the center of Bucharest. The evaluation involved direct comparison of statistical metrics (correlation, root-mean-square error, relative differences) between model outputs and pollutant direct measurements. The last step was the evaluation of the model performance to resolve different types of environment (traffic, urban, industrial). Details and results are provided in Sect. 3.1.

2.4 Tuning the mixed-effects land-use regression model for Bucharest

Based on the specificity of the climate in the region of Bucharest, we decided to use the residential areas (predictor variable) as time-dependent variables with major differences between the warm season and the cold season (Ilie et al.2023). Residential areas are considered sources of pollution due to household activities, heating being responsible for the major difference between the cold and the warm seasons. However, the time dependence of this variable is not sufficient to describe the day-to-day variability, because the residential heating is generally switched on in late autumn and off in late spring, with no real daily variability. To model NO2, PM10, PM2.5 and UFP concentrations over the Bucharest area, an additional variable was needed in the LUR models to cover the fast time dependencies (the so-called random effects). In this mixed-effects framework, the pollutant concentration can be expressed as a linear relationship between fixed variables and time-dependent variables, where the random effect is modeled by including a discrete dummy variable.

In our work, for each reference point, the random effect was modeled as the difference between the standard deviation calculated for the entire period of mobile measurements and the standard deviation calculated for each day of mobile measurements. Therefore, the magnitude and/or sign of the random effect were not the same over all reference points of measurements. The mixed-effects models were fitted to the observational dataset using the Python modules scipy, sklearn and statsmodels.

Other predictor variables used for the LUR models include vehicle traffic intensity (calculated separately for the warm and for the cold seasons) as well as aggregated values of spatial predictor variables calculated within circular buffers ranging from 25 m to 2 km in radius.

The mixed-effects LUR models (one model for each pollutant considered in this study) were adjusted and trained for Bucharest to obtain consistent datasets. For the training process, 85 % subsets of mobile measurements were randomly selected. The remaining 15 % of the mobile measurements were used to cross-validate the LUR models. By dividing the dataset used in the learning process into 85 % for training and 15 % for testing, it was followed, on the one hand, to increase the performance of the models developed in this study, and, on the other hand, the aim was to reduce the overfitting effect of the models by obtaining the smallest possible difference between the R2 score obtained during training and testing. The regression coefficients obtained as an output of the training were further used to generate high-resolution maps of seasonal concentrations of NO2, PM10, PM2.5 and UFP with a resolution of 100 m, over an area of approximately 240 km2 (the entire area of the city of Bucharest).

2.4.1 Spatial predictor variables

To define the optimal configurations of LUR architectures for Bucharest, for each 250 m street segment, spatial predictor variables were extracted using the following data sources: (i) CORINE for the land cover (European Environment Agency2018), (ii) Open Street Map (OSM) (OpenStreetMap2021) for the road network and (iii) the National Institute of Statistics (INS) of Romania for the population density.

There is no recent source quantifying the traffic intensity on road segments in Bucharest; therefore, the value for this predictor was estimated for each direction of the street segment i using the following relationship:

(2) Counts i = N L f c i v i ,

where N represents the total number of vehicles per day obtained from INS, L represents the total length of street segments (km), vi represents the speed on the street segment (km h−1) and fci represents cost function for the street segment (h) retrieved from the geofabrik.de database (Geofabrik2024).

In order to tune and train the mixed-effects models for the specifics of Bucharest, the spatial predictor variables were selected from a number of proxies that describe the possible sources and sinks of NO2, PM10, PM2.5, PM1 and UFP emissions. These variables are presented in Table 1. The column “Effect” represents the effect that the predictor variables have on the concentration of the pollutant in the atmosphere. Variables associated with emission sources have a positive effect, while variables associated with sinks, such as vegetation cover, have a negative effect. The circle radii most commonly used for buffering the variables describing the sources and sinks at a given location are given in column “Buffer sizes”.

Table 1Description of spatial predictor variables.

Download Print Version | Download XLSX

2.4.2 Predictor variable selection

For the selection of variables, we first removed proxies with a percentage of null values greater than 90 %. After applying this filter, proxies that describe the possible sources and sinks for LUR models tuned for Bucharest were associated with the following:

  • traffic, with predictor “COUNTS” (number of vehicles per day) and road length variables in buffers of 50 to 1000 m;

  • land use, with predictors industry, green, residential lower density (individual residential), residential higher density (collective residential), construction, and water in buffers of 100 to 2000 m;

  • population density in buffers of 100 to 2000 m.

To determine the optimal combination of predictor variables, the supervised forward stepwise regression approach proposed within the European Study of Cohorts for Air Pollution Effects (ESCAPE) project (ESCAPE2024) was used. The ESCAPE model starts from a constant value, and after that the predictor variables are added based on the goodness of fit given by the adjusted cross-correlation (R2) value. The direction of effect for all variables was kept as in Table 1. The variable with the highest adjusted R2 was included in Eq. (1) as Xk. The process of building the model stops when the new variables do not contribute significantly (more than 1 %) to the improvement of the adjusted R2 value. The LUR model configurations were generated using all possible combinations of generated predictor values. From the total of the models obtained, only the LUR models for which the adjusted R2 value was higher than 0.5 were selected.

In a second step, we calculated the confidence level (p value) and variance inflation factor (VIF) to identify that predictor's contribution to a collinearity problem. The statistically insignificant variables (p>0.05) and predictor variables where VIF > 5 sequentially were not used in the model.

The predictor variables and the size of the buffers used in this work are shown in Table 1. The predictors passing the above conditions are shown in bold. The sizes of the buffers are those established within the ESCAPE project for the development of LUR models. In the case where no buffer sizes were used, their value was noted with N/A. The “number of vehicles” was noted in the table with “veh”. These buffer sizes are used to determine the spatial proximity of different features by defining a distance zone around the features.

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f02

Figure 2Relative difference between model-predicted values and mobile measurements of NO2 (a, b) and PM10 (c, d) during the warm (a, c) and cold (b, d) seasons.

3 Results and discussion

3.1 Evaluation of the model performances

For each individual pollutant, four configurations of LUR models were defined, out of those showing the adjusted R2 greater than 0.5. Furthermore, only the results and performances of the LUR models for which the highest adjusted R2 value was obtained are discussed.

The mixed-effects model tuned for Bucharest city has been evaluated using mobile measurements and fixed-site measurements. The performance assessment involved not only the averages at some points, but also the overall agreement on specific types of urban areas covering the entire pollutants concentration intervals. The performance of each model (one for each type of pollutant) was tested following three steps described in detail in Sect. 2.3. Moreover, in the first step, the 15 % kept for testing covers all possible situations regarding the spatial distribution of the predictor variables used in the model. Second, the average concentrations calculated by the model at the locations of the fixed observation sites have been validated by comparison with the seasonal-average concentrations measured at those sites. Also, model data clustered based on the type of environment (traffic, industrial, urban) have been compared to similarly clustered data collected at the fixed observation sites.

3.1.1 Cross-validation against mobile measurements

Cross-validation is performed for each model by comparing the model-predicted dataset against the observed data collected by mobile measurements. Agreement between the two datasets is quantified by calculating the adjusted R2 and root-mean-square error (RMSE). The RMSE of each model was calculated as the square root of the mean of the squared errors. We also present the relative differences between modeled and measured data as a general indicator of the model accuracy. Results show very good correlations (R2>0.91) for each model as a follow-up to the training process. The cross-validation shows higher correlations for the cold season (R2 = 0.81 for NO2 and R2 = 0.88 for PM10) than for the warm season (R2 = 0.59 for NO2 and R2 = 0.72 for PM10). The weaker performance of the models during the warm season can be explained by the large variability of NO2 and PM10 concentrations in the warm season compared to the cold season (Ilie et al.2023), which is not completely captured by our method. Moreover, this strong variability is also suggested by the RMSE values which for the NO2 are higher in the warm season (2.94 ppb) than in the cold season (2.64 ppb). In contrast, the RMSE values for PM10 are lower in the warm season (2.18 µg m−3) than in the cold season (3.06 µg m−3). Cross-validated scatterplots were also added in the Supplement.

The relative differences between mobile measurements and model retrievals, computed as “(Model  Observed)/Observed”, show the ability of the models to estimate PM10 and NO2 concentrations (Fig. 2). It can be seen that the model for NO2 tends to overestimate the predicted concentrations in the warm season, especially in urban agglomeration areas (upper left panel), while the model for PM10 tends to underestimate the predicted concentrations (lower left panel). However, such differences are acceptable considering the assumptions made, the uncertainty of the data used for training the models and the general performances reported in the literature (Ma et al.2024).

3.1.2 Validation against independent measurements at fixed observation sites

The model output was evaluated against the seasonal average of the hourly values of NO2 and PM10 measured at nine long-term in situ stations, out of which eight stations were operated by NAQMN. These sites are considered representative for urban, industrial and suburban areas as pictured in Fig. 3. The model could not be evaluated in the case of UFP, due to unavailability of the data at all fixed stations. More information about what type of variables are measured by NAQMN are given in  Ilie et al. (2023). A detailed description of the MARS site is given in Pîrloagă et al. (2023), where continuous PM concentrations are performed using optical particle counters (Mărmureanu et al.2019) and gas analyzers  (Castell et al.2018). Statistical metrics, like R2, RMSE and relative differences, were calculated similarly as for the cross-validation. Statistical parameters are summarized in Table 2.

In Fig. 3, the mean mass concentrations measured at the fixed observation sites are represented by the diameter of the circle, and the type of environment is represented by the color of the circle. The missing data in Fig. 3 are due to the fact that no measurements were available for these periods, due to various non-scientific reasons (technical problems with the measurement equipment, labor problems, etc.) It can be seen that the variation of the concentrations across the city is relatively low, especially for particulate matter during the cold season. PM10 concentrations range from 21 to 32 µg m−3 in both seasons. The highest PM10 concentrations are observed at the urban, suburban and traffic stations, while the lowest PM10 concentration is measured at the industrial stations. The western part of the city shows highest PM10 values. NO2 concentrations range from 8 to 20 ppb in the warm season and from 9 to 24 ppb in the cold season. The highest NO2 concentrations correspond to the areas with intense traffic and industrial activities, while the lowest concentrations are observed in the suburban areas, which are less impacted by traffic.

.

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f03

Figure 3Average concentrations of NO2 (a, b, units ppb) and PM10 (c, d, units µg m−3) during warm (a, c) and cold (b, d) seasons at fixed sites: the diameter of the circle represents the mean mass concentrations measured at the site, and the color of the circle represents the type of environment at the site

The comparison between model-predicted values and the observed values at the nine fixed sites is presented in Table 2. Overall, the model performed well, even if the NO2 values tend to be slightly overestimated, and PM10 tends to be slightly underestimated when compared with measured mean concentrations. Mean values of modeled NO2 are within the range of the observed values, as indicated by the standard deviation. The differences can be explained by the local topography and the specifics of the land use. The road system in Bucharest is very dense, so the distance from a street to residential or industrial sectors is often very short, sometimes less than a few meters; therefore, the NO2 100×100 m grid resolution cannot always resolve the variations. This smoothing effect caused by insufficient spatial resolution of the model is pinpointed by the lower values of the standard deviations returned by the model in comparison with those returned by observations.

The trends and seasonal differences of both NO2 and PM10 are resolved well by the model, as shown by the comparison with observations. The R2 correlation between observed mean concentrations and modeled mean concentration is above 0.65 for all pollutants for the warm season and 0.75 for the cold season. These values can be attributed to the good performance of the model, in accordance with the values reported for other cities (Yuan et al.2023). The lowest correlation value is noted for particulate matter during the warm season. This result can be explained by the fact that, during the warm season, photochemical processes are intensified at the street level, and the model cannot capture the effects properly.

The RMSE is another important parameter for assessing the performance of the model, accounting for the level of absolute error. As in the case of R2, we noticed a better model performance for the cold season. The difficulties of the model to capture the small variations during the warm period for both NO2 and PM10 are depicted by higher RMSE values. High values of RMSE correspond to low R2 values, demonstrating that the model cannot fully capture the variations of NO2 or PM10 concentrations.

Table 2Comparison between model output and measured values at fixed sites for NO2 and PM10 in Bucharest (Romania) during the warm period of 2022 and the cold period of 2023, as well as statistical metrics.

Download Print Version | Download XLSX

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f04

Figure 4Model (light color) versus measurement (dark color) mean concentrations of NO2 (a) and PM10 (b) during the warm (red) and cold (blue) seasons, along with the relative difference between the cold and warm seasons from model (grey diamond mark) or measurements (black diamond mark); symbols show the relative difference between model and measurements for the warm (purple circle mark) and cold (dark purple circle mark) seasons.

Download

3.1.3 Evaluation of the model performance to resolve different types of environment

The model was tested for its robustness in capturing differences between various types of environment such as urban (including pre-urban), industrial and traffic. Long-term datasets collected at the nine fixed observation sites were used for this purpose. Figure 4 shows mean concentrations of NO2 and PM10 as retrieved by the model and measured at the stations, clustered by the type of environment and separated for the warm and cold seasons. The relative differences between values modeled and measured in different environments are also highlighted.

NO2 is the most variable species, with high differences between seasons and between environment types. Lower NO2 concentrations are depicted for the urban group, whereas the highest NO2 concentration is associated with traffic, as anticipated. The traffic group has the lowest variability among seasons and is also better captured by the model. The model shows increased NO2 concentrations during the warm season for urban and industrial categories, while in the case of traffic areas the model underestimates a bit the measurement averages. The overall relative differences between the cold and warm seasons for both model and observational data inside each defined environment are less than 35 %. Higher relative differences between modeled and measured data are observed for the warm season for all areas, with the lowest difference in the case of traffic areas (around 10 %). Overall lower relative differences between modeled and measured data are observed during the cold season in comparison with the warm period, with NO2 average concentration in industrial and traffic areas underestimated by the model, highlighted by relative differences up to −20 %.

PM10 concentrations show an overall lower seasonal variability and also lower differences between model and observed data, with relative differences less than 20 %. Particulate matter concentrations for urban, industrial and traffic areas varied slightly among seasons. Modeled PM10 concentrations show higher values for industrial environment in comparison with observational data for both seasons. Moreover, industrial areas present the lowest PM10 concentrations during the cold season, as shown by both modeled and measured data. Urban and traffic environment PM10 concentrations are slightly underestimated by the model. The overall relative differences of PM10 concentration between cold and warm seasons for both model and observational data inside each defined environment is less than 15 %. Small relative differences between modeled and measured data are observed for PM10 concentrations, during both seasons. PM10 average concentration in urban and traffic areas are slightly underestimated by the model, highlighted by relative differences up to −10 %, which can also be influenced by the low number of available fixed stations in each environment.

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f05

Figure 5Near-surface concentration maps for Bucharest, resulting from the model for NO2, PM10 and UFP during warm (a, c, e) and cold (b, d, f) seasons.

3.2 Mapping atmospheric pollution in Bucharest

The validated model was further used to produce NO2, PM10 and UFP concentration maps for Bucharest, representative of the warm and cold seasons (Fig. 5). The results are valid for daytime and working days of the week. It should also be taken into account that, in the absence of spatiotemporal emission inventories with a high spatial resolution and traffic data, modeled data were used. In analyzing these results, it must be noted that the near-surface concentrations of atmospheric pollutants are influenced not only by the emissions, but also by the height of the planetary boundary layer, transport from other regions, dry and wet deposition, and chemical processes, all in relation to relatively fast changing meteorological conditions (e.g., air temperature, wind field). Model results show that (overall and regardless of the season) NO2, PM10 and UFP concentrations are higher on the main road sections, with higher values in Bucharest's western area.

https://acp.copernicus.org/articles/25/4639/2025/acp-25-4639-2025-f06

Figure 6Model maps of PM2.5/ PM10 ratio during the warm and cold seasons.

NO2 concentration maps show that this pollutant is highly related to traffic, the road network of Bucharest being clearly visible both during the warm and during the cold season. The main roadways, especially the Bucharest Ring ring road, are depicted as the primary NO2 source. Also featured are the city's central routes, where traffic remains heavy throughout the day and seasons. The highest NO2 concentrations is noted around busy highways due to the presence of a large number of NO-emitting automobiles. Sinks related to the green areas and water bodies are identified in dark green colors. Overall, NO2 concentration is higher during the warm period, when concentrations are higher on key roadways (35.79 ± 8.38 ppb) and other sources in the city add up. The conversion of NO to NO2 in the presence of sunlight and ozone is significant. During cold months, the NO2 concentration is lower in absolute values across the city; however, the main roads are still depicted as major sources, followed by several industrial areas such as power plants serving the centralized heating of the city. The distribution of the NO2 concentration on all street segments is almost uniform during cold season, with a slight increase for the main street segments, including the Bucharest ring road (20.67 ± 3.44 ppb). At the level of the city of Bucharest, the average value of the NO2 concentration as estimated by the model for the cold season is 16.66 ± 4.04 ppb, while for the warm season it is 18.75 ± 1.98 ppb.

The spatial variation of PM in the city area is substantial, with an abundance of small particles and a high mass concentration of larger particles in densely populated residential areas. Significant concentrations of particles have been identified, mostly in industrial areas and anthropogenic agglomerations, but also along certain major transportation routes. During the cold periods, the PM fractions (all sizes) have larger loading and lower gradients, as reported also for other cities (Ndiaye et al.2024). This is related to increased emissions from residential heating. A gradient of PM10 concentrations is evidenced within the city, with higher loading in the western and southern areas. Average PM10 concentration in Bucharest during cold season is 1.2 times higher than in the warm season. During the warm periods, the PM10 clusters are localized around source areas, while during cold periods the source distributions are more homogeneous.

The average UFP number concentration throughout the mobile route exhibits an important spatial gradient, particularly during the warm season, with variations up to a factor of 2 in the mean, highlighting extensive human exposure to ultrafine particles. The UFP number concentrations are elevated on the main roads, as well as on some areas related to industrial activities in southern, northern and western city regions, mostly during the warm season. A more uniform distribution of UFP mean number concentration is observed during the cold season, when the house heating emissions add up to the traffic. Traffic sources have less impact during the cold season, as chemical processes diminish due to limited sunlight. House heating sources, more evenly spatially distributed than the roads, generate a more homogeneous distribution but also larger absolute values of UFP. Average seasonal concentration for Bucharest city during the cold season (29132 ± 4362 particles cm−3) is 1.4 times higher than in the case of the warm season (21469 ± 3528 particles cm−3).

Since measurements of PM2.5 were only available at a few fixed stations, we included the modeled ratio of PM2.5 to PM10 as the result to show what we expect for the fraction of fine particles from the model. The model shows the fine particle fraction (PM2.5/ PM10) to be larger during the cold periods compared to warm periods, with fine particles accounting for up to 95 % of the PM10 concentration (Fig. 6). This is explained by the fact that household activities generate predominantly small particles, and higher percentages are seen in the peri-urban regions (outside of Bucharest) where the house-heating sources are contributing more (lighter color of purple, Fig. 6 right panel) to PM2.5 concentrations. During warm periods, the fine particle fraction is approximately 50 % within the city and less than 40 % in the villages close to Bucharest, where the agricultural activities increase the PM10 fraction (Fig. 6 left panel). The main rivers and lakes within Bucharest’s perimeter are clearly sinks for small particles, producing lower fine mode fractions in both seasons.

4 Conclusions

The regression-based methods fed by mobile data can predict NO2, PM10 and UFP concentrations for regions which are not properly covered by observations. In order to do this, the right combination of data sampling frequency, duration and route, as well as the correct number and type of predictor factors (corresponding to the surrounding environment), must be considered. Mobile monitoring together with modeling tools can therefore compensate for spatial and temporal data gaps which are collected by the monitoring stations and can assist individuals and policymakers in identifying regions and causes of poor air quality. In this study, we demonstrate the effectiveness of combining mobile measurements with mixed-effects LUR models to derive seasonal maps of near-surface PM10, NO2 and UFP. The study shows first-time validated results for Bucharest, the capital city of Romania, which is characterized by a large, densely populated surface with a very dense and heavily used street network.

Despite the limited number of fixed stations available for this work (8 + MARS), the tuned mixed-effects LUR model proved to be robust and accurate in producing high-resolution mapping of NO2, PM10 and UFP for the warm and cold seasons. Overall, good model performance was observed for both seasons and all concentrations, similar to other studies. The slightly higher mean squared error values coupled with smaller cross-validation R2 values obtained for the warm season suggest that the mobile campaign data collected for this study did not capture all the important NO2 and PM10 concentration variations. Even if the route selected for the two mobile measurement campaigns included all urban structures, the limitation of car access remains a source of error, which can lead to an underestimation of the concentration of pollutants. The performance of this model can be greatly improved by the involvement of citizens (pedestrian or by bicycle) to collect data from areas where cars are not allowed. Datasets systematically collected by citizens during daily (repeated) activities or walks could provide improved estimates of spatial variability for these areas (Snyder et al.2013; Hankey and Marshall2015; Van den Bossche et al.2016, 2018). The citizen involvement could increase the pollutant data collection in areas restricted for cars or bicycles and could enable the possibility to study the sinks on green or water areas, but these methods would be only based on low-cost sensors. Further improvements could also be made by the inclusion of spatiotemporal emission data with a high spatial resolution, such as traffic volumes or emission inventories.

The results provided by the model show that high concentrations of particulate matter during the cold season are representative for Bucharest city, due to the added effect of house heating (either dispersed in residential areas or localized at the city's power plants). Fine particles dominate during the cold season, although they remain at high levels during the warm season as well. NO2 is less challenging but still an important factor, especially during the warm season and along the main roads. The seasonal high-resolution air quality maps for Bucharest based on mixed-effects modeling pinpointed pollutant variability mostly during the warm season and higher concentrations and fine particle ratios during the cold season. Water and vegetation areas are evidenced as effective sinks for NO2 and fine particles, while traffic and residential heating are evidenced as effective sources in Bucharest. Based on these findings, a more extended certified air quality station network would be beneficial for human-health-related pollutant monitoring, as well as inclusion of fine particle measurements among these.

The approach presented in this paper can be adjusted for high-resolution mapping of NO2, PM and UFP in other cities as well, using the series of predictor variables identified in this study as necessary. This is feasible as long as the urban structures are characterized well and there is a fairly dense and diverse network of in situ monitoring stations, whose observational data can be used for model calibration and validation.

Code availability

Codes developed for this study are available from the main author (Camelia Talianu, camelia@inoe.ro) upon request.

Data availability

Data used in this study are available from the main author (Camelia Talianu, camelia@inoe.ro) upon request.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/acp-25-4639-2025-supplement.

Author contributions

CT: Conceptualization, Methodology, Formal analysis, Software, Writing (original draft), Writing (review and editing). JV: Conceptualization, Formal Analysis, Methodology, Resources, Supervision, Writing (original draft), Writing (review and editing). DN: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Writing (original draft), Writing (review and editing). AI: Data curation, Investigation, Visualization, Writing (original draft). AD: Formal analysis, Investigation, Writing (original draft). AN: Writing (review and editing). LB: Data curation, Investigation, Formal analysis.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Special issue statement

This article is part of the special issue “Air quality research at street level – Part II (ACP/GMD inter-journal SI)”. It is not associated with a conference.

Acknowledgements

We thank INCAS and INOESY for making the mobile instruments available. Also, we thank BOKU-Met, Vienna, Austria, for facilitating access to the supercomputer from the IT infrastructure used for the LUR models. This publication has been prepared using the European Union’s Copernicus Land Monitoring Service. We gratefully acknowledge the National Air Quality Monitoring Network, part of the Romanian Ministry of Environment, and the Romanian Institute of Statistics for providing invaluable data essential to this research. We appreciate the anonymous reviewers and editors for their time and effort to improve the paper.

Financial support

This work was carried out through RI-URBANS project (Research Infrastructures Services Reinforcing Air Quality Monitoring Capacities in European Urban & Industrial Areas, European Union's Horizon 2020 research and innovation program under grant agreement, grant agreement number 101036245), ATMO-ACCESS project (European Commission under the EU Horizon 2020 – Research and Innovation Framework Programme, H2020-INFRAIA-2020-1, grant agreement number: 10100800) and the Core Program within the National Research Development and Innovation Plan 2022–2027, carried out with the support of MCID, project no. PN 23 05.

Review statement

This paper was edited by Stelios Kazadzis and reviewed by five anonymous referees.

References

Almetwally, A., Bin-Jumah, M., and Allam, A.: Ambient air pollution and its influence on human health and welfare: an overview, Environ. Sci. Pollut. Res., 27, 24815–24830, https://doi.org/10.1007/s11356-020-09042-2, 2020. a

Anand, J. S. and Monks, P. S.: Estimating daily surface NO2 concentrations from satellite data – a case study over Hong Kong using land use regression models, Atmos. Chem. Phys., 17, 8211–8230, https://doi.org/10.5194/acp-17-8211-2017, 2017. a, b

Apte, J. S., Messier, K. P., Gani, S., Brauer, M., Kirchstetter, T. W., Lunden, M. M., Marshall, J. D., Portier, C. J., Vermeulen, R. C., and Hamburg, S. P.: High-resolution air pollution mapping with Google street view cars: exploiting big data, Environ. Sci. Technol., 51, 6999–7008, https://doi.org/10.1021/acs.est.7b00891, 2017. a, b

Balaceanu, C., Mitiu, M., Marcus, I., and Mincu, M.: Pollution Impact of the Grozavesti Thermoelectric Power Plant on the Urban Agglomeration Bucharest, Rev. Chim.-Bucharest, 69, 350–353, https://doi.org/10.37358/RC.18.2.6105, 2018. a, b

Bernstein, J. A., Alexis, N., Barnes, C., Bernstein, I. L., Nel, A., Peden, D., Diaz-Sanchez, D., Tarlo, S. M., Williams, P. B., and Bernstein, J. A.: Health effects of air pollution, J. Allergy Clin. Immun., 114, 1116–1123, https://doi.org/10.1016/j.jaci.2004.08.030, 2004. a

Bi, J., Zuidema, C., Clausen, D., Kirwa, K., Young, M., Gassett, A., Seto, E., Sampson, P., Larson, T., Szpiro, A., Sheppard, L., and Kaufman, J.: Within-City Variation in Ambient Carbon Monoxide Concentrations: Leveraging Low-Cost Monitors in a Spatiotemporal Modeling Framework, Environ. Health Persp., 130, 97008, https://doi.org/10.1289/EHP10889, 2022. a

Briggs, D. J., Collins, S., Elliott, P., Fischer, P., Kingham, S., Lebret, E., Pryl, K., Reeuwijk, H. V., Smallbone, K., and Veen, A. V. D.: Mapping urban air pollution using GIS: a regression-based approach, Int. J. Geogr. Info. Sci., 11, 699–718, https://doi.org/10.1080/136588197242158, 1997. a

Brunekreef, B. and Holgate, S.: Air Pollution and Health, Lancet, 360, 1233–1242, https://doi.org/10.1016/S0140-6736(02)11274-8, 2002. a

Castell, N., Schneider, P., Grossberndt, S., Fredriksen, M. F., Sousa-Santos, G., Vogt, M., and Bartonova, A.: Localized real-time information on outdoor air quality at kindergartens in Oslo, Norway using low-cost sensor nodes, Environ. Res., 165, 410–419, https://doi.org/10.1016/j.envres.2017.10.019, 2018. a

Choi, J., Henze, D. K., Cao, H., Nowlan, C. R., González Abad, G., Kwon, H.-A., Lee, H.-M., Oak, Y. J., Park, R. J., Bates, K. H., Maasakkers, J. D., Wisthaler, A., and Weinheimer, A. J.: An Inversion Framework for Optimizing Non-Methane VOC Emissions Using Remote Sensing and Airborne Observations in Northeast Asia During the KORUS-AQ Field Campaign, J. Geophys. Res.-Atmos., 127, e2021JD035844, https://doi.org/10.1029/2021JD035844, 2022. a

De Marco, A., Garcia-Gomez, H., Collalti, A., Khaniabadi, Y. O., Feng, Z., Proietti, C., Sicard, P., Vitale, M., Anav, A., and Paoletti, E.: Ozone modelling and mapping for risk assessment: An overview of different approaches for human and ecosystems health, Environ. Res., 211, 113048, https://doi.org/10.1016/j.envres.2022.113048, 2022. a

Deshmukh, P., Kimbrough, S., Krabbe, S., Logan, R., Isakov, V., and Baldauf, R.: Identifying air pollution source impacts in urban communities using mobile monitoring, Sci. Total Environ., 715, 136979, https://doi.org/10.1016/j.scitotenv.2020.136979, 2020. a

Eeftens, M., Tsai, M.-Y., Ampe, C., Anwander, B., Beelen, R., Bellander, T., Cesaroni, G., Cirach, M., Cyrys, J., de Hoogh, K., Nazelle, A. D., de Vocht, F., Declercq, C., Dėdelė, A., Eriksen, K., Galassi, C., Gražulevičienė, R., Grivas, G., Heinrich, J., Hoffmann, B., Iakovides, M., Ineichen, A., Katsouyanni, K., Korek, M., Krämer, U., Kuhlbusch, T., Lanki, T., Madsen, C., Meliefste, K., Mölter, A., Mosler, G., Nieuwenhuijsen, M., Oldenwening, M., Pennanen, A., Probst-Hensch, N., Quass, U., Raaschou-Nielsen, O., Ranzi, A., Stephanou, E., Sugiri, D., Udvardy, O., Éva Vaskövi, Weinmayr, G., Brunekreef, B., and Hoek, G.: Spatial variation of PM2.5, PM10, PM2.5 absorbance and PM coarse concentrations between and within 20 European study areas and the relationship with NO2 – Results of the ESCAPE project, Atmos. Environ., 62, 303–317, https://doi.org/10.1016/j.atmosenv.2012.08.038, 2012. a

ESCAPE: ESCAPE – Enhancing Sales Capacity for Agrifood Products in Europe,
urlhttp://escapeproject.eu/ (last access: 11 July 2024), 2024. a

European Environment Agency: CORINE Land Cover (CLC) 2018 dataset, Data file, https://www.eea.europa.eu/publications/COR0-landcover (last access: 13 August 2024), 2018. a

Fenger, J.: Urban air quality, Atmospheric environment, 33, 4877–4900, 1999. a

Fokkema, M., Smits, N., Zeileis, A., Hothorn, T., and Kelderman, H.: Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees, Behav. Res. Methods, 50, 2016–2034, https://doi.org/10.3758/s13428-017-0971-x, 2018. a

Ge, Y., Fu, Q., Yi, M., Chao, Y., Lei, X., Xu, X., Yang, Z., Hu, J., Kan, H., and Cai, J.: High spatial resolution land-use regression model for urban ultrafine particle exposure assessment in Shanghai, China, Sci. Total Environ., 816, 151633, https://doi.org/10.1016/j.scitotenv.2021.151633, 2022. a

Geofabrik: Geofabrik GmbH, https://download.geofabrik.de/ (last access: 11 July 2024), 2024. a

Grønskei, K. E.: Europe and its cities, in: Urban Air Pollution – European Aspects, 21–32 pp., Springer, https://doi.org/10.1007/978-94-015-9080-8_3, 1998. a

Hamanaka, R. B. and Mutlu, G. M.: Particulate Matter Air Pollution: Effects on the Cardiovascular System, Front. Endocrinol., 9, 680, https://doi.org/10.3389/fendo.2018.00680, 2018. a, b, c

Hamer, P. D., Walker, S.-E., Sousa-Santos, G., Vogt, M., Vo-Thanh, D., Lopez-Aparicio, S., Schneider, P., Ramacher, M. O. P., and Karl, M.: The urban dispersion model EPISODE v10.0 – Part 1: An Eulerian and sub-grid-scale air quality model and its application in Nordic winter conditions, Geosci. Model Dev., 13, 4323–4353, https://doi.org/10.5194/gmd-13-4323-2020, 2020. a

Hankey, S. and Marshall, J. D.: Land use regression models of on-road particulate air pollution (particle number, black carbon, PM2. 5, particle size) using mobile monitoring, Environmental science & technology, 49, 9194–9202, https://doi.org/10.1021/acs.est.5b01209, 2015. a

Hasegawa, K., Tsukahara, T., and Nomiyama, T.: Short-term associations of low-level fine particulate matter (PM2.5) with cardiorespiratory hospitalizations in 139 Japanese cities, Ecotoxicol. Environ. Safe., 258, 114961, https://doi.org/10.1016/j.ecoenv.2023.114961, 2023. a, b

Hatzopoulou, M., Valois, M.-F., Levy, I., Mihele, C., Lu, G., Bagg, S., Minet, L., and Brook, J.: Robustness of Land-Use Regression Models Developed from Mobile Air Pollutant Measurements, Environ. Sci. Technol., 51, 3938–3947, https://doi.org/10.1021/acs.est.7b00366, 2017. a

Hoek, G., Beelen, R., de Hoogh, K., Vienneau, D., Gulliver, J., Fischer, P., and Briggs, D.: A review of land-use regression models to assess spatial variation of outdoor air pollution, Atmos. Environ., 42, 7561–7578, https://doi.org/10.1016/j.atmosenv.2008.05.057, 2008. a, b

Hoek, G., Eeftens, M., Beelen, R., Fischer, P., Brunekreef, B., Boersma, K., and Veefkind, P.: Satellite NO2 data improve national land use regression models for ambient NO2 in a small densely populated country, Atmos. Environ., 105, 173–180, https://doi.org/10.1016/j.atmosenv.2015.01.053, 2015. a

Ilie, A., Vasilescu, J., Talianu, C., Ioja, C., and Nemuc, A.: Spatiotemporal Variability of Urban Air Pollution in Bucharest City, Atmosphere, 14, 1759, https://doi.org/10.3390/atmos14121759, 2023. a, b, c, d, e, f, g

INS: National Institute of Statistics of Romania, https://insse.ro/cms/en (last access: 11 July 2024), 2024. a

Jerrett, M., Arain, A., Kanaroglou, P., Beckerman, B., Potoglou, D., Sahsuvaroglu, T., Morrison, J., and Giovis, C.: A review and evaluation of intraurban air pollution exposure models, J. Expo. Sci. Env. Epid., 15, 185–204, https://doi.org/10.1038/sj.jea.7500388, 2005. a

Jones, R. R., Hoek, G., Fisher, J. A., Hasheminassab, S., Wang, D., Ward, M. H., Sioutas, C., Vermeulen, R., and Silverman, D. T.: Land use regression models for ultrafine particles, fine particles, and black carbon in Southern California, Sci. Total Environ., 699, 134234, https://doi.org/10.1016/j.scitotenv.2019.134234, 2020. a

Karimi, B. and Shokrinezhad, B.: Spatial variation of ambient PM2.5 and PM10 in the industrial city of Arak, Iran: A land-use regression, Atmos. Pollut. Res., 12, 101235, https://doi.org/10.1016/j.apr.2021.101235, 2021. a

Kerckhoffs, J., Hoek, G., Gehring, U., and Vermeulen, R.: Modelling nationwide spatial variation of ultrafine particles based on mobile monitoring, Environ. Int., 154, 106569, https://doi.org/10.1016/j.envint.2021.106569, 2021. a, b, c, d

Kerckhoffs, J., Khan, J., Hoek, G., Yuan, Z., Ellermann, T., Hertel, O., Ketzel, M., Jensen, S. S., Meliefste, K., and Vermeulen, R.: Mixed-effects modeling framework for Amsterdam and Copenhagen for outdoor NO2 concentrations using measurements sampled with google street view cars, Environ. Sci. Technol., 56, 7174–7184, https://doi.org/10.1021/acs.est.1c05806, 2022a. a, b, c, d, e

Kerckhoffs, J., Khan, J., Hoek, G., Yuan, Z., Hertel, O., Ketzel, M., Jensen, S. S., Al Hasan, F., Meliefste, K., and Vermeulen, R.: Hyperlocal variation of nitrogen dioxide, black carbon, and ultrafine particles measured with Google Street View cars in Amsterdam and Copenhagen, Environ. Int., 170, 107575, https://doi.org/10.1016/j.envint.2022.107575, 2022b. a, b, c

Kerckhoffs, J., Hoek, G., and Vermeulen, R.: Mobile monitoring of air pollutants; performance evaluation of a mixed-model land use regression framework in relation to the number of drive days, Environ. Res., 240, 117457, https://doi.org/10.1016/j.envres.2023.117457, 2024. a

Knibbs, L. D., van Donkelaar, A., Martin, R. V., Bechle, M. J., Brauer, M., Cohen, D. D., Cowie, C. T., Dirgawati, M., Guo, Y., Hanigan, I. C., Johnston, F. H., Marks, G. B., Marshall, J. D., Pereira, G., Jalaludin, B., Heyworth, J. S., Morgan, G. G., and Barnett, A. G.: Satellite-Based Land-Use Regression for Continental-Scale Long-Term Ambient PM2.5 Exposure Assessment in Australia, Environ. Sci. Technol., 52, 12445–12455, https://doi.org/10.1021/acs.est.8b02328, 2018. a

Kokkalis, P., Amiridis, V., Allan, J., Papayannis, A., Solomos, S., Binietoglou, I., Bougiatioti, A., Tsekeri, A., Nenes, A., Rosenberg, P., Marenco, F., Marinou, E., Vasilescu, J., Nicolae, D., Coe, H., Bacak, A., and Chaikovsky, A.: Validation of LIRIC aerosol concentration retrievals using airborne measurements during a biomass burning episode over Athens, Atmos. Res., 183, 255–267, https://doi.org/10.1016/j.atmosres.2016.09.007, 2016. a

Lee, H. J.: Benefits of High Resolution PM2.5 Prediction using Satellite MAIAC AOD and Land Use Regression for Exposure Assessment: California Examples, Environ. Sci. Technol., 53, 12774–12783, https://doi.org/10.1021/acs.est.9b03799, 2019. a

Lee, K., Wu, J., Hudda, N., Fruin, S., and Delfino, R.: Modeling the Concentrations of On-Road Air Pollutants in Southern California, Environ. Sci. Technol., 47, 9291–9299, https://doi.org/10.1021/es401281r, 2013. a

Li, H. Z., Gu, P., Ye, Q., Zimmerman, N., Robinson, E. S., Subramanian, R., Apte, J. S., Robinson, A. L., and Presto, A. A.: Spatially dense air pollutant sampling: Implications of spatial variability on the representativeness of stationary air pollutant monitors, Atmos. Environ. X, 2, 100012, https://doi.org/10.1016/j.aeaoa.2019.100012, 2019. a

Lim, C., Kim, H., Vilcassim, M. J. R., Thurston, G., Gordon, T., Chen, L.-C., Lee, K., Heimbinder, M., and Kim, S.-Y.: Mapping urban air quality using mobile sampling with low-cost sensors and machine learning in Seoul, South Korea, Environ. Int., 131, 105022, https://doi.org/10.1016/j.envint.2019.105022, 2019. a

Lin, C., Wang, Y., Ooka, R., Flageul, C., Kim, Y., Kikumoto, H., Wang, Z., and Sartelet, K.: Modeling of street-scale pollutant dispersion by coupled simulation of chemical reaction, aerosol dynamics, and CFD, Atmos. Chem. Phys., 23, 1421–1436, https://doi.org/10.5194/acp-23-1421-2023, 2023. a

Lin, C., Ooka, R., Kikumoto, H., Flageul, C., Kim, Y., Zhang, Y., and Sartelet, K.: Impact of solid road barriers on reactive pollutant dispersion in an idealized urban canyon: A large-eddy simulation coupled with chemistry, Urban Clim., 55, 101989, https://doi.org/10.1016/j.uclim.2024.101989, 2024. a

Lloyd, M., Ganji, A., Xu, J., Venuta, A., Simon, L., Zhang, M., Saeedi, M., Yamanouchi, S., Apte, J., Hong, K., Hatzopoulou, M., and Weichenthal, S.: Predicting spatial variations in annual average outdoor ultrafine particle concentrations in Montreal and Toronto, Canada: Integrating land use regression and deep learning models, Environ. Int., 178, 108106, https://doi.org/10.1016/j.envint.2023.108106, 2023. a

Lu, M., Soenario, I., Helbich, M., Schmitz, O., Hoek, G., van der Molen, M., and Karssenberg, D.: Land use regression models revealing spatiotemporal co-variation in NO2, NO, and O3 in the Netherlands, Atmospheric Environment, 223, 117238, https://doi.org/10.1016/j.atmosenv.2019.117238, 2020. a

Ma, X., Zou, B., Deng, J., Gao, J., Longley, I., Xiao, S., Guo, B., Wu, Y., Xu, T., Xu, X., Yang, X., Wang, X., Tan, Z., Wang, Y., Morawska, L., and Salmond, J.: A comprehensive review of the development of land use regression approaches for modeling spatiotemporal variations of ambient air pollution: A perspective from 2011 to 2023, Environ. Int., 183, 108430, https://doi.org/10.1016/j.envint.2024.108430, 2024. a

Marin, C., Marmureanu, L., Radu, C., Dandocsi, A., Stan, C., Toanca, F., Preda, L., and Antonescu, B.: Wintertime Variations of Gaseous Atmospheric Constituents in Bucharest Peri-Urban Area, Atmosphere, 10, 478, https://doi.org/10.3390/atmos10080478, 2019. a, b

Marmureanu, L., Vasilescu, J., Stefanie, H., and Talianu, C.: Chemical and optical characterization of submicronic aerosol sources, Environ. Eng. Manage. J., 16, 2165–2172, https://doi.org/10.30638/eemj.2017.223, 2017. a

Mărmureanu, L., Marin, C. A., Andrei, S., Antonescu, B., Ene, D., Boldeanu, M., Vasilescu, J., Viţelaru, C., Cadar, O., and Levei, E.: Orange snow – a Saharan dust intrusion over Romania during winter conditions, Remote Sens., 11, 2466, https://doi.org/10.3390/rs11212466, 2019. a

Messier, K. P., Chambliss, S. E., Gani, S., Alvarez, R., Brauer, M., Choi, J. J., Hamburg, S. P., Kerckhoffs, J., LaFranchi, B., Lunden, M. M., et al.: Mapping air pollution with Google Street View cars: Efficient approaches with mobile monitoring and land use regression, Environ. Sci. Technol., 52, 12563–12572, https://doi.org/10.1021/acs.est.8b03395, 2018. a

Mikeš, O., Sanka, O., Rafajová, A., Vlaanderen, J., Chen, J., Hoek, G., Klanova, J., and Cupr, P.: Development of historic monthly land use regression models of SO2, NOx and suspended particulate matter for birth cohort ELSPAC, Atmos. Environ., 301, 119688, https://doi.org/10.1016/j.atmosenv.2023.119688, 2023. a

Miri, M., Ghassoun, Y., Dowlatabadi, A., Ebrahimnejad, A., and Löwner, M.-O.: Estimate annual and seasonal PM1, PM2.5 and PM10 concentrations using land use regression model, Ecotoxicol. Environm. Safe., 174, 137–145, https://doi.org/10.1016/j.ecoenv.2019.02.070, 2019. a

Ndiaye, A., Shen, Y., Kyriakou, K., Karssenberg, D., Schmitz, O., Flückiger, B., Hoogh, K. d., and Hoek, G.: Hourly land-use regression modeling for NO2 and PM2.5 in the Netherlands, Environ. Res., 256, 119233, https://doi.org/10.1016/j.envres.2024.119233, 2024. a

Nemuc, A., Stachlewska, I., Vasilescu, J., Górska, A., Nicolae, D., and Talianu, C.: Optical Properties of Long-Range Transported Volcanic Ash over Romania and Poland During Eyjafjallajökull Eruption in 2010, Acta Geophys., 62, 350–366, https://doi.org/10.2478/s11600-013-0180-7, 2013. a

OpenStreetMap: OpenStreetMap database [PostgreSQL via API], © OpenStreetMap contributors, available under the Open Database Licence from: https://openstreetmap.org (last access: 22 February 2024), Data mining by Overpass turbo, available at https://overpass-turbo.eu (last access: 22 February 2024), 2021. a

Patton, A., Collins, C., Naumova, E., Zamore, W., Brugge, D., and Durant, J.: An Hourly Regression Model for Ultrafine Particles in a Near-Highway Urban Area, Environ. Sci. Technol., 48, 3272–3280, https://doi.org/10.1021/es404838k, 2014. a

Pîrloagă, R., Adam, M., Antonescu, B., Andrei, S., and Ştefan, S.: Ground-Based Measurements of Wind and Turbulence at Bucharest–Măgurele: First Results, Remote Sens., 15, 1514, https://doi.org/10.3390/rs15061514, 2023. a

Ramacher, M., Kakouri, A., Speyer, O., Steiling, J., Karl, M., Timmermans, R., Denier van der Gon, H., Kuenen, J., Gerasopoulos, E., and Athanasopoulou, E.: The UrbEm Hybrid Method to Derive High-Resolution Emissions for City-Scale Air Quality Modeling, Atmosphere, 12, 1404, https://doi.org/10.3390/atmos12111404, 2021. a

Rongqi Abbie, L., Yaguang, W., Xinye, Q., Anna, K., and Joel D., S.: Short term exposure to air pollution and mortality in the US: a double negative control analysis, Environ. Health, 21, 81, https://doi.org/10.1186/s12940-022-00886-4, 2022. a

Saha, P. K., Li, H. Z., Apte, J. S., Robinson, A. L., and Presto, A. A.: Urban Ultrafine Particle Exposure Assessment with Land-Use Regression: Influence of Sampling Strategy, Environ. Sci. Technol., 53, 7326–7336, https://doi.org/10.1021/acs.est.9b02086, 2019. a

Schmitz, O., Beelen, R., Strak, M., Hoek, G., Soenario, I., Brunekreef, B., Vaartjes, I., Dijst, M., Grobbee, D., and Karssenberg, D.: High resolution annual average air pollution concentration maps for the Netherlands, Sci. Data, 6, 190035, https://doi.org/10.1038/sdata.2019.35, 2019. a

Seibold, H., Hothorn, T., and Zeileis, A.: Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees, Adv. Data Anal. Classi., 13, 703–725, https://doi.org/10.1007/s11634-018-0342-1, 2019. a

Seihei, N., Farhadi, M., Takdastan, A., Asban, P., Kiani, F., and Mohammadi, M. J.: Short-term and long-term effects of exposure to PM10, Clinic. Epidemiol. Global Health, 27, 101611, https://doi.org/10.1016/j.cegh.2024.101611, 2024. a

Shairsingh, K. K., Jeong, C.-H., Wang, J. M., Brook, J. R., and Evans, G. J.: Urban land use regression models: can temporal deconvolution of traffic pollution measurements extend the urban LUR to suburban areas?, Atmos. Environ., 196, 143–151, https://doi.org/10.1016/j.atmosenv.2018.10.013, 2019. a

Shi, T., Hu, Y., Liu, M., Li, C., Zhang, C., and Liu, C.: Land use regression modelling of PM2.5 spatial variations in different seasons in urban areas, Sci. Total Environ., 743, 140744, https://doi.org/10.1016/j.scitotenv.2020.140744, 2020. a

Snoun, H., Krichen, M., and Chérif, H.: A comprehensive review of Gaussian atmospheric dispersion models: current usage and future perspectives, Euro-Mediterran. J. Environ. Int., 8, 219–242, https://doi.org/10.1007/s41207-023-00354-6, 2023. a

Snyder, E. G., Watkins, T. H., Solomon, P. A., Thoma, E. D., Williams, R. W., Hagler, G. S. W., Shelow, D., Hindin, D. A., Kilaru, V. J., and Preuss, P. W.: The Changing Paradigm of Air Pollution Monitoring, Environ. Sci. Technol., 47, 11369–11377, https://doi.org/10.1021/es4022602, 2013. a

Stedman, J. R., Vincent, K. J., Campbell, G. W., Goodwin, J. W., and Downing, C. E.: New high resolution maps of estimated background ambient NOx and NO2 concentrations in the U.K., Atmos. Environ., 31, 3591–3602, https://doi.org/10.1016/S1352-2310(97)00159-3, 1997. a

Taheri Shahraiyni, H. and Sodoudi, S.: Statistical Modeling Approaches for PM10 Prediction in Urban Areas; A Review of 21st-Century Studies, Atmosphere, 7, 15, https://doi.org/10.3390/atmos7020015, 2016. a

Upadhya, A. R., Kushwaha, M., Agrawal, P., Gingrich, J. D., Asundi, J., Sreekanth, V., Marshall, J. D., and Apte, J. S.: Multi-season mobile monitoring campaign of on-road air pollution in Bengaluru, India: High-resolution mapping and estimation of quasi-emission factors, Sci. Total Environ., 914, 169987, https://doi.org/10.1016/j.scitotenv.2024.169987, 2024. a

Van den Bossche, J., Peters, J., Verwaeren, J., Botteldooren, D., Theunis, J., and De Baets, B.: Mobile monitoring for mapping spatial variation in urban air quality: Development and validation of a methodology based on an extensive dataset, Atmos. Environ., 105, 148–161, https://doi.org/10.1016/j.atmosenv.2015.01.017, 2015. a

Van den Bossche, J., Theunis, J., Elen, B., Peters, J., Botteldooren, D., and De Baets, B.: Opportunistic mobile air pollution monitoring: A case study with city wardens in Antwerp, Atmos. Environ., 141, 408–421, https://doi.org/10.1016/j.atmosenv.2016.06.063, 2016. a

Van den Bossche, J., De Baets, B., Verwaeren, J., Botteldooren, D., and Theunis, J.: Development and evaluation of land use regression models for black carbon based on bicycle and pedestrian measurements in the urban environment, Environ. Modell. Softw., 99, 58–69, https://doi.org/10.1016/j.envsoft.2017.09.019, 2018. a

Van den Hove, A., Verwaeren, J., Van den Bossche, J., Theunis, J., and De Baets, B.: Development of a land use regression model for black carbon using mobile monitoring data and its application to pollution-avoiding routing, Environ. Res., 183, 108619, https://doi.org/10.1016/j.envres.2019.108619, 2020. a

van Nunen, E., Vermeulen, R., Tsai, M.-Y., Probst-Hensch, N., Ineichen, A., Imboden, M., Naccarati, A., Tarallo, S., Raffaele, D., Ranzi, A., Nieuwenhuijsen, M., Jarvis, D., Amaral, A. F., Vlaanderen, J., Meliefste, K., Brunekreef, B., Vineis, P., Gulliver, J., and Hoek, G.: Associations between modeled residential outdoor and measured personal exposure to ultrafine particles in four European study areas, Atmos. Environ., 226, 117353, https://doi.org/10.1016/j.atmosenv.2020.117353, 2020. a

Wallek, S., Langner, M., Schubert, S., and Schneider, C.: Modelling Hourly Particulate Matter (PM10) Concentrations at High Spatial Resolution in Germany Using Land Use Regression and Open Data, Atmosphere, 13, 1282, https://doi.org/10.3390/atmos13081282, 2022. a

Wei, J., Li, Z., Li, K., Dickerson, R. R., Pinker, R. T., Wang, J., Liu, X., Sun, L., Xue, W., and Cribb, M.: Full-coverage mapping and spatiotemporal variations of ground-level ozone (O3) pollution from 2013 to 2020 across China, Remote Sens. Environ., 270, 112775, https://doi.org/10.1016/j.rse.2021.112775, 2022.  a

West, J., Cohen, A., Dentener, F., Brunekreef, B., Zhu, T., Armstrong, B., Bell, M., Brauer, M., Carmichael, G., Costa, D., Dockery, D., Kleeman, M., Krzyzanowski, M., Künzli, N., Liousse, C., Candice Lung, S.-C., Martin, R., Pöschl, U., Pope, C., and Wiedinmyer, C.: “What We Breathe Impacts Our Health: Improving Understanding of the Link between Air Pollution and Health”, Environ. Sci. Technol., 50, https://doi.org/10.1021/acs.est.5b03827, 2016. a

Wu, Y., Li, R., Cui, L., Meng, Y., Cheng, H., and Fu, H.: The high-resolution estimation of sulfur dioxide (SO2) concentration, health effect and monetary costs in Beijing, Chemosphere, 241, 125031, https://doi.org/10.1016/j.chemosphere.2019.125031, 2019. a

Xu, X., Qin, N., Qi, L., Zou, B., Cao, S., Zhang, K., Yang, Z., Liu, Y., Zhang, Y., and Duan, X.: Development of season-dependent land use regression models to estimate BC and PM1 exposure, Sci. Total Environ., 793, 148 540, https://doi.org/10.1016/j.scitotenv.2021.148540, 2021a. a, b

Xu, X., Qin, N., Yang, Z., Liu, Y., Cao, S., Zou, B., Jin, L., Zhang, Y., and Duan, X.: Potential for developing independent daytime/nighttime LUR models based on short-term mobile monitoring to improve model performance, Environ. Pollut., 268, 115951, https://doi.org/10.1016/j.envpol.2020.115951, 2021b. a

Yuan, Z., Kerckhoffs, J., Shen, Y., Hoogh, K. d., Hoek, G., and Vermeulen, R.: Integrating large-scale stationary and local mobile measurements to estimate hyperlocal long-term air pollution using transfer learning methods, Environ. Res., 228, 115836, https://doi.org/10.1016/j.envres.2023.115836, 2023. a

Yuan, Z., Kerckhoffs, J., Li, H., Khan, J., Hoek, G., and Vermeulen, R.: Hyperlocal Air Pollution Mapping: A Scalable Transfer Learning LUR Approach for Mobile Monitoring, Environ. Sci. Technol., 58, 14372–14383, https://doi.org/10.1021/acs.est.4c06144, 2024. a

Zapata-Marin, S., Schmidt, A. M., Crouse, D., Ho, V., Labrèche, F., Lavigne, E., Parent, M.-E., and Goldberg, M. S.: Spatial modeling of ambient concentrations of volatile organic compounds in Montreal, Canada, Environ. Epidemiol., 6, e226, https://doi.org/10.1097/EE9.0000000000000226, 2022. a

Zhang, L., Tian, X., Zhao, Y., Liu, L., Li, Z., Tao, L., Wang, X., Guo, X., and Luo, Y.: Application of nonlinear land use regression models for ambient air pollutants and air quality index, Atmos. Pollut. Res., 12, 101186, https://doi.org/10.1016/j.apr.2021.101186, 2021. a

Zhao, Z., Jing, Y., Luo, X.-S., and Li, Hanhan andTang, M.: Overview and Research Progresses in Chemical Speciation and In Vitro Bioaccessibility Analyses of Airborne Particulate Trace Metals, Curr. Pollut. Rep., 7, 540–548, https://doi.org/10.1007/s40726-021-00200-9, 2021. a

Download
Short summary
For Bucharest, Romania's capital, mobile measurements during two intensive campaigns and mixed-effect LUR (land-use regression) models to derive seasonal maps of near-surface PM10, NO2 and UFPs (ultrafine particles) have successfully been used. The model's performance was evaluated, demonstrating its potential for high-resolution mapping in other cities with well-characterized urban structures and diverse in situ monitoring stations.
Share
Altmetrics
Final-revised paper
Preprint