Unraveling pathways of elevated ozone induced by the 2020 lockdown in Europe by an observationally constrained regional model using TROPOMI

Souri, Amir H.; Chance, Kelly; Bak, Juseon; Nowlan, Caroline R.; González Abad, Gonzalo; Jung, Yeonjin; Wong, David C.; Mao, Jingqiu; Liu, Xiong

doi:https://doi.org/10.5194/acp-21-18227-2021

Articles | Volume 21, issue 24

https://doi.org/10.5194/acp-21-18227-2021

© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/acp-21-18227-2021

© Author(s) 2021. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 21, issue 24

Research article

|

16 Dec 2021

Research article |

| 16 Dec 2021

Unraveling pathways of elevated ozone induced by the 2020 lockdown in Europe by an observationally constrained regional model using TROPOMI

Amir H. Souri, Kelly Chance, Juseon Bak, Caroline R. Nowlan, Gonzalo González Abad, Yeonjin Jung, David C. Wong, Jingqiu Mao, and Xiong Liu

Download

Final revised paper (published on 16 Dec 2021)
Supplement to the final revised paper
Preprint (discussion started on 12 Mar 2021)
Supplement to the preprint

Interactive discussion

Status: closed

RC1:
'Comment on acp-2021-121', Anonymous Referee #3, 02 Apr 2021

This paper is an extensive and intensive study of the NOx, VOC and O3 changes over Europe due to the Lockdown using WRF-CMAQ and TROPOMI data. In a sense, it is 2 papers combined into 1. The first part is the assimilation of satellite data to adjust emission inventories of NOx and VOC. The second part is a process analysis of ozone formation.

The paper seems rigorous and is well written, I am happy to recommend publication. Below are some minor comments that you may wish to consider.

Minor Comments:

Table 2 and 3: Do you mean that these are the differences between 2019 and 2020 after assimilation?

The inconvenience of having so much material in a single paper is that the paper brushes over a fair amount of the information on the assimilation.

I think it would be good to show the emissions in the prior as well as the emissions in the 2019 posterior and the 2020 posterior.

The caption should clarify that these are estimates based on inversions using TROPOMI (vs. estimates based on ratios of TROPOMI data,for example).

Fig. 10: There is a big difference in ozone production rates in Eastern Europe between rural and urban areas. This is discussed in Section 3.4. Given the importance of the question of ozone sensitivity, would it be possible to provide average values of ozone changes for NW Europe, Rural East Europe and Urban East Europe? This could be integrated into a discussion of NOX/VOC sensitivity.

Fig. 10: It would be instructive to see the corresponding average MDA8 Ozone maps. We see many difference plots, but without seeing the actual average values that these are departures from, it is hard to get a sense of what is going on.

Technical Comments:

Fig. 3: “Estimate” not “Estmate”

Fig. 4: I think you mean Delta X = X_2020 – X_2019 – this is what you have in the text and elsewhere.

Fig. 5: Could zoom in on the area with data, which would make the figure more legible.

Fig. 10: There is room to spell out Delta in the title. I think this would make it easier for the casual reader to follow, eg. Delta O3P = O3P_2020 - O3P_2019

Citation: https://doi.org/10.5194/acp-2021-121-RC1
- AC1: 'Reply on RC1', Amir Souri, 17 May 2021
  
  The comment was uploaded in the form of a supplement: https://acp.copernicus.org/preprints/acp-2021-121/acp-2021-121-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/acp-2021-121-AC1
RC2:
'Comment on acp-2021-121', Anonymous Referee #1, 06 Apr 2021

Souri and co-authors present an analysis of lockdown-induced changes in NO2, HCHO, and O3 over Europe based on a data assimilation approach involving TROPOMI measurements (NO2, HCHO) and the WRF-CMAQ model. An advantage of the approach is that it explicitly accounts for meteorological influences and so forth in assessing the causes of AQ changes during the COVID period.

The paper topic is suitable for ACP and will make a useful contribution to the literature. There are some methodological and science comments and questions that I feel should be addressed before publication; these are listed below. In a number of cases things are described in a confusing way and need to be clarified. Finally, in many places the writing can be clearer or is overly wordy. For example, many of the paragraphs are about a page long and cover multiple topics, which really does not help communication. Once these issues are addressed I recommend publication in ACP.

=====================================

Science comments. Numbering refers to line numbers.

=====================================

1. The abstract is very long (almost 500 words!) which partly defeats the point of an abstract. I suggest reducing it by approximately half.

56. “Earth’s atmosphere has exponentially become more polluted during previous decades”. Too vague / sweeping. What do you mean by “previous decades”? Some parts of the world have become significantly less polluted (for PM and ozone) over the last 2-3 decades. Elsewhere I do not think the word “exponentially” is necessarily accurate.

78. “whereas the concentrations of several secondarily formed compounds such as ozone increased due to emissions and/or meteorology”. This is not universally true; e.g., see Bekbulat et al., https://doi.org/10.1016/j.scitotenv.2020.144693.

767. For reports, please list references in a way that readers can readily access them, e.g. with a doi or persistent URL.

138-139. RMSE already has a definition; what you are reporting here is not the RMSE.

148-150. “There are challenges…” Unclear what you mean here. Please rewrite for clarity.

153. “we uniformly scale up NO2 pixels by 25% based on the low bias determined by Verhoelst et al. [2021] while considering the potential reduction in the bias through the use of higher spatial resolution trace gas a priori profiles.” Not clear what is meant here by “while considering”. How was this considered? Do you mean it was considered by choosing 25% rather than the median value of 37% reported by Verhoelst? Or is something different being implied here? Please clarify.

153. The choice of a 25% bias correction for NO2 seems a bit arbitrary. As I understand it, the argument being presented is as follows: “the bias was reported previously to vary from -23 to -51%, with a median of -37%. But the use of higher-resolution shape factors here should reduce the bias. So, we use +25%.” I agree that higher-resolution shape factors will reduce the bias, but there is no quantification of that effect here, so the 25% value seems to be pulled out of a hat. There is also the fact that the TROPOMI bias was shown previously to vary between rural and polluted environments, but this is not accounted for here. Overall, there needs to be either a more rigorous justification for the bias correction being employed, and/or some sensitivity analysis to quantify the degree to which this assumption affects the results.

183-187. A similar comment applies to HCHO. I appreciate that the authors pay close attention to uncertainty and bias in the satellite data. But in the end the employed corrections are chosen a bit haphazardly from the range of reported biases. How can this choice be better justified, or if it is necessarily a little arbitrary, how can the impact of that assumption be quantified?

156. “we set the RMSE to 1.1x10^15 molec/cm2 in clear regions and 3.5x10^15 in moderately to highly polluted regions.” This is confusing because at this point in the text we don’t know what is meant by “set the RMSE”. We learn later that these values will be used to populate the error covariance matrices for the inversion; please clarify that here so the reader understands what is happening.

199-202. Does this mean that you only use the dark blue product? Please clarify.

The TROPOMI retrievals do not account for aerosols in the scattering weights. Yet I presume that aerosol loadings over Europe changed between the COVID and reference period. To what degree does this bias the retrieval differences and therefore the NO2 and HCHO comparisons between these periods?

236. “We nudge moisture, wind and temperature fields toward the reanalysis data used only outside of the PBL layer.” Wording is unclear, as is the reason for doing this. Please clarify.

239. “Extensive model evaluations based upon surface observations show a striking correspondence”. The model temperature bias appears to be 50% smaller in 2020 than 2019 (0.8 vs 1.2 degrees). Does this have any impact on the model interpretation of changes between years? For example, assessing changes in anthropogenic VOCs relies on distinguishing changes in biogenic emissions which depend exponentially on temperature.

239. PBL height is a major factor for model performance in simulating AQ-relevant species. How well does the simulation capture measured PBL depths over your domain?

242. Please state the time resolution at which you are optimizing emissions. I guess there is a single 3-month mean value being derived for each grid cell but unless I missed it I don’t think this is stated anywhere.

250. Please state how the Jacobian is calculated. Is there a finite difference run for every model grid cell, each tracer, and each iteration?

252. “In terms of the prior errors, we use the numbers reported in Souri et al. [2020a].” Since this is an important aspect of the inversion please briefly summarize here.

257. “here we iterate Eq 1 3 times.” How do you know this is sufficient? As you know the emission-concentration relationship for NOx in particular is highly non-linear. Do you employ a test for convergence?

275. “faster vertical mixing due to larger sensible fluxes (more diluted columns due to stronger advection in higher altitudes)”. This is a little convoluted. Faster vertical mixing by itself wouldn’t change the column amount, and faster winds during summer (really?) would only smear the columns.

277-280. Wording is quite awkward here.

281. “pronounced decreases”, please clarify that you mean in 2020 vs 2019.

282. “In contrast, we see negligible reductions…” actually some of the regions mentioned seem to show a clear March increase.

296. “suggests an abrupt hiatus in the ongoing reduced NOx emissions”. Unclear if this means the emissions went into hiatus or the reduction went into hiatus.

317. “but also stems from the fact that isoprene reactivity significantly increases by rising temperature [Pusede et al. 2015].” This is a bit oddly worded; I think you simply mean that OH is increasing seasonally along with isoprene emissions.

335-337. These evaluation statistics should be displayed in SI in a table or figures.

341-344. “However, in practical terms, the magnitude of these anomalies is not as drastic as the ratio of observation to model ratio because of the consideration of observational errors and chemical feedback [Souri et al., 2020a], which always leaves some doubt about the practicality of direct mass balance methods.” I am unsure what the authors are trying to say here.

358-360. The optimization naturally improves the simulation of HCHO with respect to TROPOMI, that is the whole point of the optimization. Does it also improve the simulation with respect to independent observations?

358-378. This paragraph is really unclear; I had to read it multiple times to try and parse what is being argued. It sounds like you’re arguing that the chemistry changed the emissions. Please rewrite.

395. “Horizontal transport (shown as wind vectors) plays a critical role in explaining the spatial variations in emissions downwind.” Why would wind affect the emissions?

397-418. This section is all quite speculative and unconvincing. It does not appear that there is much required information conveyed here, recommend deleting.

409. “This in turn will provide an opportunity for the volume of air to become dispersed”. Poor wording. The VOC lifetimes do not affect how a “volume of air is dispersed”.

422. “Unfortunately we limit the analysis to NO2 due to the lack of routinely measured HCHO observations.” The HCHO data are ultimately being used to constrain VOC emissions; so are there VOC measurements that can be used for this purpose?

440. “The surface measurements reinforce the less pronounced reduction in NO2 in northern Germany and UK, although the magnitudes are not as large as those suggested by the model.” This is not clear from the figure. For example the observations suggest that decreases over the UK in April and May are quite large compared to the rest of Europe.

492-496. “This apparent discrepancy is caused by the differences in boundary and initial conditions which are not quantifiable by the process analysis and would require additional sensitivity test.” Is it just the ICs and BCs, or is it that these processes being examined are not strictly independent and additive?

Equation 6 is incorrect (the wrong rate constant is indicated).

544. “This analysis strongly coincides with Lee et al. [2020] and Wyche et al. [2021] who observed roughly constant O3+NO2 concentrations over the UK before and during the lockdown 2020.” With this in mind, why not actually just show the modeled Ox = O3 + NO2 change (and measured change too, if available)? This seems like the most direct way to make this point.

572. “The reduced anthropogenic VOC emissions were a result of two key assumptions: the reduced NOx emissions in NOx-rich areas increased HCHO made from VOCs (evident in larger Jacobians derived from the regional model), and TROPOMI HCHO suggested a negligible difference in HCHO concentration between the two years.” Again the wording here is really confusing. It appears to be arguing that changes in NOx emissions and in the ensuing chemistry changed the actual VOC emission rates. I think I know what is meant (i.e., that these factors change the emission rates that one infers for a given HCHO level) but it really needs clearer description.

Minor edits.

73. “atmospheric composition” not “compositions”

78. “particulate matter”

157 and 185. “clean regions” and “clean areas” rather than “clear”

177. “mainly located” or “predominantly located”

437. “by only considering grid cells”

Citation: https://doi.org/10.5194/acp-2021-121-RC2
- AC2: 'Reply on RC2', Amir Souri, 17 May 2021
  
  The comment was uploaded in the form of a supplement: https://acp.copernicus.org/preprints/acp-2021-121/acp-2021-121-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/acp-2021-121-AC2
RC3:
'Comment on acp-2021-121', Anonymous Referee #2, 15 Apr 2021

Major comments

This manuscript presents an inverse modeling study of NOx and VOC emissions over Europe in the spring of 2019 and 2020, based on TROPOMI NO2 and HCHO data. The focus is on the differences between the two years and on the detection of Covid-related effects. In agreement with previous studies, large NOx emission decreases are derived over most European countries in April 2020. In March and May, however, the picture is less clear, with some regions (e.g. large parts of UK and Germany in March) showing large emission increases in 2020 (Figure 4). Those changes are unrealistic and are contradicted by comparisons with surface NO2 measurements (Figure 7). The authors present the disparity between regions as a consequence of the different timing of Covid-lockdown measures over different regions of Europe, but the discussion is poor and does not present concrete arguments for the inferred patterns. Those patterns are probably related to the inability of the model to match the observed NO2 column distribution, in particular (but not only) over N-W Europe (see Figure S3). The prior model strongly overestimates NO2 over northern Germany and strongly underestimates NO2 in southern Germany and in many other regions. It would be most enlightening to examine the top-down emission increments for each year and month (and not just the differences between the two years). I suspect there will be huge disparities within several countries, especially Germany. It is highly unlikely that bottom-up emission inventory could perform so badly in terms of spatial distributions.

The model compares also very poorly against TROPOMI HCHO, although this is partly due to issues with the observations. As detailed further below, the high HCHO values in April (also May) in Scandinavia and Russia might be artefacts as evidenced by comparisons of TROPOMI with FTIR data (Vigouroux et al. 2020). The authors apply a crude "bias correction" to the data (decrease by 25% the values below 2.5E15 cm-2, increase by 30% the values above 7.5E15 cm-2) but it is inappropriate as it probably increases values which are already too high (e.g. in April over parts of Russia and Scandinavia) and it leads to different corrections being applied in 2019 and 2020, thereby creating artificial patterns in the differences between the two years. Given the magnitude of the biases between the model and the satellite, the inferred top-down emission differences (Figure 5) have no credibility at all. Top-down anthropogenic VOC emissions in April appear to be much higher than in March, without justification. In the abstract and conclusions, much case is made of the small VOC emission decrease in March 2020, compared to March 2019, as if it could be related to Covid. This is highly misleading since the averaging kernel (AK, Figure 5) is lowest (close to zero) in March, i.e. the decrease is not well constrained by the data. The authors emphasize very much the importance of the NOx inversion on the results for VOCs in March, but their arguments (that VOC emissions in Spain and Italy are decreased only to compensate for the stronger VOC oxidation above cities due to NOx decreases) do not stand scrutiny. As shown by Figure S5, the inversion does increase HCHO columns over Spain and Italy in 2019, bringing the model (a little bit) closer to the data. For some unknown reason, this HCHO increase does not happen in 2020 despite similarly high TROPOMI columns. This explains the March patterns of Figure 5. There is an influence of the NOx optimization on the VOC results, but probably not the onedescribed by the paper.

The paper is too long and, at many instances, not clearly written. I have provided a number of suggestions for improvement, but I encourage the authors to make a general effort towards more clarity. Many sentences and entire paragraphs are given which do not add much to the discussion. For example, Figure 6 is very long to describe, but I am not sure whether it really helps to interpret the results. If it does, please make it more clear and remove unnecessary parts. On the other hand, much information about the model and methodology is incomplete or clearly wrong (e.g. the adopted errors for TROPOMI HCHO).

In conclusion, I do not recommend the paper for publication (in its present form) since its conclusions are not well supported by analysis of the data. I recommend to scale down the ambition of the paper. The HCHO (and AOD) data do not seem to help constraining the emissions. The NOx part could be interesting if presented honestly with its caveats. Sensitivity inversions would help to appreciate the uncertainties and robustness of the conclusions.

Minor comments

Abstract: very long, should be shortened

l 122-124 "Since vertical column densities (...° depend on assumed gas profile shape (...), we recalculate those shape factors using profiles from our (...) model": the air mass factors being a complex function of profile shapes, cloudiness, albedo, etc., more details are needed to describe how the profile shapes are taken into account.

l 156 Why the RMSE? Do you mean the assumed uncertainty on the NO2 columns from TROPOMI?

l 156 The values of 1.1E15 and 3.5E15 molec cm-2 seem arbitrary. Please provide better explanation of how those were derived (as they play an important role in the emission inversion)

l 176-178 "Vigouroux et al (...) majorly located over pristine areas and 9 MAX-DOAS stations..." : wrong. The paper concerns FTIR stations, not MAX-DOAS. Furthermore, many of the FTIR sites are in cities (e.g. Paris, Bremen, Mexico city, etc.). Please check the references you cite.

l 181-183 "The agreement between MAX-DOAS...": again, this paper concerns FTIR data only. Please provide the correct references to your statements.

l 184-186 Please provide the precise procedure used for deriving those numbers.

l 186-187 A value of 4% seems extremely low and unlikely given the large biases and scatter of the FTIR-TROPOMI comparisons.

l 188-201 The motivation for using MODIS AOD is not made clear. Clarify. If it does not bring anything, why this complication?

l 209-210 The assumption that the interferences are similar in 2019 and 2020 due to low photochemistry is crude. In Lamsal et al 2008, the correction factors in spring over the U.S. range typically between 0.4 and 0.7. Since CMAQ calculates the interefering species (PAN etc.), why don't you apply the correction proposed by Lamsal et al.? It is a rough correction but it would be better than no crrection at all.

l 228 Do you use gridded maps of the emission factors or PFT distributions in conjunction with the emission factors from Table 2 in Guenther et al 2012?

l 230 Does the model include soil and lightning NO emissions? The use of fertilizers could be a significant source in spring.

l 230 Is the diurnal cycle of anthropogenic emissions taken into account? Regarding the biogenic emissions from MEGAN, are diurnal and day-to-day variations included? What VOC species are emitted (besides isoprene)?

l 247 Is y a collection of monthly NO2 columns or daily NO2 columns? Is the model sampled at the satellite overpass time? Specify the temporal tolerance window.

l 257 Why three times? How do know whether this is sufficient?

l 263-267 The rationale for this assimilation of MODIS AOD is not clear.

l 275 The faster vertical mixing should generally lead to higher NO2 columns due to the higher sensitivity of TROPOMI to NO2 at higher altitudes. Stronger advection does not change much when averaged over a sufficiently large area. Clearly, the increased photochemical activity is by far the main reason for lower NO2 columns in later months.

l 282 "we see negligible reductions..." : there is no reduction at all. There is a significant increase in these regions (boxes B, C and D). Rephrase.

l 289 "northern Germany is associated with less populated areas": quite an extraordinary statement. Please look at population density maps. Please focus on relevant information, e.g. the timing of the lockdown, besides meteorological variability. When did lockdown measures take effect in Germany, France, Italy, etc.?

l 308 A detection limit of 7E15 cm-2 is not "very low", since it is higher than the TROPOMI columns at most locations in March-May (Figure S5)

l 315 The reference Karlsson et al. 2013 does not inform on the occurrence of biomass burning in 2019

l 315 Over St Petersburg, the FTIR HCHO column in April 2019 is about 4.2E15 cm-2 (Vigouroux et al. 2020), a factor of 1.6 below the TROPOMI column. A similar overestimation is found at Kiruna. In May, the discrepancy is even higher at Scandinavian sites. Clearly, TROPOMI data over Northern Europe and Russia in spring need to be considered with extreme caution. The "dipole anomaly" (line 319) might very well be an artefact (at least quantitatively)

l 317 "the fact that isoprene reactivity significantly increases by rising temperature": the OH-rate constant actually decreases at higher temperatures. The chemical lifetime of isoprene is always short enough that it is oxidized close to the emission area. Nevertheless, there is a longer delay in winter/spring before oxidation products like MACR and MVK get oxidized and form HCHO. But this should not play a significant role compared with the temperature-dependence of biogenic emissions. Note furthermore that over Russia and Scandinavia, where coniferous trees are dominant, monoterpenes (not isoprene) might be the main biogenic precursors of HCHO. Are those emissions (and their subsequent chemical oxidation) considered in the model? If not, what could be the consequence of their omission?

Section 3.2 Before discussing the top-down emissions, the paper should discuss the performance of the a priori model against satellite data. I find striking that the model fails at reproducing many prominent features of NO2 column distributions. Why is CMAQ NO2 so high along the coasts of Germany and Holland whereas it is notably too low e.g. over southern Germany? Over Ukraine and other regions, the model is too low by a very large factor (>4). The paper should show not just the top-down emissions but also the emission increments and discuss whether those increments have any plausibility. I have serious doubts on that matter. The a priori emission distribution (from CEDS) might have some uncertainty but cannot be completely wrong.

l 328 "elsewhere": elsewhere in the paper or in a further study? I would guess that these aerosol changes have only limited impacts on NO2 and HCHO. If so, the impact of AOD assimilation should be either briefly mentioned or dropped entirely from the paper. If not, it would be interesting to discuss more in detail.

l 335-336 "large reduction (...) in the bias associated with simulated surface NO2": why not show this, e.g. in the Supplement?

l 338-339 "the discrepancies between the simulated tropospheric NO2 columns versus TROPOMI are largely mitigated by the inversion": only in region with highest emissions, not at all elsewhere.

l 342 "because of the consideration of observational errors": but the choice of NO2 column errors was pretty arbitrary (as far as I understand). It could be useful to show inversion results adopting alternative choices of those errors and other setup parameters.

l 343 During summer and even in spring (at least in southern Europe), the feedback would be: if NOx increase, then OH increase, then the NOx lifetime decreases, implying a larger NOx emission increase is needed to match the NO2 enhancement. Therefore, it does not seem obvious that chemical feedbacks would decrease the magnitude of the anomalies. Please clarify, or drop the mention of chemical feedback.

l 344 "some doubt the practicality of direct mass balance methods": at least, such methods provide a direct answer independent on assumptions regarding uncertainties.

Table 2: The absolute differences of top-down NOx emissions are not really useful and could be dropped.

l 358 As for NOx, a discussion of the model performance is needed for HCHO, before discussing the inversion. In addition, the a priori VOC emissions should be shown for the 3 months. Generally speaking, there is a huge underestimation of the model against TROPOMI HCHO (Fig S5). That might be partly due to biases in the data (see above regarding FTIR vs TROPOMI comparisons) but should clearly be mentioned. My guess is that the model would underestimate the FTIR HCHO columns at sites like Paris and Bremen. In any case, the large uncertainties in TROPOMI HCHO make the inversion results unreliable (except maybe at low latitudes in May). The differences "Lockdown-Baseline" (Figure 5) are even more uncertain. I think they should not be shown at all as they might mislead the reader.

l 366-367 "This tendency, which is striking, mainly stems from the indirect impacts of the reduced NOx emissions on HCHO": the reasoning is overly simplistic. E.g. over Spain, the largest change is not seen over Madrid but in an area to the west of the city. Over Italy as well, the changes are spread over wide areas. Furthermore, in April the VOC emissions are found to increase quite a lot over cities like Paris, Rome, Milano, etc. Obviously the patterns are primarily dictated by the large differences between TROPOMI HCHO and the model, despite the large HCHO uncertainty adopted in the inversion.

An additional worry concerns the seasonality of top-down emissions in 2019. According to Fig 5, VOC emissions in April are considerably higher than in March over most countries. The retrieved emission patterns indicate primarily anthropogenic emissions. How can this be justified?

Section 3.3.1 could be shortened. Get to the point!

l 440 In March, surface NO2 is higher in 2020 than in 2019 according to the model (not the data), consistent with the column changes (Fig 4). This confirms the suspicion that the NOx emission changes in these regions are unreliable due to large model errors.

Figure 8: is this given for 2019 or 2020? I suggest providing both years, but in the Supplement instead of the main article

l 566-568 "large spatial and temporal variability associated with the reduction in NOx was evident as each country might have different level and timeline of restrictions": however, the discussion of this aspect is poor in the paper. "emissions decreased in April rather than March in some portions of UK, northern Germany...": do you really mean that the UK and Germany both showed significant regional differences in the lockdown measures? I doubt very much that it was the case, but if it is, it should be discussed and better justified.

l 569-571 "we showed that anthropogenic VOC emissions over Paris (...) decreased in March (...) achievable through jointly using NO2 and HCHO observations" as noted above, this is very doubtful. You have not made your case that the VOC emission changes are due to NOx emission changes. For that, you should realize a separate inversion using only TROPOMI HCHO and compare with the standard inversion.

Technical/language comments

l 32 "estimate of the NO2 reduction is underestimated": rephrase

l 32 "a picture that correlates with the TROPOMI etc.": unclear

l 37 "TROPOMI HCHO sets an upper limit for HCHO changes such that the chemical feedback (...) reveals a non-negligible decline..." : unclear. That a feedback reveals a decline doesn't make sense.

l 44 "Results of integrated process rates of MDA8 surface ozone": unclear

l 51 "capture the essential character changes..." essential in what sense? Unclear.

l 57 "has exponentially become more polluted during previous decades": wrong over Europe

l 59 "impulsive and sweeping" : not clear what is meant

l 78 "particulate matter" (drop the s)

l 100 Why the indentation?

l 143 low spatial resolution (remove hyphen)

l. 154 "while considering" unclear

l 156 The sentence should make more clear that "clear" is <6x1015 molec/cm2 and polluted is above that level. Use the proper symbol for >=

l 176 "Those biases oscillates around 8x1015 molec/cm2": completely unclear.

l 177 "majorly" -> mainly

l 236-237 The sentence "We nudge moisture (...) data used only outside of the PBL layer" is a bit ambiguous,

please rephrase

l 237 "PBL layer" is redundant

l 240 the correspondence is good, not striking.

l 255 are assumed diagonal

l 277 "unintended" is weird. NO2 columns have no intention. Rephrase.

l 278 "are first the free-tropospheric region complication": what does this mean? Not clear at all.

l 279 "a barrier to obtaining high amount of information from the sensor..." unclear, rephrase.

l 296 "suggests an abrupt hiatus in the ongoing reduced NOx emissions": unclear

l 300 Why "potential"?

l 302 "leading to striking HCHO column patterns with large variations" does not tell anything, please remove.

l 303 "higher chance": is it really a matter of chance? Rephrase.

l 304 "looking at": rephrase

l 312-313 "are below the detection limit (...) to relate them to the lockdown...": lousy wording, please rephrase

(e.g. remove the last part of sentence"

l 313 "nonetheless TROPOMI sets an upper limit of these changes": not useful

l 363 "we surprisingly observe": weird wording

l. 426-427 "ignoring spatial representivity function to directly compare point measurements...": unclear, rephrase

l 430 "then are then"

l 433 "heterogenicity" --> "heterogeneity"

l 440 "The surface measurements reinforce the less pronounced reduction in NO2 in northern Germany and UK": unclear.

L 458-460 The sentence "This tendency potentially is driven (...) has drawn much attention" is grammatically incorrect.

l 461 "The challenge is to simulate a model": unclear

l 463 "essential character": unclear, rephrase

l 475 "namely as": delete "as"

l 523 In Equation (6), the rate constant should be k(O3+NO), not k(OH+NO2+M)

l 560 Remove comma at end of sentence

l 1009 "explain" --> "describe"

Table 1 hyphen in MAX-DOAS

Table 2: too many significant digits are given.

Citation: https://doi.org/10.5194/acp-2021-121-RC3
- AC3: 'Reply on RC3', Amir Souri, 18 May 2021
  
  The comment was uploaded in the form of a supplement: https://acp.copernicus.org/preprints/acp-2021-121/acp-2021-121-AC3-supplement.pdf
  
  Citation: https://doi.org/10.5194/acp-2021-121-AC3

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Amir Souri on behalf of the Authors (18 May 2021) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (07 Jun 2021) by Michel Van Roozendael

RR by Anonymous Referee #2 (28 Jul 2021)

Suggestions for revision or reasons for rejection

The authors did address many of my comments, and I thank them for that. The paper now provides a much more complete description of what the model and the inversions do, and this facilitates the interpretation of the results. The paper also new comparisons with surface NO2 data (Fig. S7-S10).

Still, regarding many concerns, I remain unconvinced, as explained below. The authors keep insisting in their response that "the results in the manuscript are based on the a posteriori, not the a priori" as if the large (sometimes huge) model biases were simply irrelevant. It is obvious from Fig. S11-S14 that both the prior and the posterior model fail to match the observations at most locations except the most polluted. Over many regions (e.g. Ukraine, etc.), the model-data difference for NO2 is systematic and exceeds the TROPOMI error (~1.1E15, Verhoelst et al.). This implies serious issues in the model and/or in the data (e.g. in the bias correction). Things are even much worse regarding HCHO. If we are clueless as to the causes for such discrepancies over moderately polluted regions, why should one trust the results in very polluted areas? True, the AKs provide an indication regarding where and when the inversion results are most reliable (if we accept the hypothesis that model and data are not too biased). However following that guideline, one would have to accept as very credible the NOx results for Germany in March (Figure 2 and Figure 4) indicating a strong emission increase in Northern Germany (in 2020 with respect to 2019) and an emission decrease in Southern Germany. This discrepancy between regions in the same country is an obvious artefact, as the authors implicitly admitted by removing the discussion on that region. This is a "COVID-19 paper" and the reader should be given some clues regarding patterns of emission changes which are obviously wrong. No need for cell-phone, traffic or industrial data for that. The paper does not provide any clue, probably because of the too many issues with the data (largely due to cloudiness, Fig. S16-17) and especially with the model. For example, the very wrong distribution of NO2 columns in the model over Germany (Fig S11) should have prompted the authors to try to explore its possible causes instead of relying exclusively on the power of inverse modelling. Maybe the inversion is correct, but how does it help anyone if we don't understand why?

That being said, the authors have accounted for many of my concerns and updated the text accordingly. The authors realized that the NO2 results are less reliable in March and May, which is why the ozone analysis is restricted to April, as it should. They state to their defense that "remote sensing data provide limited information for optimizing emissions [many references]" which I find contradictory since emission optimization is precisely the methodology adopted in this paper, and the paper provides in great detail relative and absolute differences (2020-2019) of top-down emissions (Figure 4 and Table 2) despite those limitations.

The new figures S7-S11 with the comparisons with surface NO2 data are interesting but I wonder about their significance. There is an overall bias reduction (31%-45%, although the precise meaning of this range is not made clear) but, despite the TROPOMI NO2 constraint, a large systematic underestimation of modelled NO2 remains with respect to the data. Why is that? Could this be due to representativity issues? Or interferences in the NO2 measurements? Note that daily averages are strongly influenced by night-time chemistry. For a proper evaluation of TROPOMI-constrained model values, it would be preferable to sample the data as well as the model in the early afternoon (say between 12 and 15 LT), since night-time chemistry is not well constrained by TROPOMI. In addition, it would be useful to summarize the comparison over the different subregions of Fig. S7-S11 in a table (including mean bias, mean absolute deviation and correlation coefficient). The authors state in their response "The absolute amount of CEDS is too low in many places" as explanation for the model underestimation. This is not a comparison of CEDS with NO2 data. There might be issues in the model (PBL mixing, chemistry) or in the data (interferences, representativity issues) which should be acknowledged first before holding the emissions as responsible.

The authors complain that they "do not understand why this reviewer is concerned about the prior emissions. This paper is not about validating CEDS (...) We could have used EDGAR emissions and reached similar results (...) [or] even used constant emission rates throughout Europe and induce the emission changes by TROPOMI". This is wrong. The striking similarity between prior and post NO2 columns (Fig S11-S14) indicates clearly that both TROPOMI and the prior determine the solution, and therefore the choice of the prior does matter a great deal. Even in areas with high AKs where the inversion is mostly driven by the observations, the changes in the patterns of the emissions induced by the inversion must be questioned: are those real or could they be related to issues in the model or in the observations? I'm not asking you to solve these issues but only to consider and discuss them in a more balanced way.

Regarding the HCHO inversion I still believe that the results of the inversion have very little value given the high differences between the model and TROPOMI. If they think that the emission differences 2020-2019 from the inversion are significant, please provide a quantitative estimation of the uncertainties on the retrieved emissions.

The authors insist that the high TROPOMI HCHO values in April in Northern and Eastern Europe are not too high, citing the negative bias over high-HCHO level sites reported by Vigouroux et al. 2020. The hypothesis that the negative bias is due to aerosol effects is what it is, a hypothesis. In April 2019 over Saint Petersburg (a megacity right in the big HCHO plume on Figure S13), TROPOMI is overestimated by about 35% based on comparisons by Vigouroux et al. Same thing over Sodankyla and Kiruna. Those direct measurements inside the HCHO hot spot are much more relevant than speculations about the possible role of biomass burning. Your manuscript should acknowledge that TROPOMI HCHO is very probably overestimated in that area, for reasons unknown. I do not dispute the fact that the direction of the emission increment in April 2019 goes in the right direction. Of course it does. But you do not need a sophisticated inversion system to infer that emissions were higher than the model prior in April 2019.

The authors did not address my comment that the bias correction might lead to different corrections being applied in 2019 and 2020, thereby creating artificial patterns in the differences between the two years.

I strongly recommend to drop that part on HCHO and the VOCs which, despite relying on a sophisticated inversion scheme, cannot do better than a simple visual inspection of model results and observations. For the NOx part, I recommend strongly to drop Table 2 and reword many parts (including discussion, abstract and conclusions) in order to convey the known limitations and uncertainties of inverse modeling (as acknowledged by the authors, see above).

Minor comments:

- Thanks for the clarification on the AMF and the profile shapes. How does that relate to the averaging kernels (in the definition of e.g. Eskes and Boersma 2003, www.atmos-chem-phys.org/acp/3/1285/) used by other groups to derive total columns from model profiles?

- Thanks for the clarification on the observation error. Note that the TROPOMI precision estimation might actually contain non-random parts. Your inversion system does not account for model errors. Can some crude estimate be provided for those? How could their omission impact the results?

- The HCHO TROPOMI product provides random and systematic error estimates, why not using those? The 4% seems too low considering the large variability among the different sites in Vigouroux et al.

- What are the implications of neglecting diurnal variations of anthropogenic emissions?

- Table 2: please provide uncertainties if you provide numbers which you think could be used by regulatory agencies. There is ample evidence that these numbers should be taken with great caution. Or delete this table if you cannot estimate the uncertainty.

- Seasonality of top-down VOC emissions in 2019: I reiterate my comment. The retrieved patterns indicate primarily anthropogenic emissions, with hot spots in Ruhr and Rhine Valleys, Southern Holland, London, etc. Those are not biogenic emission hotspots. The inversion system is simply unable to bring useful information on the emissions, except that the total VOC emissions were much higher than the prior (and than their 2020 counterparts) in Northern Europe (which is of course very obvious from TROPOMI).

Hide

ED: Reconsider after major revisions (09 Aug 2021) by Michel Van Roozendael

AR by Amir Souri on behalf of the Authors (07 Sep 2021) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (27 Sep 2021) by Michel Van Roozendael

RR by Anonymous Referee #2 (12 Oct 2021)

Suggestions for revision or reasons for rejection

I thank the authors for addressing my comments and updating the manuscript. There are still a few important points which deserve attention (see below), but overall the paper has improved a lot and is now closer to its final form. My main comments are as follows:

1) NO2 model underestimation in rural/moderately polluted areas

As I said in my previous review, "Over many regions (e.g. Ukraine, etc.), the model-data difference for NO2 is systematic and exceeds the TROPOMI error (~1.1E15, Verhoelst et al.)". In their response, the authors seem to imply that the data are just "garbage" there (to use their words), which explains why the inversion system does not take those data into account. I understand that the inversion does what it should do given the specified covariance matrices, but I do not agree that the data are just garbage there.
If we follow Equation (1), the errors of observations consist in the quadratic sum of a fixed error based on comparisons with ground-based data (1.1E15 in clean regions, Verhoelst et al.) and the precision error provided by TROPOMI. But the latter becomes very small when we consider a large number of measurements. Since I am referring, not a single pixel, but to a large area (and a monthly average), this number of measurements is very large, and therefore the precision error is very small. It makes very little doubt that the difference between model and observation is (in this case at least) NOT garbage. Your inversion system does not 'see' this because the covariance matrix S0 is assumed diagonal, but the fact is that the model-data discrepancy is found over a very wide area (and several consecutive months!). The systematic nature of the difference and the fact that it exceeds the TROPOMI dispersion over rural/moderately polluted areas make me conclude that the model underestimation is significant. Is this due to the biogenic emissions or something else, I don't know. I agree that it might not be wise to correct for this with higher biogenic emission errors (although it could be interesting to check, but I admit this is time-consuming). Nevertheless, the authors should discuss this and propose possible explanations, instead of pretending that there is nothing to see.
It is probably difficult to take into account non-diagonal elements (correlations) of the matrix S0. It would be useful to discuss whether this would be possible or not. A possible get-around is to introduce correlations in the a priori emission errors (S_e matrix). It is not the same thing of course, but it can have the effect of allowing larger emission increments without increasing the a priori emission errors. Non-zero correlations would make much sense in the case of smoothly-varying natural emissions. The use of decorrelation lengths is very common in assimilation.

2) Comparisons with surface measurements

Thanks for expanding the discussion and including statistics. There are valid points in the argumentation, although I find the reasoning a bit confusing. It is first stated that, certainly, the emissions "are the primary culprit" for the strong underestimation of the ground-based data. By the way, this underestimation is found for the model constrained by TROPOMI in regions where the signal is very strong. Therefore if the emissions are indeed much too low (to account for the strong underestimation of ground-based measurements), what does it say about the comparison with TROPOMI? There seems to be a contradiction but the authors avoid that question. Instead, they state that there is a fundamental issue, even "a taboo", the spatial representativeness of point measurements, as if this notion was totally new. The representativeness issue was the first possible cause that I mentioned in my comment, besides interferences and model errors. But if the point measurements have such a very low representativeness and make comparisons with models inconclusive, then why include this comparison at all ? Or, if something can be concluded from this comparison, what is it? This is really not clear to me. Sure, it provides indications on the 2020-2019 differences as seen by the stations, but if their representativeness is so low (as suggested by the model underestimation despite the TROPOMI constraint), the stations might disproportionately reflect urban air, which the Covid lockdowns might have affected differently from other areas.

If the authors keep that section, they should provide a more coherent discussion of the possible causes for discrepancies.

Among others, change "is most likely due to the underestimation of CEDS" into "could be partly due to the underestimation of CEDS". The study of Sun et al. is just one study on Po Valley emissions. In Table S3 and S4, mean biases etc. in ppbv are not very useful. Better show relative deviations.

I reiterate my question regarding the implications of neglecting diurnal variations of anthropogenic emissions. You provided an elaborate response regarding the impact on ozone, but I am interested in the comparisons with TROPOMI NO2 (overpass at 13h30).

l. 55 : "physiochemical" : surely you mean physicochemical?
l. 76 : same thing

Hide

ED: Publish as is (16 Nov 2021) by Michel Van Roozendael

AR by Amir Souri on behalf of the Authors (17 Nov 2021) Manuscript

Download

Article (25363 KB)
Full-text XML

Short summary

The global pandemic is believed to have an impact on emissions of air pollutants such as nitrogen dioxide (NO₂) and formaldehyde (HCHO). This study quantifies the changes in the amount of NO_x and VOC emissions via state-of-the-art inverse modeling technique using satellite observations during the lockdown 2020 with respect to a baseline over Europe, which in turn, it permits unraveling atmospheric processes being responsible for ozone formation in a less cloudy month.