Articles | Volume 25, issue 13
https://doi.org/10.5194/acp-25-7485-2025
© Author(s) 2025. This work is distributed under the Creative Commons Attribution 4.0 License.
Technical note: Reconstructing missing surface aerosol elemental carbon data in long-term series with ensemble learning
Download
- Final revised paper (published on 15 Jul 2025)
- Supplement to the final revised paper
- Preprint (discussion started on 25 Nov 2024)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2024-2776', Anonymous Referee #2, 20 Dec 2024
- AC1: 'Reply on RC1', Yunjiang Zhang, 06 Apr 2025
-
RC2: 'Comment on egusphere-2024-2776', Anonymous Referee #3, 29 Jan 2025
- AC2: 'Reply on RC2', Yunjiang Zhang, 06 Apr 2025
-
RC3: 'Comment on egusphere-2024-2776', Anonymous Referee #4, 08 Feb 2025
- AC3: 'Reply on RC3', Yunjiang Zhang, 06 Apr 2025
Peer review completion
AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Yunjiang Zhang on behalf of the Authors (06 Apr 2025)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (10 Apr 2025) by Dantong Liu
AR by Yunjiang Zhang on behalf of the Authors (16 Apr 2025)
Manuscript
Long-term in-situ observations of black carbon aerosols are crucial for studying their environmental and climatic effects. However, in real-world observational studies, there are several inevitable technical challenges, such as data gaps. This paper proposes a machine learning method that elegantly addresses this issue. The method is applied to reconstruct time-series data of elemental carbon (EC) aerosols from four cities in eastern China. The results are also validated by comparing them with other datasets. Furthermore, the paper introduces a novel method for assessing the driving factors of long-term trends in elemental carbon, as well as evaluating the uncertainty associated with this approach. I believe both methods hold significant value for the field of atmospheric monitoring. Overall, the paper is well designed and written. However, I have the following points that the authors should address:
The authors introduce MERRA-2 black carbon column concentration data as one of the predictor variables. They also compare MERRA-2 near-surface black carbon concentrations and find that the MERRA-2 data tends to overestimate the site's elemental carbon data. I suggest that the authors conduct a sensitivity test by training the machine learning model without using MERRA-2 black carbon column concentration as a predictor variable and compare the results with the current ones.
The trend changes in EC aerosols are influenced by both meteorological conditions and emissions. In eastern China, the sources of black carbon generally include vehicle emissions and industrial coal combustion. While the paper quantifies the overall anthropogenic emission trend drivers, there is relatively little information on specific emission sectors, which may be a limitation of the method employed. The paper analyzes the daily variation of EC over the years and suggests that the reduction of motor vehicle emissions may be a major factor driving the decline in EC levels. I suggest that the authors could try to extend this analysis by investigating the trend changes of EC during vehicle emission rush hours or by quantifying the driving factors for these peak periods. This could provide a more detailed understanding of the trend changes.
The authors use the ridge regression algorithm for the multivariate regression analysis but do not employ the traditional multiple linear regression algorithm. I recommend that the authors clarify this choice. Additionally, regarding Equation 1, the expression may cause confusion because GBRTs, XGBoost, and RF are abbreviations for different machine learning algorithms, yet they are presented as variables in the formula. I suggest the authors optimize the notation for clarity.
Line 148 – 149: Appropriate references should be cited to support the use of these pollutants as tracers for source characterization.
Line 239: The phrase "Reconstruction of missing data of EC and trend analysis" should be revised to "Reconstruction of missing data of EC and comparison".
Line 336 – 337: The discussion on the impact of COVID-19 lockdowns on EC trend changes is well noted as a factual observation. Could the authors further discussion or quantify such impact?