Atmospheric Pollution Research | |
A comparison of statistical and machine-learning approaches for spatiotemporal modeling of nitrogen dioxide across Switzerland | |
article | |
Tze-Li Liu1  Benjamin Flückiger1  Kees de Hoogh1  | |
[1] Swiss Tropical and Public Health Institute;University of Basel | |
关键词: Spatiotemporal models; Air pollution; Satellite; Exposure assessment; Land use regression; Machine learning; | |
DOI : 10.1016/j.apr.2022.101611 | |
学科分类:农业科学(综合) | |
来源: Dokuz Eylul Universitesi * Department of Environmental Engineering | |
【 摘 要 】
Land use regression modeling has commonly been used to model ambient air pollutant concentrations in environmental epidemiological studies. Recently, other statistical and machine-learning methods have also been applied to model air pollution, but their relative strengths and limitations have not been extensively investigated. In this study, we developed and compared land-use statistical and machine-learning models at annual, monthly and daily scales estimating ground-level NO 2 concentrations across Switzerland (at high spatial resolution 100 × 100 m). Our study showed that the best model type varies with context, particularly with temporal resolution and training data size. Linear-regression-based models were useful in predicting long-term (annual, monthly) spatial distribution of NO 2 and outperformed machine-learning models. However, linear-regression-based models were limited in representing short-term temporal variation even when predictor variables with temporal variability were provided. Machine-learning models showed high capability in predicting short-term temporal variation and outperformed linear-regression-based models for modeling NO 2 variation at high temporal resolution (daily). However, the best performing models, XGBoost and LightGBM, constantly overfit on training data and may result in erratic patterns in the model-estimated concentration surfaces. Therefore, the temporal and spatial scale of the study is an important factor on which the choice of the suitable model type should be based and validation is required whatever approach is used.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202302100000040ZK.pdf | 4248KB | download |