Climate Change Data Portal
DOI | 10.1016/j.atmosenv.2020.117479 |
Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction | |
Adams M.D.; Massey F.; Chastko K.; Cupini C. | |
发表日期 | 2020 |
ISSN | 1352-2310 |
卷号 | 230 |
英文摘要 | Fine particulate matter air pollution is a global issue; cycling is a global activity. In our paper, particulate matter less than 2.5 μm (PM2.5) air pollution data obtained by community scientists while cycling is used to develop high-resolution spatial air pollution maps. Mapping is completed using a land use regression model for Charlotte, North Carolina. The air pollution observations were obtained with a low-cost sensor. We evaluated the accuracy of the sensor through a collocation study for 3203 h, which identified the sensor had a mean bias of 7.25 μg/m3 and a correlation of r = 0.77 with an US EPA Federal Equivalent Monitor. A machine learning model was developed to adjust the sensor observations, which demonstrated their highest errors during periods of high humidity. The adjustment was able to reduce the root mean squared error from 12 μg/m3 to 3.8 μg/m3, and the mean bias was reduced to −0.5 μg/m3. Cycling times were not balanced throughout the day nor the year. We applied a temporal adjustment algorithm to account for this imbalance in observation periods with the intention of producing long-term estimates representing the sampling period of 2016 and 2017. The long-term air pollution surface for the city was generated with a land use regression model. Both linear regression and machine learning approaches were applied. The linear regression model performed poorly with a training R2 of 0.15 and a cross-validation R2 of 0.15. A stacked ensemble model was developed using machine learning, which had a training 5-fold cross-validation mean residual deviance of 3.82 μg/m3, a root mean squared error of 1.95 μg/m3, and a mean absolute error of 0.95 μg/m3. Performance remained strong during cross-validation, which included both a random sample approach (RMSE = 1.52 μg/m3) and a spatial blocking cross-validation method (RMSE = 2.8 μg/m3). © 2020 Elsevier Ltd |
关键词 | Air pollutionCitizen scienceCommunity scienceCross-validationCyclingLand use regressionMachine learningParticulate matter |
语种 | 英语 |
scopus关键词 | Air pollution; Errors; Land use; Learning algorithms; Logistic regression; Machine learning; Mean square error; Charlotte , North Carolina; Cross-validation methods; Land-use regression models; Linear regression models; Machine learning approaches; Particulate matter air pollution; Root mean squared errors; Spatial cross validations; Particles (particulate matter); air quality; atmospheric pollution; land use change; land use planning; machine learning; particulate matter; relative humidity; sampling; sensor; air pollution; Article; controlled study; cross validation; cycling; cyclist; data processing; human; humidity; information processing; land use; linear regression analysis; machine learning; measurement accuracy; particulate matter; pollution monitoring; priority journal; scientist; statistical model; Charlotte; North Carolina; United States |
来源期刊 | ATMOSPHERIC ENVIRONMENT |
文献类型 | 期刊论文 |
条目标识符 | http://gcip.llas.ac.cn/handle/2XKMVOVA/249191 |
作者单位 | Department of Geography, University of Toronto Mississauga, 3359 Mississauga Rd, Mississauga, ON L5L 1C6, Canada; Clean Air Carolina, PO Box 5311, Charlotte, NC 28299, United States |
推荐引用方式 GB/T 7714 | Adams M.D.,Massey F.,Chastko K.,et al. Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction[J],2020,230. |
APA | Adams M.D.,Massey F.,Chastko K.,&Cupini C..(2020).Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction.ATMOSPHERIC ENVIRONMENT,230. |
MLA | Adams M.D.,et al."Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction".ATMOSPHERIC ENVIRONMENT 230(2020). |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。