CCPortal
DOI10.1016/j.atmosenv.2020.117479
Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction
Adams M.D.; Massey F.; Chastko K.; Cupini C.
发表日期2020
ISSN1352-2310
卷号230
英文摘要Fine particulate matter air pollution is a global issue; cycling is a global activity. In our paper, particulate matter less than 2.5 μm (PM2.5) air pollution data obtained by community scientists while cycling is used to develop high-resolution spatial air pollution maps. Mapping is completed using a land use regression model for Charlotte, North Carolina. The air pollution observations were obtained with a low-cost sensor. We evaluated the accuracy of the sensor through a collocation study for 3203 h, which identified the sensor had a mean bias of 7.25 μg/m3 and a correlation of r = 0.77 with an US EPA Federal Equivalent Monitor. A machine learning model was developed to adjust the sensor observations, which demonstrated their highest errors during periods of high humidity. The adjustment was able to reduce the root mean squared error from 12 μg/m3 to 3.8 μg/m3, and the mean bias was reduced to −0.5 μg/m3. Cycling times were not balanced throughout the day nor the year. We applied a temporal adjustment algorithm to account for this imbalance in observation periods with the intention of producing long-term estimates representing the sampling period of 2016 and 2017. The long-term air pollution surface for the city was generated with a land use regression model. Both linear regression and machine learning approaches were applied. The linear regression model performed poorly with a training R2 of 0.15 and a cross-validation R2 of 0.15. A stacked ensemble model was developed using machine learning, which had a training 5-fold cross-validation mean residual deviance of 3.82 μg/m3, a root mean squared error of 1.95 μg/m3, and a mean absolute error of 0.95 μg/m3. Performance remained strong during cross-validation, which included both a random sample approach (RMSE = 1.52 μg/m3) and a spatial blocking cross-validation method (RMSE = 2.8 μg/m3). © 2020 Elsevier Ltd
英文关键词Air pollution; Citizen science; Community science; Cross-validation; Cycling; Land use regression; Machine learning; Particulate matter
语种英语
来源期刊Atmospheric Environment
文献类型期刊论文
条目标识符http://gcip.llas.ac.cn/handle/2XKMVOVA/129525
作者单位Department of Geography, University of Toronto Mississauga, 3359 Mississauga Rd, Mississauga, ON L5L 1C6, Canada; Clean Air Carolina, PO Box 5311, Charlotte, NC 28299, United States
推荐引用方式
GB/T 7714
Adams M.D.,Massey F.,Chastko K.,et al. Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction[J],2020,230.
APA Adams M.D.,Massey F.,Chastko K.,&Cupini C..(2020).Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction.Atmospheric Environment,230.
MLA Adams M.D.,et al."Spatial modelling of particulate matter air pollution sensor measurements collected by community scientists while cycling, land use regression with spatial cross-validation, and applications of machine learning for data correction".Atmospheric Environment 230(2020).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Adams M.D.]的文章
[Massey F.]的文章
[Chastko K.]的文章
百度学术
百度学术中相似的文章
[Adams M.D.]的文章
[Massey F.]的文章
[Chastko K.]的文章
必应学术
必应学术中相似的文章
[Adams M.D.]的文章
[Massey F.]的文章
[Chastko K.]的文章
相关权益政策
暂无数据
收藏/分享

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。