Climate Change Data Portal
DOI | 10.1016/j.compenvurbsys.2019.01.006 |
Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling | |
Sinha, Parmanand1; Gaughan, Andrea E.1; Stevens, Forrest R.1; Nieves, Jeremiah J.2,3; Sorichetta, Alessandro2,3; Tatem, Andrew J.2,3 | |
发表日期 | 2019 |
ISSN | 0198-9715 |
EISSN | 1873-7587 |
卷号 | 75页码:132-145 |
英文摘要 | Gridded human population data provide a spatial denominator to identify populations at risk, quantify burdens, and inform our understanding of human-environment systems. When modeling gridded population, the information used for training the model may differ in spatial resolution than what is produced by the model prediction. This case arises when approaching population modeling from a top-down, dasymetric approach in which one redistributes coarse administrative unit level population data (i.e., source unit) to a finer scale (i.e., target unit). However, often overlooked are issues associated with the differing variance across the scale, spatial autocorrelation and bias in sampling techniques. In this study, we examine the effects of intentionally biasing our sampling from the source to target scale Within the context of a weighted, dasymetric mapping approach. The weighted component is based on a Random Forest estimator, which is a non-parametric ensemble-based prediction model. We investigate issues of autocorrelation and heterogeneity in the training data using 18 different types of samples to show the variations in training, census-level (i.e., source) and output, grid-level (i.e., target) predictions. We compare results to simple random sampling and geographically stratified random sampling. Results indicate that the Random Forest model is sensitive to the spatial autocorrelation inherent in the training data, which leads to an increase in the variance of the residuals. Sample training datasets that are at a spatial scale representative of the true population produced the best fitting models. However, the true representative dataset varied in autocorrelation for both scales. More attention is needed with ensemble-based learning and spatially-heterogeneous data as underlying issues of spatial autocorrelation influence results for both the census-level and grid-level estimations. |
WOS研究方向 | Computer Science ; Engineering ; Environmental Sciences & Ecology ; Geography ; Operations Research & Management Science ; Public Administration |
来源期刊 | COMPUTERS ENVIRONMENT AND URBAN SYSTEMS
![]() |
文献类型 | 期刊论文 |
条目标识符 | http://gcip.llas.ac.cn/handle/2XKMVOVA/97401 |
作者单位 | 1.Univ Louisville, Dept Geog & Geosci, Louisville, KY 40292 USA; 2.Univ Southampton, Dept Geog & Environm, WorldPop, Southampton SO17 1BJ, Hants, England; 3.Flowminder Fdn, Stockholm, Sweden |
推荐引用方式 GB/T 7714 | Sinha, Parmanand,Gaughan, Andrea E.,Stevens, Forrest R.,et al. Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling[J],2019,75:132-145. |
APA | Sinha, Parmanand,Gaughan, Andrea E.,Stevens, Forrest R.,Nieves, Jeremiah J.,Sorichetta, Alessandro,&Tatem, Andrew J..(2019).Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling.COMPUTERS ENVIRONMENT AND URBAN SYSTEMS,75,132-145. |
MLA | Sinha, Parmanand,et al."Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling".COMPUTERS ENVIRONMENT AND URBAN SYSTEMS 75(2019):132-145. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。