Handling Multicollinearity on Social Spatial Data Using Geographically Weighted Random Forest

SAR Journal Pub Date : 2023-09-26 DOI:10.18421/sar63-02
Binti Kurniati, Yuliani Setia Dewi, Alfian Futuhul Hadi
{"title":"Handling Multicollinearity on Social Spatial Data Using Geographically Weighted Random Forest","authors":"Binti Kurniati, Yuliani Setia Dewi, Alfian Futuhul Hadi","doi":"10.18421/sar63-02","DOIUrl":null,"url":null,"abstract":"Crime includes all kinds of harmful acts that violate the laws in force in Indonesia as well as social and religious norms. The crime total is the number of incidents reported to the police, obtained from public reports and events where the perpetrators were caught red-handed by the police. We can use the Poisson model to analyze the data, but the existence of spatial heterogeneity in the data makes the model less accurate. This research investigates the methods when there is spatial heterogeneity in the data by using Geographically weighted regression (GWR), Geographically Weighted Poisson Regression (GWPR) and Geographically Weighted Random Forest (GW-RF). We compare the GWR, GWPR, and GW-RF models for criminal cases in East Java in handling multicollinearity in the data. The results of this study indicate that the GW-RF model is better for modeling criminal cases with the smallest RMSE and MAPE values and an R-Square value close to 1. Based on the three most important variables in each location, they form six groups of regencies/cities in East Java, Indonesia. The variables vary between groups and the poverty severity index is not included in the three most important variables in all locations.","PeriodicalId":487006,"journal":{"name":"SAR Journal","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SAR Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18421/sar63-02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Crime includes all kinds of harmful acts that violate the laws in force in Indonesia as well as social and religious norms. The crime total is the number of incidents reported to the police, obtained from public reports and events where the perpetrators were caught red-handed by the police. We can use the Poisson model to analyze the data, but the existence of spatial heterogeneity in the data makes the model less accurate. This research investigates the methods when there is spatial heterogeneity in the data by using Geographically weighted regression (GWR), Geographically Weighted Poisson Regression (GWPR) and Geographically Weighted Random Forest (GW-RF). We compare the GWR, GWPR, and GW-RF models for criminal cases in East Java in handling multicollinearity in the data. The results of this study indicate that the GW-RF model is better for modeling criminal cases with the smallest RMSE and MAPE values and an R-Square value close to 1. Based on the three most important variables in each location, they form six groups of regencies/cities in East Java, Indonesia. The variables vary between groups and the poverty severity index is not included in the three most important variables in all locations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用地理加权随机森林处理社会空间数据的多重共线性
犯罪包括违反印度尼西亚现行法律以及社会和宗教规范的各种有害行为。犯罪总数是指向警方报告的事件数量,这些事件来自公开报告和肇事者被警方当场抓获的事件。我们可以使用泊松模型对数据进行分析,但由于数据存在空间异质性,使得模型的精度降低。本文采用地理加权回归(GWR)、地理加权泊松回归(GWPR)和地理加权随机森林(GW-RF)对数据存在空间异质性时的处理方法进行了探讨。我们比较了东爪哇刑事案件GWR、GWPR和GW-RF模型在处理数据多重共线性方面的效果。研究结果表明,当RMSE和MAPE值最小,且r平方值接近1时,GW-RF模型更适合对刑事案件进行建模。根据每个地点的三个最重要的变量,它们在印度尼西亚东爪哇形成了六组摄政/城市。这些变量因群体而异,贫困严重程度指数并没有包括在所有地区最重要的三个变量中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Technology Acceptance of Intelligent Silaturrahmi-based Collaboration Gamification Mechanic (ISb-GM) in Small Medium Enterprise Dеvising a Modеl AI-UTAUT by Combining Artificial Inteligence AI with Unifiеd Thеory of Accеptancе and Usе of Tеchnology (UTAUT) Strengthening Loyalty and Performance of Government Office Employees: Exploring Leadership Strategies The Profile Ecoliteracy of Students at Adiwiyata School Handling Multicollinearity on Social Spatial Data Using Geographically Weighted Random Forest
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1