PyGRF: An Improved Python Geographical Random Forest Model and Case Studies in Public Health and Natural Disasters

IF 2.1 3区 地球科学 Q2 GEOGRAPHY Transactions in GIS Pub Date : 2024-09-17 DOI:10.1111/tgis.13248
Kai Sun, Ryan Zhenqi Zhou, Jiyeon Kim, Yingjie Hu
{"title":"PyGRF: An Improved Python Geographical Random Forest Model and Case Studies in Public Health and Natural Disasters","authors":"Kai Sun, Ryan Zhenqi Zhou, Jiyeon Kim, Yingjie Hu","doi":"10.1111/tgis.13248","DOIUrl":null,"url":null,"abstract":"Geographical random forest (GRF) is a recently developed and spatially explicit machine learning model. With the ability to provide more accurate predictions and local interpretations, GRF has already been used in many studies. The current GRF model, however, has limitations in its determination of the local model weight and bandwidth hyperparameters, potentially insufficient numbers of local training samples, and sometimes high local prediction errors. Also, implemented as an R package, GRF currently does not have a Python version which limits its adoption among machine learning practitioners who prefer Python. This work addresses these limitations by introducing theory‐informed hyperparameter determination, local training sample expansion, and spatially weighted local prediction. We also develop a Python‐based GRF model and package, PyGRF, to facilitate the use of the model. We evaluate the performance of PyGRF on an example dataset and further demonstrate its use in two case studies in public health and natural disasters.","PeriodicalId":47842,"journal":{"name":"Transactions in GIS","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions in GIS","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1111/tgis.13248","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0

Abstract

Geographical random forest (GRF) is a recently developed and spatially explicit machine learning model. With the ability to provide more accurate predictions and local interpretations, GRF has already been used in many studies. The current GRF model, however, has limitations in its determination of the local model weight and bandwidth hyperparameters, potentially insufficient numbers of local training samples, and sometimes high local prediction errors. Also, implemented as an R package, GRF currently does not have a Python version which limits its adoption among machine learning practitioners who prefer Python. This work addresses these limitations by introducing theory‐informed hyperparameter determination, local training sample expansion, and spatially weighted local prediction. We also develop a Python‐based GRF model and package, PyGRF, to facilitate the use of the model. We evaluate the performance of PyGRF on an example dataset and further demonstrate its use in two case studies in public health and natural disasters.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PyGRF:改进的 Python 地理随机森林模型及公共卫生和自然灾害案例研究
地理随机森林(GRF)是最近开发的一种空间明确的机器学习模型。由于能够提供更准确的预测和局部解释,GRF 已被许多研究采用。然而,当前的 GRF 模型在确定本地模型权重和带宽超参数方面存在局限性,可能存在本地训练样本数量不足的问题,有时本地预测误差较高。此外,GRF 是作为 R 软件包实现的,目前还没有 Python 版本,这限制了它在偏好 Python 的机器学习从业者中的应用。本研究通过引入基于理论的超参数确定、局部训练样本扩展和空间加权局部预测来解决这些局限性。我们还开发了基于 Python 的 GRF 模型和软件包 PyGRF,以方便模型的使用。我们在一个示例数据集上评估了 PyGRF 的性能,并在公共卫生和自然灾害的两个案例研究中进一步展示了其用途。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Transactions in GIS
Transactions in GIS GEOGRAPHY-
CiteScore
4.60
自引率
8.30%
发文量
116
期刊介绍: Transactions in GIS is an international journal which provides a forum for high quality, original research articles, review articles, short notes and book reviews that focus on: - practical and theoretical issues influencing the development of GIS - the collection, analysis, modelling, interpretation and display of spatial data within GIS - the connections between GIS and related technologies - new GIS applications which help to solve problems affecting the natural or built environments, or business
期刊最新文献
Knowledge‐Guided Automated Cartographic Generalization Process Construction: A Case Study Based on Map Analysis of Public Maps of China City Influence Network: Mining and Analyzing the Influence of Chinese Cities Based on Social Media PyGRF: An Improved Python Geographical Random Forest Model and Case Studies in Public Health and Natural Disasters Neural Sensing: Toward a New Approach to Understanding Emotional Responses to Place Construction of Earth Observation Knowledge Hub Based on Knowledge Graph
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1