绘制和预测北苏门答腊贫困状况的机器学习:数据驱动方法

IF 0.7 4区 综合性期刊 Q3 MULTIDISCIPLINARY SCIENCES Sains Malaysiana Pub Date : 2024-07-31 DOI:10.17576/jsm-2024-5307-18
Arnita Arnita, F. Marpaung, Fanny Ramadhani, Dewan Dinata
{"title":"绘制和预测北苏门答腊贫困状况的机器学习:数据驱动方法","authors":"Arnita Arnita, F. Marpaung, Fanny Ramadhani, Dewan Dinata","doi":"10.17576/jsm-2024-5307-18","DOIUrl":null,"url":null,"abstract":"Discussing poverty is crucial because it affects many facets of society, including socioeconomic disparity, crime, and the inability to obtain high-quality education. One of the provinces with the highest poverty rate in Indonesia is North Sumatra. A strategy is required to gather accurate data to effectively reduce poverty. Poverty mapping and prediction were conducted in North Sumatra to get a precise spatial distribution of poverty, the operation of the poverty model, and forecasting using machine learning (ML). Poverty prediction was conducted using a random forest (RF) algorithm and poverty mapping was conducted using the K-Means algorithm. The poverty mapping showed a significant inertia value decline in the third and fourth clusters of the elbow graph. The third cluster (0.313) was superior to the fourth cluster (0.244) in the silhouette index. Thus, there were three poverty clusters - low, medium, and high - that were used in the model. The best model was created using the grid search cross-validation, while the best prediction results were created using the RF algorithm, with the following parameters: n-estimator = 50, max depth = 10, min samples split = 2, and min samples leaf = 1. The mean squared error (MSE) of the RF model's predictions was 0.002617, or satisfactory precision.","PeriodicalId":21366,"journal":{"name":"Sains Malaysiana","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning for Mapping and Forecasting Poverty in North Sumatera: A Data-Driven Approach\",\"authors\":\"Arnita Arnita, F. Marpaung, Fanny Ramadhani, Dewan Dinata\",\"doi\":\"10.17576/jsm-2024-5307-18\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discussing poverty is crucial because it affects many facets of society, including socioeconomic disparity, crime, and the inability to obtain high-quality education. One of the provinces with the highest poverty rate in Indonesia is North Sumatra. A strategy is required to gather accurate data to effectively reduce poverty. Poverty mapping and prediction were conducted in North Sumatra to get a precise spatial distribution of poverty, the operation of the poverty model, and forecasting using machine learning (ML). Poverty prediction was conducted using a random forest (RF) algorithm and poverty mapping was conducted using the K-Means algorithm. The poverty mapping showed a significant inertia value decline in the third and fourth clusters of the elbow graph. The third cluster (0.313) was superior to the fourth cluster (0.244) in the silhouette index. Thus, there were three poverty clusters - low, medium, and high - that were used in the model. The best model was created using the grid search cross-validation, while the best prediction results were created using the RF algorithm, with the following parameters: n-estimator = 50, max depth = 10, min samples split = 2, and min samples leaf = 1. The mean squared error (MSE) of the RF model's predictions was 0.002617, or satisfactory precision.\",\"PeriodicalId\":21366,\"journal\":{\"name\":\"Sains Malaysiana\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sains Malaysiana\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.17576/jsm-2024-5307-18\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sains Malaysiana","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.17576/jsm-2024-5307-18","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

讨论贫困问题至关重要,因为它影响到社会的许多方面,包括社会经济差距、犯罪和无法获得优质教育。北苏门答腊省是印尼贫困率最高的省份之一。需要制定一项战略来收集准确的数据,以有效减少贫困。在北苏门答腊省进行了贫困绘图和预测,以获得精确的贫困空间分布、贫困模型的运行以及使用机器学习(ML)进行预测。贫困预测采用随机森林(RF)算法,贫困绘图采用 K-Means 算法。贫困图谱显示,在肘图的第三和第四簇中,惯性值明显下降。在剪影指数上,第三簇(0.313)优于第四簇(0.244)。因此,模型中使用了低、中、高三个贫困群组。最佳模型是通过网格搜索交叉验证创建的,而最佳预测结果则是通过 RF 算法创建的,其参数如下:n-估计器 = 50,最大深度 = 10,最小样本分割 = 2,最小样本叶 = 1。射频模型预测的均方误差(MSE)为 0.002617,精度令人满意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine Learning for Mapping and Forecasting Poverty in North Sumatera: A Data-Driven Approach
Discussing poverty is crucial because it affects many facets of society, including socioeconomic disparity, crime, and the inability to obtain high-quality education. One of the provinces with the highest poverty rate in Indonesia is North Sumatra. A strategy is required to gather accurate data to effectively reduce poverty. Poverty mapping and prediction were conducted in North Sumatra to get a precise spatial distribution of poverty, the operation of the poverty model, and forecasting using machine learning (ML). Poverty prediction was conducted using a random forest (RF) algorithm and poverty mapping was conducted using the K-Means algorithm. The poverty mapping showed a significant inertia value decline in the third and fourth clusters of the elbow graph. The third cluster (0.313) was superior to the fourth cluster (0.244) in the silhouette index. Thus, there were three poverty clusters - low, medium, and high - that were used in the model. The best model was created using the grid search cross-validation, while the best prediction results were created using the RF algorithm, with the following parameters: n-estimator = 50, max depth = 10, min samples split = 2, and min samples leaf = 1. The mean squared error (MSE) of the RF model's predictions was 0.002617, or satisfactory precision.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Sains Malaysiana
Sains Malaysiana MULTIDISCIPLINARY SCIENCES-
CiteScore
1.60
自引率
12.50%
发文量
196
审稿时长
3-6 weeks
期刊介绍: Sains Malaysiana is a refereed journal committed to the advancement of scholarly knowledge and research findings of the several branches of science and technology. It contains articles on Earth Sciences, Health Sciences, Life Sciences, Mathematical Sciences and Physical Sciences. The journal publishes articles, reviews, and research notes whose content and approach are of interest to a wide range of scholars. Sains Malaysiana is published by the UKM Press an its autonomous Editorial Board are drawn from the Faculty of Science and Technology, Universiti Kebangsaan Malaysia. In addition, distinguished scholars from local and foreign universities are appointed to serve as advisory board members and referees.
期刊最新文献
Machine Learning for Mapping and Forecasting Poverty in North Sumatera: A Data-Driven Approach Inhibition of Pre-Emergent Herbicide on Weedy Rice under Flooded and Saturated Soil Conditions in Rice Imobilisasi Nanopartikel Ag/TiO2 Ekstrak Beko pada Membran Fotomangkin Poliakrilonitril (PAN) untuk Penyingkiran Pewarna Metilena Biru Antarctic Spore-Forming Microorganisms from Deception Island Inhibit the Growth of Various Bacterial Strains Peranan Saiz Zarah Nano Zink Oksida Dalam Prestasi Pemangkinan Foto, Perencatan Bakteria dan Ketoksikan
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1