Clustrering of BPJS National Health Insurance Participant Using DBSCAN Algorithm

Wiwit Pura Nurmayanti, D. Ratnaningsih, Sausan Nisrina, Abdul Rahim, Muhammad Malthuf, Wirajaya Kusuma
{"title":"Clustrering of BPJS National Health Insurance Participant Using DBSCAN Algorithm","authors":"Wiwit Pura Nurmayanti, D. Ratnaningsih, Sausan Nisrina, Abdul Rahim, Muhammad Malthuf, Wirajaya Kusuma","doi":"10.30812/varian.v6i1.1886","DOIUrl":null,"url":null,"abstract":"In the current era of Big Data, getting data is no longer a difficult thing because they can access easily it via the internet, which is open access. A large amount of data can cause many problems in the data, such as data that deviates too far from the average (outliers). The method used to handle outlier data is DBSCAN which is density based clustering. The DBSCAN can be applied in various fields, one of which is the social sector, namely the participation of the JKN BPJS Health in West Nusa Tenggara. This study sees the distribution of BPJS Health participation groups, and to detect outliers so that objects with noise are not included in the cluster. The results of the study using the DBSCAN algorithm show that the optimal epsilon value is between 0.37 points by observing the knee of a curve. and MinPts 3, with the highest silhouette value of 0.2763. The highest JKN BPJS participants are in cluster 1 with 5 sub-districts, the second highest cluster is cluster 3 with 5 sub-districts, while the lowest cluster is cluster 2 with 93 sub-districts. The 13 sub-districts are not included in any group because they are noise data.","PeriodicalId":188119,"journal":{"name":"Jurnal Varian","volume":"24 7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Varian","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30812/varian.v6i1.1886","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In the current era of Big Data, getting data is no longer a difficult thing because they can access easily it via the internet, which is open access. A large amount of data can cause many problems in the data, such as data that deviates too far from the average (outliers). The method used to handle outlier data is DBSCAN which is density based clustering. The DBSCAN can be applied in various fields, one of which is the social sector, namely the participation of the JKN BPJS Health in West Nusa Tenggara. This study sees the distribution of BPJS Health participation groups, and to detect outliers so that objects with noise are not included in the cluster. The results of the study using the DBSCAN algorithm show that the optimal epsilon value is between 0.37 points by observing the knee of a curve. and MinPts 3, with the highest silhouette value of 0.2763. The highest JKN BPJS participants are in cluster 1 with 5 sub-districts, the second highest cluster is cluster 3 with 5 sub-districts, while the lowest cluster is cluster 2 with 93 sub-districts. The 13 sub-districts are not included in any group because they are noise data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用 DBSCAN 算法对 BPJS 国民健康保险参保人进行聚类
在当前的大数据时代,获取数据不再是一件困难的事情,因为他们可以通过开放的互联网轻松获取数据。大量数据会给数据带来很多问题,比如数据偏离平均值太远(离群值)。处理离群数据的方法是 DBSCAN,这是一种基于密度的聚类方法。DBSCAN 可应用于多个领域,其中之一是社会领域,即西努沙登加拉省 JKN BPJS 健康的参与情况。本研究旨在了解 BPJS 健康参与群体的分布情况,并检测异常值,从而避免将带有噪声的对象纳入聚类。使用 DBSCAN 算法的研究结果表明,通过观察曲线的膝盖,最佳ε值介于 0.37 点和 MinPts 3 之间,最高剪影值为 0.2763。JKN BPJS 参与者人数最多的是有 5 个分区的第 1 群组,第二多的是有 5 个分区的第 3 群组,而最少的是有 93 个分区的第 2 群组。13 个分区因属于噪音数据而未被纳入任何组别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Application of Principal Component Regression in Analyzing Factors Affecting Human Development Index Impact of SST Anomalies on Coral Reefs Damage Based on Copula Analysis The NADI Mathematical Model on the Danger Level of the Bili-Bili Dam Regression Model of Land Area and Amount of Production to the Selling Price of Corn K-Means – Resilient Backpropagation Neural Network in Predicting Poverty Levels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1