使用 DBSCAN 算法对 BPJS 国民健康保险参保人进行聚类

Jurnal Varian Pub Date : 2022-11-13 DOI:10.30812/varian.v6i1.1886

Wiwit Pura Nurmayanti, D. Ratnaningsih, Sausan Nisrina, Abdul Rahim, Muhammad Malthuf, Wirajaya Kusuma

{"title":"使用 DBSCAN 算法对 BPJS 国民健康保险参保人进行聚类","authors":"Wiwit Pura Nurmayanti, D. Ratnaningsih, Sausan Nisrina, Abdul Rahim, Muhammad Malthuf, Wirajaya Kusuma","doi":"10.30812/varian.v6i1.1886","DOIUrl":null,"url":null,"abstract":"In the current era of Big Data, getting data is no longer a difficult thing because they can access easily it via the internet, which is open access. A large amount of data can cause many problems in the data, such as data that deviates too far from the average (outliers). The method used to handle outlier data is DBSCAN which is density based clustering. The DBSCAN can be applied in various fields, one of which is the social sector, namely the participation of the JKN BPJS Health in West Nusa Tenggara. This study sees the distribution of BPJS Health participation groups, and to detect outliers so that objects with noise are not included in the cluster. The results of the study using the DBSCAN algorithm show that the optimal epsilon value is between 0.37 points by observing the knee of a curve. and MinPts 3, with the highest silhouette value of 0.2763. The highest JKN BPJS participants are in cluster 1 with 5 sub-districts, the second highest cluster is cluster 3 with 5 sub-districts, while the lowest cluster is cluster 2 with 93 sub-districts. The 13 sub-districts are not included in any group because they are noise data.","PeriodicalId":188119,"journal":{"name":"Jurnal Varian","volume":"24 7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Clustrering of BPJS National Health Insurance Participant Using DBSCAN Algorithm\",\"authors\":\"Wiwit Pura Nurmayanti, D. Ratnaningsih, Sausan Nisrina, Abdul Rahim, Muhammad Malthuf, Wirajaya Kusuma\",\"doi\":\"10.30812/varian.v6i1.1886\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current era of Big Data, getting data is no longer a difficult thing because they can access easily it via the internet, which is open access. A large amount of data can cause many problems in the data, such as data that deviates too far from the average (outliers). The method used to handle outlier data is DBSCAN which is density based clustering. The DBSCAN can be applied in various fields, one of which is the social sector, namely the participation of the JKN BPJS Health in West Nusa Tenggara. This study sees the distribution of BPJS Health participation groups, and to detect outliers so that objects with noise are not included in the cluster. The results of the study using the DBSCAN algorithm show that the optimal epsilon value is between 0.37 points by observing the knee of a curve. and MinPts 3, with the highest silhouette value of 0.2763. The highest JKN BPJS participants are in cluster 1 with 5 sub-districts, the second highest cluster is cluster 3 with 5 sub-districts, while the lowest cluster is cluster 2 with 93 sub-districts. The 13 sub-districts are not included in any group because they are noise data.\",\"PeriodicalId\":188119,\"journal\":{\"name\":\"Jurnal Varian\",\"volume\":\"24 7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Varian\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.30812/varian.v6i1.1886\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Varian","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30812/varian.v6i1.1886","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在当前的大数据时代，获取数据不再是一件困难的事情，因为他们可以通过开放的互联网轻松获取数据。大量数据会给数据带来很多问题，比如数据偏离平均值太远（离群值）。处理离群数据的方法是 DBSCAN，这是一种基于密度的聚类方法。DBSCAN 可应用于多个领域，其中之一是社会领域，即西努沙登加拉省 JKN BPJS 健康的参与情况。本研究旨在了解 BPJS 健康参与群体的分布情况，并检测异常值，从而避免将带有噪声的对象纳入聚类。使用 DBSCAN 算法的研究结果表明，通过观察曲线的膝盖，最佳ε值介于 0.37 点和 MinPts 3 之间，最高剪影值为 0.2763。JKN BPJS 参与者人数最多的是有 5 个分区的第 1 群组，第二多的是有 5 个分区的第 3 群组，而最少的是有 93 个分区的第 2 群组。13 个分区因属于噪音数据而未被纳入任何组别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Clustrering of BPJS National Health Insurance Participant Using DBSCAN Algorithm

In the current era of Big Data, getting data is no longer a difficult thing because they can access easily it via the internet, which is open access. A large amount of data can cause many problems in the data, such as data that deviates too far from the average (outliers). The method used to handle outlier data is DBSCAN which is density based clustering. The DBSCAN can be applied in various fields, one of which is the social sector, namely the participation of the JKN BPJS Health in West Nusa Tenggara. This study sees the distribution of BPJS Health participation groups, and to detect outliers so that objects with noise are not included in the cluster. The results of the study using the DBSCAN algorithm show that the optimal epsilon value is between 0.37 points by observing the knee of a curve. and MinPts 3, with the highest silhouette value of 0.2763. The highest JKN BPJS participants are in cluster 1 with 5 sub-districts, the second highest cluster is cluster 3 with 5 sub-districts, while the lowest cluster is cluster 2 with 93 sub-districts. The 13 sub-districts are not included in any group because they are noise data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Jurnal Varian

自引率

0.00%

发文量