基于k -最近邻算法的心脏病分类模型

2021 Sixth International Conference on Informatics and Computing (ICIC) Pub Date : 2021-11-03 DOI:10.1109/ICIC54025.2021.9632918

Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto

{"title":"基于k -最近邻算法的心脏病分类模型","authors":"Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto","doi":"10.1109/ICIC54025.2021.9632918","DOIUrl":null,"url":null,"abstract":"Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.","PeriodicalId":189541,"journal":{"name":"2021 Sixth International Conference on Informatics and Computing (ICIC)","volume":"395 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Heart Disease Classification Model Using K-Nearest Neighbor Algorithm\",\"authors\":\"Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto\",\"doi\":\"10.1109/ICIC54025.2021.9632918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.\",\"PeriodicalId\":189541,\"journal\":{\"name\":\"2021 Sixth International Conference on Informatics and Computing (ICIC)\",\"volume\":\"395 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Sixth International Conference on Informatics and Computing (ICIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIC54025.2021.9632918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Informatics and Computing (ICIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIC54025.2021.9632918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

心脏病是一种需要警惕和特别关注的疾病。根据世卫组织的报告，2018年有多达1790万人死于心脏病，特别是在印度尼西亚，心脏病在2020年成为最大的死亡原因。本研究使用数据挖掘技术从所使用的数据中提取信息。这项研究提供了一项科学贡献，即尽早发现心脏病。在这种情况下，作者使用k -最近邻算法根据最近邻数据对数据进行分类。数据库在相当大的容量中是自己的，因此应该注意不相关的属性将被删除。如果仍然使用它们，数据处理结果将不是最优的，因此需要仔细进行数据清理。所用数据的选取为1243条记录，选取后本研究的数据多达366条记录，参数采用12个属性，医院的实际数据，心脏监护监护患者的数据，手术患者的数据和医学检查的数据。因此，有必要开发一种决策支持系统，帮助医生采取措施及早发现。研究表明，K近邻算法在K = 7时准确率高达77%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Heart Disease Classification Model Using K-Nearest Neighbor Algorithm

Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Sixth International Conference on Informatics and Computing (ICIC)

自引率

0.00%

发文量