基于k -最近邻算法的心脏病分类模型

Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto
{"title":"基于k -最近邻算法的心脏病分类模型","authors":"Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto","doi":"10.1109/ICIC54025.2021.9632918","DOIUrl":null,"url":null,"abstract":"Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.","PeriodicalId":189541,"journal":{"name":"2021 Sixth International Conference on Informatics and Computing (ICIC)","volume":"395 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Heart Disease Classification Model Using K-Nearest Neighbor Algorithm\",\"authors\":\"Ben Rahman, H. L. Hendric Spits Warnars, Boy Subirosa Sabarguna, W. Budiharto\",\"doi\":\"10.1109/ICIC54025.2021.9632918\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.\",\"PeriodicalId\":189541,\"journal\":{\"name\":\"2021 Sixth International Conference on Informatics and Computing (ICIC)\",\"volume\":\"395 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Sixth International Conference on Informatics and Computing (ICIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIC54025.2021.9632918\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Informatics and Computing (ICIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIC54025.2021.9632918","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

心脏病是一种需要警惕和特别关注的疾病。根据世卫组织的报告,2018年有多达1790万人死于心脏病,特别是在印度尼西亚,心脏病在2020年成为最大的死亡原因。本研究使用数据挖掘技术从所使用的数据中提取信息。这项研究提供了一项科学贡献,即尽早发现心脏病。在这种情况下,作者使用k -最近邻算法根据最近邻数据对数据进行分类。数据库在相当大的容量中是自己的,因此应该注意不相关的属性将被删除。如果仍然使用它们,数据处理结果将不是最优的,因此需要仔细进行数据清理。所用数据的选取为1243条记录,选取后本研究的数据多达366条记录,参数采用12个属性,医院的实际数据,心脏监护监护患者的数据,手术患者的数据和医学检查的数据。因此,有必要开发一种决策支持系统,帮助医生采取措施及早发现。研究表明,K近邻算法在K = 7时准确率高达77%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Heart Disease Classification Model Using K-Nearest Neighbor Algorithm
Heart disease is a disease that needs to be watched out for and is of particular concern. Seeing to the WHO report, in 2018, as many as 17.9 million people died from heart disease, and especially in Indonesia, heart disease in 2020 became the highest cause of death. This study uses data mining techniques to pull out information from the data used. This research provides a scientific contribution, namely detecting heart disease as early as possible. In this case, the author uses the K-Nearest Neighbor Algorithm to classify the data based on the nearest neighbor data. The database is own in a reasonably high volume, so it should note that irrelevant attributes will be removed over or noise. If they are still used, data processing results will not be optimal, so data cleaning needs to be done carefully. The selection of the data used was 1243 records, and after being selected the data were taken in this study as many as 366 records, with parameters using 12 attributes, actual data from hospitals, data consisting of data from patients under surveillance for cardiac care, and data from patients who underwent surgery and Data from Medical Examination. Therefore, it is necessary to develop a decision support system that assists doctors in taking steps for early detection. Research conducted with the K-Nearest Neighbors algorithm accuracy up to 77% with a value of K = 7.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of IoT adoption on Trucking Logistics in Various Industry in Indonesia Design of Blockchain Implementation for Supervision of Vaccine Distribution: Indonesia Case [ICIC 2021 Back Cover] Design and Simulation of Antipodal Vivaldi Antenna (AVA) AT 2.6 GHz For 5G Communication Optimation Classification of Chili Leaf Disease Using the Gray Level Co-occurrence Matrix (GLCM) and the Support Vector Machine (SVM) Methods
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1