Prediction of Heart Disease using Machine Learning Techniques

H. Singh, Tushar Gupta, J. Sidhu
{"title":"Prediction of Heart Disease using Machine Learning Techniques","authors":"H. Singh, Tushar Gupta, J. Sidhu","doi":"10.1109/ICIIP53038.2021.9702625","DOIUrl":null,"url":null,"abstract":"Heart attacks and strokes account for 85 percent of these fatalities. Unhealthy food, lack of physical exercise, cigarette smoking, and excessive alcohol use are all major behavioral risk factors for CVDs. These variables can lead to high blood pressure, high blood glucose, high blood cholesterol, and obesity. It is critical to identify cardiac illness as soon as possible, as well as swiftly and correctly as possible. Complex medical data is analyzed by various data mining and machine learning techniques in literature. The findings of the in-depth examination of these research articles are extremely convincing and accurate, but the future scope of these papers reflects the need for more significant characteristics and abundant standardized data, as well as the employment of different algorithms to achieve better accuracy rates. This research paper compares Random Forest algorithm with nearest neighbor (KNN) and Naïve Bayes on standard datasets from Cleveland database and Statlog Heart Disease of University of California Irvine (UCI) repository. The major goal of the research study is to get meaningful outcomes. With only 13 characteristics, we were able to get some extremely encouraging outcomes. The results validate Random Forest Classifier with accuracy of 93.02 %, significantly outperformed Naive Bayes and KNN which have accuracy of 83.72% and 90.69% respectively.","PeriodicalId":431272,"journal":{"name":"2021 Sixth International Conference on Image Information Processing (ICIIP)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Image Information Processing (ICIIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIP53038.2021.9702625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Heart attacks and strokes account for 85 percent of these fatalities. Unhealthy food, lack of physical exercise, cigarette smoking, and excessive alcohol use are all major behavioral risk factors for CVDs. These variables can lead to high blood pressure, high blood glucose, high blood cholesterol, and obesity. It is critical to identify cardiac illness as soon as possible, as well as swiftly and correctly as possible. Complex medical data is analyzed by various data mining and machine learning techniques in literature. The findings of the in-depth examination of these research articles are extremely convincing and accurate, but the future scope of these papers reflects the need for more significant characteristics and abundant standardized data, as well as the employment of different algorithms to achieve better accuracy rates. This research paper compares Random Forest algorithm with nearest neighbor (KNN) and Naïve Bayes on standard datasets from Cleveland database and Statlog Heart Disease of University of California Irvine (UCI) repository. The major goal of the research study is to get meaningful outcomes. With only 13 characteristics, we were able to get some extremely encouraging outcomes. The results validate Random Forest Classifier with accuracy of 93.02 %, significantly outperformed Naive Bayes and KNN which have accuracy of 83.72% and 90.69% respectively.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用机器学习技术预测心脏病
心脏病发作和中风占这些死亡人数的85%。不健康的食物、缺乏体育锻炼、吸烟和过度饮酒都是心血管疾病的主要行为风险因素。这些变量会导致高血压、高血糖、高胆固醇和肥胖。尽快、尽可能迅速和正确地识别心脏疾病至关重要。在文献中,复杂的医疗数据通过各种数据挖掘和机器学习技术进行分析。对这些研究文章进行深入研究后得出的结论是非常有说服力和准确的,但这些论文的未来范围反映出需要更显著的特征和丰富的标准化数据,以及采用不同的算法来达到更好的准确率。本文将随机森林算法与最近邻算法(KNN)和Naïve贝叶斯算法在Cleveland数据库和加州大学欧文分校(UCI)心脏病数据库的标准数据集上进行比较。这项研究的主要目标是获得有意义的结果。只有13个特征,我们就能得到一些非常令人鼓舞的结果。结果表明,随机森林分类器的准确率为93.02%,显著优于朴素贝叶斯和KNN,后者的准确率分别为83.72%和90.69%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Weather Station Using Raspberry Pi Prediction of Heart Disease using Machine Learning Techniques A Brief Review on Existing Techniques for Detecting Digital Image Forgery A Novel Approach for Excavating Communication Using Taxonomy and Outline Mechanisms A Novel Rough Set Based Image Denoising Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1