使用机器学习算法预测心脏病

Md. Julker Nayeem, Sohel Rana, Md. Rabiul Islam
{"title":"使用机器学习算法预测心脏病","authors":"Md. Julker Nayeem, Sohel Rana, Md. Rabiul Islam","doi":"10.24018/ejai.2022.1.3.13","DOIUrl":null,"url":null,"abstract":"Heart disease has become one of the alarming issues of death. It is accountable for fatty plaques in the arteries. If this fatal condition can be identified early, we can preserve many people’s arteries. Different types of supervised machine learning algorithms are applied in our research paper in order to predict heart disease existence in patient body. Besides this, we have focused on an efficient way to improve the performance of our applied classifiers. Imputing mean value technique is applied to handle null values present in our dataset. The features which are unnecessary are removed by using the info-gain feature selection technique. In order to calculate prediction accuracy, K-Nearest Neighbors (KNN), Naive Bayes and Random Forest are applied to the heart disease dataset. Accuracy, precision, recall, F1-score, and ROC are calculated which help us to compare the performance of the classification models. Handling null values on a particular column by imputing mean values of that column and our applied info-gain feature selection technique has aided us in improving the accuracy of our prediction models. Random Forest among all has given the best classification accuracy which is 95.63% with precision, recall, F1-score and ROC are 0.93, 0.92, 0.92 and 0.9, respectively.","PeriodicalId":360205,"journal":{"name":"European Journal of Artificial Intelligence and Machine Learning","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Prediction of Heart Disease Using Machine Learning Algorithms\",\"authors\":\"Md. Julker Nayeem, Sohel Rana, Md. Rabiul Islam\",\"doi\":\"10.24018/ejai.2022.1.3.13\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heart disease has become one of the alarming issues of death. It is accountable for fatty plaques in the arteries. If this fatal condition can be identified early, we can preserve many people’s arteries. Different types of supervised machine learning algorithms are applied in our research paper in order to predict heart disease existence in patient body. Besides this, we have focused on an efficient way to improve the performance of our applied classifiers. Imputing mean value technique is applied to handle null values present in our dataset. The features which are unnecessary are removed by using the info-gain feature selection technique. In order to calculate prediction accuracy, K-Nearest Neighbors (KNN), Naive Bayes and Random Forest are applied to the heart disease dataset. Accuracy, precision, recall, F1-score, and ROC are calculated which help us to compare the performance of the classification models. Handling null values on a particular column by imputing mean values of that column and our applied info-gain feature selection technique has aided us in improving the accuracy of our prediction models. Random Forest among all has given the best classification accuracy which is 95.63% with precision, recall, F1-score and ROC are 0.93, 0.92, 0.92 and 0.9, respectively.\",\"PeriodicalId\":360205,\"journal\":{\"name\":\"European Journal of Artificial Intelligence and Machine Learning\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Artificial Intelligence and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24018/ejai.2022.1.3.13\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Artificial Intelligence and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24018/ejai.2022.1.3.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

心脏病已成为令人担忧的死亡问题之一。它会导致动脉中的脂肪斑块。如果这种致命的疾病能及早发现,我们就能保住许多人的动脉。在我们的研究中应用了不同类型的监督机器学习算法来预测患者体内是否存在心脏病。除此之外,我们还专注于一种有效的方法来提高我们应用的分类器的性能。应用均值计算技术处理数据集中存在的空值。利用信息增益特征选择技术去除不需要的特征。为了计算预测精度,将k近邻(KNN)、朴素贝叶斯和随机森林应用于心脏病数据集。计算了准确率、精密度、召回率、f1得分和ROC,这有助于我们比较分类模型的性能。通过输入该列的平均值来处理特定列上的空值,我们应用的信息增益特征选择技术帮助我们提高了预测模型的准确性。其中Random Forest的分类准确率最高,达到95.63%,其中precision为0.93,recall为0.92,F1-score为0.92,ROC为0.9。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction of Heart Disease Using Machine Learning Algorithms
Heart disease has become one of the alarming issues of death. It is accountable for fatty plaques in the arteries. If this fatal condition can be identified early, we can preserve many people’s arteries. Different types of supervised machine learning algorithms are applied in our research paper in order to predict heart disease existence in patient body. Besides this, we have focused on an efficient way to improve the performance of our applied classifiers. Imputing mean value technique is applied to handle null values present in our dataset. The features which are unnecessary are removed by using the info-gain feature selection technique. In order to calculate prediction accuracy, K-Nearest Neighbors (KNN), Naive Bayes and Random Forest are applied to the heart disease dataset. Accuracy, precision, recall, F1-score, and ROC are calculated which help us to compare the performance of the classification models. Handling null values on a particular column by imputing mean values of that column and our applied info-gain feature selection technique has aided us in improving the accuracy of our prediction models. Random Forest among all has given the best classification accuracy which is 95.63% with precision, recall, F1-score and ROC are 0.93, 0.92, 0.92 and 0.9, respectively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Assessment of the Accuracy of Various Machine Learning Algorithms for Classifying Urban Areas through Google Earth Engine: A Case Study of Kabul City, Afghanistan Enhancing Arabic Handwritten Recognition System-Based CNN-BLSTM Using Generative Adversarial Networks Shape Recognition and Corner Points Detection in 2D Drawings Using a Machine Learning Long Short-Term Memory (LSTM) Approach Review on Technologies Applied to Classification of Tomato Leaf Virus Diseases The Development of Artificial Intelligence in Career Initiation Education and Implications for China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1