各种分类器模型在冠状动脉疾病预测中的综合性能分析

IF 0.6 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE International Journal of Cognitive Informatics and Natural Intelligence Pub Date : 2021-10-01 DOI:10.4018/IJCINI.20211001.OA36

Baranidharan Balakrishnan, C. Kumar

{"title":"各种分类器模型在冠状动脉疾病预测中的综合性能分析","authors":"Baranidharan Balakrishnan, C. Kumar","doi":"10.4018/IJCINI.20211001.OA36","DOIUrl":null,"url":null,"abstract":"Cardio vascular diseases (CVD) are the major reason for the death of the majority of the people in the world. Earlier diagnosis of disease will reduce the mortality rate. Machine learning (ML) algorithms are giving promising results in the disease diagnosis, and they are now widely accepted by medical experts as their clinical decision support system. In this work, the most popular ML models are investigated and compared with one other for heart disease prediction based on various metrics. The base classifiers such as support vector machine (SVM), logistic regression, naïve Bayes, decision tree, k-nearest neighbour are used for predicting heart disease. In this paper, bagging and boosting techniques are applied over these individual classifiers to improve the performance of the system. With the Cleveland and Statlog datasets, naive Bayes as the individual classifier gives the maximum accuracy of 85.13%and 84.81%, respectively. Bagging technique improves the accuracy of the decision tree, which is identified as a weak classifier by 7%, and it is a significant improvement in identifying CVD. that Bayes, Support Vector Machine and Logistic are strong classifiers more than 80% accuracy and Decision Tree and K Nearest Neighbours as weak classifiers. Bagging and boosting techniques the performance of weak classifiers Decision Tree and K Nearest Neighbours. Bagging technique improved the accuracy of the decision tree algorithm 7.77% maximum for Statlog dataset. In future, feature selection is to be applied to find out the most relevant features of the data set and applying over the ensemble models over it will give better-improved accuracy.","PeriodicalId":43637,"journal":{"name":"International Journal of Cognitive Informatics and Natural Intelligence","volume":"43 1","pages":"1-14"},"PeriodicalIF":0.6000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comprehensive Performance Analysis of Various Classifier Models for Coronary Artery Disease Prediction\",\"authors\":\"Baranidharan Balakrishnan, C. Kumar\",\"doi\":\"10.4018/IJCINI.20211001.OA36\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cardio vascular diseases (CVD) are the major reason for the death of the majority of the people in the world. Earlier diagnosis of disease will reduce the mortality rate. Machine learning (ML) algorithms are giving promising results in the disease diagnosis, and they are now widely accepted by medical experts as their clinical decision support system. In this work, the most popular ML models are investigated and compared with one other for heart disease prediction based on various metrics. The base classifiers such as support vector machine (SVM), logistic regression, naïve Bayes, decision tree, k-nearest neighbour are used for predicting heart disease. In this paper, bagging and boosting techniques are applied over these individual classifiers to improve the performance of the system. With the Cleveland and Statlog datasets, naive Bayes as the individual classifier gives the maximum accuracy of 85.13%and 84.81%, respectively. Bagging technique improves the accuracy of the decision tree, which is identified as a weak classifier by 7%, and it is a significant improvement in identifying CVD. that Bayes, Support Vector Machine and Logistic are strong classifiers more than 80% accuracy and Decision Tree and K Nearest Neighbours as weak classifiers. Bagging and boosting techniques the performance of weak classifiers Decision Tree and K Nearest Neighbours. Bagging technique improved the accuracy of the decision tree algorithm 7.77% maximum for Statlog dataset. In future, feature selection is to be applied to find out the most relevant features of the data set and applying over the ensemble models over it will give better-improved accuracy.\",\"PeriodicalId\":43637,\"journal\":{\"name\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"volume\":\"43 1\",\"pages\":\"1-14\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Cognitive Informatics and Natural Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/IJCINI.20211001.OA36\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Informatics and Natural Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJCINI.20211001.OA36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

摘要

心血管疾病(CVD)是世界上大多数人死亡的主要原因。疾病的早期诊断将降低死亡率。机器学习(ML)算法在疾病诊断方面取得了可喜的成果，作为临床决策支持系统已被医学专家广泛接受。在这项工作中，研究了最流行的ML模型，并基于各种指标对心脏病预测进行了比较。支持向量机(SVM)、逻辑回归、naïve贝叶斯、决策树、k近邻等基本分类器用于心脏病预测。在本文中，在这些单独的分类器上应用bagging和boosting技术来提高系统的性能。对于Cleveland和Statlog数据集，朴素贝叶斯作为单个分类器的最大准确率分别为85.13%和84.81%。Bagging技术提高了决策树的准确率，使其被识别为弱分类器的准确率提高了7%，在识别CVD方面有了显著的提高。贝叶斯、支持向量机和Logistic是准确率超过80%的强分类器，决策树和K近邻是弱分类器。弱分类器决策树和K近邻的装袋和增强技术。对于Statlog数据集，Bagging技术将决策树算法的准确率提高了7.77%。未来，特征选择将用于找出数据集最相关的特征，并将其应用于集成模型上，将获得更好的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Comprehensive Performance Analysis of Various Classifier Models for Coronary Artery Disease Prediction

Cardio vascular diseases (CVD) are the major reason for the death of the majority of the people in the world. Earlier diagnosis of disease will reduce the mortality rate. Machine learning (ML) algorithms are giving promising results in the disease diagnosis, and they are now widely accepted by medical experts as their clinical decision support system. In this work, the most popular ML models are investigated and compared with one other for heart disease prediction based on various metrics. The base classifiers such as support vector machine (SVM), logistic regression, naïve Bayes, decision tree, k-nearest neighbour are used for predicting heart disease. In this paper, bagging and boosting techniques are applied over these individual classifiers to improve the performance of the system. With the Cleveland and Statlog datasets, naive Bayes as the individual classifier gives the maximum accuracy of 85.13%and 84.81%, respectively. Bagging technique improves the accuracy of the decision tree, which is identified as a weak classifier by 7%, and it is a significant improvement in identifying CVD. that Bayes, Support Vector Machine and Logistic are strong classifiers more than 80% accuracy and Decision Tree and K Nearest Neighbours as weak classifiers. Bagging and boosting techniques the performance of weak classifiers Decision Tree and K Nearest Neighbours. Bagging technique improved the accuracy of the decision tree algorithm 7.77% maximum for Statlog dataset. In future, feature selection is to be applied to find out the most relevant features of the data set and applying over the ensemble models over it will give better-improved accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Cognitive Informatics and Natural Intelligence COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

2.00

自引率

11.10%

发文量

期刊介绍： The International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) encourages submissions that transcends disciplinary boundaries, and is devoted to rapid publication of high quality papers. The themes of IJCINI are natural intelligence, autonomic computing, and neuroinformatics. IJCINI is expected to provide the first forum and platform in the world for researchers, practitioners, and graduate students to investigate cognitive mechanisms and processes of human information processing, and to stimulate the transdisciplinary effort on cognitive informatics and natural intelligent research and engineering applications.