Ensemble classification technique for heart disease prediction with meta-heuristic-enabled training system

IF 1.2 Q3 Computer Science Bio-Algorithms and Med-Systems Pub Date : 2020-11-26 DOI:10.1515/bams-2020-0033
Parvathaneni Rajendra Kumar, S. Ravichandran, S. Narayana
{"title":"Ensemble classification technique for heart disease prediction with meta-heuristic-enabled training system","authors":"Parvathaneni Rajendra Kumar, S. Ravichandran, S. Narayana","doi":"10.1515/bams-2020-0033","DOIUrl":null,"url":null,"abstract":"Abstract Objectives This research work exclusively aims to develop a novel heart disease prediction framework including three major phases, namely proposed feature extraction, dimensionality reduction, and proposed ensemble-based classification. Methods As the novelty, the training of NN is carried out by a new enhanced optimization algorithm referred to as Sea Lion with Canberra Distance (S-CDF) via tuning the optimal weights. The improved S-CDF algorithm is the extended version of the existing “Sea Lion Optimization (SLnO)”. Initially, the statistical and higher-order statistical features are extracted including central tendency, degree of dispersion, and qualitative variation, respectively. However, in this scenario, the “curse of dimensionality” seems to be the greatest issue, such that there is a necessity of dimensionality reduction in the extracted features. Hence, the principal component analysis (PCA)-based feature reduction approach is deployed here. Finally, the dimensional concentrated features are fed as the input to the proposed ensemble technique with “Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN)” with optimized Neural Network (NN) as the final classifier. Results An elaborative analyses as well as discussion have been provided by concerning the parameters, like evaluation metrics, year of publication, accuracy, implementation tool, and utilized datasets obtained by various techniques. Conclusions From the experiment outcomes, it is proved that the accuracy of the proposed work with the proposed feature set is 5, 42.85, and 10% superior to the performance with other feature sets like central tendency + dispersion feature, central tendency qualitative variation, and dispersion qualitative variation, respectively. Results Finally, the comparative evaluation shows that the presented work is appropriate for heart disease prediction as it has high accuracy than the traditional works.","PeriodicalId":42620,"journal":{"name":"Bio-Algorithms and Med-Systems","volume":"17 1","pages":"119 - 136"},"PeriodicalIF":1.2000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/bams-2020-0033","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bio-Algorithms and Med-Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/bams-2020-0033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 6

Abstract

Abstract Objectives This research work exclusively aims to develop a novel heart disease prediction framework including three major phases, namely proposed feature extraction, dimensionality reduction, and proposed ensemble-based classification. Methods As the novelty, the training of NN is carried out by a new enhanced optimization algorithm referred to as Sea Lion with Canberra Distance (S-CDF) via tuning the optimal weights. The improved S-CDF algorithm is the extended version of the existing “Sea Lion Optimization (SLnO)”. Initially, the statistical and higher-order statistical features are extracted including central tendency, degree of dispersion, and qualitative variation, respectively. However, in this scenario, the “curse of dimensionality” seems to be the greatest issue, such that there is a necessity of dimensionality reduction in the extracted features. Hence, the principal component analysis (PCA)-based feature reduction approach is deployed here. Finally, the dimensional concentrated features are fed as the input to the proposed ensemble technique with “Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN)” with optimized Neural Network (NN) as the final classifier. Results An elaborative analyses as well as discussion have been provided by concerning the parameters, like evaluation metrics, year of publication, accuracy, implementation tool, and utilized datasets obtained by various techniques. Conclusions From the experiment outcomes, it is proved that the accuracy of the proposed work with the proposed feature set is 5, 42.85, and 10% superior to the performance with other feature sets like central tendency + dispersion feature, central tendency qualitative variation, and dispersion qualitative variation, respectively. Results Finally, the comparative evaluation shows that the presented work is appropriate for heart disease prediction as it has high accuracy than the traditional works.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于元启发式训练系统的心脏病预测集成分类技术
摘要目的本研究工作旨在开发一种新的心脏病预测框架,包括三个主要阶段,即提出的特征提取、降维和提出的基于集合的分类。方法新颖的是,通过调整最优权值,采用一种新的增强优化算法,即具有堪培拉距离的海狮算法(S-CDF)对神经网络进行训练。改进的S-CDF算法是现有“海狮优化(SLnO)”的扩展版本。首先,提取统计特征和高阶统计特征,分别包括中心趋势、离散度和定性变化。然而,在这种情况下,“维度诅咒”似乎是最大的问题,因此有必要对提取的特征进行降维。因此,本文采用了基于主成分分析的特征约简方法。最后,将维度集中的特征作为输入提供给所提出的集成技术,该技术以“支持向量机(SVM)、随机森林(RF)、K-最近邻(KNN)”和优化神经网络(NN)作为最终分类器。结果对评价指标、发表年份、准确性、实施工具和通过各种技术获得的使用数据集等参数进行了详细的分析和讨论。结论从实验结果来看,所提出的特征集的精度分别比其他特征集(如中心趋势+分散特征、中心趋势定性变化和分散定性变化)的精度高5%、42.85%和10%。结果最后,比较评估表明,所提出的工作适合于心脏病预测,因为它比传统工作具有更高的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bio-Algorithms and Med-Systems
Bio-Algorithms and Med-Systems MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
3.80
自引率
0.00%
发文量
3
期刊介绍: The journal Bio-Algorithms and Med-Systems (BAMS), edited by the Jagiellonian University Medical College, provides a forum for the exchange of information in the interdisciplinary fields of computational methods applied in medicine, presenting new algorithms and databases that allows the progress in collaborations between medicine, informatics, physics, and biochemistry. Projects linking specialists representing these disciplines are welcome to be published in this Journal. Articles in BAMS are published in English. Topics Bioinformatics Systems biology Telemedicine E-Learning in Medicine Patient''s electronic record Image processing Medical databases.
期刊最新文献
Propagation of electrical signals by fungi Light induced spiking of proteinoids Transfer Functions of Proteinoid Microspheres Application of quantum entanglement induced polarization for dual-positron and prompt gamma imaging. Transcriptomic data analysis of melanocytes and melanoma cell lines of LAT transporter genes for precise medicine
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1