利用正则化主值和二次熵提升进行心脏病预测的大数据分析

P. Muthulakshmi, M. Parveen
{"title":"利用正则化主值和二次熵提升进行心脏病预测的大数据分析","authors":"P. Muthulakshmi, M. Parveen","doi":"10.17485/ijst/v17i6.2928","DOIUrl":null,"url":null,"abstract":"Objectives: Over the past few years, there prevails an abundance wealth of big data obtained via patients' electronic health records. One of the leading causes of mortality globally is the cardiovascular disease. Based on the present test and history cardiovascular disease diagnosing of patients can be done. Therefore, early and quick diagnosis can reduce the mortality rate. To address their needs, several machine learning methods have been employed in the recent past in cardiovascular disease diagnosis and prediction. Previous research was also concentrated on acquiring the significant features to heart disease prediction however less importance was given to the time involved and error rate to identifying the strength of these features. Methods: In this work we plan to develop a method called, Regularized Principal Component and Quadratic Weighted Entropy Boosting (RPC-QWEB) for predicting heart disease. Initially in RPC-QWEB, relevant features are selected to avoid missing values in the input database by employing Regularized Principal Component Regressive Feature Selection (RPCRFS). Second, with the obtained dimensionality reduced features, Quadratic Weighted Entropy Boosting Classification (QWEBC) process is carried out to classify the patient data as normal or abnormal. The QWEBC process is an ensemble of several weak classifiers (i.e., Quadratic Classifier). The weak classifier results are combined to form strong classifier and provide final prediction results as normal or abnormal condition with minimal error rate. Findings: Experimental evaluation is carried out on factors with the cardiovascular disease dataset such as heart disease prediction accuracy, heart disease prediction time, sensitivity, error rate with respect to distinct numbers of patient data. The proposed RPC-QWEB method was compared with existing Heart Disease Prediction Framework (HDPF) and Swarm Artificial Neural Network (Swarm-ANN). Novelty: RPC-QWEB method outperforms the conventional learning methods in terms of numerous performance matrices. The RPC-QWEB method produces 3% and 5% increase in terms of accuracy and sensitivity and 7% and 29% reduced prediction time and error rate as compared to the existing benchmark methods. We may use this method to predict the heart disease at early stage there by we can reduce the death rate. Keywords: Big data, Regularized Principal Component, Quadratic Weighted Entropy Boosting, Regressive Feature Selection, Classification","PeriodicalId":508200,"journal":{"name":"Indian Journal Of Science And Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Big Data Analytics for Heart Disease Prediction using Regularized Principal and Quadratic Entropy Boosting\",\"authors\":\"P. Muthulakshmi, M. Parveen\",\"doi\":\"10.17485/ijst/v17i6.2928\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: Over the past few years, there prevails an abundance wealth of big data obtained via patients' electronic health records. One of the leading causes of mortality globally is the cardiovascular disease. Based on the present test and history cardiovascular disease diagnosing of patients can be done. Therefore, early and quick diagnosis can reduce the mortality rate. To address their needs, several machine learning methods have been employed in the recent past in cardiovascular disease diagnosis and prediction. Previous research was also concentrated on acquiring the significant features to heart disease prediction however less importance was given to the time involved and error rate to identifying the strength of these features. Methods: In this work we plan to develop a method called, Regularized Principal Component and Quadratic Weighted Entropy Boosting (RPC-QWEB) for predicting heart disease. Initially in RPC-QWEB, relevant features are selected to avoid missing values in the input database by employing Regularized Principal Component Regressive Feature Selection (RPCRFS). Second, with the obtained dimensionality reduced features, Quadratic Weighted Entropy Boosting Classification (QWEBC) process is carried out to classify the patient data as normal or abnormal. The QWEBC process is an ensemble of several weak classifiers (i.e., Quadratic Classifier). The weak classifier results are combined to form strong classifier and provide final prediction results as normal or abnormal condition with minimal error rate. Findings: Experimental evaluation is carried out on factors with the cardiovascular disease dataset such as heart disease prediction accuracy, heart disease prediction time, sensitivity, error rate with respect to distinct numbers of patient data. The proposed RPC-QWEB method was compared with existing Heart Disease Prediction Framework (HDPF) and Swarm Artificial Neural Network (Swarm-ANN). Novelty: RPC-QWEB method outperforms the conventional learning methods in terms of numerous performance matrices. The RPC-QWEB method produces 3% and 5% increase in terms of accuracy and sensitivity and 7% and 29% reduced prediction time and error rate as compared to the existing benchmark methods. We may use this method to predict the heart disease at early stage there by we can reduce the death rate. Keywords: Big data, Regularized Principal Component, Quadratic Weighted Entropy Boosting, Regressive Feature Selection, Classification\",\"PeriodicalId\":508200,\"journal\":{\"name\":\"Indian Journal Of Science And Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indian Journal Of Science And Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17485/ijst/v17i6.2928\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal Of Science And Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17485/ijst/v17i6.2928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:在过去几年中,通过病人的电子健康记录获得了大量丰富的大数据。心血管疾病是导致全球死亡的主要原因之一。根据目前的检查和病史,可以对患者进行心血管疾病诊断。因此,早期快速诊断可以降低死亡率。为了满足这些需求,近年来在心血管疾病诊断和预测方面采用了多种机器学习方法。以往的研究也集中在获取心脏病预测的重要特征上,但对识别这些特征的强度所需的时间和错误率却不太重视。方法:在这项工作中,我们计划开发一种名为正则化主成分和二次加权熵提升(RPC-QWEB)的方法,用于预测心脏病。在 RPC-QWEB 中,首先通过使用正则化主成分递归特征选择(RPCRFS)来选择相关特征,以避免输入数据库中的缺失值。其次,利用获得的降维特征,进行二次加权熵提升分类(Quadratic Weighted Entropy Boosting Classification,QWEBC)过程,将患者数据分类为正常或异常。QWEBC 流程是多个弱分类器(即二次分类器)的集合。弱分类器的结果组合成强分类器,以最小的错误率提供正常或异常的最终预测结果。研究结果利用心血管疾病数据集对各种因素进行了实验评估,如心脏病预测准确率、心脏病预测时间、灵敏度、与不同数量患者数据相关的错误率。提出的 RPC-QWEB 方法与现有的心脏病预测框架(HDPF)和蜂群人工神经网络(Swarm-ANN)进行了比较。新颖性:RPC-QWEB 方法在众多性能矩阵方面优于传统学习方法。与现有的基准方法相比,RPC-QWEB 方法的准确度和灵敏度分别提高了 3% 和 5%,预测时间和错误率分别减少了 7% 和 29%。我们可以利用这种方法在早期预测心脏病,从而降低死亡率。关键词大数据 正则化主成分 二次加权熵提升 回归特征选择 分类
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Big Data Analytics for Heart Disease Prediction using Regularized Principal and Quadratic Entropy Boosting
Objectives: Over the past few years, there prevails an abundance wealth of big data obtained via patients' electronic health records. One of the leading causes of mortality globally is the cardiovascular disease. Based on the present test and history cardiovascular disease diagnosing of patients can be done. Therefore, early and quick diagnosis can reduce the mortality rate. To address their needs, several machine learning methods have been employed in the recent past in cardiovascular disease diagnosis and prediction. Previous research was also concentrated on acquiring the significant features to heart disease prediction however less importance was given to the time involved and error rate to identifying the strength of these features. Methods: In this work we plan to develop a method called, Regularized Principal Component and Quadratic Weighted Entropy Boosting (RPC-QWEB) for predicting heart disease. Initially in RPC-QWEB, relevant features are selected to avoid missing values in the input database by employing Regularized Principal Component Regressive Feature Selection (RPCRFS). Second, with the obtained dimensionality reduced features, Quadratic Weighted Entropy Boosting Classification (QWEBC) process is carried out to classify the patient data as normal or abnormal. The QWEBC process is an ensemble of several weak classifiers (i.e., Quadratic Classifier). The weak classifier results are combined to form strong classifier and provide final prediction results as normal or abnormal condition with minimal error rate. Findings: Experimental evaluation is carried out on factors with the cardiovascular disease dataset such as heart disease prediction accuracy, heart disease prediction time, sensitivity, error rate with respect to distinct numbers of patient data. The proposed RPC-QWEB method was compared with existing Heart Disease Prediction Framework (HDPF) and Swarm Artificial Neural Network (Swarm-ANN). Novelty: RPC-QWEB method outperforms the conventional learning methods in terms of numerous performance matrices. The RPC-QWEB method produces 3% and 5% increase in terms of accuracy and sensitivity and 7% and 29% reduced prediction time and error rate as compared to the existing benchmark methods. We may use this method to predict the heart disease at early stage there by we can reduce the death rate. Keywords: Big data, Regularized Principal Component, Quadratic Weighted Entropy Boosting, Regressive Feature Selection, Classification
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Theoretical and Finite Element Analysis of Pressure Vessel Image Analysis and Classification Using HRSVM-CNN for Land-Cover Classification by Using Remote Sensing Images Lattice Ordered G􀀀Semirings On the Solution of Blasius Boundary Layer Equations of Prandtl-Eyring Fluid Flow Past a Stretching Sheet Optimizing High Data Rate Up to 2.5 Tbps Transmission using 64-Channel DWDM System with DCF and NRZ Modulation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1