{"title":"An Improved Framework for Reliable Cardiovascular Disease Prediction Using Hybrid Ensemble Learning","authors":"Tanjim Mahmud, Anik Barua, M. Begum, Eipshita Chakma, Sudhakar Das, Nahed Sharmen","doi":"10.1109/ECCE57851.2023.10101564","DOIUrl":null,"url":null,"abstract":"Cardiovascular diseases (CVDs), which include heart disorders, are the most prevalent and significant causes of death worldwide, including Bangladesh. Blood artery problems, rhythm issues, chest pain, heart attacks, strokes, and erratic blood pressure are a few of these. In Bangladesh, cardiovascular disease is the main factor in both male and female fatalities. More than 80% of CVD deaths are caused by heart disease and strokes, which are the predominant causes. To be able to examine the effectiveness of the various models, this research article explains the underlying methods as Support vector machines (SVM), K-Nearest Neighbors (KNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and XGBoost (XGB), wherein Random Forest perform better when their hyperparameters are tuned (RandomizedSearchCV). There suggested ensemble technique such as Bagging, Voting, Stacking. Additionally, it is suggested that a hybrid strategy using Bagging and stacking ensemble approaches can boost the predictability of cardiovascular disease. For this analysis of patient performance, we used a dataset from Kaggle that comprises of 70,000 unique data values. According to the experiment's findings, the proposed model had the best disease prediction accuracy, coming in at 84.03%.","PeriodicalId":131537,"journal":{"name":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECCE57851.2023.10101564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Cardiovascular diseases (CVDs), which include heart disorders, are the most prevalent and significant causes of death worldwide, including Bangladesh. Blood artery problems, rhythm issues, chest pain, heart attacks, strokes, and erratic blood pressure are a few of these. In Bangladesh, cardiovascular disease is the main factor in both male and female fatalities. More than 80% of CVD deaths are caused by heart disease and strokes, which are the predominant causes. To be able to examine the effectiveness of the various models, this research article explains the underlying methods as Support vector machines (SVM), K-Nearest Neighbors (KNN), Logistic Regression (LR), Random Forest (RF), Decision Tree (DT), and XGBoost (XGB), wherein Random Forest perform better when their hyperparameters are tuned (RandomizedSearchCV). There suggested ensemble technique such as Bagging, Voting, Stacking. Additionally, it is suggested that a hybrid strategy using Bagging and stacking ensemble approaches can boost the predictability of cardiovascular disease. For this analysis of patient performance, we used a dataset from Kaggle that comprises of 70,000 unique data values. According to the experiment's findings, the proposed model had the best disease prediction accuracy, coming in at 84.03%.