Khairul Eahsun Fahim, Hayati Yassin, Md Hasnatul Amin, Priyanka Das Dewan, Aminul Islam
{"title":"Detection of Cardiovascular Disease of Patients at an Early Stage Using Machine Learning Algorithms","authors":"Khairul Eahsun Fahim, Hayati Yassin, Md Hasnatul Amin, Priyanka Das Dewan, Aminul Islam","doi":"10.1109/ICHE55634.2022.10179871","DOIUrl":null,"url":null,"abstract":"Among the significant causes of death worldwide, cardiovascular diseases (CVD) account for 80 percent of deaths in low- and middle-income countries such as Bangladesh, according to the World Health Organization (WHO). In Bangladesh, the prevalence of HIV/AIDS and the mortality linked with it have climbed considerably over the past few decades. The rising incidence of cardiovascular disease in Bangladesh needs a complete understanding of the epidemiology of CVD risk among the population. Clinical data analysis is a significant concern for someone dealing with cardiovascular illness. When it comes to generating decisions and making predictions from the vast volumes of data generated by the healthcare industry, machine learning (ML) is to be extremely useful. It is proposed in this research to apply a supervised machine learning algorithm to detect cardiovascular disease (CVD) in individuals early on, allowing them to become concerned about their medical status and avert significant illnesses. When it comes to detecting the disease, four different machine learning methods have been used. The dataset of patients was used, and various machine learning methods, including K-nearest neighbors, Random Forest, Decision trees, and XGBoost, were used to make predictions. As a consequence of the tests, the XGBoost method is superior to the other three tactics (73.72 percent). Moreover, for the modified dataset where smoking, alcohol intake, and physical activity are positive, the percentage is 81.14% to show the effect of smoking and alcohol consumption in a physically active person in terms of cardiovascular disease. Furthermore, these strategies have been evaluated regarding their ability to detect early-stage CVD inpatients. This paper examined the Kaggle dataset to observe the trait and suitability to implement the system for primary data collected from Bangladeshi patients.","PeriodicalId":289905,"journal":{"name":"2022 International Conference on Healthcare Engineering (ICHE)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Healthcare Engineering (ICHE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICHE55634.2022.10179871","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Among the significant causes of death worldwide, cardiovascular diseases (CVD) account for 80 percent of deaths in low- and middle-income countries such as Bangladesh, according to the World Health Organization (WHO). In Bangladesh, the prevalence of HIV/AIDS and the mortality linked with it have climbed considerably over the past few decades. The rising incidence of cardiovascular disease in Bangladesh needs a complete understanding of the epidemiology of CVD risk among the population. Clinical data analysis is a significant concern for someone dealing with cardiovascular illness. When it comes to generating decisions and making predictions from the vast volumes of data generated by the healthcare industry, machine learning (ML) is to be extremely useful. It is proposed in this research to apply a supervised machine learning algorithm to detect cardiovascular disease (CVD) in individuals early on, allowing them to become concerned about their medical status and avert significant illnesses. When it comes to detecting the disease, four different machine learning methods have been used. The dataset of patients was used, and various machine learning methods, including K-nearest neighbors, Random Forest, Decision trees, and XGBoost, were used to make predictions. As a consequence of the tests, the XGBoost method is superior to the other three tactics (73.72 percent). Moreover, for the modified dataset where smoking, alcohol intake, and physical activity are positive, the percentage is 81.14% to show the effect of smoking and alcohol consumption in a physically active person in terms of cardiovascular disease. Furthermore, these strategies have been evaluated regarding their ability to detect early-stage CVD inpatients. This paper examined the Kaggle dataset to observe the trait and suitability to implement the system for primary data collected from Bangladeshi patients.