{"title":"Prediction of Heart Disease using Machine Learning Techniques","authors":"H. Singh, Tushar Gupta, J. Sidhu","doi":"10.1109/ICIIP53038.2021.9702625","DOIUrl":null,"url":null,"abstract":"Heart attacks and strokes account for 85 percent of these fatalities. Unhealthy food, lack of physical exercise, cigarette smoking, and excessive alcohol use are all major behavioral risk factors for CVDs. These variables can lead to high blood pressure, high blood glucose, high blood cholesterol, and obesity. It is critical to identify cardiac illness as soon as possible, as well as swiftly and correctly as possible. Complex medical data is analyzed by various data mining and machine learning techniques in literature. The findings of the in-depth examination of these research articles are extremely convincing and accurate, but the future scope of these papers reflects the need for more significant characteristics and abundant standardized data, as well as the employment of different algorithms to achieve better accuracy rates. This research paper compares Random Forest algorithm with nearest neighbor (KNN) and Naïve Bayes on standard datasets from Cleveland database and Statlog Heart Disease of University of California Irvine (UCI) repository. The major goal of the research study is to get meaningful outcomes. With only 13 characteristics, we were able to get some extremely encouraging outcomes. The results validate Random Forest Classifier with accuracy of 93.02 %, significantly outperformed Naive Bayes and KNN which have accuracy of 83.72% and 90.69% respectively.","PeriodicalId":431272,"journal":{"name":"2021 Sixth International Conference on Image Information Processing (ICIIP)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Sixth International Conference on Image Information Processing (ICIIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIP53038.2021.9702625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Heart attacks and strokes account for 85 percent of these fatalities. Unhealthy food, lack of physical exercise, cigarette smoking, and excessive alcohol use are all major behavioral risk factors for CVDs. These variables can lead to high blood pressure, high blood glucose, high blood cholesterol, and obesity. It is critical to identify cardiac illness as soon as possible, as well as swiftly and correctly as possible. Complex medical data is analyzed by various data mining and machine learning techniques in literature. The findings of the in-depth examination of these research articles are extremely convincing and accurate, but the future scope of these papers reflects the need for more significant characteristics and abundant standardized data, as well as the employment of different algorithms to achieve better accuracy rates. This research paper compares Random Forest algorithm with nearest neighbor (KNN) and Naïve Bayes on standard datasets from Cleveland database and Statlog Heart Disease of University of California Irvine (UCI) repository. The major goal of the research study is to get meaningful outcomes. With only 13 characteristics, we were able to get some extremely encouraging outcomes. The results validate Random Forest Classifier with accuracy of 93.02 %, significantly outperformed Naive Bayes and KNN which have accuracy of 83.72% and 90.69% respectively.