{"title":"Applying Propensity Score and Support Vector Machine to Construct a Predictive Model for Heart Disease","authors":"Hsueh-Yi Lu","doi":"10.1145/3418094.3418117","DOIUrl":null,"url":null,"abstract":"Exercise ECG is currently the best way for diagnosing heart disease, but it is not suitable for everyone. This study used data mining to establish a model to predict the risk of heart disease. Maximum oxygen uptake (VO2max) was used as an indicator of determine that the person was a high-risk or low-risk heart disease patient. Data of the National Health and Nutrition Examination Survey from the United States were used in this study. Due to scattered distribution of the data, which diminished the prediction performance, this study proposed a novel method to stratify data with the propensity scores. The subsets of data were trained by the support vector machine to establish the prediction model. The results of this study showed that the model had an AUC of 0.899. Our model can make a more accurate prediction to identify whether a patient has a higher risk in heart disease.","PeriodicalId":192804,"journal":{"name":"Proceedings of the 4th International Conference on Medical and Health Informatics","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Medical and Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3418094.3418117","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Exercise ECG is currently the best way for diagnosing heart disease, but it is not suitable for everyone. This study used data mining to establish a model to predict the risk of heart disease. Maximum oxygen uptake (VO2max) was used as an indicator of determine that the person was a high-risk or low-risk heart disease patient. Data of the National Health and Nutrition Examination Survey from the United States were used in this study. Due to scattered distribution of the data, which diminished the prediction performance, this study proposed a novel method to stratify data with the propensity scores. The subsets of data were trained by the support vector machine to establish the prediction model. The results of this study showed that the model had an AUC of 0.899. Our model can make a more accurate prediction to identify whether a patient has a higher risk in heart disease.