Runzhi Li, Hongling Zhao, Yusong Lin, Andrew S. Maxwell, Chaoyang Zhang
{"title":"智能健康风险预测的多标签分类","authors":"Runzhi Li, Hongling Zhao, Yusong Lin, Andrew S. Maxwell, Chaoyang Zhang","doi":"10.1109/BIBM.2016.7822657","DOIUrl":null,"url":null,"abstract":"A Multi-Label Problem Transformation Joint Classification (MLPTJC) method is developed to solve the multi-label classification problem for the health and disease risk prediction based on physical examination records. We adopt a multi-class classification problem transformation method to transform the multi-label classification problem to a multi-class classification problem. Then We propose a Joint Decomposition Subset Classifier method to reduce the infrequent label sets to deal with the imbalance learning problem. Based on MLPTJC, existing cost-sensitive multi-class classification algorithms can be used to train the prediction models. We conduct some experiments to evaluate the performance of the MLPTJC method. The Support Vector Machine (SVM) and Random Forest (RF) algorithms are used for multi-class classification learning. We use the 10-fold cross-validation and metrics such as Average Accuracy, Precision, Recall and F-measure to evaluate the performance. The real physical examination records were employed, which include 62 examination items and 110, 300 anonymous patients. 8 types of diseases were predicted. The experimental results show that the MLPTJC method has better performance in terms of accuracy.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Multi-label classification for intelligent health risk prediction\",\"authors\":\"Runzhi Li, Hongling Zhao, Yusong Lin, Andrew S. Maxwell, Chaoyang Zhang\",\"doi\":\"10.1109/BIBM.2016.7822657\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A Multi-Label Problem Transformation Joint Classification (MLPTJC) method is developed to solve the multi-label classification problem for the health and disease risk prediction based on physical examination records. We adopt a multi-class classification problem transformation method to transform the multi-label classification problem to a multi-class classification problem. Then We propose a Joint Decomposition Subset Classifier method to reduce the infrequent label sets to deal with the imbalance learning problem. Based on MLPTJC, existing cost-sensitive multi-class classification algorithms can be used to train the prediction models. We conduct some experiments to evaluate the performance of the MLPTJC method. The Support Vector Machine (SVM) and Random Forest (RF) algorithms are used for multi-class classification learning. We use the 10-fold cross-validation and metrics such as Average Accuracy, Precision, Recall and F-measure to evaluate the performance. The real physical examination records were employed, which include 62 examination items and 110, 300 anonymous patients. 8 types of diseases were predicted. The experimental results show that the MLPTJC method has better performance in terms of accuracy.\",\"PeriodicalId\":345384,\"journal\":{\"name\":\"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBM.2016.7822657\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2016.7822657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-label classification for intelligent health risk prediction
A Multi-Label Problem Transformation Joint Classification (MLPTJC) method is developed to solve the multi-label classification problem for the health and disease risk prediction based on physical examination records. We adopt a multi-class classification problem transformation method to transform the multi-label classification problem to a multi-class classification problem. Then We propose a Joint Decomposition Subset Classifier method to reduce the infrequent label sets to deal with the imbalance learning problem. Based on MLPTJC, existing cost-sensitive multi-class classification algorithms can be used to train the prediction models. We conduct some experiments to evaluate the performance of the MLPTJC method. The Support Vector Machine (SVM) and Random Forest (RF) algorithms are used for multi-class classification learning. We use the 10-fold cross-validation and metrics such as Average Accuracy, Precision, Recall and F-measure to evaluate the performance. The real physical examination records were employed, which include 62 examination items and 110, 300 anonymous patients. 8 types of diseases were predicted. The experimental results show that the MLPTJC method has better performance in terms of accuracy.