{"title":"Feature Identification from Imbalanced Data Sets for Diagnosis of Cardiac Arrhythmia","authors":"Lijun Liang, Tingting Jin, Meiya Huo","doi":"10.1109/ISCID.2018.10113","DOIUrl":null,"url":null,"abstract":"Considering the problem of class imbalance, the method of feature identification for disease diagnosis based on equalization methods and elastic net is proposed. First, the cardiac arrhythmia dataset from UCI machine learning repository is dealt with by random under-sampling method and synthetic minority over-sampling technique (SMOTE). Then, the feature sequence of the dataset is identified by elastic net. Finally, “one-against-one” classification strategy of multi-classes SVM classifier is employed to test the classification accuracies of different diagnosis rules. Experiments show that the average accuracy achieves 77.36%, when the first nine attributes in the feature sequence are used as diagnostic rules. Compared with other approaches in the literatures, the presented algorithm is more effective in removing redundant features and can better identify the type of arrhythmia.","PeriodicalId":294370,"journal":{"name":"International Symposium on Computational Intelligence and Design","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Symposium on Computational Intelligence and Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCID.2018.10113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Considering the problem of class imbalance, the method of feature identification for disease diagnosis based on equalization methods and elastic net is proposed. First, the cardiac arrhythmia dataset from UCI machine learning repository is dealt with by random under-sampling method and synthetic minority over-sampling technique (SMOTE). Then, the feature sequence of the dataset is identified by elastic net. Finally, “one-against-one” classification strategy of multi-classes SVM classifier is employed to test the classification accuracies of different diagnosis rules. Experiments show that the average accuracy achieves 77.36%, when the first nine attributes in the feature sequence are used as diagnostic rules. Compared with other approaches in the literatures, the presented algorithm is more effective in removing redundant features and can better identify the type of arrhythmia.