{"title":"The Classification of Tumor Using Gene Expression Profile Based on Support Vector Machines and Factor Analysis","authors":"Shulin Wang, Ji Wang, Huowang Chen, Wensheng Tang","doi":"10.1109/ISDA.2006.253882","DOIUrl":null,"url":null,"abstract":"Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper, we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover, experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset","PeriodicalId":116729,"journal":{"name":"Sixth International Conference on Intelligent Systems Design and Applications","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Intelligent Systems Design and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2006.253882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper, we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover, experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset