{"title":"基于支持向量机和因子分析的基因表达谱的肿瘤分类","authors":"Shulin Wang, Ji Wang, Huowang Chen, Wensheng Tang","doi":"10.1109/ISDA.2006.253882","DOIUrl":null,"url":null,"abstract":"Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper, we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover, experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset","PeriodicalId":116729,"journal":{"name":"Sixth International Conference on Intelligent Systems Design and Applications","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"The Classification of Tumor Using Gene Expression Profile Based on Support Vector Machines and Factor Analysis\",\"authors\":\"Shulin Wang, Ji Wang, Huowang Chen, Wensheng Tang\",\"doi\":\"10.1109/ISDA.2006.253882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper, we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover, experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset\",\"PeriodicalId\":116729,\"journal\":{\"name\":\"Sixth International Conference on Intelligent Systems Design and Applications\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sixth International Conference on Intelligent Systems Design and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISDA.2006.253882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Intelligent Systems Design and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2006.253882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Classification of Tumor Using Gene Expression Profile Based on Support Vector Machines and Factor Analysis
Gene expression data that is being used to gather information from tissue samples is expected to significantly improve the development of efficient tumor diagnosis and to provide understanding and insight into tumor related cellular processes. In this paper, we propose a novel feature selection approach which integrates the feature score criterion with factor analysis to further improve the SVM-based classification performance of gene expression data. We examine two sets of published gene expression data to validate the novel feature selection method by means of SVM classifier with different parameters. Experiments show that the proposed hybrid method can select a small quantity of principal factors to represent a large number of genes and SVM has a superior classification performance with the common factors which are extracted from gene expression data. Moreover, experiment results demonstrate successful cross-validation accuracy of 92% for the colon dataset and 100% for the leukemia dataset