{"title":"监督基因聚类从微阵列数据中提取判别特征","authors":"C. Das, P. Maji, Samiran Chattopadhyay","doi":"10.1109/INDCON.2010.5712629","DOIUrl":null,"url":null,"abstract":"Among the large number of genes presented in microarray data, only a small fraction of them are effective for performing a certain diagnostic test. However, it is very difficult to identify these genes for disease diagnosis. In this regard, a new supervised gene clustering algorithm is proposed to cluster genes from microarray data. The proposed method directly incorporates the information of response variables in the grouping process for finding such groups of genes. Significant cluster representatives are then taken to form the reduced feature set that can be used to build the classifiers with very high classification accuracy. The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on three microarray data sets based on predictive accuracy of the naive Bayes'classifier, the K-nearest neighbor rule, and the support vector machine.","PeriodicalId":109071,"journal":{"name":"2010 Annual IEEE India Conference (INDICON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Supervised gene clustering for extraction of discriminative features from microarray data\",\"authors\":\"C. Das, P. Maji, Samiran Chattopadhyay\",\"doi\":\"10.1109/INDCON.2010.5712629\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Among the large number of genes presented in microarray data, only a small fraction of them are effective for performing a certain diagnostic test. However, it is very difficult to identify these genes for disease diagnosis. In this regard, a new supervised gene clustering algorithm is proposed to cluster genes from microarray data. The proposed method directly incorporates the information of response variables in the grouping process for finding such groups of genes. Significant cluster representatives are then taken to form the reduced feature set that can be used to build the classifiers with very high classification accuracy. The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on three microarray data sets based on predictive accuracy of the naive Bayes'classifier, the K-nearest neighbor rule, and the support vector machine.\",\"PeriodicalId\":109071,\"journal\":{\"name\":\"2010 Annual IEEE India Conference (INDICON)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Annual IEEE India Conference (INDICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDCON.2010.5712629\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Annual IEEE India Conference (INDICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDCON.2010.5712629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Supervised gene clustering for extraction of discriminative features from microarray data
Among the large number of genes presented in microarray data, only a small fraction of them are effective for performing a certain diagnostic test. However, it is very difficult to identify these genes for disease diagnosis. In this regard, a new supervised gene clustering algorithm is proposed to cluster genes from microarray data. The proposed method directly incorporates the information of response variables in the grouping process for finding such groups of genes. Significant cluster representatives are then taken to form the reduced feature set that can be used to build the classifiers with very high classification accuracy. The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on three microarray data sets based on predictive accuracy of the naive Bayes'classifier, the K-nearest neighbor rule, and the support vector machine.