{"title":"A Novel and Efficient Rough Set Based Clustering Technique for Gene Expression Data","authors":"K. Adhikary, Suman Das, Samir Roy","doi":"10.1109/ICBIM.2014.6970930","DOIUrl":null,"url":null,"abstract":"Gene expressions with similar patterns are clustered, which help us to understand the functions of unknown and abnormal patterns of genes in future. The major task of gene expression data clustering is to identify groups of co-expressed genes. In this regard a new gene expression clustering method, termed as A Novel and Efficient Rough Set Based Clustering Technique for Gene Expression Data (NRSBCGE), is proposed based on the Rough set theory. This method is designed intelligently as it itself detects the optimum number of clusters. The proposed clustering method provides an efficient way of finding the unique gene expression patterns. The method was experimented with two publicly available cancer datasets and the results were compared with two existing methods of clustering. The effectiveness of the proposed method, along with a comparison with existing Rough set based gene selection and clustering algorithms, is demonstrated based on the silhouette index, which provides better result than the previously proposed methods.","PeriodicalId":6549,"journal":{"name":"2014 2nd International Conference on Business and Information Management (ICBIM)","volume":"24 3","pages":"41-46"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 2nd International Conference on Business and Information Management (ICBIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBIM.2014.6970930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Gene expressions with similar patterns are clustered, which help us to understand the functions of unknown and abnormal patterns of genes in future. The major task of gene expression data clustering is to identify groups of co-expressed genes. In this regard a new gene expression clustering method, termed as A Novel and Efficient Rough Set Based Clustering Technique for Gene Expression Data (NRSBCGE), is proposed based on the Rough set theory. This method is designed intelligently as it itself detects the optimum number of clusters. The proposed clustering method provides an efficient way of finding the unique gene expression patterns. The method was experimented with two publicly available cancer datasets and the results were compared with two existing methods of clustering. The effectiveness of the proposed method, along with a comparison with existing Rough set based gene selection and clustering algorithms, is demonstrated based on the silhouette index, which provides better result than the previously proposed methods.