{"title":"Application of reactive GRASP to the biclustering of gene expression data","authors":"Shyama Das, S. M. Idicula","doi":"10.1145/1722024.1722041","DOIUrl":null,"url":null,"abstract":"A bicluster in gene expression dataset is a subset of genes that exhibit similar expression patterns through a subset of conditions. In this work biclusters are identified in two steps. In the first step high quality bicluster seeds are generated using KMeans clustering algorithm. These seeds are then enlarged using Reactive Greedy Randomized Adaptive Search Procedure (RGRASP) which is a multi-start metaheuristic method in which there are two phases, construction and local search. The objective here is to identify biclusters of maximum size with MSR lower than a given threshold. Experiments are conducted on both Yeast and Human Lymphoma datasets. The Experimental results on the benchmark datasets demonstrate that RGRASP is capable of identifying high quality biclusters compared to many of the already existing biclustering algorithms. Compared to the already existing algorithm based on the same RGRASP metaheuristics biclusters with larger size and lower mean squared residue are obtained using this algorithm in Yeast dataset. Moreover in this study the RGRASP is applied for the first time to find biclusters from the Human Lymphoma dataset.","PeriodicalId":39379,"journal":{"name":"In Silico Biology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2010-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/1722024.1722041","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"In Silico Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1722024.1722041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 7
Abstract
A bicluster in gene expression dataset is a subset of genes that exhibit similar expression patterns through a subset of conditions. In this work biclusters are identified in two steps. In the first step high quality bicluster seeds are generated using KMeans clustering algorithm. These seeds are then enlarged using Reactive Greedy Randomized Adaptive Search Procedure (RGRASP) which is a multi-start metaheuristic method in which there are two phases, construction and local search. The objective here is to identify biclusters of maximum size with MSR lower than a given threshold. Experiments are conducted on both Yeast and Human Lymphoma datasets. The Experimental results on the benchmark datasets demonstrate that RGRASP is capable of identifying high quality biclusters compared to many of the already existing biclustering algorithms. Compared to the already existing algorithm based on the same RGRASP metaheuristics biclusters with larger size and lower mean squared residue are obtained using this algorithm in Yeast dataset. Moreover in this study the RGRASP is applied for the first time to find biclusters from the Human Lymphoma dataset.
In Silico BiologyComputer Science-Computational Theory and Mathematics
CiteScore
2.20
自引率
0.00%
发文量
1
期刊介绍:
The considerable "algorithmic complexity" of biological systems requires a huge amount of detailed information for their complete description. Although far from being complete, the overwhelming quantity of small pieces of information gathered for all kind of biological systems at the molecular and cellular level requires computational tools to be adequately stored and interpreted. Interpretation of data means to abstract them as much as allowed to provide a systematic, an integrative view of biology. Most of the presently available scientific journals focus either on accumulating more data from elaborate experimental approaches, or on presenting new algorithms for the interpretation of these data. Both approaches are meritorious.