{"title":"Interrelated two-way clustering: an unsupervised approach for gene expression data analysis","authors":"Chun Tang, Li Zhang, A. Zhang, M. Ramanathan","doi":"10.1109/BIBE.2001.974410","DOIUrl":null,"url":null,"abstract":"DNA arrays can be used to measure the expression levels of thousands of genes simultaneously. Most research is focusing on interpretation of the meaning of the data. However, the majority of methods are supervised, with less attention having been paid to unsupervised approaches which are important when domain knowledge is incomplete or hard to obtain. In this paper we present a new framework for unsupervised analysis of gene expression data which applies an interrelated two-way clustering approach to the gene expression matrices. The goal of clustering is to find important gene patterns and perform cluster discovery on samples. The advantage of this approach is that we can dynamically use the relationships between the groups of genes and samples while iteratively clustering through both gene-dimension and sample-dimension. We illustrate the method on gene expression data from a study of multiple sclerosis patients. The experiments demonstrate the effectiveness of this approach.","PeriodicalId":405124,"journal":{"name":"Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"189","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2nd Annual IEEE International Symposium on Bioinformatics and Bioengineering (BIBE 2001)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2001.974410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 189
Abstract
DNA arrays can be used to measure the expression levels of thousands of genes simultaneously. Most research is focusing on interpretation of the meaning of the data. However, the majority of methods are supervised, with less attention having been paid to unsupervised approaches which are important when domain knowledge is incomplete or hard to obtain. In this paper we present a new framework for unsupervised analysis of gene expression data which applies an interrelated two-way clustering approach to the gene expression matrices. The goal of clustering is to find important gene patterns and perform cluster discovery on samples. The advantage of this approach is that we can dynamically use the relationships between the groups of genes and samples while iteratively clustering through both gene-dimension and sample-dimension. We illustrate the method on gene expression data from a study of multiple sclerosis patients. The experiments demonstrate the effectiveness of this approach.