{"title":"Incremental non-gaussian analysis of microarray gene expression data","authors":"Kam Swee Ng, Hyung-Jeong Yang, Sun-Hee Kim","doi":"10.1145/1651318.1651334","DOIUrl":null,"url":null,"abstract":"The microarray is gaining popularity in biomedical research due to its ability to analyze hundreds to thousands of genes simultaneously in one experiment. However, the unique nature of microarray data, with a large number of features but relative small number of samples, poses challenges to process the microarray data effectively. The curse of dimensionality introduces the importance of feature extraction in analyzing microarray data. Therefore, we propose a novel incremental method to discover the non-Gaussian weight from the microarray gene expression data with high efficiency. Our proposed method can discover a small number of compact features from a huge number of genes and can still achieve good predictive performance. It integrates non-gaussianity and an adaptive incremental model in an unsupervised way to extract informative features. It is also plausible to analyze microarray data with the number of features much larger than number of observations with promising results.","PeriodicalId":143937,"journal":{"name":"Data and Text Mining in Bioinformatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and Text Mining in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1651318.1651334","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The microarray is gaining popularity in biomedical research due to its ability to analyze hundreds to thousands of genes simultaneously in one experiment. However, the unique nature of microarray data, with a large number of features but relative small number of samples, poses challenges to process the microarray data effectively. The curse of dimensionality introduces the importance of feature extraction in analyzing microarray data. Therefore, we propose a novel incremental method to discover the non-Gaussian weight from the microarray gene expression data with high efficiency. Our proposed method can discover a small number of compact features from a huge number of genes and can still achieve good predictive performance. It integrates non-gaussianity and an adaptive incremental model in an unsupervised way to extract informative features. It is also plausible to analyze microarray data with the number of features much larger than number of observations with promising results.