{"title":"Selecting informative genes by Lasso and Dantzig selector for linear classifiers","authors":"Songfeng Zheng, Weixiang Liu","doi":"10.1109/BIBM.2010.5706651","DOIUrl":null,"url":null,"abstract":"Automatically selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso and Dantzig selector are known to have automatic variable selection ability in linear regression analysis. This paper employs Lasso and Dantzig selector to select most informative genes for representing the class label as a linear function of gene expression data. The selected genes are further used to fit linear classifiers for cancer classification. On 3 publicly available cancer datasets, the experimental results show that in general, Lasso is more capable than Dantzig selector in selecting informative genes for classification.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2010.5706651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Automatically selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso and Dantzig selector are known to have automatic variable selection ability in linear regression analysis. This paper employs Lasso and Dantzig selector to select most informative genes for representing the class label as a linear function of gene expression data. The selected genes are further used to fit linear classifiers for cancer classification. On 3 publicly available cancer datasets, the experimental results show that in general, Lasso is more capable than Dantzig selector in selecting informative genes for classification.