{"title":"ESR: Optimizing Gene Feature Selection for scRNA-seq Data","authors":"Tao Huang, Xiang Chen, Li Peng","doi":"10.1109/CSCloud-EdgeCom58631.2023.00079","DOIUrl":null,"url":null,"abstract":"The rapid development of single-cell RNA sequencing (scRNA-seq) technology has enabled researchers to explore gene expression differences at the level of individual cells, revealing more refined cell types and states. However, due to the low expression and high noise of scRNA-seq data, feature selection has become particularly important in the analysis of single-cell data. Here, we introduce the Entropy Stepwise Regression (ESR) method for feature selection. This method utilizes the correlation between genes and the entropy values of each feature to filter out genes that are conducive to downstream analysis. In mouse kidney samples, we compared the performance of three methods in terms of Adjusted Rand Index and achieved good results. This indicates that the method can improve the accuracy of downstream analysis.","PeriodicalId":56007,"journal":{"name":"Journal of Cloud Computing-Advances Systems and Applications","volume":"20 1","pages":"429-433"},"PeriodicalIF":3.7000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cloud Computing-Advances Systems and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/CSCloud-EdgeCom58631.2023.00079","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid development of single-cell RNA sequencing (scRNA-seq) technology has enabled researchers to explore gene expression differences at the level of individual cells, revealing more refined cell types and states. However, due to the low expression and high noise of scRNA-seq data, feature selection has become particularly important in the analysis of single-cell data. Here, we introduce the Entropy Stepwise Regression (ESR) method for feature selection. This method utilizes the correlation between genes and the entropy values of each feature to filter out genes that are conducive to downstream analysis. In mouse kidney samples, we compared the performance of three methods in terms of Adjusted Rand Index and achieved good results. This indicates that the method can improve the accuracy of downstream analysis.
期刊介绍:
The Journal of Cloud Computing: Advances, Systems and Applications (JoCCASA) will publish research articles on all aspects of Cloud Computing. Principally, articles will address topics that are core to Cloud Computing, focusing on the Cloud applications, the Cloud systems, and the advances that will lead to the Clouds of the future. Comprehensive review and survey articles that offer up new insights, and lay the foundations for further exploratory and experimental work, are also relevant.