{"title":"Gene Selection for Single-Cell RNA-Seq Data Based on Information Gain and Genetic Algorithm","authors":"Jie Zhang, Junhong Feng","doi":"10.1109/CIS2018.2018.00021","DOIUrl":null,"url":null,"abstract":"Single-cell RNA-seq data often contain tens of and thousands genes, while too many of them are redundant genes as well as inferior genes. In this study, the information gain is used to coarsely remove those redundant and inferior genes, then a new designed genetic algorithm with a dynamic crossover operator is used to finely select the most important genes. The new feature selection algorithm is abbreviated as IGGA. The difference between the IGGA and the existing methods lies in that IGGA is designed at the first time to select genes from single-cell RNA-seq data. Experimental results performing on several real datasets demonstrate that the proposed algorithm can efficiently select the most important genes.","PeriodicalId":185099,"journal":{"name":"2018 14th International Conference on Computational Intelligence and Security (CIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 14th International Conference on Computational Intelligence and Security (CIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIS2018.2018.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Single-cell RNA-seq data often contain tens of and thousands genes, while too many of them are redundant genes as well as inferior genes. In this study, the information gain is used to coarsely remove those redundant and inferior genes, then a new designed genetic algorithm with a dynamic crossover operator is used to finely select the most important genes. The new feature selection algorithm is abbreviated as IGGA. The difference between the IGGA and the existing methods lies in that IGGA is designed at the first time to select genes from single-cell RNA-seq data. Experimental results performing on several real datasets demonstrate that the proposed algorithm can efficiently select the most important genes.