{"title":"基于信息增益和遗传算法的单细胞RNA-Seq数据基因选择","authors":"Jie Zhang, Junhong Feng","doi":"10.1109/CIS2018.2018.00021","DOIUrl":null,"url":null,"abstract":"Single-cell RNA-seq data often contain tens of and thousands genes, while too many of them are redundant genes as well as inferior genes. In this study, the information gain is used to coarsely remove those redundant and inferior genes, then a new designed genetic algorithm with a dynamic crossover operator is used to finely select the most important genes. The new feature selection algorithm is abbreviated as IGGA. The difference between the IGGA and the existing methods lies in that IGGA is designed at the first time to select genes from single-cell RNA-seq data. Experimental results performing on several real datasets demonstrate that the proposed algorithm can efficiently select the most important genes.","PeriodicalId":185099,"journal":{"name":"2018 14th International Conference on Computational Intelligence and Security (CIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Gene Selection for Single-Cell RNA-Seq Data Based on Information Gain and Genetic Algorithm\",\"authors\":\"Jie Zhang, Junhong Feng\",\"doi\":\"10.1109/CIS2018.2018.00021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Single-cell RNA-seq data often contain tens of and thousands genes, while too many of them are redundant genes as well as inferior genes. In this study, the information gain is used to coarsely remove those redundant and inferior genes, then a new designed genetic algorithm with a dynamic crossover operator is used to finely select the most important genes. The new feature selection algorithm is abbreviated as IGGA. The difference between the IGGA and the existing methods lies in that IGGA is designed at the first time to select genes from single-cell RNA-seq data. Experimental results performing on several real datasets demonstrate that the proposed algorithm can efficiently select the most important genes.\",\"PeriodicalId\":185099,\"journal\":{\"name\":\"2018 14th International Conference on Computational Intelligence and Security (CIS)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 14th International Conference on Computational Intelligence and Security (CIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIS2018.2018.00021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 14th International Conference on Computational Intelligence and Security (CIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIS2018.2018.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Gene Selection for Single-Cell RNA-Seq Data Based on Information Gain and Genetic Algorithm
Single-cell RNA-seq data often contain tens of and thousands genes, while too many of them are redundant genes as well as inferior genes. In this study, the information gain is used to coarsely remove those redundant and inferior genes, then a new designed genetic algorithm with a dynamic crossover operator is used to finely select the most important genes. The new feature selection algorithm is abbreviated as IGGA. The difference between the IGGA and the existing methods lies in that IGGA is designed at the first time to select genes from single-cell RNA-seq data. Experimental results performing on several real datasets demonstrate that the proposed algorithm can efficiently select the most important genes.