基于L1/2+2正则化的特征基因选择

Zhenyu He, Yong Liang, Ling Huang, Wenzhong Wang, Jinfeng Wang
{"title":"基于L1/2+2正则化的特征基因选择","authors":"Zhenyu He, Yong Liang, Ling Huang, Wenzhong Wang, Jinfeng Wang","doi":"10.1145/3498731.3498737","DOIUrl":null,"url":null,"abstract":"Cancer is one of the great medical problems that mankind is facing today. With the help of DNA microarray, we can analyze thousands of genes simultaneously. The analysis of cancer samples with microarray technique is a hot topic in the field of bioinformatics. There are usually quite a lot genes in the microarray datasets, so it is time-consuming for us to classify samples with all these genes. For this reason, it is necessary for us to conduct feature gene selection. Regularization can serve as a method for feature selection. In this paper, we proposed a method called L1/2+2 and Fuzzy Measure Gene Selection (LFMGS). The method can be divided into two parts. Firstly, the L1/2+2 regularization is adopted to remove most of genes. Then L1/2+2 regularization and fuzzy measure are combined to obtain the sparse solution of fuzzy measures, and then a small number of genes are eliminated based on the final gene rank. Experimental results on seven datasets show the superiority of our method over the other four methods comprehensively considering accuracy, sensitivity and specificity, and the number of selected genes.","PeriodicalId":166893,"journal":{"name":"Proceedings of the 2021 10th International Conference on Bioinformatics and Biomedical Science","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Gene Selection based on L1/2+2 Regularization\",\"authors\":\"Zhenyu He, Yong Liang, Ling Huang, Wenzhong Wang, Jinfeng Wang\",\"doi\":\"10.1145/3498731.3498737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cancer is one of the great medical problems that mankind is facing today. With the help of DNA microarray, we can analyze thousands of genes simultaneously. The analysis of cancer samples with microarray technique is a hot topic in the field of bioinformatics. There are usually quite a lot genes in the microarray datasets, so it is time-consuming for us to classify samples with all these genes. For this reason, it is necessary for us to conduct feature gene selection. Regularization can serve as a method for feature selection. In this paper, we proposed a method called L1/2+2 and Fuzzy Measure Gene Selection (LFMGS). The method can be divided into two parts. Firstly, the L1/2+2 regularization is adopted to remove most of genes. Then L1/2+2 regularization and fuzzy measure are combined to obtain the sparse solution of fuzzy measures, and then a small number of genes are eliminated based on the final gene rank. Experimental results on seven datasets show the superiority of our method over the other four methods comprehensively considering accuracy, sensitivity and specificity, and the number of selected genes.\",\"PeriodicalId\":166893,\"journal\":{\"name\":\"Proceedings of the 2021 10th International Conference on Bioinformatics and Biomedical Science\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 10th International Conference on Bioinformatics and Biomedical Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3498731.3498737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 10th International Conference on Bioinformatics and Biomedical Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3498731.3498737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

癌症是当今人类面临的重大医学问题之一。在DNA微阵列的帮助下,我们可以同时分析数千个基因。利用微阵列技术对肿瘤样本进行分析是生物信息学领域的研究热点。通常在微阵列数据集中有相当多的基因,因此我们对所有这些基因的样本进行分类是非常耗时的。因此,我们有必要进行特征基因选择。正则化可以作为特征选择的一种方法。本文提出了一种名为L1/2+2和模糊测度基因选择(LFMGS)的方法。该方法可分为两部分。首先,采用L1/2+2正则化去除大部分基因;然后将L1/2+2正则化与模糊测度相结合,得到模糊测度的稀疏解,再根据最终的基因秩剔除少量基因。在7个数据集上的实验结果表明,综合考虑准确性、敏感性和特异性以及所选基因的数量,我们的方法优于其他4种方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Feature Gene Selection based on L1/2+2 Regularization
Cancer is one of the great medical problems that mankind is facing today. With the help of DNA microarray, we can analyze thousands of genes simultaneously. The analysis of cancer samples with microarray technique is a hot topic in the field of bioinformatics. There are usually quite a lot genes in the microarray datasets, so it is time-consuming for us to classify samples with all these genes. For this reason, it is necessary for us to conduct feature gene selection. Regularization can serve as a method for feature selection. In this paper, we proposed a method called L1/2+2 and Fuzzy Measure Gene Selection (LFMGS). The method can be divided into two parts. Firstly, the L1/2+2 regularization is adopted to remove most of genes. Then L1/2+2 regularization and fuzzy measure are combined to obtain the sparse solution of fuzzy measures, and then a small number of genes are eliminated based on the final gene rank. Experimental results on seven datasets show the superiority of our method over the other four methods comprehensively considering accuracy, sensitivity and specificity, and the number of selected genes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Meshfree Method for Deformation Field Reconstruction of Soft Tissue in Needle Insertion A Systematic Review of National Drug Negotiations Use of machine learning to predict abandonment rates in an emergency department A study of healthcare associated infections in the Intensive Care Unit of “Federico II” University Hospital through Logistic Regression The Role of Circulating Tumor Cells in Diagnosis of Cancer: Cancer and Circulating Tumor Cells
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1