增强新冠肺炎微阵列数据的特征选择优化

COVID Pub Date : 2023-09-04 DOI:10.3390/covid3090093
Gayani Krishanthi, H. Jayetileke, Jinran Wu, Chanjuan Liu, You-Gan Wang
{"title":"增强新冠肺炎微阵列数据的特征选择优化","authors":"Gayani Krishanthi, H. Jayetileke, Jinran Wu, Chanjuan Liu, You-Gan Wang","doi":"10.3390/covid3090093","DOIUrl":null,"url":null,"abstract":"The utilization of gene selection techniques is crucial when dealing with extensive datasets containing limited cases and numerous genes, as they enhance the learning processes and improve overall outcomes. In this research, we introduce a hybrid method that combines the binary reptile search algorithm (BRSA) with the LASSO regression method to effectively filter and reduce the dimensionality of a gene expression dataset. Our primary objective was to pinpoint genes associated with COVID-19 by examining the GSE149273 dataset, which focuses on respiratory viral (RV) infections in individuals with asthma. This dataset suggested a potential increase in ACE2 expression, a critical receptor for the SARS-CoV-2 virus, along with the activation of cytokine pathways linked to COVID-19. Our proposed BRSA method successfully identified six significant genes, including ACE2, IFIT5, and TRIM14, that are closely related to COVID-19, achieving an impressive maximum classification accuracy of 87.22%. By conducting a comparative analysis against four existing binary feature selection algorithms, we demonstrated the effectiveness of our hybrid approach in reducing the dimensionality of features, while maintaining a high classification accuracy. As a result, our hybrid approach shows great promise for identifying COVID-19-related genes and could be an invaluable tool for other studies dealing with very large gene expression datasets.","PeriodicalId":72714,"journal":{"name":"COVID","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Enhancing Feature Selection Optimization for COVID-19 Microarray Data\",\"authors\":\"Gayani Krishanthi, H. Jayetileke, Jinran Wu, Chanjuan Liu, You-Gan Wang\",\"doi\":\"10.3390/covid3090093\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The utilization of gene selection techniques is crucial when dealing with extensive datasets containing limited cases and numerous genes, as they enhance the learning processes and improve overall outcomes. In this research, we introduce a hybrid method that combines the binary reptile search algorithm (BRSA) with the LASSO regression method to effectively filter and reduce the dimensionality of a gene expression dataset. Our primary objective was to pinpoint genes associated with COVID-19 by examining the GSE149273 dataset, which focuses on respiratory viral (RV) infections in individuals with asthma. This dataset suggested a potential increase in ACE2 expression, a critical receptor for the SARS-CoV-2 virus, along with the activation of cytokine pathways linked to COVID-19. Our proposed BRSA method successfully identified six significant genes, including ACE2, IFIT5, and TRIM14, that are closely related to COVID-19, achieving an impressive maximum classification accuracy of 87.22%. By conducting a comparative analysis against four existing binary feature selection algorithms, we demonstrated the effectiveness of our hybrid approach in reducing the dimensionality of features, while maintaining a high classification accuracy. As a result, our hybrid approach shows great promise for identifying COVID-19-related genes and could be an invaluable tool for other studies dealing with very large gene expression datasets.\",\"PeriodicalId\":72714,\"journal\":{\"name\":\"COVID\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"COVID\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/covid3090093\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"COVID","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/covid3090093","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在处理包含有限病例和大量基因的广泛数据集时,基因选择技术的使用至关重要,因为它们可以增强学习过程并提高整体结果。在本研究中,我们引入了一种混合方法,将二进制爬行动物搜索算法(BRSA)与LASSO回归方法相结合,以有效地过滤和降低基因表达数据集的维数。我们的主要目标是通过检查GSE149273数据集来确定与新冠肺炎相关的基因,该数据集专注于哮喘患者的呼吸道病毒(RV)感染。该数据集表明,ACE2表达可能增加,ACE2是SARS-CoV-2病毒的关键受体,同时激活与新冠肺炎相关的细胞因子途径。我们提出的BRSA方法成功识别了六个与新冠肺炎密切相关的重要基因,包括ACE2、IFIT5和TRIM14,达到了令人印象深刻的最高分类准确率87.22%,我们证明了我们的混合方法在降低特征维度的同时保持高分类精度方面的有效性。因此,我们的混合方法在识别COVID-19相关基因方面显示出巨大的前景,并可能成为其他处理超大基因表达数据集的研究的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Enhancing Feature Selection Optimization for COVID-19 Microarray Data
The utilization of gene selection techniques is crucial when dealing with extensive datasets containing limited cases and numerous genes, as they enhance the learning processes and improve overall outcomes. In this research, we introduce a hybrid method that combines the binary reptile search algorithm (BRSA) with the LASSO regression method to effectively filter and reduce the dimensionality of a gene expression dataset. Our primary objective was to pinpoint genes associated with COVID-19 by examining the GSE149273 dataset, which focuses on respiratory viral (RV) infections in individuals with asthma. This dataset suggested a potential increase in ACE2 expression, a critical receptor for the SARS-CoV-2 virus, along with the activation of cytokine pathways linked to COVID-19. Our proposed BRSA method successfully identified six significant genes, including ACE2, IFIT5, and TRIM14, that are closely related to COVID-19, achieving an impressive maximum classification accuracy of 87.22%. By conducting a comparative analysis against four existing binary feature selection algorithms, we demonstrated the effectiveness of our hybrid approach in reducing the dimensionality of features, while maintaining a high classification accuracy. As a result, our hybrid approach shows great promise for identifying COVID-19-related genes and could be an invaluable tool for other studies dealing with very large gene expression datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impact of COVID-19 on Dental Students’ Mental Health Status and Perception of SARS-CoV-2 Vaccine SARS-CoV-2-Related Parotitis in Children: A Narrative-Focused Review 3Cs: Unleashing Capsule Networks for Robust COVID-19 Detection Using CT Images EFCAB4B (CRACR2A/Rab46) Genetic Variants Associated with COVID-19 Fatality Comparison of the Psychological Impact of COVID-19 on Healthcare Workers between 2022 and 2023 in a Romanian COVID-19 Hub Hospital
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1