GOG-MBSHO: multi-strategy fusion binary sea-horse optimizer with Gaussian transfer function for feature selection of cancer gene expression data

IF 10.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Artificial Intelligence Review Pub Date : 2024-10-28 DOI:10.1007/s10462-024-10954-5
Yu-Cai Wang, Hao-Ming Song, Jie-Sheng Wang, Yu-Wei Song, Yu-Liang Qi, Xin-Ru Ma
{"title":"GOG-MBSHO: multi-strategy fusion binary sea-horse optimizer with Gaussian transfer function for feature selection of cancer gene expression data","authors":"Yu-Cai Wang,&nbsp;Hao-Ming Song,&nbsp;Jie-Sheng Wang,&nbsp;Yu-Wei Song,&nbsp;Yu-Liang Qi,&nbsp;Xin-Ru Ma","doi":"10.1007/s10462-024-10954-5","DOIUrl":null,"url":null,"abstract":"<div><p>Cancer gene expression data has the characteristics of high-dimensional, multi-text and multi-classification. The problem of cancer subtype diagnosis can be solved by selecting the most representative and predictive genes from a large number of gene expression data. Feature selection technology can effectively reduce the dimension of data, which helps analyze the information on cancer gene expression data. A multi-strategy fusion binary sea-horse optimizer based on Gaussian transfer function (GOG-MBSHO) is proposed to solve the feature selection problem of cancer gene expression data. Firstly, the multi-strategy includes golden sine strategy, hippo escape strategy and multiple inertia weight strategies. The sea-horse optimizer with the golden sine strategy does not disrupt the structure of the original algorithm. Embedding the golden sine strategy within the spiral motion of the sea-horse optimizer enhances the movement of the algorithm and improves its global exploration and local exploitation capabilities. The hippo escape strategy is introduced for random selection, which avoids the algorithm from falling into local optima, increases the search diversity, and improves the optimization accuracy of the algorithm. The advantage of multiple inertial weight strategies is that dynamic exploitation and exploration can be carried out to accelerate the convergence speed and improve the performance of the algorithm. Then, the effectiveness of multi-strategy fusion was demonstrated by 15 UCI datasets. The simulation results show that the proposed Gaussian transfer function is better than the commonly used S-type and V-type transfer functions, which can improve the classification accuracy, effectively reduce the number of features, and obtain better fitness value. Finally, comparing with other binary swarm intelligent optimization algorithms on 15 cancer gene expression datasets, it is proved that the proposed GOG1-MBSHO has great advantages in the feature selection of cancer gene expression data.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"57 12","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-10954-5.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-10954-5","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Cancer gene expression data has the characteristics of high-dimensional, multi-text and multi-classification. The problem of cancer subtype diagnosis can be solved by selecting the most representative and predictive genes from a large number of gene expression data. Feature selection technology can effectively reduce the dimension of data, which helps analyze the information on cancer gene expression data. A multi-strategy fusion binary sea-horse optimizer based on Gaussian transfer function (GOG-MBSHO) is proposed to solve the feature selection problem of cancer gene expression data. Firstly, the multi-strategy includes golden sine strategy, hippo escape strategy and multiple inertia weight strategies. The sea-horse optimizer with the golden sine strategy does not disrupt the structure of the original algorithm. Embedding the golden sine strategy within the spiral motion of the sea-horse optimizer enhances the movement of the algorithm and improves its global exploration and local exploitation capabilities. The hippo escape strategy is introduced for random selection, which avoids the algorithm from falling into local optima, increases the search diversity, and improves the optimization accuracy of the algorithm. The advantage of multiple inertial weight strategies is that dynamic exploitation and exploration can be carried out to accelerate the convergence speed and improve the performance of the algorithm. Then, the effectiveness of multi-strategy fusion was demonstrated by 15 UCI datasets. The simulation results show that the proposed Gaussian transfer function is better than the commonly used S-type and V-type transfer functions, which can improve the classification accuracy, effectively reduce the number of features, and obtain better fitness value. Finally, comparing with other binary swarm intelligent optimization algorithms on 15 cancer gene expression datasets, it is proved that the proposed GOG1-MBSHO has great advantages in the feature selection of cancer gene expression data.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GOG-MBSHO:采用高斯传递函数的多策略融合二元海马优化器,用于癌症基因表达数据的特征选择
癌症基因表达数据具有高维、多文本、多分类等特点。从大量基因表达数据中筛选出最具代表性和预测性的基因,可以解决癌症亚型诊断问题。特征选择技术能有效降低数据维度,有助于分析癌症基因表达数据信息。本文提出了一种基于高斯传递函数的多策略融合二元海马优化器(GOG-MBSHO)来解决癌症基因表达数据的特征选择问题。首先,多策略包括黄金正弦策略、河马逃逸策略和多惯性权重策略。采用金正弦策略的海马优化器不会破坏原始算法的结构。在海马优化器的螺旋运动中嵌入金正弦策略,增强了算法的运动能力,提高了全局探索和局部开发能力。引入河马逃逸策略进行随机选择,避免了算法陷入局部最优,增加了搜索多样性,提高了算法的优化精度。多惯性权重策略的优势在于可以进行动态利用和探索,加快收敛速度,提高算法性能。然后,通过 15 个 UCI 数据集证明了多策略融合的有效性。仿真结果表明,所提出的高斯传递函数优于常用的 S 型和 V 型传递函数,可以提高分类精度,有效减少特征数量,获得更好的适配值。最后,在 15 个癌症基因表达数据集上与其他二元蜂群智能优化算法进行比较,证明所提出的 GOG1-MBSHO 在癌症基因表达数据的特征选择方面具有很大优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Artificial Intelligence Review
Artificial Intelligence Review 工程技术-计算机:人工智能
CiteScore
22.00
自引率
3.30%
发文量
194
审稿时长
5.3 months
期刊介绍: Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.
期刊最新文献
Federated learning design and functional models: survey A systematic literature review of recent advances on context-aware recommender systems Escape: an optimization method based on crowd evacuation behaviors A multi-strategy boosted bald eagle search algorithm for global optimization and constrained engineering problems: case study on MLP classification problems Innovative solution suggestions for financing electric vehicle charging infrastructure investments with a novel artificial intelligence-based fuzzy decision-making modelling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1