Optimal Feature Selection from High-dimensional Microarray Dataset Employing Hybrid IG-Jaya Model

Bibhuprasad Sahu, S. Dash
{"title":"Optimal Feature Selection from High-dimensional Microarray Dataset Employing \nHybrid IG-Jaya Model","authors":"Bibhuprasad Sahu, S. Dash","doi":"10.2174/2666145416666230124143912","DOIUrl":null,"url":null,"abstract":"\n\nThis paper proposed a hybrid information gain and a Jaya algorithm-based\nmodel to identify the informative genes from the high dimensional microarray\ndata set.\n\n\n\nMetaheuristic algorithms need to tune the parameters to achieve better accuracy, and it is a tidy and sensitive job for all researchers. To solve the difficulties mentioned above, we proposed IG-Jaya, a new hybrid FS model based on wrapping information gain with the Jaya optimization algorithm (parameterless) to obtain the optimal features from the microarray\ndata set.\n\n\n\nThe objective behind considering Jaya is to minimize the\ncomputing cost and the risk of tuning the algorithm’s parameters to achieve\nbetter accuracy. This algorithm’s main strength is that it identifies the best\nfeature subset by updating the worst ones. The resulting feature subset can\nbe considered an input for the classification model.The primary contributions of the study are 1.Using IG as a technique, we developed a filter-based paradigm to employ feature selection by removing redundant and irrelevant features from microarray cancer datasets. 2. This new hybrid meta-heuristic FS model, namely IG-Jaya, is proposed for the efficient diagnosis of cancer disease. 3. Different metrics such as sensitivity, specificity, accuracy, and AUC-ROC Curve are used to study the performance of the hybrid model with various classifiers such as SVM, LDA, DT, NB, etc.\n\n\n\nThis\npaper’s overall investigation is divided into two phases: In the first\npart, without including any filter, we have used the parameter-less\nJA to identify featured gene subsets. And the performance of JA is\nevaluated using various classifiers like SVM, LDA, NB, and DT.\n\n\n\nFrom the resulting study, it is noteworthy to state that IG-JAYA performs better as compared to the existing models\n\n\n\nThis paper proposed a hybrid information gain and a Jaya algorithm-based model to identify the informative genes from the high dimensional microarray data set. The performance evaluation of the proposed model is done with 13 different benchmark data sets. To achieve better performance, we have focused on one of the best meta-heuristic parameter-less algorithms called JAYA. It used the solution''s fitness to gather the most feasible informative genes. And from the comparison table, we can also ensure the model''s performance. For some datasets, our proposed model cannot provide the best accuracy compared to other existing approaches; it is pretty steady and sound. The same model will be tested with different filter methods and real-time datasets in the subsequent study. A hybrid multi-filter Jaya algorithm will be proposed to check the efficiency of the proposed one. And it would be better to choose any other hybrid model with JAYA to enhance the feature selection accuracy with a high dimensional dataset.\n\n\n\nIn the future , it would be better to choose any other hybrid model (chaos-based) with JAYA to enhance the feature selection accuracy with a high-dimensional dataset.\n","PeriodicalId":36699,"journal":{"name":"Current Materials Science","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Materials Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/2666145416666230124143912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

This paper proposed a hybrid information gain and a Jaya algorithm-based model to identify the informative genes from the high dimensional microarray data set. Metaheuristic algorithms need to tune the parameters to achieve better accuracy, and it is a tidy and sensitive job for all researchers. To solve the difficulties mentioned above, we proposed IG-Jaya, a new hybrid FS model based on wrapping information gain with the Jaya optimization algorithm (parameterless) to obtain the optimal features from the microarray data set. The objective behind considering Jaya is to minimize the computing cost and the risk of tuning the algorithm’s parameters to achieve better accuracy. This algorithm’s main strength is that it identifies the best feature subset by updating the worst ones. The resulting feature subset can be considered an input for the classification model.The primary contributions of the study are 1.Using IG as a technique, we developed a filter-based paradigm to employ feature selection by removing redundant and irrelevant features from microarray cancer datasets. 2. This new hybrid meta-heuristic FS model, namely IG-Jaya, is proposed for the efficient diagnosis of cancer disease. 3. Different metrics such as sensitivity, specificity, accuracy, and AUC-ROC Curve are used to study the performance of the hybrid model with various classifiers such as SVM, LDA, DT, NB, etc. This paper’s overall investigation is divided into two phases: In the first part, without including any filter, we have used the parameter-less JA to identify featured gene subsets. And the performance of JA is evaluated using various classifiers like SVM, LDA, NB, and DT. From the resulting study, it is noteworthy to state that IG-JAYA performs better as compared to the existing models This paper proposed a hybrid information gain and a Jaya algorithm-based model to identify the informative genes from the high dimensional microarray data set. The performance evaluation of the proposed model is done with 13 different benchmark data sets. To achieve better performance, we have focused on one of the best meta-heuristic parameter-less algorithms called JAYA. It used the solution''s fitness to gather the most feasible informative genes. And from the comparison table, we can also ensure the model''s performance. For some datasets, our proposed model cannot provide the best accuracy compared to other existing approaches; it is pretty steady and sound. The same model will be tested with different filter methods and real-time datasets in the subsequent study. A hybrid multi-filter Jaya algorithm will be proposed to check the efficiency of the proposed one. And it would be better to choose any other hybrid model with JAYA to enhance the feature selection accuracy with a high dimensional dataset. In the future , it would be better to choose any other hybrid model (chaos-based) with JAYA to enhance the feature selection accuracy with a high-dimensional dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于混合IG-Jaya模型的高维微阵列数据集特征优选
本文提出了一种混合信息增益和基于Jaya算法的模型来从高维微阵列数据集中识别信息基因。元启发式算法需要调整参数以达到更好的精度,这对所有研究人员来说都是一项整洁而敏感的工作。为了解决上述困难,我们提出了一种新的混合FS模型IG-Jaya,该模型基于Jaya优化算法(无参数)包裹信息增益,以从微阵列数据集中获得最优特征。考虑Jaya的目的是最小化计算成本和调整算法参数的风险,以达到更好的精度。该算法的主要优点是通过更新最差的特征子集来识别最佳特征子集。得到的特征子集可以作为分类模型的输入。本研究的主要贡献是:1。使用IG作为技术,我们开发了一个基于过滤器的范例,通过从微阵列癌症数据集中去除冗余和不相关的特征来进行特征选择。2. 提出了一种新的混合元启发式FS模型,即IG-Jaya,用于癌症疾病的有效诊断。3.本文采用敏感性、特异性、准确性、AUC-ROC曲线等不同的指标来研究混合模型与SVM、LDA、DT、NB等各种分类器的性能。本文的整体研究分为两个阶段:第一部分,我们在不包含任何滤波器的情况下,使用无参数ja来识别特征基因子集。并使用各种分类器(如SVM、LDA、NB和DT)评估JA的性能。从研究结果来看,值得注意的是,与现有模型相比,IG-JAYA表现更好。本文提出了一种混合信息增益和基于Jaya算法的模型来从高维微阵列数据集中识别信息基因。用13个不同的基准数据集对所提出的模型进行了性能评估。为了获得更好的性能,我们专注于一种最好的元启发式无参数算法,称为JAYA。它利用溶液的适应度来收集最可行的信息基因。通过对比表,我们也可以确定模型的性能。对于某些数据集,与其他现有方法相比,我们提出的模型不能提供最好的精度;这是相当稳定和健全的。在后续的研究中,同一模型将使用不同的滤波方法和实时数据集进行测试。提出一种混合多滤波器Jaya算法来检验所提算法的效率。在高维数据集上,为了提高特征选择的准确性,最好选择与JAYA混合的模型。未来,为了提高高维数据集的特征选择精度,最好选择其他混合模型(基于混沌)与JAYA。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Current Materials Science
Current Materials Science Materials Science-Materials Science (all)
CiteScore
0.80
自引率
0.00%
发文量
38
期刊最新文献
An Experimental Study on Compressive Properties of Composite Fiber Geopolymer Concrete Mechanical Properties of Fly Ash Geopolymer with Macadamia Nutshell Aggregates Synthesis of Form-stable Phase Change Materials for Application in Lunch Box to Keep the Food Warm Potential Biomolecule Fisetin: Molecular and Pharmacological Perspectives Investigating Thermal Decomposition Kinetics and Thermodynamic Parameters of Hydroxyl-Terminated Polybutadiene-based Energetic Composite
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1