Improving Gene Expression Prediction of Cancer Data Using Nature Inspired Optimization Algorithms

Payal Patel, K. Passi, Chakresh Kumar Jain
{"title":"Improving Gene Expression Prediction of Cancer Data Using Nature Inspired Optimization Algorithms","authors":"Payal Patel, K. Passi, Chakresh Kumar Jain","doi":"10.1109/CSCI54926.2021.00128","DOIUrl":null,"url":null,"abstract":"Cancer being one of the most vital diseases in the medical history needs adequate focus on its causes, symptoms and detection. Various algorithms and software have been designed so far to predict the cancer at cellular level. The most crucial aspect for sorting the cancerous tissues is the classification of such tissues based on the gene expression data. Gene expression data consists of high amount of genetic data as compared to the number of data samples. Thus, sample size and dimensions are a major challenge for researchers. In this work, four different types of cancer microarray datasets are analyzed viz., breast cancer, lung cancer, leukemia and colon cancer. The analysis of the cancer microarray datasets was done using various nature-inspired algorithms like Grasshopper Optimization (GOA), Particle Swarm Optimization (PSO), and Interval Value-based Particle Swarm Optimization (IVPSO). To study the accuracy of the prediction, five different classifiers were used: Random Forest, K-Nearest Neighborhood (KNN), Neural Network, Naïve Bayes and Support Vector Machine (SVM). The Grasshopper Optimization (GOA) outperforms in accuracy compared to the other two optimization algorithms with SVM classifier on leukemia, lung and breast cancer datasets selecting the best genes/attributes to correctly classify the dataset.","PeriodicalId":206881,"journal":{"name":"2021 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Science and Computational Intelligence (CSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCI54926.2021.00128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Cancer being one of the most vital diseases in the medical history needs adequate focus on its causes, symptoms and detection. Various algorithms and software have been designed so far to predict the cancer at cellular level. The most crucial aspect for sorting the cancerous tissues is the classification of such tissues based on the gene expression data. Gene expression data consists of high amount of genetic data as compared to the number of data samples. Thus, sample size and dimensions are a major challenge for researchers. In this work, four different types of cancer microarray datasets are analyzed viz., breast cancer, lung cancer, leukemia and colon cancer. The analysis of the cancer microarray datasets was done using various nature-inspired algorithms like Grasshopper Optimization (GOA), Particle Swarm Optimization (PSO), and Interval Value-based Particle Swarm Optimization (IVPSO). To study the accuracy of the prediction, five different classifiers were used: Random Forest, K-Nearest Neighborhood (KNN), Neural Network, Naïve Bayes and Support Vector Machine (SVM). The Grasshopper Optimization (GOA) outperforms in accuracy compared to the other two optimization algorithms with SVM classifier on leukemia, lung and breast cancer datasets selecting the best genes/attributes to correctly classify the dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用自然启发的优化算法改进癌症数据的基因表达预测
癌症作为医学史上最重要的疾病之一,需要充分关注其病因、症状和检测。到目前为止,已经设计了各种算法和软件来预测细胞水平上的癌症。癌组织分类最关键的方面是基于基因表达数据对癌组织进行分类。基因表达数据由大量的基因数据组成,与数据样本的数量相比。因此,样本的大小和尺寸是研究人员面临的主要挑战。在这项工作中,我们分析了四种不同类型的癌症微阵列数据集,即乳腺癌、肺癌、白血病和结肠癌。癌症微阵列数据集的分析使用了各种受自然启发的算法,如Grasshopper Optimization (GOA)、Particle Swarm Optimization (PSO)和Interval Value-based Particle Swarm Optimization (IVPSO)。为了研究预测的准确性,我们使用了五种不同的分类器:随机森林、k近邻(KNN)、神经网络、Naïve贝叶斯和支持向量机(SVM)。在白血病、肺癌和乳腺癌数据集上,Grasshopper Optimization (GOA)通过选择最佳的基因/属性对数据集进行正确分类,在准确率上优于其他两种SVM分类算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Remote Video Surveillance Effects of Social Distancing Intention, Affective Risk Perception, and Cabin Fever Syndrome on Perceived Value of E-learning : Type of submission: Late Breaking Paper / Most relevant symposium: CSCI-ISED Cybersecurity Integration: Deploying Critical Infrastructure Security and Resilience Topics into the Undergraduate Curriculum Distributed Algorithms for k-Coverage in Mobile Sensor Networks Software Development Methodologies for Virtual Reality
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1