Pancreatic cancer biomarker detection using recursive feature elimination based on Support Vector Machine and large margin distribution machine

Yidan Lv, Yan Wang, Yongfei Tan, Wei Du, Keke Liu, Hao Wang
{"title":"Pancreatic cancer biomarker detection using recursive feature elimination based on Support Vector Machine and large margin distribution machine","authors":"Yidan Lv, Yan Wang, Yongfei Tan, Wei Du, Keke Liu, Hao Wang","doi":"10.1109/ICSAI.2017.8248514","DOIUrl":null,"url":null,"abstract":"Pancreatic cancer acts as one of the leading causes of cancer-connective deaths. Its five-year overall survival rate being reported is about 7.7% from 2006 to 2012 by the National Cancer Institute. One of the main causes for its poor prognosis is because its non-typical symptoms make early diagnosis very challenging. Therefore, a predominant strategy for early accurate detection and prognostication on pancreatic cancer is vital to the whole course of comprehensive therapy. In this research, we proposed a method which combined Recursive Feature Elimination (RFE) method based on Support Vector Machine (SVM) and Large Margin Distribution Machine (LDM) to identify potential biomarkers for pancreatic cancer. In our experiments, we have strengthened the process of RFE to achieve better performance. The dataset GSE15471 we adopted are from GEO database with 39 pairs of pancreatic ductal carcinoma and adjacent control pancreatic tissues. Through experiments, a panel of twelve genes was identified as biomarkers in pancreatic cancer with 91.28% classification accuracy. The universality of the candidate genes was examined on another dataset GSE28735 and the classification accuracy was higher than 80%. In addition, by using the SVM, LDM and BP classifiers, we compared the ordered feature sets generated by our proposed method with T-test, SVM-RFE and LDM-RFE, and the results indicated our proposed method obtained higher average classification accuracy.","PeriodicalId":285726,"journal":{"name":"2017 4th International Conference on Systems and Informatics (ICSAI)","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 4th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI.2017.8248514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Pancreatic cancer acts as one of the leading causes of cancer-connective deaths. Its five-year overall survival rate being reported is about 7.7% from 2006 to 2012 by the National Cancer Institute. One of the main causes for its poor prognosis is because its non-typical symptoms make early diagnosis very challenging. Therefore, a predominant strategy for early accurate detection and prognostication on pancreatic cancer is vital to the whole course of comprehensive therapy. In this research, we proposed a method which combined Recursive Feature Elimination (RFE) method based on Support Vector Machine (SVM) and Large Margin Distribution Machine (LDM) to identify potential biomarkers for pancreatic cancer. In our experiments, we have strengthened the process of RFE to achieve better performance. The dataset GSE15471 we adopted are from GEO database with 39 pairs of pancreatic ductal carcinoma and adjacent control pancreatic tissues. Through experiments, a panel of twelve genes was identified as biomarkers in pancreatic cancer with 91.28% classification accuracy. The universality of the candidate genes was examined on another dataset GSE28735 and the classification accuracy was higher than 80%. In addition, by using the SVM, LDM and BP classifiers, we compared the ordered feature sets generated by our proposed method with T-test, SVM-RFE and LDM-RFE, and the results indicated our proposed method obtained higher average classification accuracy.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于支持向量机和大边际分布机的递归特征消除胰腺癌生物标志物检测
胰腺癌是癌症相关死亡的主要原因之一。据美国国家癌症研究所(National Cancer Institute)报道,从2006年到2012年,它的五年总体存活率约为7.7%。其预后不良的主要原因之一是其非典型症状使早期诊断非常困难。因此,胰腺癌的早期准确检测和预后对整个综合治疗过程至关重要。本研究提出了一种基于支持向量机(SVM)和大边际分布机(LDM)的递归特征消除(RFE)方法相结合的胰腺癌潜在生物标志物识别方法。在我们的实验中,我们加强了RFE的过程,以获得更好的性能。我们采用的数据集GSE15471来自GEO数据库,包含39对胰腺导管癌和邻近对照胰腺组织。通过实验,确定了一组12个基因作为胰腺癌的生物标志物,分类准确率为91.28%。在另一个数据集GSE28735上检验候选基因的通用性,分类准确率高于80%。此外,通过使用SVM、LDM和BP分类器,将本文方法生成的有序特征集与t检验、SVM- rfe和LDM- rfe进行比较,结果表明本文方法获得了更高的平均分类准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Adaptive coverage control with Guaranteed Power Voronoi Diagrams Gray relativity analysis used to track association in passive sonar system Music visualization based on the MIDI specifications for multidimensional polyphonic expression Modeling of a data modification cyber-attack in an IEC 61850 scenario using stochastic colored Petri Nets Four nonlinear multi-input multi-output ADHDP constructions and algorithms based on topology principle
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1