A Support Vector Machine Classification of Thyroid Bioptic Specimens Using MALDI-MSI Data.

Q1 Biochemistry, Genetics and Molecular Biology Advances in Bioinformatics Pub Date : 2016-01-01 Epub Date: 2016-05-17 DOI:10.1155/2016/3791214
Manuel Galli, Italo Zoppis, Gabriele De Sio, Clizia Chinello, Fabio Pagni, Fulvio Magni, Giancarlo Mauri
{"title":"A Support Vector Machine Classification of Thyroid Bioptic Specimens Using MALDI-MSI Data.","authors":"Manuel Galli,&nbsp;Italo Zoppis,&nbsp;Gabriele De Sio,&nbsp;Clizia Chinello,&nbsp;Fabio Pagni,&nbsp;Fulvio Magni,&nbsp;Giancarlo Mauri","doi":"10.1155/2016/3791214","DOIUrl":null,"url":null,"abstract":"<p><p>Biomarkers able to characterise and predict multifactorial diseases are still one of the most important targets for all the \"omics\" investigations. In this context, Matrix-Assisted Laser Desorption/Ionisation-Mass Spectrometry Imaging (MALDI-MSI) has gained considerable attention in recent years, but it also led to a huge amount of complex data to be elaborated and interpreted. For this reason, computational and machine learning procedures for biomarker discovery are important tools to consider, both to reduce data dimension and to provide predictive markers for specific diseases. For instance, the availability of protein and genetic markers to support thyroid lesion diagnoses would impact deeply on society due to the high presence of undetermined reports (THY3) that are generally treated as malignant patients. In this paper we show how an accurate classification of thyroid bioptic specimens can be obtained through the application of a state-of-the-art machine learning approach (i.e., Support Vector Machines) on MALDI-MSI data, together with a particular wrapper feature selection algorithm (i.e., recursive feature elimination). The model is able to provide an accurate discriminatory capability using only 20 out of 144 features, resulting in an increase of the model performances, reliability, and computational efficiency. Finally, tissue areas rather than average proteomic profiles are classified, highlighting potential discriminating areas of clinical interest. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2016 ","pages":"3791214"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2016/3791214","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2016/3791214","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2016/5/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 17

Abstract

Biomarkers able to characterise and predict multifactorial diseases are still one of the most important targets for all the "omics" investigations. In this context, Matrix-Assisted Laser Desorption/Ionisation-Mass Spectrometry Imaging (MALDI-MSI) has gained considerable attention in recent years, but it also led to a huge amount of complex data to be elaborated and interpreted. For this reason, computational and machine learning procedures for biomarker discovery are important tools to consider, both to reduce data dimension and to provide predictive markers for specific diseases. For instance, the availability of protein and genetic markers to support thyroid lesion diagnoses would impact deeply on society due to the high presence of undetermined reports (THY3) that are generally treated as malignant patients. In this paper we show how an accurate classification of thyroid bioptic specimens can be obtained through the application of a state-of-the-art machine learning approach (i.e., Support Vector Machines) on MALDI-MSI data, together with a particular wrapper feature selection algorithm (i.e., recursive feature elimination). The model is able to provide an accurate discriminatory capability using only 20 out of 144 features, resulting in an increase of the model performances, reliability, and computational efficiency. Finally, tissue areas rather than average proteomic profiles are classified, highlighting potential discriminating areas of clinical interest.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于MALDI-MSI数据的甲状腺活检标本支持向量机分类。
能够表征和预测多因子疾病的生物标志物仍然是所有“组学”研究中最重要的目标之一。在这种背景下,基质辅助激光解吸/电离-质谱成像(MALDI-MSI)近年来获得了相当大的关注,但它也导致了大量复杂的数据需要阐述和解释。因此,用于生物标志物发现的计算和机器学习程序是需要考虑的重要工具,既可以降低数据维数,又可以为特定疾病提供预测标记。例如,支持甲状腺病变诊断的蛋白质和遗传标记的可用性将对社会产生深远影响,因为不确定报告(THY3)的高存在通常被视为恶性患者。在本文中,我们展示了如何通过在MALDI-MSI数据上应用最先进的机器学习方法(即支持向量机)以及特定的包装特征选择算法(即递归特征消除)来获得甲状腺活检标本的准确分类。该模型仅使用144个特征中的20个特征就能提供准确的区分能力,从而提高了模型的性能、可靠性和计算效率。最后,组织区域,而不是平均蛋白质组谱进行分类,突出潜在的区别领域的临床兴趣。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Advances in Bioinformatics
Advances in Bioinformatics Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)
自引率
0.00%
发文量
0
期刊最新文献
Computational Genomics A Guide to RNAseq Data Analysis Using Bioinformatics Approaches Computational Metabolomics Bioinformatics in Personalized Medicine Bioinformatics Tools for Gene and Genome Annotation Analysis of Microbes for Synthetic Biology and Cancer Biology Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1