利用机器学习推进 MS/MS 图谱预测工作

IF 3.1 2区 化学 Q2 BIOCHEMICAL RESEARCH METHODS Journal of the American Society for Mass Spectrometry Pub Date : 2024-09-11 DOI:10.1021/jasms.4c0015410.1021/jasms.4c00154
Julia Nguyen, Richard Overstreet, Ethan King and Danielle Ciesielski*, 
{"title":"利用机器学习推进 MS/MS 图谱预测工作","authors":"Julia Nguyen,&nbsp;Richard Overstreet,&nbsp;Ethan King and Danielle Ciesielski*,&nbsp;","doi":"10.1021/jasms.4c0015410.1021/jasms.4c00154","DOIUrl":null,"url":null,"abstract":"<p >Tandem mass spectrometry (MS/MS) is an important tool for the identification of small molecules and metabolites where resultant spectra are most commonly identified by matching them with spectra in MS/MS reference libraries. While popular, this strategy is limited by the contents of existing reference libraries. In response to this limitation, various methods are being developed for the <i>in silico</i> generation of spectra to augment existing libraries. Recently, machine learning and deep learning techniques have been applied to predict spectra with greater speed and accuracy. Here, we investigate the challenges these algorithms face in achieving fast and accurate predictions on a wide range of small molecules. The challenges are often amplified by the use of generic machine learning benchmarking tactics, which lead to misleading accuracy scores. Curating data sets, only predicting spectra for sufficiently high collision energies, and working more closely with experimental mass spectrometrists are recommended strategies to improve overall prediction accuracy in this nuanced field.</p>","PeriodicalId":672,"journal":{"name":"Journal of the American Society for Mass Spectrometry","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing the Prediction of MS/MS Spectra Using Machine Learning\",\"authors\":\"Julia Nguyen,&nbsp;Richard Overstreet,&nbsp;Ethan King and Danielle Ciesielski*,&nbsp;\",\"doi\":\"10.1021/jasms.4c0015410.1021/jasms.4c00154\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Tandem mass spectrometry (MS/MS) is an important tool for the identification of small molecules and metabolites where resultant spectra are most commonly identified by matching them with spectra in MS/MS reference libraries. While popular, this strategy is limited by the contents of existing reference libraries. In response to this limitation, various methods are being developed for the <i>in silico</i> generation of spectra to augment existing libraries. Recently, machine learning and deep learning techniques have been applied to predict spectra with greater speed and accuracy. Here, we investigate the challenges these algorithms face in achieving fast and accurate predictions on a wide range of small molecules. The challenges are often amplified by the use of generic machine learning benchmarking tactics, which lead to misleading accuracy scores. Curating data sets, only predicting spectra for sufficiently high collision energies, and working more closely with experimental mass spectrometrists are recommended strategies to improve overall prediction accuracy in this nuanced field.</p>\",\"PeriodicalId\":672,\"journal\":{\"name\":\"Journal of the American Society for Mass Spectrometry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Society for Mass Spectrometry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/jasms.4c00154\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Society for Mass Spectrometry","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/jasms.4c00154","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

串联质谱(MS/MS)是鉴定小分子和代谢物的重要工具,其结果光谱通常通过与 MS/MS 参考文献库中的光谱进行比对来鉴定。这种策略虽然很受欢迎,但却受到现有参考文献库内容的限制。针对这一局限性,人们正在开发各种方法,用于在硅学中生成光谱,以扩充现有的参考文献库。最近,机器学习和深度学习技术已被用于更快速、更准确地预测光谱。在此,我们研究了这些算法在对各种小分子进行快速准确预测时所面临的挑战。使用通用的机器学习基准策略往往会导致误导性的准确度分数,从而加大了挑战。整理数据集、只预测足够高碰撞能量的光谱,以及与实验质谱专家更紧密地合作,是提高这一细微领域总体预测准确性的建议策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Advancing the Prediction of MS/MS Spectra Using Machine Learning

Tandem mass spectrometry (MS/MS) is an important tool for the identification of small molecules and metabolites where resultant spectra are most commonly identified by matching them with spectra in MS/MS reference libraries. While popular, this strategy is limited by the contents of existing reference libraries. In response to this limitation, various methods are being developed for the in silico generation of spectra to augment existing libraries. Recently, machine learning and deep learning techniques have been applied to predict spectra with greater speed and accuracy. Here, we investigate the challenges these algorithms face in achieving fast and accurate predictions on a wide range of small molecules. The challenges are often amplified by the use of generic machine learning benchmarking tactics, which lead to misleading accuracy scores. Curating data sets, only predicting spectra for sufficiently high collision energies, and working more closely with experimental mass spectrometrists are recommended strategies to improve overall prediction accuracy in this nuanced field.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.50
自引率
9.40%
发文量
257
审稿时长
1 months
期刊介绍: The Journal of the American Society for Mass Spectrometry presents research papers covering all aspects of mass spectrometry, incorporating coverage of fields of scientific inquiry in which mass spectrometry can play a role. Comprehensive in scope, the journal publishes papers on both fundamentals and applications of mass spectrometry. Fundamental subjects include instrumentation principles, design, and demonstration, structures and chemical properties of gas-phase ions, studies of thermodynamic properties, ion spectroscopy, chemical kinetics, mechanisms of ionization, theories of ion fragmentation, cluster ions, and potential energy surfaces. In addition to full papers, the journal offers Communications, Application Notes, and Accounts and Perspectives
期刊最新文献
Infrared Laser Ablation and Capture of Formalin-Fixed Paraffin-Embedded Tissue. Improved Rapid Equilibrium Dialysis-Mass Spectrometry (RED-MS) Method for Measuring Small Molecule-Protein Complex Binding Affinities in Solution. Quantitative Analysis of Drugs in a Mimetic Tissue Model Using Nano-DESI on a Triple Quadrupole Mass Spectrometer. Development of a Novel Label-Free Subunit HILIC-MS Method for Domain-Specific Free Thiol Identification and Quantitation in Therapeutic Monoclonal Antibodies. Single Cell MALDI-MSI Analysis of Lipids and Proteins within a Replicative Senescence Fibroblast Model.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1