植物毒素保留时间预测模型的最优机器学习算法

IF 6.3 1区 农林科学 Q1 FOOD SCIENCE & TECHNOLOGY Food Control Pub Date : 2025-08-01 Epub Date: 2025-02-24 DOI:10.1016/j.foodcont.2025.111251
Masaru Taniguchi , Shoichiro Noguchi , Hidenobu Kawashima , Jun Sugiura , Tomoyuki Tsuchiyama , Tomiaki Minatani , Hitoshi Miyazaki , Kei Zaitsu
{"title":"植物毒素保留时间预测模型的最优机器学习算法","authors":"Masaru Taniguchi ,&nbsp;Shoichiro Noguchi ,&nbsp;Hidenobu Kawashima ,&nbsp;Jun Sugiura ,&nbsp;Tomoyuki Tsuchiyama ,&nbsp;Tomiaki Minatani ,&nbsp;Hitoshi Miyazaki ,&nbsp;Kei Zaitsu","doi":"10.1016/j.foodcont.2025.111251","DOIUrl":null,"url":null,"abstract":"<div><div>In suspect screening or nontargeted analysis via LC high-resolution MS (LC-HRMS), high-accuracy identification typically relies on retention times (RTs) and MS–MS spectra. However, RTs are difficult to obtain due to the scarcity of reference standards. Here, we developed a Quantitative Structure Retention Relationships (QSRR) -based RT prediction model using machine learning, specifically for plant toxins implicated in accidental food poisoning. A dataset for QSRR model development was generated using the molecular descriptors (MDs) and experimental RTs of 524 compounds. QSRR models were constructed as regression models derived from the relationship between experimental RTs and MDs using 10 machine learning algorithms. The QSRR model with support vector regression (SVR) outperformed the other QSRR models in generalization on the analyzed dataset (<em>R</em><sup>2</sup>: 0.972, mean absolute error: 183 [approximately 1.6 min], mean absolute percentage error [MAPE]: 6%; <em>Q</em><sup>2</sup>: 0.875, MAE: 584 [approximately 2.0 min], MAPE: 15%). Furthermore, the SVR QSRR model successfully predicted the RTs of nine plant toxins with errors of ±0.5 min. Thus, this model enhances the confidence level of plant toxin identification via LC-HRMS.</div></div>","PeriodicalId":319,"journal":{"name":"Food Control","volume":"174 ","pages":"Article 111251"},"PeriodicalIF":6.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal machine learning algorithm for prediction model for retention times of plant toxins\",\"authors\":\"Masaru Taniguchi ,&nbsp;Shoichiro Noguchi ,&nbsp;Hidenobu Kawashima ,&nbsp;Jun Sugiura ,&nbsp;Tomoyuki Tsuchiyama ,&nbsp;Tomiaki Minatani ,&nbsp;Hitoshi Miyazaki ,&nbsp;Kei Zaitsu\",\"doi\":\"10.1016/j.foodcont.2025.111251\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In suspect screening or nontargeted analysis via LC high-resolution MS (LC-HRMS), high-accuracy identification typically relies on retention times (RTs) and MS–MS spectra. However, RTs are difficult to obtain due to the scarcity of reference standards. Here, we developed a Quantitative Structure Retention Relationships (QSRR) -based RT prediction model using machine learning, specifically for plant toxins implicated in accidental food poisoning. A dataset for QSRR model development was generated using the molecular descriptors (MDs) and experimental RTs of 524 compounds. QSRR models were constructed as regression models derived from the relationship between experimental RTs and MDs using 10 machine learning algorithms. The QSRR model with support vector regression (SVR) outperformed the other QSRR models in generalization on the analyzed dataset (<em>R</em><sup>2</sup>: 0.972, mean absolute error: 183 [approximately 1.6 min], mean absolute percentage error [MAPE]: 6%; <em>Q</em><sup>2</sup>: 0.875, MAE: 584 [approximately 2.0 min], MAPE: 15%). Furthermore, the SVR QSRR model successfully predicted the RTs of nine plant toxins with errors of ±0.5 min. Thus, this model enhances the confidence level of plant toxin identification via LC-HRMS.</div></div>\",\"PeriodicalId\":319,\"journal\":{\"name\":\"Food Control\",\"volume\":\"174 \",\"pages\":\"Article 111251\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Food Control\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0956713525001203\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/24 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Control","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0956713525001203","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/24 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

在通过LC- hrms进行可疑分子筛选或非靶向分析时,高精度鉴定通常依赖于保留时间(rt)和MS - MS光谱。然而,由于参考标准的缺乏,rt很难获得。在这里,我们使用机器学习开发了一个基于定量结构保留关系(QSRR)的RT预测模型,特别是与意外食物中毒有关的植物毒素。利用524个化合物的分子描述符(MDs)和实验RTs生成了QSRR模型开发的数据集。QSRR模型是利用10种机器学习算法根据实验RTs和MDs之间的关系构建的回归模型。基于支持向量回归(SVR)的QSRR模型在分析数据集上的泛化效果优于其他QSRR模型(R2: 0.972,平均绝对误差:183[约1.6 min],平均绝对百分比误差[MAPE]: 6%;Q2: 0.875, MAE: 584[约2.0分钟],MAPE: 15%)。此外,SVR QSRR模型成功预测了9种植物毒素的rt值,误差为±0.5 min,从而提高了LC-HRMS鉴定植物毒素的置信水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Optimal machine learning algorithm for prediction model for retention times of plant toxins
In suspect screening or nontargeted analysis via LC high-resolution MS (LC-HRMS), high-accuracy identification typically relies on retention times (RTs) and MS–MS spectra. However, RTs are difficult to obtain due to the scarcity of reference standards. Here, we developed a Quantitative Structure Retention Relationships (QSRR) -based RT prediction model using machine learning, specifically for plant toxins implicated in accidental food poisoning. A dataset for QSRR model development was generated using the molecular descriptors (MDs) and experimental RTs of 524 compounds. QSRR models were constructed as regression models derived from the relationship between experimental RTs and MDs using 10 machine learning algorithms. The QSRR model with support vector regression (SVR) outperformed the other QSRR models in generalization on the analyzed dataset (R2: 0.972, mean absolute error: 183 [approximately 1.6 min], mean absolute percentage error [MAPE]: 6%; Q2: 0.875, MAE: 584 [approximately 2.0 min], MAPE: 15%). Furthermore, the SVR QSRR model successfully predicted the RTs of nine plant toxins with errors of ±0.5 min. Thus, this model enhances the confidence level of plant toxin identification via LC-HRMS.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Food Control
Food Control 工程技术-食品科技
CiteScore
12.20
自引率
6.70%
发文量
758
审稿时长
33 days
期刊介绍: Food Control is an international journal that provides essential information for those involved in food safety and process control. Food Control covers the below areas that relate to food process control or to food safety of human foods: • Microbial food safety and antimicrobial systems • Mycotoxins • Hazard analysis, HACCP and food safety objectives • Risk assessment, including microbial and chemical hazards • Quality assurance • Good manufacturing practices • Food process systems design and control • Food Packaging technology and materials in contact with foods • Rapid methods of analysis and detection, including sensor technology • Codes of practice, legislation and international harmonization • Consumer issues • Education, training and research needs. The scope of Food Control is comprehensive and includes original research papers, authoritative reviews, short communications, comment articles that report on new developments in food control, and position papers.
期刊最新文献
Isomer-resolved lipid fingerprints for authenticity and processing quality of edible fats and oils An interpretable multi-task incremental learning method for simultaneous vintage and grade assessment of Liupao tea using terahertz spectroscopy From formulation to application: An eco-friendly carnosic acid nanoemulsion for Colletotrichum viniferum control and postharvest grape preservation Effect of natural biopolymers-based packaging materials on shelf life of dairy products Smartphone-enabled fluorescent probes for food freshness detection: A comprehensive review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1