{"title":"Optimal machine learning algorithm for prediction model for retention times of plant toxins","authors":"Masaru Taniguchi , Shoichiro Noguchi , Hidenobu Kawashima , Jun Sugiura , Tomoyuki Tsuchiyama , Tomiaki Minatani , Hitoshi Miyazaki , Kei Zaitsu","doi":"10.1016/j.foodcont.2025.111251","DOIUrl":null,"url":null,"abstract":"<div><div>In suspect screening or nontargeted analysis via LC high-resolution MS (LC-HRMS), high-accuracy identification typically relies on retention times (RTs) and MS–MS spectra. However, RTs are difficult to obtain due to the scarcity of reference standards. Here, we developed a Quantitative Structure Retention Relationships (QSRR) -based RT prediction model using machine learning, specifically for plant toxins implicated in accidental food poisoning. A dataset for QSRR model development was generated using the molecular descriptors (MDs) and experimental RTs of 524 compounds. QSRR models were constructed as regression models derived from the relationship between experimental RTs and MDs using 10 machine learning algorithms. The QSRR model with support vector regression (SVR) outperformed the other QSRR models in generalization on the analyzed dataset (<em>R</em><sup>2</sup>: 0.972, mean absolute error: 183 [approximately 1.6 min], mean absolute percentage error [MAPE]: 6%; <em>Q</em><sup>2</sup>: 0.875, MAE: 584 [approximately 2.0 min], MAPE: 15%). Furthermore, the SVR QSRR model successfully predicted the RTs of nine plant toxins with errors of ±0.5 min. Thus, this model enhances the confidence level of plant toxin identification via LC-HRMS.</div></div>","PeriodicalId":319,"journal":{"name":"Food Control","volume":"174 ","pages":"Article 111251"},"PeriodicalIF":5.6000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Control","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0956713525001203","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In suspect screening or nontargeted analysis via LC high-resolution MS (LC-HRMS), high-accuracy identification typically relies on retention times (RTs) and MS–MS spectra. However, RTs are difficult to obtain due to the scarcity of reference standards. Here, we developed a Quantitative Structure Retention Relationships (QSRR) -based RT prediction model using machine learning, specifically for plant toxins implicated in accidental food poisoning. A dataset for QSRR model development was generated using the molecular descriptors (MDs) and experimental RTs of 524 compounds. QSRR models were constructed as regression models derived from the relationship between experimental RTs and MDs using 10 machine learning algorithms. The QSRR model with support vector regression (SVR) outperformed the other QSRR models in generalization on the analyzed dataset (R2: 0.972, mean absolute error: 183 [approximately 1.6 min], mean absolute percentage error [MAPE]: 6%; Q2: 0.875, MAE: 584 [approximately 2.0 min], MAPE: 15%). Furthermore, the SVR QSRR model successfully predicted the RTs of nine plant toxins with errors of ±0.5 min. Thus, this model enhances the confidence level of plant toxin identification via LC-HRMS.
期刊介绍:
Food Control is an international journal that provides essential information for those involved in food safety and process control.
Food Control covers the below areas that relate to food process control or to food safety of human foods:
• Microbial food safety and antimicrobial systems
• Mycotoxins
• Hazard analysis, HACCP and food safety objectives
• Risk assessment, including microbial and chemical hazards
• Quality assurance
• Good manufacturing practices
• Food process systems design and control
• Food Packaging technology and materials in contact with foods
• Rapid methods of analysis and detection, including sensor technology
• Codes of practice, legislation and international harmonization
• Consumer issues
• Education, training and research needs.
The scope of Food Control is comprehensive and includes original research papers, authoritative reviews, short communications, comment articles that report on new developments in food control, and position papers.