Ready-to-use Models Built Using a Diverse Set of 266 Aroma Compounds for the Estimation of Gas Chromatographic Retention Indices for the 50%-Cyanopropylphenyl-50%-Dimethylpolysiloxane Stationary Phase

IF 2.8 3区 工程技术 Q2 CHEMISTRY, ANALYTICAL Journal of separation science Pub Date : 2024-11-04 DOI:10.1002/jssc.70016
Anastasia Yu. Sholokhova, Dmitriy D. Matyushin
{"title":"Ready-to-use Models Built Using a Diverse Set of 266 Aroma Compounds for the Estimation of Gas Chromatographic Retention Indices for the 50%-Cyanopropylphenyl-50%-Dimethylpolysiloxane Stationary Phase","authors":"Anastasia Yu. Sholokhova,&nbsp;Dmitriy D. Matyushin","doi":"10.1002/jssc.70016","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Retention index prediction based on the molecule structure is not often used in practice due to low accuracy, the need to use paid software to calculate molecular descriptors (MD), and the narrow applicability domain of many models. In recent years, relatively accurate and versatile deep learning (DL)-based models have emerged. These models are now used in practice as an additional criterion in gas chromatography-mass spectrometry identification. The DB-225ms stationary phase (usually described as 50%-cyanopropylphenyl-50%-dimethylpolysiloxane in available sources) is widely used, but ready-to-use retention index estimation models are not available for it. This study presents such models. The models are linear and use simple constitutional MD and retention indices predicted by DL for the DB-WAX and DB-624 stationary phases as MD (we show that it is their use that allows us to achieve satisfactory accuracy). The accuracy obtained for a completely unseen hold-out test set: root mean square error 73.2; mean absolute error 45.7; median absolute error 22.0. The models were trained using a retention data set of 266 volatile compounds. All calculations can be performed using the convenient open-source software CHERESHNYA. The final equations are implemented as a spreadsheet and a code snippet and are available online: https://doi.org/10.6084/m9.figshare.26800789.</p>\n </div>","PeriodicalId":17098,"journal":{"name":"Journal of separation science","volume":"47 21","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of separation science","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jssc.70016","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Retention index prediction based on the molecule structure is not often used in practice due to low accuracy, the need to use paid software to calculate molecular descriptors (MD), and the narrow applicability domain of many models. In recent years, relatively accurate and versatile deep learning (DL)-based models have emerged. These models are now used in practice as an additional criterion in gas chromatography-mass spectrometry identification. The DB-225ms stationary phase (usually described as 50%-cyanopropylphenyl-50%-dimethylpolysiloxane in available sources) is widely used, but ready-to-use retention index estimation models are not available for it. This study presents such models. The models are linear and use simple constitutional MD and retention indices predicted by DL for the DB-WAX and DB-624 stationary phases as MD (we show that it is their use that allows us to achieve satisfactory accuracy). The accuracy obtained for a completely unseen hold-out test set: root mean square error 73.2; mean absolute error 45.7; median absolute error 22.0. The models were trained using a retention data set of 266 volatile compounds. All calculations can be performed using the convenient open-source software CHERESHNYA. The final equations are implemented as a spreadsheet and a code snippet and are available online: https://doi.org/10.6084/m9.figshare.26800789.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用多种多样的 266 种芳香化合物建立的即用模型,用于估算 50%-Cyanopropylphenyl-50%-Dimethylpolysiloxane 固定相的气相色谱保留指数。
基于分子结构的保留指数预测由于准确度低、需要使用付费软件计算分子描述符(MD)以及许多模型的适用范围较窄等原因,在实践中并不常用。近年来,出现了基于深度学习(DL)的相对准确和通用的模型。目前,这些模型已被实际用作气相色谱-质谱鉴定的附加标准。DB-225ms 固定相(在现有资料中通常描述为 50%-氰丙基苯基-50%-二甲基聚硅氧烷)被广泛使用,但却没有现成可用的保留指数估算模型。本研究提出了这样的模型。这些模型是线性的,使用简单的宪法 MD 和 DL 预测的 DB-WAX 和 DB-624 固定相的保留指数作为 MD(我们证明,正是使用了这些模型,我们才能获得令人满意的准确度)。在完全未见的保留测试集上获得的准确度为:均方根误差 73.2;平均绝对误差 45.7;中位数绝对误差 22.0。这些模型是使用 266 种挥发性化合物的保留数据集进行训练的。所有计算均可使用方便的开源软件 CHERESHNYA 进行。最终方程以电子表格和代码片段的形式实现,可在线查阅:https://doi.org/10.6084/m9.figshare.26800789。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of separation science
Journal of separation science 化学-分析化学
CiteScore
6.30
自引率
16.10%
发文量
408
审稿时长
1.8 months
期刊介绍: The Journal of Separation Science (JSS) is the most comprehensive source in separation science, since it covers all areas of chromatographic and electrophoretic separation methods in theory and practice, both in the analytical and in the preparative mode, solid phase extraction, sample preparation, and related techniques. Manuscripts on methodological or instrumental developments, including detection aspects, in particular mass spectrometry, as well as on innovative applications will also be published. Manuscripts on hyphenation, automation, and miniaturization are particularly welcome. Pre- and post-separation facets of a total analysis may be covered as well as the underlying logic of the development or application of a method.
期刊最新文献
Primaquine Analysis in Pharmaceutical Formulation Using Multiple and Short-End Injections by Capillary Zone Electrophoresis-Ultraviolet Detection Issue Information Quality Assessment of Corydalis saxicola Bunting Using Quantitative Analysis of Multi-Components by Single Marker and Fingerprint Analysis Chemical Screening, Identification, and Comparison of Tripterygium Hypoglaucum Hutch Preparations by Ultra-High-Performance Liquid Chromatography-Quadrupole Time-of-Flight Mass Spectrometry Combined With Multivariate Statistical Analysis Extraction of Penicillin G From Dairy Products Using Deep Eutectic Solvent/Alginate Beads Enhanced via Layered Double Hydroxides
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1