In Silico High-Performance Liquid Chromatography Method Development via Machine Learning

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL Analytical Chemistry Pub Date : 2025-03-28 DOI:10.1021/acs.analchem.4c03466
Alberto Marchetto, Monica Tirapelle, Luca Mazzei, Eva Sorensen, Maximilian O. Besenhard
{"title":"In Silico High-Performance Liquid Chromatography Method Development via Machine Learning","authors":"Alberto Marchetto, Monica Tirapelle, Luca Mazzei, Eva Sorensen, Maximilian O. Besenhard","doi":"10.1021/acs.analchem.4c03466","DOIUrl":null,"url":null,"abstract":"High-performance liquid chromatography (HPLC) remains the gold standard for analyzing and purifying molecular components in solutions. However, developing HPLC methods is material- and time-consuming, so computer-aided shortcuts are highly desirable. In line with the digitalization of process development and the growth of HPLC databases, we propose a data-driven methodology to predict molecule retention factors as a function of mobile phase composition without the need for any new experiments, solely relying on molecular descriptors (MDs) obtained via simplified molecular input line entry system (SMILES) string representations of molecules. This new approach combines: (a) quantitative structure–property relationships (QSPR) using MDs to predict solute-dependent parameters in (b) linear solvation energy relationships (LSER) and (c) linear solvent strength (LSS) theory. We demonstrate the potential of this computational methodology using experimental data for retention factors of small molecules made available by the research community for which the MDs were obtained via SMILES string representations determined by the structural formulas of the molecules. This method can be adopted directly to predict elution times of molecular components; however, in combination with first-principle-based mechanistic transport models, the method can also be employed to optimize HPLC methods in-silico. Both options can reduce the experimental load and accelerate HPLC method development significantly, lowering the time and cost of the drug manufacturing cycle and reducing the time to market. Given the growing number and quality of HPLC databases, the predictive power of this methodology will only increase in the coming years.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"33 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c03466","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

High-performance liquid chromatography (HPLC) remains the gold standard for analyzing and purifying molecular components in solutions. However, developing HPLC methods is material- and time-consuming, so computer-aided shortcuts are highly desirable. In line with the digitalization of process development and the growth of HPLC databases, we propose a data-driven methodology to predict molecule retention factors as a function of mobile phase composition without the need for any new experiments, solely relying on molecular descriptors (MDs) obtained via simplified molecular input line entry system (SMILES) string representations of molecules. This new approach combines: (a) quantitative structure–property relationships (QSPR) using MDs to predict solute-dependent parameters in (b) linear solvation energy relationships (LSER) and (c) linear solvent strength (LSS) theory. We demonstrate the potential of this computational methodology using experimental data for retention factors of small molecules made available by the research community for which the MDs were obtained via SMILES string representations determined by the structural formulas of the molecules. This method can be adopted directly to predict elution times of molecular components; however, in combination with first-principle-based mechanistic transport models, the method can also be employed to optimize HPLC methods in-silico. Both options can reduce the experimental load and accelerate HPLC method development significantly, lowering the time and cost of the drug manufacturing cycle and reducing the time to market. Given the growing number and quality of HPLC databases, the predictive power of this methodology will only increase in the coming years.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的高效液相色谱方法开发
高效液相色谱法(HPLC)仍然是分析和纯化溶液中分子成分的金标准。然而,开发高效液相色谱方法需要耗费大量的材料和时间,因此计算机辅助的捷径是非常可取的。随着工艺开发的数字化和HPLC数据库的增长,我们提出了一种数据驱动的方法来预测分子保留因子作为流动相组成的函数,而无需任何新的实验,仅依赖于通过简化分子输入线输入系统(SMILES)字符串表示分子获得的分子描述符(MDs)。这种新方法结合了:(a)定量结构-性质关系(QSPR)使用MDs来预测溶质相关参数;(b)线性溶剂化能关系(LSER)和(c)线性溶剂强度(LSS)理论。我们利用研究团体提供的小分子保留因子的实验数据证明了这种计算方法的潜力,其中MDs是通过由分子结构公式确定的SMILES字符串表示获得的。该方法可直接预测分子组分的洗脱次数;然而,结合第一性原理的机制传递模型,该方法也可用于优化HPLC方法。这两种选择都可以显著减少实验负荷,加快HPLC方法的开发,降低药物生产周期的时间和成本,缩短上市时间。鉴于HPLC数据库的数量和质量不断增长,这种方法的预测能力在未来几年只会增加。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
期刊最新文献
A Fluorescent Palladium–Ligand Platform for In Situ Monitoring and Bayesian Optimization of Sonogashira Coupling Reactions Advanced Analytical Methodologies for Monitoring Lithium in Environmental and Biological Systems for Sustainable Assessment Enhancing the Flow Dynamics and Sensitivity of Paper-Based Lateral Flow Immunoassays Through Zwitterionic Antifouling Modification Stem-Engineered Aptamers with Enhanced Stability and Affinity for Tumor-Targeted Imaging Ionophore-Based Ion-Selective Optodes Using Hydrocarbons as Ultralow-Polarity Media
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1