MLR Data-Driven for the Prediction of Infinite Dilution Activity Coefficient of Water in Ionic Liquids (ILs) Using QSPR-Based COSMO Descriptors.

IF 5.3 2区 化学 Q1 CHEMISTRY, MEDICINAL Journal of Chemical Information and Modeling Pub Date : 2025-03-10 Epub Date: 2025-02-21 DOI:10.1021/acs.jcim.4c02095
Ali Ebrahimpoor Gorji, Juho-Pekka Laakso, Ville Alopaeus, Petri Uusi-Kyyny
{"title":"MLR Data-Driven for the Prediction of Infinite Dilution Activity Coefficient of Water in Ionic Liquids (ILs) Using QSPR-Based COSMO Descriptors.","authors":"Ali Ebrahimpoor Gorji, Juho-Pekka Laakso, Ville Alopaeus, Petri Uusi-Kyyny","doi":"10.1021/acs.jcim.4c02095","DOIUrl":null,"url":null,"abstract":"<p><p>To predict the partial molar excess enthalpy, entropy at infinite dilution, and phase equilibria, the availability of an infinite dilution activity coefficient is vital. The \"quantitative structure-activity/property relationship\" (QSAR/QSPR) approach has been used for the prediction of infinite dilution activity coefficient of water in ionic liquids using an extensive data set. The data set comprised 380 data points including 68 unique ILs at a wide range of temperatures, which is more extensive than previously published data sets. Moreover, new predictive QSAR/QSPR models including novel molecular descriptors, called \"COSMO-RS descriptors\", have been developed. Using two different techniques of external validation, the data set was divided to the training set for the development of models and to the validation set for external validation. Unlike former available models, internal validation using leave one/multi out-cross validations (LOO-CV/LMO-CV) and Y-scrambling methods were performed on the models using statistical parameters for further assessment. According to the obtained results of statistical parameters (<i>R</i><sup>2</sup> = 0.99 and <i>Q</i><sup>2</sup><sub>LOO-CV</sub> = 0.99), the predictive capability of the developed QSPR model was excellent for training set. Regarding the external validation, other statistical parameters such as AAD = 0.283 and AARD % = 30 were also satisfactory for the validation set. While the values of γ<sub>H<sub>2</sub></sub><sub>O</sub><sup>∞</sup> increase or decrease with increasing temperature, the QSAR/QSPR models based on the van't Hoff equation takes into account the negative and positive effects of temperature on the γ<sub>H<sub>2</sub></sub><sub>O</sub><sup>∞</sup> in ILs well, depending on the nature of ILs. It was also shown that γ<sub>H<sub>2</sub></sub><sub>O</sub><sup>∞</sup> in some new ILs which had not been experimentally studied before can be predicted using the QSPR model.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"2530-2542"},"PeriodicalIF":5.3000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898046/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c02095","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/21 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

To predict the partial molar excess enthalpy, entropy at infinite dilution, and phase equilibria, the availability of an infinite dilution activity coefficient is vital. The "quantitative structure-activity/property relationship" (QSAR/QSPR) approach has been used for the prediction of infinite dilution activity coefficient of water in ionic liquids using an extensive data set. The data set comprised 380 data points including 68 unique ILs at a wide range of temperatures, which is more extensive than previously published data sets. Moreover, new predictive QSAR/QSPR models including novel molecular descriptors, called "COSMO-RS descriptors", have been developed. Using two different techniques of external validation, the data set was divided to the training set for the development of models and to the validation set for external validation. Unlike former available models, internal validation using leave one/multi out-cross validations (LOO-CV/LMO-CV) and Y-scrambling methods were performed on the models using statistical parameters for further assessment. According to the obtained results of statistical parameters (R2 = 0.99 and Q2LOO-CV = 0.99), the predictive capability of the developed QSPR model was excellent for training set. Regarding the external validation, other statistical parameters such as AAD = 0.283 and AARD % = 30 were also satisfactory for the validation set. While the values of γH2O increase or decrease with increasing temperature, the QSAR/QSPR models based on the van't Hoff equation takes into account the negative and positive effects of temperature on the γH2O in ILs well, depending on the nature of ILs. It was also shown that γH2O in some new ILs which had not been experimentally studied before can be predicted using the QSPR model.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用基于 QSPR 的 COSMO 描述子预测离子液体 (ILs) 中水的无限稀释活性系数的 MLR 数据驱动。
为了预测部分摩尔过量焓、无限稀释熵和相平衡,无限稀释活度系数的可用性是至关重要的。“定量构效关系”(QSAR/QSPR)方法利用大量数据集对离子液体中水的无限稀释活度系数进行了预测。该数据集由380个数据点组成,其中包括68个独特的温度范围,比以前发布的数据集更广泛。此外,还开发了新的预测QSAR/QSPR模型,包括新的分子描述子,称为cosmos - rs描述子。使用两种不同的外部验证技术,将数据集划分为用于模型开发的训练集和用于外部验证的验证集。与以往的可用模型不同,采用左一/多外交叉验证(LOO-CV/LMO-CV)和y -置乱方法对模型进行内部验证,并使用统计参数进行进一步评估。根据得到的统计参数结果(R2 = 0.99, q2o - cv = 0.99),所建立的QSPR模型对训练集的预测能力较好。外部验证方面,AAD = 0.283、AARD % = 30等其他统计参数也满足验证集。虽然γH2O∞随温度升高而增大或减小,但基于范霍夫方程的QSAR/QSPR模型根据il的性质,很好地考虑了温度对il中γH2O∞的负影响和正影响。结果还表明,在一些未被实验研究的新型离子中,γH2O∞可以用QSPR模型进行预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
10.70%
发文量
529
审稿时长
1.4 months
期刊介绍: The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery. Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field. As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.
期刊最新文献
A Relative Binding Free Energy Framework for Structurally Dissimilar Molecules. G.AI.A: An Integrated Machine-Learning Platform for Predicting Bioaccumulation and Ecotoxicity of Pharmaceuticals. DynoPore─A Package to Analyze Molecular Dynamics Trajectories of Confined Liquids. Prediction of Atropisomerism for Drug-like Molecules. Unified Graph-Based Interatomic Potential for Perovskite Structure Optimization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1