{"title":"Prediction of acetylene solubility by a mechanism-data hybrid-driven machine learning model constructed based on COSMO-RS theory","authors":"Yao Mu , Tianying Dai , Jiahe Fan, Yi Cheng","doi":"10.1016/j.molliq.2024.126194","DOIUrl":null,"url":null,"abstract":"<div><div>In this work, a mechanism-data hybrid-driven machine learning model is built to predict acetylene solubility from COSMO-derived molecular descriptors, overcoming the challenge of limited training samples. We have successfully enhanced the data-learning capabilities of the model by mechanism-level modelling, and generated new mechanism-level understanding from the results of data learning. On the basis of the mechanism-level understanding of molecular interactions, this model gives more accurate predictions than the baseline models, even though it contains only 37 model parameters. Meanwhile, it presents better generalization to predict solubility in solvents beyond the training sample. Furthermore, by analyzing the implication of the parameters of this model, we find a charged segment interaction energy form different from that described by the classic COSMO-RS theory. This work not only produces available acetylene solubility data, but also presents a proposed framework to build a mechanism-data hybrid-driven model for thermodynamic properties and discovering interpretable chemical laws from small training samples.</div></div>","PeriodicalId":371,"journal":{"name":"Journal of Molecular Liquids","volume":"414 ","pages":"Article 126194"},"PeriodicalIF":5.3000,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Liquids","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167732224022530","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In this work, a mechanism-data hybrid-driven machine learning model is built to predict acetylene solubility from COSMO-derived molecular descriptors, overcoming the challenge of limited training samples. We have successfully enhanced the data-learning capabilities of the model by mechanism-level modelling, and generated new mechanism-level understanding from the results of data learning. On the basis of the mechanism-level understanding of molecular interactions, this model gives more accurate predictions than the baseline models, even though it contains only 37 model parameters. Meanwhile, it presents better generalization to predict solubility in solvents beyond the training sample. Furthermore, by analyzing the implication of the parameters of this model, we find a charged segment interaction energy form different from that described by the classic COSMO-RS theory. This work not only produces available acetylene solubility data, but also presents a proposed framework to build a mechanism-data hybrid-driven model for thermodynamic properties and discovering interpretable chemical laws from small training samples.
期刊介绍:
The journal includes papers in the following areas:
– Simple organic liquids and mixtures
– Ionic liquids
– Surfactant solutions (including micelles and vesicles) and liquid interfaces
– Colloidal solutions and nanoparticles
– Thermotropic and lyotropic liquid crystals
– Ferrofluids
– Water, aqueous solutions and other hydrogen-bonded liquids
– Lubricants, polymer solutions and melts
– Molten metals and salts
– Phase transitions and critical phenomena in liquids and confined fluids
– Self assembly in complex liquids.– Biomolecules in solution
The emphasis is on the molecular (or microscopic) understanding of particular liquids or liquid systems, especially concerning structure, dynamics and intermolecular forces. The experimental techniques used may include:
– Conventional spectroscopy (mid-IR and far-IR, Raman, NMR, etc.)
– Non-linear optics and time resolved spectroscopy (psec, fsec, asec, ISRS, etc.)
– Light scattering (Rayleigh, Brillouin, PCS, etc.)
– Dielectric relaxation
– X-ray and neutron scattering and diffraction.
Experimental studies, computer simulations (MD or MC) and analytical theory will be considered for publication; papers just reporting experimental results that do not contribute to the understanding of the fundamentals of molecular and ionic liquids will not be accepted. Only papers of a non-routine nature and advancing the field will be considered for publication.