V. Villazón-León , R.R. Suárez , A. Bonilla-Petriciolet , J.C. Tapia-Picazo
{"title":"A group contribution-based machine learning model to estimate the triple-point temperature","authors":"V. Villazón-León , R.R. Suárez , A. Bonilla-Petriciolet , J.C. Tapia-Picazo","doi":"10.1016/j.fluid.2025.114395","DOIUrl":null,"url":null,"abstract":"<div><div>This manuscript reports a new thermodynamic model for calculating triple-point temperature using a machine-learning algorithm and group contribution theory. The model was developed using the Auto-Machine Learning approach available in the auto-sklearn library in Python to identify the best algorithm for estimating this relevant property of pure compounds. Different input variables and ensembles of machine learning algorithms were assessed. The limitations and gaps in the proposed model are highlighted for different chemical families and functional groups. The results demonstrate that the Gradient Boosting algorithm achieved the best performance in estimating the triple-point temperature. The average absolute relative deviation <span><math><mrow><mo>(</mo><mtext>AARD</mtext><mo>)</mo></mrow></math></span> of this model ranged from 0.85 to 5.73 % for the main chemical families included in the data analysis. The proposed model is reliable for calculating the triple-point temperatures of alkanes, alkynes, alcohols, cycloalkenes, polyols, nitriles, and anhydrides. However, the estimation of the triple-point temperature was challenging for polar compounds containing halogens and NO<sub>2</sub>, which showed a non-ideal thermodynamic behavior. This study represents an initial step towards the development of an improved thermodynamic framework based on machine learning algorithms and group contribution theory for the accurate estimation of triple-point temperature.</div></div>","PeriodicalId":12170,"journal":{"name":"Fluid Phase Equilibria","volume":"595 ","pages":"Article 114395"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fluid Phase Equilibria","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378381225000652","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
This manuscript reports a new thermodynamic model for calculating triple-point temperature using a machine-learning algorithm and group contribution theory. The model was developed using the Auto-Machine Learning approach available in the auto-sklearn library in Python to identify the best algorithm for estimating this relevant property of pure compounds. Different input variables and ensembles of machine learning algorithms were assessed. The limitations and gaps in the proposed model are highlighted for different chemical families and functional groups. The results demonstrate that the Gradient Boosting algorithm achieved the best performance in estimating the triple-point temperature. The average absolute relative deviation of this model ranged from 0.85 to 5.73 % for the main chemical families included in the data analysis. The proposed model is reliable for calculating the triple-point temperatures of alkanes, alkynes, alcohols, cycloalkenes, polyols, nitriles, and anhydrides. However, the estimation of the triple-point temperature was challenging for polar compounds containing halogens and NO2, which showed a non-ideal thermodynamic behavior. This study represents an initial step towards the development of an improved thermodynamic framework based on machine learning algorithms and group contribution theory for the accurate estimation of triple-point temperature.
期刊介绍:
Fluid Phase Equilibria publishes high-quality papers dealing with experimental, theoretical, and applied research related to equilibrium and transport properties of fluids, solids, and interfaces. Subjects of interest include physical/phase and chemical equilibria; equilibrium and nonequilibrium thermophysical properties; fundamental thermodynamic relations; and stability. The systems central to the journal include pure substances and mixtures of organic and inorganic materials, including polymers, biochemicals, and surfactants with sufficient characterization of composition and purity for the results to be reproduced. Alloys are of interest only when thermodynamic studies are included, purely material studies will not be considered. In all cases, authors are expected to provide physical or chemical interpretations of the results.
Experimental research can include measurements under all conditions of temperature, pressure, and composition, including critical and supercritical. Measurements are to be associated with systems and conditions of fundamental or applied interest, and may not be only a collection of routine data, such as physical property or solubility measurements at limited pressures and temperatures close to ambient, or surfactant studies focussed strictly on micellisation or micelle structure. Papers reporting common data must be accompanied by new physical insights and/or contemporary or new theory or techniques.