Machine learning analysis of a large set of homopolymers to predict glass transition temperatures

IF 5.9 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY Communications Chemistry Pub Date : 2024-10-02 DOI:10.1038/s42004-024-01305-0
Gerardo M. Casanola-Martin, Anas Karuth, Hai Pham-The, Humbert González-Díaz, Dean C. Webster, Bakhtiyor Rasulev
{"title":"Machine learning analysis of a large set of homopolymers to predict glass transition temperatures","authors":"Gerardo M. Casanola-Martin, Anas Karuth, Hai Pham-The, Humbert González-Díaz, Dean C. Webster, Bakhtiyor Rasulev","doi":"10.1038/s42004-024-01305-0","DOIUrl":null,"url":null,"abstract":"Glass transition temperature of polymers, Tg, is an important thermophysical property, which sometimes can be difficult to measure experimentally. In this regard, data-driven machine learning approaches are important alternatives to assess Tg values, in a high-throughput way. In this study, a large dataset of more than 900 polymers with reported glass transition temperature (Tg) was assembled from various public sources in order to develop a predictive model depicting the structure-property relationships. The collected dataset was curated, explored via cluster analysis, and then split into training and test sets for validation purposes and then polymer structures characterized by molecular descriptors. To find the models, several machine learning techniques, including multiple linear regression (MLR), k-nearest neighbor (k-NN), support vector machine (SVM), random forest (RF), gaussian processes for regression (GPR), and multi-layer perceptron (MLP) were explored. As result, a model with the subset of 15 descriptors accurately predicting the glass transition temperatures was developed. The electronic effect indices were determined to be important properties that positively contribute to the Tg values. The SVM-based model showed the best performance with determination coefficients (R2) of 0.813 and 0.770, for training and test sets, respectively. Also, the SVM model showed the lowest estimation error, RMSE = 0.062. In addition, the developed structure-property model was implemented as a web app to be used as an online computational tool to design and evaluate new homopolymers with desired glass transition profiles. Glass transition temperatures (Tg) of polymers are important thermophysical descriptors, but they can be difficult to determine experimentally. Here, the authors develop a data-driven support vector machine structure-property model to assess Tg values in a high-throughput manner, and implement the model into a web app.","PeriodicalId":10529,"journal":{"name":"Communications Chemistry","volume":" ","pages":"1-9"},"PeriodicalIF":5.9000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42004-024-01305-0.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Chemistry","FirstCategoryId":"92","ListUrlMain":"https://www.nature.com/articles/s42004-024-01305-0","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Glass transition temperature of polymers, Tg, is an important thermophysical property, which sometimes can be difficult to measure experimentally. In this regard, data-driven machine learning approaches are important alternatives to assess Tg values, in a high-throughput way. In this study, a large dataset of more than 900 polymers with reported glass transition temperature (Tg) was assembled from various public sources in order to develop a predictive model depicting the structure-property relationships. The collected dataset was curated, explored via cluster analysis, and then split into training and test sets for validation purposes and then polymer structures characterized by molecular descriptors. To find the models, several machine learning techniques, including multiple linear regression (MLR), k-nearest neighbor (k-NN), support vector machine (SVM), random forest (RF), gaussian processes for regression (GPR), and multi-layer perceptron (MLP) were explored. As result, a model with the subset of 15 descriptors accurately predicting the glass transition temperatures was developed. The electronic effect indices were determined to be important properties that positively contribute to the Tg values. The SVM-based model showed the best performance with determination coefficients (R2) of 0.813 and 0.770, for training and test sets, respectively. Also, the SVM model showed the lowest estimation error, RMSE = 0.062. In addition, the developed structure-property model was implemented as a web app to be used as an online computational tool to design and evaluate new homopolymers with desired glass transition profiles. Glass transition temperatures (Tg) of polymers are important thermophysical descriptors, but they can be difficult to determine experimentally. Here, the authors develop a data-driven support vector machine structure-property model to assess Tg values in a high-throughput manner, and implement the model into a web app.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
对大量均聚物进行机器学习分析,以预测玻璃化转变温度。
聚合物的玻璃化温度(Tg)是一项重要的热物理性质,有时难以通过实验测量。在这方面,数据驱动的机器学习方法是以高通量方式评估 Tg 值的重要替代方法。在这项研究中,我们从各种公共资源中收集了超过 900 种已报告玻璃化转变温度(Tg)的聚合物数据集,以便开发一个描述结构-性能关系的预测模型。对收集到的数据集进行了整理,通过聚类分析进行了探索,然后出于验证目的将其分成训练集和测试集,再通过分子描述符对聚合物结构进行表征。为了找到模型,研究人员采用了多种机器学习技术,包括多元线性回归(MLR)、k-近邻(k-NN)、支持向量机(SVM)、随机森林(RF)、高斯过程回归(GPR)和多层感知器(MLP)。结果,利用 15 个描述子集建立了一个模型,该模型可准确预测玻璃化转变温度。电子效应指数被确定为对 Tg 值有积极影响的重要属性。基于 SVM 的模型表现最佳,训练集和测试集的判定系数 (R2) 分别为 0.813 和 0.770。SVM 模型的估计误差也最小,RMSE = 0.062。此外,所开发的结构-性能模型已作为网络应用程序实施,可作为在线计算工具用于设计和评估具有理想玻璃化转变曲线的新型均聚物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Communications Chemistry
Communications Chemistry Chemistry-General Chemistry
CiteScore
7.70
自引率
1.70%
发文量
146
审稿时长
13 weeks
期刊介绍: Communications Chemistry is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the chemical sciences. Research papers published by the journal represent significant advances bringing new chemical insight to a specialized area of research. We also aim to provide a community forum for issues of importance to all chemists, regardless of sub-discipline.
期刊最新文献
Enabling systemic identification and functionality profiling for Cdc42 homeostatic modulators. Shortcut to chemically accurate quantum computing via density-based basis-set correction A biomimetic phosphor that can build a rigid microenvironment for its long-lived afterglow in aqueous medium Leveraging infrared spectroscopy for automated structure elucidation Immiscible metamorphic water and methane fluids preserved in carbonated eclogite
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1