{"title":"聚酰亚胺的多性能预测和高通量筛选:可解释机器学习的应用案例","authors":"","doi":"10.1016/j.polymer.2024.127603","DOIUrl":null,"url":null,"abstract":"<div><p>Polyimide (PI), as a high-performance polymer widely used in aerospace, optoelectronics, microelectronics, etc., the properties it focused on in different areas of application were diverse. However, most of the past machine learning studies in polyimide property prediction were focused only on the prediction of a single property. This study focused on four major categories of properties of polyimide, including thermal (Tg, T<sub>d5</sub>, T<sub>d10</sub>, and CTE), mechanical (Ts and TM), dielectric (ε), and optical (λ<sub>cutoff</sub>, T400, n<sub>av</sub>, and Δn), totaling eleven properties. PI data synthesized from previous studies were collected. MorGan fingerprints, improved MorGan fingerprints, RDkit and Mordred descriptors were selected as feature representations. Four ML models such as DNN, RF, XGBoost and BT were also built. Collectively, 176 machine learning models were trained for 11 predictions of properties. The performance and generalization ability of the models were confirmed by experimental validation, external validation and leave-one-out cross-validation. SHAP analysis was used to explain the optimal model for each property prediction from a physicochemical point of view and structural aspects, and three PIs with different structures were designed accordingly. Finally, a high-throughput virtual screening of nearly 7.6 million polyimides was performed based on the trained model. SA_scores was used to evaluate the ease of synthesis of each PI, and finally high-performance PIs with potential and easy to synthesize were selected for each field. This study could be expected to provide a guideline and a design framework for the application of PIs in various fields in the future.</p></div>","PeriodicalId":405,"journal":{"name":"Polymer","volume":null,"pages":null},"PeriodicalIF":4.1000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-property prediction and high-throughput screening of polyimides: An application case for interpretable machine learning\",\"authors\":\"\",\"doi\":\"10.1016/j.polymer.2024.127603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Polyimide (PI), as a high-performance polymer widely used in aerospace, optoelectronics, microelectronics, etc., the properties it focused on in different areas of application were diverse. However, most of the past machine learning studies in polyimide property prediction were focused only on the prediction of a single property. This study focused on four major categories of properties of polyimide, including thermal (Tg, T<sub>d5</sub>, T<sub>d10</sub>, and CTE), mechanical (Ts and TM), dielectric (ε), and optical (λ<sub>cutoff</sub>, T400, n<sub>av</sub>, and Δn), totaling eleven properties. PI data synthesized from previous studies were collected. MorGan fingerprints, improved MorGan fingerprints, RDkit and Mordred descriptors were selected as feature representations. Four ML models such as DNN, RF, XGBoost and BT were also built. Collectively, 176 machine learning models were trained for 11 predictions of properties. The performance and generalization ability of the models were confirmed by experimental validation, external validation and leave-one-out cross-validation. SHAP analysis was used to explain the optimal model for each property prediction from a physicochemical point of view and structural aspects, and three PIs with different structures were designed accordingly. Finally, a high-throughput virtual screening of nearly 7.6 million polyimides was performed based on the trained model. SA_scores was used to evaluate the ease of synthesis of each PI, and finally high-performance PIs with potential and easy to synthesize were selected for each field. This study could be expected to provide a guideline and a design framework for the application of PIs in various fields in the future.</p></div>\",\"PeriodicalId\":405,\"journal\":{\"name\":\"Polymer\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2024-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Polymer\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S003238612400939X\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"POLYMER SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Polymer","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003238612400939X","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"POLYMER SCIENCE","Score":null,"Total":0}
引用次数: 0
摘要
聚酰亚胺(PI)作为一种高性能聚合物,广泛应用于航空航天、光电子、微电子等领域,其在不同应用领域所关注的特性也多种多样。然而,以往对聚酰亚胺性能预测的机器学习研究大多只关注单一性能的预测。本研究主要关注聚酰亚胺的四大类性能,包括热性能(Tg、Td5、Td10 和 CTE)、机械性能(Ts 和 TM)、介电性能(ε)和光学性能(λcutoff、T400、nav 和 Δn),共计 11 项性能。从以前的研究中收集了综合的 PI 数据。选择 MorGan 指纹、改进的 MorGan 指纹、RDkit 和 Mordred 描述符作为特征表示。此外,还建立了 DNN、RF、XGBoost 和 BT 等四种 ML 模型。总共为 11 项属性预测训练了 176 个机器学习模型。模型的性能和泛化能力通过实验验证、外部验证和留空交叉验证得到了确认。通过 SHAP 分析,从理化角度和结构方面解释了每种性质预测的最佳模型,并据此设计了三种不同结构的 PI。最后,根据训练好的模型对近 760 万种聚酰亚胺进行了高通量虚拟筛选。利用 SA_scores 对每种聚酰亚胺的合成难易程度进行评估,最终在各个领域选出了具有潜力且易于合成的高性能聚酰亚胺。这项研究有望为今后聚合离子在各个领域的应用提供指导和设计框架。
Multi-property prediction and high-throughput screening of polyimides: An application case for interpretable machine learning
Polyimide (PI), as a high-performance polymer widely used in aerospace, optoelectronics, microelectronics, etc., the properties it focused on in different areas of application were diverse. However, most of the past machine learning studies in polyimide property prediction were focused only on the prediction of a single property. This study focused on four major categories of properties of polyimide, including thermal (Tg, Td5, Td10, and CTE), mechanical (Ts and TM), dielectric (ε), and optical (λcutoff, T400, nav, and Δn), totaling eleven properties. PI data synthesized from previous studies were collected. MorGan fingerprints, improved MorGan fingerprints, RDkit and Mordred descriptors were selected as feature representations. Four ML models such as DNN, RF, XGBoost and BT were also built. Collectively, 176 machine learning models were trained for 11 predictions of properties. The performance and generalization ability of the models were confirmed by experimental validation, external validation and leave-one-out cross-validation. SHAP analysis was used to explain the optimal model for each property prediction from a physicochemical point of view and structural aspects, and three PIs with different structures were designed accordingly. Finally, a high-throughput virtual screening of nearly 7.6 million polyimides was performed based on the trained model. SA_scores was used to evaluate the ease of synthesis of each PI, and finally high-performance PIs with potential and easy to synthesize were selected for each field. This study could be expected to provide a guideline and a design framework for the application of PIs in various fields in the future.
期刊介绍:
Polymer is an interdisciplinary journal dedicated to publishing innovative and significant advances in Polymer Physics, Chemistry and Technology. We welcome submissions on polymer hybrids, nanocomposites, characterisation and self-assembly. Polymer also publishes work on the technological application of polymers in energy and optoelectronics.
The main scope is covered but not limited to the following core areas:
Polymer Materials
Nanocomposites and hybrid nanomaterials
Polymer blends, films, fibres, networks and porous materials
Physical Characterization
Characterisation, modelling and simulation* of molecular and materials properties in bulk, solution, and thin films
Polymer Engineering
Advanced multiscale processing methods
Polymer Synthesis, Modification and Self-assembly
Including designer polymer architectures, mechanisms and kinetics, and supramolecular polymerization
Technological Applications
Polymers for energy generation and storage
Polymer membranes for separation technology
Polymers for opto- and microelectronics.