{"title":"Multi-property prediction and high-throughput screening of polyimides: An application case for interpretable machine learning","authors":"","doi":"10.1016/j.polymer.2024.127603","DOIUrl":null,"url":null,"abstract":"<div><p>Polyimide (PI), as a high-performance polymer widely used in aerospace, optoelectronics, microelectronics, etc., the properties it focused on in different areas of application were diverse. However, most of the past machine learning studies in polyimide property prediction were focused only on the prediction of a single property. This study focused on four major categories of properties of polyimide, including thermal (Tg, T<sub>d5</sub>, T<sub>d10</sub>, and CTE), mechanical (Ts and TM), dielectric (ε), and optical (λ<sub>cutoff</sub>, T400, n<sub>av</sub>, and Δn), totaling eleven properties. PI data synthesized from previous studies were collected. MorGan fingerprints, improved MorGan fingerprints, RDkit and Mordred descriptors were selected as feature representations. Four ML models such as DNN, RF, XGBoost and BT were also built. Collectively, 176 machine learning models were trained for 11 predictions of properties. The performance and generalization ability of the models were confirmed by experimental validation, external validation and leave-one-out cross-validation. SHAP analysis was used to explain the optimal model for each property prediction from a physicochemical point of view and structural aspects, and three PIs with different structures were designed accordingly. Finally, a high-throughput virtual screening of nearly 7.6 million polyimides was performed based on the trained model. SA_scores was used to evaluate the ease of synthesis of each PI, and finally high-performance PIs with potential and easy to synthesize were selected for each field. This study could be expected to provide a guideline and a design framework for the application of PIs in various fields in the future.</p></div>","PeriodicalId":405,"journal":{"name":"Polymer","volume":null,"pages":null},"PeriodicalIF":4.1000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Polymer","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003238612400939X","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"POLYMER SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Polyimide (PI), as a high-performance polymer widely used in aerospace, optoelectronics, microelectronics, etc., the properties it focused on in different areas of application were diverse. However, most of the past machine learning studies in polyimide property prediction were focused only on the prediction of a single property. This study focused on four major categories of properties of polyimide, including thermal (Tg, Td5, Td10, and CTE), mechanical (Ts and TM), dielectric (ε), and optical (λcutoff, T400, nav, and Δn), totaling eleven properties. PI data synthesized from previous studies were collected. MorGan fingerprints, improved MorGan fingerprints, RDkit and Mordred descriptors were selected as feature representations. Four ML models such as DNN, RF, XGBoost and BT were also built. Collectively, 176 machine learning models were trained for 11 predictions of properties. The performance and generalization ability of the models were confirmed by experimental validation, external validation and leave-one-out cross-validation. SHAP analysis was used to explain the optimal model for each property prediction from a physicochemical point of view and structural aspects, and three PIs with different structures were designed accordingly. Finally, a high-throughput virtual screening of nearly 7.6 million polyimides was performed based on the trained model. SA_scores was used to evaluate the ease of synthesis of each PI, and finally high-performance PIs with potential and easy to synthesize were selected for each field. This study could be expected to provide a guideline and a design framework for the application of PIs in various fields in the future.
期刊介绍:
Polymer is an interdisciplinary journal dedicated to publishing innovative and significant advances in Polymer Physics, Chemistry and Technology. We welcome submissions on polymer hybrids, nanocomposites, characterisation and self-assembly. Polymer also publishes work on the technological application of polymers in energy and optoelectronics.
The main scope is covered but not limited to the following core areas:
Polymer Materials
Nanocomposites and hybrid nanomaterials
Polymer blends, films, fibres, networks and porous materials
Physical Characterization
Characterisation, modelling and simulation* of molecular and materials properties in bulk, solution, and thin films
Polymer Engineering
Advanced multiscale processing methods
Polymer Synthesis, Modification and Self-assembly
Including designer polymer architectures, mechanisms and kinetics, and supramolecular polymerization
Technological Applications
Polymers for energy generation and storage
Polymer membranes for separation technology
Polymers for opto- and microelectronics.