Haripriyan Uthayakumar, Rahul Krishna K, Raj Jain, Rajnish Kumar and Tarak K. Patra*,
{"title":"QRChEM: A Deep Learning Framework for Materials Property Prediction and Design Using QR Codes","authors":"Haripriyan Uthayakumar, Rahul Krishna K, Raj Jain, Rajnish Kumar and Tarak K. Patra*, ","doi":"10.1021/acsengineeringau.3c00055","DOIUrl":null,"url":null,"abstract":"<p >Machine learning (ML) surrogate models are used for the rapid prediction of materials properties and are promising tools for accelerating new materials design and development. The performance and accuracy of these surrogate models appear to be intricately connected to the molecular representation that is employed. Developing efficient numerical representations of molecules is vital for the success of surrogate models in predicting materials' properties. Here, we propose a new machine-readable molecular representation, namely a molecular quick response (QR) code, for the deep learning of materials structure–property correlations. We built a convolutional deep neural network (CNN) model based on molecular QR codes, which is abbreviated as QRChEM. QRChEM was trained and validated using ∼21 000 data for four representative properties of small molecules, namely specific heat, enthalpy, zero-point vibrational energy, and HOMO–LUMO band gap. We show that QRChEM outperforms the commonly used Morgan fingerprint-based and one-hot encoding (OHE)-based deep learning frameworks. We further performed UMAP (uniform manifold approximation and projection) on the molecular QR codes to demonstrate the differentiability of the molecular topologies, which is vital for high-fidelity surrogate model development.</p>","PeriodicalId":29804,"journal":{"name":"ACS Engineering Au","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsengineeringau.3c00055","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Engineering Au","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsengineeringau.3c00055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning (ML) surrogate models are used for the rapid prediction of materials properties and are promising tools for accelerating new materials design and development. The performance and accuracy of these surrogate models appear to be intricately connected to the molecular representation that is employed. Developing efficient numerical representations of molecules is vital for the success of surrogate models in predicting materials' properties. Here, we propose a new machine-readable molecular representation, namely a molecular quick response (QR) code, for the deep learning of materials structure–property correlations. We built a convolutional deep neural network (CNN) model based on molecular QR codes, which is abbreviated as QRChEM. QRChEM was trained and validated using ∼21 000 data for four representative properties of small molecules, namely specific heat, enthalpy, zero-point vibrational energy, and HOMO–LUMO band gap. We show that QRChEM outperforms the commonly used Morgan fingerprint-based and one-hot encoding (OHE)-based deep learning frameworks. We further performed UMAP (uniform manifold approximation and projection) on the molecular QR codes to demonstrate the differentiability of the molecular topologies, which is vital for high-fidelity surrogate model development.
期刊介绍:
)ACS Engineering Au is an open access journal that reports significant advances in chemical engineering applied chemistry and energy covering fundamentals processes and products. The journal's broad scope includes experimental theoretical mathematical computational chemical and physical research from academic and industrial settings. Short letters comprehensive articles reviews and perspectives are welcome on topics that include:Fundamental research in such areas as thermodynamics transport phenomena (flow mixing mass & heat transfer) chemical reaction kinetics and engineering catalysis separations interfacial phenomena and materialsProcess design development and intensification (e.g. process technologies for chemicals and materials synthesis and design methods process intensification multiphase reactors scale-up systems analysis process control data correlation schemes modeling machine learning Artificial Intelligence)Product research and development involving chemical and engineering aspects (e.g. catalysts plastics elastomers fibers adhesives coatings paper membranes lubricants ceramics aerosols fluidic devices intensified process equipment)Energy and fuels (e.g. pre-treatment processing and utilization of renewable energy resources; processing and utilization of fuels; properties and structure or molecular composition of both raw fuels and refined products; fuel cells hydrogen batteries; photochemical fuel and energy production; decarbonization; electrification; microwave; cavitation)Measurement techniques computational models and data on thermo-physical thermodynamic and transport properties of materials and phase equilibrium behaviorNew methods models and tools (e.g. real-time data analytics multi-scale models physics informed machine learning models machine learning enhanced physics-based models soft sensors high-performance computing)