Pub Date : 2024-02-18DOI: 10.1016/j.aichem.2024.100056
Yan Shen, Zihan Wang, Yihan Wang, Cheng Wang
The electrocatalytic conversion of CO2 (CO2RR) to multi-carbon products has been an appealing strategy to reduce carbon emissions. However, rapid experimental discovery of efficient CO2RR electrocatalysts and fast recording of full product distribution information is non-trivial. Herein, we used an electrocatalyst testing platform featuring a home-built automatic flow cell to accelerate catalysts screening. Based on 364 effective data points from 42 Cu-lanthanide bimetallic catalysts obtained within 21 working hours, we found that Eu modification over Cu can promote C2+ faradaic efficiency (FE). We have previously reported part of the screening data and the optimization of the Mg-Cu catalyst(Angew. Chem.2022, 134, e202213423). Here we augmented the dataset by adding the lanthanide modifiers and reported the Eu-Cu catalyst resulted from the high-throughput investigation. Our characterizations revealed that the Eu2+ reduced from Eu3+ during the catalyst synthesis prevented the agglomeration of nanoparticles, thus making europium modifications stand out from other lanthanide metal modifiers on FE C2+ enhancement. We then optimized the Eu-CuOx catalyst based on the above understanding to achieve ∼80% C2+ FE at a high current density of 1.25 A cm−2.
{"title":"Rapid screening of copper-based bimetallic catalysts via automatic electrocatalysis platform: Electrocatalytic reduction of CO2 to C2+ products on europium-modified copper","authors":"Yan Shen, Zihan Wang, Yihan Wang, Cheng Wang","doi":"10.1016/j.aichem.2024.100056","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100056","url":null,"abstract":"<div><p>The electrocatalytic conversion of CO<sub>2</sub> (CO<sub>2</sub>RR) to multi-carbon products has been an appealing strategy to reduce carbon emissions. However, rapid experimental discovery of efficient CO<sub>2</sub>RR electrocatalysts and fast recording of full product distribution information is non-trivial. Herein, we used an electrocatalyst testing platform featuring a home-built automatic flow cell to accelerate catalysts screening. Based on 364 effective data points from 42 Cu-lanthanide bimetallic catalysts obtained within 21 working hours, we found that Eu modification over Cu can promote C<sub>2+</sub> faradaic efficiency (FE). We have previously reported part of the screening data and the optimization of the Mg-Cu catalyst(<em>Angew. Chem.</em> <strong>2022</strong>, <em>134</em>, e202213423). Here we augmented the dataset by adding the lanthanide modifiers and reported the Eu-Cu catalyst resulted from the high-throughput investigation. Our characterizations revealed that the Eu<sup>2+</sup> reduced from Eu<sup>3+</sup> during the catalyst synthesis prevented the agglomeration of nanoparticles, thus making europium modifications stand out from other lanthanide metal modifiers on FE C<sub>2+</sub> enhancement. We then optimized the Eu-CuO<sub>x</sub> catalyst based on the above understanding to achieve ∼80% C<sub>2+</sub> FE at a high current density of 1.25 A cm<sup>−2</sup>.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100056"},"PeriodicalIF":0.0,"publicationDate":"2024-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000149/pdfft?md5=d1c6b7f6973c2f825f4024a496be4cd7&pid=1-s2.0-S2949747724000149-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139935586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-17DOI: 10.1016/j.aichem.2024.100055
Mario Villares , Carla M. Saunders , Natalie Fey
We have used a Ligand Knowledge Base for bidentate P,P-donor ligands of potential interest to homogeneous catalysis to compare three dimensionality reduction techniques, namely Principal Component Analysis (PCA), Uniform Manifold Approximation and Projection (UMAP) and t-distributed Stochastic Neighbor Embedding (t-SNE). While our previous work on Ligand Knowledge Bases has focused on PCA, here we compare this approach with more recently-published approaches and assess the information retention, visualization, clustering and interpretability which can be achieved for each approach. We find that potential advantages of t-SNE are not realized with a database of the current size (275 entries), and that there is a degree of complementarity between PCA and UMAP. The statistics underlying PCA rely on linear relationships, making interpretation of the resulting plots comparatively straightforward. Since much of chemistry relies on linear structure-property relationships and low-dimensional visualization, the explainability and information retention achieved is attractive. UMAP proved more challenging to interpret, but achieved clear clustering which was often chemically meaningful, and it would be a useful approach for ensuring that distinct subsets of compounds are sampled in a machine-learning context. This analysis also highlighted that the tunability of catalysis achieved through ligand exchange maps well onto some areas of chemical space where closely related ligands cluster, while others represent outliers; these arise from different combinations of steric and electronic effects which chemists will find intuitive.
{"title":"Comparison of dimensionality reduction techniques for the visualisation of chemical space in organometallic catalysis","authors":"Mario Villares , Carla M. Saunders , Natalie Fey","doi":"10.1016/j.aichem.2024.100055","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100055","url":null,"abstract":"<div><p>We have used a Ligand Knowledge Base for bidentate P,P-donor ligands of potential interest to homogeneous catalysis to compare three dimensionality reduction techniques, namely Principal Component Analysis (PCA), Uniform Manifold Approximation and Projection (UMAP) and t-distributed Stochastic Neighbor Embedding (t-SNE). While our previous work on Ligand Knowledge Bases has focused on PCA, here we compare this approach with more recently-published approaches and assess the information retention, visualization, clustering and interpretability which can be achieved for each approach. We find that potential advantages of t-SNE are not realized with a database of the current size (275 entries), and that there is a degree of complementarity between PCA and UMAP. The statistics underlying PCA rely on linear relationships, making interpretation of the resulting plots comparatively straightforward. Since much of chemistry relies on linear structure-property relationships and low-dimensional visualization, the explainability and information retention achieved is attractive. UMAP proved more challenging to interpret, but achieved clear clustering which was often chemically meaningful, and it would be a useful approach for ensuring that distinct subsets of compounds are sampled in a machine-learning context. This analysis also highlighted that the tunability of catalysis achieved through ligand exchange maps well onto some areas of chemical space where closely related ligands cluster, while others represent outliers; these arise from different combinations of steric and electronic effects which chemists will find intuitive.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100055"},"PeriodicalIF":0.0,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000137/pdfft?md5=d22dd66b98e698544ad12f66b7d830c4&pid=1-s2.0-S2949747724000137-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139943030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-12DOI: 10.1016/j.aichem.2024.100054
Wei Deng , Yuehua Zhao , Yafang Zheng , Yuan Yin , Yan Huan , Lijun Liu , Dapeng Wang
Designing rubber formulations can greatly benefit from using a database that stores the formulations and corresponding property data of rubber composites. Such a database can expedite the decision-making process by swiftly identifying the most suitable formulations for specific applications. However, the management of a rubber formulation database encounters various issues, including missing formulation and property data, as well as data entry errors. These issues can impede the decision-making processes and even result in incorrect decisions being made. In this study, machine learning (ML) algorithms were applied to analyze rubber formulation databases. Our findings highlight the success of the ML algorithm in effectively filling in missing data and identifying erroneous data. Furthermore, it demonstrates the accurate prediction of properties for untested formulations within the pre-determined database space. The results underline the outstanding performance of ML algorithms in expediting the rubber formulation design process and emphasize their immense potential to play a prominent role in the advancement of rubber composites.
使用可存储橡胶复合材料配方和相应属性数据的数据库,可大大有利于橡胶配方的设计。此类数据库可迅速确定最适合特定应用的配方,从而加快决策过程。然而,橡胶配方数据库的管理会遇到各种问题,包括配方和属性数据缺失以及数据输入错误。这些问题会阻碍决策过程,甚至导致做出错误的决策。本研究采用机器学习(ML)算法分析橡胶配方数据库。我们的研究结果凸显了 ML 算法在有效填补缺失数据和识别错误数据方面的成功。此外,它还证明了在预先确定的数据库空间内对未经测试的配方特性进行准确预测的能力。这些结果凸显了 ML 算法在加快橡胶配方设计过程中的出色表现,并强调了其在推动橡胶复合材料发展方面发挥突出作用的巨大潜力。
{"title":"Machine learning assisted analysis and prediction of rubber formulation using existing databases","authors":"Wei Deng , Yuehua Zhao , Yafang Zheng , Yuan Yin , Yan Huan , Lijun Liu , Dapeng Wang","doi":"10.1016/j.aichem.2024.100054","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100054","url":null,"abstract":"<div><p>Designing rubber formulations can greatly benefit from using a database that stores the formulations and corresponding property data of rubber composites. Such a database can expedite the decision-making process by swiftly identifying the most suitable formulations for specific applications. However, the management of a rubber formulation database encounters various issues, including missing formulation and property data, as well as data entry errors. These issues can impede the decision-making processes and even result in incorrect decisions being made. In this study, machine learning (ML) algorithms were applied to analyze rubber formulation databases. Our findings highlight the success of the ML algorithm in effectively filling in missing data and identifying erroneous data. Furthermore, it demonstrates the accurate prediction of properties for untested formulations within the pre-determined database space. The results underline the outstanding performance of ML algorithms in expediting the rubber formulation design process and emphasize their immense potential to play a prominent role in the advancement of rubber composites.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100054"},"PeriodicalIF":0.0,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000125/pdfft?md5=c058446a90f81b469ca59bff1d08c2a1&pid=1-s2.0-S2949747724000125-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139749640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-06DOI: 10.1016/j.aichem.2024.100053
Yuanzhe Zhou , Shi-Jie Chen
RNA molecules play multifaceted functional and regulatory roles within cells and have garnered significant attention in recent years as promising therapeutic targets. With remarkable successes achieved by artificial intelligence (AI) in different fields such as computer vision and natural language processing, there is a growing imperative to harness AI’s potential in computer-aided drug design (CADD) to discover novel drug compounds that target RNA. Although machine-learning (ML) approaches have been widely adopted in the discovery of small molecules targeting proteins, the application of ML approaches to model interactions between RNA and small molecule is still in its infancy. Compared to protein-targeted drug discovery, the major challenges in ML-based RNA-targeted drug discovery stem from the scarcity of available data resources. With the growing interest and the development of curated databases focusing on interactions between RNA and small molecule, the field anticipates a rapid growth and the opening of a new avenue for disease treatment. In this review, we aim to provide an overview of recent advancements in computationally modeling RNA-small molecule interactions within the context of RNA-targeted drug discovery, with a particular emphasis on methodologies employing ML techniques.
RNA 分子在细胞内发挥着多方面的功能和调控作用,近年来作为有前景的治疗靶点备受关注。随着人工智能(AI)在计算机视觉和自然语言处理等不同领域取得了令人瞩目的成就,利用人工智能在计算机辅助药物设计(CADD)中的潜力来发现靶向 RNA 的新型药物化合物的需求日益迫切。尽管机器学习(ML)方法已被广泛应用于发现靶向蛋白质的小分子化合物,但将 ML 方法应用于 RNA 与小分子化合物之间的相互作用建模仍处于起步阶段。与蛋白质靶向药物发现相比,基于 ML 的 RNA 靶向药物发现面临的主要挑战来自于可用数据资源的稀缺。随着人们对 RNA 与小分子相互作用的兴趣与日俱增,以及以 RNA 与小分子相互作用为重点的研究数据库的开发,该领域有望迅速发展,并为疾病治疗开辟一条新途径。在这篇综述中,我们旨在概述在 RNA 靶向药物发现背景下 RNA-小分子相互作用计算建模的最新进展,并特别强调采用 ML 技术的方法。
{"title":"Advances in machine-learning approaches to RNA-targeted drug design","authors":"Yuanzhe Zhou , Shi-Jie Chen","doi":"10.1016/j.aichem.2024.100053","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100053","url":null,"abstract":"<div><p>RNA molecules play multifaceted functional and regulatory roles within cells and have garnered significant attention in recent years as promising therapeutic targets. With remarkable successes achieved by artificial intelligence (AI) in different fields such as computer vision and natural language processing, there is a growing imperative to harness AI’s potential in computer-aided drug design (CADD) to discover novel drug compounds that target RNA. Although machine-learning (ML) approaches have been widely adopted in the discovery of small molecules targeting proteins, the application of ML approaches to model interactions between RNA and small molecule is still in its infancy. Compared to protein-targeted drug discovery, the major challenges in ML-based RNA-targeted drug discovery stem from the scarcity of available data resources. With the growing interest and the development of curated databases focusing on interactions between RNA and small molecule, the field anticipates a rapid growth and the opening of a new avenue for disease treatment. In this review, we aim to provide an overview of recent advancements in computationally modeling RNA-small molecule interactions within the context of RNA-targeted drug discovery, with a particular emphasis on methodologies employing ML techniques.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100053"},"PeriodicalIF":0.0,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000113/pdfft?md5=300db5aa459794dcdbc0972a40d0ca02&pid=1-s2.0-S2949747724000113-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139714050","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-03DOI: 10.1016/j.aichem.2024.100052
Duncan Bossion , Gunnar Nyman , Yohann Scribano
In this work, we investigate the possibility to use an artificial neural network to predict a large number of accurate state-to-state rate constants for atom-diatom collisions, from available rates obtained at two different accuracy levels, using a few accurate rates and many low-accuracy rates. The H + H2 → H2 + H chemical reaction is used to benchmark our neural network, as both low and high accuracy state-to-state rates are available in the literature. Our artificial neural network is a multilayer perceptron, using 8 input neurons including the low-accuracy rate constants, with the high accuracy rate constants as the output neuron. The use of machine learning to predict rate constants is very encouraged, as the rates obtained are accurate, even using as low as 1% of the full dataset to train the neural network, and improve greatly the low accuracy rates previously available. This approach can be used to generate full rate constant datasets with a consistent accuracy, from sparse rates obtained with various methods of different accuracies.
{"title":"Machine learning prediction of state-to-state rate constants for astrochemistry","authors":"Duncan Bossion , Gunnar Nyman , Yohann Scribano","doi":"10.1016/j.aichem.2024.100052","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100052","url":null,"abstract":"<div><p>In this work, we investigate the possibility to use an artificial neural network to predict a large number of accurate state-to-state rate constants for atom-diatom collisions, from available rates obtained at two different accuracy levels, using a few accurate rates and many low-accuracy rates. The H + H<sub>2</sub> → H<sub>2</sub> + H chemical reaction is used to benchmark our neural network, as both low and high accuracy state-to-state rates are available in the literature. Our artificial neural network is a multilayer perceptron, using 8 input neurons including the low-accuracy rate constants, with the high accuracy rate constants as the output neuron. The use of machine learning to predict rate constants is very encouraged, as the rates obtained are accurate, even using as low as 1% of the full dataset to train the neural network, and improve greatly the low accuracy rates previously available. This approach can be used to generate full rate constant datasets with a consistent accuracy, from sparse rates obtained with various methods of different accuracies.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100052"},"PeriodicalIF":0.0,"publicationDate":"2024-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000101/pdfft?md5=be9d938fa5886a1544bcda53427c4f6f&pid=1-s2.0-S2949747724000101-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139714051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-24DOI: 10.1016/j.aichem.2024.100051
Yixi Zhang , Jin-Da Luo , Hong-Bin Yao , Bin Jiang
Solid-state electrolytes are key ingredients in next-generation devices for energy storage and release. Machine learning molecular dynamics (MLMD) has shown great promise in studying the diffusivity of mobile ions in solid-state electrolytes, with much higher efficiency than conventional ab initio molecular dynamics (AIMD). In this work, we combine an efficient embedded atom neural network (EANN) approach and an uncertainty-driven active learning algorithm that optimally selects data points from high-temperature AIMD trajectories to construct ML potentials for solid-state electrolytes and validate this strategy in a benchmark system, Li3YCl6, for which several controversy theoretical results exist. Through systematic MLMD simulations, we find that a typically used small supercell in AIMD simulations fails to predict the supersonic transition at a critical temperature, leading to a significant overestimation of the Li+ conductivity in Li3YCl6 at room temperature. Fortunately, thanks to the scalability of the EANN potential, extended MLMD simulations in a sufficiently large cell does yield a notable change of temperature-dependence in conductivity at ∼420 K and a much lower room-temperature conductivity in excellent with experiment. Interestingly, our results are all based on a semi-local PBE density functional, which was argued unable to predict the superionic transition. We analyze possible reasons of the seemingly inconsistent MLMD results reported in literature with different ML potentials. This work paves the way of simply using high-temperature AIMD data to generate more reliable MLMD results of low-temperature ionic conductivities in solid-state electrolytes.
{"title":"Size dependent lithium-ion conductivity of solid electrolytes in machine learning molecular dynamics simulations","authors":"Yixi Zhang , Jin-Da Luo , Hong-Bin Yao , Bin Jiang","doi":"10.1016/j.aichem.2024.100051","DOIUrl":"10.1016/j.aichem.2024.100051","url":null,"abstract":"<div><p>Solid-state electrolytes are key ingredients in next-generation devices for energy storage and release. Machine learning molecular dynamics (MLMD) has shown great promise in studying the diffusivity of mobile ions in solid-state electrolytes, with much higher efficiency than conventional ab initio molecular dynamics (AIMD). In this work, we combine an efficient embedded atom neural network (EANN) approach and an uncertainty-driven active learning algorithm that optimally selects data points from high-temperature AIMD trajectories to construct ML potentials for solid-state electrolytes and validate this strategy in a benchmark system, Li<sub>3</sub>YCl<sub>6</sub>, for which several controversy theoretical results exist. Through systematic MLMD simulations, we find that a typically used small supercell in AIMD simulations fails to predict the supersonic transition at a critical temperature, leading to a significant overestimation of the Li<sup>+</sup> conductivity in Li<sub>3</sub>YCl<sub>6</sub> at room temperature. Fortunately, thanks to the scalability of the EANN potential, extended MLMD simulations in a sufficiently large cell does yield a notable change of temperature-dependence in conductivity at ∼420 K and a much lower room-temperature conductivity in excellent with experiment. Interestingly, our results are all based on a semi-local PBE density functional, which was argued unable to predict the superionic transition. We analyze possible reasons of the seemingly inconsistent MLMD results reported in literature with different ML potentials. This work paves the way of simply using high-temperature AIMD data to generate more reliable MLMD results of low-temperature ionic conductivities in solid-state electrolytes.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100051"},"PeriodicalIF":0.0,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000095/pdfft?md5=ff9758425c151a024cd1c50e2503eb45&pid=1-s2.0-S2949747724000095-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139635555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-19DOI: 10.1016/j.aichem.2024.100049
Rizvi Syed Aal E Ali , Jiaolong Meng , Muhammad Ehtisham Ibraheem Khan , Xuefeng Jiang
Artificial intelligence (AI) is driving a revolution in chemistry, reshaping the landscape of molecular design. This review explores AI’s pivotal roles in the field of organic synthesis applications. AI accurately predicts reaction outcomes, controls chemical selectivity, simplifies synthesis planning, accelerates catalyst discovery, and fuels material innovation and so on. It seamlessly integrates data-driven algorithms with chemical intuition to redefine molecular design. As AI chemistry advances, it promises accelerated research, sustainability, and innovative solutions to chemistry’s pressing challenges. The fusion of AI and chemistry is poised to shape the field’s future profoundly, offering new horizons in precision and efficiency. This review encapsulates the transformation of AI in chemistry, marking a pivotal moment where algorithms and data converge to revolutionize the world of molecules.
{"title":"Machine learning advancements in organic synthesis: A focused exploration of artificial intelligence applications in chemistry","authors":"Rizvi Syed Aal E Ali , Jiaolong Meng , Muhammad Ehtisham Ibraheem Khan , Xuefeng Jiang","doi":"10.1016/j.aichem.2024.100049","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100049","url":null,"abstract":"<div><p>Artificial intelligence (AI) is driving a revolution in chemistry, reshaping the landscape of molecular design. This review explores AI’s pivotal roles in the field of organic synthesis applications. AI accurately predicts reaction outcomes, controls chemical selectivity, simplifies synthesis planning, accelerates catalyst discovery, and fuels material innovation and so on. It seamlessly integrates data-driven algorithms with chemical intuition to redefine molecular design. As AI chemistry advances, it promises accelerated research, sustainability, and innovative solutions to chemistry’s pressing challenges. The fusion of AI and chemistry is poised to shape the field’s future profoundly, offering new horizons in precision and efficiency. This review encapsulates the transformation of AI in chemistry, marking a pivotal moment where algorithms and data converge to revolutionize the world of molecules.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100049"},"PeriodicalIF":0.0,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000071/pdfft?md5=ca6a79f1c6ae5ed3980ec0ff3589b022&pid=1-s2.0-S2949747724000071-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139549589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-19DOI: 10.1016/j.aichem.2024.100050
Chen Qu, Barry I. Schneider, Anthony J. Kearsley, Walid Keyrouz, Thomas C. Allison
Graph neural networks have been successfully applied to machine learning models related to molecules and crystals, due to the similarity between a molecule/crystal and a graph. In this paper, we present three models that are trained with high-quality experimental data to predict three molecular properties (Kováts retention index, normal boiling point, and mass spectrum), using the same GNN architecture. We show that graph representations of molecules, combined with deep learning methodologies and high-quality data sets, lead to accurate machine learning models to predict molecular properties.
{"title":"Applying graph neural network models to molecular property prediction using high-quality experimental data","authors":"Chen Qu, Barry I. Schneider, Anthony J. Kearsley, Walid Keyrouz, Thomas C. Allison","doi":"10.1016/j.aichem.2024.100050","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100050","url":null,"abstract":"<div><p>Graph neural networks have been successfully applied to machine learning models related to molecules and crystals, due to the similarity between a molecule/crystal and a graph. In this paper, we present three models that are trained with high-quality experimental data to predict three molecular properties (Kováts retention index, normal boiling point, and mass spectrum), using the same GNN architecture. We show that graph representations of molecules, combined with deep learning methodologies and high-quality data sets, lead to accurate machine learning models to predict molecular properties.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100050"},"PeriodicalIF":0.0,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000083/pdfft?md5=d755fd2f616c83e07982edec2890d06c&pid=1-s2.0-S2949747724000083-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139549590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-11DOI: 10.1016/j.aichem.2024.100048
Megha Rajeevan, Rotti Srinivasamurthy Swathi
Gas adsorption on one-atom-thick membranes is a growing technology for separation applications owing to its excellent energy efficiency. Herein, we investigate the adsorption of the noble gases, Ne, Ar and Kr on graphynes (GYs), a novel class of one-atom-thick carbon membranes using a swarm intelligence technique, namely particle swarm optimization (PSO). Modeling the adsorption of noble gas clusters on two-dimensional substrates requires a thorough examination of the energy landscape. The high dimensionality of the problem makes it tricky to employ ab initio methods for such studies, necessitating the use of a metaheuristic global optimization technique such as PSO. We explored the adsorption of 1–30 atoms of Ne, Ar and Kr on α-, β-, γ- and rhombic-GYs to predict the most suitable form of GY for the adsorption of each of the gases. Employing the dispersion-corrected density functional theory (DFT-D) data for the adsorption of single gas atoms as the reference data, we parametrized two empirical pairwise potentials, namely, Lennard-Jones (LJ) and improved Lennard-Jones (ILJ) potentials. We then analyzed the growth pattern as well as the energetics of adsorption using the parametrized potentials, in combination with the PSO technique, which enabled us to predict the best possible membrane for the adsorption of the noble gases: α-GY for Ne and γ-GY for Ar and Kr. The accuracy of our modeling approach is further validated against DFT-D computations thereby establishing that PSO, when combined with the ILJ potential, can serve as a computationally feasible approach for modeling gas adsorption on GYs.
{"title":"Exploring the energy landscape of graphynes for noble gas adsorption using swarm intelligence","authors":"Megha Rajeevan, Rotti Srinivasamurthy Swathi","doi":"10.1016/j.aichem.2024.100048","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100048","url":null,"abstract":"<div><p>Gas adsorption on one-atom-thick membranes is a growing technology for separation applications owing to its excellent energy efficiency. Herein, we investigate the adsorption of the noble gases, Ne, Ar and Kr on graphynes (GYs), a novel class of one-atom-thick carbon membranes using a swarm intelligence technique, namely particle swarm optimization (PSO). Modeling the adsorption of noble gas clusters on two-dimensional substrates requires a thorough examination of the energy landscape. The high dimensionality of the problem makes it tricky to employ ab initio methods for such studies, necessitating the use of a metaheuristic global optimization technique such as PSO. We explored the adsorption of 1–30 atoms of Ne, Ar and Kr on α-, β-, γ- and rhombic-GYs to predict the most suitable form of GY for the adsorption of each of the gases. Employing the dispersion-corrected density functional theory (DFT-D) data for the adsorption of single gas atoms as the reference data, we parametrized two empirical pairwise potentials, namely, Lennard-Jones (LJ) and improved Lennard-Jones (ILJ) potentials. We then analyzed the growth pattern as well as the energetics of adsorption using the parametrized potentials, in combination with the PSO technique, which enabled us to predict the best possible membrane for the adsorption of the noble gases: α-GY for Ne and γ-GY for Ar and Kr. The accuracy of our modeling approach is further validated against DFT-D computations thereby establishing that PSO, when combined with the ILJ potential, can serve as a computationally feasible approach for modeling gas adsorption on GYs.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100048"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S294974772400006X/pdfft?md5=13e8fd3ef313b8180bab9c56f7c85352&pid=1-s2.0-S294974772400006X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139504245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-10DOI: 10.1016/j.aichem.2024.100047
Jakob Gamper , Hans Georg Gallmetzer , Alexander K.H. Weiss , Thomas S. Hofer
In this work, the previously introduced NeuralSchrödinger PINN is extended towards the use of analytical gradient expressions of the loss function. It is shown that the analytical gradients derived in this work increase the convergence properties for both the BFGS and ADAM optimizers compared to the previously employed numerical gradient implementation. In addition, the use of parallelised GPU computations via CUDA greatly increased the computational performance over the previous implementation using single-core CPU computations. As a consequence, an extension of the NeuralSchrödinger PINN towards two-dimensional quantum systems became feasible as also demonstrated in this work.
在这项工作中,先前介绍的神经薛定谔 PINN 被扩展到使用损失函数的分析梯度表达式。结果表明,与之前使用的数值梯度实现相比,本研究中得出的分析梯度提高了 BFGS 和 ADAM 优化器的收敛特性。此外,通过 CUDA 使用 GPU 并行计算,与之前使用单核 CPU 计算相比,大大提高了计算性能。因此,将神经薛定谔 PINN 扩展到二维量子系统是可行的,这也在本研究中得到了证明。
{"title":"A general strategy for improving the performance of PINNs -- Analytical gradients and advanced optimizers in the NeuralSchrödinger framework","authors":"Jakob Gamper , Hans Georg Gallmetzer , Alexander K.H. Weiss , Thomas S. Hofer","doi":"10.1016/j.aichem.2024.100047","DOIUrl":"https://doi.org/10.1016/j.aichem.2024.100047","url":null,"abstract":"<div><p>In this work, the previously introduced NeuralSchrödinger PINN is extended towards the use of analytical gradient expressions of the loss function. It is shown that the analytical gradients derived in this work increase the convergence properties for both the BFGS and ADAM optimizers compared to the previously employed numerical gradient implementation. In addition, the use of parallelised GPU computations <em>via</em> CUDA greatly increased the computational performance over the previous implementation using single-core CPU computations. As a consequence, an extension of the NeuralSchrödinger PINN towards two-dimensional quantum systems became feasible as also demonstrated in this work.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":"2 1","pages":"Article 100047"},"PeriodicalIF":0.0,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000058/pdfft?md5=a75b7d5a2a3dee17cd82180444493827&pid=1-s2.0-S2949747724000058-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139434123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}