首页 > 最新文献

Artificial intelligence chemistry最新文献

英文 中文
Orders of coupling representations as a versatile framework for machine learning from sparse data in high-dimensional spaces 耦合表示的阶数作为高维空间中稀疏数据机器学习的通用框架
Pub Date : 2023-07-17 DOI: 10.1016/j.aichem.2023.100008
Sergei Manzhos , Tucker Carrington , Manabu Ihara

Machine learning (ML) techniques are already widely and increasingly used in diverse applications in science and technology, including computational chemistry. Specifically in computational chemistry, neural networks (NN) and kernel methods such as Gaussian process regressions (GPR) have been increasingly used for the construction of potential functions and functionals for density functional theory. While ML techniques have a number of advantages vs intuition-based models, notably their generality and black-box nature, they are still challenged when faced with high dimensionality of the feature space or low and uneven data density – in part because of their general nature. We review recent works using methods such as NNs and GPR as building blocks of composite methods in the framework of an expansion over orders of coupling. We introduce models using NN or GPR-based components as part of HDMR (high-dimensional model representations)-based structures. HDMR is a formalization of orders-of-coupling representations that include the many-body and N-mode representations well known in computational chemistry and allows, in particular, building all terms from one dataset of arbitrarily distributed data. The resulting HDMR-NN and HDMR-GPR combinations and NN with HDMR-GPR derived neuron activation functions not requiring non-linear optimization enhance machine learning capabilities in high dimensional spaces and or with sparse data.

机器学习(ML)技术已经被广泛且越来越多地用于科学技术的各种应用,包括计算化学。特别是在计算化学中,神经网络(NN)和高斯过程回归(GPR)等核方法越来越多地用于构建密度泛函理论的势函数和泛函。虽然ML技术与基于直觉的模型相比有很多优势,特别是它们的通用性和黑匣子性质,但当面临高维度的特征空间或低且不均匀的数据密度时,它们仍然面临挑战——部分原因是它们的一般性。我们回顾了最近的工作,使用神经网络和GPR等方法作为复合方法的构建块,在耦合阶数扩展的框架中。我们介绍了使用基于NN或GPR的组件作为基于HDMR(高维模型表示)的结构的一部分的模型。HDMR是耦合表示顺序的形式化,包括计算化学中众所周知的多体和N模表示,并且特别允许从任意分布数据的一个数据集构建所有项。所得到的HDMR-NN和HDMR-GPR组合以及不需要非线性优化的具有HDMR-GPR-衍生的神经元激活函数的NN增强了高维空间和/或稀疏数据中的机器学习能力。
{"title":"Orders of coupling representations as a versatile framework for machine learning from sparse data in high-dimensional spaces","authors":"Sergei Manzhos ,&nbsp;Tucker Carrington ,&nbsp;Manabu Ihara","doi":"10.1016/j.aichem.2023.100008","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100008","url":null,"abstract":"<div><p>Machine learning (ML) techniques are already widely and increasingly used in diverse applications in science and technology, including computational chemistry. Specifically in computational chemistry, neural networks (NN) and kernel methods such as Gaussian process regressions (GPR) have been increasingly used for the construction of potential functions and functionals for density functional theory. While ML techniques have a number of advantages vs intuition-based models, notably their generality and black-box nature, they are still challenged when faced with high dimensionality of the feature space or low and uneven data density – in part because of their general nature. We review recent works using methods such as NNs and GPR as building blocks of composite methods in the framework of an expansion over orders of coupling. We introduce models using NN or GPR-based components as part of HDMR (high-dimensional model representations)-based structures. HDMR is a formalization of orders-of-coupling representations that include the many-body and N-mode representations well known in computational chemistry and allows, in particular, building all terms from one dataset of arbitrarily distributed data. The resulting HDMR-NN and HDMR-GPR combinations and NN with HDMR-GPR derived neuron activation functions not requiring non-linear optimization enhance machine learning capabilities in high dimensional spaces and or with sparse data.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49764054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
How do centrality measures help to predict similarity patterns in molecular chemical structural graphs? 中心性测量如何帮助预测分子化学结构图中的相似模式?
Pub Date : 2023-07-13 DOI: 10.1016/j.aichem.2023.100007
Nirmala Parisutham

The proposed work uses centrality measures based heuristic method to improve the efficiency of the solution for the similarity search problem in molecular chemical graphs by effectively identifying central candidate or representative candidate nodes, which simplify the complex processes involved while detecting a large-sized maximal common connected edge subgraph. After analyzing the structure of the two input molecular chemical graphs, a Tensor Product graph is created. This newly built graph is further analyzed to get the similarity pattern of the input graphs. It is an open problem to decide which centrality measure selects the best central candidate node in Tensor Product graphs to get a large maximal common connected edge graph. Since each centrality measure is analyses, the given graph is uniquely based on its own specific aspects. The proposed work offers directions on using various centrality measures to determine a big-sized maximal common connected subgraph for two molecular chemical input graphs. It also analyses seven centrality measures to select the best candidate node in the Tensor Product graph of two input chemical molecular graphs. Based on the obtained results, the betweenness centrality and degree centrality measures exclusively help to get large-sized similarity patterns.

该工作使用基于中心性测度的启发式方法,通过有效识别中心候选或代表性候选节点,提高了分子化学图中相似性搜索问题的求解效率,简化了检测大型最大公共连接边子图时涉及的复杂过程。在分析了两个输入分子化学图的结构后,建立了张量乘积图。对这个新建立的图进行进一步的分析,得到输入图的相似模式。在张量乘积图中,决定哪个中心性测度选择最佳的中心候选节点来得到一个大的最大公共连通边图是一个开放问题。由于每个中心性度量都是分析的,因此给定的图基于其自身的特定方面是唯一的。所提出的工作为使用各种中心性度量来确定两个分子化学输入图的大尺寸最大公共连通子图提供了指导。它还分析了在两个输入化学分子图的张量乘积图中选择最佳候选节点的七个中心性度量。基于所获得的结果,介数中心性和度中心性度量完全有助于获得大尺度的相似模式。
{"title":"How do centrality measures help to predict similarity patterns in molecular chemical structural graphs?","authors":"Nirmala Parisutham","doi":"10.1016/j.aichem.2023.100007","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100007","url":null,"abstract":"<div><p>The proposed work uses centrality measures based heuristic method to improve the efficiency of the solution for the similarity search problem in molecular chemical graphs by effectively identifying central candidate or representative candidate nodes, which simplify the complex processes involved while detecting a large-sized maximal common connected edge subgraph. After analyzing the structure of the two input molecular chemical graphs, a Tensor Product graph is created. This newly built graph is further analyzed to get the similarity pattern of the input graphs. It is an open problem to decide which centrality measure selects the best central candidate node in Tensor Product graphs to get a large maximal common connected edge graph. Since each centrality measure is analyses, the given graph is uniquely based on its own specific aspects. The proposed work offers directions on using various centrality measures to determine a big-sized maximal common connected subgraph for two molecular chemical input graphs. It also analyses seven centrality measures to select the best candidate node in the Tensor Product graph of two input chemical molecular graphs. Based on the obtained results, the betweenness centrality and degree centrality measures exclusively help to get large-sized similarity patterns.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49764053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ML meets MLn: Machine learning in ligand promoted homogeneous catalysis ML与MLn相遇:配体促进均相催化的机器学习
Pub Date : 2023-07-11 DOI: 10.1016/j.aichem.2023.100006
Jonathan D. Hirst , Samuel Boobier , Jennifer Coughlan , Jessica Streets , Philippa L. Jacob , Oska Pugh , Ender Özcan , Simon Woodward

The benefits of using machine learning approaches in the design, optimisation and understanding of homogeneous catalytic processes are being increasingly realised. We focus on the understanding and implementation of key concepts, which serve as conduits to more advanced chemical machine learning literature, much of which is (presently) outside the area of homogeneous catalysis. Potential pitfalls in the ‘workflow’ procedures needed in the machine learning process are identified and all the examples provided are in a chemical sciences context, including several from ‘real world’ catalyst systems. Finally, potential areas of expansion and impact for machine learning in homogeneous catalysis in the future are considered.

在均匀催化过程的设计、优化和理解中使用机器学习方法的好处正在日益实现。我们专注于理解和实施关键概念,这些概念是通往更先进的化学机器学习文献的渠道,其中大部分(目前)不在均相催化领域。识别了机器学习过程中所需的“工作流程”程序中的潜在陷阱,提供的所有示例都是在化学科学背景下提供的,包括来自“现实世界”催化剂系统的几个示例。最后,考虑了机器学习在均相催化领域未来的潜在扩展和影响。
{"title":"ML meets MLn: Machine learning in ligand promoted homogeneous catalysis","authors":"Jonathan D. Hirst ,&nbsp;Samuel Boobier ,&nbsp;Jennifer Coughlan ,&nbsp;Jessica Streets ,&nbsp;Philippa L. Jacob ,&nbsp;Oska Pugh ,&nbsp;Ender Özcan ,&nbsp;Simon Woodward","doi":"10.1016/j.aichem.2023.100006","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100006","url":null,"abstract":"<div><p>The benefits of using machine learning approaches in the design, optimisation and understanding of homogeneous catalytic processes are being increasingly realised. We focus on the understanding and implementation of key concepts, which serve as conduits to more advanced chemical machine learning literature, much of which is (presently) outside the area of homogeneous catalysis. Potential pitfalls in the ‘workflow’ procedures needed in the machine learning process are identified and all the examples provided are in a chemical sciences context, including several from ‘real world’ catalyst systems. Finally, potential areas of expansion and impact for machine learning in homogeneous catalysis in the future are considered.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49764052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction 评估神经网络在蛋白质配体结合预测中的点预测不确定性
Pub Date : 2023-06-01 DOI: 10.1016/j.aichem.2023.100004
Ya Ju Fan , Jonathan E. Allen , Kevin S. McLoughlin , Da Shi , Brian J. Bennion , Xiaohua Zhang , Felice C. Lightstone

Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error.

神经网络(NN)模型提供了加快药物发现过程并降低其失败率的潜力。神经网络模型的成功需要不确定性量化(UQ),因为药物发现探索了训练数据分布之外的化学空间。标准NN模型不提供不确定性信息。一些方法需要改变神经网络架构或训练程序,从而限制神经网络模型的选择。此外,预测的不确定性可能来自不同的来源。重要的是要有能力分别对不同类型的预测不确定性进行建模,因为模型可以根据不确定性的来源采取各种行动。在本文中,我们检验了UQ方法,这些方法估计了针对蛋白质配体结合预测的NN模型的不同预测不确定性来源。我们利用我们对化合物的先验知识来设计实验。通过使用可视化方法,我们从一组化合物中创建了不重叠和化学多样的分区。这些分区被用作训练和测试集分割,以探索神经网络模型的不确定性。我们展示了所选方法估计的不确定性如何在不同的划分和特征化方案下描述不同的不确定性来源,以及与预测误差的关系。
{"title":"Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction","authors":"Ya Ju Fan ,&nbsp;Jonathan E. Allen ,&nbsp;Kevin S. McLoughlin ,&nbsp;Da Shi ,&nbsp;Brian J. Bennion ,&nbsp;Xiaohua Zhang ,&nbsp;Felice C. Lightstone","doi":"10.1016/j.aichem.2023.100004","DOIUrl":"10.1016/j.aichem.2023.100004","url":null,"abstract":"<div><p>Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/9f/25/nihms-1912151.PMC10426331.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10019861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reproducing the color with reformulated recipe 用重新配方再现颜色
Pub Date : 2023-06-01 DOI: 10.1016/j.aichem.2023.100003
Jinming Fan , Chao Qian , Shaodong Zhou

A reverse molecule contribution (reMC) - molecule contribution (MC) – Machine learning (ML) protocol for disassemble and reproduce the spectrum is presented. By splitting the mixture spectrum with monochromophoric spectra in the database in a “Peeling-Onion” manner, a new recipe can be obtained. Upon comparison of the reproduced spectrum (with the forward molecular contribution - machine learning method) with the original one, the reliability of the proposed method is justified.

提出了一种用于分解和再现光谱的反向分子贡献(reMC)-分子贡献(MC)-机器学习(ML)协议。通过在数据库中以“剥洋葱皮”的方式将混合物光谱与单色光谱分开,可以获得新的配方。通过将再现的光谱(采用正向分子贡献-机器学习方法)与原始光谱进行比较,证明了所提出方法的可靠性。
{"title":"Reproducing the color with reformulated recipe","authors":"Jinming Fan ,&nbsp;Chao Qian ,&nbsp;Shaodong Zhou","doi":"10.1016/j.aichem.2023.100003","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100003","url":null,"abstract":"<div><p>A reverse molecule contribution (reMC) - molecule contribution (MC) – Machine learning (ML) protocol for disassemble and reproduce the spectrum is presented. By splitting the mixture spectrum with monochromophoric spectra in the database in a “Peeling-Onion” manner, a new recipe can be obtained. Upon comparison of the reproduced spectrum (with the forward molecular contribution - machine learning method) with the original one, the reliability of the proposed method is justified.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning modeling of the absorption properties of azobenzene molecules 偶氮苯分子吸收特性的机器学习建模
Pub Date : 2023-06-01 DOI: 10.1016/j.aichem.2023.100002
Valentin Stanev , Ryota Maehashi , Yoshimi Ohta , Ichiro Takeuchi

We present a machine learning framework for modeling the absorption properties of azobenzene molecules – an important class of organic compounds with many potential photochemical applications. The framework utilizes predictors based on the chemical composition and structure of each molecule and consists of separate regression models trained to predict the absorption at distinct wavelengths, covering the UV and visible light ranges. Despite the relatively small size of the dataset (330 molecule-absorption spectrum pairs), the models were able to learn to accurately predict the absorption at fixed wavelengths, as well as the position and intensity of the maximum absorption. These predictions can be used to rapidly screen thousands of candidate molecules for a variety of potential applications, reducing the need for time-consuming and expensive experiments or first-principles computations.

我们提出了一个机器学习框架来模拟偶氮苯分子的吸收特性,偶氮苯是一类重要的有机化合物,具有许多潜在的光化学应用。该框架利用了基于每个分子的化学组成和结构的预测因子,并由单独的回归模型组成,这些模型经过训练,可以预测不同波长的吸收,覆盖紫外线和可见光范围。尽管数据集的大小相对较小(330个分子吸收光谱对),但模型能够学习准确预测固定波长下的吸收,以及最大吸收的位置和强度。这些预测可以用于快速筛选数千个候选分子,用于各种潜在的应用,减少了耗时和昂贵的实验或第一性原理计算的需要。
{"title":"Machine learning modeling of the absorption properties of azobenzene molecules","authors":"Valentin Stanev ,&nbsp;Ryota Maehashi ,&nbsp;Yoshimi Ohta ,&nbsp;Ichiro Takeuchi","doi":"10.1016/j.aichem.2023.100002","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100002","url":null,"abstract":"<div><p>We present a machine learning framework for modeling the absorption properties of azobenzene molecules – an important class of organic compounds with many potential photochemical applications. The framework utilizes predictors based on the chemical composition and structure of each molecule and consists of separate regression models trained to predict the absorption at distinct wavelengths, covering the UV and visible light ranges. Despite the relatively small size of the dataset (330 molecule-absorption spectrum pairs), the models were able to learn to accurately predict the absorption at fixed wavelengths, as well as the position and intensity of the maximum absorption. These predictions can be used to rapidly screen thousands of candidate molecules for a variety of potential applications, reducing the need for time-consuming and expensive experiments or first-principles computations.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Starting the new journal of “Artificial Intelligence Chemistry” 创办新期刊《人工智能化学》
Pub Date : 2023-06-01 DOI: 10.1016/j.aichem.2023.100001
Jun Jiang , Song Wang , Shaul Mukamel
{"title":"Starting the new journal of “Artificial Intelligence Chemistry”","authors":"Jun Jiang ,&nbsp;Song Wang ,&nbsp;Shaul Mukamel","doi":"10.1016/j.aichem.2023.100001","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100001","url":null,"abstract":"","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probing the origin of higher efficiency of terphenyl phosphine over the biaryl framework in Pd-catalyzed C-N coupling: A combined DFT and machine learning study 在钯催化的C-N偶联中,三苯基膦在双芳基骨架上的高效性的来源:DFT和机器学习的结合研究
Pub Date : 2023-06-01 DOI: 10.1016/j.aichem.2023.100005
Qingfu Ye , Yu Zhao , Jun Zhu

The Pd-catalyzed Buchwald–Hartwig coupling reaction is important in the construction of the C-N bond due to various applications in organic synthesis. Quantum chemical calculations are widely used in understanding reaction mechanisms whereas the machine learning method is extremely popular in recognizing the relationships of data. Here, we combine density functional theory calculations with the support vector regression method to probe the origin of the higher efficiency of terphenyl phosphine ligand over the biaryl counterpart in the Buchwald–Hartwig C-N coupling reaction. By quantum chemical calculations, the turnover frequency-determining transition states are located and ligand features are calculated with high accuracy. By machine learning, the relationship between the reaction barrier and ligand features has been examined. It is found that the interplay of the charge on the metal center, the cone angle of the ligands, and the Sterimol L parameters of the ligand determines the catalytic performance of the palladium catalysts with different phosphine ligands. Our findings could help experimental chemists to design the ligands for Pd-catalyzed C-N coupling reactions with high efficiency.

钯催化的Buchwald–Hartwig偶联反应在有机合成中的各种应用,在C-N键的构建中具有重要意义。量子化学计算被广泛用于理解反应机制,而机器学习方法在识别数据关系方面非常流行。在这里,我们将密度泛函理论计算与支持向量回归方法相结合,以探索三苯基膦配体在Buchwald–Hartwig C-N偶联反应中比二芳基配体效率更高的原因。通过量子化学计算,定位了决定跃迁态的翻转频率,并高精度地计算了配体的特征。通过机器学习,研究了反应屏障和配体特征之间的关系。研究发现,金属中心上的电荷、配体的锥角和配体的Sterimol L参数的相互作用决定了具有不同膦配体的钯催化剂的催化性能。我们的发现可以帮助实验化学家设计高效的钯催化的C-N偶联反应的配体。
{"title":"Probing the origin of higher efficiency of terphenyl phosphine over the biaryl framework in Pd-catalyzed C-N coupling: A combined DFT and machine learning study","authors":"Qingfu Ye ,&nbsp;Yu Zhao ,&nbsp;Jun Zhu","doi":"10.1016/j.aichem.2023.100005","DOIUrl":"https://doi.org/10.1016/j.aichem.2023.100005","url":null,"abstract":"<div><p>The Pd-catalyzed Buchwald–Hartwig coupling reaction is important in the construction of the C-N bond due to various applications in organic synthesis. Quantum chemical calculations are widely used in understanding reaction mechanisms whereas the machine learning method is extremely popular in recognizing the relationships of data. Here, we combine density functional theory calculations with the support vector regression method to probe the origin of the higher efficiency of terphenyl phosphine ligand over the biaryl counterpart in the Buchwald–Hartwig C-N coupling reaction. By quantum chemical calculations, the turnover frequency-determining transition states are located and ligand features are calculated with high accuracy. By machine learning, the relationship between the reaction barrier and ligand features has been examined. It is found that the interplay of the charge on the metal center, the cone angle of the ligands, and the Sterimol L parameters of the ligand determines the catalytic performance of the palladium catalysts with different phosphine ligands. Our findings could help experimental chemists to design the ligands for Pd-catalyzed C-N coupling reactions with high efficiency.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49817020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial intelligence chemistry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1