Kaile Dou, Weibo Zhao, Chenyue Wang, Yuanchen Fan, Chunlin He, Lei Zhang, Siping Pang
{"title":"开启胺底物对六氮杂环戊烷笼的反应性:量身定制的机器学习模型带来的启示","authors":"Kaile Dou, Weibo Zhao, Chenyue Wang, Yuanchen Fan, Chunlin He, Lei Zhang, Siping Pang","doi":"10.1016/j.cej.2024.157677","DOIUrl":null,"url":null,"abstract":"The efficient synthesis of novel hexaazaisowurtzitane cage compounds has remained a formidable challenge for years due to the complicated reaction mechanism and the uncertainty of amine substrate selection. Here, we developed a tailored machine learning model to predict the reactivity of amine substrates towards hexaazaisowurtzitane cage based on high-throughput quantum mechanical calculations of 3428 property parameters of 118 amine substrates. The customized model was developed through an appropriately weighted fusion of advanced universal models, achieving comprehensive predictive capability with an accuracy of 91.4 %, an F1 score of 89.1 %, and a recall of 91.4 %. Further, the customized model exhibits a narrow interquartile range of accuracy, surpassing universal models by 30.6–54.4 % and demonstrating robustness across various data splits. The data-driven analysis identified that electronic and geometric features are the dominant regulating factors of amine’s reactivity. Further, physics-driven insights revealed that a low electron-density environment near the nitrogen in the amine group is a key for switching on the reactivity of the amine substrates, which can be characterized by a sufficiently high NMR signal around 225.7 ppm with a narrow fluctuation of 2.6 ppm. Based on the revealed guiding factors and regulating mechanism, we selected 27 commercially available amine substrates for reactivity assessment and recommended 5 candidates with a probability exceeding 90 % for synthesis trials. This work pioneers machine learning and high-throughput quantum mechanical computationally assisted prediction of substrate selection for the rational synthesis of hexaazaisowurtzitane cages.","PeriodicalId":270,"journal":{"name":"Chemical Engineering Journal","volume":null,"pages":null},"PeriodicalIF":13.3000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Switch on amine substrate reactivity towards hexaazaisowurtzitane cage: Insights from a tailored machine learning model\",\"authors\":\"Kaile Dou, Weibo Zhao, Chenyue Wang, Yuanchen Fan, Chunlin He, Lei Zhang, Siping Pang\",\"doi\":\"10.1016/j.cej.2024.157677\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The efficient synthesis of novel hexaazaisowurtzitane cage compounds has remained a formidable challenge for years due to the complicated reaction mechanism and the uncertainty of amine substrate selection. Here, we developed a tailored machine learning model to predict the reactivity of amine substrates towards hexaazaisowurtzitane cage based on high-throughput quantum mechanical calculations of 3428 property parameters of 118 amine substrates. The customized model was developed through an appropriately weighted fusion of advanced universal models, achieving comprehensive predictive capability with an accuracy of 91.4 %, an F1 score of 89.1 %, and a recall of 91.4 %. Further, the customized model exhibits a narrow interquartile range of accuracy, surpassing universal models by 30.6–54.4 % and demonstrating robustness across various data splits. The data-driven analysis identified that electronic and geometric features are the dominant regulating factors of amine’s reactivity. Further, physics-driven insights revealed that a low electron-density environment near the nitrogen in the amine group is a key for switching on the reactivity of the amine substrates, which can be characterized by a sufficiently high NMR signal around 225.7 ppm with a narrow fluctuation of 2.6 ppm. Based on the revealed guiding factors and regulating mechanism, we selected 27 commercially available amine substrates for reactivity assessment and recommended 5 candidates with a probability exceeding 90 % for synthesis trials. This work pioneers machine learning and high-throughput quantum mechanical computationally assisted prediction of substrate selection for the rational synthesis of hexaazaisowurtzitane cages.\",\"PeriodicalId\":270,\"journal\":{\"name\":\"Chemical Engineering Journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":13.3000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemical Engineering Journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.cej.2024.157677\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Engineering Journal","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.cej.2024.157677","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Switch on amine substrate reactivity towards hexaazaisowurtzitane cage: Insights from a tailored machine learning model
The efficient synthesis of novel hexaazaisowurtzitane cage compounds has remained a formidable challenge for years due to the complicated reaction mechanism and the uncertainty of amine substrate selection. Here, we developed a tailored machine learning model to predict the reactivity of amine substrates towards hexaazaisowurtzitane cage based on high-throughput quantum mechanical calculations of 3428 property parameters of 118 amine substrates. The customized model was developed through an appropriately weighted fusion of advanced universal models, achieving comprehensive predictive capability with an accuracy of 91.4 %, an F1 score of 89.1 %, and a recall of 91.4 %. Further, the customized model exhibits a narrow interquartile range of accuracy, surpassing universal models by 30.6–54.4 % and demonstrating robustness across various data splits. The data-driven analysis identified that electronic and geometric features are the dominant regulating factors of amine’s reactivity. Further, physics-driven insights revealed that a low electron-density environment near the nitrogen in the amine group is a key for switching on the reactivity of the amine substrates, which can be characterized by a sufficiently high NMR signal around 225.7 ppm with a narrow fluctuation of 2.6 ppm. Based on the revealed guiding factors and regulating mechanism, we selected 27 commercially available amine substrates for reactivity assessment and recommended 5 candidates with a probability exceeding 90 % for synthesis trials. This work pioneers machine learning and high-throughput quantum mechanical computationally assisted prediction of substrate selection for the rational synthesis of hexaazaisowurtzitane cages.
期刊介绍:
The Chemical Engineering Journal is an international research journal that invites contributions of original and novel fundamental research. It aims to provide an international platform for presenting original fundamental research, interpretative reviews, and discussions on new developments in chemical engineering. The journal welcomes papers that describe novel theory and its practical application, as well as those that demonstrate the transfer of techniques from other disciplines. It also welcomes reports on carefully conducted experimental work that is soundly interpreted. The main focus of the journal is on original and rigorous research results that have broad significance. The Catalysis section within the Chemical Engineering Journal focuses specifically on Experimental and Theoretical studies in the fields of heterogeneous catalysis, molecular catalysis, and biocatalysis. These studies have industrial impact on various sectors such as chemicals, energy, materials, foods, healthcare, and environmental protection.