{"title":"ChemCNet: An Explainable Integrated Model for Intelligent Analyzing Chemistry Synthesis Reactions","authors":"Lanfeng Wang, Hengzhe Wang, Shuoshi Liu, Zixin Li, Yaping Yu, Yun Chai, Xiaohui Yang","doi":"10.46793/match.91-1.041w","DOIUrl":null,"url":null,"abstract":"Palladium (Pd)-catalyzed cross coupling reactions are of great significance in organic synthesis. However, the reaction route is more complex, time-consuming and costly. For addressing the above problems, a model-related feature selection strategy is introduced, focusing on iterative optimization of feature description and prediction to guide and strengthen each other. Then, we combine the lightweight convolution neural network (CNN) driven by attention mechanism with CatBoost to build an intelligent chemical synthesis reaction analysis model-ChemCNet. Moreover, we conduct the interpretability analysis based on ChemCNet model. The results show that ChemCNet model has achieved relatively high prediction accuracy and generalization, and it is helpful to provide reliable decision-making information for the experimenter or institution.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":"33 1","pages":"0"},"PeriodicalIF":2.9000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Match-Communications in Mathematical and in Computer Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46793/match.91-1.041w","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Palladium (Pd)-catalyzed cross coupling reactions are of great significance in organic synthesis. However, the reaction route is more complex, time-consuming and costly. For addressing the above problems, a model-related feature selection strategy is introduced, focusing on iterative optimization of feature description and prediction to guide and strengthen each other. Then, we combine the lightweight convolution neural network (CNN) driven by attention mechanism with CatBoost to build an intelligent chemical synthesis reaction analysis model-ChemCNet. Moreover, we conduct the interpretability analysis based on ChemCNet model. The results show that ChemCNet model has achieved relatively high prediction accuracy and generalization, and it is helpful to provide reliable decision-making information for the experimenter or institution.
期刊介绍:
MATCH Communications in Mathematical and in Computer Chemistry publishes papers of original research as well as reviews on chemically important mathematical results and non-routine applications of mathematical techniques to chemical problems. A paper acceptable for publication must contain non-trivial mathematics or communicate non-routine computer-based procedures AND have a clear connection to chemistry. Papers are published without any processing or publication charge.