{"title":"Bayesian Meta-Learning for Few-Shot Reaction Outcome Prediction of Asymmetric Hydrogenation of Olefins","authors":"Sukriti Singh, José Miguel Hernández-Lobato","doi":"10.1002/anie.202503821","DOIUrl":null,"url":null,"abstract":"<p>Recent years have witnessed the increasing application of machine learning (ML) in chemical reaction development. These ML methods, in general, require huge training set examples. The published literature has large amounts of data, but there are modelling challenges due to the sparse nature of these datasets. Herein, we report a meta-learning workflow that can utilize the literature-mined data and return accurate predictions with limited data. A literature dataset comprising of over 12 000 transition metal catalyzed asymmetric hydrogenation of olefins (AHO) is chosen to demonstrate the utility of our protocol. A meta-model is trained in a binary classification setting to identify highly enantioselective AHO reactions. Two Bayesian meta-learning approaches are considered, namely, deep kernel transfer (DKT) and adaptive deep kernel fitting (ADKF). Both these methods returned better predictions compared to prototypical network, which is another popular meta-learning approach. Single-task methods, such as random forest, graph neural network, and deep kernel learning, performed worse than meta-learning methods even when trained on full training data. Additionally, we propose another meta-learning approach called ADKF-prior that is shown to further improve the performance in low-data settings. The generalizability of our meta-model is also evaluated on substrate- and time-based splits. Our meta-learning workflow can be utilized to build a pretrained meta-model for any reaction of interest, which can then be useful to predict the outcome of new but related reactions in a few-shot manners.</p>","PeriodicalId":125,"journal":{"name":"Angewandte Chemie International Edition","volume":"64 27","pages":""},"PeriodicalIF":16.9000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/anie.202503821","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Angewandte Chemie International Edition","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/anie.202503821","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Recent years have witnessed the increasing application of machine learning (ML) in chemical reaction development. These ML methods, in general, require huge training set examples. The published literature has large amounts of data, but there are modelling challenges due to the sparse nature of these datasets. Herein, we report a meta-learning workflow that can utilize the literature-mined data and return accurate predictions with limited data. A literature dataset comprising of over 12 000 transition metal catalyzed asymmetric hydrogenation of olefins (AHO) is chosen to demonstrate the utility of our protocol. A meta-model is trained in a binary classification setting to identify highly enantioselective AHO reactions. Two Bayesian meta-learning approaches are considered, namely, deep kernel transfer (DKT) and adaptive deep kernel fitting (ADKF). Both these methods returned better predictions compared to prototypical network, which is another popular meta-learning approach. Single-task methods, such as random forest, graph neural network, and deep kernel learning, performed worse than meta-learning methods even when trained on full training data. Additionally, we propose another meta-learning approach called ADKF-prior that is shown to further improve the performance in low-data settings. The generalizability of our meta-model is also evaluated on substrate- and time-based splits. Our meta-learning workflow can be utilized to build a pretrained meta-model for any reaction of interest, which can then be useful to predict the outcome of new but related reactions in a few-shot manners.
期刊介绍:
Angewandte Chemie, a journal of the German Chemical Society (GDCh), maintains a leading position among scholarly journals in general chemistry with an impressive Impact Factor of 16.6 (2022 Journal Citation Reports, Clarivate, 2023). Published weekly in a reader-friendly format, it features new articles almost every day. Established in 1887, Angewandte Chemie is a prominent chemistry journal, offering a dynamic blend of Review-type articles, Highlights, Communications, and Research Articles on a weekly basis, making it unique in the field.