{"title":"Protocol to generate dual-target compounds using a transformer chemical language model.","authors":"Sanjana Srinivasan, Jürgen Bajorath","doi":"10.1016/j.xpro.2024.103584","DOIUrl":null,"url":null,"abstract":"<p><p>Here, we present a protocol to generate dual-target compounds (DT-CPDs) interacting with two distinct target proteins using a transformer-based chemical language model. We describe steps for installing software, preparing data, and pre-training the model on pairs of single-target compounds (ST-CPDs), which bind to an individual protein, and DT-CPDs. We then detail procedures for assembling ST- and corresponding DT-CPD data for specific protein pairs and evaluating the model's performance on hold-out test sets. For complete details on the use and execution of this protocol, please refer to Srinivasan and Bajorath.<sup>1</sup>.</p>","PeriodicalId":34214,"journal":{"name":"STAR Protocols","volume":"6 1","pages":"103584"},"PeriodicalIF":1.3000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"STAR Protocols","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xpro.2024.103584","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Here, we present a protocol to generate dual-target compounds (DT-CPDs) interacting with two distinct target proteins using a transformer-based chemical language model. We describe steps for installing software, preparing data, and pre-training the model on pairs of single-target compounds (ST-CPDs), which bind to an individual protein, and DT-CPDs. We then detail procedures for assembling ST- and corresponding DT-CPD data for specific protein pairs and evaluating the model's performance on hold-out test sets. For complete details on the use and execution of this protocol, please refer to Srinivasan and Bajorath.1.