Extension of multi-site analogue series with potent compounds using a bidirectional transformer-based chemical language model

IF 3.597 Q2 Pharmacology, Toxicology and Pharmaceutics MedChemComm Pub Date : 2024-06-17 DOI:10.1039/D4MD00423J
Hengwei Chen, Atsushi Yoshimori and Jürgen Bajorath
{"title":"Extension of multi-site analogue series with potent compounds using a bidirectional transformer-based chemical language model","authors":"Hengwei Chen, Atsushi Yoshimori and Jürgen Bajorath","doi":"10.1039/D4MD00423J","DOIUrl":null,"url":null,"abstract":"<p >Generating potent compounds for evolving analogue series (AS) is a key challenge in medicinal chemistry. The versatility of chemical language models (CLMs) makes it possible to formulate this challenge as an off-the-beaten-path prediction task. In this work, we have devised a coding and tokenization scheme for evolving AS with multiple substitution sites (multi-site AS) and implemented a bidirectional transformer to predict new potent analogues for such series. Scientific foundations of this approach are discussed and, as a benchmark, the transformer model is compared to a recurrent neural network (RNN) for the prediction of analogues of AS with single substitution sites. Furthermore, the transformer is shown to successfully predict potent analogues with varying R-group combinations for multi-site AS having activity against many different targets. Prediction of R-group combinations for extending AS with potent compounds represents a novel approach for compound optimization.</p>","PeriodicalId":88,"journal":{"name":"MedChemComm","volume":" 7","pages":" 2527-2537"},"PeriodicalIF":3.5970,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MedChemComm","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/md/d4md00423j","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
引用次数: 0

Abstract

Generating potent compounds for evolving analogue series (AS) is a key challenge in medicinal chemistry. The versatility of chemical language models (CLMs) makes it possible to formulate this challenge as an off-the-beaten-path prediction task. In this work, we have devised a coding and tokenization scheme for evolving AS with multiple substitution sites (multi-site AS) and implemented a bidirectional transformer to predict new potent analogues for such series. Scientific foundations of this approach are discussed and, as a benchmark, the transformer model is compared to a recurrent neural network (RNN) for the prediction of analogues of AS with single substitution sites. Furthermore, the transformer is shown to successfully predict potent analogues with varying R-group combinations for multi-site AS having activity against many different targets. Prediction of R-group combinations for extending AS with potent compounds represents a novel approach for compound optimization.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用基于双向转换器的化学语言模型,扩展多位点类似物系列的强效化合物
为不断演化的类似物系列(AS)生成强效化合物是药物化学领域的一项关键挑战。化学语言模型(CLM)的多功能性使我们有可能将这一挑战制定为非主流预测任务。在这项工作中,我们为具有多个取代位点(多位点 AS)的 AS 演化设计了一种编码和标记化方案,并实施了一种双向转换器来预测此类系列的新的强效类似物。我们讨论了这种方法的科学基础,并将转换器模型与预测单取代位点 AS 类似物的递归神经网络(RNN)进行了比较。此外,研究还表明转化器能成功预测具有不同 R 组组合的强效类似物,这些类似物是针对许多不同靶点具有活性的多位点 AS。预测 R 基团组合以扩展 AS 的强效化合物是化合物优化的一种新方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
MedChemComm
MedChemComm BIOCHEMISTRY & MOLECULAR BIOLOGY-CHEMISTRY, MEDICINAL
CiteScore
4.70
自引率
0.00%
发文量
0
审稿时长
2.2 months
期刊介绍: Research and review articles in medicinal chemistry and related drug discovery science; the official journal of the European Federation for Medicinal Chemistry. In 2020, MedChemComm will change its name to RSC Medicinal Chemistry. Issue 12, 2019 will be the last issue as MedChemComm.
期刊最新文献
Back cover Introduction to the themed collection in honour of Professor Christian Leumann Back cover Correction: computational design, synthesis, and assessment of 3-(4-(4-(1,3,4-oxadiazol-2-yl)-1H-imidazol-2-yl)phenyl)-1,2,4-oxadiazole derivatives as effective epidermal growth factor receptor inhibitors: a prospective strategy for anticancer therapy Introduction to the themed collection on ‘AI in Medicinal Chemistry’
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1