AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist

IF 5.7 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY Journal of Cheminformatics Pub Date : 2025-01-29 DOI:10.1186/s13321-024-00945-7
Rahul Brahma, Sunghyun Moon, Jae-Min Shin, Kwang-Hwi Cho
{"title":"AiGPro: a multi-tasks model for profiling of GPCRs for agonist and antagonist","authors":"Rahul Brahma,&nbsp;Sunghyun Moon,&nbsp;Jae-Min Shin,&nbsp;Kwang-Hwi Cho","doi":"10.1186/s13321-024-00945-7","DOIUrl":null,"url":null,"abstract":"<p>G protein-coupled receptors (GPCRs) play vital roles in various physiological processes, making them attractive drug discovery targets. Meanwhile, deep learning techniques have revolutionized drug discovery by facilitating efficient tools for expediting the identification and optimization of ligands. However, existing models for the GPCRs often focus on single-target or a small subset of GPCRs or employ binary classification, constraining their applicability for high throughput virtual screening. To address these issues, we introduce AiGPro, a novel multitask model designed to predict small molecule agonists (EC<sub>50</sub>) and antagonists (IC<sub>50</sub>) across the 231 human GPCRs, making it a first-in-class solution for large-scale GPCR profiling.</p><p>Leveraging multi-scale context aggregation and bidirectional multi-head cross-attention mechanisms, our approach demonstrates that ensemble models may not be necessary for predicting complex GPCR states and small molecule interactions. Through extensive validation using stratified tenfold cross-validation, AiGPro achieves robust performance with Pearson's correlation coefficient of 0.91, indicating broad generalizability. This breakthrough sets a new standard in the GPCR studies, outperforming previous studies. Moreover, our first-in-class multi-tasking model can predict agonist and antagonist activities across a wide range of GPCRs, offering a comprehensive perspective on ligand bioactivity within this diverse superfamily. To facilitate easy accessibility, we have deployed a web-based platform for model access at https://aicadd.ssu.ac.kr/AiGPro.</p><p><b>Scientific Contribution </b>We introduce a deep learning-based multi-task model to generalize the agonist and antagonist bioactivity prediction for GPCRs accurately. The model is implemented on a user-friendly web server to facilitate rapid screening of small-molecule libraries, expediting GPCR-targeted drug discovery. Covering a diverse set of 231 GPCR targets, the platform delivers a robust, scalable solution for advancing GPCR-focused therapeutic development.</p><p>The proposed framework incorporates an innovative dual-label prediction strategy, enabling the simultaneous classification of molecules as agonists, antagonists, or both. Each prediction is further accompanied by a confidence score, offering a quantitative measure of activity likelihood. This advancement moves beyond conventional models focusing solely on binding affinity, providing a more comprehensive understanding of ligand-receptor interactions.</p><p>At the core of our model lies the Bi-Directional Multi-Head Cross-Attention (BMCA) module, a novel architecture that captures forward and backward contextual embeddings of protein and ligand features. By leveraging BMCA, the model effectively integrates structural and sequence-level information, ensuring a precise representation of molecular interactions. Results show that this approach is highly accurate in binding affinity predictions and consistent across diverse GPCR families.</p><p>By unifying agonist and antagonist bioactivity prediction into a single model architecture, we bridge a critical gap in GPCR modeling. This enhances prediction accuracy and accelerates virtual screening workflows, offering a valuable and innovative solution for advancing GPCR-targeted drug discovery.</p>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"17 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-024-00945-7","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-024-00945-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

G protein-coupled receptors (GPCRs) play vital roles in various physiological processes, making them attractive drug discovery targets. Meanwhile, deep learning techniques have revolutionized drug discovery by facilitating efficient tools for expediting the identification and optimization of ligands. However, existing models for the GPCRs often focus on single-target or a small subset of GPCRs or employ binary classification, constraining their applicability for high throughput virtual screening. To address these issues, we introduce AiGPro, a novel multitask model designed to predict small molecule agonists (EC50) and antagonists (IC50) across the 231 human GPCRs, making it a first-in-class solution for large-scale GPCR profiling.

Leveraging multi-scale context aggregation and bidirectional multi-head cross-attention mechanisms, our approach demonstrates that ensemble models may not be necessary for predicting complex GPCR states and small molecule interactions. Through extensive validation using stratified tenfold cross-validation, AiGPro achieves robust performance with Pearson's correlation coefficient of 0.91, indicating broad generalizability. This breakthrough sets a new standard in the GPCR studies, outperforming previous studies. Moreover, our first-in-class multi-tasking model can predict agonist and antagonist activities across a wide range of GPCRs, offering a comprehensive perspective on ligand bioactivity within this diverse superfamily. To facilitate easy accessibility, we have deployed a web-based platform for model access at https://aicadd.ssu.ac.kr/AiGPro.

Scientific Contribution We introduce a deep learning-based multi-task model to generalize the agonist and antagonist bioactivity prediction for GPCRs accurately. The model is implemented on a user-friendly web server to facilitate rapid screening of small-molecule libraries, expediting GPCR-targeted drug discovery. Covering a diverse set of 231 GPCR targets, the platform delivers a robust, scalable solution for advancing GPCR-focused therapeutic development.

The proposed framework incorporates an innovative dual-label prediction strategy, enabling the simultaneous classification of molecules as agonists, antagonists, or both. Each prediction is further accompanied by a confidence score, offering a quantitative measure of activity likelihood. This advancement moves beyond conventional models focusing solely on binding affinity, providing a more comprehensive understanding of ligand-receptor interactions.

At the core of our model lies the Bi-Directional Multi-Head Cross-Attention (BMCA) module, a novel architecture that captures forward and backward contextual embeddings of protein and ligand features. By leveraging BMCA, the model effectively integrates structural and sequence-level information, ensuring a precise representation of molecular interactions. Results show that this approach is highly accurate in binding affinity predictions and consistent across diverse GPCR families.

By unifying agonist and antagonist bioactivity prediction into a single model architecture, we bridge a critical gap in GPCR modeling. This enhances prediction accuracy and accelerates virtual screening workflows, offering a valuable and innovative solution for advancing GPCR-targeted drug discovery.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AiGPro:一个多任务模型,用于分析gpcr的激动剂和拮抗剂
G蛋白偶联受体(gpcr)在各种生理过程中发挥着重要作用,使其成为有吸引力的药物发现靶点。与此同时,深度学习技术通过促进快速识别和优化配体的有效工具,彻底改变了药物发现。然而,现有的gpcr模型往往侧重于单靶点或一小部分gpcr,或采用二元分类,限制了其对高通量虚拟筛选的适用性。为了解决这些问题,我们引入了AiGPro,这是一种新的多任务模型,旨在预测231种人类GPCR中的小分子激动剂(EC50)和拮抗剂(IC50),使其成为大规模GPCR分析的一流解决方案。利用多尺度上下文聚合和双向多头交叉注意机制,我们的方法表明,集成模型可能不是预测复杂GPCR状态和小分子相互作用所必需的。通过分层十倍交叉验证的广泛验证,AiGPro实现了稳健的性能,Pearson相关系数为0.91,表明了广泛的泛化性。这一突破为GPCR研究树立了新的标准,超越了以往的研究。此外,我们一流的多任务模型可以预测多种gpcr的激动剂和拮抗剂活性,为这个多样化超家族中的配体生物活性提供了一个全面的视角。为了方便访问,我们在https://aicadd.ssu.ac.kr/AiGPro部署了一个基于web的模型访问平台。我们引入了一个基于深度学习的多任务模型来准确地推广gpcr的激动剂和拮抗剂生物活性预测。该模型在一个用户友好的web服务器上实现,以促进小分子文库的快速筛选,加快gpcr靶向药物的发现。该平台涵盖了231个GPCR靶点,为推进以GPCR为重点的治疗开发提供了一个强大的、可扩展的解决方案。提出的框架结合了一种创新的双标签预测策略,可以同时将分子分类为激动剂、拮抗剂或两者。每一个预测都进一步附有一个信心分数,提供活动可能性的定量度量。这一进展超越了仅关注结合亲和力的传统模型,提供了对配体-受体相互作用的更全面的理解。我们模型的核心是双向多头交叉注意(BMCA)模块,这是一种新颖的架构,可以捕获蛋白质和配体特征的向前和向后上下文嵌入。通过利用BMCA,该模型有效地集成了结构和序列级信息,确保了分子相互作用的精确表示。结果表明,这种方法在结合亲和力预测方面非常准确,并且在不同的GPCR家族中是一致的。通过将激动剂和拮抗剂的生物活性预测统一到一个单一的模型结构中,我们弥合了GPCR建模的关键空白。这提高了预测准确性,加速了虚拟筛选工作流程,为推进gpcr靶向药物的发现提供了有价值的创新解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Cheminformatics
Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS
CiteScore
14.10
自引率
7.00%
发文量
82
审稿时长
3 months
期刊介绍: Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.
期刊最新文献
Proteolysis-targeting Chimera efficacy prediction using a deep-learning-QSP model. Multimodal graph fusion with statistically guided parsimonious descriptor selection for molecular property prediction. Empowering federated learning for robust compound-protein interaction prediction across heterogeneous cross-pharma domains. Analysis of cyclohexane, cyclopentane, and benzene conformations in ligands for PDB X-ray structures using the Hill-Reilly approach. Learnable protein representations in computational biology for predicting drug-target affinity.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1