Predicting the location of coordinated metal ion-ligand binding sites using geometry-aware graph neural networks

IF 4.4 2区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY Computational and structural biotechnology journal Pub Date : 2025-01-01 DOI:10.1016/j.csbj.2024.12.016
Clement Essien , Ning Wang , Yang Yu , Salhuldin Alqarghuli , Yongfang Qin , Negin Manshour , Fei He , Dong Xu
{"title":"Predicting the location of coordinated metal ion-ligand binding sites using geometry-aware graph neural networks","authors":"Clement Essien ,&nbsp;Ning Wang ,&nbsp;Yang Yu ,&nbsp;Salhuldin Alqarghuli ,&nbsp;Yongfang Qin ,&nbsp;Negin Manshour ,&nbsp;Fei He ,&nbsp;Dong Xu","doi":"10.1016/j.csbj.2024.12.016","DOIUrl":null,"url":null,"abstract":"<div><div>More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs. However, accurately predicting these interactions remains challenging. In this study, we proposed GPred, a novel structure-based method that transforms the 3-dimensional structure of a protein into a point cloud representation and then designs a geometry-aware graph neural network to learn the local structural properties of each amino acid residue under specific ligand-binding supervision. We trained our model to predict the location of coordinated binding sites for five essential metal ions: Zn<sup>2+</sup>, Ca<sup>2+</sup>, Mg<sup>2+</sup>, Mn<sup>2+</sup>, and Fe<sup>2+</sup>. We further demonstrated the versatility of GPred by applying transfer learning to predict the binding sites of 2 heavy metal ions, that is, cadmium (Cd<sup>2+</sup>) and mercury (Hg<sup>2+</sup>). We achieved greater than 19.62 %, 14.32 %, 36.62 %, and 40.69 % improvement in the area under the precision-recall curve (AUPR) of Zn<sup>2+</sup>, Ca<sup>2+</sup>, Mg<sup>2+</sup>, Mn<sup>2+</sup>, and Fe<sup>2+</sup>, respectively, when compared with 6 current accessible state-of-the-art sequence-based or structure-based tools. We also validated the proposed approach on protein structures predicted by AlphaFold2, and its performance was similar to experimental protein structures. In both cases, achieving a low false discovery rate for proteins without annotated ion-binding sites was demonstrated. © 2017 Elsevier Inc. All rights reserved.</div></div>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"Pages 137-148"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11750443/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2001037024004410","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs. However, accurately predicting these interactions remains challenging. In this study, we proposed GPred, a novel structure-based method that transforms the 3-dimensional structure of a protein into a point cloud representation and then designs a geometry-aware graph neural network to learn the local structural properties of each amino acid residue under specific ligand-binding supervision. We trained our model to predict the location of coordinated binding sites for five essential metal ions: Zn2+, Ca2+, Mg2+, Mn2+, and Fe2+. We further demonstrated the versatility of GPred by applying transfer learning to predict the binding sites of 2 heavy metal ions, that is, cadmium (Cd2+) and mercury (Hg2+). We achieved greater than 19.62 %, 14.32 %, 36.62 %, and 40.69 % improvement in the area under the precision-recall curve (AUPR) of Zn2+, Ca2+, Mg2+, Mn2+, and Fe2+, respectively, when compared with 6 current accessible state-of-the-art sequence-based or structure-based tools. We also validated the proposed approach on protein structures predicted by AlphaFold2, and its performance was similar to experimental protein structures. In both cases, achieving a low false discovery rate for proteins without annotated ion-binding sites was demonstrated. © 2017 Elsevier Inc. All rights reserved.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用几何感知图形神经网络预测金属离子配体结合位点的位置。
超过50% %的蛋白质与金属离子结合。金属离子与蛋白质之间的相互作用,特别是协同相互作用,对于维持蛋白质结构和信号传递等生物功能至关重要。生理金属离子结合预测对于阐明蛋白质的生物学功能和设计新药至关重要。然而,准确预测这些相互作用仍然具有挑战性。在这项研究中,我们提出了一种新的基于结构的方法GPred,它将蛋白质的三维结构转化为点云表示,然后设计一个几何感知的图神经网络,在特定的配体结合监督下学习每个氨基酸残基的局部结构性质。我们训练我们的模型来预测五种基本金属离子:Zn2+, Ca2+, Mg2+, Mn2+和Fe2+的协调结合位点的位置。通过迁移学习预测镉(Cd2+)和汞(Hg2+)这两种重金属离子的结合位点,进一步证明了GPred的通用性。与目前可获得的6种基于序列或结构的工具相比,我们在Zn2+、Ca2+、Mg2+、Mn2+和Fe2+的精确召回曲线(AUPR)下的面积分别提高了19.62% %、14.32% %、36.62% %和40.69 %。我们还对AlphaFold2预测的蛋白质结构进行了验证,其性能与实验蛋白质结构相似。在这两种情况下,没有注释离子结合位点的蛋白质的错误发现率都很低。©2017 Elsevier Inc.版权所有。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computational and structural biotechnology journal
Computational and structural biotechnology journal Biochemistry, Genetics and Molecular Biology-Biophysics
CiteScore
9.30
自引率
3.30%
发文量
540
审稿时长
6 weeks
期刊介绍: Computational and Structural Biotechnology Journal (CSBJ) is an online gold open access journal publishing research articles and reviews after full peer review. All articles are published, without barriers to access, immediately upon acceptance. The journal places a strong emphasis on functional and mechanistic understanding of how molecular components in a biological process work together through the application of computational methods. Structural data may provide such insights, but they are not a pre-requisite for publication in the journal. Specific areas of interest include, but are not limited to: Structure and function of proteins, nucleic acids and other macromolecules Structure and function of multi-component complexes Protein folding, processing and degradation Enzymology Computational and structural studies of plant systems Microbial Informatics Genomics Proteomics Metabolomics Algorithms and Hypothesis in Bioinformatics Mathematical and Theoretical Biology Computational Chemistry and Drug Discovery Microscopy and Molecular Imaging Nanotechnology Systems and Synthetic Biology
期刊最新文献
An ecological and stochastic perspective on persisters resuscitation CTGCT, Centre of Excellence for the Technologies of Gene and Cell Therapy: Collaborative translation of scientific discoveries into advanced treatments for neurological rare genetic diseases and cancer Identification of cellular signatures associated with chinese hamster ovary cell adaptation for secretion of antibodies Intelligent biology and medicine: Accelerating innovative computational approaches Withdrawal notice to “Corrigendum to: 'Cryo-EM reveals architectural diversity in active Rotavirus particles'. Comput. Struct. Biotechnol. J. 2019 Jul 31; 17:1178-1183” [Comput. Struct. Biotechnol. J. 23 (2024) 3702]
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1