下载PDF
{"title":"Predicting the location of coordinated metal ion-ligand binding sites using geometry-aware graph neural networks","authors":"Clement Essien , Ning Wang , Yang Yu , Salhuldin Alqarghuli , Yongfang Qin , Negin Manshour , Fei He , Dong Xu","doi":"10.1016/j.csbj.2024.12.016","DOIUrl":null,"url":null,"abstract":"<div><div>More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs. However, accurately predicting these interactions remains challenging. In this study, we proposed GPred, a novel structure-based method that transforms the 3-dimensional structure of a protein into a point cloud representation and then designs a geometry-aware graph neural network to learn the local structural properties of each amino acid residue under specific ligand-binding supervision. We trained our model to predict the location of coordinated binding sites for five essential metal ions: Zn<sup>2+</sup>, Ca<sup>2+</sup>, Mg<sup>2+</sup>, Mn<sup>2+</sup>, and Fe<sup>2+</sup>. We further demonstrated the versatility of GPred by applying transfer learning to predict the binding sites of 2 heavy metal ions, that is, cadmium (Cd<sup>2+</sup>) and mercury (Hg<sup>2+</sup>). We achieved greater than 19.62 %, 14.32 %, 36.62 %, and 40.69 % improvement in the area under the precision-recall curve (AUPR) of Zn<sup>2+</sup>, Ca<sup>2+</sup>, Mg<sup>2+</sup>, Mn<sup>2+</sup>, and Fe<sup>2+</sup>, respectively, when compared with 6 current accessible state-of-the-art sequence-based or structure-based tools. We also validated the proposed approach on protein structures predicted by AlphaFold2, and its performance was similar to experimental protein structures. In both cases, achieving a low false discovery rate for proteins without annotated ion-binding sites was demonstrated. © 2017 Elsevier Inc. All rights reserved.</div></div>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"Pages 137-148"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11750443/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2001037024004410","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
引用
批量引用
Abstract
More than 50 % of proteins bind to metal ions. Interactions between metal ions and proteins, especially coordinated interactions, are essential for biological functions, such as maintaining protein structure and signal transport. Physiological metal-ion binding prediction is pivotal for both elucidating the biological functions of proteins and for the design of new drugs. However, accurately predicting these interactions remains challenging. In this study, we proposed GPred, a novel structure-based method that transforms the 3-dimensional structure of a protein into a point cloud representation and then designs a geometry-aware graph neural network to learn the local structural properties of each amino acid residue under specific ligand-binding supervision. We trained our model to predict the location of coordinated binding sites for five essential metal ions: Zn2+ , Ca2+ , Mg2+ , Mn2+ , and Fe2+ . We further demonstrated the versatility of GPred by applying transfer learning to predict the binding sites of 2 heavy metal ions, that is, cadmium (Cd2+ ) and mercury (Hg2+ ). We achieved greater than 19.62 %, 14.32 %, 36.62 %, and 40.69 % improvement in the area under the precision-recall curve (AUPR) of Zn2+ , Ca2+ , Mg2+ , Mn2+ , and Fe2+ , respectively, when compared with 6 current accessible state-of-the-art sequence-based or structure-based tools. We also validated the proposed approach on protein structures predicted by AlphaFold2, and its performance was similar to experimental protein structures. In both cases, achieving a low false discovery rate for proteins without annotated ion-binding sites was demonstrated. © 2017 Elsevier Inc. All rights reserved.
利用几何感知图形神经网络预测金属离子配体结合位点的位置。
超过50% %的蛋白质与金属离子结合。金属离子与蛋白质之间的相互作用,特别是协同相互作用,对于维持蛋白质结构和信号传递等生物功能至关重要。生理金属离子结合预测对于阐明蛋白质的生物学功能和设计新药至关重要。然而,准确预测这些相互作用仍然具有挑战性。在这项研究中,我们提出了一种新的基于结构的方法GPred,它将蛋白质的三维结构转化为点云表示,然后设计一个几何感知的图神经网络,在特定的配体结合监督下学习每个氨基酸残基的局部结构性质。我们训练我们的模型来预测五种基本金属离子:Zn2+, Ca2+, Mg2+, Mn2+和Fe2+的协调结合位点的位置。通过迁移学习预测镉(Cd2+)和汞(Hg2+)这两种重金属离子的结合位点,进一步证明了GPred的通用性。与目前可获得的6种基于序列或结构的工具相比,我们在Zn2+、Ca2+、Mg2+、Mn2+和Fe2+的精确召回曲线(AUPR)下的面积分别提高了19.62% %、14.32% %、36.62% %和40.69 %。我们还对AlphaFold2预测的蛋白质结构进行了验证,其性能与实验蛋白质结构相似。在这两种情况下,没有注释离子结合位点的蛋白质的错误发现率都很低。©2017 Elsevier Inc.版权所有。
本文章由计算机程序翻译,如有差异,请以英文原文为准。