TCRcost:利用 TCR 三维结构增强 TCR 肽结合的深度学习模型。

IF 2.8 3区 生物学 Q2 GENETICS & HEREDITY Frontiers in Genetics Pub Date : 2024-10-02 eCollection Date: 2024-01-01 DOI:10.3389/fgene.2024.1346784
Fan Li, Xinyang Qian, Xiaoyan Zhu, Xin Lai, Xuanping Zhang, Jiayin Wang
{"title":"TCRcost:利用 TCR 三维结构增强 TCR 肽结合的深度学习模型。","authors":"Fan Li, Xinyang Qian, Xiaoyan Zhu, Xin Lai, Xuanping Zhang, Jiayin Wang","doi":"10.3389/fgene.2024.1346784","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Predicting TCR-peptide binding is a complex and significant computational problem in systems immunology. During the past decade, a series of computational methods have been developed for better predicting TCR-peptide binding from amino acid sequences. However, the performance of sequence-based methods appears to have hit a bottleneck. Considering the 3D structures of TCR-peptide complexes, which provide much more information, could potentially lead to better prediction outcomes.</p><p><strong>Methods: </strong>In this study, we developed TCRcost, a deep learning method, to predict TCR-peptide binding by incorporating 3D structures. TCRcost overcomes two significant challenges: acquiring a sufficient number of high-quality TCR-peptide structures and effectively extracting information from these structures for binding prediction. TCRcost corrects TCR 3D structures generated by protein structure tools, significantly extending the available datasets. The main and side chains of a TCR structure are separately corrected using a long short-term memory (LSTM) model. This approach prevents interference between the chains and accurately extracts interactions among both adjacent and global atoms. A 3D convolutional neural network (CNN) is designed to extract the atomic features relevant to TCR-peptide binding. The spatial features extracted by the 3DCNN are then processed through a fully connected layer to estimate the probability of TCR-peptide binding.</p><p><strong>Results: </strong>Test results demonstrated that predicting TCR-peptide binding from 3D TCR structures is both efficient and highly accurate with an average accuracy of 0.974 on precise structures. Furthermore, the average accuracy on corrected structures was 0.762, significantly higher than the average accuracy of 0.375 on uncorrected original structures. Additionally, the average root mean square distance (RMSD) to precise structures was significantly reduced from 12.753 Å for predicted structures to 8.785 Å for corrected structures.</p><p><strong>Discussion: </strong>Thus, utilizing structural information of TCR-peptide complexes is a promising approach to improve the accuracy of binding predictions.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479912/pdf/","citationCount":"0","resultStr":"{\"title\":\"TCRcost: a deep learning model utilizing TCR 3D structure for enhanced of TCR-peptide binding.\",\"authors\":\"Fan Li, Xinyang Qian, Xiaoyan Zhu, Xin Lai, Xuanping Zhang, Jiayin Wang\",\"doi\":\"10.3389/fgene.2024.1346784\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Predicting TCR-peptide binding is a complex and significant computational problem in systems immunology. During the past decade, a series of computational methods have been developed for better predicting TCR-peptide binding from amino acid sequences. However, the performance of sequence-based methods appears to have hit a bottleneck. Considering the 3D structures of TCR-peptide complexes, which provide much more information, could potentially lead to better prediction outcomes.</p><p><strong>Methods: </strong>In this study, we developed TCRcost, a deep learning method, to predict TCR-peptide binding by incorporating 3D structures. TCRcost overcomes two significant challenges: acquiring a sufficient number of high-quality TCR-peptide structures and effectively extracting information from these structures for binding prediction. TCRcost corrects TCR 3D structures generated by protein structure tools, significantly extending the available datasets. The main and side chains of a TCR structure are separately corrected using a long short-term memory (LSTM) model. This approach prevents interference between the chains and accurately extracts interactions among both adjacent and global atoms. A 3D convolutional neural network (CNN) is designed to extract the atomic features relevant to TCR-peptide binding. The spatial features extracted by the 3DCNN are then processed through a fully connected layer to estimate the probability of TCR-peptide binding.</p><p><strong>Results: </strong>Test results demonstrated that predicting TCR-peptide binding from 3D TCR structures is both efficient and highly accurate with an average accuracy of 0.974 on precise structures. Furthermore, the average accuracy on corrected structures was 0.762, significantly higher than the average accuracy of 0.375 on uncorrected original structures. Additionally, the average root mean square distance (RMSD) to precise structures was significantly reduced from 12.753 Å for predicted structures to 8.785 Å for corrected structures.</p><p><strong>Discussion: </strong>Thus, utilizing structural information of TCR-peptide complexes is a promising approach to improve the accuracy of binding predictions.</p>\",\"PeriodicalId\":12750,\"journal\":{\"name\":\"Frontiers in Genetics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2024-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11479912/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.3389/fgene.2024.1346784\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fgene.2024.1346784","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

导言:预测 TCR 与肽的结合是系统免疫学中一个复杂而重要的计算问题。过去十年间,人们开发了一系列计算方法,以便更好地从氨基酸序列预测 TCR 肽结合。然而,基于序列的方法的性能似乎遇到了瓶颈。考虑到 TCR-多肽复合物的三维结构能提供更多信息,有可能带来更好的预测结果:在这项研究中,我们开发了一种深度学习方法 TCRcost,通过结合三维结构来预测 TCR 与多肽的结合。TCRcost克服了两个重大挑战:获取足够数量的高质量TCR-多肽结构,以及有效提取这些结构中的信息用于结合预测。TCRcost 修正了蛋白质结构工具生成的 TCR 三维结构,大大扩展了可用数据集。使用长短期记忆(LSTM)模型分别校正 TCR 结构的主链和侧链。这种方法可以防止链之间的干扰,并准确提取相邻原子和全局原子之间的相互作用。设计了一个三维卷积神经网络(CNN)来提取与 TCR 肽结合相关的原子特征。然后通过全连接层处理 3DCNN 提取的空间特征,以估计 TCR 肽结合的概率:测试结果表明,从三维 TCR 结构预测 TCR 肽结合既高效又准确,精确结构的平均准确率为 0.974。此外,校正结构的平均准确率为 0.762,明显高于未校正原始结构的平均准确率 0.375。此外,精确结构的平均均方根距离(RMSD)也从预测结构的 12.753 Å 显著减少到校正结构的 8.785 Å:因此,利用 TCR 肽复合物的结构信息是提高结合预测准确性的一种可行方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
TCRcost: a deep learning model utilizing TCR 3D structure for enhanced of TCR-peptide binding.

Introduction: Predicting TCR-peptide binding is a complex and significant computational problem in systems immunology. During the past decade, a series of computational methods have been developed for better predicting TCR-peptide binding from amino acid sequences. However, the performance of sequence-based methods appears to have hit a bottleneck. Considering the 3D structures of TCR-peptide complexes, which provide much more information, could potentially lead to better prediction outcomes.

Methods: In this study, we developed TCRcost, a deep learning method, to predict TCR-peptide binding by incorporating 3D structures. TCRcost overcomes two significant challenges: acquiring a sufficient number of high-quality TCR-peptide structures and effectively extracting information from these structures for binding prediction. TCRcost corrects TCR 3D structures generated by protein structure tools, significantly extending the available datasets. The main and side chains of a TCR structure are separately corrected using a long short-term memory (LSTM) model. This approach prevents interference between the chains and accurately extracts interactions among both adjacent and global atoms. A 3D convolutional neural network (CNN) is designed to extract the atomic features relevant to TCR-peptide binding. The spatial features extracted by the 3DCNN are then processed through a fully connected layer to estimate the probability of TCR-peptide binding.

Results: Test results demonstrated that predicting TCR-peptide binding from 3D TCR structures is both efficient and highly accurate with an average accuracy of 0.974 on precise structures. Furthermore, the average accuracy on corrected structures was 0.762, significantly higher than the average accuracy of 0.375 on uncorrected original structures. Additionally, the average root mean square distance (RMSD) to precise structures was significantly reduced from 12.753 Å for predicted structures to 8.785 Å for corrected structures.

Discussion: Thus, utilizing structural information of TCR-peptide complexes is a promising approach to improve the accuracy of binding predictions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Frontiers in Genetics
Frontiers in Genetics Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
5.50
自引率
8.10%
发文量
3491
审稿时长
14 weeks
期刊介绍: Frontiers in Genetics publishes rigorously peer-reviewed research on genes and genomes relating to all the domains of life, from humans to plants to livestock and other model organisms. Led by an outstanding Editorial Board of the world’s leading experts, this multidisciplinary, open-access journal is at the forefront of communicating cutting-edge research to researchers, academics, clinicians, policy makers and the public. The study of inheritance and the impact of the genome on various biological processes is well documented. However, the majority of discoveries are still to come. A new era is seeing major developments in the function and variability of the genome, the use of genetic and genomic tools and the analysis of the genetic basis of various biological phenomena.
期刊最新文献
Identification of m5C-Related gene diagnostic biomarkers for sepsis: a machine learning study. Number of human protein interactions correlates with structural, but not regulatory conservation of the respective genes. Comparison of blood parameters in two genetically different groups of horses for functional longevity in show jumping. Editorial: Non-coding RNAs and human diseases volume 2 -long non-coding RNAs and pathogenesis of human disease. Editorial: Epigenetic modification in neurological diseases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1