GDFold2: A fast and parallelizable protein folding environment with freely defined objective functions.

IF 5.2 3区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Protein Science Pub Date : 2025-02-01 DOI:10.1002/pro.70041
Tianyu Mi, Nan Xiao, Haipeng Gong
{"title":"GDFold2: A fast and parallelizable protein folding environment with freely defined objective functions.","authors":"Tianyu Mi, Nan Xiao, Haipeng Gong","doi":"10.1002/pro.70041","DOIUrl":null,"url":null,"abstract":"<p><p>An important step of mainstream protein structure prediction is to model the 3D protein structure based on the predicted 2D inter-residue geometric information. This folding step has been integrated into a unified neural network to allow end-to-end training in state-of-the-art methods like AlphaFold2, but is separately implemented using the Rosetta folding environment in some traditional methods like trRosetta. Despite the inferiority in prediction accuracy, the conventional approach allows for the sampling of various protein conformations compatible with the predicted geometric constraints, partially capturing the dynamic information. Here, we propose GDFold2, a novel protein folding environment, to address the limitations of Rosetta. On the one hand, GDFold2 is highly computationally efficient, capable of accomplishing multiple folding processes in parallel within the time scale of minutes for generic proteins. On the other hand, GDFold2 supports freely defined objective functions to fulfill diversified optimization requirements. Moreover, we propose a quality assessment (QA) model to provide reliable prediction on the quality of protein structures folded by GDFold2, thus substantially simplifying the selection of structural models. GDFold2 and the QA model could be combined to investigate the transition path between protein conformational states, and the online server is available at https://structpred.life.tsinghua.edu.cn/server_gdfold2.html.</p>","PeriodicalId":20761,"journal":{"name":"Protein Science","volume":"34 2","pages":"e70041"},"PeriodicalIF":5.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11773392/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/pro.70041","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

An important step of mainstream protein structure prediction is to model the 3D protein structure based on the predicted 2D inter-residue geometric information. This folding step has been integrated into a unified neural network to allow end-to-end training in state-of-the-art methods like AlphaFold2, but is separately implemented using the Rosetta folding environment in some traditional methods like trRosetta. Despite the inferiority in prediction accuracy, the conventional approach allows for the sampling of various protein conformations compatible with the predicted geometric constraints, partially capturing the dynamic information. Here, we propose GDFold2, a novel protein folding environment, to address the limitations of Rosetta. On the one hand, GDFold2 is highly computationally efficient, capable of accomplishing multiple folding processes in parallel within the time scale of minutes for generic proteins. On the other hand, GDFold2 supports freely defined objective functions to fulfill diversified optimization requirements. Moreover, we propose a quality assessment (QA) model to provide reliable prediction on the quality of protein structures folded by GDFold2, thus substantially simplifying the selection of structural models. GDFold2 and the QA model could be combined to investigate the transition path between protein conformational states, and the online server is available at https://structpred.life.tsinghua.edu.cn/server_gdfold2.html.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GDFold2:一个快速和可并行的蛋白质折叠环境,具有自由定义的目标函数。
主流蛋白质结构预测的一个重要步骤是基于预测的二维残基间几何信息对蛋白质的三维结构进行建模。这个折叠步骤已经集成到一个统一的神经网络中,允许在最先进的方法(如AlphaFold2)中进行端到端训练,但在一些传统方法(如trRosetta)中使用Rosetta折叠环境单独实现。尽管预测精度较低,但传统方法允许对与预测几何约束兼容的各种蛋白质构象进行采样,部分捕获动态信息。在这里,我们提出了一种新的蛋白质折叠环境GDFold2,以解决Rosetta的局限性。一方面,GDFold2具有很高的计算效率,能够在几分钟的时间尺度内并行完成通用蛋白的多个折叠过程。另一方面,GDFold2支持自由定义的目标函数,以满足多样化的优化需求。此外,我们提出了一个质量评估(QA)模型,为GDFold2折叠的蛋白质结构的质量提供可靠的预测,从而大大简化了结构模型的选择。GDFold2和QA模型可以结合起来研究蛋白质构象状态之间的过渡路径,在线服务器可在https://structpred.life.tsinghua.edu.cn/server_gdfold2.html上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Protein Science
Protein Science 生物-生化与分子生物学
CiteScore
12.40
自引率
1.20%
发文量
246
审稿时长
1 months
期刊介绍: Protein Science, the flagship journal of The Protein Society, is a publication that focuses on advancing fundamental knowledge in the field of protein molecules. The journal welcomes original reports and review articles that contribute to our understanding of protein function, structure, folding, design, and evolution. Additionally, Protein Science encourages papers that explore the applications of protein science in various areas such as therapeutics, protein-based biomaterials, bionanotechnology, synthetic biology, and bioelectronics. The journal accepts manuscript submissions in any suitable format for review, with the requirement of converting the manuscript to journal-style format only upon acceptance for publication. Protein Science is indexed and abstracted in numerous databases, including the Agricultural & Environmental Science Database (ProQuest), Biological Science Database (ProQuest), CAS: Chemical Abstracts Service (ACS), Embase (Elsevier), Health & Medical Collection (ProQuest), Health Research Premium Collection (ProQuest), Materials Science & Engineering Database (ProQuest), MEDLINE/PubMed (NLM), Natural Science Collection (ProQuest), and SciTech Premium Collection (ProQuest).
期刊最新文献
Absolute quantification of fluorescent protein fusions by mass spectrometry. Hydrophobic interactions of a substrate with L193 on the flexible substrate-binding loop contribute to 3α-hydroxysteroid dehydrogenase/carbonyl reductase catalytic efficiency. Positional scanning and computational modeling reveal determinants of legumain transpeptidase activity. Phosphate- and pH-dependent self-assembly of recombinant spider silk proteins. Multiple disulfide-bonded states confer extensive conformational diversity in fibrinogen.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1