MassiveFold: unveiling AlphaFold’s hidden potential with optimized and parallelized massive sampling

IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Nature computational science Pub Date : 2024-11-11 DOI:10.1038/s43588-024-00714-4

Nessim Raouraoua, Claudio Mirabello, Thibaut Véry, Christophe Blanchet, Björn Wallner, Marc F. Lensink, Guillaume Brysbaert

引用次数: 0

Abstract

Massive sampling in AlphaFold enables access to increased structural diversity. In combination with its efficient confidence ranking, this unlocks elevated modeling capabilities for monomeric structures and foremost for protein assemblies. However, the approach struggles with GPU cost and data storage. Here we introduce MassiveFold, an optimized and customizable version of AlphaFold that runs predictions in parallel, reducing the computing time from several months to hours. MassiveFold is scalable and able to run on anything from a single computer to a large GPU infrastructure, where it can fully benefit from all the computing nodes. Although AlphaFold is very efficient for protein structure prediction, massive sampling is a very GPU demanding task. MassiveFold overcomes this limitation, being capable of parallelizing structure prediction computation.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

MassiveFold：通过优化和并行化的大规模采样挖掘 AlphaFold 隐藏的潜力。

AlphaFold 中的大规模采样可以提高结构的多样性。结合其高效的置信度排序，这就为单体结构和最重要的蛋白质组装释放了更高的建模能力。然而，这种方法在 GPU 成本和数据存储方面存在困难。在这里，我们介绍了MassiveFold，它是AlphaFold的优化和定制版本，可以并行运行预测，将计算时间从几个月缩短到几个小时。MassiveFold具有可扩展性，可以运行在从单台计算机到大型GPU基础架构的任何地方，从而充分受益于所有计算节点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Nature computational science

CiteScore

11.70

自引率

0.00%

发文量