Before and after AlphaFold2: An overview of protein structure prediction.

IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Frontiers in bioinformatics Pub Date : 2023-02-28 eCollection Date: 2023-01-01 DOI:10.3389/fbinf.2023.1120370

Letícia M F Bertoline, Angélica N Lima, Jose E Krieger, Samantha K Teixeira

{"title":"Before and after AlphaFold2: An overview of protein structure prediction.","authors":"Letícia M F Bertoline, Angélica N Lima, Jose E Krieger, Samantha K Teixeira","doi":"10.3389/fbinf.2023.1120370","DOIUrl":null,"url":null,"abstract":"<p><p>Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"3 ","pages":"1120370"},"PeriodicalIF":3.9000,"publicationDate":"2023-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011655/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2023.1120370","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AlphaFold2 前后：蛋白质结构预测概述。

蛋白质的三维结构与其功能直接相关，确定蛋白质的三维结构对于理解生物过程、解决人类健康和生命科学问题至关重要。尽管随着时间的推移不断有新的蛋白质结构通过实验获得，但在 Uniprot 中的蛋白质序列数量与已解析三级结构的蛋白质序列数量之间仍存在很大差距。在这种情况下，出现了通过基于模板或自由建模的方法预测蛋白质结构的研究。在过去几年中，不同的方法被结合起来以克服各自的局限性，直到 AlphaFold2 的出现，它证明了以前所未有的规模高精度预测蛋白质结构是可能的。尽管 AlphaFold2 目前在该领域颇具影响力，但它也有局限性。最近，基于蛋白质语言模型的新方法有望彻底改变蛋白质结构生物学，使人们能够仅从蛋白质序列的进化模式中发现蛋白质的结构和功能。尽管这些方法达不到 AlphaFold2 的精确度，但它们已经克服了 AlphaFold2 的一些局限性，能够从元基因组数据库中高精度预测 2 亿多个蛋白质。在这篇小型综述中，我们将概述 AlphaFold2 出现前后蛋白质结构预测领域取得的突破。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Frontiers in bioinformatics

CiteScore

2.60

自引率

0.00%

发文量