How the technologies behind self-driving cars, social networks, ChatGPT, and DALL-E2 are changing structural biology

IF 3.2 3区生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY BioEssays Pub Date : 2024-10-15 DOI:10.1002/bies.202400155

Matthias Bochtler

{"title":"How the technologies behind self-driving cars, social networks, ChatGPT, and DALL-E2 are changing structural biology","authors":"Matthias Bochtler","doi":"10.1002/bies.202400155","DOIUrl":null,"url":null,"abstract":"<p>The performance of deep Neural Networks (NNs) in the text (ChatGPT) and image (DALL-E2) domains has attracted worldwide attention. Convolutional NNs (CNNs), Large Language Models (LLMs), Denoising Diffusion Probabilistic Models (DDPMs)/Noise Conditional Score Networks (NCSNs), and Graph NNs (GNNs) have impacted computer vision, language editing and translation, automated conversation, image generation, and social network management. Proteins can be viewed as texts written with the alphabet of amino acids, as images, or as graphs of interacting residues. Each of these perspectives suggests the use of tools from a different area of deep learning for protein structural biology. Here, I review how CNNs, LLMs, DDPMs/NCSNs, and GNNs have led to major advances in protein structure prediction, inverse folding, protein design, and small molecule design. This review is primarily intended as a deep learning primer for practicing experimental structural biologists. However, extensive references to the deep learning literature should also make it relevant to readers who have a background in machine learning, physics or statistics, and an interest in protein structural biology.</p>","PeriodicalId":9264,"journal":{"name":"BioEssays","volume":"47 1","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bies.202400155","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioEssays","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/bies.202400155","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

The performance of deep Neural Networks (NNs) in the text (ChatGPT) and image (DALL-E2) domains has attracted worldwide attention. Convolutional NNs (CNNs), Large Language Models (LLMs), Denoising Diffusion Probabilistic Models (DDPMs)/Noise Conditional Score Networks (NCSNs), and Graph NNs (GNNs) have impacted computer vision, language editing and translation, automated conversation, image generation, and social network management. Proteins can be viewed as texts written with the alphabet of amino acids, as images, or as graphs of interacting residues. Each of these perspectives suggests the use of tools from a different area of deep learning for protein structural biology. Here, I review how CNNs, LLMs, DDPMs/NCSNs, and GNNs have led to major advances in protein structure prediction, inverse folding, protein design, and small molecule design. This review is primarily intended as a deep learning primer for practicing experimental structural biologists. However, extensive references to the deep learning literature should also make it relevant to readers who have a background in machine learning, physics or statistics, and an interest in protein structural biology.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自动驾驶汽车、社交网络、ChatGPT 和 DALL-E2 背后的技术如何改变生物结构。

深度神经网络（NN）在文本（ChatGPT）和图像（DALL-E2）领域的表现引起了全世界的关注。卷积神经网络（CNN）、大型语言模型（LLM）、去噪扩散概率模型（DDPM）/噪声条件得分网络（NCSN）和图神经网络（GNN）已经对计算机视觉、语言编辑和翻译、自动对话、图像生成和社交网络管理产生了影响。蛋白质可被视为用氨基酸字母表书写的文本、图像或相互作用的残基图。这些视角中的每一个都建议将深度学习不同领域的工具用于蛋白质结构生物学。在此，我将回顾 CNN、LLM、DDPM/NCSN 和 GNN 是如何在蛋白质结构预测、反折叠、蛋白质设计和小分子设计方面取得重大进展的。本综述的主要目的是为实验结构生物学家提供深度学习入门指南。不过，对深度学习文献的广泛引用也应该使它与具有机器学习、物理学或统计学背景并对蛋白质结构生物学感兴趣的读者息息相关。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BioEssays 生物-生化与分子生物学

CiteScore

7.30

自引率

2.50%

发文量

167

审稿时长

4-8 weeks

期刊介绍： molecular – cellular – biomedical – physiology – translational research – systems - hypotheses encouraged BioEssays is a peer-reviewed, review-and-discussion journal. Our aims are to publish novel insights, forward-looking reviews and commentaries in contemporary biology with a molecular, genetic, cellular, or physiological dimension, and serve as a discussion forum for new ideas in these areas. An additional goal is to encourage transdisciplinarity and integrative biology in the context of organismal studies, systems approaches, through to ecosystems, where appropriate.