Nature Machine Intelligence最新文献_第9页

Efficient generation of protein pockets with PocketGen 利用 PocketGen 高效生成蛋白质口袋

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-15 DOI: 10.1038/s42256-024-00920-9

Zaixi Zhang, Wan Xiang Shen, Qi Liu, Marinka Zitnik

Designing protein-binding proteins is critical for drug discovery. However, artificial-intelligence-based design of such proteins is challenging due to the complexity of protein–ligand interactions, the flexibility of ligand molecules and amino acid side chains, and sequence–structure dependencies. We introduce PocketGen, a deep generative model that produces residue sequence and atomic structure of the protein regions in which ligand interactions occur. PocketGen promotes consistency between protein sequence and structure by using a graph transformer for structural encoding and a sequence refinement module based on a protein language model. The graph transformer captures interactions at multiple scales, including atom, residue and ligand levels. For sequence refinement, PocketGen integrates a structural adapter into the protein language model, ensuring that structure-based predictions align with sequence-based predictions. PocketGen can generate high-fidelity protein pockets with enhanced binding affinity and structural validity. It operates ten times faster than physics-based methods and achieves a 97% success rate, defined as the percentage of generated pockets with higher binding affinity than reference pockets. Additionally, it attains an amino acid recovery rate exceeding 63%. A generative model that leverages a graph transformer and protein language model to generate residue sequences and full-atom structures of protein pockets is introduced, which outperforms state-of-the-art approaches.

设计蛋白质结合蛋白对药物发现至关重要。然而，由于蛋白质-配体相互作用的复杂性、配体分子和氨基酸侧链的灵活性以及序列-结构的依赖性，基于人工智能的此类蛋白质设计极具挑战性。我们介绍的 PocketGen 是一种深度生成模型，它能生成发生配体相互作用的蛋白质区域的残基序列和原子结构。PocketGen 通过使用结构编码的图转换器和基于蛋白质语言模型的序列细化模块，促进蛋白质序列和结构之间的一致性。图转换器捕捉多个尺度的相互作用，包括原子、残基和配体水平。在序列细化方面，PocketGen 将结构适配器集成到蛋白质语言模型中，确保基于结构的预测与基于序列的预测相一致。PocketGen 可以生成高保真蛋白质口袋，增强结合亲和力和结构有效性。它的运行速度比基于物理的方法快十倍，成功率高达 97%，成功率的定义是生成的口袋的结合亲和力高于参考口袋的百分比。此外，它的氨基酸回收率超过 63%。

{"title":"Efficient generation of protein pockets with PocketGen","authors":"Zaixi Zhang, Wan Xiang Shen, Qi Liu, Marinka Zitnik","doi":"10.1038/s42256-024-00920-9","DOIUrl":"10.1038/s42256-024-00920-9","url":null,"abstract":"Designing protein-binding proteins is critical for drug discovery. However, artificial-intelligence-based design of such proteins is challenging due to the complexity of protein–ligand interactions, the flexibility of ligand molecules and amino acid side chains, and sequence–structure dependencies. We introduce PocketGen, a deep generative model that produces residue sequence and atomic structure of the protein regions in which ligand interactions occur. PocketGen promotes consistency between protein sequence and structure by using a graph transformer for structural encoding and a sequence refinement module based on a protein language model. The graph transformer captures interactions at multiple scales, including atom, residue and ligand levels. For sequence refinement, PocketGen integrates a structural adapter into the protein language model, ensuring that structure-based predictions align with sequence-based predictions. PocketGen can generate high-fidelity protein pockets with enhanced binding affinity and structural validity. It operates ten times faster than physics-based methods and achieves a 97% success rate, defined as the percentage of generated pockets with higher binding affinity than reference pockets. Additionally, it attains an amino acid recovery rate exceeding 63%. A generative model that leverages a graph transformer and protein language model to generate residue sequences and full-atom structures of protein pockets is introduced, which outperforms state-of-the-art approaches.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1382-1395"},"PeriodicalIF":18.8,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00920-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142637798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast and generalizable micromagnetic simulation with deep neural nets 利用深度神经网络进行快速、通用的微磁模拟

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-14 DOI: 10.1038/s42256-024-00914-7

Yunqi Cai, Jiangnan Li, Dong Wang

Important progress has been made in micromagnetics, driven by its wide-ranging applications in magnetic storage design. Numerical simulation, a cornerstone of micromagnetics research, relies on first-principles rules to compute the dynamic evolution of micromagnetic systems using the renowned Landau–Lifshitz–Gilbert equation, named after Landau, Lifshitz and Gilbert. However, these simulations are often hindered by their slow speeds. Although fast Fourier transformation calculations reduce the computational complexity to O(Nlog(N)), it remains impractical for large-scale simulations. Here we introduce NeuralMAG, a deep learning approach to micromagnetic simulation. Our approach follows the Landau–Lifshitz–Gilbert iterative framework but accelerates computation of demagnetizing fields by employing a U-shaped neural network. This neural network architecture comprises an encoder that extracts aggregated spins at various scales and learns the local interaction at each scale, followed by a decoder that accumulates the local interactions at different scales to approximate the global convolution. This divide-and-accumulate scheme achieves a time complexity of O(N), notably enhancing the speed and feasibility of large-scale simulations. Unlike existing neural methods, NeuralMAG concentrates on the core computation—rather than an end-to-end approximation for a specific task—making it inherently generalizable. To validate the new approach, we trained a single model and evaluated it on two micromagnetics tasks with various sample sizes, shapes and material settings. Many physical systems involve long-range interactions, which present a considerable obstacle to large-scale simulations. Cai, Li and Wang introduce NeuralMAG, a deep learning approach to reduce complexity and accelerate micromagnetic simulations.

在磁存储设计的广泛应用推动下，微磁学取得了重要进展。数值模拟是微磁学研究的基石，它依赖于第一原理规则，利用以 Landau、Lifshitz 和 Gilbert 命名的著名的 Landau-Lifshitz-Gilbert 方程计算微磁系统的动态演化。然而，这些模拟往往因速度慢而受阻。虽然快速傅立叶变换计算能将计算复杂度降低到 O(Nlog(N))，但对于大规模仿真来说仍然不切实际。在此，我们介绍一种用于微磁模拟的深度学习方法--NeuralMAG。我们的方法遵循 Landau-Lifshitz-Gilbert 迭代框架，但通过采用 U 型神经网络来加速消磁场的计算。这种神经网络架构由一个编码器和一个解码器组成，编码器负责提取不同尺度的聚合自旋，并学习每个尺度的局部相互作用，解码器则负责累积不同尺度的局部相互作用，以近似全局卷积。这种 "分割-累积 "方案的时间复杂度为 O(N)，显著提高了大规模模拟的速度和可行性。与现有的神经方法不同，NeuralMAG 专注于核心计算，而不是针对特定任务的端到端近似，因此具有内在的通用性。为了验证这种新方法，我们训练了一个单一模型，并在两个具有不同样本大小、形状和材料设置的微观磁学任务中对其进行了评估。

{"title":"Fast and generalizable micromagnetic simulation with deep neural nets","authors":"Yunqi Cai, Jiangnan Li, Dong Wang","doi":"10.1038/s42256-024-00914-7","DOIUrl":"10.1038/s42256-024-00914-7","url":null,"abstract":"Important progress has been made in micromagnetics, driven by its wide-ranging applications in magnetic storage design. Numerical simulation, a cornerstone of micromagnetics research, relies on first-principles rules to compute the dynamic evolution of micromagnetic systems using the renowned Landau–Lifshitz–Gilbert equation, named after Landau, Lifshitz and Gilbert. However, these simulations are often hindered by their slow speeds. Although fast Fourier transformation calculations reduce the computational complexity to O(Nlog(N)), it remains impractical for large-scale simulations. Here we introduce NeuralMAG, a deep learning approach to micromagnetic simulation. Our approach follows the Landau–Lifshitz–Gilbert iterative framework but accelerates computation of demagnetizing fields by employing a U-shaped neural network. This neural network architecture comprises an encoder that extracts aggregated spins at various scales and learns the local interaction at each scale, followed by a decoder that accumulates the local interactions at different scales to approximate the global convolution. This divide-and-accumulate scheme achieves a time complexity of O(N), notably enhancing the speed and feasibility of large-scale simulations. Unlike existing neural methods, NeuralMAG concentrates on the core computation—rather than an end-to-end approximation for a specific task—making it inherently generalizable. To validate the new approach, we trained a single model and evaluated it on two micromagnetics tasks with various sample sizes, shapes and material settings. Many physical systems involve long-range interactions, which present a considerable obstacle to large-scale simulations. Cai, Li and Wang introduce NeuralMAG, a deep learning approach to reduce complexity and accelerate micromagnetic simulations.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1330-1343"},"PeriodicalIF":18.8,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142610316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Guidelines for ethical use and acknowledgement of large language models in academic writing 在学术写作中合乎道德地使用和认可大型语言模型的准则

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-13 DOI: 10.1038/s42256-024-00922-7

Sebastian Porsdam Mann, Anuraag A. Vazirani, Mateo Aboy, Brian D. Earp, Timo Minssen, I. Glenn Cohen, Julian Savulescu

In this Comment, we propose a cumulative set of three essential criteria for the ethical use of LLMs in academic writing, and present a statement that researchers can quote when submitting LLM-assisted manuscripts in order to testify to their adherence to them.

在这篇评论中，我们为在学术写作中合乎伦理地使用法学硕士提出了一套累积性的三项基本标准，并提交了一份声明，供研究人员在提交法学硕士辅助稿件时引用，以证明他们遵守了这些标准。

引用次数: 0

Foundation models in healthcare require rethinking reliability 医疗保健领域的基础模式需要重新思考可靠性问题

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-11 DOI: 10.1038/s42256-024-00924-5

Thomas Grote, Timo Freiesleben, Philipp Berens

A new class of AI models, called foundation models, has entered healthcare. Foundation models violate several basic principles of the standard machine learning paradigm for assessing reliability, making it necessary to rethink what guarantees are required to establish warranted trust in them.

一种被称为基础模型的新型人工智能模型已进入医疗保健领域。基础模型违反了用于评估可靠性的标准机器学习范式的几项基本原则，因此有必要重新思考需要哪些保证才能建立对基础模型的充分信任。

引用次数: 0

Reusability report: exploring the utility of variational graph encoders for predicting molecular toxicity in drug design 可重用性报告：探索变异图编码器在药物设计中预测分子毒性的实用性

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-08 DOI: 10.1038/s42256-024-00923-6

Ruijiang Li, Jiang Lu, Ziyi Liu, Duoyun Yi, Mengxuan Wan, Yixin Zhang, Peng Zan, Song He, Xiaochen Bo

Variational graph encoders effectively combine graph convolutional networks with variational autoencoders, and have been widely employed for biomedical graph-structured data. Lam and colleagues developed a framework based on the variational graph encoder, NYAN, to facilitate the prediction of molecular properties in computer-assisted drug design. In NYAN, the low-dimensional latent variables derived from the variational graph autoencoder are leveraged as a universal molecular representation, yielding remarkable performance and versatility throughout the drug discovery process. In this study we assess the reusability of NYAN and investigate its applicability within the context of specific chemical toxicity prediction. The prediction accuracy—based on NYAN latent representations and other popular molecular feature representations—is benchmarked across a broad spectrum of toxicity datasets, and the adaptation of NYAN latent representation to other surrogate models is also explored. NYAN, equipped with common surrogate models, shows competitive or better performance in toxicity prediction compared with other state-of-the-art molecular property prediction methods. We also devise a multi-task learning strategy with feature enhancement and consensus inference by leveraging the low dimensionality and feature diversity of NYAN latent space, further boosting the multi-endpoint acute toxicity estimation. The analysis delves into the adaptability of the generic graph variational model, showcasing its aptitude for tailored tasks within the realm of drug discovery. Ruijiang Li et al. assess the reproducibility of a variational graph encoder-based framework and examines its reusability for chemical toxicity prediction. It explores how a generalist model can function as a specialist model with adaptation.

变异图编码器有效地结合了图卷积网络和变异自编码器，已被广泛用于生物医学图结构数据。Lam 及其同事开发了一个基于变异图编码器的框架 NYAN，以促进计算机辅助药物设计中的分子特性预测。在 NYAN 中，从变异图自动编码器中得到的低维潜在变量被用作通用的分子表示，在整个药物发现过程中产生了显著的性能和多功能性。在本研究中，我们评估了 NYAN 的可重用性，并研究了其在特定化学毒性预测中的适用性。基于 NYAN 潜在表征和其他常用分子特征表征的预测准确性在广泛的毒性数据集中进行了基准测试，同时还探讨了 NYAN 潜在表征对其他代用模型的适应性。与其他最先进的分子特性预测方法相比，配备了常用代用模型的 NYAN 在毒性预测方面具有竞争力或更好的性能。我们还利用 NYAN 潜在空间的低维度和特征多样性，设计了一种具有特征增强和共识推断功能的多任务学习策略，进一步提高了多端点急性毒性预测的能力。分析深入探讨了通用图变分法模型的适应性，展示了它在药物发现领域中执行定制任务的能力。

{"title":"Reusability report: exploring the utility of variational graph encoders for predicting molecular toxicity in drug design","authors":"Ruijiang Li, Jiang Lu, Ziyi Liu, Duoyun Yi, Mengxuan Wan, Yixin Zhang, Peng Zan, Song He, Xiaochen Bo","doi":"10.1038/s42256-024-00923-6","DOIUrl":"10.1038/s42256-024-00923-6","url":null,"abstract":"Variational graph encoders effectively combine graph convolutional networks with variational autoencoders, and have been widely employed for biomedical graph-structured data. Lam and colleagues developed a framework based on the variational graph encoder, NYAN, to facilitate the prediction of molecular properties in computer-assisted drug design. In NYAN, the low-dimensional latent variables derived from the variational graph autoencoder are leveraged as a universal molecular representation, yielding remarkable performance and versatility throughout the drug discovery process. In this study we assess the reusability of NYAN and investigate its applicability within the context of specific chemical toxicity prediction. The prediction accuracy—based on NYAN latent representations and other popular molecular feature representations—is benchmarked across a broad spectrum of toxicity datasets, and the adaptation of NYAN latent representation to other surrogate models is also explored. NYAN, equipped with common surrogate models, shows competitive or better performance in toxicity prediction compared with other state-of-the-art molecular property prediction methods. We also devise a multi-task learning strategy with feature enhancement and consensus inference by leveraging the low dimensionality and feature diversity of NYAN latent space, further boosting the multi-endpoint acute toxicity estimation. The analysis delves into the adaptability of the generic graph variational model, showcasing its aptitude for tailored tasks within the realm of drug discovery. Ruijiang Li et al. assess the reproducibility of a variational graph encoder-based framework and examines its reusability for chemical toxicity prediction. It explores how a generalist model can function as a specialist model with adaptation.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1457-1466"},"PeriodicalIF":18.8,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00923-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142596493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

General-purpose foundation models for increased autonomy in robot-assisted surgery 提高机器人辅助手术自主性的通用基础模型

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-11-01 DOI: 10.1038/s42256-024-00917-4

Samuel Schmidgall, Ji Woong Kim, Alan Kuntz, Ahmed Ezzat Ghazi, Axel Krieger

The dominant paradigm for end-to-end robot learning focuses on optimizing task-specific objectives that solve a single robotic problem such as picking up an object or reaching a target position. However, recent work on high-capacity models in robotics has shown promise towards being trained on large collections of diverse and task-agnostic datasets of video demonstrations. These models have shown impressive levels of generalization to unseen circumstances, especially as the amount of data and the model complexity scale. Surgical robot systems that learn from data have struggled to advance as quickly as other fields of robot learning for a few reasons: there is a lack of existing large-scale open-source data to train models; it is challenging to model the soft-body deformations that these robots work with during surgery because simulation cannot match the physical and visual complexity of biological tissue; and surgical robots risk harming patients when tested in clinical trials and require more extensive safety measures. This Perspective aims to provide a path towards increasing robot autonomy in robot-assisted surgery through the development of a multi-modal, multi-task, vision–language–action model for surgical robots. Ultimately, we argue that surgical robots are uniquely positioned to benefit from general-purpose models and provide four guiding actions towards increased autonomy in robot-assisted surgery. Schmidgall et al. describe a pathway for building general-purpose machine learning models for robot-assisted surgery, including mechanisms for avoiding risk and handing over control to surgeons, and improving safety and outcomes beyond demonstration data.

端到端机器人学习的主流模式侧重于优化特定任务目标，以解决单一的机器人问题，如拾取物体或到达目标位置。然而，最近在机器人大容量模型方面的研究表明，在大量不同的、与任务无关的视频演示数据集上进行训练很有前途。这些模型对未知环境的泛化程度令人印象深刻，尤其是在数据量和模型复杂度不断增加的情况下。从数据中学习的手术机器人系统一直难以像其他机器人学习领域那样快速发展，原因有以下几点：缺乏现有的大规模开源数据来训练模型；由于模拟无法与生物组织的物理和视觉复杂性相匹配，因此对这些机器人在手术过程中的软体变形进行建模具有挑战性；手术机器人在临床试验中存在伤害患者的风险，因此需要采取更广泛的安全措施。本视角旨在通过为手术机器人开发多模式、多任务、视觉-语言-动作模型，为提高机器人辅助手术中的机器人自主性提供一条途径。最终，我们认为手术机器人具有得天独厚的优势，可以从通用模型中获益，并为提高机器人辅助手术的自主性提供四项指导行动。

{"title":"General-purpose foundation models for increased autonomy in robot-assisted surgery","authors":"Samuel Schmidgall, Ji Woong Kim, Alan Kuntz, Ahmed Ezzat Ghazi, Axel Krieger","doi":"10.1038/s42256-024-00917-4","DOIUrl":"10.1038/s42256-024-00917-4","url":null,"abstract":"The dominant paradigm for end-to-end robot learning focuses on optimizing task-specific objectives that solve a single robotic problem such as picking up an object or reaching a target position. However, recent work on high-capacity models in robotics has shown promise towards being trained on large collections of diverse and task-agnostic datasets of video demonstrations. These models have shown impressive levels of generalization to unseen circumstances, especially as the amount of data and the model complexity scale. Surgical robot systems that learn from data have struggled to advance as quickly as other fields of robot learning for a few reasons: there is a lack of existing large-scale open-source data to train models; it is challenging to model the soft-body deformations that these robots work with during surgery because simulation cannot match the physical and visual complexity of biological tissue; and surgical robots risk harming patients when tested in clinical trials and require more extensive safety measures. This Perspective aims to provide a path towards increasing robot autonomy in robot-assisted surgery through the development of a multi-modal, multi-task, vision–language–action model for surgical robots. Ultimately, we argue that surgical robots are uniquely positioned to benefit from general-purpose models and provide four guiding actions towards increased autonomy in robot-assisted surgery. Schmidgall et al. describe a pathway for building general-purpose machine learning models for robot-assisted surgery, including mechanisms for avoiding risk and handing over control to surgeons, and improving safety and outcomes beyond demonstration data.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1275-1283"},"PeriodicalIF":18.8,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142562164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Results from the autoPET challenge on fully automated lesion segmentation in oncologic PET/CT imaging 肿瘤 PET/CT 成像中全自动病灶分割的 autoPET 挑战赛结果

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-10-30 DOI: 10.1038/s42256-024-00912-9

Sergios Gatidis, Marcel Früh, Matthias P. Fabritius, Sijing Gu, Konstantin Nikolaou, Christian La Fougère, Jin Ye, Junjun He, Yige Peng, Lei Bi, Jun Ma, Bo Wang, Jia Zhang, Yukun Huang, Lars Heiliger, Zdravko Marinov, Rainer Stiefelhagen, Jan Egger, Jens Kleesiek, Ludovic Sibille, Lei Xiang, Simone Bendazzoli, Mehdi Astaraki, Michael Ingrisch, Clemens C. Cyran, Thomas Küstner

Automated detection of tumour lesions on positron emission tomography–computed tomography (PET/CT) image data is a clinically relevant but highly challenging task. Progress in this field has been hampered in the past owing to the lack of publicly available annotated data and limited availability of platforms for inter-institutional collaboration. Here we describe the results of the autoPET challenge, a biomedical image analysis challenge aimed to motivate research in the field of automated PET/CT image analysis. The challenge task was the automated segmentation of metabolically active tumour lesions on whole-body 18F-fluorodeoxyglucose PET/CT. Challenge participants had access to a large publicly available annotated PET/CT dataset for algorithm training. All algorithms submitted to the final challenge phase were based on deep learning methods, mostly using three-dimensional U-Net architectures. Submitted algorithms were evaluated on a private test set composed of 150 PET/CT studies from two institutions. An ensemble model of the highest-ranking algorithms achieved favourable performance compared with individual algorithms. Algorithm performance was dependent on the quality and quantity of data and on algorithm design choices, such as tailored post-processing of predicted segmentations. Future iterations of this challenge will focus on generalization and clinical translation. Automating the image analysis process for oncologic whole-body positron emission tomography–computed tomography data is a key area of interest. Gatidis et al. describe the autoPET 2022 challenge, an international competition focused on the segmentation of metabolically active tumour lesions, aiming to advance techniques in the field.

在正电子发射计算机断层扫描（PET/CT）图像数据上自动检测肿瘤病灶是一项与临床相关但极具挑战性的任务。过去，由于缺乏公开可用的注释数据以及机构间合作平台有限，这一领域的研究进展一直受阻。我们在此介绍 autoPET 挑战赛的结果，这是一项生物医学图像分析挑战赛，旨在激励 PET/CT 图像自动分析领域的研究。挑战任务是自动分割全身 18F 氟脱氧葡萄糖 PET/CT 上代谢活跃的肿瘤病灶。挑战赛的参赛者可以访问大量公开的注释 PET/CT 数据集，进行算法训练。提交到最后挑战阶段的所有算法都基于深度学习方法，大多使用三维 U-Net 架构。提交的算法在一个私人测试集上进行了评估，该测试集由来自两个机构的 150 项 PET/CT 研究组成。与单个算法相比，排名最高算法的集合模型取得了良好的性能。算法的性能取决于数据的质量和数量以及算法设计的选择，例如对预测分割进行量身定制的后处理。这项挑战赛的未来迭代将侧重于推广和临床转化。

{"title":"Results from the autoPET challenge on fully automated lesion segmentation in oncologic PET/CT imaging","authors":"Sergios Gatidis, Marcel Früh, Matthias P. Fabritius, Sijing Gu, Konstantin Nikolaou, Christian La Fougère, Jin Ye, Junjun He, Yige Peng, Lei Bi, Jun Ma, Bo Wang, Jia Zhang, Yukun Huang, Lars Heiliger, Zdravko Marinov, Rainer Stiefelhagen, Jan Egger, Jens Kleesiek, Ludovic Sibille, Lei Xiang, Simone Bendazzoli, Mehdi Astaraki, Michael Ingrisch, Clemens C. Cyran, Thomas Küstner","doi":"10.1038/s42256-024-00912-9","DOIUrl":"10.1038/s42256-024-00912-9","url":null,"abstract":"Automated detection of tumour lesions on positron emission tomography–computed tomography (PET/CT) image data is a clinically relevant but highly challenging task. Progress in this field has been hampered in the past owing to the lack of publicly available annotated data and limited availability of platforms for inter-institutional collaboration. Here we describe the results of the autoPET challenge, a biomedical image analysis challenge aimed to motivate research in the field of automated PET/CT image analysis. The challenge task was the automated segmentation of metabolically active tumour lesions on whole-body 18F-fluorodeoxyglucose PET/CT. Challenge participants had access to a large publicly available annotated PET/CT dataset for algorithm training. All algorithms submitted to the final challenge phase were based on deep learning methods, mostly using three-dimensional U-Net architectures. Submitted algorithms were evaluated on a private test set composed of 150 PET/CT studies from two institutions. An ensemble model of the highest-ranking algorithms achieved favourable performance compared with individual algorithms. Algorithm performance was dependent on the quality and quantity of data and on algorithm design choices, such as tailored post-processing of predicted segmentations. Future iterations of this challenge will focus on generalization and clinical translation. Automating the image analysis process for oncologic whole-body positron emission tomography–computed tomography data is a key area of interest. Gatidis et al. describe the autoPET 2022 challenge, an international competition focused on the segmentation of metabolically active tumour lesions, aiming to advance techniques in the field.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1396-1405"},"PeriodicalIF":18.8,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142541712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Author Correction: Predicting equilibrium distributions for molecular systems with deep learning 作者更正：用深度学习预测分子系统的平衡分布

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-10-29 DOI: 10.1038/s42256-024-00933-4

Shuxin Zheng, Jiyan He, Chang Liu, Yu Shi, Ziheng Lu, Weitao Feng, Fusong Ju, Jiaxi Wang, Jianwei Zhu, Yaosen Min, He Zhang, Shidi Tang, Hongxia Hao, Peiran Jin, Chi Chen, Frank Noé, Haiguang Liu, Tie-Yan Liu

引用次数: 0

Deep learning prediction of ribosome profiling with Translatomer reveals translational regulation and interprets disease variants 利用 Translatomer 对核糖体图谱进行深度学习预测，揭示翻译调控并解释疾病变异

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-10-23 DOI: 10.1038/s42256-024-00915-6

Jialin He, Lei Xiong, Shaohui Shi, Chengyu Li, Kexuan Chen, Qianchen Fang, Jiuhong Nan, Ke Ding, Yuanhui Mao, Carles A. Boix, Xinyang Hu, Manolis Kellis, Jingyun Li, Xushen Xiong

Gene expression involves transcription and translation. Despite large datasets and increasingly powerful methods devoted to calculating genetic variants’ effects on transcription, discrepancy between messenger RNA and protein levels hinders the systematic interpretation of the regulatory effects of disease-associated variants. Accurate models of the sequence determinants of translation are needed to close this gap and to interpret disease-associated variants that act on translation. Here we present Translatomer, a multimodal transformer framework that predicts cell-type-specific translation from messenger RNA expression and gene sequence. We train the Translatomer on 33 tissues and cell lines, and show that the inclusion of sequence improves the prediction of ribosome profiling signal, indicating that the Translatomer captures sequence-dependent translational regulatory information. The Translatomer achieves accuracies of 0.72 to 0.80 for the de novo prediction of cell-type-specific ribosome profiling. We develop an in silico mutagenesis tool to estimate mutational effects on translation and demonstrate that variants associated with translation regulation are evolutionarily constrained, both in the human population and across species. In particular, we identify cell-type-specific translational regulatory mechanisms independent of the expression quantitative trait loci for 3,041 non-coding and synonymous variants associated with complex diseases, including Alzheimer’s disease, schizophrenia and congenital heart disease. The Translatomer accurately models the genetic underpinnings of translation, bridging the gap between messenger RNA and protein levels as well as providing valuable mechanistic insights for uninterpreted disease variants. A transformer-based approach called Translatomer is presented, which models cell-type-specific translation from messenger RNA expression and gene sequence, bridging the gap between messenger RNA and protein levels as well as providing a mechanistic insight into the genetic regulation of translation.

基因表达涉及转录和翻译。尽管有大量的数据集和越来越强大的方法来计算基因变异对转录的影响，但信使 RNA 和蛋白质水平之间的差异阻碍了对疾病相关变异的调控效应的系统解释。要缩小这一差距，解释作用于翻译的疾病相关变异，需要翻译序列决定因素的精确模型。我们在此介绍 Translatomer，这是一个多模式转换器框架，可从信使 RNA 表达和基因序列预测细胞类型特异性翻译。我们在 33 种组织和细胞系上对 Translatomer 进行了训练，结果表明，加入序列能改善核糖体剖析信号的预测，这表明 Translatomer 捕捉到了依赖序列的翻译调控信息。Translatomer 对细胞类型特异性核糖体图谱的从头预测准确率达到了 0.72 到 0.80。我们开发了一种硅学诱变工具来估计突变对翻译的影响，并证明与翻译调控相关的变体在人类群体和不同物种中都受到进化的限制。特别是，我们确定了细胞类型特异性翻译调控机制，这些机制独立于与阿尔茨海默病、精神分裂症和先天性心脏病等复杂疾病相关的 3,041 个非编码变异和同义变异的表达量性状位点。Translatomer 能准确模拟翻译的遗传基础，弥合信使 RNA 和蛋白质水平之间的差距，并为无法解读的疾病变异提供有价值的机理见解。

{"title":"Deep learning prediction of ribosome profiling with Translatomer reveals translational regulation and interprets disease variants","authors":"Jialin He, Lei Xiong, Shaohui Shi, Chengyu Li, Kexuan Chen, Qianchen Fang, Jiuhong Nan, Ke Ding, Yuanhui Mao, Carles A. Boix, Xinyang Hu, Manolis Kellis, Jingyun Li, Xushen Xiong","doi":"10.1038/s42256-024-00915-6","DOIUrl":"10.1038/s42256-024-00915-6","url":null,"abstract":"Gene expression involves transcription and translation. Despite large datasets and increasingly powerful methods devoted to calculating genetic variants’ effects on transcription, discrepancy between messenger RNA and protein levels hinders the systematic interpretation of the regulatory effects of disease-associated variants. Accurate models of the sequence determinants of translation are needed to close this gap and to interpret disease-associated variants that act on translation. Here we present Translatomer, a multimodal transformer framework that predicts cell-type-specific translation from messenger RNA expression and gene sequence. We train the Translatomer on 33 tissues and cell lines, and show that the inclusion of sequence improves the prediction of ribosome profiling signal, indicating that the Translatomer captures sequence-dependent translational regulatory information. The Translatomer achieves accuracies of 0.72 to 0.80 for the de novo prediction of cell-type-specific ribosome profiling. We develop an in silico mutagenesis tool to estimate mutational effects on translation and demonstrate that variants associated with translation regulation are evolutionarily constrained, both in the human population and across species. In particular, we identify cell-type-specific translational regulatory mechanisms independent of the expression quantitative trait loci for 3,041 non-coding and synonymous variants associated with complex diseases, including Alzheimer’s disease, schizophrenia and congenital heart disease. The Translatomer accurately models the genetic underpinnings of translation, bridging the gap between messenger RNA and protein levels as well as providing valuable mechanistic insights for uninterpreted disease variants. A transformer-based approach called Translatomer is presented, which models cell-type-specific translation from messenger RNA expression and gene sequence, bridging the gap between messenger RNA and protein levels as well as providing a mechanistic insight into the genetic regulation of translation.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1314-1329"},"PeriodicalIF":18.8,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142488349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Epitope-anchored contrastive transfer learning for paired CD8+ T cell receptor–antigen recognition CD8+T细胞受体-抗原配对识别的表位锚定对比迁移学习

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-10-22 DOI: 10.1038/s42256-024-00913-8

Yumeng Zhang, Zhikang Wang, Yunzhe Jiang, Dene R. Littler, Mark Gerstein, Anthony W. Purcell, Jamie Rossjohn, Hong-Yu Ou, Jiangning Song

Understanding the mechanisms of T cell antigen recognition that underpin adaptive immune responses is critical for developing vaccines, immunotherapies and treatments against autoimmune diseases. Despite extensive research efforts, accurate prediction of T cell receptor (TCR)–antigen binding pairs remains a great challenge due to the vast diversity and cross-reactivity of TCRs. Here we propose a deep-learning-based framework termed epitope-anchored contrastive transfer learning (EPACT) tailored to paired human CD8+ TCRs. Harnessing the pretrained representations and co-embeddings of peptide–major histocompatibility complex (pMHC) and TCR, EPACT demonstrated generalizability in predicting binding specificity for unseen epitopes and distinct TCR repertoires. Contrastive learning enabled highly precise predictions for immunodominant epitopes and interpretable analysis of epitope-specific T cells. We applied EPACT to SARS-CoV-2-responsive T cells, and the predicted binding strength aligned well with the surge in spike-specific immune responses after vaccination. We further fine-tuned EPACT on structural data to decipher the residue-level interactions involved in TCR–antigen recognition. EPACT was capable of quantifying interchain distance matrices and identifying contact residues, corroborating the presence of TCR cross-reactivity across multiple tumour-associated antigens. Together, EPACT can serve as a useful artificial intelligence approach with important potential in practical applications and contribute towards the development of TCR-based immunotherapies. Accurate prediction of T cell receptor (TCR)–antigen recognition remains a challenge. Zhang et al. propose a contrastive transfer learning model to predict TCR–pMHC binding that enables interpretable analyses of epitope-specific T cells and can decipher residue-level interactions.

了解支撑适应性免疫反应的 T 细胞抗原识别机制对于开发疫苗、免疫疗法和治疗自身免疫性疾病至关重要。尽管开展了大量研究工作，但由于 TCR 的多样性和交叉反应性，准确预测 T 细胞受体（TCR）与抗原的结合对仍然是一项巨大的挑战。在这里，我们提出了一种基于深度学习的框架，称为表位锚定对比转移学习（EPACT），专门针对成对的人类 CD8+ TCR。利用肽-主要组织相容性复合体（pMHC）和TCR的预训练表征和共嵌入，EPACT在预测未知表位和不同TCR复合物的结合特异性方面展示了通用性。对比学习可以对免疫优势表位进行高度精确的预测，并对表位特异性 T 细胞进行可解释的分析。我们将 EPACT 应用于 SARS-CoV-2 反应性 T 细胞，预测的结合强度与接种疫苗后尖峰特异性免疫反应的激增非常吻合。我们根据结构数据进一步微调了 EPACT，以破译 TCR 与抗原识别中涉及的残基级相互作用。EPACT 能够量化链间距离矩阵并识别接触残基，从而证实多种肿瘤相关抗原之间存在 TCR 交叉反应。总之，EPACT可以作为一种有用的人工智能方法，在实际应用中具有重要潜力，并有助于开发基于TCR的免疫疗法。

{"title":"Epitope-anchored contrastive transfer learning for paired CD8+ T cell receptor–antigen recognition","authors":"Yumeng Zhang, Zhikang Wang, Yunzhe Jiang, Dene R. Littler, Mark Gerstein, Anthony W. Purcell, Jamie Rossjohn, Hong-Yu Ou, Jiangning Song","doi":"10.1038/s42256-024-00913-8","DOIUrl":"10.1038/s42256-024-00913-8","url":null,"abstract":"Understanding the mechanisms of T cell antigen recognition that underpin adaptive immune responses is critical for developing vaccines, immunotherapies and treatments against autoimmune diseases. Despite extensive research efforts, accurate prediction of T cell receptor (TCR)–antigen binding pairs remains a great challenge due to the vast diversity and cross-reactivity of TCRs. Here we propose a deep-learning-based framework termed epitope-anchored contrastive transfer learning (EPACT) tailored to paired human CD8+ TCRs. Harnessing the pretrained representations and co-embeddings of peptide–major histocompatibility complex (pMHC) and TCR, EPACT demonstrated generalizability in predicting binding specificity for unseen epitopes and distinct TCR repertoires. Contrastive learning enabled highly precise predictions for immunodominant epitopes and interpretable analysis of epitope-specific T cells. We applied EPACT to SARS-CoV-2-responsive T cells, and the predicted binding strength aligned well with the surge in spike-specific immune responses after vaccination. We further fine-tuned EPACT on structural data to decipher the residue-level interactions involved in TCR–antigen recognition. EPACT was capable of quantifying interchain distance matrices and identifying contact residues, corroborating the presence of TCR cross-reactivity across multiple tumour-associated antigens. Together, EPACT can serve as a useful artificial intelligence approach with important potential in practical applications and contribute towards the development of TCR-based immunotherapies. Accurate prediction of T cell receptor (TCR)–antigen recognition remains a challenge. Zhang et al. propose a contrastive transfer learning model to predict TCR–pMHC binding that enables interpretable analyses of epitope-specific T cells and can decipher residue-level interactions.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 11","pages":"1344-1358"},"PeriodicalIF":18.8,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142486862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0