首页 > 最新文献

Nature Machine Intelligence最新文献

英文 中文
Inferring spatial single-cell-level interactions through interpreting cell state and niche correlations learned by self-supervised graph transformer 通过解释由自监督图转换器学习的细胞状态和生态位相关性来推断空间单细胞水平的相互作用
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01161-0
Xiao Xiao, Le Zhang, Hongyu Zhao, Zuoheng Wang
Cell–cell interactions (CCI), driven by distance-dependent signalling, are important for tissue development and organ function. While imaging-based spatial transcriptomics offers unprecedented opportunities to unravel CCI at single-cell resolution, current analyses face challenges such as limited ligand–receptor pairs measured, insufficient spatial encoding and low interpretability. We present GITIII (graph inductive bias transformer for intercellular interaction investigation), a lightweight, interpretable, self-supervised graph transformer-based model that conceptualizes cells as words and their surrounding cellular neighbourhood as context that shapes the meaning or state of the central cell. GITIII infers CCI by examining the correlation between a cell’s state and its niche, enabling us to understand how sender cells influence the gene expression of receiver cells, visualize spatial CCI patterns, perform CCI-informed cell clustering and construct CCI networks. Applied to four spatial transcriptomics datasets across multiple species, organs and platforms, GITIII effectively identified and statistically interpreted CCI patterns in the brain and tumour microenvironments. Xiao et al. present GITIII, a lightweight and interpretable graph transformer for inferring spatial single-cell-level interactions and quantifying the influence of neighbouring cells on the gene expression of receiver cells in spatial transcriptomics.
由距离依赖信号驱动的细胞-细胞相互作用(CCI)对组织发育和器官功能至关重要。虽然基于成像的空间转录组学为在单细胞分辨率上解开CCI提供了前所未有的机会,但目前的分析面临着诸如测量的配体-受体对有限、空间编码不足和可解释性低等挑战。我们提出了GITIII(用于细胞间相互作用研究的图感应偏置变压器),这是一种轻量级、可解释、自监督的基于图转换器的模型,它将细胞概念化为单词,将其周围的细胞邻居概念化为塑造中心细胞的意义或状态的上下文。GITIII通过检查细胞状态与其生态位之间的相关性来推断CCI,使我们能够了解发送细胞如何影响接收细胞的基因表达,可视化空间CCI模式,执行CCI信息细胞聚类并构建CCI网络。应用于跨多个物种、器官和平台的四个空间转录组学数据集,GITIII有效地识别并统计解释了脑和肿瘤微环境中的CCI模式。
{"title":"Inferring spatial single-cell-level interactions through interpreting cell state and niche correlations learned by self-supervised graph transformer","authors":"Xiao Xiao, Le Zhang, Hongyu Zhao, Zuoheng Wang","doi":"10.1038/s42256-025-01161-0","DOIUrl":"10.1038/s42256-025-01161-0","url":null,"abstract":"Cell–cell interactions (CCI), driven by distance-dependent signalling, are important for tissue development and organ function. While imaging-based spatial transcriptomics offers unprecedented opportunities to unravel CCI at single-cell resolution, current analyses face challenges such as limited ligand–receptor pairs measured, insufficient spatial encoding and low interpretability. We present GITIII (graph inductive bias transformer for intercellular interaction investigation), a lightweight, interpretable, self-supervised graph transformer-based model that conceptualizes cells as words and their surrounding cellular neighbourhood as context that shapes the meaning or state of the central cell. GITIII infers CCI by examining the correlation between a cell’s state and its niche, enabling us to understand how sender cells influence the gene expression of receiver cells, visualize spatial CCI patterns, perform CCI-informed cell clustering and construct CCI networks. Applied to four spatial transcriptomics datasets across multiple species, organs and platforms, GITIII effectively identified and statistically interpreted CCI patterns in the brain and tumour microenvironments. Xiao et al. present GITIII, a lightweight and interpretable graph transformer for inferring spatial single-cell-level interactions and quantifying the influence of neighbouring cells on the gene expression of receiver cells in spatial transcriptomics.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"42-58"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the potential of deep learning for protein–ligand docking 评估深度学习在蛋白质配体对接中的潜力
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01160-1
Alex Morehead, Nabin Giri, Jian Liu, Pawan Neupane, Jianlin Cheng
The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein–ligand docking have recently been introduced, so far no previous works have systematically studied the behaviour of the latest docking and structure prediction methods within the broadly applicable context of: (1) using predicted (apo) protein structures for docking (for example, for applicability to new proteins); (2) binding multiple (cofactor) ligands concurrently to a given target protein (for example, for enzyme design); and (3) having no previous knowledge of binding pockets (for example, for generalization to unknown pockets). To enable a deeper understanding of the real-world utility of docking methods, we introduce PoseBench, a comprehensive benchmark for broadly applicable protein–ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL methods for apo-to-holo protein–ligand docking and protein–ligand structure prediction using both primary ligand and multiligand benchmark datasets, the latter of which we introduce to the DL community. Empirically, using PoseBench, we find that: (1) DL cofolding methods generally outperform comparable conventional and DL docking baseline algorithms, but popular methods such as AlphaFold 3 are still challenged by prediction targets with new protein–ligand binding poses; (2) certain DL cofolding methods are highly sensitive to their input multiple sequence alignments, whereas others are not; and (3) DL methods struggle to strike a balance between structural accuracy and chemical specificity when predicting new or multiligand protein targets. Morehead et al. introduce the benchmark PoseBench and evaluate the strengths and limitations of current AI-based protein–ligand docking and structure prediction methods.
配体结合对蛋白质结构及其体内功能的影响对现代生物医学研究和生物技术开发工作(如药物发现)具有许多意义。虽然最近已经引入了几种为蛋白质-配体对接设计的深度学习(DL)方法和基准,但到目前为止,还没有以前的工作系统地研究了最新对接和结构预测方法在广泛适用的背景下的行为:(1)使用预测的(载脂蛋白)蛋白质结构进行对接(例如,适用于新蛋白质);(2)将多个(辅因子)配体同时结合到给定的靶蛋白上(例如,用于酶设计);(3)以前没有绑定口袋的知识(例如,为了推广到未知口袋)。为了更深入地了解对接方法在现实世界中的效用,我们引入了PoseBench,这是一个广泛适用的蛋白质-配体对接的综合基准。PoseBench使研究人员能够使用初级配体和多配体基准数据集,严格和系统地评估载脂蛋白到全息蛋白配体对接和蛋白质配体结构预测的DL方法,我们将后者介绍给DL社区。利用PoseBench,我们发现:(1)深度学习共折叠方法总体上优于传统和深度学习对接基线算法,但流行的方法(如AlphaFold 3)仍然面临着用新的蛋白质-配体结合姿态预测目标的挑战;(2)某些DL编码方法对输入序列比对高度敏感,而其他DL编码方法对输入序列比对不敏感;(3) DL方法在预测新的或多配体蛋白靶点时难以在结构准确性和化学特异性之间取得平衡。
{"title":"Assessing the potential of deep learning for protein–ligand docking","authors":"Alex Morehead, Nabin Giri, Jian Liu, Pawan Neupane, Jianlin Cheng","doi":"10.1038/s42256-025-01160-1","DOIUrl":"10.1038/s42256-025-01160-1","url":null,"abstract":"The effects of ligand binding on protein structures and their in vivo functions carry numerous implications for modern biomedical research and biotechnology development efforts such as drug discovery. Although several deep learning (DL) methods and benchmarks designed for protein–ligand docking have recently been introduced, so far no previous works have systematically studied the behaviour of the latest docking and structure prediction methods within the broadly applicable context of: (1) using predicted (apo) protein structures for docking (for example, for applicability to new proteins); (2) binding multiple (cofactor) ligands concurrently to a given target protein (for example, for enzyme design); and (3) having no previous knowledge of binding pockets (for example, for generalization to unknown pockets). To enable a deeper understanding of the real-world utility of docking methods, we introduce PoseBench, a comprehensive benchmark for broadly applicable protein–ligand docking. PoseBench enables researchers to rigorously and systematically evaluate DL methods for apo-to-holo protein–ligand docking and protein–ligand structure prediction using both primary ligand and multiligand benchmark datasets, the latter of which we introduce to the DL community. Empirically, using PoseBench, we find that: (1) DL cofolding methods generally outperform comparable conventional and DL docking baseline algorithms, but popular methods such as AlphaFold 3 are still challenged by prediction targets with new protein–ligand binding poses; (2) certain DL cofolding methods are highly sensitive to their input multiple sequence alignments, whereas others are not; and (3) DL methods struggle to strike a balance between structural accuracy and chemical specificity when predicting new or multiligand protein targets. Morehead et al. introduce the benchmark PoseBench and evaluate the strengths and limitations of current AI-based protein–ligand docking and structure prediction methods.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"32-41"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01160-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning intermediate physical states for inverse metasurface design 学习中间物理状态用于逆超表面设计
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01167-8
Chun-Teh Chen
Deep generative models that learn intermediate surface-current maps, rather than layouts directly, offer a more stable route to inverse design of tunable and stacked metasurfaces.
深度生成模型学习中间表面电流图,而不是直接布局,为可调和堆叠元表面的反向设计提供了更稳定的途径。
{"title":"Learning intermediate physical states for inverse metasurface design","authors":"Chun-Teh Chen","doi":"10.1038/s42256-025-01167-8","DOIUrl":"10.1038/s42256-025-01167-8","url":null,"abstract":"Deep generative models that learn intermediate surface-current maps, rather than layouts directly, offer a more stable route to inverse design of tunable and stacked metasurfaces.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"2-3"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145895501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reusability report: Optimizing T count in general quantum circuits with AlphaTensor-Quantum 可重用性报告:利用alphatsensor - quantum优化一般量子电路中的T计数
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01166-9
Remmy Zen, Maximilian Nägele, Florian Marquardt
Quantum computing has the potential to solve problems that are intractable for classical computers, with possible applications in areas such as drug discovery and high-energy physics. However, the practical implementation of quantum computation is hindered by the complexity of executing quantum circuits on hardware. In particular, minimizing the number of T gates is crucial for implementing efficient quantum algorithms. AlphaTensor-Quantum1 is a reinforcement-learning-based method designed to optimize the T count of quantum circuits by formulating the problem as a tensor decomposition task. Although it has demonstrated superior performance over existing methods on benchmark quantum arithmetic circuits, its applicability has so far been restricted to specific circuit families, requiring separate, time-intensive training for each new application. This report reproduces some of the key results of the original work and extends AlphaTensor-Quantum’s capabilities to simplify random quantum circuits with varying qubit counts, eliminating the need for retraining on new circuits. Our experiments show that a general agent trained on five- to eight-qubit circuits achieves greater T-count reduction than previous methods for a large fraction of quantum circuits. Furthermore, we demonstrate that a general agent trained on circuits with varying qubit numbers outperforms agents trained on fixed qubit numbers, highlighting the method’s generalizability and its potential for broader quantum circuit optimization tasks. The reusability of AlphaTensor-Quantum is tested and the method is extended to optimize a broad range of quantum circuits without retraining, achieving greater T-count reductions and demonstrating generalizable and efficient quantum circuit optimization.
量子计算有可能解决经典计算机难以解决的问题,并可能应用于药物发现和高能物理等领域。然而,在硬件上执行量子电路的复杂性阻碍了量子计算的实际实现。特别是,最小化T门的数量对于实现高效的量子算法至关重要。alphatensensor - quantum1是一种基于强化学习的方法,旨在通过将问题表述为张量分解任务来优化量子电路的T计数。尽管它在基准量子算法电路上的表现优于现有方法,但其适用性迄今仅限于特定的电路系列,每个新应用都需要单独的、耗时的训练。该报告再现了原始工作的一些关键结果,并扩展了alphatsensor - quantum的功能,以简化具有不同量子位计数的随机量子电路,从而消除了对新电路进行再培训的需要。我们的实验表明,在5到8个量子比特电路上训练的一般智能体比以前的方法在很大一部分量子电路上实现了更大的t计数减少。此外,我们证明了在不同量子比特数的电路上训练的一般智能体优于在固定量子比特数上训练的智能体,突出了该方法的泛化性及其在更广泛的量子电路优化任务中的潜力。
{"title":"Reusability report: Optimizing T count in general quantum circuits with AlphaTensor-Quantum","authors":"Remmy Zen, Maximilian Nägele, Florian Marquardt","doi":"10.1038/s42256-025-01166-9","DOIUrl":"10.1038/s42256-025-01166-9","url":null,"abstract":"Quantum computing has the potential to solve problems that are intractable for classical computers, with possible applications in areas such as drug discovery and high-energy physics. However, the practical implementation of quantum computation is hindered by the complexity of executing quantum circuits on hardware. In particular, minimizing the number of T gates is crucial for implementing efficient quantum algorithms. AlphaTensor-Quantum1 is a reinforcement-learning-based method designed to optimize the T count of quantum circuits by formulating the problem as a tensor decomposition task. Although it has demonstrated superior performance over existing methods on benchmark quantum arithmetic circuits, its applicability has so far been restricted to specific circuit families, requiring separate, time-intensive training for each new application. This report reproduces some of the key results of the original work and extends AlphaTensor-Quantum’s capabilities to simplify random quantum circuits with varying qubit counts, eliminating the need for retraining on new circuits. Our experiments show that a general agent trained on five- to eight-qubit circuits achieves greater T-count reduction than previous methods for a large fraction of quantum circuits. Furthermore, we demonstrate that a general agent trained on circuits with varying qubit numbers outperforms agents trained on fixed qubit numbers, highlighting the method’s generalizability and its potential for broader quantum circuit optimization tasks. The reusability of AlphaTensor-Quantum is tested and the method is extended to optimize a broad range of quantum circuits without retraining, achieving greater T-count reductions and demonstrating generalizable and efficient quantum circuit optimization.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"113-117"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01166-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing the power of single-cell large language models with parameter-efficient fine-tuning using scPEFT 利用单细胞大型语言模型的功能,使用scPEFT进行参数高效微调
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01170-z
Fei He, Ruixin Fei, Jordan E. Krull, Yang Yu, Xinyu Zhang, Xianyu Wang, Hao Cheng, Mingyue Gao, Li Su, Yibo Chen, Jinpu Li, Baichuan Jin, Yuzhou Chang, Anjun Ma, Qin Ma, Dong Xu
Single-cell large language models (scLLMs) capture essential biological insights from vast single-cell atlases but struggle in out-of-context applications, where zero-shot predictions can be unreliable. To address this, here we introduce a single-cell parameter-efficient fine-tuning (scPEFT) framework that integrates learnable, low-dimensional adapters into scLLMs. By freezing the backbone model and updating only the adapter parameters, scPEFT efficiently adapts to specific tasks using limited custom data. This approach mitigates catastrophic forgetting, reduces parameter tuning by over 96% and decreases GPU memory usage by more than half, thus substantially enhancing the accessibility of scLLMs for resource-constrained researchers. When validated across diverse datasets, scPEFT outperformed zero-shot models and traditional fine-tuning in disease-specific, cross-species and undercharacterized cell population tasks. Its attention-mechanism analysis identified COVID-related genes associated with specific cell states and uncovered unique blood cell subpopulations, demonstrating the capacity of scPEFT for condition-specific interpretations. These findings position scPEFT as an efficient solution for enhancing the utility of scLLMs in general single-cell analyses. He et al. present a parameter-efficient fine-tuning method for single-cell language models that improves performance on unseen diseases, treatments and cell types.
单细胞大语言模型(scLLMs)从庞大的单细胞图谱中获取重要的生物学见解,但在脱离上下文的应用中却很困难,因为零概率预测可能不可靠。为了解决这个问题,我们在这里引入了一个单单元参数有效微调(scPEFT)框架,该框架将可学习的低维适配器集成到scllm中。通过冻结骨干模型并仅更新适配器参数,scPEFT可以使用有限的自定义数据有效地适应特定的任务。这种方法减轻了灾难性遗忘,减少了96%以上的参数调整,并将GPU内存使用量减少了一半以上,从而大大提高了资源受限研究人员对scllm的可访问性。当在不同的数据集上进行验证时,scPEFT在疾病特异性、跨物种和未充分表征的细胞群任务中优于零射击模型和传统微调。其注意力机制分析确定了与特定细胞状态相关的covid - 19相关基因,并揭示了独特的血细胞亚群,证明了scPEFT对疾病特异性解释的能力。这些发现将scPEFT定位为提高scllm在一般单细胞分析中的效用的有效解决方案。
{"title":"Harnessing the power of single-cell large language models with parameter-efficient fine-tuning using scPEFT","authors":"Fei He, Ruixin Fei, Jordan E. Krull, Yang Yu, Xinyu Zhang, Xianyu Wang, Hao Cheng, Mingyue Gao, Li Su, Yibo Chen, Jinpu Li, Baichuan Jin, Yuzhou Chang, Anjun Ma, Qin Ma, Dong Xu","doi":"10.1038/s42256-025-01170-z","DOIUrl":"10.1038/s42256-025-01170-z","url":null,"abstract":"Single-cell large language models (scLLMs) capture essential biological insights from vast single-cell atlases but struggle in out-of-context applications, where zero-shot predictions can be unreliable. To address this, here we introduce a single-cell parameter-efficient fine-tuning (scPEFT) framework that integrates learnable, low-dimensional adapters into scLLMs. By freezing the backbone model and updating only the adapter parameters, scPEFT efficiently adapts to specific tasks using limited custom data. This approach mitigates catastrophic forgetting, reduces parameter tuning by over 96% and decreases GPU memory usage by more than half, thus substantially enhancing the accessibility of scLLMs for resource-constrained researchers. When validated across diverse datasets, scPEFT outperformed zero-shot models and traditional fine-tuning in disease-specific, cross-species and undercharacterized cell population tasks. Its attention-mechanism analysis identified COVID-related genes associated with specific cell states and uncovered unique blood cell subpopulations, demonstrating the capacity of scPEFT for condition-specific interpretations. These findings position scPEFT as an efficient solution for enhancing the utility of scLLMs in general single-cell analyses. He et al. present a parameter-efficient fine-tuning method for single-cell language models that improves performance on unseen diseases, treatments and cell types.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"118-133"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ImmunoStruct enables multimodal deep learning for immunogenicity prediction 免疫结构为免疫原性预测提供了多模态深度学习
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-31 DOI: 10.1038/s42256-025-01163-y
Kevin Bijan Givechian, João Felipe Rocha, Chen Liu, Edward Yang, Sidharth Tyagi, Kerrie Greene, Rex Ying, Etienne Caron, Akiko Iwasaki, Smita Krishnaswamy
Epitope-based vaccines are promising therapeutic modalities for infectious diseases and cancer, but identifying immunogenic epitopes is challenging. Most prediction methods only use amino acid sequence information, and do not incorporate wide-scale structure data and biochemical properties across each peptide–major histocompatibility complex (MHC). We present ImmunoStruct, a deep learning model that integrates sequence, structural and biochemical information to predict multi-allele class I peptide–MHC immunogenicity. By leveraging a multimodal dataset of 26,049 peptide–MHCs, we demonstrate that ImmunoStruct improves immunogenicity prediction performance and interpretability beyond existing methods, across infectious disease epitopes and cancer neoepitopes. We further show strong alignment with in vitro assay results for a set of SARS-CoV-2 epitopes, as well as strong performance in peptide–MHC-based survival prediction for patients with cancer. Overall, this work also presents an architecture that incorporates equivariant graph processing and multimodal data integration for a long-standing challenge in immunotherapy. A multimodal deep learning model combines molecular sequence, structure and biochemical properties to predict immunogenicity in an interpretable way, providing a framework for smarter molecular prediction and hypothesis generation.
基于表位的疫苗是传染病和癌症的有希望的治疗方式,但确定免疫原性表位是具有挑战性的。大多数预测方法仅使用氨基酸序列信息,而不包括每个肽-主要组织相容性复合体(MHC)的大范围结构数据和生化特性。我们提出了一种深度学习模型ImmunoStruct,它集成了序列、结构和生化信息来预测多等位基因I类肽- mhc免疫原性。通过利用26,049个多肽mhc的多模态数据集,我们证明了ImmunoStruct在传染病表位和癌症新表位上比现有方法提高了免疫原性预测性能和可解释性。我们进一步表明,该方法与一组SARS-CoV-2表位的体外检测结果高度一致,并且在基于多肽mhc的癌症患者生存预测中表现出色。总的来说,这项工作还提出了一个架构,该架构结合了免疫治疗中长期存在的挑战的等变图处理和多模态数据集成。
{"title":"ImmunoStruct enables multimodal deep learning for immunogenicity prediction","authors":"Kevin Bijan Givechian, João Felipe Rocha, Chen Liu, Edward Yang, Sidharth Tyagi, Kerrie Greene, Rex Ying, Etienne Caron, Akiko Iwasaki, Smita Krishnaswamy","doi":"10.1038/s42256-025-01163-y","DOIUrl":"10.1038/s42256-025-01163-y","url":null,"abstract":"Epitope-based vaccines are promising therapeutic modalities for infectious diseases and cancer, but identifying immunogenic epitopes is challenging. Most prediction methods only use amino acid sequence information, and do not incorporate wide-scale structure data and biochemical properties across each peptide–major histocompatibility complex (MHC). We present ImmunoStruct, a deep learning model that integrates sequence, structural and biochemical information to predict multi-allele class I peptide–MHC immunogenicity. By leveraging a multimodal dataset of 26,049 peptide–MHCs, we demonstrate that ImmunoStruct improves immunogenicity prediction performance and interpretability beyond existing methods, across infectious disease epitopes and cancer neoepitopes. We further show strong alignment with in vitro assay results for a set of SARS-CoV-2 epitopes, as well as strong performance in peptide–MHC-based survival prediction for patients with cancer. Overall, this work also presents an architecture that incorporates equivariant graph processing and multimodal data integration for a long-standing challenge in immunotherapy. A multimodal deep learning model combines molecular sequence, structure and biochemical properties to predict immunogenicity in an interpretable way, providing a framework for smarter molecular prediction and hypothesis generation.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"70-83"},"PeriodicalIF":23.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145895500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Author Correction: Scalable and robust DNA-based storage via coding theory and deep learning 作者更正:通过编码理论和深度学习可扩展和健壮的基于dna的存储
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-30 DOI: 10.1038/s42256-025-01175-8
Daniella Bar-Lev, Itai Orr, Omer Sabary, Tuvi Etzion, Eitan Yaakobi
{"title":"Author Correction: Scalable and robust DNA-based storage via coding theory and deep learning","authors":"Daniella Bar-Lev, Itai Orr, Omer Sabary, Tuvi Etzion, Eitan Yaakobi","doi":"10.1038/s42256-025-01175-8","DOIUrl":"10.1038/s42256-025-01175-8","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"134-134"},"PeriodicalIF":23.9,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01175-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Versatile cardiovascular signal generation with a unified diffusion transformer 多功能心血管信号生成与统一的扩散变压器
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-29 DOI: 10.1038/s42256-025-01147-y
Zehua Chen, Yuyang Miao, Liyuan Wang, Luyun Fan, Danilo P. Mandic, Jun Zhu
Cardiovascular signals such as photoplethysmography, electrocardiography and blood pressure are inherently correlated and complementary, together reflecting the health of the cardiovascular system. However, their joint utilization in real-time monitoring is severely limited by diverse acquisition challenges from noisy wearable recordings to burdened invasive procedures. Here we propose UniCardio, a multimodal diffusion transformer that reconstructs low-quality signals and synthesizes unrecorded signals in a unified generative framework. Its key innovations include a specialized model architecture to manage the signal modalities involved in generation tasks and a continual learning paradigm to incorporate varying modality combinations. By exploiting the complementary nature of cardiovascular signals, UniCardio clearly outperforms recent task-specific baselines in signal denoising, imputation and translation. The generated signals match the performance of ground-truth signals in detecting abnormal health conditions and estimating vital signs, even in unseen domains, as well as ensuring interpretability for human experts. These advantages establish UniCardio as a practical and robust framework for advancing artificial-intelligence-assisted healthcare. UniCardio is a unified framework for versatile multimodal cardiovascular signal generation, enabling robust signal restoration and cross-modal translation to detect abnormal conditions and estimate vital signs in real-time health monitoring.
光容积脉搏波、心电图、血压等心血管信号具有内在的相关性和互补性,共同反映心血管系统的健康状况。然而,它们在实时监测中的联合应用受到各种采集挑战的严重限制,从嘈杂的可穿戴录音到负担沉重的侵入性手术。在这里,我们提出UniCardio,一个多模态扩散变压器,重建低质量的信号,并在一个统一的生成框架中合成未记录的信号。它的关键创新包括一个专门的模型架构,用于管理生成任务中涉及的信号模态,以及一个持续学习范式,以纳入不同的模态组合。通过利用心血管信号的互补性,UniCardio在信号去噪、输入和翻译方面明显优于最近的特定任务基线。生成的信号在检测异常健康状况和估计生命体征方面的表现与地面真值信号相匹配,即使在看不见的领域也是如此,并确保人类专家的可解释性。这些优势使UniCardio成为推进人工智能辅助医疗保健的实用而强大的框架。
{"title":"Versatile cardiovascular signal generation with a unified diffusion transformer","authors":"Zehua Chen, Yuyang Miao, Liyuan Wang, Luyun Fan, Danilo P. Mandic, Jun Zhu","doi":"10.1038/s42256-025-01147-y","DOIUrl":"10.1038/s42256-025-01147-y","url":null,"abstract":"Cardiovascular signals such as photoplethysmography, electrocardiography and blood pressure are inherently correlated and complementary, together reflecting the health of the cardiovascular system. However, their joint utilization in real-time monitoring is severely limited by diverse acquisition challenges from noisy wearable recordings to burdened invasive procedures. Here we propose UniCardio, a multimodal diffusion transformer that reconstructs low-quality signals and synthesizes unrecorded signals in a unified generative framework. Its key innovations include a specialized model architecture to manage the signal modalities involved in generation tasks and a continual learning paradigm to incorporate varying modality combinations. By exploiting the complementary nature of cardiovascular signals, UniCardio clearly outperforms recent task-specific baselines in signal denoising, imputation and translation. The generated signals match the performance of ground-truth signals in detecting abnormal health conditions and estimating vital signs, even in unseen domains, as well as ensuring interpretability for human experts. These advantages establish UniCardio as a practical and robust framework for advancing artificial-intelligence-assisted healthcare. UniCardio is a unified framework for versatile multimodal cardiovascular signal generation, enabling robust signal restoration and cross-modal translation to detect abnormal conditions and estimate vital signs in real-time health monitoring.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"8 1","pages":"6-19"},"PeriodicalIF":23.9,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145895542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning cell dynamics with neural differential equations 用神经微分方程学习细胞动力学
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-18 DOI: 10.1038/s42256-025-01150-3
Michael E. Vinyard, Anders W. Rasmussen, Ruitong Li, Allon M. Klein, Gad Getz, Luca Pinello
Single-cell sequencing measurements facilitate the reconstruction of dynamic biology by capturing snapshot molecular profiles of individual cells. Cell fate decisions in development and disease are orchestrated through an intricate balance of deterministic and stochastic regulatory events. Drift-diffusion equations are effective in modelling single-cell dynamics from high-dimensional single-cell measurements. While existing solutions describe the deterministic dynamics associated with the drift term of these equations at the level of cell state, diffusion is modelled as a constant across cell states. To fully understand the dynamic regulatory logic in development and disease, models explicitly attuned to the balance between deterministic and stochastic biology are required. To address these limitations, we introduce scDiffEq, a generative framework for learning neural stochastic differential equations that approximate biology’s deterministic and stochastic dynamics. Using lineage-traced single-cell data, we demonstrate that scDiffEq offers an improved reconstruction of cell trajectories and prediction of cell fate from multipotent progenitors during haematopoiesis. By imparting in silico perturbations to multipotent progenitor cells, we find that scDiffEq accurately recapitulates the dynamics of CRISPR-perturbed haematopoiesis. We generalize this approach beyond lineage-traced or multi-time-point datasets to model the dynamics of single-cell data from a single time point. Using scDiffEq, we simulate high-resolution developmental cell trajectories, which can model their drift and diffusion, enabling us to study their time-dependent gene-level dynamics. Vinyard et al. present a generative method to model cell dynamics using neural stochastic differential equations that learn state-dependent drift and diffusion, outperforming existing approaches and enabling perturbation studies of development and disease.
单细胞测序测量通过捕获单个细胞的快照分子概况,促进动态生物学的重建。细胞在发育和疾病中的命运决定是通过确定性和随机调节事件的复杂平衡而精心安排的。漂移扩散方程在模拟高维单细胞测量的单细胞动力学方面是有效的。虽然现有的解决方案描述了在细胞状态水平上与这些方程的漂移项相关的确定性动力学,但扩散被建模为跨细胞状态的常数。为了充分理解发育和疾病中的动态调控逻辑,需要明确调整确定性生物学和随机生物学之间平衡的模型。为了解决这些限制,我们引入了scDiffEq,这是一个用于学习神经随机微分方程的生成框架,它近似于生物学的确定性和随机动力学。利用单细胞谱系追踪数据,我们证明scDiffEq在造血过程中提供了更好的细胞轨迹重建和来自多能祖细胞命运预测。通过对多能祖细胞施加硅干扰,我们发现scDiffEq准确地概括了crispr干扰的造血动力学。我们将这种方法推广到谱系跟踪或多时间点数据集之外,以从单个时间点对单细胞数据的动态建模。利用scDiffEq,我们模拟了高分辨率的发育细胞轨迹,可以模拟它们的漂移和扩散,使我们能够研究它们的时间依赖性基因水平动力学。Vinyard等人提出了一种生成方法,利用神经随机微分方程来模拟细胞动力学,该方程学习依赖状态的漂移和扩散,优于现有方法,并使发育和疾病的扰动研究成为可能。
{"title":"Learning cell dynamics with neural differential equations","authors":"Michael E. Vinyard, Anders W. Rasmussen, Ruitong Li, Allon M. Klein, Gad Getz, Luca Pinello","doi":"10.1038/s42256-025-01150-3","DOIUrl":"10.1038/s42256-025-01150-3","url":null,"abstract":"Single-cell sequencing measurements facilitate the reconstruction of dynamic biology by capturing snapshot molecular profiles of individual cells. Cell fate decisions in development and disease are orchestrated through an intricate balance of deterministic and stochastic regulatory events. Drift-diffusion equations are effective in modelling single-cell dynamics from high-dimensional single-cell measurements. While existing solutions describe the deterministic dynamics associated with the drift term of these equations at the level of cell state, diffusion is modelled as a constant across cell states. To fully understand the dynamic regulatory logic in development and disease, models explicitly attuned to the balance between deterministic and stochastic biology are required. To address these limitations, we introduce scDiffEq, a generative framework for learning neural stochastic differential equations that approximate biology’s deterministic and stochastic dynamics. Using lineage-traced single-cell data, we demonstrate that scDiffEq offers an improved reconstruction of cell trajectories and prediction of cell fate from multipotent progenitors during haematopoiesis. By imparting in silico perturbations to multipotent progenitor cells, we find that scDiffEq accurately recapitulates the dynamics of CRISPR-perturbed haematopoiesis. We generalize this approach beyond lineage-traced or multi-time-point datasets to model the dynamics of single-cell data from a single time point. Using scDiffEq, we simulate high-resolution developmental cell trajectories, which can model their drift and diffusion, enabling us to study their time-dependent gene-level dynamics. Vinyard et al. present a generative method to model cell dynamics using neural stochastic differential equations that learn state-dependent drift and diffusion, outperforming existing approaches and enabling perturbation studies of development and disease.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 12","pages":"1969-1984"},"PeriodicalIF":23.9,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145770795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A psychometric framework for evaluating and shaping personality traits in large language models 在大型语言模型中评估和塑造人格特征的心理测量框架
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-18 DOI: 10.1038/s42256-025-01115-6
Gregory Serapio-García, Mustafa Safdari, Clément Crepy, Luning Sun, Stephen Fitz, Peter Romero, Marwa Abdulhai, Aleksandra Faust, Maja Matarić
The advent of large language models (LLMs) has revolutionized natural language processing, enabling the generation of coherent and contextually relevant human-like text. As LLMs increasingly power conversational agents used by the general public worldwide, the synthetic personality traits embedded in these models by virtue of training on large amounts of human data are becoming increasingly important to evaluate. The style in which LLMs respond can mimic different human personality traits. Here, as these patterns can be a key factor determining the effectiveness of communication, we present a comprehensive psychometric methodology for administering and validating personality tests on widely used LLMs, as well as for shaping personality in the generated text of such LLMs. Applying this method to 18 LLMs, we found that: personality measurements in the outputs of some LLMs under specific prompting configurations are reliable and valid; evidence of reliability and validity of synthetic LLM personality is stronger for larger and instruction-fine-tuned models; and personality in LLM outputs can be shaped along desired dimensions to mimic specific human personality profiles. We discuss the application and ethical implications of the measurement and shaping method, in particular regarding responsible artificial intelligence. Serapio-García, Safdari and colleagues develop a method based on psychometric tests to measure and validate personality-like traits in LLMs. Large, instruction-tuned models give reliable personality measurement results, and specific personality profiles can be mimicked in downstream tasks.
大型语言模型(llm)的出现彻底改变了自然语言处理,使生成连贯且与上下文相关的类人文本成为可能。随着法学硕士越来越强大的对话代理被全世界的公众所使用,通过对大量人类数据的训练,嵌入在这些模型中的综合人格特征的评估变得越来越重要。法学硕士的回应风格可以模仿不同的人类性格特征。在这里,由于这些模式可能是决定沟通有效性的关键因素,我们提出了一种全面的心理测量方法,用于管理和验证广泛使用的法学硕士的性格测试,以及在这些法学硕士生成的文本中塑造个性。将该方法应用于18个法学硕士,我们发现:在特定提示配置下,一些法学硕士的输出中人格测量是可靠有效的;综合LLM人格的信度和效度证据在更大的模型和教学微调模型中更强;法学硕士输出中的人格可以沿着所需的维度进行塑造,以模仿特定的人类人格特征。我们讨论了测量和塑造方法的应用和伦理含义,特别是关于负责任的人工智能。在Serapio-García, Safdari和他的同事开发了一种基于心理测试的方法来测量和验证法学硕士的人格特征。大型的、指令调优的模型提供可靠的人格测量结果,并且特定的人格概况可以在下游任务中模仿。
{"title":"A psychometric framework for evaluating and shaping personality traits in large language models","authors":"Gregory Serapio-García, Mustafa Safdari, Clément Crepy, Luning Sun, Stephen Fitz, Peter Romero, Marwa Abdulhai, Aleksandra Faust, Maja Matarić","doi":"10.1038/s42256-025-01115-6","DOIUrl":"10.1038/s42256-025-01115-6","url":null,"abstract":"The advent of large language models (LLMs) has revolutionized natural language processing, enabling the generation of coherent and contextually relevant human-like text. As LLMs increasingly power conversational agents used by the general public worldwide, the synthetic personality traits embedded in these models by virtue of training on large amounts of human data are becoming increasingly important to evaluate. The style in which LLMs respond can mimic different human personality traits. Here, as these patterns can be a key factor determining the effectiveness of communication, we present a comprehensive psychometric methodology for administering and validating personality tests on widely used LLMs, as well as for shaping personality in the generated text of such LLMs. Applying this method to 18 LLMs, we found that: personality measurements in the outputs of some LLMs under specific prompting configurations are reliable and valid; evidence of reliability and validity of synthetic LLM personality is stronger for larger and instruction-fine-tuned models; and personality in LLM outputs can be shaped along desired dimensions to mimic specific human personality profiles. We discuss the application and ethical implications of the measurement and shaping method, in particular regarding responsible artificial intelligence. Serapio-García, Safdari and colleagues develop a method based on psychometric tests to measure and validate personality-like traits in LLMs. Large, instruction-tuned models give reliable personality measurement results, and specific personality profiles can be mimicked in downstream tasks.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 12","pages":"1954-1968"},"PeriodicalIF":23.9,"publicationDate":"2025-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01115-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145770796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Nature Machine Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1