Nature Machine Intelligence最新文献

英文中文

Investigating machine moral judgement through the Delphi experiment 通过德尔菲实验研究机器的道德判断

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2025-01-13 DOI: 10.1038/s42256-024-00969-6

Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny T. Liang, Sydney Levine, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jack Hessel, Jon Borchardt, Taylor Sorensen, Saadia Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini, Yejin Choi

As our society adopts increasingly powerful artificial intelligence (AI) systems for pervasive use, there are growing concerns about machine morality—or lack thereof. Millions of users already rely on the outputs of AI systems, such as chatbots, as decision aids. Meanwhile, AI researchers continue to grapple with the challenge of aligning these systems with human morality and values. In response to this challenge, we build and test Delphi, an open-source AI system trained to predict the moral judgements of US participants. The computational framework of Delphi is grounded in the framework proposed by the prominent moral philosopher John Rawls. Our results speak to the promises and limits of teaching machines about human morality. Delphi demonstrates improved generalization capabilities over those exhibited by off-the-shelf neural language models. At the same time, Delphi’s failures also underscore important challenges in this arena. For instance, Delphi has limited cultural awareness and is susceptible to pervasive biases. Despite these shortcomings, we demonstrate several compelling use cases of Delphi, including its incorporation as a component within an ensemble of AI systems. Finally, we computationally demonstrate the potential of Rawls’s prospect of hybrid approaches for reliable moral reasoning, inspiring future research in computational morality. Aligning artificial intelligence systems with human morality poses scientific, societal and ethical challenges. Delphi, an artificial intelligence system designed to predict human moral judgements based on John Rawls’s philosophical framework, is developed and tested, highlighting its potential for ethical applications and emphasizing the need to address its limitations and biases.

随着我们的社会采用越来越强大的人工智能（AI）系统进行广泛使用，人们越来越担心机器道德——或者缺乏道德。数以百万计的用户已经依赖聊天机器人等人工智能系统的输出作为决策辅助。与此同时，人工智能研究人员继续努力应对使这些系统与人类道德和价值观保持一致的挑战。为了应对这一挑战，我们建立并测试了Delphi，这是一个开源的人工智能系统，经过训练可以预测美国参与者的道德判断。德尔菲的计算框架以著名道德哲学家约翰·罗尔斯提出的框架为基础。我们的研究结果说明了教导机器关于人类道德的承诺和局限性。Delphi展示了比现成的神经语言模型更好的泛化能力。与此同时，德尔福的失败也凸显了这一领域的重大挑战。例如，德尔福的文化意识有限，容易受到普遍偏见的影响。尽管存在这些缺点，我们还是展示了Delphi的几个引人注目的用例，包括将其作为AI系统集成中的一个组件。最后，我们在计算上展示了罗尔斯对可靠道德推理的混合方法的前景的潜力，启发了计算道德的未来研究。

{"title":"Investigating machine moral judgement through the Delphi experiment","authors":"Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny T. Liang, Sydney Levine, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jack Hessel, Jon Borchardt, Taylor Sorensen, Saadia Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini, Yejin Choi","doi":"10.1038/s42256-024-00969-6","DOIUrl":"10.1038/s42256-024-00969-6","url":null,"abstract":"As our society adopts increasingly powerful artificial intelligence (AI) systems for pervasive use, there are growing concerns about machine morality—or lack thereof. Millions of users already rely on the outputs of AI systems, such as chatbots, as decision aids. Meanwhile, AI researchers continue to grapple with the challenge of aligning these systems with human morality and values. In response to this challenge, we build and test Delphi, an open-source AI system trained to predict the moral judgements of US participants. The computational framework of Delphi is grounded in the framework proposed by the prominent moral philosopher John Rawls. Our results speak to the promises and limits of teaching machines about human morality. Delphi demonstrates improved generalization capabilities over those exhibited by off-the-shelf neural language models. At the same time, Delphi’s failures also underscore important challenges in this arena. For instance, Delphi has limited cultural awareness and is susceptible to pervasive biases. Despite these shortcomings, we demonstrate several compelling use cases of Delphi, including its incorporation as a component within an ensemble of AI systems. Finally, we computationally demonstrate the potential of Rawls’s prospect of hybrid approaches for reliable moral reasoning, inspiring future research in computational morality. Aligning artificial intelligence systems with human morality poses scientific, societal and ethical challenges. Delphi, an artificial intelligence system designed to predict human moral judgements based on John Rawls’s philosophical framework, is developed and tested, highlighting its potential for ethical applications and emphasizing the need to address its limitations and biases.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"145-160"},"PeriodicalIF":18.8,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00969-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142968287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards highly sensitive deep learning-based end-to-end database search for tandem mass spectrometry 迈向高度敏感的基于深度学习的端到端串联质谱数据库搜索

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2025-01-06 DOI: 10.1038/s42256-024-00960-1

Yonghan Yu, Ming Li

Peptide identification in mass spectrometry-based proteomics is crucial for understanding protein function and dynamics. Traditional database search methods, though widely used, rely on heuristic scoring functions, and statistical estimations must be introduced to achieve a higher identification rate. Here we introduce DeepSearch, a deep learning-based end-to-end database search method for tandem mass spectrometry. DeepSearch leverages a modified transformer-based encoder–decoder architecture under the contrastive learning framework. Unlike conventional methods, which rely on ion-to-ion matching, DeepSearch adopts a data-driven approach to score peptide–spectrum matches. DeepSearch can also profile variable post-translational modifications in a zero-shot manner. We show that DeepSearch’s scoring scheme expresses less bias and does not require any statistical estimation. We validate DeepSearch’s accuracy and robustness across various datasets, including those from species with diverse protein compositions and a modification-enriched dataset. DeepSearch sheds new light on database search methods in tandem mass spectrometry. Yu and Li present DeepSearch, a deep learning-based method for peptide identification in mass spectrometry, offering unbiased, data-driven scoring without statistical estimation. It accurately profiles post-translational modifications in a zero-shot manner.

基于质谱的蛋白质组学中的肽鉴定对于理解蛋白质的功能和动力学至关重要。传统的数据库搜索方法虽然被广泛使用，但依赖于启发式评分函数，必须引入统计估计才能达到更高的识别率。本文介绍了一种基于深度学习的串联质谱端到端数据库搜索方法DeepSearch。DeepSearch在对比学习框架下利用改进的基于变压器的编码器-解码器架构。与依赖离子对离子匹配的传统方法不同，DeepSearch采用数据驱动的方法对肽谱匹配进行评分。DeepSearch还可以以零采样的方式分析翻译后的变化。我们证明了DeepSearch的评分方案表达了更少的偏差，并且不需要任何统计估计。我们在各种数据集上验证了DeepSearch的准确性和稳健性，包括来自具有不同蛋白质组成的物种的数据集和修改丰富的数据集。DeepSearch为串联质谱的数据库搜索方法提供了新的思路。

{"title":"Towards highly sensitive deep learning-based end-to-end database search for tandem mass spectrometry","authors":"Yonghan Yu, Ming Li","doi":"10.1038/s42256-024-00960-1","DOIUrl":"10.1038/s42256-024-00960-1","url":null,"abstract":"Peptide identification in mass spectrometry-based proteomics is crucial for understanding protein function and dynamics. Traditional database search methods, though widely used, rely on heuristic scoring functions, and statistical estimations must be introduced to achieve a higher identification rate. Here we introduce DeepSearch, a deep learning-based end-to-end database search method for tandem mass spectrometry. DeepSearch leverages a modified transformer-based encoder–decoder architecture under the contrastive learning framework. Unlike conventional methods, which rely on ion-to-ion matching, DeepSearch adopts a data-driven approach to score peptide–spectrum matches. DeepSearch can also profile variable post-translational modifications in a zero-shot manner. We show that DeepSearch’s scoring scheme expresses less bias and does not require any statistical estimation. We validate DeepSearch’s accuracy and robustness across various datasets, including those from species with diverse protein compositions and a modification-enriched dataset. DeepSearch sheds new light on database search methods in tandem mass spectrometry. Yu and Li present DeepSearch, a deep learning-based method for peptide identification in mass spectrometry, offering unbiased, data-driven scoring without statistical estimation. It accurately profiles post-translational modifications in a zero-shot manner.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"85-95"},"PeriodicalIF":18.8,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142929493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reusability report: Deep learning-based analysis of images and spectroscopy data with AtomAI 可重用性报告：使用AtomAI进行基于深度学习的图像和光谱数据分析

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2025-01-02 DOI: 10.1038/s42256-024-00958-9

Pragalbh Vashishtha, Hitesh Gupta Kattamuri, Nikhil Thawari, Murugaiyan Amirthalingam, Rohit Batra

Machine learning (ML) techniques are gaining traction for materials image processing applications. In this context, Ziatdinov et al. developed AtomAI, a user-friendly and comprehensive Python library designed for a wide range of materials imaging tasks, including image segmentation, denoising, image generation, image-to-spectrum mapping (and vice versa) and subsequent atomistic modelling of image-resolved structures. Given its broad applicability, this report aims to reproduce key aspects of the authors’ original work, extend its capabilities to new materials datasets and enhance certain features to improve model performance. We have not only successfully replicated parts of the original study, but also developed improved ML models for multiple datasets across different image processing tasks. The AtomAI library was found to be easy to use and extensible for custom applications. We believe that AtomAI holds significant potential for the microscopy and spectroscopy communities, and further development—such as semi-automated image segmentation—could broaden its utility and impact. Vashishtha and colleagues test and reuse AtomAI, a machine learning framework developed for analysing microscopy data, for a range of materials characterization tasks.

机器学习（ML）技术正在获得材料图像处理应用的牵引力。在此背景下，Ziatdinov等人开发了AtomAI，这是一个用户友好且全面的Python库，专为广泛的材料成像任务而设计，包括图像分割，去噪，图像生成，图像到光谱映射（反之亦然）以及随后的图像分辨率结构的原子建模。鉴于其广泛的适用性，本报告旨在重现作者原始工作的关键方面，将其功能扩展到新材料数据集，并增强某些特征以提高模型性能。我们不仅成功地复制了原始研究的部分内容，而且还针对不同图像处理任务的多个数据集开发了改进的ML模型。人们发现AtomAI库易于使用，并且可扩展到自定义应用程序。我们相信AtomAI在显微镜和光谱学领域具有巨大的潜力，进一步的发展，比如半自动图像分割，可以扩大它的效用和影响。

{"title":"Reusability report: Deep learning-based analysis of images and spectroscopy data with AtomAI","authors":"Pragalbh Vashishtha, Hitesh Gupta Kattamuri, Nikhil Thawari, Murugaiyan Amirthalingam, Rohit Batra","doi":"10.1038/s42256-024-00958-9","DOIUrl":"10.1038/s42256-024-00958-9","url":null,"abstract":"Machine learning (ML) techniques are gaining traction for materials image processing applications. In this context, Ziatdinov et al. developed AtomAI, a user-friendly and comprehensive Python library designed for a wide range of materials imaging tasks, including image segmentation, denoising, image generation, image-to-spectrum mapping (and vice versa) and subsequent atomistic modelling of image-resolved structures. Given its broad applicability, this report aims to reproduce key aspects of the authors’ original work, extend its capabilities to new materials datasets and enhance certain features to improve model performance. We have not only successfully replicated parts of the original study, but also developed improved ML models for multiple datasets across different image processing tasks. The AtomAI library was found to be easy to use and extensible for custom applications. We believe that AtomAI holds significant potential for the microscopy and spectroscopy communities, and further development—such as semi-automated image segmentation—could broaden its utility and impact. Vashishtha and colleagues test and reuse AtomAI, a machine learning framework developed for analysing microscopy data, for a range of materials characterization tasks.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"79-84"},"PeriodicalIF":18.8,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142911796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ARNLE model identifies prevalence potential of SARS-CoV-2 variants ARNLE模型确定了SARS-CoV-2变体的流行潜力

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-31 DOI: 10.1038/s42256-024-00919-2

Yuqi Liu, Jing Li, Peihan Li, Yehong Yang, Kaiying Wang, Jinhui Li, Lang Yang, Jiangfeng Liu, Leili Jia, Aiping Wu, Juntao Yang, Peng Li, Hongbin Song

SARS-CoV-2 mutations accumulated during the COVID-19 pandemic, posing significant challenges for immune prevention. An optimistic perspective suggests that SARS-CoV-2 will become more tropic to humans with weaker virulence and stronger infectivity. However, tracing a quantified trajectory of this process remains difficult. Here we introduce an attentional recurrent network based on language embedding (ARNLE) framework to analyse the shift in SARS-CoV-2 host tropism towards humans. ARNLE incorporates a language model for self-supervised learning to capture the features of amino acid sequences, alongside a supervised bidirectional long-short-term-memory-based network to discern the relationship between mutations and host tropism among coronaviruses. We identified a shift in SARS-CoV-2 tropism from weak to strong, transitioning from an approximate Chiroptera coronavirus to a primate-tropic coronavirus. Delta variants were closer to other common primate coronaviruses than previous SARS-CoV-2 variants. A similar phenomenon was observed among the Omicron variants. We employed a Bayesian-based post hoc explanation method to analyse key mutations influencing the human tropism of SARS-CoV-2. ARNLE identified pivotal mutations in the spike proteins, including T478K, L452R, G142D and so on, as the top determinants of human tropism. Our findings suggest that language models like ARNLE will significantly facilitate the identification of potentially prevalent variants and provide important support for screening key mutations, aiding in timely update of vaccines to protect against future emerging SARS-CoV-2 variants. Liu et al. developed a framework called ARNLE to explore host tropism of SARS-CoV-2 and found a shift from weak to strong primate tropism. Key mutations involved in this shift can be analysed to advance research on emerging viruses.

SARS-CoV-2突变在COVID-19大流行期间积累，给免疫预防带来重大挑战。从乐观的角度来看，SARS-CoV-2对人类的影响将更大，毒性更弱，传染性更强。然而，追踪这一过程的量化轨迹仍然很困难。在这里，我们引入了一个基于语言嵌入的关注循环网络（ARNLE）框架来分析SARS-CoV-2宿主向人类的转移。ARNLE结合了一个用于自我监督学习的语言模型来捕捉氨基酸序列的特征，以及一个基于监督的双向长短期记忆网络，以识别冠状病毒之间的突变与宿主亲和性之间的关系。我们发现SARS-CoV-2的趋向性从弱向强转变，从一种近似翼目冠状病毒过渡到一种灵长类趋向性冠状病毒。Delta变体比以前的SARS-CoV-2变体更接近其他常见的灵长类冠状病毒。在欧米克隆变异中也观察到类似的现象。我们采用基于贝叶斯的事后解释方法分析了影响SARS-CoV-2人类趋向性的关键突变。ARNLE发现刺突蛋白中的关键突变，包括T478K、L452R、G142D等，是人类向性的主要决定因素。我们的研究结果表明，像ARNLE这样的语言模型将极大地促进潜在流行变体的识别，并为筛选关键突变提供重要支持，帮助及时更新疫苗，以防止未来出现的SARS-CoV-2变体。

{"title":"ARNLE model identifies prevalence potential of SARS-CoV-2 variants","authors":"Yuqi Liu, Jing Li, Peihan Li, Yehong Yang, Kaiying Wang, Jinhui Li, Lang Yang, Jiangfeng Liu, Leili Jia, Aiping Wu, Juntao Yang, Peng Li, Hongbin Song","doi":"10.1038/s42256-024-00919-2","DOIUrl":"10.1038/s42256-024-00919-2","url":null,"abstract":"SARS-CoV-2 mutations accumulated during the COVID-19 pandemic, posing significant challenges for immune prevention. An optimistic perspective suggests that SARS-CoV-2 will become more tropic to humans with weaker virulence and stronger infectivity. However, tracing a quantified trajectory of this process remains difficult. Here we introduce an attentional recurrent network based on language embedding (ARNLE) framework to analyse the shift in SARS-CoV-2 host tropism towards humans. ARNLE incorporates a language model for self-supervised learning to capture the features of amino acid sequences, alongside a supervised bidirectional long-short-term-memory-based network to discern the relationship between mutations and host tropism among coronaviruses. We identified a shift in SARS-CoV-2 tropism from weak to strong, transitioning from an approximate Chiroptera coronavirus to a primate-tropic coronavirus. Delta variants were closer to other common primate coronaviruses than previous SARS-CoV-2 variants. A similar phenomenon was observed among the Omicron variants. We employed a Bayesian-based post hoc explanation method to analyse key mutations influencing the human tropism of SARS-CoV-2. ARNLE identified pivotal mutations in the spike proteins, including T478K, L452R, G142D and so on, as the top determinants of human tropism. Our findings suggest that language models like ARNLE will significantly facilitate the identification of potentially prevalent variants and provide important support for screening key mutations, aiding in timely update of vaccines to protect against future emerging SARS-CoV-2 variants. Liu et al. developed a framework called ARNLE to explore host tropism of SARS-CoV-2 and found a shift from weak to strong primate tropism. Key mutations involved in this shift can be analysed to advance research on emerging viruses.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"18-28"},"PeriodicalIF":18.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142904754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sequential memory improves sample and memory efficiency in episodic control 顺序记忆提高了情景控制的样本和记忆效率

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-31 DOI: 10.1038/s42256-024-00950-3

Ismael T. Freire, Adrián F. Amil, Paul F. M. J. Verschure

Deep reinforcement learning algorithms are known for their sample inefficiency, requiring extensive episodes to reach optimal performance. Episodic reinforcement learning algorithms aim to overcome this issue by using extended memory systems to leverage past experiences. However, these memory augmentations are often used as mere buffers, from which isolated events are resampled for offline learning (for example, replay). In this Article, we introduce Sequential Episodic Control (SEC), a hippocampal-inspired model that stores entire event sequences in their temporal order and employs a sequential bias in their retrieval to guide actions. We evaluate SEC across various benchmarks from the Animal-AI testbed, demonstrating its superior performance and sample efficiency compared to several state-of-the-art models, including Model-Free Episodic Control, Deep Q-Network and Episodic Reinforcement Learning with Associative Memory. Our experiments show that SEC achieves higher rewards and faster policy convergence in tasks requiring memory and decision-making. Additionally, we investigate the effects of memory constraints and forgetting mechanisms, revealing that prioritized forgetting enhances both performance and policy stability. Further, ablation studies demonstrate the critical role of the sequential memory component in SEC. Finally, we discuss how fast, sequential hippocampal-like episodic memory systems could support both habit formation and deliberation in artificial and biological systems. Previous studies have explored the integration of episodic memory into reinforcement learning and control. Inspired by hippocampal memory, Freire et al. develop a model that improves learning speed and stability by storing experiences as sequences, demonstrating resilience and efficiency under memory constraints.

深度强化学习算法以其样本效率低下而闻名，需要大量的剧集才能达到最佳性能。情景强化学习算法旨在通过使用扩展记忆系统来利用过去的经验来克服这个问题。然而，这些内存增强通常仅仅用作缓冲区，从中重新采样孤立的事件以进行离线学习（例如，重播）。在本文中，我们介绍了顺序情景控制（SEC），这是一种受海马启发的模型，它按时间顺序存储整个事件序列，并在检索过程中使用顺序偏差来指导行动。我们通过动物人工智能测试平台的各种基准对SEC进行了评估，与几种最先进的模型（包括无模型情景控制、深度q -网络和带有联想记忆的情景强化学习）相比，证明了SEC的卓越性能和样本效率。我们的实验表明，SEC在需要记忆和决策的任务中实现了更高的奖励和更快的策略收敛。此外，我们研究了记忆约束和遗忘机制的影响，揭示了优先遗忘提高了性能和策略稳定性。此外，消融研究证明了顺序记忆成分在SEC中的关键作用。最后，我们讨论了在人工和生物系统中，顺序海马样情景记忆系统如何快速地支持习惯形成和思考。

{"title":"Sequential memory improves sample and memory efficiency in episodic control","authors":"Ismael T. Freire, Adrián F. Amil, Paul F. M. J. Verschure","doi":"10.1038/s42256-024-00950-3","DOIUrl":"10.1038/s42256-024-00950-3","url":null,"abstract":"Deep reinforcement learning algorithms are known for their sample inefficiency, requiring extensive episodes to reach optimal performance. Episodic reinforcement learning algorithms aim to overcome this issue by using extended memory systems to leverage past experiences. However, these memory augmentations are often used as mere buffers, from which isolated events are resampled for offline learning (for example, replay). In this Article, we introduce Sequential Episodic Control (SEC), a hippocampal-inspired model that stores entire event sequences in their temporal order and employs a sequential bias in their retrieval to guide actions. We evaluate SEC across various benchmarks from the Animal-AI testbed, demonstrating its superior performance and sample efficiency compared to several state-of-the-art models, including Model-Free Episodic Control, Deep Q-Network and Episodic Reinforcement Learning with Associative Memory. Our experiments show that SEC achieves higher rewards and faster policy convergence in tasks requiring memory and decision-making. Additionally, we investigate the effects of memory constraints and forgetting mechanisms, revealing that prioritized forgetting enhances both performance and policy stability. Further, ablation studies demonstrate the critical role of the sequential memory component in SEC. Finally, we discuss how fast, sequential hippocampal-like episodic memory systems could support both habit formation and deliberation in artificial and biological systems. Previous studies have explored the integration of episodic memory into reinforcement learning and control. Inspired by hippocampal memory, Freire et al. develop a model that improves learning speed and stability by storing experiences as sequences, demonstrating resilience and efficiency under memory constraints.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"43-55"},"PeriodicalIF":18.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142904749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Delineating the effective use of self-supervised learning in single-cell genomics 描述自我监督学习在单细胞基因组学中的有效应用

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-27 DOI: 10.1038/s42256-024-00934-3

Till Richter, Mojtaba Bahrami, Yufan Xia, David S. Fischer, Fabian J. Theis

Self-supervised learning (SSL) has emerged as a powerful method for extracting meaningful representations from vast, unlabelled datasets, transforming computer vision and natural language processing. In single-cell genomics (SCG), representation learning offers insights into the complex biological data, especially with emerging foundation models. However, identifying scenarios in SCG where SSL outperforms traditional learning methods remains a nuanced challenge. Furthermore, selecting the most effective pretext tasks within the SSL framework for SCG is a critical yet unresolved question. Here we address this gap by adapting and benchmarking SSL methods in SCG, including masked autoencoders with multiple masking strategies and contrastive learning methods. Models trained on over 20 million cells were examined across multiple downstream tasks, including cell-type prediction, gene-expression reconstruction, cross-modality prediction and data integration. Our empirical analyses underscore the nuanced role of SSL, namely, in transfer learning scenarios leveraging auxiliary data or analysing unseen datasets. Masked autoencoders excel over contrastive methods in SCG, diverging from computer vision trends. Moreover, our findings reveal the notable capabilities of SSL in zero-shot settings and its potential in cross-modality prediction and data integration. In summary, we study SSL methods in SCG on fully connected networks and benchmark their utility across key representation learning scenarios. Self-supervised learning techniques are powerful assets for enabling deep insights into complex, unlabelled single-cell genomic data. Richter et al. here benchmark the applicability of self-supervised architectures into key downstream representation learning scenarios.

自监督学习（SSL）已经成为一种强大的方法，可以从大量未标记的数据集中提取有意义的表示，从而改变计算机视觉和自然语言处理。在单细胞基因组学（SCG）中，表示学习提供了对复杂生物数据的见解，特别是新兴的基础模型。然而，在SCG中确定SSL优于传统学习方法的场景仍然是一个微妙的挑战。此外，在SSL框架中为SCG选择最有效的借口任务是一个关键但尚未解决的问题。在这里，我们通过在SCG中对SSL方法进行调整和基准测试来解决这一差距，包括具有多种屏蔽策略的屏蔽自编码器和对比学习方法。在超过2000万个细胞上训练的模型在多个下游任务中进行了检查，包括细胞类型预测、基因表达重建、跨模态预测和数据整合。我们的实证分析强调了SSL的微妙作用，即在迁移学习场景中利用辅助数据或分析未见过的数据集。掩码自编码器优于对比方法在SCG，偏离计算机视觉趋势。此外，我们的研究结果揭示了SSL在零射击设置中的显着能力及其在跨模态预测和数据集成方面的潜力。总之，我们在完全连接的网络上研究了SCG中的SSL方法，并对它们在关键表示学习场景中的效用进行了基准测试。

{"title":"Delineating the effective use of self-supervised learning in single-cell genomics","authors":"Till Richter, Mojtaba Bahrami, Yufan Xia, David S. Fischer, Fabian J. Theis","doi":"10.1038/s42256-024-00934-3","DOIUrl":"10.1038/s42256-024-00934-3","url":null,"abstract":"Self-supervised learning (SSL) has emerged as a powerful method for extracting meaningful representations from vast, unlabelled datasets, transforming computer vision and natural language processing. In single-cell genomics (SCG), representation learning offers insights into the complex biological data, especially with emerging foundation models. However, identifying scenarios in SCG where SSL outperforms traditional learning methods remains a nuanced challenge. Furthermore, selecting the most effective pretext tasks within the SSL framework for SCG is a critical yet unresolved question. Here we address this gap by adapting and benchmarking SSL methods in SCG, including masked autoencoders with multiple masking strategies and contrastive learning methods. Models trained on over 20 million cells were examined across multiple downstream tasks, including cell-type prediction, gene-expression reconstruction, cross-modality prediction and data integration. Our empirical analyses underscore the nuanced role of SSL, namely, in transfer learning scenarios leveraging auxiliary data or analysing unseen datasets. Masked autoencoders excel over contrastive methods in SCG, diverging from computer vision trends. Moreover, our findings reveal the notable capabilities of SSL in zero-shot settings and its potential in cross-modality prediction and data integration. In summary, we study SSL methods in SCG on fully connected networks and benchmark their utility across key representation learning scenarios. Self-supervised learning techniques are powerful assets for enabling deep insights into complex, unlabelled single-cell genomic data. Richter et al. here benchmark the applicability of self-supervised architectures into key downstream representation learning scenarios.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"68-78"},"PeriodicalIF":18.8,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00934-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142888262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Seeking clarity rather than strong opinions on intelligence 在情报问题上寻求清晰而不是强烈的意见

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-18 DOI: 10.1038/s42256-024-00968-7

Clear descriptions of intelligence in both living organisms and machines are essential to avoid confusion, sharpen thinking and guide interdisciplinary research. A Comment in this issue encourages researchers to answer key questions to improve clarity on the terms they use.

要避免混淆、锐化思维并指导跨学科研究，就必须对生物体和机器的智能进行清晰的描述。本期的一篇评论鼓励研究人员回答关键问题，使他们使用的术语更加清晰。

引用次数: 0

Strategies needed to counter potential AI misuse 应对潜在的人工智能滥用的战略

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-18 DOI: 10.1038/s42256-024-00967-8

Researchers urgently need more guidance to help them identify and mitigate potential risks when designing projects that involve AI developments.

研究人员迫切需要更多的指导，以帮助他们在设计涉及人工智能开发的项目时识别和减轻潜在风险。

引用次数: 0

Leveraging ancestral sequence reconstruction for protein representation learning 利用祖先序列重建进行蛋白质表示学习

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-18 DOI: 10.1038/s42256-024-00935-2

D. S. Matthews, M. A. Spence, A. C. Mater, J. Nichols, S. B. Pulsford, M. Sandhu, J. A. Kaczmarski, C. M. Miton, N. Tokuriki, C. J. Jackson

Protein language models (PLMs) convert amino acid sequences into the numerical representations required to train machine learning models. Many PLMs are large (>600 million parameters) and trained on a broad span of protein sequence space. However, these models have limitations in terms of predictive accuracy and computational cost. Here we use multiplexed ancestral sequence reconstruction to generate small but focused functional protein sequence datasets for PLM training. Compared to large PLMs, this local ancestral sequence embedding produces representations with higher predictive accuracy. We show that due to the evolutionary nature of the ancestral sequence reconstruction data, local ancestral sequence embedding produces smoother fitness landscapes, in which protein variants that are closer in fitness value become numerically closer in representation space. This work contributes to the implementation of machine learning-based protein design in real-world settings, where data are sparse and computational resources are limited. Matthews et al. present a protein sequence embedding based on data from ancestral sequences that allows machine learning to be used for tasks where training data are scarce or expensive.

蛋白质语言模型（PLM）将氨基酸序列转换为训练机器学习模型所需的数字表示。许多蛋白质语言模型都很庞大（6 亿个参数），并在广泛的蛋白质序列空间中进行训练。然而，这些模型在预测准确性和计算成本方面存在局限性。在这里，我们使用多路复用祖先序列重建来生成小而集中的功能蛋白质序列数据集，用于PLM训练。与大型 PLM 相比，这种局部祖先序列嵌入产生的表征具有更高的预测准确性。我们的研究表明，由于祖先序列重构数据的进化性质，局部祖先序列嵌入会产生更平滑的适配性景观，在这种景观中，适配值更接近的蛋白质变体在表示空间的数值上也更接近。这项工作有助于在数据稀少、计算资源有限的实际环境中实现基于机器学习的蛋白质设计。

{"title":"Leveraging ancestral sequence reconstruction for protein representation learning","authors":"D. S. Matthews, M. A. Spence, A. C. Mater, J. Nichols, S. B. Pulsford, M. Sandhu, J. A. Kaczmarski, C. M. Miton, N. Tokuriki, C. J. Jackson","doi":"10.1038/s42256-024-00935-2","DOIUrl":"10.1038/s42256-024-00935-2","url":null,"abstract":"Protein language models (PLMs) convert amino acid sequences into the numerical representations required to train machine learning models. Many PLMs are large (>600 million parameters) and trained on a broad span of protein sequence space. However, these models have limitations in terms of predictive accuracy and computational cost. Here we use multiplexed ancestral sequence reconstruction to generate small but focused functional protein sequence datasets for PLM training. Compared to large PLMs, this local ancestral sequence embedding produces representations with higher predictive accuracy. We show that due to the evolutionary nature of the ancestral sequence reconstruction data, local ancestral sequence embedding produces smoother fitness landscapes, in which protein variants that are closer in fitness value become numerically closer in representation space. This work contributes to the implementation of machine learning-based protein design in real-world settings, where data are sparse and computational resources are limited. Matthews et al. present a protein sequence embedding based on data from ancestral sequences that allows machine learning to be used for tasks where training data are scarce or expensive.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1542-1555"},"PeriodicalIF":18.8,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142841572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reply to: Limitations in odour recognition and generalization in a neuromorphic olfactory circuit 回复：气味识别的局限性和神经形态嗅觉回路的泛化

IF 18.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Nature Machine Intelligence

Pub Date : 2024-12-16 DOI: 10.1038/s42256-024-00951-2

Roy Moyal, Nabil Imam, Thomas A. Cleland

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Nature Machine Intelligence

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀