首页 > 最新文献

Nature Machine Intelligence最新文献

英文 中文
Structure as an inductive bias for brain–model alignment 结构作为脑模型对齐的归纳偏差
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-04 DOI: 10.1038/s42256-025-01155-y
Binxu Wang, Carlos R. Ponce
Even before training, convolutional neural networks may reflect the brain’s visual processing principles. A study now shows how structure alone can help to explain the alignment between brains and models.
甚至在训练之前,卷积神经网络可能反映了大脑的视觉处理原理。现在的一项研究表明,结构本身如何有助于解释大脑和模型之间的一致性。
{"title":"Structure as an inductive bias for brain–model alignment","authors":"Binxu Wang, Carlos R. Ponce","doi":"10.1038/s42256-025-01155-y","DOIUrl":"10.1038/s42256-025-01155-y","url":null,"abstract":"Even before training, convolutional neural networks may reflect the brain’s visual processing principles. A study now shows how structure alone can help to explain the alignment between brains and models.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 12","pages":"1895-1896"},"PeriodicalIF":23.9,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145680424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empowering artificial intelligence with homomorphic encryption for secure deep reinforcement learning 用同态加密增强人工智能,实现安全的深度强化学习
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-01 DOI: 10.1038/s42256-025-01135-2
Chi-Hieu Nguyen, Thai Hoang Dinh, Diep N. Nguyen, Kristin Lauter, Miran Kim
Deep reinforcement learning (DRL) demonstrates significant potential in solving complex control and decision-making problems, but it may inadvertently expose sensitive, environment-specific information, raising privacy and security concerns for computer systems, humans and organizations. This work introduces a privacy-preserving framework using homomorphic encryption and advanced learning algorithms to secure DRL processes. Our framework enables the encryption of sensitive information, including states, actions and rewards, before sharing it with an untrusted processing platform. This encryption ensures data privacy, prevents unauthorized access and maintains compliance with data protection laws throughout the learning process. In addition, we develop innovative algorithms to efficiently handle a wide range of encrypted control tasks. Our core innovation is the homomorphic encryption-compatible Adam optimizer, which reparameterizes momentum values to bypass the need for high-degree polynomial approximations of inverse square roots on encrypted data. This adaptation, previously unexplored in homomorphic encryption-based ML research, enables stable and efficient training with adaptive learning rates in encrypted domains, addressing a critical bottleneck for privacy-preserving DRL with sparse rewards. Evaluations on standard DRL benchmarks demonstrate that our encrypted DRL performs comparably with its unencrypted counterpart (with a gap of less than 10%) and maintaining data confidentiality with homomorphic encryption. This work facilitates the integration of privacy-preserving DRL into real-world applications, addressing critical privacy concerns, and promoting the ethical advancement of artificial intelligence. A secure artificial intelligence framework is introduced that leverages homomorphic encryption to safeguard sensitive information in deep reinforcement learning, achieving accurate decision-making and ensuring data privacy and confidentiality.
深度强化学习(DRL)在解决复杂的控制和决策问题方面显示出巨大的潜力,但它可能会无意中暴露敏感的、特定于环境的信息,提高计算机系统、人类和组织的隐私和安全问题。这项工作引入了一个使用同态加密和高级学习算法来保护DRL进程的隐私保护框架。我们的框架允许在与不受信任的处理平台共享之前对敏感信息(包括状态、操作和奖励)进行加密。这种加密可确保数据隐私,防止未经授权的访问,并在整个学习过程中遵守数据保护法。此外,我们还开发了创新的算法来有效地处理各种加密控制任务。我们的核心创新是兼容同态加密的Adam优化器,它重新参数化动量值,以绕过对加密数据的反平方根的高次多项式近似的需要。这种适应,以前未在基于同态加密的机器学习研究中探索过,能够在加密领域中以自适应学习率进行稳定有效的训练,解决了具有稀疏奖励的隐私保护DRL的关键瓶颈。对标准DRL基准的评估表明,我们加密的DRL的性能与未加密的DRL相当(差距小于10%),并且使用同态加密保持数据机密性。这项工作促进了将保护隐私的DRL集成到现实世界的应用中,解决了关键的隐私问题,并促进了人工智能的伦理进步。介绍了一种安全的人工智能框架,利用同态加密保护深度强化学习中的敏感信息,实现准确决策,确保数据隐私和机密性。
{"title":"Empowering artificial intelligence with homomorphic encryption for secure deep reinforcement learning","authors":"Chi-Hieu Nguyen, Thai Hoang Dinh, Diep N. Nguyen, Kristin Lauter, Miran Kim","doi":"10.1038/s42256-025-01135-2","DOIUrl":"10.1038/s42256-025-01135-2","url":null,"abstract":"Deep reinforcement learning (DRL) demonstrates significant potential in solving complex control and decision-making problems, but it may inadvertently expose sensitive, environment-specific information, raising privacy and security concerns for computer systems, humans and organizations. This work introduces a privacy-preserving framework using homomorphic encryption and advanced learning algorithms to secure DRL processes. Our framework enables the encryption of sensitive information, including states, actions and rewards, before sharing it with an untrusted processing platform. This encryption ensures data privacy, prevents unauthorized access and maintains compliance with data protection laws throughout the learning process. In addition, we develop innovative algorithms to efficiently handle a wide range of encrypted control tasks. Our core innovation is the homomorphic encryption-compatible Adam optimizer, which reparameterizes momentum values to bypass the need for high-degree polynomial approximations of inverse square roots on encrypted data. This adaptation, previously unexplored in homomorphic encryption-based ML research, enables stable and efficient training with adaptive learning rates in encrypted domains, addressing a critical bottleneck for privacy-preserving DRL with sparse rewards. Evaluations on standard DRL benchmarks demonstrate that our encrypted DRL performs comparably with its unencrypted counterpart (with a gap of less than 10%) and maintaining data confidentiality with homomorphic encryption. This work facilitates the integration of privacy-preserving DRL into real-world applications, addressing critical privacy concerns, and promoting the ethical advancement of artificial intelligence. A secure artificial intelligence framework is introduced that leverages homomorphic encryption to safeguard sensitive information in deep reinforcement learning, achieving accurate decision-making and ensuring data privacy and confidentiality.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 12","pages":"1913-1926"},"PeriodicalIF":23.9,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145645246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What neuroscience can tell AI about learning in continuously changing environments 神经科学可以告诉人工智能如何在不断变化的环境中学习
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-28 DOI: 10.1038/s42256-025-01146-z
Daniel Durstewitz, Bruno Averbeck, Georgia Koppe
Modern artificial intelligence (AI) models, such as large language models, are usually trained once on a huge corpus of data, potentially fine-tuned for a specific task and then deployed with fixed parameters. Their training is costly, slow and gradual, requiring billions of repetitions. In stark contrast, animals continuously adapt to the ever-changing contingencies in their environments. This is particularly important for social species, where behavioural policies and reward outcomes may frequently change in interaction with peers. The underlying computational processes are often marked by rapid shifts in an animal’s behaviour and rather sudden transitions in neuronal population activity. Such computational capacities are of growing importance for AI systems operating in the real world, like those guiding robots or autonomous vehicles, or for agentic AI interacting with humans online. Can AI learn from neuroscience? This Perspective explores this question, integrating the literature on continual and in-context learning in AI with the neuroscience of learning on behavioural tasks with shifting rules, reward probabilities or outcomes. We outline an agenda for how the links between neuroscience and AI could be tightened, thus supporting the transfer of ideas and findings between both areas and contributing to the evolving field of NeuroAI. Durstewitz et al. explore what artificial intelligence can learn from the brain’s ability to adjust quickly to changing environments. By linking neuroscience studies of flexible behaviour with advances in continual and in-context learning, this Perspective outlines ways to strengthen the exchange of ideas between the two fields and advance NeuroAI.
现代人工智能(AI)模型,如大型语言模型,通常在一个巨大的数据语料库上进行一次训练,可能会针对特定任务进行微调,然后使用固定的参数进行部署。它们的训练是昂贵的、缓慢的、渐进的,需要几十亿次的重复。与之形成鲜明对比的是,动物不断适应环境中不断变化的突发事件。这对群居物种尤其重要,因为它们的行为政策和奖励结果可能在与同伴的互动中经常发生变化。潜在的计算过程通常以动物行为的快速变化和神经元群活动的突然转变为特征。这种计算能力对于在现实世界中运行的人工智能系统越来越重要,比如那些引导机器人或自动驾驶汽车的系统,或者用于在线与人类互动的人工智能。人工智能能从神经科学中学习吗?本观点探讨了这个问题,将人工智能中持续和情境学习的文献与具有变化规则、奖励概率或结果的行为任务学习的神经科学相结合。我们概述了如何加强神经科学和人工智能之间的联系的议程,从而支持两个领域之间的思想和发现的转移,并为神经人工智能领域的发展做出贡献。Durstewitz等人探索了人工智能可以从大脑快速适应不断变化的环境的能力中学到什么。通过将灵活行为的神经科学研究与持续和情境学习的进展联系起来,本展望概述了加强两个领域之间思想交流和推进神经人工智能的方法。
{"title":"What neuroscience can tell AI about learning in continuously changing environments","authors":"Daniel Durstewitz, Bruno Averbeck, Georgia Koppe","doi":"10.1038/s42256-025-01146-z","DOIUrl":"10.1038/s42256-025-01146-z","url":null,"abstract":"Modern artificial intelligence (AI) models, such as large language models, are usually trained once on a huge corpus of data, potentially fine-tuned for a specific task and then deployed with fixed parameters. Their training is costly, slow and gradual, requiring billions of repetitions. In stark contrast, animals continuously adapt to the ever-changing contingencies in their environments. This is particularly important for social species, where behavioural policies and reward outcomes may frequently change in interaction with peers. The underlying computational processes are often marked by rapid shifts in an animal’s behaviour and rather sudden transitions in neuronal population activity. Such computational capacities are of growing importance for AI systems operating in the real world, like those guiding robots or autonomous vehicles, or for agentic AI interacting with humans online. Can AI learn from neuroscience? This Perspective explores this question, integrating the literature on continual and in-context learning in AI with the neuroscience of learning on behavioural tasks with shifting rules, reward probabilities or outcomes. We outline an agenda for how the links between neuroscience and AI could be tightened, thus supporting the transfer of ideas and findings between both areas and contributing to the evolving field of NeuroAI. Durstewitz et al. explore what artificial intelligence can learn from the brain’s ability to adjust quickly to changing environments. By linking neuroscience studies of flexible behaviour with advances in continual and in-context learning, this Perspective outlines ways to strengthen the exchange of ideas between the two fields and advance NeuroAI.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 12","pages":"1897-1912"},"PeriodicalIF":23.9,"publicationDate":"2025-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145611610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep generative classification of blood cell morphology 血细胞形态的深层生成分类
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1038/s42256-025-01122-7
Simon Deltadahl, Julian Gilbey, Christine Van Laer, Nancy Boeckx, Mathie P. G. Leers, Tanya Freeman, Laura Aiken, Timothy Farren, Matthew Smith, Mohamad Zeina, Stephen MacDonald, Daniel Gleghorn, BloodCounts! consortium, James HF Rudd, Concetta Piazzese, Joseph Taylor, Nicholas Gleadall, Carola-Bibiane Schönlieb, Suthesh Sivapalaratnam, Michael Roberts, Parashkev Nachev
Blood cell morphology assessment via light microscopy constitutes a cornerstone of haematological diagnostics, providing crucial insights into diverse pathological conditions. This complex task demands expert interpretation owing to subtle morphological variations, biological heterogeneity and technical imaging factors that obstruct automated approaches. Conventional machine learning methods using discriminative models struggle with domain shifts, intraclass variability and rare morphological variants, constraining their clinical utility. We introduce CytoDiffusion, a diffusion-based generative classifier that faithfully models the distribution of blood cell morphology, combining accurate classification with robust anomaly detection, resistance to distributional shifts, interpretability, data efficiency and uncertainty quantification that surpasses clinical experts. Our approach outperforms state-of-the-art discriminative models in anomaly detection (area under the curve, 0.990 versus 0.916), resistance to domain shifts (0.854 versus 0.738 accuracy) and performance in low-data regimes (0.962 versus 0.924 balanced accuracy). In particular, CytoDiffusion generates synthetic blood cell images that expert haematologists cannot distinguish from real ones (accuracy, 0.523; 95% confidence interval: [0.505, 0.542]), demonstrating good command of the underlying distribution. Furthermore, we enhance model explainability through directly interpretable counterfactual heat maps. Our comprehensive evaluation framework establishes a multidimensional benchmark for medical image analysis in haematology, ultimately enabling improved diagnostic accuracy in clinical settings. Diffusion models are reframed by developing a generative blood cell classifier that performs reliably in low-data regimes, adapts to domain shifts, detects anomalies with robustness and provides uncertainty estimates that surpass clinical expert benchmarks.
通过光学显微镜进行血细胞形态评估是血液学诊断的基石,为不同的病理状况提供了重要的见解。由于细微的形态变化、生物异质性和技术成像因素阻碍了自动化方法,这项复杂的任务需要专家解释。使用判别模型的传统机器学习方法与领域转移、类内变异性和罕见的形态变异作斗争,限制了它们的临床应用。我们介绍了CytoDiffusion,这是一种基于扩散的生成分类器,它忠实地模拟了血细胞形态的分布,将准确的分类与鲁棒的异常检测、对分布变化的抵抗、可解释性、数据效率和不确定性量化相结合,超越了临床专家。我们的方法在异常检测(曲线下面积,0.990 vs 0.916)、抗域移(0.854 vs 0.738精度)和低数据状态下的性能(0.962 vs 0.924平衡精度)方面优于最先进的判别模型。特别是,CytoDiffusion生成的合成血细胞图像,血液专家无法将其与真实的血细胞图像区分开来(准确率为0.523;95%置信区间:[0.505,0.542]),显示出对潜在分布的良好掌握。此外,我们通过直接可解释的反事实热图增强了模型的可解释性。我们的综合评估框架建立了血液学医学图像分析的多维基准,最终提高了临床诊断的准确性。扩散模型通过开发再生血细胞分类器进行重构,该分类器在低数据状态下可靠地执行,适应域转移,鲁棒性检测异常,并提供超过临床专家基准的不确定性估计。
{"title":"Deep generative classification of blood cell morphology","authors":"Simon Deltadahl, Julian Gilbey, Christine Van Laer, Nancy Boeckx, Mathie P. G. Leers, Tanya Freeman, Laura Aiken, Timothy Farren, Matthew Smith, Mohamad Zeina, Stephen MacDonald, Daniel Gleghorn, BloodCounts! consortium, James HF Rudd, Concetta Piazzese, Joseph Taylor, Nicholas Gleadall, Carola-Bibiane Schönlieb, Suthesh Sivapalaratnam, Michael Roberts, Parashkev Nachev","doi":"10.1038/s42256-025-01122-7","DOIUrl":"10.1038/s42256-025-01122-7","url":null,"abstract":"Blood cell morphology assessment via light microscopy constitutes a cornerstone of haematological diagnostics, providing crucial insights into diverse pathological conditions. This complex task demands expert interpretation owing to subtle morphological variations, biological heterogeneity and technical imaging factors that obstruct automated approaches. Conventional machine learning methods using discriminative models struggle with domain shifts, intraclass variability and rare morphological variants, constraining their clinical utility. We introduce CytoDiffusion, a diffusion-based generative classifier that faithfully models the distribution of blood cell morphology, combining accurate classification with robust anomaly detection, resistance to distributional shifts, interpretability, data efficiency and uncertainty quantification that surpasses clinical experts. Our approach outperforms state-of-the-art discriminative models in anomaly detection (area under the curve, 0.990 versus 0.916), resistance to domain shifts (0.854 versus 0.738 accuracy) and performance in low-data regimes (0.962 versus 0.924 balanced accuracy). In particular, CytoDiffusion generates synthetic blood cell images that expert haematologists cannot distinguish from real ones (accuracy, 0.523; 95% confidence interval: [0.505, 0.542]), demonstrating good command of the underlying distribution. Furthermore, we enhance model explainability through directly interpretable counterfactual heat maps. Our comprehensive evaluation framework establishes a multidimensional benchmark for medical image analysis in haematology, ultimately enabling improved diagnostic accuracy in clinical settings. Diffusion models are reframed by developing a generative blood cell classifier that performs reliably in low-data regimes, adapts to domain shifts, detects anomalies with robustness and provides uncertainty estimates that surpass clinical expert benchmarks.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1791-1803"},"PeriodicalIF":23.9,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01122-7.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145547308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reusability report: A distributed strategy for solving combinatorial optimization problems with hypergraph neural networks 可重用性报告:一种用超图神经网络解决组合优化问题的分布式策略
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-19 DOI: 10.1038/s42256-025-01141-4
Xiaodi Li, Jianfeng Gui, Wei Xue, Baochuan Wang, Kai Chen, Pijing Wei, Junfeng Xia, Zhenyu Yue
The scalable solution to constrained combinatorial problems in high dimensions can address many challenges encountered in scientific and engineering disciplines. Inspired by the use of graph neural networks for quadratic-cost combinatorial optimization problems, Heydaribeni and colleagues proposed HypOp, which aims to efficiently solve general problems with higher-order constraints by leveraging hypergraph neural networks to extend previous algorithms to arbitrary cost functions. It incorporates a distributed training architecture to handle larger-scale tasks efficiently. Here we reproduce the primary experiments of HypOp and examine its robustness with respect to the number of graphics processing units, distributed partitioning strategies and fine-tuning methods. We also assess its transferability by applying it to the maximum clique problem and the quadratic assignment problem. The results validate the reusability of HypOp across diverse application scenarios. Furthermore, we provide guidelines offering practical insights for effectively applying it to multiple combinatorial optimization problems. HypOp is a scalable method for solving complex combinatorial problems. This study reproduces its results, tests its robustness, extends it to new tasks and provides practical guidelines for broader scientific applications.
高维约束组合问题的可扩展解决方案可以解决科学和工程学科中遇到的许多挑战。Heydaribeni及其同事受到图神经网络用于二次代价组合优化问题的启发,提出了HypOp,旨在利用超图神经网络将以前的算法扩展到任意代价函数,从而有效地解决具有高阶约束的一般问题。它结合了分布式训练架构来有效地处理大规模任务。在这里,我们重现了HypOp的主要实验,并检查了它在图形处理单元数量、分布式分区策略和微调方法方面的鲁棒性。并将其应用于最大团问题和二次分配问题,评价了其可转移性。结果验证了HypOp在不同应用场景中的可重用性。此外,我们还提供了指导方针,为有效地将其应用于多个组合优化问题提供了实际见解。假设假设是一种求解复杂组合问题的可扩展方法。这项研究再现了其结果,测试了其稳健性,将其扩展到新的任务中,并为更广泛的科学应用提供了实用指导。
{"title":"Reusability report: A distributed strategy for solving combinatorial optimization problems with hypergraph neural networks","authors":"Xiaodi Li, Jianfeng Gui, Wei Xue, Baochuan Wang, Kai Chen, Pijing Wei, Junfeng Xia, Zhenyu Yue","doi":"10.1038/s42256-025-01141-4","DOIUrl":"10.1038/s42256-025-01141-4","url":null,"abstract":"The scalable solution to constrained combinatorial problems in high dimensions can address many challenges encountered in scientific and engineering disciplines. Inspired by the use of graph neural networks for quadratic-cost combinatorial optimization problems, Heydaribeni and colleagues proposed HypOp, which aims to efficiently solve general problems with higher-order constraints by leveraging hypergraph neural networks to extend previous algorithms to arbitrary cost functions. It incorporates a distributed training architecture to handle larger-scale tasks efficiently. Here we reproduce the primary experiments of HypOp and examine its robustness with respect to the number of graphics processing units, distributed partitioning strategies and fine-tuning methods. We also assess its transferability by applying it to the maximum clique problem and the quadratic assignment problem. The results validate the reusability of HypOp across diverse application scenarios. Furthermore, we provide guidelines offering practical insights for effectively applying it to multiple combinatorial optimization problems. HypOp is a scalable method for solving complex combinatorial problems. This study reproduces its results, tests its robustness, extends it to new tasks and provides practical guidelines for broader scientific applications.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1870-1878"},"PeriodicalIF":23.9,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145547309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudodata-based molecular structure generator to reveal unknown chemicals 伪数据为基础的分子结构生成器揭示未知的化学物质
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-14 DOI: 10.1038/s42256-025-01140-5
Nanyang Yu, Zheng Ma, Qi Shao, Laihui Li, Xuebing Wang, Bingcai Pan, Hongxia Yu, Si Wei
Translating mass spectra into chemical structures is a central challenge in exposomics, making it difficult to quickly track the millions of chemicals found in humans and the environment. Unlike metabolomics, key problems in developing models for chemicals with a larger molecular space include data scarcity, model complexity and proper query strategy. Here we present a molecular structure generator (MSGo) that can generate structures directly from mass spectra and discover unknown polyfluorinated chemicals in the exposome. Trained with only virtual spectra using a transformer neural network, MSGo correctly identified 48% of structures in a validation set and was better at discovering new polyfluorinated chemicals in wastewater samples reported in the literature than experts. Applying probability-oriented masking to the virtual spectra is key to MSGo’s performance. Rapid discovery of chemicals with limited experimental mass spectral data using automated tools such as MSGo is key to tackling the current unknown polyfluorinated chemical crisis. Yu and colleagues present MSGo, an artificial intelligence exposomics tool trained on virtual mass spectra with masking that identifies pollutants by generating chemical structures that match measured spectral data.
将质谱转化为化学结构是暴露学的核心挑战,这使得快速追踪人类和环境中发现的数百万种化学物质变得困难。与代谢组学不同,开发具有更大分子空间的化学物质模型的关键问题包括数据稀缺、模型复杂性和适当的查询策略。本文介绍了一种分子结构发生器(MSGo),它可以直接从质谱中产生结构,并在暴露物中发现未知的多氟化学物质。仅使用变压器神经网络进行虚拟光谱训练,MSGo在验证集中正确识别了48%的结构,并且在发现文献中报道的废水样品中新的多氟化化学物质方面比专家更好。对虚谱应用面向概率的掩模是提高MSGo性能的关键。使用自动化工具(如MSGo)在有限的实验质谱数据下快速发现化学品是解决当前未知多氟化化学品危机的关键。Yu及其同事介绍了MSGo,这是一种人工智能暴露学工具,通过屏蔽虚拟质谱进行训练,通过生成与测量光谱数据相匹配的化学结构来识别污染物。
{"title":"Pseudodata-based molecular structure generator to reveal unknown chemicals","authors":"Nanyang Yu, Zheng Ma, Qi Shao, Laihui Li, Xuebing Wang, Bingcai Pan, Hongxia Yu, Si Wei","doi":"10.1038/s42256-025-01140-5","DOIUrl":"10.1038/s42256-025-01140-5","url":null,"abstract":"Translating mass spectra into chemical structures is a central challenge in exposomics, making it difficult to quickly track the millions of chemicals found in humans and the environment. Unlike metabolomics, key problems in developing models for chemicals with a larger molecular space include data scarcity, model complexity and proper query strategy. Here we present a molecular structure generator (MSGo) that can generate structures directly from mass spectra and discover unknown polyfluorinated chemicals in the exposome. Trained with only virtual spectra using a transformer neural network, MSGo correctly identified 48% of structures in a validation set and was better at discovering new polyfluorinated chemicals in wastewater samples reported in the literature than experts. Applying probability-oriented masking to the virtual spectra is key to MSGo’s performance. Rapid discovery of chemicals with limited experimental mass spectral data using automated tools such as MSGo is key to tackling the current unknown polyfluorinated chemical crisis. Yu and colleagues present MSGo, an artificial intelligence exposomics tool trained on virtual mass spectra with masking that identifies pollutants by generating chemical structures that match measured spectral data.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1879-1887"},"PeriodicalIF":23.9,"publicationDate":"2025-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145509633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models still struggle with false beliefs 大型语言模型仍在与错误信念作斗争
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1038/s42256-025-01145-0
Kristian Kersting
A new benchmark, KaBLE (knowledge and belief language evaluation), indicates that some large language models are unable to accurately distinguish belief from knowledge and fact, calling into question their use in real-word applications such as medicine and law.
一个新的基准,able(知识和信念语言评估)表明,一些大型语言模型无法准确区分信念、知识和事实,这使得它们在现实世界中的应用(如医学和法律)受到质疑。
{"title":"Large language models still struggle with false beliefs","authors":"Kristian Kersting","doi":"10.1038/s42256-025-01145-0","DOIUrl":"10.1038/s42256-025-01145-0","url":null,"abstract":"A new benchmark, KaBLE (knowledge and belief language evaluation), indicates that some large language models are unable to accurately distinguish belief from knowledge and fact, calling into question their use in real-word applications such as medicine and law.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1778-1779"},"PeriodicalIF":23.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145498438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Solving sparse finite element problems on neuromorphic hardware 求解神经形态硬件稀疏有限元问题
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1038/s42256-025-01143-2
Bradley H. Theilman, James B. Aimone
The finite element method (FEM) is one of the most important and ubiquitous numerical methods for solving partial differential equations (PDEs) on computers for scientific and engineering discovery. Applying the FEM to larger and more detailed scientific models has driven advances in high-performance computing for decades. Here we demonstrate that scalable spiking neuromorphic hardware can directly implement the FEM by constructing a spiking neural network that solves the large, sparse, linear systems of equations at the core of the FEM. We show that for the Poisson equation, a fundamental PDE in science and engineering, our neural circuit achieves meaningful levels of numerical accuracy and close to ideal scaling on modern, inherently parallel and energy-efficient neuromorphic hardware, specifically Intel’s Loihi 2 neuromorphic platform. We illustrate extensions to irregular mesh geometries in both two and three dimensions as well as other PDEs such as linear elasticity. Our spiking neural network is constructed from a recurrent network model of the brain’s motor cortex and, in contrast to black-box deep artificial neural network-based methods for PDEs, directly translates the well-understood and trusted mathematics of the FEM to a natively spiking neuromorphic algorithm. Theilman and Aimone introduce a natively spiking algorithm for solving partial differential equations on large-scale neuromorphic computers and demonstrate the algorithm on Intel’s Loihi 2 neuromorphic research chip.
有限元法(FEM)是在科学和工程发现的计算机上求解偏微分方程(PDEs)的最重要和最普遍的数值方法之一。几十年来,将FEM应用于更大、更详细的科学模型已经推动了高性能计算的进步。在这里,我们证明了可扩展的尖峰神经形态硬件可以通过构建一个尖峰神经网络来直接实现FEM,该网络可以解决FEM核心的大型、稀疏、线性方程组。我们表明,对于泊松方程(科学和工程中的基本PDE),我们的神经回路在现代、内在并行和节能的神经形态硬件(特别是英特尔的Loihi 2神经形态平台)上达到了有意义的数值精度水平,并接近理想的缩放。我们说明了扩展到不规则网格几何在二维和三维以及其他偏微分方程,如线性弹性。我们的脉冲神经网络是由大脑运动皮层的循环网络模型构建而成的,与基于黑箱深度人工神经网络的pde方法不同,它直接将FEM的良好理解和可信的数学转化为原生的脉冲神经形态算法。
{"title":"Solving sparse finite element problems on neuromorphic hardware","authors":"Bradley H. Theilman, James B. Aimone","doi":"10.1038/s42256-025-01143-2","DOIUrl":"10.1038/s42256-025-01143-2","url":null,"abstract":"The finite element method (FEM) is one of the most important and ubiquitous numerical methods for solving partial differential equations (PDEs) on computers for scientific and engineering discovery. Applying the FEM to larger and more detailed scientific models has driven advances in high-performance computing for decades. Here we demonstrate that scalable spiking neuromorphic hardware can directly implement the FEM by constructing a spiking neural network that solves the large, sparse, linear systems of equations at the core of the FEM. We show that for the Poisson equation, a fundamental PDE in science and engineering, our neural circuit achieves meaningful levels of numerical accuracy and close to ideal scaling on modern, inherently parallel and energy-efficient neuromorphic hardware, specifically Intel’s Loihi 2 neuromorphic platform. We illustrate extensions to irregular mesh geometries in both two and three dimensions as well as other PDEs such as linear elasticity. Our spiking neural network is constructed from a recurrent network model of the brain’s motor cortex and, in contrast to black-box deep artificial neural network-based methods for PDEs, directly translates the well-understood and trusted mathematics of the FEM to a natively spiking neuromorphic algorithm. Theilman and Aimone introduce a natively spiking algorithm for solving partial differential equations on large-scale neuromorphic computers and demonstrate the algorithm on Intel’s Loihi 2 neuromorphic research chip.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1845-1857"},"PeriodicalIF":23.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s42256-025-01143-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145498437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
South Asian biases in language and vision models 语言和视觉模型中的南亚偏见
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1038/s42256-025-01144-1
Mohammad Nadeem, Shahab Saquib Sohail, Erik Cambria, Shagufta Afreen
Biases in artificial intelligence models have been studied predominantly through Western lenses, overlooking South Asia’s unique contexts of caste, religion, colourism and representation. This Comment highlights region-specific biases in language and vision models and calls for fairness frameworks grounded in South Asian realities.
人工智能模型中的偏见主要是通过西方视角来研究的,忽视了南亚种姓、宗教、肤色歧视和代表性等独特背景。本评论强调了语言和视觉模型中的区域特定偏见,并呼吁建立基于南亚现实的公平框架。
{"title":"South Asian biases in language and vision models","authors":"Mohammad Nadeem, Shahab Saquib Sohail, Erik Cambria, Shagufta Afreen","doi":"10.1038/s42256-025-01144-1","DOIUrl":"10.1038/s42256-025-01144-1","url":null,"abstract":"Biases in artificial intelligence models have been studied predominantly through Western lenses, overlooking South Asia’s unique contexts of caste, religion, colourism and representation. This Comment highlights region-specific biases in language and vision models and calls for fairness frameworks grounded in South Asian realities.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1775-1777"},"PeriodicalIF":23.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145498436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional architectures are cortex-aligned de novo 卷积架构是从头对齐的
IF 23.9 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-13 DOI: 10.1038/s42256-025-01142-3
Atlas Kazemian, Eric Elmoznino, Michael F. Bonner
What underlies the emergence of cortex-aligned representations in deep neural network models of vision? Earlier work suggested that shared architectural constraints were a major factor, but the success of widely varied architectures after pretraining raises critical questions about the importance of architectural constraints. Here we show that in wide networks with minimal training, architectural inductive biases have a prominent role. We examined networks with varied architectures but no pretraining and quantified their ability to predict image representations in the visual cortices of monkeys and humans. We found that cortex-aligned representations emerge in convolutional architectures that combine two key manipulations of dimensionality: compression in the spatial domain, through pooling, and expansion in the feature domain by increasing the number of channels. We further show that the inductive biases of convolutional architectures are critical for obtaining performance gains from feature expansion—dimensionality manipulations were relatively ineffective in other architectures and in convolutional models with targeted lesions. Our findings suggest that the architectural constraints of convolutional networks are sufficiently close to the constraints of biological vision to allow many aspects of cortical visual representation to emerge even before synaptic connections have been tuned through experience. Kazemian et al. report that untrained convolutional networks with wide layers predict primate visual cortex responses nearly as well as task-optimized networks, revealing how architectural constraints shape brain-like representations in deep networks.
在视觉的深度神经网络模型中,是什么导致了皮层对齐表征的出现?早期的工作表明,共享的体系结构约束是一个主要因素,但是在预训练之后,广泛变化的体系结构的成功提出了关于体系结构约束重要性的关键问题。在这里,我们表明,在具有最少训练的广泛网络中,架构归纳偏差具有突出的作用。我们研究了不同架构的网络,但没有进行预训练,并量化了它们在猴子和人类视觉皮层中预测图像表征的能力。我们发现,在结合了两种关键维数操作的卷积架构中出现了与上下文对齐的表示:通过池化在空间域中压缩,以及通过增加通道数量在特征域中扩展。我们进一步表明,卷积架构的归纳偏差对于从特征扩展中获得性能增益至关重要——在其他架构和具有目标病变的卷积模型中,维数操作相对无效。我们的研究结果表明,卷积网络的结构约束与生物视觉的约束足够接近,甚至在突触连接通过经验调整之前,皮层视觉表征的许多方面就已经出现了。Kazemian等人报告说,未经训练的宽层卷积网络预测灵长类动物视觉皮层的反应几乎与任务优化网络一样好,揭示了架构约束如何在深度网络中塑造类脑表征。
{"title":"Convolutional architectures are cortex-aligned de novo","authors":"Atlas Kazemian, Eric Elmoznino, Michael F. Bonner","doi":"10.1038/s42256-025-01142-3","DOIUrl":"10.1038/s42256-025-01142-3","url":null,"abstract":"What underlies the emergence of cortex-aligned representations in deep neural network models of vision? Earlier work suggested that shared architectural constraints were a major factor, but the success of widely varied architectures after pretraining raises critical questions about the importance of architectural constraints. Here we show that in wide networks with minimal training, architectural inductive biases have a prominent role. We examined networks with varied architectures but no pretraining and quantified their ability to predict image representations in the visual cortices of monkeys and humans. We found that cortex-aligned representations emerge in convolutional architectures that combine two key manipulations of dimensionality: compression in the spatial domain, through pooling, and expansion in the feature domain by increasing the number of channels. We further show that the inductive biases of convolutional architectures are critical for obtaining performance gains from feature expansion—dimensionality manipulations were relatively ineffective in other architectures and in convolutional models with targeted lesions. Our findings suggest that the architectural constraints of convolutional networks are sufficiently close to the constraints of biological vision to allow many aspects of cortical visual representation to emerge even before synaptic connections have been tuned through experience. Kazemian et al. report that untrained convolutional networks with wide layers predict primate visual cortex responses nearly as well as task-optimized networks, revealing how architectural constraints shape brain-like representations in deep networks.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 11","pages":"1834-1844"},"PeriodicalIF":23.9,"publicationDate":"2025-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145498435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Nature Machine Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1