首页 > 最新文献

Nature Machine Intelligence最新文献

英文 中文
Limitations in odour recognition and generalization in a neuromorphic olfactory circuit 气味识别的局限性和神经形态嗅觉回路的泛化
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-16 DOI: 10.1038/s42256-024-00952-1
Nik Dennler, André van Schaik, Michael Schmuker
{"title":"Limitations in odour recognition and generalization in a neuromorphic olfactory circuit","authors":"Nik Dennler, André van Schaik, Michael Schmuker","doi":"10.1038/s42256-024-00952-1","DOIUrl":"10.1038/s42256-024-00952-1","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1451-1453"},"PeriodicalIF":18.8,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142825245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stable Cox regression for survival analysis under distribution shifts 分布变化条件下用于生存分析的稳定考克斯回归
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-13 DOI: 10.1038/s42256-024-00932-5
Shaohua Fan, Renzhe Xu, Qian Dong, Yue He, Cheng Chang, Peng Cui
Survival analysis aims to estimate the impact of covariates on the expected time until an event occurs, which is broadly utilized in disciplines such as life sciences and healthcare, substantially influencing decision-making and improving survival outcomes. Existing methods, usually assuming similar training and testing distributions, nevertheless face challenges with real-world varying data sources, creating unpredictable shifts that undermine their reliability. This urgently necessitates that survival analysis methods should utilize stable features across diverse cohorts for predictions, rather than relying on spurious correlations. To this end, we propose a stable Cox model with theoretical guarantees to identify stable variables, which jointly optimizes an independence-driven sample reweighting module and a weighted Cox regression model. Through extensive evaluation on simulated and real-world omics and clinical data, stable Cox not only shows strong generalization ability across diverse independent test sets but also stratifies the subtype of patients significantly with the identified biomarker panels. Survival prediction models used in healthcare usually assume that training and test data share a similar distribution, which is not true in real-world settings. Cui and colleagues develop a stable Cox regression model that can identify stable variables for predicting survival outcomes under distribution shifts.
生存分析的目的是估计协变量对事件发生前预期时间的影响,这在生命科学和医疗保健等学科中被广泛应用,极大地影响决策并改善生存结果。现有的方法,通常假设类似的训练和测试分布,然而面对现实世界中不同数据源的挑战,产生不可预测的变化,破坏了它们的可靠性。这迫切需要生存分析方法应该利用不同群体的稳定特征进行预测,而不是依赖于虚假的相关性。为此,我们提出了一个具有理论保证的稳定Cox模型来识别稳定变量,该模型联合优化了独立性驱动的样本重赋权模块和加权Cox回归模型。通过对模拟组学和现实组学以及临床数据的广泛评估,稳定的Cox不仅在不同的独立测试集上表现出较强的泛化能力,而且可以通过鉴定的生物标志物面板显著地划分患者的亚型。
{"title":"Stable Cox regression for survival analysis under distribution shifts","authors":"Shaohua Fan, Renzhe Xu, Qian Dong, Yue He, Cheng Chang, Peng Cui","doi":"10.1038/s42256-024-00932-5","DOIUrl":"10.1038/s42256-024-00932-5","url":null,"abstract":"Survival analysis aims to estimate the impact of covariates on the expected time until an event occurs, which is broadly utilized in disciplines such as life sciences and healthcare, substantially influencing decision-making and improving survival outcomes. Existing methods, usually assuming similar training and testing distributions, nevertheless face challenges with real-world varying data sources, creating unpredictable shifts that undermine their reliability. This urgently necessitates that survival analysis methods should utilize stable features across diverse cohorts for predictions, rather than relying on spurious correlations. To this end, we propose a stable Cox model with theoretical guarantees to identify stable variables, which jointly optimizes an independence-driven sample reweighting module and a weighted Cox regression model. Through extensive evaluation on simulated and real-world omics and clinical data, stable Cox not only shows strong generalization ability across diverse independent test sets but also stratifies the subtype of patients significantly with the identified biomarker panels. Survival prediction models used in healthcare usually assume that training and test data share a similar distribution, which is not true in real-world settings. Cui and colleagues develop a stable Cox regression model that can identify stable variables for predicting survival outcomes under distribution shifts.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1525-1541"},"PeriodicalIF":18.8,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00932-5.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kernel approximation using analogue in-memory computing 核近似使用模拟内存计算
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-13 DOI: 10.1038/s42256-024-00943-2
Julian Büchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian
Kernel functions are vital ingredients of several machine learning (ML) algorithms but often incur substantial memory and computational costs. We introduce an approach to kernel approximation in ML algorithms suitable for mixed-signal analogue in-memory computing (AIMC) architectures. Analogue in-memory kernel approximation addresses the performance bottlenecks of conventional kernel-based methods by executing most operations in approximate kernel methods directly in memory. The IBM HERMES project chip, a state-of-the-art phase-change memory-based AIMC chip, is utilized for the hardware demonstration of kernel approximation. Experimental results show that our method maintains high accuracy, with less than a 1% drop in kernel-based ridge classification benchmarks and within 1% accuracy on the long-range arena benchmark for kernelized attention in transformer neural networks. Compared to traditional digital accelerators, our approach is estimated to deliver superior energy efficiency and lower power consumption. These findings highlight the potential of heterogeneous AIMC architectures to enhance the efficiency and scalability of ML applications. A kernel approximation method that enables linear-complexity attention computation via analogue in-memory computing (AIMC) to deliver superior energy efficiency is demonstrated on a multicore AIMC chip.
核函数是几种机器学习(ML)算法的重要组成部分,但通常会产生大量的内存和计算成本。我们介绍了一种适用于混合信号模拟内存计算(AIMC)架构的机器学习算法中的核逼近方法。模拟内存核近似通过直接在内存中执行近似核方法中的大多数操作来解决传统基于核的方法的性能瓶颈。IBM HERMES项目芯片是一种最先进的基于相变存储器的AIMC芯片,用于内核逼近的硬件演示。实验结果表明,该方法保持了较高的准确率,在变压器神经网络中,基于核的脊分类基准的准确率下降不到1%,在核关注的远程竞技场基准的准确率在1%以内。与传统的数字加速器相比,我们的方法估计可以提供更高的能源效率和更低的功耗。这些发现突出了异构AIMC架构在提高机器学习应用程序的效率和可扩展性方面的潜力。
{"title":"Kernel approximation using analogue in-memory computing","authors":"Julian Büchel, Giacomo Camposampiero, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Abbas Rahimi, Abu Sebastian","doi":"10.1038/s42256-024-00943-2","DOIUrl":"10.1038/s42256-024-00943-2","url":null,"abstract":"Kernel functions are vital ingredients of several machine learning (ML) algorithms but often incur substantial memory and computational costs. We introduce an approach to kernel approximation in ML algorithms suitable for mixed-signal analogue in-memory computing (AIMC) architectures. Analogue in-memory kernel approximation addresses the performance bottlenecks of conventional kernel-based methods by executing most operations in approximate kernel methods directly in memory. The IBM HERMES project chip, a state-of-the-art phase-change memory-based AIMC chip, is utilized for the hardware demonstration of kernel approximation. Experimental results show that our method maintains high accuracy, with less than a 1% drop in kernel-based ridge classification benchmarks and within 1% accuracy on the long-range arena benchmark for kernelized attention in transformer neural networks. Compared to traditional digital accelerators, our approach is estimated to deliver superior energy efficiency and lower power consumption. These findings highlight the potential of heterogeneous AIMC architectures to enhance the efficiency and scalability of ML applications. A kernel approximation method that enables linear-complexity attention computation via analogue in-memory computing (AIMC) to deliver superior energy efficiency is demonstrated on a multicore AIMC chip.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1605-1615"},"PeriodicalIF":18.8,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Envisioning better benchmarks for machine learning PDE solvers 为机器学习PDE求解器设想更好的基准
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-13 DOI: 10.1038/s42256-024-00962-z
Johannes Brandstetter
Tackling partial differential equations with machine learning solvers is a promising direction, but recent analysis reveals challenges with making fair comparisons to previous methods. Stronger benchmark problems are needed for the field to advance.
用机器学习求解器解决偏微分方程是一个很有前途的方向,但最近的分析显示,与以前的方法进行公平比较存在挑战。该领域的进步需要更强有力的基准问题。
{"title":"Envisioning better benchmarks for machine learning PDE solvers","authors":"Johannes Brandstetter","doi":"10.1038/s42256-024-00962-z","DOIUrl":"10.1038/s42256-024-00962-z","url":null,"abstract":"Tackling partial differential equations with machine learning solvers is a promising direction, but recent analysis reveals challenges with making fair comparisons to previous methods. Stronger benchmark problems are needed for the field to advance.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"7 1","pages":"2-3"},"PeriodicalIF":18.8,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussions of machine versus living intelligence need more clarity 关于机器智能与生活智能的讨论需要更加明确
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-13 DOI: 10.1038/s42256-024-00955-y
Nicolas Rouleau, Michael Levin
Sharp distinctions often drawn between machine and biological intelligences have not tracked advances in the fields of developmental biology and hybrid robotics. We call for conceptual clarity driven by the science of diverse intelligences in unconventional spaces and at unfamiliar scales and embodiments that blur conventional categories.
通常对机器智能和生物智能的明确区分并没有跟上发育生物学和混合机器人学领域的进步。我们呼吁,在非传统空间、陌生尺度和体现方式下的多样化智能科学,应使概念更加清晰,从而模糊传统范畴。
{"title":"Discussions of machine versus living intelligence need more clarity","authors":"Nicolas Rouleau, Michael Levin","doi":"10.1038/s42256-024-00955-y","DOIUrl":"10.1038/s42256-024-00955-y","url":null,"abstract":"Sharp distinctions often drawn between machine and biological intelligences have not tracked advances in the fields of developmental biology and hybrid robotics. We call for conceptual clarity driven by the science of diverse intelligences in unconventional spaces and at unfamiliar scales and embodiments that blur conventional categories.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1424-1426"},"PeriodicalIF":18.8,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reply to: Deeper evaluation of a single-cell foundation model 回复:单细胞基础模型的更深层次评价
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-12 DOI: 10.1038/s42256-024-00948-x
Fan Yang, Fang Wang, Longkai Huang, Linjing Liu, Junzhou Huang, Jianhua Yao
{"title":"Reply to: Deeper evaluation of a single-cell foundation model","authors":"Fan Yang, Fang Wang, Longkai Huang, Linjing Liu, Junzhou Huang, Jianhua Yao","doi":"10.1038/s42256-024-00948-x","DOIUrl":"10.1038/s42256-024-00948-x","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1447-1450"},"PeriodicalIF":18.8,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142809749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deeper evaluation of a single-cell foundation model 单细胞地基模型的更深层次评价
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-12 DOI: 10.1038/s42256-024-00949-w
Rebecca Boiarsky, Nalini M. Singh, Alejandro Buendia, Ava P. Amini, Gad Getz, David Sontag
{"title":"Deeper evaluation of a single-cell foundation model","authors":"Rebecca Boiarsky, Nalini M. Singh, Alejandro Buendia, Ava P. Amini, Gad Getz, David Sontag","doi":"10.1038/s42256-024-00949-w","DOIUrl":"10.1038/s42256-024-00949-w","url":null,"abstract":"","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1443-1446"},"PeriodicalIF":18.8,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142809746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Successful implementation of the EU AI Act requires interdisciplinary efforts 欧盟人工智能法案的成功实施需要跨学科的努力
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-10 DOI: 10.1038/s42256-024-00954-z
Christian Montag, Michèle Finck
The EU Artificial Intelligence Act bans certain “subliminal techniques beyond a person’s consciousness”, but uses undefined legal terms. Interdisciplinary efforts are needed to ensure effective implementation of the legal text.
欧盟《人工智能法》禁止某些 "超越个人意识的潜意识技术",但使用了未定义的法律术语。要确保法律条文的有效实施,需要跨学科的努力。
{"title":"Successful implementation of the EU AI Act requires interdisciplinary efforts","authors":"Christian Montag, Michèle Finck","doi":"10.1038/s42256-024-00954-z","DOIUrl":"10.1038/s42256-024-00954-z","url":null,"abstract":"The EU Artificial Intelligence Act bans certain “subliminal techniques beyond a person’s consciousness”, but uses undefined legal terms. Interdisciplinary efforts are needed to ensure effective implementation of the legal text.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1415-1417"},"PeriodicalIF":18.8,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142796788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interpretable RNA foundation model for exploring functional RNA motifs in plants 探索植物功能性RNA基序的可解释RNA基础模型
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-09 DOI: 10.1038/s42256-024-00946-z
Haopeng Yu, Heng Yang, Wenqing Sun, Zongyun Yan, Xiaofei Yang, Huakun Zhang, Yiliang Ding, Ke Li
The complex ‘language’ of plant RNA encodes a vast array of biological regulatory elements that orchestrate crucial aspects of plant growth, development and adaptation to environmental stresses. Recent advancements in foundation models (FMs) have demonstrated their unprecedented potential to decipher complex ‘language’ in biology. In this study, we introduced PlantRNA-FM, a high-performance and interpretable RNA FM specifically designed for plants. PlantRNA-FM was pretrained on an extensive dataset, integrating RNA sequences and RNA structure information from 1,124 distinct plant species. PlantRNA-FM exhibits superior performance in plant-specific downstream tasks. PlantRNA-FM achieves an F1 score of 0.974 for genic region annotation, whereas the current best-performing model achieves 0.639. Our PlantRNA-FM is empowered by our interpretable framework that facilitates the identification of biologically functional RNA sequence and structure motifs, including both RNA secondary and tertiary structure motifs across transcriptomes. Through experimental validations, we revealed translation-associated RNA motifs in plants. Our PlantRNA-FM also highlighted the importance of the position information of these functional RNA motifs in genic regions. Taken together, our PlantRNA-FM facilitates the exploration of functional RNA motifs across the complexity of transcriptomes, empowering plant scientists with capabilities for programming RNA codes in plants. Approaches are needed to explore regulatory RNA motifs in plants. An interpretable RNA foundation model is developed, trained on thousands of plant transcriptomes, which achieves superior performance in plant RNA biology tasks and enables the discovery of functional RNA sequence and structure motifs across transcriptomes.
植物RNA的复杂“语言”编码了大量的生物调控元件,这些元件协调了植物生长、发育和适应环境胁迫的关键方面。基础模型(FMs)的最新进展显示了它们在破译生物学中复杂“语言”方面前所未有的潜力。在这项研究中,我们引入了PlantRNA-FM,一种专门为植物设计的高性能、可解释的RNA FM。PlantRNA-FM是在一个广泛的数据集上进行预训练的,该数据集整合了来自1,124种不同植物的RNA序列和RNA结构信息。PlantRNA-FM在植物特异性下游任务中表现出优越的性能。PlantRNA-FM对基因区域注释的F1得分为0.974,而目前表现最好的模型为0.639。我们的PlantRNA-FM由我们的可解释框架授权,有助于识别生物功能RNA序列和结构基序,包括转录组中的RNA二级和三级结构基序。通过实验验证,我们揭示了植物中翻译相关的RNA基序。我们的PlantRNA-FM也强调了这些功能RNA基序在基因区域的位置信息的重要性。综上所述,我们的PlantRNA-FM有助于在转录组的复杂性中探索功能性RNA基序,使植物科学家能够编程植物中的RNA编码。
{"title":"An interpretable RNA foundation model for exploring functional RNA motifs in plants","authors":"Haopeng Yu, Heng Yang, Wenqing Sun, Zongyun Yan, Xiaofei Yang, Huakun Zhang, Yiliang Ding, Ke Li","doi":"10.1038/s42256-024-00946-z","DOIUrl":"10.1038/s42256-024-00946-z","url":null,"abstract":"The complex ‘language’ of plant RNA encodes a vast array of biological regulatory elements that orchestrate crucial aspects of plant growth, development and adaptation to environmental stresses. Recent advancements in foundation models (FMs) have demonstrated their unprecedented potential to decipher complex ‘language’ in biology. In this study, we introduced PlantRNA-FM, a high-performance and interpretable RNA FM specifically designed for plants. PlantRNA-FM was pretrained on an extensive dataset, integrating RNA sequences and RNA structure information from 1,124 distinct plant species. PlantRNA-FM exhibits superior performance in plant-specific downstream tasks. PlantRNA-FM achieves an F1 score of 0.974 for genic region annotation, whereas the current best-performing model achieves 0.639. Our PlantRNA-FM is empowered by our interpretable framework that facilitates the identification of biologically functional RNA sequence and structure motifs, including both RNA secondary and tertiary structure motifs across transcriptomes. Through experimental validations, we revealed translation-associated RNA motifs in plants. Our PlantRNA-FM also highlighted the importance of the position information of these functional RNA motifs in genic regions. Taken together, our PlantRNA-FM facilitates the exploration of functional RNA motifs across the complexity of transcriptomes, empowering plant scientists with capabilities for programming RNA codes in plants. Approaches are needed to explore regulatory RNA motifs in plants. An interpretable RNA foundation model is developed, trained on thousands of plant transcriptomes, which achieves superior performance in plant RNA biology tasks and enables the discovery of functional RNA sequence and structure motifs across transcriptomes.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1616-1625"},"PeriodicalIF":18.8,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42256-024-00946-z.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142793852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating generalizability of artificial intelligence models for molecular datasets 评估人工智能模型在分子数据集上的泛化性
IF 18.8 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-12-06 DOI: 10.1038/s42256-024-00931-6
Yasha Ektefaie, Andrew Shen, Daria Bykova, Maximillian G. Marin, Marinka Zitnik, Maha Farhat
Deep learning has made rapid advances in modelling molecular sequencing data. Despite achieving high performance on benchmarks, it remains unclear to what extent deep learning models learn general principles and generalize to previously unseen sequences. Benchmarks traditionally interrogate model generalizability by generating metadata- or sequence similarity-based train and test splits of input data before assessing model performance. Here we show that this approach mischaracterizes model generalizability by failing to consider the full spectrum of cross-split overlap, that is, similarity between train and test splits. We introduce SPECTRA, the spectral framework for model evaluation. Given a model and a dataset, SPECTRA plots model performance as a function of decreasing cross-split overlap and reports the area under this curve as a measure of generalizability. We use SPECTRA with 18 sequencing datasets and phenotypes ranging from antibiotic resistance in tuberculosis to protein–ligand binding and evaluate the generalizability of 19 state-of-the-art deep learning models, including large language models, graph neural networks, diffusion models and convolutional neural networks. We show that sequence similarity- and metadata-based splits provide an incomplete assessment of model generalizability. Using SPECTRA, we find that as cross-split overlap decreases, deep learning models consistently show reduced performance, varying by task and model. Although no model consistently achieved the highest performance across all tasks, deep learning models can, in some cases, generalize to previously unseen sequences on specific tasks. SPECTRA advances our understanding of how foundation models generalize in biological applications. Ektefaie and colleagues introduce the spectral framework for models evaluation (SPECTRA) to measure the generalizability of machine learning models for molecular sequences.
深度学习在分子测序数据建模方面取得了快速进展。尽管在基准测试中取得了很高的性能,但深度学习模型在多大程度上学习了一般原理,并将其推广到以前未见过的序列,这一点尚不清楚。在评估模型性能之前,基准测试通常通过生成基于元数据或序列相似性的训练和测试输入数据的分割来询问模型的泛化性。在这里,我们表明,这种方法错误地描述了模型的可泛化性,因为它没有考虑交叉分割重叠的全谱,即训练和测试分割之间的相似性。我们介绍了用于模型评估的光谱框架SPECTRA。给定一个模型和一个数据集,SPECTRA将模型性能绘制为减少交叉分裂重叠的函数,并报告该曲线下的面积作为泛化性的度量。我们将SPECTRA与18个测序数据集和表型(从结核病的抗生素耐药性到蛋白质配体结合)一起使用,并评估了19个最先进的深度学习模型的泛化性,包括大型语言模型、图神经网络、扩散模型和卷积神经网络。我们表明,序列相似性和基于元数据的分裂提供了模型泛化的不完全评估。使用SPECTRA,我们发现随着交叉分割重叠的减少,深度学习模型的性能持续下降,因任务和模型而异。虽然没有模型在所有任务中都能达到最高的性能,但在某些情况下,深度学习模型可以在特定任务中推广到以前未见过的序列。SPECTRA提高了我们对基础模型如何在生物学应用中推广的理解。
{"title":"Evaluating generalizability of artificial intelligence models for molecular datasets","authors":"Yasha Ektefaie, Andrew Shen, Daria Bykova, Maximillian G. Marin, Marinka Zitnik, Maha Farhat","doi":"10.1038/s42256-024-00931-6","DOIUrl":"10.1038/s42256-024-00931-6","url":null,"abstract":"Deep learning has made rapid advances in modelling molecular sequencing data. Despite achieving high performance on benchmarks, it remains unclear to what extent deep learning models learn general principles and generalize to previously unseen sequences. Benchmarks traditionally interrogate model generalizability by generating metadata- or sequence similarity-based train and test splits of input data before assessing model performance. Here we show that this approach mischaracterizes model generalizability by failing to consider the full spectrum of cross-split overlap, that is, similarity between train and test splits. We introduce SPECTRA, the spectral framework for model evaluation. Given a model and a dataset, SPECTRA plots model performance as a function of decreasing cross-split overlap and reports the area under this curve as a measure of generalizability. We use SPECTRA with 18 sequencing datasets and phenotypes ranging from antibiotic resistance in tuberculosis to protein–ligand binding and evaluate the generalizability of 19 state-of-the-art deep learning models, including large language models, graph neural networks, diffusion models and convolutional neural networks. We show that sequence similarity- and metadata-based splits provide an incomplete assessment of model generalizability. Using SPECTRA, we find that as cross-split overlap decreases, deep learning models consistently show reduced performance, varying by task and model. Although no model consistently achieved the highest performance across all tasks, deep learning models can, in some cases, generalize to previously unseen sequences on specific tasks. SPECTRA advances our understanding of how foundation models generalize in biological applications. Ektefaie and colleagues introduce the spectral framework for models evaluation (SPECTRA) to measure the generalizability of machine learning models for molecular sequences.","PeriodicalId":48533,"journal":{"name":"Nature Machine Intelligence","volume":"6 12","pages":"1512-1524"},"PeriodicalIF":18.8,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142782801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Nature Machine Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1