首页 > 最新文献

Nature computational science最新文献

英文 中文
Predicting the regulatory impacts of noncoding variants on gene expression through epigenomic integration across tissues and single-cell landscapes 通过跨组织和单细胞景观的表观基因组整合预测非编码变异对基因表达的调控影响。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-26 DOI: 10.1038/s43588-025-00878-7
Zhe Liu  (, ), Yihang Bao  (, ), An Gu  (, ), Weichen Song  (, ), Guan Ning Lin  (, )
Noncoding mutations play a critical role in regulating gene expression, yet predicting their effects across diverse tissues and cell types remains a challenge. Here we present EMO, a transformer-based model that integrates DNA sequence with chromatin accessibility data (assay for transposase-accessible chromatin with sequencing) to predict the regulatory impact of noncoding single nucleotide polymorphisms on gene expression. A key component of EMO is its ability to incorporate personalized functional genomic profiles, enabling individual-level and disease-contextual predictions and addressing critical limitations of current approaches. EMO generalizes across tissues and cell types by modeling both short- and long-range regulatory interactions and capturing dynamic gene expression changes associated with disease progression. In benchmark evaluations, the pretraining-based EMO framework outperformed existing models, with fine-tuning small-sample tissues enhancing the model’s ability to fit target tissues. In single-cell contexts, EMO accurately identified cell-type-specific regulatory patterns and successfully captured the effects of disease-associated single nucleotide polymorphisms in conditions, linking genetic variation to disease-relevant pathways. EMO integrates DNA sequence and chromatin accessibility data to predict how noncoding variants regulate gene expression across tissues and single cells, enabling context-aware personalized insights into genetic effects for precision medicine.
非编码突变在调节基因表达中起着至关重要的作用,然而预测它们在不同组织和细胞类型中的作用仍然是一个挑战。在这里,我们提出了EMO,这是一种基于转换器的模型,将DNA序列与染色质可接近性数据(转座酶可接近染色质的测定与测序)整合在一起,以预测非编码单核苷酸多态性对基因表达的调节影响。EMO的一个关键组成部分是其整合个性化功能基因组图谱的能力,能够实现个体水平和疾病背景预测,并解决当前方法的关键局限性。EMO通过模拟短期和长期的调节相互作用以及捕捉与疾病进展相关的动态基因表达变化,从而推广到各种组织和细胞类型。在基准评估中,基于预训练的EMO框架优于现有模型,微调小样本组织增强了模型拟合目标组织的能力。在单细胞环境下,EMO准确识别细胞类型特异性调节模式,并成功捕获疾病相关单核苷酸多态性在条件下的影响,将遗传变异与疾病相关途径联系起来。
{"title":"Predicting the regulatory impacts of noncoding variants on gene expression through epigenomic integration across tissues and single-cell landscapes","authors":"Zhe Liu \u0000 (, ), Yihang Bao \u0000 (, ), An Gu \u0000 (, ), Weichen Song \u0000 (, ), Guan Ning Lin \u0000 (, )","doi":"10.1038/s43588-025-00878-7","DOIUrl":"10.1038/s43588-025-00878-7","url":null,"abstract":"Noncoding mutations play a critical role in regulating gene expression, yet predicting their effects across diverse tissues and cell types remains a challenge. Here we present EMO, a transformer-based model that integrates DNA sequence with chromatin accessibility data (assay for transposase-accessible chromatin with sequencing) to predict the regulatory impact of noncoding single nucleotide polymorphisms on gene expression. A key component of EMO is its ability to incorporate personalized functional genomic profiles, enabling individual-level and disease-contextual predictions and addressing critical limitations of current approaches. EMO generalizes across tissues and cell types by modeling both short- and long-range regulatory interactions and capturing dynamic gene expression changes associated with disease progression. In benchmark evaluations, the pretraining-based EMO framework outperformed existing models, with fine-tuning small-sample tissues enhancing the model’s ability to fit target tissues. In single-cell contexts, EMO accurately identified cell-type-specific regulatory patterns and successfully captured the effects of disease-associated single nucleotide polymorphisms in conditions, linking genetic variation to disease-relevant pathways. EMO integrates DNA sequence and chromatin accessibility data to predict how noncoding variants regulate gene expression across tissues and single cells, enabling context-aware personalized insights into genetic effects for precision medicine.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 10","pages":"927-939"},"PeriodicalIF":18.3,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rhythm-based hierarchical predictive computations support acoustic−semantic transformation in speech processing 基于节奏的分层预测计算支持语音处理中的声-语义转换。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-26 DOI: 10.1038/s43588-025-00876-9
Olesia Dogonasheva, Keith B. Doelling, Denis Zakharov, Anne-Lise Giraud, Boris Gutkin
Unraveling how humans understand speech despite distortions has long intrigued researchers. A prominent hypothesis highlights the role of multiple endogenous brain rhythms in forming the computational context to predict speech structure and content. Yet how neural processes may implement rhythm-based context formation remains unclear. Here we propose the brain rhythm-based inference model (BRyBI) as a possible neural implementation of speech processing in the auditory cortex based on the interaction of endogenous brain rhythms in a predictive coding framework. BRyBI encodes key rhythmic processes for parsing spectro-temporal representations of the speech signal into phoneme sequences and to govern the formation of the phrasal context. BRyBI matches patterns of human performance in speech recognition tasks and explains contradictory experimental observations of rhythms during speech listening and their dependence on the informational aspect of speech (uncertainty and surprise). This work highlights the computational role of multiscale brain rhythms in predictive speech processing. This study presents a brain rhythm-based inference model (BRyBI) for speech processing in the auditory cortex. BRyBI shows how rhythmic neural activity enables robust speech processing by dynamically predicting context and elucidates mechanistic principles that allow robust speech parsing in the brain.
长期以来,研究人员一直好奇人类是如何在扭曲的情况下理解语言的。一个突出的假设强调了多种内源性大脑节律在形成预测语音结构和内容的计算环境中的作用。然而,神经过程如何实现基于节奏的上下文形成仍不清楚。在此,我们提出基于脑节奏的推理模型(BRyBI)作为一种基于内源性脑节奏在预测编码框架中的相互作用的听觉皮层语音处理的可能神经实现。BRyBI对关键的节奏过程进行编码,以将语音信号的光谱时间表征解析为音素序列,并控制短语上下文的形成。BRyBI与人类在语音识别任务中的表现模式相匹配,并解释了语音听力过程中节奏的矛盾实验观察及其对语音信息方面(不确定性和惊喜)的依赖。这项工作强调了多尺度脑节律在预测语音处理中的计算作用。
{"title":"Rhythm-based hierarchical predictive computations support acoustic−semantic transformation in speech processing","authors":"Olesia Dogonasheva, Keith B. Doelling, Denis Zakharov, Anne-Lise Giraud, Boris Gutkin","doi":"10.1038/s43588-025-00876-9","DOIUrl":"10.1038/s43588-025-00876-9","url":null,"abstract":"Unraveling how humans understand speech despite distortions has long intrigued researchers. A prominent hypothesis highlights the role of multiple endogenous brain rhythms in forming the computational context to predict speech structure and content. Yet how neural processes may implement rhythm-based context formation remains unclear. Here we propose the brain rhythm-based inference model (BRyBI) as a possible neural implementation of speech processing in the auditory cortex based on the interaction of endogenous brain rhythms in a predictive coding framework. BRyBI encodes key rhythmic processes for parsing spectro-temporal representations of the speech signal into phoneme sequences and to govern the formation of the phrasal context. BRyBI matches patterns of human performance in speech recognition tasks and explains contradictory experimental observations of rhythms during speech listening and their dependence on the informational aspect of speech (uncertainty and surprise). This work highlights the computational role of multiscale brain rhythms in predictive speech processing. This study presents a brain rhythm-based inference model (BRyBI) for speech processing in the auditory cortex. BRyBI shows how rhythmic neural activity enables robust speech processing by dynamically predicting context and elucidates mechanistic principles that allow robust speech parsing in the brain.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 10","pages":"915-926"},"PeriodicalIF":18.3,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145180835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The rise of large language models 大型语言模型的兴起
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-24 DOI: 10.1038/s43588-025-00890-x
This issue of Nature Computational Science features a Focus that highlights both the promises and perils of large language models, their emerging applications across diverse scientific domains, and the opportunities to overcome the challenges that lie ahead.
这一期的《自然计算科学》重点介绍了大型语言模型的前景和危险,它们在不同科学领域的新兴应用,以及克服未来挑战的机会。
{"title":"The rise of large language models","authors":"","doi":"10.1038/s43588-025-00890-x","DOIUrl":"10.1038/s43588-025-00890-x","url":null,"abstract":"This issue of Nature Computational Science features a Focus that highlights both the promises and perils of large language models, their emerging applications across diverse scientific domains, and the opportunities to overcome the challenges that lie ahead.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 9","pages":"689-690"},"PeriodicalIF":18.3,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s43588-025-00890-x.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145129541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neuromorphic principles in self-attention hardware for efficient transformers 高效变压器自注意硬件的神经形态原理。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-16 DOI: 10.1038/s43588-025-00868-9
Nathan Leroux, Jan Finkbeiner, Emre Neftci
Strong barriers remain between neuromorphic engineering and machine learning, especially with regard to recent large language models (LLMs) and transformers. This Comment makes the case that neuromorphic engineering may hold the keys to more efficient inference with transformer-like models.
神经形态工程和机器学习之间仍然存在很大的障碍,特别是在最近的大型语言模型(llm)和变形器方面。这篇评论表明,神经形态工程可能是对类似变形器的模型进行更有效推理的关键。
{"title":"Neuromorphic principles in self-attention hardware for efficient transformers","authors":"Nathan Leroux, Jan Finkbeiner, Emre Neftci","doi":"10.1038/s43588-025-00868-9","DOIUrl":"10.1038/s43588-025-00868-9","url":null,"abstract":"Strong barriers remain between neuromorphic engineering and machine learning, especially with regard to recent large language models (LLMs) and transformers. This Comment makes the case that neuromorphic engineering may hold the keys to more efficient inference with transformer-like models.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 9","pages":"708-710"},"PeriodicalIF":18.3,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145076704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the compatibility of generative AI and generative linguistics 论生成人工智能与生成语言学的兼容性。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-16 DOI: 10.1038/s43588-025-00861-2
Eva Portelance, Masoud Jasbi
Chomsky’s generative linguistics has made substantial contributions to cognitive science and symbolic artificial intelligence. With the rise of neural language models, however, the compatibility between generative artificial intelligence and generative linguistics has come under debate. Here we outline three ways in which generative artificial intelligence aligns with and supports the core ideas of generative linguistics. In turn, generative linguistics can provide criteria to evaluate and improve neural language models as models of human language and cognition. This Perspective discusses that generative AI aligns with generative linguistics by showing that neural language models (NLMs) are formal generative models. Furthermore, generative linguistics offers a framework for evaluating and improving NLMs.
乔姆斯基的生成语言学对认知科学和符号人工智能做出了重大贡献。然而,随着神经语言模型的兴起,生成式人工智能和生成式语言学之间的兼容性已经成为争论的焦点。在这里,我们概述了生成人工智能与生成语言学的核心思想相一致和支持的三种方式。反过来,生成语言学可以为评估和改进作为人类语言和认知模型的神经语言模型提供标准。
{"title":"On the compatibility of generative AI and generative linguistics","authors":"Eva Portelance, Masoud Jasbi","doi":"10.1038/s43588-025-00861-2","DOIUrl":"10.1038/s43588-025-00861-2","url":null,"abstract":"Chomsky’s generative linguistics has made substantial contributions to cognitive science and symbolic artificial intelligence. With the rise of neural language models, however, the compatibility between generative artificial intelligence and generative linguistics has come under debate. Here we outline three ways in which generative artificial intelligence aligns with and supports the core ideas of generative linguistics. In turn, generative linguistics can provide criteria to evaluate and improve neural language models as models of human language and cognition. This Perspective discusses that generative AI aligns with generative linguistics by showing that neural language models (NLMs) are formal generative models. Furthermore, generative linguistics offers a framework for evaluating and improving NLMs.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 9","pages":"745-753"},"PeriodicalIF":18.3,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145076692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Increasing alignment of large language models with language processing in the human brain 大型语言模型与人类大脑中语言处理的日益一致。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-16 DOI: 10.1038/s43588-025-00863-0
Changjiang Gao, Zhengwu Ma, Jiajun Chen, Ping Li, Shujian Huang, Jixing Li
Transformer-based large language models (LLMs) have considerably advanced our understanding of how meaning is represented in the human brain; however, the validity of increasingly large LLMs is being questioned due to their extensive training data and their ability to access context thousands of words long. In this study we investigated whether instruction tuning—another core technique in recent LLMs that goes beyond mere scaling—can enhance models’ ability to capture linguistic information in the human brain. We compared base and instruction-tuned LLMs of varying sizes against human behavioral and brain activity measured with eye-tracking and functional magnetic resonance imaging during naturalistic reading. We show that simply making LLMs larger leads to a closer match with the human brain than fine-tuning them with instructions. These finding have substantial implications for understanding the cognitive plausibility of LLMs and their role in studying naturalistic language comprehension. Larger LLMs’ self-attention more accurately predicts readers’ regressive saccades and fMRI responses in language regions, whereas instruction tuning adds no benefit.
基于变压器的大型语言模型(llm)大大提高了我们对人类大脑中意义如何表示的理解;然而,越来越多的大型法学硕士的有效性受到质疑,因为它们有大量的训练数据和数千字长的上下文访问能力。在这项研究中,我们调查了指令调整——最近法学硕士的另一项核心技术,超越了单纯的缩放——是否能提高模型在人脑中捕捉语言信息的能力。在自然阅读过程中,我们通过眼动追踪和功能性磁共振成像来测量人类的行为和大脑活动。我们表明,简单地将llm变大比用指令对其进行微调更接近人类大脑。这些发现对于理解法学硕士的认知合理性及其在研究自然语言理解中的作用具有重要意义。
{"title":"Increasing alignment of large language models with language processing in the human brain","authors":"Changjiang Gao, Zhengwu Ma, Jiajun Chen, Ping Li, Shujian Huang, Jixing Li","doi":"10.1038/s43588-025-00863-0","DOIUrl":"10.1038/s43588-025-00863-0","url":null,"abstract":"Transformer-based large language models (LLMs) have considerably advanced our understanding of how meaning is represented in the human brain; however, the validity of increasingly large LLMs is being questioned due to their extensive training data and their ability to access context thousands of words long. In this study we investigated whether instruction tuning—another core technique in recent LLMs that goes beyond mere scaling—can enhance models’ ability to capture linguistic information in the human brain. We compared base and instruction-tuned LLMs of varying sizes against human behavioral and brain activity measured with eye-tracking and functional magnetic resonance imaging during naturalistic reading. We show that simply making LLMs larger leads to a closer match with the human brain than fine-tuning them with instructions. These finding have substantial implications for understanding the cognitive plausibility of LLMs and their role in studying naturalistic language comprehension. Larger LLMs’ self-attention more accurately predicts readers’ regressive saccades and fMRI responses in language regions, whereas instruction tuning adds no benefit.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 11","pages":"1080-1090"},"PeriodicalIF":18.3,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.comhttps://www.nature.com/articles/s43588-025-00863-0.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145076701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying weighted Cox regression to genome-wide association studies of time-to-event phenotypes 将加权Cox回归应用于事件时间表型的全基因组关联研究。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-12 DOI: 10.1038/s43588-025-00864-z
Ying Li, Yuzhuo Ma, He Xu, Yaoyao Sun, Min Zhu, Weihua Yue, Wei Zhou, Wenjian Bi
With the growing availability of time-stamped electronic health records linked to genetic data in large biobanks and cohorts, time-to-event phenotypes are increasingly studied in genome-wide association studies. Although numerous Cox-regression-based methods have been proposed for a large-scale genome-wide association study, case ascertainment in time-to-event phenotypes has not been well addressed. Here we propose a computationally efficient Cox-based method, named WtCoxG, that accounts for case ascertainment by fitting a weighted Cox proportional hazards null model. A hybrid strategy incorporating saddlepoint approximation largely increases its accuracy when analyzing low-frequency and rare variants. Notably, by leveraging external minor allele frequencies from public resources, WtCoxG further boosts statistical power. Extensive simulation studies demonstrated that WtCoxG is more powerful than ADuLT and other Cox-based methods, while effectively controlling type I error rates. UK Biobank real data analysis validated that leveraging external minor allele frequencies contributes to the power gains of WtCoxG compared with ADuLT in the analysis of type 2 diabetes and coronary atherosclerosis. This study introduces WtCoxG, an efficient genetic analysis method for time-to-event data, which improves statistical power by addressing case ascertainment and leveraging external allele frequency information.
随着大型生物库和队列中与遗传数据相关的带时间戳的电子健康记录越来越多,全基因组关联研究越来越多地研究事件时间表型。尽管许多基于cox回归的方法已被提出用于大规模全基因组关联研究,但事件时间表型的病例确定尚未得到很好的解决。在这里,我们提出了一种计算效率高的基于Cox的方法,名为WtCoxG,它通过拟合加权Cox比例风险零模型来确定病例。结合鞍点近似的混合策略在分析低频和罕见变异时大大提高了准确性。值得注意的是,通过利用来自公共资源的外部小等位基因频率,WtCoxG进一步提高了统计能力。大量的仿真研究表明,WtCoxG比ADuLT和其他基于cox的方法更强大,同时有效地控制了I型错误率。UK Biobank的真实数据分析证实,与ADuLT相比,利用外部次要等位基因频率有助于WtCoxG在2型糖尿病和冠状动脉粥样硬化分析中的功率增益。
{"title":"Applying weighted Cox regression to genome-wide association studies of time-to-event phenotypes","authors":"Ying Li, Yuzhuo Ma, He Xu, Yaoyao Sun, Min Zhu, Weihua Yue, Wei Zhou, Wenjian Bi","doi":"10.1038/s43588-025-00864-z","DOIUrl":"10.1038/s43588-025-00864-z","url":null,"abstract":"With the growing availability of time-stamped electronic health records linked to genetic data in large biobanks and cohorts, time-to-event phenotypes are increasingly studied in genome-wide association studies. Although numerous Cox-regression-based methods have been proposed for a large-scale genome-wide association study, case ascertainment in time-to-event phenotypes has not been well addressed. Here we propose a computationally efficient Cox-based method, named WtCoxG, that accounts for case ascertainment by fitting a weighted Cox proportional hazards null model. A hybrid strategy incorporating saddlepoint approximation largely increases its accuracy when analyzing low-frequency and rare variants. Notably, by leveraging external minor allele frequencies from public resources, WtCoxG further boosts statistical power. Extensive simulation studies demonstrated that WtCoxG is more powerful than ADuLT and other Cox-based methods, while effectively controlling type I error rates. UK Biobank real data analysis validated that leveraging external minor allele frequencies contributes to the power gains of WtCoxG compared with ADuLT in the analysis of type 2 diabetes and coronary atherosclerosis. This study introduces WtCoxG, an efficient genetic analysis method for time-to-event data, which improves statistical power by addressing case ascertainment and leveraging external allele frequency information.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 11","pages":"1064-1079"},"PeriodicalIF":18.3,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145056460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time raw signal genomic analysis using fully integrated memristor hardware 实时原始信号基因组分析使用完全集成的忆阻器硬件。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-12 DOI: 10.1038/s43588-025-00867-w
Peiyi He, Shengbo Wang, Ruibin Mao, Mingrui Jiang, Sebastian Siegel, Giacomo Pedretti, Jim Ignowski, John Paul Strachan, Ruibang Luo, Can Li
Advances in third-generation sequencing have enabled portable and real-time genomic sequencing, but real-time data processing remains a bottleneck, hampering on-site genomic analysis. These technologies generate noisy analog signals that traditionally require basecalling and read mapping, both demanding costly data movement on von Neumann hardware. Here, to overcome this, we present a memristor-based hardware–software codesign that processes raw sequencer signals directly in analog memory, combining the two separated steps. By exploiting intrinsic device noise for locality-sensitive hashing and implementing parallel approximate searches in content-addressable memory, we experimentally showcase on-site applications, including infectious disease detection and metagenomic classification on a fully integrated memristor chip. Our experimentally validated analysis confirms the effectiveness of this approach on real-world tasks, achieving a 97.15% F1 score in virus raw signal mapping, with 51× speed-up and 477× energy saving over an application-specific integrated circuit. These results demonstrate that in-memory computing hardware provides a viable solution for integration with portable sequencers, enabling real-time and on-site genomic analysis. The authors report a memristor-based system that analyzes raw analog signals from a genomic sequencer directly in memory. By bypassing slow data conversion, the system achieves substantial improvements in speed and efficiency, enabling real-time, on-site genomic analysis.
第三代测序技术的进步使便携式和实时基因组测序成为可能,但实时数据处理仍然是一个瓶颈,阻碍了现场基因组分析。这些技术产生嘈杂的模拟信号,传统上需要基调用和读取映射,两者都需要在冯·诺伊曼硬件上进行昂贵的数据移动。在这里,为了克服这个问题,我们提出了一个基于忆阻器的硬件软件协同设计,直接在模拟存储器中处理原始序列器信号,结合两个分离的步骤。通过利用固有设备噪声进行位置敏感散列,并在内容可寻址存储器中实现并行近似搜索,我们实验展示了在完全集成的忆阻芯片上的现场应用,包括传染病检测和宏基因组分类。我们的实验验证分析证实了该方法在实际任务中的有效性,在病毒原始信号映射中获得97.15%的F1分数,在特定应用集成电路上加速51倍,节能477倍。这些结果表明,内存计算硬件为与便携式测序仪集成提供了可行的解决方案,实现了实时和现场基因组分析。
{"title":"Real-time raw signal genomic analysis using fully integrated memristor hardware","authors":"Peiyi He, Shengbo Wang, Ruibin Mao, Mingrui Jiang, Sebastian Siegel, Giacomo Pedretti, Jim Ignowski, John Paul Strachan, Ruibang Luo, Can Li","doi":"10.1038/s43588-025-00867-w","DOIUrl":"10.1038/s43588-025-00867-w","url":null,"abstract":"Advances in third-generation sequencing have enabled portable and real-time genomic sequencing, but real-time data processing remains a bottleneck, hampering on-site genomic analysis. These technologies generate noisy analog signals that traditionally require basecalling and read mapping, both demanding costly data movement on von Neumann hardware. Here, to overcome this, we present a memristor-based hardware–software codesign that processes raw sequencer signals directly in analog memory, combining the two separated steps. By exploiting intrinsic device noise for locality-sensitive hashing and implementing parallel approximate searches in content-addressable memory, we experimentally showcase on-site applications, including infectious disease detection and metagenomic classification on a fully integrated memristor chip. Our experimentally validated analysis confirms the effectiveness of this approach on real-world tasks, achieving a 97.15% F1 score in virus raw signal mapping, with 51× speed-up and 477× energy saving over an application-specific integrated circuit. These results demonstrate that in-memory computing hardware provides a viable solution for integration with portable sequencers, enabling real-time and on-site genomic analysis. The authors report a memristor-based system that analyzes raw analog signals from a genomic sequencer directly in memory. By bypassing slow data conversion, the system achieves substantial improvements in speed and efficiency, enabling real-time, on-site genomic analysis.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 10","pages":"940-951"},"PeriodicalIF":18.3,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145056487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A complete photonic integrated neuron for nonlinear all-optical computing 用于非线性全光计算的完整光子集成神经元。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-12 DOI: 10.1038/s43588-025-00866-x
Tao Yan, Yanchen Guo, Tiankuang Zhou, Guocheng Shao, Shanglong Li, Ruqi Huang, Qionghai Dai, Lu Fang
The field of photonic neural networks has experienced substantial growth, driven by its potential to enable ultrafast artificial intelligence inference and address the escalating demand for computing speed and energy efficiency. However, realizing nonlinearity-complete all-optical neurons is still challenging, constraining the performance of photonic neural networks. Here we report a complete photonic integrated neuron (PIN) with spatiotemporal feature learning capabilities and reconfigurable structures for nonlinear all-optical computing. By interleaving the spatiotemporal dimension of photons and leveraging the Kerr effect, PIN performs high-order temporal convolution and all-optical nonlinear activation monolithically on a silicon-nitride photonic chip, achieving neuron completeness of weighted interconnects and nonlinearities. We develop the PIN chip system and demonstrate its remarkable performance in high-accuracy image classification and human motion generation. PIN enables ultrafast spatialtemporal processing with a latency as low as 240 ps, paving the way for advancing machine intelligence into the subnanosecond regime. This study reports a complete photonic neuron integrated on a silicon-nitride chip, enabling ultrafast all-optical computing with nonlinear multi-kernel convolution for image recognition and motion generation.
光子神经网络在实现超快速人工智能推理和解决对计算速度和能源效率不断增长的需求方面具有巨大的潜力,因此该领域经历了大幅增长。然而,实现非线性完全全光神经元仍然具有挑战性,这限制了光子神经网络的性能。本文报道了一种具有时空特征学习能力和可重构结构的完整光子集成神经元(PIN),用于非线性全光计算。通过交错光子的时空维度和利用克尔效应,PIN在氮化硅光子芯片上单片执行高阶时间卷积和全光非线性激活,实现加权互连和非线性的神经元完备性。我们开发了PIN芯片系统,并证明了其在高精度图像分类和人体运动生成方面的卓越性能。PIN实现了超快的时空处理,延迟低至240 ps,为将机器智能推进到亚纳秒级铺平了道路。
{"title":"A complete photonic integrated neuron for nonlinear all-optical computing","authors":"Tao Yan, Yanchen Guo, Tiankuang Zhou, Guocheng Shao, Shanglong Li, Ruqi Huang, Qionghai Dai, Lu Fang","doi":"10.1038/s43588-025-00866-x","DOIUrl":"10.1038/s43588-025-00866-x","url":null,"abstract":"The field of photonic neural networks has experienced substantial growth, driven by its potential to enable ultrafast artificial intelligence inference and address the escalating demand for computing speed and energy efficiency. However, realizing nonlinearity-complete all-optical neurons is still challenging, constraining the performance of photonic neural networks. Here we report a complete photonic integrated neuron (PIN) with spatiotemporal feature learning capabilities and reconfigurable structures for nonlinear all-optical computing. By interleaving the spatiotemporal dimension of photons and leveraging the Kerr effect, PIN performs high-order temporal convolution and all-optical nonlinear activation monolithically on a silicon-nitride photonic chip, achieving neuron completeness of weighted interconnects and nonlinearities. We develop the PIN chip system and demonstrate its remarkable performance in high-accuracy image classification and human motion generation. PIN enables ultrafast spatialtemporal processing with a latency as low as 240 ps, paving the way for advancing machine intelligence into the subnanosecond regime. This study reports a complete photonic neuron integrated on a silicon-nitride chip, enabling ultrafast all-optical computing with nonlinear multi-kernel convolution for image recognition and motion generation.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 12","pages":"1202-1213"},"PeriodicalIF":18.3,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145056517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Confidential computing for population-scale genome-wide association studies with SECRET-GWAS SECRET-GWAS用于群体规模全基因组关联研究的保密计算。
IF 18.3 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-12 DOI: 10.1038/s43588-025-00856-z
Jonah Rosenblum, Juechu Dong, Satish Narayanasamy
Genomic data from a single institution lacks global diversity representation, especially for rare variants and diseases. Confidential computing can enable collaborative genome-wide association studies (GWAS) without compromising privacy or accuracy. However, due to limited secure memory space and performance overheads, previous solutions fail to support widely used regression methods. Here we present SECRET-GWAS—a rapid, privacy-preserving, population-scale, collaborative GWAS tool. We discuss several system optimizations, including streaming, batching, data parallelization and reducing trusted hardware overheads to efficiently scale linear and logistic regression to over a thousand processor cores on an Intel SGX-based cloud platform. In addition, we protect SECRET-GWAS against several hardware side-channel attacks. SECRET-GWAS is an open-source tool and works with the widely used Hail genomic analysis framework. Our experiments on Azure’s Confidential Computing platform demonstrate that SECRET-GWAS enables multivariate linear and logistic regression GWAS queries on population-scale datasets from ten independent sources in just 4.5 and 29 minutes, respectively. Secure collaborative genome-wide association studies (GWAS) with population-scale datasets address gaps in genomic data. This work proposes SECRET-GWAS and system optimizations that overcome resource constraints and exploit parallelism, while maintaining privacy and accuracy.
来自单一机构的基因组数据缺乏全球多样性代表,特别是对于罕见变异和疾病。保密计算可以使协作性全基因组关联研究(GWAS)在不损害隐私或准确性的情况下实现。但是,由于有限的安全内存空间和性能开销,以前的解决方案无法支持广泛使用的回归方法。在这里,我们提出了secret -GWAS-一个快速,隐私保护,人口规模,协作的GWAS工具。我们讨论了几个系统优化,包括流、批处理、数据并行化和减少可信硬件开销,以便在基于Intel sgx的云平台上有效地将线性和逻辑回归扩展到超过1000个处理器内核。此外,我们还保护SECRET-GWAS免受几种硬件侧信道攻击。SECRET-GWAS是一个开源工具,与广泛使用的Hail基因组分析框架一起工作。我们在Azure的机密计算平台上的实验表明,SECRET-GWAS可以在4.5分钟和29分钟内分别对来自10个独立来源的人口规模数据集进行多元线性和逻辑回归GWAS查询。
{"title":"Confidential computing for population-scale genome-wide association studies with SECRET-GWAS","authors":"Jonah Rosenblum, Juechu Dong, Satish Narayanasamy","doi":"10.1038/s43588-025-00856-z","DOIUrl":"10.1038/s43588-025-00856-z","url":null,"abstract":"Genomic data from a single institution lacks global diversity representation, especially for rare variants and diseases. Confidential computing can enable collaborative genome-wide association studies (GWAS) without compromising privacy or accuracy. However, due to limited secure memory space and performance overheads, previous solutions fail to support widely used regression methods. Here we present SECRET-GWAS—a rapid, privacy-preserving, population-scale, collaborative GWAS tool. We discuss several system optimizations, including streaming, batching, data parallelization and reducing trusted hardware overheads to efficiently scale linear and logistic regression to over a thousand processor cores on an Intel SGX-based cloud platform. In addition, we protect SECRET-GWAS against several hardware side-channel attacks. SECRET-GWAS is an open-source tool and works with the widely used Hail genomic analysis framework. Our experiments on Azure’s Confidential Computing platform demonstrate that SECRET-GWAS enables multivariate linear and logistic regression GWAS queries on population-scale datasets from ten independent sources in just 4.5 and 29 minutes, respectively. Secure collaborative genome-wide association studies (GWAS) with population-scale datasets address gaps in genomic data. This work proposes SECRET-GWAS and system optimizations that overcome resource constraints and exploit parallelism, while maintaining privacy and accuracy.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"5 9","pages":"825-835"},"PeriodicalIF":18.3,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145056513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Nature computational science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1