Pub Date : 2026-01-13DOI: 10.1038/s43588-025-00941-3
Shuyan Wang, Hao Xu, Junyu Wang, Yao Xiao, Shanghao Dai, Junyi Lu, Ruoxuan Cao, Xuejin Chen, Kun Qu
Spatial epigenomics (SE) technologies profile epigenomic landscapes within intact tissues, preserving spatial context and enabling the study of gene regulatory mechanisms in situ. However, current SE datasets typically suffer from low signal detection, substantial noise and extremely sparse peak matrices, which pose considerable challenges for downstream analysis. Here we introduce SPEED (spatial epigenomic data denoising), a deep matrix factorization framework that leverages atlas-level single-cell epigenomic data and spatial context to impute and denoise SE data. In comprehensive benchmarks on both simulated data and real SE tissue datasets, SPEED outperformed five state-of-the-art methods across diverse tissues and technologies. Moreover, SPEED's denoised outputs facilitated downstream analyses such as differential chromatin accessibility analysis, epigenomic spatial domain identification and gene activity inference. Collectively, our results indicate that SPEED is a generalizable tool for improving data quality and biological insights in SE.
{"title":"Denoising spatial epigenomic data via deep matrix factorization.","authors":"Shuyan Wang, Hao Xu, Junyu Wang, Yao Xiao, Shanghao Dai, Junyi Lu, Ruoxuan Cao, Xuejin Chen, Kun Qu","doi":"10.1038/s43588-025-00941-3","DOIUrl":"https://doi.org/10.1038/s43588-025-00941-3","url":null,"abstract":"<p><p>Spatial epigenomics (SE) technologies profile epigenomic landscapes within intact tissues, preserving spatial context and enabling the study of gene regulatory mechanisms in situ. However, current SE datasets typically suffer from low signal detection, substantial noise and extremely sparse peak matrices, which pose considerable challenges for downstream analysis. Here we introduce SPEED (spatial epigenomic data denoising), a deep matrix factorization framework that leverages atlas-level single-cell epigenomic data and spatial context to impute and denoise SE data. In comprehensive benchmarks on both simulated data and real SE tissue datasets, SPEED outperformed five state-of-the-art methods across diverse tissues and technologies. Moreover, SPEED's denoised outputs facilitated downstream analyses such as differential chromatin accessibility analysis, epigenomic spatial domain identification and gene activity inference. Collectively, our results indicate that SPEED is a generalizable tool for improving data quality and biological insights in SE.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145967973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-13DOI: 10.1038/s43588-025-00939-x
Yu Zhang, Yichen Yao, Yuanhao Tang, Yuan Cheng, Yinghui Xu, Ying He, Yuan Qi, Li Jin
Conventional epigenetic clocks encounter challenges in generalizability, especially when there are pronounced batch effects between the training and test datasets, restricting their clinical applicability for aging assessment. Here we present MAPLE, a robust computational framework for methylation age and disease-risk prediction through pairwise learning. MAPLE utilizes pairwise learning to discern the relative relationships between two DNA methylation profiles regarding age or disease risk. It effectively identifies aging- or disease-related biological signals while mitigating technical biases in the data. MAPLE outperforms five competing methods, achieving a median absolute error of 1.6 years across 31 benchmark tests from diverse studies, sequencing platforms, data preprocessing methods and tissue types. Furthermore, MAPLE performs well when assessing aging-related disease risk, with mean areas under the curve of 0.97 for disease identification and 0.85 for pre-disease status detection. Overall, we show that MAPLE has great potential for assessing epigenetic age and aging-related disease risk clinically.
{"title":"A robust computational framework for methylation age and disease-risk prediction based on pairwise learning.","authors":"Yu Zhang, Yichen Yao, Yuanhao Tang, Yuan Cheng, Yinghui Xu, Ying He, Yuan Qi, Li Jin","doi":"10.1038/s43588-025-00939-x","DOIUrl":"https://doi.org/10.1038/s43588-025-00939-x","url":null,"abstract":"<p><p>Conventional epigenetic clocks encounter challenges in generalizability, especially when there are pronounced batch effects between the training and test datasets, restricting their clinical applicability for aging assessment. Here we present MAPLE, a robust computational framework for methylation age and disease-risk prediction through pairwise learning. MAPLE utilizes pairwise learning to discern the relative relationships between two DNA methylation profiles regarding age or disease risk. It effectively identifies aging- or disease-related biological signals while mitigating technical biases in the data. MAPLE outperforms five competing methods, achieving a median absolute error of 1.6 years across 31 benchmark tests from diverse studies, sequencing platforms, data preprocessing methods and tissue types. Furthermore, MAPLE performs well when assessing aging-related disease risk, with mean areas under the curve of 0.97 for disease identification and 0.85 for pre-disease status detection. Overall, we show that MAPLE has great potential for assessing epigenetic age and aging-related disease risk clinically.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145967968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Spatial transcriptomics has transformed the mapping of gene expression within intact tissues, yet current sequencing-based platforms are limited by coarse spot-level resolution and sparse sampling that leaves large interspot regions unmeasured. Here we introduce PanoSpace, a computational framework that integrates low-resolution spatial transcriptomics with high-resolution histology and matched single-cell RNA sequencing to reconstruct a continuous, single-cell-level map across entire tissue sections. Originally developed for tumors, PanoSpace accurately reconstructs cellular locations, cell identities and gene expression profiles, enabling detailed characterization of intracell-type heterogeneity and spatially organized cell-cell interactions. Application to breast and prostate cancers reveals complex cellular architectures and tumor microenvironment dynamics mediated by cancer-associated fibroblasts. Thanks to its modular design, PanoSpace can be seamlessly adapted to noncancerous tissues, as demonstrated by precise spatial reconstruction in mouse brain. Together, these results demonstrate that PanoSpace enables comprehensive spatial transcriptomic analysis and facilitates biological discovery.
{"title":"Unlocking single-cell level and continuous whole-slide insights in spatial transcriptomics with PanoSpace.","authors":"Hui-Feng He, Pai Peng, Shi-Tong Yang, Meng-Guo Wang, Xiao-Fei Zhang, Luonan Chen","doi":"10.1038/s43588-025-00938-y","DOIUrl":"https://doi.org/10.1038/s43588-025-00938-y","url":null,"abstract":"<p><p>Spatial transcriptomics has transformed the mapping of gene expression within intact tissues, yet current sequencing-based platforms are limited by coarse spot-level resolution and sparse sampling that leaves large interspot regions unmeasured. Here we introduce PanoSpace, a computational framework that integrates low-resolution spatial transcriptomics with high-resolution histology and matched single-cell RNA sequencing to reconstruct a continuous, single-cell-level map across entire tissue sections. Originally developed for tumors, PanoSpace accurately reconstructs cellular locations, cell identities and gene expression profiles, enabling detailed characterization of intracell-type heterogeneity and spatially organized cell-cell interactions. Application to breast and prostate cancers reveals complex cellular architectures and tumor microenvironment dynamics mediated by cancer-associated fibroblasts. Thanks to its modular design, PanoSpace can be seamlessly adapted to noncancerous tissues, as demonstrated by precise spatial reconstruction in mouse brain. Together, these results demonstrate that PanoSpace enables comprehensive spatial transcriptomic analysis and facilitates biological discovery.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145914034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-05DOI: 10.1038/s43588-025-00934-2
Lulu Yan, Dongyan Zhang, Xiaoqiang Sun
In multicellular systems, cell fate determination emerges from the integration of intracellular signaling and intercellular communication. Spatial transcriptomics (ST) provides opportunities to elucidate these regulatory processes, yet inferring the spatiotemporal dynamics of cell state transitions (CSTs) governed by cell-cell communication (CCC) remains a challenge. Here we introduce CCCvelo to reconstruct CCC-driven CST dynamics by jointly optimizing a dynamic CCC signaling network and a latent CST clock. CCCvelo formulates a unified multiscale nonlinear kinetic model that integrates intercellular ligand-receptor signaling gradients with intracellular transcription-factor activation cascades to capture gene expression dynamics encoding CSTs. Moreover, we devise PINN-CELL, a physics-informed neural-network-based coevolution learning algorithm, which simultaneously optimizes model parameters and pseudotemporal ordering. Application of CCCvelo to high-resolution ST datasets, including mouse cortex, embryonic trunk development and human prostate cancer datasets, demonstrates its ability to successfully recover known morphogenetic trajectories while uncovering dynamic CCC signaling rewiring that orchestrates CST progression.
{"title":"Decoding cell state transitions driven by dynamic cell-cell communication in spatial transcriptomics.","authors":"Lulu Yan, Dongyan Zhang, Xiaoqiang Sun","doi":"10.1038/s43588-025-00934-2","DOIUrl":"https://doi.org/10.1038/s43588-025-00934-2","url":null,"abstract":"<p><p>In multicellular systems, cell fate determination emerges from the integration of intracellular signaling and intercellular communication. Spatial transcriptomics (ST) provides opportunities to elucidate these regulatory processes, yet inferring the spatiotemporal dynamics of cell state transitions (CSTs) governed by cell-cell communication (CCC) remains a challenge. Here we introduce CCCvelo to reconstruct CCC-driven CST dynamics by jointly optimizing a dynamic CCC signaling network and a latent CST clock. CCCvelo formulates a unified multiscale nonlinear kinetic model that integrates intercellular ligand-receptor signaling gradients with intracellular transcription-factor activation cascades to capture gene expression dynamics encoding CSTs. Moreover, we devise PINN-CELL, a physics-informed neural-network-based coevolution learning algorithm, which simultaneously optimizes model parameters and pseudotemporal ordering. Application of CCCvelo to high-resolution ST datasets, including mouse cortex, embryonic trunk development and human prostate cancer datasets, demonstrates its ability to successfully recover known morphogenetic trajectories while uncovering dynamic CCC signaling rewiring that orchestrates CST progression.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-02DOI: 10.1038/s43588-025-00919-1
Jeheon Woo, Seonghwan Kim, Jun Hyeong Kim, Woo Youn Kim
Here we introduce a framework for molecular structure optimization using a denoising model on a physics-informed Riemannian manifold (R-DM). Unlike conventional approaches operating in Euclidean space, our method leverages a Riemannian metric that better aligns with molecular energy change, enabling more robust modeling of potential energy surfaces. By incorporating internal coordinates reflective of energetic properties, R-DM achieves chemical accuracy with an energy error below 1 kcal mol-1. Comparative evaluations on QM9, QM7-X and GEOM datasets demonstrate improvements in both structural and energetic accuracy, surpassing conventional Euclidean-based denoising models. This approach highlights the potential of physics-informed coordinates for tackling complex molecular optimization problems, with implications for tasks in computational chemistry and materials science.
{"title":"Riemannian denoising model for molecular structure optimization with chemical accuracy.","authors":"Jeheon Woo, Seonghwan Kim, Jun Hyeong Kim, Woo Youn Kim","doi":"10.1038/s43588-025-00919-1","DOIUrl":"https://doi.org/10.1038/s43588-025-00919-1","url":null,"abstract":"<p><p>Here we introduce a framework for molecular structure optimization using a denoising model on a physics-informed Riemannian manifold (R-DM). Unlike conventional approaches operating in Euclidean space, our method leverages a Riemannian metric that better aligns with molecular energy change, enabling more robust modeling of potential energy surfaces. By incorporating internal coordinates reflective of energetic properties, R-DM achieves chemical accuracy with an energy error below 1 kcal mol<sup>-1</sup>. Comparative evaluations on QM9, QM7-X and GEOM datasets demonstrate improvements in both structural and energetic accuracy, surpassing conventional Euclidean-based denoising models. This approach highlights the potential of physics-informed coordinates for tackling complex molecular optimization problems, with implications for tasks in computational chemistry and materials science.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145892627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-02DOI: 10.1038/s43588-025-00915-5
Shenghui Wu, Zhiwei Song, Xiang Zhang, Yifan Huang, Shuhang Chen, Xiang Shen, Jieyuan Tan, Mingdong Li, Ziyi Wang, Yujun Chen, Kai Liu, Dario Farina, Jose C Principe, Yiwen Wang
Prediction models that generate neuronal spikes from upstream neural activities offer a promising way to re-establish neural functional connectivity. Traditional methods train these models by supervised learning, which requires downstream recordings as ground truth. However, functional downstream activity cannot be recorded when neurological disorders exist. Here we introduce a reinforcement learning (RL)-based point process framework to generate spike trains that directly maximize behavior-level rewards, thus bypassing downstream recordings. This yields a generative spike model that directly transforms upstream activity into spike patterns modulated to desired behavior. We show that these RL-based generative models produce movement-modulated spike patterns akin to downstream recordings from healthy subjects, providing a biomimetic spike encoding framework. This RL framework outperforms existing methods and demonstrates a strong adaptation capability across different decoder settings, highlighting its potential for neural prostheses in restoring transregional communication with biomimetic cortical stimulation.
{"title":"A generative spike prediction model using behavioral reinforcement for re-establishing neural functional connectivity.","authors":"Shenghui Wu, Zhiwei Song, Xiang Zhang, Yifan Huang, Shuhang Chen, Xiang Shen, Jieyuan Tan, Mingdong Li, Ziyi Wang, Yujun Chen, Kai Liu, Dario Farina, Jose C Principe, Yiwen Wang","doi":"10.1038/s43588-025-00915-5","DOIUrl":"https://doi.org/10.1038/s43588-025-00915-5","url":null,"abstract":"<p><p>Prediction models that generate neuronal spikes from upstream neural activities offer a promising way to re-establish neural functional connectivity. Traditional methods train these models by supervised learning, which requires downstream recordings as ground truth. However, functional downstream activity cannot be recorded when neurological disorders exist. Here we introduce a reinforcement learning (RL)-based point process framework to generate spike trains that directly maximize behavior-level rewards, thus bypassing downstream recordings. This yields a generative spike model that directly transforms upstream activity into spike patterns modulated to desired behavior. We show that these RL-based generative models produce movement-modulated spike patterns akin to downstream recordings from healthy subjects, providing a biomimetic spike encoding framework. This RL framework outperforms existing methods and demonstrates a strong adaptation capability across different decoder settings, highlighting its potential for neural prostheses in restoring transregional communication with biomimetic cortical stimulation.</p>","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":" ","pages":""},"PeriodicalIF":18.3,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145893534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31DOI: 10.1038/s43588-025-00908-4
Tong Zhao (, ), Yan Zeng
Digital twins of self-driving chemistry laboratories may help reduce reliance on costly real-world experimentation and enable the testing of hypothetical automated workflows in silico.
{"title":"Digital twins for self-driving chemistry laboratories","authors":"Tong Zhao \u0000 (, ), Yan Zeng","doi":"10.1038/s43588-025-00908-4","DOIUrl":"10.1038/s43588-025-00908-4","url":null,"abstract":"Digital twins of self-driving chemistry laboratories may help reduce reliance on costly real-world experimentation and enable the testing of hypothetical automated workflows in silico.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"6 1","pages":"15-16"},"PeriodicalIF":18.3,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31DOI: 10.1038/s43588-025-00924-4
Kourosh Darvish, Arjun Sohal, Abhijoy Mandal, Hatem Fakhruldeen, Nikola Radulov, Zhengxue Zhou, Satheeshkumar Veeramani, Joshua Choi, Sijie Han, Brayden Zhang, Jeeyeoun Chae, Alex Wright, Yijie Wang, Hossein Darvish, Yuchi Zhao, Gary Tom, Han Hao, Miroslav Bogdanovic, Gabriella Pizzuto, Andrew I. Cooper, Alán Aspuru Guzik, Florian Shkurti, Animesh Garg
Accelerated materials discovery is critical for addressing global challenges. However, developing new laboratory workflows relies heavily on real-world experimental trials, and this can hinder scalability because of the need for numerous physical make-and-test iterations. Here we present MATTERIX, a multiscale, graphics processing unit-accelerated robotic simulation framework designed to create high-fidelity digital twins of chemistry laboratories, thus accelerating workflow development. This multiscale digital twin simulates robotic physical manipulation, powder and liquid dynamics, device functionalities, heat transfer and basic chemical reaction kinetics. This is enabled by integrating realistic physics simulation and photorealistic rendering with a modular graphics processing unit-accelerated semantics engine, which models logical states and continuous behaviors to simulate chemistry workflows across different levels of abstraction. MATTERIX streamlines the creation of digital twin environments through open-source asset libraries and interfaces, while enabling flexible workflow design via hierarchical plan definition and a modular skill library that incorporates learning-based methods. Our approach demonstrates sim-to-real transfer in robotic chemistry setups, reducing reliance on costly real-world experiments and enabling the testing of hypothetical automated workflows in silico. MATTERIX, a multiscale graphics processing unit-accelerated framework for high-fidelity digital twins and workflows of chemistry laboratories, is presented, simulating robot and device operation, fluids and powders, and processes such as heat transfer and chemical kinetics.
{"title":"MATTERIX: toward a digital twin for robotics-assisted chemistry laboratory automation","authors":"Kourosh Darvish, Arjun Sohal, Abhijoy Mandal, Hatem Fakhruldeen, Nikola Radulov, Zhengxue Zhou, Satheeshkumar Veeramani, Joshua Choi, Sijie Han, Brayden Zhang, Jeeyeoun Chae, Alex Wright, Yijie Wang, Hossein Darvish, Yuchi Zhao, Gary Tom, Han Hao, Miroslav Bogdanovic, Gabriella Pizzuto, Andrew I. Cooper, Alán Aspuru Guzik, Florian Shkurti, Animesh Garg","doi":"10.1038/s43588-025-00924-4","DOIUrl":"10.1038/s43588-025-00924-4","url":null,"abstract":"Accelerated materials discovery is critical for addressing global challenges. However, developing new laboratory workflows relies heavily on real-world experimental trials, and this can hinder scalability because of the need for numerous physical make-and-test iterations. Here we present MATTERIX, a multiscale, graphics processing unit-accelerated robotic simulation framework designed to create high-fidelity digital twins of chemistry laboratories, thus accelerating workflow development. This multiscale digital twin simulates robotic physical manipulation, powder and liquid dynamics, device functionalities, heat transfer and basic chemical reaction kinetics. This is enabled by integrating realistic physics simulation and photorealistic rendering with a modular graphics processing unit-accelerated semantics engine, which models logical states and continuous behaviors to simulate chemistry workflows across different levels of abstraction. MATTERIX streamlines the creation of digital twin environments through open-source asset libraries and interfaces, while enabling flexible workflow design via hierarchical plan definition and a modular skill library that incorporates learning-based methods. Our approach demonstrates sim-to-real transfer in robotic chemistry setups, reducing reliance on costly real-world experiments and enabling the testing of hypothetical automated workflows in silico. MATTERIX, a multiscale graphics processing unit-accelerated framework for high-fidelity digital twins and workflows of chemistry laboratories, is presented, simulating robot and device operation, fluids and powders, and processes such as heat transfer and chemical kinetics.","PeriodicalId":74246,"journal":{"name":"Nature computational science","volume":"6 1","pages":"67-82"},"PeriodicalIF":18.3,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145879554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}