Pub Date : 2026-02-18DOI: 10.1186/s13059-026-03973-8
Alexandre Segers, Jeroen Gilis, Mattias Van Heetvelde, Davide Risso, Elfride De Baere, Lieven Clement
RNA-seq data analysis relies on many different tools, each tailored to specific applications and coming with unique assumptions and limitations. Indeed, tools for differential transcript usage or rare disease diagnosis through splicing and expression outliers, either lack performance, discard information, or do not scale to large datasets. We show that replacing normalization offsets unlocks bulk RNA-seq tools for differential usage and aberrant splicing, providing a single framework for various short- and long-read applications. We then introduce saseR, a tool for prioritizing expression and usage outliers that is much faster than state-of-the-art methods, and significantly outperforms these for aberrant splicing detection.
{"title":"saseR: juggling offsets unlocks RNA-seq tools for fast and scalable differential usage, aberrant splicing and expression retrieval.","authors":"Alexandre Segers, Jeroen Gilis, Mattias Van Heetvelde, Davide Risso, Elfride De Baere, Lieven Clement","doi":"10.1186/s13059-026-03973-8","DOIUrl":"10.1186/s13059-026-03973-8","url":null,"abstract":"<p><p>RNA-seq data analysis relies on many different tools, each tailored to specific applications and coming with unique assumptions and limitations. Indeed, tools for differential transcript usage or rare disease diagnosis through splicing and expression outliers, either lack performance, discard information, or do not scale to large datasets. We show that replacing normalization offsets unlocks bulk RNA-seq tools for differential usage and aberrant splicing, providing a single framework for various short- and long-read applications. We then introduce saseR, a tool for prioritizing expression and usage outliers that is much faster than state-of-the-art methods, and significantly outperforms these for aberrant splicing detection.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146221485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-18DOI: 10.1186/s13059-026-04001-5
Feng Yan, Pedro L Baldoni, James Lancaster, Matthew E Ritchie, Mathew G Lewsey, Quentin Gouil, Nadia M Davidson
Introduction: Recently, de novo transcriptome assembly methods have been developed to utilise long-read data in cases where a reference genome is unavailable, such as in non-model organisms. Despite the potential of these tools, there remains a lack of benchmarking and established protocols for optimal reference-free, long-read transcriptome assembly and differential expression analysis.
Results: Here, we evaluate the long-read de novo transcriptome assembly tools, RATTLE, RNA-Bloom2 and isONform, and compare their performance to one of the leading short-read assemblers, Trinity. We assess various metrics across a range of datasets, which include simulated data and spike-in sequin transcripts, where ground truth is known, and real data from human and pea (Pisum sativum) samples, using a reference-based approach to define truth. To represent contemporary analysis scenarios, the datasets cover depths from 6 to 60 million reads, Oxford Nanopore Technologies (ONT) cDNA, ONT direct RNA and Pacific Biosciences (PacBio) 10 × single-cell sequencing. Critically, we assess the downstream impact of assembly choice on the detection of differential gene and transcript expression.
Conclusions: Our results confirm that long reads generate longer assembled transcripts than short-reads for reference-free analysis, though limitations remain compared to reference-guided approaches, and suggest scope for improved accuracy and reduced redundancy. Of the de novo pipelines, RNA-Bloom2, coupled with Corset for transcript clustering, was the best performing in terms of both accuracy and computational efficiency. Our findings offer guidance when selecting the most effective strategy for long-read differential expression analysis, when a high-quality reference genome is unavailable.
{"title":"A comprehensive evaluation of long-read de novo transcriptome assembly.","authors":"Feng Yan, Pedro L Baldoni, James Lancaster, Matthew E Ritchie, Mathew G Lewsey, Quentin Gouil, Nadia M Davidson","doi":"10.1186/s13059-026-04001-5","DOIUrl":"https://doi.org/10.1186/s13059-026-04001-5","url":null,"abstract":"<p><strong>Introduction: </strong>Recently, de novo transcriptome assembly methods have been developed to utilise long-read data in cases where a reference genome is unavailable, such as in non-model organisms. Despite the potential of these tools, there remains a lack of benchmarking and established protocols for optimal reference-free, long-read transcriptome assembly and differential expression analysis.</p><p><strong>Results: </strong>Here, we evaluate the long-read de novo transcriptome assembly tools, RATTLE, RNA-Bloom2 and isONform, and compare their performance to one of the leading short-read assemblers, Trinity. We assess various metrics across a range of datasets, which include simulated data and spike-in sequin transcripts, where ground truth is known, and real data from human and pea (Pisum sativum) samples, using a reference-based approach to define truth. To represent contemporary analysis scenarios, the datasets cover depths from 6 to 60 million reads, Oxford Nanopore Technologies (ONT) cDNA, ONT direct RNA and Pacific Biosciences (PacBio) 10 × single-cell sequencing. Critically, we assess the downstream impact of assembly choice on the detection of differential gene and transcript expression.</p><p><strong>Conclusions: </strong>Our results confirm that long reads generate longer assembled transcripts than short-reads for reference-free analysis, though limitations remain compared to reference-guided approaches, and suggest scope for improved accuracy and reduced redundancy. Of the de novo pipelines, RNA-Bloom2, coupled with Corset for transcript clustering, was the best performing in terms of both accuracy and computational efficiency. Our findings offer guidance when selecting the most effective strategy for long-read differential expression analysis, when a high-quality reference genome is unavailable.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146221496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-10DOI: 10.1186/s13059-026-03990-7
Zengdong Tan, Yunhao Liu, Xiaowei Wu, Jingyan Song, Bingjie Lu, Yongqi Chen, Ruyi Fan, Jie Chen, Wanneng Yang, Hui Feng, Liang Guo, Xuan Yao
Background: Brassica napus (B. napus) is globally important oilseed crop, yet traditional approaches for phenotyping of seed traits are labor-intensive and destructive.
Results: Here, we establish a non-destructive analytical framework integrating hyperspectral imaging (HSI) with machine learning for characterizing seed-related traits. We collect HSI data from seeds of 393 B. napus accessions over two consecutive years, generating 1,944 spectral indices per sample. We identify significant correlations between 1,293 hyperspectral indices and 956 seed metabolites. Flavonoid metabolites exhibit the most consistent interannual correlations with hyperspectral indices. Systematic benchmarking of 19 machine learning algorithms identifies nine optimal models for metabolite prediction, with 73.44% of metabolites achieving significant associations. Hyperspectral indices effectively predict nine key seed-related traits, including oil content, seed coat content, glucosinolate content and six fatty acid components. Genome-wide association studies (GWAS) of hyperspectral indices uncover three stable quantitative trait loci (QTL) hotspots, qHSI.hotA09, qHSI.hotA05 and qHSI.hotC05, that co-localize with QTLs for seed oil and seed coat content. Integration of GWAS with POCKET prioritization identifies BnaA09.MYB52 and BnaC05.PMT6 as candidate genes for the hotspots, qHSI.hotA09 and qHSI.hotC05, respectively. Functional validation using mutants demonstrates that both genes significantly influence seed flavonoid metabolites and hyperspectral profiles. BnaPMT6 is characterized as a novel positive regulator of seed coat content, similar to BnaMYB52.
Conclusions: This study establishes a novel, non-destructive approach for seed traits and metabolite assessment in B. napus seeds. It also provides a theoretical foundation and genetic basis for breeding of B. napus varieties with high oil content and improved nutritional quality.
背景:甘蓝型油菜(Brassica napus, B. napus)是全球重要的油料作物,但传统的种子性状表型分析方法是劳动密集型和破坏性的。结果:在这里,我们建立了一个非破坏性的分析框架,将高光谱成像(HSI)与机器学习相结合,用于表征种子相关性状。我们连续两年收集了393份甘蓝型油菜种子的HSI数据,每个样本生成了1944个光谱指数。我们发现1293个高光谱指数与956个种子代谢物之间存在显著相关性。黄酮类代谢产物与高光谱指数的年际相关性最为一致。对19种机器学习算法进行系统基准测试,确定了9种代谢物预测的最佳模型,其中73.44%的代谢物实现了显著关联。高光谱指数能有效预测种子相关的9个关键性状,包括含油量、种皮含量、硫代葡萄糖苷含量和6种脂肪酸组分。高光谱指数的全基因组关联研究(GWAS)揭示了三个稳定的数量性状位点(QTL)热点。hotA09 qHSI。hotA05和qHSI。hotC05,与种子油和种皮含量的qtl共定位。GWAS与POCKET优先级的集成确定了BnaA09。MYB52和BnaC05。PMT6作为热点qHSI的候选基因。hotA09和qHSI。分别hotC05。突变体的功能验证表明,这两个基因显著影响种子类黄酮代谢产物和高光谱特征。与BnaMYB52类似,BnaPMT6被认为是种皮含量的一种新的正调节因子。结论:本研究建立了一种新的、无损的甘蓝型油菜种子性状和代谢物评价方法。这也为选育高含油量、高营养品质的甘蓝型油菜品种提供了理论基础和遗传基础。
{"title":"Dissecting the genetic architecture of seed-related traits in Brassica napus by integrating multi-omics analysis and VIS-NIR hyperspectral imaging.","authors":"Zengdong Tan, Yunhao Liu, Xiaowei Wu, Jingyan Song, Bingjie Lu, Yongqi Chen, Ruyi Fan, Jie Chen, Wanneng Yang, Hui Feng, Liang Guo, Xuan Yao","doi":"10.1186/s13059-026-03990-7","DOIUrl":"10.1186/s13059-026-03990-7","url":null,"abstract":"<p><strong>Background: </strong>Brassica napus (B. napus) is globally important oilseed crop, yet traditional approaches for phenotyping of seed traits are labor-intensive and destructive.</p><p><strong>Results: </strong>Here, we establish a non-destructive analytical framework integrating hyperspectral imaging (HSI) with machine learning for characterizing seed-related traits. We collect HSI data from seeds of 393 B. napus accessions over two consecutive years, generating 1,944 spectral indices per sample. We identify significant correlations between 1,293 hyperspectral indices and 956 seed metabolites. Flavonoid metabolites exhibit the most consistent interannual correlations with hyperspectral indices. Systematic benchmarking of 19 machine learning algorithms identifies nine optimal models for metabolite prediction, with 73.44% of metabolites achieving significant associations. Hyperspectral indices effectively predict nine key seed-related traits, including oil content, seed coat content, glucosinolate content and six fatty acid components. Genome-wide association studies (GWAS) of hyperspectral indices uncover three stable quantitative trait loci (QTL) hotspots, qHSI.hotA09, qHSI.hotA05 and qHSI.hotC05, that co-localize with QTLs for seed oil and seed coat content. Integration of GWAS with POCKET prioritization identifies BnaA09.MYB52 and BnaC05.PMT6 as candidate genes for the hotspots, qHSI.hotA09 and qHSI.hotC05, respectively. Functional validation using mutants demonstrates that both genes significantly influence seed flavonoid metabolites and hyperspectral profiles. BnaPMT6 is characterized as a novel positive regulator of seed coat content, similar to BnaMYB52.</p><p><strong>Conclusions: </strong>This study establishes a novel, non-destructive approach for seed traits and metabolite assessment in B. napus seeds. It also provides a theoretical foundation and genetic basis for breeding of B. napus varieties with high oil content and improved nutritional quality.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12990562/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146158851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-10DOI: 10.1186/s13059-026-03997-0
Peng Xu, Hantao Zhang, Siyao Zhu, Yimeng Kong
Detecting senescent cells from single-cell RNA-seq data remains challenging due to the weak and non-specific expression of canonical markers. Here, we demonstrate that simple expansion of these low-signal marker sets does not improve detection accuracy. To address this limitation, we develop ICE (Imputation-based Cell Enrichment), a computational framework that integrates expression imputation with marker refinement. ICE improves the detection of senescent cells in pancreatic β cells and microglia from Alzheimer's disease samples. This tool enables reliable identification of senescence-associated cell populations, facilitating more detailed analyses of their heterogeneity and temporal dynamics across human tissues and disease contexts.
{"title":"ICE: robust detection of cellular senescence from weak single-cell signatures using imputation-based marker refinement.","authors":"Peng Xu, Hantao Zhang, Siyao Zhu, Yimeng Kong","doi":"10.1186/s13059-026-03997-0","DOIUrl":"10.1186/s13059-026-03997-0","url":null,"abstract":"<p><p>Detecting senescent cells from single-cell RNA-seq data remains challenging due to the weak and non-specific expression of canonical markers. Here, we demonstrate that simple expansion of these low-signal marker sets does not improve detection accuracy. To address this limitation, we develop ICE (Imputation-based Cell Enrichment), a computational framework that integrates expression imputation with marker refinement. ICE improves the detection of senescent cells in pancreatic β cells and microglia from Alzheimer's disease samples. This tool enables reliable identification of senescence-associated cell populations, facilitating more detailed analyses of their heterogeneity and temporal dynamics across human tissues and disease contexts.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12990438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146158839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-10DOI: 10.1186/s13059-026-03979-2
Moonia Ammari, Aanchal Choudhary, Mark Zander
Elucidating transcription factor (TF) function is essential for advancing our understanding and manipulation of the mechanisms that orchestrate gene expression programs underlying plant growth, development, and resilience. Defining the genomic binding sites of a TF-its cistrome-is particularly critical, as it reveals where TF activity can occur and, when integrated with gene expression and chromatin landscape data, delineates the full scope of TF function. In this review, we highlight the biological factors that shape plant cistromes and, importantly, the potential alterations in cistrome composition that may arise during experimental mapping. We further emphasize recent methodological advances now available to the plant science community and outline future directions for the emerging field of plant cistromics.
{"title":"Many roads lead to a plant cistrome: mapping and interpreting transcription factor binding in plants.","authors":"Moonia Ammari, Aanchal Choudhary, Mark Zander","doi":"10.1186/s13059-026-03979-2","DOIUrl":"10.1186/s13059-026-03979-2","url":null,"abstract":"<p><p>Elucidating transcription factor (TF) function is essential for advancing our understanding and manipulation of the mechanisms that orchestrate gene expression programs underlying plant growth, development, and resilience. Defining the genomic binding sites of a TF-its cistrome-is particularly critical, as it reveals where TF activity can occur and, when integrated with gene expression and chromatin landscape data, delineates the full scope of TF function. In this review, we highlight the biological factors that shape plant cistromes and, importantly, the potential alterations in cistrome composition that may arise during experimental mapping. We further emphasize recent methodological advances now available to the plant science community and outline future directions for the emerging field of plant cistromics.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12990482/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146158842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-06DOI: 10.1186/s13059-026-03983-6
Xiang Li, Yitian Fang, You Wu, Xiang Yu
Antifungal peptides (AFPs) are crucial for plant defense against biotic stress. Yet, no artificial intelligence tool specifically classifies plant AFPs. To fill this gap, we develop FungiGuard, which integrates Random Forest, Long Short-Term Memory, and attention mechanisms to identify AFPs using functionally annotated plant small peptides. FungiGuard outperforms existing generalized AFP model in classifying plant AFPs, and detects candidate AFPs in Arabidopsis, wheat, rice, and maize. It also discovers novel AFPs through randomly generated sequences. Experimental validation confirms the antifungal activity of candidate AFP against Botrytis cinerea. This tool deepens plant AFP understanding and facilitates novel AFP discovery.
{"title":"FungiGuard: identification of plant antifungal peptides with artificial intelligence.","authors":"Xiang Li, Yitian Fang, You Wu, Xiang Yu","doi":"10.1186/s13059-026-03983-6","DOIUrl":"10.1186/s13059-026-03983-6","url":null,"abstract":"<p><p>Antifungal peptides (AFPs) are crucial for plant defense against biotic stress. Yet, no artificial intelligence tool specifically classifies plant AFPs. To fill this gap, we develop FungiGuard, which integrates Random Forest, Long Short-Term Memory, and attention mechanisms to identify AFPs using functionally annotated plant small peptides. FungiGuard outperforms existing generalized AFP model in classifying plant AFPs, and detects candidate AFPs in Arabidopsis, wheat, rice, and maize. It also discovers novel AFPs through randomly generated sequences. Experimental validation confirms the antifungal activity of candidate AFP against Botrytis cinerea. This tool deepens plant AFP understanding and facilitates novel AFP discovery.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12977696/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146127073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1186/s13059-026-03985-4
Hayoung Cho, Michael L Nielsen, Jesper V Olsen
Background: Glucocorticoids are corticosteroid hormones that are commonly used for treating systemic inflammatory diseases and acute infections. Immunosuppressive effects of glucocorticoids have been studied in many cell types, particularly macrophages and T cells. Despite the importance and abundance of neutrophils in the human immune system, glucocorticoid responses remain understudied in neutrophils.
Results: Here, we perform quantitative mass spectrometry-based proteomics of primary neutrophils and neutrophil-like cells differentiated from human HL-60 promyelocyte cells. Primary neutrophils exhibited CK2 kinase activation and increase phosphorylation of HSP90 following 2-h incubation, highlighting potential effects of short-term ex vivo handling. Proteome and flow cytometry analysis show that neutrophil-like cells share features of neutrophils. Quantitative proteomics and phosphoproteomics of neutrophil-like cells treated with two synthetic glucocorticoid compounds, the clinical drugs dexamethasone and prednisolone, identify higher numbers of significantly regulated proteins and phosphosites compared to parental HL-60 cells. Glucocorticoid treatments modulated toll-like receptor signaling and CXCR4 serine phosphorylation. In addition, we identify RIPOR2 as a glucocorticoid-regulated protein associated with Rho GTPase signaling networks and actin cytoskeletal remodeling in neutrophils and neutrophil-like cells, though its exact functional role requires further investigation.
Conclusions: Our results not only reveal unconventional regulatory mechanisms of glucocorticoids in the human immune system but also provide valuable resources for discovering novel glucocorticoid-responsive protein targets in neutrophils.
{"title":"Quantitative proteomics and phosphoproteomics reveal glucocorticoid stimulation of TLR and Rho GTPase signaling in neutrophil-like cells.","authors":"Hayoung Cho, Michael L Nielsen, Jesper V Olsen","doi":"10.1186/s13059-026-03985-4","DOIUrl":"10.1186/s13059-026-03985-4","url":null,"abstract":"<p><strong>Background: </strong>Glucocorticoids are corticosteroid hormones that are commonly used for treating systemic inflammatory diseases and acute infections. Immunosuppressive effects of glucocorticoids have been studied in many cell types, particularly macrophages and T cells. Despite the importance and abundance of neutrophils in the human immune system, glucocorticoid responses remain understudied in neutrophils.</p><p><strong>Results: </strong>Here, we perform quantitative mass spectrometry-based proteomics of primary neutrophils and neutrophil-like cells differentiated from human HL-60 promyelocyte cells. Primary neutrophils exhibited CK2 kinase activation and increase phosphorylation of HSP90 following 2-h incubation, highlighting potential effects of short-term ex vivo handling. Proteome and flow cytometry analysis show that neutrophil-like cells share features of neutrophils. Quantitative proteomics and phosphoproteomics of neutrophil-like cells treated with two synthetic glucocorticoid compounds, the clinical drugs dexamethasone and prednisolone, identify higher numbers of significantly regulated proteins and phosphosites compared to parental HL-60 cells. Glucocorticoid treatments modulated toll-like receptor signaling and CXCR4 serine phosphorylation. In addition, we identify RIPOR2 as a glucocorticoid-regulated protein associated with Rho GTPase signaling networks and actin cytoskeletal remodeling in neutrophils and neutrophil-like cells, though its exact functional role requires further investigation.</p><p><strong>Conclusions: </strong>Our results not only reveal unconventional regulatory mechanisms of glucocorticoids in the human immune system but also provide valuable resources for discovering novel glucocorticoid-responsive protein targets in neutrophils.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12964602/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146127087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1186/s13059-026-03952-z
Robin Grolaux, Macsue Jacques, Bernadette Jones-Freeman, Steve Horvath, Andrew Teschendorff, Nir Eynon
Background: Aging is a multi-modal process, leaving distinct molecular signatures across the epigenome. DNA methylation is among the most robust biomarkers of biological aging, yet most studies assume linear age relationships and analyze mixed-sex cohorts, overlooking known sex differences. Such approaches risk obscuring critical nonlinear transitions and sex-specific trajectories.
Results: We develop SNITCH, a computational framework to detect complex nonlinear methylation trajectories and disentangle shared from sex-divergent patterns. Applied to the array-derived whole-blood methylomes from 252 females and 246 males (ages 19-90 years), SNITCH reveals convergent and divergent epigenetic aging pathways independent of immune cell composition. Nonlinear trajectories are enriched for developmental transcription factor motifs, including NF1/CTF and REST, with known oncogenic roles. Importantly, a female-specific nonlinear cluster is prospectively associated with cancer onset and systemic inflammation in an independent cohort, nominating clinically relevant biomarkers. We replicate the analysis in an additional cohort and highlight consistent nonlinear trajectories.
Conclusions: Our results uncover sex-specific, nonlinear aging programs that capture the dynamics of epigenetic change beyond linear models. These findings provide potential candidate biomarkers for early disease risk and advance understanding of how aging trajectories diverge between sexes.
{"title":"Sex-specific nonlinear DNA methylation aging trajectories reveal biomarkers of cancer risk and inflammation.","authors":"Robin Grolaux, Macsue Jacques, Bernadette Jones-Freeman, Steve Horvath, Andrew Teschendorff, Nir Eynon","doi":"10.1186/s13059-026-03952-z","DOIUrl":"10.1186/s13059-026-03952-z","url":null,"abstract":"<p><strong>Background: </strong>Aging is a multi-modal process, leaving distinct molecular signatures across the epigenome. DNA methylation is among the most robust biomarkers of biological aging, yet most studies assume linear age relationships and analyze mixed-sex cohorts, overlooking known sex differences. Such approaches risk obscuring critical nonlinear transitions and sex-specific trajectories.</p><p><strong>Results: </strong>We develop SNITCH, a computational framework to detect complex nonlinear methylation trajectories and disentangle shared from sex-divergent patterns. Applied to the array-derived whole-blood methylomes from 252 females and 246 males (ages 19-90 years), SNITCH reveals convergent and divergent epigenetic aging pathways independent of immune cell composition. Nonlinear trajectories are enriched for developmental transcription factor motifs, including NF1/CTF and REST, with known oncogenic roles. Importantly, a female-specific nonlinear cluster is prospectively associated with cancer onset and systemic inflammation in an independent cohort, nominating clinically relevant biomarkers. We replicate the analysis in an additional cohort and highlight consistent nonlinear trajectories.</p><p><strong>Conclusions: </strong>Our results uncover sex-specific, nonlinear aging programs that capture the dynamics of epigenetic change beyond linear models. These findings provide potential candidate biomarkers for early disease risk and advance understanding of how aging trajectories diverge between sexes.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"27 1","pages":"2"},"PeriodicalIF":12.3,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870970/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1186/s13059-026-03965-8
Zhenghao Zhang, Jiamin Chen, Haoran Wu, Kelly Yichen Li, Peter D Adams, Pamela Itkin-Ansari, Kevin Y Yip
Dimensionality reduction is routinely applied to single-cell transcriptomic data to improve interpretability, remove noise and redundancy, and enable visualization. Most existing methods aim at preserving the most prominent data properties, which can lead to omission of rare but important signals. Here we propose a novel framework, SAKURA, that uses knowledge-derived genes of interest to guide dimensionality reduction, which can help cluster rare cells and separate highly similar cell subpopulations. We demonstrate the utility of our framework in identifying endocrine cell subtypes in the pancreatic islet, highly similar hematopoietic subpopulations, and rare senescent cells.
{"title":"SAKURA: a knowledge-guided approach to recovering important, rare signals from single-cell data.","authors":"Zhenghao Zhang, Jiamin Chen, Haoran Wu, Kelly Yichen Li, Peter D Adams, Pamela Itkin-Ansari, Kevin Y Yip","doi":"10.1186/s13059-026-03965-8","DOIUrl":"10.1186/s13059-026-03965-8","url":null,"abstract":"<p><p>Dimensionality reduction is routinely applied to single-cell transcriptomic data to improve interpretability, remove noise and redundancy, and enable visualization. Most existing methods aim at preserving the most prominent data properties, which can lead to omission of rare but important signals. Here we propose a novel framework, SAKURA, that uses knowledge-derived genes of interest to guide dimensionality reduction, which can help cluster rare cells and separate highly similar cell subpopulations. We demonstrate the utility of our framework in identifying endocrine cell subtypes in the pancreatic islet, highly similar hematopoietic subpopulations, and rare senescent cells.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12964657/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.1186/s13059-026-03972-9
Yaojia Chen, Yumeng Zhang, Mengting Niu, Jiacheng Wang, Zhonghao Ren, Quan Zou, Jiangning Song, Ximei Luo
Drug combinations can improve cancer therapy by boosting efficacy, limiting dose-related toxicity, and delaying resistance. We present UniSyn, an interpretable multi-modal deep learning framework that transfers knowledge from monotherapy responses to enhance drug-synergy prediction. Through hybrid attention-based integration of drug and cell-line features, UniSyn supports multi-task learning and yields mechanistic insights. It generalizes robustly to unseen drug pairs and cell types, maintaining consistent performance across multiple synergy scoring metrics. Applied at scale to tumor cell lines, UniSyn captures context-specific synergy signals and prioritizes therapeutic combinations with translational potential.
{"title":"UniSyn: a multi-modal framework with knowledge transfer for anti-cancer drug synergy prediction.","authors":"Yaojia Chen, Yumeng Zhang, Mengting Niu, Jiacheng Wang, Zhonghao Ren, Quan Zou, Jiangning Song, Ximei Luo","doi":"10.1186/s13059-026-03972-9","DOIUrl":"10.1186/s13059-026-03972-9","url":null,"abstract":"<p><p>Drug combinations can improve cancer therapy by boosting efficacy, limiting dose-related toxicity, and delaying resistance. We present UniSyn, an interpretable multi-modal deep learning framework that transfers knowledge from monotherapy responses to enhance drug-synergy prediction. Through hybrid attention-based integration of drug and cell-line features, UniSyn supports multi-task learning and yields mechanistic insights. It generalizes robustly to unseen drug pairs and cell types, maintaining consistent performance across multiple synergy scoring metrics. Applied at scale to tumor cell lines, UniSyn captures context-specific synergy signals and prioritizes therapeutic combinations with translational potential.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12964627/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}