Pub Date : 2026-02-11Epub Date: 2025-12-01DOI: 10.1016/j.xgen.2025.101074
Mohab Helmy, Jin U Li, Xinyu F Yan, Rachel K Meade, Elizabeth Anderson, Patrick B Chen, Anne M Czechanski, Tomás Di Domenico, Jonathan Flint, Erik Garrison, Marco T P Gontijo, Andrea Guarracino, Leanne Haggerty, Edith Heard, Kerstin Howe, Narendra Meena, Fergal J Martin, Eric A Miska, Isabell Rall, Navin B Ramakrishna, Alexandra Sapetschnig, Swati Sinha, Diandian Sun, Francesca F Tricomi, Runjia Qu, Jonathan M D Wood, Tianzhen Wu, Dian J Zhou, Laura Reinholdt, David J Adams, Clare M Smith, Jingtao Lilue, Thomas M Keane
We present a collection of 17 high-quality long-read inbred mouse strain genomes with complete annotation (contig N50s of 0.8-33.9 Mbp). This collection includes 12 widely used classical laboratory strains and 5 wild-derived strains. We have resolved previously incomplete genomic regions, including the major histocompatibility complex (MHC), defensin cluster, T cell receptor, and Ly49 complexes. Hundreds of non-reference genes from previous publications not found in GRCm39, such as Defa1, Raet1a, and Klra20 (Ly49T), were localized in the new reference genomes. We conducted a genome-wide scan of variable number tandem repeats (VNTRs) within the coding regions, identifying over 400 genes with VNTR polymorphisms with up to 600 repeat copies and repeat units reaching 990 nucleotides. Our strain-specific annotations enhance RNA sequencing (RNA-seq) analyses, as demonstrated in PWK/PhJ, where we observed a 5.1% improvement in read mapping and expression-level differences in 2.1% of coding genes compared to using GRCm39.
{"title":"High-quality mouse reference genomes reveal the structural complexity of the murine protein-coding landscape.","authors":"Mohab Helmy, Jin U Li, Xinyu F Yan, Rachel K Meade, Elizabeth Anderson, Patrick B Chen, Anne M Czechanski, Tomás Di Domenico, Jonathan Flint, Erik Garrison, Marco T P Gontijo, Andrea Guarracino, Leanne Haggerty, Edith Heard, Kerstin Howe, Narendra Meena, Fergal J Martin, Eric A Miska, Isabell Rall, Navin B Ramakrishna, Alexandra Sapetschnig, Swati Sinha, Diandian Sun, Francesca F Tricomi, Runjia Qu, Jonathan M D Wood, Tianzhen Wu, Dian J Zhou, Laura Reinholdt, David J Adams, Clare M Smith, Jingtao Lilue, Thomas M Keane","doi":"10.1016/j.xgen.2025.101074","DOIUrl":"10.1016/j.xgen.2025.101074","url":null,"abstract":"<p><p>We present a collection of 17 high-quality long-read inbred mouse strain genomes with complete annotation (contig N50s of 0.8-33.9 Mbp). This collection includes 12 widely used classical laboratory strains and 5 wild-derived strains. We have resolved previously incomplete genomic regions, including the major histocompatibility complex (MHC), defensin cluster, T cell receptor, and Ly49 complexes. Hundreds of non-reference genes from previous publications not found in GRCm39, such as Defa1, Raet1a, and Klra20 (Ly49T), were localized in the new reference genomes. We conducted a genome-wide scan of variable number tandem repeats (VNTRs) within the coding regions, identifying over 400 genes with VNTR polymorphisms with up to 600 repeat copies and repeat units reaching 990 nucleotides. Our strain-specific annotations enhance RNA sequencing (RNA-seq) analyses, as demonstrated in PWK/PhJ, where we observed a 5.1% improvement in read mapping and expression-level differences in 2.1% of coding genes compared to using GRCm39.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101074"},"PeriodicalIF":11.1,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-11Epub Date: 2025-12-10DOI: 10.1016/j.xgen.2025.101072
Jiazheng Zhu, Georgios Kalantzis, Ali Pazokitoroudi, Árni Freyr Gunnarsson, Hrushikesh Loya, Han Chen, Sriram Sankararaman, Pier Francesco Palamara
Recent algorithmic advances have enabled the inference of genome-wide ancestral recombination graphs (ARGs) from large genomic cohorts, providing detailed models of genealogical relatedness along the genome. These inferred ARGs can complement genotype imputation by capturing the effects of unobserved variants, but their use in large-scale linear mixed-model analyses has been computationally prohibitive. Here, we develop methods that leverage the ARG to perform genotype-matrix multiplications in sublinear time and implement scalable randomized algorithms for mixed-model analyses. We introduce ARG-RHE, a randomized Haseman-Elston approach for estimating narrow-sense heritability and performing region-based association testing using ARGs, enabling parallel analysis of multiple quantitative traits. Through extensive simulations, we demonstrate the computational efficiency and statistical power of this approach. Applied to 21,159 genes and 52 blood traits in 337,464 UK Biobank participants, ARG-RHE identifies 8% more gene-trait associations than imputation alone, demonstrating that genome-wide genealogies may be leveraged to complement genotype imputation in complex trait analyses.
{"title":"Leveraging ancestral recombination graphs for scalable mixed-model analysis of complex traits.","authors":"Jiazheng Zhu, Georgios Kalantzis, Ali Pazokitoroudi, Árni Freyr Gunnarsson, Hrushikesh Loya, Han Chen, Sriram Sankararaman, Pier Francesco Palamara","doi":"10.1016/j.xgen.2025.101072","DOIUrl":"10.1016/j.xgen.2025.101072","url":null,"abstract":"<p><p>Recent algorithmic advances have enabled the inference of genome-wide ancestral recombination graphs (ARGs) from large genomic cohorts, providing detailed models of genealogical relatedness along the genome. These inferred ARGs can complement genotype imputation by capturing the effects of unobserved variants, but their use in large-scale linear mixed-model analyses has been computationally prohibitive. Here, we develop methods that leverage the ARG to perform genotype-matrix multiplications in sublinear time and implement scalable randomized algorithms for mixed-model analyses. We introduce ARG-RHE, a randomized Haseman-Elston approach for estimating narrow-sense heritability and performing region-based association testing using ARGs, enabling parallel analysis of multiple quantitative traits. Through extensive simulations, we demonstrate the computational efficiency and statistical power of this approach. Applied to 21,159 genes and 52 blood traits in 337,464 UK Biobank participants, ARG-RHE identifies 8% more gene-trait associations than imputation alone, demonstrating that genome-wide genealogies may be leveraged to complement genotype imputation in complex trait analyses.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101072"},"PeriodicalIF":11.1,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-11Epub Date: 2025-12-01DOI: 10.1016/j.xgen.2025.101071
Mingrui Yu, Qian Zhang, Kai Yuan, Aleksejs Sazonovs, Christine R Stevens, Laura Fachal, Christopher A Lamb, Carl A Anderson, Mark J Daly, Hailiang Huang
Genetic mutations that yield a defective cystic fibrosis (CF) transmembrane regulator (CFTR) protein cause CF, a life-limiting autosomal-recessive Mendelian disorder. A protective role of CFTR loss-of-function mutations in inflammatory bowel disease (IBD) has been suggested, but its evidence has been inconclusive and contradictory. Here, leveraging a large IBD exome sequencing dataset comprising 38,558 cases and 66,945 controls of European ancestry in the discovery stage and a combined total of 42,475 cases and 192,050 controls across diverse ancestry groups in the replication stage, we established a protective role of CF-risk variants against IBD based on the association test of CFTR deltaF508 (p = 8.96E-11) and the gene-based burden test of CF-risk variants (p = 3.9E-07). Furthermore, we assessed variant prioritization methods, including AlphaMissense, using clinically annotated CF-risk variants as the gold standard. Our findings highlight the critical and unmet need for effective variant prioritization in gene-based burden tests.
{"title":"Cystic fibrosis risk variants confer protection against inflammatory bowel disease.","authors":"Mingrui Yu, Qian Zhang, Kai Yuan, Aleksejs Sazonovs, Christine R Stevens, Laura Fachal, Christopher A Lamb, Carl A Anderson, Mark J Daly, Hailiang Huang","doi":"10.1016/j.xgen.2025.101071","DOIUrl":"10.1016/j.xgen.2025.101071","url":null,"abstract":"<p><p>Genetic mutations that yield a defective cystic fibrosis (CF) transmembrane regulator (CFTR) protein cause CF, a life-limiting autosomal-recessive Mendelian disorder. A protective role of CFTR loss-of-function mutations in inflammatory bowel disease (IBD) has been suggested, but its evidence has been inconclusive and contradictory. Here, leveraging a large IBD exome sequencing dataset comprising 38,558 cases and 66,945 controls of European ancestry in the discovery stage and a combined total of 42,475 cases and 192,050 controls across diverse ancestry groups in the replication stage, we established a protective role of CF-risk variants against IBD based on the association test of CFTR deltaF508 (p = 8.96E-11) and the gene-based burden test of CF-risk variants (p = 3.9E-07). Furthermore, we assessed variant prioritization methods, including AlphaMissense, using clinically annotated CF-risk variants as the gold standard. Our findings highlight the critical and unmet need for effective variant prioritization in gene-based burden tests.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101071"},"PeriodicalIF":11.1,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-11Epub Date: 2026-01-22DOI: 10.1016/j.xgen.2025.101133
Alla Karpova, Xiang Li, Chien-Wei Peng, Kelsey L Gallant, Daniel R Rapp, Daniel M Alligood, Andrew J Houston, Alex Park, Andre Luiz N Targino da Costa, Wen-Hung Chou, Michael D Iglesia, John M Herndon, Kathleen Byrnes, Nataly Naser Al Deen, Preet Lal, Xiangwei Fang, Reyka G Jayasinghe, Jeffrey A Blatnik, William G Hawkins, Dominic E Sanford, J Chris Eagon, Natasha Leigh, Maria Bernadette M Doyle, L Michael Brunt, William C Chapman, Roheena Z Panni, Darren R Cullinan, Sherri R Davies, Jun Guo, Matthew A Wyczalkowski, Michael C Wendl, Hong Zhang, Colin A Martin, Brad W Warner, Milan G Chheda, Sheila A Stewart, Feng Chen, Ryan C Fields, Li Ding
Cellular senescence, a stress-induced program causing stable cell-cycle arrest, is a hallmark of liver aging, fibrosis, and cancer. However, the cell-type-specific mechanisms, spatial organization, and cancer-associated alterations in the liver remain unclear. We profiled 43 normal human livers spanning ages and fibrosis stages using a single-cell multiome, Xenium spatial transcriptomics, and CODEX, complemented by fibrotic mouse models and 24 colorectal cancer liver metastases. We found CDKN1A+ senescent hepatocytes, fibroblasts, cholangiocytes, and endothelial cells associated with age, liver disease, or cancer. Senescence differed between aged and fibrotic livers, with similar patterns in mice. Spatially, CDKN1A+ hepatocytes localized periportally, while SERPINE1+ aging-associated hepatocytes formed spatial clusters, potentially mediated by Claudins and THBS1. Fibrotic regions contained CXCL12+ senescent fibroblasts interacting with CXCR4+ immune cells. Chemotherapy intensified senescence in hepatocytes by 5-fold relative to aging and led to unique CDKN2A+ populations. Across conditions, senescent cells shared AP-1 activation, pro-inflammatory cytokines, and apoptosis resistance, suggesting therapeutic opportunities.
{"title":"Cellular senescence in human liver under normal aging and cancer.","authors":"Alla Karpova, Xiang Li, Chien-Wei Peng, Kelsey L Gallant, Daniel R Rapp, Daniel M Alligood, Andrew J Houston, Alex Park, Andre Luiz N Targino da Costa, Wen-Hung Chou, Michael D Iglesia, John M Herndon, Kathleen Byrnes, Nataly Naser Al Deen, Preet Lal, Xiangwei Fang, Reyka G Jayasinghe, Jeffrey A Blatnik, William G Hawkins, Dominic E Sanford, J Chris Eagon, Natasha Leigh, Maria Bernadette M Doyle, L Michael Brunt, William C Chapman, Roheena Z Panni, Darren R Cullinan, Sherri R Davies, Jun Guo, Matthew A Wyczalkowski, Michael C Wendl, Hong Zhang, Colin A Martin, Brad W Warner, Milan G Chheda, Sheila A Stewart, Feng Chen, Ryan C Fields, Li Ding","doi":"10.1016/j.xgen.2025.101133","DOIUrl":"10.1016/j.xgen.2025.101133","url":null,"abstract":"<p><p>Cellular senescence, a stress-induced program causing stable cell-cycle arrest, is a hallmark of liver aging, fibrosis, and cancer. However, the cell-type-specific mechanisms, spatial organization, and cancer-associated alterations in the liver remain unclear. We profiled 43 normal human livers spanning ages and fibrosis stages using a single-cell multiome, Xenium spatial transcriptomics, and CODEX, complemented by fibrotic mouse models and 24 colorectal cancer liver metastases. We found CDKN1A+ senescent hepatocytes, fibroblasts, cholangiocytes, and endothelial cells associated with age, liver disease, or cancer. Senescence differed between aged and fibrotic livers, with similar patterns in mice. Spatially, CDKN1A+ hepatocytes localized periportally, while SERPINE1+ aging-associated hepatocytes formed spatial clusters, potentially mediated by Claudins and THBS1. Fibrotic regions contained CXCL12+ senescent fibroblasts interacting with CXCR4+ immune cells. Chemotherapy intensified senescence in hepatocytes by 5-fold relative to aging and led to unique CDKN2A+ populations. Across conditions, senescent cells shared AP-1 activation, pro-inflammatory cytokines, and apoptosis resistance, suggesting therapeutic opportunities.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101133"},"PeriodicalIF":11.1,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-10DOI: 10.1016/j.xgen.2026.101164
Bettina Welz, Saul Pierotti, Tomas Fitzgerald, Thomas Thumberger, Risa Suzuki, Philip Watson, Jana Fuss, Tiago Cordeiro da Trindade, Fanny Defranoux, Marcio Ferreira, Kiyoshi Naruse, Felix Loosli, Jakob Gierten, Joachim Wittbrodt, Ewan Birney
Phenotypic variation arises from interactions between genetic and environmental factors, but disentangling these effects for complex traits remains challenging in observational cohorts like human biobanks. Model organisms with controlled genetic and environmental variation complement human studies in analyzing higher-order effects such as gene-by-environment (G×E) interactions, dominance, and epistasis. We utilized 76 medaka strains from the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel to compare heart rate plasticity across temperatures. An F2 segregation analysis identified 16 quantitative trait loci (QTLs), many exhibiting dominance, G×E, G×G, and G×G×E interactions. We experimentally validated four candidate genes, revealing temperature-sensitive heart rate effects. Finally, we simulated how genome-wide association study (GWAS) discovery power depends on statistical model choice. Our results suggest that the limited detection of non-additive effects in human GWASs stems from current study designs and sample sizes. This work demonstrates the value of controlled model organism studies for dissecting complex trait genetics and informing association study design.
{"title":"Discovery and characterization of gene-by-environment and epistatic genetic effects in a vertebrate model.","authors":"Bettina Welz, Saul Pierotti, Tomas Fitzgerald, Thomas Thumberger, Risa Suzuki, Philip Watson, Jana Fuss, Tiago Cordeiro da Trindade, Fanny Defranoux, Marcio Ferreira, Kiyoshi Naruse, Felix Loosli, Jakob Gierten, Joachim Wittbrodt, Ewan Birney","doi":"10.1016/j.xgen.2026.101164","DOIUrl":"https://doi.org/10.1016/j.xgen.2026.101164","url":null,"abstract":"<p><p>Phenotypic variation arises from interactions between genetic and environmental factors, but disentangling these effects for complex traits remains challenging in observational cohorts like human biobanks. Model organisms with controlled genetic and environmental variation complement human studies in analyzing higher-order effects such as gene-by-environment (G×E) interactions, dominance, and epistasis. We utilized 76 medaka strains from the Medaka Inbred Kiyosu-Karlsruhe (MIKK) panel to compare heart rate plasticity across temperatures. An F2 segregation analysis identified 16 quantitative trait loci (QTLs), many exhibiting dominance, G×E, G×G, and G×G×E interactions. We experimentally validated four candidate genes, revealing temperature-sensitive heart rate effects. Finally, we simulated how genome-wide association study (GWAS) discovery power depends on statistical model choice. Our results suggest that the limited detection of non-additive effects in human GWASs stems from current study designs and sample sizes. This work demonstrates the value of controlled model organism studies for dissecting complex trait genetics and informing association study design.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101164"},"PeriodicalIF":11.1,"publicationDate":"2026-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146167734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-06DOI: 10.1016/j.xgen.2026.101142
Peiyong Jiang, Mary-Jane L Ma, Rong Qiao, Yuwei Shi, Jing Liu, Qing Zhou, Wenlei Peng, W K Jacky Lam, Jinyue Bai, L Y Lois Choy, W H Adrian Tsui, Yasine Malki, Guannan Kang, Stephanie C Y Yu, Dongyan Xiong, Grace L H Wong, Landon L Chan, John Wong, Stephen L Chan, Vincent W S Wong, K C Allen Chan, Y M Dennis Lo
Cell-free DNA (cfDNA) end motifs serve as fragmentomics biomarkers for cancer. Prior studies primarily focused on 5' ends, whereas 3' ends were overlooked due to artifactual modification in existing sequencing protocols. We utilized single-stranded library preparation ("2-end sequencing") to assess the native 5' and 3' end motifs (EM5 and EM3, respectively). Additionally, we demonstrated diagnostic power from the nucleotide motifs located immediately upstream and downstream of 5' and 3' ends, named pre-end motifs (PREMs) and post-end motifs (POEMs). These fragmentomics markers collectively achieved an area under the curve (AUC) of 0.95 for hepatocellular carcinoma (HCC) detection. Fragmentomics-based methylation analysis of 3' ends (3' FRAGMA) improved detection of HCC (AUC: 0.97). We further developed "4-end sequencing" to interrogate both ends of both strands of a double-stranded cfDNA molecule, enhancing fragmentomics-based cancer detection. Holistic end profiling adds to the armamentarium of liquid biopsy and sheds light on the biology of cfDNA fragmentation.
{"title":"Holistic determination of ends of cfDNA molecules.","authors":"Peiyong Jiang, Mary-Jane L Ma, Rong Qiao, Yuwei Shi, Jing Liu, Qing Zhou, Wenlei Peng, W K Jacky Lam, Jinyue Bai, L Y Lois Choy, W H Adrian Tsui, Yasine Malki, Guannan Kang, Stephanie C Y Yu, Dongyan Xiong, Grace L H Wong, Landon L Chan, John Wong, Stephen L Chan, Vincent W S Wong, K C Allen Chan, Y M Dennis Lo","doi":"10.1016/j.xgen.2026.101142","DOIUrl":"https://doi.org/10.1016/j.xgen.2026.101142","url":null,"abstract":"<p><p>Cell-free DNA (cfDNA) end motifs serve as fragmentomics biomarkers for cancer. Prior studies primarily focused on 5' ends, whereas 3' ends were overlooked due to artifactual modification in existing sequencing protocols. We utilized single-stranded library preparation (\"2-end sequencing\") to assess the native 5' and 3' end motifs (EM5 and EM3, respectively). Additionally, we demonstrated diagnostic power from the nucleotide motifs located immediately upstream and downstream of 5' and 3' ends, named pre-end motifs (PREMs) and post-end motifs (POEMs). These fragmentomics markers collectively achieved an area under the curve (AUC) of 0.95 for hepatocellular carcinoma (HCC) detection. Fragmentomics-based methylation analysis of 3' ends (3' FRAGMA) improved detection of HCC (AUC: 0.97). We further developed \"4-end sequencing\" to interrogate both ends of both strands of a double-stranded cfDNA molecule, enhancing fragmentomics-based cancer detection. Holistic end profiling adds to the armamentarium of liquid biopsy and sheds light on the biology of cfDNA fragmentation.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101142"},"PeriodicalIF":11.1,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-06DOI: 10.1016/j.xgen.2026.101161
Wioletta I Nawrocka, Shouqiang Cheng, Bingjie Hao, Matthew C Rosen, Elena Cortés, Elana E Baltrusaitis, Zainab Aziz, Leo T H Tang, István A Kovács, Engin Özkan
The nematode Caenorhabditis elegans is a favorable model for studying cell-surface protein interactomes, given its well-defined and stereotyped intercellular contacts. Here, we report an extracellular interactome dataset for C. elegans. Most of these interactions were unknown, despite recent datasets for flies and humans, as our collection contains a larger selection of protein families. We uncover interactions for all four major axon guidance pathways, including ectodomain interactions between three of them. We demonstrate that a protein family, previously known for maintaining axon positioning, functions as secreted binders for insulins and that their overexpression in vivo extends lifespan, consistent with inhibition of insulin signaling. We reveal interactions of cystine-knot proteins with putative signaling receptors, which may extend the study of neurotrophins and growth factors to nematodes. Finally, our dataset constitutes a resource for uncovering the logic of neuronal connectivity, intercellular communication and adhesion, and signaling pathways involved in aging and disease.
{"title":"Nematode extracellular protein interactome expands connections between signaling pathways.","authors":"Wioletta I Nawrocka, Shouqiang Cheng, Bingjie Hao, Matthew C Rosen, Elena Cortés, Elana E Baltrusaitis, Zainab Aziz, Leo T H Tang, István A Kovács, Engin Özkan","doi":"10.1016/j.xgen.2026.101161","DOIUrl":"10.1016/j.xgen.2026.101161","url":null,"abstract":"<p><p>The nematode Caenorhabditis elegans is a favorable model for studying cell-surface protein interactomes, given its well-defined and stereotyped intercellular contacts. Here, we report an extracellular interactome dataset for C. elegans. Most of these interactions were unknown, despite recent datasets for flies and humans, as our collection contains a larger selection of protein families. We uncover interactions for all four major axon guidance pathways, including ectodomain interactions between three of them. We demonstrate that a protein family, previously known for maintaining axon positioning, functions as secreted binders for insulins and that their overexpression in vivo extends lifespan, consistent with inhibition of insulin signaling. We reveal interactions of cystine-knot proteins with putative signaling receptors, which may extend the study of neurotrophins and growth factors to nematodes. Finally, our dataset constitutes a resource for uncovering the logic of neuronal connectivity, intercellular communication and adhesion, and signaling pathways involved in aging and disease.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101161"},"PeriodicalIF":11.1,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1016/j.xgen.2026.101140
Aaron D Goldman, Gregory P Fournier, Betül Kaçar
The study of early evolutionary history provides an account of how the foundational features of life as we know it first emerged. Phylogenetic analysis is a powerful method in the study of early evolution because it uses molecular evidence that has been inherited from the ancient organisms themselves. Here, we describe an important yet understudied type of protein family, universal paralogs, that retain phylogenetic signals from evolutionary events predating the last universal common ancestor of life, offering a unique window into early evolution. We survey recent advances in the study of universal paralogs and discuss how emerging computational tools enhance our ability to use these protein families to describe the very earliest stages of evolution with increasing detail and accuracy. Such research will greatly improve our understanding of how life emerged and subsequently evolved on the ancient Earth.
{"title":"Universal paralogs provide a window into evolution before the last universal common ancestor.","authors":"Aaron D Goldman, Gregory P Fournier, Betül Kaçar","doi":"10.1016/j.xgen.2026.101140","DOIUrl":"https://doi.org/10.1016/j.xgen.2026.101140","url":null,"abstract":"<p><p>The study of early evolutionary history provides an account of how the foundational features of life as we know it first emerged. Phylogenetic analysis is a powerful method in the study of early evolution because it uses molecular evidence that has been inherited from the ancient organisms themselves. Here, we describe an important yet understudied type of protein family, universal paralogs, that retain phylogenetic signals from evolutionary events predating the last universal common ancestor of life, offering a unique window into early evolution. We survey recent advances in the study of universal paralogs and discuss how emerging computational tools enhance our ability to use these protein families to describe the very earliest stages of evolution with increasing detail and accuracy. Such research will greatly improve our understanding of how life emerged and subsequently evolved on the ancient Earth.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101140"},"PeriodicalIF":11.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-05DOI: 10.1016/j.xgen.2026.101141
Chenfeng Mo, Xiufen Zou, Suoqin Jin
Recent breakthroughs in spatial multi-omics enable simultaneous profiling of different modalities while preserving tissue architecture, providing unprecedented opportunities to explore tissue complexity. However, due to the sparse and noisy nature of the data, interpreting these complex tissue structures and cellular communication remains challenging. We present MultiSP, a deep learning framework that enhances data representation through efficient spatial and feature similarity fusion, modality-specific probabilistic generative modeling, and cross-modality adversarial learning. Applied to various spatial multi-omics datasets, it outperforms existing methods in capturing biologically interpretable spatial domains. MultiSP also denoises spatial data, uncovers modality-specific spatial variations, and reveals gene regulation mechanisms. In the tumor microenvironment, it unravels fine-resolution cellular distribution maps, such as spatially neighboring macrophage-enriched sub-regions with distinct prognosis outcomes. Additionally, MultiSP facilitates the inference of spatially multimodal cell-cell communication. Together, MultiSP serves as a powerful framework for uncovering spatially multimodal heterogeneity and communication by integrating complementary information from multiple modalities.
{"title":"MultiSP deciphers tissue structure and multicellular communication from spatial multi-omics data.","authors":"Chenfeng Mo, Xiufen Zou, Suoqin Jin","doi":"10.1016/j.xgen.2026.101141","DOIUrl":"https://doi.org/10.1016/j.xgen.2026.101141","url":null,"abstract":"<p><p>Recent breakthroughs in spatial multi-omics enable simultaneous profiling of different modalities while preserving tissue architecture, providing unprecedented opportunities to explore tissue complexity. However, due to the sparse and noisy nature of the data, interpreting these complex tissue structures and cellular communication remains challenging. We present MultiSP, a deep learning framework that enhances data representation through efficient spatial and feature similarity fusion, modality-specific probabilistic generative modeling, and cross-modality adversarial learning. Applied to various spatial multi-omics datasets, it outperforms existing methods in capturing biologically interpretable spatial domains. MultiSP also denoises spatial data, uncovers modality-specific spatial variations, and reveals gene regulation mechanisms. In the tumor microenvironment, it unravels fine-resolution cellular distribution maps, such as spatially neighboring macrophage-enriched sub-regions with distinct prognosis outcomes. Additionally, MultiSP facilitates the inference of spatially multimodal cell-cell communication. Together, MultiSP serves as a powerful framework for uncovering spatially multimodal heterogeneity and communication by integrating complementary information from multiple modalities.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101141"},"PeriodicalIF":11.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-28DOI: 10.1016/j.xgen.2026.101139
John Daly, Lidia Piatnitca, Mohammed Al-Seragi, Vignesh Krishnamoorthy, Simon Wisnovsky
Many cancers upregulate the expression of sialic acid-containing glycans. These oligosaccharides engage inhibitory sialic acid-binding immunoglobulin-like lectin (Siglec) receptors on immune cells, allowing cancer cells to evade immune surveillance. The genetic mechanisms underlying this process remain poorly defined. In this study, we performed gain-of-function CRISPR activation (CRISPRa) screens to define genetic pathways that regulate expression of Siglec-binding glycans. We show that Siglec ligand expression is controlled through genetic competition between genes that catalyze α2-3 sialylation and GlcNAcylation of galactose residues. Cancer glycome remodeling is also aided by the overexpression of "professional ligands" that facilitate Siglec-glycan binding. Notably, we also find that expression of the CD24 gene is genetically dispensable for cell surface binding of the inhibitory receptor Siglec-10. Finally, we identify the sulfotransferase enzyme GAL3ST4 as a potential driver of immune evasion in glioma cells. Our study provides a unique genomic atlas of cancer-associated glycosylation and identifies immediately actionable targets for cancer immunotherapy.
{"title":"CRISPR activation screens map the genomic landscape of cancer glycome remodeling.","authors":"John Daly, Lidia Piatnitca, Mohammed Al-Seragi, Vignesh Krishnamoorthy, Simon Wisnovsky","doi":"10.1016/j.xgen.2026.101139","DOIUrl":"https://doi.org/10.1016/j.xgen.2026.101139","url":null,"abstract":"<p><p>Many cancers upregulate the expression of sialic acid-containing glycans. These oligosaccharides engage inhibitory sialic acid-binding immunoglobulin-like lectin (Siglec) receptors on immune cells, allowing cancer cells to evade immune surveillance. The genetic mechanisms underlying this process remain poorly defined. In this study, we performed gain-of-function CRISPR activation (CRISPRa) screens to define genetic pathways that regulate expression of Siglec-binding glycans. We show that Siglec ligand expression is controlled through genetic competition between genes that catalyze α2-3 sialylation and GlcNAcylation of galactose residues. Cancer glycome remodeling is also aided by the overexpression of \"professional ligands\" that facilitate Siglec-glycan binding. Notably, we also find that expression of the CD24 gene is genetically dispensable for cell surface binding of the inhibitory receptor Siglec-10. Finally, we identify the sulfotransferase enzyme GAL3ST4 as a potential driver of immune evasion in glioma cells. Our study provides a unique genomic atlas of cancer-associated glycosylation and identifies immediately actionable targets for cancer immunotherapy.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101139"},"PeriodicalIF":11.1,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}