Genome Biology最新文献

英文中文

SCITO-seq2: ultra-high-throughput single-cell transcriptome and epitope sequencing. SCITO-seq2：超高通量单细胞转录组和表位测序。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-026-03954-x

Su-Hyeon Lee, Bo-Yeong Jin, Cho-Rong Lee, Doo Ri Kim, Areum Shin, Sung-Gyoo Park, Yae-Jean Kim, Seong Heon Kim, Murim Choi, Byungjin Hwang

We introduce SCITO-seq2, an enhanced successor to SCITO-seq that integrates probe-based RNA detection with the established ultra-high-throughput protein profiling. SCITO-seq2 achieves robust quantification of transcripts and surface proteins across more than 100,000 cells, with a shared pool barcoding strategy ensuring precise matching of molecular profiles within multiplexed droplets. SCITO-seq2 is compatible with cell hashing technology, allowing efficient sample multiplexing. We demonstrate its utility in autoimmune diseases, including childhood systemic lupus erythematosus and CTLA4 haploinsufficiency with autoimmune infiltration, enabling the detection of minor immune clusters and disease-specific protein signatures. This platform establishes a scalable, streamlined, and cost-effective next-generation single-cell multi-omics workflow.

我们推出SCITO-seq2，这是SCITO-seq的增强型后继产品，将基于探针的RNA检测与已建立的超高通量蛋白质分析相结合。SCITO-seq2在超过100,000个细胞中实现转录本和表面蛋白的强大定量，具有共享池条形码策略，确保多路液滴内分子谱的精确匹配。SCITO-seq2兼容cell哈希技术，允许高效的样本多路复用。我们证明了它在自身免疫性疾病中的应用，包括儿童系统性红斑狼疮和自身免疫性浸润的CTLA4单倍功能不全，能够检测轻微的免疫簇和疾病特异性蛋白质特征。该平台建立了一个可扩展的、精简的、经济高效的下一代单细胞多组学工作流程。

引用次数: 0

Expression spectrum of TE-driven transcripts in human adult tissues. te驱动转录本在成人组织中的表达谱。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-025-03898-8

Benpeng Miao, Xinlong Luo, Amina Ademovic, Yushan Yang, Tao P Wu, Bo A Zhang

引用次数: 0

Campolina: a deep neural framework for accurate segmentation of nanopore signals. Campolina：用于精确分割纳米孔信号的深度神经框架。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-026-03950-1

Sara Bakić, Krešimir Friganović, Bryan Hooi, Mile Šikić

Nanopore sequencing enables real-time, long-read analysis by processing raw signals as they are produced. A key step, segmentation of signals into events, is typically handled algorithmically, struggling in noisy regions. We present Campolina, a first deep-learning framework for accurate segmentation of raw nanopore signals. Campolina uses a convolutional model to identify event boundaries and significantly outperforms the traditional Scrappie algorithm on R9.4.1 and R10.4.1 datasets. We introduce a comprehensive evaluation pipeline and show that Campolina aligns better with reference-guided ground-truth segmentation. We show that integrating Campolina segmentation into real-time frameworks, Sigmoni and RawHash2, improves their performance while maintaining time efficiency.

纳米孔测序通过处理产生的原始信号，实现实时、长读分析。关键的一步，将信号分割成事件，通常是通过算法处理的，在噪声区域中挣扎。我们提出Campolina，这是第一个用于准确分割原始纳米孔信号的深度学习框架。Campolina使用卷积模型来识别事件边界，在R9.4.1和R10.4.1数据集上显著优于传统的Scrappie算法。我们介绍了一个全面的评估管道，并表明Campolina与参考引导的地面真值分割更一致。我们证明了将Campolina分割集成到实时框架Sigmoni和RawHash2中，在保持时间效率的同时提高了它们的性能。

引用次数: 0

Engineering hypercompact IscB nucleases for efficient and versatile genome editing in rice. 工程超紧凑IscB核酸酶用于水稻高效和通用的基因组编辑。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-026-03953-y

Jiahui Zhu, Yuying Li, Cao Yu, Weixi Huang, Junming Chen, Xiaoshuang Liu, Ruiying Qin, Juan Li, Rongfang Xu, Pengcheng Wei

Background: IscB (Insertion sequences Cas9-like OrfB) represents a novel class of RNA-guided nucleases, approximately one-third the size of Cas9 proteins. Despite the limited natural efficiency in eukaryotic cells, recent advances have led to the engineering of several IscBs for mammalian genome editing.

Results: In this study, we screen and identify high-activity IscB variants for rice. A version of pIscB-v3, combining enOgeuIscB and ωRNA-v13, demonstrated superior mutagenesis efficiency compared to other systems. The average editing efficiency of pIscB-v3 is 17.61% from ten endogenous targets, and we obtain edited lines in up to 83.33% of T0 generation with 33.33% of homozygous and bi-allelic mutations. Further analysis reveals that pIscB-v3 exhibits high editing specificity and relaxed target-adjacent motif (TAM) compatibility in rice. Beyond gene knockout systems, we develop cytosine base editors (CBEs) and adenine base editors (ABEs) from pIscB-v3. We find that the ssDNA-targeting SCP1.201 family deaminase Sdd7 outperformed human APOBEC3A in IscB-CBEs for C-to-T conversions in rice. The Sdd7-nIscB achieves precise edits in 22.92% of lines on average, with a maximum frequency of 47.92%. Additionally, TadA8e-nIscB exhibits limited activity. However, fusing an extra copy of TadA-8e to either terminus of TadA8e-nIsc significantly enhances A-to-G conversions.

Conclusions: Collectively, our results demonstrate the robust capabilities of IscB to develop an efficient and versatile miniature plant genome editing toolkit to substantially facilitate crop breeding.

背景：IscB（插入序列Cas9样OrfB）是一类新的rna引导核酸酶，大约是Cas9蛋白大小的三分之一。尽管真核细胞的天然效率有限，但最近的进展已经导致了几种用于哺乳动物基因组编辑的iscb的工程。结果：在本研究中，我们筛选并鉴定了水稻的高活性IscB变异。与其他系统相比，结合了enOgeuIscB和ωRNA-v13的pIscB-v3版本显示出更高的诱变效率。pIscB-v3对10个内源靶点的平均编辑效率为17.61%，T0代的编辑率高达83.33%，纯合子和双等位基因突变率为33.33%。进一步分析表明，pIscB-v3在水稻中具有较高的编辑特异性和宽松的靶邻近基序（TAM）相容性。除了基因敲除系统，我们还从pIscB-v3中开发了胞嘧啶碱基编辑器（CBEs）和腺嘌呤碱基编辑器（ABEs）。我们发现，在水稻中，靶向ssdna的SCP1.201家族脱氨酶Sdd7在C-to-T转化方面优于人类APOBEC3A。Sdd7-nIscB平均在22.92%的行中实现精确编辑，最高频率为47.92%。此外，TadA8e-nIscB表现出有限的活性。然而，将一个额外的TadA-8e拷贝融合到TadA8e-nIsc的任何一端，都能显著增强A-to-G转换。总之，我们的研究结果证明了IscB开发高效、通用的微型植物基因组编辑工具包的强大能力，从而极大地促进了作物育种。

{"title":"Engineering hypercompact IscB nucleases for efficient and versatile genome editing in rice.","authors":"Jiahui Zhu, Yuying Li, Cao Yu, Weixi Huang, Junming Chen, Xiaoshuang Liu, Ruiying Qin, Juan Li, Rongfang Xu, Pengcheng Wei","doi":"10.1186/s13059-026-03953-y","DOIUrl":"10.1186/s13059-026-03953-y","url":null,"abstract":"Background: IscB (Insertion sequences Cas9-like OrfB) represents a novel class of RNA-guided nucleases, approximately one-third the size of Cas9 proteins. Despite the limited natural efficiency in eukaryotic cells, recent advances have led to the engineering of several IscBs for mammalian genome editing.Results: In this study, we screen and identify high-activity IscB variants for rice. A version of pIscB-v3, combining enOgeuIscB and ωRNA-v13, demonstrated superior mutagenesis efficiency compared to other systems. The average editing efficiency of pIscB-v3 is 17.61% from ten endogenous targets, and we obtain edited lines in up to 83.33% of T0 generation with 33.33% of homozygous and bi-allelic mutations. Further analysis reveals that pIscB-v3 exhibits high editing specificity and relaxed target-adjacent motif (TAM) compatibility in rice. Beyond gene knockout systems, we develop cytosine base editors (CBEs) and adenine base editors (ABEs) from pIscB-v3. We find that the ssDNA-targeting SCP1.201 family deaminase Sdd7 outperformed human APOBEC3A in IscB-CBEs for C-to-T conversions in rice. The Sdd7-nIscB achieves precise edits in 22.92% of lines on average, with a maximum frequency of 47.92%. Additionally, TadA8e-nIscB exhibits limited activity. However, fusing an extra copy of TadA-8e to either terminus of TadA8e-nIsc significantly enhances A-to-G conversions.Conclusions: Collectively, our results demonstrate the robust capabilities of IscB to develop an efficient and versatile miniature plant genome editing toolkit to substantially facilitate crop breeding.","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"49"},"PeriodicalIF":12.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Circularization of 23S rRNA but not 16S rRNA within archaeal ribosomes. 古细菌核糖体中23S rRNA环状化而16S rRNA不环状化。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-025-03903-0

Ling-Dong Shi, Petar I Penev, Amos J Nissley, Dipti D Nayak, Rohan Sachdeva, Jamie H D Cate, Jillian F Banfield

Background: Processing of archaeal 16S and 23S rRNAs is believed to involve excision of individual rRNAs from polycistronic precursors, circularization of excised rRNAs, and re-linearization before the incorporation into ribosomes. However, all the knowledge is derived from several isolated species, leaving open the possibility that different processes may occur in other archaeal groups.

Results: Here, we investigate rRNAs from diverse and mostly uncultivated archaea. Sequencing of total cellular RNA from eight phylum-level lineages indicates that archaeal circular 23S rRNA transcript abundances vastly exceed those of linear counterparts, and linear versions are often undetectable. As the majority of rRNAs derive from mature ribosomes, the data suggest that ribosomes contain circular 23S rRNAs. Thus, we directly sequence RNA extracted from isolated ribosomes of a model archaeon, Methanosarcina acetivorans, and confirm that the 23S rRNAs in the ribosomes are circular. Structural modeling places the 5' and 3' ends of the linear precursors of archaeal 23S rRNAs in close proximity to form a GNRA tetraloop (in which N is A, C, G, or U and R is A or G), consistent with their existence as circular molecules. We also confirm the existence of circular 16S rRNA intermediates in transcriptomes of most archaea, yet a circular form is not evident in some distinct archaeal groups, suggesting that certain archaea do not circularize 16S rRNA during processing.

Conclusions: Our findings uncover unexpected variations in the processing required to generate mature rRNAs and the conformation of functional molecules in archaeal ribosomes.

背景：古细菌16S和23S rnas的加工被认为涉及从多顺反电子前体中切除单个rnas，将切除的rnas循环化，并在纳入核糖体之前重新线性化。然而，所有的知识都来自几个孤立的物种，留下了不同的过程可能发生在其他古菌群的可能性。结果：在这里，我们研究了来自多种和大多数未培养的古细菌的rrna。来自8个门水平谱系的总细胞RNA测序表明，古细菌环状23S rRNA转录物丰度大大超过线性对应物，而线性版本通常无法检测到。由于大多数rnas来自成熟核糖体，数据表明核糖体含有环状23S rnas。因此，我们直接对从模型古菌Methanosarcina acetivorans分离的核糖体中提取的RNA进行测序，并证实核糖体中的23S rrna是环状的。结构建模将古细菌23S rrna的线性前体的5‘和3’端靠近，形成一个GNRA四环（其中N为a、C、G或U， R为a或G），与它们作为圆形分子的存在一致。我们还证实了在大多数古菌的转录组中存在环状16S rRNA中间体，但在某些特定的古菌群中不明显存在环状16S rRNA，这表明某些古菌在加工过程中不环状16S rRNA。结论：我们的发现揭示了古细菌核糖体中产生成熟rnas所需的加工和功能分子构象的意外变化。

{"title":"Circularization of 23S rRNA but not 16S rRNA within archaeal ribosomes.","authors":"Ling-Dong Shi, Petar I Penev, Amos J Nissley, Dipti D Nayak, Rohan Sachdeva, Jamie H D Cate, Jillian F Banfield","doi":"10.1186/s13059-025-03903-0","DOIUrl":"10.1186/s13059-025-03903-0","url":null,"abstract":"Background: Processing of archaeal 16S and 23S rRNAs is believed to involve excision of individual rRNAs from polycistronic precursors, circularization of excised rRNAs, and re-linearization before the incorporation into ribosomes. However, all the knowledge is derived from several isolated species, leaving open the possibility that different processes may occur in other archaeal groups.Results: Here, we investigate rRNAs from diverse and mostly uncultivated archaea. Sequencing of total cellular RNA from eight phylum-level lineages indicates that archaeal circular 23S rRNA transcript abundances vastly exceed those of linear counterparts, and linear versions are often undetectable. As the majority of rRNAs derive from mature ribosomes, the data suggest that ribosomes contain circular 23S rRNAs. Thus, we directly sequence RNA extracted from isolated ribosomes of a model archaeon, Methanosarcina acetivorans, and confirm that the 23S rRNAs in the ribosomes are circular. Structural modeling places the 5' and 3' ends of the linear precursors of archaeal 23S rRNAs in close proximity to form a GNRA tetraloop (in which N is A, C, G, or U and R is A or G), consistent with their existence as circular molecules. We also confirm the existence of circular 16S rRNA intermediates in transcriptomes of most archaea, yet a circular form is not evident in some distinct archaeal groups, suggesting that certain archaea do not circularize 16S rRNA during processing.Conclusions: Our findings uncover unexpected variations in the processing required to generate mature rRNAs and the conformation of functional molecules in archaeal ribosomes.","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"42"},"PeriodicalIF":12.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146067926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unveiling the early defense response dynamics in grapevines against Plasmopara viticola by single-cell transcriptomics. 利用单细胞转录组学揭示葡萄对葡萄浆原菌的早期防御反应动力学。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-025-03904-z

Xiukun Yao, Zhizhuo Xu, Yasheng Xi, Xinyue He, Qifei Gao, Jiang Lu, Peining Fu

Background: The viticulture has long suffered from the downy mildew caused by Plasmopara viticola, a strictly obligate biotrophic oomycete. Numerous studies have been performed to reveal how grapevine defends against Plasmopara viticola, but they mainly investigate the plant defense responses on the whole tissue level, not on the cellular level.

Results: Here we employ single-cell RNA sequencing and spatial RNA sequencing to profile approximately 100,000 individual cells (~ 89,000 from scRNA-seq and ~ 11,000 from spRNA-seq), generating the first single-cell transcriptome atlas of grapevine leaves during Plasmopara viticola infection. This high-resolution atlas reveals the dynamic and distinct defense responses of plant cells at early stages of oomycete infection. Notably, we find that Plasmopara viticola reprograms the guard cell transcriptome to facilitate successful invasion, likely by altering the expression of ABA negative regulators and modulating a potassium channel regulatory pathway to influence stomatal movement.

Conclusions: Overall, our work reveals differential and dynamic responses of grapevine to the Plasmopara viticola infection at a single-cell level, providing valuable clues for dissecting the interaction between plants and oomycetes.

背景：葡萄栽培长期遭受由葡萄浆原菌（Plasmopara viticola）引起的霜霉病的困扰，这是一种严格的生物营养性卵菌。大量研究揭示了葡萄如何防御葡萄浆原菌，但它们主要是在整个组织水平上研究植物的防御反应，而不是在细胞水平上。结果：本研究采用单细胞RNA测序和空间RNA测序技术，对大约10万个细胞（约89000个来自scRNA-seq，约11000个来自spRNA-seq）进行了分析，获得了葡萄浆原菌感染期间葡萄叶片的首个单细胞转录组图谱。这个高分辨率的图谱揭示了植物细胞在卵霉菌感染的早期阶段的动态和独特的防御反应。值得注意的是，我们发现葡萄浆原对保护细胞转录组进行了重编程，以促进成功入侵，可能是通过改变ABA负调节因子的表达和调节钾通道调节途径来影响气孔运动。结论：总的来说，我们的工作揭示了葡萄在单细胞水平上对葡萄浆原菌感染的差异和动态反应，为解剖植物与卵菌之间的相互作用提供了有价值的线索。

{"title":"Unveiling the early defense response dynamics in grapevines against Plasmopara viticola by single-cell transcriptomics.","authors":"Xiukun Yao, Zhizhuo Xu, Yasheng Xi, Xinyue He, Qifei Gao, Jiang Lu, Peining Fu","doi":"10.1186/s13059-025-03904-z","DOIUrl":"10.1186/s13059-025-03904-z","url":null,"abstract":"Background: The viticulture has long suffered from the downy mildew caused by Plasmopara viticola, a strictly obligate biotrophic oomycete. Numerous studies have been performed to reveal how grapevine defends against Plasmopara viticola, but they mainly investigate the plant defense responses on the whole tissue level, not on the cellular level.Results: Here we employ single-cell RNA sequencing and spatial RNA sequencing to profile approximately 100,000 individual cells (~ 89,000 from scRNA-seq and ~ 11,000 from spRNA-seq), generating the first single-cell transcriptome atlas of grapevine leaves during Plasmopara viticola infection. This high-resolution atlas reveals the dynamic and distinct defense responses of plant cells at early stages of oomycete infection. Notably, we find that Plasmopara viticola reprograms the guard cell transcriptome to facilitate successful invasion, likely by altering the expression of ABA negative regulators and modulating a potassium channel regulatory pathway to influence stomatal movement.Conclusions: Overall, our work reveals differential and dynamic responses of grapevine to the Plasmopara viticola infection at a single-cell level, providing valuable clues for dissecting the interaction between plants and oomycetes.","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"43"},"PeriodicalIF":12.3,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146068020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accurate variant effect estimation in FACS-based deep mutational scanning data with Lilace. 基于facs的深度突变扫描数据中变异效应的精确估计。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-27 DOI: 10.1186/s13059-026-03934-1

Jerome Freudenberg, Jingyou Rao, Matthew K Howard, Christian Macdonald, Noah F Greenwald, Willow Coyote-Maestas, Harold Pimentel

Deep mutational scanning (DMS) coupled with fluorescence-activated cell sorting (FACS) provides a high-throughput method to link genetic variants with quantitative molecular phenotypes. Analysis of these experiments is challenging due to measurement variance and the multidimensional FACS readout. However, no statistical method has yet been developed to address these challenges. Here we present Lilace, a Bayesian statistical model to estimate variant effects with uncertainty quantification from FACS-based DMS experiments. We validate Lilace's performance and robustness using simulated data and apply it to OCT1 and Kir2.1 DMS datasets, demonstrating an improved false discovery rate while largely maintaining sensitivity.

深度突变扫描（DMS）与荧光激活细胞分选（FACS）相结合，提供了一种高通量方法，将遗传变异与定量分子表型联系起来。由于测量方差和多维FACS读数，这些实验的分析具有挑战性。然而，目前还没有统计方法来解决这些挑战。在这里，我们提出了一个贝叶斯统计模型Lilace，用于估计基于facs的DMS实验的不确定性量化的变异效应。我们使用模拟数据验证了Lilace的性能和鲁棒性，并将其应用于OCT1和Kir2.1 DMS数据集，证明了在很大程度上保持灵敏度的同时提高了错误发现率。

引用次数: 0

Whole genome sequence analysis of pulmonary function and COPD in 44,287 multi-ancestry participants. 44,287名多祖先参与者肺功能和COPD的全基因组序列分析。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-15 DOI: 10.1186/s13059-025-03921-y

Wonji Kim, Xiaowei Hu, Kangjin Kim, Sung Chun, Peter Orchard, Dandi Qiao, Ingo Ruczinski, Aabida Saferali, Francois Aguet, Lucinda Antonacci-Fulton, Pallavi P Balte, Traci M Bartz, Wardatul Jannat Anamika, Xiaobo Zhou, JunYi Duan, Jennifer A Brody, Brian E Cade, Martha L Daviglus, Harshavadran Doddapaneni, Shannon Dugan-Perez, Susan K Dutcher, Christian D Frazar, Stacey B Gabriel, Sina A Gharib, Namrata Gupta, Brian D Hobbs, Silva Kasela, Laura R Loehr, Ginger A Metcalf, Donna M Muzny, Elizabeth C Oelsner, Laura J Rasmussen-Torvik, Colleen M Sitlani, Joshua Smith, Tamar Sofer, Hanfei Xu, Bing Yu, David Zhang, John Ziniti, R Graham Barr, April P Carson, Myriam Fornage, Lifang Hou, Ravi Kalhan, Robert Kaplan, Tuuli Lappalainen, Stephanie J London, Alanna C Morrison, George T O'Connor, Bruce M Psaty, Laura M Raffield, Susan Redline, Stephen S Rich, Jerome I Rotter, Edwin K Silverman, Ani Manichaikul, Michael H Cho

Background: Whole genome sequence (WGS) data in multi-ancestry samples supports discovery of low-frequency or population-specific genetic variants associated with chronic obstructive pulmonary disease (COPD) and lung function.

Results: We performed single variant, structural variant, and gene-based analysis of pulmonary function (FEV₁, FVC and FEV₁/FVC) and COPD case-control status in 44,287 multi-ancestry participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) Program. We validated findings using the UK Biobank and assessed implicated genes using lung single-cell RNA-seq (scRNA-seq) data sets. Applying a genome-wide significance threshold (P < 5 × 10^-9), we replicated known loci and identified novel associations near LY86, MAGI1, GRK7, and LINC02668. Colocalization with gene expression quantitative trait loci (eQTL) from the Lung Tissue Research Consortium highlighted known candidate genes including ADAM19, THSD4, C4B, and PSMA4, which were not identified through other eQTL sources. Multi-ancestry analysis improved fine-mapping resolution (e.g., HTR4 and RIN3). Gene-based analysis identified and replicated HMCN1. In human lung scRNA-seq data sets, lung epithelial cells and immune cell types showed enriched expression, while fibroblasts showed higher expression for HMCN1. CRISPR targeting HMCN1 in IMR90 demonstrated reduced expression of collagen genes.

Conclusions: Large-scale multi-ancestry WGS analysis improves variant discovery and fine-mapping resolution for lung function and COPD and highlights biologically relevant genes and pathways.

背景：多祖先样本的全基因组序列（WGS）数据支持发现与慢性阻塞性肺疾病（COPD）和肺功能相关的低频或人群特异性遗传变异。结果：我们对来自NHLBI Trans-Omics for Precision Medicine （TOPMed）项目的44,287名多血统参与者进行了单变异、结构变异和基于基因的肺功能（FEV1、FVC和FEV1/FVC）和COPD病例对照状态的分析。我们使用UK Biobank验证了研究结果，并使用肺单细胞RNA-seq （scRNA-seq）数据集评估了相关基因。应用全基因组显著性阈值（P -9），我们复制了已知的位点，并在LY86、MAGI1、GRK7和LINC02668附近发现了新的关联。与来自肺组织研究联盟的基因表达数量性状位点（eQTL）共定位突出了已知的候选基因，包括ADAM19、THSD4、C4B和PSMA4，这些基因未通过其他eQTL来源鉴定。多祖先分析提高了精细制图的分辨率（例如，HTR4和RIN3）。基于基因的分析鉴定并复制了HMCN1。在人肺scRNA-seq数据集中，肺上皮细胞和免疫细胞类型表达丰富，而成纤维细胞表达HMCN1较高。在IMR90中靶向HMCN1的CRISPR显示胶原基因表达降低。结论：大规模多祖先WGS分析提高了肺功能和COPD的变异发现和精细定位分辨率，并突出了生物学相关基因和途径。

{"title":"Whole genome sequence analysis of pulmonary function and COPD in 44,287 multi-ancestry participants.","authors":"Wonji Kim, Xiaowei Hu, Kangjin Kim, Sung Chun, Peter Orchard, Dandi Qiao, Ingo Ruczinski, Aabida Saferali, Francois Aguet, Lucinda Antonacci-Fulton, Pallavi P Balte, Traci M Bartz, Wardatul Jannat Anamika, Xiaobo Zhou, JunYi Duan, Jennifer A Brody, Brian E Cade, Martha L Daviglus, Harshavadran Doddapaneni, Shannon Dugan-Perez, Susan K Dutcher, Christian D Frazar, Stacey B Gabriel, Sina A Gharib, Namrata Gupta, Brian D Hobbs, Silva Kasela, Laura R Loehr, Ginger A Metcalf, Donna M Muzny, Elizabeth C Oelsner, Laura J Rasmussen-Torvik, Colleen M Sitlani, Joshua Smith, Tamar Sofer, Hanfei Xu, Bing Yu, David Zhang, John Ziniti, R Graham Barr, April P Carson, Myriam Fornage, Lifang Hou, Ravi Kalhan, Robert Kaplan, Tuuli Lappalainen, Stephanie J London, Alanna C Morrison, George T O'Connor, Bruce M Psaty, Laura M Raffield, Susan Redline, Stephen S Rich, Jerome I Rotter, Edwin K Silverman, Ani Manichaikul, Michael H Cho","doi":"10.1186/s13059-025-03921-y","DOIUrl":"10.1186/s13059-025-03921-y","url":null,"abstract":"Background: Whole genome sequence (WGS) data in multi-ancestry samples supports discovery of low-frequency or population-specific genetic variants associated with chronic obstructive pulmonary disease (COPD) and lung function.Results: We performed single variant, structural variant, and gene-based analysis of pulmonary function (FEV1, FVC and FEV1/FVC) and COPD case-control status in 44,287 multi-ancestry participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) Program. We validated findings using the UK Biobank and assessed implicated genes using lung single-cell RNA-seq (scRNA-seq) data sets. Applying a genome-wide significance threshold (P < 5 × 10-9), we replicated known loci and identified novel associations near LY86, MAGI1, GRK7, and LINC02668. Colocalization with gene expression quantitative trait loci (eQTL) from the Lung Tissue Research Consortium highlighted known candidate genes including ADAM19, THSD4, C4B, and PSMA4, which were not identified through other eQTL sources. Multi-ancestry analysis improved fine-mapping resolution (e.g., HTR4 and RIN3). Gene-based analysis identified and replicated HMCN1. In human lung scRNA-seq data sets, lung epithelial cells and immune cell types showed enriched expression, while fibroblasts showed higher expression for HMCN1. CRISPR targeting HMCN1 in IMR90 demonstrated reduced expression of collagen genes.Conclusions: Large-scale multi-ancestry WGS analysis improves variant discovery and fine-mapping resolution for lung function and COPD and highlights biologically relevant genes and pathways.","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"28"},"PeriodicalIF":12.3,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12888468/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VIST: variational inference for single cell time series. 单细胞时间序列的变分推理。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-15 DOI: 10.1186/s13059-025-03874-2

Bingxian Xu, Rosemary Braun

引用次数: 0

Non-negative matrix factorization and deconvolution as a dual simplex problem. 非负矩阵分解和反褶积作为对偶单纯形问题。

IF 12.3 1区生物学 Q1 Agricultural and Biological Sciences

Genome Biology

Pub Date : 2026-01-14 DOI: 10.1186/s13059-025-03910-1

Denis Kleverov, Ekaterina Aladyeva, Alexey Serdyukov, Maxim N Artyomov

Background: Non-negative matrix factorization is a powerful linear algebra tool used in multiple areas of data analysis, including computational biology. Despite numerous optimization methods devised for non-negative matrix factorization, our understanding of the inherent topological structure within factorizable matrices remains limited.

Results: This study reveals the topological properties of linear mixture data, leading to a remarkable reduction of the non-negative matrix factorization optimization problem to a search for K(K-1) variables, where K represents the number of pure components, regardless of the initial matrix size. This is achieved by revealing complementary simplex structures existing in both feature and sample spaces and leveraging the Sinkhorn transformation to find the relationship between these simplexes. We validate this approach in the context of an unconstrained mixed images scenario and achieve a significant improvement in decomposition accuracy. Furthermore, we successfully applied the proposed approach in the biological context of bulk RNA-seq gene expression deconvolution.

Conclusions: The Dual Simplex unified analytical framework improves robustness to noise and enhances optimization stability, enabling accurate recovery of component proportions and expression profiles. Importantly, the framework naturally accommodates both reference-free and marker-based deconvolution settings, providing a general and efficient solution for analyzing complex biological mixtures such as bulk RNA-seq and single-cell derived data.

背景：非负矩阵分解是一种强大的线性代数工具，用于数据分析的多个领域，包括计算生物学。尽管为非负矩阵分解设计了许多优化方法，但我们对可分解矩阵内固有拓扑结构的理解仍然有限。结果：该研究揭示了线性混合数据的拓扑特性，使得非负矩阵分解优化问题显著减少为搜索K（K-1）个变量，其中K表示纯成分的数量，而与初始矩阵大小无关。这是通过揭示存在于特征空间和样本空间中的互补单纯形结构，并利用Sinkhorn变换来找到这些单纯形之间的关系来实现的。我们在无约束的混合图像场景中验证了这种方法，并显著提高了分解精度。此外，我们成功地将所提出的方法应用于大量RNA-seq基因表达反卷积的生物学背景。结论：Dual Simplex统一分析框架提高了对噪声的鲁棒性，增强了优化的稳定性，能够准确地恢复组分比例和表达谱。重要的是，该框架自然地适应无参考和基于标记的反褶积设置，为分析复杂的生物混合物（如大量RNA-seq和单细胞衍生数据）提供了通用和有效的解决方案。

{"title":"Non-negative matrix factorization and deconvolution as a dual simplex problem.","authors":"Denis Kleverov, Ekaterina Aladyeva, Alexey Serdyukov, Maxim N Artyomov","doi":"10.1186/s13059-025-03910-1","DOIUrl":"10.1186/s13059-025-03910-1","url":null,"abstract":"Background: Non-negative matrix factorization is a powerful linear algebra tool used in multiple areas of data analysis, including computational biology. Despite numerous optimization methods devised for non-negative matrix factorization, our understanding of the inherent topological structure within factorizable matrices remains limited.Results: This study reveals the topological properties of linear mixture data, leading to a remarkable reduction of the non-negative matrix factorization optimization problem to a search for K(K-1) variables, where K represents the number of pure components, regardless of the initial matrix size. This is achieved by revealing complementary simplex structures existing in both feature and sample spaces and leveraging the Sinkhorn transformation to find the relationship between these simplexes. We validate this approach in the context of an unconstrained mixed images scenario and achieve a significant improvement in decomposition accuracy. Furthermore, we successfully applied the proposed approach in the biological context of bulk RNA-seq gene expression deconvolution.Conclusions: The Dual Simplex unified analytical framework improves robustness to noise and enhances optimization stability, enabling accurate recovery of component proportions and expression profiles. Importantly, the framework naturally accommodates both reference-free and marker-based deconvolution settings, providing a general and efficient solution for analyzing complex biological mixtures such as bulk RNA-seq and single-cell derived data.","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"25"},"PeriodicalIF":12.3,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12888666/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Genome Biology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀