首页 > 最新文献

Genome research最新文献

英文 中文
Artificial intelligence and machine learning in cell-free-DNA-based diagnostics.
IF 6.2 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-22 DOI: 10.1101/gr.278413.123
W H Adrian Tsui, Spencer C Ding, Peiyong Jiang, Y M Dennis Lo

The discovery of circulating fetal and tumor cell-free DNA (cfDNA) molecules in plasma has opened up tremendous opportunities in noninvasive diagnostics such as the detection of fetal chromosomal aneuploidies and cancers and in posttransplantation monitoring. The advent of high-throughput sequencing technologies makes it possible to scrutinize the characteristics of cfDNA molecules, opening up the fields of cfDNA genetics, epigenetics, transcriptomics, and fragmentomics, providing a plethora of biomarkers. Machine learning (ML) and/or artificial intelligence (AI) technologies that are known for their ability to integrate high-dimensional features have recently been applied to the field of liquid biopsy. In this review, we highlight various AI and ML approaches in cfDNA-based diagnostics. We first introduce the biology of cell-free DNA and basic concepts of ML and AI technologies. We then discuss selected examples of ML- or AI-based applications in noninvasive prenatal testing and cancer liquid biopsy. These applications include the deduction of fetal DNA fraction, plasma DNA tissue mapping, and cancer detection and localization. Finally, we offer perspectives on the future direction of using ML and AI technologies to leverage cfDNA fragmentation patterns in terms of methylomic and transcriptional investigations.

{"title":"Artificial intelligence and machine learning in cell-free-DNA-based diagnostics.","authors":"W H Adrian Tsui, Spencer C Ding, Peiyong Jiang, Y M Dennis Lo","doi":"10.1101/gr.278413.123","DOIUrl":"10.1101/gr.278413.123","url":null,"abstract":"<p><p>The discovery of circulating fetal and tumor cell-free DNA (cfDNA) molecules in plasma has opened up tremendous opportunities in noninvasive diagnostics such as the detection of fetal chromosomal aneuploidies and cancers and in posttransplantation monitoring. The advent of high-throughput sequencing technologies makes it possible to scrutinize the characteristics of cfDNA molecules, opening up the fields of cfDNA genetics, epigenetics, transcriptomics, and fragmentomics, providing a plethora of biomarkers. Machine learning (ML) and/or artificial intelligence (AI) technologies that are known for their ability to integrate high-dimensional features have recently been applied to the field of liquid biopsy. In this review, we highlight various AI and ML approaches in cfDNA-based diagnostics. We first introduce the biology of cell-free DNA and basic concepts of ML and AI technologies. We then discuss selected examples of ML- or AI-based applications in noninvasive prenatal testing and cancer liquid biopsy. These applications include the deduction of fetal DNA fraction, plasma DNA tissue mapping, and cancer detection and localization. Finally, we offer perspectives on the future direction of using ML and AI technologies to leverage cfDNA fragmentation patterns in terms of methylomic and transcriptional investigations.</p>","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"35 1","pages":"1-19"},"PeriodicalIF":6.2,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11789496/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of a cell-free DNA-based cancer screening cohort links fragmentomic profiles, nuclease levels, and plasma DNA concentrations. 分析基于无细胞 DNA 的癌症筛查队列,将片段组图谱、核酸酶水平和血浆 DNA 浓度联系起来。
IF 6.2 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-22 DOI: 10.1101/gr.279667.124
Yasine Malki, Guannan Kang, W K Jacky Lam, Qing Zhou, Suk Hang Cheng, Peter P H Cheung, Jinyue Bai, Ming Lok Chan, Chui Ting Lee, Wenlei Peng, Yiqiong Zhang, Wanxia Gai, Winsome W S Wong, Mary-Jane L Ma, Wenshuo Li, Xinzhou Xu, Zhuoran Gao, Irene O L Tse, Huimin Shang, L Y Lois Choy, Peiyong Jiang, K C Allen Chan, Y M Dennis Lo

The concentration of circulating cell-free DNA (cfDNA) in plasma is an important determinant of the robustness of liquid biopsies. However, biological mechanisms that lead to inter-individual differences in cfDNA concentrations remain unexplored. The concentration of plasma cfDNA is governed by an interplay between its release and clearance. We hypothesized that cfDNA clearance by nucleases might be one mechanism that contributes toward inter-individual variations in cfDNA concentrations. We performed fragmentomic analysis of the plasma cfDNA from 862 healthy individuals, with a cfDNA concentration range of 1.61-41.01 ng/mL. We observed an increase in large DNA fragments (231-600 bp), a decreased frequencies of shorter DNA fragments (20-160 bp), and an increased frequency of G-end motifs with increasing cfDNA concentrations. End motif deconvolution analysis revealed a decreased contribution of DNASE1L3 and DFFB in subjects with higher cfDNA concentration. The five subjects with the highest plasma DNA concentration (top 0.58%) had aberrantly decreased levels of DNASE1L3 protein in plasma. The cfDNA concentration could be inferred from the fragmentomic profile through machine learning and was well correlated to the measured cfDNA concentration. Such an approach could infer the fractional DNA concentration from particular tissue types, such as the fetal and tumor fraction. This work shows that individuals with different cfDNA concentrations are associated with characteristic fragmentomic patterns of the cfDNA pool and that nuclease-mediated clearance of DNA is a key parameter that affects cfDNA concentration. Understanding these mechanisms has facilitated the enhanced measurement of cfDNA species of clinical interest, including circulating fetal and tumor DNA.

血浆中循环游离细胞 DNA(cfDNA)的浓度是决定液体活检可靠性的重要因素。然而,导致个体间 cfDNA 浓度差异的生物学机制仍有待探索。血浆中 cfDNA 的浓度受其释放和清除之间相互作用的影响。我们假设,核酸酶清除 cfDNA 可能是导致 cfDNA 浓度个体间差异的机制之一。我们对 862 名健康人的血浆 cfDNA 进行了片段分析,其 cfDNA 浓度范围为 1.61 - 41.01 纳克/毫升。我们观察到,随着 cfDNA 浓度的增加,大 DNA 片段(231-600 bp)的频率增加,短 DNA 片段(20-160 bp)的频率减少,G 端基序的频率增加。末端基团解卷积分析表明,在 cfDNA 浓度较高的受试者中,DNASE1L3 和 DFFB 的贡献率有所下降。血浆 DNA 浓度最高的五名受试者(前 0.58%)血浆中的 DNASE1L3 蛋白水平异常降低。通过机器学习,可以从片段组图谱推断出cfDNA浓度,而且与测量的cfDNA浓度有很好的相关性。这种方法可以推断出特定组织类型(如胎儿和肿瘤部分)的DNA分数浓度。这项工作表明,不同cfDNA浓度的个体与cfDNA池的特征片段组模式有关;核酸酶介导的DNA清除是影响cfDNA浓度的关键参数。对这些机制的了解有助于加强对临床感兴趣的 cfDNA 种类(包括循环中的胎儿 DNA 和肿瘤 DNA)的测量。
{"title":"Analysis of a cell-free DNA-based cancer screening cohort links fragmentomic profiles, nuclease levels, and plasma DNA concentrations.","authors":"Yasine Malki, Guannan Kang, W K Jacky Lam, Qing Zhou, Suk Hang Cheng, Peter P H Cheung, Jinyue Bai, Ming Lok Chan, Chui Ting Lee, Wenlei Peng, Yiqiong Zhang, Wanxia Gai, Winsome W S Wong, Mary-Jane L Ma, Wenshuo Li, Xinzhou Xu, Zhuoran Gao, Irene O L Tse, Huimin Shang, L Y Lois Choy, Peiyong Jiang, K C Allen Chan, Y M Dennis Lo","doi":"10.1101/gr.279667.124","DOIUrl":"10.1101/gr.279667.124","url":null,"abstract":"<p><p>The concentration of circulating cell-free DNA (cfDNA) in plasma is an important determinant of the robustness of liquid biopsies. However, biological mechanisms that lead to inter-individual differences in cfDNA concentrations remain unexplored. The concentration of plasma cfDNA is governed by an interplay between its release and clearance. We hypothesized that cfDNA clearance by nucleases might be one mechanism that contributes toward inter-individual variations in cfDNA concentrations. We performed fragmentomic analysis of the plasma cfDNA from 862 healthy individuals, with a cfDNA concentration range of 1.61-41.01 ng/mL. We observed an increase in large DNA fragments (231-600 bp), a decreased frequencies of shorter DNA fragments (20-160 bp), and an increased frequency of G-end motifs with increasing cfDNA concentrations. End motif deconvolution analysis revealed a decreased contribution of DNASE1L3 and DFFB in subjects with higher cfDNA concentration. The five subjects with the highest plasma DNA concentration (top 0.58%) had aberrantly decreased levels of DNASE1L3 protein in plasma. The cfDNA concentration could be inferred from the fragmentomic profile through machine learning and was well correlated to the measured cfDNA concentration. Such an approach could infer the fractional DNA concentration from particular tissue types, such as the fetal and tumor fraction. This work shows that individuals with different cfDNA concentrations are associated with characteristic fragmentomic patterns of the cfDNA pool and that nuclease-mediated clearance of DNA is a key parameter that affects cfDNA concentration. Understanding these mechanisms has facilitated the enhanced measurement of cfDNA species of clinical interest, including circulating fetal and tumor DNA.</p>","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"31-42"},"PeriodicalIF":6.2,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11789642/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142739223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Common and specific gene regulatory programs in zebrafish caudal fin regeneration at single-cell resolution 单细胞分辨率下斑马鱼尾鳍再生过程中的常见和特异基因调控程序
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-14 DOI: 10.1101/gr.279372.124
Yujie Chen, Yiran Hou, Qinglin Zeng, Irene Wang, Meiru Shang, Kwangdeok Shin, Christopher Hemauer, Xiaoyun Xing, Junsu Kang, Guoyan Zhao, Ting Wang
Following amputation, zebrafish regenerate their injured caudal fin through lineage-restricted reprogramming. Although previous studies have charted various genetic and epigenetic dimensions of this process, the intricate gene regulatory programs shared by, or unique to, different regenerating cell types remain underinvestigated. Here, we mapped the regulatory landscape of fin regeneration by applying paired snRNA-seq and snATAC-seq on uninjured and regenerating fins. This map delineates the regulatory dynamics of predominant cell populations at multiple stages of regeneration. We observe a marked increase in the accessibility of chromatin regions associated with regenerative and developmental processes at 1 dpa, followed by a gradual closure across major cell types at later stages. This pattern is distinct from that of transcriptomic dynamics, which is characterized by several waves of gene upregulation and downregulation. We identified and in vivo validated cell-type-specific and position-specific regeneration-responsive enhancers and constructed regulatory networks by cell type and stage. Our single-cell resolution transcriptomic and chromatin accessibility map across regenerative stages provides new insights into regeneration regulatory mechanisms and serves as a valuable resource for the community.
截肢后,斑马鱼通过谱系限制重编程再生其受伤的尾鳍。尽管先前的研究已经绘制了这一过程的各种遗传和表观遗传维度,但不同再生细胞类型共享或独特的复杂基因调控程序仍未得到充分研究。在这里,我们通过在未受伤和再生的鳍上应用配对的snRNA-seq和snATAC-seq,绘制了鳍再生的调控图景。这张图描绘了在再生的多个阶段优势细胞群的调控动态。我们观察到在1 dpa时,与再生和发育过程相关的染色质区域的可及性显著增加,随后在后期阶段,主要细胞类型逐渐关闭。这种模式与转录组动力学不同,转录组动力学的特点是基因上调和下调几波。我们鉴定并在体内验证了细胞类型特异性和位置特异性再生反应增强子,并根据细胞类型和阶段构建了调控网络。我们的单细胞分辨率转录组和染色质可及性图谱跨越再生阶段,为再生调控机制提供了新的见解,并为社区提供了宝贵的资源。
{"title":"Common and specific gene regulatory programs in zebrafish caudal fin regeneration at single-cell resolution","authors":"Yujie Chen, Yiran Hou, Qinglin Zeng, Irene Wang, Meiru Shang, Kwangdeok Shin, Christopher Hemauer, Xiaoyun Xing, Junsu Kang, Guoyan Zhao, Ting Wang","doi":"10.1101/gr.279372.124","DOIUrl":"https://doi.org/10.1101/gr.279372.124","url":null,"abstract":"Following amputation, zebrafish regenerate their injured caudal fin through lineage-restricted reprogramming. Although previous studies have charted various genetic and epigenetic dimensions of this process, the intricate gene regulatory programs shared by, or unique to, different regenerating cell types remain underinvestigated. Here, we mapped the regulatory landscape of fin regeneration by applying paired snRNA-seq and snATAC-seq on uninjured and regenerating fins. This map delineates the regulatory dynamics of predominant cell populations at multiple stages of regeneration. We observe a marked increase in the accessibility of chromatin regions associated with regenerative and developmental processes at 1 dpa, followed by a gradual closure across major cell types at later stages. This pattern is distinct from that of transcriptomic dynamics, which is characterized by several waves of gene upregulation and downregulation. We identified and in vivo validated cell-type-specific and position-specific regeneration-responsive enhancers and constructed regulatory networks by cell type and stage. Our single-cell resolution transcriptomic and chromatin accessibility map across regenerative stages provides new insights into regeneration regulatory mechanisms and serves as a valuable resource for the community.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"36 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142974823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization of the role of spatial proximity of DNA double-strand breaks in the formation of CRISPR-Cas9-induced large structural variations DNA双链断裂的空间邻近性在crispr - cas9诱导的大结构变异形成中的作用
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-13 DOI: 10.1101/gr.278575.123
Mikkel Dahl-Jessen, Thorkild Terkelsen, Rasmus O Bak, Uffe Birk Jensen
Structural variations (SVs) play important roles in genetic diversity, evolution, and carcinogenesis and are, as such, important for human health. However, it remains unclear how spatial proximity of double-strand breaks (DSBs) affects the formation of SVs. To investigate if spatial proximity between two DSBs affects DNA repair, we used data from 3C experiments (Hi-C, ChIA-PET, and ChIP-seq) to identify highly interacting loci on six different chromosomes. The target regions correlate with the borders of mega-base sized Topologically Associated Domains (TADs), and we used CRISPR-Cas9 nuclease and pairs of single guide RNAs (sgRNAs) against these targets to generate DSBs in both K562 cells and H9 human embryonic stem cells (hESC). Droplet Digital PCR (ddPCR) was used to quantify the resulting recombination events, and high-throughput sequencing was used to analyze the chimeric junctions created between the two DSBs. We observe a significantly higher formation frequency of deletions and inversions with DSBs in proximity as compared to deletions and inversions with DSBs not in proximity in K562 cells. Additionally, our results suggest that DSB proximity may affect the ligation of chimeric deletion junctions. Taken together, spatial proximity between DSBs is a significant predictor of large-scale deletion and inversion frequency induced by CRISPR-Cas9 in K562 cells. This finding has implications for understanding SVs in the human genome and for the future application of CRISPR-Cas9 in gene editing and the modelling of rare SVs.
结构变异(SVs)在遗传多样性、进化和致癌作用中发挥着重要作用,因此对人类健康非常重要。然而,双链断裂(dsb)的空间邻近性如何影响sv的形成尚不清楚。为了研究两个dsb之间的空间接近是否会影响DNA修复,我们使用了3C实验(Hi-C, china - pet和ChIP-seq)的数据来鉴定六条不同染色体上高度相互作用的位点。靶区与大碱基大小的拓扑相关结构域(TADs)的边界相关,我们使用CRISPR-Cas9核酸酶和针对这些靶点的单向导rna对(sgRNAs)在K562细胞和H9人胚胎干细胞(hESC)中生成dsb。利用液滴数字PCR (ddPCR)对重组事件进行定量分析,并利用高通量测序对两个dsb之间的嵌合连接进行分析。我们观察到在K562细胞中,与dsb邻近的缺失和反转相比,dsb邻近的缺失和反转的形成频率明显更高。此外,我们的结果表明,DSB邻近可能影响嵌合缺失连接的连接。综上所述,dsb之间的空间接近性是CRISPR-Cas9在K562细胞中诱导的大规模缺失和反转频率的重要预测因子。这一发现对理解人类基因组中的SVs以及CRISPR-Cas9在基因编辑和罕见SVs建模中的未来应用具有重要意义。
{"title":"Characterization of the role of spatial proximity of DNA double-strand breaks in the formation of CRISPR-Cas9-induced large structural variations","authors":"Mikkel Dahl-Jessen, Thorkild Terkelsen, Rasmus O Bak, Uffe Birk Jensen","doi":"10.1101/gr.278575.123","DOIUrl":"https://doi.org/10.1101/gr.278575.123","url":null,"abstract":"Structural variations (SVs) play important roles in genetic diversity, evolution, and carcinogenesis and are, as such, important for human health. However, it remains unclear how spatial proximity of double-strand breaks (DSBs) affects the formation of SVs. To investigate if spatial proximity between two DSBs affects DNA repair, we used data from 3C experiments (Hi-C, ChIA-PET, and ChIP-seq) to identify highly interacting loci on six different chromosomes. The target regions correlate with the borders of mega-base sized Topologically Associated Domains (TADs), and we used CRISPR-Cas9 nuclease and pairs of single guide RNAs (sgRNAs) against these targets to generate DSBs in both K562 cells and H9 human embryonic stem cells (hESC). Droplet Digital PCR (ddPCR) was used to quantify the resulting recombination events, and high-throughput sequencing was used to analyze the chimeric junctions created between the two DSBs. We observe a significantly higher formation frequency of deletions and inversions with DSBs in proximity as compared to deletions and inversions with DSBs not in proximity in K562 cells. Additionally, our results suggest that DSB proximity may affect the ligation of chimeric deletion junctions. Taken together, spatial proximity between DSBs is a significant predictor of large-scale deletion and inversion frequency induced by CRISPR-Cas9 in K562 cells. This finding has implications for understanding SVs in the human genome and for the future application of CRISPR-Cas9 in gene editing and the modelling of rare SVs.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"29 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142974716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nanopore-based consensus sequencing enables accurate multimodal tumor cell-free DNA profiling 基于纳米孔的共识测序能够实现准确的多模态肿瘤无细胞DNA分析
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-13 DOI: 10.1101/gr.279144.124
Li-Ting Chen, Myrthe Jager, Dàmi Rebergen, Geertruid J. Brink, Tom van den Ende, Willem Vanderlinden, Pauline Kolbeck, Marc Pagès-Gallego, Ymke van der Pol, Nicolle Besselink, Norbert Moldovan, Nizar Hami, Wigard P. Kloosterman, Hanneke van Laarhoven, Florent Mouliere, Ronald Zweemer, Jan Lipfert, Sarah Derks, Alessio Marcozzi, Jeroen de Ridder
Shallow genome-wide cell-free DNA (cfDNA) sequencing holds great promise for non-invasive cancer monitoring by providing reliable copy number alteration (CNA) and fragmentomic profiles. Single nucleotide variations (SNVs) are, however, much harder to identify with low sequencing depth due to sequencing errors. Here we present Nanopore Rolling Circle Amplification (RCA)-enhanced Consensus Sequencing (NanoRCS), which leverages RCA and consensus calling based on genome-wide long-read nanopore sequencing to enable simultaneous multimodal tumor fraction estimation through SNVs, CNAs, and fragmentomics. Efficacy of NanoRCS is tested on 18 cancer patient samples and seven healthy controls, demonstrating its ability to reliably detect tumor fractions as low as 0.24%. In vitro experiments confirm that SNV measurements are essential for detecting tumor fractions below 3%. NanoRCS provides the opportunity for cost-effective and rapid processing, which aligns well with clinical needs, particularly in settings where quick and accurate cancer monitoring is essential for personalized treatment strategies.
浅层全基因组无细胞DNA(cfDNA)测序可提供可靠的拷贝数变异(CNA)和片段组图谱,因此在无创癌症监测方面大有可为。然而,由于测序误差,单核苷酸变异(SNV)在测序深度较低的情况下更难识别。在这里,我们介绍了纳米孔滚动圈扩增(RCA)增强共识测序(NanoRCS),它利用基于全基因组长线程纳米孔测序的 RCA 和共识调用,通过 SNV、CNA 和片段组学同时进行多模态肿瘤分型估计。NanoRCS 的功效在 18 个癌症患者样本和 7 个健康对照样本上进行了测试,证明它能可靠地检测出低至 0.24% 的肿瘤成分。体外实验证实,SNV 测量对于检测低于 3% 的肿瘤片段至关重要。NanoRCS 为成本效益型快速处理提供了机会,这非常符合临床需求,尤其是在快速准确的癌症监测对个性化治疗策略至关重要的情况下。
{"title":"Nanopore-based consensus sequencing enables accurate multimodal tumor cell-free DNA profiling","authors":"Li-Ting Chen, Myrthe Jager, Dàmi Rebergen, Geertruid J. Brink, Tom van den Ende, Willem Vanderlinden, Pauline Kolbeck, Marc Pagès-Gallego, Ymke van der Pol, Nicolle Besselink, Norbert Moldovan, Nizar Hami, Wigard P. Kloosterman, Hanneke van Laarhoven, Florent Mouliere, Ronald Zweemer, Jan Lipfert, Sarah Derks, Alessio Marcozzi, Jeroen de Ridder","doi":"10.1101/gr.279144.124","DOIUrl":"https://doi.org/10.1101/gr.279144.124","url":null,"abstract":"Shallow genome-wide cell-free DNA (cfDNA) sequencing holds great promise for non-invasive cancer monitoring by providing reliable copy number alteration (CNA) and fragmentomic profiles. Single nucleotide variations (SNVs) are, however, much harder to identify with low sequencing depth due to sequencing errors. Here we present Nanopore Rolling Circle Amplification (RCA)-enhanced Consensus Sequencing (NanoRCS), which leverages RCA and consensus calling based on genome-wide long-read nanopore sequencing to enable simultaneous multimodal tumor fraction estimation through SNVs, CNAs, and fragmentomics. Efficacy of NanoRCS is tested on 18 cancer patient samples and seven healthy controls, demonstrating its ability to reliably detect tumor fractions as low as 0.24%. In vitro experiments confirm that SNV measurements are essential for detecting tumor fractions below 3%. NanoRCS provides the opportunity for cost-effective and rapid processing, which aligns well with clinical needs, particularly in settings where quick and accurate cancer monitoring is essential for personalized treatment strategies.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"83 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142974713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interactive visualization and interpretation of pangenome graphs by linear-reference-based coordinate projection and annotation integration 通过基于线性参照的坐标投影和注释整合,实现庞基因组图谱的交互式可视化和解读
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-13 DOI: 10.1101/gr.279461.124
Zepu Miao, Jia-Xing Yue
With the increasing availability of high-quality genome assemblies, pangenome graphs emerged as a new paradigm in the genomics field for identifying, encoding, and presenting genomic variation at both population and species levels. However, it remains challenging to truly dissect and interpret pangenome graphs via biologically informative visualization. To facilitate better exploration and understanding of pangenome graphs towards novel biological insights, here we present a web-based interactive Visualization and interpretation framework for linear-Reference-projected Pangenome Graphs (VRPG). VRPG provides efficient and intuitive supports for exploring and annotating pangenome graphs along a linear-genome-based coordinate system (e.g., that of a primary linear reference genome). Moreover, VRPG offers many unique features such as in-graph path highlighting for graph-constituent input assemblies, copy number characterization for graph-embedding nodes, graph-based mapping for query sequences, all of which are highly valuable for researchers working with pangenome graphs. Additionally, VRPG enables side-by-side visualization between the graph-based pangenome representation and the conventional primary-linear-reference-genome-based feature annotations, therefore seamlessly bridging the graph and linear genomic contexts. To further demonstrate its functionality and scalability, we applied VRPG to the cutting-edge yeast and human reference pangenome graphs derived from hundreds of high-quality genome assemblies via a dedicated web portal and examined their local genome diversity in the graph contexts.
随着高质量基因组汇编的日益普及,泛基因组图谱成为基因组学领域的一种新范式,用于在种群和物种水平上识别、编码和展示基因组变异。然而,通过生物信息可视化来真正剖析和解释庞基因组图谱仍然具有挑战性。为了促进更好地探索和理解庞基因组图,从而获得新的生物学见解,我们在此提出了一个基于网络的线性参考投射庞基因组图(VRPG)交互式可视化和解释框架。VRPG 为沿着基于线性基因组的坐标系(如主要线性参考基因组的坐标系)探索和注释庞基因组图提供了高效、直观的支持。此外,VRPG 还提供了许多独特的功能,如图内路径高亮显示组成图的输入集合、图嵌入节点的拷贝数特征、基于图的查询序列映射等,所有这些功能对研究人员处理庞基因组图都非常有价值。此外,VRPG 还能将基于图的庞基因组表示法与传统的基于主线性参考基因组的特征注释并排可视化,从而无缝连接图和线性基因组上下文。为了进一步证明 VRPG 的功能性和可扩展性,我们将 VRPG 应用于通过专用门户网站从数百个高质量基因组组装中获得的最先进的酵母和人类参考泛基因组图谱,并在图谱上下文中研究了它们的局部基因组多样性。
{"title":"Interactive visualization and interpretation of pangenome graphs by linear-reference-based coordinate projection and annotation integration","authors":"Zepu Miao, Jia-Xing Yue","doi":"10.1101/gr.279461.124","DOIUrl":"https://doi.org/10.1101/gr.279461.124","url":null,"abstract":"With the increasing availability of high-quality genome assemblies, pangenome graphs emerged as a new paradigm in the genomics field for identifying, encoding, and presenting genomic variation at both population and species levels. However, it remains challenging to truly dissect and interpret pangenome graphs via biologically informative visualization. To facilitate better exploration and understanding of pangenome graphs towards novel biological insights, here we present a web-based interactive Visualization and interpretation framework for linear-Reference-projected Pangenome Graphs (VRPG). VRPG provides efficient and intuitive supports for exploring and annotating pangenome graphs along a linear-genome-based coordinate system (e.g., that of a primary linear reference genome). Moreover, VRPG offers many unique features such as in-graph path highlighting for graph-constituent input assemblies, copy number characterization for graph-embedding nodes, graph-based mapping for query sequences, all of which are highly valuable for researchers working with pangenome graphs. Additionally, VRPG enables side-by-side visualization between the graph-based pangenome representation and the conventional primary-linear-reference-genome-based feature annotations, therefore seamlessly bridging the graph and linear genomic contexts. To further demonstrate its functionality and scalability, we applied VRPG to the cutting-edge yeast and human reference pangenome graphs derived from hundreds of high-quality genome assemblies via a dedicated web portal and examined their local genome diversity in the graph contexts.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"27 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142974715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-cell Rapid Capture Hybridization sequencing to reliably detect isoform usage and coding mutations in targeted genes 单细胞快速捕获杂交测序可靠地检测同种异构体的使用和编码突变的目标基因
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-10 DOI: 10.1101/gr.279322.124
Hongke Peng, Jafar S. Jabbari, Luyi Tian, Changqing Wang, Yupei You, Chong Chyn Chua, Natasha S. Anstee, Noorul Amin, Andrew H. Wei, Nadia Davidson, Andrew W. Roberts, David Huang, Matthew E Ritchie, Rachel Thijssen
Single-cell long-read sequencing has transformed our understanding of isoform usage and the mutation heterogeneity between cells. Despite unbiased in-depth analysis, the low sequencing throughput often results in insufficient read coverage thereby limiting our ability to perform mutation calling for specific genes. Here, we developed a single-cell Rapid Capture Hybridization sequencing (scRaCH-seq) method that demonstrated high specificity and efficiency in capturing targeted transcripts using long-read sequencing, allowing an in-depth analysis of mutation status and transcript usage for genes of interest. The method includes creating a probe panel for transcript capture, using barcoded primers for pooling and efficient sequencing via Oxford Nanopore Technologies platforms. scRaCH-seq is applicable to stored and indexed single-cell cDNA which allows analysis to be combined with existing short-read RNA-seq datasets. In our investigation of BTK and SF3B1 genes in samples from patients with chronic lymphocytic leukaemia (CLL), we detected SF3B1 isoforms and mutations with high sensitivity. Integration with short-read scRNA-seq data revealed significant gene expression differences in SF3B1-mutated CLL cells, though it did not impact the sensitivity of the anti-cancer drug venetoclax. scRaCH-seq's capability to study long-read transcripts of multiple genes makes it a powerful tool for single-cell genomics.
单细胞长读测序改变了我们对异构体使用和细胞间突变异质性的理解。尽管进行了无偏见的深入分析,但低测序通量往往导致读取覆盖率不足,从而限制了我们对特定基因进行突变调用的能力。在这里,我们开发了一种单细胞快速捕获杂交测序(scRaCH-seq)方法,该方法在使用长读测序捕获目标转录本方面表现出高特异性和高效率,允许对感兴趣基因的突变状态和转录本使用进行深入分析。该方法包括创建一个用于转录捕获的探针面板,使用条形码引物通过Oxford Nanopore Technologies平台进行池化和高效测序。scRaCH-seq适用于存储和索引的单细胞cDNA,允许分析与现有的短读RNA-seq数据集相结合。在我们对慢性淋巴细胞白血病(CLL)患者样本中BTK和SF3B1基因的研究中,我们以高灵敏度检测到SF3B1亚型和突变。与短读scRNA-seq数据的整合显示,sf3b1突变的CLL细胞中基因表达存在显著差异,但不影响抗癌药物venetoclax的敏感性。scRaCH-seq研究多基因长读转录本的能力使其成为单细胞基因组学的强大工具。
{"title":"Single-cell Rapid Capture Hybridization sequencing to reliably detect isoform usage and coding mutations in targeted genes","authors":"Hongke Peng, Jafar S. Jabbari, Luyi Tian, Changqing Wang, Yupei You, Chong Chyn Chua, Natasha S. Anstee, Noorul Amin, Andrew H. Wei, Nadia Davidson, Andrew W. Roberts, David Huang, Matthew E Ritchie, Rachel Thijssen","doi":"10.1101/gr.279322.124","DOIUrl":"https://doi.org/10.1101/gr.279322.124","url":null,"abstract":"Single-cell long-read sequencing has transformed our understanding of isoform usage and the mutation heterogeneity between cells. Despite unbiased in-depth analysis, the low sequencing throughput often results in insufficient read coverage thereby limiting our ability to perform mutation calling for specific genes. Here, we developed a single-cell Rapid Capture Hybridization sequencing (scRaCH-seq) method that demonstrated high specificity and efficiency in capturing targeted transcripts using long-read sequencing, allowing an in-depth analysis of mutation status and transcript usage for genes of interest. The method includes creating a probe panel for transcript capture, using barcoded primers for pooling and efficient sequencing via Oxford Nanopore Technologies platforms. scRaCH-seq is applicable to stored and indexed single-cell cDNA which allows analysis to be combined with existing short-read RNA-seq datasets. In our investigation of <em>BTK</em> and <em>SF3B1</em> genes in samples from patients with chronic lymphocytic leukaemia (CLL), we detected <em>SF3B1</em> isoforms and mutations with high sensitivity. Integration with short-read scRNA-seq data revealed significant gene expression differences in <em>SF3B1</em>-mutated CLL cells, though it did not impact the sensitivity of the anti-cancer drug venetoclax. scRaCH-seq's capability to study long-read transcripts of multiple genes makes it a powerful tool for single-cell genomics.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"36 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proxy panels enable privacy-aware outsourcing of genotype imputation 代理面板使隐私意识外包的基因型插入
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-10 DOI: 10.1101/gr.278934.124
Degui Zhi, Xiaoqian Jiang, Arif O. Harmanci
One of the major challenges in genomic data sharing is protecting participants' privacy in collaborative studies and when genomic data is outsourced to perform analysis tasks, e.g., genotype imputation services and federated collaborations genomic analysis. Although numerous cryptographic methods have been developed, these methods may not yet be practical for population-scale tasks in terms of computational requirements, rely on high-level expertise in security, and require each algorithm to be implemented from scratch. In this study, we focus on outsourcing of genotype imputation, a fundamental task that utilizes population-level reference panels, and develop protocols that rely on using "proxy-panels" to protect genotype panels while imputation task is being outsourced at servers. The proxy panels are generated through a series of protection mechanisms such as haplotype sampling, allele hashing, and coordinate anonymization to protect the underlying sensitive panel's genetic variant coordinates, genetic maps, and chromosome-wide haplotypes. While the resulting proxy panels are almost distinct from the sensitive panels, they are valid panels that can be used as input to imputation methods such as Beagle. We demonstrate that proxy-based imputation protects against well-known attacks with a minor decrease in imputation accuracy for variants in a wide range of allele frequencies.
基因组数据共享的主要挑战之一是在合作研究中保护参与者的隐私,以及当基因组数据被外包来执行分析任务时,例如基因型插入服务和联合合作基因组分析。尽管已经开发了许多加密方法,但就计算需求而言,这些方法可能对于人口规模的任务还不实用,依赖于安全性方面的高级专业知识,并且需要从头实现每个算法。在本研究中,我们将重点放在外包基因型输入上,这是一项利用人口水平参考面板的基本任务,并制定了依赖于“代理面板”的协议,以保护基因型面板,而输入任务是在服务器上外包的。代理面板是通过单倍型采样、等位基因哈希和坐标匿名化等一系列保护机制生成的,以保护底层敏感面板的遗传变异坐标、遗传图谱和全染色体单倍型。虽然生成的代理面板与敏感面板几乎不同,但它们都是有效的面板,可以用作输入到像Beagle这样的imputation方法。我们证明了基于代理的imputation可以防止众所周知的攻击,并且在广泛的等位基因频率范围内的变体的imputation准确性略有下降。
{"title":"Proxy panels enable privacy-aware outsourcing of genotype imputation","authors":"Degui Zhi, Xiaoqian Jiang, Arif O. Harmanci","doi":"10.1101/gr.278934.124","DOIUrl":"https://doi.org/10.1101/gr.278934.124","url":null,"abstract":"One of the major challenges in genomic data sharing is protecting participants' privacy in collaborative studies and when genomic data is outsourced to perform analysis tasks, e.g., genotype imputation services and federated collaborations genomic analysis. Although numerous cryptographic methods have been developed, these methods may not yet be practical for population-scale tasks in terms of computational requirements, rely on high-level expertise in security, and require each algorithm to be implemented from scratch. In this study, we focus on outsourcing of genotype imputation, a fundamental task that utilizes population-level reference panels, and develop protocols that rely on using \"proxy-panels\" to protect genotype panels while imputation task is being outsourced at servers. The proxy panels are generated through a series of protection mechanisms such as haplotype sampling, allele hashing, and coordinate anonymization to protect the underlying sensitive panel's genetic variant coordinates, genetic maps, and chromosome-wide haplotypes. While the resulting proxy panels are almost distinct from the sensitive panels, they are valid panels that can be used as input to imputation methods such as Beagle. We demonstrate that proxy-based imputation protects against well-known attacks with a minor decrease in imputation accuracy for variants in a wide range of allele frequencies.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"7 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Post-transcriptional cross- and auto-regulation buffer expression of the human RNA helicases DDX3X and DDX3Y 人RNA解旋酶DDX3X和DDX3Y的转录后交叉和自动调节缓冲表达
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-10 DOI: 10.1101/gr.279707.124
Shruthi Rengarajan, Jason Derks, Daniel W. Bellott, Nikolai Slavov, David C. Page
The Y-linked gene DDX3Y and its X-linked homolog DDX3X survived the evolution of the human sex chromosomes from ordinary autosomes. DDX3X encodes a multifunctional RNA helicase, with mutations causing developmental disorders and cancers. We find that, among X-linked genes with surviving Y homologs, DDX3X is extraordinarily dosage sensitive. Studying cells of individuals with sex chromosome aneuploidy, we observe that when the number of Y Chromosomes increases, DDX3X transcript levels fall; conversely, when the number of X Chromosomes increases, DDX3Y transcript levels fall. In 46,XY cells, CRISPRi knockdown of either DDX3X or DDX3Y causes transcript levels of the homologous gene to rise. In 46,XX cells, chemical inhibition of DDX3X protein activity elicits an increase in DDX3X transcript levels. Thus, perturbation of either DDX3X or DDX3Y expression is buffered: by negative cross-regulation of DDX3X and DDX3Y in 46,XY cells and by negative auto-regulation of DDX3X in 46,XX cells. DDX3XDDX3Y cross-regulation is mediated through mRNA destabilization—as shown by metabolic labeling of newly transcribed RNA—and buffers total levels of DDX3X and DDX3Y protein in human cells. We infer that post-transcriptional auto-regulation of the ancestral (autosomal) DDX3X gene transmuted into auto- and cross-regulation of DDX3X and DDX3Y as these sex-linked genes evolved from ordinary alleles of their autosomal precursor.
y连锁基因DDX3Y及其x连锁同源基因DDX3X在人类性染色体从普通常染色体进化而来的过程中幸存下来。DDX3X编码一种多功能RNA解旋酶,其突变可导致发育障碍和癌症。我们发现,在具有存活的Y同源物的x连锁基因中,DDX3X对剂量非常敏感。研究性染色体非整倍体个体的细胞,我们观察到当Y染色体数量增加时,DDX3X转录物水平下降;相反,当X染色体数量增加时,DDX3Y转录物水平下降。在46xy细胞中,CRISPRi敲低DDX3X或DDX3Y都会导致同源基因的转录水平升高。在46xx细胞中,DDX3X蛋白活性的化学抑制引起DDX3X转录物水平的增加。因此,DDX3X或DDX3Y表达的扰动被缓冲:在46,XY细胞中,DDX3X和DDX3Y负向交叉调节,在46,XX细胞中,DDX3X负向自我调节。DDX3X - DDX3Y交叉调节是通过mRNA不稳定介导的(如新转录rna的代谢标记所示),并缓冲人细胞中DDX3X和DDX3Y蛋白的总水平。我们推断,祖先(常染色体)DDX3X基因的转录后自调控转变为DDX3X和DDX3Y的自调控和交叉调控,因为这些性别连锁基因是从常染色体前体的普通等位基因进化而来的。
{"title":"Post-transcriptional cross- and auto-regulation buffer expression of the human RNA helicases DDX3X and DDX3Y","authors":"Shruthi Rengarajan, Jason Derks, Daniel W. Bellott, Nikolai Slavov, David C. Page","doi":"10.1101/gr.279707.124","DOIUrl":"https://doi.org/10.1101/gr.279707.124","url":null,"abstract":"The Y-linked gene <em>DDX3Y</em> and its X-linked homolog <em>DDX3X</em> survived the evolution of the human sex chromosomes from ordinary autosomes. <em>DDX3X</em> encodes a multifunctional RNA helicase, with mutations causing developmental disorders and cancers. We find that, among X-linked genes with surviving Y homologs, <em>DDX3X</em> is extraordinarily dosage sensitive. Studying cells of individuals with sex chromosome aneuploidy, we observe that when the number of Y Chromosomes increases, <em>DDX3X</em> transcript levels fall; conversely, when the number of X Chromosomes increases, <em>DDX3Y</em> transcript levels fall. In 46,XY cells, CRISPRi knockdown of either <em>DDX3X</em> or <em>DDX3Y</em> causes transcript levels of the homologous gene to rise. In 46,XX cells, chemical inhibition of DDX3X protein activity elicits an increase in <em>DDX3X</em> transcript levels. Thus, perturbation of either <em>DDX3X</em> or <em>DDX3Y</em> expression is buffered: by negative cross-regulation of <em>DDX3X</em> and <em>DDX3Y</em> in 46,XY cells and by negative auto-regulation of <em>DDX3X</em> in 46,XX cells. <em>DDX3X</em>–<em>DDX3Y</em> cross-regulation is mediated through mRNA destabilization—as shown by metabolic labeling of newly transcribed RNA—and buffers total levels of DDX3X and DDX3Y protein in human cells. We infer that post-transcriptional auto-regulation of the ancestral (autosomal) <em>DDX3X</em> gene transmuted into auto- and cross-regulation of <em>DDX3X</em> and <em>DDX3Y</em> as these sex-linked genes evolved from ordinary alleles of their autosomal precursor.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"26 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Timescale and genetic linkage explain the variable impact of defense systems on horizontal gene transfer 时间尺度和遗传连锁解释了防御系统对水平基因转移的可变影响
IF 7 2区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2025-01-10 DOI: 10.1101/gr.279300.124
Yang Liu, Joao Botelho, Jaime Iranzo
Prokaryotes have evolved a wide repertoire of defense systems to prevent invasion by mobile genetic elements (MGE). However, because MGE are vehicles for the exchange of beneficial accessory genes, defense systems could consequently impede rapid adaptation in microbial populations. Here, we study how defense systems impact horizontal gene transfer (HGT) in the short and long terms. By combining comparative genomics and phylogeny-aware statistical methods, we quantified the association between the presence of 7 widespread defense systems and the abundance of MGE in the genomes of 196 bacterial and 1 archaeal species. We also calculated the differences in the rates of gene gain and loss between lineages that possess and lack each defense system. Our results show that the impact of defense systems on HGT is highly taxon- and system-dependent, and in most cases not statistically significant. Timescale analysis reveals that defense systems must persist in a lineage for a relatively long time to exert an appreciable negative impact on HGT. In contrast, for shorter evolutionary timescales, frequent co-acquisition of MGE and defense systems results in a net positive association of the latter with HGT. Given the high turnover rates experienced by defense systems, we propose that the inhibitory effect of most defense systems on HGT is masked by their strong linkage with MGE. These findings help explain the contradictory conclusions of previous research by pointing at mobility and within-host retention times as key factors that determine the impact of defense systems on genome plasticity.
原核生物已经进化出广泛的防御系统来防止移动遗传元件(MGE)的入侵。然而,由于MGE是交换有益附属基因的载体,防御系统可能因此阻碍微生物种群的快速适应。在这里,我们研究防御系统如何影响水平基因转移(HGT)在短期和长期。通过比较基因组学和系统发育意识统计方法的结合,我们量化了196种细菌和1种古细菌基因组中7种广泛存在的防御系统与MGE丰度之间的关系。我们还计算了拥有和缺乏每种防御系统的世系之间基因获得和丢失率的差异。研究结果表明,防御系统对HGT的影响是高度依赖于分类群和系统的,在大多数情况下不具有统计学意义。时间尺度分析表明,防御系统必须在一个谱系中持续相对较长的时间才能对HGT产生明显的负面影响。相比之下,对于较短的进化时间尺度,MGE和防御系统的频繁共同获取导致后者与HGT的净正相关。鉴于防御系统的高流失率,我们认为大多数防御系统对HGT的抑制作用被它们与MGE的强联系所掩盖。这些发现有助于解释先前研究的矛盾结论,指出流动性和宿主内保留时间是决定防御系统对基因组可塑性影响的关键因素。
{"title":"Timescale and genetic linkage explain the variable impact of defense systems on horizontal gene transfer","authors":"Yang Liu, Joao Botelho, Jaime Iranzo","doi":"10.1101/gr.279300.124","DOIUrl":"https://doi.org/10.1101/gr.279300.124","url":null,"abstract":"Prokaryotes have evolved a wide repertoire of defense systems to prevent invasion by mobile genetic elements (MGE). However, because MGE are vehicles for the exchange of beneficial accessory genes, defense systems could consequently impede rapid adaptation in microbial populations. Here, we study how defense systems impact horizontal gene transfer (HGT) in the short and long terms. By combining comparative genomics and phylogeny-aware statistical methods, we quantified the association between the presence of 7 widespread defense systems and the abundance of MGE in the genomes of 196 bacterial and 1 archaeal species. We also calculated the differences in the rates of gene gain and loss between lineages that possess and lack each defense system. Our results show that the impact of defense systems on HGT is highly taxon- and system-dependent, and in most cases not statistically significant. Timescale analysis reveals that defense systems must persist in a lineage for a relatively long time to exert an appreciable negative impact on HGT. In contrast, for shorter evolutionary timescales, frequent co-acquisition of MGE and defense systems results in a net positive association of the latter with HGT. Given the high turnover rates experienced by defense systems, we propose that the inhibitory effect of most defense systems on HGT is masked by their strong linkage with MGE. These findings help explain the contradictory conclusions of previous research by pointing at mobility and within-host retention times as key factors that determine the impact of defense systems on genome plasticity.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"26 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genome research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1