bioRxiv - Genomics最新文献_第7页

Efficiently Constructing Complete Genomes with CycloneSEQ to Fill Gaps in Bacterial Draft Assemblies 利用 CycloneSEQ 高效构建完整基因组，填补细菌草案组装的空白

bioRxiv - Genomics

Pub Date : 2024-09-07 DOI: 10.1101/2024.09.05.611410

Hewei Liang, Mengmeng Wang, Tongyuan Hu, Haoyu Wang, Wenxin He, Yanmei Ju, Ruijin Guo, Junyi Chen, Fei Guo, Tao Zeng, Yuliang Dong, Bo Wang, Chuanyu Liu, Xin Jin, Wenwei Zhang, Yuanqiang Zou, Xun Xu, Liang Xiao

Current microbial sequencing relies on short-read platforms like Illumina and DNBSEQ, favored for their low cost and high accuracy. However, these methods often produce fragmented draft genomes, hindering comprehensive bacterial function analysis. CycloneSEQ, a novel long-read sequencing platform developed by BGI-Research, its sequencing performance and assembly improvements has been evaluated. Using CycloneSEQ long-read sequencing, the type strain produced long reads with an average length of 11.6 kbp and an average quality score of 14.4. After hybrid assembly with short-read data, the assembled genome exhibited an error rate of only 0.04 mismatches and 0.08 indels per 100 kbp compared to the reference genome. This method was validated across 9 diverse species, successfully assembling complete circular genomes. Hybrid assembly significantly enhances genome completeness by using long reads to fill gaps and accurately assemble multi-copy rRNA genes, which unable be achieved by short reads solely. Through data subsampling, we found that over 500 Mbp of short-read data combined with 100 Mbp of long-read data can result in a high-quality circular assembly. Additionally, using CycloneSEQ long reads effectively improves the assembly of circular complete genomes from mixed microbial communities. CycloneSEQ's read length is sufficient for circular bacterial genomes, but its base quality needs improvement. Integrating DNBSEQ short reads improved accuracy, resulting in complete and accurate assemblies. This efficient approach can be widely applied in microbial sequencing.

目前的微生物测序依赖于 Illumina 和 DNBSEQ 等短线程平台，这些平台因成本低、准确性高而备受青睐。然而，这些方法通常会产生片段化的基因组草案，阻碍了细菌功能的全面分析。CycloneSEQ 是 BGI-Research 开发的一种新型长读程测序平台，对其测序性能和组装改进进行了评估。利用 CycloneSEQ 长读数测序技术，该类型菌株产生的长读数平均长度为 11.6 kbp，平均质量分数为 14.4。在与短读数数据进行混合组装后，与参考基因组相比，组装后的基因组每 100 kbp 的错误率仅为 0.04 个错配和 0.08 个吲哚。这种方法在 9 个不同物种中得到了验证，成功地组装出了完整的环状基因组。通过使用长读数填补空白并准确组装多拷贝 rRNA 基因，混合组装大大提高了基因组的完整性，而这是短读数无法实现的。通过数据子采样，我们发现超过 500 Mbp 的短读数数据与 100 Mbp 的长读数数据相结合，可以得到高质量的循环组装结果。此外，使用 CycloneSEQ 长读数还能有效改善混合微生物群落循环全基因组的组装。CycloneSEQ 的读数长度对于循环细菌基因组来说是足够的，但其碱基质量有待提高。整合 DNBSEQ 短读数提高了准确性，从而实现了完整而准确的组装。这种高效的方法可广泛应用于微生物测序。

{"title":"Efficiently Constructing Complete Genomes with CycloneSEQ to Fill Gaps in Bacterial Draft Assemblies","authors":"Hewei Liang, Mengmeng Wang, Tongyuan Hu, Haoyu Wang, Wenxin He, Yanmei Ju, Ruijin Guo, Junyi Chen, Fei Guo, Tao Zeng, Yuliang Dong, Bo Wang, Chuanyu Liu, Xin Jin, Wenwei Zhang, Yuanqiang Zou, Xun Xu, Liang Xiao","doi":"10.1101/2024.09.05.611410","DOIUrl":"https://doi.org/10.1101/2024.09.05.611410","url":null,"abstract":"Current microbial sequencing relies on short-read platforms like Illumina and DNBSEQ, favored for their low cost and high accuracy. However, these methods often produce fragmented draft genomes, hindering comprehensive bacterial function analysis. CycloneSEQ, a novel long-read sequencing platform developed by BGI-Research, its sequencing performance and assembly improvements has been evaluated. Using CycloneSEQ long-read sequencing, the type strain produced long reads with an average length of 11.6 kbp and an average quality score of 14.4. After hybrid assembly with short-read data, the assembled genome exhibited an error rate of only 0.04 mismatches and 0.08 indels per 100 kbp compared to the reference genome. This method was validated across 9 diverse species, successfully assembling complete circular genomes. Hybrid assembly significantly enhances genome completeness by using long reads to fill gaps and accurately assemble multi-copy rRNA genes, which unable be achieved by short reads solely. Through data subsampling, we found that over 500 Mbp of short-read data combined with 100 Mbp of long-read data can result in a high-quality circular assembly. Additionally, using CycloneSEQ long reads effectively improves the assembly of circular complete genomes from mixed microbial communities. CycloneSEQ's read length is sufficient for circular bacterial genomes, but its base quality needs improvement. Integrating DNBSEQ short reads improved accuracy, resulting in complete and accurate assemblies. This efficient approach can be widely applied in microbial sequencing.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamics and determinants of the gut mobilome in early life 生命早期肠道移动组的动态和决定因素

bioRxiv - Genomics

Pub Date : 2024-09-07 DOI: 10.1101/2024.09.06.611692

Asier Fernandez-Pato, Trishla Sinha, Sanzhima Garmaeva, Anastasia Gulyaeva, Nataliia Kuzub, Simon Roux, Jingyuan Fu, Alexander Kurilshikov, Alexandra Zhernakova

As mobile genetic elements (MGE) are critical yet understudied determinants of gut microbiome composition, we characterized the gut virome and plasmidome in 195 samples from 28 mother-infant dyads delivered by caesarean section. Infant mobilome increased in richness over the first 6 postnatal weeks, demonstrating high individual-specificity and temporal stability, establishing a personal persistent mobilome. Formula-fed infants exhibited greater mobilome richness than breastfed infants, with plasmid composition influenced by antibiotic exposure and birth weight. Plasmids constituted a significant reservoir of antibiotic resistance genes (ARG), with around 5% of infant gut plasmid taxonomic units carrying ARG. Notably, ARG profiles did not differ with antibiotic exposure at birth. We found that mother-infant sharing of viral and plasmid strains primarily occurred after 6 months of age. Overall, our integrative analysis offers novel insights into the dynamics, modulation, origin, and clinical implications of MGE in the developing gut microbiome.

移动遗传因子（MGE）是决定肠道微生物组组成的关键因素，但对其的研究却不够深入，因此我们对来自28对剖腹产母婴的195份样本中的肠道病毒组和质粒组进行了鉴定。婴儿移动组的丰富度在出生后的前 6 周有所增加，显示出高度的个体特异性和时间稳定性，从而建立了个人持久的移动组。配方奶喂养的婴儿比母乳喂养的婴儿表现出更丰富的移动组，质粒的组成受抗生素暴露和出生体重的影响。质粒是抗生素耐药基因（ARG）的重要储存库，约有5%的婴儿肠道质粒分类单元携带ARG。值得注意的是，ARG特征与出生时的抗生素暴露量并无差异。我们发现，病毒株和质粒株的母婴共享主要发生在婴儿 6 个月后。总之，我们的综合分析为了解发育中肠道微生物组中 MGE 的动态、调节、起源和临床意义提供了新的视角。

引用次数: 0

Enformation Theory: A Framework for Evaluating Genomic AI 信息论：评估基因组人工智能的框架

bioRxiv - Genomics

Pub Date : 2024-09-07 DOI: 10.1101/2024.09.03.611127

Eyes S Robson, Nilah M. Ioannidis

The nascent field of genomic AI is rapidly expanding with new models, benchmarks, and findings. As the field diversifies, there is an increased need for a common set of measurement tools and perspectives to standardize model evaluation. Here, we present a statistically grounded framework for performance evaluation, visualization, and interpretation using the prominent genomic AI model Enformer as a case study. The Enformer model has been used for a range of applications from mechanism discovery to variant effect prediction, but what makes it better or worse than precedent models at particular tasks? Our goal is not merely to answer these questions for Enformer, but to propose how we should think about new models in general. We start by reporting Enformer's few-shot performance on the GUANinE benchmark, which emphasizes complex genome interpretation tasks, and discuss its gains and deficits compared to precedent models. We follow this analysis with visualizations of Enformer's embeddings in low-dimensional space, where, among other insights, we diagnose features of the embeddings that may limit model generalization to synthetic biology tasks. Finally, we present a novel, theory-backed probe of Enformer embeddings, where variance decomposition allows for holistic interpretation and partial 'backtracking' to explanatory causal features. Through this case study, we illustrate a new framework, Enformation Theory, for analyzing and interpreting genomic AI models.

新生的基因组人工智能领域正随着新模型、新基准和新发现的出现而迅速扩展。随着该领域的多样化，越来越需要一套通用的测量工具和视角来规范模型评估。在这里，我们以著名的基因组人工智能模型 Enformer 为案例，介绍了一个基于统计的性能评估、可视化和解释框架。Enformer 模型已被用于从机制发现到变异效应预测等一系列应用，但在特定任务中，它比先例模型好在哪里？我们的目标不仅仅是回答 Enformer 的这些问题，而是提出我们应该如何看待一般的新模型。我们首先报告了 Enformer 在 GUANinE 基准（强调复杂的基因组解读任务）上的少量表现，并讨论了它与先例模型相比的优势和不足。在分析之后，我们对 Enformer 在低维空间中的嵌入进行了可视化展示，其中除其他见解外，我们还诊断了可能限制模型推广到合成生物学任务的嵌入特征。最后，我们提出了一种新颖的、以理论为基础的 Enformer 嵌入探究方法，通过方差分解可以对解释性因果特征进行整体解释和部分 "回溯"。通过这个案例研究，我们展示了一个新的框架--"Enformation 理论"，用于分析和解释基因组人工智能模型。

{"title":"Enformation Theory: A Framework for Evaluating Genomic AI","authors":"Eyes S Robson, Nilah M. Ioannidis","doi":"10.1101/2024.09.03.611127","DOIUrl":"https://doi.org/10.1101/2024.09.03.611127","url":null,"abstract":"The nascent field of genomic AI is rapidly expanding with new models, benchmarks, and findings. As the field diversifies, there is an increased need for a common set of measurement tools and perspectives to standardize model evaluation. Here, we present a statistically grounded framework for performance evaluation, visualization, and interpretation using the prominent genomic AI model Enformer as a case study. The Enformer model has been used for a range of applications from mechanism discovery to variant effect prediction, but what makes it better or worse than precedent models at particular tasks? Our goal is not merely to answer these questions for Enformer, but to propose how we should think about new models in general. We start by reporting Enformer's few-shot performance on the GUANinE benchmark, which emphasizes complex genome interpretation tasks, and discuss its gains and deficits compared to precedent models. We follow this analysis with visualizations of Enformer's embeddings in low-dimensional space, where, among other insights, we diagnose features of the embeddings that may limit model generalization to synthetic biology tasks. Finally, we present a novel, theory-backed probe of Enformer embeddings, where variance decomposition allows for holistic interpretation and partial 'backtracking' to explanatory causal features. Through this case study, we illustrate a new framework, Enformation Theory, for analyzing and interpreting genomic AI models.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CRISPRi with barcoded expression reporters dissects regulatory networks in human cells 带有条形码表达报告器的 CRISPRi 可剖析人类细胞中的调控网络

bioRxiv - Genomics

Pub Date : 2024-09-06 DOI: 10.1101/2024.09.06.611573

Jinyoung Kim, Ryan Y. Muller, Eliana R Bondra, Nicholas Ingolia

Genome-wide CRISPR screens have emerged as powerful tools for uncovering the genetic underpinnings of diverse biological processes. Incisive screens often depend on directly measuring molecular phenotypes, such as regulated gene expression changes, provoked by CRISPR-mediated genetic perturbations. Here, we provide quantitative measurements of transcriptional responses in human cells across genome-scale perturbation libraries by coupling CRISPR interference (CRISPRi) with barcoded expression reporter sequencing (CiBER-seq). To enable CiBER-seq in mammalian cells, we optimize the integration of highly complex, barcoded sgRNA libraries into a defined genomic context. CiBER-seq profiling of a nuclear factor kappa B (NF-kappaB) reporter delineates the canonical signaling cascade linking the transmembrane TNF- alpha receptor to inflammatory gene activation and highlights cell-type-specific factors in this response. Importantly, CiBER-seq relies solely on bulk RNA sequencing to capture the regulatory circuit driving this rapid transcriptional response. Our work demonstrates the accuracy of CiBER-seq and its potential for dissecting genetic networks in mammalian cells with superior time resolution.

全基因组 CRISPR 筛选已成为揭示各种生物过程遗传基础的有力工具。决定性的筛选通常依赖于直接测量分子表型，如 CRISPR 介导的遗传扰动引起的调控基因表达变化。在这里，我们通过将 CRISPR 干扰（CRISPRi）与条形码表达报告测序（CiBER-seq）相结合，在基因组规模的扰动文库中对人类细胞的转录反应进行了定量测量。为了在哺乳动物细胞中实现 CiBER-seq，我们优化了将高度复杂的条形码 sgRNA 文库整合到定义的基因组上下文中的过程。核因子卡巴B（NF-kappaB）报告基因的CiBER-seq图谱描绘了连接跨膜TNF-α受体与炎症基因激活的典型信号级联，并突出了这一反应中细胞类型特异性的因素。重要的是，CiBER-seq 完全依靠大容量 RNA 测序来捕捉驱动这种快速转录反应的调控回路。我们的研究工作证明了 CiBER-seq 的准确性及其以卓越的时间分辨率剖析哺乳动物细胞遗传网络的潜力。

{"title":"CRISPRi with barcoded expression reporters dissects regulatory networks in human cells","authors":"Jinyoung Kim, Ryan Y. Muller, Eliana R Bondra, Nicholas Ingolia","doi":"10.1101/2024.09.06.611573","DOIUrl":"https://doi.org/10.1101/2024.09.06.611573","url":null,"abstract":"Genome-wide CRISPR screens have emerged as powerful tools for uncovering the genetic underpinnings of diverse biological processes. Incisive screens often depend on directly measuring molecular phenotypes, such as regulated gene expression changes, provoked by CRISPR-mediated genetic perturbations. Here, we provide quantitative measurements of transcriptional responses in human cells across genome-scale perturbation libraries by coupling CRISPR interference (CRISPRi) with barcoded expression reporter sequencing (CiBER-seq). To enable CiBER-seq in mammalian cells, we optimize the integration of highly complex, barcoded sgRNA libraries into a defined genomic context. CiBER-seq profiling of a nuclear factor kappa B (NF-kappaB) reporter delineates the canonical signaling cascade linking the transmembrane TNF- alpha receptor to inflammatory gene activation and highlights cell-type-specific factors in this response. Importantly, CiBER-seq relies solely on bulk RNA sequencing to capture the regulatory circuit driving this rapid transcriptional response. Our work demonstrates the accuracy of CiBER-seq and its potential for dissecting genetic networks in mammalian cells with superior time resolution.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"24 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genotype inference from aggregated chromatin accessibility data reveals genetic regulatory mechanisms 从染色质可及性聚合数据推断基因型，揭示遗传调控机制

bioRxiv - Genomics

Pub Date : 2024-09-05 DOI: 10.1101/2024.09.04.610850

Brandon M. Wenz, Yuan He, Nae-Chyun Chen, Joseph K. Pickrell, Jeremiah H Li, Max F. Dudek, Taibo Li, Rebecca Keener, Benjamin F. Voight, Christopher D. Brown, Alexis Battle

BackgroundUnderstanding the genetic causes for variability in chromatin accessibility can shed light on the molecular mechanisms through which genetic variants may affect complex traits. Thousands of ATAC-seq samples have been collected that hold information about chromatin accessibility across diverse cell types and contexts, but most of these are not paired with genetic information and come from diverse distinct projects and laboratories. ResultsWe report here joint genotyping, chromatin accessibility peak calling, and discovery of quantitative trait loci which influence chromatin accessibility (caQTLs), demonstrating the capability of performing caQTL analysis on a large scale in a diverse sample set without pre-existing genotype information. Using 10,293 profiling samples representing 1,454 unique donor individuals across 653 studies from public databases, we catalog 23,381 caQTLs in total. After joint discovery analysis, we cluster samples based on accessible chromatin profiles to identify context-specific caQTLs. We find that caQTLs are strongly enriched for annotations of gene regulatory elements across diverse cell types and tissues and are often strongly linked with genetic variation associated with changes in expression (eQTLs), indicating that caQTLs can mediate genetic effects on gene expression. We demonstrate sharing of causal variants for chromatin accessibility and diverse complex human traits, enabling a more complete picture of the genetic mechanisms underlying complex human phenotypes. ConclusionsOur work provides a proof of principle for caQTL calling from previously ungenotyped samples, and represents one of the largest, most diverse caQTL resources currently available, informing mechanisms of genetic regulation of gene expression and contribution to disease.

背景了解染色质可及性变异的遗传原因可以揭示遗传变异可能影响复杂性状的分子机制。目前已收集了成千上万的 ATAC-seq 样本，这些样本包含了不同细胞类型和环境中染色质可及性的信息，但其中大部分样本没有与遗传信息配对，而且来自不同的项目和实验室。结果我们在此报告了联合基因分型、染色质可及性峰值调用以及影响染色质可及性的定量性状位点（caQTLs）的发现，展示了在没有预先存在的基因型信息的情况下在不同样本集中大规模执行 caQTL 分析的能力。我们利用来自公共数据库的 653 项研究的 10,293 个分析样本，代表了 1,454 个独特的捐赠者个体，共对 23,381 个 caQTL 进行了编目。在联合发现分析之后，我们根据可访问的染色质图谱对样本进行聚类，以确定特定背景的 caQTL。我们发现，caQTLs 在不同细胞类型和组织的基因调控元件注释中具有很强的富集性，而且往往与表达变化相关的遗传变异（eQTLs）密切相关，这表明 caQTLs 可以介导基因表达的遗传效应。我们证明了染色质可及性和人类各种复杂性状的因果变异的共享性，从而能够更全面地了解人类复杂表型的遗传机制。结论我们的工作证明了从以前未分型的样本中进行 caQTL 调用的原理，是目前可用的最大、最多样化的 caQTL 资源之一，为基因表达的遗传调控机制和对疾病的影响提供了信息。

{"title":"Genotype inference from aggregated chromatin accessibility data reveals genetic regulatory mechanisms","authors":"Brandon M. Wenz, Yuan He, Nae-Chyun Chen, Joseph K. Pickrell, Jeremiah H Li, Max F. Dudek, Taibo Li, Rebecca Keener, Benjamin F. Voight, Christopher D. Brown, Alexis Battle","doi":"10.1101/2024.09.04.610850","DOIUrl":"https://doi.org/10.1101/2024.09.04.610850","url":null,"abstract":"Background\u0000Understanding the genetic causes for variability in chromatin accessibility can shed light on the molecular mechanisms through which genetic variants may affect complex traits. Thousands of ATAC-seq samples have been collected that hold information about chromatin accessibility across diverse cell types and contexts, but most of these are not paired with genetic information and come from diverse distinct projects and laboratories. Results\u0000We report here joint genotyping, chromatin accessibility peak calling, and discovery of quantitative trait loci which influence chromatin accessibility (caQTLs), demonstrating the capability of performing caQTL analysis on a large scale in a diverse sample set without pre-existing genotype information. Using 10,293 profiling samples representing 1,454 unique donor individuals across 653 studies from public databases, we catalog 23,381 caQTLs in total. After joint discovery analysis, we cluster samples based on accessible chromatin profiles to identify context-specific caQTLs. We find that caQTLs are strongly enriched for annotations of gene regulatory elements across diverse cell types and tissues and are often strongly linked with genetic variation associated with changes in expression (eQTLs), indicating that caQTLs can mediate genetic effects on gene expression. We demonstrate sharing of causal variants for chromatin accessibility and diverse complex human traits, enabling a more complete picture of the genetic mechanisms underlying complex human phenotypes. Conclusions\u0000Our work provides a proof of principle for caQTL calling from previously ungenotyped samples, and represents one of the largest, most diverse caQTL resources currently available, informing mechanisms of genetic regulation of gene expression and contribution to disease.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Frequent genetic exchanges revealed by a pan-mitogenome graph of a fungal plant pathogen 植物真菌病原体的泛基因组图谱揭示了频繁的基因交换

bioRxiv - Genomics

Pub Date : 2024-09-05 DOI: 10.1101/2024.06.19.599757

Anouk van Westerhoven, Jelmer Dijkstra, Luis Aznar Palop, Kyran Wissink, Jasper Bell, Gert Kema, Michael F. Seidl

Mitochondria are present in almost all eukaryotic lineages. The mitochondrial genomes (mitogenomes) evolve separately from nuclear genomes, and they can therefore provide relevant insights into the evolution of their host species. Fusarium oxysporum is a major fungal plant pathogen that is assumed to reproduce clonally. However, horizontal chromosome transfer between strains can occur through heterokaryon formation, and recently signs of sexual recombination have been observed. Similarly, signs of recombination in F. oxysporum mitogenomes challenged the prevailing assumption of clonal reproduction in this species. Here, we construct, to our knowledge, the first fungal pan-mitogenome graph of nearly 500 F. oxysporum mitogenome assemblies to uncover the variation and evolution. In general, the gene order of fungal mitogenomes is not well conserved, yet the mitogenome of F. oxysporum and related species are highly co-linear. We observed two strikingly contrasting regions in the Fusarium oxysporum pan-mitogenome, comprising a highly conserved core mitogenome and a long variable region (6-16 kb in size), of which we identified three distinct types. The pan-mitogenome graph reveals that only five intron insertions occurred in the core mitogenome and that the long variable regions drive the difference between mitogenomes. Moreover, we observed that their evolution is neither concurrent with the core mitogenome nor with the nuclear genome. Our large-scale analysis of long variable regions uncovers frequent recombination between mitogenomes, even between strains that belong to different taxonomic clades. This challenges the common assumption of incompatibility between genetically diverse F. oxysporum strains and provides new insights into the evolution of this fungal species.

线粒体存在于几乎所有真核生物系中。线粒体基因组（有丝分裂基因组）的进化与核基因组的进化是分开的，因此它们可以为宿主物种的进化提供相关信息。Fusarium oxysporum 是一种主要的植物真菌病原体，被认为会进行克隆繁殖。然而，菌株之间的水平染色体转移可以通过异核子的形成而发生，最近还观察到了有性重组的迹象。同样，在 F. oxysporum 有丝分裂基因组中出现的重组迹象也对该物种克隆繁殖的普遍假设提出了挑战。据我们所知，我们在这里构建了第一个真菌泛有丝分裂基因组图谱，包含了近 500 个有丝分裂真菌有丝分裂基因组，以揭示其变异和进化。一般来说，真菌有丝分裂基因组的基因顺序并不十分保守，然而氧孢子菌和相关物种的有丝分裂基因组却高度共线。我们观察到氧孢镰刀菌泛有丝分裂基因组中有两个截然不同的区域，包括一个高度保守的核心有丝分裂基因组和一个长的可变区域（大小为 6-16 kb），我们发现其中有三种不同的类型。泛有丝分裂基因组图显示，核心有丝分裂基因组中只有五个内含子插入，而长可变区则导致了有丝分裂基因组之间的差异。此外，我们观察到它们的进化既不与核心有丝分裂基因组同步，也不与核基因组同步。我们对长可变区的大规模分析发现了有丝分裂基因组之间频繁的重组，甚至在属于不同分类支系的菌株之间也是如此。这挑战了基因不同的 F. oxysporum 菌株之间不相容的普遍假设，并为该真菌物种的进化提供了新的见解。

{"title":"Frequent genetic exchanges revealed by a pan-mitogenome graph of a fungal plant pathogen","authors":"Anouk van Westerhoven, Jelmer Dijkstra, Luis Aznar Palop, Kyran Wissink, Jasper Bell, Gert Kema, Michael F. Seidl","doi":"10.1101/2024.06.19.599757","DOIUrl":"https://doi.org/10.1101/2024.06.19.599757","url":null,"abstract":"Mitochondria are present in almost all eukaryotic lineages. The mitochondrial genomes (mitogenomes) evolve separately from nuclear genomes, and they can therefore provide relevant insights into the evolution of their host species. Fusarium oxysporum is a major fungal plant pathogen that is assumed to reproduce clonally. However, horizontal chromosome transfer between strains can occur through heterokaryon formation, and recently signs of sexual recombination have been observed. Similarly, signs of recombination in F. oxysporum mitogenomes challenged the prevailing assumption of clonal reproduction in this species. Here, we construct, to our knowledge, the first fungal pan-mitogenome graph of nearly 500 F. oxysporum mitogenome assemblies to uncover the variation and evolution. In general, the gene order of fungal mitogenomes is not well conserved, yet the mitogenome of F. oxysporum and related species are highly co-linear. We observed two strikingly contrasting regions in the Fusarium oxysporum pan-mitogenome, comprising a highly conserved core mitogenome and a long variable region (6-16 kb in size), of which we identified three distinct types. The pan-mitogenome graph reveals that only five intron insertions occurred in the core mitogenome and that the long variable regions drive the difference between mitogenomes. Moreover, we observed that their evolution is neither concurrent with the core mitogenome nor with the nuclear genome. Our large-scale analysis of long variable regions uncovers frequent recombination between mitogenomes, even between strains that belong to different taxonomic clades. This challenges the common assumption of incompatibility between genetically diverse F. oxysporum strains and provides new insights into the evolution of this fungal species.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genomic Epidemiology of SARS-CoV-2 in Norfolk, UK, March 2020 – December 2022 2020 年 3 月至 2022 年 12 月英国诺福克郡 SARS-CoV-2 基因组流行病学研究

bioRxiv - Genomics

Pub Date : 2024-09-05 DOI: 10.1101/2024.09.05.611382

Eleanor H Hayles, Andrew J Page, Javier Guitian, Robert A Kingsley, The COVID-19 Genomics UK Consortium, Gemma C Langridge

Background: In the UK, the COVID-19 Genomics UK Consortium (COG-UK) established a real time national genomic surveillance system during the COVID-19 pandemic, producing centralised data for monitoring SARS-CoV-2. As a COG-UK partner, Quadram Institute Bioscience (QIB) in Norfolk sequenced over 87,000 SARS-CoV-2 genomes, contributing to the region becoming densely sequenced. Retrospective analysis of SARS-CoV-2 lineage dynamics in this region may contribute to preparedness for future pandemics. Methods: 29,406 SARS-CoV-2 whole genome sequences and corresponding metadata from Norfolk were extracted from the COG-UK dataset, sampled between March 2020 and December 2022, representing 9.9% of regional COVID-19 cases. Sequences were lineage typed using Pangolin, and subsequent lineage analysis carried out in R using RStudio and related packages, including graphical analysis using ggplot2. Results: 401 global lineages were identified, with 69.8% appearing more than once and 31.2% over ten times. Temporal clustering identified six lineage communities based on first lineage emergence. Alpha, Delta, and Omicron variants of concern (VOC) accounted for 8.6%, 34.9% and 48.5% of sequences respectively. These formed four regional epidemic waves alongside the remaining lineages which appeared in the early pandemic prior to VOC designation and were termed pre-VOC lineages. Regional comparison highlighted variability in VOC epidemic wave dates dependent on location. Conclusion: This study is the first to assess SARS-CoV-2 diversity in Norfolk across a large timescale within the COVID-19 pandemic. SARS-CoV-2 was both highly diverse and dynamic throughout the Norfolk region between March 2020 – December 2022, with a strong VOC presence within the latter two thirds of the study period. The study also displays the utility of incorporating genomic epidemiological methods into pandemic response.

背景：在英国，COVID-19 基因组学英国联合会（COG-UK）在 COVID-19 大流行期间建立了实时国家基因组监测系统，为监测 SARS-CoV-2 提供集中数据。作为 COG-UK 的合作伙伴，诺福克的 Quadram Institute Bioscience (QIB) 对超过 87,000 个 SARS-CoV-2 基因组进行了测序，从而使该地区成为测序密集的地区。对该地区 SARS-CoV-2 世系动态的回顾性分析可能有助于为未来的大流行做好准备。方法：从 COG-UK 数据集中提取了来自诺福克郡的 29,406 个 SARS-CoV-2 全基因组序列和相应的元数据，采样时间为 2020 年 3 月至 2022 年 12 月，占 COVID-19 地区病例的 9.9%。使用 Pangolin 对序列进行了系谱分型，随后使用 RStudio 和相关软件包在 R 中进行了系谱分析，包括使用 ggplot2 进行图形分析。结果确定了 401 个全球谱系，其中 69.8%的谱系出现过一次以上，31.2%的谱系出现过十次以上。根据首次出现的世系，时间聚类确定了六个世系群落。α、δ和Ω变种（VOC）分别占序列的 8.6%、34.9% 和 48.5%。这些变异株与其余变异株一起形成了四个区域性流行波，这些变异株出现在 VOC 确定之前的早期流行中，被称为前 VOC 变异株。区域比较突显了 VOC 流行波日期因地点而异。结论本研究首次评估了 COVID-19 大流行期间诺福克地区 SARS-CoV-2 的多样性。在 2020 年 3 月至 2022 年 12 月期间，SARS-CoV-2 在整个诺福克地区具有高度的多样性和动态性，在研究期间的后三分之二时间段内出现了大量的 VOC。这项研究还显示了将基因组流行病学方法纳入大流行病应对措施的实用性。

{"title":"Genomic Epidemiology of SARS-CoV-2 in Norfolk, UK, March 2020 – December 2022","authors":"Eleanor H Hayles, Andrew J Page, Javier Guitian, Robert A Kingsley, The COVID-19 Genomics UK Consortium, Gemma C Langridge","doi":"10.1101/2024.09.05.611382","DOIUrl":"https://doi.org/10.1101/2024.09.05.611382","url":null,"abstract":"Background: In the UK, the COVID-19 Genomics UK Consortium (COG-UK) established a real time national genomic surveillance system during the COVID-19 pandemic, producing centralised data for monitoring SARS-CoV-2. As a COG-UK partner, Quadram Institute Bioscience (QIB) in Norfolk sequenced over 87,000 SARS-CoV-2 genomes, contributing to the region becoming densely sequenced. Retrospective analysis of SARS-CoV-2 lineage dynamics in this region may contribute to preparedness for future pandemics. Methods: 29,406 SARS-CoV-2 whole genome sequences and corresponding metadata from Norfolk were extracted from the COG-UK dataset, sampled between March 2020 and December 2022, representing 9.9% of regional COVID-19 cases. Sequences were lineage typed using Pangolin, and subsequent lineage analysis carried out in R using RStudio and related packages, including graphical analysis using ggplot2. Results: 401 global lineages were identified, with 69.8% appearing more than once and 31.2% over ten times. Temporal clustering identified six lineage communities based on first lineage emergence. Alpha, Delta, and Omicron variants of concern (VOC) accounted for 8.6%, 34.9% and 48.5% of sequences respectively. These formed four regional epidemic waves alongside the remaining lineages which appeared in the early pandemic prior to VOC designation and were termed pre-VOC lineages. Regional comparison highlighted variability in VOC epidemic wave dates dependent on location. Conclusion: This study is the first to assess SARS-CoV-2 diversity in Norfolk across a large timescale within the COVID-19 pandemic. SARS-CoV-2 was both highly diverse and dynamic throughout the Norfolk region between March 2020 – December 2022, with a strong VOC presence within the latter two thirds of the study period. The study also displays the utility of incorporating genomic epidemiological methods into pandemic response.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A genome assembly and annotation for the Australian alpine skink Bassiana duperreyi using long-read technologies 利用长读数技术对澳大利亚高山鼬 Bassiana duperreyi 进行基因组组装和注释

bioRxiv - Genomics

Pub Date : 2024-09-05 DOI: 10.1101/2024.09.05.611471

Benjamin J Hanrahan, Kirat Alreja, Andre L. M. Reis, J King Chang, Duminda S. B. Dissanayake, Richard J Edwards, Terry Bertozzi, Jillian M Hammond, Denis O'Meally, Ira W Deveson, Arthur Georges, Paul D. Waters, Hardip Rameshbhai Patel

The eastern three-lined skink (Bassiana duperreyi) inhabits the Australian high country in the southwest of the continent including Tasmania. It is an oviparous species that is distinctive because it undergoes sex reversal (from XX genotypic females to phenotypic males) at low incubation temperatures. We present a chromosome-scale genome assembly of a Bassiana duperreyi XY male individual, constructed using a combination of PacBio HiFi and ONT long reads scaffolded using Illumina HiC data. The genome assembly length is 1.57 Gb with a scaffold N50 of 222 Mbp, N90 of 26 Mbp, 200 gaps and 43.10% GC content. Most (95%) of the assembly is scaffolded into 6 macrochromosomes, 8 microchromosomes and the X chromosome, corresponding to the karyotype. Fragmented Y chromosome scaffolds (n=11 ≥1 Mbp) were identified using Y-specific contigs generated by genome subtraction. We identified two novel alpha-satellite repeats of 187 bp and 199 bp in the putative centromeres that did not form higher order repeats. The genome assembly exceeds the standard recommended by the Earth Biogenome Project; 0.02% false expansions, 99.63% kmer completeness, 94.66% complete single copy BUSCO genes and an average 98.42% of transcriptome data mappable to the genome assembly. The mitochondrial genome (17,506 bp) and the model rDNA repeat unit (15,154 bp) were assembled. The B. duperreyi genome assembly has one of the highest completeness levels for a skink and will provide a resource for research focused on sex determination and thermolabile sex reversal, as an oviparous foundation species for studies of the evolution of viviparity, and for other comparative genomics studies of the Scincidae.

东部三线石龙子（Bassiana duperreyi）栖息在澳大利亚大陆西南部的高原地区，包括塔斯马尼亚。它是一种卵生物种，在低温孵化条件下会发生性别逆转（从 XX 基因型的雌性变为表型的雄性），因而与众不同。我们展示了一个 Bassiana duperreyi XY 雄性个体的染色体级基因组组装，该组装是利用 Illumina HiC 数据结合 PacBio HiFi 和 ONT 长读数构建的。基因组组装长度为 1.57 Gb，支架 N50 为 222 Mbp，N90 为 26 Mbp，间隙为 200，GC 含量为 43.10%。大部分（95%）的装配支架分为 6 条大染色体、8 条小染色体和 X 染色体，与核型相对应。利用基因组减法生成的 Y 染色体特异性等位基因，鉴定出了破碎的 Y 染色体支架（n=11 ≥1 Mbp）。我们在推定的中心粒中发现了两个新的α-卫星重复序列，分别为187 bp和199 bp，它们没有形成高阶重复序列。基因组组装超过了地球生物基因组计划推荐的标准：0.02%的错误扩展、99.63%的kmer完整性、94.66%的完整单拷贝BUSCO基因以及平均98.42%的转录组数据可映射到基因组组装。线粒体基因组（17,506 bp）和模式 rDNA 重复单元（15,154 bp）已组装完成。B. duperreyi 的基因组组装是目前完成度最高的石龙子基因组组装之一，将为性别决定和热性反转研究提供资源，并可作为研究胎生进化的卵生基础物种，还可用于其他石龙子科动物的比较基因组学研究。

{"title":"A genome assembly and annotation for the Australian alpine skink Bassiana duperreyi using long-read technologies","authors":"Benjamin J Hanrahan, Kirat Alreja, Andre L. M. Reis, J King Chang, Duminda S. B. Dissanayake, Richard J Edwards, Terry Bertozzi, Jillian M Hammond, Denis O'Meally, Ira W Deveson, Arthur Georges, Paul D. Waters, Hardip Rameshbhai Patel","doi":"10.1101/2024.09.05.611471","DOIUrl":"https://doi.org/10.1101/2024.09.05.611471","url":null,"abstract":"The eastern three-lined skink (Bassiana duperreyi) inhabits the Australian high country in the southwest of the continent including Tasmania. It is an oviparous species that is distinctive because it undergoes sex reversal (from XX genotypic females to phenotypic males) at low incubation temperatures. We present a chromosome-scale genome assembly of a Bassiana duperreyi XY male individual, constructed using a combination of PacBio HiFi and ONT long reads scaffolded using Illumina HiC data. The genome assembly length is 1.57 Gb with a scaffold N50 of 222 Mbp, N90 of 26 Mbp, 200 gaps and 43.10% GC content. Most (95%) of the assembly is scaffolded into 6 macrochromosomes, 8 microchromosomes and the X chromosome, corresponding to the karyotype. Fragmented Y chromosome scaffolds (n=11 ≥1 Mbp) were identified using Y-specific contigs generated by genome subtraction. We identified two novel alpha-satellite repeats of 187 bp and 199 bp in the putative centromeres that did not form higher order repeats. The genome assembly exceeds the standard recommended by the Earth Biogenome Project; 0.02% false expansions, 99.63% kmer completeness, 94.66% complete single copy BUSCO genes and an average 98.42% of transcriptome data mappable to the genome assembly. The mitochondrial genome (17,506 bp) and the model rDNA repeat unit (15,154 bp) were assembled. The B. duperreyi genome assembly has one of the highest completeness levels for a skink and will provide a resource for research focused on sex determination and thermolabile sex reversal, as an oviparous foundation species for studies of the evolution of viviparity, and for other comparative genomics studies of the Scincidae.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"130 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Characterization and bioinformatic filtering of ambient gRNAs in single-cell CRISPR screens using CLEANSER 利用 CLEANSER 对单细胞 CRISPR 筛选中的环境 gRNA 进行表征和生物信息学筛选

bioRxiv - Genomics

Pub Date : 2024-09-04 DOI: 10.1101/2024.09.04.611293

Siyan Liu, Marisa C Hamilton, Thomas N Cowart, Alejandro Barrera, Lexi R Bounds, Alexander C Nelson, Richard W Doty, Andrew S Allen, Gregory E Crawford, William H Majoros, Charles A. Gersbach

Recent technological developments in single-cell RNA-seq CRISPR screens enable high-throughput investigation of the genome. Through transduction of a gRNA library to a cell population followed by transcriptomic profiling by scRNA-seq, it is possible to characterize the effects of thousands of genomic perturbations on global gene expression. A major source of noise in scRNA-seq CRISPR screens are ambient gRNAs, which are contaminating gRNAs that likely originate from other cells. If not properly filtered, ambient gRNAs can result in an excess of false positive gRNA assignments. Here, we utilize CRISPR barnyard assays to characterize ambient gRNA noise in single-cell CRISPR screens. We use these datasets to develop and train CLEANSER, a mixture model that identifies and filters ambient gRNA noise. This model takes advantage of the bimodal distribution between native and ambient gRNAs and includes both gRNA and cell-specific normalization parameters, correcting for confounding technical factors that affect individual gRNAs and cells. The output of CLEANSER is the probability that a gRNA-cell assignment is in the native distribution over the ambient distribution. We find that ambient gRNA filtering methods impact differential gene expression analysis outcomes and that CLEANSER outperforms alternate approaches by increasing gRNA-cell assignment accuracy.

单细胞 RNA-seq CRISPR 筛选技术的最新发展实现了对基因组的高通量研究。通过将 gRNA 文库转导到细胞群，然后用 scRNA-seq 进行转录组分析，可以确定成千上万个基因组扰动对全局基因表达的影响。在 scRNA-seq CRISPR 筛查中，环境 gRNA 是噪音的一个主要来源。如果不适当过滤，环境 gRNA 会导致过多的假阳性 gRNA 分配。在这里，我们利用 CRISPR barnyard 检测来描述单细胞 CRISPR 筛选中的环境 gRNA 噪声。我们利用这些数据集来开发和训练 CLEANSER，这是一种能识别和过滤环境 gRNA 噪音的混合模型。该模型利用了原生和环境 gRNA 之间的双峰分布，并包含 gRNA 和细胞特异性归一化参数，纠正了影响单个 gRNA 和细胞的混杂技术因素。CLEANSER 的输出结果是 gRNA-细胞分配处于原生分布而非环境分布的概率。我们发现，环境 gRNA 过滤方法会影响差异基因表达分析结果，而 CLEANSER 通过提高 gRNA-细胞分配的准确性优于其他方法。

{"title":"Characterization and bioinformatic filtering of ambient gRNAs in single-cell CRISPR screens using CLEANSER","authors":"Siyan Liu, Marisa C Hamilton, Thomas N Cowart, Alejandro Barrera, Lexi R Bounds, Alexander C Nelson, Richard W Doty, Andrew S Allen, Gregory E Crawford, William H Majoros, Charles A. Gersbach","doi":"10.1101/2024.09.04.611293","DOIUrl":"https://doi.org/10.1101/2024.09.04.611293","url":null,"abstract":"Recent technological developments in single-cell RNA-seq CRISPR screens enable high-throughput investigation of the genome. Through transduction of a gRNA library to a cell population followed by transcriptomic profiling by scRNA-seq, it is possible to characterize the effects of thousands of genomic perturbations on global gene expression. A major source of noise in scRNA-seq CRISPR screens are ambient gRNAs, which are contaminating gRNAs that likely originate from other cells. If not properly filtered, ambient gRNAs can result in an excess of false positive gRNA assignments. Here, we utilize CRISPR barnyard assays to characterize ambient gRNA noise in single-cell CRISPR screens. We use these datasets to develop and train CLEANSER, a mixture model that identifies and filters ambient gRNA noise. This model takes advantage of the bimodal distribution between native and ambient gRNAs and includes both gRNA and cell-specific normalization parameters, correcting for confounding technical factors that affect individual gRNAs and cells. The output of CLEANSER is the probability that a gRNA-cell assignment is in the native distribution over the ambient distribution. We find that ambient gRNA filtering methods impact differential gene expression analysis outcomes and that CLEANSER outperforms alternate approaches by increasing gRNA-cell assignment accuracy.","PeriodicalId":501161,"journal":{"name":"bioRxiv - Genomics","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142213650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Pre and Post antibiotic epoch: insights into the historical spread of antimicrobial resistance 抗生素时代前后：抗菌药耐药性历史传播的启示

bioRxiv - Genomics

Pub Date : 2024-09-04 DOI: 10.1101/2024.09.03.610986

Adrian Cazares, Wendy Figueroa, Daniel Cazares, Leandro Lima, Jake D. Turnbull, Hannah McGregor, Jo Dicks, Sarah Alexander, Zamin Iqbal, Nicholas Thomson

Plasmids are now the primary vectors of antimicrobial resistance, but our understanding of how human industrialisation of antibiotics influenced this is limited by a paucity of data predating the antibiotic era (PAE). By investigating plasmids from clinically relevant bacteria isolated between 1917 and 1954 and comparing them to modern plasmids, we captured over 100 years of evolution. We show that while all PAE plasmids were devoid of resistance genes and most never acquired them, a small minority evolved to drive the global spread of resistance to first-line and last-resort antibiotics in Gram-negative bacteria. They have evolved through complex microevolution and fusion events into a distinct group of highly recombinogenic, multi-replicon, self-transmissible plasmids that now pose the highest risk to resistance dissemination, and therefore human health.

质粒现在是抗菌药耐药性的主要载体，但由于抗生素时代（PAE）之前的数据很少，我们对人类抗生素工业化如何影响抗菌药耐药性的了解受到了限制。通过研究 1917 年至 1954 年间从临床相关细菌中分离出来的质粒，并将它们与现代质粒进行比较，我们捕捉到了 100 多年的进化过程。我们发现，虽然所有 PAE 质粒都不含抗性基因，而且大多数质粒从未获得抗性基因，但少数质粒的进化推动了革兰氏阴性细菌对一线和最后一线抗生素抗性的全球传播。它们通过复杂的微进化和融合事件进化成了一组独特的高重组性、多复制子、可自我传播的质粒，目前对耐药性传播和人类健康构成了最高风险。

引用次数: 0