Identification of diversity-generating retroelements in host-associated and environmental genomes: prevalence, diversity, and roles.

IF 3.7 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY BMC Genomics Pub Date : 2024-12-20 DOI:10.1186/s12864-024-11124-1
Mariela Carrasco-Villanueva, Chaoxian Wang, Chaochun Wei
{"title":"Identification of diversity-generating retroelements in host-associated and environmental genomes: prevalence, diversity, and roles.","authors":"Mariela Carrasco-Villanueva, Chaoxian Wang, Chaochun Wei","doi":"10.1186/s12864-024-11124-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The diversity-generating retroelements (DGRs) are a family of genetic elements that can produce mutations in target genes often related to ligand-binding functions, which possess a C-type lectin (CLec) domain that tolerates massive variations. They were first identified in viruses, then in bacteria and archaea from human-associated and environmental genomes. This DGR mechanism represents a fast adaptation of organisms to ever- changing environments. However, their existence, phylogenetic and structural diversity, and functions in a wide range of environments are largely unknown.</p><p><strong>Results: </strong>Here we present a study of DGR systems based on metagenome-assembled genomes (MAGs) from host-associated, aquatic, terrestrial and engineered environments. In total, we identified 861 non-redundant DGR-RTs and ~ 5.7% are new. We found that microbes associated with human hosts harbor the highest number of DGRs and also exhibit a higher prevalence of DGRs. After normalizing with genome size and including more genome data, we found that DGRs occur more frequently in organisms with smaller genomes. Overall, we identified nine main clades in the phylogenetic tree of reverse transcriptases (RTs), some comprising specific phyla and cassette architectures. We identified 38 different cassette patterns and 6 of them were shown in at least 10 DGRs, showing differences in terms of the numbers, arrangements, and orientations of their components. Finally, most of the target genes were related to ligand-binding and signaling functions, but we discovered a few cases in which the VRs were situated in domains different from the CLec.</p><p><strong>Conclusions: </strong>Our research sheds light on the widespread prevalence of DGRs within environments and taxa, and supports the DGR phylogenetic divergence in different organisms. These variations might also occur in their structures since some cassette architectures were common in specific underrepresented phyla. In addition, we suggest that VRs could be found in domains different to the CLec, which should be further explored for organisms in scarcely studied environments.</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"25 1","pages":"1227"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11661182/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-024-11124-1","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The diversity-generating retroelements (DGRs) are a family of genetic elements that can produce mutations in target genes often related to ligand-binding functions, which possess a C-type lectin (CLec) domain that tolerates massive variations. They were first identified in viruses, then in bacteria and archaea from human-associated and environmental genomes. This DGR mechanism represents a fast adaptation of organisms to ever- changing environments. However, their existence, phylogenetic and structural diversity, and functions in a wide range of environments are largely unknown.

Results: Here we present a study of DGR systems based on metagenome-assembled genomes (MAGs) from host-associated, aquatic, terrestrial and engineered environments. In total, we identified 861 non-redundant DGR-RTs and ~ 5.7% are new. We found that microbes associated with human hosts harbor the highest number of DGRs and also exhibit a higher prevalence of DGRs. After normalizing with genome size and including more genome data, we found that DGRs occur more frequently in organisms with smaller genomes. Overall, we identified nine main clades in the phylogenetic tree of reverse transcriptases (RTs), some comprising specific phyla and cassette architectures. We identified 38 different cassette patterns and 6 of them were shown in at least 10 DGRs, showing differences in terms of the numbers, arrangements, and orientations of their components. Finally, most of the target genes were related to ligand-binding and signaling functions, but we discovered a few cases in which the VRs were situated in domains different from the CLec.

Conclusions: Our research sheds light on the widespread prevalence of DGRs within environments and taxa, and supports the DGR phylogenetic divergence in different organisms. These variations might also occur in their structures since some cassette architectures were common in specific underrepresented phyla. In addition, we suggest that VRs could be found in domains different to the CLec, which should be further explored for organisms in scarcely studied environments.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
宿主相关和环境基因组中产生多样性的逆转录因子的鉴定:流行、多样性和作用。
背景:多样性生成逆转录因子(DGRs)是一类可以在靶基因中产生突变的遗传元件,通常与配体结合功能相关,其具有耐受大量变异的c型凝集素(CLec)结构域。它们首先在病毒中被发现,然后在人类相关基因组和环境基因组中的细菌和古细菌中被发现。这种DGR机制代表了生物体对不断变化的环境的快速适应。然而,它们的存在、系统发育和结构多样性以及在广泛环境中的功能在很大程度上是未知的。结果:在这里,我们提出了一项基于宿主相关、水生、陆地和工程环境的宏基因组组装基因组(MAGs)的DGR系统研究。我们共发现861例非冗余dgr - rt,约5.7%为新发。我们发现与人类宿主相关的微生物拥有最多的dgr,并且dgr的患病率也更高。在将基因组大小归一化并包含更多基因组数据后,我们发现dgr在基因组较小的生物体中更频繁地发生。总的来说,我们在逆转录酶(RTs)的系统发育树中确定了9个主要的进化支,其中一些包括特定的门和盒结构。我们确定了38种不同的卡带模式,其中6种至少在10个dgr中出现,显示出其组件在数量,排列和方向方面的差异。最后,大多数靶基因与配体结合和信号功能有关,但我们发现少数VRs位于与CLec不同的结构域。结论:我们的研究揭示了DGR在不同环境和分类群中的普遍存在,并支持了DGR在不同生物中的系统发育差异。这些变化也可能发生在它们的结构上,因为一些盒式结构在特定的代表性不足的门中很常见。此外,我们认为VRs可能存在于与CLec不同的结构域,这应该在研究较少的环境中进一步探索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Genomics
BMC Genomics 生物-生物工程与应用微生物
CiteScore
7.40
自引率
4.50%
发文量
769
审稿时长
6.4 months
期刊介绍: BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics. BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
期刊最新文献
Assessment of forensic individual identification and kinship analysis using transcript SNPs derived from public transcriptome sequencing data. Genome-wide identification and functional characterization of MAPKK gene family in Zanthoxylum armatum DC. reveal its potential role in ecological adaptive evolution. Integrated analysis of ATAC-seq and RNA-seq reveals the TCP-ARF molecular module related to pathogenic process of phytoplasma infection in Paulownia fortunei. DSS-PPI: a self-supervised graph learning framework for protein-protein interaction prediction via multimodal sequence semantics. crocketa: an automated Snakemake framework for integrated single-cell transcriptome and immune-repertoire analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1