通过SACRA预处理嵌合reads对低生物量人肠道噬菌体多位移扩增DNA进行长读宏基因组学研究。

IF 3.9 2区 生物学 Q1 GENETICS & HEREDITY DNA Research Pub Date : 2021-10-11 DOI:10.1093/dnares/dsab019
Yuya Kiguchi, Suguru Nishijima, Naveen Kumar, Masahira Hattori, Wataru Suda
{"title":"通过SACRA预处理嵌合reads对低生物量人肠道噬菌体多位移扩增DNA进行长读宏基因组学研究。","authors":"Yuya Kiguchi,&nbsp;Suguru Nishijima,&nbsp;Naveen Kumar,&nbsp;Masahira Hattori,&nbsp;Wataru Suda","doi":"10.1093/dnares/dsab019","DOIUrl":null,"url":null,"abstract":"<p><p>The human gut bacteriophage community (phageome) plays an important role in the host's health and disease; however, the entire structure is poorly understood, partly owing to the generation of many incomplete genomes in conventional short-read metagenomics. Here, we show long-read metagenomics of amplified DNA of low-biomass phageomes with multiple displacement amplification (MDA), involving the development of a novel bioinformatics tool, split amplified chimeric read algorithm (SACRA), that efficiently pre-processed numerous chimeric reads generated through MDA. Using five samples, SACRA markedly reduced the average chimera ratio from 72% to 1.5% in PacBio reads with an average length of 1.8 kb. De novo assembly of chimera-less PacBio long reads reconstructed contigs of ≥5 kb with an average proportion of 27%, which was 1% in contigs from MiSeq short reads, thereby dramatically improving contig length and genome completeness. Comparison of PacBio and MiSeq contigs found MiSeq contig fragmentations frequently near local repeats and hypervariable regions in the phage genomes, and those caused by multiple homologous phage genomes coexisting in the community. We also developed a reference-independent method to assess the completeness of the linear phage genomes. Overall, we established a SACRA-coupled long-read metagenomics robust to highly diverse gut phageomes, identifying high-quality circular and linear phage genomes with adequate sequence quantity.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 6","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2021-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8513251/pdf/dsab019.pdf","citationCount":"10","resultStr":"{\"title\":\"Long-read metagenomics of multiple displacement amplified DNA of low-biomass human gut phageomes by SACRA pre-processing chimeric reads.\",\"authors\":\"Yuya Kiguchi,&nbsp;Suguru Nishijima,&nbsp;Naveen Kumar,&nbsp;Masahira Hattori,&nbsp;Wataru Suda\",\"doi\":\"10.1093/dnares/dsab019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The human gut bacteriophage community (phageome) plays an important role in the host's health and disease; however, the entire structure is poorly understood, partly owing to the generation of many incomplete genomes in conventional short-read metagenomics. Here, we show long-read metagenomics of amplified DNA of low-biomass phageomes with multiple displacement amplification (MDA), involving the development of a novel bioinformatics tool, split amplified chimeric read algorithm (SACRA), that efficiently pre-processed numerous chimeric reads generated through MDA. Using five samples, SACRA markedly reduced the average chimera ratio from 72% to 1.5% in PacBio reads with an average length of 1.8 kb. De novo assembly of chimera-less PacBio long reads reconstructed contigs of ≥5 kb with an average proportion of 27%, which was 1% in contigs from MiSeq short reads, thereby dramatically improving contig length and genome completeness. Comparison of PacBio and MiSeq contigs found MiSeq contig fragmentations frequently near local repeats and hypervariable regions in the phage genomes, and those caused by multiple homologous phage genomes coexisting in the community. We also developed a reference-independent method to assess the completeness of the linear phage genomes. Overall, we established a SACRA-coupled long-read metagenomics robust to highly diverse gut phageomes, identifying high-quality circular and linear phage genomes with adequate sequence quantity.</p>\",\"PeriodicalId\":51014,\"journal\":{\"name\":\"DNA Research\",\"volume\":\"28 6\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2021-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8513251/pdf/dsab019.pdf\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DNA Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/dnares/dsab019\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/dnares/dsab019","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 10

摘要

人类肠道噬菌体群落(phageome)在宿主的健康和疾病中起着重要作用;然而,人们对整个结构知之甚少,部分原因是传统的短读宏基因组学产生了许多不完整的基因组。在这里,我们展示了使用多重位移扩增(MDA)的低生物量噬菌体扩增DNA的长读元基因组学,涉及一种新的生物信息学工具的开发,分裂扩增嵌合读取算法(SACRA),该算法有效地预处理了通过MDA产生的大量嵌合读取。在5个样本中,SACRA显著降低了平均长度为1.8 kb的PacBio reads的平均嵌合体比例,从72%降至1.5%。无嵌合体的PacBio长reads从头组装后,平均重建的contigs≥5kb的比例为27%,而MiSeq短reads的比例为1%,从而显著提高了contigs长度和基因组的完整性。PacBio和MiSeq序列比较发现,MiSeq序列片段经常出现在噬菌体基因组的局部重复和高变区附近,并且这些片段是由多个同源噬菌体基因组共存造成的。我们还开发了一种独立于参考的方法来评估线性噬菌体基因组的完整性。总体而言,我们建立了一个sacra偶联的长读宏基因组,对高度多样化的肠道噬菌体具有强大的功能,鉴定出具有足够序列量的高质量圆形和线性噬菌体基因组。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Long-read metagenomics of multiple displacement amplified DNA of low-biomass human gut phageomes by SACRA pre-processing chimeric reads.

The human gut bacteriophage community (phageome) plays an important role in the host's health and disease; however, the entire structure is poorly understood, partly owing to the generation of many incomplete genomes in conventional short-read metagenomics. Here, we show long-read metagenomics of amplified DNA of low-biomass phageomes with multiple displacement amplification (MDA), involving the development of a novel bioinformatics tool, split amplified chimeric read algorithm (SACRA), that efficiently pre-processed numerous chimeric reads generated through MDA. Using five samples, SACRA markedly reduced the average chimera ratio from 72% to 1.5% in PacBio reads with an average length of 1.8 kb. De novo assembly of chimera-less PacBio long reads reconstructed contigs of ≥5 kb with an average proportion of 27%, which was 1% in contigs from MiSeq short reads, thereby dramatically improving contig length and genome completeness. Comparison of PacBio and MiSeq contigs found MiSeq contig fragmentations frequently near local repeats and hypervariable regions in the phage genomes, and those caused by multiple homologous phage genomes coexisting in the community. We also developed a reference-independent method to assess the completeness of the linear phage genomes. Overall, we established a SACRA-coupled long-read metagenomics robust to highly diverse gut phageomes, identifying high-quality circular and linear phage genomes with adequate sequence quantity.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
DNA Research
DNA Research 生物-遗传学
CiteScore
6.00
自引率
4.90%
发文量
39
审稿时长
4.5 months
期刊介绍: DNA Research is an internationally peer-reviewed journal which aims at publishing papers of highest quality in broad aspects of DNA and genome-related research. Emphasis will be made on the following subjects: 1) Sequencing and characterization of genomes/important genomic regions, 2) Comprehensive analysis of the functions of genes, gene families and genomes, 3) Techniques and equipments useful for structural and functional analysis of genes, gene families and genomes, 4) Computer algorithms and/or their applications relevant to structural and functional analysis of genes and genomes. The journal also welcomes novel findings in other scientific disciplines related to genomes.
期刊最新文献
Chromosome-scale genome assembly of acerola (Malpighia emarginata DC.). The burst of satellite DNA in Leptidea wood white butterflies and their putative role in karyotype evolution. Time-dependent changes in genome-wide gene expression and post-transcriptional regulation across the post-death process in silkworm. A fully phased, chromosome-scale genome of sugar beet line FC309 enables the discovery of Fusarium yellows resistance QTL. Insights from the first chromosome-level genome assembly of the alpine gentian Gentiana straminea Maxim.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1