两个番茄祖先茄(Solanum pinpinellifolium)和茄(Solanum lycopersicum var. cerasiformme)的长读测序从头基因组组装。

IF 3.9 2区 生物学 Q1 GENETICS & HEREDITY DNA Research Pub Date : 2021-01-19 DOI:10.1093/dnares/dsaa029
Hitomi Takei, Kenta Shirasawa, Kosuke Kuwabara, Atsushi Toyoda, Yuma Matsuzawa, Shinji Iioka, Tohru Ariizumi
{"title":"两个番茄祖先茄(Solanum pinpinellifolium)和茄(Solanum lycopersicum var. cerasiformme)的长读测序从头基因组组装。","authors":"Hitomi Takei,&nbsp;Kenta Shirasawa,&nbsp;Kosuke Kuwabara,&nbsp;Atsushi Toyoda,&nbsp;Yuma Matsuzawa,&nbsp;Shinji Iioka,&nbsp;Tohru Ariizumi","doi":"10.1093/dnares/dsaa029","DOIUrl":null,"url":null,"abstract":"<p><p>The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":"28 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2021-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/dnares/dsaa029","citationCount":"18","resultStr":"{\"title\":\"De novo genome assembly of two tomato ancestors, Solanum pimpinellifolium and Solanum  lycopersicum var. cerasiforme, by long-read sequencing.\",\"authors\":\"Hitomi Takei,&nbsp;Kenta Shirasawa,&nbsp;Kosuke Kuwabara,&nbsp;Atsushi Toyoda,&nbsp;Yuma Matsuzawa,&nbsp;Shinji Iioka,&nbsp;Tohru Ariizumi\",\"doi\":\"10.1093/dnares/dsaa029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.</p>\",\"PeriodicalId\":51014,\"journal\":{\"name\":\"DNA Research\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2021-01-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1093/dnares/dsaa029\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DNA Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/dnares/dsaa029\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/dnares/dsaa029","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 18

摘要

已知祖先番茄物种具有在育种中改善性状的有价值的基因。本研究旨在构建源自秘鲁的茄茄(Solanum pimpinellifolium) LA1670和番茄变种(S. lycopersicum var. cerasiformme) LA1673的高质量从头基因组组装。对覆盖110x和104x的PacBio (Pacific Biosciences)长读序列进行组装和优化,得到LA1670和LA1673的长读序列分别为244和202个,长度分别为808.8 Mbp和804.5 Mbp。在参考引导下,构建了12条番茄染色体对应的14条骨架序列和2条未分配序列。使用Benchmarking Universal Single-Copy Orthologs和长末端重复序列组装索引确认高质量的基因组组装。然后预测蛋白质编码序列,并确认其转录组。根据转录组分析结果,新组装的葡萄球菌(S. pimpinellifolium)和葡萄球菌(S. lycopersicum vars . cerasiformme)基因组分别含有71,945个和75,230个蛋白质编码基因,其中非冗余基因分别为29,629个和29,185个。染色体水平的基因组组装与两份材料的转录组数据集相结合,将为深入了解番茄驯化和基因组规模育种提供有价值的信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
De novo genome assembly of two tomato ancestors, Solanum pimpinellifolium and Solanum  lycopersicum var. cerasiforme, by long-read sequencing.

The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
DNA Research
DNA Research 生物-遗传学
CiteScore
6.00
自引率
4.90%
发文量
39
审稿时长
4.5 months
期刊介绍: DNA Research is an internationally peer-reviewed journal which aims at publishing papers of highest quality in broad aspects of DNA and genome-related research. Emphasis will be made on the following subjects: 1) Sequencing and characterization of genomes/important genomic regions, 2) Comprehensive analysis of the functions of genes, gene families and genomes, 3) Techniques and equipments useful for structural and functional analysis of genes, gene families and genomes, 4) Computer algorithms and/or their applications relevant to structural and functional analysis of genes and genomes. The journal also welcomes novel findings in other scientific disciplines related to genomes.
期刊最新文献
Chromosome-scale genome assembly of acerola (Malpighia emarginata DC.). The burst of satellite DNA in Leptidea wood white butterflies and their putative role in karyotype evolution. Time-dependent changes in genome-wide gene expression and post-transcriptional regulation across the post-death process in silkworm. A fully phased, chromosome-scale genome of sugar beet line FC309 enables the discovery of Fusarium yellows resistance QTL. Insights from the first chromosome-level genome assembly of the alpine gentian Gentiana straminea Maxim.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1