Draft genome of Castanopsis chinensis, a dominant species safeguarding biodiversity in subtropical broadleaved evergreen forests

IF 2.9 Q2 Biochemistry, Genetics and Molecular Biology BMC Genetics Pub Date : 2023-12-14 DOI:10.1186/s12863-023-01183-w
Pan Chen, Ju-Yu Lian, Bin Wu, Hong-Lin Cao, Zhi-Hong Li, Zheng-Feng Wang
{"title":"Draft genome of Castanopsis chinensis, a dominant species safeguarding biodiversity in subtropical broadleaved evergreen forests","authors":"Pan Chen, Ju-Yu Lian, Bin Wu, Hong-Lin Cao, Zhi-Hong Li, Zheng-Feng Wang","doi":"10.1186/s12863-023-01183-w","DOIUrl":null,"url":null,"abstract":"Castanopsis is the third largest genus in the Fagaceae family and is essentially tropical or subtropical in origin. The species in this genus are mainly canopy-dominant trees, and the key components of evergreen broadleaved forests play a crucial role in the maintenance of local biodiversity. Castanopsis chinensis, distributed from South China to Vietnam, is a representative species. It currently suffers from a high disturbance of human activity and climate change. Here, we present its assembled genome to facilitate its preliminary conservation and breeding on the genome level. The C. chinensis genome was assembled and annotated by Nanopore and MGI whole-genome sequencing and RNA-seq reads using leaf tissues. The assembly was 888,699,661 bp in length, consisting of 133 contigs and a contig N50 of 23,395,510 bp. A completeness assessment of the assembly with Benchmarking Universal Single-Copy Orthologs (BUSCO) indicated a score of 98.3%. Repetitive elements comprised 471,006,885 bp, accounting for 55.9% of the assembled sequences. A total of 51,406 genes that coded for 54,310 proteins were predicted. Multiple databases were used to functionally annotate the protein sequences.","PeriodicalId":9197,"journal":{"name":"BMC Genetics","volume":"167 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12863-023-01183-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

Castanopsis is the third largest genus in the Fagaceae family and is essentially tropical or subtropical in origin. The species in this genus are mainly canopy-dominant trees, and the key components of evergreen broadleaved forests play a crucial role in the maintenance of local biodiversity. Castanopsis chinensis, distributed from South China to Vietnam, is a representative species. It currently suffers from a high disturbance of human activity and climate change. Here, we present its assembled genome to facilitate its preliminary conservation and breeding on the genome level. The C. chinensis genome was assembled and annotated by Nanopore and MGI whole-genome sequencing and RNA-seq reads using leaf tissues. The assembly was 888,699,661 bp in length, consisting of 133 contigs and a contig N50 of 23,395,510 bp. A completeness assessment of the assembly with Benchmarking Universal Single-Copy Orthologs (BUSCO) indicated a score of 98.3%. Repetitive elements comprised 471,006,885 bp, accounting for 55.9% of the assembled sequences. A total of 51,406 genes that coded for 54,310 proteins were predicted. Multiple databases were used to functionally annotate the protein sequences.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
保护亚热带常绿阔叶林生物多样性的优势物种栲的基因组草案
栲属是落叶松科第三大属,主要产于热带或亚热带地区。该属的物种主要是树冠优势树种,是常绿阔叶林的重要组成部分,在维护当地生物多样性方面发挥着至关重要的作用。分布于华南至越南的栲树是其中的代表树种。目前,它受到人类活动和气候变化的严重干扰。在此,我们展示了其组装的基因组,以促进其在基因组水平上的初步保护和育种。通过对叶片组织进行 Nanopore 和 MGI 全基因组测序和 RNA-seq 读取,组装并注释了 chinensis 的基因组。该序列长度为 888,699,661 bp,由 133 个等位基因组成,等位基因 N50 为 23,395,510 bp。利用通用单拷贝同源物基准(BUSCO)对该序列进行的完整性评估显示,得分率为 98.3%。重复元件包括 471,006,885 bp,占组装序列的 55.9%。共预测出 51,406 个基因,编码 54,310 种蛋白质。多个数据库被用来对蛋白质序列进行功能注释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Genetics
BMC Genetics 生物-遗传学
CiteScore
4.30
自引率
0.00%
发文量
77
审稿时长
4-8 weeks
期刊介绍: BMC Genetics is an open access, peer-reviewed journal that considers articles on all aspects of inheritance and variation in individuals and among populations.
期刊最新文献
Association between human blood metabolome and the risk of hypertension Diverse WGBS profiles of longissimus dorsi muscle in Hainan black goats and hybrid goats Draft genome of Castanopsis chinensis, a dominant species safeguarding biodiversity in subtropical broadleaved evergreen forests Chromosome-scale genome assembly of the ‘Munstead’ cultivar of Lavandula angustifolia Investigating the impact of non-additive genetic effects in the estimation of variance components and genomic predictions for heat tolerance and performance traits in crossbred and purebred pig populations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1