A dataset of 40 assembled and annotated transcriptomes from 34 species in Silene and related genera.

IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES Data in Brief Pub Date : 2024-11-01 eCollection Date: 2024-12-01 DOI:10.1016/j.dib.2024.111094
Patrik Cangren, Yann J K Bertrand, John M Braverman, Gregor Duncan Gilfillan, Matthew B Hamilton, Bengt Oxelman
{"title":"A dataset of 40 assembled and annotated transcriptomes from 34 species in <i>Silene</i> and related genera.","authors":"Patrik Cangren, Yann J K Bertrand, John M Braverman, Gregor Duncan Gilfillan, Matthew B Hamilton, Bengt Oxelman","doi":"10.1016/j.dib.2024.111094","DOIUrl":null,"url":null,"abstract":"<p><p>A dataset of 40 assembled and annotated transcriptomes from 34 different species sampled from phylogenetically diverse parts of the flowering plant genus <i>Silene</i> (Caryophyllaceae) and the related genera <i>Agrostemma, Atocion, Eudianthe, Heliosperma, Petrocoptis</i> and <i>Viscaria.</i> RNA extracted from roots, stems, leaves, buds and flowers were sequenced using paired end reads on the Illumina Hiseq platform. A total of 716 million raw reads were produced and assembled into 2.67 million isogroups (\"genes\"). Contigs from all samples were annotated using UniProt/SwissProt and assigned with GO-terms. A total of 974274 annotations were made (per sample average 24357, stdev 7034), giving an annotation proportion of 37% (per sample average 39%, stdev 9.75%). 741087 of the annotations had taxonomic identities within Magnoliopsida (per sample average 18527, stdev 3931), resulting in assignment of 4519488 GO-terms (per sample average 112987, stdev 22536). The data set can be further utilized for biological research and phylogenetic studies, evolutionary questions, functional analyses of genes, polyploidy as well as for marker development.</p>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"111094"},"PeriodicalIF":1.4000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11615531/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.dib.2024.111094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

A dataset of 40 assembled and annotated transcriptomes from 34 different species sampled from phylogenetically diverse parts of the flowering plant genus Silene (Caryophyllaceae) and the related genera Agrostemma, Atocion, Eudianthe, Heliosperma, Petrocoptis and Viscaria. RNA extracted from roots, stems, leaves, buds and flowers were sequenced using paired end reads on the Illumina Hiseq platform. A total of 716 million raw reads were produced and assembled into 2.67 million isogroups ("genes"). Contigs from all samples were annotated using UniProt/SwissProt and assigned with GO-terms. A total of 974274 annotations were made (per sample average 24357, stdev 7034), giving an annotation proportion of 37% (per sample average 39%, stdev 9.75%). 741087 of the annotations had taxonomic identities within Magnoliopsida (per sample average 18527, stdev 3931), resulting in assignment of 4519488 GO-terms (per sample average 112987, stdev 22536). The data set can be further utilized for biological research and phylogenetic studies, evolutionary questions, functional analyses of genes, polyploidy as well as for marker development.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
来自Silene及其相关属34个物种的40个组装和注释的转录组数据集。
从开花植物Silene属(石楠科)及其相关属Agrostemma、Atocion、Eudianthe、Heliosperma、Petrocoptis和Viscaria中采集的34个不同物种的40个转录组数据集。从根、茎、叶、芽和花中提取的RNA在Illumina Hiseq平台上使用配对端读测序。总共产生了7.16亿个原始reads,并组装成267万个同群(“基因”)。使用UniProt/SwissProt对所有样本的Contigs进行注释,并赋予go术语。共注释974274条(每个样本平均24357条,stdev为7034条),注释比例为37%(每个样本平均39%,stdev为9.75%)。741087个注释在Magnoliopsida中具有分类一致性(每个样本平均18527,stdev 3931),从而分配了4519488个go术语(每个样本平均112987,stdev 22536)。该数据集可以进一步用于生物学研究和系统发育研究、进化问题、基因功能分析、多倍体以及标记开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
期刊最新文献
Data from three experiments on auditory attention and distraction in autistic and nonautistic adults Dataset of Ultrasonic Frequency – Domain Signals and Machine – Learning Outputs for Parameterising Lithium – Ion battery Electrodes’ Coating and Calendering Processes Mitochondrial genome dataset of Drosophila melanogaster Meigen, 1830 (fruit fly) from Pakistan Atlas of pollen of melliferous plants, data set from the Soconusco region, Chiapas, Mexico A dataset for geographical origin identification of tobacco leaves from multiple countries using near-infrared spectroscopy and chemometric analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1