首页 > 最新文献

GigaByte (Hong Kong, China)最新文献

英文 中文
SAW: an efficient and accurate data analysis workflow for Stereo-seq spatial transcriptomics. SAW:立体测序空间转录组学高效准确的数据分析工作流程。
Pub Date : 2024-02-20 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.111
Chun Gong, Shengkang Li, Leying Wang, Fuxiang Zhao, Shuangsang Fang, Dong Yuan, Zijian Zhao, Qiqi He, Mei Li, Weiqing Liu, Zhaoxun Li, Hongqing Xie, Sha Liao, Ao Chen, Yong Zhang, Yuxiang Li, Xun Xu

The basic analysis steps of spatial transcriptomics require obtaining gene expression information from both space and cells. The existing tools for these analyses incur performance issues when dealing with large datasets. These issues involve computationally intensive spatial localization, RNA genome alignment, and excessive memory usage in large chip scenarios. These problems affect the applicability and efficiency of the analysis. Here, a high-performance and accurate spatial transcriptomics data analysis workflow, called Stereo-seq Analysis Workflow (SAW), was developed for the Stereo-seq technology developed at BGI. SAW includes mRNA spatial position reconstruction, genome alignment, gene expression matrix generation, and clustering. The workflow outputs files in a universal format for subsequent personalized analysis. The execution time for the entire analysis is ∼148 min with 1 GB reads 1 × 1 cm chip test data, 1.8 times faster than with an unoptimized workflow.

空间转录组学的基本分析步骤需要从空间和细胞两方面获取基因表达信息。现有的这些分析工具在处理大型数据集时存在性能问题。这些问题涉及计算密集型空间定位、RNA 基因组比对、大型芯片情况下内存使用过多等。这些问题影响了分析的适用性和效率。在此,我们针对 BGI 开发的 Stereo-seq 技术,开发了一种高性能、高精度的空间转录组学数据分析工作流,称为 Stereo-seq 分析工作流(SAW)。SAW 包括 mRNA 空间位置重建、基因组比对、基因表达矩阵生成和聚类。工作流程以通用格式输出文件,供后续个性化分析使用。在 1 GB 读数 1 × 1 厘米芯片测试数据下,整个分析的执行时间为 148 分钟,比未优化的工作流程快 1.8 倍。
{"title":"SAW: an efficient and accurate data analysis workflow for Stereo-seq spatial transcriptomics.","authors":"Chun Gong, Shengkang Li, Leying Wang, Fuxiang Zhao, Shuangsang Fang, Dong Yuan, Zijian Zhao, Qiqi He, Mei Li, Weiqing Liu, Zhaoxun Li, Hongqing Xie, Sha Liao, Ao Chen, Yong Zhang, Yuxiang Li, Xun Xu","doi":"10.46471/gigabyte.111","DOIUrl":"10.46471/gigabyte.111","url":null,"abstract":"<p><p>The basic analysis steps of spatial transcriptomics require obtaining gene expression information from both space and cells. The existing tools for these analyses incur performance issues when dealing with large datasets. These issues involve computationally intensive spatial localization, RNA genome alignment, and excessive memory usage in large chip scenarios. These problems affect the applicability and efficiency of the analysis. Here, a high-performance and accurate spatial transcriptomics data analysis workflow, called Stereo-seq Analysis Workflow (SAW), was developed for the Stereo-seq technology developed at BGI. SAW includes mRNA spatial position reconstruction, genome alignment, gene expression matrix generation, and clustering. The workflow outputs files in a universal format for subsequent personalized analysis. The execution time for the entire analysis is ∼148 min with 1 GB reads 1 × 1 cm chip test data, 1.8 times faster than with an unoptimized workflow.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte111"},"PeriodicalIF":0.0,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10905255/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140023511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The genome assembly and annotation of the white-lipped tree pit viper Trimeresurus albolabris. 白唇树蝮的基因组组装和注释。
Pub Date : 2024-01-25 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.106
Xiaotong Niu, Yakui Lv, Jin Chen, Yueheng Feng, Yilin Cui, Haorong Lu, Hui Liu

Trimeresurus albolabris, also known as the white-lipped pit viper or white-lipped tree viper, is a highly venomous snake distributed across Southeast Asia and the cause of many snakebite cases. In this study, we report the first whole genome assembly of T. albolabris obtained with next-generation sequencing from a specimen collected in Mengzi, Yunnan, China. After genome sequencing and assembly, the genome of this male T. albolabris individual was 1.51 Gb in length and included 38.42% repeat-element content. Using this genome, 21,695 genes were identified, and 99.17% of genes could be annotated using gene functional databases. Our genome assembly and annotation process was validated using a phylogenetic tree, which included six species and focused on single-copy genes of nuclear genomes. This research will contribute to future studies on Trimeresurus biology and the genetic basis of snake venom.

白唇蝮蛇(Trimeresurus albolabris)又称白唇蝮或白唇树蝰,是一种分布于东南亚的剧毒蛇类,也是许多蛇咬伤病例的病因。在这项研究中,我们报告了首次通过新一代测序从中国云南蒙自采集的标本中获得的白唇蝮蛇全基因组组装结果。经过基因组测序和组装,这只雄性白纹背天牛的基因组长度为 1.51 Gb,重复元素含量为 38.42%。利用该基因组共鉴定出 21,695 个基因,其中 99.17% 的基因可通过基因功能数据库进行注释。我们的基因组组装和注释过程通过系统发生树进行了验证,系统发生树包括六个物种,重点关注核基因组的单拷贝基因。这项研究将有助于今后对Trimeresurus生物学和蛇毒遗传基础的研究。
{"title":"The genome assembly and annotation of the white-lipped tree pit viper <i>Trimeresurus albolabris</i>.","authors":"Xiaotong Niu, Yakui Lv, Jin Chen, Yueheng Feng, Yilin Cui, Haorong Lu, Hui Liu","doi":"10.46471/gigabyte.106","DOIUrl":"10.46471/gigabyte.106","url":null,"abstract":"<p><p><i>Trimeresurus albolabris</i>, also known as the white-lipped pit viper or white-lipped tree viper, is a highly venomous snake distributed across Southeast Asia and the cause of many snakebite cases. In this study, we report the first whole genome assembly of <i>T. albolabris</i> obtained with next-generation sequencing from a specimen collected in Mengzi, Yunnan, China. After genome sequencing and assembly, the genome of this male <i>T. albolabris</i> individual was 1.51 Gb in length and included 38.42% repeat-element content. Using this genome, 21,695 genes were identified, and 99.17% of genes could be annotated using gene functional databases. Our genome assembly and annotation process was validated using a phylogenetic tree, which included six species and focused on single-copy genes of nuclear genomes. This research will contribute to future studies on <i>Trimeresurus</i> biology and the genetic basis of snake venom.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte106"},"PeriodicalIF":0.0,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10836062/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139682037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Near chromosome-level and highly repetitive genome assembly of the snake pipefish Entelurus aequoreus (Syngnathiformes: Syngnathidae). 蛇琵琶鱼 Entelurus aequoreus (Syngnathiformes: Syngnathidae) 的近染色体水平和高度重复基因组组装。
Pub Date : 2024-01-11 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.105
Magnus Wolf, Bruno Lopes da Silva Ferrette, Raphael T F Coimbra, Menno de Jong, Marcel Nebenführ, David Prochotta, Yannis Schöneberg, Konstantin Zapf, Jessica Rosenbaum, Hannah A Mc Intyre, Julia Maier, Clara C S de Souza, Lucas M Gehlhaar, Melina J Werner, Henrik Oechler, Marie Wittekind, Moritz Sonnewald, Maria A Nilsson, Axel Janke, Sven Winter

The snake pipefish, Entelurus aequoreus (Linnaeus, 1758), is a northern Atlantic fish inhabiting open seagrass environments that recently expanded its distribution range. Here, we present a highly contiguous, near chromosome-scale genome of E. aequoreus. The final assembly spans 1.6 Gbp in 7,391 scaffolds, with a scaffold N50 of 62.3 Mbp and L50 of 12. The 28 largest scaffolds (>21 Mbp) span 89.7% of the assembly length. A BUSCO completeness score of 94.1% and a mapping rate above 98% suggest a high assembly completeness. Repetitive elements cover 74.93% of the genome, one of the highest proportions identified in vertebrates. Our demographic modeling identified a peak in population size during the last interglacial period, suggesting the species might benefit from warmer water conditions. Our updated snake pipefish assembly is essential for future analyses of the morphological and molecular changes unique to the Syngnathidae.

蛇琵琶鱼(Entelurus aequoreus,Linnaeus,1758 年)是一种栖息于开阔海草环境中的大西洋北部鱼类,最近其分布范围有所扩大。在这里,我们展示了一个高度连续、接近染色体尺度的 E. aequoreus 基因组。最终的组装跨越 1.6 Gbp,包含 7,391 个支架,支架 N50 为 62.3 Mbp,L50 为 12。最大的 28 个支架(>21 Mbp)占组装长度的 89.7%。BUSCO 完整性得分为 94.1%,映射率超过 98%,表明组装的完整性很高。重复元件覆盖了基因组的 74.93%,是脊椎动物中发现的最高比例之一。我们的人口统计建模确定了上一个冰期的种群数量高峰,这表明该物种可能受益于较温暖的水域条件。我们更新的蛇琵琶鱼基因组对于未来分析蛇琵琶鱼科特有的形态和分子变化至关重要。
{"title":"Near chromosome-level and highly repetitive genome assembly of the snake pipefish <i>Entelurus aequoreus</i> (Syngnathiformes: Syngnathidae).","authors":"Magnus Wolf, Bruno Lopes da Silva Ferrette, Raphael T F Coimbra, Menno de Jong, Marcel Nebenführ, David Prochotta, Yannis Schöneberg, Konstantin Zapf, Jessica Rosenbaum, Hannah A Mc Intyre, Julia Maier, Clara C S de Souza, Lucas M Gehlhaar, Melina J Werner, Henrik Oechler, Marie Wittekind, Moritz Sonnewald, Maria A Nilsson, Axel Janke, Sven Winter","doi":"10.46471/gigabyte.105","DOIUrl":"10.46471/gigabyte.105","url":null,"abstract":"<p><p>The snake pipefish, <i>Entelurus aequoreus</i> (Linnaeus, 1758), is a northern Atlantic fish inhabiting open seagrass environments that recently expanded its distribution range. Here, we present a highly contiguous, near chromosome-scale genome of <i>E. aequoreus</i>. The final assembly spans 1.6 Gbp in 7,391 scaffolds, with a scaffold N50 of 62.3 Mbp and L50 of 12. The 28 largest scaffolds (>21 Mbp) span 89.7% of the assembly length. A BUSCO completeness score of 94.1% and a mapping rate above 98% suggest a high assembly completeness. Repetitive elements cover 74.93% of the genome, one of the highest proportions identified in vertebrates. Our demographic modeling identified a peak in population size during the last interglacial period, suggesting the species might benefit from warmer water conditions. Our updated snake pipefish assembly is essential for future analyses of the morphological and molecular changes unique to the Syngnathidae.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte105"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10795108/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139492894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Species composition and distribution of the Anopheles gambiae complex circulating in Kinshasa. 金沙萨地区冈比亚按蚊的种类组成和分布。
Pub Date : 2024-01-03 eCollection Date: 2024-01-01 DOI: 10.46471/gigabyte.104
Josue Zanga, Emery Metelo, Nono Mvuama, Victoire Nsabatien, Vanessa Mvudi, Degani Banzulu, Osée Mansiangi, Maxwel Bamba, Narcisse Basosila, Rodrigue Agossa, Roger Wumba

Understanding the distribution of Anopheles species is essential for planning and implementing malaria control programmes. This study assessed the composition and distribution of cryptic species of the main malaria vector, the Anopheles gambiae complex, in different districts of Kinshasa. Anopheles were sampled using CDC light traps in the four Kinshasa districts between July 2021 and June 2022, and then morphologically identified. Equal proportions of Anopheles gambiae s.l. per site were subjected to polymerase chain reaction to identify the cryptic species of the Anopheles gambiae complex. Anopheles gambiae complex specimens were identified throughout Kinshasa. The average density significantly differed inside and outside households. Two species of this complex circulate in Kinshasa: Anopheles gambiae and Anopheles coluzzii. In all the study sites, Anopheles gambiae was the most widespread species. Our results provide an important basis for future studies on the ecology and dynamics of cryptic species of the Anopheles gambiae complex in Kinshasa.

了解按蚊种类的分布对于规划和实施疟疾控制计划至关重要。本研究评估了金沙萨不同地区主要疟疾病媒冈比亚按蚊复合体隐性物种的组成和分布情况。2021 年 7 月至 2022 年 6 月期间,在金沙萨的四个区使用 CDC 灯光诱捕器对冈比亚按蚊进行了采样,然后进行了形态鉴定。对每个地点等比例的冈比亚按蚊进行聚合酶链式反应,以确定冈比亚按蚊复合体的隐性物种。在整个金沙萨都发现了冈比亚按蚊复合体标本。住户内外的平均密度差异很大。在金沙萨有两种冈比亚按蚊:冈比亚按蚊和科鲁兹按蚊。在所有研究地点,冈比亚按蚊是分布最广的物种。我们的研究结果为今后研究金沙萨冈比亚按蚊隐蔽种的生态和动态提供了重要依据。
{"title":"Species composition and distribution of the <i>Anopheles gambiae</i> complex circulating in Kinshasa.","authors":"Josue Zanga, Emery Metelo, Nono Mvuama, Victoire Nsabatien, Vanessa Mvudi, Degani Banzulu, Osée Mansiangi, Maxwel Bamba, Narcisse Basosila, Rodrigue Agossa, Roger Wumba","doi":"10.46471/gigabyte.104","DOIUrl":"10.46471/gigabyte.104","url":null,"abstract":"<p><p>Understanding the distribution of Anopheles species is essential for planning and implementing malaria control programmes. This study assessed the composition and distribution of cryptic species of the main malaria vector, the <i>Anopheles gambiae</i> complex, in different districts of Kinshasa. Anopheles were sampled using CDC light traps in the four Kinshasa districts between July 2021 and June 2022, and then morphologically identified. Equal proportions of <i>Anopheles gambiae</i> s.l. per site were subjected to polymerase chain reaction to identify the cryptic species of the <i>Anopheles gambiae</i> complex. <i>Anopheles gambiae</i> complex specimens were identified throughout Kinshasa. The average density significantly differed inside and outside households. Two species of this complex circulate in Kinshasa: <i>Anopheles gambiae</i> and <i>Anopheles coluzzii</i>. In all the study sites, <i>Anopheles gambiae</i> was the most widespread species. Our results provide an important basis for future studies on the ecology and dynamics of cryptic species of the <i>Anopheles gambiae</i> complex in Kinshasa.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte104"},"PeriodicalIF":0.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777374/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139426145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nanopore adaptive sampling enriches for antimicrobial resistance genes in microbial communities. 纳米孔适应性采样可富集微生物群落中的抗菌药耐药性基因。
Pub Date : 2023-12-11 eCollection Date: 2023-01-01 DOI: 10.46471/gigabyte.103
Danielle C Wrenn, Devin M Drown

Antimicrobial resistance (AMR) is a global public health threat. Environmental microbial communities act as reservoirs for AMR, containing genes associated with resistance, their precursors, and the selective pressures promoting their persistence. Genomic surveillance could provide insights into how these reservoirs change and impact public health. Enriching for AMR genomic signatures in complex microbial communities would strengthen surveillance efforts and reduce time-to-answer. Here, we tested the ability of nanopore sequencing and adaptive sampling to enrich for AMR genes in a mock community of environmental origin. Our setup implemented the MinION mk1B, an NVIDIA Jetson Xavier GPU, and Flongle flow cells. Using adaptive sampling, we observed consistent enrichment by composition. On average, adaptive sampling resulted in a target composition 4× higher than without adaptive sampling. Despite a decrease in total sequencing output, adaptive sampling increased target yield in most replicates. We also demonstrate enrichment in a diverse community using an environmental sample. This method enables rapid and flexible genomic surveillance.

抗菌药耐药性(AMR)是对全球公共卫生的威胁。环境微生物群落是 AMR 的储存库,其中包含与耐药性相关的基因、其前体以及促进其持续存在的选择性压力。基因组监测可以帮助人们深入了解这些贮藏库是如何变化并影响公共卫生的。在复杂的微生物群落中丰富 AMR 基因组特征将加强监测工作并缩短回复时间。在这里,我们测试了纳米孔测序和自适应采样在环境源模拟群落中富集 AMR 基因的能力。我们的装置采用了 MinION mk1B、英伟达 Jetson Xavier GPU 和 Flongle 流动池。利用自适应采样,我们观察到了一致的成分富集。平均而言,自适应采样的目标成分比不使用自适应采样时高 4 倍。尽管测序总输出量有所下降,但在大多数重复中,自适应采样提高了目标产量。我们还利用环境样本展示了多样化群落的富集情况。这种方法可以实现快速灵活的基因组监测。
{"title":"Nanopore adaptive sampling enriches for antimicrobial resistance genes in microbial communities.","authors":"Danielle C Wrenn, Devin M Drown","doi":"10.46471/gigabyte.103","DOIUrl":"10.46471/gigabyte.103","url":null,"abstract":"<p><p>Antimicrobial resistance (AMR) is a global public health threat. Environmental microbial communities act as reservoirs for AMR, containing genes associated with resistance, their precursors, and the selective pressures promoting their persistence. Genomic surveillance could provide insights into how these reservoirs change and impact public health. Enriching for AMR genomic signatures in complex microbial communities would strengthen surveillance efforts and reduce time-to-answer. Here, we tested the ability of nanopore sequencing and adaptive sampling to enrich for AMR genes in a mock community of environmental origin. Our setup implemented the MinION mk1B, an NVIDIA Jetson Xavier GPU, and Flongle flow cells. Using adaptive sampling, we observed consistent enrichment by composition. On average, adaptive sampling resulted in a target composition 4× higher than without adaptive sampling. Despite a decrease in total sequencing output, adaptive sampling increased target yield in most replicates. We also demonstrate enrichment in a diverse community using an environmental sample. This method enables rapid and flexible genomic surveillance.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2023 ","pages":"gigabyte103"},"PeriodicalIF":0.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10726737/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138814643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome assembly and annotation of the king ratsnake, Elaphe carinata. 鼠王蛇(Elaphe carinata)的基因组组装和注释。
Pub Date : 2023-12-07 eCollection Date: 2023-01-01 DOI: 10.46471/gigabyte.101
Jiale Fan, Ruyi Huang, Diancheng Yang, Yanan Gong, Zhangbo Cui, Xinge Wang, Zicheng Su, Jing Yu, Yi Zhang, Tierui Zhang, Zhihao Jiang, Tianming Lan, He Wang, Song Huang

The king ratsnake (Elaphe carinata) of the genus Elaphe is a common large, non-venomous snake widely distributed in Southeast and East Asia. It is an economically important farmed species. As a non-venomous snake, the king ratsnake predates venomous snakes, such as cobras and pit vipers. However, the immune and digestive mechanisms of the king ratsnake remain unclear. Despite their economic and research importance, we lack genomic resources that would benefit toxicology, phylogeography, and immunogenetics studies. Here, we used single-tube long fragment read sequencing to generate the first contiguous genome of a king ratsnake from Huangshan City, Anhui province, China. The genome size is 1.56 GB with a scaffold N50 of 6.53M. The total length of the genome is approximately 621 Mb, while the repeat content is 42.26%. Additionally, we predicted 22,339 protein-coding genes, including 22,065 with functional annotations. Our genome is a potentially useful addition to those available for snakes.

王鼠蛇(Elaphe carinata)是一种常见的大型无毒蛇,广泛分布于东南亚和东亚。它是一种重要的经济养殖物种。作为一种无毒蛇,王鼠蛇比眼镜蛇和蝮蛇等毒蛇的毒性更强。然而,鼠王蛇的免疫和消化机制仍不清楚。尽管鼠王蛇在经济和研究方面具有重要意义,但我们缺乏有利于毒理学、系统地理学和免疫遗传学研究的基因组资源。在这里,我们利用单管长片段读数测序技术首次获得了来自中国安徽省黄山市的大王鼠蛇的连续基因组。基因组大小为1.56 GB,支架N50为6.53M。基因组总长度约为 621 Mb,重复率为 42.26%。此外,我们还预测了 22,339 个蛋白质编码基因,包括 22,065 个有功能注释的基因。我们的基因组是对现有蛇类基因组的有益补充。
{"title":"Genome assembly and annotation of the king ratsnake, <i>Elaphe carinata</i>.","authors":"Jiale Fan, Ruyi Huang, Diancheng Yang, Yanan Gong, Zhangbo Cui, Xinge Wang, Zicheng Su, Jing Yu, Yi Zhang, Tierui Zhang, Zhihao Jiang, Tianming Lan, He Wang, Song Huang","doi":"10.46471/gigabyte.101","DOIUrl":"https://doi.org/10.46471/gigabyte.101","url":null,"abstract":"<p><p>The king ratsnake (<i>Elaphe carinata</i>) of the genus Elaphe is a common large, non-venomous snake widely distributed in Southeast and East Asia. It is an economically important farmed species. As a non-venomous snake, the king ratsnake predates venomous snakes, such as cobras and pit vipers. However, the immune and digestive mechanisms of the king ratsnake remain unclear. Despite their economic and research importance, we lack genomic resources that would benefit toxicology, phylogeography, and immunogenetics studies. Here, we used single-tube long fragment read sequencing to generate the first contiguous genome of a king ratsnake from Huangshan City, Anhui province, China. The genome size is 1.56 GB with a scaffold N50 of 6.53M. The total length of the genome is approximately 621 Mb, while the repeat content is 42.26%. Additionally, we predicted 22,339 protein-coding genes, including 22,065 with functional annotations. Our genome is a potentially useful addition to those available for snakes.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2023 ","pages":"gigabyte101"},"PeriodicalIF":0.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10719989/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138814642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sampling collections and metadata of planorbidae (Mollusca: Gastropoda) in Brazil: a comprehensive analysis of the Oswaldo Cruz Institute's Mollusk Collection from 1948 to 2023. 巴西刨腹目(软体动物门:腹足纲)的采样采集和元数据:1948-2023 年奥斯瓦尔多-克鲁斯研究所软体动物采集的综合分析。
Pub Date : 2023-12-07 eCollection Date: 2023-01-01 DOI: 10.46471/gigabyte.102
Silvana Carvalho Thiengo, Mariana Gomes Lima, Alexandre Bonfim Pinheiro da Silva, Raiany Thuler Nogueira, Flávia Cristina Dos Santos Rangel, Suzete Rodrigues Gomes

Planorbidae comprises approximately 40 genera of freshwater gastropods, including roughly 250 species. Among the Planorbidae subfamilies, the significance of Planorbinae is due to its genus Biomphalaria, whose species are intermediate hosts of the trematode Schistosoma mansoni Sambon, 1907, which causes schistosomiasis in humans and animals. Here, we present the analysis of the dataset of Planorbidae housed in the Collection of Mollusks of the Oswaldo Cruz Institute, with a special focus on Biomphalaria species. This dataset includes 7,267 lots originating from 55 countries, representing 20 genera and 75 species collected from 1948 to 2023. Collections were performed in all regions of Brazil, comprising specimens from 26 states and the Federal District, particularly from the Southeast and Northeast. Within the dataset, Biomphalaria includes 3,926 lots of 31 species from 42 countries. These records will help improve our comprehension of schistosomiasis transmission dynamics and the geographic distributions of these medically important species.

腹足纲(Planorbidae)包括约 40 属淡水腹足类,约 250 个物种。在浮游动物亚科中,浮游动物亚科(Planorbinae)的重要意义在于其属中的Biomphalaria,Biomphalaria属的物种是曼氏血吸虫(Schistosoma mansoni Sambon, 1907)的中间宿主,曼氏血吸虫会导致人类和动物感染血吸虫病。在此,我们介绍了对奥斯瓦尔多-克鲁斯研究所软体动物保藏中心(Collection of Mollusks of the Oswaldo Cruz Institute)收藏的栉水母类(Planorbidae)数据集的分析,重点是栉水母类(Biomphalaria)物种。该数据集包括来自 55 个国家的 7,267 个样本,代表了从 1948 年到 2023 年收集的 20 个属和 75 个物种。采集工作在巴西所有地区进行,包括来自 26 个州和联邦区的标本,尤其是来自东南部和东北部的标本。在数据集中,Biomphalaria 包括来自 42 个国家的 31 个物种的 3926 个批次。这些记录将有助于我们更好地了解血吸虫病的传播动态以及这些重要医学物种的地理分布情况。
{"title":"Sampling collections and metadata of planorbidae (Mollusca: Gastropoda) in Brazil: a comprehensive analysis of the Oswaldo Cruz Institute's Mollusk Collection from 1948 to 2023.","authors":"Silvana Carvalho Thiengo, Mariana Gomes Lima, Alexandre Bonfim Pinheiro da Silva, Raiany Thuler Nogueira, Flávia Cristina Dos Santos Rangel, Suzete Rodrigues Gomes","doi":"10.46471/gigabyte.102","DOIUrl":"https://doi.org/10.46471/gigabyte.102","url":null,"abstract":"<p><p>Planorbidae comprises approximately 40 genera of freshwater gastropods, including roughly 250 species. Among the Planorbidae subfamilies, the significance of Planorbinae is due to its genus <i>Biomphalaria</i>, whose species are intermediate hosts of the trematode <i>Schistosoma mansoni</i> Sambon, 1907, which causes schistosomiasis in humans and animals. Here, we present the analysis of the dataset of Planorbidae housed in the Collection of Mollusks of the Oswaldo Cruz Institute, with a special focus on <i>Biomphalaria</i> species. This dataset includes 7,267 lots originating from 55 countries, representing 20 genera and 75 species collected from 1948 to 2023. Collections were performed in all regions of Brazil, comprising specimens from 26 states and the Federal District, particularly from the Southeast and Northeast. Within the dataset, <i>Biomphalaria</i> includes 3,926 lots of 31 species from 42 countries. These records will help improve our comprehension of schistosomiasis transmission dynamics and the geographic distributions of these medically important species.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2023 ","pages":"gigabyte102"},"PeriodicalIF":0.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10719988/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138814644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of small-mammal detections in West Africa and their associated micro-organisms. 西非小型哺乳动物检测数据集及其相关微生物。
Pub Date : 2023-12-04 eCollection Date: 2023-01-01 DOI: 10.46471/gigabyte.100
David Simons, Lauren A Attfield, Kate E Jones, Deborah Watson-Jones, Richard Kock

Rodents, a globally distributed and ecologically important mammalian order, serve as hosts for various zoonotic pathogens. However, sampling of rodents and their pathogens suffers from taxonomic and spatial biases. This affects consolidated databases, such as IUCN and GBIF, limiting inference regarding the spillover hazard of zoonotic pathogens into human populations. Here, we synthesised data from 127 rodent trapping studies conducted in 14 West African countries between 1964 and 2022. We combined occurrence data with pathogen screening results to produce a dataset containing detection/non-detection data for 65,628 individual small mammals identified to the species level from at least 1,611 trapping sites. We also included 32 microorganisms, identified to the species or genus levels, that are known or potential pathogens. The dataset is formatted to Darwin Core Standard with associated metadata. This dataset can mitigate spatial and taxonomic biases of current databases, improving understanding of rodent-associated zoonotic pathogen spillover across West Africa.

啮齿动物是一种分布于全球且具有重要生态意义的哺乳动物,是各种人畜共患病病原体的宿主。然而,对啮齿动物及其病原体的取样存在分类和空间偏差。这影响了 IUCN 和 GBIF 等综合数据库,限制了有关人畜共患病病原体对人类种群外溢危害的推断。在此,我们综合了 1964 年至 2022 年期间在 14 个西非国家进行的 127 项啮齿动物诱捕研究的数据。我们将发生数据与病原体筛查结果相结合,生成了一个数据集,其中包含从至少 1611 个诱捕点检测到/未检测到的 65,628 种小型哺乳动物的检测/未检测数据。我们还收录了 32 种已知或潜在病原体的微生物,并对其进行了种或属级鉴定。数据集的格式符合达尔文核心标准,并附有相关元数据。该数据集可以减轻现有数据库在空间和分类方面的偏差,从而提高人们对西非啮齿动物相关人畜共患病病原体外溢的了解。
{"title":"A dataset of small-mammal detections in West Africa and their associated micro-organisms.","authors":"David Simons, Lauren A Attfield, Kate E Jones, Deborah Watson-Jones, Richard Kock","doi":"10.46471/gigabyte.100","DOIUrl":"https://doi.org/10.46471/gigabyte.100","url":null,"abstract":"<p><p>Rodents, a globally distributed and ecologically important mammalian order, serve as hosts for various zoonotic pathogens. However, sampling of rodents and their pathogens suffers from taxonomic and spatial biases. This affects consolidated databases, such as IUCN and GBIF, limiting inference regarding the spillover hazard of zoonotic pathogens into human populations. Here, we synthesised data from 127 rodent trapping studies conducted in 14 West African countries between 1964 and 2022. We combined occurrence data with pathogen screening results to produce a dataset containing detection/non-detection data for 65,628 individual small mammals identified to the species level from at least 1,611 trapping sites. We also included 32 microorganisms, identified to the species or genus levels, that are known or potential pathogens. The dataset is formatted to Darwin Core Standard with associated metadata. This dataset can mitigate spatial and taxonomic biases of current databases, improving understanding of rodent-associated zoonotic pathogen spillover across West Africa.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2023 ","pages":"gigabyte100"},"PeriodicalIF":0.0,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10711198/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138814641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The genome assembly and annotation of the Chinese cobra, Naja atra. 中国眼镜蛇Naja atra的基因组组装与注释。
Pub Date : 2023-11-20 eCollection Date: 2023-01-01 DOI: 10.46471/gigabyte.99
Jiangang Wang, Yuxin Wu, Shiqing Wang, Weiwu Mu, Wenmei Zeng, Xi Chen, Kangfeng Jiang, Liangyu Yang, Guohai Hu, Fengping He

In China, 65 types of venomous snakes exist, with the Chinese Cobra Naja atra being prominent and a major cause of snakebites in humans. Furthermore, N. atra is a protected animal in some areas, as it has been listed as vulnerable by the International Union for Conservation of Nature. Recently, due to the medical value of snake venoms, venomics has experienced growing research interest. In particular, genomic resources are crucial for understanding the molecular mechanisms of venom production. Here, we report a highly continuous genome assembly of N. atra, based on a snake sample from Huangshan, Anhui, China. The size of this genome is 1.67 Gb, while its repeat content constitutes 37.8% of the genome. A total of 26,432 functional genes were annotated. This data provides an essential resource for studying venom production in N. atra. It may also provide guidance for the protection of this species.

在中国,有65种毒蛇,其中中国眼镜蛇(Chinese Cobra Naja atra)最为突出,也是人类被蛇咬伤的主要原因。此外,atra在某些地区是受保护的动物,因为它已被国际自然保护联盟列为易危物种。近年来,由于蛇毒的医学价值,蛇毒组学受到越来越多的研究兴趣。特别是,基因组资源对于理解毒液产生的分子机制至关重要。在这里,我们报告了一个高度连续的N. atra基因组组装,基于来自中国安徽黄山的蛇样本。该基因组的大小为1.67 Gb,其重复序列含量占整个基因组的37.8%。共有26432个功能基因被注释。这一数据为研究北甲的毒液分泌提供了重要的资源。这也可能为该物种的保护提供指导。
{"title":"The genome assembly and annotation of the Chinese cobra, <i>Naja atra</i>.","authors":"Jiangang Wang, Yuxin Wu, Shiqing Wang, Weiwu Mu, Wenmei Zeng, Xi Chen, Kangfeng Jiang, Liangyu Yang, Guohai Hu, Fengping He","doi":"10.46471/gigabyte.99","DOIUrl":"10.46471/gigabyte.99","url":null,"abstract":"<p><p>In China, 65 types of venomous snakes exist, with the Chinese Cobra <i>Naja atra</i> being prominent and a major cause of snakebites in humans. Furthermore, <i>N. atra</i> is a protected animal in some areas, as it has been listed as vulnerable by the International Union for Conservation of Nature. Recently, due to the medical value of snake venoms, venomics has experienced growing research interest. In particular, genomic resources are crucial for understanding the molecular mechanisms of venom production. Here, we report a highly continuous genome assembly of <i>N. atra</i>, based on a snake sample from Huangshan, Anhui, China. The size of this genome is 1.67 Gb, while its repeat content constitutes 37.8% of the genome. A total of 26,432 functional genes were annotated. This data provides an essential resource for studying venom production in <i>N. atra</i>. It may also provide guidance for the protection of this species.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2023 ","pages":"gigabyte99"},"PeriodicalIF":0.0,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10682346/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138464780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A reference assembly for the legume cover crop hairy vetch (Vicia villosa) 豆科覆盖作物毛豌豆(Vicia villosa)的参考组合
Pub Date : 2023-11-13 DOI: 10.46471/gigabyte.98
Tyson Fuller, Derek M. Bickhart, Lisa M. Koch, Lisa Kissing Kucek, Shahjahan Ali, Haley Mangelson, Maria J. Monteros, Timothy Hernandez, Timothy P. L. Smith, Heathcliffe Riday, Michael L. Sullivan
Vicia villosa is an incompletely domesticated annual legume of the Fabaceae family native to Europe and Western Asia. V. villosa is widely used as a cover crop and forage due to its ability to withstand harsh winters. Here, we generated a reference-quality genome assembly (Vvill1.0) from low error-rate long-sequence reads to improve the genetic-based trait selection of this species. Our Vvill1.0 assembly includes seven scaffolds corresponding to the seven estimated linkage groups and comprising approximately 68% of the total genome size of 2.03 Gbp. This assembly is expected to be a useful resource for genetically improving this emerging cover crop species and provide useful insights into legume genomics and plant genome evolution.
紫豆是一种不完全驯化的豆科一年生豆科植物,原产于欧洲和西亚。由于其耐严冬的能力,绒毛草被广泛用作覆盖作物和饲料。在这里,我们从低错误率的长序列reads中生成了一个参考质量的基因组组装(Vvill1.0),以改善该物种基于遗传的性状选择。我们的Vvill1.0组装包括7个支架,对应于7个估计的连锁组,约占基因组总大小2.03 Gbp的68%。该组合有望为这一新兴覆盖作物物种的遗传改良提供有用的资源,并为豆类基因组学和植物基因组进化提供有用的见解。
{"title":"A reference assembly for the legume cover crop hairy vetch (Vicia villosa)","authors":"Tyson Fuller, Derek M. Bickhart, Lisa M. Koch, Lisa Kissing Kucek, Shahjahan Ali, Haley Mangelson, Maria J. Monteros, Timothy Hernandez, Timothy P. L. Smith, Heathcliffe Riday, Michael L. Sullivan","doi":"10.46471/gigabyte.98","DOIUrl":"https://doi.org/10.46471/gigabyte.98","url":null,"abstract":"Vicia villosa is an incompletely domesticated annual legume of the Fabaceae family native to Europe and Western Asia. V. villosa is widely used as a cover crop and forage due to its ability to withstand harsh winters. Here, we generated a reference-quality genome assembly (Vvill1.0) from low error-rate long-sequence reads to improve the genetic-based trait selection of this species. Our Vvill1.0 assembly includes seven scaffolds corresponding to the seven estimated linkage groups and comprising approximately 68% of the total genome size of 2.03 Gbp. This assembly is expected to be a useful resource for genetically improving this emerging cover crop species and provide useful insights into legume genomics and plant genome evolution.","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"13 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136346758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
GigaByte (Hong Kong, China)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1