Danielle Dana Mitchell, Jo-Marie Vreulink, Alaric Prins, Marilize Le Roes-Hill
{"title":"从潮汐池沉积物中分离出的 Streptomyces griseoincarnatus 菌株 R-35 的基因组数据集草案。","authors":"Danielle Dana Mitchell, Jo-Marie Vreulink, Alaric Prins, Marilize Le Roes-Hill","doi":"10.1016/j.dib.2024.111235","DOIUrl":null,"url":null,"abstract":"<div><div>The marine isolate, <em>Streptomyces griseoincarnatus</em> strain R-35, was isolated from marine sediments collected from the Glencairn Tidal Pool, Table Mountain National Park, Cape Town, South Africa. The genomic DNA was sequenced using the Ion Torrent GeneStudio™ S5 platform, and the <em>de novo</em> assembly was performed using the SPAdes assembler on the Centre for High Performance Computing (CHPC) Lengau Cluster located at the CSIR, Rosebank, South Africa. The draft genome assembly consisted of 722 contigs totaling 7,625,174 base pairs and a G+C% content of 72.2 mol%. Genome completeness and genome contamination were determined as 99.12% and 0.92%, respectively. Genome annotations performed using the Rapid Annotation with Subsystem Technology (RAST) and the Bacterial and Viral Bioinformatics Resource Centre (BV-BRC) determined the presence of 7996 coding sequences (CDS), 63 transfer RNAs (tRNAs), and six ribosomal RNAs (rRNAs). A total of 2570 hypothetical proteins were assigned, and 5246 proteins were assigned to function. The phylogenomic positioning of <em>S. griseoincarnatus</em> strain R-35 was determined using the Type Strain Genome Server (TYGS) and was found to be related to <em>S. griseoincarnatus</em> JCM 4381<sup>T</sup>, with a digital DNA-DNA hybridisation (dDDH) value of 84.1%, and an OrthoANIu value of 98.22%. The CARD RGI algorithm on Proksee predicted the presence of 6,107 antimicrobial resistance (AMR) features, 27 biosynthetic gene clusters (BGCs) were predicted using antiSMASH, while 189 carbohydrate-active enzymes (CAZymes) were predicted using dbCAN3. The raw genome sequencing data has been submitted to the National Center for Biotechnology (NCBI) under the BioProject ID PRJNA1129156 (BioSample ID Accession Number: SAMN42145163; Short Read Archive (SRA) Accession: SRR29633055; <span><span>https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1129156</span><svg><path></path></svg></span>).</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"58 ","pages":"Article 111235"},"PeriodicalIF":1.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11731734/pdf/","citationCount":"0","resultStr":"{\"title\":\"Draft genome dataset of Streptomyces griseoincarnatus strain R-35 isolated from tidal pool sediments\",\"authors\":\"Danielle Dana Mitchell, Jo-Marie Vreulink, Alaric Prins, Marilize Le Roes-Hill\",\"doi\":\"10.1016/j.dib.2024.111235\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The marine isolate, <em>Streptomyces griseoincarnatus</em> strain R-35, was isolated from marine sediments collected from the Glencairn Tidal Pool, Table Mountain National Park, Cape Town, South Africa. The genomic DNA was sequenced using the Ion Torrent GeneStudio™ S5 platform, and the <em>de novo</em> assembly was performed using the SPAdes assembler on the Centre for High Performance Computing (CHPC) Lengau Cluster located at the CSIR, Rosebank, South Africa. The draft genome assembly consisted of 722 contigs totaling 7,625,174 base pairs and a G+C% content of 72.2 mol%. Genome completeness and genome contamination were determined as 99.12% and 0.92%, respectively. Genome annotations performed using the Rapid Annotation with Subsystem Technology (RAST) and the Bacterial and Viral Bioinformatics Resource Centre (BV-BRC) determined the presence of 7996 coding sequences (CDS), 63 transfer RNAs (tRNAs), and six ribosomal RNAs (rRNAs). A total of 2570 hypothetical proteins were assigned, and 5246 proteins were assigned to function. The phylogenomic positioning of <em>S. griseoincarnatus</em> strain R-35 was determined using the Type Strain Genome Server (TYGS) and was found to be related to <em>S. griseoincarnatus</em> JCM 4381<sup>T</sup>, with a digital DNA-DNA hybridisation (dDDH) value of 84.1%, and an OrthoANIu value of 98.22%. The CARD RGI algorithm on Proksee predicted the presence of 6,107 antimicrobial resistance (AMR) features, 27 biosynthetic gene clusters (BGCs) were predicted using antiSMASH, while 189 carbohydrate-active enzymes (CAZymes) were predicted using dbCAN3. The raw genome sequencing data has been submitted to the National Center for Biotechnology (NCBI) under the BioProject ID PRJNA1129156 (BioSample ID Accession Number: SAMN42145163; Short Read Archive (SRA) Accession: SRR29633055; <span><span>https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1129156</span><svg><path></path></svg></span>).</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"58 \",\"pages\":\"Article 111235\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11731734/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340924011971\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340924011971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Draft genome dataset of Streptomyces griseoincarnatus strain R-35 isolated from tidal pool sediments
The marine isolate, Streptomyces griseoincarnatus strain R-35, was isolated from marine sediments collected from the Glencairn Tidal Pool, Table Mountain National Park, Cape Town, South Africa. The genomic DNA was sequenced using the Ion Torrent GeneStudio™ S5 platform, and the de novo assembly was performed using the SPAdes assembler on the Centre for High Performance Computing (CHPC) Lengau Cluster located at the CSIR, Rosebank, South Africa. The draft genome assembly consisted of 722 contigs totaling 7,625,174 base pairs and a G+C% content of 72.2 mol%. Genome completeness and genome contamination were determined as 99.12% and 0.92%, respectively. Genome annotations performed using the Rapid Annotation with Subsystem Technology (RAST) and the Bacterial and Viral Bioinformatics Resource Centre (BV-BRC) determined the presence of 7996 coding sequences (CDS), 63 transfer RNAs (tRNAs), and six ribosomal RNAs (rRNAs). A total of 2570 hypothetical proteins were assigned, and 5246 proteins were assigned to function. The phylogenomic positioning of S. griseoincarnatus strain R-35 was determined using the Type Strain Genome Server (TYGS) and was found to be related to S. griseoincarnatus JCM 4381T, with a digital DNA-DNA hybridisation (dDDH) value of 84.1%, and an OrthoANIu value of 98.22%. The CARD RGI algorithm on Proksee predicted the presence of 6,107 antimicrobial resistance (AMR) features, 27 biosynthetic gene clusters (BGCs) were predicted using antiSMASH, while 189 carbohydrate-active enzymes (CAZymes) were predicted using dbCAN3. The raw genome sequencing data has been submitted to the National Center for Biotechnology (NCBI) under the BioProject ID PRJNA1129156 (BioSample ID Accession Number: SAMN42145163; Short Read Archive (SRA) Accession: SRR29633055; https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1129156).
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.