Anton Kraege, Edgar Chavarro-Carrero, Eva Schnell, Stefanie Heilmann-Heimbach, Kerstin Becker, Karl Köhrer, Bruno Huettel, Nafiseh Sargheini, Philipp Schiffer, Ann-Marie Waldvogel, Bart P H J Thomma, Hanna Rovenich
{"title":"真核淡水微藻 Coccomyxa elongata SAG 216-3b 的高质量基因组组装和注释 (v1)。","authors":"Anton Kraege, Edgar Chavarro-Carrero, Eva Schnell, Stefanie Heilmann-Heimbach, Kerstin Becker, Karl Köhrer, Bruno Huettel, Nafiseh Sargheini, Philipp Schiffer, Ann-Marie Waldvogel, Bart P H J Thomma, Hanna Rovenich","doi":"10.1093/g3journal/jkae294","DOIUrl":null,"url":null,"abstract":"<p><p>Unicellular green algae of the genus Coccomyxa are recognized for their worldwide distribution and ecological versatility. Coccomyxa elongata is a freshwater species of the Coccomyxa simplex clade, which also includes lichen symbionts. To facilitate future molecular and phylogenomic studies of this versatile clade of algae, we generated a high-quality genome assembly for C. elongata Chodat & Jaag SAG 216-3b within the framework of the Biodiversity Genomics Center Cologne (BioC2) initiative. A combination of long-read PacBio HiFi and Oxford Nanopore Technologies with chromatin conformation capture (Hi-C) sequencing led to the assembly of the genome into 21 scaffolds with a total length of 51.4 Mb and an N50 of 2.8 Mb. Nineteen of the scaffolds represent highly complete nuclear chromosomes delimited by telomeric repeats, while the two additional scaffolds represent the mitochondrial and plastid genomes. Transcriptome-guided gene annotation resulted in the identification of 14,811 protein-coding genes, of which 61% have annotated protein family domains and 841 are predicted to be secreted. Benchmarking universal single-copy orthologs analysis against the Chlorophyta database identified a total of 1,494 (98.4%) complete gene models, suggesting a highly complete genome annotation.</p>","PeriodicalId":12468,"journal":{"name":"G3: Genes|Genomes|Genetics","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11797067/pdf/","citationCount":"0","resultStr":"{\"title\":\"High quality genome assembly and annotation (v1) of the eukaryotic freshwater microalga Coccomyxa elongata SAG 216-3b.\",\"authors\":\"Anton Kraege, Edgar Chavarro-Carrero, Eva Schnell, Stefanie Heilmann-Heimbach, Kerstin Becker, Karl Köhrer, Bruno Huettel, Nafiseh Sargheini, Philipp Schiffer, Ann-Marie Waldvogel, Bart P H J Thomma, Hanna Rovenich\",\"doi\":\"10.1093/g3journal/jkae294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Unicellular green algae of the genus Coccomyxa are recognized for their worldwide distribution and ecological versatility. Coccomyxa elongata is a freshwater species of the Coccomyxa simplex clade, which also includes lichen symbionts. To facilitate future molecular and phylogenomic studies of this versatile clade of algae, we generated a high-quality genome assembly for C. elongata Chodat & Jaag SAG 216-3b within the framework of the Biodiversity Genomics Center Cologne (BioC2) initiative. A combination of long-read PacBio HiFi and Oxford Nanopore Technologies with chromatin conformation capture (Hi-C) sequencing led to the assembly of the genome into 21 scaffolds with a total length of 51.4 Mb and an N50 of 2.8 Mb. Nineteen of the scaffolds represent highly complete nuclear chromosomes delimited by telomeric repeats, while the two additional scaffolds represent the mitochondrial and plastid genomes. Transcriptome-guided gene annotation resulted in the identification of 14,811 protein-coding genes, of which 61% have annotated protein family domains and 841 are predicted to be secreted. Benchmarking universal single-copy orthologs analysis against the Chlorophyta database identified a total of 1,494 (98.4%) complete gene models, suggesting a highly complete genome annotation.</p>\",\"PeriodicalId\":12468,\"journal\":{\"name\":\"G3: Genes|Genomes|Genetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-02-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11797067/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"G3: Genes|Genomes|Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/g3journal/jkae294\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"G3: Genes|Genomes|Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/g3journal/jkae294","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
High quality genome assembly and annotation (v1) of the eukaryotic freshwater microalga Coccomyxa elongata SAG 216-3b.
Unicellular green algae of the genus Coccomyxa are recognized for their worldwide distribution and ecological versatility. Coccomyxa elongata is a freshwater species of the Coccomyxa simplex clade, which also includes lichen symbionts. To facilitate future molecular and phylogenomic studies of this versatile clade of algae, we generated a high-quality genome assembly for C. elongata Chodat & Jaag SAG 216-3b within the framework of the Biodiversity Genomics Center Cologne (BioC2) initiative. A combination of long-read PacBio HiFi and Oxford Nanopore Technologies with chromatin conformation capture (Hi-C) sequencing led to the assembly of the genome into 21 scaffolds with a total length of 51.4 Mb and an N50 of 2.8 Mb. Nineteen of the scaffolds represent highly complete nuclear chromosomes delimited by telomeric repeats, while the two additional scaffolds represent the mitochondrial and plastid genomes. Transcriptome-guided gene annotation resulted in the identification of 14,811 protein-coding genes, of which 61% have annotated protein family domains and 841 are predicted to be secreted. Benchmarking universal single-copy orthologs analysis against the Chlorophyta database identified a total of 1,494 (98.4%) complete gene models, suggesting a highly complete genome annotation.
期刊介绍:
G3: Genes, Genomes, Genetics provides a forum for the publication of high‐quality foundational research, particularly research that generates useful genetic and genomic information such as genome maps, single gene studies, genome‐wide association and QTL studies, as well as genome reports, mutant screens, and advances in methods and technology. The Editorial Board of G3 believes that rapid dissemination of these data is the necessary foundation for analysis that leads to mechanistic insights.
G3, published by the Genetics Society of America, meets the critical and growing need of the genetics community for rapid review and publication of important results in all areas of genetics. G3 offers the opportunity to publish the puzzling finding or to present unpublished results that may not have been submitted for review and publication due to a perceived lack of a potential high-impact finding. G3 has earned the DOAJ Seal, which is a mark of certification for open access journals, awarded by DOAJ to journals that achieve a high level of openness, adhere to Best Practice and high publishing standards.