{"title":"Chromosome-level genome assembly of Salvia sclarea.","authors":"Sehyun Choi, Yuna Kang, Changsoo Kim","doi":"10.1038/s41597-024-04347-0","DOIUrl":null,"url":null,"abstract":"<p><p>Salvia sclarea is a medicinal herb from the Lamiaceae family, valued for its essential oil which contains sclareol, linalool, linalyl acetate, and other compounds. Despite its extensive use, the genetic mechanisms of S. sclarea are not well understood. This study presents a chromosome-level genome assembly of S. sclarea using the Oxford Nanopore Technology, Illumina short reads, and Pore-C technology. The assembled genome spans 499.03 Mbp with a scaffold N50 of 50.5 Mbp, forming 11 pseudochromosomes. The genome assembly was validated by BUSCO analysis, which indicated a high completeness marked at 98.7%. The genome contains 68.73% repetitive sequences, with long terminal repeats (LTRs) accounting for 33.01%. A total of 17,202 protein-coding genes were predicted. Among them, 16,846 genes were annotated in the NCBI NR database, the highest among all databases, covering 97.93% of the predicted genes. The phylogenetic analysis revealed that S. sclarea shares a close evolutionary relationship with S. officinalis within the Salvia genus, while species in the genus have evolved independently within the Lamiaceae family. This high-resolution genome assembly offers fresh insights into the biosynthesis of essential oils and other aromatic compounds in S. sclarea, establishing a basis for genetic preservation.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"14"},"PeriodicalIF":5.8000,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-024-04347-0","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Salvia sclarea is a medicinal herb from the Lamiaceae family, valued for its essential oil which contains sclareol, linalool, linalyl acetate, and other compounds. Despite its extensive use, the genetic mechanisms of S. sclarea are not well understood. This study presents a chromosome-level genome assembly of S. sclarea using the Oxford Nanopore Technology, Illumina short reads, and Pore-C technology. The assembled genome spans 499.03 Mbp with a scaffold N50 of 50.5 Mbp, forming 11 pseudochromosomes. The genome assembly was validated by BUSCO analysis, which indicated a high completeness marked at 98.7%. The genome contains 68.73% repetitive sequences, with long terminal repeats (LTRs) accounting for 33.01%. A total of 17,202 protein-coding genes were predicted. Among them, 16,846 genes were annotated in the NCBI NR database, the highest among all databases, covering 97.93% of the predicted genes. The phylogenetic analysis revealed that S. sclarea shares a close evolutionary relationship with S. officinalis within the Salvia genus, while species in the genus have evolved independently within the Lamiaceae family. This high-resolution genome assembly offers fresh insights into the biosynthesis of essential oils and other aromatic compounds in S. sclarea, establishing a basis for genetic preservation.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.