{"title":"Chromosome-scale genome assembly of Korean goosegrass (Eleusine indica).","authors":"Solji Lee, Changsoo Kim","doi":"10.1038/s41597-025-04490-2","DOIUrl":null,"url":null,"abstract":"<p><p>Goosegrass, belonging to the genus Eleusine within the Chloridoideae subfamily, is often one of the problematic weeds with strong invasiveness, competing with crops for essential survival resources. Although a chromosome-level genome assembly of E. indica from China was published last year, the present research focuses on a population of E. indica native to South Korea. Considering the high genetic variability among wild E. indica populations, constructing multi-reference genomes from geographically distinct populations is crucial for comprehensive weed management strategies. In this study, we sequenced and assembled the whole genome using PacBio long read and Illumina short read sequencing platforms. We then combined Pore-C sequencing technology to successfully anchor 255 contigs to nine pseudochromosomes. The chromosome-level genome assembly showed a high level of completeness with a 97% score according to BUSCO analysis results. Repetitive sequences accounted for 97% of the genome assembly, and 26,836 protein-coding genes were predicted. The high-quality genome assembly of E. indica will serve as a valuable genetic resource to enhance our understanding of weed control research.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"156"},"PeriodicalIF":5.8000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-04490-2","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Goosegrass, belonging to the genus Eleusine within the Chloridoideae subfamily, is often one of the problematic weeds with strong invasiveness, competing with crops for essential survival resources. Although a chromosome-level genome assembly of E. indica from China was published last year, the present research focuses on a population of E. indica native to South Korea. Considering the high genetic variability among wild E. indica populations, constructing multi-reference genomes from geographically distinct populations is crucial for comprehensive weed management strategies. In this study, we sequenced and assembled the whole genome using PacBio long read and Illumina short read sequencing platforms. We then combined Pore-C sequencing technology to successfully anchor 255 contigs to nine pseudochromosomes. The chromosome-level genome assembly showed a high level of completeness with a 97% score according to BUSCO analysis results. Repetitive sequences accounted for 97% of the genome assembly, and 26,836 protein-coding genes were predicted. The high-quality genome assembly of E. indica will serve as a valuable genetic resource to enhance our understanding of weed control research.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.