Zhonglai Luo, Libo Jiang, Jianing Xu, Jinhuan Wang, Wenhui Nie, Zemin Ning, Fengtang Yang
{"title":"Haplotype-phased genome assemblies and annotation of the northern white-cheeked gibbon (Nomascus leucogenys).","authors":"Zhonglai Luo, Libo Jiang, Jianing Xu, Jinhuan Wang, Wenhui Nie, Zemin Ning, Fengtang Yang","doi":"10.1038/s41597-024-04073-7","DOIUrl":null,"url":null,"abstract":"<p><p>Nomascus leucogenys is a critically endangered species of small apes. Here, we sequenced and assembled the male genome of N. leucogenys, using PacBio and Hi-C datasets, with a particular focus on its Y-chromosome. The resulting high-quality haplotype-phased assemblies are at chromosome-scale, with scaffold/contig N50 values of 124.2/102.2 Mb for Haplotype 1 and 121.2/85.67 Mb for Haplotype 2. The assembled Y-chromosome spans 16.06 Mb. BUSCO assessment indicated completeness scores exceeding 95%. We predicted 18,925 protein-coding genes (23,783 mRNAs), including 58 genes on the Y-chromosome. Approximately 50% of the genome comprises repetitive elements. These comprehensive genome datasets will serve as a valuable resource for future studies on the genetics and protection of gibbons and improve our understanding on the evolution of Y-chromosome-related genes in primates.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"11 1","pages":"1279"},"PeriodicalIF":5.8000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-024-04073-7","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Nomascus leucogenys is a critically endangered species of small apes. Here, we sequenced and assembled the male genome of N. leucogenys, using PacBio and Hi-C datasets, with a particular focus on its Y-chromosome. The resulting high-quality haplotype-phased assemblies are at chromosome-scale, with scaffold/contig N50 values of 124.2/102.2 Mb for Haplotype 1 and 121.2/85.67 Mb for Haplotype 2. The assembled Y-chromosome spans 16.06 Mb. BUSCO assessment indicated completeness scores exceeding 95%. We predicted 18,925 protein-coding genes (23,783 mRNAs), including 58 genes on the Y-chromosome. Approximately 50% of the genome comprises repetitive elements. These comprehensive genome datasets will serve as a valuable resource for future studies on the genetics and protection of gibbons and improve our understanding on the evolution of Y-chromosome-related genes in primates.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.