{"title":"利用无参考基因组的 Hi-C 数据对单体型解析组装进行染色体级支架构建","authors":"Xiaofei Zeng, Zili Yi, Xingtan Zhang, Yuhui Du, Yu Li, Zhiqing Zhou, Sijie Chen, Huijie Zhao, Sai Yang, Yibin Wang, Guoan Chen","doi":"10.1038/s41477-024-01755-3","DOIUrl":null,"url":null,"abstract":"Scaffolding is crucial for constructing most chromosome-level genomes. The high-throughput chromatin conformation capture (Hi-C) technology has become the primary scaffolding strategy due to its convenience and cost-effectiveness. As sequencing technologies and assembly algorithms advance, constructing haplotype-resolved genomes is increasingly preferred because haplotypes can provide additional genetic information on allelic and non-allelic variations. ALLHiC is a widely used allele-aware scaffolding tool designed for this purpose. However, its dependence on chromosome-level reference genomes and a higher chromosome misassignment rate still impede the unravelling of haplotype-resolved genomes. Here we present HapHiC, a reference-independent allele-aware scaffolding tool with superior performance on chromosome assignment as well as contig ordering and orientation. In addition, we provide new insights into the challenges in allele-aware scaffolding by conducting comprehensive analyses on various adverse factors. Finally, with the help of HapHiC, we constructed the haplotype-resolved allotriploid genome for Miscanthus × giganteus, an important lignocellulosic bioenergy crop. This study uncovers key challenges in allele-aware genome scaffolding with Hi-C data. A reference-independent Hi-C scaffolder showing superior performance was developed to construct the haplotype-resolved genome of triploid Miscanthus × giganteus.","PeriodicalId":18904,"journal":{"name":"Nature Plants","volume":null,"pages":null},"PeriodicalIF":15.8000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes\",\"authors\":\"Xiaofei Zeng, Zili Yi, Xingtan Zhang, Yuhui Du, Yu Li, Zhiqing Zhou, Sijie Chen, Huijie Zhao, Sai Yang, Yibin Wang, Guoan Chen\",\"doi\":\"10.1038/s41477-024-01755-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scaffolding is crucial for constructing most chromosome-level genomes. The high-throughput chromatin conformation capture (Hi-C) technology has become the primary scaffolding strategy due to its convenience and cost-effectiveness. As sequencing technologies and assembly algorithms advance, constructing haplotype-resolved genomes is increasingly preferred because haplotypes can provide additional genetic information on allelic and non-allelic variations. ALLHiC is a widely used allele-aware scaffolding tool designed for this purpose. However, its dependence on chromosome-level reference genomes and a higher chromosome misassignment rate still impede the unravelling of haplotype-resolved genomes. Here we present HapHiC, a reference-independent allele-aware scaffolding tool with superior performance on chromosome assignment as well as contig ordering and orientation. In addition, we provide new insights into the challenges in allele-aware scaffolding by conducting comprehensive analyses on various adverse factors. Finally, with the help of HapHiC, we constructed the haplotype-resolved allotriploid genome for Miscanthus × giganteus, an important lignocellulosic bioenergy crop. This study uncovers key challenges in allele-aware genome scaffolding with Hi-C data. A reference-independent Hi-C scaffolder showing superior performance was developed to construct the haplotype-resolved genome of triploid Miscanthus × giganteus.\",\"PeriodicalId\":18904,\"journal\":{\"name\":\"Nature Plants\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":15.8000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Plants\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.nature.com/articles/s41477-024-01755-3\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PLANT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Plants","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41477-024-01755-3","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes
Scaffolding is crucial for constructing most chromosome-level genomes. The high-throughput chromatin conformation capture (Hi-C) technology has become the primary scaffolding strategy due to its convenience and cost-effectiveness. As sequencing technologies and assembly algorithms advance, constructing haplotype-resolved genomes is increasingly preferred because haplotypes can provide additional genetic information on allelic and non-allelic variations. ALLHiC is a widely used allele-aware scaffolding tool designed for this purpose. However, its dependence on chromosome-level reference genomes and a higher chromosome misassignment rate still impede the unravelling of haplotype-resolved genomes. Here we present HapHiC, a reference-independent allele-aware scaffolding tool with superior performance on chromosome assignment as well as contig ordering and orientation. In addition, we provide new insights into the challenges in allele-aware scaffolding by conducting comprehensive analyses on various adverse factors. Finally, with the help of HapHiC, we constructed the haplotype-resolved allotriploid genome for Miscanthus × giganteus, an important lignocellulosic bioenergy crop. This study uncovers key challenges in allele-aware genome scaffolding with Hi-C data. A reference-independent Hi-C scaffolder showing superior performance was developed to construct the haplotype-resolved genome of triploid Miscanthus × giganteus.
期刊介绍:
Nature Plants is an online-only, monthly journal publishing the best research on plants — from their evolution, development, metabolism and environmental interactions to their societal significance.