DPImpute: A Genotype Imputation Framework for Ultra-Low Coverage Whole-Genome Sequencing and its Application in Genomic Selection

IF 14.1 1区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY Advanced Science Pub Date : 2025-02-27 DOI:10.1002/advs.202412482
Weigang Zheng, Wenlong Ma, Zhilong Chen, Chao Wang, Tao Sun, Wenjun Dong, Wenjing Zhang, Song Zhang, Zhonglin Tang, Kui Li, Yunxiang Zhao, Yuwen Liu
{"title":"DPImpute: A Genotype Imputation Framework for Ultra-Low Coverage Whole-Genome Sequencing and its Application in Genomic Selection","authors":"Weigang Zheng,&nbsp;Wenlong Ma,&nbsp;Zhilong Chen,&nbsp;Chao Wang,&nbsp;Tao Sun,&nbsp;Wenjun Dong,&nbsp;Wenjing Zhang,&nbsp;Song Zhang,&nbsp;Zhonglin Tang,&nbsp;Kui Li,&nbsp;Yunxiang Zhao,&nbsp;Yuwen Liu","doi":"10.1002/advs.202412482","DOIUrl":null,"url":null,"abstract":"<p>Whole-genome sequencing is pivotal for elucidating the complex relationships between genotype and phenotype. However, its widespread application is hindered by the high sequencing depth and large sample sizes required, especially for genomic selection (GS) reliant on precise phenotype prediction from high-density genotype data. To address this, DPImpute (Dual-Phase Impute) is developed, an two-step imputation pipeline enabling accurate whole-genome SNP genotyping under ultra-low coverage whole-genome sequencing (ulcWGS) depths, small testing sample sizes, and limited reference populations. DPImpute achieved 98.06% SNP imputation accuracy with minimal testing samples (≤10), reference samples (≤100), and an ultra-low sequencing depth of 0.3X, surpassing the accuracy of existing imputation methods. Moreover, this high accuracy is maintained across multi-ancestry human populations. Remarkably, DPImpute demonstrated accurate SNP imputation from low-coverage sequencing data from single blood cells and single blastocyst cells, highlighting its potential in embryo GS. To enhance the accessibility of DPImpute, a user-friendly web server (https://agdb.ecenr.com/DPImpute/home) is developed and a Docker container for seamless implementation. In summary, DPImpute can significantly expedite breeding programs through precise and cost-effective genotyping and serve as a valuable tool for diverse population genotyping, encompassing both human and animal studies.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":"12 16","pages":""},"PeriodicalIF":14.1000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/advs.202412482","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://advanced.onlinelibrary.wiley.com/doi/10.1002/advs.202412482","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Whole-genome sequencing is pivotal for elucidating the complex relationships between genotype and phenotype. However, its widespread application is hindered by the high sequencing depth and large sample sizes required, especially for genomic selection (GS) reliant on precise phenotype prediction from high-density genotype data. To address this, DPImpute (Dual-Phase Impute) is developed, an two-step imputation pipeline enabling accurate whole-genome SNP genotyping under ultra-low coverage whole-genome sequencing (ulcWGS) depths, small testing sample sizes, and limited reference populations. DPImpute achieved 98.06% SNP imputation accuracy with minimal testing samples (≤10), reference samples (≤100), and an ultra-low sequencing depth of 0.3X, surpassing the accuracy of existing imputation methods. Moreover, this high accuracy is maintained across multi-ancestry human populations. Remarkably, DPImpute demonstrated accurate SNP imputation from low-coverage sequencing data from single blood cells and single blastocyst cells, highlighting its potential in embryo GS. To enhance the accessibility of DPImpute, a user-friendly web server (https://agdb.ecenr.com/DPImpute/home) is developed and a Docker container for seamless implementation. In summary, DPImpute can significantly expedite breeding programs through precise and cost-effective genotyping and serve as a valuable tool for diverse population genotyping, encompassing both human and animal studies.

Abstract Image

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DPImpute:一种超低覆盖率全基因组测序的基因型插入框架及其在基因组选择中的应用。
全基因组测序是阐明基因型和表型之间复杂关系的关键。然而,它的广泛应用受到高测序深度和所需的大样本量的阻碍,特别是对于依赖于高密度基因型数据的精确表型预测的基因组选择(GS)。为了解决这个问题,DPImpute (Dual-Phase Impute)被开发出来,这是一种两步的imputation管道,可以在超低覆盖全基因组测序(ulcWGS)深度、小测试样本量和有限的参考人群下准确地进行全基因组SNP基因分型。DPImpute以最少的检测样本(≤10个)和参考样本(≤100个),以及0.3X的超低测序深度,实现了98.06%的SNP插入精度,超过了现有的插入方法的精度。此外,这种高准确性在多祖先人群中保持不变。值得注意的是,DPImpute从单个血细胞和单个囊胚细胞的低覆盖率测序数据中证明了准确的SNP imputation,突出了其在胚胎GS中的潜力。为了增强DPImpute的可访问性,开发了一个用户友好的web服务器(https://agdb.ecenr.com/DPImpute/home)和一个无缝实现的Docker容器。总之,DPImpute可以通过精确和具有成本效益的基因分型大大加快育种计划,并可作为包括人类和动物研究在内的多种群体基因分型的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Advanced Science
Advanced Science CHEMISTRY, MULTIDISCIPLINARYNANOSCIENCE &-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
18.90
自引率
2.60%
发文量
1602
审稿时长
1.9 months
期刊介绍: Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.
期刊最新文献
The Dynamic Energy Band Model of Contact-Separation and Sliding Mode Triboelectric Charging of Polymers at the Metal-Polymer Interface. Volcano-Inspired Dual-Carbon Network Aerogel for High-Performance Solar Evaporation With Edge-Directed Salt Crystallization and Recovery. The Transcription Factor FgSge1 Harnesses the SAGA Complex to Activate Mycotoxin Biosynthesis and Fungal Virulence. Scalable Laser Processing Enables Transparent, Accretion Scale-Independent, Ice-Shedding Glass. Why Computational Photochemistry Is Challenging and Will Probably Remain So: A Quantum Chemist's Perspective.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1