Using feedback in pooled experiments augmented with imputation for high genotyping accuracy at reduced cost.

IF 2.2 3区 生物学 Q3 GENETICS & HEREDITY G3: Genes|Genomes|Genetics Pub Date : 2025-03-18 DOI:10.1093/g3journal/jkaf010
Camille Clouard, Carl Nettelblad
{"title":"Using feedback in pooled experiments augmented with imputation for high genotyping accuracy at reduced cost.","authors":"Camille Clouard, Carl Nettelblad","doi":"10.1093/g3journal/jkaf010","DOIUrl":null,"url":null,"abstract":"<p><p>Conducting genomic selection (GS) in plant breeding programs can substantially speed up the development of new varieties. GS provides more reliable insights when it is based on dense marker data, in which the rare variants can be particularly informative. Despite the availability of new technologies, the cost of large-scale genotyping remains a major limitation to the implementation of GS. We suggest to combine pooled genotyping with population-based imputation as a cost-effective computational strategy for genotyping SNPs. Pooling saves genotyping tests and has proven to accurately capture the rare variants that are usually missed by imputation. In this study, we investigate adding iterative coupling to a joint model of pooling and imputation that we have previously proposed. In each iteration, the imputed genotype probabilities serve as feedback input for adjusting the per-sample prior genotype probabilities, before running a new imputation based on these adjusted data. This flexible setup indirectly imposes consistency between the imputed genotypes and the pooled observations. We demonstrate that repeated cycles of feedback can take advantage of the strengths in both pooling and imputation when an appropriate set of reference haplotypes is available for imputation. The iterations improve greatly upon the initial genotype predictions, achieving very high genotype accuracy for both low- and high-frequency variants. We enhance the average concordance from 94.5% to 98.4% at limited computational cost and without requiring any additional genotype testing.</p>","PeriodicalId":12468,"journal":{"name":"G3: Genes|Genomes|Genetics","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11917477/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"G3: Genes|Genomes|Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/g3journal/jkaf010","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Conducting genomic selection (GS) in plant breeding programs can substantially speed up the development of new varieties. GS provides more reliable insights when it is based on dense marker data, in which the rare variants can be particularly informative. Despite the availability of new technologies, the cost of large-scale genotyping remains a major limitation to the implementation of GS. We suggest to combine pooled genotyping with population-based imputation as a cost-effective computational strategy for genotyping SNPs. Pooling saves genotyping tests and has proven to accurately capture the rare variants that are usually missed by imputation. In this study, we investigate adding iterative coupling to a joint model of pooling and imputation that we have previously proposed. In each iteration, the imputed genotype probabilities serve as feedback input for adjusting the per-sample prior genotype probabilities, before running a new imputation based on these adjusted data. This flexible setup indirectly imposes consistency between the imputed genotypes and the pooled observations. We demonstrate that repeated cycles of feedback can take advantage of the strengths in both pooling and imputation when an appropriate set of reference haplotypes is available for imputation. The iterations improve greatly upon the initial genotype predictions, achieving very high genotype accuracy for both low- and high-frequency variants. We enhance the average concordance from 94.5% to 98.4% at limited computational cost and without requiring any additional genotype testing.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在混合实验中使用反馈,在较低的成本下增加了高基因分型准确性。
在植物育种计划中进行基因组选择可以大大加快新品种的开发。当基因组选择基于密集的标记数据时,它提供了更可靠的见解,其中罕见的变异可以提供特别的信息。尽管有了新技术,但大规模基因分型的成本仍然是实施基因组选择的主要限制。我们建议将混合基因分型与基于群体的归算相结合,作为一种成本效益高的snp基因分型计算策略。汇集节省了基因分型测试,并已被证明准确地捕获了通常由归算错过的罕见变异。在本研究中,我们研究了将迭代耦合添加到我们之前提出的池化和imputation联合模型中。在每次迭代中,输入的基因型概率作为反馈输入,用于调整每个样本的先前基因型概率,然后基于这些调整后的数据运行新的输入。这种灵活的设置间接地增加了输入基因型和汇总观察结果之间的一致性。我们证明,当有一组合适的参考单倍型可供输入时,重复的反馈循环可以利用池化和输入的优势。迭代大大改进了最初的基因型预测,在低频率和高频率变异中实现了非常高的基因型准确性。我们在有限的计算成本下将平均一致性从94.5%提高到98.4%,并且不需要任何额外的基因型检测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
G3: Genes|Genomes|Genetics
G3: Genes|Genomes|Genetics GENETICS & HEREDITY-
CiteScore
5.10
自引率
3.80%
发文量
305
审稿时长
3-8 weeks
期刊介绍: G3: Genes, Genomes, Genetics provides a forum for the publication of high‐quality foundational research, particularly research that generates useful genetic and genomic information such as genome maps, single gene studies, genome‐wide association and QTL studies, as well as genome reports, mutant screens, and advances in methods and technology. The Editorial Board of G3 believes that rapid dissemination of these data is the necessary foundation for analysis that leads to mechanistic insights. G3, published by the Genetics Society of America, meets the critical and growing need of the genetics community for rapid review and publication of important results in all areas of genetics. G3 offers the opportunity to publish the puzzling finding or to present unpublished results that may not have been submitted for review and publication due to a perceived lack of a potential high-impact finding. G3 has earned the DOAJ Seal, which is a mark of certification for open access journals, awarded by DOAJ to journals that achieve a high level of openness, adhere to Best Practice and high publishing standards.
期刊最新文献
Genomic Resources for Two Apex Avian Predators from Saudi Arabia: Falco biarmicus and Falco peregrinus. Snail plots are badges of genome assembly quality. Genetic Dissection of Protein Content in Cowpea Using Custom-made NIRS equations and GWAS as a Model for Nutritional Breeding and Undergraduate Research Training. Genes shielded, repeats exposed: mutation bias in the midge Chironomus riparius. The genome of an enigmatic sea urchin parasite Echinomermella matsi Jones & Hagen, 1987 resolves its place among other invertebrate parasitic nematodes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1