GA-GBLUP: leveraging the genetic algorithm to improve the predictability of genomic selection.

IF 7.7 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS Briefings in bioinformatics Pub Date : 2024-07-25 DOI:10.1093/bib/bbae385

Yang Xu, Yuxiang Zhang, Yanru Cui, Kai Zhou, Guangning Yu, Wenyan Yang, Xin Wang, Furong Li, Xiusheng Guan, Xuecai Zhang, Zefeng Yang, Shizhong Xu, Chenwu Xu

{"title":"GA-GBLUP: leveraging the genetic algorithm to improve the predictability of genomic selection.","authors":"Yang Xu, Yuxiang Zhang, Yanru Cui, Kai Zhou, Guangning Yu, Wenyan Yang, Xin Wang, Furong Li, Xiusheng Guan, Xuecai Zhang, Zefeng Yang, Shizhong Xu, Chenwu Xu","doi":"10.1093/bib/bbae385","DOIUrl":null,"url":null,"abstract":"<p><p>Genomic selection (GS) has emerged as an effective technology to accelerate crop hybrid breeding by enabling early selection prior to phenotype collection. Genomic best linear unbiased prediction (GBLUP) is a robust method that has been routinely used in GS breeding programs. However, GBLUP assumes that markers contribute equally to the total genetic variance, which may not be the case. In this study, we developed a novel GS method called GA-GBLUP that leverages the genetic algorithm (GA) to select markers related to the target trait. We defined four fitness functions for optimization, including AIC, BIC, R2, and HAT, to improve the predictability and bin adjacent markers based on the principle of linkage disequilibrium to reduce model dimension. The results demonstrate that the GA-GBLUP model, equipped with R2 and HAT fitness function, produces much higher predictability than GBLUP for most traits in rice and maize datasets, particularly for traits with low heritability. Moreover, we have developed a user-friendly R package, GAGBLUP, for GS, and the package is freely available on CRAN (https://CRAN.R-project.org/package=GAGBLUP).</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"25 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11299030/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae385","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Genomic selection (GS) has emerged as an effective technology to accelerate crop hybrid breeding by enabling early selection prior to phenotype collection. Genomic best linear unbiased prediction (GBLUP) is a robust method that has been routinely used in GS breeding programs. However, GBLUP assumes that markers contribute equally to the total genetic variance, which may not be the case. In this study, we developed a novel GS method called GA-GBLUP that leverages the genetic algorithm (GA) to select markers related to the target trait. We defined four fitness functions for optimization, including AIC, BIC, R2, and HAT, to improve the predictability and bin adjacent markers based on the principle of linkage disequilibrium to reduce model dimension. The results demonstrate that the GA-GBLUP model, equipped with R2 and HAT fitness function, produces much higher predictability than GBLUP for most traits in rice and maize datasets, particularly for traits with low heritability. Moreover, we have developed a user-friendly R package, GAGBLUP, for GS, and the package is freely available on CRAN (https://CRAN.R-project.org/package=GAGBLUP).

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GA-GBLUP：利用遗传算法提高基因组选择的可预测性。

基因组选择（GS）通过在收集表型之前进行早期选择，已成为加速作物杂交育种的有效技术。基因组最佳线性无偏预测（GBLUP）是一种稳健的方法，已被常规用于基因组选择育种项目。然而，GBLUP 假设标记对总遗传变异的贡献相同，而实际情况可能并非如此。在本研究中，我们开发了一种名为 GA-GBLUP 的新型基因组学方法，利用遗传算法（GA）来选择与目标性状相关的标记。我们定义了 AIC、BIC、R2 和 HAT 等四种适合度函数进行优化，以提高可预测性，并根据连锁不平衡原理将相邻标记进行分选，以减少模型维度。结果表明，对于水稻和玉米数据集中的大多数性状，尤其是遗传率较低的性状，配备 R2 和 HAT 健身函数的 GA-GBLUP 模型比 GBLUP 产生的预测性要高得多。此外，我们还为 GS 开发了一个用户友好型 R 软件包 GAGBLUP，该软件包可在 CRAN（https://CRAN.R-project.org/package=GAGBLUP）上免费获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Briefings in bioinformatics 生物-生化研究方法

CiteScore

13.20

自引率

13.70%

发文量

549

审稿时长

6 months

期刊介绍： Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data. The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.