Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle.

IF 5.7 2区 生物学 Q1 BIOLOGY Biology Direct Pub Date : 2024-12-31 DOI:10.1186/s13062-024-00574-y
Zhida Zhao, Qunhao Niu, Jiayuan Wu, Tianyi Wu, Xueyuan Xie, Zezhao Wang, Lupei Zhang, Huijiang Gao, Xue Gao, Lingyang Xu, Bo Zhu, Junya Li
{"title":"Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle.","authors":"Zhida Zhao, Qunhao Niu, Jiayuan Wu, Tianyi Wu, Xueyuan Xie, Zezhao Wang, Lupei Zhang, Huijiang Gao, Xue Gao, Lingyang Xu, Bo Zhu, Junya Li","doi":"10.1186/s13062-024-00574-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging.</p><p><strong>Methods: </strong>We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis.</p><p><strong>Results: </strong>We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database.</p><p><strong>Conclusions: </strong>Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.</p>","PeriodicalId":9164,"journal":{"name":"Biology Direct","volume":"19 1","pages":"147"},"PeriodicalIF":5.7000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11686921/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology Direct","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13062-024-00574-y","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging.

Methods: We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis.

Results: We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database.

Conclusions: Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
整合多层生物学先验提高肉牛基因组预测精度。
背景:整合多层信息可以提高复杂性状基因组预测的准确性。然而,利用多组学数据进行基因组预测的有效策略的改进和应用仍然具有挑战性。方法:我们从肉牛基因组学、转录组学、代谢组学和表观遗传学数据中生成了11个特征集,用于测序变异,然后我们使用基因组限制最大似然(GREML)评估功能变异的贡献。接下来,我们对43个重要经济性状的变异得分进行了估计和排序,并使用基因组最佳线性无偏预测(GBLUP)和BayesB模型比较了顶部和底部集的预测精度。此外,我们用功能特征集注释了GWAS的变体,并进行了富集分析。结果:我们在11个特征集中观察到32个功能类别的显著富集。进化相关集(保护区和选择特征)对遗传力有显著贡献(富集度分别为31.78倍和14.48倍),而代谢组学和转录组学的遗传力富集度较低。我们观察到与全基因组测序(WGS)数据相比,使用顶级特征集变体的预测准确性显着提高。BayesB和GBLUP对前10%变异集的预测精度在各性状上平均分别提高11.6%和7.54%。其中,BayesB可使脾脏重量(SW)提高31.52%。此外,我们发现基于牛QTL数据库的前10%的变异显示出与体重相关的QTL的强富集。结论:综合多层生物学先验信息可以增强我们对复杂性状遗传结构的理解,进一步提高肉牛基因组预测水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biology Direct
Biology Direct 生物-生物学
CiteScore
6.40
自引率
10.90%
发文量
32
审稿时长
7 months
期刊介绍: Biology Direct serves the life science research community as an open access, peer-reviewed online journal, providing authors and readers with an alternative to the traditional model of peer review. Biology Direct considers original research articles, hypotheses, comments, discovery notes and reviews in subject areas currently identified as those most conducive to the open review approach, primarily those with a significant non-experimental component.
期刊最新文献
Exploring the role of oxidative stress in carotid atherosclerosis: insights from transcriptomic data and single-cell sequencing combined with machine learning. BRCA1 is involved in sustaining rapid antler growth possibly via balancing of the p53/endoplasmic reticulum stress signaling pathway. Correction: Integrated Mendelian randomization and single-cell RNA-sequencing analyses identified OAS1 as a novel therapeutic target for erectile dysfunction via targeting fibroblasts. Functional genomic imaging (FGI), a virtual tool for visualization of functional gene expression modules in heterogeneous tumor samples. Uncovering glycolysis-driven molecular subtypes in diabetic nephropathy: a WGCNA and machine learning approach for diagnostic precision.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1