Using machine learning and partial dependence to evaluate robustness of best linear unbiased prediction (BLUP) for phenotypic values.

IF 2 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Journal of Applied Genetics Pub Date : 2024-05-01 Epub Date: 2024-01-03 DOI:10.1007/s13353-023-00815-2
Prashant Bhandari, Tong Geon Lee
{"title":"Using machine learning and partial dependence to evaluate robustness of best linear unbiased prediction (BLUP) for phenotypic values.","authors":"Prashant Bhandari, Tong Geon Lee","doi":"10.1007/s13353-023-00815-2","DOIUrl":null,"url":null,"abstract":"<p><p>Best linear unbiased prediction (BLUP) is widely used in plant research to address experimental variation. For phenotypic values, BLUP accuracy is largely dependent on properly controlled experimental repetition and how variable components are outlined in the model. Thus, determining BLUP robustness implies the need to evaluate contributions from each repetition. Here, we assessed the robustness of BLUP values for simulated or empirical phenotypic datasets, where the BLUP value and each experimental repetition served as dependent and independent (feature) variables, respectively. Our technique incorporated machine learning and partial dependence. First, we compared the feature importance estimated with the neural networks. Second, we compared estimated average marginal effects of individual repetitions, calculated with a partial dependence analysis. We showed that contributions of experimental repetitions are unequal in a phenotypic dataset, suggesting that the calculated BLUP value is likely to be influenced by some repetitions more than others (such as failing to detect simulated true positive associations). To resolve disproportionate sources, variable components in the BLUP model must be further outlined.</p>","PeriodicalId":14891,"journal":{"name":"Journal of Applied Genetics","volume":" ","pages":"283-286"},"PeriodicalIF":2.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s13353-023-00815-2","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/3 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Best linear unbiased prediction (BLUP) is widely used in plant research to address experimental variation. For phenotypic values, BLUP accuracy is largely dependent on properly controlled experimental repetition and how variable components are outlined in the model. Thus, determining BLUP robustness implies the need to evaluate contributions from each repetition. Here, we assessed the robustness of BLUP values for simulated or empirical phenotypic datasets, where the BLUP value and each experimental repetition served as dependent and independent (feature) variables, respectively. Our technique incorporated machine learning and partial dependence. First, we compared the feature importance estimated with the neural networks. Second, we compared estimated average marginal effects of individual repetitions, calculated with a partial dependence analysis. We showed that contributions of experimental repetitions are unequal in a phenotypic dataset, suggesting that the calculated BLUP value is likely to be influenced by some repetitions more than others (such as failing to detect simulated true positive associations). To resolve disproportionate sources, variable components in the BLUP model must be further outlined.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习和部分依赖性评估表型值最佳线性无偏预测(BLUP)的稳健性。
最佳线性无偏预测(BLUP)被广泛应用于植物研究,以解决实验变异问题。对于表型值而言,最佳线性无偏预测的准确性在很大程度上取决于对实验重复的适当控制以及如何在模型中概述变量成分。因此,确定 BLUP 的稳健性意味着需要评估每次重复的贡献。在这里,我们评估了模拟或经验表型数据集的 BLUP 值的稳健性,其中 BLUP 值和每次实验重复分别作为因变量和自变量(特征)。我们的技术结合了机器学习和部分依赖性。首先,我们比较了用神经网络估计的特征重要性。其次,我们比较了通过部分依赖分析计算出的各个重复的平均边际效应。我们发现,在表型数据集中,实验重复的贡献是不平等的,这表明计算出的 BLUP 值很可能受某些重复的影响大于其他重复(如无法检测到模拟的真阳性关联)。为了解决比例失调的问题,必须进一步概述 BLUP 模型中的变量成分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Applied Genetics
Journal of Applied Genetics 生物-生物工程与应用微生物
CiteScore
4.30
自引率
4.20%
发文量
62
审稿时长
6-12 weeks
期刊介绍: The Journal of Applied Genetics is an international journal on genetics and genomics. It publishes peer-reviewed original papers, short communications (including case reports) and review articles focused on the research of applicative aspects of plant, human, animal and microbial genetics and genomics.
期刊最新文献
Hfq influences ciprofloxacin accumulation in Escherichia coli independently of ompC and ompF post-transcriptional regulation. Scope for a threshold animal model for genetic evaluation for hip dysplasia. QTL mapping of Fusarium ear rot resistance using genotyping by target sequencing (GBTS) in maize. Correction to: The analysis of transcriptomic signature of TNBC-searching for the potential RNA‑based predictive biomarkers to determine the chemotherapy sensitivity. Bioinformatics to analyze the differentially expressed genes in different degrees of Alzheimer's disease and their roles in progress of the disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1