将健康性状的估计育种值从观测值转换为概率值。

IF 3.7 1区农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Journal of Dairy Science Pub Date : 2024-11-01 DOI:10.3168/jds.2024-24767

Jorge Hidalgo , Shogo Tsuruta , Dianelys Gonzalez , Gerson de Oliveira , Miguel Sanchez , Asmita Kulkarni , Cory Przybyla , Giovana Vargas , Natascha Vukasinovic , Ignacy Misztal , Daniela Lourenco

{"title":"将健康性状的估计育种值从观测值转换为概率值。","authors":"Jorge Hidalgo , Shogo Tsuruta , Dianelys Gonzalez , Gerson de Oliveira , Miguel Sanchez , Asmita Kulkarni , Cory Przybyla , Giovana Vargas , Natascha Vukasinovic , Ignacy Misztal , Daniela Lourenco","doi":"10.3168/jds.2024-24767","DOIUrl":null,"url":null,"abstract":"<div><div>Dairy cattle health traits are paramount from a welfare and economic viewpoint, and modern breeding programs therefore prioritize the genetic improvement of these traits. Estimated breeding values for health traits are published as the probability of animals staying healthy. They are obtained using threshold models, which assume that the observed binary phenotype (i.e., healthy or sick) is dictated by an underlying normally distributed liability exceeding or not exceeding a threshold. This methodology requires significant computing time and faces convergence challenges, as it implies a nonlinear system of equations. Linear models have more straightforward computations and provide a robust approximation to threshold models; thus, they could be used to overcome these challenges. However, linear models yield estimated breeding values on the observed scale, requiring an approximation to the liability scale analogous to that from threshold models to later obtain the estimated breeding values on the probability scale. In addition, the robustness of the approximation of linear to threshold models depends on the amount of information and the incidence of the trait, with extreme incidence (i.e., ≤5%) deviating from optimal approximation. Our objective was to test a transformation from the observed to the liability, and then to the probability scale, in the genetic evaluation of health traits with moderate and very low (extreme) incidence. Data comprised displaced abomasum (5.1 million), ketosis (3.6 million), lameness (5 million), and mastitis (6.3 million) records from a Holstein population with a pedigree of 6 million animals, of which 1.7 million were genotyped. Univariate threshold and linear models were performed to predict breeding values. The agreement between estimated breeding values on the probability scale derived from threshold and linear models was assessed using Spearman rank correlations and comparison of estimated breeding values distributions. Correlations were at least 0.95, and estimated breeding value distributions almost entirely overlapped for all the traits but displaced abomasum, the trait with the lowest incidence (2%). Computing time was ∼3 times longer for threshold than for linear models. In this Holstein population, the approximation was suboptimal for a trait with extreme incidence (2%). However, when the incidence was ≥6%, the approximation was robust, and its use is recommended along with linear models for analyzing categorical traits in large populations to ease the computational burden.</div></div>","PeriodicalId":354,"journal":{"name":"Journal of Dairy Science","volume":"107 11","pages":"Pages 9628-9637"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Converting estimated breeding values from the observed to probability scale for health traits\",\"authors\":\"Jorge Hidalgo , Shogo Tsuruta , Dianelys Gonzalez , Gerson de Oliveira , Miguel Sanchez , Asmita Kulkarni , Cory Przybyla , Giovana Vargas , Natascha Vukasinovic , Ignacy Misztal , Daniela Lourenco\",\"doi\":\"10.3168/jds.2024-24767\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Dairy cattle health traits are paramount from a welfare and economic viewpoint, and modern breeding programs therefore prioritize the genetic improvement of these traits. Estimated breeding values for health traits are published as the probability of animals staying healthy. They are obtained using threshold models, which assume that the observed binary phenotype (i.e., healthy or sick) is dictated by an underlying normally distributed liability exceeding or not exceeding a threshold. This methodology requires significant computing time and faces convergence challenges, as it implies a nonlinear system of equations. Linear models have more straightforward computations and provide a robust approximation to threshold models; thus, they could be used to overcome these challenges. However, linear models yield estimated breeding values on the observed scale, requiring an approximation to the liability scale analogous to that from threshold models to later obtain the estimated breeding values on the probability scale. In addition, the robustness of the approximation of linear to threshold models depends on the amount of information and the incidence of the trait, with extreme incidence (i.e., ≤5%) deviating from optimal approximation. Our objective was to test a transformation from the observed to the liability, and then to the probability scale, in the genetic evaluation of health traits with moderate and very low (extreme) incidence. Data comprised displaced abomasum (5.1 million), ketosis (3.6 million), lameness (5 million), and mastitis (6.3 million) records from a Holstein population with a pedigree of 6 million animals, of which 1.7 million were genotyped. Univariate threshold and linear models were performed to predict breeding values. The agreement between estimated breeding values on the probability scale derived from threshold and linear models was assessed using Spearman rank correlations and comparison of estimated breeding values distributions. Correlations were at least 0.95, and estimated breeding value distributions almost entirely overlapped for all the traits but displaced abomasum, the trait with the lowest incidence (2%). Computing time was ∼3 times longer for threshold than for linear models. In this Holstein population, the approximation was suboptimal for a trait with extreme incidence (2%). However, when the incidence was ≥6%, the approximation was robust, and its use is recommended along with linear models for analyzing categorical traits in large populations to ease the computational burden.</div></div>\",\"PeriodicalId\":354,\"journal\":{\"name\":\"Journal of Dairy Science\",\"volume\":\"107 11\",\"pages\":\"Pages 9628-9637\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Dairy Science\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022030224009949\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, DAIRY & ANIMAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Dairy Science","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022030224009949","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, DAIRY & ANIMAL SCIENCE","Score":null,"Total":0}

引用次数: 0

摘要

从福利和经济角度来看，奶牛的健康性状至关重要；因此，现代育种计划优先考虑这些性状的遗传改良。健康性状的估计育种值是作为动物保持健康的概率公布的。它们是通过阈值模型获得的，该模型假定观察到的二元表型（即健康或生病）是由超过或未超过阈值的基本正态分布责任决定的。这种方法需要大量的计算时间，并面临收敛性挑战，因为它意味着一个非线性方程组。线性模型的计算更为直接，并能提供对阈值模型的稳健近似，因此可用于克服上述挑战。然而，线性模型得到的是观测尺度上的估计繁殖值，需要对责任尺度进行近似，类似于阈值模型，才能得到概率尺度上的估计繁殖值。此外，线性模型与阈值模型近似的稳健性取决于信息量和性状发生率，极端发生率（即≤5%）会偏离最佳近似值。我们的目标是测试在中度和极度（极端）发病率的健康性状遗传评估中，从观测值到责任值再到概率标度的转换。数据包括来自荷斯坦种群的腹腔移位（510 万）、酮病（360 万）、跛足（500 万）和乳腺炎（630 万）记录，该种群有 600 万只动物的血统，其中 170 万只进行了基因分型。采用单变量阈值和线性模型预测育种值。利用斯皮尔曼等级相关性和估计育种值分布的比较，评估了阈值模型和线性模型得出的概率标尺上的估计育种值之间的一致性。相关系数至少为 0.95，所有性状的估计育种值分布几乎完全重合，但发生率最低（2%）的移位性状除外。阈值模型的计算时间是线性模型的 3 倍。在该荷斯坦种群中，对于发病率极高的性状（2%）来说，近似值是次优的。然而，当发生率≥6%时，近似值是稳健的，建议在分析大量群体的分类性状时将其与线性模型一起使用，以减轻计算负担。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Converting estimated breeding values from the observed to probability scale for health traits

Dairy cattle health traits are paramount from a welfare and economic viewpoint, and modern breeding programs therefore prioritize the genetic improvement of these traits. Estimated breeding values for health traits are published as the probability of animals staying healthy. They are obtained using threshold models, which assume that the observed binary phenotype (i.e., healthy or sick) is dictated by an underlying normally distributed liability exceeding or not exceeding a threshold. This methodology requires significant computing time and faces convergence challenges, as it implies a nonlinear system of equations. Linear models have more straightforward computations and provide a robust approximation to threshold models; thus, they could be used to overcome these challenges. However, linear models yield estimated breeding values on the observed scale, requiring an approximation to the liability scale analogous to that from threshold models to later obtain the estimated breeding values on the probability scale. In addition, the robustness of the approximation of linear to threshold models depends on the amount of information and the incidence of the trait, with extreme incidence (i.e., ≤5%) deviating from optimal approximation. Our objective was to test a transformation from the observed to the liability, and then to the probability scale, in the genetic evaluation of health traits with moderate and very low (extreme) incidence. Data comprised displaced abomasum (5.1 million), ketosis (3.6 million), lameness (5 million), and mastitis (6.3 million) records from a Holstein population with a pedigree of 6 million animals, of which 1.7 million were genotyped. Univariate threshold and linear models were performed to predict breeding values. The agreement between estimated breeding values on the probability scale derived from threshold and linear models was assessed using Spearman rank correlations and comparison of estimated breeding values distributions. Correlations were at least 0.95, and estimated breeding value distributions almost entirely overlapped for all the traits but displaced abomasum, the trait with the lowest incidence (2%). Computing time was ∼3 times longer for threshold than for linear models. In this Holstein population, the approximation was suboptimal for a trait with extreme incidence (2%). However, when the incidence was ≥6%, the approximation was robust, and its use is recommended along with linear models for analyzing categorical traits in large populations to ease the computational burden.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Dairy Science 农林科学-奶制品与动物科学

CiteScore

7.90

自引率

17.10%

发文量

784

审稿时长

4.2 months

期刊介绍： The official journal of the American Dairy Science Association®, Journal of Dairy Science® (JDS) is the leading peer-reviewed general dairy research journal in the world. JDS readers represent education, industry, and government agencies in more than 70 countries with interests in biochemistry, breeding, economics, engineering, environment, food science, genetics, microbiology, nutrition, pathology, physiology, processing, public health, quality assurance, and sanitation.