首页 > 最新文献

Genetics Selection Evolution最新文献

英文 中文
High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing 使用低覆盖率全基因组测序的结构和单核苷酸变异的高性能插入
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-28 DOI: 10.1186/s12711-025-00962-6
Manu Kumar Gundappa, Diego Robledo, Alastair Hamilton, Ross D. Houston, James G. D. Prendergast, Daniel J. Macqueen
Whole genome sequencing (WGS), despite its advantages, is yet to replace methods for genotyping single nucleotide variants (SNVs) such as SNP arrays and targeted genotyping assays. Structural variants (SVs) have larger effects on traits than SNVs, but are more challenging to accurately genotype. Using low-coverage WGS with genotype imputation offers a cost-effective strategy to achieve genome-wide variant coverage, but is yet to be tested for SVs. Here, we investigate combined SNV and SV imputation with low-coverage WGS data in Atlantic salmon (Salmo salar). As the reference panel, we used genotypes for high-confidence SVs and SNVs for n = 365 wild individuals sampled from diverse populations. We also generated 15 × WGS data (n = 20 samples) for a commercial population external to the reference panel, and called SVs and SNVs with gold-standard approaches. An imputation method selected for its established performance using low-coverage sequencing data (GLIMPSE) was tested at WGS depths of 1 × , 2 × , 3 × , and 4 × for samples within and external to the reference panel. SNVs were imputed with high accuracy and recall across all WGS depths, including for samples out-with the reference panel. For SVs, we compared imputation based purely on linkage disequilibrium (LD) with SNVs, to that supplemented with SV genotype likelihoods (GLs) from low-coverage WGS. Including SV GLs increased imputation accuracy, but as a trade-off with recall, requiring 3–4 × depth for best performance. Combining strategies allowed us to capture 84% of the reference panel deletions with 87% accuracy at 1 × depth. We also show that SV length affects imputation performance, with provision of SV GLs greatly enhancing accuracy for the longest SVs in the dataset. This study highlights the promise of reference panel imputation using low-coverage WGS, including novel opportunities to enhance the resolution of genome-wide association studies by capturing SVs.
尽管全基因组测序(WGS)有其优势,但它尚未取代单核苷酸变异(snv)的基因分型方法,如SNP阵列和靶向基因分型分析。结构变异(SVs)对性状的影响比snv更大,但更难以准确地进行基因分型。使用低覆盖率WGS与基因型插补提供了一种经济有效的策略来实现全基因组变异覆盖,但尚未对sv进行测试。在此,我们利用低覆盖率的WGS数据对大西洋鲑鱼(Salmo salar)的SNV和SV进行了综合估算。作为参考面板,我们对来自不同种群的n = 365个野生个体使用高置信度的SVs和snv基因型。我们还为参考面板外的商业人群生成了15 × WGS数据(n = 20个样本),并使用金标准方法称为SVs和snv。利用低覆盖率测序数据(GLIMPSE)选择一种具有既定性能的插补方法,在参考面板内外样品的WGS深度为1 ×、2 ×、3 ×和4 ×进行测试。snv在所有WGS深度上都具有很高的准确性和召回率,包括与参考面板外的样品。对于SV,我们比较了纯粹基于连锁不平衡(LD)和snv的估算结果,与低覆盖率WGS补充SV基因型可能性(GLs)的估算结果。包括SV GLs增加了输入精度,但作为召回率的权衡,需要3-4倍的深度才能获得最佳性能。组合策略使我们能够在1倍深度下以87%的准确率捕获84%的参考面板删除。我们还表明,SV长度会影响imputation性能,提供SV GLs大大提高了数据集中最长SV的准确性。这项研究强调了使用低覆盖率WGS进行参考面板插入的前景,包括通过捕获sv来提高全基因组关联研究的分辨率的新机会。
{"title":"High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing","authors":"Manu Kumar Gundappa, Diego Robledo, Alastair Hamilton, Ross D. Houston, James G. D. Prendergast, Daniel J. Macqueen","doi":"10.1186/s12711-025-00962-6","DOIUrl":"https://doi.org/10.1186/s12711-025-00962-6","url":null,"abstract":"Whole genome sequencing (WGS), despite its advantages, is yet to replace methods for genotyping single nucleotide variants (SNVs) such as SNP arrays and targeted genotyping assays. Structural variants (SVs) have larger effects on traits than SNVs, but are more challenging to accurately genotype. Using low-coverage WGS with genotype imputation offers a cost-effective strategy to achieve genome-wide variant coverage, but is yet to be tested for SVs. Here, we investigate combined SNV and SV imputation with low-coverage WGS data in Atlantic salmon (Salmo salar). As the reference panel, we used genotypes for high-confidence SVs and SNVs for n = 365 wild individuals sampled from diverse populations. We also generated 15 × WGS data (n = 20 samples) for a commercial population external to the reference panel, and called SVs and SNVs with gold-standard approaches. An imputation method selected for its established performance using low-coverage sequencing data (GLIMPSE) was tested at WGS depths of 1 × , 2 × , 3 × , and 4 × for samples within and external to the reference panel. SNVs were imputed with high accuracy and recall across all WGS depths, including for samples out-with the reference panel. For SVs, we compared imputation based purely on linkage disequilibrium (LD) with SNVs, to that supplemented with SV genotype likelihoods (GLs) from low-coverage WGS. Including SV GLs increased imputation accuracy, but as a trade-off with recall, requiring 3–4 × depth for best performance. Combining strategies allowed us to capture 84% of the reference panel deletions with 87% accuracy at 1 × depth. We also show that SV length affects imputation performance, with provision of SV GLs greatly enhancing accuracy for the longest SVs in the dataset. This study highlights the promise of reference panel imputation using low-coverage WGS, including novel opportunities to enhance the resolution of genome-wide association studies by capturing SVs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"57 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143723122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multitrait genome-wide association best linear unbiased prediction of genetic values 多性状全基因组关联对遗传价值的最佳线性无偏预测
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-21 DOI: 10.1186/s12711-025-00964-4
Theo Meuwissen, Vinzent Boerner
The GWABLUP (Genome-Wide Association based Best Linear Unbiased Prediction) approach used GWA analysis results to differentially weigh the SNPs in genomic prediction, and was found to improve the reliabilities of genomic predictions. However, the proposed multitrait GWABLUP method assumed that the SNP weights were the same across the traits. Here we extended and validated the multitrait GWABLUP method towards using trait specific SNP weights. In a 3-trait dairy data set, multitrait GWAS estimates of SNP effects and their standard errors were translated into trait specific likelihood ratios for the SNPs having trait effects, and posterior probabilities using the GWABLUP approach. This produced trait specific prior (co)variance matrices for each SNP, which were applied in a SNP-BLUP model for genomic predictions, implemented in the APEX linear model suite. In a validation population, the trait specific SNP weights resulted in more reliable predictions for all three traits. Especially, for somatic cell count, which was hardly related to the other traits, the use of the same weights across all traits was harming genomic predictions. The use of trait specific SNP weights overcame this problem. In multitrait GWABLUP analyses of ~ 30,000 reference population cows, trait specific SNP weights resulted in up to 13% more reliable genomic predictions than unweighted SNP-BLUP, and improved genomic predictions for all three studied traits.
GWABLUP(基于全基因组关联的最佳线性无偏预测)方法利用GWA分析结果对基因组预测中的snp进行差异性加权,提高了基因组预测的可靠性。然而,所提出的多性状GWABLUP方法假设各性状之间的SNP权重相同。在这里,我们扩展并验证了多性状GWABLUP方法,以使用性状特异性SNP权重。在3个性状的乳制品数据集中,多性状GWAS估计的SNP效应及其标准误差被转化为具有性状效应的SNP的性状特定似然比,以及使用GWABLUP方法的后验概率。这产生了每个SNP的性状特异性先验(co)方差矩阵,这些矩阵应用于SNP- blup模型中进行基因组预测,并在APEX线性模型套件中实现。在验证群体中,性状特异性SNP权重导致对所有三个性状的预测更可靠。特别是,对于与其他性状几乎没有关系的体细胞计数,在所有性状中使用相同的权重会损害基因组预测。使用性状特异性SNP权重克服了这个问题。在对约30,000头参考种群奶牛的多性状GWABLUP分析中,性状特异性SNP权重比未加权SNP- blup预测的可靠性提高了13%,并且改进了对所有三种所研究性状的基因组预测。
{"title":"Multitrait genome-wide association best linear unbiased prediction of genetic values","authors":"Theo Meuwissen, Vinzent Boerner","doi":"10.1186/s12711-025-00964-4","DOIUrl":"https://doi.org/10.1186/s12711-025-00964-4","url":null,"abstract":"The GWABLUP (Genome-Wide Association based Best Linear Unbiased Prediction) approach used GWA analysis results to differentially weigh the SNPs in genomic prediction, and was found to improve the reliabilities of genomic predictions. However, the proposed multitrait GWABLUP method assumed that the SNP weights were the same across the traits. Here we extended and validated the multitrait GWABLUP method towards using trait specific SNP weights. In a 3-trait dairy data set, multitrait GWAS estimates of SNP effects and their standard errors were translated into trait specific likelihood ratios for the SNPs having trait effects, and posterior probabilities using the GWABLUP approach. This produced trait specific prior (co)variance matrices for each SNP, which were applied in a SNP-BLUP model for genomic predictions, implemented in the APEX linear model suite. In a validation population, the trait specific SNP weights resulted in more reliable predictions for all three traits. Especially, for somatic cell count, which was hardly related to the other traits, the use of the same weights across all traits was harming genomic predictions. The use of trait specific SNP weights overcame this problem. In multitrait GWABLUP analyses of ~ 30,000 reference population cows, trait specific SNP weights resulted in up to 13% more reliable genomic predictions than unweighted SNP-BLUP, and improved genomic predictions for all three studied traits.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"61 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143665977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erosion of estimated genomic breeding values with generations is due to long distance associations between markers and QTL 由于标记和QTL之间的长距离关联,估计的基因组育种值随着世代的变化而下降
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-21 DOI: 10.1186/s12711-025-00963-5
Didier Boichard, Sébastien Fritz, Pascal Croiseau, Vincent Ducrocq, Thierry Tribout, Beatriz C. D. Cuyabano
Most validation studies of genomic evaluations on candidates (prior to observing phenotypes) present inflation of their predicted breeding values, i.e., regression coefficients of their later observed phenotypes on the early predictions are smaller than one. The aim of this study was to show that this inflation pattern reflects at least partly long-distance associations between markers and quantitative trait loci (QTL) in the reference population and to propose methods to estimate the corresponding “erosion” coefficient. Across-chromosome linkage disequilibrium (LD) is observed in different dairy cattle breeds, being a result from limited effective population size and from relationships within the reference population. Due to this long distance LD, the estimated SNP effects capture non-zero contributions from distant QTLs, some located on other chromosomes than the SNP itself. Therefore, corresponding SNP effects are partly lost in the next generations and we refer to this loss as “erosion”. With the concept of QTL contribution to SNP effects derived from mixed model equations, we show with simulation that this long range LD explains 6–25% of the variance of the estimated genomic breeding values, a proportion that is unchanged when the evaluation model includes a residual polygenic effect. Two methods are proposed to predict this erosion factor assuming known simulated QTL effects. In Method 1, one generation of progeny is simulated from the reference population and the GEBV of these progeny based on SNP effects estimated in this newly simulated generation are regressed on the GEBV of the same progeny based on SNP effects estimated in the reference population. In Method 2 all the QTL contributions to SNP effects are regressed based on SNP-QTL recombination rates and summed to predict the GEBV at the next generation. The regression coefficient of the GEBV based on eroded contributions on the raw GEBV is also an estimate of erosion. An illustration is given with the French Normande female reference bovine population in 2021, showing erosion factors ranging from 0.84 to 0.87. Accounting for erosion is important to avoid inflation and biased predictions. The ways to both reduce inflation and to correct for it in the prediction are discussed.
大多数候选基因评估的验证研究(在观察表型之前)存在其预测育种值的膨胀,即其后期观察到的表型对早期预测的回归系数小于1。本研究的目的是表明这种膨胀模式至少部分反映了参考群体中标记和数量性状位点(QTL)之间的长距离关联,并提出了估算相应“侵蚀”系数的方法。在不同的奶牛品种中观察到跨染色体连锁不平衡(LD),这是有限的有效群体规模和参考群体内部关系的结果。由于这种长距离LD,估计的SNP效应捕获了远端qtl的非零贡献,其中一些位于其他染色体上,而不是SNP本身。因此,相应的SNP效应在下一代中部分丢失,我们将这种丢失称为“侵蚀”。根据混合模型方程推导出的QTL对SNP效应贡献的概念,我们通过模拟表明,这一长期LD解释了估计基因组育种值方差的6-25%,当评估模型包括残余多基因效应时,这一比例不变。假设已知的模拟QTL效应,提出了两种预测侵蚀因子的方法。在方法1中,从参考群体中模拟一代后代,根据新模拟一代中估计的SNP效应对这些后代的GEBV进行回归,基于参考群体中估计的SNP效应对同一后代的GEBV进行回归。在方法2中,根据SNP-QTL重组率对所有对SNP效应的QTL贡献进行回归,并求和以预测下一代的GEBV。基于侵蚀对原始GEBV贡献的GEBV回归系数也是对侵蚀的估计。以2021年法国诺曼底雌性参考牛种群为例,显示侵蚀因子范围为0.84至0.87。考虑侵蚀对于避免通胀和有偏见的预测很重要。讨论了在预测中减少通货膨胀和修正通货膨胀的方法。
{"title":"Erosion of estimated genomic breeding values with generations is due to long distance associations between markers and QTL","authors":"Didier Boichard, Sébastien Fritz, Pascal Croiseau, Vincent Ducrocq, Thierry Tribout, Beatriz C. D. Cuyabano","doi":"10.1186/s12711-025-00963-5","DOIUrl":"https://doi.org/10.1186/s12711-025-00963-5","url":null,"abstract":"Most validation studies of genomic evaluations on candidates (prior to observing phenotypes) present inflation of their predicted breeding values, i.e., regression coefficients of their later observed phenotypes on the early predictions are smaller than one. The aim of this study was to show that this inflation pattern reflects at least partly long-distance associations between markers and quantitative trait loci (QTL) in the reference population and to propose methods to estimate the corresponding “erosion” coefficient. Across-chromosome linkage disequilibrium (LD) is observed in different dairy cattle breeds, being a result from limited effective population size and from relationships within the reference population. Due to this long distance LD, the estimated SNP effects capture non-zero contributions from distant QTLs, some located on other chromosomes than the SNP itself. Therefore, corresponding SNP effects are partly lost in the next generations and we refer to this loss as “erosion”. With the concept of QTL contribution to SNP effects derived from mixed model equations, we show with simulation that this long range LD explains 6–25% of the variance of the estimated genomic breeding values, a proportion that is unchanged when the evaluation model includes a residual polygenic effect. Two methods are proposed to predict this erosion factor assuming known simulated QTL effects. In Method 1, one generation of progeny is simulated from the reference population and the GEBV of these progeny based on SNP effects estimated in this newly simulated generation are regressed on the GEBV of the same progeny based on SNP effects estimated in the reference population. In Method 2 all the QTL contributions to SNP effects are regressed based on SNP-QTL recombination rates and summed to predict the GEBV at the next generation. The regression coefficient of the GEBV based on eroded contributions on the raw GEBV is also an estimate of erosion. An illustration is given with the French Normande female reference bovine population in 2021, showing erosion factors ranging from 0.84 to 0.87. Accounting for erosion is important to avoid inflation and biased predictions. The ways to both reduce inflation and to correct for it in the prediction are discussed.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"1 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143666295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular breeding of pigs in the genome editing era 基因组编辑时代猪的分子育种
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-10 DOI: 10.1186/s12711-025-00961-7
Jiahuan Chen, Jiaqi Wang, Haoran Zhao, Xiao Tan, Shihan Yan, Huanyu Zhang, Tiefeng Wang, Xiaochun Tang
To address the increasing demand for high-quality pork protein, it is essential to implement strategies that enhance diets and produce pigs with excellent production traits. Selective breeding and crossbreeding are the primary methods used for genetic improvement in modern agriculture. However, these methods face challenges due to long breeding cycles and the necessity for beneficial genetic variation associated with high-quality traits within the population. This limitation restricts the transfer of desirable alleles across different genera and species. This article systematically reviews past and current research advancements in porcine molecular breeding. It discusses the screening of clustered regularly interspaced short palindromic repeats (CRISPR) to identify resistance loci in swine and the challenges and future applications of genetically modified pigs. The emergence of transgenic and gene editing technologies has prompted researchers to apply these methods to pig breeding. These advancements allow for alterations in the pig genome through various techniques, ranging from random integration into the genome to site-specific insertion and from target gene knockout (KO) to precise base and prime editing. As a result, numerous desirable traits, such as disease resistance, high meat yield, improved feed efficiency, reduced fat deposition, and lower environmental waste, can be achieved easily and effectively by genetic modification. These traits can serve as valuable resources to enhance swine breeding programmes. In the era of genome editing, molecular breeding of pigs is critical to the future of agriculture. Long-term and multidomain analyses of genetically modified pigs by researchers, related policy development by regulatory agencies, and public awareness and acceptance of their safety are the keys to realizing the transition of genetically modified products from the laboratory to the market.
为了满足对优质猪肉蛋白质日益增长的需求,必须实施改善日粮和生产优良生产性状猪的战略。选择育种和杂交育种是现代农业遗传改良的主要方法。然而,这些方法面临着挑战,因为育种周期长,需要与群体内高质量性状相关的有益遗传变异。这一限制限制了理想等位基因在不同属和种之间的转移。本文系统地综述了猪分子育种的研究进展。它讨论了聚集规律间隔短回文重复序列(CRISPR)的筛选,以确定猪的抗性位点,以及转基因猪的挑战和未来应用。转基因和基因编辑技术的出现促使研究人员将这些方法应用于猪的育种。这些进步允许通过各种技术改变猪基因组,从随机整合到基因组到特定位点插入,从靶基因敲除(KO)到精确的碱基和引物编辑。因此,许多理想的性状,如抗病性、高肉产量、提高饲料效率、减少脂肪沉积和减少环境浪费,可以通过基因改造轻松有效地实现。这些性状可以作为宝贵的资源来加强猪的育种计划。在基因组编辑时代,猪的分子育种对农业的未来至关重要。研究人员对转基因猪的长期和多领域分析、监管机构的相关政策制定以及公众对其安全性的认识和接受是实现转基因产品从实验室向市场过渡的关键。
{"title":"Molecular breeding of pigs in the genome editing era","authors":"Jiahuan Chen, Jiaqi Wang, Haoran Zhao, Xiao Tan, Shihan Yan, Huanyu Zhang, Tiefeng Wang, Xiaochun Tang","doi":"10.1186/s12711-025-00961-7","DOIUrl":"https://doi.org/10.1186/s12711-025-00961-7","url":null,"abstract":"To address the increasing demand for high-quality pork protein, it is essential to implement strategies that enhance diets and produce pigs with excellent production traits. Selective breeding and crossbreeding are the primary methods used for genetic improvement in modern agriculture. However, these methods face challenges due to long breeding cycles and the necessity for beneficial genetic variation associated with high-quality traits within the population. This limitation restricts the transfer of desirable alleles across different genera and species. This article systematically reviews past and current research advancements in porcine molecular breeding. It discusses the screening of clustered regularly interspaced short palindromic repeats (CRISPR) to identify resistance loci in swine and the challenges and future applications of genetically modified pigs. The emergence of transgenic and gene editing technologies has prompted researchers to apply these methods to pig breeding. These advancements allow for alterations in the pig genome through various techniques, ranging from random integration into the genome to site-specific insertion and from target gene knockout (KO) to precise base and prime editing. As a result, numerous desirable traits, such as disease resistance, high meat yield, improved feed efficiency, reduced fat deposition, and lower environmental waste, can be achieved easily and effectively by genetic modification. These traits can serve as valuable resources to enhance swine breeding programmes. In the era of genome editing, molecular breeding of pigs is critical to the future of agriculture. Long-term and multidomain analyses of genetically modified pigs by researchers, related policy development by regulatory agencies, and public awareness and acceptance of their safety are the keys to realizing the transition of genetically modified products from the laboratory to the market.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"19 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143582976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomic selection in pig breeding: comparative analysis of machine learning algorithms 猪育种中的基因组选择:机器学习算法的比较分析
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-10 DOI: 10.1186/s12711-025-00957-3
Ruilin Su, Jingbo Lv, Yahui Xue, Sheng Jiang, Lei Zhou, Li Jiang, Junyan Tan, Zhencai Shen, Ping Zhong, Jianfeng Liu
The effectiveness of genomic prediction (GP) significantly influences breeding progress, and employing SNP markers to predict phenotypic values is a pivotal aspect of pig breeding. Machine learning (ML) methods are usually used to predict phenotypic values since their advantages in processing high dimensional data. While, the existing researches have not indicated which ML methods are suitable for most pig genomic prediction. Therefore, it is necessary to select appropriate methods from a large number of ML methods as long as genomic prediction is performed. This paper compared the performance of popular ML methods in predicting pig phenotypes and then found out suitable methods for most traits. In this paper, five commonly used datasets from other literatures were utilized to compare the performance of different ML methods. The experimental results demonstrate that Stacking performs best on the PIC dataset where the trait information is hidden, and the performs of kernel ridge regression with rbf kernel (KRR-rbf) closely follows. Support vector regression (SVR) performs best in predicting reproductive traits, followed by genomic best linear unbiased prediction (GBLUP). GBLUP achieves the best performance on growth traits, with SVR as the second best. GBLUP achieves good performance for GP problems. Similarly, the Stacking, SVR, and KRR-RBF methods also achieve high prediction accuracy. Moreover, LR statistical analysis shows that Stacking, SVR and KRR are stable. When applying ML methods for phenotypic values prediction in pigs, we recommend these three approaches.
基因组预测(GP)的有效性显著影响育种进展,利用SNP标记预测表型值是猪育种的关键方面。由于机器学习方法在处理高维数据方面具有优势,因此通常用于预测表型值。然而,现有的研究并没有表明哪些ML方法适合于大多数猪基因组预测。因此,只要进行基因组预测,就有必要从大量ML方法中选择合适的方法。本文比较了几种常用的机器学习方法在猪表型预测方面的性能,找出了适合大多数性状的预测方法。本文利用其他文献中五个常用的数据集来比较不同ML方法的性能。实验结果表明,在特征信息隐藏的PIC数据集上,Stacking的效果最好,rbf核脊回归(KRR-rbf)的效果紧随其后。支持向量回归(SVR)对生殖性状的预测效果最好,其次是基因组最佳线性无偏预测(GBLUP)。GBLUP在生长性状上表现最佳,SVR次之。GBLUP在GP问题上取得了良好的性能。同样,Stacking、SVR和KRR-RBF方法也能达到较高的预测精度。此外,LR统计分析表明,叠加、SVR和KRR是稳定的。当应用ML方法预测猪的表型值时,我们推荐这三种方法。
{"title":"Genomic selection in pig breeding: comparative analysis of machine learning algorithms","authors":"Ruilin Su, Jingbo Lv, Yahui Xue, Sheng Jiang, Lei Zhou, Li Jiang, Junyan Tan, Zhencai Shen, Ping Zhong, Jianfeng Liu","doi":"10.1186/s12711-025-00957-3","DOIUrl":"https://doi.org/10.1186/s12711-025-00957-3","url":null,"abstract":"The effectiveness of genomic prediction (GP) significantly influences breeding progress, and employing SNP markers to predict phenotypic values is a pivotal aspect of pig breeding. Machine learning (ML) methods are usually used to predict phenotypic values since their advantages in processing high dimensional data. While, the existing researches have not indicated which ML methods are suitable for most pig genomic prediction. Therefore, it is necessary to select appropriate methods from a large number of ML methods as long as genomic prediction is performed. This paper compared the performance of popular ML methods in predicting pig phenotypes and then found out suitable methods for most traits. In this paper, five commonly used datasets from other literatures were utilized to compare the performance of different ML methods. The experimental results demonstrate that Stacking performs best on the PIC dataset where the trait information is hidden, and the performs of kernel ridge regression with rbf kernel (KRR-rbf) closely follows. Support vector regression (SVR) performs best in predicting reproductive traits, followed by genomic best linear unbiased prediction (GBLUP). GBLUP achieves the best performance on growth traits, with SVR as the second best. GBLUP achieves good performance for GP problems. Similarly, the Stacking, SVR, and KRR-RBF methods also achieve high prediction accuracy. Moreover, LR statistical analysis shows that Stacking, SVR and KRR are stable. When applying ML methods for phenotypic values prediction in pigs, we recommend these three approaches.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"38 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143582977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Merging metabolomics and genomics provides a catalog of genetic factors that influence molecular phenotypes in pigs linking relevant metabolic pathways 代谢组学和基因组学的结合提供了影响猪分子表型的遗传因素目录,这些遗传因素连接了相关的代谢途径
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-06 DOI: 10.1186/s12711-025-00960-8
Samuele Bovo, Anisa Ribani, Flaminia Fanelli, Giuliano Galimberti, Pier Luigi Martelli, Paolo Trevisi, Francesca Bertolini, Matteo Bolner, Rita Casadio, Stefania Dall’Olio, Maurizio Gallo, Diana Luise, Gianluca Mazzoni, Giuseppina Schiavo, Valeria Taurisano, Paolo Zambonelli, Paolo Bosi, Uberto Pagotto, Luca Fontanesi
Metabolomics opens novel avenues to study the basic biological mechanisms underlying complex traits, starting from characterization of metabolites. Metabolites and their levels in a biofluid represent simple molecular phenotypes (metabotypes) that are direct products of enzyme activities and relate to all metabolic pathways, including catabolism and anabolism of nutrients. In this study, we demonstrated the utility of merging metabolomics and genomics in pigs to uncover a large list of genetic factors that influence mammalian metabolism. We obtained targeted characterization of the plasma metabolome of more than 1300 pigs from two populations of Large White and Duroc pig breeds. The metabolomic profiles of these pigs were used to identify genetically influenced metabolites by estimating the heritability of the level of 188 metabolites. Then, combining breed-specific genome-wide association studies of single metabolites and their ratios and across breed meta-analyses, we identified a total of 97 metabolite quantitative trait loci (mQTL), associated with 126 metabolites. Using these results, we constructed a human-pig comparative catalog of genetic factors influencing the metabolomic profile. Whole genome resequencing data identified several putative causative mutations for these mQTL. Additionally, based on a major mQTL for kynurenine level, we designed a nutrigenetic study feeding piglets that carried different genotypes at the candidate gene kynurenine 3-monooxygenase (KMO) varying levels of tryptophan and demonstrated the effect of this genetic factor on the kynurenine pathway. Furthermore, we used metabolomic profiles of Large White and Duroc pigs to reconstruct metabolic pathways using Gaussian Graphical Models, which included perturbation of the identified mQTL. This study has provided the first catalog of genetic factors affecting molecular phenotypes that describe the pig blood metabolome, with links to important metabolic pathways, opening novel avenues to merge genetics and nutrition in this livestock species. The obtained results are relevant for basic and applied biology and to evaluate the pig as a biomedical model. Genetically influenced metabolites can be further exploited in nutrigenetic approaches in pigs. The described molecular phenotypes can be useful to dissect complex traits and design novel feeding, breeding and selection programs in pigs.
代谢组学为研究复杂性状的基本生物学机制开辟了新的途径,从代谢物的表征开始。生物流体中的代谢物及其水平代表了简单的分子表型(代谢型),它们是酶活性的直接产物,与所有代谢途径有关,包括营养物质的分解代谢和合成代谢。在这项研究中,我们展示了将代谢组学和基因组学结合在猪身上的效用,以揭示影响哺乳动物代谢的大量遗传因素。我们对来自大白猪和杜洛克猪两个品种的1300多头猪的血浆代谢组进行了针对性的表征。通过估计188种代谢物水平的遗传能力,利用这些猪的代谢组学特征来鉴定受遗传影响的代谢物。然后,结合单个代谢物及其比例的品种特异性全基因组关联研究和跨品种荟萃分析,我们共鉴定出97个代谢物数量性状位点(mQTL),与126种代谢物相关。利用这些结果,我们构建了影响代谢组学特征的遗传因素的人猪比较目录。全基因组重测序数据确定了这些mQTL的几个假定的致病突变。此外,基于犬尿氨酸水平的主要mQTL,我们设计了一项营养遗传学研究,饲养不同水平色氨酸的候选基因kynurenine 3-monooxygenase (KMO)携带不同基因型的仔猪,并证明了该遗传因素对犬尿氨酸途径的影响。此外,我们利用大白猪和杜洛克猪的代谢组学特征,使用高斯图形模型重建代谢途径,其中包括已识别的mQTL的扰动。这项研究首次提供了影响猪血液代谢组分子表型的遗传因素目录,并与重要的代谢途径相关联,为将这种牲畜物种的遗传和营养结合起来开辟了新的途径。所获得的结果与基础生物学和应用生物学以及评价猪作为生物医学模型有关。受遗传影响的代谢物可以在猪的营养遗传学方法中进一步利用。所描述的分子表型可以用于解剖复杂的性状和设计新的饲养,育种和选择方案的猪。
{"title":"Merging metabolomics and genomics provides a catalog of genetic factors that influence molecular phenotypes in pigs linking relevant metabolic pathways","authors":"Samuele Bovo, Anisa Ribani, Flaminia Fanelli, Giuliano Galimberti, Pier Luigi Martelli, Paolo Trevisi, Francesca Bertolini, Matteo Bolner, Rita Casadio, Stefania Dall’Olio, Maurizio Gallo, Diana Luise, Gianluca Mazzoni, Giuseppina Schiavo, Valeria Taurisano, Paolo Zambonelli, Paolo Bosi, Uberto Pagotto, Luca Fontanesi","doi":"10.1186/s12711-025-00960-8","DOIUrl":"https://doi.org/10.1186/s12711-025-00960-8","url":null,"abstract":"Metabolomics opens novel avenues to study the basic biological mechanisms underlying complex traits, starting from characterization of metabolites. Metabolites and their levels in a biofluid represent simple molecular phenotypes (metabotypes) that are direct products of enzyme activities and relate to all metabolic pathways, including catabolism and anabolism of nutrients. In this study, we demonstrated the utility of merging metabolomics and genomics in pigs to uncover a large list of genetic factors that influence mammalian metabolism. We obtained targeted characterization of the plasma metabolome of more than 1300 pigs from two populations of Large White and Duroc pig breeds. The metabolomic profiles of these pigs were used to identify genetically influenced metabolites by estimating the heritability of the level of 188 metabolites. Then, combining breed-specific genome-wide association studies of single metabolites and their ratios and across breed meta-analyses, we identified a total of 97 metabolite quantitative trait loci (mQTL), associated with 126 metabolites. Using these results, we constructed a human-pig comparative catalog of genetic factors influencing the metabolomic profile. Whole genome resequencing data identified several putative causative mutations for these mQTL. Additionally, based on a major mQTL for kynurenine level, we designed a nutrigenetic study feeding piglets that carried different genotypes at the candidate gene kynurenine 3-monooxygenase (KMO) varying levels of tryptophan and demonstrated the effect of this genetic factor on the kynurenine pathway. Furthermore, we used metabolomic profiles of Large White and Duroc pigs to reconstruct metabolic pathways using Gaussian Graphical Models, which included perturbation of the identified mQTL. This study has provided the first catalog of genetic factors affecting molecular phenotypes that describe the pig blood metabolome, with links to important metabolic pathways, opening novel avenues to merge genetics and nutrition in this livestock species. The obtained results are relevant for basic and applied biology and to evaluate the pig as a biomedical model. Genetically influenced metabolites can be further exploited in nutrigenetic approaches in pigs. The described molecular phenotypes can be useful to dissect complex traits and design novel feeding, breeding and selection programs in pigs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"36 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143560636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle 利用比利时蓝牛全基因组序列数据和功能注释评估基因组选择模型
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-04 DOI: 10.1186/s12711-025-00955-5
Can Yuan, Alain Gillon, José Luis Gualdrón Duarte, Haruko Takeda, Wouter Coppieters, Michel Georges, Tom Druet
The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation. Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods. GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.
大量全基因组测序个体的可用性,结合功能注释,有望为提高基因组选择(GS)的准确性提供机会。然而,这些好处在最初的应用中并不经常被观察到。比利时蓝牛(BBC)中GS的参考种群继续增长。结合已测序个体参考面板的可用性,它提供了利用全基因组序列(WGS)数据和功能注释来评估GS模型的机会。在这里,我们使用了16,508头奶牛的数据,这些数据具有5种肌肉发育性状的表型,并在WGS水平上进行了估算,结合了从实验数据中获得的计算机功能注释和推测的调节变异目录。我们使用完整的WGS数据评估了第一个GS模型,有或没有功能注释。在这个标记密度下,我们能够运行两种方法,假设一个高多基因结构(GBLUP)或允许一些变体有更大的影响(BayesRR-RC,贝叶斯混合模型),并观察到与中等标记密度下的官方GBLUP模型相比,可靠性有所提高(GBLUP和BayesRR-RC的平均可靠性分别为0.016和0.018)。当使用功能注释时,我们观察到使用包含多个多基因术语(每个功能组一个)的GBLUP扩展的可靠性略高,而使用BayesRR-RC的可靠性降低。然后,我们使用基于功能信息或链接不平衡(LD)修剪方法选择的变体的大子集,这使我们能够评估另外两种方法,bayescc π和贝叶斯稀疏线性混合模型(BSLMM)。这些面板的可靠性高于WGS数据,根据功能信息选择标记时获得的准确性最高。在我们的设置中,BSLMM系统地获得了比其他方法更高的可靠性。与官方基因组评估方法相比,从WGS数据中选择大量功能变异的GS可以显著提高可靠性。然而,使用WGS和功能数据的好处仍然有限,这表明仍有改进的空间,例如,可以进一步完善BBC品种中的功能注释。
{"title":"Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle","authors":"Can Yuan, Alain Gillon, José Luis Gualdrón Duarte, Haruko Takeda, Wouter Coppieters, Michel Georges, Tom Druet","doi":"10.1186/s12711-025-00955-5","DOIUrl":"https://doi.org/10.1186/s12711-025-00955-5","url":null,"abstract":"The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation. Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods. GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"34 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143538512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome-wide association analysis using multiple Atlantic salmon populations 使用多个大西洋鲑鱼种群的全基因组关联分析
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-02-27 DOI: 10.1186/s12711-025-00959-1
Afees A. Ajasa, Hans M. Gjøen, Solomon A. Boison, Marie Lillehammer
In a previous study, we found low persistence of linkage disequilibrium (LD) phase across breeding populations of Atlantic salmon. Accordingly, we observed no increase in accuracy from combining these populations for genomic prediction. In this study, we aimed to examine if the same were true for detection power in genome-wide association studies (GWAS), in terms of reduction in p-values, and if the precision of mapping quantitative trait loci (QTL) would improve from such analysis. Since individual records may not always be available, e.g. due to proprietorship or confidentiality, we also compared mega-analysis and meta-analysis. Mega-analysis needs access to all individual records, whereas meta-analysis utilizes parameters, such as p-values or allele substitution effects, from multiple studies or populations. Furthermore, different methods for determining the presence or absence of independent or secondary signals, such as conditional association analysis, approximate conditional and joint analysis (COJO), and the clumping approach, were assessed. Mega-analysis resulted in increased detection power, in terms of reduction in p-values, and increased precision, compared to the within-population GWAS. Only one QTL was detected using conditional association analysis, both within populations and in mega-analysis, while the number of QTL detected with COJO and the clumping approach ranged from 1 to 19. The allele substitution effect and -log10p-values obtained from mega-analysis were highly correlated with the corresponding values from various meta-analysis methods. Compared to mega-analysis, a higher detection power and reduced precision were obtained with the meta-analysis methods. Our results show that combining multiple datasets or populations in a mega-analysis can increase detection power and mapping precision. With meta-analysis, a higher detection power was obtained compared to mega-analysis. However, care must be taken in the interpretation of the meta-analysis results from multiple populations because their test statistics might be inflated due to population structure or cryptic relatedness.
在之前的一项研究中,我们发现大西洋鲑繁殖种群间的连锁不平衡(LD)相位持续性较低。因此,我们发现结合这些种群进行基因组预测的准确性并没有提高。在这项研究中,我们的目的是考察全基因组关联研究(GWAS)的检测能力是否也是如此,即 p 值是否会降低,以及绘制定量性状位点(QTL)的精确度是否会从此类分析中得到提高。由于个人记录不一定总能获得,例如由于所有权或保密原因,我们还对巨型分析和元分析进行了比较。巨量分析需要获取所有的个体记录,而元分析则利用多个研究或群体的参数,如 p 值或等位基因替代效应。此外,还评估了确定是否存在独立或次要信号的不同方法,如条件关联分析、近似条件和联合分析(COJO)以及聚类方法。与群体内 GWAS 相比,巨量分析在降低 p 值方面提高了检测能力,并提高了精确度。在种群内和大规模分析中,使用条件关联分析只检测到一个 QTL,而使用 COJO 和聚类方法检测到的 QTL 数量从 1 个到 19 个不等。大型分析得出的等位基因替代效应和-log10p-值与各种元分析方法得出的相应值高度相关。与巨量分析相比,荟萃分析方法的检测能力更高,精确度更低。我们的研究结果表明,在超大规模分析中结合多个数据集或人群可以提高检测能力和绘图精度。与超大规模分析相比,元分析的检测能力更高。然而,在解释来自多个种群的荟萃分析结果时必须小心谨慎,因为种群结构或隐性亲缘关系可能会导致测试统计量膨胀。
{"title":"Genome-wide association analysis using multiple Atlantic salmon populations","authors":"Afees A. Ajasa, Hans M. Gjøen, Solomon A. Boison, Marie Lillehammer","doi":"10.1186/s12711-025-00959-1","DOIUrl":"https://doi.org/10.1186/s12711-025-00959-1","url":null,"abstract":"In a previous study, we found low persistence of linkage disequilibrium (LD) phase across breeding populations of Atlantic salmon. Accordingly, we observed no increase in accuracy from combining these populations for genomic prediction. In this study, we aimed to examine if the same were true for detection power in genome-wide association studies (GWAS), in terms of reduction in p-values, and if the precision of mapping quantitative trait loci (QTL) would improve from such analysis. Since individual records may not always be available, e.g. due to proprietorship or confidentiality, we also compared mega-analysis and meta-analysis. Mega-analysis needs access to all individual records, whereas meta-analysis utilizes parameters, such as p-values or allele substitution effects, from multiple studies or populations. Furthermore, different methods for determining the presence or absence of independent or secondary signals, such as conditional association analysis, approximate conditional and joint analysis (COJO), and the clumping approach, were assessed. Mega-analysis resulted in increased detection power, in terms of reduction in p-values, and increased precision, compared to the within-population GWAS. Only one QTL was detected using conditional association analysis, both within populations and in mega-analysis, while the number of QTL detected with COJO and the clumping approach ranged from 1 to 19. The allele substitution effect and -log10p-values obtained from mega-analysis were highly correlated with the corresponding values from various meta-analysis methods. Compared to mega-analysis, a higher detection power and reduced precision were obtained with the meta-analysis methods. Our results show that combining multiple datasets or populations in a mega-analysis can increase detection power and mapping precision. With meta-analysis, a higher detection power was obtained compared to mega-analysis. However, care must be taken in the interpretation of the meta-analysis results from multiple populations because their test statistics might be inflated due to population structure or cryptic relatedness.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"210 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143506911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The ability to race barefoot is a heritable trait in Standardbred and Coldblooded trotters 赤足赛跑的能力是标准品种和冷血蹄马的遗传特征
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-02-25 DOI: 10.1186/s12711-025-00958-2
Paulina Berglund, Sreten Andonov, Anna Jansson, Christina Olsson, Therese Lundqvist, Erling Strandberg, Susanne Eriksson
In equine sports, shoes are used to protect the hooves from wear and tear. In Swedish trotting races, pulling off the shoes to race barefoot is popular because it improves racing time. Good hoof quality is essential for high-performance horses, but not all trotting horses have hooves that tolerate barefoot racing. The ability to race barefoot is a complex trait that is known to be influenced by environmental factors, but the genetic basis of this trait has not been studied. The aim of this study was to estimate genetic parameters and correlations between estimated breeding values for three novel traits: two related to the proportion of barefoot races and “barefoot status”, a binary trait that reflects the probability of racing unshod in a race, in Swedish Standardbred trotters (SB) and Swedish-Norwegian Coldblooded trotters (CB). For the two traits describing the proportion of barefoot races, single-trait mixed linear animal models were used to estimate variance components for up to 24,958 SB and up to 4050 CB. Estimates of heritability ranged from 0.17 to 0.28. For barefoot status, a binary trait with repeated measurements, 875,056 observations from 25,973 SB, and 93,376 observations from 3384 CB were included. Using a single-trait mixed animal threshold model estimates of heritability for barefoot status were 0.07 and 0.08. The Pearson correlation coefficient between the estimated breeding values for barefoot status and each of the traits describing the proportion of barefoot races for breeding stallions was 0.63 and 0.64 for SB and 0.82 and 0.76 for CB. The traits analyzed reflecting the ability to race barefoot are heritable, with the traits for the proportion of barefoot races showing higher heritability estimates for both breeds than barefoot status. Estimated breeding values for breeding stallions were moderately to strongly correlated for the three traits. The average accuracy of estimated breeding values for breeding stallions was moderate to high for all traits. To breed for the ability to race barefoot, further studies on the genetic correlation of the ability to race barefoot with performance traits and the impact of racing barefoot on career length, are necessary.
在马的运动中,鞋子是用来保护蹄子免受磨损的。在瑞典的小跑比赛中,脱鞋赤脚比赛很受欢迎,因为这样可以缩短比赛时间。良好的蹄质量对于高性能的马来说是必不可少的,但并不是所有的小跑马都有能够忍受光脚比赛的蹄。赤脚比赛的能力是一种复杂的特征,已知会受到环境因素的影响,但这种特征的遗传基础尚未得到研究。本研究的目的是估计三种新性状的遗传参数和估计育种值之间的相关性:两种性状与赤足赛跑的比例和“赤足状态”有关,这是一种反映在比赛中不穿鞋的可能性的二元性状,在瑞典标准种马驹(SB)和瑞典-挪威冷血马驹(CB)中。对于描述赤足人种比例的两个性状,采用单性状混合线性动物模型估计方差分量,最大方差为24,958 SB,最大方差为4050 CB。遗传率估计在0.17到0.28之间。对于赤脚状态,一个重复测量的二元特征,包括来自25,973个SB的875,056个观测值和来自3384个CB的93,376个观测值。利用单性状混合动物阈值模型估计赤足状态的遗传率分别为0.07和0.08。赤足状态的估计育种值与描述赤足种所占比例的各性状之间的Pearson相关系数分别为:SB为0.63和0.64,CB为0.82和0.76。所分析的反映赤足赛跑能力的性状是可遗传的,赤足赛跑比例的性状对两个品种的遗传率估计都高于赤足赛跑状态。这三个性状对种马的估计育种值呈中至强相关。所有性状的种马育种值估计值的平均准确度均为中高。为了培养赤脚比赛的能力,有必要进一步研究赤脚比赛能力与表现特征的遗传相关性以及赤脚比赛对职业生涯长度的影响。
{"title":"The ability to race barefoot is a heritable trait in Standardbred and Coldblooded trotters","authors":"Paulina Berglund, Sreten Andonov, Anna Jansson, Christina Olsson, Therese Lundqvist, Erling Strandberg, Susanne Eriksson","doi":"10.1186/s12711-025-00958-2","DOIUrl":"https://doi.org/10.1186/s12711-025-00958-2","url":null,"abstract":"In equine sports, shoes are used to protect the hooves from wear and tear. In Swedish trotting races, pulling off the shoes to race barefoot is popular because it improves racing time. Good hoof quality is essential for high-performance horses, but not all trotting horses have hooves that tolerate barefoot racing. The ability to race barefoot is a complex trait that is known to be influenced by environmental factors, but the genetic basis of this trait has not been studied. The aim of this study was to estimate genetic parameters and correlations between estimated breeding values for three novel traits: two related to the proportion of barefoot races and “barefoot status”, a binary trait that reflects the probability of racing unshod in a race, in Swedish Standardbred trotters (SB) and Swedish-Norwegian Coldblooded trotters (CB). For the two traits describing the proportion of barefoot races, single-trait mixed linear animal models were used to estimate variance components for up to 24,958 SB and up to 4050 CB. Estimates of heritability ranged from 0.17 to 0.28. For barefoot status, a binary trait with repeated measurements, 875,056 observations from 25,973 SB, and 93,376 observations from 3384 CB were included. Using a single-trait mixed animal threshold model estimates of heritability for barefoot status were 0.07 and 0.08. The Pearson correlation coefficient between the estimated breeding values for barefoot status and each of the traits describing the proportion of barefoot races for breeding stallions was 0.63 and 0.64 for SB and 0.82 and 0.76 for CB. The traits analyzed reflecting the ability to race barefoot are heritable, with the traits for the proportion of barefoot races showing higher heritability estimates for both breeds than barefoot status. Estimated breeding values for breeding stallions were moderately to strongly correlated for the three traits. The average accuracy of estimated breeding values for breeding stallions was moderate to high for all traits. To breed for the ability to race barefoot, further studies on the genetic correlation of the ability to race barefoot with performance traits and the impact of racing barefoot on career length, are necessary.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"3 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mendelian randomisation to uncover causal associations between conformation, metabolism, and production as potential exposure to reproduction in German Holstein dairy cattle 孟德尔随机化以揭示德国荷斯坦奶牛的构象、代谢和产量之间的因果关系,作为潜在的生殖暴露
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-02-25 DOI: 10.1186/s12711-025-00950-w
Leopold Schwarz, Johannes Heise, Zengting Liu, Jörn Bennewitz, Georg Thaller, Jens Tetens
Reproduction is vital to welfare, health, and economics in animal husbandry and breeding. Health and reproduction are increasingly being considered because of the observed genetic correlations between reproduction, health, conformation, and performance traits in dairy cattle. Understanding the detailed genetic architecture underlying these traits would represent a major step in comprehending their interplay. Identifying known, putative or novel associations in genomics could improve animal health, welfare, and performance while allowing further adjustments in animal breeding. We conducted genome-wide association studies for 25 different traits belonging to four different complexes, namely reproduction (n = 13), conformation (n = 6), production (n = 3), and metabolism (n = 3), using a cohort of over 235,000 dairy cows. As a result, we identified genome-wide significant signals for all the studied traits. The obtained summary statistics collected served as the input for a Mendelian randomisation approach (GSMR) to infer causal associations between putative exposure and reproduction traits. The study considered conformation, production, and metabolism as exposure and reproduction as outcome. A range of 139 to 252 genome-wide significant SNPs per combination were identified as instrumental variables (IVs). Out of 156 trait combinations, 135 demonstrated statistically significant effects, thereby enabling the identification of the responsible IVs. Combinations of traits related to metabolism (38 out of 39), conformation (68 out of 78), or production (29 out of 39) were found to have significant effects on reproduction. These relationships were partially non-linear. Moreover, a separate variance component estimation supported these findings, strongly correlating with the GSMR results and offering suggestions for improvement. Downstream analyses of selected representative traits per complex resulted in identifying and investigating potential physiological mechanisms. Notably, we identified both trait-specific SNPs and genes that appeared to influence specific traits per complex, as well as more general SNPs that were common between exposure and outcome traits. Our study confirms the known genetic associations between reproduction traits and the three complexes tested. It provides new insights into causality, indicating a non-linear relationship between conformation and reproduction. In addition, the downstream analyses have identified several clustered genes that may mediate this association.
在畜牧业和养殖业中,繁殖对福利、健康和经济至关重要。由于观察到奶牛的繁殖、健康、构象和生产性能特征之间存在遗传相关性,健康和繁殖正日益受到重视。了解这些特征背后的详细遗传结构,将是理解它们之间相互作用的重要一步。确定基因组学中已知的、假定的或新的关联可以改善动物的健康、福利和性能,同时允许进一步调整动物育种。我们使用超过23.5万头奶牛进行了25种不同性状的全基因组关联研究,这些性状属于四个不同的复合体,即繁殖(n = 13)、构象(n = 6)、生产(n = 3)和代谢(n = 3)。结果,我们确定了所有研究性状的全基因组显著信号。收集到的汇总统计数据作为孟德尔随机化方法(GSMR)的输入,以推断假定暴露与生殖性状之间的因果关系。该研究将构象、生产和代谢视为暴露和繁殖作为结果。每个组合的139至252个全基因组显著snp范围被确定为工具变量(IVs)。在156个性状组合中,135个表现出统计学上显著的影响,从而能够识别出负责任的IVs。与代谢(39个中的38个)、构象(78个中的68个)或产量(39个中的29个)相关的性状组合被发现对繁殖有显著影响。这些关系部分是非线性的。此外,一个单独的方差分量估计支持这些发现,与GSMR结果强烈相关,并提供改进建议。对每个复合体选定的代表性性状进行下游分析,从而确定和研究潜在的生理机制。值得注意的是,我们确定了性状特异性snp和似乎影响每个复合物特定性状的基因,以及暴露和结果性状之间常见的更一般的snp。我们的研究证实了已知的生殖性状和测试的三种复合物之间的遗传关联。它为因果关系提供了新的见解,表明构象和繁殖之间存在非线性关系。此外,下游分析已经确定了几个可能介导这种关联的聚集基因。
{"title":"Mendelian randomisation to uncover causal associations between conformation, metabolism, and production as potential exposure to reproduction in German Holstein dairy cattle","authors":"Leopold Schwarz, Johannes Heise, Zengting Liu, Jörn Bennewitz, Georg Thaller, Jens Tetens","doi":"10.1186/s12711-025-00950-w","DOIUrl":"https://doi.org/10.1186/s12711-025-00950-w","url":null,"abstract":"Reproduction is vital to welfare, health, and economics in animal husbandry and breeding. Health and reproduction are increasingly being considered because of the observed genetic correlations between reproduction, health, conformation, and performance traits in dairy cattle. Understanding the detailed genetic architecture underlying these traits would represent a major step in comprehending their interplay. Identifying known, putative or novel associations in genomics could improve animal health, welfare, and performance while allowing further adjustments in animal breeding. We conducted genome-wide association studies for 25 different traits belonging to four different complexes, namely reproduction (n = 13), conformation (n = 6), production (n = 3), and metabolism (n = 3), using a cohort of over 235,000 dairy cows. As a result, we identified genome-wide significant signals for all the studied traits. The obtained summary statistics collected served as the input for a Mendelian randomisation approach (GSMR) to infer causal associations between putative exposure and reproduction traits. The study considered conformation, production, and metabolism as exposure and reproduction as outcome. A range of 139 to 252 genome-wide significant SNPs per combination were identified as instrumental variables (IVs). Out of 156 trait combinations, 135 demonstrated statistically significant effects, thereby enabling the identification of the responsible IVs. Combinations of traits related to metabolism (38 out of 39), conformation (68 out of 78), or production (29 out of 39) were found to have significant effects on reproduction. These relationships were partially non-linear. Moreover, a separate variance component estimation supported these findings, strongly correlating with the GSMR results and offering suggestions for improvement. Downstream analyses of selected representative traits per complex resulted in identifying and investigating potential physiological mechanisms. Notably, we identified both trait-specific SNPs and genes that appeared to influence specific traits per complex, as well as more general SNPs that were common between exposure and outcome traits. Our study confirms the known genetic associations between reproduction traits and the three complexes tested. It provides new insights into causality, indicating a non-linear relationship between conformation and reproduction. In addition, the downstream analyses have identified several clustered genes that may mediate this association.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"14 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143485964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genetics Selection Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1