首页 > 最新文献

Genetics Selection Evolution最新文献

英文 中文
Quantitative genetic analysis of late spring mortality in triploid Crassostrea virginica 三倍体长春花晚春死亡的定量遗传分析
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-04-09 DOI: 10.1186/s12711-025-00965-3
Joseph L. Matt, Jessica Moss Small, Peter D. Kube, Standish K. Allen
Triploid oysters, bred by crossing tetraploid and diploid oysters, are common worldwide in commercial oyster aquaculture and make up much of the hatchery-produced Crassostrea virginica farmed in the mid-Atlantic and southeast of the United States. Breeding diploid and tetraploid animals for genetic improvement of triploid progeny is unique to oysters and can proceed via several possible breeding strategies. Triploid oysters, along with their diploid or tetraploid relatives, have yet been subject to quantitative genetic analyses that could inform a breeding strategy of triploid improvement. The importance of quantitative genetic analyses involving triploid C. virginica has been emphasized by the occurrence of mortality events of near-market sized triploids in late spring. Genetic parameters for survival and weight of triploid and tetraploid C. virginica were estimated from twenty paternal half-sib triploid families and thirty-nine full-sib tetraploid families reared at three sites in the Chesapeake Bay (USA). Traits were analyzed using linear mixed models in ASReml-R. Genetic relationship matrices appropriate for pedigrees with triploid and tetraploid animals were produced using the polyAinv package in R. A mortality event in triploids occurred at one site located on the bayside of the Eastern Shore of Virginia. Between early May and early July, three triploid families had survival of less than 0.70, while most had survival greater than 0.90. The heritability for survival during this period in triploids at this affected site was 0.57 ± 0.23. Triploid survival at the affected site was adversely related to triploid survival at the low salinity site (− 0.50 ± 0.23) and unrelated to tetraploid survival at the site with similar salinity (0.05 ± 0.39). Survival during a late spring mortality event in triploids had a substantial additive genetic basis, suggesting selective breeding of tetraploids can reduce triploid mortalities. Genetic correlations revealed evidence of genotype by environment interactions for triploid survival and weak genetic correlations between survival of tetraploids and triploids. A selective breeding strategy with phenotyping of tetraploid and triploid half-sibs is recommended for genetic improvement of triploid oysters.
三倍体牡蛎是由四倍体和二倍体牡蛎杂交而成的,在世界范围内的商业牡蛎养殖中很常见,在大西洋中部和美国东南部的孵化场养殖的珍珠贝中占很大比例。繁殖二倍体和四倍体动物以遗传改良三倍体后代是牡蛎特有的,可以通过几种可能的育种策略进行。三倍体牡蛎,以及它们的二倍体或四倍体亲属,还需要进行定量遗传分析,以便为三倍体改良的育种策略提供信息。晚春三倍体近市场大小三倍体的死亡事件的发生强调了三倍体锦葵定量遗传分析的重要性。对美国切萨皮克湾(Chesapeake Bay) 3个地点饲养的20个父系半兄弟三倍体家族和39个全兄弟四倍体家族进行了三倍体和四倍体弗吉尼亚锦鲤的存活率和体重遗传参数的估算。采用线性混合模型对ASReml-R性状进行分析。利用polyAinv包在R.建立了适合于三倍体和四倍体动物家系的亲缘关系矩阵。5月初至7月初,3个三倍体家庭的成活率低于0.70,而大多数三倍体家庭的成活率大于0.90。三倍体在这一时期的生存遗传率为0.57±0.23。患病部位的三倍体存活率与低盐度部位的三倍体存活率呈负相关(- 0.50±0.23),与相似盐度部位的四倍体存活率无关(0.05±0.39)。在晚春死亡事件中,三倍体的存活具有大量的加性遗传基础,表明四倍体的选择性育种可以降低三倍体的死亡率。遗传相关揭示了环境相互作用对三倍体存活的基因型影响,四倍体和三倍体存活的遗传相关性较弱。建议采用四倍体和三倍体半姐妹的选择育种策略进行三倍体牡蛎的遗传改良。
{"title":"Quantitative genetic analysis of late spring mortality in triploid Crassostrea virginica","authors":"Joseph L. Matt, Jessica Moss Small, Peter D. Kube, Standish K. Allen","doi":"10.1186/s12711-025-00965-3","DOIUrl":"https://doi.org/10.1186/s12711-025-00965-3","url":null,"abstract":"Triploid oysters, bred by crossing tetraploid and diploid oysters, are common worldwide in commercial oyster aquaculture and make up much of the hatchery-produced Crassostrea virginica farmed in the mid-Atlantic and southeast of the United States. Breeding diploid and tetraploid animals for genetic improvement of triploid progeny is unique to oysters and can proceed via several possible breeding strategies. Triploid oysters, along with their diploid or tetraploid relatives, have yet been subject to quantitative genetic analyses that could inform a breeding strategy of triploid improvement. The importance of quantitative genetic analyses involving triploid C. virginica has been emphasized by the occurrence of mortality events of near-market sized triploids in late spring. Genetic parameters for survival and weight of triploid and tetraploid C. virginica were estimated from twenty paternal half-sib triploid families and thirty-nine full-sib tetraploid families reared at three sites in the Chesapeake Bay (USA). Traits were analyzed using linear mixed models in ASReml-R. Genetic relationship matrices appropriate for pedigrees with triploid and tetraploid animals were produced using the polyAinv package in R. A mortality event in triploids occurred at one site located on the bayside of the Eastern Shore of Virginia. Between early May and early July, three triploid families had survival of less than 0.70, while most had survival greater than 0.90. The heritability for survival during this period in triploids at this affected site was 0.57 ± 0.23. Triploid survival at the affected site was adversely related to triploid survival at the low salinity site (− 0.50 ± 0.23) and unrelated to tetraploid survival at the site with similar salinity (0.05 ± 0.39). Survival during a late spring mortality event in triploids had a substantial additive genetic basis, suggesting selective breeding of tetraploids can reduce triploid mortalities. Genetic correlations revealed evidence of genotype by environment interactions for triploid survival and weak genetic correlations between survival of tetraploids and triploids. A selective breeding strategy with phenotyping of tetraploid and triploid half-sibs is recommended for genetic improvement of triploid oysters.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"108 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143813775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of different genotyping and selection strategies in laying hen breeding programs 蛋鸡育种中不同基因分型及选择策略的分析
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-04-07 DOI: 10.1186/s12711-025-00948-4
Lisa Büttgen, Henner Simianer, Torsten Pook
Genomic selection has become an integral component of modern animal breeding programs, having the potential to improve the efficiency of layer breeding programs both by obtaining higher prediction accuracies and reducing the generation interval, particularly for males, who cannot be phenotyped for sex-limited traits such as laying performance. In the current study, we investigate different strategies to reduce the generation interval either for both sexes or only for the male side of the breeding scheme based on stochastic simulation using the software MoBPS. Additionally, prediction accuracies based on varying proportions of genotyping and phenotype- and pedigree-based selection as well as genomic breeding values are compared. Selection of hens based on estimated breeding values, either pedigree-based or genomic, increased genetic gain compared to selection based on phenotypes only. The use of two time-shifted subpopulations with exchange of males between subpopulations to reduce the generation interval on the male side led to significantly higher genetic gains. Reducing the generation interval for both males and females was only efficient when population sizes were maintained, which result in doubling of the number of females to genotype and phenotype within the same time frame compared to the scenarios with the longer generation intervals. Although substantially higher gains were obtained by in particular pedigree-based selection of females and a reduction of generation intervals this led to substantially greater rates of inbreeding per year. The use of a genomic relationship matrix in breeding value estimation instead of a pedigree-based relationship matrix not only increased genetic gains but also reduced inbreeding rates. The use of optimum contribution selection led to basically the same genetic gains as without it but reduced inbreeding rates. However, overall differences obtained with optimal contribution selection were small compared to differences caused by the other effects that were considered. The reduction of the generation interval on the male side by the use of genomic estimated breeding values was highly beneficial. Reduction of the generation interval on the female side was only beneficial when a high proportion of hens was genotyped and housing capacities were increased. On the female side of a layer breeding program, selection based on pedigree-based estimated breeding values was inferior to phenotypic selection, as it resulted in a substantial increase in inbreeding rates.
基因组选择已经成为现代动物育种计划的一个组成部分,有可能通过获得更高的预测精度和缩短世代间隔来提高蛋鸡育种计划的效率,特别是对于雄性,因为它们不能在性别限制的性状(如产蛋性能)上表型化。在本研究中,我们利用MoBPS软件进行随机模拟,研究了不同的策略来减少繁殖方案中两性或仅雄性的世代间隔。此外,还比较了基于不同比例的基因分型、表型和基于家系的选择以及基因组育种值的预测准确性。与仅基于表型的选择相比,基于估计育种价值(无论是基于系谱还是基因组)的母鸡选择增加了遗传增益。利用两个时移亚群体,在亚群体之间交换雄性,减少雄性侧的世代间隔,可显著提高遗传收益。减小雄性和雌性的世代间隔只有在种群规模保持不变的情况下才有效,这导致在同一时间框架内具有基因型和表型的雌性数量比具有较长世代间隔的情况增加一倍。尽管通过以系谱为基础的雌性选择和世代间隔的缩短获得了更高的收益,但这导致了每年近亲繁殖率的大幅提高。在育种价值估计中使用基因组关系矩阵代替基于家系的关系矩阵不仅增加了遗传收益,而且降低了近交率。最优贡献选择的使用导致了与不使用它基本相同的遗传收益,但降低了近交率。然而,与考虑的其他影响所造成的差异相比,最优贡献选择所获得的总体差异很小。利用基因组估计育种值减少雄性侧的世代间隔是非常有益的。只有在高比例的母鸡进行基因分型和增加鸡舍容量的情况下,雌性侧的产蛋期缩短才有益。在蛋鸡育种计划的雌性方面,基于系谱的估计育种值的选择不如表型选择,因为它导致近交率的大幅增加。
{"title":"Analysis of different genotyping and selection strategies in laying hen breeding programs","authors":"Lisa Büttgen, Henner Simianer, Torsten Pook","doi":"10.1186/s12711-025-00948-4","DOIUrl":"https://doi.org/10.1186/s12711-025-00948-4","url":null,"abstract":"Genomic selection has become an integral component of modern animal breeding programs, having the potential to improve the efficiency of layer breeding programs both by obtaining higher prediction accuracies and reducing the generation interval, particularly for males, who cannot be phenotyped for sex-limited traits such as laying performance. In the current study, we investigate different strategies to reduce the generation interval either for both sexes or only for the male side of the breeding scheme based on stochastic simulation using the software MoBPS. Additionally, prediction accuracies based on varying proportions of genotyping and phenotype- and pedigree-based selection as well as genomic breeding values are compared. Selection of hens based on estimated breeding values, either pedigree-based or genomic, increased genetic gain compared to selection based on phenotypes only. The use of two time-shifted subpopulations with exchange of males between subpopulations to reduce the generation interval on the male side led to significantly higher genetic gains. Reducing the generation interval for both males and females was only efficient when population sizes were maintained, which result in doubling of the number of females to genotype and phenotype within the same time frame compared to the scenarios with the longer generation intervals. Although substantially higher gains were obtained by in particular pedigree-based selection of females and a reduction of generation intervals this led to substantially greater rates of inbreeding per year. The use of a genomic relationship matrix in breeding value estimation instead of a pedigree-based relationship matrix not only increased genetic gains but also reduced inbreeding rates. The use of optimum contribution selection led to basically the same genetic gains as without it but reduced inbreeding rates. However, overall differences obtained with optimal contribution selection were small compared to differences caused by the other effects that were considered. The reduction of the generation interval on the male side by the use of genomic estimated breeding values was highly beneficial. Reduction of the generation interval on the female side was only beneficial when a high proportion of hens was genotyped and housing capacities were increased. On the female side of a layer breeding program, selection based on pedigree-based estimated breeding values was inferior to phenotypic selection, as it resulted in a substantial increase in inbreeding rates.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"6 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143790194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SIRE 2.0: a novel method for estimating polygenic host effects underlying infectious disease transmission, and analytical expressions for prediction accuracies SIRE 2.0:估算传染病传播所依赖的多基因宿主效应的新方法,以及预测精度的分析表达式
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-04-01 DOI: 10.1186/s12711-025-00956-4
Christopher M. Pooley, Glenn Marion, Jamie Prentice, Ricardo Pong-Wong, Stephen C. Bishop, Andrea Doeschl-Wilson
Genetic selection of individuals that are less susceptible to infection, less infectious once infected, and recover faster, offers an effective and long-lasting solution to reduce the incidence and impact of infectious diseases in farmed animals. However, computational methods for simultaneously estimating genetic parameters for host susceptibility, infectivity and recoverability from real-word data have been lacking. Our previously developed methodology and software tool SIRE 1.0 (Susceptibility, Infectivity and Recoverability Estimator) allows estimation of host genetic effects of a single nucleotide polymorphism (SNP), or other fixed effects (e.g. breed, vaccination status), for these three host traits using individual disease data typically available from field studies and challenge experiments. SIRE 1.0, however, lacks the capability to estimate genetic parameters for these traits in the likely case of underlying polygenic control. This paper introduces novel Bayesian methodology and a new software tool SIRE 2.0 for estimating polygenic contributions (i.e. variance components and additive genetic effects) for host susceptibility, infectivity and recoverability from temporal epidemic data, assuming that pedigree or genomic relationships are known. Analytical expressions for prediction accuracies (PAs) for these traits are derived for simplified scenarios, revealing their dependence on genetic and phenotypic variances, and the distribution of related individuals within and between contact groups. PAs for infectivity are found to be critically dependent on the size of contact groups. Validation of the methodology with data from simulated epidemics demonstrates good agreement between numerically generated PAs and analytical predictions. Genetic correlations between infectivity and other traits substantially increase trait PAs. Incomplete data (e.g. time censored or infrequent sampling) generally yield only small reductions in PAs, except for when infection times are completely unknown, which results in a substantial reduction. The method presented can estimate genetic parameters for host susceptibility, infectivity and recoverability from individual disease records. The freely available SIRE 2.0 software provides a valuable extension to SIRE 1.0 for estimating host polygenic effects underlying infectious disease transmission. This tool will open up new possibilities for analysis and quantification of genetic determinates of disease dynamics.
对不易受感染、感染后传染性较低、恢复较快的个体进行遗传选择,为减少家畜传染病的发病率和影响提供了有效和持久的解决方案。然而,从实际数据中同时估计宿主易感性、传染性和可恢复性遗传参数的计算方法一直缺乏。我们之前开发的方法和软件工具SIRE 1.0(易感性,传染性和可恢复性估计器)允许使用通常从实地研究和挑战实验中获得的个体疾病数据来估计单核苷酸多态性(SNP)或其他固定效应(例如品种,接种状态)对这三种宿主性状的遗传影响。然而,在潜在多基因控制的可能情况下,SIRE 1.0缺乏估计这些性状遗传参数的能力。本文介绍了新的贝叶斯方法和新的软件工具SIRE 2.0,用于估计宿主易感性,传染性和可恢复性的多基因贡献(即方差成分和加性遗传效应),假设谱系或基因组关系是已知的。推导了这些性状预测精度的解析表达式,揭示了它们对遗传和表型变异的依赖,以及接触群体内部和群体之间相关个体的分布。发现传染性的PAs严重依赖于接触群体的规模。用模拟流行病的数据验证了该方法,表明数值生成的pa与分析预测之间具有良好的一致性。传染性和其他性状之间的遗传相关性大大增加了性状PAs。不完整的数据(例如,时间审查或不频繁的采样)通常只能产生少量的PAs减少,除非感染时间完全未知,否则会导致大量减少。所提出的方法可以从个体疾病记录中估计宿主易感性、传染性和可恢复性的遗传参数。免费提供的SIRE 2.0软件为SIRE 1.0提供了一个有价值的扩展,用于估计传染病传播的宿主多基因效应。该工具将为疾病动力学的遗传决定因素的分析和定量开辟新的可能性。
{"title":"SIRE 2.0: a novel method for estimating polygenic host effects underlying infectious disease transmission, and analytical expressions for prediction accuracies","authors":"Christopher M. Pooley, Glenn Marion, Jamie Prentice, Ricardo Pong-Wong, Stephen C. Bishop, Andrea Doeschl-Wilson","doi":"10.1186/s12711-025-00956-4","DOIUrl":"https://doi.org/10.1186/s12711-025-00956-4","url":null,"abstract":"Genetic selection of individuals that are less susceptible to infection, less infectious once infected, and recover faster, offers an effective and long-lasting solution to reduce the incidence and impact of infectious diseases in farmed animals. However, computational methods for simultaneously estimating genetic parameters for host susceptibility, infectivity and recoverability from real-word data have been lacking. Our previously developed methodology and software tool SIRE 1.0 (Susceptibility, Infectivity and Recoverability Estimator) allows estimation of host genetic effects of a single nucleotide polymorphism (SNP), or other fixed effects (e.g. breed, vaccination status), for these three host traits using individual disease data typically available from field studies and challenge experiments. SIRE 1.0, however, lacks the capability to estimate genetic parameters for these traits in the likely case of underlying polygenic control. This paper introduces novel Bayesian methodology and a new software tool SIRE 2.0 for estimating polygenic contributions (i.e. variance components and additive genetic effects) for host susceptibility, infectivity and recoverability from temporal epidemic data, assuming that pedigree or genomic relationships are known. Analytical expressions for prediction accuracies (PAs) for these traits are derived for simplified scenarios, revealing their dependence on genetic and phenotypic variances, and the distribution of related individuals within and between contact groups. PAs for infectivity are found to be critically dependent on the size of contact groups. Validation of the methodology with data from simulated epidemics demonstrates good agreement between numerically generated PAs and analytical predictions. Genetic correlations between infectivity and other traits substantially increase trait PAs. Incomplete data (e.g. time censored or infrequent sampling) generally yield only small reductions in PAs, except for when infection times are completely unknown, which results in a substantial reduction. The method presented can estimate genetic parameters for host susceptibility, infectivity and recoverability from individual disease records. The freely available SIRE 2.0 software provides a valuable extension to SIRE 1.0 for estimating host polygenic effects underlying infectious disease transmission. This tool will open up new possibilities for analysis and quantification of genetic determinates of disease dynamics.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"34 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143757995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing 使用低覆盖率全基因组测序的结构和单核苷酸变异的高性能插入
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-28 DOI: 10.1186/s12711-025-00962-6
Manu Kumar Gundappa, Diego Robledo, Alastair Hamilton, Ross D. Houston, James G. D. Prendergast, Daniel J. Macqueen
Whole genome sequencing (WGS), despite its advantages, is yet to replace methods for genotyping single nucleotide variants (SNVs) such as SNP arrays and targeted genotyping assays. Structural variants (SVs) have larger effects on traits than SNVs, but are more challenging to accurately genotype. Using low-coverage WGS with genotype imputation offers a cost-effective strategy to achieve genome-wide variant coverage, but is yet to be tested for SVs. Here, we investigate combined SNV and SV imputation with low-coverage WGS data in Atlantic salmon (Salmo salar). As the reference panel, we used genotypes for high-confidence SVs and SNVs for n = 365 wild individuals sampled from diverse populations. We also generated 15 × WGS data (n = 20 samples) for a commercial population external to the reference panel, and called SVs and SNVs with gold-standard approaches. An imputation method selected for its established performance using low-coverage sequencing data (GLIMPSE) was tested at WGS depths of 1 × , 2 × , 3 × , and 4 × for samples within and external to the reference panel. SNVs were imputed with high accuracy and recall across all WGS depths, including for samples out-with the reference panel. For SVs, we compared imputation based purely on linkage disequilibrium (LD) with SNVs, to that supplemented with SV genotype likelihoods (GLs) from low-coverage WGS. Including SV GLs increased imputation accuracy, but as a trade-off with recall, requiring 3–4 × depth for best performance. Combining strategies allowed us to capture 84% of the reference panel deletions with 87% accuracy at 1 × depth. We also show that SV length affects imputation performance, with provision of SV GLs greatly enhancing accuracy for the longest SVs in the dataset. This study highlights the promise of reference panel imputation using low-coverage WGS, including novel opportunities to enhance the resolution of genome-wide association studies by capturing SVs.
尽管全基因组测序(WGS)有其优势,但它尚未取代单核苷酸变异(snv)的基因分型方法,如SNP阵列和靶向基因分型分析。结构变异(SVs)对性状的影响比snv更大,但更难以准确地进行基因分型。使用低覆盖率WGS与基因型插补提供了一种经济有效的策略来实现全基因组变异覆盖,但尚未对sv进行测试。在此,我们利用低覆盖率的WGS数据对大西洋鲑鱼(Salmo salar)的SNV和SV进行了综合估算。作为参考面板,我们对来自不同种群的n = 365个野生个体使用高置信度的SVs和snv基因型。我们还为参考面板外的商业人群生成了15 × WGS数据(n = 20个样本),并使用金标准方法称为SVs和snv。利用低覆盖率测序数据(GLIMPSE)选择一种具有既定性能的插补方法,在参考面板内外样品的WGS深度为1 ×、2 ×、3 ×和4 ×进行测试。snv在所有WGS深度上都具有很高的准确性和召回率,包括与参考面板外的样品。对于SV,我们比较了纯粹基于连锁不平衡(LD)和snv的估算结果,与低覆盖率WGS补充SV基因型可能性(GLs)的估算结果。包括SV GLs增加了输入精度,但作为召回率的权衡,需要3-4倍的深度才能获得最佳性能。组合策略使我们能够在1倍深度下以87%的准确率捕获84%的参考面板删除。我们还表明,SV长度会影响imputation性能,提供SV GLs大大提高了数据集中最长SV的准确性。这项研究强调了使用低覆盖率WGS进行参考面板插入的前景,包括通过捕获sv来提高全基因组关联研究的分辨率的新机会。
{"title":"High performance imputation of structural and single nucleotide variants using low-coverage whole genome sequencing","authors":"Manu Kumar Gundappa, Diego Robledo, Alastair Hamilton, Ross D. Houston, James G. D. Prendergast, Daniel J. Macqueen","doi":"10.1186/s12711-025-00962-6","DOIUrl":"https://doi.org/10.1186/s12711-025-00962-6","url":null,"abstract":"Whole genome sequencing (WGS), despite its advantages, is yet to replace methods for genotyping single nucleotide variants (SNVs) such as SNP arrays and targeted genotyping assays. Structural variants (SVs) have larger effects on traits than SNVs, but are more challenging to accurately genotype. Using low-coverage WGS with genotype imputation offers a cost-effective strategy to achieve genome-wide variant coverage, but is yet to be tested for SVs. Here, we investigate combined SNV and SV imputation with low-coverage WGS data in Atlantic salmon (Salmo salar). As the reference panel, we used genotypes for high-confidence SVs and SNVs for n = 365 wild individuals sampled from diverse populations. We also generated 15 × WGS data (n = 20 samples) for a commercial population external to the reference panel, and called SVs and SNVs with gold-standard approaches. An imputation method selected for its established performance using low-coverage sequencing data (GLIMPSE) was tested at WGS depths of 1 × , 2 × , 3 × , and 4 × for samples within and external to the reference panel. SNVs were imputed with high accuracy and recall across all WGS depths, including for samples out-with the reference panel. For SVs, we compared imputation based purely on linkage disequilibrium (LD) with SNVs, to that supplemented with SV genotype likelihoods (GLs) from low-coverage WGS. Including SV GLs increased imputation accuracy, but as a trade-off with recall, requiring 3–4 × depth for best performance. Combining strategies allowed us to capture 84% of the reference panel deletions with 87% accuracy at 1 × depth. We also show that SV length affects imputation performance, with provision of SV GLs greatly enhancing accuracy for the longest SVs in the dataset. This study highlights the promise of reference panel imputation using low-coverage WGS, including novel opportunities to enhance the resolution of genome-wide association studies by capturing SVs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"57 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143723122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multitrait genome-wide association best linear unbiased prediction of genetic values 多性状全基因组关联对遗传价值的最佳线性无偏预测
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-21 DOI: 10.1186/s12711-025-00964-4
Theo Meuwissen, Vinzent Boerner
The GWABLUP (Genome-Wide Association based Best Linear Unbiased Prediction) approach used GWA analysis results to differentially weigh the SNPs in genomic prediction, and was found to improve the reliabilities of genomic predictions. However, the proposed multitrait GWABLUP method assumed that the SNP weights were the same across the traits. Here we extended and validated the multitrait GWABLUP method towards using trait specific SNP weights. In a 3-trait dairy data set, multitrait GWAS estimates of SNP effects and their standard errors were translated into trait specific likelihood ratios for the SNPs having trait effects, and posterior probabilities using the GWABLUP approach. This produced trait specific prior (co)variance matrices for each SNP, which were applied in a SNP-BLUP model for genomic predictions, implemented in the APEX linear model suite. In a validation population, the trait specific SNP weights resulted in more reliable predictions for all three traits. Especially, for somatic cell count, which was hardly related to the other traits, the use of the same weights across all traits was harming genomic predictions. The use of trait specific SNP weights overcame this problem. In multitrait GWABLUP analyses of ~ 30,000 reference population cows, trait specific SNP weights resulted in up to 13% more reliable genomic predictions than unweighted SNP-BLUP, and improved genomic predictions for all three studied traits.
GWABLUP(基于全基因组关联的最佳线性无偏预测)方法利用GWA分析结果对基因组预测中的snp进行差异性加权,提高了基因组预测的可靠性。然而,所提出的多性状GWABLUP方法假设各性状之间的SNP权重相同。在这里,我们扩展并验证了多性状GWABLUP方法,以使用性状特异性SNP权重。在3个性状的乳制品数据集中,多性状GWAS估计的SNP效应及其标准误差被转化为具有性状效应的SNP的性状特定似然比,以及使用GWABLUP方法的后验概率。这产生了每个SNP的性状特异性先验(co)方差矩阵,这些矩阵应用于SNP- blup模型中进行基因组预测,并在APEX线性模型套件中实现。在验证群体中,性状特异性SNP权重导致对所有三个性状的预测更可靠。特别是,对于与其他性状几乎没有关系的体细胞计数,在所有性状中使用相同的权重会损害基因组预测。使用性状特异性SNP权重克服了这个问题。在对约30,000头参考种群奶牛的多性状GWABLUP分析中,性状特异性SNP权重比未加权SNP- blup预测的可靠性提高了13%,并且改进了对所有三种所研究性状的基因组预测。
{"title":"Multitrait genome-wide association best linear unbiased prediction of genetic values","authors":"Theo Meuwissen, Vinzent Boerner","doi":"10.1186/s12711-025-00964-4","DOIUrl":"https://doi.org/10.1186/s12711-025-00964-4","url":null,"abstract":"The GWABLUP (Genome-Wide Association based Best Linear Unbiased Prediction) approach used GWA analysis results to differentially weigh the SNPs in genomic prediction, and was found to improve the reliabilities of genomic predictions. However, the proposed multitrait GWABLUP method assumed that the SNP weights were the same across the traits. Here we extended and validated the multitrait GWABLUP method towards using trait specific SNP weights. In a 3-trait dairy data set, multitrait GWAS estimates of SNP effects and their standard errors were translated into trait specific likelihood ratios for the SNPs having trait effects, and posterior probabilities using the GWABLUP approach. This produced trait specific prior (co)variance matrices for each SNP, which were applied in a SNP-BLUP model for genomic predictions, implemented in the APEX linear model suite. In a validation population, the trait specific SNP weights resulted in more reliable predictions for all three traits. Especially, for somatic cell count, which was hardly related to the other traits, the use of the same weights across all traits was harming genomic predictions. The use of trait specific SNP weights overcame this problem. In multitrait GWABLUP analyses of ~ 30,000 reference population cows, trait specific SNP weights resulted in up to 13% more reliable genomic predictions than unweighted SNP-BLUP, and improved genomic predictions for all three studied traits.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"61 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143665977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erosion of estimated genomic breeding values with generations is due to long distance associations between markers and QTL 由于标记和QTL之间的长距离关联,估计的基因组育种值随着世代的变化而下降
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-21 DOI: 10.1186/s12711-025-00963-5
Didier Boichard, Sébastien Fritz, Pascal Croiseau, Vincent Ducrocq, Thierry Tribout, Beatriz C. D. Cuyabano
Most validation studies of genomic evaluations on candidates (prior to observing phenotypes) present inflation of their predicted breeding values, i.e., regression coefficients of their later observed phenotypes on the early predictions are smaller than one. The aim of this study was to show that this inflation pattern reflects at least partly long-distance associations between markers and quantitative trait loci (QTL) in the reference population and to propose methods to estimate the corresponding “erosion” coefficient. Across-chromosome linkage disequilibrium (LD) is observed in different dairy cattle breeds, being a result from limited effective population size and from relationships within the reference population. Due to this long distance LD, the estimated SNP effects capture non-zero contributions from distant QTLs, some located on other chromosomes than the SNP itself. Therefore, corresponding SNP effects are partly lost in the next generations and we refer to this loss as “erosion”. With the concept of QTL contribution to SNP effects derived from mixed model equations, we show with simulation that this long range LD explains 6–25% of the variance of the estimated genomic breeding values, a proportion that is unchanged when the evaluation model includes a residual polygenic effect. Two methods are proposed to predict this erosion factor assuming known simulated QTL effects. In Method 1, one generation of progeny is simulated from the reference population and the GEBV of these progeny based on SNP effects estimated in this newly simulated generation are regressed on the GEBV of the same progeny based on SNP effects estimated in the reference population. In Method 2 all the QTL contributions to SNP effects are regressed based on SNP-QTL recombination rates and summed to predict the GEBV at the next generation. The regression coefficient of the GEBV based on eroded contributions on the raw GEBV is also an estimate of erosion. An illustration is given with the French Normande female reference bovine population in 2021, showing erosion factors ranging from 0.84 to 0.87. Accounting for erosion is important to avoid inflation and biased predictions. The ways to both reduce inflation and to correct for it in the prediction are discussed.
大多数候选基因评估的验证研究(在观察表型之前)存在其预测育种值的膨胀,即其后期观察到的表型对早期预测的回归系数小于1。本研究的目的是表明这种膨胀模式至少部分反映了参考群体中标记和数量性状位点(QTL)之间的长距离关联,并提出了估算相应“侵蚀”系数的方法。在不同的奶牛品种中观察到跨染色体连锁不平衡(LD),这是有限的有效群体规模和参考群体内部关系的结果。由于这种长距离LD,估计的SNP效应捕获了远端qtl的非零贡献,其中一些位于其他染色体上,而不是SNP本身。因此,相应的SNP效应在下一代中部分丢失,我们将这种丢失称为“侵蚀”。根据混合模型方程推导出的QTL对SNP效应贡献的概念,我们通过模拟表明,这一长期LD解释了估计基因组育种值方差的6-25%,当评估模型包括残余多基因效应时,这一比例不变。假设已知的模拟QTL效应,提出了两种预测侵蚀因子的方法。在方法1中,从参考群体中模拟一代后代,根据新模拟一代中估计的SNP效应对这些后代的GEBV进行回归,基于参考群体中估计的SNP效应对同一后代的GEBV进行回归。在方法2中,根据SNP-QTL重组率对所有对SNP效应的QTL贡献进行回归,并求和以预测下一代的GEBV。基于侵蚀对原始GEBV贡献的GEBV回归系数也是对侵蚀的估计。以2021年法国诺曼底雌性参考牛种群为例,显示侵蚀因子范围为0.84至0.87。考虑侵蚀对于避免通胀和有偏见的预测很重要。讨论了在预测中减少通货膨胀和修正通货膨胀的方法。
{"title":"Erosion of estimated genomic breeding values with generations is due to long distance associations between markers and QTL","authors":"Didier Boichard, Sébastien Fritz, Pascal Croiseau, Vincent Ducrocq, Thierry Tribout, Beatriz C. D. Cuyabano","doi":"10.1186/s12711-025-00963-5","DOIUrl":"https://doi.org/10.1186/s12711-025-00963-5","url":null,"abstract":"Most validation studies of genomic evaluations on candidates (prior to observing phenotypes) present inflation of their predicted breeding values, i.e., regression coefficients of their later observed phenotypes on the early predictions are smaller than one. The aim of this study was to show that this inflation pattern reflects at least partly long-distance associations between markers and quantitative trait loci (QTL) in the reference population and to propose methods to estimate the corresponding “erosion” coefficient. Across-chromosome linkage disequilibrium (LD) is observed in different dairy cattle breeds, being a result from limited effective population size and from relationships within the reference population. Due to this long distance LD, the estimated SNP effects capture non-zero contributions from distant QTLs, some located on other chromosomes than the SNP itself. Therefore, corresponding SNP effects are partly lost in the next generations and we refer to this loss as “erosion”. With the concept of QTL contribution to SNP effects derived from mixed model equations, we show with simulation that this long range LD explains 6–25% of the variance of the estimated genomic breeding values, a proportion that is unchanged when the evaluation model includes a residual polygenic effect. Two methods are proposed to predict this erosion factor assuming known simulated QTL effects. In Method 1, one generation of progeny is simulated from the reference population and the GEBV of these progeny based on SNP effects estimated in this newly simulated generation are regressed on the GEBV of the same progeny based on SNP effects estimated in the reference population. In Method 2 all the QTL contributions to SNP effects are regressed based on SNP-QTL recombination rates and summed to predict the GEBV at the next generation. The regression coefficient of the GEBV based on eroded contributions on the raw GEBV is also an estimate of erosion. An illustration is given with the French Normande female reference bovine population in 2021, showing erosion factors ranging from 0.84 to 0.87. Accounting for erosion is important to avoid inflation and biased predictions. The ways to both reduce inflation and to correct for it in the prediction are discussed.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"1 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143666295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Molecular breeding of pigs in the genome editing era 基因组编辑时代猪的分子育种
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-10 DOI: 10.1186/s12711-025-00961-7
Jiahuan Chen, Jiaqi Wang, Haoran Zhao, Xiao Tan, Shihan Yan, Huanyu Zhang, Tiefeng Wang, Xiaochun Tang
To address the increasing demand for high-quality pork protein, it is essential to implement strategies that enhance diets and produce pigs with excellent production traits. Selective breeding and crossbreeding are the primary methods used for genetic improvement in modern agriculture. However, these methods face challenges due to long breeding cycles and the necessity for beneficial genetic variation associated with high-quality traits within the population. This limitation restricts the transfer of desirable alleles across different genera and species. This article systematically reviews past and current research advancements in porcine molecular breeding. It discusses the screening of clustered regularly interspaced short palindromic repeats (CRISPR) to identify resistance loci in swine and the challenges and future applications of genetically modified pigs. The emergence of transgenic and gene editing technologies has prompted researchers to apply these methods to pig breeding. These advancements allow for alterations in the pig genome through various techniques, ranging from random integration into the genome to site-specific insertion and from target gene knockout (KO) to precise base and prime editing. As a result, numerous desirable traits, such as disease resistance, high meat yield, improved feed efficiency, reduced fat deposition, and lower environmental waste, can be achieved easily and effectively by genetic modification. These traits can serve as valuable resources to enhance swine breeding programmes. In the era of genome editing, molecular breeding of pigs is critical to the future of agriculture. Long-term and multidomain analyses of genetically modified pigs by researchers, related policy development by regulatory agencies, and public awareness and acceptance of their safety are the keys to realizing the transition of genetically modified products from the laboratory to the market.
为了满足对优质猪肉蛋白质日益增长的需求,必须实施改善日粮和生产优良生产性状猪的战略。选择育种和杂交育种是现代农业遗传改良的主要方法。然而,这些方法面临着挑战,因为育种周期长,需要与群体内高质量性状相关的有益遗传变异。这一限制限制了理想等位基因在不同属和种之间的转移。本文系统地综述了猪分子育种的研究进展。它讨论了聚集规律间隔短回文重复序列(CRISPR)的筛选,以确定猪的抗性位点,以及转基因猪的挑战和未来应用。转基因和基因编辑技术的出现促使研究人员将这些方法应用于猪的育种。这些进步允许通过各种技术改变猪基因组,从随机整合到基因组到特定位点插入,从靶基因敲除(KO)到精确的碱基和引物编辑。因此,许多理想的性状,如抗病性、高肉产量、提高饲料效率、减少脂肪沉积和减少环境浪费,可以通过基因改造轻松有效地实现。这些性状可以作为宝贵的资源来加强猪的育种计划。在基因组编辑时代,猪的分子育种对农业的未来至关重要。研究人员对转基因猪的长期和多领域分析、监管机构的相关政策制定以及公众对其安全性的认识和接受是实现转基因产品从实验室向市场过渡的关键。
{"title":"Molecular breeding of pigs in the genome editing era","authors":"Jiahuan Chen, Jiaqi Wang, Haoran Zhao, Xiao Tan, Shihan Yan, Huanyu Zhang, Tiefeng Wang, Xiaochun Tang","doi":"10.1186/s12711-025-00961-7","DOIUrl":"https://doi.org/10.1186/s12711-025-00961-7","url":null,"abstract":"To address the increasing demand for high-quality pork protein, it is essential to implement strategies that enhance diets and produce pigs with excellent production traits. Selective breeding and crossbreeding are the primary methods used for genetic improvement in modern agriculture. However, these methods face challenges due to long breeding cycles and the necessity for beneficial genetic variation associated with high-quality traits within the population. This limitation restricts the transfer of desirable alleles across different genera and species. This article systematically reviews past and current research advancements in porcine molecular breeding. It discusses the screening of clustered regularly interspaced short palindromic repeats (CRISPR) to identify resistance loci in swine and the challenges and future applications of genetically modified pigs. The emergence of transgenic and gene editing technologies has prompted researchers to apply these methods to pig breeding. These advancements allow for alterations in the pig genome through various techniques, ranging from random integration into the genome to site-specific insertion and from target gene knockout (KO) to precise base and prime editing. As a result, numerous desirable traits, such as disease resistance, high meat yield, improved feed efficiency, reduced fat deposition, and lower environmental waste, can be achieved easily and effectively by genetic modification. These traits can serve as valuable resources to enhance swine breeding programmes. In the era of genome editing, molecular breeding of pigs is critical to the future of agriculture. Long-term and multidomain analyses of genetically modified pigs by researchers, related policy development by regulatory agencies, and public awareness and acceptance of their safety are the keys to realizing the transition of genetically modified products from the laboratory to the market.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"19 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143582976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomic selection in pig breeding: comparative analysis of machine learning algorithms 猪育种中的基因组选择:机器学习算法的比较分析
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-10 DOI: 10.1186/s12711-025-00957-3
Ruilin Su, Jingbo Lv, Yahui Xue, Sheng Jiang, Lei Zhou, Li Jiang, Junyan Tan, Zhencai Shen, Ping Zhong, Jianfeng Liu
The effectiveness of genomic prediction (GP) significantly influences breeding progress, and employing SNP markers to predict phenotypic values is a pivotal aspect of pig breeding. Machine learning (ML) methods are usually used to predict phenotypic values since their advantages in processing high dimensional data. While, the existing researches have not indicated which ML methods are suitable for most pig genomic prediction. Therefore, it is necessary to select appropriate methods from a large number of ML methods as long as genomic prediction is performed. This paper compared the performance of popular ML methods in predicting pig phenotypes and then found out suitable methods for most traits. In this paper, five commonly used datasets from other literatures were utilized to compare the performance of different ML methods. The experimental results demonstrate that Stacking performs best on the PIC dataset where the trait information is hidden, and the performs of kernel ridge regression with rbf kernel (KRR-rbf) closely follows. Support vector regression (SVR) performs best in predicting reproductive traits, followed by genomic best linear unbiased prediction (GBLUP). GBLUP achieves the best performance on growth traits, with SVR as the second best. GBLUP achieves good performance for GP problems. Similarly, the Stacking, SVR, and KRR-RBF methods also achieve high prediction accuracy. Moreover, LR statistical analysis shows that Stacking, SVR and KRR are stable. When applying ML methods for phenotypic values prediction in pigs, we recommend these three approaches.
基因组预测(GP)的有效性显著影响育种进展,利用SNP标记预测表型值是猪育种的关键方面。由于机器学习方法在处理高维数据方面具有优势,因此通常用于预测表型值。然而,现有的研究并没有表明哪些ML方法适合于大多数猪基因组预测。因此,只要进行基因组预测,就有必要从大量ML方法中选择合适的方法。本文比较了几种常用的机器学习方法在猪表型预测方面的性能,找出了适合大多数性状的预测方法。本文利用其他文献中五个常用的数据集来比较不同ML方法的性能。实验结果表明,在特征信息隐藏的PIC数据集上,Stacking的效果最好,rbf核脊回归(KRR-rbf)的效果紧随其后。支持向量回归(SVR)对生殖性状的预测效果最好,其次是基因组最佳线性无偏预测(GBLUP)。GBLUP在生长性状上表现最佳,SVR次之。GBLUP在GP问题上取得了良好的性能。同样,Stacking、SVR和KRR-RBF方法也能达到较高的预测精度。此外,LR统计分析表明,叠加、SVR和KRR是稳定的。当应用ML方法预测猪的表型值时,我们推荐这三种方法。
{"title":"Genomic selection in pig breeding: comparative analysis of machine learning algorithms","authors":"Ruilin Su, Jingbo Lv, Yahui Xue, Sheng Jiang, Lei Zhou, Li Jiang, Junyan Tan, Zhencai Shen, Ping Zhong, Jianfeng Liu","doi":"10.1186/s12711-025-00957-3","DOIUrl":"https://doi.org/10.1186/s12711-025-00957-3","url":null,"abstract":"The effectiveness of genomic prediction (GP) significantly influences breeding progress, and employing SNP markers to predict phenotypic values is a pivotal aspect of pig breeding. Machine learning (ML) methods are usually used to predict phenotypic values since their advantages in processing high dimensional data. While, the existing researches have not indicated which ML methods are suitable for most pig genomic prediction. Therefore, it is necessary to select appropriate methods from a large number of ML methods as long as genomic prediction is performed. This paper compared the performance of popular ML methods in predicting pig phenotypes and then found out suitable methods for most traits. In this paper, five commonly used datasets from other literatures were utilized to compare the performance of different ML methods. The experimental results demonstrate that Stacking performs best on the PIC dataset where the trait information is hidden, and the performs of kernel ridge regression with rbf kernel (KRR-rbf) closely follows. Support vector regression (SVR) performs best in predicting reproductive traits, followed by genomic best linear unbiased prediction (GBLUP). GBLUP achieves the best performance on growth traits, with SVR as the second best. GBLUP achieves good performance for GP problems. Similarly, the Stacking, SVR, and KRR-RBF methods also achieve high prediction accuracy. Moreover, LR statistical analysis shows that Stacking, SVR and KRR are stable. When applying ML methods for phenotypic values prediction in pigs, we recommend these three approaches.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"38 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143582977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Merging metabolomics and genomics provides a catalog of genetic factors that influence molecular phenotypes in pigs linking relevant metabolic pathways 代谢组学和基因组学的结合提供了影响猪分子表型的遗传因素目录,这些遗传因素连接了相关的代谢途径
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-06 DOI: 10.1186/s12711-025-00960-8
Samuele Bovo, Anisa Ribani, Flaminia Fanelli, Giuliano Galimberti, Pier Luigi Martelli, Paolo Trevisi, Francesca Bertolini, Matteo Bolner, Rita Casadio, Stefania Dall’Olio, Maurizio Gallo, Diana Luise, Gianluca Mazzoni, Giuseppina Schiavo, Valeria Taurisano, Paolo Zambonelli, Paolo Bosi, Uberto Pagotto, Luca Fontanesi
Metabolomics opens novel avenues to study the basic biological mechanisms underlying complex traits, starting from characterization of metabolites. Metabolites and their levels in a biofluid represent simple molecular phenotypes (metabotypes) that are direct products of enzyme activities and relate to all metabolic pathways, including catabolism and anabolism of nutrients. In this study, we demonstrated the utility of merging metabolomics and genomics in pigs to uncover a large list of genetic factors that influence mammalian metabolism. We obtained targeted characterization of the plasma metabolome of more than 1300 pigs from two populations of Large White and Duroc pig breeds. The metabolomic profiles of these pigs were used to identify genetically influenced metabolites by estimating the heritability of the level of 188 metabolites. Then, combining breed-specific genome-wide association studies of single metabolites and their ratios and across breed meta-analyses, we identified a total of 97 metabolite quantitative trait loci (mQTL), associated with 126 metabolites. Using these results, we constructed a human-pig comparative catalog of genetic factors influencing the metabolomic profile. Whole genome resequencing data identified several putative causative mutations for these mQTL. Additionally, based on a major mQTL for kynurenine level, we designed a nutrigenetic study feeding piglets that carried different genotypes at the candidate gene kynurenine 3-monooxygenase (KMO) varying levels of tryptophan and demonstrated the effect of this genetic factor on the kynurenine pathway. Furthermore, we used metabolomic profiles of Large White and Duroc pigs to reconstruct metabolic pathways using Gaussian Graphical Models, which included perturbation of the identified mQTL. This study has provided the first catalog of genetic factors affecting molecular phenotypes that describe the pig blood metabolome, with links to important metabolic pathways, opening novel avenues to merge genetics and nutrition in this livestock species. The obtained results are relevant for basic and applied biology and to evaluate the pig as a biomedical model. Genetically influenced metabolites can be further exploited in nutrigenetic approaches in pigs. The described molecular phenotypes can be useful to dissect complex traits and design novel feeding, breeding and selection programs in pigs.
代谢组学为研究复杂性状的基本生物学机制开辟了新的途径,从代谢物的表征开始。生物流体中的代谢物及其水平代表了简单的分子表型(代谢型),它们是酶活性的直接产物,与所有代谢途径有关,包括营养物质的分解代谢和合成代谢。在这项研究中,我们展示了将代谢组学和基因组学结合在猪身上的效用,以揭示影响哺乳动物代谢的大量遗传因素。我们对来自大白猪和杜洛克猪两个品种的1300多头猪的血浆代谢组进行了针对性的表征。通过估计188种代谢物水平的遗传能力,利用这些猪的代谢组学特征来鉴定受遗传影响的代谢物。然后,结合单个代谢物及其比例的品种特异性全基因组关联研究和跨品种荟萃分析,我们共鉴定出97个代谢物数量性状位点(mQTL),与126种代谢物相关。利用这些结果,我们构建了影响代谢组学特征的遗传因素的人猪比较目录。全基因组重测序数据确定了这些mQTL的几个假定的致病突变。此外,基于犬尿氨酸水平的主要mQTL,我们设计了一项营养遗传学研究,饲养不同水平色氨酸的候选基因kynurenine 3-monooxygenase (KMO)携带不同基因型的仔猪,并证明了该遗传因素对犬尿氨酸途径的影响。此外,我们利用大白猪和杜洛克猪的代谢组学特征,使用高斯图形模型重建代谢途径,其中包括已识别的mQTL的扰动。这项研究首次提供了影响猪血液代谢组分子表型的遗传因素目录,并与重要的代谢途径相关联,为将这种牲畜物种的遗传和营养结合起来开辟了新的途径。所获得的结果与基础生物学和应用生物学以及评价猪作为生物医学模型有关。受遗传影响的代谢物可以在猪的营养遗传学方法中进一步利用。所描述的分子表型可以用于解剖复杂的性状和设计新的饲养,育种和选择方案的猪。
{"title":"Merging metabolomics and genomics provides a catalog of genetic factors that influence molecular phenotypes in pigs linking relevant metabolic pathways","authors":"Samuele Bovo, Anisa Ribani, Flaminia Fanelli, Giuliano Galimberti, Pier Luigi Martelli, Paolo Trevisi, Francesca Bertolini, Matteo Bolner, Rita Casadio, Stefania Dall’Olio, Maurizio Gallo, Diana Luise, Gianluca Mazzoni, Giuseppina Schiavo, Valeria Taurisano, Paolo Zambonelli, Paolo Bosi, Uberto Pagotto, Luca Fontanesi","doi":"10.1186/s12711-025-00960-8","DOIUrl":"https://doi.org/10.1186/s12711-025-00960-8","url":null,"abstract":"Metabolomics opens novel avenues to study the basic biological mechanisms underlying complex traits, starting from characterization of metabolites. Metabolites and their levels in a biofluid represent simple molecular phenotypes (metabotypes) that are direct products of enzyme activities and relate to all metabolic pathways, including catabolism and anabolism of nutrients. In this study, we demonstrated the utility of merging metabolomics and genomics in pigs to uncover a large list of genetic factors that influence mammalian metabolism. We obtained targeted characterization of the plasma metabolome of more than 1300 pigs from two populations of Large White and Duroc pig breeds. The metabolomic profiles of these pigs were used to identify genetically influenced metabolites by estimating the heritability of the level of 188 metabolites. Then, combining breed-specific genome-wide association studies of single metabolites and their ratios and across breed meta-analyses, we identified a total of 97 metabolite quantitative trait loci (mQTL), associated with 126 metabolites. Using these results, we constructed a human-pig comparative catalog of genetic factors influencing the metabolomic profile. Whole genome resequencing data identified several putative causative mutations for these mQTL. Additionally, based on a major mQTL for kynurenine level, we designed a nutrigenetic study feeding piglets that carried different genotypes at the candidate gene kynurenine 3-monooxygenase (KMO) varying levels of tryptophan and demonstrated the effect of this genetic factor on the kynurenine pathway. Furthermore, we used metabolomic profiles of Large White and Duroc pigs to reconstruct metabolic pathways using Gaussian Graphical Models, which included perturbation of the identified mQTL. This study has provided the first catalog of genetic factors affecting molecular phenotypes that describe the pig blood metabolome, with links to important metabolic pathways, opening novel avenues to merge genetics and nutrition in this livestock species. The obtained results are relevant for basic and applied biology and to evaluate the pig as a biomedical model. Genetically influenced metabolites can be further exploited in nutrigenetic approaches in pigs. The described molecular phenotypes can be useful to dissect complex traits and design novel feeding, breeding and selection programs in pigs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"36 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143560636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle 利用比利时蓝牛全基因组序列数据和功能注释评估基因组选择模型
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-03-04 DOI: 10.1186/s12711-025-00955-5
Can Yuan, Alain Gillon, José Luis Gualdrón Duarte, Haruko Takeda, Wouter Coppieters, Michel Georges, Tom Druet
The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation. Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods. GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.
大量全基因组测序个体的可用性,结合功能注释,有望为提高基因组选择(GS)的准确性提供机会。然而,这些好处在最初的应用中并不经常被观察到。比利时蓝牛(BBC)中GS的参考种群继续增长。结合已测序个体参考面板的可用性,它提供了利用全基因组序列(WGS)数据和功能注释来评估GS模型的机会。在这里,我们使用了16,508头奶牛的数据,这些数据具有5种肌肉发育性状的表型,并在WGS水平上进行了估算,结合了从实验数据中获得的计算机功能注释和推测的调节变异目录。我们使用完整的WGS数据评估了第一个GS模型,有或没有功能注释。在这个标记密度下,我们能够运行两种方法,假设一个高多基因结构(GBLUP)或允许一些变体有更大的影响(BayesRR-RC,贝叶斯混合模型),并观察到与中等标记密度下的官方GBLUP模型相比,可靠性有所提高(GBLUP和BayesRR-RC的平均可靠性分别为0.016和0.018)。当使用功能注释时,我们观察到使用包含多个多基因术语(每个功能组一个)的GBLUP扩展的可靠性略高,而使用BayesRR-RC的可靠性降低。然后,我们使用基于功能信息或链接不平衡(LD)修剪方法选择的变体的大子集,这使我们能够评估另外两种方法,bayescc π和贝叶斯稀疏线性混合模型(BSLMM)。这些面板的可靠性高于WGS数据,根据功能信息选择标记时获得的准确性最高。在我们的设置中,BSLMM系统地获得了比其他方法更高的可靠性。与官方基因组评估方法相比,从WGS数据中选择大量功能变异的GS可以显著提高可靠性。然而,使用WGS和功能数据的好处仍然有限,这表明仍有改进的空间,例如,可以进一步完善BBC品种中的功能注释。
{"title":"Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle","authors":"Can Yuan, Alain Gillon, José Luis Gualdrón Duarte, Haruko Takeda, Wouter Coppieters, Michel Georges, Tom Druet","doi":"10.1186/s12711-025-00955-5","DOIUrl":"https://doi.org/10.1186/s12711-025-00955-5","url":null,"abstract":"The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation. Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods. GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"34 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143538512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genetics Selection Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1