首页 > 最新文献

Genetics Selection Evolution最新文献

英文 中文
Unraveling the role of bacteria with heritable versus non-heritable relative abundance in the gut on boar semen quality 揭示肠道中遗传与非遗传相对丰度细菌对公猪精液质量的作用
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-11-06 DOI: 10.1186/s12711-025-00990-2
Liangliang Guo, Xiaoqi Pei, Jiajian Tan, Haiqing Sun, Siwen Jiang, Hongkui Wei, Jian Peng
The relative abundance of some bacteria in the gut of pigs is heritable, suggesting that host genetics may recursively influence boar semen quality by affecting the composition and function of gut microbiota. Therefore, it is essential to elucidate the specific contributions of heritable versus non-heritable gut microbiota to semen quality traits. Our study aimed to identify heritable and non-heritable bacterial taxa at the genus level in the boar gut and to predict their functions and respective contributions to semen quality traits. At the genus level, 39 heritable and 91 non-heritable bacterial taxa were identified. Functional analysis revealed that predicted microbial functions in both groups were primarily enriched in carbohydrate, nucleotide, and amino acid metabolism. We further analyzed the average microbiability of heritable and non-heritable bacteria on short-chain fatty acids (SCFAs) and semen quality traits. The relative abundance of heritable bacteria was found to contribute more to SCFAs levels and semen quality than non-heritable bacteria. Mediation analysis revealed that SCFAs could mediate the influence of the relative abundance of heritable bacteria on host phenotypes, identifying 99 significant genus-SCFAs-semen quality trait mediation links. Our findings underscore the substantial role of the relative abundance of heritable gut bacteria in shaping porcine semen quality through SCFAs mediation. These results highlight the potential of targeted microbiome interventions to enhance reproductive traits in pigs.
猪肠道中某些细菌的相对丰度是可遗传的,这表明宿主遗传可能通过影响肠道微生物群的组成和功能而递归地影响猪精液质量。因此,有必要阐明遗传与非遗传肠道微生物群对精液质量性状的具体贡献。本研究旨在鉴定猪肠道中可遗传和不可遗传的细菌分类群,并预测它们的功能及其对精液质量性状的贡献。在属水平上鉴定出39个可遗传的细菌类群和91个不可遗传的细菌类群。功能分析显示,两组预测的微生物功能主要富集于碳水化合物、核苷酸和氨基酸代谢。我们进一步分析了遗传和非遗传细菌对短链脂肪酸(SCFAs)的平均微生物率和精液质量性状。可遗传细菌的相对丰度比不可遗传细菌对SCFAs水平和精液质量的贡献更大。中介分析显示,SCFAs可以介导可遗传细菌相对丰度对宿主表型的影响,鉴定出99个显著的属-SCFAs-精液质量性状中介联系。我们的研究结果强调了相对丰富的可遗传肠道细菌在通过SCFAs介导塑造猪精液质量中的重要作用。这些结果突出了靶向微生物组干预提高猪繁殖性状的潜力。
{"title":"Unraveling the role of bacteria with heritable versus non-heritable relative abundance in the gut on boar semen quality","authors":"Liangliang Guo, Xiaoqi Pei, Jiajian Tan, Haiqing Sun, Siwen Jiang, Hongkui Wei, Jian Peng","doi":"10.1186/s12711-025-00990-2","DOIUrl":"https://doi.org/10.1186/s12711-025-00990-2","url":null,"abstract":"The relative abundance of some bacteria in the gut of pigs is heritable, suggesting that host genetics may recursively influence boar semen quality by affecting the composition and function of gut microbiota. Therefore, it is essential to elucidate the specific contributions of heritable versus non-heritable gut microbiota to semen quality traits. Our study aimed to identify heritable and non-heritable bacterial taxa at the genus level in the boar gut and to predict their functions and respective contributions to semen quality traits. At the genus level, 39 heritable and 91 non-heritable bacterial taxa were identified. Functional analysis revealed that predicted microbial functions in both groups were primarily enriched in carbohydrate, nucleotide, and amino acid metabolism. We further analyzed the average microbiability of heritable and non-heritable bacteria on short-chain fatty acids (SCFAs) and semen quality traits. The relative abundance of heritable bacteria was found to contribute more to SCFAs levels and semen quality than non-heritable bacteria. Mediation analysis revealed that SCFAs could mediate the influence of the relative abundance of heritable bacteria on host phenotypes, identifying 99 significant genus-SCFAs-semen quality trait mediation links. Our findings underscore the substantial role of the relative abundance of heritable gut bacteria in shaping porcine semen quality through SCFAs mediation. These results highlight the potential of targeted microbiome interventions to enhance reproductive traits in pigs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"36 2 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145447198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recombination events restored the functional horned haplotypes in the offspring of polled parents 重组事件恢复了被调查亲本后代的功能性角单倍型
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-31 DOI: 10.1186/s12711-025-01009-6
Maulik Upadhyay, Alexander Graf, Neža Pogorevc, Doris Seichter, Ingolf Russ, Stefan Krebs, Saskia Meier, Ivica Medugorac
Breeding of genetically polled animals is a desirable approach in modern cattle husbandry for welfare and economic reasons. At least four different genetic variants associated with polledness in cattle have been identified, suggesting genetic heterogeneity. These dominant variants are located on chromosome 1 between approximately 2.42–2.73 Mb (reference: ARS-UCD1.3), also called the POLLED locus. Among these variants, Friesian (PF, ~ 80 kbp duplication) and Celtic (PC, 212 bp complex InDel) are the most common across breeds in the production systems globally, such as in Holstein–Friesian (HF) and Fleckvieh (FV). While studies have provided strong evidence supporting the association of the PF allele with the polledness, it has not yet been functionally validated, unlike the PC allele. Here, we conduct whole-genome sequencing analyses of two trios exhibiting unexpected inheritance patterns related to the PC and PF variants. In both instances, horned offspring were produced from mating pairs where one parent was homozygous for the polled variant and the other was homozygous for the ancestral horned variant. By analyzing the WGS data generated using Nanopore technology, we show that the de novo generation of the ancestral horned phenotype in both offspring was the result of distinct recombination events. Specifically, in the HF trio, it was the result of non-allelic homologous recombination in the gametes of the sire (PF/PF), while in the FV trio, it was the result of allelic homologous recombination in the gametes of the dam (PC/PF). The findings from the HF trio support the hypothesis that ~ 80-kbp duplication is the genetic variant responsible for the polled phenotype of Friesian origin. We show that different genomic arrangements in the POLLED locus can lead to the emergence of de novo ancestral horn phenotypes. Such arrangements can complicate phenotype prediction in offspring, even when sires or dams have been tested as genetically homozygous polled. Therefore, it is important, for a better understanding of the relationship between the POLLED locus and the POLLED phenotype, that any deviation from the expected result is critically analysed. Possibly, some of these cases can further narrow down the sequence motif that is essential for polledness in cattle.
从福利和经济的角度考虑,育种遗传投票动物是现代畜牧业的一种理想方法。至少有四种不同的遗传变异与牛的授粉有关,这表明遗传异质性。这些显性变异位于1号染色体约2.42-2.73 Mb之间(参考文献:ARS-UCD1.3),也称为轮询位点。在这些变异中,弗里西亚(PF, ~ 80 kbp重复)和凯尔特(PC, 212 bp复杂InDel)是全球生产系统中最常见的品种,如荷尔斯坦-弗里西亚(HF)和弗莱科维耶(FV)。虽然研究提供了强有力的证据支持PF等位基因与轮询性的关联,但它尚未得到功能验证,不像PC等位基因。在这里,我们对两个三重奏进行了全基因组测序分析,显示出与PC和PF变体相关的意想不到的遗传模式。在这两种情况下,有角的后代都是由一对交配对产生的,其中一方是纯合的,另一方是纯合的,是祖先的有角的变体。通过分析使用纳米孔技术生成的WGS数据,我们发现两个后代中祖先角型表型的重新产生是不同重组事件的结果。其中,高频三人组是父系(PF/PF)配子非等位基因同源重组的结果,而FV三人组是母系(PC/PF)配子等位基因同源重组的结果。HF三人组的研究结果支持了一种假设,即~ 80 kbp的重复是弗里西亚起源的投票表型的遗传变异。我们表明,在投票位点的不同基因组安排可以导致出现新的祖先角表型。这样的安排可能会使后代的表型预测复杂化,即使是在母系或母系已经被测试为基因纯合的情况下。因此,重要的是,为了更好地理解投票位点和投票表型之间的关系,任何偏离预期结果的情况都要进行严格的分析。可能,这些案例中的一些可以进一步缩小对牛授粉至关重要的序列基序。
{"title":"Recombination events restored the functional horned haplotypes in the offspring of polled parents","authors":"Maulik Upadhyay, Alexander Graf, Neža Pogorevc, Doris Seichter, Ingolf Russ, Stefan Krebs, Saskia Meier, Ivica Medugorac","doi":"10.1186/s12711-025-01009-6","DOIUrl":"https://doi.org/10.1186/s12711-025-01009-6","url":null,"abstract":"Breeding of genetically polled animals is a desirable approach in modern cattle husbandry for welfare and economic reasons. At least four different genetic variants associated with polledness in cattle have been identified, suggesting genetic heterogeneity. These dominant variants are located on chromosome 1 between approximately 2.42–2.73 Mb (reference: ARS-UCD1.3), also called the POLLED locus. Among these variants, Friesian (PF, ~ 80 kbp duplication) and Celtic (PC, 212 bp complex InDel) are the most common across breeds in the production systems globally, such as in Holstein–Friesian (HF) and Fleckvieh (FV). While studies have provided strong evidence supporting the association of the PF allele with the polledness, it has not yet been functionally validated, unlike the PC allele. Here, we conduct whole-genome sequencing analyses of two trios exhibiting unexpected inheritance patterns related to the PC and PF variants. In both instances, horned offspring were produced from mating pairs where one parent was homozygous for the polled variant and the other was homozygous for the ancestral horned variant. By analyzing the WGS data generated using Nanopore technology, we show that the de novo generation of the ancestral horned phenotype in both offspring was the result of distinct recombination events. Specifically, in the HF trio, it was the result of non-allelic homologous recombination in the gametes of the sire (PF/PF), while in the FV trio, it was the result of allelic homologous recombination in the gametes of the dam (PC/PF). The findings from the HF trio support the hypothesis that ~ 80-kbp duplication is the genetic variant responsible for the polled phenotype of Friesian origin. We show that different genomic arrangements in the POLLED locus can lead to the emergence of de novo ancestral horn phenotypes. Such arrangements can complicate phenotype prediction in offspring, even when sires or dams have been tested as genetically homozygous polled. Therefore, it is important, for a better understanding of the relationship between the POLLED locus and the POLLED phenotype, that any deviation from the expected result is critically analysed. Possibly, some of these cases can further narrow down the sequence motif that is essential for polledness in cattle.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"55 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145405123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genetics of reproductive performance across Porcine Reproductive and Respiratory Syndrome (PRRS) outbreak phases in purebred and crossbred sows 猪繁殖与呼吸综合征(PRRS)在纯种和杂交母猪爆发阶段繁殖性能的遗传学研究
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-30 DOI: 10.1186/s12711-025-01011-y
Leticia F. de Oliveira, Jenelle Dunkelberger, Claudia A. Sevillano, Saranya Arirangan, Robbee Wedow, Matthew Tegtmeyer, Mitchell Tuinstra, Luiz F. Brito
Porcine Reproductive and Respiratory Syndrome (PRRS) is a major challenge for the worldwide pig industry. Therefore, genetic selection for enhanced disease resilience is a priority for pig breeding programs. The objectives of this study were to evaluate genetic variation in reproductive performance during a PRRS outbreak and to assess the impact of selecting for enhanced reproductive performance using data collected under non-challenged conditions on reproductive performance in a PRRS challenged environment. These objectives were addressed by identifying natural PRRS outbreak periods from longitudinal performance data and estimating genetic parameters for reproductive performance traits, before, during, and after a PRRS outbreak, using data collected from purebred and crossbred sows on multiplier farms. During PRRS outbreaks, the number of piglets born alive decreased, while the number of stillborn and mummified piglets increased for both purebred and crossbred sows. Additive genetic variance and heritability estimates for reproductive performance traits varied by phase. For most traits, additive genetic variance was highest during the outbreak. Estimates of genetic correlations between a given trait measured across phases ranged from 0.09 to 0.99, but were > 0.3 for most traits. In general, estimates of genetic correlations were greatest between a given trait before and after an outbreak. Results also indicated reranking of animals based on estimated breeding values across outbreak phases, with Spearman correlations below 0.50 for most traits and low proportion of top-ranking animals retained across phases. PRRS outbreak periods can be detected by evaluating phenotypic variation in reproductive performance from longitudinal data. Reproductive performance is heritable, whether evaluated before, during, or after a PRRS outbreak, but estimates varied by phase. Favorable moderate-to-high genetic correlations were estimated for reproductive performance traits measured before vs. during a PRRS outbreak, suggesting that selection for improved reproductive performance under non-challenged conditions is also expected to improve reproductive performance under PRRS challenge conditions. However, the estimates of genetic correlation for most of the reproductive traits indicated genotype-by-environment interactions between the PRRS-free and challenge conditions. Therefore, incorporating data collected under PRRS challenge will capture additional genetic variation in PRRS resilience and, ultimately, aid in selecting sows with increased PRRS resilience.
猪繁殖与呼吸综合征(PRRS)是全球养猪业面临的主要挑战。因此,提高抗病性的遗传选择是猪育种计划的优先事项。本研究的目的是评估PRRS暴发期间繁殖性能的遗传变异,并评估使用在非挑战条件下收集的数据选择提高繁殖性能对PRRS挑战环境中繁殖性能的影响。通过纵向性能数据确定PRRS自然爆发期,并利用从倍增器农场纯种和杂交母猪收集的数据估计PRRS爆发之前、期间和之后繁殖性能性状的遗传参数,实现了这些目标。在PRRS暴发期间,出生时活产的仔猪数量减少,而纯种和杂交母猪的死产和干尸仔猪数量增加。生殖性能性状的加性遗传变异和遗传力估计因期而异。对于大多数性状,加性遗传变异在暴发期间最高。在不同阶段测量的特定性状之间的遗传相关性估计在0.09到0.99之间,但大多数性状的遗传相关性为bb0 0.3。一般来说,在疫情爆发前后对某一特定性状的遗传相关性估计最大。结果还表明,在疫情爆发的各个阶段,根据估计的育种值对动物进行了重新排序,大多数性状的斯皮尔曼相关性低于0.50,在各个阶段保留的排名靠前的动物比例很低。可以通过纵向数据评估繁殖表现的表型变异来检测PRRS暴发期。无论是在PRRS暴发之前、期间还是之后进行评估,生殖表现都是可遗传的,但评估结果因阶段而异。据估计,在PRRS爆发前和爆发期间测量的繁殖性能性状具有良好的中等到高度的遗传相关性,这表明在非挑战条件下提高繁殖性能的选择也有望提高PRRS挑战条件下的繁殖性能。然而,大多数生殖性状的遗传相关性估计表明,无prrs和攻毒条件之间存在基因型-环境相互作用。因此,整合在PRRS挑战下收集的数据将捕获PRRS抗逆性的额外遗传变异,并最终有助于选择具有增强PRRS抗逆性的母猪。
{"title":"Genetics of reproductive performance across Porcine Reproductive and Respiratory Syndrome (PRRS) outbreak phases in purebred and crossbred sows","authors":"Leticia F. de Oliveira, Jenelle Dunkelberger, Claudia A. Sevillano, Saranya Arirangan, Robbee Wedow, Matthew Tegtmeyer, Mitchell Tuinstra, Luiz F. Brito","doi":"10.1186/s12711-025-01011-y","DOIUrl":"https://doi.org/10.1186/s12711-025-01011-y","url":null,"abstract":"Porcine Reproductive and Respiratory Syndrome (PRRS) is a major challenge for the worldwide pig industry. Therefore, genetic selection for enhanced disease resilience is a priority for pig breeding programs. The objectives of this study were to evaluate genetic variation in reproductive performance during a PRRS outbreak and to assess the impact of selecting for enhanced reproductive performance using data collected under non-challenged conditions on reproductive performance in a PRRS challenged environment. These objectives were addressed by identifying natural PRRS outbreak periods from longitudinal performance data and estimating genetic parameters for reproductive performance traits, before, during, and after a PRRS outbreak, using data collected from purebred and crossbred sows on multiplier farms. During PRRS outbreaks, the number of piglets born alive decreased, while the number of stillborn and mummified piglets increased for both purebred and crossbred sows. Additive genetic variance and heritability estimates for reproductive performance traits varied by phase. For most traits, additive genetic variance was highest during the outbreak. Estimates of genetic correlations between a given trait measured across phases ranged from 0.09 to 0.99, but were > 0.3 for most traits. In general, estimates of genetic correlations were greatest between a given trait before and after an outbreak. Results also indicated reranking of animals based on estimated breeding values across outbreak phases, with Spearman correlations below 0.50 for most traits and low proportion of top-ranking animals retained across phases. PRRS outbreak periods can be detected by evaluating phenotypic variation in reproductive performance from longitudinal data. Reproductive performance is heritable, whether evaluated before, during, or after a PRRS outbreak, but estimates varied by phase. Favorable moderate-to-high genetic correlations were estimated for reproductive performance traits measured before vs. during a PRRS outbreak, suggesting that selection for improved reproductive performance under non-challenged conditions is also expected to improve reproductive performance under PRRS challenge conditions. However, the estimates of genetic correlation for most of the reproductive traits indicated genotype-by-environment interactions between the PRRS-free and challenge conditions. Therefore, incorporating data collected under PRRS challenge will capture additional genetic variation in PRRS resilience and, ultimately, aid in selecting sows with increased PRRS resilience.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"80 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145396779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction accuracy for feed intake and body weight gain using host genomic and rumen metagenomic data in beef cattle 利用宿主基因组和瘤胃宏基因组数据预测肉牛采食量和体重增加的准确性
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-30 DOI: 10.1186/s12711-025-01007-8
Andrew Lakamp, Seidu Adams, Larry Kuehn, Warren Snelling, James Wells, Kristin Hales, Bryan Neville, Samodha Fernando, Matthew L. Spangler
Host genomic and rumen metagenome data can predict feed efficiency traits, supporting management decisions and increasing profitability. This study estimated the proportion of variation of average daily dry matter intake and average daily gain explained by the rumen metagenome in beef cattle, evaluated prediction accuracy using genomic data, metagenomic data, or their combination, and explored methods for modelling the rumen metagenome to improve phenotypic prediction accuracy. Data from 717 animals on four diets (two concentrate-based and two forage-based) were analyzed. Animal genotypes consisted of 749,922 imputed sequence variants, while metagenomic data comprised 16,583 open reading frames from ruminal microbiota. The metagenome was modelled using six (co)variance matrices, based on combinations of two creation methods and three modifications. Nineteen mixed linear models were used per trait: one with genomic effects only, six with metagenomic effects, six combining genomic and metagenomic effects, and six adding interaction effects. Two cross-validation schemes were applied to evaluate prediction accuracy: fourfold cross-validation balanced for diet type with 5 replicates and leave-one-diet-out cross-validation, where three diets served as training and the fourth as testing. Prediction accuracy was measured as the correlation between an animal’s summed random effects and its adjusted phenotype. Although minimal, differences existed in parameter estimates and validation accuracy depending on how the metagenome effect was modelled. Median phenotype prediction accuracy ranged from −0.01 to 0.28. No specific set of model characteristics consistently lead to the highest accuracies. Models which combined genome and metagenome data outperformed those using either data source alone. Models where the rumen metagenome (co)variances matrix was scaled within each diet composition generally led to lower prediction accuracies in this study. The rumen metagenome can explain a significant proportion of variation in beef cattle feed efficiency traits. Those traits can also be predicted using either host genome or rumen metagenome, though using both sources of information proved more accurate. Multiple methods of forming the metagenome (co)variance matrix can lead to similar prediction accuracies.
宿主基因组和瘤胃宏基因组数据可以预测饲料效率性状,支持管理决策和提高盈利能力。本研究估计了肉牛瘤胃宏基因组解释的平均日干物质采食量和平均日增重的变化比例,评估了使用基因组数据、宏基因组数据或它们的组合预测的准确性,并探索了建立瘤胃宏基因组模型的方法,以提高表型预测的准确性。对717只饲喂4种日粮(2种以精料为基础,2种以饲料为基础)的动物的数据进行了分析。动物基因型包括749,922个输入的序列变异,而宏基因组数据包括来自瘤胃微生物群的16,583个开放阅读框。基于两种创建方法和三种修改方法的组合,使用六个(co)方差矩阵对宏基因组进行建模。每个性状使用了19个混合线性模型:1个仅具有基因组效应,6个具有宏基因组效应,6个基因组与宏基因组效应相结合,6个添加相互作用效应。采用两种交叉验证方案评估预测准确性:四重交叉验证平衡饮食类型,5个重复;留一种饮食交叉验证,其中三种饮食作为训练,第四种作为测试。预测准确性是通过动物随机效应的总和与其调整后的表型之间的相关性来衡量的。虽然最小,但参数估计和验证准确性存在差异,这取决于宏基因组效应如何建模。中位表型预测精度范围为- 0.01 ~ 0.28。没有一组特定的模型特征能始终如一地导致最高的精度。结合基因组和宏基因组数据的模型优于单独使用任何一个数据源的模型。在本研究中,在每种日粮组成中缩放瘤胃宏基因组(co)方差矩阵的模型通常导致较低的预测准确性。瘤胃宏基因组可以解释肉牛饲料效率性状的很大一部分变异。这些特征也可以用宿主基因组或瘤胃宏基因组来预测,尽管使用这两种信息来源被证明更准确。形成宏基因组(co)方差矩阵的多种方法可以导致相似的预测精度。
{"title":"Prediction accuracy for feed intake and body weight gain using host genomic and rumen metagenomic data in beef cattle","authors":"Andrew Lakamp, Seidu Adams, Larry Kuehn, Warren Snelling, James Wells, Kristin Hales, Bryan Neville, Samodha Fernando, Matthew L. Spangler","doi":"10.1186/s12711-025-01007-8","DOIUrl":"https://doi.org/10.1186/s12711-025-01007-8","url":null,"abstract":"Host genomic and rumen metagenome data can predict feed efficiency traits, supporting management decisions and increasing profitability. This study estimated the proportion of variation of average daily dry matter intake and average daily gain explained by the rumen metagenome in beef cattle, evaluated prediction accuracy using genomic data, metagenomic data, or their combination, and explored methods for modelling the rumen metagenome to improve phenotypic prediction accuracy. Data from 717 animals on four diets (two concentrate-based and two forage-based) were analyzed. Animal genotypes consisted of 749,922 imputed sequence variants, while metagenomic data comprised 16,583 open reading frames from ruminal microbiota. The metagenome was modelled using six (co)variance matrices, based on combinations of two creation methods and three modifications. Nineteen mixed linear models were used per trait: one with genomic effects only, six with metagenomic effects, six combining genomic and metagenomic effects, and six adding interaction effects. Two cross-validation schemes were applied to evaluate prediction accuracy: fourfold cross-validation balanced for diet type with 5 replicates and leave-one-diet-out cross-validation, where three diets served as training and the fourth as testing. Prediction accuracy was measured as the correlation between an animal’s summed random effects and its adjusted phenotype. Although minimal, differences existed in parameter estimates and validation accuracy depending on how the metagenome effect was modelled. Median phenotype prediction accuracy ranged from −0.01 to 0.28. No specific set of model characteristics consistently lead to the highest accuracies. Models which combined genome and metagenome data outperformed those using either data source alone. Models where the rumen metagenome (co)variances matrix was scaled within each diet composition generally led to lower prediction accuracies in this study. The rumen metagenome can explain a significant proportion of variation in beef cattle feed efficiency traits. Those traits can also be predicted using either host genome or rumen metagenome, though using both sources of information proved more accurate. Multiple methods of forming the metagenome (co)variance matrix can lead to similar prediction accuracies.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"17 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145396846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of (co)variance components for very large datasets and complex single-step genomic models 估计(co)方差成分的非常大的数据集和复杂的单步基因组模型
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-30 DOI: 10.1186/s12711-025-01006-9
Matias Bermann, Andres Legarra, Ignacio Aguilar, Alejandra Alvarez-Munera, Ignacy Misztal, Daniela Lourenco
Variance components of linear mixed models should be estimated with all the data and information available for a specific statistical model to avoid bias. Due to computational limitations, the estimation for large datasets or complex models is often carried out by subsetting the data, removing genomic information, or simplifying the statistical model. Monte Carlo REML (MC-REML) is a method developed to lift computational limitations, but so far there was no extension for genomic information under the single-step genomic methods. In this study, we extended MC-REML to include large genomic information. We developed a method to estimate variance components named Monte Carlo single-step genomic REML (MC-ss-GREML). The core of the method includes repeatedly simulating breeding values under a ssGBLUP model and solving the mixed model equations to approximate traces involving prediction error variances. The REML optimization strategies include Expectation Maximization and Average Information. We tested the accuracy of MC-ss-GREML with a three-trait growth model in beef cattle with maternal effects, with 14 parameters to estimate. The data set had 100,000 animals in the pedigree, of which about 33,000 had records, and 10,000 were genotyped. There were no differences in estimates between MC-ss-GREML and ss-GREML with the exact calculation of the traces (exact ss-GREML). MC-ss-GREML took 14% of the computing time and used 1% of the memory compared to the exact ss-GREML. We tested the computing performance of MC-ss-GREML by estimating variance components in a birth weight model, with much larger data that included 7 million animals in the pedigree, from which 330,000 were genotyped. The estimation converged in 11 rounds and took 121 h, with a peak memory usage of 53 Gb. The new method, MC-ss-GREML, can estimate variance components with large datasets including many genotyped individuals, at affordable time and memory costs.
线性混合模型的方差成分应该用特定统计模型的所有数据和信息来估计,以避免偏差。由于计算的限制,对于大型数据集或复杂模型的估计通常是通过细分数据、去除基因组信息或简化统计模型来进行的。Monte Carlo REML (MC-REML)是一种消除计算限制的方法,但目前在单步基因组方法下还没有对基因组信息的扩展。在这项研究中,我们扩展了MC-REML以包含大的基因组信息。我们开发了一种估算方差分量的方法,称为蒙特卡罗单步基因组REML (MC-ss-GREML)。该方法的核心是在ssGBLUP模型下反复模拟育种值,并求解混合模型方程来逼近涉及预测误差方差的迹线。REML优化策略包括期望最大化和平均信息。我们以具有母系效应的肉牛为研究对象,采用三性状生长模型对MC-ss-GREML的准确性进行了检验,模型中有14个参数需要估计。该数据集有10万只动物的家谱,其中约3.3万只有记录,1万只进行了基因分型。MC-ss-GREML和ss-GREML之间的估计与轨迹的精确计算(精确ss-GREML)没有差异。与精确的ss-GREML相比,MC-ss-GREML花费了14%的计算时间和1%的内存。我们通过估算出生体重模型中的方差成分来测试MC-ss-GREML的计算性能,该模型使用了更大的数据,包括家谱中的700万只动物,其中33万只进行了基因分型。估计在11轮中收敛,耗时121小时,峰值内存使用量为53 Gb。新方法MC-ss-GREML可以在可承受的时间和内存成本下估算包含许多基因型个体的大型数据集的方差成分。
{"title":"Estimation of (co)variance components for very large datasets and complex single-step genomic models","authors":"Matias Bermann, Andres Legarra, Ignacio Aguilar, Alejandra Alvarez-Munera, Ignacy Misztal, Daniela Lourenco","doi":"10.1186/s12711-025-01006-9","DOIUrl":"https://doi.org/10.1186/s12711-025-01006-9","url":null,"abstract":"Variance components of linear mixed models should be estimated with all the data and information available for a specific statistical model to avoid bias. Due to computational limitations, the estimation for large datasets or complex models is often carried out by subsetting the data, removing genomic information, or simplifying the statistical model. Monte Carlo REML (MC-REML) is a method developed to lift computational limitations, but so far there was no extension for genomic information under the single-step genomic methods. In this study, we extended MC-REML to include large genomic information. We developed a method to estimate variance components named Monte Carlo single-step genomic REML (MC-ss-GREML). The core of the method includes repeatedly simulating breeding values under a ssGBLUP model and solving the mixed model equations to approximate traces involving prediction error variances. The REML optimization strategies include Expectation Maximization and Average Information. We tested the accuracy of MC-ss-GREML with a three-trait growth model in beef cattle with maternal effects, with 14 parameters to estimate. The data set had 100,000 animals in the pedigree, of which about 33,000 had records, and 10,000 were genotyped. There were no differences in estimates between MC-ss-GREML and ss-GREML with the exact calculation of the traces (exact ss-GREML). MC-ss-GREML took 14% of the computing time and used 1% of the memory compared to the exact ss-GREML. We tested the computing performance of MC-ss-GREML by estimating variance components in a birth weight model, with much larger data that included 7 million animals in the pedigree, from which 330,000 were genotyped. The estimation converged in 11 rounds and took 121 h, with a peak memory usage of 53 Gb. The new method, MC-ss-GREML, can estimate variance components with large datasets including many genotyped individuals, at affordable time and memory costs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"8 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145396777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating transcriptomic data into genomic prediction models to improve the prediction accuracy of phenotypes of efficiency traits. 将转录组学数据纳入基因组预测模型,提高效率性状表型预测的准确性。
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-23 DOI: 10.1186/s12711-025-01008-7
Valentin P Haas,Robin Wellmann,Pascal Duenk,Michael Oster,Siriluck Ponsuksili,Jörn Bennewitz,Mario P L Calus
BACKGROUNDSince genomic selection has been established in animal breeding, attention has turned towards other omics layers that are seen as promising to improve prediction accuracy. Transcriptomic data provide insights into gene expression patterns, which are shaped by both genetic and environmental factors, offering a more comprehensive understanding of the expression of phenotypes. This study utilized various statistical methods to assess the applicability of transcriptomic data derived from intestinal tissue to the prediction of efficiency-related phenotypes. The focus was on formal derivation of the previously described GTCBLUP model, which was adapted to create GTCBLUPi and compared with other BLUP models. The GTCBLUPi model addresses redundant information between genomic and transcriptomic information. We compared estimated variance components and accuracies of prediction of phenotypes for efficiency-related traits in an F2 cross of 480 Japanese quail using different models. Additionally, we estimated transcriptomic correlations between the traits using animal effects based on transcriptomic similarity, and the effects of individual transcript abundances on the phenotypes.RESULTSThis study showed that transcript abundances from the ileum explain a larger portion of the phenotypic variance of the traits than host genetics. Models incorporating both genetic and transcriptomic information outperformed those using only one type of information, with regard to the phenotypic variances explained. The combination of both data types resulted in higher trait prediction accuracies, confirming that transcriptomic information complements genetic data effectively. The derived GTCBLUPi model proved to be a suitable framework for integrating both information types. Additionally, polygenic backgrounds were identified for the traits studied based on transcriptomic profiles, along with high transcriptomic correlations between the traits.CONCLUSIONSTranscriptomic data account for a high portion of phenotypic expression for all phenotypes and incorporating them enables more accurate predictions of phenotypes for efficiency and performance traits. Models that integrate both genetic and transcriptomic information are the most effective, offering valuable insights for improving phenotype prediction accuracy and insights in biological mechanisms underlying phenotypic variation of traits.
自从基因组选择在动物育种中建立以来,人们的注意力转向了其他有望提高预测准确性的组学层。转录组学数据提供了对基因表达模式的见解,这是由遗传和环境因素塑造的,提供了对表型表达的更全面的理解。本研究利用各种统计方法来评估来自肠道组织的转录组学数据在预测效率相关表型方面的适用性。重点是前面描述的GTCBLUP模型的形式化派生,该模型被用于创建GTCBLUPi,并与其他BLUP模型进行比较。GTCBLUPi模型解决了基因组和转录组信息之间的冗余信息。我们比较了480只日本鹌鹑F2杂交中效率相关性状的估计方差成分和预测精度。此外,我们利用基于转录组相似性的动物效应估计了性状之间的转录组相关性,以及个体转录丰度对表型的影响。结果本研究表明,来自回肠的转录物丰度比宿主遗传更能解释性状的表型变异。结合遗传和转录组信息的模型在解释表型差异方面优于仅使用一种信息的模型。两种数据类型的结合导致更高的性状预测准确性,证实转录组信息有效地补充了遗传数据。推导的GTCBLUPi模型被证明是集成这两种信息类型的合适框架。此外,基于转录组谱鉴定了所研究性状的多基因背景,以及性状之间的高转录组相关性。结论转录组学数据占所有表型的很大一部分表型表达,结合它们可以更准确地预测效率和性能性状的表型。整合遗传和转录组信息的模型是最有效的,为提高表型预测准确性和了解性状表型变异的生物学机制提供了有价值的见解。
{"title":"Incorporating transcriptomic data into genomic prediction models to improve the prediction accuracy of phenotypes of efficiency traits.","authors":"Valentin P Haas,Robin Wellmann,Pascal Duenk,Michael Oster,Siriluck Ponsuksili,Jörn Bennewitz,Mario P L Calus","doi":"10.1186/s12711-025-01008-7","DOIUrl":"https://doi.org/10.1186/s12711-025-01008-7","url":null,"abstract":"BACKGROUNDSince genomic selection has been established in animal breeding, attention has turned towards other omics layers that are seen as promising to improve prediction accuracy. Transcriptomic data provide insights into gene expression patterns, which are shaped by both genetic and environmental factors, offering a more comprehensive understanding of the expression of phenotypes. This study utilized various statistical methods to assess the applicability of transcriptomic data derived from intestinal tissue to the prediction of efficiency-related phenotypes. The focus was on formal derivation of the previously described GTCBLUP model, which was adapted to create GTCBLUPi and compared with other BLUP models. The GTCBLUPi model addresses redundant information between genomic and transcriptomic information. We compared estimated variance components and accuracies of prediction of phenotypes for efficiency-related traits in an F2 cross of 480 Japanese quail using different models. Additionally, we estimated transcriptomic correlations between the traits using animal effects based on transcriptomic similarity, and the effects of individual transcript abundances on the phenotypes.RESULTSThis study showed that transcript abundances from the ileum explain a larger portion of the phenotypic variance of the traits than host genetics. Models incorporating both genetic and transcriptomic information outperformed those using only one type of information, with regard to the phenotypic variances explained. The combination of both data types resulted in higher trait prediction accuracies, confirming that transcriptomic information complements genetic data effectively. The derived GTCBLUPi model proved to be a suitable framework for integrating both information types. Additionally, polygenic backgrounds were identified for the traits studied based on transcriptomic profiles, along with high transcriptomic correlations between the traits.CONCLUSIONSTranscriptomic data account for a high portion of phenotypic expression for all phenotypes and incorporating them enables more accurate predictions of phenotypes for efficiency and performance traits. Models that integrate both genetic and transcriptomic information are the most effective, offering valuable insights for improving phenotype prediction accuracy and insights in biological mechanisms underlying phenotypic variation of traits.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"140 1","pages":"59"},"PeriodicalIF":4.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel frameshift variant in ALS2 associated with segmental axonopathy in Merino sheep. 一种与美利奴羊节段性轴索病相关的ALS2新移码变异。
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-23 DOI: 10.1186/s12711-025-01005-w
Katie L M Eager,Robert D Jolly,Leah Manning,Cali E Willet,Russell G Snell,Klaus Lehnert,Natasha E Mckean,Nick W Sneddon,Brendon A O'Rourke,Keren E Dittmer,Imke Tammen,Matt Littlejohn
BACKGROUNDSegmental axonopathy is a recessively inherited neurodegenerative disorder that has affected Merino sheep since the early 1930s. Despite its long-standing recognition, the genetic basis of the condition remained unknown. This study aimed to identify the genetic cause of segmental axonopathy and confirm its pathological features to improve diagnostic accuracy and inform breeding strategies.RESULTSWhole genome sequencing and genotyping of affected and unaffected Merino sheep identified a novel homozygous frameshift variant in the ALS2 gene that segregated with disease. RNA sequencing of cerebellar peduncle tissue confirmed the nonsense consequence on the ALS2 transcript. Histological analysis highlighted the hallmarks of the disease as large, foamy eosinophilic axonal swellings predominantly in the trigeminal ganglia, with additional degenerative changes in both the brain and spinal cord. These findings support the value of targeted sampling of sensory roots of the trigeminal nerve, spinal cord tracts, and dorsal nerve rootlets to enhance diagnostic accuracy. The same ALS2 variant was found across multiple unrelated flocks in both Australia and New Zealand, indicating a broader presence within the fine-wool Merino sheep population.CONCLUSIONSThis study identifies a novel ALS2 frameshift variant associated with segmental axonopathy in Merino sheep and provides both genetic and histological evidence supporting its role in disease pathology. The development of a DNA diagnostic test will enable more informed breeding decisions, reduce the prevalence of this condition, and improve animal welfare and productivity in the Merino industry. Moreover, the findings offer a potential large-animal model for exploring early-onset forms of human motor neuron diseases, including amyotrophic lateral sclerosis, in which ALS2 variants are implicated.
背景:节段性轴索病是一种隐性遗传的神经退行性疾病,自20世纪30年代初以来一直影响着美利奴羊。尽管人们早就认识到这一点,但这种疾病的遗传基础仍然未知。本研究旨在确定节段性轴索病的遗传原因,确认其病理特征,以提高诊断准确性,为育种策略提供依据。结果对患病和美利奴羊和未患病美利奴羊进行全基因组测序和基因分型,鉴定出一种新的ALS2基因纯合子移码变异。小脑脚组织的RNA测序证实了对ALS2转录本的无意义后果。组织学分析强调该疾病的特征为主要发生在三叉神经节的巨大泡沫性嗜酸性轴突肿胀,并伴有脑和脊髓的其他退行性改变。这些发现支持了三叉神经感觉神经根、脊髓束和背神经根的定向取样对提高诊断准确性的价值。在澳大利亚和新西兰的多个不相关的羊群中发现了相同的ALS2变异,表明在细毛美利奴羊种群中存在更广泛。结论本研究在美利奴羊中发现了一种新的与节段性轴索病相关的ALS2移胞体变异,并为其在疾病病理中的作用提供了遗传和组织学证据。DNA诊断测试的发展将使育种决策更加明智,减少这种情况的流行,并提高美利奴产业的动物福利和生产力。此外,这些发现为探索人类运动神经元疾病的早发形式提供了一个潜在的大型动物模型,包括肌萎缩侧索硬化症,其中涉及ALS2变异。
{"title":"A novel frameshift variant in ALS2 associated with segmental axonopathy in Merino sheep.","authors":"Katie L M Eager,Robert D Jolly,Leah Manning,Cali E Willet,Russell G Snell,Klaus Lehnert,Natasha E Mckean,Nick W Sneddon,Brendon A O'Rourke,Keren E Dittmer,Imke Tammen,Matt Littlejohn","doi":"10.1186/s12711-025-01005-w","DOIUrl":"https://doi.org/10.1186/s12711-025-01005-w","url":null,"abstract":"BACKGROUNDSegmental axonopathy is a recessively inherited neurodegenerative disorder that has affected Merino sheep since the early 1930s. Despite its long-standing recognition, the genetic basis of the condition remained unknown. This study aimed to identify the genetic cause of segmental axonopathy and confirm its pathological features to improve diagnostic accuracy and inform breeding strategies.RESULTSWhole genome sequencing and genotyping of affected and unaffected Merino sheep identified a novel homozygous frameshift variant in the ALS2 gene that segregated with disease. RNA sequencing of cerebellar peduncle tissue confirmed the nonsense consequence on the ALS2 transcript. Histological analysis highlighted the hallmarks of the disease as large, foamy eosinophilic axonal swellings predominantly in the trigeminal ganglia, with additional degenerative changes in both the brain and spinal cord. These findings support the value of targeted sampling of sensory roots of the trigeminal nerve, spinal cord tracts, and dorsal nerve rootlets to enhance diagnostic accuracy. The same ALS2 variant was found across multiple unrelated flocks in both Australia and New Zealand, indicating a broader presence within the fine-wool Merino sheep population.CONCLUSIONSThis study identifies a novel ALS2 frameshift variant associated with segmental axonopathy in Merino sheep and provides both genetic and histological evidence supporting its role in disease pathology. The development of a DNA diagnostic test will enable more informed breeding decisions, reduce the prevalence of this condition, and improve animal welfare and productivity in the Merino industry. Moreover, the findings offer a potential large-animal model for exploring early-onset forms of human motor neuron diseases, including amyotrophic lateral sclerosis, in which ALS2 variants are implicated.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"11 1","pages":"60"},"PeriodicalIF":4.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of a French cattle pangenome, from structural variant discovery to association studies on key phenotypes. 法国牛泛基因组的应用,从结构变异发现到关键表型的关联研究。
IF 3.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-23 DOI: 10.1186/s12711-025-01012-x
Valentin Sorin, Maulana Mughitz Naji, Clément Birbes, Cécile Grohs, Clémentine Escouflaire, Sébastien Fritz, Camille Eché, Camille Marcuzzo, Amandine Suin, Cécile Donnadieu, Christine Gaspin, Carole Iampietro, Denis Milan, Laurence Drouilhet, Gwenola Tosser-Klopp, Didier Boichard, Christophe Klopp, Marie-Pierre Sanchez, Mekki Boussaha

Background: The current cattle reference genome assembly, a pseudo-linear sequence produced using sequences from a single Hereford cow, represents a limitation when performing genetic studies, especially when investigating the whole spectrum of genetic variations within the species. Detecting structural variations (SVs) poses significant challenges when relying solely on conventional methods of sequencing read mapping to the current bovine genome assembly.

Results: In this study, we used long-reads (LR) and bioinformatic tools to construct a comprehensive bovine pangenome, using as a backbone the Hereford ARS-UCD1.2 reference genome assembly, and incorporating genetic diversity of 64 good quality de novo genome assemblies representing 14 French dairy and beef cattle breeds. Using a combination of complementary approaches, we explored the pangenome graph and identified 2.563 Gb of sequences common to all samples, and cumulated 0.295 Gb of variable sequences. Notably, we discovered 0.159 Gb of novel sequences not present in the current reference genome assembly. Our analysis also revealed 109,275 SVs, of which 84,612 were bi-allelic. These included 27,171 insertions and 24,592 deletions, while the remaining 32,849 SVs corresponded to alternate allele sequences defined as sequence substitutions between the reference genome and the sample sequence. Genome-wide association studies using SNPs and a panel of 221 SVs, shared between the pangenome and the EuroGMD chip, revealed well-known QTLs across the genome for the Holstein, Montbéliarde and Normande breeds. Among those, a QTL on chromosome 11 presents an SV with a highly significant effect on stature in the Holstein breed. This SV is a 6.2 kb deletion affecting the 5'UTR, first exon and part of the first intron of the MATN3 gene, suggesting a potential regulatory and coding effect.

Conclusions: Our study provides new insights into the genetic diversity of 14 French dairy and beef breeds and highlights the utility of pangenome graphs in capturing structural variation. The identified SV associated with stature highlights the importance of integrating SVs into GWAS for a more comprehensive understanding of complex traits.

背景:目前的牛参考基因组组合是使用来自赫里福德牛的序列产生的伪线性序列,在进行遗传研究时具有局限性,特别是在调查物种内遗传变异的全谱时。检测结构变异(SVs)提出了重大挑战,当仅仅依赖于传统的测序方法读图到当前的牛基因组组装。结果:本研究以herford ARS-UCD1.2参考基因组序列为骨架,结合代表14个法国乳牛和肉牛品种的64个优质从头基因组序列的遗传多样性,利用长reads (LR)和生物信息学工具构建了一个完整的牛泛基因组序列。利用互补互补的方法,我们对泛基因组图谱进行了探索,鉴定出所有样本共有的序列为2.563 Gb,累积的可变序列为0.295 Gb。值得注意的是,我们发现了0.159 Gb当前参考基因组中不存在的新序列。我们的分析还发现了109,275个sv,其中84,612个是双等位基因。其中包括27,171个插入和24,592个缺失,而剩余的32,849个SVs对应于替代等位基因序列,定义为参考基因组和样本序列之间的序列替换。利用泛基因组和EuroGMD芯片共享的单核苷酸多态性和221个SVs进行全基因组关联研究,揭示了荷尔斯坦、蒙巴姆比亚德和诺曼德品种基因组中众所周知的qtl。其中,11号染色体上的一个QTL对荷斯坦品种的身高有极显著影响。该SV是一个6.2 kb的缺失,影响MATN3基因的5'UTR,第一外显子和第一个内含子的一部分,表明潜在的调节和编码作用。结论:我们的研究为14个法国奶牛和牛肉品种的遗传多样性提供了新的见解,并强调了泛基因组图在捕获结构变异方面的实用性。已确定的与身高相关的SV突出了将SV整合到GWAS中对于更全面地理解复杂性状的重要性。
{"title":"Application of a French cattle pangenome, from structural variant discovery to association studies on key phenotypes.","authors":"Valentin Sorin, Maulana Mughitz Naji, Clément Birbes, Cécile Grohs, Clémentine Escouflaire, Sébastien Fritz, Camille Eché, Camille Marcuzzo, Amandine Suin, Cécile Donnadieu, Christine Gaspin, Carole Iampietro, Denis Milan, Laurence Drouilhet, Gwenola Tosser-Klopp, Didier Boichard, Christophe Klopp, Marie-Pierre Sanchez, Mekki Boussaha","doi":"10.1186/s12711-025-01012-x","DOIUrl":"10.1186/s12711-025-01012-x","url":null,"abstract":"<p><strong>Background: </strong>The current cattle reference genome assembly, a pseudo-linear sequence produced using sequences from a single Hereford cow, represents a limitation when performing genetic studies, especially when investigating the whole spectrum of genetic variations within the species. Detecting structural variations (SVs) poses significant challenges when relying solely on conventional methods of sequencing read mapping to the current bovine genome assembly.</p><p><strong>Results: </strong>In this study, we used long-reads (LR) and bioinformatic tools to construct a comprehensive bovine pangenome, using as a backbone the Hereford ARS-UCD1.2 reference genome assembly, and incorporating genetic diversity of 64 good quality de novo genome assemblies representing 14 French dairy and beef cattle breeds. Using a combination of complementary approaches, we explored the pangenome graph and identified 2.563 Gb of sequences common to all samples, and cumulated 0.295 Gb of variable sequences. Notably, we discovered 0.159 Gb of novel sequences not present in the current reference genome assembly. Our analysis also revealed 109,275 SVs, of which 84,612 were bi-allelic. These included 27,171 insertions and 24,592 deletions, while the remaining 32,849 SVs corresponded to alternate allele sequences defined as sequence substitutions between the reference genome and the sample sequence. Genome-wide association studies using SNPs and a panel of 221 SVs, shared between the pangenome and the EuroGMD chip, revealed well-known QTLs across the genome for the Holstein, Montbéliarde and Normande breeds. Among those, a QTL on chromosome 11 presents an SV with a highly significant effect on stature in the Holstein breed. This SV is a 6.2 kb deletion affecting the 5'UTR, first exon and part of the first intron of the MATN3 gene, suggesting a potential regulatory and coding effect.</p><p><strong>Conclusions: </strong>Our study provides new insights into the genetic diversity of 14 French dairy and beef breeds and highlights the utility of pangenome graphs in capturing structural variation. The identified SV associated with stature highlights the importance of integrating SVs into GWAS for a more comprehensive understanding of complex traits.</p>","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"57 1","pages":"61"},"PeriodicalIF":3.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12551211/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145356826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The impact of haplotypes derived from Chinese pigs on genetic variation and economic traits in the Duroc breed. 中国猪单倍型对杜洛克品种遗传变异和经济性状的影响。
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-23 DOI: 10.1186/s12711-025-01010-z
Suyun Fang,Chao Guo,Hang Liu,Yuzhan Wang,Cheng Tan,Zhenfang Wu,Yiqiang Zhao,Xiaoxiang Hu,Ruifei Yang
BACKGROUNDThe historical importation of Chinese pigs into Western countries has facilitated the introduction of Chinese haplotypes into European pig breeds, thereby shaping their genetic diversity and phenotypic traits. However, the genetic and biological implications of this introgression remain poorly understood.RESULTSBased on SNP chip and resequencing data, we confirmed significant genetic introgression from Chinese pigs into commercial European lines. The genetic origins of the introgressed segments predominantly derive from Southern Chinese domestic pigs (CSDP), with additional contributions from other populations, such as Eastern Chinese domestic pigs (CEDP). Our study demonstrates that the selection pressure for Chinese pig introgression was stronger in Duroc pigs compared to the Large White and Landrace breeds. Based on ancestral haplotypes from CEDP and CSDP, we conducted a genome-wide association study (GWAS) and identified 10 quantitative trait loci (QTLs), five of which were not identified in previous studies or using SNPs. Expression genome-wide association studies (eGWAS) based on these introgressed haplotypes, using gene expression profiles from the duodenum, liver, and muscle tissues in the Duroc population, revealed eGWAS signals that were enriched near transcript start sites. By integrating GWAS signals for loin muscle depth with eGWAS signals in muscle tissue, we confirmed that a region 300 Kb from TAF11, which is enriched with open chromatin regions and encompasses a super-enhancer located within the same topologically associating domain as TAF11, was associated with both TAF11 expression and loin muscle depth, highlighting the profound influence of Chinese introgression.CONCLUSIONSThese findings offer valuable insights into the genetic influences of Chinese pig introgression on the Duroc breed, as well as the molecular basis for its effects on economically important traits in Duroc pigs.
中国猪向西方国家的历史输入促进了中国单倍型向欧洲猪品种的引入,从而塑造了它们的遗传多样性和表型性状。然而,这种基因渗入的遗传和生物学意义仍然知之甚少。结果基于SNP芯片和重测序数据,我们证实了中国猪向欧洲商业品系的显著遗传渗入。渐渗片段的遗传来源主要来自中国南方家猪(CSDP),其他群体如中国东部家猪(CEDP)也有额外的贡献。我们的研究表明,与大白猪和长白猪品种相比,杜洛克猪的中国猪遗传渗入的选择压力更大。基于来自CEDP和CSDP的祖先单倍型,我们进行了全基因组关联研究(GWAS),并鉴定了10个数量性状位点(qtl),其中5个在以前的研究中未被发现或使用snp。基于这些渗入单倍型的表达全基因组关联研究(eGWAS),利用杜洛克种群中十二指肠、肝脏和肌肉组织的基因表达谱,揭示了eGWAS信号在转录起始位点附近富集。通过整合肌肉组织中有关腰肌深度的GWAS信号和eGWAS信号,我们证实了来自TAF11的300 Kb区域与TAF11表达和腰肌深度相关,该区域富含开放染色质区域,并包含与TAF11位于相同拓扑相关结构域的超级增强子,这突出了中国基因渗入的深远影响。结论这些发现为中国猪基因渗入对杜洛克品种的遗传影响以及其对杜洛克猪重要经济性状影响的分子基础提供了有价值的见解。
{"title":"The impact of haplotypes derived from Chinese pigs on genetic variation and economic traits in the Duroc breed.","authors":"Suyun Fang,Chao Guo,Hang Liu,Yuzhan Wang,Cheng Tan,Zhenfang Wu,Yiqiang Zhao,Xiaoxiang Hu,Ruifei Yang","doi":"10.1186/s12711-025-01010-z","DOIUrl":"https://doi.org/10.1186/s12711-025-01010-z","url":null,"abstract":"BACKGROUNDThe historical importation of Chinese pigs into Western countries has facilitated the introduction of Chinese haplotypes into European pig breeds, thereby shaping their genetic diversity and phenotypic traits. However, the genetic and biological implications of this introgression remain poorly understood.RESULTSBased on SNP chip and resequencing data, we confirmed significant genetic introgression from Chinese pigs into commercial European lines. The genetic origins of the introgressed segments predominantly derive from Southern Chinese domestic pigs (CSDP), with additional contributions from other populations, such as Eastern Chinese domestic pigs (CEDP). Our study demonstrates that the selection pressure for Chinese pig introgression was stronger in Duroc pigs compared to the Large White and Landrace breeds. Based on ancestral haplotypes from CEDP and CSDP, we conducted a genome-wide association study (GWAS) and identified 10 quantitative trait loci (QTLs), five of which were not identified in previous studies or using SNPs. Expression genome-wide association studies (eGWAS) based on these introgressed haplotypes, using gene expression profiles from the duodenum, liver, and muscle tissues in the Duroc population, revealed eGWAS signals that were enriched near transcript start sites. By integrating GWAS signals for loin muscle depth with eGWAS signals in muscle tissue, we confirmed that a region 300 Kb from TAF11, which is enriched with open chromatin regions and encompasses a super-enhancer located within the same topologically associating domain as TAF11, was associated with both TAF11 expression and loin muscle depth, highlighting the profound influence of Chinese introgression.CONCLUSIONSThese findings offer valuable insights into the genetic influences of Chinese pig introgression on the Duroc breed, as well as the molecular basis for its effects on economically important traits in Duroc pigs.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"42 1","pages":"58"},"PeriodicalIF":4.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome landscape and genetic architecture of recombination in domestic goats (Capra hircus) 家山羊(Capra hircus)基因组景观与重组遗传结构
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2025-10-16 DOI: 10.1186/s12711-025-01001-0
Alice Etourneau, Rachel Rupp, Bertrand Servin
Recombination is a fundamental biological process, both in participating to the creation of viable gametes and as a driver of genetic diversity. Characterising recombination is therefore of strong interest in breeding populations. In this study, we used ~ 50 K genotyped data and pedigree from two French populations (Alpine and Saanen) of domestic goats (Capra hircus) to build sex-specific recombination maps, and to explore the genetic basis of two recombination phenotypes: genome-wide recombination rate (GRR) and intra-chromosomal shuffling. Sex-specific recombination maps showed higher recombination in males than females for both breeds (Alpine autosomal map size = 35.1 M in males and 30.5 M in females; Saanen map size = 34.0 M in males and 29.0 M in females). Heterochiasmy is particularly notable on small chromosomes, and at both ends of the chromosomes. Yet, no difference in shuffling has been found between populations. Genetic parameters on recombination phenotypes could only be estimated in males, due to lack of data in females. Both phenotypes are significantly heritable (h2 = 0.12 (0.03) for GRR and h2 = 0.034 (0.015) for shuffling, when pooling breeds). GWAS on male GRR identified several significant loci, including RNF212, RNF212B and SSH1, the last one being a novel locus for this phenotype. Correlation of SNP effects between breeds is low for both recombination phenotypes (0.25 for GRR and 0.04 for shuffling), indicating different genetic determinants in the two breeds. This study contributes to the understanding of recombination evolution in ruminants, both between breeds and species.
重组是一个基本的生物学过程,既参与了可存活配子的产生,又作为遗传多样性的驱动因素。因此,在繁殖种群中,对重组进行表征具有浓厚的兴趣。本研究利用法国家山羊(Capra hircus)两个种群(Alpine和Saanen)的~ 50 K基因分型数据和家系,构建了性别特异性重组图谱,并探讨了两种重组表型(全基因组重组率(GRR)和染色体内重组)的遗传基础。性别特异性重组图谱显示,两个品种的雄性重组率均高于雌性(Alpine常染色体图谱大小为雄性35.1 M,雌性30.5 M; Saanen图谱大小为雄性34.0 M,雌性29.0 M)。异交现象在小染色体和染色体两端尤为明显。然而,在不同的人群中,洗牌并没有发现差异。由于缺乏女性的数据,重组表型的遗传参数只能在男性中估计。两种表型均具有显著的可遗传性(混合品种时,GRR的h2 = 0.12(0.03),洗牌的h2 = 0.034(0.015))。GWAS在男性GRR上发现了几个重要的位点,包括RNF212、RNF212B和SSH1,其中SSH1是该表型的新位点。两种重组表型的SNP效应在品种之间的相关性都很低(GRR为0.25,洗牌为0.04),表明两个品种存在不同的遗传决定因素。该研究有助于理解反刍动物的重组进化,包括品种和物种之间的重组进化。
{"title":"Genome landscape and genetic architecture of recombination in domestic goats (Capra hircus)","authors":"Alice Etourneau, Rachel Rupp, Bertrand Servin","doi":"10.1186/s12711-025-01001-0","DOIUrl":"https://doi.org/10.1186/s12711-025-01001-0","url":null,"abstract":"Recombination is a fundamental biological process, both in participating to the creation of viable gametes and as a driver of genetic diversity. Characterising recombination is therefore of strong interest in breeding populations. In this study, we used ~ 50 K genotyped data and pedigree from two French populations (Alpine and Saanen) of domestic goats (Capra hircus) to build sex-specific recombination maps, and to explore the genetic basis of two recombination phenotypes: genome-wide recombination rate (GRR) and intra-chromosomal shuffling. Sex-specific recombination maps showed higher recombination in males than females for both breeds (Alpine autosomal map size = 35.1 M in males and 30.5 M in females; Saanen map size = 34.0 M in males and 29.0 M in females). Heterochiasmy is particularly notable on small chromosomes, and at both ends of the chromosomes. Yet, no difference in shuffling has been found between populations. Genetic parameters on recombination phenotypes could only be estimated in males, due to lack of data in females. Both phenotypes are significantly heritable (h2 = 0.12 (0.03) for GRR and h2 = 0.034 (0.015) for shuffling, when pooling breeds). GWAS on male GRR identified several significant loci, including RNF212, RNF212B and SSH1, the last one being a novel locus for this phenotype. Correlation of SNP effects between breeds is low for both recombination phenotypes (0.25 for GRR and 0.04 for shuffling), indicating different genetic determinants in the two breeds. This study contributes to the understanding of recombination evolution in ruminants, both between breeds and species.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"10 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145295624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genetics Selection Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1