首页 > 最新文献

Genetics Selection Evolution最新文献

英文 中文
A comprehensive atlas of nuclear sequences of mitochondrial origin (NUMT) inserted into the pig genome 插入猪基因组的线粒体来源核序列(NUMT)综合图集
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-09-16 DOI: 10.1186/s12711-024-00930-6
Matteo Bolner, Samuele Bovo, Mohamad Ballan, Giuseppina Schiavo, Valeria Taurisano, Anisa Ribani, Francesca Bertolini, Luca Fontanesi
The integration of nuclear mitochondrial DNA (mtDNA) into the mammalian genomes is an ongoing, yet rare evolutionary process that produces nuclear sequences of mitochondrial origin (NUMT). In this study, we identified and analysed NUMT inserted into the pig (Sus scrofa) genome and in the genomes of a few other Suinae species. First, we constructed a comparative distribution map of NUMT in the Sscrofa11.1 reference genome and in 22 other assembled S. scrofa genomes (from Asian and European pig breeds and populations), as well as the assembled genomes of the Visayan warty pig (Sus cebifrons) and warthog (Phacochoerus africanus). We then analysed a total of 485 whole genome sequencing datasets, from different breeds, populations, or Sus species, to discover polymorphic NUMT (inserted/deleted in the pig genome). The insertion age was inferred based on the presence or absence of orthologous NUMT in the genomes of different species, taking into account their evolutionary divergence. Additionally, the age of the NUMT was calculated based on sequence degradation compared to the authentic mtDNA sequence. We also validated a selected set of representative NUMT via PCR amplification. We have constructed an atlas of 418 NUMT regions, 70 of which were not present in any assembled genomes. We identified ancient NUMT regions (older than 55 million years ago, Mya) and NUMT that appeared at different time points along the Suinae evolutionary lineage. We identified very recent polymorphic NUMT (private to S. scrofa, with < 1 Mya), and more ancient polymorphic NUMT (3.5–10 Mya) present in various Sus species. These latest polymorphic NUMT regions, which segregate in European and Asian pig breeds and populations, are likely the results of interspecies admixture within the Sus genus. This study provided a first comprehensive analysis of NUMT present in the Sus scrofa genome, comparing them to NUMT found in other species within the order Cetartiodactyla. The NUMT-based evolutionary window that we reconstructed from NUMT integration ages could be useful to better understand the micro-evolutionary events that shaped the modern pig genome and enriched the genetic diversity of this species.
将核线粒体 DNA(mtDNA)整合到哺乳动物基因组中是一个持续但罕见的进化过程,这一过程会产生线粒体来源的核序列(NUMT)。在这项研究中,我们鉴定并分析了插入猪(Sus scrofa)基因组和其他几个蹄目物种基因组中的 NUMT。首先,我们构建了 NUMT 在 Sscrofa11.1 参考基因组、22 个其他已组装的 S. scrofa 基因组(来自亚洲和欧洲的猪种和种群)以及鄢陵疣猪(Sus cebifrons)和疣猪(Phacochoerus africanus)已组装基因组中的比较分布图。然后,我们分析了来自不同品种、种群或 Sus 种类的总共 485 个全基因组测序数据集,以发现多态 NUMT(在猪基因组中插入/删除)。根据不同物种基因组中是否存在同源 NUMT,并考虑到其进化差异,推断出插入年龄。此外,还根据与真实 mtDNA 序列相比的序列退化情况计算了 NUMT 的年龄。我们还通过 PCR 扩增验证了一组具有代表性的 NUMT。我们构建了一个包含 418 个 NUMT 区域的图集,其中 70 个区域不存在于任何已组装的基因组中。我们发现了古老的 NUMT 区域(早于 5,500 万年前,Mya)以及在 Suinae 进化过程中不同时间点出现的 NUMT。我们发现了非常新的多态 NUMT(S. scrofa 特有,小于 1 Mya),以及存在于不同苏氏物种中的更古老的多态 NUMT(3.5-10 Mya)。这些最新的多态 NUMT 区域在欧洲和亚洲的猪种和种群中出现分离,很可能是 Sus 属种间混杂的结果。本研究首次对苏门答腊猪基因组中的 NUMT 进行了全面分析,并将其与鲸目动物中其他物种的 NUMT 进行了比较。我们根据 NUMT 整合年龄重建的基于 NUMT 的进化窗口有助于更好地理解塑造现代猪基因组和丰富该物种遗传多样性的微进化事件。
{"title":"A comprehensive atlas of nuclear sequences of mitochondrial origin (NUMT) inserted into the pig genome","authors":"Matteo Bolner, Samuele Bovo, Mohamad Ballan, Giuseppina Schiavo, Valeria Taurisano, Anisa Ribani, Francesca Bertolini, Luca Fontanesi","doi":"10.1186/s12711-024-00930-6","DOIUrl":"https://doi.org/10.1186/s12711-024-00930-6","url":null,"abstract":"The integration of nuclear mitochondrial DNA (mtDNA) into the mammalian genomes is an ongoing, yet rare evolutionary process that produces nuclear sequences of mitochondrial origin (NUMT). In this study, we identified and analysed NUMT inserted into the pig (Sus scrofa) genome and in the genomes of a few other Suinae species. First, we constructed a comparative distribution map of NUMT in the Sscrofa11.1 reference genome and in 22 other assembled S. scrofa genomes (from Asian and European pig breeds and populations), as well as the assembled genomes of the Visayan warty pig (Sus cebifrons) and warthog (Phacochoerus africanus). We then analysed a total of 485 whole genome sequencing datasets, from different breeds, populations, or Sus species, to discover polymorphic NUMT (inserted/deleted in the pig genome). The insertion age was inferred based on the presence or absence of orthologous NUMT in the genomes of different species, taking into account their evolutionary divergence. Additionally, the age of the NUMT was calculated based on sequence degradation compared to the authentic mtDNA sequence. We also validated a selected set of representative NUMT via PCR amplification. We have constructed an atlas of 418 NUMT regions, 70 of which were not present in any assembled genomes. We identified ancient NUMT regions (older than 55 million years ago, Mya) and NUMT that appeared at different time points along the Suinae evolutionary lineage. We identified very recent polymorphic NUMT (private to S. scrofa, with < 1 Mya), and more ancient polymorphic NUMT (3.5–10 Mya) present in various Sus species. These latest polymorphic NUMT regions, which segregate in European and Asian pig breeds and populations, are likely the results of interspecies admixture within the Sus genus. This study provided a first comprehensive analysis of NUMT present in the Sus scrofa genome, comparing them to NUMT found in other species within the order Cetartiodactyla. The NUMT-based evolutionary window that we reconstructed from NUMT integration ages could be useful to better understand the micro-evolutionary events that shaped the modern pig genome and enriched the genetic diversity of this species.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"12 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142234457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitochondrial sequence variants: testing imputation accuracy and their association with dairy cattle milk traits 线粒体序列变异:测试估算的准确性及其与奶牛牛奶性状的关系
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-09-12 DOI: 10.1186/s12711-024-00931-5
Jigme Dorji, Amanda J. Chamberlain, Coralie M. Reich, Christy J. VanderJagt, Tuan V. Nguyen, Hans D. Daetwyler, Iona M. MacLeod
Mitochondrial genomes differ from the nuclear genome and in humans it is known that mitochondrial variants contribute to genetic disorders. Prior to genomics, some livestock studies assessed the role of the mitochondrial genome but these were limited and inconclusive. Modern genome sequencing provides an opportunity to re-evaluate the potential impact of mitochondrial variation on livestock traits. This study first evaluated the empirical accuracy of mitochondrial sequence imputation and then used real and imputed mitochondrial sequence genotypes to study the role of mitochondrial variants on milk production traits of dairy cattle. The empirical accuracy of imputation from Single Nucleotide Polymorphism (SNP) panels to mitochondrial sequence genotypes was assessed in 516 test animals of Holstein, Jersey and Red breeds using Beagle software and a sequence reference of 1883 animals. The overall accuracy estimated as the Pearson’s correlation squared (R2) between all imputed and real genotypes across all animals was 0.454. The low accuracy was attributed partly to the majority of variants having low minor allele frequency (MAF < 0.005) but also due to variants in the hypervariable D-loop region showing poor imputation accuracy. Beagle software provides an internal estimate of imputation accuracy (DR2), and 10 percent of the total 1927 imputed positions showed DR2 greater than 0.9 (N = 201). There were 151 sites with empirical R2 > 0.9 (of 954 variants segregating in the test animals) and 138 of these overlapped the sites with DR2 > 0.9. This suggests that the DR2 statistic is a reasonable proxy to select sites that are imputed with higher accuracy for downstream analyses. Accordingly, in the second part of the study mitochondrial sequence variants were imputed from real mitochondrial SNP panel genotypes of 9515 Australian Holstein, Jersey and Red dairy cattle. Then, using only sites with DR2 > 0.900 and real genotypes, we undertook a genome-wide association study (GWAS) for milk, fat and protein yields. The GWAS mitochondrial SNP effects were not significant. The accuracy of imputation of mitochondrial genotypes from the SNP panel to sequence was generally low. The Beagle DR2 statistic enabled selection of sites imputed with higher empirical accuracy. We recommend building larger reference populations with mitochondrial sequence to improve the accuracy of imputing less common variants and ensuring that SNP panels include common variants in the D-loop region.
线粒体基因组不同于核基因组,在人类中,线粒体变异导致了遗传疾病。在基因组学出现之前,一些家畜研究对线粒体基因组的作用进行了评估,但评估结果有限,而且没有定论。现代基因组测序技术为重新评估线粒体变异对家畜性状的潜在影响提供了机会。本研究首先评估了线粒体序列估算的经验准确性,然后使用真实和估算的线粒体序列基因型研究线粒体变异对奶牛产奶性状的作用。使用 Beagle 软件和 1883 头动物的序列参照,对 516 头荷斯坦、娟珊和红种的测试动物进行了评估,结果表明,从单核苷酸多态性(SNP)面板到线粒体序列基因型的推算经验准确性很高。根据所有动物的所有估算基因型与真实基因型之间的皮尔逊相关平方(R2)估算,总体准确度为 0.454。准确率低的部分原因是大多数变异的小等位基因频率(MAF 0.9)较低(在测试动物中分离出 954 个变异),其中 138 个与 DR2 > 0.9 的位点重叠。这表明,DR2 统计量是一个合理的替代指标,可用于为下游分析选择更准确的估算位点。因此,在研究的第二部分,从 9515 头澳大利亚荷斯坦牛、娟珊牛和红奶牛的真实线粒体 SNP 面板基因型中推算线粒体序列变异。然后,我们仅使用 DR2 > 0.900 的位点和真实基因型,对牛奶、脂肪和蛋白质产量进行了全基因组关联研究(GWAS)。GWAS 的线粒体 SNP 影响并不显著。从 SNP 面板到序列的线粒体基因型估算准确率普遍较低。使用 Beagle DR2 统计量可以选择经验准确性较高的归因位点。我们建议利用线粒体序列建立更大的参考群体,以提高较不常见变异的归因准确性,并确保 SNP 面板包括 D 环区域的常见变异。
{"title":"Mitochondrial sequence variants: testing imputation accuracy and their association with dairy cattle milk traits","authors":"Jigme Dorji, Amanda J. Chamberlain, Coralie M. Reich, Christy J. VanderJagt, Tuan V. Nguyen, Hans D. Daetwyler, Iona M. MacLeod","doi":"10.1186/s12711-024-00931-5","DOIUrl":"https://doi.org/10.1186/s12711-024-00931-5","url":null,"abstract":"Mitochondrial genomes differ from the nuclear genome and in humans it is known that mitochondrial variants contribute to genetic disorders. Prior to genomics, some livestock studies assessed the role of the mitochondrial genome but these were limited and inconclusive. Modern genome sequencing provides an opportunity to re-evaluate the potential impact of mitochondrial variation on livestock traits. This study first evaluated the empirical accuracy of mitochondrial sequence imputation and then used real and imputed mitochondrial sequence genotypes to study the role of mitochondrial variants on milk production traits of dairy cattle. The empirical accuracy of imputation from Single Nucleotide Polymorphism (SNP) panels to mitochondrial sequence genotypes was assessed in 516 test animals of Holstein, Jersey and Red breeds using Beagle software and a sequence reference of 1883 animals. The overall accuracy estimated as the Pearson’s correlation squared (R2) between all imputed and real genotypes across all animals was 0.454. The low accuracy was attributed partly to the majority of variants having low minor allele frequency (MAF < 0.005) but also due to variants in the hypervariable D-loop region showing poor imputation accuracy. Beagle software provides an internal estimate of imputation accuracy (DR2), and 10 percent of the total 1927 imputed positions showed DR2 greater than 0.9 (N = 201). There were 151 sites with empirical R2 > 0.9 (of 954 variants segregating in the test animals) and 138 of these overlapped the sites with DR2 > 0.9. This suggests that the DR2 statistic is a reasonable proxy to select sites that are imputed with higher accuracy for downstream analyses. Accordingly, in the second part of the study mitochondrial sequence variants were imputed from real mitochondrial SNP panel genotypes of 9515 Australian Holstein, Jersey and Red dairy cattle. Then, using only sites with DR2 > 0.900 and real genotypes, we undertook a genome-wide association study (GWAS) for milk, fat and protein yields. The GWAS mitochondrial SNP effects were not significant. The accuracy of imputation of mitochondrial genotypes from the SNP panel to sequence was generally low. The Beagle DR2 statistic enabled selection of sites imputed with higher empirical accuracy. We recommend building larger reference populations with mitochondrial sequence to improve the accuracy of imputing less common variants and ensuring that SNP panels include common variants in the D-loop region.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"104 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142170438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genetic parameters and genotype-by-environment interaction estimates for growth and feed efficiency related traits in Chinook salmon, Oncorhynchus tshawytscha, reared under low and moderate flow regimes 低流量和中流量条件下饲养的大鳞鲑(Oncorhynchus tshawytscha)生长和饲料效率相关性状的遗传参数和基因型与环境相互作用估计值
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-09-12 DOI: 10.1186/s12711-024-00929-z
Leteisha A. Prescott, Megan R. Scholtens, Seumas P. Walker, Shannon M. Clarke, Ken G. Dodds, Matthew R. Miller, Jayson M. Semmens, Chris G. Carter, Jane E. Symonds
A genotype-by-environment (G × E) interaction is defined as genotypes responding differently to different environments. In salmonids, G × E interactions can occur in different rearing conditions, including changes in salinity or temperature. However, water flow, an important variable that can influence metabolism, has yet to be considered for potential G × E interactions, although water flows differ across production stages. The salmonid industry is now manipulating flow in tanks to improve welfare and production performance, and expanding sea pen farming offshore, where flow dynamics are substantially greater. Therefore, there is a need to test whether G × E interactions occur under low and higher flow regimes to determine if industry should consider modifying their performance evaluation and selection criteria to account for different flow environments. Here, we used genotype-by-sequencing to create a genomic-relationship matrix of 37 Chinook salmon, Oncorhynchus tshawytscha, families to assess possible G × E interactions for production performance under two flow environments: a low flow regime (0.3 body lengths per second; bl s−1) and a moderate flow regime (0.8 bl s−1). Genetic correlations for the same production performance trait between flow regimes suggest there is minimal evidence of a G × E interaction between the low and moderate flow regimes tested in this study, for Chinook salmon reared from 82.9 ± 16.8 g ( $${overline{text{x}}}$$ ± s.d.) to 583.2 ± 117.1 g ( $${overline{text{x}}}$$ ± s.d.). Estimates of genetic and phenotypic correlations between traits did not reveal any unfavorable trait correlations for size- (weight and condition factor) and growth-related traits, regardless of the flow regime, but did suggest measuring feed intake would be the preferred approach to improve feed efficiency because of the strong correlations between feed intake and feed efficiency, consistent with previous studies. This new information suggests that Chinook salmon families do not need to be selected separately for performance across different flow regimes. However, further studies are needed to confirm this across a wider range of fish sizes and flows. This information is key for breeding programs to determine if separate evaluation groups are required for different flow regimes that are used for production (e.g., hatchery, post smolt recirculating aquaculture system, or offshore).
基因型与环境的相互作用(G × E)是指基因型对不同环境的不同反应。在鲑科鱼类中,G × E 相互作用可能发生在不同的饲养条件下,包括盐度或温度的变化。然而,水流是影响新陈代谢的一个重要变量,虽然不同生产阶段的水流不同,但尚未考虑潜在的 G × E 相互作用。目前,鲑鱼养殖业正在控制水箱中的水流,以提高福利和生产性能,并在近海扩大海栏养殖,因为那里的水流动态更大。因此,有必要测试在低流量和高流量条件下是否会发生 G × E 相互作用,以确定该行业是否应考虑修改其性能评估和选择标准,以适应不同的流量环境。在此,我们使用基因型测序方法创建了 37 个大鳞大麻哈鱼(Oncorhynchus tshawytscha)家系的基因组关系矩阵,以评估在两种水流环境(低水流环境(0.3 体长/秒;bl s-1)和中水流环境(0.8 bl s-1))下生产性能可能存在的 G × E 相互作用。对于饲养体重从 82.9 ± 16.8 g($${overline{text{x}}$±s.d.)到 583.2 ± 117.1 g($${overline{text{x}}$±s.d.)的大鳞大麻哈鱼而言,不同水流条件下同一生产性能特征的遗传相关性表明,在本研究测试的低水流条件和中等水流条件下,G × E 相互作用的证据极少。)对性状间遗传和表型相关性的估计并未发现任何不利于体型(体重和体况因子)和生长相关性状的性状相关性,与水流制度无关,但由于采食量和饲料效率之间的强相关性,表明测量采食量将是提高饲料效率的首选方法,这与之前的研究一致。这一新信息表明,大鳞大麻哈鱼家族不需要在不同水流条件下分别进行性能选择。不过,还需要进一步研究,以便在更广泛的鱼体大小和水流范围内证实这一点。这些信息对育种计划至关重要,有助于确定是否需要对用于生产的不同水流条件(如孵化场、蜕皮后循环水产养殖系统或近海)进行单独的评估分组。
{"title":"Genetic parameters and genotype-by-environment interaction estimates for growth and feed efficiency related traits in Chinook salmon, Oncorhynchus tshawytscha, reared under low and moderate flow regimes","authors":"Leteisha A. Prescott, Megan R. Scholtens, Seumas P. Walker, Shannon M. Clarke, Ken G. Dodds, Matthew R. Miller, Jayson M. Semmens, Chris G. Carter, Jane E. Symonds","doi":"10.1186/s12711-024-00929-z","DOIUrl":"https://doi.org/10.1186/s12711-024-00929-z","url":null,"abstract":"A genotype-by-environment (G × E) interaction is defined as genotypes responding differently to different environments. In salmonids, G × E interactions can occur in different rearing conditions, including changes in salinity or temperature. However, water flow, an important variable that can influence metabolism, has yet to be considered for potential G × E interactions, although water flows differ across production stages. The salmonid industry is now manipulating flow in tanks to improve welfare and production performance, and expanding sea pen farming offshore, where flow dynamics are substantially greater. Therefore, there is a need to test whether G × E interactions occur under low and higher flow regimes to determine if industry should consider modifying their performance evaluation and selection criteria to account for different flow environments. Here, we used genotype-by-sequencing to create a genomic-relationship matrix of 37 Chinook salmon, Oncorhynchus tshawytscha, families to assess possible G × E interactions for production performance under two flow environments: a low flow regime (0.3 body lengths per second; bl s−1) and a moderate flow regime (0.8 bl s−1). Genetic correlations for the same production performance trait between flow regimes suggest there is minimal evidence of a G × E interaction between the low and moderate flow regimes tested in this study, for Chinook salmon reared from 82.9 ± 16.8 g ( $${overline{text{x}}}$$ ± s.d.) to 583.2 ± 117.1 g ( $${overline{text{x}}}$$ ± s.d.). Estimates of genetic and phenotypic correlations between traits did not reveal any unfavorable trait correlations for size- (weight and condition factor) and growth-related traits, regardless of the flow regime, but did suggest measuring feed intake would be the preferred approach to improve feed efficiency because of the strong correlations between feed intake and feed efficiency, consistent with previous studies. This new information suggests that Chinook salmon families do not need to be selected separately for performance across different flow regimes. However, further studies are needed to confirm this across a wider range of fish sizes and flows. This information is key for breeding programs to determine if separate evaluation groups are required for different flow regimes that are used for production (e.g., hatchery, post smolt recirculating aquaculture system, or offshore).","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"10 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142174671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segregation GWAS to linearize a non-additive locus with incomplete penetrance: an example of horn status in sheep 通过分离 GWAS 对具有不完全渗透性的非加性基因座进行线性化:以绵羊的角状况为例
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-09-03 DOI: 10.1186/s12711-024-00928-0
Naomi Duijvesteijn, Julius H. J. van der Werf, Brian P. Kinghorn
The objective of this study was to introduce a genome-wide association study (GWAS) in conjunction with segregation analysis on monogenic categorical traits. Genotype probabilities calculated from phenotypes, mode of inheritance and pedigree information, are expressed as the expected allele count (EAC) (range 0 to 2), and are inherited additively, by definition, unlike the original phenotypes, which are non-additive and could be of incomplete penetrance. The EAC are regressed on the single nucleotide polymorphism (SNP) genotypes, similar to an additive GWAS. In this study, horn phenotypes in Merino sheep are used to illustrate the advantages of using the segregation GWAS, a trait believed to be monogenic, affected by dominance, sex-dependent expression and likely affected by incomplete penetrance. We also used simulation to investigate whether incomplete penetrance can cause prediction errors in Merino sheep for horn status. Estimated penetrance values differed between the sexes, where males showed almost complete penetrance, especially for horned and polled phenotypes, while females had low penetrance values for the horned status. This suggests that females homozygous for the ‘horned allele’ have a horned phenotype in only 22% of the cases while 78% will be knobbed or have scurs. The GWAS using EAC on 4001 animals and 510,174 SNP genotypes from the Illumina Ovine high-density (600k) chip gave a stronger association compared to using actual phenotypes. The correlation between the EAC and the allele count of the SNP with the highest –log10(p-value) was 0.73 in males and 0.67 in females. Simulations using penetrance values found by the segregation analyses resulted in higher correlations between the EAC and the causative mutation (0.95 for males and 0.89 for females, respectively), suggesting that the most predictive SNP is not in full LD with the causative mutation. Our results show clear differences in penetrance values between males and female Merino sheep for horn status. Segregation analysis for a trait with mutually exclusive phenotypes, non-additive inheritance, and/or incomplete penetrance can lead to considerably more power in a GWAS because the linearized genotype probabilities are additive and can accommodate incomplete penetrance. This method can be extended to any monogenic controlled categorical trait of which the phenotypes are mutually exclusive.
本研究的目的是将全基因组关联研究(GWAS)与单基因分类性状的分离分析相结合。根据表型、遗传方式和血统信息计算出的基因型概率用预期等位基因数(EAC)表示(范围在 0 到 2 之间),根据定义,EAC 是加性遗传,这与原始表型不同,原始表型是非加性遗传,可能具有不完全渗透性。EAC对单核苷酸多态性(SNP)基因型进行回归,类似于加性 GWAS。在本研究中,我们利用美利奴羊的角表型来说明使用分离 GWAS 的优势,这种性状被认为是单基因性的,受显性遗传的影响,其表达依赖于性别,并可能受不完全渗透性的影响。我们还利用模拟研究了不完全渗透是否会导致美利奴羊角状况的预测错误。不同性别之间的估计穿透力值存在差异,雄性的穿透力几乎是完全的,尤其是在有角和有花粉的表型上,而雌性在有角的表型上穿透力值较低。这表明,等位基因 "有角 "的雌性只有 22% 的情况下会出现有角的表型,而 78% 的情况下会出现有节或有鳞。与使用实际表型相比,使用 4001 只动物的 EAC 和来自 Illumina Ovine 高密度(600k)芯片的 510,174 个 SNP 基因型进行的 GWAS 发现了更强的关联性。EAC与-log10(p值)最高的SNP等位基因数之间的相关性在雄性动物中为0.73,在雌性动物中为0.67。使用分离分析发现的渗透率值进行模拟,EAC 与致病突变之间的相关性更高(男性分别为 0.95,女性为 0.89),这表明最具预测性的 SNP 与致病突变不存在完全 LD。我们的研究结果表明,雄性美利奴羊和雌性美利奴羊的角状况渗透值存在明显差异。对具有互斥表型、非加性遗传和/或不完全穿透性的性状进行分离分析,可以大大提高 GWAS 的效率,因为线性化的基因型概率是加性的,而且可以适应不完全穿透性。这种方法可扩展到表型相互排斥的任何单基因受控分类性状。
{"title":"Segregation GWAS to linearize a non-additive locus with incomplete penetrance: an example of horn status in sheep","authors":"Naomi Duijvesteijn, Julius H. J. van der Werf, Brian P. Kinghorn","doi":"10.1186/s12711-024-00928-0","DOIUrl":"https://doi.org/10.1186/s12711-024-00928-0","url":null,"abstract":"The objective of this study was to introduce a genome-wide association study (GWAS) in conjunction with segregation analysis on monogenic categorical traits. Genotype probabilities calculated from phenotypes, mode of inheritance and pedigree information, are expressed as the expected allele count (EAC) (range 0 to 2), and are inherited additively, by definition, unlike the original phenotypes, which are non-additive and could be of incomplete penetrance. The EAC are regressed on the single nucleotide polymorphism (SNP) genotypes, similar to an additive GWAS. In this study, horn phenotypes in Merino sheep are used to illustrate the advantages of using the segregation GWAS, a trait believed to be monogenic, affected by dominance, sex-dependent expression and likely affected by incomplete penetrance. We also used simulation to investigate whether incomplete penetrance can cause prediction errors in Merino sheep for horn status. Estimated penetrance values differed between the sexes, where males showed almost complete penetrance, especially for horned and polled phenotypes, while females had low penetrance values for the horned status. This suggests that females homozygous for the ‘horned allele’ have a horned phenotype in only 22% of the cases while 78% will be knobbed or have scurs. The GWAS using EAC on 4001 animals and 510,174 SNP genotypes from the Illumina Ovine high-density (600k) chip gave a stronger association compared to using actual phenotypes. The correlation between the EAC and the allele count of the SNP with the highest –log10(p-value) was 0.73 in males and 0.67 in females. Simulations using penetrance values found by the segregation analyses resulted in higher correlations between the EAC and the causative mutation (0.95 for males and 0.89 for females, respectively), suggesting that the most predictive SNP is not in full LD with the causative mutation. Our results show clear differences in penetrance values between males and female Merino sheep for horn status. Segregation analysis for a trait with mutually exclusive phenotypes, non-additive inheritance, and/or incomplete penetrance can lead to considerably more power in a GWAS because the linearized genotype probabilities are additive and can accommodate incomplete penetrance. This method can be extended to any monogenic controlled categorical trait of which the phenotypes are mutually exclusive.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"6 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142123713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population structure and breed identification of Chinese indigenous sheep breeds using whole genome SNPs and InDels 利用全基因组SNPs和InDels鉴定中国土种羊的种群结构和品种
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-09-03 DOI: 10.1186/s12711-024-00927-1
Chang-heng Zhao, Dan Wang, Cheng Yang, Yan Chen, Jun Teng, Xin-yi Zhang, Zhi Cao, Xian-ming Wei, Chao Ning, Qi-en Yang, Wen-fa Lv, Qin Zhang
Accurate breed identification is essential for the conservation and sustainable use of indigenous farm animal genetic resources. In this study, we evaluated the phylogenetic relationships and genomic breed compositions of 13 sheep breeds using SNP and InDel data from whole genome sequencing. The breeds included 11 Chinese indigenous and 2 foreign commercial breeds. We compared different strategies for breed identification with respect to different marker types, i.e. SNPs, InDels, and a combination of SNPs and InDels (named SIs), different breed-informative marker detection methods, and different machine learning classification methods. Using WGS-based SNPs and InDels, we revealed the phylogenetic relationships between 11 Chinese indigenous and two foreign sheep breeds and quantified their purities through estimated genomic breed compositions. We found that the optimal strategy for identifying these breeds was the combination of DFI_union for breed-informative marker detection, which integrated the methods of Delta, Pairwise Wright's FST, and Informativeness for Assignment (namely DFI) by merging the breed-informative markers derived from the three methods, and KSR for breed assignment, which integrated the methods of K-Nearest Neighbor, Support Vector Machine, and Random Forest (namely KSR) by intersecting their results. Using SI markers improved the identification accuracy compared to using SNPs or InDels alone. We achieved accuracies over 97.5% when using at least the 1000 most breed-informative (MBI) SI markers and even 100% when using 5000 SI markers. Our results provide not only an important foundation for conservation of these Chinese local sheep breeds, but also general approaches for breed identification of indigenous farm animal breeds.
准确的品种鉴定对于本土农场动物遗传资源的保护和可持续利用至关重要。在本研究中,我们利用全基因组测序的 SNP 和 InDel 数据评估了 13 个绵羊品种的系统发育关系和基因组品种组成。这些品种包括 11 个中国本土品种和 2 个国外商业品种。我们比较了不同标记类型(即SNPs、InDels以及SNPs和InDels的组合(命名为SIs))、不同品种信息标记检测方法以及不同机器学习分类方法的不同品种鉴定策略。利用基于 WGS 的 SNPs 和 InDels,我们揭示了 11 个中国本土绵羊品种和 2 个外国绵羊品种之间的系统发育关系,并通过估计的基因组品种组成量化了它们的纯度。我们发现,鉴定这些品种的最佳策略是将 DFI_union 与 KSR 结合起来,前者用于品种信息标记检测,通过合并三种方法得出的品种信息标记,整合了 Delta、配对赖特 FST 和分配信息度方法(即 DFI);后者用于品种分配,通过交叉它们的结果,整合了 K-近邻、支持向量机和随机森林方法(即 KSR)。与单独使用 SNP 或 InDels 相比,使用 SI 标记提高了鉴定准确率。当使用至少 1000 个最具品种信息(MBI)的 SI 标记时,我们的准确率超过了 97.5%,而当使用 5000 个 SI 标记时,准确率甚至达到了 100%。我们的研究结果不仅为这些中国地方绵羊品种的保护提供了重要依据,也为本土农畜品种的品种识别提供了一般方法。
{"title":"Population structure and breed identification of Chinese indigenous sheep breeds using whole genome SNPs and InDels","authors":"Chang-heng Zhao, Dan Wang, Cheng Yang, Yan Chen, Jun Teng, Xin-yi Zhang, Zhi Cao, Xian-ming Wei, Chao Ning, Qi-en Yang, Wen-fa Lv, Qin Zhang","doi":"10.1186/s12711-024-00927-1","DOIUrl":"https://doi.org/10.1186/s12711-024-00927-1","url":null,"abstract":"Accurate breed identification is essential for the conservation and sustainable use of indigenous farm animal genetic resources. In this study, we evaluated the phylogenetic relationships and genomic breed compositions of 13 sheep breeds using SNP and InDel data from whole genome sequencing. The breeds included 11 Chinese indigenous and 2 foreign commercial breeds. We compared different strategies for breed identification with respect to different marker types, i.e. SNPs, InDels, and a combination of SNPs and InDels (named SIs), different breed-informative marker detection methods, and different machine learning classification methods. Using WGS-based SNPs and InDels, we revealed the phylogenetic relationships between 11 Chinese indigenous and two foreign sheep breeds and quantified their purities through estimated genomic breed compositions. We found that the optimal strategy for identifying these breeds was the combination of DFI_union for breed-informative marker detection, which integrated the methods of Delta, Pairwise Wright's FST, and Informativeness for Assignment (namely DFI) by merging the breed-informative markers derived from the three methods, and KSR for breed assignment, which integrated the methods of K-Nearest Neighbor, Support Vector Machine, and Random Forest (namely KSR) by intersecting their results. Using SI markers improved the identification accuracy compared to using SNPs or InDels alone. We achieved accuracies over 97.5% when using at least the 1000 most breed-informative (MBI) SI markers and even 100% when using 5000 SI markers. Our results provide not only an important foundation for conservation of these Chinese local sheep breeds, but also general approaches for breed identification of indigenous farm animal breeds.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"25 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142123714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations 在大量基因分型人群中使用成熟和年轻算法进行单步 GWAS 的标记效应 p 值
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-08-22 DOI: 10.1186/s12711-024-00925-3
Natália Galoro Leite, Matias Bermann, Shogo Tsuruta, Ignacy Misztal, Daniela Lourenco
Single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV) and used for genome-wide association studies (ssGWAS). However, obtaining p-values for those SNP effects relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present a method to approximate SNP p-values for ssGWAS with many genotyped animals. This method relies on the combination of a sparse approximation of the inverse of the genomic relationship matrix ( $${mathbf{G}}_{mathbf{A}mathbf{P}mathbf{Y}}^mathbf{-1}$$ ) built with the algorithm for proven and young ( $$text{APY}$$ ) and an approximation of the prediction error variance of SNP effects which does not require the inversion of the left-hand side (LHS) of the mixed model equations. To test the proposed p-value computing method, we used a reduced genotyped population of 50K genotyped animals and compared the approximated SNP p-values with benchmark p-values obtained with the direct inverse of LHS built with an exact genomic relationship matrix ( $${mathbf{G}}^mathbf{-1})$$ . Then, we applied the proposed approximation method to obtain SNP p-values for a larger genotyped population composed of 450K genotyped animals. The same genomic regions on chromosomes 7 and 20 were identified across all p-value computing methods when using 50K genotyped animals. In terms of computational requirements, obtaining p-values with the proposed approximation reduced the wall-clock time by 38 times and the memory requirement by ten times compared to using the exact inversion of the LHS. When the approximation was applied to a population of 450K genotyped animals, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-values with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals. With the proposed method, obtaining p-values for SNP effects in ssGWAS is computationally feasible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS may no longer be a limitation in extensive populations with many genotyped animals.
单核苷酸多态性(SNP)效应可以从 ssGBLUP 基因组估计育种值(GEBV)中反演算出来,并用于全基因组关联研究(ssGWAS)。然而,要获得这些 SNP 效应的 p 值,需要对密集矩阵进行反演,这给大型基因分型群体的计算带来了限制。在本研究中,我们提出了一种方法,用于近似许多基因分型动物的ssGWAS的SNP p值。该方法依赖于对基因组关系矩阵($${mathbf{G}}_{mathbf{A}mathbf{P}mathbf{Y}}^mathbf{-1}$$ )和 SNP 影响预测误差方差的近似值,后者不需要对混合模型方程的左手侧(LHS)进行反演。为了测试所提出的 p 值计算方法,我们使用了一个由 50K 只基因分型动物组成的缩小基因分型群体,并将近似 SNP p 值与使用精确基因组关系矩阵($${mathbf{G}}^mathbf{-1})建立的 LHS 直接反演得到的基准 p 值进行了比较。然后,我们应用所提出的近似方法获得了由 450K 个基因分型动物组成的更大基因分型群体的 SNP p 值。当使用 50K 只基因分型动物时,所有 p 值计算方法都能确定 7 号和 20 号染色体上的相同基因组区域。在计算要求方面,与使用 LHS 精确反转法相比,使用所提出的近似法获得 p 值的挂钟时间减少了 38 倍,内存需求减少了 10 倍。当把近似值应用于 450K 个基因分型的动物群体时,发现了 6 号和 14 号染色体上两个新的重要区域,这表明当分析中包含更多基因型时,GWAS 的检测能力会提高。利用近似方法和 450K 个基因分型个体获得 p 值的过程耗时 24.5 个壁钟小时,内存 87.66GB,预计随着非核心基因分型个体的增加,p 值将呈线性增长。采用所提出的方法,在ssGWAS中获取SNP效应的p值在大型基因分型群体中是可行的。在有许多基因分型动物的大种群中,在 ssGWAS 中获取 p 值的计算成本可能不再是一个限制因素。
{"title":"Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations","authors":"Natália Galoro Leite, Matias Bermann, Shogo Tsuruta, Ignacy Misztal, Daniela Lourenco","doi":"10.1186/s12711-024-00925-3","DOIUrl":"https://doi.org/10.1186/s12711-024-00925-3","url":null,"abstract":"Single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV) and used for genome-wide association studies (ssGWAS). However, obtaining p-values for those SNP effects relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present a method to approximate SNP p-values for ssGWAS with many genotyped animals. This method relies on the combination of a sparse approximation of the inverse of the genomic relationship matrix ( $${mathbf{G}}_{mathbf{A}mathbf{P}mathbf{Y}}^mathbf{-1}$$ ) built with the algorithm for proven and young ( $$text{APY}$$ ) and an approximation of the prediction error variance of SNP effects which does not require the inversion of the left-hand side (LHS) of the mixed model equations. To test the proposed p-value computing method, we used a reduced genotyped population of 50K genotyped animals and compared the approximated SNP p-values with benchmark p-values obtained with the direct inverse of LHS built with an exact genomic relationship matrix ( $${mathbf{G}}^mathbf{-1})$$ . Then, we applied the proposed approximation method to obtain SNP p-values for a larger genotyped population composed of 450K genotyped animals. The same genomic regions on chromosomes 7 and 20 were identified across all p-value computing methods when using 50K genotyped animals. In terms of computational requirements, obtaining p-values with the proposed approximation reduced the wall-clock time by 38 times and the memory requirement by ten times compared to using the exact inversion of the LHS. When the approximation was applied to a population of 450K genotyped animals, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-values with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals. With the proposed method, obtaining p-values for SNP effects in ssGWAS is computationally feasible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS may no longer be a limitation in extensive populations with many genotyped animals.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"14 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142021889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A computationally feasible multi-trait single-step genomic prediction model with trait-specific marker weights 具有特定性状标记权重、计算可行的多性状单步基因组预测模型
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-08-16 DOI: 10.1186/s12711-024-00926-2
Ismo Strandén, Janez Jenko
Regions of genome-wide marker data may have differing influences on the evaluated traits. This can be reflected in the genomic models by assigning different weights to the markers, which can enhance the accuracy of genomic prediction. However, the standard multi-trait single-step genomic evaluation model can be computationally infeasible when the traits are allowed to have different marker weights. In this study, we developed and implemented a multi-trait single-step single nucleotide polymorphism best linear unbiased prediction (SNPBLUP) model for large genomic data evaluations that allows for the use of precomputed trait-specific marker weights. The modifications to the standard single-step SNPBLUP model were minor and did not significantly increase the preprocessing workload. The model was tested using simulated data and marker weights precomputed using BayesA. Based on the results, memory requirements and computing time per iteration slightly increased compared to the standard single-step model without weights. Moreover, convergence of the model was slower when using marker weights, which resulted in longer total computing time. The use of marker weights, however, improved prediction accuracy. We investigated a single-step SNPBLUP model that can be used to accommodate trait-specific marker weights. The marker-weighted single-step model improved prediction accuracy. The approach can be used for large genomic data evaluations using precomputed marker weights.
全基因组标记数据的区域可能对所评估的性状有不同的影响。这可以通过给标记分配不同权重反映在基因组模型中,从而提高基因组预测的准确性。然而,当允许性状具有不同的标记权重时,标准的多性状单步基因组评估模型在计算上可能是不可行的。在本研究中,我们开发并实施了一种用于大型基因组数据评估的多性状单步单核苷酸多态性最佳线性无偏预测(SNPBLUP)模型,该模型允许使用预先计算的性状特异性标记权重。对标准单步 SNPBLUP 模型的修改很小,不会显著增加预处理工作量。根据结果,与不带权重的标准单步模型相比,每次迭代所需的内存和计算时间略有增加。此外,使用标记权重时,模型的收敛速度较慢,导致总计算时间延长。不过,使用标记权重提高了预测准确率。我们研究了一种可用于容纳特异性标记权重的单步 SNPBLUP 模型。标记加权单步模型提高了预测准确性。这种方法可用于使用预计算标记权重的大型基因组数据评估。
{"title":"A computationally feasible multi-trait single-step genomic prediction model with trait-specific marker weights","authors":"Ismo Strandén, Janez Jenko","doi":"10.1186/s12711-024-00926-2","DOIUrl":"https://doi.org/10.1186/s12711-024-00926-2","url":null,"abstract":"Regions of genome-wide marker data may have differing influences on the evaluated traits. This can be reflected in the genomic models by assigning different weights to the markers, which can enhance the accuracy of genomic prediction. However, the standard multi-trait single-step genomic evaluation model can be computationally infeasible when the traits are allowed to have different marker weights. In this study, we developed and implemented a multi-trait single-step single nucleotide polymorphism best linear unbiased prediction (SNPBLUP) model for large genomic data evaluations that allows for the use of precomputed trait-specific marker weights. The modifications to the standard single-step SNPBLUP model were minor and did not significantly increase the preprocessing workload. The model was tested using simulated data and marker weights precomputed using BayesA. Based on the results, memory requirements and computing time per iteration slightly increased compared to the standard single-step model without weights. Moreover, convergence of the model was slower when using marker weights, which resulted in longer total computing time. The use of marker weights, however, improved prediction accuracy. We investigated a single-step SNPBLUP model that can be used to accommodate trait-specific marker weights. The marker-weighted single-step model improved prediction accuracy. The approach can be used for large genomic data evaluations using precomputed marker weights.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"19 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of the genetic variance of fibre diameter measured along the wool staple for use as a potential indicator of resilience in sheep 分析沿羊毛短纤维测量的纤维直径的遗传变异,以作为绵羊抗逆性的潜在指标
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-08-06 DOI: 10.1186/s12711-024-00924-4
Erin G. Smith, Dominic L. Waters, Samuel F. Walkom, Sam A. Clark
The effects of environmental disturbances on livestock are often observed indirectly through the variability patterns of repeated performance records over time. Sheep are frequently exposed to diverse extensive environments but currently lack appropriate measures of resilience (or sensitivity) towards environmental disturbance. In this study, random regression models were used to analyse repeated records of the fibre diameter of wool taken along the wool staple (bundle of wool fibres) to investigate how the genetic and environmental variance of fibre diameter changes with different growing environments. A model containing a fifth, fourth and second-order Legendre polynomial applied to the fixed, additive and permanent environmental effects, respectively, was optimal for modelling fibre diameter along the wool staple. The additive genetic and permanent environmental variance both showed variability across the staple length trajectory. The ranking of sire estimated breeding values (EBV) for fibre diameter was shown to change along the staple and the genetic correlations decreased as the distance between measurements along the staple increased. This result suggests that some genotypes were potentially more resilient towards the changes in the growing environment compared to others. In addition, the eigenfunctions of the random regression model implied the ability to change the fibre diameter trajectory to reduce its variability along the wool staple. These results show that genetic variation in fibre diameter measured along the wool staple exists and this could be used to provide greater insight into the ability to select for resilience in extensively raised sheep populations.
环境干扰对家畜的影响通常是通过长期重复性能记录的变化模式间接观察到的。绵羊经常暴露在多种多样的广阔环境中,但目前缺乏适当的方法来衡量其对环境干扰的适应性(或敏感性)。本研究采用随机回归模型分析沿羊毛短纤维(羊毛纤维束)采集的羊毛纤维直径的重复记录,以研究纤维直径的遗传和环境变异如何随不同的生长环境而变化。一个包含五阶、四阶和二阶 Legendre 多项式的模型分别适用于固定效应、加法效应和永久环境效应,是沿羊毛短纤维建立纤维直径模型的最佳选择。附加遗传变异和永久环境变异都显示了整个短绒长度轨迹的可变性。纤维直径的母系估计育种值(EBV)的排序随着短绒长度的变化而变化,遗传相关性随着短绒长度测量间距的增加而降低。这一结果表明,与其他基因型相比,某些基因型对生长环境变化的适应能力更强。此外,随机回归模型的特征函数暗示了改变纤维直径轨迹以减少其沿羊毛短纤变化的能力。这些结果表明,沿羊毛主干测量纤维直径的遗传变异是存在的,这可用于更深入地了解在广泛饲养的绵羊种群中选择抗逆性的能力。
{"title":"Analysis of the genetic variance of fibre diameter measured along the wool staple for use as a potential indicator of resilience in sheep","authors":"Erin G. Smith, Dominic L. Waters, Samuel F. Walkom, Sam A. Clark","doi":"10.1186/s12711-024-00924-4","DOIUrl":"https://doi.org/10.1186/s12711-024-00924-4","url":null,"abstract":"The effects of environmental disturbances on livestock are often observed indirectly through the variability patterns of repeated performance records over time. Sheep are frequently exposed to diverse extensive environments but currently lack appropriate measures of resilience (or sensitivity) towards environmental disturbance. In this study, random regression models were used to analyse repeated records of the fibre diameter of wool taken along the wool staple (bundle of wool fibres) to investigate how the genetic and environmental variance of fibre diameter changes with different growing environments. A model containing a fifth, fourth and second-order Legendre polynomial applied to the fixed, additive and permanent environmental effects, respectively, was optimal for modelling fibre diameter along the wool staple. The additive genetic and permanent environmental variance both showed variability across the staple length trajectory. The ranking of sire estimated breeding values (EBV) for fibre diameter was shown to change along the staple and the genetic correlations decreased as the distance between measurements along the staple increased. This result suggests that some genotypes were potentially more resilient towards the changes in the growing environment compared to others. In addition, the eigenfunctions of the random regression model implied the ability to change the fibre diameter trajectory to reduce its variability along the wool staple. These results show that genetic variation in fibre diameter measured along the wool staple exists and this could be used to provide greater insight into the ability to select for resilience in extensively raised sheep populations.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"55 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141895460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genetic diversity of United States Rambouillet, Katahdin and Dorper sheep 美国兰布依莱羊、卡塔丁羊和多尔帕羊的遗传多样性
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-07-30 DOI: 10.1186/s12711-024-00905-7
Gabrielle M. Becker, Jacob W. Thorne, Joan M. Burke, Ronald M. Lewis, David R. Notter, James L. M. Morgan, Christopher S. Schauer, Whit C. Stewart, R. R. Redden, Brenda M. Murdoch
Managing genetic diversity is critically important for maintaining species fitness. Excessive homozygosity caused by the loss of genetic diversity can have detrimental effects on the reproduction and production performance of a breed. Analysis of genetic diversity can facilitate the identification of signatures of selection which may contribute to the specific characteristics regarding the health, production and physical appearance of a breed or population. In this study, breeds with well-characterized traits such as fine wool production (Rambouillet, N = 745), parasite resistance (Katahdin, N = 581) and environmental hardiness (Dorper, N = 265) were evaluated for inbreeding, effective population size (Ne), runs of homozygosity (ROH) and Wright’s fixation index (FST) outlier approach to identify differential signatures of selection at 36,113 autosomal single nucleotide polymorphisms (SNPs). Katahdin sheep had the largest current Ne at the most recent generation estimated with both the GONe and NeEstimator software. The most highly conserved ROH Island was identified in Rambouillet with a signature of selection on chromosome 6 containing 202 SNPs called in an ROH in 50 to 94% of the individuals. This region contained the DCAF16, LCORL and NCAPG genes that have been previously reported to be under selection and have biological roles related to milk production and growth traits. The outlier regions identified through the FST comparisons of Katahdin with Rambouillet and Dorper contained genes with known roles in milk production and mastitis resistance or susceptibility, and the FST comparisons of Rambouillet with Katahdin and Dorper identified genes related to wool growth, suggesting these traits have been under natural or artificial selection pressure in these populations. Genes involved in the cytokine-cytokine receptor interaction pathways were identified in all FST breed comparisons, which indicates the presence of allelic diversity between these breeds in genomic regions controlling cytokine signaling mechanisms. In this paper, we describe signatures of selection within diverse and economically important U.S. sheep breeds. The genes contained within these signatures are proposed for further study to understand their relevance to biological traits and improve understanding of breed diversity.
管理遗传多样性对维持物种的健康至关重要。遗传多样性丧失导致的同源性过高会对品种的繁殖和生产性能产生不利影响。对遗传多样性的分析有助于确定选择的特征,这些特征可能会导致一个品种或种群在健康、生产和外貌方面具有特定的特征。在这项研究中,对细毛羊(Rambouillet,N = 745)、抗寄生虫羊(Katahdin,N = 581)和耐环境羊(Dorper,N = 265)等性状特征良好的品种进行了近亲繁殖、有效种群规模(Ne)、同源杂合度(ROH)和赖特固定指数(FST)离群值评估,以确定 36,113 个常染色体单核苷酸多态性(SNPs)上的选择差异特征。用 GONe 和 NeEstimator 软件估计,卡塔丁绵羊最近一代的当前 Ne 值最大。兰布依莱羊的 ROH 岛具有最高的保守性,其 6 号染色体上有一个选择特征,在 50% 到 94% 的个体中,有 202 个 SNPs 在 ROH 中被调用。该区域包含 DCAF16、LCORL 和 NCAPG 基因,这些基因以前曾报道过受到选择,并具有与产奶量和生长性状相关的生物学作用。通过对卡塔丁牛与兰布依莱牛和多尔帕牛的 FST 比较发现的离群区包含了在产奶量和乳腺炎抗性或易感性方面具有已知作用的基因,而对兰布依莱牛与卡塔丁牛和多尔帕牛的 FST 比较则发现了与羊毛生长有关的基因,这表明在这些种群中这些性状受到了自然或人工选择的压力。在所有 FST 品种比较中都发现了细胞因子-细胞因子受体相互作用途径中的基因,这表明这些品种之间在控制细胞因子信号转导机制的基因组区域中存在等位基因多样性。在本文中,我们描述了美国具有重要经济价值的不同绵羊品种的选择特征。我们建议对这些特征中包含的基因进行进一步研究,以了解它们与生物性状的相关性,并加深对品种多样性的理解。
{"title":"Genetic diversity of United States Rambouillet, Katahdin and Dorper sheep","authors":"Gabrielle M. Becker, Jacob W. Thorne, Joan M. Burke, Ronald M. Lewis, David R. Notter, James L. M. Morgan, Christopher S. Schauer, Whit C. Stewart, R. R. Redden, Brenda M. Murdoch","doi":"10.1186/s12711-024-00905-7","DOIUrl":"https://doi.org/10.1186/s12711-024-00905-7","url":null,"abstract":"Managing genetic diversity is critically important for maintaining species fitness. Excessive homozygosity caused by the loss of genetic diversity can have detrimental effects on the reproduction and production performance of a breed. Analysis of genetic diversity can facilitate the identification of signatures of selection which may contribute to the specific characteristics regarding the health, production and physical appearance of a breed or population. In this study, breeds with well-characterized traits such as fine wool production (Rambouillet, N = 745), parasite resistance (Katahdin, N = 581) and environmental hardiness (Dorper, N = 265) were evaluated for inbreeding, effective population size (Ne), runs of homozygosity (ROH) and Wright’s fixation index (FST) outlier approach to identify differential signatures of selection at 36,113 autosomal single nucleotide polymorphisms (SNPs). Katahdin sheep had the largest current Ne at the most recent generation estimated with both the GONe and NeEstimator software. The most highly conserved ROH Island was identified in Rambouillet with a signature of selection on chromosome 6 containing 202 SNPs called in an ROH in 50 to 94% of the individuals. This region contained the DCAF16, LCORL and NCAPG genes that have been previously reported to be under selection and have biological roles related to milk production and growth traits. The outlier regions identified through the FST comparisons of Katahdin with Rambouillet and Dorper contained genes with known roles in milk production and mastitis resistance or susceptibility, and the FST comparisons of Rambouillet with Katahdin and Dorper identified genes related to wool growth, suggesting these traits have been under natural or artificial selection pressure in these populations. Genes involved in the cytokine-cytokine receptor interaction pathways were identified in all FST breed comparisons, which indicates the presence of allelic diversity between these breeds in genomic regions controlling cytokine signaling mechanisms. In this paper, we describe signatures of selection within diverse and economically important U.S. sheep breeds. The genes contained within these signatures are proposed for further study to understand their relevance to biological traits and improve understanding of breed diversity.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"10 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141794607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the footprint of post-domestication dispersal on the diversity of modern European, African and Asian goats 调查驯化后扩散对现代欧洲、非洲和亚洲山羊多样性的影响
IF 4.1 1区 农林科学 Q1 AGRICULTURE, DAIRY & ANIMAL SCIENCE Pub Date : 2024-07-27 DOI: 10.1186/s12711-024-00923-5
Elena Petretto, Maria Luisa Dettori, María Gracia Luigi-Sierra, Antonia Noce, Michele Pazzola, Giuseppe Massimo Vacca, Antonio Molina, Amparo Martínez, Félix Goyache, Sean Carolan, Marcel Amills
Goats were domesticated in the Fertile Crescent about 10,000 years before present (YBP) and subsequently spread across Eurasia and Africa. This dispersal is expected to generate a gradient of declining genetic diversity with increasing distance from the areas of early livestock management. Previous studies have reported the existence of such genetic cline in European goat populations, but they were based on a limited number of microsatellite markers. Here, we have analyzed data generated by the AdaptMap project and other studies. More specifically, we have used the geographic coordinates and estimates of the observed (Ho) and expected (He) heterozygosities of 1077 European, 1187 African and 617 Asian goats belonging to 38, 43 and 22 different breeds, respectively, to find out whether genetic diversity and distance to Ganj Dareh, a Neolithic settlement in western Iran for which evidence of an early management of domestic goats has been obtained, are significantly correlated. Principal component and ADMIXTURE analyses revealed an incomplete regional differentiation of European breeds, but two genetic clusters representing Northern Europe and the British-Irish Isles were remarkably differentiated from the remaining European populations. In African breeds, we observed five main clusters: (1) North Africa, (2) West Africa, (3) East Africa, (4) South Africa, and (5) Madagascar. Regarding Asian breeds, three well differentiated West Asian, South Asian and East Asian groups were observed. For European and Asian goats, no strong evidence of significant correlations between Ho and He and distance to Ganj Dareh was found. In contrast, in African breeds we detected a significant gradient of diversity, which decreased with distance to Ganj Dareh. The detection of a genetic cline associated with distance to the Ganj Dareh in African but not in European or Asian goat breeds might reflect differences in the post-domestication dispersal process and subsequent migratory movements associated with the management of caprine populations from these three continents.
山羊是在距今约 10,000 年(YBP)前的新月沃地被驯化的,随后在欧亚大陆和非洲传播。随着与早期牲畜管理地区的距离越来越远,这种扩散预计会产生遗传多样性下降的梯度。以前的研究曾报道过欧洲山羊种群中存在这种遗传梯度,但这些研究都是基于数量有限的微卫星标记。在这里,我们分析了 AdaptMap 项目和其他研究产生的数据。更具体地说,我们使用了 1077 只欧洲山羊、1187 只非洲山羊和 617 只亚洲山羊(分别属于 38、43 和 22 个不同品种)的地理坐标以及观察到的杂合度(Ho)和预期杂合度(He)的估计值,以找出遗传多样性是否与与伊朗西部新石器时代定居点 Ganj Dareh 的距离显著相关。主成分和 ADMIXTURE 分析表明,欧洲品种的地区分化并不完全,但代表北欧和英爱群岛的两个基因群与其他欧洲种群有明显的分化。在非洲犬种中,我们观察到五个主要群落:(1) 北非,(2) 西非,(3) 东非,(4) 南非和 (5) 马达加斯加。在亚洲品种方面,我们观察到了西亚、南亚和东亚三个差异明显的群体。在欧洲和亚洲山羊中,Ho 和 He 与 Ganj Dareh 的距离之间没有发现显著相关的有力证据。相反,在非洲品种中,我们发现了明显的多样性梯度,这种梯度随着与甘杰达雷的距离而降低。在非洲而非欧洲或亚洲的山羊品种中发现了与甘杰达雷距离相关的遗传系,这可能反映了这三大洲山羊种群在驯化后的扩散过程和随后的迁徙过程中存在的差异。
{"title":"Investigating the footprint of post-domestication dispersal on the diversity of modern European, African and Asian goats","authors":"Elena Petretto, Maria Luisa Dettori, María Gracia Luigi-Sierra, Antonia Noce, Michele Pazzola, Giuseppe Massimo Vacca, Antonio Molina, Amparo Martínez, Félix Goyache, Sean Carolan, Marcel Amills","doi":"10.1186/s12711-024-00923-5","DOIUrl":"https://doi.org/10.1186/s12711-024-00923-5","url":null,"abstract":"Goats were domesticated in the Fertile Crescent about 10,000 years before present (YBP) and subsequently spread across Eurasia and Africa. This dispersal is expected to generate a gradient of declining genetic diversity with increasing distance from the areas of early livestock management. Previous studies have reported the existence of such genetic cline in European goat populations, but they were based on a limited number of microsatellite markers. Here, we have analyzed data generated by the AdaptMap project and other studies. More specifically, we have used the geographic coordinates and estimates of the observed (Ho) and expected (He) heterozygosities of 1077 European, 1187 African and 617 Asian goats belonging to 38, 43 and 22 different breeds, respectively, to find out whether genetic diversity and distance to Ganj Dareh, a Neolithic settlement in western Iran for which evidence of an early management of domestic goats has been obtained, are significantly correlated. Principal component and ADMIXTURE analyses revealed an incomplete regional differentiation of European breeds, but two genetic clusters representing Northern Europe and the British-Irish Isles were remarkably differentiated from the remaining European populations. In African breeds, we observed five main clusters: (1) North Africa, (2) West Africa, (3) East Africa, (4) South Africa, and (5) Madagascar. Regarding Asian breeds, three well differentiated West Asian, South Asian and East Asian groups were observed. For European and Asian goats, no strong evidence of significant correlations between Ho and He and distance to Ganj Dareh was found. In contrast, in African breeds we detected a significant gradient of diversity, which decreased with distance to Ganj Dareh. The detection of a genetic cline associated with distance to the Ganj Dareh in African but not in European or Asian goat breeds might reflect differences in the post-domestication dispersal process and subsequent migratory movements associated with the management of caprine populations from these three continents.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"106 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genetics Selection Evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1