Pub Date : 2024-04-02DOI: 10.1186/s12711-024-00880-z
Meilin Jin, Huihua Wang, Gang Liu, Jian Lu, Zehu Yuan, Taotao Li, Engming Liu, Zengkui Lu, Lixin Du, Caihong Wei
Chinese indigenous sheep are valuable resources with unique features and characteristics. They are distributed across regions with different climates in mainland China; however, few reports have analyzed the environmental adaptability of sheep based on their genome. We examined the variants and signatures of selection involved in adaptation to extreme humidity, altitude, and temperature conditions in 173 sheep genomes from 41 phenotypically and geographically representative Chinese indigenous sheep breeds to characterize the genetic basis underlying environmental adaptation in these populations. Based on the analysis of population structure, we inferred that Chinese indigenous sheep are divided into four groups: Kazakh (KAZ), Mongolian (MON), Tibetan (TIB), and Yunnan (YUN). We also detected a set of candidate genes that are relevant to adaptation to extreme environmental conditions, such as drought-prone regions (TBXT, TG, and HOXA1), high-altitude regions (DYSF, EPAS1, JAZF1, PDGFD, and NF1) and warm-temperature regions (TSHR, ABCD4, and TEX11). Among all these candidate genes, eight ABCD4, CNTN4, DOCK10, LOC105608545, LOC121816479, SEM3A, SVIL, and TSHR overlap between extreme environmental conditions. The TSHR gene shows a strong signature for positive selection in the warm-temperature group and harbors a single nucleotide polymorphism (SNP) missense mutation located between positions 90,600,001 and 90,650,001 on chromosome 7, which leads to a change in the protein structure of TSHR and influences its stability. Analysis of the signatures of selection uncovered genes that are likely related to environmental adaptation and a SNP missense mutation in the TSHR gene that affects the protein structure and stability. It also provides information on the evolution of the phylogeographic structure of Chinese indigenous sheep populations. These results provide important genetic resources for future breeding studies and new perspectives on how animals can adapt to climate change.
{"title":"Whole-genome resequencing of Chinese indigenous sheep provides insight into the genetic basis underlying climate adaptation","authors":"Meilin Jin, Huihua Wang, Gang Liu, Jian Lu, Zehu Yuan, Taotao Li, Engming Liu, Zengkui Lu, Lixin Du, Caihong Wei","doi":"10.1186/s12711-024-00880-z","DOIUrl":"https://doi.org/10.1186/s12711-024-00880-z","url":null,"abstract":"Chinese indigenous sheep are valuable resources with unique features and characteristics. They are distributed across regions with different climates in mainland China; however, few reports have analyzed the environmental adaptability of sheep based on their genome. We examined the variants and signatures of selection involved in adaptation to extreme humidity, altitude, and temperature conditions in 173 sheep genomes from 41 phenotypically and geographically representative Chinese indigenous sheep breeds to characterize the genetic basis underlying environmental adaptation in these populations. Based on the analysis of population structure, we inferred that Chinese indigenous sheep are divided into four groups: Kazakh (KAZ), Mongolian (MON), Tibetan (TIB), and Yunnan (YUN). We also detected a set of candidate genes that are relevant to adaptation to extreme environmental conditions, such as drought-prone regions (TBXT, TG, and HOXA1), high-altitude regions (DYSF, EPAS1, JAZF1, PDGFD, and NF1) and warm-temperature regions (TSHR, ABCD4, and TEX11). Among all these candidate genes, eight ABCD4, CNTN4, DOCK10, LOC105608545, LOC121816479, SEM3A, SVIL, and TSHR overlap between extreme environmental conditions. The TSHR gene shows a strong signature for positive selection in the warm-temperature group and harbors a single nucleotide polymorphism (SNP) missense mutation located between positions 90,600,001 and 90,650,001 on chromosome 7, which leads to a change in the protein structure of TSHR and influences its stability. Analysis of the signatures of selection uncovered genes that are likely related to environmental adaptation and a SNP missense mutation in the TSHR gene that affects the protein structure and stability. It also provides information on the evolution of the phylogeographic structure of Chinese indigenous sheep populations. These results provide important genetic resources for future breeding studies and new perspectives on how animals can adapt to climate change.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"26 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140534486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-29DOI: 10.1186/s12711-024-00889-4
Aurélie Vinet, Sophie Mattalia, Roxane Vallée, Christine Bertrand, Anne Barbat, Julie Promp, Beatriz C. D. Cuyabano, Didier Boichard
In the current context of climate change, livestock production faces many challenges to improve the sustainability of systems. Dairy farming, in particular, must find ways to select animals that will be able to achieve sufficient overall production while maintaining their reproductive ability in environments with increasing temperatures. With future forecasted climate conditions in mind, this study used data from Holstein and Montbeliarde dairy cattle to: (1) estimate the genetic-by-temperature-humidity index (THI) interactions for female fertility, and (2) evaluate the production-fertility trade-off with increasing values of THI. Two-trait random regression models were fitted for conception rate (fertility) and test-day protein yield (production). For fertility, genetic correlations between different THI values were generally above 0.75, suggesting weak genotype-by-THI interactions for conception rate in both breeds. However, the genetic correlations between the conception rate breeding values at the current average THI (THI = 50, corresponding to a 24-h average temperature of 8 °C at 50% relative humidity) and their slopes (i.e., potential reranking) for heat stress scenarios (THI > 70), were different for each breed. For Montbeliarde, this correlation tended to be positive (i.e., overall the best reproducers are less affected by heat stress), whereas for Holstein it was approximately zero. Finally, our results indicated a weak antagonism between production and fertility, although for Montbeliarde this antagonism intensified with increasing THI. Within the range of weather conditions studied, increasing temperatures are not expected to exacerbate the fertility-production trade-off. However, our results indicated that the animals with the best breeding values for production today will be the most affected by temperature increases, both in terms of fertility and production. Nonetheless, these animals should remain among the most productive ones during heat waves. For Montbeliarde, the current selection program for fertility seems to be adequate for ensuring the adaptation of fertility traits to temperature increases, without adverse effects on production. Such a conclusion cannot be drawn for Holstein. In the future, the incorporation of a heat tolerance index into dairy cattle breeding programs would be valuable to promote the selection of animals adapted to future climate conditions.
{"title":"Effect of temperature-humidity index on the evolution of trade-offs between fertility and production in dairy cattle","authors":"Aurélie Vinet, Sophie Mattalia, Roxane Vallée, Christine Bertrand, Anne Barbat, Julie Promp, Beatriz C. D. Cuyabano, Didier Boichard","doi":"10.1186/s12711-024-00889-4","DOIUrl":"https://doi.org/10.1186/s12711-024-00889-4","url":null,"abstract":"In the current context of climate change, livestock production faces many challenges to improve the sustainability of systems. Dairy farming, in particular, must find ways to select animals that will be able to achieve sufficient overall production while maintaining their reproductive ability in environments with increasing temperatures. With future forecasted climate conditions in mind, this study used data from Holstein and Montbeliarde dairy cattle to: (1) estimate the genetic-by-temperature-humidity index (THI) interactions for female fertility, and (2) evaluate the production-fertility trade-off with increasing values of THI. Two-trait random regression models were fitted for conception rate (fertility) and test-day protein yield (production). For fertility, genetic correlations between different THI values were generally above 0.75, suggesting weak genotype-by-THI interactions for conception rate in both breeds. However, the genetic correlations between the conception rate breeding values at the current average THI (THI = 50, corresponding to a 24-h average temperature of 8 °C at 50% relative humidity) and their slopes (i.e., potential reranking) for heat stress scenarios (THI > 70), were different for each breed. For Montbeliarde, this correlation tended to be positive (i.e., overall the best reproducers are less affected by heat stress), whereas for Holstein it was approximately zero. Finally, our results indicated a weak antagonism between production and fertility, although for Montbeliarde this antagonism intensified with increasing THI. Within the range of weather conditions studied, increasing temperatures are not expected to exacerbate the fertility-production trade-off. However, our results indicated that the animals with the best breeding values for production today will be the most affected by temperature increases, both in terms of fertility and production. Nonetheless, these animals should remain among the most productive ones during heat waves. For Montbeliarde, the current selection program for fertility seems to be adequate for ensuring the adaptation of fertility traits to temperature increases, without adverse effects on production. Such a conclusion cannot be drawn for Holstein. In the future, the incorporation of a heat tolerance index into dairy cattle breeding programs would be valuable to promote the selection of animals adapted to future climate conditions.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"28 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140322057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-28DOI: 10.1186/s12711-024-00890-x
Thomas J. Lopdell, Alexander J. Trevarton, Janelle Moody, Claire Prowse-Wilkins, Sarah Knowles, Kathryn Tiplady, Amanda J. Chamberlain, Michael E. Goddard, Richard J. Spelman, Klaus Lehnert, Russell G. Snell, Stephen R. Davis, Mathew D. Littlejohn
Bovine lactoferrin (Lf) is an iron absorbing whey protein with antibacterial, antiviral, and antifungal activity. Lactoferrin is economically valuable and has an extremely variable concentration in milk, partly driven by environmental influences such as milking frequency, involution, or mastitis. A significant genetic influence has also been previously observed to regulate lactoferrin content in milk. Here, we conducted genetic mapping of lactoferrin protein concentration in conjunction with RNA-seq, ChIP-seq, and ATAC-seq data to pinpoint candidate causative variants that regulate lactoferrin concentrations in milk. We identified a highly-significant lactoferrin protein quantitative trait locus (pQTL), as well as a cis lactotransferrin (LTF) expression QTL (cis-eQTL) mapping to the LTF locus. Using ChIP-seq and ATAC-seq datasets representing lactating mammary tissue samples, we also report a number of regions where the openness of chromatin is under genetic influence. Several of these also show highly significant QTL with genetic signatures similar to those highlighted through pQTL and eQTL analysis. By performing correlation analysis between these QTL, we revealed an ATAC-seq peak in the putative promotor region of LTF, that highlights a set of 115 high-frequency variants that are potentially responsible for these effects. One of the 115 variants (rs110000337), which maps within the ATAC-seq peak, was predicted to alter binding sites of transcription factors known to be involved in lactation-related pathways. Here, we report a regulatory haplotype of 115 variants with conspicuously large impacts on milk lactoferrin concentration. These findings could enable the selection of animals for high-producing specialist herds.
{"title":"A common regulatory haplotype doubles lactoferrin concentration in milk","authors":"Thomas J. Lopdell, Alexander J. Trevarton, Janelle Moody, Claire Prowse-Wilkins, Sarah Knowles, Kathryn Tiplady, Amanda J. Chamberlain, Michael E. Goddard, Richard J. Spelman, Klaus Lehnert, Russell G. Snell, Stephen R. Davis, Mathew D. Littlejohn","doi":"10.1186/s12711-024-00890-x","DOIUrl":"https://doi.org/10.1186/s12711-024-00890-x","url":null,"abstract":"Bovine lactoferrin (Lf) is an iron absorbing whey protein with antibacterial, antiviral, and antifungal activity. Lactoferrin is economically valuable and has an extremely variable concentration in milk, partly driven by environmental influences such as milking frequency, involution, or mastitis. A significant genetic influence has also been previously observed to regulate lactoferrin content in milk. Here, we conducted genetic mapping of lactoferrin protein concentration in conjunction with RNA-seq, ChIP-seq, and ATAC-seq data to pinpoint candidate causative variants that regulate lactoferrin concentrations in milk. We identified a highly-significant lactoferrin protein quantitative trait locus (pQTL), as well as a cis lactotransferrin (LTF) expression QTL (cis-eQTL) mapping to the LTF locus. Using ChIP-seq and ATAC-seq datasets representing lactating mammary tissue samples, we also report a number of regions where the openness of chromatin is under genetic influence. Several of these also show highly significant QTL with genetic signatures similar to those highlighted through pQTL and eQTL analysis. By performing correlation analysis between these QTL, we revealed an ATAC-seq peak in the putative promotor region of LTF, that highlights a set of 115 high-frequency variants that are potentially responsible for these effects. One of the 115 variants (rs110000337), which maps within the ATAC-seq peak, was predicted to alter binding sites of transcription factors known to be involved in lactation-related pathways. Here, we report a regulatory haplotype of 115 variants with conspicuously large impacts on milk lactoferrin concentration. These findings could enable the selection of animals for high-producing specialist herds.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"5 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140310443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-25DOI: 10.1186/s12711-024-00894-7
Laura Skrubbeltrang Hansen, Stine Frey Laursen, Simon Bahrndorff, Morten Kargo, Jesper Givskov Sørensen, Goutam Sahana, Hanne Marie Nielsen, Torsten Nygaard Kristensen
There is a burgeoning interest in using insects as a sustainable source of food and feed, particularly by capitalising on various waste materials and by-products that are typically considered of low value. Enhancing the commercial production of insects can be achieved through two main approaches: optimising environmental conditions and implementing selective breeding strategies. In order to successfully target desirable traits through selective breeding, having a thorough understanding of the genetic parameters pertaining to those traits is essential. In this study, a full-sib half-sib mating design was used to estimate variance components and heritabilities for larval size and survival at day seven of development, development time and survival from egg to adult, and to estimate correlations between these traits, within an outbred population of house flies (Musca domestica), using high-throughput phenotyping for data collection. The results revealed low to intermediate heritabilities and positive genetic correlations between all traits except development time and survival to day seven of development and from egg to adulthood. Surprisingly, larval size at day seven exhibited a comparatively low heritability (0.10) in contrast to development time (0.25), a trait that is believed to have a stronger association with overall fitness. A decline in family numbers resulting from low mating success and high overall mortality reduced the amount of available data which resulted in large standard errors for the estimated parameters. Environmental factors made a substantial contribution to the phenotypic variation, which was overall high for all traits. There is potential for genetic improvement in all studied traits and estimates of genetic correlations indicate a partly shared genetic architecture among the traits. All estimates have large standard errors. Implementing high-throughput phenotyping is imperative for the estimation of genetic parameters in fast developing insects, and facilitates age synchronisation, which is vital in a breeding population. In spite of endeavours to minimise non-genetic sources of variation, all traits demonstrated substantial influences from environmental components. This emphasises the necessity of thorough attention to the experimental design before breeding is initiated in insect populations.
{"title":"Estimation of genetic parameters for the implementation of selective breeding in commercial insect production","authors":"Laura Skrubbeltrang Hansen, Stine Frey Laursen, Simon Bahrndorff, Morten Kargo, Jesper Givskov Sørensen, Goutam Sahana, Hanne Marie Nielsen, Torsten Nygaard Kristensen","doi":"10.1186/s12711-024-00894-7","DOIUrl":"https://doi.org/10.1186/s12711-024-00894-7","url":null,"abstract":"There is a burgeoning interest in using insects as a sustainable source of food and feed, particularly by capitalising on various waste materials and by-products that are typically considered of low value. Enhancing the commercial production of insects can be achieved through two main approaches: optimising environmental conditions and implementing selective breeding strategies. In order to successfully target desirable traits through selective breeding, having a thorough understanding of the genetic parameters pertaining to those traits is essential. In this study, a full-sib half-sib mating design was used to estimate variance components and heritabilities for larval size and survival at day seven of development, development time and survival from egg to adult, and to estimate correlations between these traits, within an outbred population of house flies (Musca domestica), using high-throughput phenotyping for data collection. The results revealed low to intermediate heritabilities and positive genetic correlations between all traits except development time and survival to day seven of development and from egg to adulthood. Surprisingly, larval size at day seven exhibited a comparatively low heritability (0.10) in contrast to development time (0.25), a trait that is believed to have a stronger association with overall fitness. A decline in family numbers resulting from low mating success and high overall mortality reduced the amount of available data which resulted in large standard errors for the estimated parameters. Environmental factors made a substantial contribution to the phenotypic variation, which was overall high for all traits. There is potential for genetic improvement in all studied traits and estimates of genetic correlations indicate a partly shared genetic architecture among the traits. All estimates have large standard errors. Implementing high-throughput phenotyping is imperative for the estimation of genetic parameters in fast developing insects, and facilitates age synchronisation, which is vital in a breeding population. In spite of endeavours to minimise non-genetic sources of variation, all traits demonstrated substantial influences from environmental components. This emphasises the necessity of thorough attention to the experimental design before breeding is initiated in insect populations.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"7 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140209789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-19DOI: 10.1186/s12711-024-00888-5
Tom V. L. Berghof, Nicolas Bedere, Katrijn Peeters, Marieke Poppe, Jeroen Visscher, Han A. Mulder
Resilience is the capacity of an animal to be minimally affected by disturbances or to rapidly return to its initial state before exposure to a disturbance. Resilient livestock are desired because of their improved health and increased economic profit. Genetic improvement of resilience may also lead to trade-offs with production traits. Recently, resilience indicators based on longitudinal data have been suggested, but they need further evaluation to determine whether they are indeed predictive of improved resilience, such as disease resilience. This study investigated different resilience indicators based on deviations between expected and observed egg production (EP) by exploring their genetic parameters, their possible trade-offs with production traits, and their relationships with antibody traits in chickens. Egg production in a nucleus breeding herd environment based on 1-week-, 2-week-, or 3-week-intervals of two purebred chicken lines, a white egg-laying (33,825 chickens) and a brown egg-laying line (34,397 chickens), were used to determine deviations between observed EP and expected average batch EP, and between observed EP and expected individual EP. These deviations were used to calculate three types of resilience indicators for two life periods of each individual: natural logarithm-transformed variance (ln(variance)), skewness, and lag-one autocorrelation (autocorrelation) of deviations from 25 to 83 weeks of age and from 83 weeks of age to end of life. Then, we estimated their genetic correlations with EP traits and with two antibody traits. The most promising resilience indicators were those based on 1-week-intervals, as they had the highest heritability estimates (0.02–0.12) and high genetic correlations (above 0.60) with the same resilience indicators based on longer intervals. The three types of resilience indicators differed genetically from each other, which indicates that they possibly capture different aspects of resilience. Genetic correlations of the resilience indicator traits based on 1-week-intervals with EP traits were favorable or zero, which means that trade-off effects were marginal. The resilience indicator traits based on 1-week-intervals also showed no genetic correlations with the antibody traits, which suggests that they are not informative for improved immunity or vice versa in the nucleus environment. This paper gives direction towards the evaluation and implementation of resilience indicators, i.e. to further investigate resilience indicator traits based on 1-week-intervals, in breeding programs for selecting genetically more resilient layer chickens.
{"title":"The genetics of resilience and its relationships with egg production traits and antibody traits in chickens","authors":"Tom V. L. Berghof, Nicolas Bedere, Katrijn Peeters, Marieke Poppe, Jeroen Visscher, Han A. Mulder","doi":"10.1186/s12711-024-00888-5","DOIUrl":"https://doi.org/10.1186/s12711-024-00888-5","url":null,"abstract":"Resilience is the capacity of an animal to be minimally affected by disturbances or to rapidly return to its initial state before exposure to a disturbance. Resilient livestock are desired because of their improved health and increased economic profit. Genetic improvement of resilience may also lead to trade-offs with production traits. Recently, resilience indicators based on longitudinal data have been suggested, but they need further evaluation to determine whether they are indeed predictive of improved resilience, such as disease resilience. This study investigated different resilience indicators based on deviations between expected and observed egg production (EP) by exploring their genetic parameters, their possible trade-offs with production traits, and their relationships with antibody traits in chickens. Egg production in a nucleus breeding herd environment based on 1-week-, 2-week-, or 3-week-intervals of two purebred chicken lines, a white egg-laying (33,825 chickens) and a brown egg-laying line (34,397 chickens), were used to determine deviations between observed EP and expected average batch EP, and between observed EP and expected individual EP. These deviations were used to calculate three types of resilience indicators for two life periods of each individual: natural logarithm-transformed variance (ln(variance)), skewness, and lag-one autocorrelation (autocorrelation) of deviations from 25 to 83 weeks of age and from 83 weeks of age to end of life. Then, we estimated their genetic correlations with EP traits and with two antibody traits. The most promising resilience indicators were those based on 1-week-intervals, as they had the highest heritability estimates (0.02–0.12) and high genetic correlations (above 0.60) with the same resilience indicators based on longer intervals. The three types of resilience indicators differed genetically from each other, which indicates that they possibly capture different aspects of resilience. Genetic correlations of the resilience indicator traits based on 1-week-intervals with EP traits were favorable or zero, which means that trade-off effects were marginal. The resilience indicator traits based on 1-week-intervals also showed no genetic correlations with the antibody traits, which suggests that they are not informative for improved immunity or vice versa in the nucleus environment. This paper gives direction towards the evaluation and implementation of resilience indicators, i.e. to further investigate resilience indicator traits based on 1-week-intervals, in breeding programs for selecting genetically more resilient layer chickens.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"105 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140161905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-15DOI: 10.1186/s12711-024-00887-6
Marina Martínez-Álvaro, Jennifer Mattock, Óscar González-Recio, Alejandro Saborío-Montero, Ziqing Weng, Joana Lima, Carol-Anne Duthie, Richard Dewhurst, Matthew A. Cleveland, Mick Watson, Rainer Roehe
Growth rate is an important component of feed conversion efficiency in cattle and varies across the different stages of the finishing period. The metabolic effect of the rumen microbiome is essential for cattle growth, and investigating the genomic and microbial factors that underlie this temporal variation can help maximize feed conversion efficiency at each growth stage. By analysing longitudinal body weights during the finishing period and genomic and metagenomic data from 359 beef cattle, our study demonstrates that the influence of the host genome on the functional rumen microbiome contributes to the temporal variation in average daily gain (ADG) in different months (ADG1, ADG2, ADG3, ADG4). Five hundred and thirty-three additive log-ratio transformed microbial genes (alr-MG) had non-zero genomic correlations (rg) with at least one ADG-trait (ranging from |0.21| to |0.42|). Only a few alr-MG correlated with more than one ADG-trait, which suggests that a differential host-microbiome determinism underlies ADG at different stages. These alr-MG were involved in ribosomal biosynthesis, energy processes, sulphur and aminoacid metabolism and transport, or lipopolysaccharide signalling, among others. We selected two alternative subsets of 32 alr-MG that had a non-uniform or a uniform rg sign with all the ADG-traits, regardless of the rg magnitude, and used them to develop a microbiome-driven breeding strategy based on alr-MG only, or combined with ADG-traits, which was aimed at shaping the rumen microbiome towards increased ADG at all finishing stages. Combining alr-MG information with ADG records increased prediction accuracy of genomic estimated breeding values (GEBV) by 11 to 22% relative to the direct breeding strategy (using ADG-traits only), whereas using microbiome information, only, achieved lower accuracies (from 7 to 41%). Predicted selection responses varied consistently with accuracies. Restricting alr-MG based on their rg sign (uniform subset) did not yield a gain in the predicted response compared to the non-uniform subset, which is explained by the absence of alr-MG showing non-zero rg at least with more than one of the ADG-traits. Our work sheds light on the role of the microbial metabolism in the growth trajectory of beef cattle at the genomic level and provides insights into the potential benefits of using microbiome information in future genomic breeding programs to accurately estimate GEBV and increase ADG at each finishing stage in beef cattle.
{"title":"Including microbiome information in a multi-trait genomic evaluation: a case study on longitudinal growth performance in beef cattle","authors":"Marina Martínez-Álvaro, Jennifer Mattock, Óscar González-Recio, Alejandro Saborío-Montero, Ziqing Weng, Joana Lima, Carol-Anne Duthie, Richard Dewhurst, Matthew A. Cleveland, Mick Watson, Rainer Roehe","doi":"10.1186/s12711-024-00887-6","DOIUrl":"https://doi.org/10.1186/s12711-024-00887-6","url":null,"abstract":"Growth rate is an important component of feed conversion efficiency in cattle and varies across the different stages of the finishing period. The metabolic effect of the rumen microbiome is essential for cattle growth, and investigating the genomic and microbial factors that underlie this temporal variation can help maximize feed conversion efficiency at each growth stage. By analysing longitudinal body weights during the finishing period and genomic and metagenomic data from 359 beef cattle, our study demonstrates that the influence of the host genome on the functional rumen microbiome contributes to the temporal variation in average daily gain (ADG) in different months (ADG1, ADG2, ADG3, ADG4). Five hundred and thirty-three additive log-ratio transformed microbial genes (alr-MG) had non-zero genomic correlations (rg) with at least one ADG-trait (ranging from |0.21| to |0.42|). Only a few alr-MG correlated with more than one ADG-trait, which suggests that a differential host-microbiome determinism underlies ADG at different stages. These alr-MG were involved in ribosomal biosynthesis, energy processes, sulphur and aminoacid metabolism and transport, or lipopolysaccharide signalling, among others. We selected two alternative subsets of 32 alr-MG that had a non-uniform or a uniform rg sign with all the ADG-traits, regardless of the rg magnitude, and used them to develop a microbiome-driven breeding strategy based on alr-MG only, or combined with ADG-traits, which was aimed at shaping the rumen microbiome towards increased ADG at all finishing stages. Combining alr-MG information with ADG records increased prediction accuracy of genomic estimated breeding values (GEBV) by 11 to 22% relative to the direct breeding strategy (using ADG-traits only), whereas using microbiome information, only, achieved lower accuracies (from 7 to 41%). Predicted selection responses varied consistently with accuracies. Restricting alr-MG based on their rg sign (uniform subset) did not yield a gain in the predicted response compared to the non-uniform subset, which is explained by the absence of alr-MG showing non-zero rg at least with more than one of the ADG-traits. Our work sheds light on the role of the microbial metabolism in the growth trajectory of beef cattle at the genomic level and provides insights into the potential benefits of using microbiome information in future genomic breeding programs to accurately estimate GEBV and increase ADG at each finishing stage in beef cattle.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"364 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140139531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Validation by data truncation is a common practice in genetic evaluations because of the interest in predicting the genetic merit of a set of young selection candidates. Two of the most used validation methods in genetic evaluations use a single data partition: predictivity or predictive ability (correlation between pre-adjusted phenotypes and estimated breeding values (EBV) divided by the square root of the heritability) and the linear regression (LR) method (comparison of “early” and “late” EBV). Both methods compare predictions with the whole dataset and a partial dataset that is obtained by removing the information related to a set of validation individuals. EBV obtained with the partial dataset are compared against adjusted phenotypes for the predictivity or EBV obtained with the whole dataset in the LR method. Confidence intervals for predictivity and the LR method can be obtained by replicating the validation for different samples (or folds), or bootstrapping. Analytical confidence intervals would be beneficial to avoid running several validations and to test the quality of the bootstrap intervals. However, analytical confidence intervals are unavailable for predictivity and the LR method. We derived standard errors and Wald confidence intervals for the predictivity and statistics included in the LR method (bias, dispersion, ratio of accuracies, and reliability). The confidence intervals for the bias, dispersion, and reliability depend on the relationships and prediction error variances and covariances across the individuals in the validation set. We developed approximations for large datasets that only need the reliabilities of the individuals in the validation set. The confidence intervals for the ratio of accuracies and predictivity were obtained through the Fisher transformation. We show the adequacy of both the analytical and approximated analytical confidence intervals and compare them versus bootstrap confidence intervals using two simulated examples. The analytical confidence intervals were closer to the simulated ones for both examples. Bootstrap confidence intervals tend to be narrower than the simulated ones. The approximated analytical confidence intervals were similar to those obtained by bootstrapping. Estimating the sampling variation of predictivity and the statistics in the LR method without replication or bootstrap is possible for any dataset with the formulas presented in this study.
通过数据截断进行验证是遗传评估中的常见做法,因为人们希望预测一组年轻候选品种的遗传优势。遗传评估中最常用的两种验证方法是使用单一数据分区:预测性或预测能力(预调整表型与估计育种值(EBV)之间的相关性除以遗传率的平方根)和线性回归(LR)方法("早期 "和 "晚期 "EBV 的比较)。这两种方法都是将预测结果与整个数据集和部分数据集进行比较,部分数据集是通过去除与一组验证个体相关的信息而得到的。用部分数据集获得的 EBV 与 LR 方法中预测性的调整表型或整个数据集获得的 EBV 进行比较。预测性和 LR 方法的置信区间可通过对不同样本(或褶皱)进行重复验证或引导获得。分析置信区间有助于避免多次验证,并检验自举区间的质量。然而,预测性和 LR 方法没有分析置信区间。我们推导出了预测性和 LR 方法所含统计量(偏差、离散度、精确度比率和可靠性)的标准误差和 Wald 置信区间。偏差、离散度和可靠性的置信区间取决于验证集中各个体之间的关系和预测误差方差和协方差。我们为大型数据集开发了近似值,只需要验证集中个体的可靠性。准确率和预测率之比的置信区间是通过费雪变换得到的。我们用两个模拟例子说明了分析置信区间和近似分析置信区间的充分性,并将它们与自引导置信区间进行了比较。在两个例子中,分析置信区间都更接近模拟置信区间。引导置信区间往往比模拟置信区间更窄。近似分析置信区间与引导法得到的置信区间相似。对于任何数据集,利用本研究提出的公式都可以估计预测性的抽样变化和 LR 方法中的统计量,而无需复制或引导。
{"title":"Confidence intervals for validation statistics with data truncation in genomic prediction","authors":"Matias Bermann, Andres Legarra, Alejandra Alvarez Munera, Ignacy Misztal, Daniela Lourenco","doi":"10.1186/s12711-024-00883-w","DOIUrl":"https://doi.org/10.1186/s12711-024-00883-w","url":null,"abstract":"Validation by data truncation is a common practice in genetic evaluations because of the interest in predicting the genetic merit of a set of young selection candidates. Two of the most used validation methods in genetic evaluations use a single data partition: predictivity or predictive ability (correlation between pre-adjusted phenotypes and estimated breeding values (EBV) divided by the square root of the heritability) and the linear regression (LR) method (comparison of “early” and “late” EBV). Both methods compare predictions with the whole dataset and a partial dataset that is obtained by removing the information related to a set of validation individuals. EBV obtained with the partial dataset are compared against adjusted phenotypes for the predictivity or EBV obtained with the whole dataset in the LR method. Confidence intervals for predictivity and the LR method can be obtained by replicating the validation for different samples (or folds), or bootstrapping. Analytical confidence intervals would be beneficial to avoid running several validations and to test the quality of the bootstrap intervals. However, analytical confidence intervals are unavailable for predictivity and the LR method. We derived standard errors and Wald confidence intervals for the predictivity and statistics included in the LR method (bias, dispersion, ratio of accuracies, and reliability). The confidence intervals for the bias, dispersion, and reliability depend on the relationships and prediction error variances and covariances across the individuals in the validation set. We developed approximations for large datasets that only need the reliabilities of the individuals in the validation set. The confidence intervals for the ratio of accuracies and predictivity were obtained through the Fisher transformation. We show the adequacy of both the analytical and approximated analytical confidence intervals and compare them versus bootstrap confidence intervals using two simulated examples. The analytical confidence intervals were closer to the simulated ones for both examples. Bootstrap confidence intervals tend to be narrower than the simulated ones. The approximated analytical confidence intervals were similar to those obtained by bootstrapping. Estimating the sampling variation of predictivity and the statistics in the LR method without replication or bootstrap is possible for any dataset with the formulas presented in this study.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"86 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140064146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-01DOI: 10.1186/s12711-024-00881-y
Theo Meuwissen, Leiv Sigbjorn Eikje, Arne B. Gjuvsland
Since the very beginning of genomic selection, researchers investigated methods that improved upon SNP-BLUP (single nucleotide polymorphism best linear unbiased prediction). SNP-BLUP gives equal weight to all SNPs, whereas it is expected that many SNPs are not near causal variants and thus do not have substantial effects. A recent approach to remedy this is to use genome-wide association study (GWAS) findings and increase the weights of GWAS-top-SNPs in genomic predictions. Here, we employ a genome-wide approach to integrate GWAS results into genomic prediction, called GWABLUP. GWABLUP consists of the following steps: (1) performing a GWAS in the training data which results in likelihood ratios; (2) smoothing the likelihood ratios over the SNPs; (3) combining the smoothed likelihood ratio with the prior probability of SNPs having non-zero effects, which yields the posterior probability of the SNPs; (4) calculating a weighted genomic relationship matrix using the posterior probabilities as weights; and (5) performing genomic prediction using the weighted genomic relationship matrix. Using high-density genotypes and milk, fat, protein and somatic cell count phenotypes on dairy cows, GWABLUP was compared to GBLUP, GBLUP (topSNPs) with extra weights for GWAS top-SNPs, and BayesGC, i.e. a Bayesian variable selection model. The GWAS resulted in six, five, four, and three genome-wide significant peaks for milk, fat and protein yield and somatic cell count, respectively. GWABLUP genomic predictions were 10, 6, 7 and 1% more reliable than those of GBLUP for milk, fat and protein yield and somatic cell count, respectively. It was also more reliable than GBLUP (topSNPs) for all four traits, and more reliable than BayesGC for three of the traits. Although GWABLUP showed a tendency towards inflation bias for three of the traits, this was not statistically significant. In a multitrait analysis, GWABLUP yielded the highest accuracy for two of the traits. However, for SCC, which was relatively unrelated to the yield traits, including yield trait GWAS-results reduced the reliability compared to a single trait analysis. GWABLUP uses GWAS results to differentially weigh all the SNPs in a weighted GBLUP genomic prediction analysis. GWABLUP yielded up to 10% and 13% more reliable genomic predictions than GBLUP for single and multitrait analyses, respectively. Extension of GWABLUP to single-step analyses is straightforward.
{"title":"GWABLUP: genome-wide association assisted best linear unbiased prediction of genetic values","authors":"Theo Meuwissen, Leiv Sigbjorn Eikje, Arne B. Gjuvsland","doi":"10.1186/s12711-024-00881-y","DOIUrl":"https://doi.org/10.1186/s12711-024-00881-y","url":null,"abstract":"Since the very beginning of genomic selection, researchers investigated methods that improved upon SNP-BLUP (single nucleotide polymorphism best linear unbiased prediction). SNP-BLUP gives equal weight to all SNPs, whereas it is expected that many SNPs are not near causal variants and thus do not have substantial effects. A recent approach to remedy this is to use genome-wide association study (GWAS) findings and increase the weights of GWAS-top-SNPs in genomic predictions. Here, we employ a genome-wide approach to integrate GWAS results into genomic prediction, called GWABLUP. GWABLUP consists of the following steps: (1) performing a GWAS in the training data which results in likelihood ratios; (2) smoothing the likelihood ratios over the SNPs; (3) combining the smoothed likelihood ratio with the prior probability of SNPs having non-zero effects, which yields the posterior probability of the SNPs; (4) calculating a weighted genomic relationship matrix using the posterior probabilities as weights; and (5) performing genomic prediction using the weighted genomic relationship matrix. Using high-density genotypes and milk, fat, protein and somatic cell count phenotypes on dairy cows, GWABLUP was compared to GBLUP, GBLUP (topSNPs) with extra weights for GWAS top-SNPs, and BayesGC, i.e. a Bayesian variable selection model. The GWAS resulted in six, five, four, and three genome-wide significant peaks for milk, fat and protein yield and somatic cell count, respectively. GWABLUP genomic predictions were 10, 6, 7 and 1% more reliable than those of GBLUP for milk, fat and protein yield and somatic cell count, respectively. It was also more reliable than GBLUP (topSNPs) for all four traits, and more reliable than BayesGC for three of the traits. Although GWABLUP showed a tendency towards inflation bias for three of the traits, this was not statistically significant. In a multitrait analysis, GWABLUP yielded the highest accuracy for two of the traits. However, for SCC, which was relatively unrelated to the yield traits, including yield trait GWAS-results reduced the reliability compared to a single trait analysis. GWABLUP uses GWAS results to differentially weigh all the SNPs in a weighted GBLUP genomic prediction analysis. GWABLUP yielded up to 10% and 13% more reliable genomic predictions than GBLUP for single and multitrait analyses, respectively. Extension of GWABLUP to single-step analyses is straightforward.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"33 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140000861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-29DOI: 10.1186/s12711-024-00876-9
Beatriz C. D. Cuyabano, Didier Boichard, Cedric Gondro
Genetic merit, or breeding values as referred to in livestock and crop breeding programs, is one of the keys to the successful selection of animals in commercial farming systems. The developments in statistical methods during the twentieth century and single nucleotide polymorphism (SNP) chip technologies in the twenty-first century have revolutionized agricultural production, by allowing highly accurate predictions of breeding values for selection candidates at a very early age. Nonetheless, for many breeding populations, realized accuracies of predicted breeding values (PBV) remain below the theoretical maximum, even when the reference population is sufficiently large, and SNPs included in the model are in sufficient linkage disequilibrium (LD) with the quantitative trait locus (QTL). This is particularly noticeable over generations, as we observe the so-called erosion of the effects of SNPs due to recombinations, accompanied by the erosion of the accuracy of prediction. While accurately quantifying the erosion at the individual SNP level is a difficult and unresolved task, quantifying the erosion of the accuracy of prediction is a more tractable problem. In this paper, we describe a method that uses the relationship between reference and target populations to calculate expected values for the accuracies of predicted breeding values for non-phenotyped individuals accounting for erosion. The accuracy of the expected values was evaluated through simulations, and a further evaluation was performed on real data. Using simulations, we empirically confirmed that our expected values for the accuracy of PBV accounting for erosion were able to correctly determine the prediction accuracy of breeding values for non-phenotyped individuals. When comparing the expected to the realized accuracies of PBV with real data, only one out of the four traits evaluated presented accuracies that were significantly higher than the expected, approaching $$sqrt{{{text{h}}}^{2}}$$ . We defined an index of genetic correlation between reference and target populations, which summarizes the expected overall erosion due to differences in allele frequencies and LD patterns between populations. We used this correlation along with a trait’s heritability to derive expected values for the accuracy ( $${text{R}}$$ ) of PBV accounting for the erosion, and demonstrated that our derived $${text{E}}left[{text{R}}|{text{erosion}}right]$$ is a reliable metric.
家畜和农作物育种计划中提到的遗传优势或育种价值,是商业化农业系统成功选育动物的关键之一。二十世纪统计方法和二十一世纪单核苷酸多态性(SNP)芯片技术的发展为农业生产带来了革命性的变化,使人们能够在很早的时候就高度准确地预测候选牲畜的育种价值。然而,对于许多育种群体来说,即使参考群体足够大,且模型中的 SNP 与数量性状位点(QTL)有足够的连锁不平衡(LD),预测育种值(PBV)的实际准确度仍低于理论最大值。这一点在世代交替过程中尤为明显,因为我们观察到所谓的重组导致的 SNP 效应侵蚀,同时伴随着预测准确性的降低。准确量化单个 SNP 水平上的侵蚀是一项困难且尚未解决的任务,而量化预测准确性的侵蚀则是一个更容易解决的问题。在本文中,我们介绍了一种方法,该方法利用参照群体和目标群体之间的关系来计算非表型个体的预测育种值精度的预期值,并将侵蚀考虑在内。通过模拟评估了预期值的准确性,并在真实数据上进行了进一步评估。通过模拟,我们经验性地证实,我们对考虑侵蚀因素的 PBV 精确度的预期值能够正确地确定非表型个体繁殖值的预测精确度。在用真实数据比较 PBV 的预期准确度和实际准确度时,在评估的四个性状中,只有一个性状的准确度明显高于预期,接近 $$sqrt{{text{h}}}^{2}}$$。我们定义了参照种群和目标种群之间的遗传相关性指数,该指数概括了由于种群间等位基因频率和 LD 模式的差异而导致的预期总体侵蚀。我们将这种相关性与性状的遗传率一起用于推导 PBV 计算侵蚀的准确性($${text{R}}$)的预期值,并证明我们推导出的$${text{E}}left[{text{R}}|{text{erosion}}right]$$是一个可靠的指标。
{"title":"Expected values for the accuracy of predicted breeding values accounting for genetic differences between reference and target populations","authors":"Beatriz C. D. Cuyabano, Didier Boichard, Cedric Gondro","doi":"10.1186/s12711-024-00876-9","DOIUrl":"https://doi.org/10.1186/s12711-024-00876-9","url":null,"abstract":"Genetic merit, or breeding values as referred to in livestock and crop breeding programs, is one of the keys to the successful selection of animals in commercial farming systems. The developments in statistical methods during the twentieth century and single nucleotide polymorphism (SNP) chip technologies in the twenty-first century have revolutionized agricultural production, by allowing highly accurate predictions of breeding values for selection candidates at a very early age. Nonetheless, for many breeding populations, realized accuracies of predicted breeding values (PBV) remain below the theoretical maximum, even when the reference population is sufficiently large, and SNPs included in the model are in sufficient linkage disequilibrium (LD) with the quantitative trait locus (QTL). This is particularly noticeable over generations, as we observe the so-called erosion of the effects of SNPs due to recombinations, accompanied by the erosion of the accuracy of prediction. While accurately quantifying the erosion at the individual SNP level is a difficult and unresolved task, quantifying the erosion of the accuracy of prediction is a more tractable problem. In this paper, we describe a method that uses the relationship between reference and target populations to calculate expected values for the accuracies of predicted breeding values for non-phenotyped individuals accounting for erosion. The accuracy of the expected values was evaluated through simulations, and a further evaluation was performed on real data. Using simulations, we empirically confirmed that our expected values for the accuracy of PBV accounting for erosion were able to correctly determine the prediction accuracy of breeding values for non-phenotyped individuals. When comparing the expected to the realized accuracies of PBV with real data, only one out of the four traits evaluated presented accuracies that were significantly higher than the expected, approaching $$sqrt{{{text{h}}}^{2}}$$ . We defined an index of genetic correlation between reference and target populations, which summarizes the expected overall erosion due to differences in allele frequencies and LD patterns between populations. We used this correlation along with a trait’s heritability to derive expected values for the accuracy ( $${text{R}}$$ ) of PBV accounting for the erosion, and demonstrated that our derived $${text{E}}left[{text{R}}|{text{erosion}}right]$$ is a reliable metric.","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"24 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139994288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-29DOI: 10.1186/s12711-024-00886-7
Maxime Ben Braiek, Carole Moreno-Romieux, Céline André, Jean-Michel Astruc, Philippe Bardou, Arnaud Bordes, Frédéric Debat, Francis Fidelle, Itsasne Granado-Tajada, Chris Hozé, Florence Plisson-Petit, François Rivemale, Julien Sarry, Némuel Tadi, Florent Woloszyn, Stéphane Fabre
Background: Recessive deleterious variants are known to segregate in livestock populations, as in humans, and some may be lethal in the homozygous state.
Results: We used phased 50 k single nucleotide polymorphism (SNP) genotypes and pedigree data to scan the genome of 6845 Manech Tête Rousse dairy sheep to search for deficiency in homozygous haplotypes (DHH). Five Manech Tête Rousse deficient homozygous haplotypes (MTRDHH1 to 5) were identified, with a homozygous deficiency ranging from 84 to 100%. These haplotypes are located on Ovis aries chromosome (OAR)1 (MTRDHH2 and 3), OAR10 (MTRDHH4), OAR13 (MTRDHH5), and OAR20 (MTRDHH1), and have carrier frequencies ranging from 7.8 to 16.6%. When comparing at-risk matings between DHH carriers to safe matings between non-carriers, two DHH (MTRDHH1 and 2) were linked with decreased insemination success and/or increased stillbirth incidence. We investigated the MTRDHH1 haplotype, which substantially increased stillbirth rate, and identified a single nucleotide variant (SNV) inducing a premature stop codon (p.Gln409*) in the methylmalonyl-CoA mutase (MMUT) gene by using a whole-genome sequencing approach. We generated homozygous lambs for the MMUT mutation by at-risk mating between heterozygous carriers, and most of them died within the first 24 h after birth without any obvious clinical symptoms. Reverse transcriptase-qPCR and western blotting on post-mortem liver and kidney biological samples showed a decreased expression of MMUT mRNA in the liver and absence of a full-length MMUT protein in the mutant homozygous lambs.
Conclusions: We identified five homozygous deficient haplotypes that are likely to harbor five independent deleterious recessive variants in sheep. One of these was detected in the MMUT gene, which is associated with lamb lethality in the homozygous state. A specific management of these haplotypes/variants in the MTR dairy sheep selection program would help enhance the overall fertility and lamb survival.
{"title":"Searching for homozygous haplotype deficiency in Manech Tête Rousse dairy sheep revealed a nonsense variant in the MMUT gene affecting newborn lamb viability.","authors":"Maxime Ben Braiek, Carole Moreno-Romieux, Céline André, Jean-Michel Astruc, Philippe Bardou, Arnaud Bordes, Frédéric Debat, Francis Fidelle, Itsasne Granado-Tajada, Chris Hozé, Florence Plisson-Petit, François Rivemale, Julien Sarry, Némuel Tadi, Florent Woloszyn, Stéphane Fabre","doi":"10.1186/s12711-024-00886-7","DOIUrl":"10.1186/s12711-024-00886-7","url":null,"abstract":"<p><strong>Background: </strong>Recessive deleterious variants are known to segregate in livestock populations, as in humans, and some may be lethal in the homozygous state.</p><p><strong>Results: </strong>We used phased 50 k single nucleotide polymorphism (SNP) genotypes and pedigree data to scan the genome of 6845 Manech Tête Rousse dairy sheep to search for deficiency in homozygous haplotypes (DHH). Five Manech Tête Rousse deficient homozygous haplotypes (MTRDHH1 to 5) were identified, with a homozygous deficiency ranging from 84 to 100%. These haplotypes are located on Ovis aries chromosome (OAR)1 (MTRDHH2 and 3), OAR10 (MTRDHH4), OAR13 (MTRDHH5), and OAR20 (MTRDHH1), and have carrier frequencies ranging from 7.8 to 16.6%. When comparing at-risk matings between DHH carriers to safe matings between non-carriers, two DHH (MTRDHH1 and 2) were linked with decreased insemination success and/or increased stillbirth incidence. We investigated the MTRDHH1 haplotype, which substantially increased stillbirth rate, and identified a single nucleotide variant (SNV) inducing a premature stop codon (p.Gln409*) in the methylmalonyl-CoA mutase (MMUT) gene by using a whole-genome sequencing approach. We generated homozygous lambs for the MMUT mutation by at-risk mating between heterozygous carriers, and most of them died within the first 24 h after birth without any obvious clinical symptoms. Reverse transcriptase-qPCR and western blotting on post-mortem liver and kidney biological samples showed a decreased expression of MMUT mRNA in the liver and absence of a full-length MMUT protein in the mutant homozygous lambs.</p><p><strong>Conclusions: </strong>We identified five homozygous deficient haplotypes that are likely to harbor five independent deleterious recessive variants in sheep. One of these was detected in the MMUT gene, which is associated with lamb lethality in the homozygous state. A specific management of these haplotypes/variants in the MTR dairy sheep selection program would help enhance the overall fertility and lamb survival.</p>","PeriodicalId":55120,"journal":{"name":"Genetics Selection Evolution","volume":"56 1","pages":"16"},"PeriodicalIF":4.1,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10905913/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139998264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}