Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae196
James Hose, Qi Zheng, Nathaniel P Sharp, Audrey P Gasch
Aneuploidy, arising from the gain or loss of chromosomes due to nondisjunction, is a special class of mutation. It can create significant phenotypic changes by altering the abundance of hundreds of genes in a single event, providing material for adaptive evolution. But it can also incur large fitness costs relative to other types of mutations. Understanding the mutational dynamics of aneuploidy is important for modeling its impact in nature, but aneuploidy rates are difficult to measure accurately. One challenge is that aneuploid karyotypes may revert back to euploidy, biasing forward mutation rate estimates-yet the rate of aneuploidy reversion is largely uncharacterized. Furthermore, current rate estimates are confounded because fitness differences between euploids and aneuploids are typically not accounted for in rate calculations. We developed a unique fluctuation assay in a wild-yeast model to measure the rate of extra-chromosome loss across 3 aneuploid chromosomes while accounting for fitness differences between aneuploid and euploid cells. We show that incorporating fitness effects is essential to obtain accurate estimates of aneuploidy rates. Furthermore, the rate of extra-chromosome loss, separate from karyotype fitness differences, varies across chromosomes. We also measured rates in a strain lacking RNA-binding protein Ssd1, important for aneuploidy tolerance and implicated in chromosome segregation. We found no role for Ssd1 in the loss of native aneuploid chromosomes, although it did impact an engineered chromosome XV with a perturbed centromeric sequence. We discuss the impacts and challenges of modeling aneuploidy dynamics in real-world situations.
{"title":"On the rate of aneuploidy reversion in a wild yeast model.","authors":"James Hose, Qi Zheng, Nathaniel P Sharp, Audrey P Gasch","doi":"10.1093/genetics/iyae196","DOIUrl":"10.1093/genetics/iyae196","url":null,"abstract":"<p><p>Aneuploidy, arising from the gain or loss of chromosomes due to nondisjunction, is a special class of mutation. It can create significant phenotypic changes by altering the abundance of hundreds of genes in a single event, providing material for adaptive evolution. But it can also incur large fitness costs relative to other types of mutations. Understanding the mutational dynamics of aneuploidy is important for modeling its impact in nature, but aneuploidy rates are difficult to measure accurately. One challenge is that aneuploid karyotypes may revert back to euploidy, biasing forward mutation rate estimates-yet the rate of aneuploidy reversion is largely uncharacterized. Furthermore, current rate estimates are confounded because fitness differences between euploids and aneuploids are typically not accounted for in rate calculations. We developed a unique fluctuation assay in a wild-yeast model to measure the rate of extra-chromosome loss across 3 aneuploid chromosomes while accounting for fitness differences between aneuploid and euploid cells. We show that incorporating fitness effects is essential to obtain accurate estimates of aneuploidy rates. Furthermore, the rate of extra-chromosome loss, separate from karyotype fitness differences, varies across chromosomes. We also measured rates in a strain lacking RNA-binding protein Ssd1, important for aneuploidy tolerance and implicated in chromosome segregation. We found no role for Ssd1 in the loss of native aneuploid chromosomes, although it did impact an engineered chromosome XV with a perturbed centromeric sequence. We discuss the impacts and challenges of modeling aneuploidy dynamics in real-world situations.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142711576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Standard measures of linkage disequilibrium (LD) are affected by admixture and population structure, such that loci that are not in LD within each ancestral population appear linked when considered jointly across the populations. The influence of population structure on LD can cause problems for downstream analysis methods, in particular those that rely on LD pruning or clumping. To address this issue, we propose a measure of LD that accommodates population structure using the top inferred principal components. We estimate LD from the correlation of genotype residuals and prove that this LD measure remains unaffected by population structure when analyzing multiple populations jointly, even with admixed individuals. Based on this adjusted measure of LD, we can perform LD pruning to remove the correlation between markers for downstream analysis. Traditional LD pruning is more likely to remove markers with high differences in allele frequencies between populations, which biases measures for genetic differentiation and removes markers that are not in LD in the ancestral populations. Using data from moderately differentiated human populations and highly differentiated giraffe populations we show that traditional LD pruning biases FST and principal component analysis (PCA), which can be alleviated with the adjusted LD measure. In addition, we show that the adjusted LD leads to better PCA when pruning and that LD clumping retains more sites with the retained sites having stronger associations.
{"title":"Measuring linkage disequilibrium and improvement of pruning and clumping in structured populations.","authors":"Ulises Bercovich, Malthe Sebro Rasmussen, Zilong Li, Carsten Wiuf, Anders Albrechtsen","doi":"10.1093/genetics/iyaf009","DOIUrl":"https://doi.org/10.1093/genetics/iyaf009","url":null,"abstract":"<p><p>Standard measures of linkage disequilibrium (LD) are affected by admixture and population structure, such that loci that are not in LD within each ancestral population appear linked when considered jointly across the populations. The influence of population structure on LD can cause problems for downstream analysis methods, in particular those that rely on LD pruning or clumping. To address this issue, we propose a measure of LD that accommodates population structure using the top inferred principal components. We estimate LD from the correlation of genotype residuals and prove that this LD measure remains unaffected by population structure when analyzing multiple populations jointly, even with admixed individuals. Based on this adjusted measure of LD, we can perform LD pruning to remove the correlation between markers for downstream analysis. Traditional LD pruning is more likely to remove markers with high differences in allele frequencies between populations, which biases measures for genetic differentiation and removes markers that are not in LD in the ancestral populations. Using data from moderately differentiated human populations and highly differentiated giraffe populations we show that traditional LD pruning biases FST and principal component analysis (PCA), which can be alleviated with the adjusted LD measure. In addition, we show that the adjusted LD leads to better PCA when pruning and that LD clumping retains more sites with the retained sites having stronger associations.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae208
Joana Teixeira, Anu-Mari Harju, Alaa Othman, Ove Eriksson, Brendan J Battersby, Susana M D A Garcia
Expansion of nucleotide repeat sequences is associated with more than 40 human neuromuscular disorders. The different pathogenic mechanisms associated with the expression of nucleotide repeats are not well understood. We use a Caenorhabditis elegans model that expresses expanded CUG repeats only in cells of the body wall muscle and recapitulate muscle dysfunction and impaired organismal motility to identify the basis by which expression of RNA repeats is toxic to muscle function. Here, we performed 2 consecutive RNA interference screens and uncovered coenzyme Q metabolism and mitochondrial dysfunction as critical genetic modifiers of the motility phenotype. Furthermore, coenzyme Q supplementation reduced the toxic phenotypes, ameliorating the motility impairment and mitochondrial phenotypes. Together our data show how the expression of expanded RNA repeats can be toxic to mitochondrial homeostasis.
{"title":"Coenzyme Q improves mitochondrial and muscle dysfunction caused by CUG expanded repeats in Caenorhabditis elegans.","authors":"Joana Teixeira, Anu-Mari Harju, Alaa Othman, Ove Eriksson, Brendan J Battersby, Susana M D A Garcia","doi":"10.1093/genetics/iyae208","DOIUrl":"10.1093/genetics/iyae208","url":null,"abstract":"<p><p>Expansion of nucleotide repeat sequences is associated with more than 40 human neuromuscular disorders. The different pathogenic mechanisms associated with the expression of nucleotide repeats are not well understood. We use a Caenorhabditis elegans model that expresses expanded CUG repeats only in cells of the body wall muscle and recapitulate muscle dysfunction and impaired organismal motility to identify the basis by which expression of RNA repeats is toxic to muscle function. Here, we performed 2 consecutive RNA interference screens and uncovered coenzyme Q metabolism and mitochondrial dysfunction as critical genetic modifiers of the motility phenotype. Furthermore, coenzyme Q supplementation reduced the toxic phenotypes, ameliorating the motility impairment and mitochondrial phenotypes. Together our data show how the expression of expanded RNA repeats can be toxic to mitochondrial homeostasis.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142899614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae193
Jeffrey B Endelman
Breeders have long appreciated the need to balance selection for short-term genetic gain with maintaining genetic variance for long-term gain. For outbred populations, the method called optimum contribution selection (OCS) chooses parental contributions to maximize the average breeding value at a prescribed inbreeding rate. With optimum mate allocation (OMA), the contribution of each mating is optimized, which allows for specific combining ability due to dominance. To enable OCS and OMA in polyploid species, new theoretical results were derived to (1) predict midparent heterosis due to dominance and (2) control inbreeding in a population of arbitrary ploidy. A new convex optimization framework for OMA, named COMA, was developed and released as public software. Under stochastic simulation of a genomic selection program, COMA maintained a target inbreeding rate of 0.5% using either pedigree or genomic IBD (identity-by-descent) kinship. Significantly more genetic gain was realized with pedigree kinship, which is consistent with previous studies showing the selective advantage of an individual under OCS is dominated by its Mendelian sampling term. Despite the higher accuracy (+0.2-0.3) when predicting mate performance with OMA compared with OCS, there was little long-term gain advantage. The sparsity of the COMA mating design and flexibility to incorporate mating constraints offer practical incentives over OCS. In a potato breeding case study with 170 candidates, the optimal solution at 0.5% inbreeding involved 43 parents but only 43 of the 903 possible matings.
{"title":"Genomic prediction of heterosis, inbreeding control, and mate allocation in outbred diploid and tetraploid populations.","authors":"Jeffrey B Endelman","doi":"10.1093/genetics/iyae193","DOIUrl":"10.1093/genetics/iyae193","url":null,"abstract":"<p><p>Breeders have long appreciated the need to balance selection for short-term genetic gain with maintaining genetic variance for long-term gain. For outbred populations, the method called optimum contribution selection (OCS) chooses parental contributions to maximize the average breeding value at a prescribed inbreeding rate. With optimum mate allocation (OMA), the contribution of each mating is optimized, which allows for specific combining ability due to dominance. To enable OCS and OMA in polyploid species, new theoretical results were derived to (1) predict midparent heterosis due to dominance and (2) control inbreeding in a population of arbitrary ploidy. A new convex optimization framework for OMA, named COMA, was developed and released as public software. Under stochastic simulation of a genomic selection program, COMA maintained a target inbreeding rate of 0.5% using either pedigree or genomic IBD (identity-by-descent) kinship. Significantly more genetic gain was realized with pedigree kinship, which is consistent with previous studies showing the selective advantage of an individual under OCS is dominated by its Mendelian sampling term. Despite the higher accuracy (+0.2-0.3) when predicting mate performance with OMA compared with OCS, there was little long-term gain advantage. The sparsity of the COMA mating design and flexibility to incorporate mating constraints offer practical incentives over OCS. In a potato breeding case study with 170 candidates, the optimal solution at 0.5% inbreeding involved 43 parents but only 43 of the 903 possible matings.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae195
Jacob D Washburn, José Ignacio Varela, Alencar Xavier, Qiuyue Chen, David Ertl, Joseph L Gage, James B Holland, Dayane Cristina Lima, Maria Cinta Romay, Marco Lopez-Cruz, Gustavo de Los Campos, Wesley Barber, Cristiano Zimmer, Ignacio Trucillo Silva, Fabiani Rocha, Renaud Rincent, Baber Ali, Haixiao Hu, Daniel E Runcie, Kirill Gusev, Andrei Slabodkin, Phillip Bax, Julie Aubert, Hugo Gangloff, Tristan Mary-Huard, Theodore Vanrenterghem, Carles Quesada-Traver, Steven Yates, Daniel Ariza-Suárez, Argeo Ulrich, Michele Wyler, Daniel R Kick, Emily S Bellis, Jason L Causey, Emilio Soriano Chavez, Yixing Wang, Ved Piyush, Gayara D Fernando, Robert K Hu, Rachit Kumar, Annan J Timon, Rasika Venkatesh, Kenia Segura Abá, Huan Chen, Thilanka Ranaweera, Shin-Han Shiu, Peiran Wang, Max J Gordon, B Kirtley Amos, Sebastiano Busato, Daniel Perondi, Abhishek Gogna, Dennis Psaroudakis, Chun-Peng James Chen, Hawlader A Al-Mamun, Monica F Danilevicz, Shriprabha R Upadhyaya, David Edwards, Natalia de Leon
Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023, the first open-to-the-public Genomes to Fields initiative Genotype by Environment prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements, and field management notes gathered by the project over 9 years. The competition attracted registrants from around the world with representation from academic, government, industry, and nonprofit institutions as well as unaffiliated. These participants came from diverse disciplines, including plant science, animal science, breeding, statistics, computational biology, and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner's strategy involved 2 models combining machine learning and traditional breeding tools: 1 model emphasized environment using features extracted by random forest, ridge regression, and least squares, and 1 focused on genetics. Other high-performing teams' methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics, weather, and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.
{"title":"Global genotype by environment prediction competition reveals that diverse modeling strategies can deliver satisfactory maize yield estimates.","authors":"Jacob D Washburn, José Ignacio Varela, Alencar Xavier, Qiuyue Chen, David Ertl, Joseph L Gage, James B Holland, Dayane Cristina Lima, Maria Cinta Romay, Marco Lopez-Cruz, Gustavo de Los Campos, Wesley Barber, Cristiano Zimmer, Ignacio Trucillo Silva, Fabiani Rocha, Renaud Rincent, Baber Ali, Haixiao Hu, Daniel E Runcie, Kirill Gusev, Andrei Slabodkin, Phillip Bax, Julie Aubert, Hugo Gangloff, Tristan Mary-Huard, Theodore Vanrenterghem, Carles Quesada-Traver, Steven Yates, Daniel Ariza-Suárez, Argeo Ulrich, Michele Wyler, Daniel R Kick, Emily S Bellis, Jason L Causey, Emilio Soriano Chavez, Yixing Wang, Ved Piyush, Gayara D Fernando, Robert K Hu, Rachit Kumar, Annan J Timon, Rasika Venkatesh, Kenia Segura Abá, Huan Chen, Thilanka Ranaweera, Shin-Han Shiu, Peiran Wang, Max J Gordon, B Kirtley Amos, Sebastiano Busato, Daniel Perondi, Abhishek Gogna, Dennis Psaroudakis, Chun-Peng James Chen, Hawlader A Al-Mamun, Monica F Danilevicz, Shriprabha R Upadhyaya, David Edwards, Natalia de Leon","doi":"10.1093/genetics/iyae195","DOIUrl":"10.1093/genetics/iyae195","url":null,"abstract":"<p><p>Predicting phenotypes from a combination of genetic and environmental factors is a grand challenge of modern biology. Slight improvements in this area have the potential to save lives, improve food and fuel security, permit better care of the planet, and create other positive outcomes. In 2022 and 2023, the first open-to-the-public Genomes to Fields initiative Genotype by Environment prediction competition was held using a large dataset including genomic variation, phenotype and weather measurements, and field management notes gathered by the project over 9 years. The competition attracted registrants from around the world with representation from academic, government, industry, and nonprofit institutions as well as unaffiliated. These participants came from diverse disciplines, including plant science, animal science, breeding, statistics, computational biology, and others. Some participants had no formal genetics or plant-related training, and some were just beginning their graduate education. The teams applied varied methods and strategies, providing a wealth of modeling knowledge based on a common dataset. The winner's strategy involved 2 models combining machine learning and traditional breeding tools: 1 model emphasized environment using features extracted by random forest, ridge regression, and least squares, and 1 focused on genetics. Other high-performing teams' methods included quantitative genetics, machine learning/deep learning, mechanistic models, and model ensembles. The dataset factors used, such as genetics, weather, and management data, were also diverse, demonstrating that no single model or strategy is far superior to all others within the context of this competition.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142688859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae206
Anna Zhebrun, Julie Z Ni, Laura Corveleyn, Siddharth Ghosh Roy, Simone Sidoli, Sam G Gu
Nuclear RNAi in Caenorhabditis elegans induces a set of transgenerationally heritable marks of H3K9me3, H3K23me3, and H3K27me3 at the target genes. The function of H3K23me3 in the nuclear RNAi pathway is largely unknown due to the limited knowledge of H3K23 histone methyltransferase (HMT). In this study we identified SET-21 as a novel H3K23 HMT. By taking combined genetic, biochemical, imaging, and genomic approaches, we found that SET-21 functions synergistically with a previously reported H3K23 HMT SET-32 to deposit H3K23me3 at the native targets of germline nuclear RNAi. We identified a subset of native nuclear RNAi targets that are transcriptionally activated in the set-21;set-32 double mutant. SET-21 and SET-32 are also required for robust transgenerational gene silencing induced by exogenous dsRNA. The set-21;set-32 double mutant strain exhibits an enhanced temperature-sensitive mortal germline phenotype compared to the set-32 single mutant, while the set-21 single mutant animals are fertile. We also found that HRDE-1 and SET-32 are required for cosuppression, a transgene-induced gene silencing phenomenon, in C. elegans germline. Together, these results support a model in which H3K23 HMTs SET-21 and SET-32 function cooperatively as germline nuclear RNAi factors and promote the germline immortality under the heat stress.
{"title":"Two H3K23 histone methyltransferases, SET-32 and SET-21, function synergistically to promote nuclear RNAi-mediated transgenerational epigenetic inheritance in Caenorhabditis elegans.","authors":"Anna Zhebrun, Julie Z Ni, Laura Corveleyn, Siddharth Ghosh Roy, Simone Sidoli, Sam G Gu","doi":"10.1093/genetics/iyae206","DOIUrl":"10.1093/genetics/iyae206","url":null,"abstract":"<p><p>Nuclear RNAi in Caenorhabditis elegans induces a set of transgenerationally heritable marks of H3K9me3, H3K23me3, and H3K27me3 at the target genes. The function of H3K23me3 in the nuclear RNAi pathway is largely unknown due to the limited knowledge of H3K23 histone methyltransferase (HMT). In this study we identified SET-21 as a novel H3K23 HMT. By taking combined genetic, biochemical, imaging, and genomic approaches, we found that SET-21 functions synergistically with a previously reported H3K23 HMT SET-32 to deposit H3K23me3 at the native targets of germline nuclear RNAi. We identified a subset of native nuclear RNAi targets that are transcriptionally activated in the set-21;set-32 double mutant. SET-21 and SET-32 are also required for robust transgenerational gene silencing induced by exogenous dsRNA. The set-21;set-32 double mutant strain exhibits an enhanced temperature-sensitive mortal germline phenotype compared to the set-32 single mutant, while the set-21 single mutant animals are fertile. We also found that HRDE-1 and SET-32 are required for cosuppression, a transgene-induced gene silencing phenomenon, in C. elegans germline. Together, these results support a model in which H3K23 HMTs SET-21 and SET-32 function cooperatively as germline nuclear RNAi factors and promote the germline immortality under the heat stress.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11796467/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142814689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae200
{"title":"Correction to: A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding.","authors":"","doi":"10.1093/genetics/iyae200","DOIUrl":"10.1093/genetics/iyae200","url":null,"abstract":"","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11796459/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142865970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-05DOI: 10.1093/genetics/iyae197
Wesley Wong, Lea Wang, Stephen F Schaffner, Xue Li, Ian Cheeseman, Timothy J C Anderson, Ashley Vaughan, Michael Ferdig, Sarah K Volkman, Daniel L Hartl, Dyann F Wirth
Pathogen genomics is a powerful tool for tracking infectious disease transmission. In malaria, identity-by-descent is used to assess the genetic relatedness between parasites and has been used to study transmission and importation. In theory, identity-by-descent can be used to distinguish genealogical relationships to reconstruct transmission history or identify parasites for QTL experiments. MalKinID (Malaria Kinship Identifier) is a new classification model designed to identify genealogical relationships among malaria parasites based on genome-wide identity-by-descent proportions and identity-by-descent segment distributions. MalKinID was calibrated to the genomic data from 3 laboratory-based genetic crosses (yielding 440 parent-child and 9060 full-sibling comparisons). MalKinID identified lab-generated F1 progeny with >80% sensitivity and showed that 0.39 (95% CI 0.28, 0.49) of the second-generation progeny of a NF54 and NHP4026 cross were F1s and 0.56 (0.45, 0.67) were backcrosses of an F1 with the parental NF54 strain. In simulated outcrossed importations, MalKinID reconstructs genealogy history with high precision and sensitivity, with F1-scores exceeding 0.84. However, when importation involves inbreeding, such as during serial co-transmission, the precision and sensitivity of MalKinID declined, with F1-scores (the harmonic mean of precision and sensitivity) of 0.76 (0.56, 0.92) and 0.23 (0.0, 0.4) for parent-child and full-sibling and <0.05 for second-degree and third-degree relatives. Disentangling inbred relationships required adapting MalKinID to perform multisample comparisons. Genealogical inference is most powered when (1) outcrossing is the norm or (2) multisample comparisons based on a predefined pedigree are used. MalKinID lays the foundations for using identity-by-descent to track parasite transmission history and for separating progeny for quantitative-trait-locus experiments.
病原体基因组学是追踪传染病传播的有力工具。在疟疾中,通过后代鉴定(IBD)可用于评估寄生虫之间的遗传亲缘关系,并已被用于研究传播和输入。从理论上讲,IBD 可用来区分谱系关系,以重建传播历史,或为定量性状-病灶实验识别寄生虫。MalKinID (疟疾亲缘关系识别器)是一种新的分类模型,旨在根据全基因组的 IBD 比例和 IBD 片段分布来识别疟疾寄生虫之间的系谱关系。MalKinID 根据三个实验室基因杂交的基因组数据进行了校准(产生了 440 个亲子 [PC] 和 9060 个全同胞 [FS] 比较)。MalKinID 识别实验室产生的 F1 后代的灵敏度大于 80%,并显示 NF54 和 NHP4026 杂交的第二代后代中有 0.39(95% CI 0.28,0.49)个是 F1 后代,0.56(0.45,0.67)个是 F1 与亲本 NF54 株系的回交后代。在模拟的外交进口中,MalKinID 能高精度、高灵敏度地重建系谱历史,F1 评分超过 0.84。然而,当导入涉及近亲繁殖时,如在连续共输过程中,MalKinID 的精确度和灵敏度下降,PC 和 FS 的 F1 分数(精确度和灵敏度的调和平均值)分别为 0.76(0.56,0.92)和 0.23(0.0,0.4),FS 和 PC 的 F1 分数(精确度和灵敏度的调和平均值)分别为 0.50(0.50,0.10)和 0.50(0.10,0.10)。
{"title":"MalKinID: A classification model for identifying malaria parasite genealogical relationships using identity-by-descent.","authors":"Wesley Wong, Lea Wang, Stephen F Schaffner, Xue Li, Ian Cheeseman, Timothy J C Anderson, Ashley Vaughan, Michael Ferdig, Sarah K Volkman, Daniel L Hartl, Dyann F Wirth","doi":"10.1093/genetics/iyae197","DOIUrl":"10.1093/genetics/iyae197","url":null,"abstract":"<p><p>Pathogen genomics is a powerful tool for tracking infectious disease transmission. In malaria, identity-by-descent is used to assess the genetic relatedness between parasites and has been used to study transmission and importation. In theory, identity-by-descent can be used to distinguish genealogical relationships to reconstruct transmission history or identify parasites for QTL experiments. MalKinID (Malaria Kinship Identifier) is a new classification model designed to identify genealogical relationships among malaria parasites based on genome-wide identity-by-descent proportions and identity-by-descent segment distributions. MalKinID was calibrated to the genomic data from 3 laboratory-based genetic crosses (yielding 440 parent-child and 9060 full-sibling comparisons). MalKinID identified lab-generated F1 progeny with >80% sensitivity and showed that 0.39 (95% CI 0.28, 0.49) of the second-generation progeny of a NF54 and NHP4026 cross were F1s and 0.56 (0.45, 0.67) were backcrosses of an F1 with the parental NF54 strain. In simulated outcrossed importations, MalKinID reconstructs genealogy history with high precision and sensitivity, with F1-scores exceeding 0.84. However, when importation involves inbreeding, such as during serial co-transmission, the precision and sensitivity of MalKinID declined, with F1-scores (the harmonic mean of precision and sensitivity) of 0.76 (0.56, 0.92) and 0.23 (0.0, 0.4) for parent-child and full-sibling and <0.05 for second-degree and third-degree relatives. Disentangling inbred relationships required adapting MalKinID to perform multisample comparisons. Genealogical inference is most powered when (1) outcrossing is the norm or (2) multisample comparisons based on a predefined pedigree are used. MalKinID lays the foundations for using identity-by-descent to track parasite transmission history and for separating progeny for quantitative-trait-locus experiments.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11796457/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142695992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-04DOI: 10.1093/genetics/iyaf021
Yvonne M Bradford, Ceri E Van Slyke, Jonathan B Muyskens, Wei-Chia Tseng, Douglas G Howe, David Fashena, Ryan Martin, Holly Paddock, Christian Pich, Sridhar Ramachandran, Leyla Ruzicka, Amy Singer, Ryan Taylor, Monte Westerfield
The Zebrafish Information Network (ZFIN, zfin.org) is the database resource for genetic, genomic, and phenotypic data from research using zebrafish, Danio rerio. ZFIN curates information about genetic perturbations, gene expression, phenotype, gene function, and human disease models from zebrafish research publications and makes these data available to researchers worldwide. Over the past 20 years, zebrafish have increasingly been used to investigate the effects of environmental exposures, becoming an ideal model to study toxicity, phenotypic outcomes, and gene-chemical interactions. Despite this, database resources supporting zebrafish toxicology and environmental exposure research are limited. To fill this gap, ZFIN has expanded functionality to incorporate and convey toxicology data better. ZFIN annotations for gene expression, phenotype, and human disease models include information about genotypes and experimental conditions used. One type of experimental condition the database captures is the application of chemicals to zebrafish. ZFIN annotates chemicals using the Chemical Entities of Biological Interest Ontology (ChEBI) along with the Zebrafish Experimental Conditions Ontology (ZECO) to denote route of exposure and other experimental conditions. These features allow researchers to search phenotypes and human disease models linked to chemicals more efficiently. Here we discuss how experimental conditions are displayed on ZFIN web pages, the data displayed on chemical term pages, and how to search and download data associated with chemical exposure experiments.
{"title":"ZFIN Updates to Support Zebrafish Environmental Exposure Data.","authors":"Yvonne M Bradford, Ceri E Van Slyke, Jonathan B Muyskens, Wei-Chia Tseng, Douglas G Howe, David Fashena, Ryan Martin, Holly Paddock, Christian Pich, Sridhar Ramachandran, Leyla Ruzicka, Amy Singer, Ryan Taylor, Monte Westerfield","doi":"10.1093/genetics/iyaf021","DOIUrl":"https://doi.org/10.1093/genetics/iyaf021","url":null,"abstract":"<p><p>The Zebrafish Information Network (ZFIN, zfin.org) is the database resource for genetic, genomic, and phenotypic data from research using zebrafish, Danio rerio. ZFIN curates information about genetic perturbations, gene expression, phenotype, gene function, and human disease models from zebrafish research publications and makes these data available to researchers worldwide. Over the past 20 years, zebrafish have increasingly been used to investigate the effects of environmental exposures, becoming an ideal model to study toxicity, phenotypic outcomes, and gene-chemical interactions. Despite this, database resources supporting zebrafish toxicology and environmental exposure research are limited. To fill this gap, ZFIN has expanded functionality to incorporate and convey toxicology data better. ZFIN annotations for gene expression, phenotype, and human disease models include information about genotypes and experimental conditions used. One type of experimental condition the database captures is the application of chemicals to zebrafish. ZFIN annotates chemicals using the Chemical Entities of Biological Interest Ontology (ChEBI) along with the Zebrafish Experimental Conditions Ontology (ZECO) to denote route of exposure and other experimental conditions. These features allow researchers to search phenotypes and human disease models linked to chemicals more efficiently. Here we discuss how experimental conditions are displayed on ZFIN web pages, the data displayed on chemical term pages, and how to search and download data associated with chemical exposure experiments.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-04DOI: 10.1093/genetics/iyaf010
{"title":"Correction to: Knockdown of NeuroD2 leads to seizure-like behavior, brain neuronal hyperactivity and a leaky blood-brain barrier in a Xenopus laevis tadpole model of DEE72.","authors":"","doi":"10.1093/genetics/iyaf010","DOIUrl":"https://doi.org/10.1093/genetics/iyaf010","url":null,"abstract":"","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}