首页 > 最新文献

Genetics最新文献

英文 中文
Detecting gene expression in Caenorhabditis elegans. 秀丽隐杆线虫基因表达的检测。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae167
John A Calarco, Seth R Taylor, David M Miller

Reliable methods for detecting and analyzing gene expression are necessary tools for understanding development and investigating biological responses to genetic and environmental perturbation. With its fully sequenced genome, invariant cell lineage, transparent body, wiring diagram, detailed anatomy, and wide array of genetic tools, Caenorhabditis elegans is an exceptionally useful model organism for linking gene expression to cellular phenotypes. The development of new techniques in recent years has greatly expanded our ability to detect gene expression at high resolution. Here, we provide an overview of gene expression methods for C. elegans, including techniques for detecting transcripts and proteins in situ, bulk RNA sequencing of whole worms and specific tissues and cells, single-cell RNA sequencing, and high-throughput proteomics. We discuss important considerations for choosing among these techniques and provide an overview of publicly available online resources for gene expression data.

可靠的检测和分析基因表达的方法是理解发育和研究遗传和环境扰动的生物反应的必要工具。秀丽隐杆线虫具有完全测序的基因组,不变的细胞谱系,透明的身体,接线图,详细的解剖结构和广泛的遗传工具,是将基因表达与细胞表型联系起来的非常有用的模式生物。近年来新技术的发展极大地扩展了我们以高分辨率检测基因表达的能力。在这里,我们概述了秀丽隐杆线虫的基因表达方法,包括原位检测转录本和蛋白质的技术,整个蠕虫和特定组织和细胞的大量RNA测序,单细胞RNA测序和高通量蛋白质组学。我们讨论了在这些技术中选择的重要考虑因素,并提供了基因表达数据的公开在线资源的概述。
{"title":"Detecting gene expression in Caenorhabditis elegans.","authors":"John A Calarco, Seth R Taylor, David M Miller","doi":"10.1093/genetics/iyae167","DOIUrl":"10.1093/genetics/iyae167","url":null,"abstract":"<p><p>Reliable methods for detecting and analyzing gene expression are necessary tools for understanding development and investigating biological responses to genetic and environmental perturbation. With its fully sequenced genome, invariant cell lineage, transparent body, wiring diagram, detailed anatomy, and wide array of genetic tools, Caenorhabditis elegans is an exceptionally useful model organism for linking gene expression to cellular phenotypes. The development of new techniques in recent years has greatly expanded our ability to detect gene expression at high resolution. Here, we provide an overview of gene expression methods for C. elegans, including techniques for detecting transcripts and proteins in situ, bulk RNA sequencing of whole worms and specific tissues and cells, single-cell RNA sequencing, and high-throughput proteomics. We discuss important considerations for choosing among these techniques and provide an overview of publicly available online resources for gene expression data.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-108"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142856196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations. 种群规模的重新缩放会使前向时间种群遗传模拟的结果出现明显偏差。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae180
Amjad Dabi, Daniel R Schrider

Simulations are an essential tool in all areas of population genetic research, used in tasks such as the validation of theoretical analysis and the study of complex evolutionary models. Forward-in-time simulations are especially flexible, allowing for various types of natural selection, complex genetic architectures, and non-Wright-Fisher dynamics. However, their intense computational requirements can be prohibitive to simulating large populations and genomes. A popular method to alleviate this burden is to scale down the population size by some scaling factor while scaling up the mutation rate, selection coefficients, and recombination rate by the same factor. However, this rescaling approach may in some cases bias simulation results. To investigate the manner and degree to which rescaling impacts simulation outcomes, we carried out simulations with different demographic histories and distributions of fitness effects using several values of the rescaling factor, Q, and compared the deviation of key outcomes (fixation times, allele frequencies, linkage disequilibrium, and the fraction of mutations that fix during the simulation) between the scaled and unscaled simulations. Our results indicate that scaling introduces substantial biases to each of these measured outcomes, even at small values of Q. Moreover, the nature of these effects depends on the evolutionary model and scaling factor being examined. While increasing the scaling factor tends to increase the observed biases, this relationship is not always straightforward; thus, it may be difficult to know the impact of scaling on simulation outcomes a priori. However, it appears that for most models, only a small number of replicates was needed to accurately quantify the bias produced by rescaling for a given Q. In summary, while rescaling forward-in-time simulations may be necessary in many cases, researchers should be aware of the rescaling procedure's impact on simulation outcomes and consider investigating its magnitude in smaller scale simulations of the desired model(s) before selecting an appropriate value of Q.

模拟是群体遗传研究各个领域的重要工具,用于验证理论分析和研究复杂的进化模型等任务。时间前向模拟尤其灵活,可用于各种类型的自然选择、复杂的遗传结构和非赖特-费舍尔动力学。然而,其高昂的计算要求可能会阻碍大型种群和基因组的模拟。为减轻这一负担,一种流行的方法是按一定的比例系数缩小种群规模,同时按相同的系数提高突变率、选择系数和重组率。然而,这种重新缩放的方法在某些情况下可能会使模拟结果出现偏差。为了研究重新缩放对模拟结果的影响方式和程度,我们使用不同的重新缩放因子Ǫ值,对不同的人口历史和适存效应分布进行了模拟,并比较了缩放模拟和未缩放模拟的关键结果(固定时间、等位基因频率、连锁不平衡和模拟期间固定的突变比例)的偏差。我们的研究结果表明,即使Ʈ 的值很小,缩放也会给这些测量结果中的每一个带来很大的偏差。此外,这些影响的性质取决于所研究的进化模型和缩放因子。虽然增加缩放因子往往会增加观测到的偏差,但这种关系并不总是很直观,因此很难预先知道缩放因子对模拟结果的影响。不过,对于大多数模型来说,似乎只需要少量的重复,就可以准确量化在给定 Ʈ 条件下重新缩放所产生的偏差。总之,虽然在许多情况下需要对前向时间模拟进行重定标,但研究人员应意识到重定标程序对模拟结果的影响,并考虑在选择合适的Ʈ 值之前,在所需模型的较小规模模拟中调查其影响程度。
{"title":"Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations.","authors":"Amjad Dabi, Daniel R Schrider","doi":"10.1093/genetics/iyae180","DOIUrl":"10.1093/genetics/iyae180","url":null,"abstract":"<p><p>Simulations are an essential tool in all areas of population genetic research, used in tasks such as the validation of theoretical analysis and the study of complex evolutionary models. Forward-in-time simulations are especially flexible, allowing for various types of natural selection, complex genetic architectures, and non-Wright-Fisher dynamics. However, their intense computational requirements can be prohibitive to simulating large populations and genomes. A popular method to alleviate this burden is to scale down the population size by some scaling factor while scaling up the mutation rate, selection coefficients, and recombination rate by the same factor. However, this rescaling approach may in some cases bias simulation results. To investigate the manner and degree to which rescaling impacts simulation outcomes, we carried out simulations with different demographic histories and distributions of fitness effects using several values of the rescaling factor, Q, and compared the deviation of key outcomes (fixation times, allele frequencies, linkage disequilibrium, and the fraction of mutations that fix during the simulation) between the scaled and unscaled simulations. Our results indicate that scaling introduces substantial biases to each of these measured outcomes, even at small values of Q. Moreover, the nature of these effects depends on the evolutionary model and scaling factor being examined. While increasing the scaling factor tends to increase the observed biases, this relationship is not always straightforward; thus, it may be difficult to know the impact of scaling on simulation outcomes a priori. However, it appears that for most models, only a small number of replicates was needed to accurately quantify the bias produced by rescaling for a given Q. In summary, while rescaling forward-in-time simulations may be necessary in many cases, researchers should be aware of the rescaling procedure's impact on simulation outcomes and consider investigating its magnitude in smaller scale simulations of the desired model(s) before selecting an appropriate value of Q.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-57"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708920/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142584680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network hub gene detection using the entire solution path information. 利用整个解决方案路径信息检测网络中心基因
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae187
Markku Kuismin, Mikko J Sillanpää

Gene co-expression networks typically comprise modules and their associated hub genes, which are regulating numerous downstream interactions within the network. Methods for hub screening, as well as data-driven estimation of hub co-expression networks using graphical models, can serve as useful tools for identifying these hubs. Graphical model-based penalization methods typically have one or multiple regularization terms, each of which encourages some favorable characteristics (e.g. sparsity, hubs, and power-law) to the estimated complex gene network. It is common practice to find a single optimal graphical model corresponding to a specific value of the regularization parameter(s). However, instead of doing this, one could aggregate information across several graphical models, all of which depend on the same data set, along the solution path in the hub gene detection process. We propose a novel method for detecting hub genes that utilizes the information available in the solution path. Our procedure is related to stability selection, but we replace resampling with a simple statistic. This procedure amalgamates information from each node of the data-driven graphical models into a single influence statistic, similar to Cook's distance. We call this statistic the Mean Degree Squared Distance (MDSD). Our simulation and empirical studies demonstrate that the MDSD statistic maintains a good balance between false positive and true positive hubs. An R package MDSD is publicly available on GitHub under the General Public License https://github.com/markkukuismin/MDSD.

基因共表达网络通常由模块及其相关的中枢基因组成,这些基因调控着网络中众多的下游相互作用。枢纽筛选方法以及使用图形模型对枢纽共表达网络进行数据驱动估算,可作为识别这些枢纽的有用工具。基于图形模型的惩罚方法通常有一个或多个正则化项,每个正则化项都会对估计的复杂基因网络产生一些有利的影响(如稀疏性、集线器、幂律)。通常的做法是找到与正则化参数的特定值相对应的单一最优图形模型。然而,与其这样做,我们还不如在中心基因检测过程中,沿着求解路径将多个图形模型的信息汇总起来,所有这些模型都依赖于相同的数据集。我们提出了一种利用求解路径中可用信息来检测中心基因的新方法。我们的程序与稳定性选择有关,但我们用一个简单的统计量取代了重采样。这一程序将数据驱动图形模型中每个节点的信息合并为一个影响统计量,类似于库克距离。我们称这种统计量为平均度平方距离(MDSD)。我们的模拟和实证研究表明,MDSD 统计量在假阳性枢纽和真阳性枢纽之间保持了良好的平衡。MDSD 的 R 软件包以通用公共许可证 https://github.com/markkukuismin/MDSD 在 GitHub 上公开发布。
{"title":"Network hub gene detection using the entire solution path information.","authors":"Markku Kuismin, Mikko J Sillanpää","doi":"10.1093/genetics/iyae187","DOIUrl":"10.1093/genetics/iyae187","url":null,"abstract":"<p><p>Gene co-expression networks typically comprise modules and their associated hub genes, which are regulating numerous downstream interactions within the network. Methods for hub screening, as well as data-driven estimation of hub co-expression networks using graphical models, can serve as useful tools for identifying these hubs. Graphical model-based penalization methods typically have one or multiple regularization terms, each of which encourages some favorable characteristics (e.g. sparsity, hubs, and power-law) to the estimated complex gene network. It is common practice to find a single optimal graphical model corresponding to a specific value of the regularization parameter(s). However, instead of doing this, one could aggregate information across several graphical models, all of which depend on the same data set, along the solution path in the hub gene detection process. We propose a novel method for detecting hub genes that utilizes the information available in the solution path. Our procedure is related to stability selection, but we replace resampling with a simple statistic. This procedure amalgamates information from each node of the data-driven graphical models into a single influence statistic, similar to Cook's distance. We call this statistic the Mean Degree Squared Distance (MDSD). Our simulation and empirical studies demonstrate that the MDSD statistic maintains a good balance between false positive and true positive hubs. An R package MDSD is publicly available on GitHub under the General Public License https://github.com/markkukuismin/MDSD.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-33"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708912/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142630842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging a new data resource to define the response of Cryptococcus neoformans to environmental signals. 利用新的数据资源确定新变形杆菌对环境信号的反应。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae178
Yu Sung Kang, Jeffery Jung, Holly L Brown, Chase Mateusiak, Tamara L Doering, Michael R Brent

Cryptococcus neoformans is an opportunistic fungal pathogen with a polysaccharide capsule that becomes greatly enlarged in the mammalian host and during in vitro growth under host-like conditions. To understand how individual environmental signals affect capsule size and gene expression, we grew cells in all combinations of 5 signals implicated in capsule size and systematically measured cell and capsule sizes. We also sampled these cultures over time and performed RNA-seq in quadruplicate, yielding 881 RNA-seq samples. Analysis of the resulting data sets showed that capsule induction in tissue culture medium, typically used to represent host-like conditions, requires the presence of either CO2 or exogenous cyclic AMP. Surprisingly, adding either of these pushes overall gene expression in the opposite direction from tissue culture media alone, even though both are required for capsule development. Another unexpected finding was that rich medium blocks capsule growth completely. Statistical analysis further revealed many genes whose expression is associated with capsule thickness; deletion of one of these significantly reduced capsule size. Beyond illuminating capsule induction, our massive, uniformly collected data set will be a significant resource for the research community.

新生隐球菌是一种机会性真菌病原体,其多糖囊在哺乳动物宿主体内和类似宿主条件的体外生长过程中会大大增大。为了了解个别环境信号如何影响胶囊大小和基因表达,我们在与胶囊大小有关的五种信号的所有组合中培养细胞,并系统地测量细胞和胶囊大小。我们还在一段时间内对这些培养物进行采样,并进行了四重 RNA-Seq,共获得 881 个 RNA-Seq 样本。对所得数据集的分析表明,在组织培养基(通常用来代表类似宿主的条件)中诱导蒴果需要二氧化碳或外源环磷酸腺苷(cAMP)的存在。令人惊讶的是,尽管蒴果的发育需要二氧化碳或外源环磷酸腺苷(cAMP),但加入这两种物质后,整体基因表达与单独加入组织培养基时相反。另一个意想不到的发现是,富培养基会完全阻止蒴果的生长。统计分析进一步揭示了许多基因的表达与蒴果厚度有关;删除其中一个基因会显著缩小蒴果大小。除了揭示蒴果的诱导过程,我们统一收集的大量数据集也将成为研究界的重要资源。
{"title":"Leveraging a new data resource to define the response of Cryptococcus neoformans to environmental signals.","authors":"Yu Sung Kang, Jeffery Jung, Holly L Brown, Chase Mateusiak, Tamara L Doering, Michael R Brent","doi":"10.1093/genetics/iyae178","DOIUrl":"10.1093/genetics/iyae178","url":null,"abstract":"<p><p>Cryptococcus neoformans is an opportunistic fungal pathogen with a polysaccharide capsule that becomes greatly enlarged in the mammalian host and during in vitro growth under host-like conditions. To understand how individual environmental signals affect capsule size and gene expression, we grew cells in all combinations of 5 signals implicated in capsule size and systematically measured cell and capsule sizes. We also sampled these cultures over time and performed RNA-seq in quadruplicate, yielding 881 RNA-seq samples. Analysis of the resulting data sets showed that capsule induction in tissue culture medium, typically used to represent host-like conditions, requires the presence of either CO2 or exogenous cyclic AMP. Surprisingly, adding either of these pushes overall gene expression in the opposite direction from tissue culture media alone, even though both are required for capsule development. Another unexpected finding was that rich medium blocks capsule growth completely. Statistical analysis further revealed many genes whose expression is associated with capsule thickness; deletion of one of these significantly reduced capsule size. Beyond illuminating capsule induction, our massive, uniformly collected data set will be a significant resource for the research community.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-29"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708910/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142562981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editor's Note: Ribosome Association and Stability of the Nascent Polypeptide-Associated Complex Is Dependent Upon Its Own Ubiquitination. 编者按:核糖体关联和新生多肽相关复合物的稳定性取决于其自身的泛素化。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae177
{"title":"Editor's Note: Ribosome Association and Stability of the Nascent Polypeptide-Associated Complex Is Dependent Upon Its Own Ubiquitination.","authors":"","doi":"10.1093/genetics/iyae177","DOIUrl":"10.1093/genetics/iyae177","url":null,"abstract":"","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MegaLMM improves genomic predictions in new environments using environmental covariates. MegaLMM 利用环境协变量改进了新环境中的基因组预测。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae171
Haixiao Hu, Renaud Rincent, Daniel E Runcie

Multienvironment trials (METs) are crucial for identifying varieties that perform well across a target population of environments. However, METs are typically too small to sufficiently represent all relevant environment-types, and face challenges from changing environment-types due to climate change. Statistical methods that enable prediction of variety performance for new environments beyond the METs are needed. We recently developed MegaLMM, a statistical model that can leverage hundreds of trials to significantly improve genetic value prediction accuracy within METs. Here, we extend MegaLMM to enable genomic prediction in new environments by learning regressions of latent factor loadings on Environmental Covariates (ECs) across trials. We evaluated the extended MegaLMM using the maize Genome-To-Fields dataset, consisting of 4,402 varieties cultivated in 195 trials with 87.1% of phenotypic values missing, and demonstrated its high accuracy in genomic prediction under various breeding scenarios. Furthermore, we showcased MegaLMM's superiority over univariate GBLUP in predicting trait performance of experimental genotypes in new environments. Finally, we explored the use of higher-dimensional quantitative ECs and discussed when and how detailed environmental data can be leveraged for genomic prediction from METs. We propose that MegaLMM can be applied to plant breeding of diverse crops and different fields of genetics where large-scale linear mixed models are utilized.

多环境试验(MET)对于确定在目标环境(TPE)中表现良好的品种至关重要。然而,多环境试验通常规模太小,无法充分代表所有相关的环境类型,而且还面临着气候变化导致环境类型不断变化的挑战。我们需要能预测 METs 以外新环境中品种表现的统计方法。我们最近开发了 MegaLMM,这是一种统计模型,可利用数百次试验,显著提高 METs 内遗传值预测的准确性。在此,我们对 MegaLMM 进行了扩展,通过学习各试验中环境协变量(ECs)的潜在因子载荷回归,实现了新环境下的基因组预测。我们使用玉米 "基因组-田间 "数据集对扩展的MegaLMM进行了评估,该数据集由195个试验中的4402个品种组成,表型值缺失率为87.1%。此外,在预测新环境中试验基因型的性状表现方面,我们展示了 MegaLMM 优于单变量 GBLUP 的优势。最后,我们探讨了高维定量EC的使用,并讨论了何时以及如何利用详细的环境数据通过MET进行基因组预测。我们建议将 MegaLMM 应用于不同作物的植物育种和使用大规模线性混合模型的不同遗传学领域。
{"title":"MegaLMM improves genomic predictions in new environments using environmental covariates.","authors":"Haixiao Hu, Renaud Rincent, Daniel E Runcie","doi":"10.1093/genetics/iyae171","DOIUrl":"10.1093/genetics/iyae171","url":null,"abstract":"<p><p>Multienvironment trials (METs) are crucial for identifying varieties that perform well across a target population of environments. However, METs are typically too small to sufficiently represent all relevant environment-types, and face challenges from changing environment-types due to climate change. Statistical methods that enable prediction of variety performance for new environments beyond the METs are needed. We recently developed MegaLMM, a statistical model that can leverage hundreds of trials to significantly improve genetic value prediction accuracy within METs. Here, we extend MegaLMM to enable genomic prediction in new environments by learning regressions of latent factor loadings on Environmental Covariates (ECs) across trials. We evaluated the extended MegaLMM using the maize Genome-To-Fields dataset, consisting of 4,402 varieties cultivated in 195 trials with 87.1% of phenotypic values missing, and demonstrated its high accuracy in genomic prediction under various breeding scenarios. Furthermore, we showcased MegaLMM's superiority over univariate GBLUP in predicting trait performance of experimental genotypes in new environments. Finally, we explored the use of higher-dimensional quantitative ECs and discussed when and how detailed environmental data can be leveraged for genomic prediction from METs. We propose that MegaLMM can be applied to plant breeding of diverse crops and different fields of genetics where large-scale linear mixed models are utilized.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-41"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708919/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142548474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A path integral approach for allele frequency dynamics under polygenic selection. 多基因选择下等位基因频率动态的路径积分法
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae182
Nathan W Anderson, Lloyd Kirk, Joshua G Schraiber, Aaron P Ragsdale

Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence (E&R) experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a role in a given allele frequency change (AFC). Predicting AFCs under drift and selection, even for alleles contributing to simple, monogenic traits, has remained a challenging problem. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here, we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. We derive analytic expressions for the transition probability (i.e. the probability that an allele will change in frequency from x to y in time t) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of AFC to test for selection, as well as explore optimal design choices for E&R experiments to uncover the genetic architecture of polygenic traits under selection.

许多表型性状都有多基因遗传基础,因此了解其遗传结构并预测个体表型具有挑战性。解决复杂性状遗传基础的一个有希望的途径是通过 "进化与序列 "实验,在实验中,实验室种群面临一定的选择性压力,通过实验过程中的极端频率变化来确定性状贡献位点。然而,小规模的实验室群体会经历大量的随机遗传漂移,因此很难确定选择是否在特定等位基因频率变化中起了作用。预测漂移和选择作用下等位基因频率的变化,即使是预测简单的单基因性状的等位基因频率变化,仍然是一个具有挑战性的问题。最近,人们开始应用路径积分(一种借用物理学的方法)来解决这个问题。迄今为止,这种方法仅限于基因选择,因此不足以捕捉通常研究的定量、高度多基因性状的复杂性。在这里,我们将这些路径积分方法之一--扰动近似--扩展到数量遗传学感兴趣的选择情景中。我们推导出了受稳定选择影响的等位基因性状的过渡概率(即等位基因的频率在 t 时间内从 x 变为 y 的概率)以及快速适应新表型最佳性状的等位基因性状的过渡概率的解析表达式。我们利用这些表达式来描述利用等位基因频率变化来测试选择的特点,并探索进化和序列实验的最佳设计选择,以揭示选择下多基因性状的遗传结构。
{"title":"A path integral approach for allele frequency dynamics under polygenic selection.","authors":"Nathan W Anderson, Lloyd Kirk, Joshua G Schraiber, Aaron P Ragsdale","doi":"10.1093/genetics/iyae182","DOIUrl":"10.1093/genetics/iyae182","url":null,"abstract":"<p><p>Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence (E&R) experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a role in a given allele frequency change (AFC). Predicting AFCs under drift and selection, even for alleles contributing to simple, monogenic traits, has remained a challenging problem. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here, we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. We derive analytic expressions for the transition probability (i.e. the probability that an allele will change in frequency from x to y in time t) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of AFC to test for selection, as well as explore optimal design choices for E&R experiments to uncover the genetic architecture of polygenic traits under selection.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-63"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142630841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Allopolyploidy expanded gene content but not pangenomic variation in the hexaploid oilseed Camelina sativa. 异源多倍体扩大了六倍体油菜籽荠菜的基因含量,但没有扩大泛基因组变异。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae183
Kevin A Bird, Jordan R Brock, Paul P Grabowski, Avril M Harder, Adam L Healy, Shengqiang Shu, Kerrie Barry, LoriBeth Boston, Christopher Daum, Jie Guo, Anna Lipzen, Rachel Walstead, Jane Grimwood, Jeremy Schmutz, Chaofu Lu, Luca Comai, John K McKay, J Chris Pires, Patrick P Edger, John T Lovell, Daniel J Kliebenstein

Ancient whole-genome duplications are believed to facilitate novelty and adaptation by providing the raw fuel for new genes. However, it is unclear how recent whole-genome duplications may contribute to evolvability within recent polyploids. Hybridization accompanying some whole-genome duplications may combine divergent gene content among diploid species. Some theory and evidence suggest that polyploids have a greater accumulation and tolerance of gene presence-absence and genomic structural variation, but it is unclear to what extent either is true. To test how recent polyploidy may influence pangenomic variation, we sequenced, assembled, and annotated 12 complete, chromosome-scale genomes of Camelina sativa, an allohexaploid biofuel crop with 3 distinct subgenomes. Using pangenomic comparative analyses, we characterized gene presence-absence and genomic structural variation both within and between the subgenomes. We found over 75% of ortholog gene clusters are core in C. sativa and <10% of sequence space was affected by genomic structural rearrangements. In contrast, 19% of gene clusters were unique to one subgenome, and the majority of these were Camelina specific (no ortholog in Arabidopsis). We identified an inversion that may contribute to vernalization requirements in winter-type Camelina and an enrichment of Camelina-specific genes with enzymatic processes related to seed oil quality and Camelina's unique glucosinolate profile. Genes related to these traits exhibited little presence-absence variation. Our results reveal minimal pangenomic variation in this species and instead show how hybridization accompanied by whole-genome duplication may benefit polyploids by merging diverged gene content of different species.

古老的全基因组重复(WGDs)被认为通过为新基因提供原始燃料而促进了新颖性和适应性。然而,目前还不清楚近期的WGD如何促进了近期多倍体的进化。伴随某些 WGDs 的杂交可能会结合二倍体物种之间的不同基因内容。一些理论和证据表明,多倍体对基因的存在-不存在和基因组结构变异有更大的积累和耐受性,但目前还不清楚二者在多大程度上是正确的。为了测试最近的多倍体如何影响泛基因组变异,我们对具有三个不同亚基因组的异源六倍体生物燃料作物荠菜的十二个完整的染色体级基因组进行了测序、组装和注释。通过泛基因组比较分析,我们确定了亚基因组内和亚基因组间基因存在与否以及基因组结构变异的特征。我们发现超过 75% 的直向同源基因簇在荠菜中是核心基因,而在荠菜中则是核心基因。
{"title":"Allopolyploidy expanded gene content but not pangenomic variation in the hexaploid oilseed Camelina sativa.","authors":"Kevin A Bird, Jordan R Brock, Paul P Grabowski, Avril M Harder, Adam L Healy, Shengqiang Shu, Kerrie Barry, LoriBeth Boston, Christopher Daum, Jie Guo, Anna Lipzen, Rachel Walstead, Jane Grimwood, Jeremy Schmutz, Chaofu Lu, Luca Comai, John K McKay, J Chris Pires, Patrick P Edger, John T Lovell, Daniel J Kliebenstein","doi":"10.1093/genetics/iyae183","DOIUrl":"10.1093/genetics/iyae183","url":null,"abstract":"<p><p>Ancient whole-genome duplications are believed to facilitate novelty and adaptation by providing the raw fuel for new genes. However, it is unclear how recent whole-genome duplications may contribute to evolvability within recent polyploids. Hybridization accompanying some whole-genome duplications may combine divergent gene content among diploid species. Some theory and evidence suggest that polyploids have a greater accumulation and tolerance of gene presence-absence and genomic structural variation, but it is unclear to what extent either is true. To test how recent polyploidy may influence pangenomic variation, we sequenced, assembled, and annotated 12 complete, chromosome-scale genomes of Camelina sativa, an allohexaploid biofuel crop with 3 distinct subgenomes. Using pangenomic comparative analyses, we characterized gene presence-absence and genomic structural variation both within and between the subgenomes. We found over 75% of ortholog gene clusters are core in C. sativa and <10% of sequence space was affected by genomic structural rearrangements. In contrast, 19% of gene clusters were unique to one subgenome, and the majority of these were Camelina specific (no ortholog in Arabidopsis). We identified an inversion that may contribute to vernalization requirements in winter-type Camelina and an enrichment of Camelina-specific genes with enzymatic processes related to seed oil quality and Camelina's unique glucosinolate profile. Genes related to these traits exhibited little presence-absence variation. Our results reveal minimal pangenomic variation in this species and instead show how hybridization accompanied by whole-genome duplication may benefit polyploids by merging diverged gene content of different species.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-44"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142639957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A previously reported bottleneck in human ancestry 900 kya is likely a statistical artifact. 之前报告的人类祖先 900 千年的瓶颈很可能是统计上的假象。
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae192
Yun Deng, Rasmus Nielsen, Yun S Song

It was recently reported that a severe ancient bottleneck occurred around 900 thousand years ago in the ancestry of African populations, while this signal is absent in non-African populations. Here, we present evidence to show that this finding is likely a statistical artifact.

最近有报道称,非洲种群的祖先在大约 90 万年前出现了严重的古代瓶颈,而在非洲以外的种群中却没有这种信号。在这里,我们提出证据表明,这一发现很可能是统计上的假象。
{"title":"A previously reported bottleneck in human ancestry 900 kya is likely a statistical artifact.","authors":"Yun Deng, Rasmus Nielsen, Yun S Song","doi":"10.1093/genetics/iyae192","DOIUrl":"10.1093/genetics/iyae192","url":null,"abstract":"<p><p>It was recently reported that a severe ancient bottleneck occurred around 900 thousand years ago in the ancestry of African populations, while this signal is absent in non-African populations. Here, we present evidence to show that this finding is likely a statistical artifact.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-3"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708913/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parental-effect gene-drive elements under partial selfing, or why do Caenorhabditis genomes have hyperdivergent regions? 部分自交条件下的亲本效应基因驱动元素,或为什么 Caenorhabditis 基因组具有超分化区域?
IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Pub Date : 2025-01-08 DOI: 10.1093/genetics/iyae175
Matthew V Rockman

Self-fertile Caenorhabditis nematodes carry a surprising number of Medea elements, alleles that act in heterozygous mothers and cause death or developmental delay in offspring that do not inherit them. At some loci, both alleles in a cross operate as independent Medeas, affecting all the homozygous progeny of a selfing heterozygote. The genomic coincidence of Medea elements and ancient, deeply coalescing haplotypes, which pepper the otherwise homogeneous genomes of these animals, raises questions about how these apparent gene-drive elements persist for long periods of time. Here, I investigate how mating system affects the evolution of Medeas, and their paternal-effect counterparts, peels. Despite an intuition that antagonistic alleles should induce balancing selection by killing homozygotes, models show that, under partial selfing, antagonistic elements experience positive frequency dependence: the common allele drives the rare one extinct, even if the rare one is more penetrant. Analytical results for the threshold frequency required for one allele to invade a population show that a very weakly penetrant allele, one whose effects would escape laboratory detection, could nevertheless prevent a much more penetrant allele from invading under high rates of selfing. Ubiquitous weak antagonistic Medeas and peels could then act as localized barriers to gene flow between populations, generating genomic islands of deep coalescence. Analysis of gene expression data, however, suggests that this cannot be the whole story. A complementary explanation is that ordinary ecological balancing selection generates ancient haplotypes on which Medeas can evolve, while high homozygosity in these selfers minimizes the role of gene drive in their evolution.

自交不育的 Caenorhabditis 线虫携带着数量惊人的美狄亚因子,这些等位基因作用于杂合母本,导致没有遗传到这些等位基因的后代死亡或发育迟缓。在某些位点,杂交中的两个等位基因都是独立的美狄亚,会影响自交杂合子的所有同源后代。美狄亚基因与古老的、深度凝聚的单倍型在基因组上的重合,使这些动物原本同质的基因组变得更加丰富,这就提出了一个问题:这些明显的基因驱动基因是如何长期存在的?在这里,我研究了交配系统如何影响鳉鱼及其父系效应对应物鳞皮的进化。尽管根据直觉,拮抗等位基因应该通过杀死同源基因来诱导平衡选择,但模型显示,在部分自交的情况下,拮抗基因会出现正的频率依赖性:普通等位基因会促使稀有等位基因灭绝,即使稀有等位基因的穿透力更强。对一个等位基因入侵一个种群所需的阈值频率的分析结果表明,一个穿透力很弱的等位基因,其影响会逃过实验室的检测,但在自交率很高的情况下,可以阻止一个穿透力更强的等位基因入侵。这样,无处不在的弱拮抗梅迪亚种和等位基因就会成为种群间基因流动的局部障碍,产生深度聚合的基因组孤岛。然而,对基因表达数据的分析表明,这并不是故事的全部。一种补充解释是,普通的生态平衡选择产生了古老的单倍型,Medeas 可以在这些单倍型上进化,而这些自体中的高同源性将基因驱动在其进化中的作用降至最低。
{"title":"Parental-effect gene-drive elements under partial selfing, or why do Caenorhabditis genomes have hyperdivergent regions?","authors":"Matthew V Rockman","doi":"10.1093/genetics/iyae175","DOIUrl":"10.1093/genetics/iyae175","url":null,"abstract":"<p><p>Self-fertile Caenorhabditis nematodes carry a surprising number of Medea elements, alleles that act in heterozygous mothers and cause death or developmental delay in offspring that do not inherit them. At some loci, both alleles in a cross operate as independent Medeas, affecting all the homozygous progeny of a selfing heterozygote. The genomic coincidence of Medea elements and ancient, deeply coalescing haplotypes, which pepper the otherwise homogeneous genomes of these animals, raises questions about how these apparent gene-drive elements persist for long periods of time. Here, I investigate how mating system affects the evolution of Medeas, and their paternal-effect counterparts, peels. Despite an intuition that antagonistic alleles should induce balancing selection by killing homozygotes, models show that, under partial selfing, antagonistic elements experience positive frequency dependence: the common allele drives the rare one extinct, even if the rare one is more penetrant. Analytical results for the threshold frequency required for one allele to invade a population show that a very weakly penetrant allele, one whose effects would escape laboratory detection, could nevertheless prevent a much more penetrant allele from invading under high rates of selfing. Ubiquitous weak antagonistic Medeas and peels could then act as localized barriers to gene flow between populations, generating genomic islands of deep coalescence. Analysis of gene expression data, however, suggests that this cannot be the whole story. A complementary explanation is that ordinary ecological balancing selection generates ancient haplotypes on which Medeas can evolve, while high homozygosity in these selfers minimizes the role of gene drive in their evolution.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":"1-36"},"PeriodicalIF":3.3,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142548475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1