首页 > 最新文献

Molecular biology and evolution最新文献

英文 中文
Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications. 吲哚:计算方法、进化动力学和生物应用。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-04 DOI: 10.1093/molbev/msae177
Benjamin D Redelings, Ian Holmes, Gerton Lunter, Tal Pupko, Maria Anisimova

Insertions and deletions constitute the second most important source of natural genomic variation. Insertions and deletions make up to 25% of genomic variants in humans and are involved in complex evolutionary processes including genomic rearrangements, adaptation, and speciation. Recent advances in long-read sequencing technologies allow detailed inference of insertions and deletion variation in species and populations. Yet, despite their importance, evolutionary studies have traditionally ignored or mishandled insertions and deletions due to a lack of comprehensive methodologies and statistical models of insertions and deletion dynamics. Here, we discuss methods for describing insertions and deletion variation and modeling insertions and deletions over evolutionary time. We provide practical advice for tackling insertions and deletions in genomic sequences and illustrate our discussion with examples of insertions and deletion-induced effects in human and other natural populations and their contribution to evolutionary processes. We outline promising directions for future developments in statistical methodologies that would allow researchers to analyze insertions and deletion variation and their effects in large genomic data sets and to incorporate insertions and deletions in evolutionary inference.

插入和缺失(indels)是自然基因组变异的第二大来源。在人类基因组变异中,吲哚占 25%,它参与了复杂的进化过程,包括基因组重排、适应和物种分化。长读数测序技术的最新进展可以详细推断物种和种群中的吲哚变异。然而,尽管吲哚非常重要,但由于缺乏全面的方法和吲哚动态统计模型,进化研究历来忽视或错误处理吲哚。在这里,我们讨论了描述吲哚变异的方法,以及对进化过程中的插入和缺失进行建模的方法。我们为处理基因组序列中的吲哚提供了实用建议,并以人类和其他自然种群中吲哚诱导效应及其对进化过程的贡献为例进行了说明。我们概述了统计方法的未来发展方向,这些方法将使研究人员能够分析大型基因组数据集中的吲哚变异及其影响,并将吲哚纳入进化推断中。
{"title":"Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications.","authors":"Benjamin D Redelings, Ian Holmes, Gerton Lunter, Tal Pupko, Maria Anisimova","doi":"10.1093/molbev/msae177","DOIUrl":"10.1093/molbev/msae177","url":null,"abstract":"<p><p>Insertions and deletions constitute the second most important source of natural genomic variation. Insertions and deletions make up to 25% of genomic variants in humans and are involved in complex evolutionary processes including genomic rearrangements, adaptation, and speciation. Recent advances in long-read sequencing technologies allow detailed inference of insertions and deletion variation in species and populations. Yet, despite their importance, evolutionary studies have traditionally ignored or mishandled insertions and deletions due to a lack of comprehensive methodologies and statistical models of insertions and deletion dynamics. Here, we discuss methods for describing insertions and deletion variation and modeling insertions and deletions over evolutionary time. We provide practical advice for tackling insertions and deletions in genomic sequences and illustrate our discussion with examples of insertions and deletion-induced effects in human and other natural populations and their contribution to evolutionary processes. We outline promising directions for future developments in statistical methodologies that would allow researchers to analyze insertions and deletion variation and their effects in large genomic data sets and to incorporate insertions and deletions in evolutionary inference.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11385596/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142036386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Genetic Architecture of Recombination Rates is Polygenic and Differs Between the Sexes in Wild House Sparrows (Passer domesticus). 在野生家雀(Passer domesticus)中,重组率的遗传结构是多基因的,并且在性别之间存在差异。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-04 DOI: 10.1093/molbev/msae179
John B McAuley, Bertrand Servin, Hamish A Burnett, Cathrine Brekke, Lucy Peters, Ingerid J Hagen, Alina K Niskanen, Thor Harald Ringsby, Arild Husby, Henrik Jensen, Susan E Johnston

Meiotic recombination through chromosomal crossing-over is a fundamental feature of sex and an important driver of genomic diversity. It ensures proper disjunction, allows increased selection responses, and prevents mutation accumulation; however, it is also mutagenic and can break up favorable haplotypes. This cost-benefit dynamic is likely to vary depending on mechanistic and evolutionary contexts, and indeed, recombination rates show huge variation in nature. Identifying the genetic architecture of this variation is key to understanding its causes and consequences. Here, we investigate individual recombination rate variation in wild house sparrows (Passer domesticus). We integrate genomic and pedigree data to identify autosomal crossover counts (ACCs) and intrachromosomal allelic shuffling (r¯intra) in 13,056 gametes transmitted from 2,653 individuals to their offspring. Females had 1.37 times higher ACC, and 1.55 times higher r¯intra than males. ACC and r¯intra were heritable in females and males (ACC h2 = 0.23 and 0.11; r¯intra  h2 = 0.12 and 0.14), but cross-sex additive genetic correlations were low (rA = 0.29 and 0.32 for ACC and r¯intra). Conditional bivariate analyses showed that all measures remained heritable after accounting for genetic values in the opposite sex, indicating that sex-specific ACC and r¯intra can evolve somewhat independently. Genome-wide models showed that ACC and r¯intra are polygenic and driven by many small-effect loci, many of which are likely to act in trans as global recombination modifiers. Our findings show that recombination rates of females and males can have different evolutionary potential in wild birds, providing a compelling mechanism for the evolution of sexual dimorphism in recombination.

通过染色体交叉进行的减数分裂重组是性别的基本特征,也是基因组多样性的重要驱动力。它能确保适当的分离,允许更多的选择反应,并防止突变积累;然而,它也具有诱变作用,并能破坏有利的单倍型。这种成本/收益的动态变化可能因机制和进化背景的不同而不同,事实上,重组率在自然界中表现出巨大的差异。确定这种变异的遗传结构是了解其原因和后果的关键。在这里,我们研究了野生家雀(Passer domesticus)的个体重组率变异。我们整合了基因组和血统数据,以确定从 2,653 个个体传给后代的 13,056 个配子中的常染色体交叉计数(ACC)和染色体内等位基因洗牌(r¯intra)。雌性的ACC和r¯intra分别是雄性的1.37倍和1.55倍。ACC和r¯intra分别为0.23和0.11;r¯intra h2分别为0.12和0.14),但跨性别加性遗传相关性较低(ACC和r¯intra的rA分别为0.29和0.32)。条件双变量分析表明,在考虑了异性的遗传值后,所有指标仍然是可遗传的,这表明性别特异性 ACC 和 r¯intra 的进化在某种程度上是独立的。全基因组模型显示,ACC和r¯intra是多基因的,由许多小效应位点驱动,其中许多位点可能反式充当全局重组调节因子。我们的研究结果表明,在野生鸟类中,雌性和雄性的重组率可能具有不同的进化潜力,这为重组中的性二态进化提供了一个令人信服的机制。
{"title":"The Genetic Architecture of Recombination Rates is Polygenic and Differs Between the Sexes in Wild House Sparrows (Passer domesticus).","authors":"John B McAuley, Bertrand Servin, Hamish A Burnett, Cathrine Brekke, Lucy Peters, Ingerid J Hagen, Alina K Niskanen, Thor Harald Ringsby, Arild Husby, Henrik Jensen, Susan E Johnston","doi":"10.1093/molbev/msae179","DOIUrl":"10.1093/molbev/msae179","url":null,"abstract":"<p><p>Meiotic recombination through chromosomal crossing-over is a fundamental feature of sex and an important driver of genomic diversity. It ensures proper disjunction, allows increased selection responses, and prevents mutation accumulation; however, it is also mutagenic and can break up favorable haplotypes. This cost-benefit dynamic is likely to vary depending on mechanistic and evolutionary contexts, and indeed, recombination rates show huge variation in nature. Identifying the genetic architecture of this variation is key to understanding its causes and consequences. Here, we investigate individual recombination rate variation in wild house sparrows (Passer domesticus). We integrate genomic and pedigree data to identify autosomal crossover counts (ACCs) and intrachromosomal allelic shuffling (r¯intra) in 13,056 gametes transmitted from 2,653 individuals to their offspring. Females had 1.37 times higher ACC, and 1.55 times higher r¯intra than males. ACC and r¯intra were heritable in females and males (ACC h2 = 0.23 and 0.11; r¯intra  h2 = 0.12 and 0.14), but cross-sex additive genetic correlations were low (rA = 0.29 and 0.32 for ACC and r¯intra). Conditional bivariate analyses showed that all measures remained heritable after accounting for genetic values in the opposite sex, indicating that sex-specific ACC and r¯intra can evolve somewhat independently. Genome-wide models showed that ACC and r¯intra are polygenic and driven by many small-effect loci, many of which are likely to act in trans as global recombination modifiers. Our findings show that recombination rates of females and males can have different evolutionary potential in wild birds, providing a compelling mechanism for the evolution of sexual dimorphism in recombination.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11385585/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142056094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Expression Domain of extradenticle Underlies the Evolutionary Developmental Origin of the Chelicerate Patella. 髌骨外的新表达域是螯肢髌骨进化发展起源的基础。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-04 DOI: 10.1093/molbev/msae188
Benjamin C Klementz, Georg Brenneis, Isaac A Hinne, Ethan M Laumer, Sophie M Neu, Grace M Hareid, Guilherme Gainett, Emily V W Setton, Catalina Simian, David E Vrech, Isabella Joyce, Austen A Barnett, Nipam H Patel, Mark S Harvey, Alfredo V Peretti, Monika Gulia-Nuss, Prashant P Sharma

Neofunctionalization of duplicated gene copies is thought to be an important process underlying the origin of evolutionary novelty and provides an elegant mechanism for the origin of new phenotypic traits. One putative case where a new gene copy has been linked to a novel morphological trait is the origin of the arachnid patella, a taxonomically restricted leg segment. In spiders, the origin of this segment has been linked to the origin of the paralog dachshund-2, suggesting that a new gene facilitated the expression of a new trait. However, various arachnid groups that possess patellae do not have a copy of dachshund-2, disfavoring the direct link between gene origin and trait origin. We investigated the developmental genetic basis for patellar patterning in the harvestman Phalangium opilio, which lacks dachshund-2. Here, we show that the harvestman patella is established by a novel expression domain of the transcription factor extradenticle. Leveraging this definition of patellar identity, we surveyed targeted groups across chelicerate phylogeny to assess when this trait evolved. We show that a patellar homolog is present in Pycnogonida (sea spiders) and various arachnid orders, suggesting a single origin of the patella in the ancestor of Chelicerata. A potential loss of the patella is observed in Ixodida. Our results suggest that the modification of an ancient gene, rather than the neofunctionalization of a new gene copy, underlies the origin of the patella. Broadly, this work underscores the value of comparative data and broad taxonomic sampling when testing hypotheses in evolutionary developmental biology.

复制基因拷贝的新功能化被认为是进化新特性起源的一个重要过程,并为新表型特征的起源提供了一个优雅的机制。新的基因拷贝与新的形态特征相关联的一个推测案例是蛛形纲动物髌骨的起源,这是一个在分类学上受到限制的腿节。在蜘蛛中,该腿节的起源与旁系亲属腊肠狗-2 的起源有关,这表明新基因促进了新性状的表达。然而,拥有髌骨的各种蛛形纲都没有腊肠犬-2的拷贝,这就否定了基因起源与性状起源之间的直接联系。我们研究了缺乏 dachshund-2 的禾本科动物 Phalangium opilio 的髌骨花纹的发育遗传基础。在这里,我们发现禾本科动物的髌骨是由转录因子extracenticle的一个新表达域建立的。根据髌骨特征的这一定义,我们调查了螯足类系统发育中的目标群,以评估这一特征的进化时间。我们发现海蜘蛛(Pycnogonida)和各种蛛形纲中都有髌骨同源物,这表明髌骨起源于螯足类的祖先。髌骨可能在伊科动物中消失。我们的研究结果表明,髌骨起源的基础是一个古老基因的改变,而不是一个新基因拷贝的新功能化。总的来说,这项研究强调了比较数据和广泛的分类取样在检验进化发育生物学假说时的价值。
{"title":"A Novel Expression Domain of extradenticle Underlies the Evolutionary Developmental Origin of the Chelicerate Patella.","authors":"Benjamin C Klementz, Georg Brenneis, Isaac A Hinne, Ethan M Laumer, Sophie M Neu, Grace M Hareid, Guilherme Gainett, Emily V W Setton, Catalina Simian, David E Vrech, Isabella Joyce, Austen A Barnett, Nipam H Patel, Mark S Harvey, Alfredo V Peretti, Monika Gulia-Nuss, Prashant P Sharma","doi":"10.1093/molbev/msae188","DOIUrl":"10.1093/molbev/msae188","url":null,"abstract":"<p><p>Neofunctionalization of duplicated gene copies is thought to be an important process underlying the origin of evolutionary novelty and provides an elegant mechanism for the origin of new phenotypic traits. One putative case where a new gene copy has been linked to a novel morphological trait is the origin of the arachnid patella, a taxonomically restricted leg segment. In spiders, the origin of this segment has been linked to the origin of the paralog dachshund-2, suggesting that a new gene facilitated the expression of a new trait. However, various arachnid groups that possess patellae do not have a copy of dachshund-2, disfavoring the direct link between gene origin and trait origin. We investigated the developmental genetic basis for patellar patterning in the harvestman Phalangium opilio, which lacks dachshund-2. Here, we show that the harvestman patella is established by a novel expression domain of the transcription factor extradenticle. Leveraging this definition of patellar identity, we surveyed targeted groups across chelicerate phylogeny to assess when this trait evolved. We show that a patellar homolog is present in Pycnogonida (sea spiders) and various arachnid orders, suggesting a single origin of the patella in the ancestor of Chelicerata. A potential loss of the patella is observed in Ixodida. Our results suggest that the modification of an ancient gene, rather than the neofunctionalization of a new gene copy, underlies the origin of the patella. Broadly, this work underscores the value of comparative data and broad taxonomic sampling when testing hypotheses in evolutionary developmental biology.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11422720/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142133203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patterns of Fitness and Gene Expression Epistasis Generated by Beneficial Mutations in the rho and rpoB Genes of Escherichia coli during High-Temperature Adaptation. 高温适应过程中大肠杆菌 rho 和 rpoB 基因有益突变产生的适应性和基因表达表观模式。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-04 DOI: 10.1093/molbev/msae187
Andrea González-González, Tiffany N Batarseh, Alejandra Rodríguez-Verdugo, Brandon S Gaut

Epistasis is caused by genetic interactions among mutations that affect fitness. To characterize properties and potential mechanisms of epistasis, we engineered eight double mutants that combined mutations from the rho and rpoB genes of Escherichia coli. The two genes encode essential functions for transcription, and the mutations in each gene were chosen because they were beneficial for adaptation to thermal stress (42.2 °C). The double mutants exhibited patterns of fitness epistasis that included diminishing returns epistasis at 42.2 °C, stronger diminishing returns between mutations with larger beneficial effects and both negative and positive (sign) epistasis across environments (20.0 °C and 37.0 °C). By assessing gene expression between single and double mutants, we detected hundreds of genes with gene expression epistasis. Previous work postulated that highly connected hub genes in coexpression networks have low epistasis, but we found the opposite: hub genes had high epistasis values in both coexpression and protein-protein interaction networks. We hypothesized that elevated epistasis in hub genes reflected that they were enriched for targets of Rho termination but that was not the case. Altogether, gene expression and coexpression analyses revealed that thermal adaptation occurred in modules, through modulation of ribonucleotide biosynthetic processes and ribosome assembly, the attenuation of expression in genes related to heat shock and stress responses, and with an overall trend toward restoring gene expression toward the unstressed state.

外显性是由影响适存性的突变之间的遗传相互作用引起的。为了描述表观遗传的特性和潜在机制,我们设计了八个结合了大肠杆菌 rho 和 rpoB 基因突变的双突变体。这两个基因编码转录的基本功能,选择每个基因的突变是因为它们有利于适应热应激(42.2°C)。双突变体表现出适应性外显的模式,包括在42.2°C时的递减回报外显,在具有较大有益效应的突变之间的较强递减回报,以及在不同环境(20.0°C和37.0°C)下的负外显和正(符号)外显。通过评估单突变体和双突变体之间的基因表达,我们发现了数百个具有基因表达外显性的基因。以前的研究推测,共表达网络中高度连接的中心基因具有较低的表观性,但我们发现情况恰恰相反:中心基因在共表达和蛋白质相互作用网络中都具有较高的表观性。我们假设中枢基因的上位性升高反映了它们富集了 Rho 终止的靶标,但事实并非如此。总之,基因表达和共表达分析表明,热适应是通过调节核糖核苷酸生物合成过程和核糖体组装、减弱与热休克和应激反应有关的基因的表达以及使基因表达恢复到非应激状态的总体趋势等模块实现的。
{"title":"Patterns of Fitness and Gene Expression Epistasis Generated by Beneficial Mutations in the rho and rpoB Genes of Escherichia coli during High-Temperature Adaptation.","authors":"Andrea González-González, Tiffany N Batarseh, Alejandra Rodríguez-Verdugo, Brandon S Gaut","doi":"10.1093/molbev/msae187","DOIUrl":"10.1093/molbev/msae187","url":null,"abstract":"<p><p>Epistasis is caused by genetic interactions among mutations that affect fitness. To characterize properties and potential mechanisms of epistasis, we engineered eight double mutants that combined mutations from the rho and rpoB genes of Escherichia coli. The two genes encode essential functions for transcription, and the mutations in each gene were chosen because they were beneficial for adaptation to thermal stress (42.2 °C). The double mutants exhibited patterns of fitness epistasis that included diminishing returns epistasis at 42.2 °C, stronger diminishing returns between mutations with larger beneficial effects and both negative and positive (sign) epistasis across environments (20.0 °C and 37.0 °C). By assessing gene expression between single and double mutants, we detected hundreds of genes with gene expression epistasis. Previous work postulated that highly connected hub genes in coexpression networks have low epistasis, but we found the opposite: hub genes had high epistasis values in both coexpression and protein-protein interaction networks. We hypothesized that elevated epistasis in hub genes reflected that they were enriched for targets of Rho termination but that was not the case. Altogether, gene expression and coexpression analyses revealed that thermal adaptation occurred in modules, through modulation of ribonucleotide biosynthetic processes and ribosome assembly, the attenuation of expression in genes related to heat shock and stress responses, and with an overall trend toward restoring gene expression toward the unstressed state.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11414761/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142133204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Epigenetics Research in Evolutionary Biology: Perspectives on Timescales and Mechanisms. 进化生物学中的表观遗传学研究:关于时间尺度和机制的观点。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-09-03 DOI: 10.1093/molbev/msae170
Soojin V Yi

Epigenetics research in evolutionary biology encompasses a variety of research areas, from regulation of gene expression to inheritance of environmentally mediated phenotypes. Such divergent research foci can occasionally render the umbrella term "epigenetics" ambiguous. Here I discuss several areas of contemporary epigenetics research in the context of evolutionary biology, aiming to provide balanced views across timescales and molecular mechanisms. The importance of epigenetics in development is now being assessed in many nonmodel species. These studies not only confirm the importance of epigenetic marks in developmental processes, but also highlight the significant diversity in epigenetic regulatory mechanisms across taxa. Further, these comparative epigenomic studies have begun to show promise toward enhancing our understanding of how regulatory programs evolve. A key property of epigenetic marks is that they can be inherited along mitotic cell lineages, and epigenetic differences that occur during early development can have lasting consequences on the organismal phenotypes. Thus, epigenetic marks may play roles in short-term (within an organism's lifetime or to the next generation) adaptation and phenotypic plasticity. However, the extent to which observed epigenetic variation occurs independently of genetic influences remains uncertain, due to the widespread impact of genetics on epigenetic variation and the limited availability of comprehensive (epi)genomic resources from most species. While epigenetic marks can be inherited independently of genetic sequences in some species, there is little evidence that such "transgenerational inheritance" is a general phenomenon. Rather, molecular mechanisms of epigenetic inheritance are highly variable between species.

进化生物学中的表观遗传学研究涵盖从基因表达调控到环境介导表型遗传等多个研究领域。这些不同的研究重点有时会使 "表观遗传学 "这个总括术语变得模糊不清。在此,我将结合进化生物学讨论当代表观遗传学研究的几个领域,旨在提供跨越时间尺度和分子机制的平衡观点。表观遗传学在发育过程中的重要性目前正在许多非模式物种中得到评估。这些研究不仅证实了表观遗传标记在发育过程中的重要性,而且还凸显了不同类群之间表观遗传调控机制的显著多样性。此外,这些表观基因组比较研究已开始显示出希望,能增进我们对调控程序如何进化的了解。表观遗传标记的一个关键特性是,它们可以沿着有丝分裂细胞系遗传,早期发育过程中出现的表观遗传差异会对生物体的表型产生持久的影响。因此,表观遗传标记可能在短期(生物体一生或下一代)适应和表型可塑性方面发挥作用。然而,由于遗传对表观遗传变异的广泛影响,以及大多数物种的全面(表)基因组资源有限,观察到的表观遗传变异在多大程度上独立于遗传影响仍然不确定。虽然在某些物种中,表观遗传标记可以独立于基因序列遗传,但几乎没有证据表明这种 "跨代遗传 "是一种普遍现象。相反,不同物种之间的表观遗传分子机制差异很大。
{"title":"Epigenetics Research in Evolutionary Biology: Perspectives on Timescales and Mechanisms.","authors":"Soojin V Yi","doi":"10.1093/molbev/msae170","DOIUrl":"10.1093/molbev/msae170","url":null,"abstract":"<p><p>Epigenetics research in evolutionary biology encompasses a variety of research areas, from regulation of gene expression to inheritance of environmentally mediated phenotypes. Such divergent research foci can occasionally render the umbrella term \"epigenetics\" ambiguous. Here I discuss several areas of contemporary epigenetics research in the context of evolutionary biology, aiming to provide balanced views across timescales and molecular mechanisms. The importance of epigenetics in development is now being assessed in many nonmodel species. These studies not only confirm the importance of epigenetic marks in developmental processes, but also highlight the significant diversity in epigenetic regulatory mechanisms across taxa. Further, these comparative epigenomic studies have begun to show promise toward enhancing our understanding of how regulatory programs evolve. A key property of epigenetic marks is that they can be inherited along mitotic cell lineages, and epigenetic differences that occur during early development can have lasting consequences on the organismal phenotypes. Thus, epigenetic marks may play roles in short-term (within an organism's lifetime or to the next generation) adaptation and phenotypic plasticity. However, the extent to which observed epigenetic variation occurs independently of genetic influences remains uncertain, due to the widespread impact of genetics on epigenetic variation and the limited availability of comprehensive (epi)genomic resources from most species. While epigenetic marks can be inherited independently of genetic sequences in some species, there is little evidence that such \"transgenerational inheritance\" is a general phenomenon. Rather, molecular mechanisms of epigenetic inheritance are highly variable between species.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"41 9","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11376073/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142133205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Invasion and Amplification of Endogenous Retroviruses in Dasyuridae Marsupial Genomes. 有袋动物基因组中内源性逆转录病毒的入侵和扩增。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1093/molbev/msae160
Emma F Harding, Lewis K Mercer, Grace J H Yan, Paul D Waters, Peter A White

Retroviruses are an ancient viral family that have globally coevolved with vertebrates and impacted their evolution. In Australia, a continent that has been geographically isolated for millions of years, little is known about retroviruses in wildlife, despite the devastating impacts of a retrovirus on endangered koala populations. We therefore sought to identify and characterize Australian retroviruses through reconstruction of endogenous retroviruses from marsupial genomes, in particular the Tasmanian devil due to its high cancer incidence. We screened 19 marsupial genomes and identified over 80,000 endogenous retrovirus fragments which we classified into eight retrovirus clades. The retroviruses were similar to either Betaretrovirus (5/8) or Gammaretrovirus (3/8) retroviruses, but formed distinct phylogenetic clades compared to extant retroviruses. One of the clades (MEBrv 3) lost an envelope but retained retrotranspositional activity, subsequently amplifying throughout all Dasyuridae genomes. Overall, we provide insights into Australian retrovirus evolution and identify a highly active endogenous retrovirus within Dasyuridae genomes.

逆转录病毒是一个古老的病毒家族,在全球范围内与脊椎动物共同进化,影响着脊椎动物的进化。澳大利亚是一个地理上与世隔绝了数百万年的大陆,尽管逆转录病毒对濒临灭绝的考拉种群造成了毁灭性的影响,但人们对野生动物中的逆转录病毒知之甚少。因此,我们试图通过重建有袋动物基因组中的内源性逆转录病毒(ERVs)来识别澳大利亚的逆转录病毒并确定其特征,特别是塔斯马尼亚袋魔,因为它的癌症发病率很高。我们筛选了 19 个有袋动物基因组,发现了 8 万多个 ERV 片段,并将其分为 8 个逆转录病毒支系。这些逆转录病毒与 Betaretrovirus(5/8)或 Gammaretrovirus(3/8)逆转录病毒相似,但与现存的逆转录病毒相比形成了不同的系统发育支系。其中一个支系(MEBrv 3)失去了包膜,但保留了逆转录活性,随后在所有 Dasyuridae 基因组中扩增。总之,我们对澳大利亚逆转录病毒的进化有了深入的了解,并在 Dasyuridae 基因组中发现了一种高度活跃的 ERV。
{"title":"Invasion and Amplification of Endogenous Retroviruses in Dasyuridae Marsupial Genomes.","authors":"Emma F Harding, Lewis K Mercer, Grace J H Yan, Paul D Waters, Peter A White","doi":"10.1093/molbev/msae160","DOIUrl":"10.1093/molbev/msae160","url":null,"abstract":"<p><p>Retroviruses are an ancient viral family that have globally coevolved with vertebrates and impacted their evolution. In Australia, a continent that has been geographically isolated for millions of years, little is known about retroviruses in wildlife, despite the devastating impacts of a retrovirus on endangered koala populations. We therefore sought to identify and characterize Australian retroviruses through reconstruction of endogenous retroviruses from marsupial genomes, in particular the Tasmanian devil due to its high cancer incidence. We screened 19 marsupial genomes and identified over 80,000 endogenous retrovirus fragments which we classified into eight retrovirus clades. The retroviruses were similar to either Betaretrovirus (5/8) or Gammaretrovirus (3/8) retroviruses, but formed distinct phylogenetic clades compared to extant retroviruses. One of the clades (MEBrv 3) lost an envelope but retained retrotranspositional activity, subsequently amplifying throughout all Dasyuridae genomes. Overall, we provide insights into Australian retrovirus evolution and identify a highly active endogenous retrovirus within Dasyuridae genomes.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11334065/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141889758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolution of Virus-like Features and Intrinsically Disordered Regions in Retrotransposon-derived Mammalian Genes. 逆转录转座子衍生哺乳动物基因中病毒样特征和内在紊乱区的进化
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1093/molbev/msae154
Rachele Cagliani, Diego Forni, Alessandra Mozzi, Rotem Fuchs, Dafna Tussia-Cohen, Federica Arrigoni, Uberto Pozzoli, Luca De Gioia, Tzachi Hagai, Manuela Sironi

Several mammalian genes have originated from the domestication of retrotransposons, selfish mobile elements related to retroviruses. Some of the proteins encoded by these genes have maintained virus-like features; including self-processing, capsid structure formation, and the generation of different isoforms through -1 programmed ribosomal frameshifting. Using quantitative approaches in molecular evolution and biophysical analyses, we studied 28 retrotransposon-derived genes, with a focus on the evolution of virus-like features. By analyzing the rate of synonymous substitutions, we show that the -1 programmed ribosomal frameshifting mechanism in three of these genes (PEG10, PNMA3, and PNMA5) is conserved across mammals and originates alternative proteins. These genes were targets of positive selection in primates, and one of the positively selected sites affects a B-cell epitope on the spike domain of the PNMA5 capsid, a finding reminiscent of observations in infectious viruses. More generally, we found that retrotransposon-derived proteins vary in their intrinsically disordered region content and this is directly associated with their evolutionary rates. Most positively selected sites in these proteins are located in intrinsically disordered regions and some of them impact protein posttranslational modifications, such as autocleavage and phosphorylation. Detailed analyses of the biophysical properties of intrinsically disordered regions showed that positive selection preferentially targeted regions with lower conformational entropy. Furthermore, positive selection introduces variation in binary sequence patterns across orthologues, as well as in chain compaction. Our results shed light on the evolutionary trajectories of a unique class of mammalian genes and suggest a novel approach to study how intrinsically disordered region biophysical characteristics are affected by evolution.

哺乳动物的一些基因来源于逆转录病毒的驯化,逆转录病毒是一种与逆转录病毒有关的自私的移动元素。这些基因编码的一些蛋白质保持了类似病毒的特征,包括自我处理、形成囊膜结构,以及通过-1程序核糖体框架转换产生不同的异构体。利用分子进化和生物物理分析的定量方法,我们研究了 28 个反转座子衍生基因,重点关注病毒样特征的进化。通过分析同义替换率,我们发现其中三个基因(PEG10、PNMA3 和 PNMA5)的-1 程序化核糖体框架转换机制在哺乳动物中是保守的,并产生了替代蛋白。这些基因在灵长类动物中是正选择的目标,其中一个正选择位点影响了 PNMA5 包囊尖峰结构域上的 B 细胞表位,这一发现让人联想到在传染性病毒中的观察结果。更广泛地说,我们发现逆转录病毒载体衍生的蛋白质在其内在无序区的含量上各不相同,这与其进化速度直接相关。这些蛋白质中的大多数正选择位点都位于内在无序区,其中一些位点会影响蛋白质的翻译后修饰,如自裂解和磷酸化。对内在无序区生物物理特性的详细分析显示,正向选择优先针对构象熵较低的区域。此外,正选择还引入了同源物之间二元序列模式的变异以及链的压缩。我们的研究结果揭示了一类独特的哺乳动物基因的进化轨迹,并提出了一种研究内在无序区生物物理特征如何受进化影响的新方法。
{"title":"Evolution of Virus-like Features and Intrinsically Disordered Regions in Retrotransposon-derived Mammalian Genes.","authors":"Rachele Cagliani, Diego Forni, Alessandra Mozzi, Rotem Fuchs, Dafna Tussia-Cohen, Federica Arrigoni, Uberto Pozzoli, Luca De Gioia, Tzachi Hagai, Manuela Sironi","doi":"10.1093/molbev/msae154","DOIUrl":"10.1093/molbev/msae154","url":null,"abstract":"<p><p>Several mammalian genes have originated from the domestication of retrotransposons, selfish mobile elements related to retroviruses. Some of the proteins encoded by these genes have maintained virus-like features; including self-processing, capsid structure formation, and the generation of different isoforms through -1 programmed ribosomal frameshifting. Using quantitative approaches in molecular evolution and biophysical analyses, we studied 28 retrotransposon-derived genes, with a focus on the evolution of virus-like features. By analyzing the rate of synonymous substitutions, we show that the -1 programmed ribosomal frameshifting mechanism in three of these genes (PEG10, PNMA3, and PNMA5) is conserved across mammals and originates alternative proteins. These genes were targets of positive selection in primates, and one of the positively selected sites affects a B-cell epitope on the spike domain of the PNMA5 capsid, a finding reminiscent of observations in infectious viruses. More generally, we found that retrotransposon-derived proteins vary in their intrinsically disordered region content and this is directly associated with their evolutionary rates. Most positively selected sites in these proteins are located in intrinsically disordered regions and some of them impact protein posttranslational modifications, such as autocleavage and phosphorylation. Detailed analyses of the biophysical properties of intrinsically disordered regions showed that positive selection preferentially targeted regions with lower conformational entropy. Furthermore, positive selection introduces variation in binary sequence patterns across orthologues, as well as in chain compaction. Our results shed light on the evolutionary trajectories of a unique class of mammalian genes and suggest a novel approach to study how intrinsically disordered region biophysical characteristics are affected by evolution.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"41 8","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11299033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141889761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
phyloBARCODER: A Web Tool for Phylogenetic Classification of Eukaryote Metabarcodes Using Custom Reference Databases. phyloBARCODER:利用定制参考数据库对真核生物代谢条进行系统发育分类的网络工具。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1093/molbev/msae111
Jun Inoue, Chuya Shinzato, Junya Hirai, Sachihiko Itoh, Yuki Minegishi, Shin-Ichi Ito, Susumu Hyodo

We developed phyloBARCODER (https://github.com/jun-inoue/phyloBARCODER), a new web tool that can identify short DNA sequences to the species level using metabarcoding. phyloBARCODER estimates phylogenetic trees based on the uploaded anonymous DNA sequences and reference sequences from databases. Without such phylogenetic contexts, alternative, similarity-based methods independently identify species names and anonymous sequences of the same group by pairwise comparisons between queries and database sequences, with the caveat that they must match exactly or very closely. By putting metabarcoding sequences into a phylogenetic context, phyloBARCODER accurately identifies (i) species or classification of query sequences and (ii) anonymous sequences associated with the same species or even with populations of query sequences, with clear and accurate explanations. Version 1 of phyloBARCODER stores a database comprising all eukaryotic mitochondrial gene sequences. Moreover, by uploading their own databases, phyloBARCODER users can conduct species identification specialized for sequences obtained from a local geographic region or those of nonmitochondrial genes, e.g. ITS or rbcL.

我们开发了phyloBARCODER (https://github.com/jun-inoue/phyloBARCODER),这是一种新的网络工具,可以利用元条码技术识别物种水平的短DNA序列。phyloBARCODER根据上传的匿名DNA序列和数据库中的参考序列估算系统发生树。如果没有这样的系统发育背景,其他基于相似性的方法只能通过对查询序列和数据库序列进行配对比较来独立识别同组的物种名称和匿名序列,但需要注意的是,这些序列必须完全匹配或非常接近。通过将元条码序列置于系统发育的背景下,phyloBARCODER 可以准确识别(1)查询序列的物种或分类,以及(2)与同一物种甚至与查询序列的种群相关的匿名序列,并给出清晰准确的解释。第一版 phyloBARCODER 存储了一个包含所有真核生物线粒体基因序列的数据库。此外,通过上传自己的数据库,phyloBARCODER 用户还可以对从本地地理区域获得的序列或非线粒体基因(如 ITS 或 rbcL)序列进行专门的物种鉴定。
{"title":"phyloBARCODER: A Web Tool for Phylogenetic Classification of Eukaryote Metabarcodes Using Custom Reference Databases.","authors":"Jun Inoue, Chuya Shinzato, Junya Hirai, Sachihiko Itoh, Yuki Minegishi, Shin-Ichi Ito, Susumu Hyodo","doi":"10.1093/molbev/msae111","DOIUrl":"10.1093/molbev/msae111","url":null,"abstract":"<p><p>We developed phyloBARCODER (https://github.com/jun-inoue/phyloBARCODER), a new web tool that can identify short DNA sequences to the species level using metabarcoding. phyloBARCODER estimates phylogenetic trees based on the uploaded anonymous DNA sequences and reference sequences from databases. Without such phylogenetic contexts, alternative, similarity-based methods independently identify species names and anonymous sequences of the same group by pairwise comparisons between queries and database sequences, with the caveat that they must match exactly or very closely. By putting metabarcoding sequences into a phylogenetic context, phyloBARCODER accurately identifies (i) species or classification of query sequences and (ii) anonymous sequences associated with the same species or even with populations of query sequences, with clear and accurate explanations. Version 1 of phyloBARCODER stores a database comprising all eukaryotic mitochondrial gene sequences. Moreover, by uploading their own databases, phyloBARCODER users can conduct species identification specialized for sequences obtained from a local geographic region or those of nonmitochondrial genes, e.g. ITS or rbcL.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11297486/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141293566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Origins and Evolution of Oxygenic and Anoxygenic Photosynthesis in Deeply Branched Cyanobacteriota. 探索深枝蓝细菌含氧和缺氧光合作用的起源和进化。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1093/molbev/msae151
Sha Tan, Lan Liu, Jian-Yu Jiao, Meng-Meng Li, Chao-Jian Hu, Ai-Ping Lv, Yan-Ling Qi, Yu-Xian Li, Yang-Zhi Rao, Yan-Ni Qu, Hong-Chen Jiang, Rochelle M Soo, Paul N Evans, Zheng-Shuang Hua, Wen-Jun Li

Cyanobacteriota, the sole prokaryotes capable of oxygenic photosynthesis (OxyP), occupy a unique and pivotal role in Earth's history. While the notion that OxyP may have originated from Cyanobacteriota is widely accepted, its early evolution remains elusive. Here, by using both metagenomics and metatranscriptomics, we explore 36 metagenome-assembled genomes from hot spring ecosystems, belonging to two deep-branching cyanobacterial orders: Thermostichales and Gloeomargaritales. Functional investigation reveals that Thermostichales encode the crucial thylakoid membrane biogenesis protein, vesicle-inducing protein in plastids 1 (Vipp1). Based on the phylogenetic results, we infer that the evolution of the thylakoid membrane predates the divergence of Thermostichales from other cyanobacterial groups and that Thermostichales may be the most ancient lineage known to date to have inherited this feature from their common ancestor. Apart from OxyP, both lineages are potentially capable of sulfide-driven AnoxyP by linking sulfide oxidation to the photosynthetic electron transport chain. Unexpectedly, this AnoxyP capacity appears to be an acquired feature, as the key gene sqr was horizontally transferred from later-evolved cyanobacterial lineages. The presence of two D1 protein variants in Thermostichales suggests the functional flexibility of photosystems, ensuring their survival in fluctuating redox environments. Furthermore, all MAGs feature streamlined phycobilisomes with a preference for capturing longer-wavelength light, implying a unique evolutionary trajectory. Collectively, these results reveal the photosynthetic flexibility in these early-diverging cyanobacterial lineages, shedding new light on the early evolution of Cyanobacteriota and their photosynthetic processes.

蓝细菌群是唯一能够进行含氧光合作用(OxyP)的原核生物,在地球历史上扮演着独特而关键的角色。虽然人们普遍认为氧光合作用可能起源于蓝细菌群,但其早期演化仍然难以捉摸。在这里,我们利用元基因组学和元转录组学,探索了来自温泉生态系统的 36 个元基因组组装基因组(MAGs),它们分别属于两个深分支蓝藻目:Thermostichales 和 Gloeomargaritales。功能性研究发现,Thermostichales编码关键的类囊体膜生物生成蛋白Vipp1。根据系统发育的结果,我们推断出类囊体膜的进化要早于恒温动物群与其他蓝藻群的分化,恒温动物群可能是迄今所知从其共同祖先继承了这一特征的最古老的类群。除了 OxyP 外,通过将硫化物氧化与光合电子传递链连接起来,这两个品系都有可能进行硫化物驱动的无氧光合作用(AnoxyP)。意想不到的是,这种 AnoxyP 能力似乎是后天获得的,因为关键基因 sqr 是由后来进化的蓝藻品系水平转移而来的。嗜热菌中存在两种 D1 蛋白变体,这表明光系统具有功能灵活性,可确保其在波动的氧化还原环境中生存。此外,所有 MAGs 都具有流线型的藻体,偏好捕捉长波长的光,这意味着它们有着独特的进化轨迹。总之,这些结果揭示了这些早期分化的蓝藻品系的光合作用灵活性,为蓝藻生物群及其光合作用过程的早期演化提供了新的线索。
{"title":"Exploring the Origins and Evolution of Oxygenic and Anoxygenic Photosynthesis in Deeply Branched Cyanobacteriota.","authors":"Sha Tan, Lan Liu, Jian-Yu Jiao, Meng-Meng Li, Chao-Jian Hu, Ai-Ping Lv, Yan-Ling Qi, Yu-Xian Li, Yang-Zhi Rao, Yan-Ni Qu, Hong-Chen Jiang, Rochelle M Soo, Paul N Evans, Zheng-Shuang Hua, Wen-Jun Li","doi":"10.1093/molbev/msae151","DOIUrl":"10.1093/molbev/msae151","url":null,"abstract":"<p><p>Cyanobacteriota, the sole prokaryotes capable of oxygenic photosynthesis (OxyP), occupy a unique and pivotal role in Earth's history. While the notion that OxyP may have originated from Cyanobacteriota is widely accepted, its early evolution remains elusive. Here, by using both metagenomics and metatranscriptomics, we explore 36 metagenome-assembled genomes from hot spring ecosystems, belonging to two deep-branching cyanobacterial orders: Thermostichales and Gloeomargaritales. Functional investigation reveals that Thermostichales encode the crucial thylakoid membrane biogenesis protein, vesicle-inducing protein in plastids 1 (Vipp1). Based on the phylogenetic results, we infer that the evolution of the thylakoid membrane predates the divergence of Thermostichales from other cyanobacterial groups and that Thermostichales may be the most ancient lineage known to date to have inherited this feature from their common ancestor. Apart from OxyP, both lineages are potentially capable of sulfide-driven AnoxyP by linking sulfide oxidation to the photosynthetic electron transport chain. Unexpectedly, this AnoxyP capacity appears to be an acquired feature, as the key gene sqr was horizontally transferred from later-evolved cyanobacterial lineages. The presence of two D1 protein variants in Thermostichales suggests the functional flexibility of photosystems, ensuring their survival in fluctuating redox environments. Furthermore, all MAGs feature streamlined phycobilisomes with a preference for capturing longer-wavelength light, implying a unique evolutionary trajectory. Collectively, these results reveal the photosynthetic flexibility in these early-diverging cyanobacterial lineages, shedding new light on the early evolution of Cyanobacteriota and their photosynthetic processes.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11304991/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141748581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast and Accurate Estimation of Selection Coefficients and Allele Histories from Ancient and Modern DNA. 从古代和现代 DNA 中快速准确地估算选择系数和等位基因历史。
IF 11 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-08-02 DOI: 10.1093/molbev/msae156
Andrew H Vaughn, Rasmus Nielsen

We here present CLUES2, a full-likelihood method to infer natural selection from sequence data that is an extension of the method CLUES. We make several substantial improvements to the CLUES method that greatly increases both its applicability and its speed. We add the ability to use ancestral recombination graphs on ancient data as emissions to the underlying hidden Markov model, which enables CLUES2 to use both temporal and linkage information to make estimates of selection coefficients. We also fully implement the ability to estimate distinct selection coefficients in different epochs, which allows for the analysis of changes in selective pressures through time, as well as selection with dominance. In addition, we greatly increase the computational efficiency of CLUES2 over CLUES using several approximations to the forward-backward algorithms and develop a new way to reconstruct historic allele frequencies by integrating over the uncertainty in the estimation of the selection coefficients. We illustrate the accuracy of CLUES2 through extensive simulations and validate the importance sampling framework for integrating over the uncertainty in the inference of gene trees. We also show that CLUES2 is well-calibrated by showing that under the null hypothesis, the distribution of log-likelihood ratios follows a χ2 distribution with the appropriate degrees of freedom. We run CLUES2 on a set of recently published ancient human data from Western Eurasia and test for evidence of changing selection coefficients through time. We find significant evidence of changing selective pressures in several genes correlated with the introduction of agriculture to Europe and the ensuing dietary and demographic shifts of that time. In particular, our analysis supports previous hypotheses of strong selection on lactase persistence during periods of ancient famines and attenuated selection in more modern periods.

我们在此介绍 CLUES2,这是一种从序列数据推断自然选择的全似然方法,是 CLUES 方法的扩展。我们对 CLUES 方法进行了多项重大改进,大大提高了其适用性和速度。我们增加了在古代数据上使用 ARG 作为底层 HMM 排放的能力,这使得 CLUES2 能够使用时间信息和联系信息来估计选择系数。我们还完全实现了在不同时代估算不同选择系数的功能,从而可以分析选择压力随时间的变化以及优势选择。此外,与 CLUES 相比,我们使用了几种近似的前向后向算法,大大提高了 CLUES2 的计算效率,并开发了一种新方法,通过对选择系数估计中的不确定性进行积分,重建历史等位基因频率。我们通过大量模拟说明了 CLUES2 的准确性,并验证了在推断基因树时对不确定性进行整合的重要性采样框架。我们还通过证明在零假设下,对数似然比的分布遵循具有适当自由度的秩方分布,证明 CLUES2 经过了良好校准。我们在一组最近公布的欧亚大陆西部古人类数据上运行了 CLUES2,并检验了选择系数随时间变化的证据。我们发现了一些基因的选择压力发生变化的重要证据,这些变化与农业传入欧洲以及当时随之而来的饮食和人口变化有关。特别是,我们的分析支持了之前的假设,即在古代饥荒时期,乳糖酶的持久性受到了强烈的选择,而在更现代的时期,选择则有所减弱。
{"title":"Fast and Accurate Estimation of Selection Coefficients and Allele Histories from Ancient and Modern DNA.","authors":"Andrew H Vaughn, Rasmus Nielsen","doi":"10.1093/molbev/msae156","DOIUrl":"10.1093/molbev/msae156","url":null,"abstract":"<p><p>We here present CLUES2, a full-likelihood method to infer natural selection from sequence data that is an extension of the method CLUES. We make several substantial improvements to the CLUES method that greatly increases both its applicability and its speed. We add the ability to use ancestral recombination graphs on ancient data as emissions to the underlying hidden Markov model, which enables CLUES2 to use both temporal and linkage information to make estimates of selection coefficients. We also fully implement the ability to estimate distinct selection coefficients in different epochs, which allows for the analysis of changes in selective pressures through time, as well as selection with dominance. In addition, we greatly increase the computational efficiency of CLUES2 over CLUES using several approximations to the forward-backward algorithms and develop a new way to reconstruct historic allele frequencies by integrating over the uncertainty in the estimation of the selection coefficients. We illustrate the accuracy of CLUES2 through extensive simulations and validate the importance sampling framework for integrating over the uncertainty in the inference of gene trees. We also show that CLUES2 is well-calibrated by showing that under the null hypothesis, the distribution of log-likelihood ratios follows a χ2 distribution with the appropriate degrees of freedom. We run CLUES2 on a set of recently published ancient human data from Western Eurasia and test for evidence of changing selection coefficients through time. We find significant evidence of changing selective pressures in several genes correlated with the introduction of agriculture to Europe and the ensuing dietary and demographic shifts of that time. In particular, our analysis supports previous hypotheses of strong selection on lactase persistence during periods of ancient famines and attenuated selection in more modern periods.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321360/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141792878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular biology and evolution
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1