Ryan Seamus McGee, Grant Kinsler, Dmitri Petrov, Mikhail Tikhonov
Measuring the fitnesses of genetic variants is a fundamental objective in evolutionary biology. A standard approach for measuring microbial fitnesses in bulk involves labeling a library of genetic variants with unique sequence barcodes, competing the labeled strains in batch culture, and using deep sequencing to track changes in the barcode abundances over time. However, idiosyncratic properties of barcodes can induce nonuniform amplification or uneven sequencing coverage that causes some barcodes to be over- or under-represented in samples. This systematic bias can result in erroneous read count trajectories and misestimates of fitness. Here, we develop a computational method, named REBAR (Removing the Effects of Bias through Analysis of Residuals), for inferring the effects of barcode processing bias by leveraging the structure of systematic deviations in the data. We illustrate this approach by applying it to two independent data sets, and demonstrate that this method estimates and corrects for bias more accurately than standard proxies, such as GC-based corrections. REBAR mitigates bias and improves fitness estimates in high-throughput assays without introducing additional complexity to the experimental protocols, with potential applications in a range of experimental evolution and mutation screening contexts.
{"title":"Improving the Accuracy of Bulk Fitness Assays by Correcting Barcode Processing Biases.","authors":"Ryan Seamus McGee, Grant Kinsler, Dmitri Petrov, Mikhail Tikhonov","doi":"10.1093/molbev/msae152","DOIUrl":"10.1093/molbev/msae152","url":null,"abstract":"<p><p>Measuring the fitnesses of genetic variants is a fundamental objective in evolutionary biology. A standard approach for measuring microbial fitnesses in bulk involves labeling a library of genetic variants with unique sequence barcodes, competing the labeled strains in batch culture, and using deep sequencing to track changes in the barcode abundances over time. However, idiosyncratic properties of barcodes can induce nonuniform amplification or uneven sequencing coverage that causes some barcodes to be over- or under-represented in samples. This systematic bias can result in erroneous read count trajectories and misestimates of fitness. Here, we develop a computational method, named REBAR (Removing the Effects of Bias through Analysis of Residuals), for inferring the effects of barcode processing bias by leveraging the structure of systematic deviations in the data. We illustrate this approach by applying it to two independent data sets, and demonstrate that this method estimates and corrects for bias more accurately than standard proxies, such as GC-based corrections. REBAR mitigates bias and improves fitness estimates in high-throughput assays without introducing additional complexity to the experimental protocols, with potential applications in a range of experimental evolution and mutation screening contexts.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11316221/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141748582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Parallel evolution occurs when distinct lineages with similar ancestral states converge on a new phenotype. Parallel evolution has been well documented at the organ, gene pathway, and amino acid sequence level but in theory, it can also occur at individual nucleotides within noncoding regions. To examine the role of parallel evolution in shaping the biology of mammalian complex traits, we used data on single-nucleotide polymorphisms (SNPs) influencing human intraspecific variation to predict trait values in other species for 11 complex traits. We found that the alleles at SNP positions associated with human intraspecific height and red blood cell (RBC) count variation are associated with interspecific variation in the corresponding traits across mammals. These associations hold for deeper branches of mammalian evolution as well as between strains of collaborative cross mice. While variation in RBC count between primates uses both ancient and more recently evolved genomic regions, we found that only primate-specific elements were correlated with primate body size. We show that the SNP positions driving these signals are flanked by conserved sequences, maintain synteny with target genes, and overlap transcription factor binding sites. This work highlights the potential of conserved but tunable regulatory elements to be reused in parallel to facilitate evolutionary adaptation in mammals.
当具有相似祖先状态的不同血统汇聚到一个新的表型上时,平行进化就发生了。平行进化在器官、基因通路和氨基酸序列水平上都有详细记录,但在理论上,平行进化也可能发生在非编码区的单个核苷酸上。为了研究平行进化在塑造哺乳动物复杂性状生物学中的作用,我们利用影响人类种内变异的单核苷酸多态性(SNPs)数据来预测其他物种 11 个复杂性状的性状值。我们发现,与人类种内身高和红细胞计数变异相关的 SNP 位点等位基因与哺乳动物相应性状的种间变异相关。这些关联在哺乳动物进化的更深分支以及合作杂交小鼠品系之间都存在。虽然灵长类动物之间的红细胞计数变异使用了古老的和最近进化的基因组区域,但我们发现只有灵长类动物特有的元素与灵长类动物的体型相关。我们发现,驱动这些信号的 SNP 位置两侧是保守序列,与目标基因保持着同源关系,并与转录因子结合位点重叠。这项工作凸显了保守但可调整的调控元件被并行重复使用以促进哺乳动物进化适应的潜力。
{"title":"Parallel Evolution at the Regulatory Base-Pair Level Contributes to Mammalian Interspecific Differences in Polygenic Traits.","authors":"Alexander S Okamoto, Terence D Capellini","doi":"10.1093/molbev/msae157","DOIUrl":"10.1093/molbev/msae157","url":null,"abstract":"<p><p>Parallel evolution occurs when distinct lineages with similar ancestral states converge on a new phenotype. Parallel evolution has been well documented at the organ, gene pathway, and amino acid sequence level but in theory, it can also occur at individual nucleotides within noncoding regions. To examine the role of parallel evolution in shaping the biology of mammalian complex traits, we used data on single-nucleotide polymorphisms (SNPs) influencing human intraspecific variation to predict trait values in other species for 11 complex traits. We found that the alleles at SNP positions associated with human intraspecific height and red blood cell (RBC) count variation are associated with interspecific variation in the corresponding traits across mammals. These associations hold for deeper branches of mammalian evolution as well as between strains of collaborative cross mice. While variation in RBC count between primates uses both ancient and more recently evolved genomic regions, we found that only primate-specific elements were correlated with primate body size. We show that the SNP positions driving these signals are flanked by conserved sequences, maintain synteny with target genes, and overlap transcription factor binding sites. This work highlights the potential of conserved but tunable regulatory elements to be reused in parallel to facilitate evolutionary adaptation in mammals.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321361/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141788559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Marc Beringer, Rimjhim Roy Choudhury, Terezie Mandáková, Sandra Grünig, Manuel Poretti, Ilia J Leitch, Martin A Lysak, Christian Parisod
The molecular underpinnings and consequences of cycles of whole-genome duplication (WGD) and subsequent gene loss through subgenome fractionation remain largely elusive. Endogenous drivers, such as transposable elements (TEs), have been postulated to shape genome-wide dominance and biased fractionation, leading to a conserved least-fractionated (LF) subgenome and a degenerated most-fractionated (MF) subgenome. In contrast, the role of exogenous factors, such as those induced by environmental stresses, has been overlooked. In this study, a chromosome-scale assembly of the alpine buckler mustard (Biscutella laevigata; Brassicaceae) that underwent a WGD event about 11 million years ago is coupled with transcriptional responses to heat, cold, drought, and herbivory to assess how gene expression is associated with differential gene retention across the MF and LF subgenomes. Counteracting the impact of TEs in reducing the expression and retention of nearby genes across the MF subgenome, dosage balance is highlighted as a main endogenous promoter of the retention of duplicated gene products under purifying selection. Consistent with the "turn a hobby into a job" model, about one-third of environment-responsive duplicates exhibit novel expression patterns, with one copy typically remaining conditionally expressed, whereas the other copy has evolved constitutive expression, highlighting exogenous factors as a major driver of gene retention. Showing uneven patterns of fractionation, with regions remaining unbiased, but with others showing high bias and significant enrichment in environment-responsive genes, this mesopolyploid genome presents evolutionary signatures consistent with an interplay of endogenous and exogenous factors having driven gene content following WGD-fractionation cycles.
{"title":"Biased Retention of Environment-Responsive Genes Following Genome Fractionation.","authors":"Marc Beringer, Rimjhim Roy Choudhury, Terezie Mandáková, Sandra Grünig, Manuel Poretti, Ilia J Leitch, Martin A Lysak, Christian Parisod","doi":"10.1093/molbev/msae155","DOIUrl":"10.1093/molbev/msae155","url":null,"abstract":"<p><p>The molecular underpinnings and consequences of cycles of whole-genome duplication (WGD) and subsequent gene loss through subgenome fractionation remain largely elusive. Endogenous drivers, such as transposable elements (TEs), have been postulated to shape genome-wide dominance and biased fractionation, leading to a conserved least-fractionated (LF) subgenome and a degenerated most-fractionated (MF) subgenome. In contrast, the role of exogenous factors, such as those induced by environmental stresses, has been overlooked. In this study, a chromosome-scale assembly of the alpine buckler mustard (Biscutella laevigata; Brassicaceae) that underwent a WGD event about 11 million years ago is coupled with transcriptional responses to heat, cold, drought, and herbivory to assess how gene expression is associated with differential gene retention across the MF and LF subgenomes. Counteracting the impact of TEs in reducing the expression and retention of nearby genes across the MF subgenome, dosage balance is highlighted as a main endogenous promoter of the retention of duplicated gene products under purifying selection. Consistent with the \"turn a hobby into a job\" model, about one-third of environment-responsive duplicates exhibit novel expression patterns, with one copy typically remaining conditionally expressed, whereas the other copy has evolved constitutive expression, highlighting exogenous factors as a major driver of gene retention. Showing uneven patterns of fractionation, with regions remaining unbiased, but with others showing high bias and significant enrichment in environment-responsive genes, this mesopolyploid genome presents evolutionary signatures consistent with an interplay of endogenous and exogenous factors having driven gene content following WGD-fractionation cycles.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11306978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141788631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lester J Pérez, Guy Baele, Samuel L Hong, Gavin A Cloherty, Michael G Berg
Severe fever with thrombocytopenia syndrome virus (SFTSV) is a tick-borne virus recognized by the World Health Organization as an emerging infectious disease of growing concern. Utilizing phylodynamic and phylogeographic methods, we have reconstructed the origin and transmission patterns of SFTSV lineages and the roles demographic, ecological, and climatic factors have played in shaping its emergence and spread throughout Asia. Environmental changes and fluctuations in tick populations, exacerbated by the widespread use of pesticides, have contributed significantly to its geographic expansion. The increased adaptability of Lineage L2 strains to the Haemaphysalis longicornis vector has facilitated the dispersal of SFTSV through Southeast Asia. Increased surveillance and proactive measures are needed to prevent further spread to Australia, Indonesia, and North America.
{"title":"Ecological Changes Exacerbating the Spread of Invasive Ticks has Driven the Dispersal of Severe Fever with Thrombocytopenia Syndrome Virus Throughout Southeast Asia.","authors":"Lester J Pérez, Guy Baele, Samuel L Hong, Gavin A Cloherty, Michael G Berg","doi":"10.1093/molbev/msae173","DOIUrl":"10.1093/molbev/msae173","url":null,"abstract":"<p><p>Severe fever with thrombocytopenia syndrome virus (SFTSV) is a tick-borne virus recognized by the World Health Organization as an emerging infectious disease of growing concern. Utilizing phylodynamic and phylogeographic methods, we have reconstructed the origin and transmission patterns of SFTSV lineages and the roles demographic, ecological, and climatic factors have played in shaping its emergence and spread throughout Asia. Environmental changes and fluctuations in tick populations, exacerbated by the widespread use of pesticides, have contributed significantly to its geographic expansion. The increased adaptability of Lineage L2 strains to the Haemaphysalis longicornis vector has facilitated the dispersal of SFTSV through Southeast Asia. Increased surveillance and proactive measures are needed to prevent further spread to Australia, Indonesia, and North America.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"41 8","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11349436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142080920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas Lund Koch, Samuel D Robinson, Paula Flórez Salcedo, Kevin Chase, Jason Biggs, Alexander E Fedosov, Mark Yandell, Baldomero M Olivera, Helena Safavi-Hemami
Venom systems are complex traits that have independently emerged multiple times in diverse plant and animal phyla. Within each venomous lineage there typically exists interspecific variation in venom composition where several factors have been proposed as drivers of variation, including phylogeny and diet. Understanding these factors is of broad biological interest and has implications for the development of antivenom therapies and venom-based drug discovery. Because of their high species richness and the presence of several major evolutionary prey shifts, venomous marine cone snails (genus Conus) provide an ideal system to investigate drivers of interspecific venom variation. Here, by analyzing the venom gland expression profiles of ∼3,000 toxin genes from 42 species of cone snail, we elucidate the role of prey-specific selection pressures in shaping venom variation. By analyzing overall venom composition and individual toxin structures, we demonstrate that the shifts from vermivory to piscivory in Conus are complemented by distinct changes in venom composition independent of phylogeny. In vivo injections of venom from piscivorous cone snails in fish further showed a higher potency compared with venom of nonpiscivores demonstrating a selective advantage. Together, our findings provide compelling evidence for the role of prey shifts in directing the venom composition of cone snails and expand our understanding of the mechanisms of venom variation and diversification.
{"title":"Prey Shifts Drive Venom Evolution in Cone Snails.","authors":"Thomas Lund Koch, Samuel D Robinson, Paula Flórez Salcedo, Kevin Chase, Jason Biggs, Alexander E Fedosov, Mark Yandell, Baldomero M Olivera, Helena Safavi-Hemami","doi":"10.1093/molbev/msae120","DOIUrl":"10.1093/molbev/msae120","url":null,"abstract":"<p><p>Venom systems are complex traits that have independently emerged multiple times in diverse plant and animal phyla. Within each venomous lineage there typically exists interspecific variation in venom composition where several factors have been proposed as drivers of variation, including phylogeny and diet. Understanding these factors is of broad biological interest and has implications for the development of antivenom therapies and venom-based drug discovery. Because of their high species richness and the presence of several major evolutionary prey shifts, venomous marine cone snails (genus Conus) provide an ideal system to investigate drivers of interspecific venom variation. Here, by analyzing the venom gland expression profiles of ∼3,000 toxin genes from 42 species of cone snail, we elucidate the role of prey-specific selection pressures in shaping venom variation. By analyzing overall venom composition and individual toxin structures, we demonstrate that the shifts from vermivory to piscivory in Conus are complemented by distinct changes in venom composition independent of phylogeny. In vivo injections of venom from piscivorous cone snails in fish further showed a higher potency compared with venom of nonpiscivores demonstrating a selective advantage. Together, our findings provide compelling evidence for the role of prey shifts in directing the venom composition of cone snails and expand our understanding of the mechanisms of venom variation and diversification.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11296725/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141469542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mitchell Cummins, Cadel Watson, Richard J Edwards, John S Mattick
Ultraconserved elements (UCEs) were discovered two decades ago, arbitrarily defined as sequences that are identical over a length ≥200 bp in the human, mouse and rat genomes. The definition was subsequently extended to sequences ≥100 bp identical in at least three of five mammalian genomes (including dog and cow), and shown to have undergone rapid expansion from ancestors in fish and strong negative selection in birds and mammals. Since then, many more genomes have become available, allowing better definition and more thorough examination of UCE distribution and evolutionary history. We developed a fast and flexible analytical pipeline for identifying UCEs in multiple genomes, dedUCE, which allows manipulation of minimum length, sequence identity, and number of species with a detectable UCE according to specified parameters. We suggest an updated definition of UCEs as sequences ≥100 bp and ≥97% sequence identity in ≥50% of placental mammal orders (12813 UCEs). By mapping UCEs to ∼200 species we find that placental UCEs appeared early in vertebrate evolution, well before land colonisation, suggesting the evolutionary pressures driving UCE selection were present in aquatic environments in the Cambrian-Devonian periods. Most (>90%) UCEs likely appeared after the divergence of gnathostomes from jawless predecessors, were largely established in sequence identity by early Sarcopterygii evolution - before the divergence of lobe-finned fishes from tetrapods - and became near fixed in the amniotes. UCEs are mainly located in the introns of protein-coding and non-coding genes involved in neurological and skeletomuscular development, enriched in regulatory elements, and dynamically expressed throughout embryonic development.
{"title":"The evolution of ultraconserved elements in vertebrates.","authors":"Mitchell Cummins, Cadel Watson, Richard J Edwards, John S Mattick","doi":"10.1093/molbev/msae146","DOIUrl":"10.1093/molbev/msae146","url":null,"abstract":"<p><p>Ultraconserved elements (UCEs) were discovered two decades ago, arbitrarily defined as sequences that are identical over a length ≥200 bp in the human, mouse and rat genomes. The definition was subsequently extended to sequences ≥100 bp identical in at least three of five mammalian genomes (including dog and cow), and shown to have undergone rapid expansion from ancestors in fish and strong negative selection in birds and mammals. Since then, many more genomes have become available, allowing better definition and more thorough examination of UCE distribution and evolutionary history. We developed a fast and flexible analytical pipeline for identifying UCEs in multiple genomes, dedUCE, which allows manipulation of minimum length, sequence identity, and number of species with a detectable UCE according to specified parameters. We suggest an updated definition of UCEs as sequences ≥100 bp and ≥97% sequence identity in ≥50% of placental mammal orders (12813 UCEs). By mapping UCEs to ∼200 species we find that placental UCEs appeared early in vertebrate evolution, well before land colonisation, suggesting the evolutionary pressures driving UCE selection were present in aquatic environments in the Cambrian-Devonian periods. Most (>90%) UCEs likely appeared after the divergence of gnathostomes from jawless predecessors, were largely established in sequence identity by early Sarcopterygii evolution - before the divergence of lobe-finned fishes from tetrapods - and became near fixed in the amniotes. UCEs are mainly located in the introns of protein-coding and non-coding genes involved in neurological and skeletomuscular development, enriched in regulatory elements, and dynamically expressed throughout embryonic development.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141627180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Clara Iglhaut, Jūlija Pečerska, Manuel Gil, Maria Anisimova
Despite having important biological implications, insertion, and deletion (indel) events are often disregarded or mishandled during phylogenetic inference. In multiple sequence alignment, indels are represented as gaps and are estimated without considering the distinct evolutionary history of insertions and deletions. Consequently, indels are usually excluded from subsequent inference steps, such as ancestral sequence reconstruction and phylogenetic tree search. Here, we introduce indel-aware parsimony (indelMaP), a novel way to treat gaps under the parsimony criterion by considering insertions and deletions as separate evolutionary events and accounting for long indels. By identifying the precise location of an evolutionary event on the tree, we can separate overlapping indel events and use affine gap penalties for long indel modeling. Our indel-aware approach harnesses the phylogenetic signal from indels, including them into all inference stages. Validation and comparison to state-of-the-art inference tools on simulated data show that indelMaP is most suitable for densely sampled datasets with closely to moderately related sequences, where it can reach alignment quality comparable to probabilistic methods and accurately infer ancestral sequences, including indel patterns. Due to its remarkable speed, our method is well suited for epidemiological datasets, eliminating the need for downsampling and enabling the exploitation of the additional information provided by dense taxonomic sampling. Moreover, indelMaP offers new insights into the indel patterns of biologically significant sequences and advances our understanding of genetic variability by considering gaps as crucial evolutionary signals rather than mere artefacts.
{"title":"Please Mind the Gap: Indel-Aware Parsimony for Fast and Accurate Ancestral Sequence Reconstruction and Multiple Sequence Alignment Including Long Indels.","authors":"Clara Iglhaut, Jūlija Pečerska, Manuel Gil, Maria Anisimova","doi":"10.1093/molbev/msae109","DOIUrl":"10.1093/molbev/msae109","url":null,"abstract":"<p><p>Despite having important biological implications, insertion, and deletion (indel) events are often disregarded or mishandled during phylogenetic inference. In multiple sequence alignment, indels are represented as gaps and are estimated without considering the distinct evolutionary history of insertions and deletions. Consequently, indels are usually excluded from subsequent inference steps, such as ancestral sequence reconstruction and phylogenetic tree search. Here, we introduce indel-aware parsimony (indelMaP), a novel way to treat gaps under the parsimony criterion by considering insertions and deletions as separate evolutionary events and accounting for long indels. By identifying the precise location of an evolutionary event on the tree, we can separate overlapping indel events and use affine gap penalties for long indel modeling. Our indel-aware approach harnesses the phylogenetic signal from indels, including them into all inference stages. Validation and comparison to state-of-the-art inference tools on simulated data show that indelMaP is most suitable for densely sampled datasets with closely to moderately related sequences, where it can reach alignment quality comparable to probabilistic methods and accurately infer ancestral sequences, including indel patterns. Due to its remarkable speed, our method is well suited for epidemiological datasets, eliminating the need for downsampling and enabling the exploitation of the additional information provided by dense taxonomic sampling. Moreover, indelMaP offers new insights into the indel patterns of biologically significant sequences and advances our understanding of genetic variability by considering gaps as crucial evolutionary signals rather than mere artefacts.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11221656/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141262020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shi-Zhi Wang, Yu Yan, Malin Widlund, Chen-Chang Qian, Liang-Liang Zhang, Shao-Jie Zhang, Zi-Mai Li, Peng Cao, Qing-Yan Dai, Xiao-Tian Feng, Feng Liu, Lu Wang, Chao Gao, Qiao-Mei Fu, Marjo K Hytönen, Hannes Lohi, Peter Savolainen, Guo-Dong Wang
The origins and extreme morphological evolution of the modern dog breeds are poorly studied because the founder populations are extinct. Here, we analyse eight 100 to 200 years old dog fur samples obtained from traditional North Swedish clothing, to explore the origin and artificial selection of the modern Nordic Lapphund and Elkhound dog breeds. Population genomic analysis confirmed the Lapphund and Elkhound breeds to originate from the local dog population, and showed a distinct decrease in genetic diversity in agreement with intense breeding. We identified eleven genes under positive selection during the breed development. In particular, the MSRB3 gene, associated with breed-related ear morphology, was selected in all Lapphund and Elkhound breeds, and functional assays showed that a SNP mutation in the 3'UTR region suppresses its expression through miRNA regulation. Our findings demonstrate analysis of near-modern dog artifacts as an effective tool for interpreting the origin and artificial selection of the modern dog breeds.
{"title":"Historic dog Furs Unravel the Origin and Artificial Selection of Modern Nordic Lapphund and Elkhound dog Breeds.","authors":"Shi-Zhi Wang, Yu Yan, Malin Widlund, Chen-Chang Qian, Liang-Liang Zhang, Shao-Jie Zhang, Zi-Mai Li, Peng Cao, Qing-Yan Dai, Xiao-Tian Feng, Feng Liu, Lu Wang, Chao Gao, Qiao-Mei Fu, Marjo K Hytönen, Hannes Lohi, Peter Savolainen, Guo-Dong Wang","doi":"10.1093/molbev/msae108","DOIUrl":"10.1093/molbev/msae108","url":null,"abstract":"<p><p>The origins and extreme morphological evolution of the modern dog breeds are poorly studied because the founder populations are extinct. Here, we analyse eight 100 to 200 years old dog fur samples obtained from traditional North Swedish clothing, to explore the origin and artificial selection of the modern Nordic Lapphund and Elkhound dog breeds. Population genomic analysis confirmed the Lapphund and Elkhound breeds to originate from the local dog population, and showed a distinct decrease in genetic diversity in agreement with intense breeding. We identified eleven genes under positive selection during the breed development. In particular, the MSRB3 gene, associated with breed-related ear morphology, was selected in all Lapphund and Elkhound breeds, and functional assays showed that a SNP mutation in the 3'UTR region suppresses its expression through miRNA regulation. Our findings demonstrate analysis of near-modern dog artifacts as an effective tool for interpreting the origin and artificial selection of the modern dog breeds.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11226788/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141261992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Thomas Decroly, Roger Vila, Konrad Lohse, Alexander Mackintosh
Natural selection is less efficient in the absence of recombination. As a result, nonrecombining sequences, such as sex chromosomes, tend to degenerate over time. Although the outcomes of recombination arrest are typically observed after many millions of generations, recent neo-sex chromosomes can give insight into the early stages of this process. Here, we investigate the evolution of neo-sex chromosomes in the Spanish marbled white butterfly, Melanargia ines, where a Z-autosome fusion has turned the homologous autosome into a nonrecombining neo-W chromosome. We show that these neo-sex chromosomes are likely limited to the Iberian population of M. ines, and that they arose around the time when this population split from North-African populations, around 1.5 million years ago. Recombination arrest of the neo-W chromosome has led to an excess of premature stop-codons and frame-shift mutations, and reduced gene expression compared to the neo-Z chromosome. Surprisingly, we identified two regions of ∼1 Mb at one end of the neo-W that are both less diverged from the neo-Z and less degraded than the rest of the chromosome, suggesting a history of rare but repeated genetic exchange between the two neo-sex chromosomes. These plateaus of neo-sex chromosome divergence suggest that neo-W degradation can be locally reversed by rare recombination between neo-W and neo-Z chromosomes.
在没有重组的情况下,自然选择的效率较低。因此,非重组序列(如性染色体)往往会随着时间的推移而退化。虽然通常要经过数百万代才能观察到重组停止的结果,但最近的新性染色体可以让人们了解这一过程的早期阶段。在这里,我们研究了西班牙大理石纹白丁鱼(Melanargia ines)中新性染色体的演化过程,Z-自体融合将同源自体变成了非重组的新W染色体。我们的研究表明,这些新性染色体很可能仅限于伊比利亚种群中的M. ines,而且它们是在大约150万年前该种群从北非种群中分离出来时出现的。与新 Z 染色体相比,新 W 染色体的重组停滞导致过早终止密码子和框架转换突变过多,基因表达减少。令人惊奇的是,我们在新W染色体的一端发现了两个长约1 Mb的区域,它们与新Z染色体的分化程度较低,退化程度也低于染色体的其他部分,这表明两条新性染色体之间存在着罕见但重复的基因交换历史。这些新性染色体差异的高原表明,新W染色体和新Z染色体之间的罕见重组可以局部逆转新W染色体的退化。
{"title":"Rewinding the Ratchet: Rare Recombination Locally Rescues Neo-W Degeneration and Generates Plateaus of Sex-Chromosome Divergence.","authors":"Thomas Decroly, Roger Vila, Konrad Lohse, Alexander Mackintosh","doi":"10.1093/molbev/msae124","DOIUrl":"10.1093/molbev/msae124","url":null,"abstract":"<p><p>Natural selection is less efficient in the absence of recombination. As a result, nonrecombining sequences, such as sex chromosomes, tend to degenerate over time. Although the outcomes of recombination arrest are typically observed after many millions of generations, recent neo-sex chromosomes can give insight into the early stages of this process. Here, we investigate the evolution of neo-sex chromosomes in the Spanish marbled white butterfly, Melanargia ines, where a Z-autosome fusion has turned the homologous autosome into a nonrecombining neo-W chromosome. We show that these neo-sex chromosomes are likely limited to the Iberian population of M. ines, and that they arose around the time when this population split from North-African populations, around 1.5 million years ago. Recombination arrest of the neo-W chromosome has led to an excess of premature stop-codons and frame-shift mutations, and reduced gene expression compared to the neo-Z chromosome. Surprisingly, we identified two regions of ∼1 Mb at one end of the neo-W that are both less diverged from the neo-Z and less degraded than the rest of the chromosome, suggesting a history of rare but repeated genetic exchange between the two neo-sex chromosomes. These plateaus of neo-sex chromosome divergence suggest that neo-W degradation can be locally reversed by rare recombination between neo-W and neo-Z chromosomes.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":" ","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11232697/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141476886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Natalia A S Przelomska, Rudy A Diaz, Fabio Andrés Ávila, Gustavo A Ballen, Rocío Cortés-B, Logan Kistler, Daniel H Chitwood, Martha Charitonidou, Susanne S Renner, Oscar A Pérez-Escobar, Alexandre Antonelli
South American coca (Erythroxylum coca and E. novogranatense) has been a keystone crop for many Andean and Amazonian communities for at least 8,000 years. However, over the last half-century, global demand for its alkaloid cocaine has driven intensive agriculture of this plant and placed it in the center of armed conflict and deforestation. To monitor the changing landscape of coca plantations, the United Nations Office on Drugs and Crime collects annual data on their areas of cultivation. However, attempts to delineate areas in which different varieties are grown have failed due to limitations around identification. In the absence of flowers, identification relies on leaf morphology, yet the extent to which this is reflected in taxonomy is uncertain. Here, we analyze the consistency of the current naming system of coca and its four closest wild relatives (the "coca clade"), using morphometrics, phylogenomics, molecular clocks, and population genomics. We include name-bearing type specimens of coca's closest wild relatives E. gracilipes and E. cataractarum. Morphometrics of 342 digitized herbarium specimens show that leaf shape and size fail to reliably discriminate between species and varieties. However, the statistical analyses illuminate that rounder and more obovate leaves of certain varieties could be associated with the subtle domestication syndrome of coca. Our phylogenomic data indicate extensive gene flow involving E. gracilipes which, combined with morphometrics, supports E. gracilipes being retained as a single species. Establishing a robust evolutionary-taxonomic framework for the coca clade will facilitate the development of cost-effective genotyping methods to support reliable identification.
至少在 8000 年前,南美洲古柯(Erythroxylum coca 和 E. novogranatense)一直是许多安第斯和亚马逊社区的主要作物。然而,在过去的半个世纪里,全球对可卡因生物碱的需求推动了这种植物的集约化种植,并将其置于武装冲突和森林砍伐的中心。为了监测古柯种植园地貌的变化,联合国毒品和犯罪问题办公室每年都会收集古柯种植面积的数据。然而,由于识别方面的限制,划定不同品种种植区域的尝试都以失败告终。在没有花朵的情况下,识别依赖于叶片形态,但这在分类学中的反映程度并不确定。在这里,我们利用形态计量学、系统发生组学、分子钟和种群基因组学分析了古柯及其四种近缘野生植物("古柯支系")当前命名系统的一致性。我们将古柯最近的野生近缘植物 E. gracilipes 和 E. cataractarum 的命名模式标本包括在内。对 342 份数字化标本馆标本的形态计量学分析表明,叶片形状和大小不能可靠地区分物种和变种。不过,统计分析表明,某些品种的叶片更圆、更倒卵形,可能与古柯的微妙驯化综合症有关。我们的系统发生组数据表明,涉及 E. gracilipes 的基因流非常广泛,结合形态计量学,支持将 E. gracilipes 保留为一个物种。为古柯支系建立一个强大的进化-分类框架将有助于开发具有成本效益的基因分型方法,以支持可靠的鉴定。
{"title":"Morphometrics and Phylogenomics of Coca (Erythroxylum spp.) Illuminate Its Reticulate Evolution, With Implications for Taxonomy.","authors":"Natalia A S Przelomska, Rudy A Diaz, Fabio Andrés Ávila, Gustavo A Ballen, Rocío Cortés-B, Logan Kistler, Daniel H Chitwood, Martha Charitonidou, Susanne S Renner, Oscar A Pérez-Escobar, Alexandre Antonelli","doi":"10.1093/molbev/msae114","DOIUrl":"10.1093/molbev/msae114","url":null,"abstract":"<p><p>South American coca (Erythroxylum coca and E. novogranatense) has been a keystone crop for many Andean and Amazonian communities for at least 8,000 years. However, over the last half-century, global demand for its alkaloid cocaine has driven intensive agriculture of this plant and placed it in the center of armed conflict and deforestation. To monitor the changing landscape of coca plantations, the United Nations Office on Drugs and Crime collects annual data on their areas of cultivation. However, attempts to delineate areas in which different varieties are grown have failed due to limitations around identification. In the absence of flowers, identification relies on leaf morphology, yet the extent to which this is reflected in taxonomy is uncertain. Here, we analyze the consistency of the current naming system of coca and its four closest wild relatives (the \"coca clade\"), using morphometrics, phylogenomics, molecular clocks, and population genomics. We include name-bearing type specimens of coca's closest wild relatives E. gracilipes and E. cataractarum. Morphometrics of 342 digitized herbarium specimens show that leaf shape and size fail to reliably discriminate between species and varieties. However, the statistical analyses illuminate that rounder and more obovate leaves of certain varieties could be associated with the subtle domestication syndrome of coca. Our phylogenomic data indicate extensive gene flow involving E. gracilipes which, combined with morphometrics, supports E. gracilipes being retained as a single species. Establishing a robust evolutionary-taxonomic framework for the coca clade will facilitate the development of cost-effective genotyping methods to support reliable identification.</p>","PeriodicalId":18730,"journal":{"name":"Molecular biology and evolution","volume":"41 7","pages":""},"PeriodicalIF":11.0,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11233275/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141563759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}