Robert Kesälahti, Timo A Kumpula, Sandra Cervantes, Sonja T Kujala, Tiina M Mattila, Jaakko S Tyrmi, Alina K Niskanen, Pasi Rastas, Outi Savolainen, Tanja Pyhäjärvi
Large and highly repetitive genomes are common. However, research interests usually lie within the non-repetitive parts of the genome, as they are more likely functional, and can be used to answer questions related to adaptation, selection and evolutionary history. Exome capture is a cost-effective method for providing sequencing data from protein-coding parts of the genes. C0t-1 DNA blockers consist of repetitive DNA and are used in exome captures to prevent the hybridisation of repetitive DNA sequences to capture baits or bait-bound genomic DNA. Universal blockers target repetitive regions shared by many species, while species-specific c0t-1 DNA is prepared from the DNA of the studied species, thus perfectly matching the repetitive DNA contents of the species. So far, the use of species-specific c0t-1 DNA has been limited to a few model species. Here, we evaluated the performance of blocker treatments in exome captures of Pinus sylvestris, a widely distributed conifer species with a large (> 20 Gbp) and highly repetitive genome. We compared treatment with a commercial universal blocker to treatments with species-specific c0t-1 (30,000 and 60,000 ng). Species-specific c0t-1 captured more unique exons than the initial set of targets leading to increased SNP discovery and reduced sequencing of tandem repeats compared to the universal blocker. Based on our results, we recommend optimising exome captures using at least 60,000 ng of species-specific c0t-1 DNA. It is relatively easy and fast to prepare and can also be used with existing bait set designs.
{"title":"Optimising Exome Captures in Species With Large Genomes Using Species-Specific Repetitive DNA Blocker.","authors":"Robert Kesälahti, Timo A Kumpula, Sandra Cervantes, Sonja T Kujala, Tiina M Mattila, Jaakko S Tyrmi, Alina K Niskanen, Pasi Rastas, Outi Savolainen, Tanja Pyhäjärvi","doi":"10.1111/1755-0998.14053","DOIUrl":"https://doi.org/10.1111/1755-0998.14053","url":null,"abstract":"<p><p>Large and highly repetitive genomes are common. However, research interests usually lie within the non-repetitive parts of the genome, as they are more likely functional, and can be used to answer questions related to adaptation, selection and evolutionary history. Exome capture is a cost-effective method for providing sequencing data from protein-coding parts of the genes. C0t-1 DNA blockers consist of repetitive DNA and are used in exome captures to prevent the hybridisation of repetitive DNA sequences to capture baits or bait-bound genomic DNA. Universal blockers target repetitive regions shared by many species, while species-specific c0t-1 DNA is prepared from the DNA of the studied species, thus perfectly matching the repetitive DNA contents of the species. So far, the use of species-specific c0t-1 DNA has been limited to a few model species. Here, we evaluated the performance of blocker treatments in exome captures of Pinus sylvestris, a widely distributed conifer species with a large (> 20 Gbp) and highly repetitive genome. We compared treatment with a commercial universal blocker to treatments with species-specific c0t-1 (30,000 and 60,000 ng). Species-specific c0t-1 captured more unique exons than the initial set of targets leading to increased SNP discovery and reduced sequencing of tandem repeats compared to the universal blocker. Based on our results, we recommend optimising exome captures using at least 60,000 ng of species-specific c0t-1 DNA. It is relatively easy and fast to prepare and can also be used with existing bait set designs.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14053"},"PeriodicalIF":5.5,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142845520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Landscape genomic approaches for detecting genotype-environment associations (GEA), isolation by distance (IBD) and isolation by environment (IBE) have seen a dramatic increase in use, but there have been few thorough analyses of the influence of sampling strategy on their performance under realistic genomic and environmental conditions. We simulated 24,000 datasets across a range of scenarios with complex population dynamics and realistic landscape structure to evaluate the effects of the spatial distribution and number of samples on common landscape genomics methods. Our results show that common analyses are relatively robust to sampling scheme as long as sampling covers enough environmental and geographic space. We found that for detecting adaptive loci and estimating IBE, sampling schemes that were explicitly designed to increase coverage of available environmental space matched or outperformed sampling schemes that only considered geographic space. When sampling does not cover adequate geographic and environmental space, such as with transect-based sampling, we detected fewer adaptive loci and had higher error when estimating IBD and IBE. We found that IBD could be detected with as few as nine sampling sites, while large sample sizes (e.g., greater than 100 individuals) were crucial for detecting adaptive loci and IBE. We also demonstrate that, even with optimal sampling strategies, landscape genomic analyses are highly sensitive to landscape structure and migration-when spatial autocorrelation and migration are weak, common GEA methods fail to detect adaptive loci.
{"title":"Optimising Sampling Design for Landscape Genomics.","authors":"Anusha P Bishop, Drew E Terasaki Hart, Ian J Wang","doi":"10.1111/1755-0998.14052","DOIUrl":"https://doi.org/10.1111/1755-0998.14052","url":null,"abstract":"<p><p>Landscape genomic approaches for detecting genotype-environment associations (GEA), isolation by distance (IBD) and isolation by environment (IBE) have seen a dramatic increase in use, but there have been few thorough analyses of the influence of sampling strategy on their performance under realistic genomic and environmental conditions. We simulated 24,000 datasets across a range of scenarios with complex population dynamics and realistic landscape structure to evaluate the effects of the spatial distribution and number of samples on common landscape genomics methods. Our results show that common analyses are relatively robust to sampling scheme as long as sampling covers enough environmental and geographic space. We found that for detecting adaptive loci and estimating IBE, sampling schemes that were explicitly designed to increase coverage of available environmental space matched or outperformed sampling schemes that only considered geographic space. When sampling does not cover adequate geographic and environmental space, such as with transect-based sampling, we detected fewer adaptive loci and had higher error when estimating IBD and IBE. We found that IBD could be detected with as few as nine sampling sites, while large sample sizes (e.g., greater than 100 individuals) were crucial for detecting adaptive loci and IBE. We also demonstrate that, even with optimal sampling strategies, landscape genomic analyses are highly sensitive to landscape structure and migration-when spatial autocorrelation and migration are weak, common GEA methods fail to detect adaptive loci.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14052"},"PeriodicalIF":5.5,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142845344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ellie E Armstrong, Chenyang Li, Michael G Campana, Tessa Ferrari, Joanna L Kelley, Dmitri A Petrov, Katherine A Solari, Jazlyn A Mooney
Despite substantial reductions in the cost of sequencing over the last decade, genetic panels remain relevant due to their cost-effectiveness and flexibility across a variety of sample types. In particular, single nucleotide polymorphism (SNP) panels are increasingly favoured for conservation applications. SNP panels are often used because of their adaptability, effectiveness with low-quality samples, and cost-efficiency for population monitoring and forensics. However, the selection of diagnostic SNPs for population assignment and individual identification can be challenging. The consequences of poor SNP selection are under-powered panels, inaccurate results, and monetary loss. Here, we develop a novel and user-friendly SNP selection pipeline (mPCRselect) that can be used to select SNPs for population assignment and/or individual identification. mPCRselect allows any researcher, who has sufficient SNP-level data, to design a successful and cost-effective SNP panel for a diploid species of conservation concern.
{"title":"A Pipeline and Recommendations for Population and Individual Diagnostic SNP Selection in Non-Model Species.","authors":"Ellie E Armstrong, Chenyang Li, Michael G Campana, Tessa Ferrari, Joanna L Kelley, Dmitri A Petrov, Katherine A Solari, Jazlyn A Mooney","doi":"10.1111/1755-0998.14048","DOIUrl":"https://doi.org/10.1111/1755-0998.14048","url":null,"abstract":"<p><p>Despite substantial reductions in the cost of sequencing over the last decade, genetic panels remain relevant due to their cost-effectiveness and flexibility across a variety of sample types. In particular, single nucleotide polymorphism (SNP) panels are increasingly favoured for conservation applications. SNP panels are often used because of their adaptability, effectiveness with low-quality samples, and cost-efficiency for population monitoring and forensics. However, the selection of diagnostic SNPs for population assignment and individual identification can be challenging. The consequences of poor SNP selection are under-powered panels, inaccurate results, and monetary loss. Here, we develop a novel and user-friendly SNP selection pipeline (mPCRselect) that can be used to select SNPs for population assignment and/or individual identification. mPCRselect allows any researcher, who has sufficient SNP-level data, to design a successful and cost-effective SNP panel for a diploid species of conservation concern.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14048"},"PeriodicalIF":5.5,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142749503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ya Wang, Wei Dong, Yufan Liang, Weiwei Lin, Junhao Chen, Robert Henry, Fei Chen
The dimensions of phylogenetic research have expanded to encompass the study of large-scale populations at the microevolutionary level and comparisons between different species or taxonomic units at the macroevolutionary level. Traditional phylogenetic tools often struggle to handle the diverse and complex data required for these different evolutionary scales. In response to this challenge, we introduce PhyloForge, a robust tool designed to seamlessly integrate the demands of both micro- and macroevolution, comprehensively utilising diverse phylogenomic signals, such as genes, SNPs, and structural variations, as well as mitochondrial and chloroplast genomes. PhyloForge's innovation lies in its capability to seamlessly integrate multiple phylogenomic signals, enabling the unified analysis of multidimensional genomic data. This unique feature empowers researchers to gain a more comprehensive understanding of diverse aspects of biological evolution. PhyloForge not only provides highly customisable analysis tools for experienced researchers but also features an intuitively designed interface, facilitating effortless phylogenetic analysis for beginners. Extensive testing across various domains, including animals, plants and fungi, attests to its broad applicability in the field of phylogenetics. In summary, PhyloForge has significant potential in the era of large-scale genomics, offering a new perspective and toolset for a deeper understanding of the evolution of life. PhyloForge codes could be found in GitHub (https://github.com/wangyayaya/PhyloForge/), and the program could be installed in Conda (https://anaconda.org/wangxiaobei/phyloforge).
{"title":"PhyloForge: Unifying Micro- and Macroevolution With Comprehensive Genomic Signals.","authors":"Ya Wang, Wei Dong, Yufan Liang, Weiwei Lin, Junhao Chen, Robert Henry, Fei Chen","doi":"10.1111/1755-0998.14050","DOIUrl":"https://doi.org/10.1111/1755-0998.14050","url":null,"abstract":"<p><p>The dimensions of phylogenetic research have expanded to encompass the study of large-scale populations at the microevolutionary level and comparisons between different species or taxonomic units at the macroevolutionary level. Traditional phylogenetic tools often struggle to handle the diverse and complex data required for these different evolutionary scales. In response to this challenge, we introduce PhyloForge, a robust tool designed to seamlessly integrate the demands of both micro- and macroevolution, comprehensively utilising diverse phylogenomic signals, such as genes, SNPs, and structural variations, as well as mitochondrial and chloroplast genomes. PhyloForge's innovation lies in its capability to seamlessly integrate multiple phylogenomic signals, enabling the unified analysis of multidimensional genomic data. This unique feature empowers researchers to gain a more comprehensive understanding of diverse aspects of biological evolution. PhyloForge not only provides highly customisable analysis tools for experienced researchers but also features an intuitively designed interface, facilitating effortless phylogenetic analysis for beginners. Extensive testing across various domains, including animals, plants and fungi, attests to its broad applicability in the field of phylogenetics. In summary, PhyloForge has significant potential in the era of large-scale genomics, offering a new perspective and toolset for a deeper understanding of the evolution of life. PhyloForge codes could be found in GitHub (https://github.com/wangyayaya/PhyloForge/), and the program could be installed in Conda (https://anaconda.org/wangxiaobei/phyloforge).</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14050"},"PeriodicalIF":5.5,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142714806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniel E Ruzzante, Gregory R McCracken, Dylan J Fraser, John MacMillan, Colin Buhariwalla, Joanna Mills Flemming
<p><p>Although efforts to estimate effective population size, census size and their ratio in wild populations are expanding, few empirical studies investigate interannual changes in these parameters. Hence, we do not know how repeatable or representative many estimates may be. Answering this question requires studies of long-term population dynamics. Here we took advantage of a rich dataset of seven brook trout (Salvelinus fontinalis) populations, 5 consecutive years and 5400 individuals genotyped at 33 microsatellites to examine variation in estimates of effective and census size and in their ratio. We first estimated the annual effective number of breeders ( <math> <semantics> <mrow><mover><mi>N</mi> <mo>̂</mo></mover> </mrow> <annotation>$$ hat{N} $$</annotation></semantics> </math> <sub>b</sub>) using individuals aged 1+. We then adjusted these estimates using two life history traits, to obtain <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>b</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{b(adj2)} $$</annotation></semantics> </math> and subsequently, <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> following Waples et al. (2013). <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> was estimated for the years 2014 to 2019. Census size was estimated by mark recapture using double-pass electrofishing ( <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>c</mi> <mfenced><mi>MR</mi></mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{c(MR)} $$</annotation></semantics> </math> ) (years 2014-2018) as well as by the Close Kin Mark Recapture approach ( <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>c</mi> <mfenced><mtext>CKMR</mtext></mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{c(CKMR)} $$</annotation></semantics> </math> ) (years 2015-2017). Within populations, annual variation in <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> (ratio of maximum to minimum <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> ) ranged from 1.6-fold to 58-fold. Over all 7 populations, the median annual variation in <math> <semantics> <mrow> <msub><mover><mi>N</mi>
尽管估算野生种群有效种群数量、普查规模及其比例的工作正在扩大,但很少有实证研究调查这些参数的年际变化。因此,我们不知道许多估计值的可重复性或代表性如何。要回答这个问题,需要对长期种群动态进行研究。在这里,我们利用了一个包含 7 个溪鳟(Salvelinus fontinalis)种群、连续 5 年、5400 个个体、33 个微卫星基因分型的丰富数据集,来研究有效种群规模和普查种群规模的估计值及其比例的变化。我们首先使用 1 岁以上的个体估算了繁殖者的年有效数量(N ̂ $hat{N} $$ b)。然后,我们利用两种生活史特征对这些估计值进行调整,得出 N ̂ b adj 2 $$ {hat{N}}_{b(adj2)} $$,随后按照 Waples 等人(2013 年)的方法得出 N ̂ e adj 2 $$ {hat{N}}_{e(adj2)} $$。 N ̂ e adj 2 $$ {hat{N}}_{e(adj2)}$ 是 2014 年至 2019 年的估计值。普查规模是通过使用双通电鱼的标记再捕法(N ̂ c MR $$ {hat{N}}_{c(MR)} $$ )(2014-2018年)以及近亲标记再捕法(N ̂ c CKMR $$ {hat{N}}_{c(CKMR)} $$ )(2015-2017年)估算的。在种群内部,N ̂ e adj 2 $$ {hat{N}}_{e(adj2)} $$ (最大 N ̂ e adj 2 $$ {hat{N}}_{e(adj2)} $$ 与最小 N ̂ e adj 2 $$ {hat{N}}_{e(adj2)} $ 之比)的年变化范围从 1.6 倍到 58 倍不等。在所有 7 个种群中,N ̂ e adj 2 $$ {hat{N}}_{e(adj2)}$ 的年变化中位数约为 5 倍。这些结果反映了繁殖成功率差异的重要年际变化,以及更普遍的种群动态变化。种群内 N ̂ c MR $$ {hat{N}}_{c(MR)}$ 在不同年份的变化系数(中位数)为 2.7,范围在 2 到 4.3 之间。因此,估计有效规模的变化几乎是估计普查规模变化的两倍。因此,我们的结果表明,至少在本研究考察的小型种群中,任何单一的 N ̂ e adj 2 $$ {hat{N}}_{e(adj2)}$ 的年度估计值都不太可能代表长期动态。至少需要 3-4 个年度估计值才能真正代表当代有效规模的估计值。然后,我们将 N ̂ c MR $$ {hat{N}}_{c(MR)}$ 与 N ̂ c CKMR $$ {hat{N}}_{c(CKMR)}$ 进行了比较。对于 7 个种群中的 5 个种群,基于标记重捕的种群丰度估计值(N ̂ c MR $$ {hat{N}}_{c(MR)} $$)与基于近亲标记重捕的种群丰度估计值(N ̂ c CKMR $$ {hat{N}}_{c(CKMR)} $$)无法区分。N ̂ c MR $$ {hat{N}}_{c(MR)}$ 和 N ̂ c CKMR $ {hat{N}}_{c(CKMR)}$ 不一致的两个种群表现出极低的 N ̂ e adj2 / N ̂ c MR $$ {hat{N}}_{e(adj2)}/{hat{N}}_{c(MR)}$ 的比率和 N ̂ e adj 2 $$ {hat{ N}}_{e(adj2)}$ 的年度变化最大(58-.倍和 35.4 倍)。这些结果表明,在这两条溪流中的取样工作可能不足以正确捕获整个种群的遗传多样性,而且所取样的个体在种群中不具有代表性。我们的研究进一步验证了 CKMR 作为估算野生种群丰度的方法的有效性,并证明了如何利用 N ̂ e $$ {hat{N}}_e $ 的时间变化知识来识别 N ̂ c MR $$ {hat{N}}_{c(MR)}$ 与 N ̂ c CKMR $$ {hat{N}}_{c(CKMR)}$ 之间差异的潜在来源。
{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">Temporal Variability in Effective Size ( <ns0:math> <ns0:semantics> <ns0:mrow> <ns0:msub><ns0:mover><ns0:mi>N</ns0:mi> <ns0:mo>̂</ns0:mo></ns0:mover> <ns0:mi>e</ns0:mi></ns0:msub> </ns0:mrow> <ns0:annotation>$$ {hat{N}}_e $$</ns0:annotation></ns0:semantics> </ns0:math> ) Identifies Potential Sources of Discrepancies Between Mark Recapture and Close Kin Mark Recapture Estimates of Population Abundance.","authors":"Daniel E Ruzzante, Gregory R McCracken, Dylan J Fraser, John MacMillan, Colin Buhariwalla, Joanna Mills Flemming","doi":"10.1111/1755-0998.14047","DOIUrl":"https://doi.org/10.1111/1755-0998.14047","url":null,"abstract":"<p><p>Although efforts to estimate effective population size, census size and their ratio in wild populations are expanding, few empirical studies investigate interannual changes in these parameters. Hence, we do not know how repeatable or representative many estimates may be. Answering this question requires studies of long-term population dynamics. Here we took advantage of a rich dataset of seven brook trout (Salvelinus fontinalis) populations, 5 consecutive years and 5400 individuals genotyped at 33 microsatellites to examine variation in estimates of effective and census size and in their ratio. We first estimated the annual effective number of breeders ( <math> <semantics> <mrow><mover><mi>N</mi> <mo>̂</mo></mover> </mrow> <annotation>$$ hat{N} $$</annotation></semantics> </math> <sub>b</sub>) using individuals aged 1+. We then adjusted these estimates using two life history traits, to obtain <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>b</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{b(adj2)} $$</annotation></semantics> </math> and subsequently, <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> following Waples et al. (2013). <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> was estimated for the years 2014 to 2019. Census size was estimated by mark recapture using double-pass electrofishing ( <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>c</mi> <mfenced><mi>MR</mi></mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{c(MR)} $$</annotation></semantics> </math> ) (years 2014-2018) as well as by the Close Kin Mark Recapture approach ( <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>c</mi> <mfenced><mtext>CKMR</mtext></mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{c(CKMR)} $$</annotation></semantics> </math> ) (years 2015-2017). Within populations, annual variation in <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> (ratio of maximum to minimum <math> <semantics> <mrow> <msub><mover><mi>N</mi> <mo>̂</mo></mover> <mrow><mi>e</mi> <mfenced><mrow><mi>adj</mi> <mn>2</mn></mrow> </mfenced> </mrow> </msub> </mrow> <annotation>$$ {hat{N}}_{e(adj2)} $$</annotation></semantics> </math> ) ranged from 1.6-fold to 58-fold. Over all 7 populations, the median annual variation in <math> <semantics> <mrow> <msub><mover><mi>N</mi> ","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14047"},"PeriodicalIF":5.5,"publicationDate":"2024-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142708718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shao Shao, Yulong Li, Xiao Feng, Chuanfeng Jin, Min Liu, Ranran Zhu, Miles E Tracy, Zixiao Guo, Ziwen He, Suhua Shi, Shaohua Xu
Increased human activity and climate change have significantly impacted wild habitats and increased the number of endangered species. Exploring evolutionary history and predicting adaptive potential using genomic data will facilitate species conservation and biodiversity recovery. Here, we examined the genome evolution of a critically endangered tree Pellacalyx yunnanensis, a plant species with extremely small populations (PSESP) that is narrowly distributed in Xishuangbanna, China. The species has neared extinction due to economic exploitation in recent decades. We assembled a chromosome-level genome of 334 Mb, with the N50 length of 20.5 Mb. Using the genome, we discovered that P. yunnanensis has undergone several population size reductions, leading to excess deleterious mutations. The species may possess low adaptive potential due to reduced genetic diversity and the loss of stress-responsive genes. We estimate that P. yunnanensis is the basal species of its genus and diverged from its relatives during global cooling, suggesting it was stranded in unsuitable environments during periods of dramatic climate change. In particular, the loss of seed dormancy leads to germination under unfavourable conditions and reproduction challenges. This dormancy loss may have occurred through genetic changes that suppress ABA signalling and the loss of genes involved in seed maturation. The high-quality genome has also enabled us to reveal phenotypic trait evolution in Rhizophoraceae and identify divergent adaptation to intertidal and inland habitats. In summary, our study elucidates mechanisms underlying the decline and evaluates the adaptive potential of P. yunnanensis to future climate change, informing future conservation efforts.
人类活动的增加和气候变化严重影响了野生栖息地,并增加了濒危物种的数量。利用基因组数据探索进化历史和预测适应潜力将有助于物种保护和生物多样性恢复。在这里,我们研究了一种极度濒危树种云南糙叶树的基因组进化。由于近几十年来的经济开发,该物种已濒临灭绝。我们组装了一个 334 Mb 的染色体级基因组,N50 长度为 20.5 Mb。利用该基因组,我们发现云南滇金丝猴经历了多次种群规模缩小,导致了过多的有害突变。由于遗传多样性的减少和应激反应基因的丢失,该物种可能具有较低的适应潜力。我们估计,云南滇金丝猴是其属中的基干种,在全球变冷期间与其亲缘种发生了分化,这表明它在气候剧变期间被困在了不适宜的环境中。特别是,种子休眠的丧失导致其在不利的条件下发芽并面临繁殖挑战。这种休眠的丧失可能是通过抑制 ABA 信号的遗传变化和参与种子成熟的基因的丧失而发生的。高质量的基因组还使我们能够揭示根瘤菌科植物的表型性状进化,并确定对潮间带和内陆生境的不同适应。总之,我们的研究阐明了云南红豆杉衰退的机制,评估了其对未来气候变化的适应潜力,为未来的保护工作提供了信息。
{"title":"Chromosomal-Level Genome Suggests Adaptive Constraints Leading to the Historical Population Decline in an Extremely Endangered Plant.","authors":"Shao Shao, Yulong Li, Xiao Feng, Chuanfeng Jin, Min Liu, Ranran Zhu, Miles E Tracy, Zixiao Guo, Ziwen He, Suhua Shi, Shaohua Xu","doi":"10.1111/1755-0998.14045","DOIUrl":"https://doi.org/10.1111/1755-0998.14045","url":null,"abstract":"<p><p>Increased human activity and climate change have significantly impacted wild habitats and increased the number of endangered species. Exploring evolutionary history and predicting adaptive potential using genomic data will facilitate species conservation and biodiversity recovery. Here, we examined the genome evolution of a critically endangered tree Pellacalyx yunnanensis, a plant species with extremely small populations (PSESP) that is narrowly distributed in Xishuangbanna, China. The species has neared extinction due to economic exploitation in recent decades. We assembled a chromosome-level genome of 334 Mb, with the N50 length of 20.5 Mb. Using the genome, we discovered that P. yunnanensis has undergone several population size reductions, leading to excess deleterious mutations. The species may possess low adaptive potential due to reduced genetic diversity and the loss of stress-responsive genes. We estimate that P. yunnanensis is the basal species of its genus and diverged from its relatives during global cooling, suggesting it was stranded in unsuitable environments during periods of dramatic climate change. In particular, the loss of seed dormancy leads to germination under unfavourable conditions and reproduction challenges. This dormancy loss may have occurred through genetic changes that suppress ABA signalling and the loss of genes involved in seed maturation. The high-quality genome has also enabled us to reveal phenotypic trait evolution in Rhizophoraceae and identify divergent adaptation to intertidal and inland habitats. In summary, our study elucidates mechanisms underlying the decline and evaluates the adaptive potential of P. yunnanensis to future climate change, informing future conservation efforts.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14045"},"PeriodicalIF":5.5,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142685385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anne Beemelmanns, Raphaël Bouchard, Sozos Michaelides, Eric Normandeau, Hyung-Bae Jeon, Badrouyk Chamlian, Charles Babin, Philippe Hénault, Océane Perrot, Les N Harris, Xinhua Zhu, Dylan Fraser, Louis Bernatchez, Jean-Sébastien Moore
Single nucleotide polymorphism (SNP) panels are powerful tools for assessing the genetic population structure and dispersal of fishes and can enhance management practices for commercial, recreational and subsistence mixed-stock fisheries. Arctic Char (Salvelinus alpinus), Brook Trout (Salvelinus fontinalis) and Lake Whitefish (Coregonus clupeaformis) are among the most harvested and consumed fish species in Northern Indigenous communities in Canada, contributing significantly to food security, culture, tradition and economy. However, genetic resources supporting Indigenous fisheries have not been widely accessible to northern communities (e.g. Inuit, Cree, Dene). Here, we developed Genotyping-in-Thousands by sequencing (GT-seq) panels for population assignment and mixed-stock analyses of three salmonids, to support fisheries stewardship or co-management in Northern Canada. Using low-coverage Whole Genome Sequencing data from 418 individuals across source populations in Cambridge Bay (Nunavut), Great Slave Lake (Northwest Territories), James Bay (Québec) and Mistassini Lake (Québec), we developed a bioinformatic SNP filtering workflow to select informative SNP markers from genotype likelihoods. These markers were then used to design GT-seq panels, thus enabling high-throughput genotyping for these species. The three GT-seq panels yielded an average of 413 autosomal loci and were validated using 525 individuals with an average assignment accuracy of 83%. Thus, these GT-seq panels are powerful tools for assessing population structure and quantifying the relative contributions of populations/stocks in mixed-stock fisheries across multiple regions. Interweaving genomic data derived from these tools with Traditional Ecological Knowledge will ensure the sustainable harvest of three culturally important salmonids in Indigenous communities, contributing to food security programmes and the economy in Northern Canada.
{"title":"Development of SNP Panels from Low-Coverage Whole Genome Sequencing (lcWGS) to Support Indigenous Fisheries for Three Salmonid Species in Northern Canada.","authors":"Anne Beemelmanns, Raphaël Bouchard, Sozos Michaelides, Eric Normandeau, Hyung-Bae Jeon, Badrouyk Chamlian, Charles Babin, Philippe Hénault, Océane Perrot, Les N Harris, Xinhua Zhu, Dylan Fraser, Louis Bernatchez, Jean-Sébastien Moore","doi":"10.1111/1755-0998.14040","DOIUrl":"https://doi.org/10.1111/1755-0998.14040","url":null,"abstract":"<p><p>Single nucleotide polymorphism (SNP) panels are powerful tools for assessing the genetic population structure and dispersal of fishes and can enhance management practices for commercial, recreational and subsistence mixed-stock fisheries. Arctic Char (Salvelinus alpinus), Brook Trout (Salvelinus fontinalis) and Lake Whitefish (Coregonus clupeaformis) are among the most harvested and consumed fish species in Northern Indigenous communities in Canada, contributing significantly to food security, culture, tradition and economy. However, genetic resources supporting Indigenous fisheries have not been widely accessible to northern communities (e.g. Inuit, Cree, Dene). Here, we developed Genotyping-in-Thousands by sequencing (GT-seq) panels for population assignment and mixed-stock analyses of three salmonids, to support fisheries stewardship or co-management in Northern Canada. Using low-coverage Whole Genome Sequencing data from 418 individuals across source populations in Cambridge Bay (Nunavut), Great Slave Lake (Northwest Territories), James Bay (Québec) and Mistassini Lake (Québec), we developed a bioinformatic SNP filtering workflow to select informative SNP markers from genotype likelihoods. These markers were then used to design GT-seq panels, thus enabling high-throughput genotyping for these species. The three GT-seq panels yielded an average of 413 autosomal loci and were validated using 525 individuals with an average assignment accuracy of 83%. Thus, these GT-seq panels are powerful tools for assessing population structure and quantifying the relative contributions of populations/stocks in mixed-stock fisheries across multiple regions. Interweaving genomic data derived from these tools with Traditional Ecological Knowledge will ensure the sustainable harvest of three culturally important salmonids in Indigenous communities, contributing to food security programmes and the economy in Northern Canada.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14040"},"PeriodicalIF":5.5,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142646387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ammonia monooxygenase subunit A (amoA) gene has been used to investigate the phylogenetic diversity, spatial distribution and activity of ammonia-oxidising archaeal (AOA) and bacterial (AOB), which contribute significantly to the nitrogen cycle in various ecosystems. Amplicon sequencing of amoA is a widely used method; however, it produces inaccurate results owing to the lack of a 'universal' primer set. Moreover, currently available primer sets suffer from amplification biases, which can lead to severe misinterpretation. Although shotgun metagenomic and metatranscriptomic analyses are alternative approaches without amplification bias, the low abundance of target genes in heterogeneous environmental DNA restricts a comprehensive analysis to a realisable sequencing depth. In this study, we developed a probe set and bioinformatics workflow for amoA enrichment sequencing using a hybridisation capture technique. Using metagenomic mock community samples, our approach effectively enriched amoA genes with low compositional changes, outperforming amplification and meta-omics sequencing analyses. Following the analysis of metatranscriptomic marine samples, we predicted 80 operational taxonomic units (OTUs) assigned to either AOA or AOB, of which 30 OTUs were unidentified using simple metatranscriptomic or amoA gene amplicon sequencing. Mapped read ratios to all the detected OTUs were significantly higher for the capture samples (50.4 ± 27.2%) than for non-capture samples (0.05 ± 0.02%), demonstrating the high enrichment efficiency of the method. The analysis also revealed the spatial diversity of AOA ecotypes with high sensitivity and phylogenetic resolution, which are difficult to examine using conventional approaches.
氨单加氧酶亚基 A(amoA)基因已被用于研究氨氧化古细菌(AOA)和细菌(AOB)的系统发育多样性、空间分布和活性。氨氧化古细菌(amoA)的扩增子测序是一种广泛使用的方法;然而,由于缺乏 "通用 "引物集,这种方法产生的结果并不准确。此外,目前可用的引物组存在扩增偏差,可能导致严重的误读。虽然散弹枪元基因组和元转录组分析是没有扩增偏差的替代方法,但目标基因在异质环境 DNA 中的低丰度限制了全面分析的可实现测序深度。在这项研究中,我们利用杂交捕获技术为 amoA 富集测序开发了探针组和生物信息学工作流程。利用元基因组模拟群落样本,我们的方法有效地富集了组成变化较小的amoA基因,优于扩增和元组学测序分析。在对海洋样本进行元转录组学分析后,我们预测了 80 个可操作的分类单元(OTU),这些单元被归入 AOA 或 AOB,其中 30 个 OTU 通过简单的元转录组学或 amoA 基因扩增片段测序无法识别。捕获样本与所有检测到的 OTU 的映射读数比(50.4 ± 27.2%)明显高于非捕获样本(0.05 ± 0.02%),这表明该方法具有很高的富集效率。该分析还揭示了 AOA 生态型的空间多样性,具有较高的灵敏度和系统发育分辨率,这是传统方法难以研究的。
{"title":"Probe Capture Enrichment Sequencing of amoA Genes Improves the Detection of Diverse Ammonia-Oxidising Archaeal and Bacterial Populations.","authors":"Satoshi Hiraoka, Minoru Ijichi, Hirohiko Takeshima, Yohei Kumagai, Ching-Chia Yang, Yoko Makabe-Kobayashi, Hideki Fukuda, Susumu Yoshizawa, Wataru Iwasaki, Kazuhiro Kogure, Takuhei Shiozaki","doi":"10.1111/1755-0998.14042","DOIUrl":"https://doi.org/10.1111/1755-0998.14042","url":null,"abstract":"<p><p>The ammonia monooxygenase subunit A (amoA) gene has been used to investigate the phylogenetic diversity, spatial distribution and activity of ammonia-oxidising archaeal (AOA) and bacterial (AOB), which contribute significantly to the nitrogen cycle in various ecosystems. Amplicon sequencing of amoA is a widely used method; however, it produces inaccurate results owing to the lack of a 'universal' primer set. Moreover, currently available primer sets suffer from amplification biases, which can lead to severe misinterpretation. Although shotgun metagenomic and metatranscriptomic analyses are alternative approaches without amplification bias, the low abundance of target genes in heterogeneous environmental DNA restricts a comprehensive analysis to a realisable sequencing depth. In this study, we developed a probe set and bioinformatics workflow for amoA enrichment sequencing using a hybridisation capture technique. Using metagenomic mock community samples, our approach effectively enriched amoA genes with low compositional changes, outperforming amplification and meta-omics sequencing analyses. Following the analysis of metatranscriptomic marine samples, we predicted 80 operational taxonomic units (OTUs) assigned to either AOA or AOB, of which 30 OTUs were unidentified using simple metatranscriptomic or amoA gene amplicon sequencing. Mapped read ratios to all the detected OTUs were significantly higher for the capture samples (50.4 ± 27.2%) than for non-capture samples (0.05 ± 0.02%), demonstrating the high enrichment efficiency of the method. The analysis also revealed the spatial diversity of AOA ecotypes with high sensitivity and phylogenetic resolution, which are difficult to examine using conventional approaches.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14042"},"PeriodicalIF":5.5,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142646394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Symbiotic microbiota strongly impact host physiology. Amphibians and reptiles occupy a pivotal role in the evolutionary history of Animalia, and they are of significant ecological, economic, and scientific value. Many prior studies have found that symbiotic microbiota in herpetofaunal species are closely associated with host phylogeny, physiological traits, and environmental factors; however, insufficient integrated databases hinder researchers from querying, accessing, and reanalyzing these resources. To rectify this, we built the first herpetofaunal microbiota database (HMicroDB; https://herpdb.com/) that integrates 11,697 microbiological samples from 337 host species (covering 23 body sites and associated with 23 host phenotypic or environmental factors), and we identified 11,084 microbial taxa by consistent annotation. The standardised analysis process, cross-dataset integration, user-friendly interface, and interactive visualisation make the HMicroDB a powerful resource for researchers to search, browse, and explore the relationships between symbiotic microbiota, hosts, and environment. This facilitates research in host-microbiota coevolution, biological conservation, and resource utilisation.
{"title":"HMicroDB: A Comprehensive Database of Herpetofaunal Microbiota With a Focus on Host Phylogeny, Physiological Traits, and Environment Factors.","authors":"Jiaying Li, Yuze Gao, Guocheng Shu, Xuanzhong Chen, Jiahao Zhu, Si Zheng, Ting Chen","doi":"10.1111/1755-0998.14046","DOIUrl":"10.1111/1755-0998.14046","url":null,"abstract":"<p><p>Symbiotic microbiota strongly impact host physiology. Amphibians and reptiles occupy a pivotal role in the evolutionary history of Animalia, and they are of significant ecological, economic, and scientific value. Many prior studies have found that symbiotic microbiota in herpetofaunal species are closely associated with host phylogeny, physiological traits, and environmental factors; however, insufficient integrated databases hinder researchers from querying, accessing, and reanalyzing these resources. To rectify this, we built the first herpetofaunal microbiota database (HMicroDB; https://herpdb.com/) that integrates 11,697 microbiological samples from 337 host species (covering 23 body sites and associated with 23 host phenotypic or environmental factors), and we identified 11,084 microbial taxa by consistent annotation. The standardised analysis process, cross-dataset integration, user-friendly interface, and interactive visualisation make the HMicroDB a powerful resource for researchers to search, browse, and explore the relationships between symbiotic microbiota, hosts, and environment. This facilitates research in host-microbiota coevolution, biological conservation, and resource utilisation.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e14046"},"PeriodicalIF":5.5,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142613334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}