首页 > 最新文献

Molecular Ecology Resources最新文献

英文 中文
Quantitative assessment of reef foraminifera community from metabarcoding data 利用代谢编码数据对珊瑚礁有孔虫群落进行定量评估。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-23 DOI: 10.1111/1755-0998.14000
Elsa B. Girard, Emilie A. Didaskalou, Andi M. A. Pratama, Carolina Rattner, Raphaël Morard, Willem Renema

Describing living community compositions is essential to monitor ecosystems in a rapidly changing world, but it is challenging to produce fast and accurate depiction of ecosystems due to methodological limitations. Morphological methods provide absolute abundances with limited throughput, whereas metabarcoding provides relative abundances of genes that may not correctly represent living communities from environmental DNA assessed with morphological methods. However, it has the potential to deliver fast descriptions of living communities provided that it is interpreted with validated species-specific calibrations and reference databases. Here, we developed a quantitative approach to retrieve from metabarcoding data the assemblages of living large benthic foraminifera (LBF), photosymbiotic calcifying protists, from Indonesian coral reefs that are under increasing anthropogenic pressure. To depict the diversity, we calculated taxon-specific correction factors to reduce biological biases by comparing surface area, biovolume and calcite volume, and the number of mitochondrial gene copies in seven common LBF species. To validate the approach, we compared calibrated datasets of morphological communities from mock samples with bulk reef sediment; both sample types were metabarcoded. The calibration of the data significantly improved the estimations of genus relative abundance, with a difference of ±5% on average, allowing for comparison of past morphological datasets with future molecular ones. Our results also highlight the application of our quantitative approach to support reef monitoring operations by capturing fine-scale processes, such as seasonal and pollution-driven dynamics, that require high-throughput sampling treatment.

在瞬息万变的世界中,描述生物群落组成对监测生态系统至关重要,但由于方法的局限性,要快速准确地描述生态系统具有挑战性。形态学方法提供的是绝对丰度,但通量有限,而元条码提供的是基因的相对丰度,可能无法从形态学方法评估的环境 DNA 中正确代表生物群落。然而,如果使用经过验证的物种特异性定标和参考数据库进行解释,代谢标码有可能快速描述生物群落。在此,我们开发了一种定量方法,从代谢编码数据中检索印度尼西亚珊瑚礁中活的大型底栖有孔虫(LBF)--光合共生的钙化原生生物--的集合。为了描绘多样性,我们通过比较七个常见 LBF 物种的表面积、生物体积和方解石体积以及线粒体基因拷贝数,计算了特定类群的校正因子,以减少生物偏差。为了验证这种方法,我们比较了来自模拟样本和大块珊瑚礁沉积物的形态群落校准数据集;两种样本类型都进行了代谢标记。数据校准大大提高了对物种相对丰度的估计,平均差异为±5%,从而可以将过去的形态数据集与未来的分子数据集进行比较。我们的研究结果还突显了我们的定量方法在支持珊瑚礁监测行动中的应用,即捕捉需要高通量采样处理的细尺度过程,如季节性和污染驱动的动态过程。
{"title":"Quantitative assessment of reef foraminifera community from metabarcoding data","authors":"Elsa B. Girard,&nbsp;Emilie A. Didaskalou,&nbsp;Andi M. A. Pratama,&nbsp;Carolina Rattner,&nbsp;Raphaël Morard,&nbsp;Willem Renema","doi":"10.1111/1755-0998.14000","DOIUrl":"10.1111/1755-0998.14000","url":null,"abstract":"<p>Describing living community compositions is essential to monitor ecosystems in a rapidly changing world, but it is challenging to produce fast and accurate depiction of ecosystems due to methodological limitations. Morphological methods provide absolute abundances with limited throughput, whereas metabarcoding provides relative abundances of genes that may not correctly represent living communities from environmental DNA assessed with morphological methods. However, it has the potential to deliver fast descriptions of living communities provided that it is interpreted with validated species-specific calibrations and reference databases. Here, we developed a quantitative approach to retrieve from metabarcoding data the assemblages of living large benthic foraminifera (LBF), photosymbiotic calcifying protists, from Indonesian coral reefs that are under increasing anthropogenic pressure. To depict the diversity, we calculated taxon-specific correction factors to reduce biological biases by comparing surface area, biovolume and calcite volume, and the number of mitochondrial gene copies in seven common LBF species. To validate the approach, we compared calibrated datasets of morphological communities from mock samples with bulk reef sediment; both sample types were metabarcoded. The calibration of the data significantly improved the estimations of genus relative abundance, with a difference of ±5% on average, allowing for comparison of past morphological datasets with future molecular ones. Our results also highlight the application of our quantitative approach to support reef monitoring operations by capturing fine-scale processes, such as seasonal and pollution-driven dynamics, that require high-throughput sampling treatment.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.14000","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141746975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The voice of the little giants: Arcellinida testate amoebae in environmental DNA-based bioindication, from taxonomy free to haplotypic level 小巨人的声音:从无分类到单倍型水平,基于环境 DNA 的生物鉴定中的 Arcellinida testate amoebae。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-23 DOI: 10.1111/1755-0998.13999
Rubén González-Miguéns, Emilio Cano, Mónica García-Gallo Pinto, Pablo G. Peña, Mario Rincón-Barrado, Guillermo Iglesias, Andrés Blanco-Rotea, María Isabel Carrasco-Braganza, David de Salvador-Velasco, Antonio Guillén-Oterino, Daniel Tenorio-Rodríguez, Ferry Siemensma, David Velázquez, Enrique Lara

Bioindication, evaluating biological responses to environmental disturbances, is crucial for assessing the ecological status of an ecosystem. While historical bioindication relied on macroscopic organisms, the introduction of environmental DNA (eDNA) techniques allows the application of protists without the necessity of morphological identification. In this study, we propose a novel bioindication methodology utilizing Arcellinida, a group of top predators among protists, as bioindicators of freshwater ecosystems. For that purpose, we first characterized the Arcellinida diversity over 1 year at three different points of Lake Sanabria, an ancient glacier lake known to be subjected to anthropogenic disturbances. We compared this diversity with an undisturbed control site. Second, we characterized the Arcellinida diversity in other ecosystems to generate the ecological background to test the connectivity between them. Results indicate limited connectivity between the different ecosystems and an edge effect between terrestrial and aquatic ecosystems. Disturbed freshwater ecosystems exhibited reduced Arcellinida diversity at both specific and infraspecific levels, providing valuable insight into recent disturbances. Arcellinida-based bioindication provides a sensitive, accurate and easy-to-interpret protocol for monitoring disturbances in freshwater ecosystems. It represents a valuable tool for environmental assessments and conservation strategies.

生物鉴定,即评估生物对环境干扰的反应,对于评估生态系统的生态状况至关重要。历史上的生物鉴别依赖于宏观生物,而环境 DNA(eDNA)技术的引入使得原生生物的应用无需进行形态鉴定。在本研究中,我们提出了一种新的生物指示方法,利用原生动物中的顶级捕食者--Arcellinida 作为淡水生态系统的生物指示剂。为此,我们首先描述了萨纳布里亚湖(一个已知受到人为干扰的古老冰川湖)三个不同点一年来的 Arcellinida 多样性。我们将这一多样性与未受干扰的对照地点进行了比较。其次,我们对其他生态系统中的弓形虫多样性进行了特征描述,以便为检验它们之间的联系提供生态背景。结果表明,不同生态系统之间的连通性有限,陆生和水生生态系统之间存在边缘效应。受干扰的淡水生态系统在特异性和次特异性水平上都表现出了较低的弓形虫多样性,为了解近期的干扰提供了宝贵的信息。基于弓形虫的生物指示为监测淡水生态系统的干扰提供了一个灵敏、准确和易于解释的方案。它是环境评估和保护战略的宝贵工具。
{"title":"The voice of the little giants: Arcellinida testate amoebae in environmental DNA-based bioindication, from taxonomy free to haplotypic level","authors":"Rubén González-Miguéns,&nbsp;Emilio Cano,&nbsp;Mónica García-Gallo Pinto,&nbsp;Pablo G. Peña,&nbsp;Mario Rincón-Barrado,&nbsp;Guillermo Iglesias,&nbsp;Andrés Blanco-Rotea,&nbsp;María Isabel Carrasco-Braganza,&nbsp;David de Salvador-Velasco,&nbsp;Antonio Guillén-Oterino,&nbsp;Daniel Tenorio-Rodríguez,&nbsp;Ferry Siemensma,&nbsp;David Velázquez,&nbsp;Enrique Lara","doi":"10.1111/1755-0998.13999","DOIUrl":"10.1111/1755-0998.13999","url":null,"abstract":"<p>Bioindication, evaluating biological responses to environmental disturbances, is crucial for assessing the ecological status of an ecosystem. While historical bioindication relied on macroscopic organisms, the introduction of environmental DNA (eDNA) techniques allows the application of protists without the necessity of morphological identification. In this study, we propose a novel bioindication methodology utilizing Arcellinida, a group of top predators among protists, as bioindicators of freshwater ecosystems. For that purpose, we first characterized the Arcellinida diversity over 1 year at three different points of Lake Sanabria, an ancient glacier lake known to be subjected to anthropogenic disturbances. We compared this diversity with an undisturbed control site. Second, we characterized the Arcellinida diversity in other ecosystems to generate the ecological background to test the connectivity between them. Results indicate limited connectivity between the different ecosystems and an edge effect between terrestrial and aquatic ecosystems. Disturbed freshwater ecosystems exhibited reduced Arcellinida diversity at both specific and infraspecific levels, providing valuable insight into recent disturbances. Arcellinida-based bioindication provides a sensitive, accurate and easy-to-interpret protocol for monitoring disturbances in freshwater ecosystems. It represents a valuable tool for environmental assessments and conservation strategies.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.13999","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141750681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes 从环境微真核细胞中获取ASV解析rRNA操作子的长线程测序策略基准。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-09 DOI: 10.1111/1755-0998.13991
Christina Karmisholt Overgaard, Mahwash Jamy, Simona Radutoiu, Fabien Burki, Morten Kam Dahl Dueholm

The use of short-read metabarcoding for classifying microeukaryotes is challenged by the lack of comprehensive 18S rRNA reference databases. While recent advances in high-throughput long-read sequencing provide the potential to greatly increase the phylogenetic coverage of these databases, the performance of different sequencing technologies and subsequent bioinformatics processing remain to be evaluated, primarily because of the absence of well-defined eukaryotic mock communities. To address this challenge, we created a eukaryotic rRNA operon clone-library and turned it into a precisely defined synthetic eukaryotic mock community. This mock community was then used to evaluate the performance of three long-read sequencing strategies (PacBio circular consensus sequencing and two Nanopore approaches using unique molecular identifiers) and three tools for resolving amplicons sequence variants (ASVs) (USEARCH, VSEARCH, and DADA2). We investigated the sensitivity of the sequencing techniques based on the number of detected mock taxa, and the accuracy of the different ASV-calling tools with a specific focus on the presence of chimera among the final rRNA operon ASVs. Based on our findings, we provide recommendations and best practice protocols for how to cost-effectively obtain essentially error-free rRNA operons in high-throughput. An agricultural soil sample was used to demonstrate that the sequencing and bioinformatic results from the mock community also translates to highly diverse natural samples, which enables us to identify previously undescribed microeukaryotic lineages.

由于缺乏全面的 18S rRNA 参考数据库,使用短线程元条码对微核生物进行分类面临挑战。虽然高通量长读数测序技术的最新进展有可能大大增加这些数据库的系统发生学覆盖范围,但不同测序技术的性能以及后续的生物信息学处理仍有待评估,这主要是因为缺乏定义明确的真核模拟群落。为了应对这一挑战,我们创建了真核生物 rRNA 操作子克隆库,并将其转化为精确定义的合成真核生物模拟群落。这个模拟群落随后被用来评估三种长读数测序策略(PacBio 循环共识测序和两种使用独特分子标识符的 Nanopore 方法)和三种解决扩增子序列变异(ASV)的工具(USEARCH、VSEARCH 和 DADA2)的性能。我们根据检测到的模拟类群数量调查了测序技术的灵敏度,以及不同 ASV 调用工具的准确性,特别关注最终 rRNA 操作子 ASV 中是否存在嵌合体。基于我们的研究结果,我们为如何以高通量、经济有效的方式获得基本无误的 rRNA 操作子提供了建议和最佳实践方案。我们使用了一个农业土壤样本来证明,模拟群落的测序和生物信息学结果同样适用于高度多样化的自然样本,这使我们能够确定以前未曾描述过的微真核细胞系。
{"title":"Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes","authors":"Christina Karmisholt Overgaard,&nbsp;Mahwash Jamy,&nbsp;Simona Radutoiu,&nbsp;Fabien Burki,&nbsp;Morten Kam Dahl Dueholm","doi":"10.1111/1755-0998.13991","DOIUrl":"10.1111/1755-0998.13991","url":null,"abstract":"<p>The use of short-read metabarcoding for classifying microeukaryotes is challenged by the lack of comprehensive 18S rRNA reference databases. While recent advances in high-throughput long-read sequencing provide the potential to greatly increase the phylogenetic coverage of these databases, the performance of different sequencing technologies and subsequent bioinformatics processing remain to be evaluated, primarily because of the absence of well-defined eukaryotic mock communities. To address this challenge, we created a eukaryotic rRNA operon clone-library and turned it into a precisely defined synthetic eukaryotic mock community. This mock community was then used to evaluate the performance of three long-read sequencing strategies (PacBio circular consensus sequencing and two Nanopore approaches using unique molecular identifiers) and three tools for resolving amplicons sequence variants (ASVs) (USEARCH, VSEARCH, and DADA2). We investigated the sensitivity of the sequencing techniques based on the number of detected mock taxa, and the accuracy of the different ASV-calling tools with a specific focus on the presence of chimera among the final rRNA operon ASVs. Based on our findings, we provide recommendations and best practice protocols for how to cost-effectively obtain essentially error-free rRNA operons in high-throughput. An agricultural soil sample was used to demonstrate that the sequencing and bioinformatic results from the mock community also translates to highly diverse natural samples, which enables us to identify previously undescribed microeukaryotic lineages.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.13991","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141557577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Next-generation bioinformatics: An ultrafast and user-friendly tool for phylogenomic data exploration 下一代生物信息学:用于系统发生组数据探索的超快和用户友好型工具。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-06 DOI: 10.1111/1755-0998.13993
Kin Onn Chan

With increasingly large genomic datasets, even routine bioinformatic tasks can be arduous, computationally demanding, and time-consuming. Additionally, most bioinformatic programs do not have a graphical user interface (GUI) and thus, require users to be minimally competent in command-line. These impediments present significant economic and technological barriers for practitioners who do not have access to advanced computational resources and support. In this issue of Molecular Ecology Resources, Handika and Esselstyn (2024) developed an ultrafast and memory-efficient bioinformatic tool, SEGUL, that performs common manipulations and calculations of summary statistics on phylogenomic datasets. SEGUL has two main features that distinguish it from other bioinformatic programs: (1) it is based on the recently emergent, high-performance programming language Rust, and (2) it has a GUI written using Flutter, a cross-platform programming framework that also supports mobile operating systems (mobile iOS, iPadOS and Android). By leveraging and combining the power of Rust and Flutter, SEGUL achieves significantly faster computation times and lower memory usage across different platforms and CPU architectures compared to similar programs. The unique and innovative features of SEGUL pave the way for a new era of bioinformatics that can be more energy-efficient, inclusive, and available to a broader swathe of users.

随着基因组数据集越来越庞大,即使是常规的生物信息任务也会变得艰巨、计算要求高且耗时。此外,大多数生物信息程序都没有图形用户界面(GUI),因此要求用户具备最低限度的命令行能力。这些障碍为无法获得先进计算资源和支持的从业人员带来了巨大的经济和技术障碍。在本期的《分子生态学资源》(Molecular Ecology Resources)杂志上,Handika 和 Esselstyn 开发了一种超快、内存效率高的生物信息学工具 SEGUL,它可以对系统发生组数据集进行常见的操作和汇总统计计算。SEGUL 有两个区别于其他生物信息学程序的主要特点:(1)它基于最近出现的高性能编程语言 Rust,(2)它的图形用户界面是用 Flutter 编写的,Flutter 是一个跨平台编程框架,也支持移动操作系统(移动 iOS、iPadOS 和 Android)。通过利用并结合Rust和Flutter的强大功能,SEGUL在不同平台和CPU架构下的计算时间和内存使用量都比同类程序要快得多。SEGUL独特而创新的功能为生物信息学的新时代铺平了道路,使其更节能、更具包容性,并为更广泛的用户所使用。
{"title":"Next-generation bioinformatics: An ultrafast and user-friendly tool for phylogenomic data exploration","authors":"Kin Onn Chan","doi":"10.1111/1755-0998.13993","DOIUrl":"10.1111/1755-0998.13993","url":null,"abstract":"<p>With increasingly large genomic datasets, even routine bioinformatic tasks can be arduous, computationally demanding, and time-consuming. Additionally, most bioinformatic programs do not have a graphical user interface (GUI) and thus, require users to be minimally competent in command-line. These impediments present significant economic and technological barriers for practitioners who do not have access to advanced computational resources and support. In this issue of Molecular Ecology Resources, Handika and Esselstyn (2024) developed an ultrafast and memory-efficient bioinformatic tool, SEGUL, that performs common manipulations and calculations of summary statistics on phylogenomic datasets. SEGUL has two main features that distinguish it from other bioinformatic programs: (1) it is based on the recently emergent, high-performance programming language Rust, and (2) it has a GUI written using Flutter, a cross-platform programming framework that also supports mobile operating systems (mobile iOS, iPadOS and Android). By leveraging and combining the power of Rust and Flutter, SEGUL achieves significantly faster computation times and lower memory usage across different platforms and CPU architectures compared to similar programs. The unique and innovative features of SEGUL pave the way for a new era of bioinformatics that can be more energy-efficient, inclusive, and available to a broader swathe of users.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141544221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Missing genotype imputation in non-model species using self-organizing maps. 利用自组织图对非模式物种中的缺失基因型进行估算。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-06 DOI: 10.1111/1755-0998.13992
Fernando Mora-Márquez, Juan Carlos Nuño, Álvaro Soto, Unai López de Heredia

Current methodologies of genome-wide single-nucleotide polymorphism (SNP) genotyping produce large amounts of missing data that may affect statistical inference and bias the outcome of experiments. Genotype imputation is routinely used in well-studied species to buffer the impact in downstream analysis, and several algorithms are available to fill in missing genotypes. The lack of reference haplotype panels precludes the use of these methods in genomic studies on non-model organisms. As an alternative, machine learning algorithms are employed to explore the genotype data and to estimate the missing genotypes. Here, we propose an imputation method based on self-organizing maps (SOM), a widely used neural networks formed by spatially distributed neurons that cluster similar inputs into close neurons. The method explores genotype datasets to select SNP loci to build binary vectors from the genotypes, and initializes and trains neural networks for each query missing SNP genotype. The SOM-derived clustering is then used to impute the best genotype. To automate the imputation process, we have implemented gtImputation, an open-source application programmed in Python3 and with a user-friendly GUI to facilitate the whole process. The method performance was validated by comparing its accuracy, precision and sensitivity on several benchmark genotype datasets with other available imputation algorithms. Our approach produced highly accurate and precise genotype imputations even for SNPs with alleles at low frequency and outperformed other algorithms, especially for datasets from mixed populations with unrelated individuals.

目前的全基因组单核苷酸多态性(SNP)基因分型方法会产生大量缺失数据,可能会影响统计推断并使实验结果出现偏差。基因型估算通常用于研究充分的物种,以缓冲下游分析的影响,有几种算法可用于填补缺失的基因型。由于缺乏参考单倍型面板,这些方法无法用于非模式生物的基因组研究。作为一种替代方法,我们采用了机器学习算法来探索基因型数据并估计缺失的基因型。在这里,我们提出了一种基于自组织图(SOM)的估算方法,自组织图是一种广泛使用的神经网络,由空间分布的神经元组成,可将相似的输入聚类到相近的神经元中。该方法通过探索基因型数据集来选择 SNP 位点,从而从基因型中建立二进制向量,并为每个查询缺失的 SNP 基因型初始化和训练神经网络。然后利用 SOM 衍生的聚类来估算最佳基因型。为了实现归因过程的自动化,我们实现了 gtImputation,这是一个用 Python3 编程的开源应用程序,具有用户友好的图形用户界面,以方便整个过程。通过在几个基准基因型数据集上与其他可用的归因算法比较其准确性、精确性和灵敏度,验证了该方法的性能。即使是等位基因频率较低的 SNP,我们的方法也能产生高度准确和精确的基因型归约,而且性能优于其他算法,尤其是在非亲缘关系的混合人群数据集上。
{"title":"Missing genotype imputation in non-model species using self-organizing maps.","authors":"Fernando Mora-Márquez, Juan Carlos Nuño, Álvaro Soto, Unai López de Heredia","doi":"10.1111/1755-0998.13992","DOIUrl":"https://doi.org/10.1111/1755-0998.13992","url":null,"abstract":"<p><p>Current methodologies of genome-wide single-nucleotide polymorphism (SNP) genotyping produce large amounts of missing data that may affect statistical inference and bias the outcome of experiments. Genotype imputation is routinely used in well-studied species to buffer the impact in downstream analysis, and several algorithms are available to fill in missing genotypes. The lack of reference haplotype panels precludes the use of these methods in genomic studies on non-model organisms. As an alternative, machine learning algorithms are employed to explore the genotype data and to estimate the missing genotypes. Here, we propose an imputation method based on self-organizing maps (SOM), a widely used neural networks formed by spatially distributed neurons that cluster similar inputs into close neurons. The method explores genotype datasets to select SNP loci to build binary vectors from the genotypes, and initializes and trains neural networks for each query missing SNP genotype. The SOM-derived clustering is then used to impute the best genotype. To automate the imputation process, we have implemented gtImputation, an open-source application programmed in Python3 and with a user-friendly GUI to facilitate the whole process. The method performance was validated by comparing its accuracy, precision and sensitivity on several benchmark genotype datasets with other available imputation algorithms. Our approach produced highly accurate and precise genotype imputations even for SNPs with alleles at low frequency and outperformed other algorithms, especially for datasets from mixed populations with unrelated individuals.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141544220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards a better future for DNA barcoding: Evaluating monophyly- and distance-based species identification using COI gene fragments of Dacini fruit flies 迈向 DNA 条形码的美好未来:利用达西尼果蝇的 COI 基因片段评估基于单系和距离的物种鉴定。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-07-02 DOI: 10.1111/1755-0998.13987
Camiel Doorenweerd, Michael San Jose, Luc Leblanc, Norman Barr, Scott M. Geib, Arthur Y. C. Chung, Julian R. Dupuis, Arni Ekayanti, Elaida Fiegalan, Kennantudawage S. Hemachandra, Mohammad Aftab Hossain, Chia-Lung Huang, Yu-Feng Hsu, Kimberly Y. Morris, Andi Maryani A. Mustapeng, Jerome Niogret, Thai Hong Pham, Nhien Thi Nguyen, Uda G. A. I. Sirisena, Terrence Todd, Daniel Rubinoff

The utility of a universal DNA ‘barcode’ fragment (658 base pairs of the Cytochrome C Oxidase I [COI] gene) has been established as a useful tool for species identification, and widely criticized as one for understanding the evolutionary history of a group. Large amounts of COI sequence data have been produced that hold promise for rapid species identification, for example, for biosecurity. The fruit fly tribe Dacini holds about a thousand species, of which 80 are pests of economic concern. We generated a COI reference library for 265 species of Dacini containing 5601 sequences that span most of the COI gene using circular consensus sequencing. We compared distance metrics versus monophyly assessments for species identification and although we found a ‘soft’ barcode gap around 2% pairwise distance, the exceptions to this rule dictate that a monophyly assessment is the only reliable method for species identification. We found that all fragments regularly used for Dacini fruit fly identification >450 base pairs long provide similar resolution. 11.3% of the species in our dataset were non-monophyletic in a COI tree, which is mostly due to species complexes. We conclude with recommendations for the future generation and use of COI libraries. We revise the generic assignment of Dacus transversus stat. rev. Hardy 1982, and Dacus perpusillus stat. rev. Drew 1971 and we establish Dacus maculipterus White 1998 syn. nov. as a junior synonym of Dacus satanas Liang et al. 1993.

通用 DNA "条形码 "片段(细胞色素 C 氧化酶 I[COI]基因的 658 个碱基对)已被确定为物种鉴定的有用工具,并被广泛批评为了解一个群体进化史的工具。大量的 COI 序列数据已经产生,有望用于快速物种鉴定,例如用于生物安全。果蝇科 Dacini 约有一千个物种,其中 80 个是经济上令人担忧的害虫。我们利用循环共识测序法为 265 种 Dacini 生成了一个 COI 参考文献库,其中包含 5601 个序列,涵盖了 COI 基因的大部分。我们比较了用于物种鉴定的距离度量和单系评估,尽管我们发现在 2% 的成对距离左右存在 "软 "条形码差距,但这一规则的例外情况决定了单系评估是物种鉴定的唯一可靠方法。我们发现,通常用于达契尼果蝇鉴定的长度大于 450 碱基对的所有片段都具有相似的分辨率。在我们的数据集中,有 11.3% 的物种在 COI 树中是非单系的,这主要是由于物种复合造成的。最后,我们对未来 COI 文库的生成和使用提出了建议。我们修改了 Dacus transversus stat.Dacus perpusillus stat.我们将 Dacus maculipterus White 1998 syn.
{"title":"Towards a better future for DNA barcoding: Evaluating monophyly- and distance-based species identification using COI gene fragments of Dacini fruit flies","authors":"Camiel Doorenweerd,&nbsp;Michael San Jose,&nbsp;Luc Leblanc,&nbsp;Norman Barr,&nbsp;Scott M. Geib,&nbsp;Arthur Y. C. Chung,&nbsp;Julian R. Dupuis,&nbsp;Arni Ekayanti,&nbsp;Elaida Fiegalan,&nbsp;Kennantudawage S. Hemachandra,&nbsp;Mohammad Aftab Hossain,&nbsp;Chia-Lung Huang,&nbsp;Yu-Feng Hsu,&nbsp;Kimberly Y. Morris,&nbsp;Andi Maryani A. Mustapeng,&nbsp;Jerome Niogret,&nbsp;Thai Hong Pham,&nbsp;Nhien Thi Nguyen,&nbsp;Uda G. A. I. Sirisena,&nbsp;Terrence Todd,&nbsp;Daniel Rubinoff","doi":"10.1111/1755-0998.13987","DOIUrl":"10.1111/1755-0998.13987","url":null,"abstract":"<p>The utility of a universal DNA ‘barcode’ fragment (658 base pairs of the <i>Cytochrome C Oxidase I</i> [COI] gene) has been established as a useful tool for species identification, and widely criticized as one for understanding the evolutionary history of a group. Large amounts of COI sequence data have been produced that hold promise for rapid species identification, for example, for biosecurity. The fruit fly tribe Dacini holds about a thousand species, of which 80 are pests of economic concern. We generated a COI reference library for 265 species of Dacini containing 5601 sequences that span most of the COI gene using circular consensus sequencing. We compared distance metrics versus monophyly assessments for species identification and although we found a ‘soft’ barcode gap around 2% pairwise distance, the exceptions to this rule dictate that a monophyly assessment is the only reliable method for species identification. We found that all fragments regularly used for Dacini fruit fly identification &gt;450 base pairs long provide similar resolution. 11.3% of the species in our dataset were non-monophyletic in a COI tree, which is mostly due to species complexes. We conclude with recommendations for the future generation and use of COI libraries. We revise the generic assignment of <i>Dacus transversus</i> stat. rev. Hardy 1982, and <i>Dacus perpusillus</i> stat. rev. Drew 1971 and we establish <i>Dacus maculipterus</i> White 1998 syn. nov. as a junior synonym of <i>Dacus satanas</i> Liang et al. 1993.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141490259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Primula edelbergii S-locus is an example of a jumping supergene Primula edelbergii S-locus是跳跃超级基因的一个例子。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-06-30 DOI: 10.1111/1755-0998.13988
Giacomo Potente, Narjes Yousefi, Barbara Keller, Emiliano Mora-Carrera, Péter Szövényi, Elena Conti

Research on supergenes, non-recombining genomic regions housing tightly linked genes that control complex phenotypes, has recently gained prominence in genomics. Heterostyly, a floral heteromorphism promoting outcrossing in several angiosperm families, is controlled by the S-locus supergene. The S-locus has been studied primarily in closely related Primula species and, more recently, in other groups that independently evolved heterostyly. However, it remains unknown whether genetic architecture and composition of the S-locus are maintained among species that share a common origin of heterostyly and subsequently diverged across larger time scales. To address this research gap, we present a chromosome-scale genome assembly of Primula edelbergii, a species that shares the same origin of heterostyly with Primula veris (whose S-locus has been characterized) but diverged from it 18 million years ago. Comparative genomic analyses between these two species allowed us to show, for the first time, that the S-locus can ‘jump’ (i.e. translocate) between chromosomes maintaining its function in controlling heterostyly. Additionally, we found that four S-locus genes were conserved but reshuffled within the supergene, seemingly without affecting their expression, thus we could not detect changes explaining the lack of self-incompatibility in P. edelbergii. Furthermore, we confirmed that the S-locus is not undergoing genetic degeneration. Finally, we investigated P. edelbergii evolutionary history within Ericales in terms of whole genome duplications and transposable element accumulation. In summary, our work provides a valuable resource for comparative analyses aimed at investigating the genetics of heterostyly and the pivotal role of supergenes in shaping the evolution of complex phenotypes.

超级基因是指含有控制复杂表型的紧密相连基因的非重组基因组区域,对超级基因的研究近来在基因组学中日益突出。异株性(Heterostyly)是几种被子植物家族中促进外交的一种花异形,由 S-locus超级基因控制。对 S-locus的研究主要集中在亲缘关系较近的报春花物种上,最近还研究了其他独立进化出异雌雄同株现象的类群。然而,S-locus 的遗传结构和组成是否在具有共同的异型起源并随后在更大时间尺度上发生分化的物种中得以维持,目前仍是未知数。为了填补这一研究空白,我们对 Primula edelbergii 进行了染色体组规模的基因组组装,该物种与 Primula veris(其 S-locus 已被鉴定)具有相同的异株起源,但在 1800 万年前与 Primula veris 发生了分化。通过对这两个物种的基因组进行比较分析,我们首次发现 S-locus可以在染色体之间 "跳跃"(即易位),从而保持其控制异型的功能。此外,我们还发现有四个 S-locus基因是保守的,但在超级基因中被重新排列,似乎并不影响其表达,因此我们无法检测到可解释 P. edelbergii 缺乏自交不亲和性的变化。此外,我们还证实 S-焦点没有发生基因退化。最后,我们从全基因组复制和转座元件积累的角度研究了 P. edelbergii 在 Ericales 中的进化史。总之,我们的工作为进行比较分析提供了宝贵的资源,这些分析旨在研究异型的遗传学以及超级基因在塑造复杂表型的进化过程中的关键作用。
{"title":"The Primula edelbergii S-locus is an example of a jumping supergene","authors":"Giacomo Potente,&nbsp;Narjes Yousefi,&nbsp;Barbara Keller,&nbsp;Emiliano Mora-Carrera,&nbsp;Péter Szövényi,&nbsp;Elena Conti","doi":"10.1111/1755-0998.13988","DOIUrl":"10.1111/1755-0998.13988","url":null,"abstract":"<p>Research on supergenes, non-recombining genomic regions housing tightly linked genes that control complex phenotypes, has recently gained prominence in genomics. Heterostyly, a floral heteromorphism promoting outcrossing in several angiosperm families, is controlled by the <i>S</i>-locus supergene. The <i>S</i>-locus has been studied primarily in closely related <i>Primula</i> species and, more recently, in other groups that independently evolved heterostyly. However, it remains unknown whether genetic architecture and composition of the <i>S</i>-locus are maintained among species that share a common origin of heterostyly and subsequently diverged across larger time scales. To address this research gap, we present a chromosome-scale genome assembly of <i>Primula edelbergii</i>, a species that shares the same origin of heterostyly with <i>Primula veris</i> (whose <i>S</i>-locus has been characterized) but diverged from it 18 million years ago. Comparative genomic analyses between these two species allowed us to show, for the first time, that the <i>S</i>-locus can ‘jump’ (i.e. translocate) between chromosomes maintaining its function in controlling heterostyly. Additionally, we found that four <i>S</i>-locus genes were conserved but reshuffled within the supergene, seemingly without affecting their expression, thus we could not detect changes explaining the lack of self-incompatibility in <i>P. edelbergii</i>. Furthermore, we confirmed that the <i>S</i>-locus is not undergoing genetic degeneration. Finally, we investigated <i>P. edelbergii</i> evolutionary history within Ericales in terms of whole genome duplications and transposable element accumulation. In summary, our work provides a valuable resource for comparative analyses aimed at investigating the genetics of heterostyly and the pivotal role of supergenes in shaping the evolution of complex phenotypes.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.13988","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141464666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genome sequencing of Coryphaenoides yaquinae reveals convergent and lineage-specific molecular evolution in deep-sea adaptation Coryphaenoides yaquinae 的基因组测序揭示了深海适应性的趋同和特定世系的分子进化。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-06-30 DOI: 10.1111/1755-0998.13989
Wenhao Li, Jie Song, Huaming Tu, Shouwen Jiang, Binbin Pan, Jiazhen Li, Yongpeng Zhao, Liangbiao Chen, Qianghua Xu

Abyssal (3501–6500 m) and hadal (>6500 m) fauna evolve under harsh abiotic stresses, characterized by high hydrostatic pressure, darkness and food shortage, providing unique opportunities to investigate mechanisms underlying environmental adaptation. Genomes of several hadal species have recently been reported. However, the genetic adaptation of deep sea species across a broad spectrum of ocean depths has yet to be thoroughly investigated, due to the challenges imposed by collecting the deep sea species. To elucidate the correlation between genetic innovation and vertical distribution, we generated a chromosome-level genome assembly of the macrourids Coryphaenoides yaquinae, which is widely distributed in the abyssal/hadal zone ranging from 3655 to 7259 m in depth. Genomic comparisons among shallow, abyssal and hadal-living species identified idiosyncratic and convergent genetic alterations underlying the extraordinary adaptations of deep-sea species including light perception, circadian regulation, hydrostatic pressure and hunger tolerance. The deep-sea fishes (Coryphaenoides Sp. and Pseudoliparis swirei) venturing into various ocean depths independently have undergone convergent amino acid substitutions in multiple proteins such as rhodopsin 1, pancreatic and duodenal homeobox 1 and melanocortin 4 receptor which are known or verified in zebrafish to be related with vision adaptation and energy expenditure. Convergent evolution events were also identified in heat shock protein 90 beta family member 1 and valosin-containing protein genes known to be related to hydrostatic pressure adaptation specifically in fishes found around the hadal range. The uncovering of the molecular convergence among the deep-sea species shed new light on the common genetic innovations required for deep-sea adaptation by the fishes.

深海(3501-6500 米)和黑鳕(大于 6500 米)动物群在严酷的非生物压力下进化,其特点是高静水压、黑暗和食物短缺,这为研究环境适应机制提供了独特的机会。最近报告了几个黑线鳕物种的基因组。然而,由于收集深海物种所面临的挑战,对深海物种在大洋深度范围内的遗传适应性还有待深入研究。为了阐明遗传创新与垂直分布之间的相关性,我们对广泛分布于水深3655米至7259米的深海/咸水区的巨型栉水母(Coryphaenoides yaquinae)进行了染色体组水平的基因组组装。通过对生活在浅海、深海和咸水层的物种进行基因组比较,发现了深海物种在光感、昼夜节律调节、静水压力和耐饥饿等非凡适应性背后的特异性和趋同性基因改变。进入不同海洋深度的深海鱼类(Coryphaenoides Sp.和 Pseudoliparis swirei)在多个蛋白质中经历了趋同的氨基酸替代,如犀牛蛋白 1、胰腺和十二指肠同工酶 1 和黑色素皮质素 4 受体,这些蛋白质在斑马鱼中已知或已验证与视觉适应和能量消耗有关。在热休克蛋白 90 beta 家族成员 1 和含缬氨酸蛋白基因中也发现了趋同进化事件,已知这些基因与静水压适应有关,特别是在黑线范围附近的鱼类中。深海物种之间分子趋同的发现为鱼类适应深海所需的共同遗传创新提供了新的线索。
{"title":"Genome sequencing of Coryphaenoides yaquinae reveals convergent and lineage-specific molecular evolution in deep-sea adaptation","authors":"Wenhao Li,&nbsp;Jie Song,&nbsp;Huaming Tu,&nbsp;Shouwen Jiang,&nbsp;Binbin Pan,&nbsp;Jiazhen Li,&nbsp;Yongpeng Zhao,&nbsp;Liangbiao Chen,&nbsp;Qianghua Xu","doi":"10.1111/1755-0998.13989","DOIUrl":"10.1111/1755-0998.13989","url":null,"abstract":"<p>Abyssal (3501–6500 m) and hadal (&gt;6500 m) fauna evolve under harsh abiotic stresses, characterized by high hydrostatic pressure, darkness and food shortage, providing unique opportunities to investigate mechanisms underlying environmental adaptation. Genomes of several hadal species have recently been reported. However, the genetic adaptation of deep sea species across a broad spectrum of ocean depths has yet to be thoroughly investigated, due to the challenges imposed by collecting the deep sea species. To elucidate the correlation between genetic innovation and vertical distribution, we generated a chromosome-level genome assembly of the macrourids <i>Coryphaenoides yaquinae</i>, which is widely distributed in the abyssal/hadal zone ranging from 3655 to 7259 m in depth. Genomic comparisons among shallow, abyssal and hadal-living species identified idiosyncratic and convergent genetic alterations underlying the extraordinary adaptations of deep-sea species including light perception, circadian regulation, hydrostatic pressure and hunger tolerance. The deep-sea fishes (<i>Coryphaenoides</i> Sp. and <i>Pseudoliparis swirei</i>) venturing into various ocean depths independently have undergone convergent amino acid substitutions in multiple proteins such as rhodopsin 1, pancreatic and duodenal homeobox 1 and melanocortin 4 receptor which are known or verified in zebrafish to be related with vision adaptation and energy expenditure. Convergent evolution events were also identified in heat shock protein 90 beta family member 1 and valosin-containing protein genes known to be related to hydrostatic pressure adaptation specifically in fishes found around the hadal range. The uncovering of the molecular convergence among the deep-sea species shed new light on the common genetic innovations required for deep-sea adaptation by the fishes.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141464665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing the applicability of environmental DNA metabarcoding to landscape genetics 测试环境 DNA 代谢编码对景观遗传学的适用性。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-06-24 DOI: 10.1111/1755-0998.13990
Souta Nakajima, Kenji Tsuri

Landscape genetics is a field dealing with local genetic differences and contributes to strategic conservation planning. Recently, environmental DNA (eDNA) metabarcoding has proven useful not only for detecting species but also for assessing genetic diversity and genetic structure on a large scale such as in phylogeography. However, it remains unclear whether eDNA analysis also has sufficient power to perform the landscape genetics, which focuses on a local scale. To reveal the applicability of eDNA to landscape genetics, we conducted an eDNA metabarcoding analysis of the mitochondrial DNA D-loop region of the fluvial sculpin Cottus nozawae in the upper Sorachi River in Japan and compared the results with inferences based on traditional tissue-based approaches by the same D-loop region and genome-wide SNP data. As a result, the spatial distribution of haplotypes was generally consistent between the eDNA- and tissue-based approaches. In addition, the genetic differentiation statistics calculated using eDNA and tissue samples were highly correlated when comparing both in the D-loop region. The removal of low-frequency reads or the conversion to semi-quantitative rankings of eDNA data did not alter the correlation of genetic diversity and differentiation statistics with tissue-based approaches much. Finally, we confirmed that analyses using eDNA data can reveal patterns such as isolation-by-distance shown in previous studies on this species, indicating the applicability of eDNA to basic landscape genetics. Even though some limitations remain, eDNA may have great potential for conducting basic landscape genetics.

景观遗传学是一个涉及地方遗传差异的领域,有助于制定战略性保护规划。近来,环境 DNA(eDNA)代谢编码已被证明不仅有助于检测物种,还有助于评估大规模的遗传多样性和遗传结构,如在系统地理学中。然而,eDNA 分析是否也有足够的能力来进行景观遗传学研究,这一点仍不清楚。为了揭示 eDNA 在景观遗传学中的适用性,我们对日本 Sorachi 河上游的河口鳞栉鱼 Cottus nozawae 的线粒体 DNA D 环区进行了 eDNA 代谢编码分析,并将结果与传统的基于组织的方法通过相同的 D 环区和全基因组 SNP 数据进行的推断进行了比较。结果显示,单倍型的空间分布在基于 eDNA 和基于组织的方法中基本一致。此外,使用 eDNA 和组织样本计算出的遗传分化统计量在 D 环区域进行比较时也高度相关。去除低频读数或将 eDNA 数据转换为半定量排序并不会对基于组织方法的遗传多样性和分化统计的相关性造成太大的改变。最后,我们证实,使用 eDNA 数据进行分析可以揭示一些模式,如以前对该物种的研究中显示的距离隔离模式,这表明 eDNA 适用于基本的景观遗传学。尽管还存在一些局限性,但 eDNA 在进行基础景观遗传学研究方面可能具有很大的潜力。
{"title":"Testing the applicability of environmental DNA metabarcoding to landscape genetics","authors":"Souta Nakajima,&nbsp;Kenji Tsuri","doi":"10.1111/1755-0998.13990","DOIUrl":"10.1111/1755-0998.13990","url":null,"abstract":"<p>Landscape genetics is a field dealing with local genetic differences and contributes to strategic conservation planning. Recently, environmental DNA (eDNA) metabarcoding has proven useful not only for detecting species but also for assessing genetic diversity and genetic structure on a large scale such as in phylogeography. However, it remains unclear whether eDNA analysis also has sufficient power to perform the landscape genetics, which focuses on a local scale. To reveal the applicability of eDNA to landscape genetics, we conducted an eDNA metabarcoding analysis of the mitochondrial DNA D-loop region of the fluvial sculpin <i>Cottus nozawae</i> in the upper Sorachi River in Japan and compared the results with inferences based on traditional tissue-based approaches by the same D-loop region and genome-wide SNP data. As a result, the spatial distribution of haplotypes was generally consistent between the eDNA- and tissue-based approaches. In addition, the genetic differentiation statistics calculated using eDNA and tissue samples were highly correlated when comparing both in the D-loop region. The removal of low-frequency reads or the conversion to semi-quantitative rankings of eDNA data did not alter the correlation of genetic diversity and differentiation statistics with tissue-based approaches much. Finally, we confirmed that analyses using eDNA data can reveal patterns such as isolation-by-distance shown in previous studies on this species, indicating the applicability of eDNA to basic landscape genetics. Even though some limitations remain, eDNA may have great potential for conducting basic landscape genetics.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141454248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Orchidinae-205: A new genome-wide custom bait set for studying the evolution, systematics, and trade of terrestrial orchids 兰科-205:用于研究陆生兰花的进化、系统学和贸易的新的全基因组定制诱饵集。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2024-06-20 DOI: 10.1111/1755-0998.13986
Margaretha A. Veltman, Bastien Anthoons, Audun Schrøder-Nielsen, Barbara Gravendeel, Hugo J. de Boer

Terrestrial orchids are a group of genetically understudied, yet culturally and economically important plants. The Orchidinae tribe contains many species that produce edible tubers that are used for the production of traditional delicacies collectively called ‘salep’. Overexploitation of wild orchids in the Eastern Mediterranean and Western Asia threatens to drive many of these species to extinction, but cost-effective tools for monitoring their trade are currently lacking. Here we present a custom bait kit for target enrichment and sequencing of 205 novel genetic markers that are tailored to phylogenomic applications in Orchidinae s.l. A subset of 31 markers capture genes putatively involved in the production of glucomannan, a water-soluble polysaccharide that gives salep its distinctive properties. We tested the kit on 73 taxa native to the area, demonstrating universally high locus recovery irrespective of species identity, that exceeds the total sequence length obtained with alternative kits currently available. Phylogenetic inference with concatenation and coalescent approaches was robust and showed high levels of support for most clades, including some which were previously unresolved. Resolution for hybridizing and recently radiated lineages remains difficult, but could be further improved by analysing multiple haplotypes and the non-exonic sequences captured by our kit, with the promise to shed new light on the evolution of enigmatic taxa with a complex speciation history. Offering a step-up from traditional barcoding and universal markers, the genome-wide custom loci targeted by Orchidinae-205 are a valuable new resource to study the evolution, systematics and trade of terrestrial orchids.

陆生兰花是一类基因研究不足,但在文化和经济上却非常重要的植物。兰科植物中有许多物种生产可食用的块茎,这些块茎可用于生产统称为 "沙利普 "的传统美食。东地中海和西亚地区对野生兰花的过度开发有可能导致其中许多物种濒临灭绝,但目前还缺乏具有成本效益的工具来监测它们的贸易。31 个标记子集捕获了可能参与生产葡甘聚糖的基因,葡甘聚糖是一种水溶性多糖,赋予了沙利叶菊与众不同的特性。我们对该地区的 73 个原生类群进行了测试,结果表明,无论物种特征如何,该试剂盒的基因座恢复率普遍很高,超过了目前其他试剂盒获得的总序列长度。使用连接和聚合方法进行的系统发育推断非常稳健,对大多数支系(包括一些以前未解决的支系)的支持率都很高。对杂交和新近辐射世系的解析仍然很困难,但通过分析多单倍型和我们的试剂盒捕获的非外显序列,可以进一步提高解析能力,有望为具有复杂物种演化历史的神秘类群的演化提供新的线索。与传统的条形码和通用标记相比,Orchidinae-205 所针对的全基因组定制位点是研究陆生兰花的进化、系统学和贸易的宝贵新资源。
{"title":"Orchidinae-205: A new genome-wide custom bait set for studying the evolution, systematics, and trade of terrestrial orchids","authors":"Margaretha A. Veltman,&nbsp;Bastien Anthoons,&nbsp;Audun Schrøder-Nielsen,&nbsp;Barbara Gravendeel,&nbsp;Hugo J. de Boer","doi":"10.1111/1755-0998.13986","DOIUrl":"10.1111/1755-0998.13986","url":null,"abstract":"<p>Terrestrial orchids are a group of genetically understudied, yet culturally and economically important plants. The Orchidinae tribe contains many species that produce edible tubers that are used for the production of traditional delicacies collectively called ‘salep’. Overexploitation of wild orchids in the Eastern Mediterranean and Western Asia threatens to drive many of these species to extinction, but cost-effective tools for monitoring their trade are currently lacking. Here we present a custom bait kit for target enrichment and sequencing of 205 novel genetic markers that are tailored to phylogenomic applications in Orchidinae s.l. A subset of 31 markers capture genes putatively involved in the production of glucomannan, a water-soluble polysaccharide that gives salep its distinctive properties. We tested the kit on 73 taxa native to the area, demonstrating universally high locus recovery irrespective of species identity, that exceeds the total sequence length obtained with alternative kits currently available. Phylogenetic inference with concatenation and coalescent approaches was robust and showed high levels of support for most clades, including some which were previously unresolved. Resolution for hybridizing and recently radiated lineages remains difficult, but could be further improved by analysing multiple haplotypes and the non-exonic sequences captured by our kit, with the promise to shed new light on the evolution of enigmatic taxa with a complex speciation history. Offering a step-up from traditional barcoding and universal markers, the genome-wide custom loci targeted by Orchidinae-205 are a valuable new resource to study the evolution, systematics and trade of terrestrial orchids.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":null,"pages":null},"PeriodicalIF":5.5,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.13986","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141425907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular Ecology Resources
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1