Molecular Ecology Resources最新文献_第8页

Refining Zooplankton Diet Composition Studies Over Short and Long Time Scales by Combining 18S Metabarcoding With Fatty Acid Analyses 结合18S元条形码和脂肪酸分析，改进浮游动物饮食组成在短期和长期尺度上的研究。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-18 DOI: 10.1111/1755-0998.70030

Nora-Charlotte Pauli, Katja Metfies, Stefan Neuhaus, Martin Graeve, Alison C. Cleary, Morten H. Iversen, Bettina Meyer

Understanding diet composition is essential for unravelling trophic interactions in aquatic ecosystems. DNA metabarcoding, utilising various variable regions of the 18S rRNA gene, is increasingly employed to investigate zooplankton diet composition. However, accurate results depend on rapid inactivation of digestive enzymes and DNA nucleases through proper sample processing and preservation. In this study, we compare the prey communities of Antarctic krill retrieved from the 18S variable regions V4 and V7 and assess how different processing treatments affect the detected prey composition of both krill and salps. Our findings highlight the critical importance of prompt sample processing for species with highly efficient digestive enzymes, such as krill, to preserve rapidly digested prey, including gelatinous plankton. Comparative analyses of the V4 and V7 regions revealed significantly different prey communities within the same krill samples, indicating that these regions may not be suitable for direct comparisons within or across studies. To complement molecular approaches, we also analyse fatty acids (FA) as trophic markers which provide insights into dietary habits over both short and long time scales. By comparing FA signals from stomach and tissue samples of the same krill and salp individuals, we identified significant differences in trophic markers representing different plankton groups. These findings emphasise the necessity of separating digestive tract from tissue to distinguish between short- and long-term diet signals. Furthermore, integrating FA analysis with metabarcoding offers valuable insights into zooplankton digestion efficiency across taxonomic levels. This combined approach enhances our understanding of zooplankton feeding ecology and trophic interactions in marine ecosystems.

了解饮食组成对于揭示水生生态系统中的营养相互作用至关重要。DNA元条形码利用18S rRNA基因的各种可变区域，越来越多地用于研究浮游动物的饮食组成。然而，准确的结果取决于通过适当的样品处理和保存，使消化酶和DNA核酸酶迅速失活。本研究比较了从18S可变区V4和V7采集到的南极磷虾的猎物群落，评估了不同处理方式对磷虾和海带探测到的猎物组成的影响。我们的研究结果强调了对具有高效消化酶的物种（如磷虾）进行快速样品处理的重要性，以保存快速消化的猎物，包括凝胶状浮游生物。V4和V7区域的对比分析显示，同一磷虾样本中猎物群落存在显著差异，表明这些区域可能不适合在研究内部或跨研究进行直接比较。为了补充分子方法，我们还分析了脂肪酸（FA）作为营养标志物，它提供了短期和长期饮食习惯的见解。通过比较同一磷虾和salp个体的胃和组织样本的FA信号，我们发现了代表不同浮游生物群体的营养标志物的显著差异。这些发现强调了将消化道从组织中分离出来以区分短期和长期饮食信号的必要性。此外，将FA分析与元条形码相结合，可以跨分类水平对浮游动物消化效率提供有价值的见解。这种综合方法增强了我们对海洋生态系统中浮游动物摄食生态学和营养相互作用的理解。

{"title":"Refining Zooplankton Diet Composition Studies Over Short and Long Time Scales by Combining 18S Metabarcoding With Fatty Acid Analyses","authors":"Nora-Charlotte Pauli, Katja Metfies, Stefan Neuhaus, Martin Graeve, Alison C. Cleary, Morten H. Iversen, Bettina Meyer","doi":"10.1111/1755-0998.70030","DOIUrl":"10.1111/1755-0998.70030","url":null,"abstract":"Understanding diet composition is essential for unravelling trophic interactions in aquatic ecosystems. DNA metabarcoding, utilising various variable regions of the 18S rRNA gene, is increasingly employed to investigate zooplankton diet composition. However, accurate results depend on rapid inactivation of digestive enzymes and DNA nucleases through proper sample processing and preservation. In this study, we compare the prey communities of Antarctic krill retrieved from the 18S variable regions V4 and V7 and assess how different processing treatments affect the detected prey composition of both krill and salps. Our findings highlight the critical importance of prompt sample processing for species with highly efficient digestive enzymes, such as krill, to preserve rapidly digested prey, including gelatinous plankton. Comparative analyses of the V4 and V7 regions revealed significantly different prey communities within the same krill samples, indicating that these regions may not be suitable for direct comparisons within or across studies. To complement molecular approaches, we also analyse fatty acids (FA) as trophic markers which provide insights into dietary habits over both short and long time scales. By comparing FA signals from stomach and tissue samples of the same krill and salp individuals, we identified significant differences in trophic markers representing different plankton groups. These findings emphasise the necessity of separating digestive tract from tissue to distinguish between short- and long-term diet signals. Furthermore, integrating FA analysis with metabarcoding offers valuable insights into zooplankton digestion efficiency across taxonomic levels. This combined approach enhances our understanding of zooplankton feeding ecology and trophic interactions in marine ecosystems.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70030","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144870589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DeepKin: Predicting Relatedness From Low-Coverage Genomes and Palaeogenomes With Convolutional Neural Networks DeepKin：用卷积神经网络预测低覆盖率基因组和古基因组的相关性。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-18 DOI: 10.1111/1755-0998.70032

Merve N. Güler, Ardan Yılmaz, Büşra Katırcıoğlu, Sarp Kantar, Tara Ekin Ünver, Kıvılcım Başak Vural, N. Ezgi Altınışık, Emre Akbas, Mehmet Somel

DeepKin is a novel tool designed to predict relatedness from genomic data using convolutional neural networks (CNNs). Traditional methods for estimating relatedness often struggle when genomic data is limited, as with palaeogenomes and degraded forensic samples. DeepKin addresses this challenge by leveraging two CNN models, which are trained solely on simulated genomic data, to classify relatedness up to the third degree and to identify parent–offspring and sibling pairs. Our benchmarking shows DeepKin performs comparably or better than the widely used tool READv2. We validated DeepKin, which uses PLINK's .map and .ped files as input, on empirical palaeogenomes from three archaeological sites, demonstrating its robustness and adaptability across different genetic backgrounds, with accuracy > 90% above 10 K shared SNPs. By capturing information across genomic segments, DeepKin offers a new methodological path for relatedness estimation in settings with highly degraded samples, with applications in ancient DNA, as well as forensic and conservation genetics.

DeepKin是一种新颖的工具，旨在使用卷积神经网络（cnn）从基因组数据中预测亲缘关系。当基因组数据有限时，传统的估计亲缘关系的方法往往会遇到困难，比如古基因组和退化的法医样本。DeepKin通过利用两个CNN模型来解决这一挑战，这两个模型仅在模拟基因组数据上进行训练，可以将亲缘关系分类到第三度，并识别父母-后代和兄弟姐妹对。我们的基准测试显示，DeepKin的性能与广泛使用的工具READv2相当，甚至更好。我们验证了使用PLINK的DeepKin。地图和。以三个考古遗址的经验古基因组为输入，证明了该方法在不同遗传背景下的鲁棒性和适应性，在10 K以上共享snp的准确率为90%。通过捕获跨基因组片段的信息，DeepKin提供了一种新的方法路径，可以在高度降解的样本设置中进行相关性估计，并应用于古代DNA，以及法医和保护遗传学。

{"title":"DeepKin: Predicting Relatedness From Low-Coverage Genomes and Palaeogenomes With Convolutional Neural Networks","authors":"Merve N. Güler, Ardan Yılmaz, Büşra Katırcıoğlu, Sarp Kantar, Tara Ekin Ünver, Kıvılcım Başak Vural, N. Ezgi Altınışık, Emre Akbas, Mehmet Somel","doi":"10.1111/1755-0998.70032","DOIUrl":"10.1111/1755-0998.70032","url":null,"abstract":"<div>\u0000 \u0000 DeepKin is a novel tool designed to predict relatedness from genomic data using convolutional neural networks (CNNs). Traditional methods for estimating relatedness often struggle when genomic data is limited, as with palaeogenomes and degraded forensic samples. DeepKin addresses this challenge by leveraging two CNN models, which are trained solely on simulated genomic data, to classify relatedness up to the third degree and to identify parent–offspring and sibling pairs. Our benchmarking shows DeepKin performs comparably or better than the widely used tool READv2. We validated DeepKin, which uses PLINK's .map and .ped files as input, on empirical palaeogenomes from three archaeological sites, demonstrating its robustness and adaptability across different genetic backgrounds, with accuracy > 90% above 10 K shared SNPs. By capturing information across genomic segments, DeepKin offers a new methodological path for relatedness estimation in settings with highly degraded samples, with applications in ancient DNA, as well as forensic and conservation genetics.\u0000 </div>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144870587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparative Decay Dynamics and Detectability of eDNA and eRNA in Connected and Isolated Freshwater Mesocosms Using Digital PCR 利用数字PCR比较连接和分离的淡水中游生物中eDNA和eRNA的衰减动力学和可检测性。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-13 DOI: 10.1111/1755-0998.70028

Wendy B. Morgado-Gamero, Orianne Tournayre, Melania E. Cristescu

Efficient use of environmental nucleic acids (eNAs) in freshwater biodiversity monitoring requires understanding their degradation and detectability in interconnected ecosystems. We employed a novel field-scale assay to compare environmental DNA (eDNA) and environmental RNA (eRNA) decay rates and detectability across four genetic markers (16S, 18S, COI and LDHA) in connected and isolated 1000-L mesocosms containing natural planktonic assemblages. This design provides ecologically relevant and complex settings to assess how connectivity influences the detectability of eNA over time. Isolated and head mesocosms were spiked with eNAs from cultured Daphnia pulex, absent from the water source, while downstream mesocosms received eNAs via unidirectional water transfers. Using digital PCR (dPCR), we captured fine-scale temporal patterns across mitochondrial and nuclear markers and transcript types (mRNA and rRNA), an approach rarely combined in previous research. eRNA degraded significantly faster than eDNA across markers and mesocosm types. Among RNA types, mRNA (COI, LDHA) degraded faster than rRNA (16S, 18S). eRNA followed a uniform monophasic decay pattern, whereas eDNA displayed biphasic decay for nuclear markers and monophasic decay for mitochondrial markers. eNA decay rates in this field-relevant mesocosm network exceeded those from laboratory scale. While decay rates remained consistent across networks, detectability declined with dilution. Even after a 10,000-fold dilution, both eNAs were detected in terminal mesocosms, demonstrating effective transport across the network. Although RNA degrades rapidly, high detectability was achieved across diverse dilutions using dPCR, highlighting eRNA's potential for detecting active biological communities in freshwater systems.

环境核酸在淡水生物多样性监测中的有效利用需要了解其在相互关联的生态系统中的退化和可探测性。我们采用了一种新颖的野外规模试验，比较了含有天然浮游生物组合的连接和分离的1000 l中游生物中环境DNA （eDNA）和环境RNA （eRNA）在四种遗传标记（16S, 18S， COI和LDHA）上的衰减率和可检测性。该设计提供了生态相关和复杂的设置，以评估连接如何随时间影响eNA的可检测性。在没有水源的情况下，分离的和头部的水蚤中胚层注入来自培养水蚤的eNAs，而下游的中胚层则通过单向水转移接受eNAs。使用数字PCR (dPCR)，我们捕获了线粒体和核标记物以及转录物类型（mRNA和rRNA）的精细尺度时间模式，这在以前的研究中很少结合使用。eRNA的降解速度明显快于eDNA。在RNA类型中，mRNA （COI， LDHA）的降解速度快于rRNA （16S, 18S）。eRNA遵循统一的单相衰变模式，而eDNA在核标记上表现为双相衰变，在线粒体标记上表现为单相衰变。在这个与野外相关的中尺度网络中，eNA的衰减率超过了实验室尺度的衰减率。虽然衰减率在网络中保持一致，但可探测性随着稀释而下降。即使在稀释了10,000倍后，这两种eNAs都在终端中观细胞中被检测到，证明了在网络中的有效传输。尽管RNA降解迅速，但使用dPCR在不同稀释度下实现了高可检测性，这突出了eRNA在检测淡水系统中活跃生物群落方面的潜力。

{"title":"Comparative Decay Dynamics and Detectability of eDNA and eRNA in Connected and Isolated Freshwater Mesocosms Using Digital PCR","authors":"Wendy B. Morgado-Gamero, Orianne Tournayre, Melania E. Cristescu","doi":"10.1111/1755-0998.70028","DOIUrl":"10.1111/1755-0998.70028","url":null,"abstract":"Efficient use of environmental nucleic acids (eNAs) in freshwater biodiversity monitoring requires understanding their degradation and detectability in interconnected ecosystems. We employed a novel field-scale assay to compare environmental DNA (eDNA) and environmental RNA (eRNA) decay rates and detectability across four genetic markers (16S, 18S, COI and LDHA) in connected and isolated 1000-L mesocosms containing natural planktonic assemblages. This design provides ecologically relevant and complex settings to assess how connectivity influences the detectability of eNA over time. Isolated and head mesocosms were spiked with eNAs from cultured Daphnia pulex, absent from the water source, while downstream mesocosms received eNAs via unidirectional water transfers. Using digital PCR (dPCR), we captured fine-scale temporal patterns across mitochondrial and nuclear markers and transcript types (mRNA and rRNA), an approach rarely combined in previous research. eRNA degraded significantly faster than eDNA across markers and mesocosm types. Among RNA types, mRNA (COI, LDHA) degraded faster than rRNA (16S, 18S). eRNA followed a uniform monophasic decay pattern, whereas eDNA displayed biphasic decay for nuclear markers and monophasic decay for mitochondrial markers. eNA decay rates in this field-relevant mesocosm network exceeded those from laboratory scale. While decay rates remained consistent across networks, detectability declined with dilution. Even after a 10,000-fold dilution, both eNAs were detected in terminal mesocosms, demonstrating effective transport across the network. Although RNA degrades rapidly, high detectability was achieved across diverse dilutions using dPCR, highlighting eRNA's potential for detecting active biological communities in freshwater systems.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70028","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unravelling the Web of Life: Incomplete Lineage Sorting and Hybridisation as Primary Mechanisms Over Polyploidisation in the Evolutionary Dynamics of Pear Species 揭开生命之网：不完全谱系分选和杂交是梨物种多倍体化进化动力学的主要机制。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-12 DOI: 10.1111/1755-0998.70029

Ze-Tao Jin, Xiao-Hua Lin, Dai-Kun Ma, Si-Yu Xie, Jia-Xin Huang, Chen Ren, Liang Zhao, Lei Duan, Chao Xu, Richard G. J. Hodel, Jun Wu, Bin-Bin Liu

The traditional Tree of Life (ToL) model is increasingly challenged by the Web of Life (WoL) paradigm, which offers a more accurate depiction of organismal phylogeny, particularly in light of the incongruences often observed between gene and species trees. However, the absence of a standardised method for resolving evolutionary mechanisms – such as Incomplete Lineage Sorting (ILS), hybridisation, introgression, polyploidisation, and whole-genome duplication – remains a significant obstacle in defining the WoL. Characterised by extensive hybridisation events, the pear genus Pyrus provides an ideal model for exploring these complexities. In this study, we present a Step-by-Step Exclusion (SSE) approach for investigating the evolutionary pathways of Pyrus, based on Whole Genome Sequencing (WGS) and Deep Genome Skimming (DGS) data, and our results demonstrate that: (1) ILS, rather than polyploidisation, plays a dominant role in the origination of Pyrus; (2) the two subgenera of Pyrus followed independent evolutionary paths, influenced by geographical barriers formed through the uplift of the Tibetan Plateau and increased aridity in Central Asia; (3) both ILS and hybridisation have driven the diversification of subg. Pashia, while hybridisation alone has shaped the reticulate evolution of subg. Pyrus; (4) the establishment of the Silk Road during the Han Dynasty facilitated genetic exchange between subg. Pyrus and subg. Pashia. The SSE approach offers a versatile framework for studying the evolutionary mechanisms underlying the WoL paradigm.

传统的生命之树（ToL）模型正日益受到生命之网（WoL）范式的挑战，后者提供了更准确的生物系统发育描述，特别是考虑到基因树和物种树之间经常观察到的不一致。然而，缺乏一种标准化的方法来解决进化机制——如不完全谱系分类（ILS）、杂交、渗入、多倍体化和全基因组复制——仍然是定义WoL的一个重大障碍。梨属的特点是广泛的杂交事件，为探索这些复杂性提供了理想的模型。在这项研究中，我们基于全基因组测序（WGS）和深度基因组撇读（DGS）数据，提出了一种逐步排除（SSE）方法来研究梨的进化途径，我们的结果表明：(1)ILS而不是多倍化在梨的起源中起主导作用；(2)受青藏高原隆升和中亚地区干旱加剧所形成的地理屏障的影响，梨属的两个亚属具有独立的进化路径；(3) ILS和杂交都推动了subg的多样化。Pashia，而杂交单独塑造了subg的网状进化。Pyrus;(4)汉代丝绸之路的建立促进了各民族之间的基因交流。梨和子。Pashia。SSE方法为研究WoL范式下的进化机制提供了一个通用的框架。

{"title":"Unravelling the Web of Life: Incomplete Lineage Sorting and Hybridisation as Primary Mechanisms Over Polyploidisation in the Evolutionary Dynamics of Pear Species","authors":"Ze-Tao Jin, Xiao-Hua Lin, Dai-Kun Ma, Si-Yu Xie, Jia-Xin Huang, Chen Ren, Liang Zhao, Lei Duan, Chao Xu, Richard G. J. Hodel, Jun Wu, Bin-Bin Liu","doi":"10.1111/1755-0998.70029","DOIUrl":"10.1111/1755-0998.70029","url":null,"abstract":"<div>\u0000 \u0000 The traditional Tree of Life (ToL) model is increasingly challenged by the Web of Life (WoL) paradigm, which offers a more accurate depiction of organismal phylogeny, particularly in light of the incongruences often observed between gene and species trees. However, the absence of a standardised method for resolving evolutionary mechanisms – such as Incomplete Lineage Sorting (ILS), hybridisation, introgression, polyploidisation, and whole-genome duplication – remains a significant obstacle in defining the WoL. Characterised by extensive hybridisation events, the pear genus Pyrus provides an ideal model for exploring these complexities. In this study, we present a Step-by-Step Exclusion (SSE) approach for investigating the evolutionary pathways of Pyrus, based on Whole Genome Sequencing (WGS) and Deep Genome Skimming (DGS) data, and our results demonstrate that: (1) ILS, rather than polyploidisation, plays a dominant role in the origination of Pyrus; (2) the two subgenera of Pyrus followed independent evolutionary paths, influenced by geographical barriers formed through the uplift of the Tibetan Plateau and increased aridity in Central Asia; (3) both ILS and hybridisation have driven the diversification of subg. Pashia, while hybridisation alone has shaped the reticulate evolution of subg. Pyrus; (4) the establishment of the Silk Road during the Han Dynasty facilitated genetic exchange between subg. Pyrus and subg. Pashia. The SSE approach offers a versatile framework for studying the evolutionary mechanisms underlying the WoL paradigm.\u0000 </div>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High Imputation Accuracy Can Be Achieved Using a Small Reference Panel in a Natural Population With Low Genetic Diversity 在遗传多样性较低的自然种群中，使用一个小的参考群体可以获得较高的输入精度。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-12 DOI: 10.1111/1755-0998.70024

Hui Zhen Tan, Katarina C. Stuart, Tram Vi, Annabel Whibley, Sarah Bailey, Patricia Brekke, Anna W. Santure

Genotype imputation, the inference of missing genotypes using a reference set of population haplotypes, is a cost-effective tool for improving the quality and quantity of genetic datasets. Imputation is usually applied to large and well-characterised datasets of humans and livestock, even though it could also benefit smaller natural populations. This study aims to understand the best practices and effectiveness of imputation with a small reference panel for species with low genetic diversity, using a case study of a population of the hihi/stitchbird (Notiomystis cincta). We used a leave-one-out method to test imputation on 30 high-coverage hihi individuals where SNPs were masked before being imputed with Beagle v5.4. Imputation accuracy was measured using r², the correlation between imputed and ground truth genotype dosages. We tested combinations of five imputation parameters, the inclusion of two linkage maps, reference panels of different sizes and compositions and targets of various SNP densities and sporadic missingness. We achieved mean r² exceeding 0.95 in most tests from a small reference panel of high-fecundity individuals. Imputation accuracy was not improved by including a linkage map and decreased at very low SNP densities. Imputed SNPs were filtered using r² to assess downstream heterozygosity calculations, the site frequency spectrum (SFS) and inference of runs of homozygosity (ROHs). We found that filtering and SNP density greatly affected heterozygosity and SFS at low SNP densities but that ROH inference was relatively robust to both. We provide a template for testing and optimising imputation in other wild populations.

基因型插入是利用参考群体单倍型推断缺失的基因型，是提高遗传数据集质量和数量的一种经济有效的工具。代入通常应用于人类和牲畜的大型和特征良好的数据集，即使它也可以使较小的自然种群受益。本研究旨在了解对低遗传多样性物种进行小型参考面板代入的最佳实践和有效性，并以一个高/缝鸟（Notiomystis cincta）种群为例进行研究。在使用Beagle v5.4进行估算之前，我们使用留一方法对30个高覆盖率的hi个体进行了估算，其中SNPs被掩盖。使用r2，即输入和真实基因型剂量之间的相关性来测量输入准确性。我们测试了五个输入参数的组合，包括两个连锁图，不同大小和组成的参考面板，以及不同SNP密度和零星缺失的目标。在一个小的高繁殖力个体参考小组中，我们在大多数测试中实现了平均r2超过0.95。包括连锁图谱并没有提高输入精度，并且在非常低的SNP密度下会降低。使用r2对输入的snp进行过滤，以评估下游杂合度计算、位点频谱（SFS）和纯合度推断（ROHs）。我们发现，在低SNP密度下，过滤和SNP密度对杂合度和SFS影响很大，但ROH推断对两者都相对稳健。我们为测试和优化其他野生种群的植入提供了模板。

{"title":"High Imputation Accuracy Can Be Achieved Using a Small Reference Panel in a Natural Population With Low Genetic Diversity","authors":"Hui Zhen Tan, Katarina C. Stuart, Tram Vi, Annabel Whibley, Sarah Bailey, Patricia Brekke, Anna W. Santure","doi":"10.1111/1755-0998.70024","DOIUrl":"10.1111/1755-0998.70024","url":null,"abstract":"Genotype imputation, the inference of missing genotypes using a reference set of population haplotypes, is a cost-effective tool for improving the quality and quantity of genetic datasets. Imputation is usually applied to large and well-characterised datasets of humans and livestock, even though it could also benefit smaller natural populations. This study aims to understand the best practices and effectiveness of imputation with a small reference panel for species with low genetic diversity, using a case study of a population of the hihi/stitchbird (Notiomystis cincta). We used a leave-one-out method to test imputation on 30 high-coverage hihi individuals where SNPs were masked before being imputed with Beagle v5.4. Imputation accuracy was measured using r2, the correlation between imputed and ground truth genotype dosages. We tested combinations of five imputation parameters, the inclusion of two linkage maps, reference panels of different sizes and compositions and targets of various SNP densities and sporadic missingness. We achieved mean r2 exceeding 0.95 in most tests from a small reference panel of high-fecundity individuals. Imputation accuracy was not improved by including a linkage map and decreased at very low SNP densities. Imputed SNPs were filtered using r2 to assess downstream heterozygosity calculations, the site frequency spectrum (SFS) and inference of runs of homozygosity (ROHs). We found that filtering and SNP density greatly affected heterozygosity and SFS at low SNP densities but that ROH inference was relatively robust to both. We provide a template for testing and optimising imputation in other wild populations.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70024","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144833546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The CODEX Approach: High-Throughput Sequencing of the Cox-1 Barcode Fragment in Neogastropods (Mollusca, Gastropoda) CODEX方法：新腹足动物（软体动物，腹足动物）Cox-1条形码片段的高通量测序。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-11 DOI: 10.1111/1755-0998.70031

Nicolas Puillandre, Dario Zuccon, Jawad Abdelkrim, Thomas Lemarcis, Claudia Ratti, Mélanie Van Weddingen, Paul Zaharias, Sarah Farhat

DNA barcoding traditionally relies on Sanger sequencing but faces limitations with degraded samples. High-throughput sequencing (HTS) offers a cost-effective alternative, enabling rapid barcode generation for extensive datasets. The advantage of HTS is its ability to employ multiplexing strategies, allowing thousands of samples to be processed simultaneously in a single sequencing run. This study presents the CODEX approach, a double-indexed HTS method designed to sequence overlapping cox-1 barcode fragments, suitable for samples with degraded DNA. The approach was applied to neogastropods, a diverse lineage of marine molluscs, using specimens (both recently collected and relatively older) from the Muséum national d'Histoire naturelle (MNHN) collections. The pipeline was used to process 15,076 samples, yielding 10,905 cleaned and assembled sequences, achieving a success rate of 72.33%. The CODEX method demonstrated advantages over Sanger sequencing by enabling the recovery of barcodes from samples previously deemed unsuitable, with significantly reduced costs (€0.5 per sequence vs. €4.5). Notably, DNA quality and sequencing success were strongly correlated with collection date, emphasising the impact of preservation methods and storage conditions. Sequencing success rates varied among families but were not correlated with phylogenetic relationships or specimen size, indicating the robustness of the primers designed for neogastropods. This study highlights the efficiency of the CODEX approach for large-scale DNA barcoding projects, especially when handling degraded samples. The CODEX pipeline and associated resources are publicly accessible, offering a scalable solution for molecular systematics and beyond.

DNA条形码传统上依赖于桑格测序，但在降解样品方面存在局限性。高通量测序（HTS）提供了一种具有成本效益的替代方案，可以为广泛的数据集快速生成条形码。HTS的优势在于其采用多路复用策略的能力，允许在单次测序运行中同时处理数千个样品。本研究提出了CODEX方法，一种双索引HTS方法，旨在对重叠的cox-1条形码片段进行测序，适用于DNA降解的样品。该方法应用于新胃足类动物，这是一种不同的海洋软体动物谱系，使用的标本（包括最近收集的和相对较老的）来自mus national d’histoire naturelle （MNHN）收藏。该流水线共处理15076个样品，得到10,905个清洗组装序列，成功率为72.33%。CODEX方法通过从以前认为不合适的样品中恢复条形码，显着降低了成本（每个序列0.5欧元对4.5欧元），从而证明了Sanger测序的优势。值得注意的是，DNA质量和测序成功与采集日期密切相关，强调了保存方法和储存条件的影响。测序成功率因科而异，但与系统发育关系或标本大小无关，表明为新胃足类动物设计的引物具有稳健性。这项研究强调了CODEX方法在大规模DNA条形码项目中的效率，特别是在处理降解样品时。CODEX管道和相关资源可公开获取，为分子系统学及其他领域提供可扩展的解决方案。

{"title":"The CODEX Approach: High-Throughput Sequencing of the Cox-1 Barcode Fragment in Neogastropods (Mollusca, Gastropoda)","authors":"Nicolas Puillandre, Dario Zuccon, Jawad Abdelkrim, Thomas Lemarcis, Claudia Ratti, Mélanie Van Weddingen, Paul Zaharias, Sarah Farhat","doi":"10.1111/1755-0998.70031","DOIUrl":"10.1111/1755-0998.70031","url":null,"abstract":"DNA barcoding traditionally relies on Sanger sequencing but faces limitations with degraded samples. High-throughput sequencing (HTS) offers a cost-effective alternative, enabling rapid barcode generation for extensive datasets. The advantage of HTS is its ability to employ multiplexing strategies, allowing thousands of samples to be processed simultaneously in a single sequencing run. This study presents the CODEX approach, a double-indexed HTS method designed to sequence overlapping cox-1 barcode fragments, suitable for samples with degraded DNA. The approach was applied to neogastropods, a diverse lineage of marine molluscs, using specimens (both recently collected and relatively older) from the Muséum national d'Histoire naturelle (MNHN) collections. The pipeline was used to process 15,076 samples, yielding 10,905 cleaned and assembled sequences, achieving a success rate of 72.33%. The CODEX method demonstrated advantages over Sanger sequencing by enabling the recovery of barcodes from samples previously deemed unsuitable, with significantly reduced costs (€0.5 per sequence vs. €4.5). Notably, DNA quality and sequencing success were strongly correlated with collection date, emphasising the impact of preservation methods and storage conditions. Sequencing success rates varied among families but were not correlated with phylogenetic relationships or specimen size, indicating the robustness of the primers designed for neogastropods. This study highlights the efficiency of the CODEX approach for large-scale DNA barcoding projects, especially when handling degraded samples. The CODEX pipeline and associated resources are publicly accessible, offering a scalable solution for molecular systematics and beyond.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70031","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144815423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Test Species Discrimination in Codonopsis (Campanulaceae) Using Genome Skimming Data 利用基因组略读数据测试党参属植物的物种区分。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-08 DOI: 10.1111/1755-0998.70025

Chun-Jiao Li, Yin-Zi Jiang, De-Zhu Li, Qing-Zhe Wu

To overcome the limitations of conventional barcoding loci, plastid genome (plastome) and nuclear ribosomal DNA (nrDNA) sequences recovered from genome skimming, proposed as ‘super-barcodes’ have been suggested as candidates for delimitating recently diverged species or complex plant groups. DNA super-barcodes must be further assessed for their effectiveness in other diverse plant groups. This research focused on the genus Codonopsis, a medicinally significant yet taxonomically complex group characterised by morphological similarity and high phenotypic plasticity in response to environmental conditions. We analysed standard DNA barcodes and super-barcodes across 81 individuals from 36 of the 42 species of Codonopsis from Asia. Our work provides a comprehensive DNA barcode library for Codonopsis species identification. Our findings demonstrated that super-barcodes significantly improved the phylogenetic resolution and the discriminatory power compared to standard DNA barcodes. Since mitochondrial sequence variation is generally low in plant species, few studies have assessed its effectiveness as super-barcodes. We screened the mitochondrial protein-coding sequences (CDS) using genome skimming and evaluated the identification capacity of their combination. Unexpectedly, the discriminatory power of mitochondrial DNA with high nucleotide variation was comparable to that of the concatenated plastid CDS. However, the organelle genome cannot wholly determine the species boundaries of Codonopsis, which might be related to their rapid evolutionary radiation, ILS, hybridisation and strong natural selection. Future multi-locus nuclear markers will likely be developed in plants for additional discriminatory power. Our study provides new knowledge and insights into species discrimination of recently evolved Codonopsis taxa in a biodiversity hotspot.

为了克服传统条形码位点的局限性，从基因组撇取中恢复的质体基因组（plas质体）和核糖体DNA （nrDNA）序列被建议作为“超级条形码”，作为划分新近分化的物种或复杂植物类群的候选者。DNA超级条形码必须进一步评估其在其他不同植物群体中的有效性。本研究的重点是党参属，这是一个具有医学意义但分类复杂的类群，其特征是形态相似性和对环境条件的高表型可塑性。我们分析了来自亚洲42种党参中36种81个个体的标准DNA条形码和超级条形码。我们的工作为党参物种鉴定提供了一个全面的DNA条形码库。我们的研究结果表明，与标准DNA条形码相比，超级条形码显著提高了系统发育分辨率和区分能力。由于线粒体序列变异在植物物种中普遍较低，很少有研究评估其作为超级条形码的有效性。我们使用基因组略读筛选线粒体蛋白编码序列（CDS），并评估其组合的识别能力。出乎意料的是，具有高核苷酸变异的线粒体DNA的区分能力与连接的质体CDS相当。然而，党参的细胞器基因组并不能完全确定其物种边界，这可能与党参的快速进化辐射、ILS、杂交和强自然选择有关。未来可能会在植物中开发多位点核标记，以获得额外的区分能力。我们的研究为最近进化的党参类群在生物多样性热点地区的物种识别提供了新的认识和见解。

{"title":"Test Species Discrimination in Codonopsis (Campanulaceae) Using Genome Skimming Data","authors":"Chun-Jiao Li, Yin-Zi Jiang, De-Zhu Li, Qing-Zhe Wu","doi":"10.1111/1755-0998.70025","DOIUrl":"10.1111/1755-0998.70025","url":null,"abstract":"<div>\u0000 \u0000 To overcome the limitations of conventional barcoding loci, plastid genome (plastome) and nuclear ribosomal DNA (nrDNA) sequences recovered from genome skimming, proposed as ‘super-barcodes’ have been suggested as candidates for delimitating recently diverged species or complex plant groups. DNA super-barcodes must be further assessed for their effectiveness in other diverse plant groups. This research focused on the genus Codonopsis, a medicinally significant yet taxonomically complex group characterised by morphological similarity and high phenotypic plasticity in response to environmental conditions. We analysed standard DNA barcodes and super-barcodes across 81 individuals from 36 of the 42 species of Codonopsis from Asia. Our work provides a comprehensive DNA barcode library for Codonopsis species identification. Our findings demonstrated that super-barcodes significantly improved the phylogenetic resolution and the discriminatory power compared to standard DNA barcodes. Since mitochondrial sequence variation is generally low in plant species, few studies have assessed its effectiveness as super-barcodes. We screened the mitochondrial protein-coding sequences (CDS) using genome skimming and evaluated the identification capacity of their combination. Unexpectedly, the discriminatory power of mitochondrial DNA with high nucleotide variation was comparable to that of the concatenated plastid CDS. However, the organelle genome cannot wholly determine the species boundaries of Codonopsis, which might be related to their rapid evolutionary radiation, ILS, hybridisation and strong natural selection. Future multi-locus nuclear markers will likely be developed in plants for additional discriminatory power. Our study provides new knowledge and insights into species discrimination of recently evolved Codonopsis taxa in a biodiversity hotspot.\u0000 </div>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144803029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VoronaGasyCodes: A Public Database of Mitochondrial Barcodes for Malagasy Birds VoronaGasyCodes：马达加斯加鸟类线粒体条形码的公共数据库。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-07 DOI: 10.1111/1755-0998.70027

Sushma Reddy, Kristen Wacker, Mai Fahmy, Evon Hekkala, John M. Bates, Steven M. Goodman, Shannon J. Hackett, Marie J. Raherilalao, J. Dylan Maddox

Molecular tools are increasingly being used to survey the presence of biodiversity and their interactions within ecosystems. Indirect methods, like environmental DNA (eDNA) and invertebrate-derived DNA (iDNA), are dependent on sequence databases with accurate and sufficient taxonomic representation. These methods are increasingly being used in regions and habitats where direct detection or observations can be difficult for a variety of reasons. Madagascar is a biodiversity hotspot with a high proportion of endemic species, many of which are threatened or endangered. Here we describe a new resource, VoronaGasyCodes, a curated database of newly published genetic sequences from Malagasy birds. Our database is currently populated with six mitochondrial genes or DNA barcodes for 142 species including 70% of the birds endemic to the island and will be periodically updated as new data become available. We demonstrate the utility of our database with an iDNA study of leech blood meals where we successfully identified 77% of the hosts to species. These types of resources for characterising biodiversity are critical for insights into species distribution, discovery of new taxa, novel ecological connections and advancing conservation and restoration measures.

分子工具越来越多地被用于调查生物多样性的存在及其在生态系统中的相互作用。间接方法，如环境DNA （eDNA）和无脊椎动物来源的DNA (iDNA)，依赖于具有准确和充分分类代表性的序列数据库。这些方法正越来越多地用于由于各种原因难以直接探测或观测的地区和栖息地。马达加斯加是一个生物多样性热点地区，拥有高比例的特有物种，其中许多物种受到威胁或濒临灭绝。在这里，我们描述了一个新的资源，VoronaGasyCodes，这是一个精心策划的数据库，收录了马达加斯加鸟类新近发表的基因序列。我们的数据库目前包含142个物种的6个线粒体基因或DNA条形码，其中包括70%的岛上特有鸟类，并将定期更新新数据。我们通过对水蛭血餐的dna研究证明了我们的数据库的实用性，我们成功地识别了77%的宿主物种。这些类型的生物多样性特征资源对于了解物种分布、发现新的分类群、新的生态联系以及推进保护和恢复措施至关重要。

{"title":"VoronaGasyCodes: A Public Database of Mitochondrial Barcodes for Malagasy Birds","authors":"Sushma Reddy, Kristen Wacker, Mai Fahmy, Evon Hekkala, John M. Bates, Steven M. Goodman, Shannon J. Hackett, Marie J. Raherilalao, J. Dylan Maddox","doi":"10.1111/1755-0998.70027","DOIUrl":"10.1111/1755-0998.70027","url":null,"abstract":"Molecular tools are increasingly being used to survey the presence of biodiversity and their interactions within ecosystems. Indirect methods, like environmental DNA (eDNA) and invertebrate-derived DNA (iDNA), are dependent on sequence databases with accurate and sufficient taxonomic representation. These methods are increasingly being used in regions and habitats where direct detection or observations can be difficult for a variety of reasons. Madagascar is a biodiversity hotspot with a high proportion of endemic species, many of which are threatened or endangered. Here we describe a new resource, VoronaGasyCodes, a curated database of newly published genetic sequences from Malagasy birds. Our database is currently populated with six mitochondrial genes or DNA barcodes for 142 species including 70% of the birds endemic to the island and will be periodically updated as new data become available. We demonstrate the utility of our database with an iDNA study of leech blood meals where we successfully identified 77% of the hosts to species. These types of resources for characterising biodiversity are critical for insights into species distribution, discovery of new taxa, novel ecological connections and advancing conservation and restoration measures.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70027","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144793103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Methylome Profiling of a Deuterostome Invertebrate Using Oxford Nanopore Technology (ONT) 使用牛津纳米孔技术（ONT）的后口无脊椎动物的甲基组分析。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-05 DOI: 10.1111/1755-0998.70026

Sarah Lok Ting Kwong, Alyssa Maree Budd, Julia Yun-Hsuan Hung, Cecilia Villacorta-Rath, Sven Uthicke

DNA methylation is crucial for genome regulation and provides key insights into the interaction between genetics and environmental factors, offering valuable perspectives for ecological research. However, knowledge of DNA methylation patterns in nonmodel invertebrates remains limited. The present study addresses this knowledge gap by conducting the first methylome profiling of the Pacific crown-of-thorns seastar (CoTS; Acanthaster cf. solaris), a coral-eating species that aggravates the decline of Indo-Pacific coral reefs. Using Oxford Nanopore Technology (ONT) we generated long-read sequences, covering over 90% of CpG dinucleotides in the CoTS genome. Our analysis revealed a mosaic methylation landscape with moderate genome-wide methylation levels of 37.7%. Comparative analysis highlights the intermediate methylation state observed in other deuterostome invertebrates, positioning them between the hypomethylated genomes of protostomes and the hypermethylated genomes of vertebrates. Methylation in CoTS was predominantly localised within gene bodies, especially in intronic regions, enabling modulation of gene expression and potentially supporting fitness in dynamic marine environments. Additionally, elevated methylation in repetitive elements suggests a role in genome defence. This study demonstrates the effectiveness of ONT for comprehensive methylome analysis in ecologically important nonmodel species and deepens our understanding of the epigenetic landscape in deuterostome invertebrates. We also present a detailed laboratory and bioinformatics workflow, including modified phenol–chloroform protocols that address the challenge of extracting high molecular weight DNA from marine invertebrates. Together with the methylome profiles, these resources serve as a foundation for future research, enabling investigations into DNA methylation functions, applications for CoTS outbreak management and comparative studies across diverse lineages.

DNA甲基化对基因组调控至关重要，为了解遗传与环境因素之间的相互作用提供了关键的见解，为生态学研究提供了有价值的视角。然而，对非模式无脊椎动物DNA甲基化模式的了解仍然有限。本研究通过对太平洋棘冠海星（CoTS）进行首次甲基组分析来解决这一知识差距；一种以珊瑚为食的物种，加剧了印度太平洋珊瑚礁的衰退。使用牛津纳米孔技术（ONT），我们生成了长读序列，覆盖了CoTS基因组中超过90%的CpG二核苷酸。我们的分析揭示了一个马赛克甲基化景观，中度全基因组甲基化水平为37.7%。比较分析强调了在其他后口动物无脊椎动物中观察到的中间甲基化状态，将它们定位在原口动物的低甲基化基因组和脊椎动物的高甲基化基因组之间。CoTS的甲基化主要集中在基因体内，尤其是内含子区域，这使得基因表达得以调节，并可能支持在动态海洋环境中的适应性。此外，重复元件的甲基化升高表明其在基因组防御中起作用。这项研究证明了ONT在生态上重要的非模式物种中进行综合甲基组分析的有效性，并加深了我们对后口动物无脊椎动物表观遗传景观的理解。我们还提出了详细的实验室和生物信息学工作流程，包括改进的苯酚-氯仿方案，以解决从海洋无脊椎动物中提取高分子量DNA的挑战。与甲基组谱一起，这些资源为未来的研究奠定了基础，使人们能够对DNA甲基化功能进行调查，对CoTS爆发管理进行应用，并对不同谱系进行比较研究。

{"title":"Methylome Profiling of a Deuterostome Invertebrate Using Oxford Nanopore Technology (ONT)","authors":"Sarah Lok Ting Kwong, Alyssa Maree Budd, Julia Yun-Hsuan Hung, Cecilia Villacorta-Rath, Sven Uthicke","doi":"10.1111/1755-0998.70026","DOIUrl":"10.1111/1755-0998.70026","url":null,"abstract":"DNA methylation is crucial for genome regulation and provides key insights into the interaction between genetics and environmental factors, offering valuable perspectives for ecological research. However, knowledge of DNA methylation patterns in nonmodel invertebrates remains limited. The present study addresses this knowledge gap by conducting the first methylome profiling of the Pacific crown-of-thorns seastar (CoTS; Acanthaster cf. solaris), a coral-eating species that aggravates the decline of Indo-Pacific coral reefs. Using Oxford Nanopore Technology (ONT) we generated long-read sequences, covering over 90% of CpG dinucleotides in the CoTS genome. Our analysis revealed a mosaic methylation landscape with moderate genome-wide methylation levels of 37.7%. Comparative analysis highlights the intermediate methylation state observed in other deuterostome invertebrates, positioning them between the hypomethylated genomes of protostomes and the hypermethylated genomes of vertebrates. Methylation in CoTS was predominantly localised within gene bodies, especially in intronic regions, enabling modulation of gene expression and potentially supporting fitness in dynamic marine environments. Additionally, elevated methylation in repetitive elements suggests a role in genome defence. This study demonstrates the effectiveness of ONT for comprehensive methylome analysis in ecologically important nonmodel species and deepens our understanding of the epigenetic landscape in deuterostome invertebrates. We also present a detailed laboratory and bioinformatics workflow, including modified phenol–chloroform protocols that address the challenge of extracting high molecular weight DNA from marine invertebrates. Together with the methylome profiles, these resources serve as a foundation for future research, enabling investigations into DNA methylation functions, applications for CoTS outbreak management and comparative studies across diverse lineages.","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70026","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144783112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

gmmDenoise: A New Method and R Package for High-Confidence Sequence Variant Filtering in Environmental DNA Amplicon Analysis gmm降噪：环境DNA扩增子分析中高置信度序列变异滤波的新方法和R包。

IF 5.5 1区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Molecular Ecology Resources

Pub Date : 2025-08-04 DOI: 10.1111/1755-0998.70023

Yusuke Koseki, Hirohiko Takeshima, Ryuji Yoneda, Kaito Katayanagi, Gen Ito, Hiroki Yamanaka

Assessing and monitoring genetic diversity is vital for understanding the ecology and evolution of natural populations but is often challenging in animal and plant species due to technically and physically demanding tissue sampling. Although environmental DNA (eDNA) metabarcoding is a promising alternative to the traditional population genetic monitoring based on biological samples, its practical application remains challenging due to spurious sequences present in the amplicon data, even after data processing with the existing sequence filtering and denoising (error correction) methods. Here we developed a novel amplicon filtering approach that can effectively eliminate such spurious amplicon sequence variants (ASVs) in eDNA metabarcoding data. A simple simulation of eDNA metabarcoding processes was performed to understand the patterns of read count (abundance) distributions of true ASVs and their polymerase chain reaction (PCR)-generated artefacts (i.e., false-positive ASVs). Based on the simulation results, the approach was developed to estimate the abundance distributions of true and false-positive ASVs using Gaussian mixture models and to determine a statistically based threshold between them. The developed approach was implemented as an R package, gmmDenoise and evaluated using single-species metabarcoding datasets in which all or some true ASVs (i.e., haplotypes) were known. Example analyses using community (multi-species) metabarcoding datasets were also performed to demonstrate how gmmDenoise can be used to derive reliable intraspecific diversity estimates and population genetic inferences from noisy amplicon sequencing data. The gmmDenoise package is freely available in the GitHub repository (https://github.com/YSKoseki/gmmDenoise).

评估和监测遗传多样性对于了解自然种群的生态和进化至关重要，但由于在技术和物理上要求组织采样，在动植物物种中往往具有挑战性。虽然环境DNA （eDNA）元条形码是传统的基于生物样本的群体遗传监测的一种有前途的替代方案，但由于扩增子数据中存在假序列，即使使用现有的序列滤波和去噪（纠错）方法进行数据处理，其实际应用仍然具有挑战性。在这里，我们开发了一种新的扩增子滤波方法，可以有效地消除eDNA元条形码数据中的伪扩增子序列变异（asv）。对eDNA元条形码过程进行了简单模拟，以了解真asv及其聚合酶链反应（PCR）产生的伪产物（即假阳性asv）的读取计数（丰度）分布模式。基于仿真结果，利用高斯混合模型估计asv的真阳性和假阳性丰度分布，并确定它们之间的统计阈值。开发的方法以R包gmmnoise实现，并使用已知所有或部分真正asv（即单倍型）的单物种元条形码数据集进行评估。使用群落（多物种）元编码数据集的示例分析也进行了演示，以证明如何使用gmmDenoise从有噪声的扩增子测序数据中获得可靠的种内多样性估计和群体遗传推断。gmmnoise包可以在GitHub存储库（https://github.com/YSKoseki/gmmDenoise）中免费获得。

{"title":"gmmDenoise: A New Method and R Package for High-Confidence Sequence Variant Filtering in Environmental DNA Amplicon Analysis","authors":"Yusuke Koseki, Hirohiko Takeshima, Ryuji Yoneda, Kaito Katayanagi, Gen Ito, Hiroki Yamanaka","doi":"10.1111/1755-0998.70023","DOIUrl":"10.1111/1755-0998.70023","url":null,"abstract":"Assessing and monitoring genetic diversity is vital for understanding the ecology and evolution of natural populations but is often challenging in animal and plant species due to technically and physically demanding tissue sampling. Although environmental DNA (eDNA) metabarcoding is a promising alternative to the traditional population genetic monitoring based on biological samples, its practical application remains challenging due to spurious sequences present in the amplicon data, even after data processing with the existing sequence filtering and denoising (error correction) methods. Here we developed a novel amplicon filtering approach that can effectively eliminate such spurious amplicon sequence variants (ASVs) in eDNA metabarcoding data. A simple simulation of eDNA metabarcoding processes was performed to understand the patterns of read count (abundance) distributions of true ASVs and their polymerase chain reaction (PCR)-generated artefacts (i.e., false-positive ASVs). Based on the simulation results, the approach was developed to estimate the abundance distributions of true and false-positive ASVs using Gaussian mixture models and to determine a statistically based threshold between them. The developed approach was implemented as an R package, gmmDenoise and evaluated using single-species metabarcoding datasets in which all or some true ASVs (i.e., haplotypes) were known. Example analyses using community (multi-species) metabarcoding datasets were also performed to demonstrate how gmmDenoise can be used to derive reliable intraspecific diversity estimates and population genetic inferences from noisy amplicon sequencing data. The gmmDenoise package is freely available in the GitHub repository (https://github.com/YSKoseki/gmmDenoise).","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"25 8","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1755-0998.70023","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144774396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0