首页 > 最新文献

Molecular Ecology Resources最新文献

英文 中文
GeneMiner2: Accurate and Automated Recovery of Genes From Genome Skimming Data GeneMiner2:准确和自动恢复基因从基因组略读数据。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-18 DOI: 10.1111/1755-0998.70111
Xinyi Yu, Zizhen Tang, Zhen Zhang, Yongxiu Song, Hao He, Yi Shi, Jiaqing Hou, Yan Yu

With the growing accessibility of low cost genome skimming, large-scale recovery of target genes in non-model species has become feasible, providing strong data for phylogenomic and evolutionary studies. However, existing tools for data assembly including Read2Tree, HybPiper, and GeneMiner still suffer from computational difficulty and assembly errors when processing genome skimming data, especially for low-level taxa that lack closely related genome references. Here, we present GeneMiner2, an updated version of GeneMiner, as an efficient and automated gene assembly and phylogenetic tool to overcome these limitations. Compared with the previous version, GeneMiner2 introduces three key optimizations by deploying a two-level hash table that speeds up k-mer filtering, applying fine-grained read selection with strand-orientation and structural-anomaly detection to handle complex heterozygous regions, and incorporating adaptive k-mer selection in the de Bruijn assembler. Together enabling accurate assembly of target genes without the need for closely related references. Using simulated datasets with low coverage and divergent references, we validated the robustness and improved accuracy of GeneMiner2 in the assembly of single-copy genes. Moreover, using genome skimming data from the subfamily Apioideae (Apiaceae), GeneMiner2 outperformed Read2Tree and HybPiper in accuracy, completeness, and speed. Equipped with a user-friendly graphical interface, GeneMiner2 integrates functions including assembly quality control, paralog detection, tree reconstruction, and divergence time calibration, offering a robust and efficient solution for phylogenetic inference using genomic level data. GeneMiner2 supports cross-platform operation. GeneMiner2 provides a user-friendly graphical interface for desktop users, and its high-performance command-line interface is specifically optimised for high-throughput analyses on Linux servers and computing clusters. Installation instructions, detailed documentation, and source code are available on GitHub (https://github.com/sculab/GeneMiner2).

随着低成本基因组脱脂技术的日益普及,在非模式物种中大规模恢复靶基因已成为可能,为系统基因组学和进化研究提供了强有力的数据。然而,包括Read2Tree、HybPiper和GeneMiner在内的现有数据组装工具在处理基因组浏览数据时仍然存在计算困难和组装错误,特别是对于缺乏密切相关基因组参考的低水平分类群。在这里,我们提出GeneMiner2, GeneMiner的更新版本,作为一种高效和自动化的基因组装和系统发育工具来克服这些局限性。与之前的版本相比,GeneMiner2引入了三个关键优化:部署一个两级哈希表来加速k-mer过滤,应用细粒度的读取选择与链取向和结构异常检测来处理复杂的杂合区域,并在de Bruijn汇编程序中结合自适应k-mer选择。在不需要密切相关的参考的情况下,可以精确地组装靶基因。利用低覆盖率和不同参考文献的模拟数据集,我们验证了GeneMiner2在单拷贝基因组装中的鲁棒性和提高的准确性。此外,使用来自蜂科(apiioideae)亚科的基因组略读数据,GeneMiner2在准确性、完整性和速度上优于Read2Tree和HybPiper。GeneMiner2具有用户友好的图形界面,集成了装配质量控制,平行检测,树重建和发散时间校准等功能,为使用基因组水平数据进行系统发育推断提供了强大而高效的解决方案。GeneMiner2支持跨平台操作。GeneMiner2为桌面用户提供了一个用户友好的图形界面,其高性能命令行界面专门针对Linux服务器和计算集群上的高吞吐量分析进行了优化。安装说明、详细文档和源代码可在GitHub (https://github.com/sculab/GeneMiner2)上获得。
{"title":"GeneMiner2: Accurate and Automated Recovery of Genes From Genome Skimming Data","authors":"Xinyi Yu,&nbsp;Zizhen Tang,&nbsp;Zhen Zhang,&nbsp;Yongxiu Song,&nbsp;Hao He,&nbsp;Yi Shi,&nbsp;Jiaqing Hou,&nbsp;Yan Yu","doi":"10.1111/1755-0998.70111","DOIUrl":"10.1111/1755-0998.70111","url":null,"abstract":"<p>With the growing accessibility of low cost genome skimming, large-scale recovery of target genes in non-model species has become feasible, providing strong data for phylogenomic and evolutionary studies. However, existing tools for data assembly including Read2Tree, HybPiper, and GeneMiner still suffer from computational difficulty and assembly errors when processing genome skimming data, especially for low-level taxa that lack closely related genome references. Here, we present GeneMiner2, an updated version of GeneMiner, as an efficient and automated gene assembly and phylogenetic tool to overcome these limitations. Compared with the previous version, GeneMiner2 introduces three key optimizations by deploying a two-level hash table that speeds up <i>k</i>-mer filtering, applying fine-grained read selection with strand-orientation and structural-anomaly detection to handle complex heterozygous regions, and incorporating adaptive <i>k</i>-mer selection in the de Bruijn assembler. Together enabling accurate assembly of target genes without the need for closely related references. Using simulated datasets with low coverage and divergent references, we validated the robustness and improved accuracy of GeneMiner2 in the assembly of single-copy genes. Moreover, using genome skimming data from the subfamily Apioideae (Apiaceae), GeneMiner2 outperformed Read2Tree and HybPiper in accuracy, completeness, and speed. Equipped with a user-friendly graphical interface, GeneMiner2 integrates functions including assembly quality control, paralog detection, tree reconstruction, and divergence time calibration, offering a robust and efficient solution for phylogenetic inference using genomic level data. GeneMiner2 supports cross-platform operation. GeneMiner2 provides a user-friendly graphical interface for desktop users, and its high-performance command-line interface is specifically optimised for high-throughput analyses on Linux servers and computing clusters. Installation instructions, detailed documentation, and source code are available on GitHub (https://github.com/sculab/GeneMiner2).</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12914762/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146218124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scoop That Poop: Optimising Faecal Sample Pre-Processing for Parasite Metabarcoding 舀起便便:寄生虫元条形码优化粪便样本预处理。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-17 DOI: 10.1111/1755-0998.70112
Madison E. Patch, Graham B. Goodman, Sara B. Weinstein

Molecular techniques such as DNA metabarcoding are increasingly used to characterise parasite communities. However, relatively few studies have examined how sample processing methods influence detection rates. We used faecal samples collected from free-roaming horses to evaluate how parasite density and pre-processing methods influenced quantification of parasite richness, community composition, and detection of different taxa. Methods that concentrated parasites substantially increased parasite detection, especially at the low infection levels often seen in wildlife. Specifically, we found that DNA extracts from larval coproculture and a newly developed egg concentration approach detected approximately twice as many species and genera as extracts made directly from faecal matter. Although parasite richness was consistently lower in these faecal subsamples, overall parasite communities were still similar between pre-processing methods. Ultimately, the optimal method depends on research constraints and goals. Working with parasite larvae is more time intensive, but lower cost as larval parasites can be extracted using a lysis buffer approach which performs similarly to commercial extraction kits. DNA extraction from faecal subsamples misses rare and common taxa but minimises field processing. Our novel egg concentration method offers a compromise between relatively rapid processing and high sensitivity.

诸如DNA元条形码之类的分子技术越来越多地用于表征寄生虫群落。然而,相对较少的研究考察了样品处理方法如何影响检出率。研究人员利用自由漫游马的粪便样本,评估了寄生虫密度和预处理方法对不同分类群中寄生虫丰富度、群落组成和检测结果的影响。集中寄生虫的方法大大增加了寄生虫的检出率,特别是在野生动物中常见的低感染水平。具体来说,我们发现从幼虫共育和新开发的卵浓缩方法中提取的DNA检测到的物种和属大约是直接从粪便中提取的两倍。虽然这些粪便亚样本中的寄生虫丰富度一直较低,但在预处理方法之间,总体寄生虫群落仍然相似。最终,最优方法取决于研究约束和目标。处理寄生虫幼虫更耗时,但成本更低,因为幼虫寄生虫可以使用裂解缓冲方法提取,其性能与商业提取试剂盒相似。从粪便亚样本中提取DNA错过了罕见和常见的分类群,但减少了现场处理。我们的新鸡蛋浓缩方法提供了相对快速处理和高灵敏度之间的折衷。
{"title":"Scoop That Poop: Optimising Faecal Sample Pre-Processing for Parasite Metabarcoding","authors":"Madison E. Patch,&nbsp;Graham B. Goodman,&nbsp;Sara B. Weinstein","doi":"10.1111/1755-0998.70112","DOIUrl":"10.1111/1755-0998.70112","url":null,"abstract":"<p>Molecular techniques such as DNA metabarcoding are increasingly used to characterise parasite communities. However, relatively few studies have examined how sample processing methods influence detection rates. We used faecal samples collected from free-roaming horses to evaluate how parasite density and pre-processing methods influenced quantification of parasite richness, community composition, and detection of different taxa. Methods that concentrated parasites substantially increased parasite detection, especially at the low infection levels often seen in wildlife. Specifically, we found that DNA extracts from larval coproculture and a newly developed egg concentration approach detected approximately twice as many species and genera as extracts made directly from faecal matter. Although parasite richness was consistently lower in these faecal subsamples, overall parasite communities were still similar between pre-processing methods. Ultimately, the optimal method depends on research constraints and goals. Working with parasite larvae is more time intensive, but lower cost as larval parasites can be extracted using a lysis buffer approach which performs similarly to commercial extraction kits. DNA extraction from faecal subsamples misses rare and common taxa but minimises field processing. Our novel egg concentration method offers a compromise between relatively rapid processing and high sensitivity.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12911221/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146211782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genetic Diversity Unveiled: Cost-Effective Methods for Grassland Species 揭示遗传多样性:草原物种的成本效益方法。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-17 DOI: 10.1111/1755-0998.70108
Damian Käch, Miguel Loera-Sánchez, Beat Reidy, Bruno Studer, Roland Kölliker

Permanent grasslands are predominantly composed of allogamous plant species that exhibit high levels of plant genetic diversity (PGD) within their populations. Grasslands with high PGD are more resilient to environmental stress and constitute valuable reservoirs of genetic resources for plant breeding. Therefore, monitoring PGD is the basis for detecting changes in PGD and for intervening accordingly. However, PGD monitoring is often neglected in biodiversity reports due to difficulties in taking representative samples and in using standardised and affordable indicators of PGD. Here we successfully applied two common approaches, multispecies amplicon sequencing (MSAS) and genotyping-by-sequencing (GBS), to assess PGD of agronomically relevant grassland species. Using MSAS, we were able to taxonomically distinguish five species (Dactylis glomerata L., Festuca pratensis Huds., Lolium perenne L., Trifolium pratense L. and T. repens L.) from multispecies samples and differentiate accessions within species, with fixation index (FST) values ranging from 0.014 for T. repens to 0.089 for L. perenne. Based on an extended L. perenne sample set containing mixtures of two cultivars at different ratios, mixtures containing both cultivars at 50% separated from the corresponding cultivars according to this ratio using MSAS and GBS. Furthermore, GBS enabled separation of samples containing two cultivars at a 75:25 ratio from the corresponding cultivars and the 50:50-ratio samples. These results indicate complementing applications of the two approaches in PGD monitoring. While we anticipate that MSAS with its cost-effectiveness could be applied to large-scale PGD monitoring, GBS with its lower detection limit could be applied to studies where cultivar composition shifts are of interest.

永久草原主要由异种植物组成,在其种群内表现出高水平的植物遗传多样性。PGD值高的草地对环境胁迫的适应能力更强,是植物育种的宝贵遗传资源库。因此,监测PGD是检测PGD变化并进行相应干预的基础。然而,由于难以获取具有代表性的样本以及难以使用标准化和负担得起的PGD指标,PGD监测在生物多样性报告中经常被忽视。本研究成功应用多物种扩增子测序(MSAS)和基因分型测序(GBS)两种常用方法对农艺相关草原物种的PGD进行了评估。利用MSAS技术,我们能够区分出5个种(Dactylis glomerata L.)、羊茅(Festuca pratensis Huds.)。多年生黑麦草(Lolium perenne L.)、三叶草(Trifolium pratense L.)和黑麦草(T. perenenne L.)的固定指数(FST)为0.014 ~ 0.089。以含两种不同比例混交种的二年生黑麦草扩展样品集为基础,采用MSAS法和GBS法将两种混交种按50%的比例与相应品种分离。此外,GBS还可以将含有两个品种的样品以75:25的比例与对应品种和50:50的比例分离。这些结果表明这两种方法在PGD监测中的应用是互补的。我们预计,具有成本效益的MSAS可以应用于大规模的PGD监测,而具有较低检出限的GBS可以应用于对品种组成变化感兴趣的研究。
{"title":"Genetic Diversity Unveiled: Cost-Effective Methods for Grassland Species","authors":"Damian Käch,&nbsp;Miguel Loera-Sánchez,&nbsp;Beat Reidy,&nbsp;Bruno Studer,&nbsp;Roland Kölliker","doi":"10.1111/1755-0998.70108","DOIUrl":"10.1111/1755-0998.70108","url":null,"abstract":"<p>Permanent grasslands are predominantly composed of allogamous plant species that exhibit high levels of plant genetic diversity (PGD) within their populations. Grasslands with high PGD are more resilient to environmental stress and constitute valuable reservoirs of genetic resources for plant breeding. Therefore, monitoring PGD is the basis for detecting changes in PGD and for intervening accordingly. However, PGD monitoring is often neglected in biodiversity reports due to difficulties in taking representative samples and in using standardised and affordable indicators of PGD. Here we successfully applied two common approaches, multispecies amplicon sequencing (MSAS) and genotyping-by-sequencing (GBS), to assess PGD of agronomically relevant grassland species. Using MSAS, we were able to taxonomically distinguish five species (<i>Dactylis glomerata</i> L., <i>Festuca pratensis</i> <span>Huds.,</span> <i>Lolium perenne</i> L., <i>Trifolium pratense</i> L. and <i>T. repens</i> L.) from multispecies samples and differentiate accessions within species, with fixation index (<i>F</i><sub>ST</sub>) values ranging from 0.014 for <i>T. repens</i> to 0.089 for <i>L. perenne.</i> Based on an extended <i>L. perenne</i> sample set containing mixtures of two cultivars at different ratios, mixtures containing both cultivars at 50% separated from the corresponding cultivars according to this ratio using MSAS and GBS. Furthermore, GBS enabled separation of samples containing two cultivars at a 75:25 ratio from the corresponding cultivars and the 50:50-ratio samples. These results indicate complementing applications of the two approaches in PGD monitoring. While we anticipate that MSAS with its cost-effectiveness could be applied to large-scale PGD monitoring, GBS with its lower detection limit could be applied to studies where cultivar composition shifts are of interest.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12914157/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146211738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating Effective Number of Breeders Under the Heterozygote-Excess Equilibrium 杂合子-过剩平衡下有效育种者数量的估计。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-15 DOI: 10.1111/1755-0998.70107
Bing Yang, Dan Wang, Yujia Shen, Wenkai Li, Haiya Wang, Ruqi Wang, Ruixin Peng, Song Zhang, Zhiming Xia, Derek W. Dunn, Kang Huang, Baoguo Li
<p>Estimating the effective number of breeders per reproductive cycle or cohort (<span></span><math> <semantics> <mrow> <msub> <mi>N</mi> <mi>b</mi> </msub> </mrow> <annotation>$$ {N}_b $$</annotation> </semantics></math>) can contribute to understanding the viability of wild populations. However, previous <span></span><math> <semantics> <mrow> <msub> <mi>N</mi> <mi>b</mi> </msub> </mrow> <annotation>$$ {N}_b $$</annotation> </semantics></math> estimators based on heterozygote-excess assume that parental genotypes are in accordance with <i>Hardy–Weinberg equilibrium</i> (HWE), and can only be applied to diploids. When these criteria are not fulfilled, estimates of expected heterozygosity may thus be biased. We extended a previous <span></span><math> <semantics> <mrow> <msub> <mi>N</mi> <mi>b</mi> </msub> </mrow> <annotation>$$ {N}_b $$</annotation> </semantics></math> estimator to autopolyploids and account for parental genotypes either in HWE or <i>heterozygote-excess equilibrium</i> (HEE). Results include the following: the distributions of genotypes converge to HEE within nine generations (with a relative error of <span></span><math> <semantics> <mrow> <msup> <mn>10</mn> <mrow> <mo>−</mo> <mn>3</mn> </mrow> </msup> </mrow> <annotation>$$ {10}^{-3} $$</annotation> </semantics></math>), the corrected estimators can asymptotically estimate <span></span><math> <semantics> <mrow> <msub> <mi>N</mi> <mi>b</mi> </msub> </mrow> <annotation>$$ {N}_b $$</annotation> </semantics></math> without bias, and estimations for autopolyploids are slightly better than those for diploids. The relationships between <span></span><math> <semantics> <mrow> <msub> <mover> <mi>N</mi> <mo>̂</mo> </mover> <mi>b</mi> </msub> </mrow> <annotation>$$ {hat{N}}_b $$</annotation> </semantics></math> and
估算每个繁殖周期或群体的有效育种者数量(N b $$ {N}_b $$)有助于了解野生种群的生存能力。然而,以往基于杂合子过量的N b $$ {N}_b $$估计假设亲本基因型符合Hardy-Weinberg平衡(HWE),并且只能应用于二倍体。当这些标准不满足时,预期杂合性的估计可能因此有偏差。我们将先前的N b $$ {N}_b $$估计器扩展到自多倍体,并在HWE或杂合子过量平衡(HEE)中解释亲本基因型。结果表明:基因型分布在9代内收敛于HEE(相对误差为10 - 3 $$ {10}^{-3} $$),校正后的估计值可以渐近估计N b $$ {N}_b $$,无偏倚,自多倍体的估计值略好于二倍体。阐明了N′b $$ {hat{N}}_b $$与N′b $$ {N}_b $$之间的关系以及不同世代有效育种数对N′b $$ {hat{N}}_b $$的影响。在二倍体中,N´b $$ {hat{N}}_b $$主要反映前代N´b $$ {N}_b $$,而在多倍体中,N´b $$ {N}_b $$是前代N´b 的加权调和平均值。我们的方法是在一个软件包polygene v1.7中实现的,该软件包可以在https://github.com/huangkang1987/polygene上免费获得。
{"title":"Estimating Effective Number of Breeders Under the Heterozygote-Excess Equilibrium","authors":"Bing Yang,&nbsp;Dan Wang,&nbsp;Yujia Shen,&nbsp;Wenkai Li,&nbsp;Haiya Wang,&nbsp;Ruqi Wang,&nbsp;Ruixin Peng,&nbsp;Song Zhang,&nbsp;Zhiming Xia,&nbsp;Derek W. Dunn,&nbsp;Kang Huang,&nbsp;Baoguo Li","doi":"10.1111/1755-0998.70107","DOIUrl":"10.1111/1755-0998.70107","url":null,"abstract":"&lt;p&gt;Estimating the effective number of breeders per reproductive cycle or cohort (&lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;N&lt;/mi&gt;\u0000 &lt;mi&gt;b&lt;/mi&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {N}_b $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;) can contribute to understanding the viability of wild populations. However, previous &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;N&lt;/mi&gt;\u0000 &lt;mi&gt;b&lt;/mi&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {N}_b $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; estimators based on heterozygote-excess assume that parental genotypes are in accordance with &lt;i&gt;Hardy–Weinberg equilibrium&lt;/i&gt; (HWE), and can only be applied to diploids. When these criteria are not fulfilled, estimates of expected heterozygosity may thus be biased. We extended a previous &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;N&lt;/mi&gt;\u0000 &lt;mi&gt;b&lt;/mi&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {N}_b $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; estimator to autopolyploids and account for parental genotypes either in HWE or &lt;i&gt;heterozygote-excess equilibrium&lt;/i&gt; (HEE). Results include the following: the distributions of genotypes converge to HEE within nine generations (with a relative error of &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mn&gt;10&lt;/mn&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;3&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msup&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {10}^{-3} $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;), the corrected estimators can asymptotically estimate &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;N&lt;/mi&gt;\u0000 &lt;mi&gt;b&lt;/mi&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {N}_b $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; without bias, and estimations for autopolyploids are slightly better than those for diploids. The relationships between &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mover&gt;\u0000 &lt;mi&gt;N&lt;/mi&gt;\u0000 &lt;mo&gt;̂&lt;/mo&gt;\u0000 &lt;/mover&gt;\u0000 &lt;mi&gt;b&lt;/mi&gt;\u0000 &lt;/msub&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {hat{N}}_b $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; and","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12907470/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146199771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing Barcoding and Genomics: gDNA Quality in Insect Vouchers After HotSHOT DNA Extraction 平衡条形码和基因组学:DNA提取后昆虫凭证的gDNA质量。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-13 DOI: 10.1111/1755-0998.70103
Vivian Feng, Caroline Høegh-Guldberg, Rudolf Meier, Aleš Buček

DNA extraction with an alkaline buffer system called ‘HotSHOT’ is widely used for barcoding because it is rapid, inexpensive, and voucher preserving, but it remains unclear whether sufficient genomic DNA (gDNA) remains in small vouchers for downstream use in genomics. We here evaluate gDNA quality and quantity before and after HotSHOT treatment of 11 insect families representing six orders. Some specimens were flash frozen immediately after collection, while others were kept for 1 week at tropical temperatures in ethanol to mimic Malaise trap conditions. Encouragingly, we show that gDNA of sufficiently high quality and quantity for genomic sequencing remained in specimens treated with HotSHOT. We also show that DNA integrity was strongly influenced by field storage, with specimens exposed to Malaise trap conditions showing such pronounced degradation that the standard HotSHOT treatment no longer significantly altered DNA quality. For control material, HotSHOT treatments involving longer exposure to high temperature led to smaller fragment lengths, with the effect apparently being influenced by the degree of specimen sclerotization. Our results thus suggest that optimised HotSHOT treatments, together with carefully controlled pre-extraction storage, preserve voucher gDNA of sufficient quality for downstream genomic analyses with both short-read and possibly even some long-read sequencing technologies. We provide protocol selection guidelines that improve voucher gDNA preservation in HotSHOT-treated samples. This is particularly important for many species which are only known from one or few specimens discovered during barcoding projects.

使用碱性缓冲系统“HotSHOT”提取DNA被广泛用于条形码,因为它快速,廉价,并且可以保存凭证,但目前尚不清楚是否有足够的基因组DNA (gDNA)保留在小凭证中供下游基因组学使用。本文对6目11科昆虫进行了HotSHOT处理前后gDNA的质量和数量进行了评价。一些标本在收集后立即快速冷冻,而另一些标本在乙醇中在热带温度下保存1周,以模拟萎靡陷阱条件。令人鼓舞的是,我们发现在使用HotSHOT处理的标本中仍然保留了足够高质量和数量的基因组测序gDNA。我们还表明,DNA完整性受到野外储存的强烈影响,暴露在“不适陷阱”条件下的标本表现出如此明显的降解,以至于标准的HotSHOT处理不再显著改变DNA质量。对于对照材料,HotSHOT处理涉及更长时间的高温暴露导致碎片长度较小,其效果显然受到样品硬化程度的影响。因此,我们的研究结果表明,优化的HotSHOT处理方法,加上精心控制的预提取存储,可以保存足够质量的代金物gDNA,用于下游基因组分析,无论是短读还是长读测序技术。我们提供的方案选择指南,提高凭证gDNA保存在hotshot处理的样品。这对于许多在条形码项目中只从一个或几个标本中发现的物种来说尤其重要。
{"title":"Balancing Barcoding and Genomics: gDNA Quality in Insect Vouchers After HotSHOT DNA Extraction","authors":"Vivian Feng,&nbsp;Caroline Høegh-Guldberg,&nbsp;Rudolf Meier,&nbsp;Aleš Buček","doi":"10.1111/1755-0998.70103","DOIUrl":"10.1111/1755-0998.70103","url":null,"abstract":"<p>DNA extraction with an alkaline buffer system called ‘HotSHOT’ is widely used for barcoding because it is rapid, inexpensive, and voucher preserving, but it remains unclear whether sufficient genomic DNA (gDNA) remains in small vouchers for downstream use in genomics. We here evaluate gDNA quality and quantity before and after HotSHOT treatment of 11 insect families representing six orders. Some specimens were flash frozen immediately after collection, while others were kept for 1 week at tropical temperatures in ethanol to mimic Malaise trap conditions. Encouragingly, we show that gDNA of sufficiently high quality and quantity for genomic sequencing remained in specimens treated with HotSHOT. We also show that DNA integrity was strongly influenced by field storage, with specimens exposed to Malaise trap conditions showing such pronounced degradation that the standard HotSHOT treatment no longer significantly altered DNA quality. For control material, HotSHOT treatments involving longer exposure to high temperature led to smaller fragment lengths, with the effect apparently being influenced by the degree of specimen sclerotization. Our results thus suggest that optimised HotSHOT treatments, together with carefully controlled pre-extraction storage, preserve voucher gDNA of sufficient quality for downstream genomic analyses with both short-read and possibly even some long-read sequencing technologies. We provide protocol selection guidelines that improve voucher gDNA preservation in HotSHOT-treated samples. This is particularly important for many species which are only known from one or few specimens discovered during barcoding projects.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12905518/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146193823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating Organism Abundance Using Within-Sample Haplotype Frequencies of eDNA Data 利用eDNA数据的样本内单倍型频率估计生物体丰度。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-13 DOI: 10.1111/1755-0998.70104
Pedro F. P. Brandão-Dias, Gledis Guri, Megan R. Shaffer, Elizabeth Andruszkiewicz Allan, Ryan P. Kelly

Environmental DNA (eDNA) provides powerful insights into species presence and community composition but remains limited in its capacity to infer species abundance or population structure. Here, we show that the deviation between within-sample haplotype frequencies and the overall population-level haplotype frequencies can be used to estimate the number of individual contributors to a given sample. We first establish the theoretical framework for approximating population haplotype frequencies directly from eDNA data, enabling application even in the absence of tissue-derived references. Building on this foundation, we introduce a maximum likelihood estimator to infer the number of contributors and assess its performance through simulations spanning a range of haplotype frequency distributions and noise scenarios. These approaches assume that all samples are drawn from a single, panmictic population. We find that accurate estimates are attainable when haplotypes are sufficiently variable, population frequencies are well-characterised, and samples are large enough to capture frequency deviations. By bridging population genetic theory and eDNA, our method complements existing molecular approaches and offers a novel path towards quantifying abundance from eDNA metabarcoding data.

环境DNA (Environmental DNA, eDNA)提供了对物种存在和群落组成的有力见解,但在推断物种丰度或种群结构方面仍然有限。在这里,我们展示了样本内单倍型频率与总体种群水平单倍型频率之间的偏差可以用来估计给定样本的个体贡献者数量。我们首先建立了直接从eDNA数据近似种群单倍型频率的理论框架,即使在缺乏组织来源参考的情况下也能应用。在此基础上,我们引入了一个最大似然估计器来推断贡献者的数量,并通过模拟一系列单倍型频率分布和噪声场景来评估其性能。这些方法假设所有的样本都是从一个单一的、泛疫的种群中抽取的。我们发现,当单倍型有足够的变化,种群频率有很好的特征,样本足够大以捕获频率偏差时,就可以实现准确的估计。通过连接群体遗传理论和eDNA,我们的方法补充了现有的分子方法,并为从eDNA元条形码数据定量丰度提供了一种新的途径。
{"title":"Estimating Organism Abundance Using Within-Sample Haplotype Frequencies of eDNA Data","authors":"Pedro F. P. Brandão-Dias,&nbsp;Gledis Guri,&nbsp;Megan R. Shaffer,&nbsp;Elizabeth Andruszkiewicz Allan,&nbsp;Ryan P. Kelly","doi":"10.1111/1755-0998.70104","DOIUrl":"10.1111/1755-0998.70104","url":null,"abstract":"<p>Environmental DNA (eDNA) provides powerful insights into species presence and community composition but remains limited in its capacity to infer species abundance or population structure. Here, we show that the deviation between within-sample haplotype frequencies and the overall population-level haplotype frequencies can be used to estimate the number of individual contributors to a given sample. We first establish the theoretical framework for approximating population haplotype frequencies directly from eDNA data, enabling application even in the absence of tissue-derived references. Building on this foundation, we introduce a maximum likelihood estimator to infer the number of contributors and assess its performance through simulations spanning a range of haplotype frequency distributions and noise scenarios. These approaches assume that all samples are drawn from a single, panmictic population. We find that accurate estimates are attainable when haplotypes are sufficiently variable, population frequencies are well-characterised, and samples are large enough to capture frequency deviations. By bridging population genetic theory and eDNA, our method complements existing molecular approaches and offers a novel path towards quantifying abundance from eDNA metabarcoding data.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12902907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146176846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polarising SNPs Without Outgroup 没有外群的极化snp。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-06 DOI: 10.1111/1755-0998.70105
Jinyang Liang, Julien Y. Dutheil

Asserting which allele is ancestral or derived, known as polarisation, is a prerequisite of many population and quantitative genetic methods. One important application is the inference of the unfolded site-frequency spectrum (uSFS). The most widely used approaches are based on outgroup data. However, for studies on species with only distantly related outgroups, large divergence between the ingroup and outgroup can result in alignment difficulties and substantial missing data, causing many sites of interest to be lost. Here, we present PolarBEAR (Polarisation By Estimation of the Ancestral Recombination graph), a method that uses the local genealogies from the ancestral recombination graph (ARG) to infer ancestral states. We show that PolarBEAR reaches high accuracy in polarisation and uSFS estimation using simulations under several scenarios. This accuracy, however, heavily depends on the ARG reconstruction method employed. We also applied our method to human population data and compared it with the outgroup-based method est-sfs. Although PolarBEAR could not infer the ancestral state with high confidence at certain positions, it obtained results for positions that est-sfs could not polarise due to missing outgroup data. The polarisation results of the two methods were highly consistent at positions inferred by both methods. The two methods inferred similar uSFS, with PolarBEAR estimating slightly fewer high-frequency derived alleles. Furthermore, we demonstrate that PolarBEAR is robust across different mutation models in our simulations, while est-sfs exhibits a bias in the presence of heterogeneous base composition. PolarBEAR can complement outgroup-based methods, or replace them when no appropriate outgroup sequence is available.

确定哪个等位基因是祖先或衍生的,称为极化,是许多种群和定量遗传方法的先决条件。一个重要的应用是对未展开的站点频谱(uSFS)的推断。最广泛使用的方法是基于外群数据。然而,对于只有远亲外群的物种研究,内群和外群之间的巨大差异可能导致比对困难和大量数据丢失,导致许多感兴趣的位点丢失。在这里,我们提出了PolarBEAR (Polarisation By Estimation of the ancestor Recombination graph),这是一种使用祖先重组图(ARG)的局部系谱来推断祖先状态的方法。我们通过几种场景下的模拟表明,PolarBEAR在极化和uSFS估计方面达到了很高的精度。然而,这种准确性在很大程度上取决于所采用的ARG重建方法。我们还将该方法应用于人口数据,并与基于外群的est-sfs方法进行了比较。虽然在某些位置上,PolarBEAR不能高置信度地推断出祖先状态,但对于est-sfs由于缺少外群数据而无法极化的位置,它得到了结果。两种方法的极化结果在两种方法推断的位置高度一致。两种方法推断出相似的uSFS,其中PolarBEAR估计的高频衍生等位基因略少。此外,我们在模拟中证明了PolarBEAR在不同的突变模型中都具有鲁棒性,而est-sfs在存在异质碱基组成时表现出偏见。PolarBEAR可以补充基于外群的方法,或者在没有合适的外群序列时替换它们。
{"title":"Polarising SNPs Without Outgroup","authors":"Jinyang Liang,&nbsp;Julien Y. Dutheil","doi":"10.1111/1755-0998.70105","DOIUrl":"10.1111/1755-0998.70105","url":null,"abstract":"<p>Asserting which allele is ancestral or derived, known as polarisation, is a prerequisite of many population and quantitative genetic methods. One important application is the inference of the unfolded site-frequency spectrum (uSFS). The most widely used approaches are based on outgroup data. However, for studies on species with only distantly related outgroups, large divergence between the ingroup and outgroup can result in alignment difficulties and substantial missing data, causing many sites of interest to be lost. Here, we present PolarBEAR (Polarisation By Estimation of the Ancestral Recombination graph), a method that uses the local genealogies from the ancestral recombination graph (ARG) to infer ancestral states. We show that PolarBEAR reaches high accuracy in polarisation and uSFS estimation using simulations under several scenarios. This accuracy, however, heavily depends on the ARG reconstruction method employed. We also applied our method to human population data and compared it with the outgroup-based method est-sfs. Although PolarBEAR could not infer the ancestral state with high confidence at certain positions, it obtained results for positions that est-sfs could not polarise due to missing outgroup data. The polarisation results of the two methods were highly consistent at positions inferred by both methods. The two methods inferred similar uSFS, with PolarBEAR estimating slightly fewer high-frequency derived alleles. Furthermore, we demonstrate that PolarBEAR is robust across different mutation models in our simulations, while est-sfs exhibits a bias in the presence of heterogeneous base composition. PolarBEAR can complement outgroup-based methods, or replace them when no appropriate outgroup sequence is available.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12878804/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146123046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection” 更正“利用高通量DNA条形码数据从策划的培养收集推进酵母鉴定”。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-02-01 DOI: 10.1111/1755-0998.70106
<p>Vu, D., M. de Vries, B. G. van den Ende, et al. 2026. “Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection.” <i>Molecular Ecology Resources</i> 26, no. 1: e70082. https://doi.org/10.1111/1755-0998.70082.</p><p>In the published article, one author's name was incorrectly presented. The name ‘Magarita Hernández-Restrepo’ should be corrected to ‘Margarita Hernández-Restrepo’.</p><p>In addition, errors occurred in the numbering of items in Sections 2.2 (‘Methods’) and 3 (‘Results’), where citation references were mistakenly used instead of numerical list indicators.</p><p>Section 2.2 Methods</p><p>Incorrect text:</p><p>We used dnabarcoder (Vu et al. 2022, available at https://github.com/vuthuyduong/dnabarcoder) to: (Abarenkov et al. 2016) analyse the CBS yeast barcode datasets including their taxonomic classification as well as the intra-specific variation within taxonomic groups (Abarenkov et al. 2022); predict global and local similarity cutoffs and resolving powers of the biomarkers ITS, ITS1, ITS2 and LSU for yeast identification; and (Abarenkov et al. 2024) reclassify the human microbiome dataset against the yeastITS2 and filfungalITS2 datasets using the similarity cutoffs predicted for yeast and filamentous fungal identification.</p><p>Correct text:</p><p>We used dnabarcoder (Vu et al. 2022, available at https://github.com/vuthuyduong/dnabarcoder) to: (1) analyse the CBS yeast barcode datasets including their taxonomic classification as well as the intra-specific variation within taxonomic groups; (2) predict global and local similarity cutoffs and resolving powers of the biomarkers ITS, ITS1, ITS2 and LSU for yeast identification; and (3) reclassify the human microbiome dataset against the yeastITS2 and filfungalITS2 datasets using the similarity cutoffs predicted for yeast and filamentous fungal identification.</p><p>Section 3 Results</p><p>Incorrect text:</p><p>We evaluated CBS yeast barcodes (yeastITS and yeastLSU) and their resolving power for yeast identification by: (Abarenkov et al. 2016) analysing their taxonomic classification, sequence distribution, and intra-specific variation within different yeast taxonomic groups (Abarenkov et al. 2022); calculating the number of indistinguishable yeast species using ITS and LSU (Abarenkov et al. 2024); predicting global and local similarity cutoffs and the resolving power of different biomarkers (viz. ITS, ITS1, ITS2 and LSU) based on the CBS yeast barcode datasets; and (Alper et al. 2011) examining whether recent name changes in 2024 (Liu, Hu, Yurkov, et al. 2024; Liu, Hu, Zhao, et al. 2024; Zhu et al. 2024) improved the resolving power of ITS and LSU for yeast identification. We highlighted the importance of well-curated databases with up-to-date taxonomy by reclassifying the human microbiome dataset using the yeastITS2 and filfungalITS2 datasets with predicted similarity cutoffs, and by comparing the results with those from Nash et al. (20
吴,D, M. de Vries, B. G. van den Ende等。2026。“利用高通量DNA条形码数据推进酵母鉴定。”分子生态资源,第26期。1: e70082。https://doi.org/10.1111/1755-0998.70082.In在发表的文章中,一个作者的名字写错了。“玛格丽塔Hernández-Restrepo”这个名字应该更正为“玛格丽塔Hernández-Restrepo”。此外,第2.2节(“方法”)和第3节(“结果”)的项目编号出现错误,其中错误地使用了引文参考而不是数字列表指标。正确文本:我们使用dnabarcoder (Vu et al. 2022,可在https://github.com/vuthuyduong/dnabarcoder获得):(Abarenkov et al. 2016)分析CBS酵母条形码数据集,包括其分类分类以及分类组内的种内变异(Abarenkov et al. 2022);预测生物标志物ITS、ITS1、ITS2和LSU在酵母鉴定中的全局和局部相似性截止点和分辨能力;和(Abarenkov et al. 2024)使用酵母和丝状真菌鉴定预测的相似性截止点,根据酵母its2和filfungalITS2数据集对人类微生物组数据集进行重新分类。我们使用dnabarcoder (Vu et al. 2022,可在https://github.com/vuthuyduong/dnabarcoder上获得):(1)分析CBS酵母条形码数据集,包括它们的分类分类以及分类类群内的种内变异;(2)预测生物标志物ITS、ITS1、ITS2和LSU在酵母鉴定中的全局和局部相似性截止点和分辨能力;(3)利用酵母和丝状真菌鉴定预测的相似性截断点,根据酵母its2和filfungalITS2数据集对人类微生物组数据集进行重新分类。我们通过(Abarenkov et al. 2016)分析了CBS酵母条形码(yeastITS和yeastLSU)的分类分类、序列分布和不同酵母分类类群的种内变异(Abarenkov et al. 2022),评估了它们对酵母鉴定的分辨能力(Abarenkov et al. 2016);使用ITS和LSU计算不可区分酵母菌种的数量(Abarenkov et al. 2024);基于CBS酵母条形码数据集预测不同生物标志物(ITS、ITS1、ITS2和LSU)的全局和局部相似性截止点和分辨能力;和(Alper et al. 2011)研究了2024年最近的名称变化(Liu, Hu, Yurkov, et al. 2024; Liu, Hu, Zhao, et al. 2024; Zhu et al. 2024)是否提高了ITS和LSU对酵母鉴定的分辨能力。我们通过使用具有预测相似性截止点的yeasits2和filfungalITS2数据集对人类微生物组数据集进行重新分类,并将结果与Nash等人(2017)的结果进行比较,强调了具有最新分类法的精心整理的数据库的重要性。我们对CBS酵母条形码(yeastITS和yeastLSU)及其酵母鉴定的分辨能力进行了评估:(1)分析了它们的分类分类、序列分布和不同酵母分类类群内的种内变异;(2)利用ITS和LSU计算不可区分酵母菌种数;(3)基于CBS酵母条形码数据集预测ITS、ITS1、ITS2和LSU等不同生物标志物的全局和局部相似性截止点及分辨能力;(4)检验2024年最近的名称变化(Liu, Hu, Yurkov, et . 2024; Liu, Hu, Zhao, et . 2024; Zhu et al. 2024)是否提高了ITS和LSU对酵母鉴定的分辨能力。我们通过使用具有预测相似性截止点的yeasits2和filfungalITS2数据集对人类微生物组数据集进行重新分类,并将结果与Nash等人(2017)的结果进行比较,强调了具有最新分类法的精心整理的数据库的重要性。我们为这些错误道歉。
{"title":"Correction to “Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection”","authors":"","doi":"10.1111/1755-0998.70106","DOIUrl":"10.1111/1755-0998.70106","url":null,"abstract":"&lt;p&gt;Vu, D., M. de Vries, B. G. van den Ende, et al. 2026. “Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection.” &lt;i&gt;Molecular Ecology Resources&lt;/i&gt; 26, no. 1: e70082. https://doi.org/10.1111/1755-0998.70082.&lt;/p&gt;&lt;p&gt;In the published article, one author's name was incorrectly presented. The name ‘Magarita Hernández-Restrepo’ should be corrected to ‘Margarita Hernández-Restrepo’.&lt;/p&gt;&lt;p&gt;In addition, errors occurred in the numbering of items in Sections 2.2 (‘Methods’) and 3 (‘Results’), where citation references were mistakenly used instead of numerical list indicators.&lt;/p&gt;&lt;p&gt;Section 2.2 Methods&lt;/p&gt;&lt;p&gt;Incorrect text:&lt;/p&gt;&lt;p&gt;We used dnabarcoder (Vu et al. 2022, available at https://github.com/vuthuyduong/dnabarcoder) to: (Abarenkov et al. 2016) analyse the CBS yeast barcode datasets including their taxonomic classification as well as the intra-specific variation within taxonomic groups (Abarenkov et al. 2022); predict global and local similarity cutoffs and resolving powers of the biomarkers ITS, ITS1, ITS2 and LSU for yeast identification; and (Abarenkov et al. 2024) reclassify the human microbiome dataset against the yeastITS2 and filfungalITS2 datasets using the similarity cutoffs predicted for yeast and filamentous fungal identification.&lt;/p&gt;&lt;p&gt;Correct text:&lt;/p&gt;&lt;p&gt;We used dnabarcoder (Vu et al. 2022, available at https://github.com/vuthuyduong/dnabarcoder) to: (1) analyse the CBS yeast barcode datasets including their taxonomic classification as well as the intra-specific variation within taxonomic groups; (2) predict global and local similarity cutoffs and resolving powers of the biomarkers ITS, ITS1, ITS2 and LSU for yeast identification; and (3) reclassify the human microbiome dataset against the yeastITS2 and filfungalITS2 datasets using the similarity cutoffs predicted for yeast and filamentous fungal identification.&lt;/p&gt;&lt;p&gt;Section 3 Results&lt;/p&gt;&lt;p&gt;Incorrect text:&lt;/p&gt;&lt;p&gt;We evaluated CBS yeast barcodes (yeastITS and yeastLSU) and their resolving power for yeast identification by: (Abarenkov et al. 2016) analysing their taxonomic classification, sequence distribution, and intra-specific variation within different yeast taxonomic groups (Abarenkov et al. 2022); calculating the number of indistinguishable yeast species using ITS and LSU (Abarenkov et al. 2024); predicting global and local similarity cutoffs and the resolving power of different biomarkers (viz. ITS, ITS1, ITS2 and LSU) based on the CBS yeast barcode datasets; and (Alper et al. 2011) examining whether recent name changes in 2024 (Liu, Hu, Yurkov, et al. 2024; Liu, Hu, Zhao, et al. 2024; Zhu et al. 2024) improved the resolving power of ITS and LSU for yeast identification. We highlighted the importance of well-curated databases with up-to-date taxonomy by reclassifying the human microbiome dataset using the yeastITS2 and filfungalITS2 datasets with predicted similarity cutoffs, and by comparing the results with those from Nash et al. (20","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12862242/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146099660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Properties and Limitations of eDNA Substrates for Terrestrial Animal Monitoring 用于陆生动物监测的eDNA底物的性质和局限性。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-23 DOI: 10.1111/1755-0998.70096
Beilun Zhao, Tobias Andermann

Environmental DNA (eDNA) constitutes a valuable tool for monitoring terrestrial animal diversity, but outcomes are affected by multiple factors. Among these factors, the choice of sampling substrate and method is especially important and must be aligned with research objectives. We reviewed 245 published studies that utilise eDNA for terrestrial animal monitoring and compiled an overview of the most frequently used environmental substrates. Based on the reviewed literature, we provide a key description of each substrate, as well as its particular properties and limitations related to the detection of animal species across different spatial and temporal scales. We categorise these substrates into three groups: abiotic substrates (soil, water, air, sediment), biotic substrates (invertebrate samples, plant tissues, spiderwebs) and direct-evidence substrates (scat, footprints, shelters, feeding sites). In addition, we identify several key challenges with the interpretation of eDNA-based biodiversity monitoring, including false negatives and false positives, as well as the dynamics of spatial and temporal deviations. The latter concepts, which we propose and define in this review, describe the temporal and spatial discrepancies between the DNA source and its detection in a given sample. We reflect on how these temporal and spatial deviations are expected to affect eDNA data extracted from the different types of substrates and how knowledge of these dynamics can inform effective and accurate biomonitoring. In summary, this review provides a decision basis for designing terrestrial eDNA monitoring studies by summarising the properties and limitations of different substrates and contextualising the interpretation of results in light of substrate-specific challenges.

环境DNA (eDNA)是监测陆生动物多样性的重要工具,但其结果受到多种因素的影响。在这些因素中,取样基质和方法的选择尤为重要,必须与研究目标相一致。我们回顾了245项利用eDNA进行陆生动物监测的已发表研究,并对最常用的环境底物进行了概述。在回顾文献的基础上,我们提供了每种基质的关键描述,以及它们在不同时空尺度上与动物物种检测相关的特性和局限性。我们将这些基质分为三类:非生物基质(土壤、水、空气、沉积物)、生物基质(无脊椎动物样本、植物组织、蜘蛛网)和直接证据基质(粪便、脚印、庇护所、喂食地点)。此外,我们还确定了基于edna的生物多样性监测解释的几个关键挑战,包括假阴性和假阳性,以及时空偏差的动态。后一个概念,我们提出和定义在这篇综述中,描述的时间和空间差异的DNA来源和其检测在一个给定的样品。我们反思了这些时空偏差如何影响从不同类型的底物中提取的eDNA数据,以及这些动态的知识如何为有效和准确的生物监测提供信息。总之,本综述通过总结不同底物的特性和局限性,并根据底物特异性挑战将结果的解释背景化,为设计陆地eDNA监测研究提供决策基础。
{"title":"Properties and Limitations of eDNA Substrates for Terrestrial Animal Monitoring","authors":"Beilun Zhao,&nbsp;Tobias Andermann","doi":"10.1111/1755-0998.70096","DOIUrl":"10.1111/1755-0998.70096","url":null,"abstract":"<p>Environmental DNA (eDNA) constitutes a valuable tool for monitoring terrestrial animal diversity, but outcomes are affected by multiple factors. Among these factors, the choice of sampling substrate and method is especially important and must be aligned with research objectives. We reviewed 245 published studies that utilise eDNA for terrestrial animal monitoring and compiled an overview of the most frequently used environmental substrates. Based on the reviewed literature, we provide a key description of each substrate, as well as its particular properties and limitations related to the detection of animal species across different spatial and temporal scales. We categorise these substrates into three groups: abiotic substrates (soil, water, air, sediment), biotic substrates (invertebrate samples, plant tissues, spiderwebs) and direct-evidence substrates (scat, footprints, shelters, feeding sites). In addition, we identify several key challenges with the interpretation of eDNA-based biodiversity monitoring, including false negatives and false positives, as well as the dynamics of spatial and temporal deviations. The latter concepts, which we propose and define in this review, describe the temporal and spatial discrepancies between the DNA source and its detection in a given sample. We reflect on how these temporal and spatial deviations are expected to affect eDNA data extracted from the different types of substrates and how knowledge of these dynamics can inform effective and accurate biomonitoring. In summary, this review provides a decision basis for designing terrestrial eDNA monitoring studies by summarising the properties and limitations of different substrates and contextualising the interpretation of results in light of substrate-specific challenges.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12831013/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146040187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Permits to Samples: Addressing Key Challenges for High-Quality Reference Genome Generation in Europe 从许可证到样本:解决欧洲高质量参考基因组生成的关键挑战。
IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY Pub Date : 2026-01-20 DOI: 10.1111/1755-0998.70100
Katja Reichel, Jaakko Pohjoismäki, Jonas J. Astrin, Astrid Böhne, Chiara Bortoluzzi, Elena Bužan, Javier del Campo, Claudio Ciofi, Camilla B. Di-Nizo, Pradeep K. Divakar, Carola Greve, Vladimír Hampl, Leon Hilgers, Veronika N. Laine, Jennifer A. Leonard, Jesus Lozano-Fernandez, Lada Lukić Bilela, Camila J. Mazzoni, Ann M. McCartney, José Melo-Ferreira, Rita Monteiro, Rebekah A. Oomen, Martina Pavlek, João Pimenta, Michal Rindos, Ole Seehausen, Andrii Tarieiev, Salvatore Tomasello, Olga Vinnere Pettersson, Robert M. Waterhouse, Alexandra A.-T. Weber, Oleksandr Zinenko, Christian de Guttry

High-quality reference genome assemblies have become essential for deepening our understanding of biodiversity, yet obtaining them for many species remains surprisingly challenging. Drawing on experiences from the European Reference Genome Atlas (ERGA) community, we focus on permit and sample-handling procedures leading up to nucleic acid sequencing, covering tasks such as ensuring ethical and legal compliance, verifying accurate species identification, maintaining sample integrity during transport, and isolating high-quality DNA or nuclei. While many of the challenges and solutions we discuss are broadly relevant, our regulatory and logistical examples are primarily from Europe. By synthesising practical guidance, we highlight the crucial importance of taxonomic expertise, proper vouchering and biobanking, rigorous cold-chain management or alternative preservation methods, and emphasise adherence to packaging and shipping requirements for biological materials. We showcase examples spanning diverse regions, taxa and source materials, which underscore the importance of context-specific strategies and internationally harmonised protocols, particularly for metadata reporting. Our recommendations aim to support both small-scale projects and large initiatives, directing collective efforts to facilitate efficient sampling, vouchering and sample processing for future genomic studies.

高质量的参考基因组组装对于加深我们对生物多样性的理解至关重要,然而获得许多物种的参考基因组组装仍然具有惊人的挑战性。借鉴欧洲参考基因组图谱(ERGA)社区的经验,我们专注于导致核酸测序的许可和样品处理程序,涵盖诸如确保道德和法律合规,验证准确的物种鉴定,在运输过程中保持样品完整性以及分离高质量DNA或细胞核等任务。虽然我们讨论的许多挑战和解决方案具有广泛的相关性,但我们的监管和物流示例主要来自欧洲。通过综合实践指导,我们强调了分类学专业知识、适当的凭证和生物银行、严格的冷链管理或替代保存方法的重要性,并强调遵守生物材料的包装和运输要求。我们展示了跨越不同地区、分类群和来源材料的例子,这些例子强调了针对具体情况的策略和国际协调协议的重要性,特别是在元数据报告方面。我们的建议旨在支持小规模和大型项目,指导集体努力,以促进有效的采样,凭证和样品处理,为未来的基因组研究。
{"title":"From Permits to Samples: Addressing Key Challenges for High-Quality Reference Genome Generation in Europe","authors":"Katja Reichel,&nbsp;Jaakko Pohjoismäki,&nbsp;Jonas J. Astrin,&nbsp;Astrid Böhne,&nbsp;Chiara Bortoluzzi,&nbsp;Elena Bužan,&nbsp;Javier del Campo,&nbsp;Claudio Ciofi,&nbsp;Camilla B. Di-Nizo,&nbsp;Pradeep K. Divakar,&nbsp;Carola Greve,&nbsp;Vladimír Hampl,&nbsp;Leon Hilgers,&nbsp;Veronika N. Laine,&nbsp;Jennifer A. Leonard,&nbsp;Jesus Lozano-Fernandez,&nbsp;Lada Lukić Bilela,&nbsp;Camila J. Mazzoni,&nbsp;Ann M. McCartney,&nbsp;José Melo-Ferreira,&nbsp;Rita Monteiro,&nbsp;Rebekah A. Oomen,&nbsp;Martina Pavlek,&nbsp;João Pimenta,&nbsp;Michal Rindos,&nbsp;Ole Seehausen,&nbsp;Andrii Tarieiev,&nbsp;Salvatore Tomasello,&nbsp;Olga Vinnere Pettersson,&nbsp;Robert M. Waterhouse,&nbsp;Alexandra A.-T. Weber,&nbsp;Oleksandr Zinenko,&nbsp;Christian de Guttry","doi":"10.1111/1755-0998.70100","DOIUrl":"10.1111/1755-0998.70100","url":null,"abstract":"<p>High-quality reference genome assemblies have become essential for deepening our understanding of biodiversity, yet obtaining them for many species remains surprisingly challenging. Drawing on experiences from the European Reference Genome Atlas (ERGA) community, we focus on permit and sample-handling procedures leading up to nucleic acid sequencing, covering tasks such as ensuring ethical and legal compliance, verifying accurate species identification, maintaining sample integrity during transport, and isolating high-quality DNA or nuclei. While many of the challenges and solutions we discuss are broadly relevant, our regulatory and logistical examples are primarily from Europe. By synthesising practical guidance, we highlight the crucial importance of taxonomic expertise, proper vouchering and biobanking, rigorous cold-chain management or alternative preservation methods, and emphasise adherence to packaging and shipping requirements for biological materials. We showcase examples spanning diverse regions, taxa and source materials, which underscore the importance of context-specific strategies and internationally harmonised protocols, particularly for metadata reporting. Our recommendations aim to support both small-scale projects and large initiatives, directing collective efforts to facilitate efficient sampling, vouchering and sample processing for future genomic studies.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":"26 2","pages":""},"PeriodicalIF":5.5,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12820446/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146007975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular Ecology Resources
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1