首页 > 最新文献

Forensic science international. Genetics最新文献

英文 中文
Dual-kingdom necrobiome succession extends postmortem interval estimation into skeletonization. 双界坏死群落演替将死后间隔估计扩展到骨骼化。
IF 3.1 Pub Date : 2026-02-04 DOI: 10.1016/j.fsigen.2026.103445
Elie Pascolo Tièche, Lara Indra, Alexandre Gouy, Gerald Heckel, Martin Zieger

Forensic estimation of the postmortem interval (PMI) becomes increasingly challenging when decomposition progresses beyond the initial weeks, as traditional medicolegal indicators lose their temporal precision. Here, we demonstrate that dual-kingdom microbial communities associated with decomposing remains form robust molecular clocks that maintain predictive power for PMI estimation well into skeletonization. Using full-length amplicon sequencing, we tracked the succession of bacteria and fungi in host-associated oral samples and the underlying gravesoil from decomposing pigs (N = 6) over nearly five months (2170.0 accumulated degree days; ADD). Microbial communities exhibited consistent three-phase succession patterns - initial disruption, intermediate colonization, and late-stage stabilization - that aligned with morphological decomposition scoring. Necrobiome succession dynamics continued long after morphological decomposition metrics reached a plateau, demonstrating the extended temporal resolution provided by microbial markers. Machine-learning models integrating microbial features with morphological data achieved robust predictive accuracy for both PMI in days and ADD, with performance varying systematically across decomposition stages. Bacterial models dominated early decomposition, dual-kingdom approaches optimized intermediate phases, and fungal models excelled during late-stage decomposition when conventional indicators fail. We identified specific microbial taxa that serve as reliable temporal indicators across sample types. These findings demonstrate that necrobiome succession extends capabilities to estimate time since death by months, offering a molecular framework for advanced decomposition cases where traditional methods lose precision.

由于传统的法医指标失去了时间上的精确性,当尸体分解超过最初的几周后,法医对死后时间间隔(PMI)的估计变得越来越具有挑战性。在这里,我们证明了与分解遗骸相关的双界微生物群落形成了强大的分子钟,在骨骼化过程中保持PMI估计的预测能力。利用全长扩增子测序,我们在近5个月(2170.0累积度天;ADD)的时间里,追踪了宿主相关口腔样本和分解猪(N = 6)的坟墓油中细菌和真菌的演代。微生物群落表现出一致的三阶段演替模式——初始破坏、中期定植和后期稳定——与形态分解评分一致。坏死生物群落演替动态在形态分解指标达到平台期后仍持续很长时间,这表明微生物标记提供了延长的时间分辨率。整合微生物特征和形态学数据的机器学习模型对PMI和ADD都实现了稳健的预测准确性,其性能在分解阶段有系统的变化。细菌模型主导了早期分解,双界方法优化了中间阶段,而真菌模型在常规指标失效的后期分解中表现出色。我们确定了特定的微生物分类群,作为可靠的跨样品类型的时间指标。这些发现表明,坏死性生物群落的演替将估计死亡时间的能力延长了几个月,为传统方法失去精度的高级分解病例提供了分子框架。
{"title":"Dual-kingdom necrobiome succession extends postmortem interval estimation into skeletonization.","authors":"Elie Pascolo Tièche, Lara Indra, Alexandre Gouy, Gerald Heckel, Martin Zieger","doi":"10.1016/j.fsigen.2026.103445","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103445","url":null,"abstract":"<p><p>Forensic estimation of the postmortem interval (PMI) becomes increasingly challenging when decomposition progresses beyond the initial weeks, as traditional medicolegal indicators lose their temporal precision. Here, we demonstrate that dual-kingdom microbial communities associated with decomposing remains form robust molecular clocks that maintain predictive power for PMI estimation well into skeletonization. Using full-length amplicon sequencing, we tracked the succession of bacteria and fungi in host-associated oral samples and the underlying gravesoil from decomposing pigs (N = 6) over nearly five months (2170.0 accumulated degree days; ADD). Microbial communities exhibited consistent three-phase succession patterns - initial disruption, intermediate colonization, and late-stage stabilization - that aligned with morphological decomposition scoring. Necrobiome succession dynamics continued long after morphological decomposition metrics reached a plateau, demonstrating the extended temporal resolution provided by microbial markers. Machine-learning models integrating microbial features with morphological data achieved robust predictive accuracy for both PMI in days and ADD, with performance varying systematically across decomposition stages. Bacterial models dominated early decomposition, dual-kingdom approaches optimized intermediate phases, and fungal models excelled during late-stage decomposition when conventional indicators fail. We identified specific microbial taxa that serve as reliable temporal indicators across sample types. These findings demonstrate that necrobiome succession extends capabilities to estimate time since death by months, offering a molecular framework for advanced decomposition cases where traditional methods lose precision.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103445"},"PeriodicalIF":3.1,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EaglePlex: Three STR multiplex panels optimized and validated for forensic identification and sex determination of Bald Eagles (Haliaeetus leucocephalus) and Golden Eagles (Aquila chrysaetos ). EaglePlex:优化并验证了三种STR多重面板,用于白头鹰(halaeetus leucocephalus)和金雕(Aquila chrysaetos)的法医鉴定和性别确定。
IF 3.1 Pub Date : 2026-02-03 DOI: 10.1016/j.fsigen.2026.103442
Darren J Wostenberg, Mary K Burnham-Curtis

Bald and Golden eagles are the largest raptors in North America and species of great cultural and ecological importance. Bald and Golden eagles and their parts are still used by indigenous peoples in crafts and ceremonies. The sale of eagle feathers and body parts in illegal markets continues to threaten the recovery of Bald and Golden eagle populations after decades of protection under the Endangered Species Act (1973) and the Bald and Golden Eagle Protection Act (1940). The ability to identify eagle parts and match evidence items among eagle victims is an important part of prosecuting wildlife crimes that involve these species. The three short tandem repeat (STR) multiplex panels we developed target 19 STR loci and the sex-linked chromo helicase DNA binding (CHD) gene to identify species and sex of eagles as well as provide the ability to match and individualize eagle forensic evidence. A collated database of complete genotypes from 289 Bald Eagles and 222 Golden Eagles from North America was created to serve as population references for the purpose of calculating match probabilities and likelihood ratios. The Bald Eagle database has a median theta-corrected match probability of 5.99 × 10-8 and a median likelihood ratio of 1.67 × 107. The Golden Eagle database has a median theta-corrected match probability of 4.68 × 10-11 and a median likelihood ratio of 2.14 × 1010.

秃鹰和金雕是北美最大的猛禽,具有重要的文化和生态意义。秃鹰和金雕及其肢体仍然被土著人民用于手工艺和仪式。在受到《濒危物种法》(1973年)和《白头鹰和金鹰保护法》(1940年)几十年的保护后,在非法市场上出售鹰的羽毛和身体部位继续威胁着白头鹰和金鹰数量的恢复。识别鹰的身体部位和匹配鹰受害者的证据项目的能力是起诉涉及这些物种的野生动物犯罪的重要组成部分。我们开发了三个短串联重复序列(STR)多重序列,目标是19个STR位点和性别连锁的染色质解旋酶DNA结合(CHD)基因,用于鉴定鹰的种类和性别,并提供匹配和个性化鹰法医证据的能力。建立了来自北美的289只白头鹰和222只金雕的完整基因型数据库,作为计算匹配概率和似然比的种群参考。Bald Eagle数据库经theta校正后的中位数匹配概率为5.99 × 10-8,中位数似然比为1.67 × 107。金鹰数据库经theta校正后的中位数匹配概率为4.68 × 10-11,中位数似然比为2.14 × 1010。
{"title":"EaglePlex: Three STR multiplex panels optimized and validated for forensic identification and sex determination of Bald Eagles (Haliaeetus leucocephalus) and Golden Eagles (Aquila chrysaetos ).","authors":"Darren J Wostenberg, Mary K Burnham-Curtis","doi":"10.1016/j.fsigen.2026.103442","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103442","url":null,"abstract":"<p><p>Bald and Golden eagles are the largest raptors in North America and species of great cultural and ecological importance. Bald and Golden eagles and their parts are still used by indigenous peoples in crafts and ceremonies. The sale of eagle feathers and body parts in illegal markets continues to threaten the recovery of Bald and Golden eagle populations after decades of protection under the Endangered Species Act (1973) and the Bald and Golden Eagle Protection Act (1940). The ability to identify eagle parts and match evidence items among eagle victims is an important part of prosecuting wildlife crimes that involve these species. The three short tandem repeat (STR) multiplex panels we developed target 19 STR loci and the sex-linked chromo helicase DNA binding (CHD) gene to identify species and sex of eagles as well as provide the ability to match and individualize eagle forensic evidence. A collated database of complete genotypes from 289 Bald Eagles and 222 Golden Eagles from North America was created to serve as population references for the purpose of calculating match probabilities and likelihood ratios. The Bald Eagle database has a median theta-corrected match probability of 5.99 × 10<sup>-8</sup> and a median likelihood ratio of 1.67 × 10<sup>7</sup>. The Golden Eagle database has a median theta-corrected match probability of 4.68 × 10<sup>-11</sup> and a median likelihood ratio of 2.14 × 10<sup>10</sup>.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103442"},"PeriodicalIF":3.1,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive evaluation of mutation rates and male differentiation using an 80-Y-STR panel. 综合评估突变率和男性分化使用80-Y-STR面板。
IF 3.1 Pub Date : 2026-01-30 DOI: 10.1016/j.fsigen.2026.103439
Xiaoyan Ma, Ran Li, Jiamin Xie, Tingjun Li, Zhiyong Liu, Xinxin Chen, Ziyue Zhong, Jiayang Li, Qing Li, Hongyu Sun

Y-chromosomal short tandem repeats (Y-STRs) are highly informative tools in forensic investigations for tracing paternal lineages and generating investigative leads when direct autosomal STR matches are not available. However, the discriminative power of expanded Y-STR panels and optimal strategies for database searching remain inadequately defined. Here, we first presented kinshipY, an interactive online platform that streamlines the analysis of Y-STR mutations, pedigree structure visualization, genetic distance calculation, and statistical power evaluation. Using this tool, we then empirically estimated the mutation rates of 80 Y-STRs (Forensic Analysis System Multiplecues SetB Kit) based on 488 father-son pairs. Finally, the differentiation rates among male relatives and between unrelated males were evaluated using three deep-rooted families spanning 1-27 meioses. The results showed that a total of 120 mutations were identified, yielding an overall mutation rate of 3.1 × 10⁻³ (95 % CIs: 2.5 × 10-3 - 3.7 × 10-3). Single-step mutations accounted for 95 % of events, with gains and losses occurring at nearly equal frequencies. The SetB panel distinguished 27.96 % of father-son pairs and 50 % of siblings, with differentiation rates reaching 100 % for relationships separated by ≥ 11 meioses. To differentiate males from distinct lineages, we established a threshold‑setting strategy that balances both the false positive rate (FPR) and false negative rate (FNR), demonstrating that step‑difference‑based thresholds outperform locus‑difference‑based thresholds. For the SetB panel, a step‑difference threshold of ≥ 15 enabled the differentiation of 99.59 % of unrelated males. In contrast, panels with fewer Y‑STRs-Yfiler (16 Y‑STRs), Class A (19 Y‑STRs), Yfiler Plus (25 Y‑STRs), and Class A+B (32 Y‑STRs)-exhibited significantly higher FPR and FNR. In summary, this study demonstrates the enhanced resolution offered by the SetB panel. For Y-STR database searching, we recommend the following: (1) use a powerful marker set whenever possible; (2) adopt a step-difference-based matching strategy; (3) apply dynamic, panel-specific thresholds; and (4) when integrating forensic investigative genetic genealogy, include more distant relatives for threshold estimation. These recommendations could provide valuable guidance for forensic practice.

y染色体短串联重复序列(y -STR)是法医调查中非常有用的工具,用于追踪父系和在没有直接常染色体STR匹配时产生调查线索。然而,扩展Y-STR面板的判别能力和数据库搜索的最佳策略仍然没有充分定义。在这里,我们首次提出了kinshipY,这是一个交互式在线平台,可以简化Y-STR突变分析,系谱结构可视化,遗传距离计算和统计功率评估。利用该工具,我们基于488对父子对,经验性地估计了80个Y-STRs (Forensic Analysis System multiecues SetB Kit)的突变率。最后,利用3个跨1-27个减数分裂的深根家族,对雄性亲缘间和非亲缘间的分化率进行了评价。结果显示,总共鉴定出120个突变,总突变率为3.1 × 10⁻³ (95 % CIs: 2.5 × 10-3 - 3.7 × 10-3)。单步突变占95% %的事件,增益和损失发生的频率几乎相等。SetB小组区分出27.96% %的父子对和50% %的兄弟姐妹,对于相隔≥ 11个减数分裂的关系,分化率达到100% %。为了区分不同谱系的雄性,我们建立了一个阈值设置策略,以平衡假阳性率(FPR)和假阴性率(FNR),证明基于步长差异的阈值优于基于位点差异的阈值。对于SetB小组,≥ 15的步差阈值使99.59 %的非亲属男性能够分化。相比之下,Y - STRs较少的面板-Yfiler(16个Y - STRs), A类(19个Y - STRs), Yfiler Plus(25个Y - STRs)和A+B类(32个Y - STRs)-表现出更高的FPR和FNR。总之,这项研究证明了SetB小组提供的增强分辨率。对于Y-STR数据库搜索,我们建议如下:(1)尽可能使用强大的标记集;(2)采用基于步长差分的匹配策略;(3)应用动态的、特定于面板的阈值;(4)在整合法医调查遗传谱系时,纳入更多的远亲进行阈值估计。这些建议可为法医实践提供有价值的指导。
{"title":"A comprehensive evaluation of mutation rates and male differentiation using an 80-Y-STR panel.","authors":"Xiaoyan Ma, Ran Li, Jiamin Xie, Tingjun Li, Zhiyong Liu, Xinxin Chen, Ziyue Zhong, Jiayang Li, Qing Li, Hongyu Sun","doi":"10.1016/j.fsigen.2026.103439","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103439","url":null,"abstract":"<p><p>Y-chromosomal short tandem repeats (Y-STRs) are highly informative tools in forensic investigations for tracing paternal lineages and generating investigative leads when direct autosomal STR matches are not available. However, the discriminative power of expanded Y-STR panels and optimal strategies for database searching remain inadequately defined. Here, we first presented kinshipY, an interactive online platform that streamlines the analysis of Y-STR mutations, pedigree structure visualization, genetic distance calculation, and statistical power evaluation. Using this tool, we then empirically estimated the mutation rates of 80 Y-STRs (Forensic Analysis System Multiplecues SetB Kit) based on 488 father-son pairs. Finally, the differentiation rates among male relatives and between unrelated males were evaluated using three deep-rooted families spanning 1-27 meioses. The results showed that a total of 120 mutations were identified, yielding an overall mutation rate of 3.1 × 10⁻³ (95 % CIs: 2.5 × 10<sup>-3</sup> - 3.7 × 10<sup>-3</sup>). Single-step mutations accounted for 95 % of events, with gains and losses occurring at nearly equal frequencies. The SetB panel distinguished 27.96 % of father-son pairs and 50 % of siblings, with differentiation rates reaching 100 % for relationships separated by ≥ 11 meioses. To differentiate males from distinct lineages, we established a threshold‑setting strategy that balances both the false positive rate (FPR) and false negative rate (FNR), demonstrating that step‑difference‑based thresholds outperform locus‑difference‑based thresholds. For the SetB panel, a step‑difference threshold of ≥ 15 enabled the differentiation of 99.59 % of unrelated males. In contrast, panels with fewer Y‑STRs-Yfiler (16 Y‑STRs), Class A (19 Y‑STRs), Yfiler Plus (25 Y‑STRs), and Class A+B (32 Y‑STRs)-exhibited significantly higher FPR and FNR. In summary, this study demonstrates the enhanced resolution offered by the SetB panel. For Y-STR database searching, we recommend the following: (1) use a powerful marker set whenever possible; (2) adopt a step-difference-based matching strategy; (3) apply dynamic, panel-specific thresholds; and (4) when integrating forensic investigative genetic genealogy, include more distant relatives for threshold estimation. These recommendations could provide valuable guidance for forensic practice.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103439"},"PeriodicalIF":3.1,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146115134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Age prediction of contributors to two-person DNA mixtures via deconvolution of autosomal DNA methylation profiles. 通过常染色体DNA甲基化谱的反褶积预测两人DNA混合物的年龄。
IF 3.1 Pub Date : 2026-01-29 DOI: 10.1016/j.fsigen.2026.103438
Brando Poggiali, Claus Børsting, Marie-Louise Kampmann, Morten Wiuf, Andreas Kelager, Alberte Honoré Jepsen, Athina Vidaki, Jacob Tfelt-Hansen, Jeppe Dyrberg Andersen

DNA methylation (DNAm) profiling has proved to be a reliable method for estimating the chronological age of an unknown sample donor, offering valuable leads in police investigations. However, a major limitation of current approaches using autosomal age-predictive CpGs is their inapplicability to multi-donor DNA samples (DNA mixtures), which are frequently encountered in forensic casework. Here, we present a novel approach, UnMixMe (UnMix DNA Methylation profiles), to deconvolute the DNAm profile of an unknown individual (the suspect) from a two-person DNA mixture and subsequently use this profile to predict the suspect's chronological age. Importantly, it relies on knowing 1) the DNAm profile of the other contributor (the victim) and 2) the DNA mixture ratio between the two contributors. The latter was estimated via the traditional STR profile for human identification. We tested our approach on mock trace DNA mixtures prepared from blood samples with varying suspect-to-victim ratios (10:1, 4:1, 2:1, 1:1, 1:2, 1:4, and 1:10) from two male-female pairs. We measured DNAm levels (β-values) using the Illumina EPIC v2.0 microarray and deconvoluted the DNAm profiles of the DNA mixtures to retrieve the DNAm profile of the suspect. The age of the suspect was predicted using four array-based epigenetic clocks (BLUP, EN, Horvath, and skinHorvath clock). We achieved age prediction accuracy comparable to that of the single-source suspect DNA sample, except for DNA mixtures with a low suspect-to-victim ratio (1:4 and 1:10). We identified three main factors affecting the age prediction accuracy: 1) precision of the DNAm technology, 2) accuracy of DNA mixture ratio estimation, and 3) accuracy of the prediction model used. Importantly, the age difference between DNA mixture contributors did not influence prediction accuracy. With this proof-of-concept study, we establish that autosomal DNAm profiles from two-person DNA mixtures can be successfully deconvoluted when one contributor is known and highlight the potential of this method for predicting chronological age in mixed DNA samples.

DNA甲基化(DNAm)分析已被证明是估计未知样本供体实足年龄的可靠方法,为警方调查提供了有价值的线索。然而,目前使用常染色体年龄预测CpGs的方法的一个主要限制是它们不适用于法医案件中经常遇到的多供体DNA样本(DNA混合物)。在这里,我们提出了一种新的方法,UnMixMe (UnMix DNA甲基化谱),从两人DNA混合物中解卷积未知个体(嫌疑人)的DNA谱,并随后使用该谱来预测嫌疑人的实际年龄。重要的是,它依赖于了解1)其他贡献者(受害者)的DNA档案和2)两个贡献者之间的DNA混合比例。后者是通过传统的STR配置文件估计的,用于人类识别。我们对两对男女的血液样本进行了模拟痕量DNA混合物测试,这些样本具有不同的嫌疑人与受害者比例(10:1、4:1、2:1、1:1、1:2、1:4和1:10)。我们使用Illumina EPIC v2.0微阵列测量DNAm水平(β值),并对DNA混合物的DNAm谱进行反卷积,以检索嫌疑人的DNAm谱。使用四种基于阵列的表观遗传时钟(BLUP, EN, Horvath和skinHorvath时钟)预测嫌疑人的年龄。我们实现了与单一来源可疑DNA样本相当的年龄预测精度,除了低嫌疑人与受害者比例的DNA混合物(1:4和1:10)。我们确定了影响年龄预测精度的三个主要因素:1)DNAm技术的精度,2)DNA混合比例估计的精度,以及3)所使用的预测模型的精度。重要的是,DNA混合物贡献者之间的年龄差异并不影响预测的准确性。通过这一概念验证研究,我们确定了当一个贡献者已知时,两个人DNA混合物的常染色体DNA谱可以成功地解卷积,并强调了这种方法在预测混合DNA样本的实足年龄方面的潜力。
{"title":"Age prediction of contributors to two-person DNA mixtures via deconvolution of autosomal DNA methylation profiles.","authors":"Brando Poggiali, Claus Børsting, Marie-Louise Kampmann, Morten Wiuf, Andreas Kelager, Alberte Honoré Jepsen, Athina Vidaki, Jacob Tfelt-Hansen, Jeppe Dyrberg Andersen","doi":"10.1016/j.fsigen.2026.103438","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103438","url":null,"abstract":"<p><p>DNA methylation (DNAm) profiling has proved to be a reliable method for estimating the chronological age of an unknown sample donor, offering valuable leads in police investigations. However, a major limitation of current approaches using autosomal age-predictive CpGs is their inapplicability to multi-donor DNA samples (DNA mixtures), which are frequently encountered in forensic casework. Here, we present a novel approach, UnMixMe (UnMix DNA Methylation profiles), to deconvolute the DNAm profile of an unknown individual (the suspect) from a two-person DNA mixture and subsequently use this profile to predict the suspect's chronological age. Importantly, it relies on knowing 1) the DNAm profile of the other contributor (the victim) and 2) the DNA mixture ratio between the two contributors. The latter was estimated via the traditional STR profile for human identification. We tested our approach on mock trace DNA mixtures prepared from blood samples with varying suspect-to-victim ratios (10:1, 4:1, 2:1, 1:1, 1:2, 1:4, and 1:10) from two male-female pairs. We measured DNAm levels (β-values) using the Illumina EPIC v2.0 microarray and deconvoluted the DNAm profiles of the DNA mixtures to retrieve the DNAm profile of the suspect. The age of the suspect was predicted using four array-based epigenetic clocks (BLUP, EN, Horvath, and skinHorvath clock). We achieved age prediction accuracy comparable to that of the single-source suspect DNA sample, except for DNA mixtures with a low suspect-to-victim ratio (1:4 and 1:10). We identified three main factors affecting the age prediction accuracy: 1) precision of the DNAm technology, 2) accuracy of DNA mixture ratio estimation, and 3) accuracy of the prediction model used. Importantly, the age difference between DNA mixture contributors did not influence prediction accuracy. With this proof-of-concept study, we establish that autosomal DNAm profiles from two-person DNA mixtures can be successfully deconvoluted when one contributor is known and highlight the potential of this method for predicting chronological age in mixed DNA samples.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103438"},"PeriodicalIF":3.1,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146128134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
deepNoC: A deep learning system to assign the number of contributors to a short tandem repeat DNA profile. deepNoC:一个深度学习系统,用于分配短串联重复DNA图谱的贡献者数量。
IF 3.1 Pub Date : 2026-01-27 DOI: 10.1016/j.fsigen.2026.103434
Duncan Taylor, Melissa A Humphries

A common task in forensic biology is to interpret and evaluate short tandem repeat DNA profiles. The first step in these interpretations is to assign a number of contributors to the profiles, a task that is most often performed manually by a scientist using their knowledge of DNA profile behaviour. Studies using constructed DNA profiles have shown that as DNA profiles become more complex, and the number of DNA-donating individuals increases, the ability for scientists to assign the true number decreases. There have been a number of machine learning algorithms developed that seek to assign the number of contributors to a DNA profile, however due to practical limitations in being able to generate DNA profiles in a laboratory, the algorithms have been based on summaries of the available information. In this work we develop an analysis pipeline that simulates the electrophoretic signal of an STR profile, allowing virtually unlimited, pre-labelled training material to be generated. We show that by simulating 100 000 profiles and training a number of contributors estimation tool using a deep neural network architecture (in an algorithm named deepNoC) that a high level of performance is achieved (89 % for 1-10 contributors). The trained network can then have fine-tuning training performed with only a few hundred laboratory-produced profiles in order to achieve the same accuracy within a specific laboratory. We also build into deepNoC secondary outputs that provide a level of explainability to a user of the algorithm and show how they can be displayed in an intuitive manner.

法医生物学的一个共同任务是解释和评估短串联重复DNA谱。这些解释的第一步是为这些图谱分配一些贡献者,这项任务通常是由科学家利用他们对DNA图谱行为的知识手动完成的。利用构建的DNA图谱进行的研究表明,随着DNA图谱变得越来越复杂,捐献DNA的个体数量增加,科学家确定真实数量的能力就会下降。已经开发了许多机器学习算法,试图为DNA图谱分配贡献者的数量,但是由于能够在实验室中生成DNA图谱的实际限制,这些算法一直基于可用信息的摘要。在这项工作中,我们开发了一个模拟STR谱的电泳信号的分析管道,允许几乎无限的,预先标记的训练材料生成。我们表明,通过模拟100,000个配置文件并使用深度神经网络架构(在名为deepNoC的算法中)训练许多贡献者估计工具,可以实现高水平的性能(1-10个贡献者的89% %)。然后,经过训练的网络可以仅使用几百个实验室生成的配置文件进行微调训练,以便在特定实验室中达到相同的精度。我们还在deepNoC中构建了二级输出,为算法用户提供一定程度的可解释性,并展示了如何以直观的方式显示它们。
{"title":"deepNoC: A deep learning system to assign the number of contributors to a short tandem repeat DNA profile.","authors":"Duncan Taylor, Melissa A Humphries","doi":"10.1016/j.fsigen.2026.103434","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103434","url":null,"abstract":"<p><p>A common task in forensic biology is to interpret and evaluate short tandem repeat DNA profiles. The first step in these interpretations is to assign a number of contributors to the profiles, a task that is most often performed manually by a scientist using their knowledge of DNA profile behaviour. Studies using constructed DNA profiles have shown that as DNA profiles become more complex, and the number of DNA-donating individuals increases, the ability for scientists to assign the true number decreases. There have been a number of machine learning algorithms developed that seek to assign the number of contributors to a DNA profile, however due to practical limitations in being able to generate DNA profiles in a laboratory, the algorithms have been based on summaries of the available information. In this work we develop an analysis pipeline that simulates the electrophoretic signal of an STR profile, allowing virtually unlimited, pre-labelled training material to be generated. We show that by simulating 100 000 profiles and training a number of contributors estimation tool using a deep neural network architecture (in an algorithm named deepNoC) that a high level of performance is achieved (89 % for 1-10 contributors). The trained network can then have fine-tuning training performed with only a few hundred laboratory-produced profiles in order to achieve the same accuracy within a specific laboratory. We also build into deepNoC secondary outputs that provide a level of explainability to a user of the algorithm and show how they can be displayed in an intuitive manner.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103434"},"PeriodicalIF":3.1,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146133708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metagenomic profiling reveals lung multi-kingdom microbes as forensic markers for aquatic corpses investigation. 宏基因组分析揭示了肺多界微生物作为水生尸体调查的法医标记。
IF 3.1 Pub Date : 2026-01-27 DOI: 10.1016/j.fsigen.2026.103435
Fu-Yuan Zhang, Du Shu-Kui, Lin-Lin Wang, Yi-Tao Ma, Ming-Zhe Wu, Hao-Miao Yuan, Jin-Nong Yang, Yan Zhang, Guo-An Zhang, Jian Zhao, Chao Liu, Da-Wei Guan, Rui Zhao

The forensic investigation of corpses recovered from aquatic environments presents a major practical challenge. Recent studies have demonstrated that the bacterial community in the lung serves as a valuable indicator for diagnosing drowning, determining the drowning medium and estimating postmortem submersion interval (PMSI). However, the application and significance of lung multi-kingdom microbiome (archaea, eukaryota, and viruses) remains inadequately characterized. Meanwhile, the insufficient sequencing depth of commonly employed techniques, such as amplicon sequencing, restricts our understanding of microbial communities. In this study, we characterized the postmortem lung microbiome of mice submerged in water for up to 10 days using metagenomic sequencing, and subsequently validated the potential microbial biomarkers in both murine and human forensic specimens via qPCR. Integrated analyses were conducted followed by the confirmation of significant lung bacterial communities for drowning diagnosis, inference of drowning site, and estimation of the PMSI. Our findings revealed that bacteria constituted the predominant component of the lung microbiome in submerged murine carcasses, with eukaryota serving as the secondary dominant taxa. Seventeen bacterial and nine eukaryotic features at the species level were identified as potential biomarkers for drowning diagnosis. By detecting the specific molecular markers for Aeromonas species in both murine and human samples, the positive detection of Aeromonas species, particularly Aeromonas hydrophila, provides solid evidence for drowning diagnosis. Additionally, 14 and 17 bacterial species were identified as biomarkers for the inference of drowning site and estimation of PMSI, respectively. Based on the identified potential biomarkers, robust forensic models were constructed using the random forest (RF) algorithm. The accuracy of the bacterial model for drowning diagnosis was 89.29 %, while the accuracy of the eukaryotic model was 87.5 %. For the inference of the drowning site, the bacterial model achieved an accuracy of 100 %. Furthermore, the estimation of the PMSI yielded a mean absolute error of 0.66 ± 0.097 days. Collectively, our findings revealed that the selected 17 bacterial and 9 eukaryotic features in the lungs, particularly Aeromonas hydrophila, are beneficial for drowning diagnosis. Additionally, the other selected bacterial species contribute to the estimation of the drowning site and PMSI, thereby providing more comprehensive and refined information for accurate forensic investigations of corpses recovered from aquatic environments.

从水生环境中恢复的尸体的法医调查提出了一个重大的实际挑战。最近的研究表明,肺中的细菌群落是诊断溺水、确定溺水介质和估计死后溺水间隔(PMSI)的有价值的指标。然而,肺部多界微生物组(古生菌、真核生物和病毒)的应用和意义仍然没有得到充分的描述。同时,扩增子测序等常用测序技术的测序深度不足,限制了我们对微生物群落的认识。在这项研究中,我们使用宏基因组测序方法对浸泡在水中长达10天的小鼠的死后肺微生物组进行了表征,并随后通过qPCR验证了小鼠和人类法医标本中潜在的微生物生物标志物。随后进行综合分析,确认溺水诊断的重要肺部细菌群落,推断溺水地点,并估计PMSI。我们的研究结果表明,细菌构成了水下小鼠尸体肺部微生物群的主要组成部分,真核生物是次要优势分类群。在物种水平上鉴定出17种细菌和9种真核生物特征作为溺水诊断的潜在生物标志物。通过对小鼠和人标本中气单胞菌种类特异性分子标记的检测,气单胞菌种类特别是嗜水气单胞菌的阳性检测为溺水诊断提供了坚实的依据。此外,鉴定出14种和17种细菌分别作为推断溺水地点和估计PMSI的生物标志物。基于识别出的潜在生物标志物,采用随机森林(RF)算法构建鲁棒法医模型。细菌模型诊断溺水的准确率为89.29 %,真核模型诊断溺水的准确率为87.5 %。对于溺水地点的推断,细菌模型的准确率达到了100% %。此外,PMSI的估计产生了0.66 ± 0.097天的平均绝对误差。总的来说,我们的研究结果表明,肺中选定的17种细菌和9种真核生物特征,特别是嗜水气单胞菌,有利于溺水的诊断。此外,其他选定的细菌种类有助于估计溺水地点和PMSI,从而为从水生环境中恢复的尸体的准确法医调查提供更全面和精确的信息。
{"title":"Metagenomic profiling reveals lung multi-kingdom microbes as forensic markers for aquatic corpses investigation.","authors":"Fu-Yuan Zhang, Du Shu-Kui, Lin-Lin Wang, Yi-Tao Ma, Ming-Zhe Wu, Hao-Miao Yuan, Jin-Nong Yang, Yan Zhang, Guo-An Zhang, Jian Zhao, Chao Liu, Da-Wei Guan, Rui Zhao","doi":"10.1016/j.fsigen.2026.103435","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103435","url":null,"abstract":"<p><p>The forensic investigation of corpses recovered from aquatic environments presents a major practical challenge. Recent studies have demonstrated that the bacterial community in the lung serves as a valuable indicator for diagnosing drowning, determining the drowning medium and estimating postmortem submersion interval (PMSI). However, the application and significance of lung multi-kingdom microbiome (archaea, eukaryota, and viruses) remains inadequately characterized. Meanwhile, the insufficient sequencing depth of commonly employed techniques, such as amplicon sequencing, restricts our understanding of microbial communities. In this study, we characterized the postmortem lung microbiome of mice submerged in water for up to 10 days using metagenomic sequencing, and subsequently validated the potential microbial biomarkers in both murine and human forensic specimens via qPCR. Integrated analyses were conducted followed by the confirmation of significant lung bacterial communities for drowning diagnosis, inference of drowning site, and estimation of the PMSI. Our findings revealed that bacteria constituted the predominant component of the lung microbiome in submerged murine carcasses, with eukaryota serving as the secondary dominant taxa. Seventeen bacterial and nine eukaryotic features at the species level were identified as potential biomarkers for drowning diagnosis. By detecting the specific molecular markers for Aeromonas species in both murine and human samples, the positive detection of Aeromonas species, particularly Aeromonas hydrophila, provides solid evidence for drowning diagnosis. Additionally, 14 and 17 bacterial species were identified as biomarkers for the inference of drowning site and estimation of PMSI, respectively. Based on the identified potential biomarkers, robust forensic models were constructed using the random forest (RF) algorithm. The accuracy of the bacterial model for drowning diagnosis was 89.29 %, while the accuracy of the eukaryotic model was 87.5 %. For the inference of the drowning site, the bacterial model achieved an accuracy of 100 %. Furthermore, the estimation of the PMSI yielded a mean absolute error of 0.66 ± 0.097 days. Collectively, our findings revealed that the selected 17 bacterial and 9 eukaryotic features in the lungs, particularly Aeromonas hydrophila, are beneficial for drowning diagnosis. Additionally, the other selected bacterial species contribute to the estimation of the drowning site and PMSI, thereby providing more comprehensive and refined information for accurate forensic investigations of corpses recovered from aquatic environments.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103435"},"PeriodicalIF":3.1,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146115255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From unsupervised selection of AIMs to likelihood-based BGA inference: Challenges revealed by OpenADMIXTURE and GENOGEOGRAPHER. 从目标的无监督选择到基于似然的BGA推断:openadmix和genogeography揭示的挑战。
IF 3.1 Pub Date : 2026-01-26 DOI: 10.1016/j.fsigen.2026.103433
Peter Resutik, Mario Gysi, Thomas Lemmin, Adelgunde Kratzer, Cordula Haas, Natasha Arora

Biogeographical ancestry (BGA) inference is an important tool in forensic genetics. However, typical approaches often rely on predefined population labels and limited marker sets, which constrain both resolution and flexibility. In this study, we evaluate the potential of unsupervised feature selection for BGA inference using Sparse K-means with Feature Ranking (SKFR), as implemented in OpenADMIXTURE. We leveraged the largest dataset used in a forensic context to date, comprising approximately 6500 individuals genotyped at ∼600,000 SNPs on the Human Origins (HO) array. Based on this dataset, we evaluated SKFR-selected ancestry-informative marker (AIM) panels ranging from 1500 to 2200 SNPs. Clustering performance was assessed using OpenADMIXTURE and quantified with G' similarity. Among the tested panels, a 1,900-SNP panel showed the most consistent clustering results and was selected for further evaluation. To examine forensic relevance, we compared this panel to a randomly selected SNP panel of the same size. Both panels produced broadly similar clustering patterns with OpenADMIXTURE, likely reflecting the marker composition of the HO array. The performance of the 1900 SKFR-selected SNPs was then evaluated using GENOGEOGRAPHER, a likelihood-based tool for BGA inference. Assignment analyses within the held-out test set provided a detailed overview of concordant and discordant assignments under the chosen reference metapopulations. While differences between the SKFR and random panels were modest, the SKFR panel showed consistently stronger and more stable assignment performance, demonstrating that unsupervised marker selection can add value even under the constraints of SNP arrays enriched for ancestry-informative variants. Overall, our study offers a systematic critical evaluation of unsupervised AIM selection and its limitations in practical settings. We show that panel size, array ascertainment, and reference dataset composition jointly shape ancestry-inference performance, and we encourage inference approaches that are not tied to fixed marker panels but instead make use of as many informative SNPs as feasible.

生物地理祖先(BGA)推断是法医遗传学的重要工具。然而,典型的方法通常依赖于预定义的种群标签和有限的标记集,这限制了分辨率和灵活性。在本研究中,我们评估了使用openadmix中实现的带有特征排序(SKFR)的稀疏K-means进行BGA推理的无监督特征选择的潜力。我们利用迄今为止在法医环境中使用的最大数据集,包括大约6500个个体,在人类起源(HO)阵列上以~ 600,000个snp进行基因分型。基于该数据集,我们评估了skfr选择的祖先信息标记(AIM)面板,范围从1500到2200个snp。使用openadmix评估聚类性能,并用G′相似度进行量化。在测试的面板中,1,900-SNP面板显示出最一致的聚类结果,并被选中进行进一步评估。为了检验法医相关性,我们将该面板与随机选择的相同大小的SNP面板进行了比较。两个面板都产生了与openadmix大致相似的聚类模式,可能反映了HO阵列的标记组成。然后使用基于似然的BGA推断工具genogeography对1900个skfr选择的snp的性能进行评估。保留测试集中的分配分析提供了在所选参考元人口下的一致和不一致分配的详细概述。虽然SKFR和随机面板之间的差异不大,但SKFR面板始终表现出更强、更稳定的分配性能,这表明即使在丰富了祖先信息变体的SNP阵列的约束下,无监督标记选择也可以增加价值。总的来说,我们的研究对无监督AIM选择及其在实际环境中的局限性进行了系统的批判性评估。我们表明,面板大小、阵列确定和参考数据集组成共同影响了祖先推断性能,并且我们鼓励不与固定标记面板绑定的推断方法,而是尽可能多地利用信息snp。
{"title":"From unsupervised selection of AIMs to likelihood-based BGA inference: Challenges revealed by OpenADMIXTURE and GENOGEOGRAPHER.","authors":"Peter Resutik, Mario Gysi, Thomas Lemmin, Adelgunde Kratzer, Cordula Haas, Natasha Arora","doi":"10.1016/j.fsigen.2026.103433","DOIUrl":"https://doi.org/10.1016/j.fsigen.2026.103433","url":null,"abstract":"<p><p>Biogeographical ancestry (BGA) inference is an important tool in forensic genetics. However, typical approaches often rely on predefined population labels and limited marker sets, which constrain both resolution and flexibility. In this study, we evaluate the potential of unsupervised feature selection for BGA inference using Sparse K-means with Feature Ranking (SKFR), as implemented in OpenADMIXTURE. We leveraged the largest dataset used in a forensic context to date, comprising approximately 6500 individuals genotyped at ∼600,000 SNPs on the Human Origins (HO) array. Based on this dataset, we evaluated SKFR-selected ancestry-informative marker (AIM) panels ranging from 1500 to 2200 SNPs. Clustering performance was assessed using OpenADMIXTURE and quantified with G' similarity. Among the tested panels, a 1,900-SNP panel showed the most consistent clustering results and was selected for further evaluation. To examine forensic relevance, we compared this panel to a randomly selected SNP panel of the same size. Both panels produced broadly similar clustering patterns with OpenADMIXTURE, likely reflecting the marker composition of the HO array. The performance of the 1900 SKFR-selected SNPs was then evaluated using GENOGEOGRAPHER, a likelihood-based tool for BGA inference. Assignment analyses within the held-out test set provided a detailed overview of concordant and discordant assignments under the chosen reference metapopulations. While differences between the SKFR and random panels were modest, the SKFR panel showed consistently stronger and more stable assignment performance, demonstrating that unsupervised marker selection can add value even under the constraints of SNP arrays enriched for ancestry-informative variants. Overall, our study offers a systematic critical evaluation of unsupervised AIM selection and its limitations in practical settings. We show that panel size, array ascertainment, and reference dataset composition jointly shape ancestry-inference performance, and we encourage inference approaches that are not tied to fixed marker panels but instead make use of as many informative SNPs as feasible.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"83 ","pages":"103433"},"PeriodicalIF":3.1,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146138253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TrACE - Trace analysis collaborative exercise: A transparent, expert driven concept of proficiency tests. 跟踪-跟踪分析协作练习:一个透明的、专家驱动的能力测试概念。
IF 3.1 Pub Date : 2026-01-01 Epub Date: 2025-07-25 DOI: 10.1016/j.fsigen.2025.103333
Marielle Vennemann, Hannah Bauer, Katja Anslinger, Martin Eckert, Waldemar Spitz, Stefanie Grethe, Walther Parson, Petra Preikschat-Sachse, The TrACE Team

TrACE (Trace Analysis Collaborative Exercise) represents a novel, strictly expert driven and transparent concept of external quality control based on a combination of proficiency testing and interlaboratory comparisons. TrACE is an official proficiency test scheme of the German Stain Commission and acts in accordance with the recommendations for proficiency testing issued by this commission and outlined in DIN EN ISO 17025. TrACE offers modules on all aspects of forensic genetics that address challenges encountered in real casework. Basic modules represent proficiency tests that cover the physical examination of items, the identification of body fluids, DNA extraction, the analysis of autosomal and Y-chromosomal STRs and mitochondrial DNA (mtDNA), and the interpretation and verbalisation of results, including complex mixtures. Advanced and extended modules provide interlaboratory tests with more challenging items and novel methodology such as probabilistic genotyping and forensic DNA phenotyping (FDP). Each module is coordinated by an internationally recognised expert in the respective field. The members of the TrACE team are based in case work and/or academic forensic laboratories.

TrACE (TrACE Analysis Collaborative Exercise)代表了一种新颖的、严格由专家驱动的、透明的外部质量控制概念,它基于熟练程度测试和实验室间比较的结合。TrACE是德国污渍委员会的官方能力测试计划,并根据该委员会发布的能力测试建议和DIN EN ISO 17025概述。TrACE提供法医遗传学的各个方面的模块,以解决在实际案件工作中遇到的挑战。基本模块是熟练程度测试,包括对物品进行体检、体液鉴定、提取DNA、分析常染色体和y染色体str和线粒体DNA (mtDNA),以及对结果(包括复杂混合物)进行解释和口头说明。先进和扩展模块提供实验室间测试更具挑战性的项目和新颖的方法,如概率基因分型和法医DNA表型(FDP)。每个模块由各自领域的国际公认专家协调。TrACE小组的成员以案件工作和/或学术法医实验室为基础。
{"title":"TrACE - Trace analysis collaborative exercise: A transparent, expert driven concept of proficiency tests.","authors":"Marielle Vennemann, Hannah Bauer, Katja Anslinger, Martin Eckert, Waldemar Spitz, Stefanie Grethe, Walther Parson, Petra Preikschat-Sachse, The TrACE Team","doi":"10.1016/j.fsigen.2025.103333","DOIUrl":"10.1016/j.fsigen.2025.103333","url":null,"abstract":"<p><p>TrACE (Trace Analysis Collaborative Exercise) represents a novel, strictly expert driven and transparent concept of external quality control based on a combination of proficiency testing and interlaboratory comparisons. TrACE is an official proficiency test scheme of the German Stain Commission and acts in accordance with the recommendations for proficiency testing issued by this commission and outlined in DIN EN ISO 17025. TrACE offers modules on all aspects of forensic genetics that address challenges encountered in real casework. Basic modules represent proficiency tests that cover the physical examination of items, the identification of body fluids, DNA extraction, the analysis of autosomal and Y-chromosomal STRs and mitochondrial DNA (mtDNA), and the interpretation and verbalisation of results, including complex mixtures. Advanced and extended modules provide interlaboratory tests with more challenging items and novel methodology such as probabilistic genotyping and forensic DNA phenotyping (FDP). Each module is coordinated by an internationally recognised expert in the respective field. The members of the TrACE team are based in case work and/or academic forensic laboratories.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"80 ","pages":"103333"},"PeriodicalIF":3.1,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144812809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting guidance on population sampling for highly polymorphic STR loci. 重新审视高多态性STR基因座的群体抽样指南。
IF 3.1 Pub Date : 2026-01-01 Epub Date: 2025-08-05 DOI: 10.1016/j.fsigen.2025.103336
Sanne E Aalbers, Katherine B Gettings

Population databases allow us to attach probabilities to DNA evidence by the estimation of genotype frequencies, which rely on accurate allele frequency estimates. As short tandem repeat (STR) marker sets for human identification have expanded to include more discriminating markers, and especially now that sequencing techniques allow us to distinguish between alleles based on variation in underlying base-pair structure, it is important to reevaluate existing guidance on population database sizes for the estimation of allele frequencies. In this paper, we revisit the topic of population sampling by focusing on the representation of alleles, i.e. whether alleles are observed or not, in a sample of individuals containing data for highly polymorphic autosomal STR loci. The effect of both length- and sequence-based STR data on population sample size implications are demonstrated, and differences between lesser and more polymorphic markers are discussed. The consequences of using a limited number of individuals are explored and the impact of increasing population sample sizes by combining different data sets is shown to help determine the point at which further sampling may no longer provide significant value. Finally, different approaches for accommodating previously unobserved alleles and their impact on DNA evidence evaluations are discussed.

人口数据库允许我们通过基因型频率的估计来附加DNA证据的概率,这依赖于精确的等位基因频率估计。由于用于人类鉴定的短串联重复(STR)标记集已经扩展到包括更多的鉴别标记,特别是现在测序技术使我们能够根据潜在碱基对结构的变化来区分等位基因,因此重新评估现有的用于估计等位基因频率的种群数据库大小指导是很重要的。在本文中,我们通过关注等位基因的代表性,即等位基因是否被观察到,在包含高度多态常染色体STR位点数据的个体样本中,重新审视群体抽样的主题。基于长度和序列的STR数据对种群样本量的影响得到了证明,并讨论了较少多态性和较多多态性标记之间的差异。探讨了使用有限数量的个体的后果,并表明了通过组合不同的数据集来增加人口样本量的影响,以帮助确定进一步抽样可能不再提供重要价值的点。最后,讨论了适应以前未观察到的等位基因的不同方法及其对DNA证据评估的影响。
{"title":"Revisiting guidance on population sampling for highly polymorphic STR loci.","authors":"Sanne E Aalbers, Katherine B Gettings","doi":"10.1016/j.fsigen.2025.103336","DOIUrl":"10.1016/j.fsigen.2025.103336","url":null,"abstract":"<p><p>Population databases allow us to attach probabilities to DNA evidence by the estimation of genotype frequencies, which rely on accurate allele frequency estimates. As short tandem repeat (STR) marker sets for human identification have expanded to include more discriminating markers, and especially now that sequencing techniques allow us to distinguish between alleles based on variation in underlying base-pair structure, it is important to reevaluate existing guidance on population database sizes for the estimation of allele frequencies. In this paper, we revisit the topic of population sampling by focusing on the representation of alleles, i.e. whether alleles are observed or not, in a sample of individuals containing data for highly polymorphic autosomal STR loci. The effect of both length- and sequence-based STR data on population sample size implications are demonstrated, and differences between lesser and more polymorphic markers are discussed. The consequences of using a limited number of individuals are explored and the impact of increasing population sample sizes by combining different data sets is shown to help determine the point at which further sampling may no longer provide significant value. Finally, different approaches for accommodating previously unobserved alleles and their impact on DNA evidence evaluations are discussed.</p>","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"80 ","pages":"103336"},"PeriodicalIF":3.1,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12382342/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144805494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expression of Concern "Population data of 17 Y-STR loci in Nanyang Han population from Henan Province, Central China" [Forensic Sci. Int. Gene. 13 (2014) 145-146]. 《河南省南阳汉族人群17个Y-STR基因座的种群数据》[法医学];Int。基因,13(2014)145-146。
Pub Date : 2025-02-01 Epub Date: 2024-08-05 DOI: 10.1016/j.fsigen.2024.103119
{"title":"Expression of Concern \"Population data of 17 Y-STR loci in Nanyang Han population from Henan Province, Central China\" [Forensic Sci. Int. Gene. 13 (2014) 145-146].","authors":"","doi":"10.1016/j.fsigen.2024.103119","DOIUrl":"https://doi.org/10.1016/j.fsigen.2024.103119","url":null,"abstract":"","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"75 ","pages":"103119"},"PeriodicalIF":0.0,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142804076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Forensic science international. Genetics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1