通过逐步条件全表观基因组关联研究精确推断年龄的10个CpGs甲基化面板。

IF 2.2 3区 医学 Q1 MEDICINE, LEGAL International Journal of Legal Medicine Pub Date : 2024-12-05 DOI:10.1007/s00414-024-03365-2
Yu Qian, Qianqian Peng, Qili Qian, Xingjian Gao, Xinxuan Liu, Yi Li, Xiu Fan, Yuan Cheng, Na Yuan, Sibte Hadi, Li Jin, Sijia Wang, Fan Liu
{"title":"通过逐步条件全表观基因组关联研究精确推断年龄的10个CpGs甲基化面板。","authors":"Yu Qian, Qianqian Peng, Qili Qian, Xingjian Gao, Xinxuan Liu, Yi Li, Xiu Fan, Yuan Cheng, Na Yuan, Sibte Hadi, Li Jin, Sijia Wang, Fan Liu","doi":"10.1007/s00414-024-03365-2","DOIUrl":null,"url":null,"abstract":"<p><p>Estimating individual age from DNA methylation at age associated CpG sites may provide key information facilitating forensic investigations. Systematic marker screening and feature selection play a critical role in ensuring the performance of the final prediction model. In the discovery stage, we screened for 811876 CpGs from whole blood of 2664 Chinese individuals ranging from 18 to 83 years of age based on a stepwise conditional epigenome-wide association study (SCEWAS). The SCEWAS identified 28 CpGs showing genome-wide significant and independent effects. Further restricting this panel to 10 most informative CpGs showed a tolerable loss of information. A linear model consisting of these 10 CpGs could explain 93% of the age variance (R<sup>2</sup> = 0.93) in the training set (n = 2664). In an independent test set of Chinese individuals (n = 648), this model also provided highly accurate predictions (R<sup>2</sup> = 0.85, mean absolute deviation, MAD = 3.20 years). The model was additionally validated in a public dataset of multiple ancestral origins (86 Europeans, 14 Asians, and 273 Africans) and the prediction accuracy reduced significantly (R<sup>2</sup> = 0.85, MAD = 6.21 years), as might be expected due to different genomic backgrounds, sample sizes, and age ranges. Our 10 CpG model also outperformed the recently proposed 9-CpG model constructed in 390 Chinese males (R<sup>2</sup> = 0.79 in test set). We also demonstrated that our SCEWAS approach outperformed the traditional EWAS and the elastic net approach in obtaining a small set of most age informative CpGs. Overall, our systematic genome-wide feature selection identified a small panel of 10 CpGs for accurate age estimation with high potential in forensic applications.</p>","PeriodicalId":14071,"journal":{"name":"International Journal of Legal Medicine","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A methylation panel of 10 CpGs for accurate age inference via stepwise conditional epigenome-wide association study.\",\"authors\":\"Yu Qian, Qianqian Peng, Qili Qian, Xingjian Gao, Xinxuan Liu, Yi Li, Xiu Fan, Yuan Cheng, Na Yuan, Sibte Hadi, Li Jin, Sijia Wang, Fan Liu\",\"doi\":\"10.1007/s00414-024-03365-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Estimating individual age from DNA methylation at age associated CpG sites may provide key information facilitating forensic investigations. Systematic marker screening and feature selection play a critical role in ensuring the performance of the final prediction model. In the discovery stage, we screened for 811876 CpGs from whole blood of 2664 Chinese individuals ranging from 18 to 83 years of age based on a stepwise conditional epigenome-wide association study (SCEWAS). The SCEWAS identified 28 CpGs showing genome-wide significant and independent effects. Further restricting this panel to 10 most informative CpGs showed a tolerable loss of information. A linear model consisting of these 10 CpGs could explain 93% of the age variance (R<sup>2</sup> = 0.93) in the training set (n = 2664). In an independent test set of Chinese individuals (n = 648), this model also provided highly accurate predictions (R<sup>2</sup> = 0.85, mean absolute deviation, MAD = 3.20 years). The model was additionally validated in a public dataset of multiple ancestral origins (86 Europeans, 14 Asians, and 273 Africans) and the prediction accuracy reduced significantly (R<sup>2</sup> = 0.85, MAD = 6.21 years), as might be expected due to different genomic backgrounds, sample sizes, and age ranges. Our 10 CpG model also outperformed the recently proposed 9-CpG model constructed in 390 Chinese males (R<sup>2</sup> = 0.79 in test set). We also demonstrated that our SCEWAS approach outperformed the traditional EWAS and the elastic net approach in obtaining a small set of most age informative CpGs. Overall, our systematic genome-wide feature selection identified a small panel of 10 CpGs for accurate age estimation with high potential in forensic applications.</p>\",\"PeriodicalId\":14071,\"journal\":{\"name\":\"International Journal of Legal Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Legal Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00414-024-03365-2\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, LEGAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Legal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00414-024-03365-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0

摘要

从年龄相关CpG位点的DNA甲基化估计个体年龄可能为法医调查提供关键信息。系统的标记筛选和特征选择在确保最终预测模型的性能方面起着至关重要的作用。在发现阶段,我们基于逐步条件全表观基因组关联研究(SCEWAS),从2664名年龄在18至83岁之间的中国人全血中筛选了811876个CpGs。SCEWAS鉴定出28个CpGs,显示出全基因组显著且独立的效应。进一步将该面板限制为10个最具信息量的cpg显示了可容忍的信息丢失。由这10个cpg组成的线性模型可以解释训练集(n = 2664)中93%的年龄方差(R2 = 0.93)。在中国个体(n = 648)的独立检验集中,该模型也提供了非常准确的预测(R2 = 0.85,平均绝对偏差,MAD = 3.20年)。该模型在多个祖先起源的公共数据集中(86名欧洲人,14名亚洲人和273名非洲人)进行了进一步验证,预测精度显着降低(R2 = 0.85, MAD = 6.21年),这可能是由于不同的基因组背景,样本量和年龄范围所导致的。我们的10 CpG模型也优于最近提出的390名中国男性构建的9-CpG模型(R2 = 0.79)。我们还证明了我们的SCEWAS方法在获得一小组最具年龄信息的cpg方面优于传统的EWAS和弹性网方法。总的来说,我们系统的全基因组特征选择确定了10个CpGs的小面板,用于准确的年龄估计,在法医应用中具有很高的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A methylation panel of 10 CpGs for accurate age inference via stepwise conditional epigenome-wide association study.

Estimating individual age from DNA methylation at age associated CpG sites may provide key information facilitating forensic investigations. Systematic marker screening and feature selection play a critical role in ensuring the performance of the final prediction model. In the discovery stage, we screened for 811876 CpGs from whole blood of 2664 Chinese individuals ranging from 18 to 83 years of age based on a stepwise conditional epigenome-wide association study (SCEWAS). The SCEWAS identified 28 CpGs showing genome-wide significant and independent effects. Further restricting this panel to 10 most informative CpGs showed a tolerable loss of information. A linear model consisting of these 10 CpGs could explain 93% of the age variance (R2 = 0.93) in the training set (n = 2664). In an independent test set of Chinese individuals (n = 648), this model also provided highly accurate predictions (R2 = 0.85, mean absolute deviation, MAD = 3.20 years). The model was additionally validated in a public dataset of multiple ancestral origins (86 Europeans, 14 Asians, and 273 Africans) and the prediction accuracy reduced significantly (R2 = 0.85, MAD = 6.21 years), as might be expected due to different genomic backgrounds, sample sizes, and age ranges. Our 10 CpG model also outperformed the recently proposed 9-CpG model constructed in 390 Chinese males (R2 = 0.79 in test set). We also demonstrated that our SCEWAS approach outperformed the traditional EWAS and the elastic net approach in obtaining a small set of most age informative CpGs. Overall, our systematic genome-wide feature selection identified a small panel of 10 CpGs for accurate age estimation with high potential in forensic applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.80
自引率
9.50%
发文量
165
审稿时长
1 months
期刊介绍: The International Journal of Legal Medicine aims to improve the scientific resources used in the elucidation of crime and related forensic applications at a high level of evidential proof. The journal offers review articles tracing development in specific areas, with up-to-date analysis; original articles discussing significant recent research results; case reports describing interesting and exceptional examples; population data; letters to the editors; and technical notes, which appear in a section originally created for rapid publication of data in the dynamic field of DNA analysis.
期刊最新文献
Reply to the commentary of Burkhard Madea and Elke Doberentz on our article "An assessment of the Henssge method for forensic death time estimation in the early post-mortem interval". A critical review of medicolegal research and information asymmetries in investigating cases of extrajudicial executions and forced disappearances. Socio-demographic and toxicological findings from autoptic cases in a Northern Italy community (2017-2022). Conventional and machine learning-based analysis of age, body weight and body height significance in knot position-related thyrohyoid and cervical spine fractures in suicidal hangings. Integration of a high-resolution melt curve assay into a commercial quantification kit for preliminary identification of biological mixtures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1