GraphAge: Unleashing the power of Graph Neural Network to Decode Epigenetic Aging

Saleh Sakib Ahmed, Nahian Shabab, Md. Abul Hassan Samee, M. Sohel Rahman
{"title":"GraphAge: Unleashing the power of Graph Neural Network to Decode Epigenetic Aging","authors":"Saleh Sakib Ahmed, Nahian Shabab, Md. Abul Hassan Samee, M. Sohel Rahman","doi":"arxiv-2408.00984","DOIUrl":null,"url":null,"abstract":"DNA methylation is a crucial epigenetic marker used in various clocks to\npredict epigenetic age. However, many existing clocks fail to account for\ncrucial information about CpG sites and their interrelationships, such as\nco-methylation patterns. We present a novel approach to represent methylation\ndata as a graph, using methylation values and relevant information about CpG\nsites as nodes, and relationships like co-methylation, same gene, and same\nchromosome as edges. We then use a Graph Neural Network (GNN) to predict age.\nThus our model, GraphAge, leverages both structural and positional information\nfor prediction as well as better interpretation. Although we had to train in a\nconstrained compute setting, GraphAge still showed competitive performance with\na Mean Absolute Error (MAE) of 3.207 and a Mean Squared Error (MSE) of 25.277,\nslightly outperforming the current state of the art. Perhaps more importantly,\nwe utilized GNN explainer for interpretation purposes and were able to unearth\ninteresting insights (e.g., key CpG sites, pathways, and their relationships\nthrough Methylation Regulated Networks in the context of aging), which were not\npossible to 'decode' without leveraging the unique capability of GraphAge to\n'encode' various structural relationships. GraphAge has the potential to\nconsume and utilize all relevant information (if available) about an individual\nthat relates to the complex process of aging. So, in that sense, it is one of\nits kind and can be seen as the first benchmark for a multimodal model that can\nincorporate all this information in order to close the gap in our understanding\nof the true nature of aging.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

DNA methylation is a crucial epigenetic marker used in various clocks to predict epigenetic age. However, many existing clocks fail to account for crucial information about CpG sites and their interrelationships, such as co-methylation patterns. We present a novel approach to represent methylation data as a graph, using methylation values and relevant information about CpG sites as nodes, and relationships like co-methylation, same gene, and same chromosome as edges. We then use a Graph Neural Network (GNN) to predict age. Thus our model, GraphAge, leverages both structural and positional information for prediction as well as better interpretation. Although we had to train in a constrained compute setting, GraphAge still showed competitive performance with a Mean Absolute Error (MAE) of 3.207 and a Mean Squared Error (MSE) of 25.277, slightly outperforming the current state of the art. Perhaps more importantly, we utilized GNN explainer for interpretation purposes and were able to unearth interesting insights (e.g., key CpG sites, pathways, and their relationships through Methylation Regulated Networks in the context of aging), which were not possible to 'decode' without leveraging the unique capability of GraphAge to 'encode' various structural relationships. GraphAge has the potential to consume and utilize all relevant information (if available) about an individual that relates to the complex process of aging. So, in that sense, it is one of its kind and can be seen as the first benchmark for a multimodal model that can incorporate all this information in order to close the gap in our understanding of the true nature of aging.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GraphAge:释放图神经网络的力量,解码表观遗传衰老
DNA 甲基化是一种重要的表观遗传标记,被各种时钟用来预测表观遗传年龄。然而,现有的许多时钟都没有考虑到 CpG 位点及其相互关系的重要信息,如共同甲基化模式。我们提出了一种将甲基化数据表示为图的新方法,将甲基化值和 CpG 位点的相关信息作为节点,将共甲基化、同一基因和同一染色体等关系作为边。因此,我们的模型 GraphAge 既能利用结构信息和位置信息进行预测,又能更好地进行解释。虽然我们必须在有限制的计算环境中进行训练,但 GraphAge 仍然表现出了极具竞争力的性能,其平均绝对误差(MAE)为 3.207,平均平方误差(MSE)为 25.277,略高于目前的技术水平。也许更重要的是,我们利用 GNN 解释器进行了解释,并挖掘出了有趣的见解(例如,关键 CpG 位点、通路及其在衰老背景下通过甲基化调控网络的关系),如果不利用 GraphAge 的独特能力来 "编码 "各种结构关系,就无法 "解码 "这些见解。GraphAge 有可能收集和利用与复杂的衰老过程有关的所有相关信息(如果有的话)。因此,从这个意义上说,它是独一无二的,可以被视为多模态模型的第一个基准,该模型可以整合所有这些信息,以缩小我们对衰老真实本质的理解差距。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning A computational framework for optimal and Model Predictive Control of stochastic gene regulatory networks Active learning for energy-based antibody optimization and enhanced screening Comorbid anxiety symptoms predict lower odds of improvement in depression symptoms during smartphone-delivered psychotherapy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1