Saleh Sakib Ahmed, Nahian Shabab, Md. Abul Hassan Samee, M. Sohel Rahman
{"title":"GraphAge: Unleashing the power of Graph Neural Network to Decode Epigenetic Aging","authors":"Saleh Sakib Ahmed, Nahian Shabab, Md. Abul Hassan Samee, M. Sohel Rahman","doi":"arxiv-2408.00984","DOIUrl":null,"url":null,"abstract":"DNA methylation is a crucial epigenetic marker used in various clocks to\npredict epigenetic age. However, many existing clocks fail to account for\ncrucial information about CpG sites and their interrelationships, such as\nco-methylation patterns. We present a novel approach to represent methylation\ndata as a graph, using methylation values and relevant information about CpG\nsites as nodes, and relationships like co-methylation, same gene, and same\nchromosome as edges. We then use a Graph Neural Network (GNN) to predict age.\nThus our model, GraphAge, leverages both structural and positional information\nfor prediction as well as better interpretation. Although we had to train in a\nconstrained compute setting, GraphAge still showed competitive performance with\na Mean Absolute Error (MAE) of 3.207 and a Mean Squared Error (MSE) of 25.277,\nslightly outperforming the current state of the art. Perhaps more importantly,\nwe utilized GNN explainer for interpretation purposes and were able to unearth\ninteresting insights (e.g., key CpG sites, pathways, and their relationships\nthrough Methylation Regulated Networks in the context of aging), which were not\npossible to 'decode' without leveraging the unique capability of GraphAge to\n'encode' various structural relationships. GraphAge has the potential to\nconsume and utilize all relevant information (if available) about an individual\nthat relates to the complex process of aging. So, in that sense, it is one of\nits kind and can be seen as the first benchmark for a multimodal model that can\nincorporate all this information in order to close the gap in our understanding\nof the true nature of aging.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.00984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
DNA methylation is a crucial epigenetic marker used in various clocks to
predict epigenetic age. However, many existing clocks fail to account for
crucial information about CpG sites and their interrelationships, such as
co-methylation patterns. We present a novel approach to represent methylation
data as a graph, using methylation values and relevant information about CpG
sites as nodes, and relationships like co-methylation, same gene, and same
chromosome as edges. We then use a Graph Neural Network (GNN) to predict age.
Thus our model, GraphAge, leverages both structural and positional information
for prediction as well as better interpretation. Although we had to train in a
constrained compute setting, GraphAge still showed competitive performance with
a Mean Absolute Error (MAE) of 3.207 and a Mean Squared Error (MSE) of 25.277,
slightly outperforming the current state of the art. Perhaps more importantly,
we utilized GNN explainer for interpretation purposes and were able to unearth
interesting insights (e.g., key CpG sites, pathways, and their relationships
through Methylation Regulated Networks in the context of aging), which were not
possible to 'decode' without leveraging the unique capability of GraphAge to
'encode' various structural relationships. GraphAge has the potential to
consume and utilize all relevant information (if available) about an individual
that relates to the complex process of aging. So, in that sense, it is one of
its kind and can be seen as the first benchmark for a multimodal model that can
incorporate all this information in order to close the gap in our understanding
of the true nature of aging.