Ali Royat, Seyed Mohamad Moghadas, Lesley De Cruz, Adrian Munteanu
{"title":"GINTRIP: Interpretable Temporal Graph Regression using Information bottleneck and Prototype-based method","authors":"Ali Royat, Seyed Mohamad Moghadas, Lesley De Cruz, Adrian Munteanu","doi":"arxiv-2409.10996","DOIUrl":null,"url":null,"abstract":"Deep neural networks (DNNs) have demonstrated remarkable performance across\nvarious domains, yet their application to temporal graph regression tasks faces\nsignificant challenges regarding interpretability. This critical issue, rooted\nin the inherent complexity of both DNNs and underlying spatio-temporal patterns\nin the graph, calls for innovative solutions. While interpretability concerns\nin Graph Neural Networks (GNNs) mirror those of DNNs, to the best of our\nknowledge, no notable work has addressed the interpretability of temporal GNNs\nusing a combination of Information Bottleneck (IB) principles and\nprototype-based methods. Our research introduces a novel approach that uniquely\nintegrates these techniques to enhance the interpretability of temporal graph\nregression models. The key contributions of our work are threefold: We\nintroduce the \\underline{G}raph \\underline{IN}terpretability in\n\\underline{T}emporal \\underline{R}egression task using \\underline{I}nformation\nbottleneck and \\underline{P}rototype (GINTRIP) framework, the first combined\napplication of IB and prototype-based methods for interpretable temporal graph\ntasks. We derive a novel theoretical bound on mutual information (MI),\nextending the applicability of IB principles to graph regression tasks. We\nincorporate an unsupervised auxiliary classification head, fostering multi-task\nlearning and diverse concept representation, which enhances the model\nbottleneck's interpretability. Our model is evaluated on real-world traffic\ndatasets, outperforming existing methods in both forecasting accuracy and\ninterpretability-related metrics.","PeriodicalId":501301,"journal":{"name":"arXiv - CS - Machine Learning","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deep neural networks (DNNs) have demonstrated remarkable performance across
various domains, yet their application to temporal graph regression tasks faces
significant challenges regarding interpretability. This critical issue, rooted
in the inherent complexity of both DNNs and underlying spatio-temporal patterns
in the graph, calls for innovative solutions. While interpretability concerns
in Graph Neural Networks (GNNs) mirror those of DNNs, to the best of our
knowledge, no notable work has addressed the interpretability of temporal GNNs
using a combination of Information Bottleneck (IB) principles and
prototype-based methods. Our research introduces a novel approach that uniquely
integrates these techniques to enhance the interpretability of temporal graph
regression models. The key contributions of our work are threefold: We
introduce the \underline{G}raph \underline{IN}terpretability in
\underline{T}emporal \underline{R}egression task using \underline{I}nformation
bottleneck and \underline{P}rototype (GINTRIP) framework, the first combined
application of IB and prototype-based methods for interpretable temporal graph
tasks. We derive a novel theoretical bound on mutual information (MI),
extending the applicability of IB principles to graph regression tasks. We
incorporate an unsupervised auxiliary classification head, fostering multi-task
learning and diverse concept representation, which enhances the model
bottleneck's interpretability. Our model is evaluated on real-world traffic
datasets, outperforming existing methods in both forecasting accuracy and
interpretability-related metrics.