Lingqiang Chen;Qinglin Zhao;Guanghui Li;Mengchu Zhou;Chenglong Dai;Yiming Feng;Xiaowei Liu;Jinjiang Li
{"title":"A Sparse Cross Attention-Based Graph Convolution Network With Auxiliary Information Awareness for Traffic Flow Prediction","authors":"Lingqiang Chen;Qinglin Zhao;Guanghui Li;Mengchu Zhou;Chenglong Dai;Yiming Feng;Xiaowei Liu;Jinjiang Li","doi":"10.1109/TITS.2025.3533560","DOIUrl":null,"url":null,"abstract":"Deep graph convolutional networks (GCNs) have shown promising performance in traffic prediction tasks, but their practical deployment on resource-constrained devices faces challenges. First, few models consider the potential influence of historical and future auxiliary information, such as weather and holidays, on complex traffic patterns. Second, the computational complexity of dynamic graph convolution operations grows quadratically with the number of traffic nodes, limiting model scalability. To address these challenges, this study proposes a deep encoder-decoder model named AIMSAN, which comprises an auxiliary information-aware module (AIM) and a sparse cross-attention-based graph convolutional network (SAN). From historical or future perspectives, AIM prunes multi-attribute auxiliary data into diverse time frames, and embeds them into one tensor. SAN employs a cross-attention mechanism to merge traffic data with historical embedded data in each encoder layer, forming dynamic adjacency matrices. Subsequently, it applies diffusion GCN to capture rich spatial-temporal dynamics from the traffic data. Additionally, AIMSAN utilizes the spatial sparsity of traffic nodes as a mask to mitigate the quadratic computational complexity of SAN, thereby improving overall computational efficiency. In the decoder layer, future embedded data are fused with feed-forward traffic data to generate prediction results. Experimental evaluations on three public traffic datasets demonstrate that AIMSAN achieves competitive performance compared to state-of-the-art algorithms, while reducing GPU memory consumption by 41.24%, training time by 62.09%, and validation time by 65.17% on average.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 3","pages":"3210-3222"},"PeriodicalIF":7.9000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10870873/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Deep graph convolutional networks (GCNs) have shown promising performance in traffic prediction tasks, but their practical deployment on resource-constrained devices faces challenges. First, few models consider the potential influence of historical and future auxiliary information, such as weather and holidays, on complex traffic patterns. Second, the computational complexity of dynamic graph convolution operations grows quadratically with the number of traffic nodes, limiting model scalability. To address these challenges, this study proposes a deep encoder-decoder model named AIMSAN, which comprises an auxiliary information-aware module (AIM) and a sparse cross-attention-based graph convolutional network (SAN). From historical or future perspectives, AIM prunes multi-attribute auxiliary data into diverse time frames, and embeds them into one tensor. SAN employs a cross-attention mechanism to merge traffic data with historical embedded data in each encoder layer, forming dynamic adjacency matrices. Subsequently, it applies diffusion GCN to capture rich spatial-temporal dynamics from the traffic data. Additionally, AIMSAN utilizes the spatial sparsity of traffic nodes as a mask to mitigate the quadratic computational complexity of SAN, thereby improving overall computational efficiency. In the decoder layer, future embedded data are fused with feed-forward traffic data to generate prediction results. Experimental evaluations on three public traffic datasets demonstrate that AIMSAN achieves competitive performance compared to state-of-the-art algorithms, while reducing GPU memory consumption by 41.24%, training time by 62.09%, and validation time by 65.17% on average.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.