{"title":"IGGCN: Individual-guided graph convolution network for pedestrian trajectory prediction","authors":"Wangxing Chen, Haifeng Sang, Jinyu Wang, Zishan Zhao","doi":"10.1016/j.dsp.2024.104862","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately predicting the future trajectory of pedestrians is crucial for applications such as autonomous driving and robot navigation. Graph convolution is widely used in trajectory prediction tasks due to its scalability and adaptive feature-learning capabilities. However, there are two problems with pedestrian trajectory prediction methods based on graph convolution: 1. Previous methods struggled to adjust social interactions according to the attributes of different pedestrians, making it difficult to accurately model the relative importance between different pedestrians and others; 2. Previous methods lacked dynamic processing of pedestrian spatial-temporal interaction features to capture high-level spatial-temporal interaction features effectively. Therefore, we propose an Individual-Guided Graph Convolution Network (IGGCN) for pedestrian trajectory prediction. To tackle problem 1, we design an individual-guided interaction module that can adjust pedestrian social interaction modeling according to the pedestrian's attributes, thereby achieving an accurate description of the relative importance of pedestrians. We extend the module to temporal interaction modeling to further achieve an accurate description of the relative importance of time frames. To address problem 2, we design a deformable convolution module to dynamically process spatial-temporal interaction features through deformable convolution kernels, facilitating the capture of high-level spatial-temporal interaction features. We evaluate our method on the ETH, UCY, and SDD datasets. Quantitative analysis shows that our method has lower prediction errors than the current state-of-the-art methods. Qualitative analysis further reveals that our method effectively eliminates the influence of irrelevant pedestrians and accurately models the spatial-temporal interaction relationship of pedestrians.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"156 ","pages":"Article 104862"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S105120042400486X","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting the future trajectory of pedestrians is crucial for applications such as autonomous driving and robot navigation. Graph convolution is widely used in trajectory prediction tasks due to its scalability and adaptive feature-learning capabilities. However, there are two problems with pedestrian trajectory prediction methods based on graph convolution: 1. Previous methods struggled to adjust social interactions according to the attributes of different pedestrians, making it difficult to accurately model the relative importance between different pedestrians and others; 2. Previous methods lacked dynamic processing of pedestrian spatial-temporal interaction features to capture high-level spatial-temporal interaction features effectively. Therefore, we propose an Individual-Guided Graph Convolution Network (IGGCN) for pedestrian trajectory prediction. To tackle problem 1, we design an individual-guided interaction module that can adjust pedestrian social interaction modeling according to the pedestrian's attributes, thereby achieving an accurate description of the relative importance of pedestrians. We extend the module to temporal interaction modeling to further achieve an accurate description of the relative importance of time frames. To address problem 2, we design a deformable convolution module to dynamically process spatial-temporal interaction features through deformable convolution kernels, facilitating the capture of high-level spatial-temporal interaction features. We evaluate our method on the ETH, UCY, and SDD datasets. Quantitative analysis shows that our method has lower prediction errors than the current state-of-the-art methods. Qualitative analysis further reveals that our method effectively eliminates the influence of irrelevant pedestrians and accurately models the spatial-temporal interaction relationship of pedestrians.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,