Spatiotemporal Pyramid Aggregation and Graph Attention for Scene Perception and Tajectory Prediction

Jianhong Zou, Yihui Cui, Ting Zhao, Weihua Ouyang, Bei Luo, Qilie Liu
{"title":"Spatiotemporal Pyramid Aggregation and Graph Attention for Scene Perception and Tajectory Prediction","authors":"Jianhong Zou, Yihui Cui, Ting Zhao, Weihua Ouyang, Bei Luo, Qilie Liu","doi":"10.1109/ACAIT56212.2022.10137838","DOIUrl":null,"url":null,"abstract":"In the autonomous driving system, accurate scene perception and trajectory prediction are critical for collision avoidance and path planning of autonomous vehicles. This paper proposes a scene perception and trajectory prediction method based on graph attention mechanism to learn semantic and interaction information based on bird eye’s view (BEV) map. The method includes spatiotemporal pyramid network and graph attention network. The former uses spatiotemporal pyramid network to model the surrounding information to obtain scene features, and graph attention network models the interaction information of the surrounding traffic participants to obtain graph interactive features. Then, scene semantic features and graph interaction features are fused into a unified feature space to perform downstream pixel-level classification and trajectory prediction tasks. Compared with baseline method, the proposed method significantly improves the average classification accuracy and reduces the average error of trajectory prediction with high efficiency. Experimental results show that the proposed method has better performance and is more feasible for deployment in real-world automatic driving scenarios.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACAIT56212.2022.10137838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In the autonomous driving system, accurate scene perception and trajectory prediction are critical for collision avoidance and path planning of autonomous vehicles. This paper proposes a scene perception and trajectory prediction method based on graph attention mechanism to learn semantic and interaction information based on bird eye’s view (BEV) map. The method includes spatiotemporal pyramid network and graph attention network. The former uses spatiotemporal pyramid network to model the surrounding information to obtain scene features, and graph attention network models the interaction information of the surrounding traffic participants to obtain graph interactive features. Then, scene semantic features and graph interaction features are fused into a unified feature space to perform downstream pixel-level classification and trajectory prediction tasks. Compared with baseline method, the proposed method significantly improves the average classification accuracy and reduces the average error of trajectory prediction with high efficiency. Experimental results show that the proposed method has better performance and is more feasible for deployment in real-world automatic driving scenarios.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
场景感知与轨迹预测的时空金字塔聚集与图注意
在自动驾驶系统中,准确的场景感知和轨迹预测对自动驾驶车辆的避碰和路径规划至关重要。本文提出了一种基于图注意机制的场景感知和轨迹预测方法,以学习基于鸟瞰图的语义和交互信息。该方法包括时空金字塔网络和图注意力网络。前者利用时空金字塔网络对周围信息进行建模,得到场景特征;图关注网络对周围交通参与者的交互信息进行建模,得到图交互特征。然后,将场景语义特征和图交互特征融合成一个统一的特征空间,完成下游像素级分类和轨迹预测任务。与基线方法相比,该方法显著提高了平均分类精度,有效地降低了轨迹预测的平均误差。实验结果表明,该方法具有更好的性能,在实际自动驾驶场景中部署更加可行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Transformer with Global and Local Interaction for Pedestrian Trajectory Prediction The Use of Explainable Artificial Intelligence in Music—Take Professor Nick Bryan-Kinns’ “XAI+Music” Research as a Perspective Playing Fight the Landlord with Tree Search and Hidden Information Evaluation Evaluation Method of Innovative Economic Benefits of Enterprise Human Capital Based on Deep Learning An Attribute Contribution-Based K-Nearest Neighbor Classifier
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1