{"title":"基于激光雷达的自动驾驶车辆可穿越性分析的双支路变压器网络","authors":"Shiliang Shao;Xianyu Shi;Guangjie Han;Ting Wang;Chunhe Song;Qi Zhang","doi":"10.1109/TITS.2024.3508839","DOIUrl":null,"url":null,"abstract":"In this study, we address the challenge of traversability analysis for autonomous vehicles in diverse environments, leveraging LiDAR sensors. We propose the Transformer-Voxel-Bird’s eye view (BEV) Network (TVBNet), a novel dual-branch framework designed to increase the accuracy and versatility of such analyses in both urban and off-road conditions. TVBNet first preprocesses raw point cloud data through voxelization and the generation of a BEV. It incorporates a Transformer network with a rotational attention mechanism to aggregate features from multiple point cloud frames, capturing long-range correlations both within and between point clouds. Additionally, a Swin Transformer extracts the relative positional relationships in the BEV projection, facilitating a comprehensive understanding of the scene. The fusion of data from both branches via a multisource feature fusion module, which employs a context aggregation mechanism based on a residual structure, allows for robust local to global contextual understanding. This approach not only improves the extraction of correlation features between 2D BEV and 3D voxel data but also demonstrates superior performance on the challenging off-road dataset RELLIS-3D and the urban dataset SemanticKITTI.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 2","pages":"2582-2595"},"PeriodicalIF":8.4000,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual-Branch Transformer Network for Enhancing LiDAR-Based Traversability Analysis in Autonomous Vehicles\",\"authors\":\"Shiliang Shao;Xianyu Shi;Guangjie Han;Ting Wang;Chunhe Song;Qi Zhang\",\"doi\":\"10.1109/TITS.2024.3508839\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, we address the challenge of traversability analysis for autonomous vehicles in diverse environments, leveraging LiDAR sensors. We propose the Transformer-Voxel-Bird’s eye view (BEV) Network (TVBNet), a novel dual-branch framework designed to increase the accuracy and versatility of such analyses in both urban and off-road conditions. TVBNet first preprocesses raw point cloud data through voxelization and the generation of a BEV. It incorporates a Transformer network with a rotational attention mechanism to aggregate features from multiple point cloud frames, capturing long-range correlations both within and between point clouds. Additionally, a Swin Transformer extracts the relative positional relationships in the BEV projection, facilitating a comprehensive understanding of the scene. The fusion of data from both branches via a multisource feature fusion module, which employs a context aggregation mechanism based on a residual structure, allows for robust local to global contextual understanding. This approach not only improves the extraction of correlation features between 2D BEV and 3D voxel data but also demonstrates superior performance on the challenging off-road dataset RELLIS-3D and the urban dataset SemanticKITTI.\",\"PeriodicalId\":13416,\"journal\":{\"name\":\"IEEE Transactions on Intelligent Transportation Systems\",\"volume\":\"26 2\",\"pages\":\"2582-2595\"},\"PeriodicalIF\":8.4000,\"publicationDate\":\"2024-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Intelligent Transportation Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10790548/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10790548/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Dual-Branch Transformer Network for Enhancing LiDAR-Based Traversability Analysis in Autonomous Vehicles
In this study, we address the challenge of traversability analysis for autonomous vehicles in diverse environments, leveraging LiDAR sensors. We propose the Transformer-Voxel-Bird’s eye view (BEV) Network (TVBNet), a novel dual-branch framework designed to increase the accuracy and versatility of such analyses in both urban and off-road conditions. TVBNet first preprocesses raw point cloud data through voxelization and the generation of a BEV. It incorporates a Transformer network with a rotational attention mechanism to aggregate features from multiple point cloud frames, capturing long-range correlations both within and between point clouds. Additionally, a Swin Transformer extracts the relative positional relationships in the BEV projection, facilitating a comprehensive understanding of the scene. The fusion of data from both branches via a multisource feature fusion module, which employs a context aggregation mechanism based on a residual structure, allows for robust local to global contextual understanding. This approach not only improves the extraction of correlation features between 2D BEV and 3D voxel data but also demonstrates superior performance on the challenging off-road dataset RELLIS-3D and the urban dataset SemanticKITTI.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.