{"title":"A bilateral semantic guidance network for detection of off-road freespace with impairments based on joint semantic segmentation and edge detection","authors":"Jiyuan Qiu, Chen Jiang","doi":"10.1016/j.compeleceng.2024.110045","DOIUrl":null,"url":null,"abstract":"<div><div>Freespace detection is one of the key technologies for scene understanding and motion planning in autonomous vehicles. However, current research on freespace detection primarily focuses on obstacles provided by objects above the freespace, such as vehicles, pedestrians, and buildings, while less attention is given to impairments within the freespace, such as potholes, defects, and collapses. Moreover, there is a lack of research on the interpretability of artificial intelligence in freespace detection. In this study, we first construct a large-scale off-road freespace detection dataset with impairments (ORIFD). The dataset comprises a total of 24,000 images representing different weather conditions (day, night, snow, etc.) and terrains (concrete roads, dirt roads, rocky paths, etc.). The impairments include potholes, defects, water puddles, and collapses. In addition to adding semantic labels for freespace and impairments, we also create semantic edge labels to enhance the extraction of scene information. Subsequently, we train a novel semantic guidance network (BSGNet) on this dataset, designed to simultaneously perform freespace detection and semantic edge detection tasks. Our framework consists of a deep extended dual-branch encoder, where one branch aggregates multi-scale semantic features, and the other extracts semantic edge information. We also propose an interactive fusion block (IFB) and a global feature aggregation module (GFAM) to enhance the model's feature representation capabilities. Extensive experiments demonstrate that our model outperforms existing state-of-the-art models, achieving superior performance. Additionally, we employ explainable artificial intelligence (XAI) methods to enhance the trustworthiness of our model and implement a method that combines bird's-eye view with the hybrid A* algorithm for generating effective collision-free paths, further extending the application of our research in autonomous vehicles.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"123 ","pages":"Article 110045"},"PeriodicalIF":4.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790624009704","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Freespace detection is one of the key technologies for scene understanding and motion planning in autonomous vehicles. However, current research on freespace detection primarily focuses on obstacles provided by objects above the freespace, such as vehicles, pedestrians, and buildings, while less attention is given to impairments within the freespace, such as potholes, defects, and collapses. Moreover, there is a lack of research on the interpretability of artificial intelligence in freespace detection. In this study, we first construct a large-scale off-road freespace detection dataset with impairments (ORIFD). The dataset comprises a total of 24,000 images representing different weather conditions (day, night, snow, etc.) and terrains (concrete roads, dirt roads, rocky paths, etc.). The impairments include potholes, defects, water puddles, and collapses. In addition to adding semantic labels for freespace and impairments, we also create semantic edge labels to enhance the extraction of scene information. Subsequently, we train a novel semantic guidance network (BSGNet) on this dataset, designed to simultaneously perform freespace detection and semantic edge detection tasks. Our framework consists of a deep extended dual-branch encoder, where one branch aggregates multi-scale semantic features, and the other extracts semantic edge information. We also propose an interactive fusion block (IFB) and a global feature aggregation module (GFAM) to enhance the model's feature representation capabilities. Extensive experiments demonstrate that our model outperforms existing state-of-the-art models, achieving superior performance. Additionally, we employ explainable artificial intelligence (XAI) methods to enhance the trustworthiness of our model and implement a method that combines bird's-eye view with the hybrid A* algorithm for generating effective collision-free paths, further extending the application of our research in autonomous vehicles.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.