Zhaodong Ding , Yifei Deng , Chenglong Li , Rui Ruan , Jin Tang
{"title":"Lane detection via disentangled representation network with slope consistency loss","authors":"Zhaodong Ding , Yifei Deng , Chenglong Li , Rui Ruan , Jin Tang","doi":"10.1016/j.engappai.2024.109449","DOIUrl":null,"url":null,"abstract":"<div><div>Existing works in lane detection focus on learning the general robust representation across different scenarios to overcome the impact of the lack of visual cues. However, factors leading to the absence of visual cues vary across different scenarios and the training data from challenging conditions is relatively small compared to common conditions. These problems result in the inability of existing methods to maintain robust lane detection in different scenarios for practical applications. To address these problems, this work presents a novel Disentangled Representation Network called DRNet, which disentangles the lane feature representations using a disentangled representation network to efficiently learn the lane representations corresponding to the specific condition. Meanwhile, DRNet also mitigates the adverse effects of data imbalance. Specifically, we disentangle lane representation via five branches, respectively to the common scenes, crowded objects, low light, dazzle light and other conditions. Due to the separated model of different conditions, each branch can be represented using a small number of parameters, which can be sufficiently learned using corresponding training subset. Moreover, existing works perform lane classification or regression using pixel-level losses, which neglect the important shape information. To this end, we design a novel slope consistency loss to take both global and local slope consistencies between prediction and ground truth into account for lane detection, which can adaptively adjust the lane shape and location. Extensive experiments on the CULane and TuSimple datasets show that our DRNet outperforms state-of-the-art methods, as it can reach 81.07% <em>F1</em> on CULane and 97.97% on TuSimple.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016075","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Existing works in lane detection focus on learning the general robust representation across different scenarios to overcome the impact of the lack of visual cues. However, factors leading to the absence of visual cues vary across different scenarios and the training data from challenging conditions is relatively small compared to common conditions. These problems result in the inability of existing methods to maintain robust lane detection in different scenarios for practical applications. To address these problems, this work presents a novel Disentangled Representation Network called DRNet, which disentangles the lane feature representations using a disentangled representation network to efficiently learn the lane representations corresponding to the specific condition. Meanwhile, DRNet also mitigates the adverse effects of data imbalance. Specifically, we disentangle lane representation via five branches, respectively to the common scenes, crowded objects, low light, dazzle light and other conditions. Due to the separated model of different conditions, each branch can be represented using a small number of parameters, which can be sufficiently learned using corresponding training subset. Moreover, existing works perform lane classification or regression using pixel-level losses, which neglect the important shape information. To this end, we design a novel slope consistency loss to take both global and local slope consistencies between prediction and ground truth into account for lane detection, which can adaptively adjust the lane shape and location. Extensive experiments on the CULane and TuSimple datasets show that our DRNet outperforms state-of-the-art methods, as it can reach 81.07% F1 on CULane and 97.97% on TuSimple.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.