Lane detection via disentangled representation network with slope consistency loss

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2024-10-20 DOI:10.1016/j.engappai.2024.109449

Zhaodong Ding , Yifei Deng , Chenglong Li , Rui Ruan , Jin Tang

{"title":"Lane detection via disentangled representation network with slope consistency loss","authors":"Zhaodong Ding , Yifei Deng , Chenglong Li , Rui Ruan , Jin Tang","doi":"10.1016/j.engappai.2024.109449","DOIUrl":null,"url":null,"abstract":"<div><div>Existing works in lane detection focus on learning the general robust representation across different scenarios to overcome the impact of the lack of visual cues. However, factors leading to the absence of visual cues vary across different scenarios and the training data from challenging conditions is relatively small compared to common conditions. These problems result in the inability of existing methods to maintain robust lane detection in different scenarios for practical applications. To address these problems, this work presents a novel Disentangled Representation Network called DRNet, which disentangles the lane feature representations using a disentangled representation network to efficiently learn the lane representations corresponding to the specific condition. Meanwhile, DRNet also mitigates the adverse effects of data imbalance. Specifically, we disentangle lane representation via five branches, respectively to the common scenes, crowded objects, low light, dazzle light and other conditions. Due to the separated model of different conditions, each branch can be represented using a small number of parameters, which can be sufficiently learned using corresponding training subset. Moreover, existing works perform lane classification or regression using pixel-level losses, which neglect the important shape information. To this end, we design a novel slope consistency loss to take both global and local slope consistencies between prediction and ground truth into account for lane detection, which can adaptively adjust the lane shape and location. Extensive experiments on the CULane and TuSimple datasets show that our DRNet outperforms state-of-the-art methods, as it can reach 81.07% <em>F1</em> on CULane and 97.97% on TuSimple.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016075","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Existing works in lane detection focus on learning the general robust representation across different scenarios to overcome the impact of the lack of visual cues. However, factors leading to the absence of visual cues vary across different scenarios and the training data from challenging conditions is relatively small compared to common conditions. These problems result in the inability of existing methods to maintain robust lane detection in different scenarios for practical applications. To address these problems, this work presents a novel Disentangled Representation Network called DRNet, which disentangles the lane feature representations using a disentangled representation network to efficiently learn the lane representations corresponding to the specific condition. Meanwhile, DRNet also mitigates the adverse effects of data imbalance. Specifically, we disentangle lane representation via five branches, respectively to the common scenes, crowded objects, low light, dazzle light and other conditions. Due to the separated model of different conditions, each branch can be represented using a small number of parameters, which can be sufficiently learned using corresponding training subset. Moreover, existing works perform lane classification or regression using pixel-level losses, which neglect the important shape information. To this end, we design a novel slope consistency loss to take both global and local slope consistencies between prediction and ground truth into account for lane detection, which can adaptively adjust the lane shape and location. Extensive experiments on the CULane and TuSimple datasets show that our DRNet outperforms state-of-the-art methods, as it can reach 81.07% F1 on CULane and 97.97% on TuSimple.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过具有斜率一致性损失的分解表示网络进行车道检测

车道检测方面的现有研究侧重于学习不同场景下的通用鲁棒表示法，以克服缺乏视觉线索的影响。然而，导致视觉线索缺失的因素在不同场景下各不相同，而且与普通情况相比，挑战性条件下的训练数据相对较少。这些问题导致现有方法无法在实际应用的不同场景中保持稳健的车道检测。为了解决这些问题，本研究提出了一种名为 DRNet 的新型分离表征网络，它利用分离表征网络对车道特征表征进行分离，从而有效地学习与特定条件相对应的车道表征。同时，DRNet 还能减轻数据不平衡带来的不利影响。具体来说，我们通过五个分支来分解车道表征，分别是常见场景、拥挤物体、弱光、眩光和其他条件。由于将不同条件的模型分离开来，每个分支都可以使用少量参数来表示，而这些参数可以通过相应的训练子集进行充分学习。此外，现有研究使用像素级损失进行车道分类或回归，忽略了重要的形状信息。为此，我们设计了一种新颖的斜率一致性损失，在车道检测中同时考虑预测值与地面实况之间的全局和局部斜率一致性，从而可以自适应地调整车道形状和位置。在 CULane 和 TuSimple 数据集上进行的大量实验表明，我们的 DRNet 优于最先进的方法，在 CULane 和 TuSimple 数据集上的 F1 分别达到 81.07% 和 97.97%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.