Real-Time Attribute Based Deep Learning Network for Traffic Sign Detection

H. M. Elhawary, U. Suddamalla, M. I. Shapiai, A. Wong, H. Zamzuri
{"title":"Real-Time Attribute Based Deep Learning Network for Traffic Sign Detection","authors":"H. M. Elhawary, U. Suddamalla, M. I. Shapiai, A. Wong, H. Zamzuri","doi":"10.1109/ICITEE56407.2022.9954110","DOIUrl":null,"url":null,"abstract":"Traffic sign detection is one of the key components of the Advanced Driving Assistant System (ADAS), which aims to detect and classify street signs in real time. However, traffic sign detection has challenges in real applications requiring high precision and real-time recall. Those challenges are due to the small object size and class imbalance. Recently, researchers have proposed several techniques to improve the detection quality by enriching the features through a multiscale network, introducing attention mechanisms and augmentation techniques to improve the features of tiny objects. To overcome class imbalance researchers proposed cascaded networks and various loss functions. However, those existing techniques and mechanisms added more complexity to the model. Meanwhile, the imbalance affects single-stage networks such as YOLO, which causes a lower recall for minor classes. We proposed a new training method for a single-stage detection network, known as Real Time Attribute Based Deep Learning Detection Network (Real Time-Attribute DL). We introduced new attributes to the loss and Non-Maximum Suppression (NMS) to reduce the class number by categorizing it based on the shape of the traffic sign while maintaining the same number of classes. Our proposed method extends the YOLO detection head to have four main parameters: objectiveness, regression, class, and attribute. We modify the loss function to train the network jointly between class and attribute. We validate our proposed technique with Tsinghua-Tencent 100K(TT100K) as a benchmark dataset. The results show that our proposed technique improves the recall index from 85.85% to 94.26% in yolov4-tiny-31 with a 0.8% improvement in precision and improves the recall index from 93.51% to 96.68% in yolov4 with a drop by 2% in precision without adding extra complexity to the main network. The proposed technique offers a better recall index than the baseline, especially for imbalanced datasets such as TT100K datasets.","PeriodicalId":246279,"journal":{"name":"2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"399 4-6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEE56407.2022.9954110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Traffic sign detection is one of the key components of the Advanced Driving Assistant System (ADAS), which aims to detect and classify street signs in real time. However, traffic sign detection has challenges in real applications requiring high precision and real-time recall. Those challenges are due to the small object size and class imbalance. Recently, researchers have proposed several techniques to improve the detection quality by enriching the features through a multiscale network, introducing attention mechanisms and augmentation techniques to improve the features of tiny objects. To overcome class imbalance researchers proposed cascaded networks and various loss functions. However, those existing techniques and mechanisms added more complexity to the model. Meanwhile, the imbalance affects single-stage networks such as YOLO, which causes a lower recall for minor classes. We proposed a new training method for a single-stage detection network, known as Real Time Attribute Based Deep Learning Detection Network (Real Time-Attribute DL). We introduced new attributes to the loss and Non-Maximum Suppression (NMS) to reduce the class number by categorizing it based on the shape of the traffic sign while maintaining the same number of classes. Our proposed method extends the YOLO detection head to have four main parameters: objectiveness, regression, class, and attribute. We modify the loss function to train the network jointly between class and attribute. We validate our proposed technique with Tsinghua-Tencent 100K(TT100K) as a benchmark dataset. The results show that our proposed technique improves the recall index from 85.85% to 94.26% in yolov4-tiny-31 with a 0.8% improvement in precision and improves the recall index from 93.51% to 96.68% in yolov4 with a drop by 2% in precision without adding extra complexity to the main network. The proposed technique offers a better recall index than the baseline, especially for imbalanced datasets such as TT100K datasets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于实时属性的深度学习网络交通标志检测
交通标志检测是高级驾驶辅助系统(ADAS)的关键组成部分之一,旨在实时检测和分类道路标志。然而,交通标志检测在需要高精度和实时召回的实际应用中面临着挑战。这些挑战是由于小的对象大小和类的不平衡。近年来,研究人员提出了几种提高检测质量的技术,通过多尺度网络丰富特征,引入注意机制和增强技术来改善微小物体的特征。为了克服类不平衡,研究者提出了级联网络和各种损失函数。然而,这些现有的技术和机制增加了模型的复杂性。同时,这种不平衡影响了单阶段网络,如YOLO,这导致了小类别的召回率较低。我们提出了一种新的单阶段检测网络的训练方法,称为基于实时属性的深度学习检测网络(Real Time-Attribute DL)。我们为损失和非最大抑制(NMS)引入了新的属性,通过基于交通标志的形状对其进行分类来减少类数,同时保持相同的类数。我们提出的方法扩展了YOLO检测头,使其具有四个主要参数:客观性、回归性、类和属性。我们修改损失函数,使网络在类和属性之间联合训练。我们用清华-腾讯100K(TT100K)作为基准数据集验证了我们提出的技术。结果表明,在不增加主网络复杂度的情况下,将yolov4-tiny-31的查全指数从85.85%提高到94.26%,查全精度提高0.8%;将yolov4的查全指数从93.51%提高到96.68%,查全精度降低2%。该技术提供了比基线更好的召回指数,特别是对于TT100K数据集等不平衡数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Measuring Trust Perception during Information Evaluation Comparative Study of Multi-channel and Omni-channel on Supply Chain Management Classifying Stress Mental State by using Power Spectral Density of Electroencephalography (EEG) Development of CNN Pruning Method for Earthquake Signal Imagery Classification Object Selection Using LSTM Networks for Spontaneous Gaze-Based Interaction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1