Unleashing the power of generative adversarial networks: A novel machine learning approach for vehicle detection and localisation in the dark

IF 1.2 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Cognitive Computation and Systems Pub Date : 2023-09-02 DOI:10.1049/ccs2.12085
Md Saif Hassan Onim, Hussain Nyeem, Md. Wahiduzzaman Khan Arnob, Arunima Dey Pooja
{"title":"Unleashing the power of generative adversarial networks: A novel machine learning approach for vehicle detection and localisation in the dark","authors":"Md Saif Hassan Onim,&nbsp;Hussain Nyeem,&nbsp;Md. Wahiduzzaman Khan Arnob,&nbsp;Arunima Dey Pooja","doi":"10.1049/ccs2.12085","DOIUrl":null,"url":null,"abstract":"<p>Machine vision in low-light conditions is a critical requirement for object detection in road transportation, particularly for assisted and autonomous driving scenarios. Existing vision-based techniques are limited to daylight traffic scenarios due to their reliance on adequate lighting and high frame rates. This paper presents a novel approach to tackle this problem by investigating Vehicle Detection and Localisation (VDL) in extremely low-light conditions by using a new machine learning model. Specifically, the proposed model employs two customised generative adversarial networks, based on Pix2PixGAN and CycleGAN, to enhance dark images for input into a YOLOv4-based VDL algorithm. The model's performance is thoroughly analysed and compared against the prominent models. Our findings validate that the proposed model detects and localises vehicles accurately in extremely dark images, with an additional run-time of approximately 11 ms and an accuracy improvement of 10%–50% compared to the other models. Moreover, our model demonstrates a 4%–8% increase in Intersection over Union (IoU) at a mean frame rate of 9 <i>fps</i>, which underscores its potential for broader applications in ubiquitous road-object detection. The results demonstrate the significance of the proposed model as an early step to overcoming the challenges of low-light vision in road-object detection and autonomous driving, paving the way for safer and more efficient transportation systems.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2023-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12085","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Computation and Systems","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/ccs2.12085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Machine vision in low-light conditions is a critical requirement for object detection in road transportation, particularly for assisted and autonomous driving scenarios. Existing vision-based techniques are limited to daylight traffic scenarios due to their reliance on adequate lighting and high frame rates. This paper presents a novel approach to tackle this problem by investigating Vehicle Detection and Localisation (VDL) in extremely low-light conditions by using a new machine learning model. Specifically, the proposed model employs two customised generative adversarial networks, based on Pix2PixGAN and CycleGAN, to enhance dark images for input into a YOLOv4-based VDL algorithm. The model's performance is thoroughly analysed and compared against the prominent models. Our findings validate that the proposed model detects and localises vehicles accurately in extremely dark images, with an additional run-time of approximately 11 ms and an accuracy improvement of 10%–50% compared to the other models. Moreover, our model demonstrates a 4%–8% increase in Intersection over Union (IoU) at a mean frame rate of 9 fps, which underscores its potential for broader applications in ubiquitous road-object detection. The results demonstrate the significance of the proposed model as an early step to overcoming the challenges of low-light vision in road-object detection and autonomous driving, paving the way for safer and more efficient transportation systems.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
释放生成对抗性网络的力量:一种用于黑暗中车辆检测和定位的新型机器学习方法
弱光条件下的机器视觉是道路运输中物体检测的关键要求,尤其是在辅助驾驶和自动驾驶场景中。现有的基于视觉的技术仅限于白天的交通场景,因为它们依赖于充足的照明和高帧率。本文提出了一种解决这一问题的新方法,通过使用一种新的机器学习模型研究极低光照条件下的车辆检测和定位(VDL)。具体而言,所提出的模型采用了两个基于Pix2PixGAN和CycleGAN的定制生成对抗性网络来增强暗图像,以输入到基于YOLOv4的VDL算法中。对该模型的性能进行了全面分析,并与著名模型进行了比较。我们的研究结果验证了所提出的模型在极暗的图像中准确地检测和定位车辆,与其他模型相比,额外的运行时间约为11毫秒,精度提高了10%-50%。此外,我们的模型表明,在9帧/秒的平均帧速率下,交叉点对并集(IoU)增加了4%-8%,这突出了其在泛在道路目标检测中更广泛应用的潜力。结果证明了所提出的模型的重要性,它是克服道路物体检测和自动驾驶中微光视觉挑战的早期步骤,为更安全、更高效的交通系统铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Cognitive Computation and Systems
Cognitive Computation and Systems Computer Science-Computer Science Applications
CiteScore
2.50
自引率
0.00%
发文量
39
审稿时长
10 weeks
期刊最新文献
EF-CorrCA: A multi-modal EEG-fNIRS subject independent model to assess speech quality on brain activity using correlated component analysis Detection of autism spectrum disorder using multi-scale enhanced graph convolutional network Evolving usability heuristics for visualising Augmented Reality/Mixed Reality applications using cognitive model of information processing and fuzzy analytical hierarchy process Emotion classification with multi-modal physiological signals using multi-attention-based neural network Optimisation of deep neural network model using Reptile meta learning approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1