P. Prajwal, D. Prajwal, D. H. Harish, R. Gajanana, B. Jayasri, S. Lokesh
{"title":"使用深度学习的自动驾驶汽车目标检测","authors":"P. Prajwal, D. Prajwal, D. H. Harish, R. Gajanana, B. Jayasri, S. Lokesh","doi":"10.1109/ICSES52305.2021.9633965","DOIUrl":null,"url":null,"abstract":"In the Computer Vision domain, there has been continuous growth and development with main focus so as to facilitate a smooth interaction between Machines and human. Perception, planning and control are the main aspects that make up the Self-driving system. Perception subsystem converts the raw data collected by sensors or other information capturing devices into a model of the environment surrounding us. Planning subsystem analyses this model of the surrounding environment and makes certain purposeful decisions based on the inferences obtained from the analysis. Finally, the Control Subsystem is responsible for execution of the actions or the decisions planned previously. The scope of this project is to study and analyze the problems faced in the Perception subsystem in the domain of detecting objects for autonomous cars. Previously, technologies like Radar, LiDAR, GPS and various other sensors had been employed for Driverless cars for mapping the surroundings of the car. However, in the recent past, some deep neural network (DNN) architectures like YOLO (You Only Look Once), and SSD (Single Shot MultiBox Detector) have been developed which are capable of detecting objects even when live video is considered as the input, thus having potential to be included as a part of the Driverless car systems. Selection of a model having considerable accuracy and producing results at a faster rate is very much essential so as to meet the requirements of object detection in driverless cars. In this project, we have used Caffe, which is developed by Berkeley AI Research and Community contributors as the deep learning framework. Keeping in mind the factors that contribute to the selection of a good model, we have chosen SSD model along-side MobileNet Neural network as the base architecture as it results in both faster rate of result production and has a moderate accuracy.","PeriodicalId":6777,"journal":{"name":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","volume":"9 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Object Detection in Self Driving Cars Using Deep Learning\",\"authors\":\"P. Prajwal, D. Prajwal, D. H. Harish, R. Gajanana, B. Jayasri, S. Lokesh\",\"doi\":\"10.1109/ICSES52305.2021.9633965\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the Computer Vision domain, there has been continuous growth and development with main focus so as to facilitate a smooth interaction between Machines and human. Perception, planning and control are the main aspects that make up the Self-driving system. Perception subsystem converts the raw data collected by sensors or other information capturing devices into a model of the environment surrounding us. Planning subsystem analyses this model of the surrounding environment and makes certain purposeful decisions based on the inferences obtained from the analysis. Finally, the Control Subsystem is responsible for execution of the actions or the decisions planned previously. The scope of this project is to study and analyze the problems faced in the Perception subsystem in the domain of detecting objects for autonomous cars. Previously, technologies like Radar, LiDAR, GPS and various other sensors had been employed for Driverless cars for mapping the surroundings of the car. However, in the recent past, some deep neural network (DNN) architectures like YOLO (You Only Look Once), and SSD (Single Shot MultiBox Detector) have been developed which are capable of detecting objects even when live video is considered as the input, thus having potential to be included as a part of the Driverless car systems. Selection of a model having considerable accuracy and producing results at a faster rate is very much essential so as to meet the requirements of object detection in driverless cars. In this project, we have used Caffe, which is developed by Berkeley AI Research and Community contributors as the deep learning framework. Keeping in mind the factors that contribute to the selection of a good model, we have chosen SSD model along-side MobileNet Neural network as the base architecture as it results in both faster rate of result production and has a moderate accuracy.\",\"PeriodicalId\":6777,\"journal\":{\"name\":\"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)\",\"volume\":\"9 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSES52305.2021.9633965\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSES52305.2021.9633965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
在计算机视觉领域,一直在不断的成长和发展,其主要重点是促进机器与人之间的顺畅交互。感知、规划和控制是构成自动驾驶系统的主要方面。感知子系统将传感器或其他信息捕获设备收集的原始数据转换为我们周围环境的模型。规划子系统对该模型的周边环境进行分析,并根据分析得出的推论做出有针对性的决策。最后,控制子系统负责执行先前计划的操作或决策。本项目的范围是研究和分析自动驾驶汽车物体检测领域感知子系统所面临的问题。此前,无人驾驶汽车采用雷达、激光雷达、GPS和各种其他传感器等技术来绘制汽车周围的地图。然而,在最近的过去,一些深度神经网络(DNN)架构,如YOLO (You Only Look Once)和SSD (Single Shot MultiBox Detector)已经被开发出来,即使将实时视频视为输入,也能够检测到物体,因此有可能被纳入无人驾驶汽车系统的一部分。为了满足无人驾驶汽车中物体检测的要求,选择一个具有相当精度并以更快的速度产生结果的模型是非常必要的。在这个项目中,我们使用了由伯克利人工智能研究和社区贡献者开发的Caffe作为深度学习框架。考虑到有助于选择一个好的模型的因素,我们选择了SSD模型和MobileNet神经网络作为基础架构,因为它既可以更快地产生结果,又具有中等的准确性。
Object Detection in Self Driving Cars Using Deep Learning
In the Computer Vision domain, there has been continuous growth and development with main focus so as to facilitate a smooth interaction between Machines and human. Perception, planning and control are the main aspects that make up the Self-driving system. Perception subsystem converts the raw data collected by sensors or other information capturing devices into a model of the environment surrounding us. Planning subsystem analyses this model of the surrounding environment and makes certain purposeful decisions based on the inferences obtained from the analysis. Finally, the Control Subsystem is responsible for execution of the actions or the decisions planned previously. The scope of this project is to study and analyze the problems faced in the Perception subsystem in the domain of detecting objects for autonomous cars. Previously, technologies like Radar, LiDAR, GPS and various other sensors had been employed for Driverless cars for mapping the surroundings of the car. However, in the recent past, some deep neural network (DNN) architectures like YOLO (You Only Look Once), and SSD (Single Shot MultiBox Detector) have been developed which are capable of detecting objects even when live video is considered as the input, thus having potential to be included as a part of the Driverless car systems. Selection of a model having considerable accuracy and producing results at a faster rate is very much essential so as to meet the requirements of object detection in driverless cars. In this project, we have used Caffe, which is developed by Berkeley AI Research and Community contributors as the deep learning framework. Keeping in mind the factors that contribute to the selection of a good model, we have chosen SSD model along-side MobileNet Neural network as the base architecture as it results in both faster rate of result production and has a moderate accuracy.