Naman Jain, Shreesha Yerragolla, Tanuja Guha, Mohana
{"title":"基于神经网络的交通监控目标检测与跟踪算法性能分析","authors":"Naman Jain, Shreesha Yerragolla, Tanuja Guha, Mohana","doi":"10.1109/I-SMAC47947.2019.9032502","DOIUrl":null,"url":null,"abstract":"The single object detection has been performed by using the concepts of convolution layers. A neural network consists of several different layers such as the input layer, at least one hidden layer, and an output layer. The dataset used for single object detection is the on-road vehicle dataset. This dataset consists of three classes of images which are Heavy, Auto and Light. The dataset consists of images of varying illuminations. The performance metrics has been calculated for the day dataset, evening dataset and night dataset. Multiple object detection has been performed using the You Only Look Once (YOLOv3) algorithm. This approach encompasses a single deep convolution neural network dividing the input into a cell grid and each cell predicts a boundary box and classifies object directly. The dataset used for multiple object detection is the KITTI dataset. It consists of 80 classes out of which five classes has been considered for this project which are: car, bus, truck, and motorcycle and train. Using the Multiple Object Detection concepts, tracking of vehicles was further implemented. The first frame of the video was taken and Multiple object detection was performed and in the further frames of the video the object was tracked using its centroid position. This has been developed using OpenCV and Python using YOLOv3 algorithm for the object detection phase.","PeriodicalId":275791,"journal":{"name":"2019 Third International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Performance Analysis of Object Detection and Tracking Algorithms for Traffic Surveillance Applications using Neural Networks\",\"authors\":\"Naman Jain, Shreesha Yerragolla, Tanuja Guha, Mohana\",\"doi\":\"10.1109/I-SMAC47947.2019.9032502\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The single object detection has been performed by using the concepts of convolution layers. A neural network consists of several different layers such as the input layer, at least one hidden layer, and an output layer. The dataset used for single object detection is the on-road vehicle dataset. This dataset consists of three classes of images which are Heavy, Auto and Light. The dataset consists of images of varying illuminations. The performance metrics has been calculated for the day dataset, evening dataset and night dataset. Multiple object detection has been performed using the You Only Look Once (YOLOv3) algorithm. This approach encompasses a single deep convolution neural network dividing the input into a cell grid and each cell predicts a boundary box and classifies object directly. The dataset used for multiple object detection is the KITTI dataset. It consists of 80 classes out of which five classes has been considered for this project which are: car, bus, truck, and motorcycle and train. Using the Multiple Object Detection concepts, tracking of vehicles was further implemented. The first frame of the video was taken and Multiple object detection was performed and in the further frames of the video the object was tracked using its centroid position. This has been developed using OpenCV and Python using YOLOv3 algorithm for the object detection phase.\",\"PeriodicalId\":275791,\"journal\":{\"name\":\"2019 Third International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Third International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I-SMAC47947.2019.9032502\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Third International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I-SMAC47947.2019.9032502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
摘要
利用卷积层的概念进行了单目标检测。神经网络由几个不同的层组成,如输入层、至少一个隐藏层和一个输出层。用于单目标检测的数据集是道路车辆数据集。该数据集由三类图像组成,分别是Heavy, Auto和Light。数据集由不同光照的图像组成。计算了白天数据集、晚上数据集和夜间数据集的性能指标。使用YOLOv3 (You Only Look Once)算法进行多目标检测。该方法包含一个深度卷积神经网络,将输入划分为一个单元格,每个单元格预测一个边界框并直接对对象进行分类。用于多目标检测的数据集是KITTI数据集。它由80个类别组成,其中五个类别已被考虑用于该项目:汽车,公共汽车,卡车,摩托车和火车。利用多目标检测的概念,进一步实现了车辆的跟踪。拍摄视频的第一帧并执行多目标检测,并在视频的其他帧中使用其质心位置跟踪目标。这是使用OpenCV和Python开发的,使用YOLOv3算法进行对象检测阶段。
Performance Analysis of Object Detection and Tracking Algorithms for Traffic Surveillance Applications using Neural Networks
The single object detection has been performed by using the concepts of convolution layers. A neural network consists of several different layers such as the input layer, at least one hidden layer, and an output layer. The dataset used for single object detection is the on-road vehicle dataset. This dataset consists of three classes of images which are Heavy, Auto and Light. The dataset consists of images of varying illuminations. The performance metrics has been calculated for the day dataset, evening dataset and night dataset. Multiple object detection has been performed using the You Only Look Once (YOLOv3) algorithm. This approach encompasses a single deep convolution neural network dividing the input into a cell grid and each cell predicts a boundary box and classifies object directly. The dataset used for multiple object detection is the KITTI dataset. It consists of 80 classes out of which five classes has been considered for this project which are: car, bus, truck, and motorcycle and train. Using the Multiple Object Detection concepts, tracking of vehicles was further implemented. The first frame of the video was taken and Multiple object detection was performed and in the further frames of the video the object was tracked using its centroid position. This has been developed using OpenCV and Python using YOLOv3 algorithm for the object detection phase.