Rahul B. Diwate, Atharva Zagade, M. Khodaskar, Varsha R. Dange
{"title":"Optimization in Object Detection Model using YOLO.v3","authors":"Rahul B. Diwate, Atharva Zagade, M. Khodaskar, Varsha R. Dange","doi":"10.1109/ESCI53509.2022.9758381","DOIUrl":null,"url":null,"abstract":"Object Detection is one of the important entities in the field of Computer Vision with a large number of applications. This project demonstrates Object detection using You Only Look Once (YOLO) Algorithm, version 3. YOLOv3 method is prominently used in object detection methods which are based on Deep Learning. It uses k-means cluster method for creating bounding boxes of specific height and width, which are used for predicting output. The model training is based on the Common Object in Context (COCO) Dataset. The dataset has around 164K images based on 80 categories, also called as classes. Thus, this object detection model takes an image from the user and then with the help of YOLO algorithm, predicts the types of objects present in that image and marks them accurately., the lower complex CNN model achieves an accuracy of 0.93.","PeriodicalId":436539,"journal":{"name":"2022 International Conference on Emerging Smart Computing and Informatics (ESCI)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Smart Computing and Informatics (ESCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCI53509.2022.9758381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Object Detection is one of the important entities in the field of Computer Vision with a large number of applications. This project demonstrates Object detection using You Only Look Once (YOLO) Algorithm, version 3. YOLOv3 method is prominently used in object detection methods which are based on Deep Learning. It uses k-means cluster method for creating bounding boxes of specific height and width, which are used for predicting output. The model training is based on the Common Object in Context (COCO) Dataset. The dataset has around 164K images based on 80 categories, also called as classes. Thus, this object detection model takes an image from the user and then with the help of YOLO algorithm, predicts the types of objects present in that image and marks them accurately., the lower complex CNN model achieves an accuracy of 0.93.