Meygen D. Cruz, J. Keh, Ramiel G. Deticio, Carl Vincent T. Tan, John Anthony C. Jose, E. Dadios
{"title":"A People Counting System for Use in CCTV Cameras in Retail","authors":"Meygen D. Cruz, J. Keh, Ramiel G. Deticio, Carl Vincent T. Tan, John Anthony C. Jose, E. Dadios","doi":"10.1109/HNICEM51456.2020.9400048","DOIUrl":null,"url":null,"abstract":"This paper focuses on the feasibility of implementing a vision-based people counting system using footage from an existing surveillance camera in a restaurant establishment. The main challenge is to do so given the unique fixed viewpoint of the camera, which is optimized for security instead of data analytics. A three-step approach, namely people detection, tracking, and then people counting, is employed in creating the system. Neural networks such as YOLOv3 and Deep SORT are used. The proponents then partnered with a retail establishment in a high-traffic business district, to test the system. The results show that it is possible to achieve an accuracy of 82.76% for days when the restaurant waiting area is not crowded. The system also achieved an overall accuracy of 66.17% over five days of extensive testing, which includes extreme conditions wherein people in the video are densely packed and occluded. However, the system performance and accuracy can still be improved through downsizing the frames, retraining the models, and exploring other models.","PeriodicalId":230810,"journal":{"name":"2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM51456.2020.9400048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper focuses on the feasibility of implementing a vision-based people counting system using footage from an existing surveillance camera in a restaurant establishment. The main challenge is to do so given the unique fixed viewpoint of the camera, which is optimized for security instead of data analytics. A three-step approach, namely people detection, tracking, and then people counting, is employed in creating the system. Neural networks such as YOLOv3 and Deep SORT are used. The proponents then partnered with a retail establishment in a high-traffic business district, to test the system. The results show that it is possible to achieve an accuracy of 82.76% for days when the restaurant waiting area is not crowded. The system also achieved an overall accuracy of 66.17% over five days of extensive testing, which includes extreme conditions wherein people in the video are densely packed and occluded. However, the system performance and accuracy can still be improved through downsizing the frames, retraining the models, and exploring other models.