{"title":"Fast Regression Convolutional Neural Network for Visual Crowd Counting","authors":"S. Teoh, Vooi Voon Yap, H. Nisar","doi":"10.1109/ICCOINS49721.2021.9497198","DOIUrl":null,"url":null,"abstract":"This paper presents an improved convolutional neural network (CNN) architecture for accurate visual crowd counting in crowd images. Comprehensive analysis on the performance and inference speed of the network model are also presented. Multi-column convolutional neural network (MCNN) for visual crowd counting through predicted density map is widely used in previous works, however this method has limitation in predicting a quality density map. Instead, the proposed network is constructed by using the powerful CNN layers, dense layers, and one regressor node with whole image-based inference. Therefore, it is less computationally intensive and inference speed can be increased. Experiments have been conducted on Mall dataset. Moreover, benchmarking on different CNN architectures have been conducted. The proposed network shows promising counting accuracy and reasonable inference speed against the existing state-of-art approaches.","PeriodicalId":245662,"journal":{"name":"2021 International Conference on Computer & Information Sciences (ICCOINS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer & Information Sciences (ICCOINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCOINS49721.2021.9497198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents an improved convolutional neural network (CNN) architecture for accurate visual crowd counting in crowd images. Comprehensive analysis on the performance and inference speed of the network model are also presented. Multi-column convolutional neural network (MCNN) for visual crowd counting through predicted density map is widely used in previous works, however this method has limitation in predicting a quality density map. Instead, the proposed network is constructed by using the powerful CNN layers, dense layers, and one regressor node with whole image-based inference. Therefore, it is less computationally intensive and inference speed can be increased. Experiments have been conducted on Mall dataset. Moreover, benchmarking on different CNN architectures have been conducted. The proposed network shows promising counting accuracy and reasonable inference speed against the existing state-of-art approaches.