{"title":"DCTNets: Deep crowd transfer networks for an approximate crowd counting","authors":"Arslan Ali , Weihua Ou , Saima Kanwal","doi":"10.1016/j.cogr.2022.03.004","DOIUrl":null,"url":null,"abstract":"<div><p>Due to the numerous real-world applications of the crowd counting job, it has become a popular research topic. Modern crowd counting systems have a sophisticated structure and employ a filter on a big image size, making them difficult to use. Because these technologies are computationally intensive and difficult to implement in small surveillance systems, they are not appropriate for use in small surveillance systems. They also function poorly in a variety of sizes and densities, as well. Transfer learning and deep convolutional neural network architecture are used to create a modest but efficient network, which we describe herein. We named the proposed crowd counting architecture deep crowd transfer network (DCTNets) since it incorporates both deep learning and transfer learning principles into a single system. Among DCTNets’ key components are a detection module that is based on mask R-CNNs and an estimate module that is based on deep convolutional neural networks. In the first step, we apply transfer learning to the Mask R-CNN model using the datasets ShanghaiTech, JHU-CROWD++, and UCF-QNRF. After that, we train and evaluate the complete architecture on these datasets using the transfer learning results. Input images are sent through a Mask R-CNN, which counts individuals and segments the counted region, then through an estimation network, which estimates the population size, and finally through a merge of the outputs from the two models. According to the findings of comparative tests, the proposed model outperforms existing state-of-the-art approaches on the ShanghaiTech, JHU-CROWD++, and UCF-QNRF datasets.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"2 ","pages":"Pages 96-111"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241322000076/pdfft?md5=9be2d6987eecd7631f37947f00a23f45&pid=1-s2.0-S2667241322000076-main.pdf","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667241322000076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Due to the numerous real-world applications of the crowd counting job, it has become a popular research topic. Modern crowd counting systems have a sophisticated structure and employ a filter on a big image size, making them difficult to use. Because these technologies are computationally intensive and difficult to implement in small surveillance systems, they are not appropriate for use in small surveillance systems. They also function poorly in a variety of sizes and densities, as well. Transfer learning and deep convolutional neural network architecture are used to create a modest but efficient network, which we describe herein. We named the proposed crowd counting architecture deep crowd transfer network (DCTNets) since it incorporates both deep learning and transfer learning principles into a single system. Among DCTNets’ key components are a detection module that is based on mask R-CNNs and an estimate module that is based on deep convolutional neural networks. In the first step, we apply transfer learning to the Mask R-CNN model using the datasets ShanghaiTech, JHU-CROWD++, and UCF-QNRF. After that, we train and evaluate the complete architecture on these datasets using the transfer learning results. Input images are sent through a Mask R-CNN, which counts individuals and segments the counted region, then through an estimation network, which estimates the population size, and finally through a merge of the outputs from the two models. According to the findings of comparative tests, the proposed model outperforms existing state-of-the-art approaches on the ShanghaiTech, JHU-CROWD++, and UCF-QNRF datasets.