DCTNets: Deep crowd transfer networks for an approximate crowd counting

Arslan Ali , Weihua Ou , Saima Kanwal
{"title":"DCTNets: Deep crowd transfer networks for an approximate crowd counting","authors":"Arslan Ali ,&nbsp;Weihua Ou ,&nbsp;Saima Kanwal","doi":"10.1016/j.cogr.2022.03.004","DOIUrl":null,"url":null,"abstract":"<div><p>Due to the numerous real-world applications of the crowd counting job, it has become a popular research topic. Modern crowd counting systems have a sophisticated structure and employ a filter on a big image size, making them difficult to use. Because these technologies are computationally intensive and difficult to implement in small surveillance systems, they are not appropriate for use in small surveillance systems. They also function poorly in a variety of sizes and densities, as well. Transfer learning and deep convolutional neural network architecture are used to create a modest but efficient network, which we describe herein. We named the proposed crowd counting architecture deep crowd transfer network (DCTNets) since it incorporates both deep learning and transfer learning principles into a single system. Among DCTNets’ key components are a detection module that is based on mask R-CNNs and an estimate module that is based on deep convolutional neural networks. In the first step, we apply transfer learning to the Mask R-CNN model using the datasets ShanghaiTech, JHU-CROWD++, and UCF-QNRF. After that, we train and evaluate the complete architecture on these datasets using the transfer learning results. Input images are sent through a Mask R-CNN, which counts individuals and segments the counted region, then through an estimation network, which estimates the population size, and finally through a merge of the outputs from the two models. According to the findings of comparative tests, the proposed model outperforms existing state-of-the-art approaches on the ShanghaiTech, JHU-CROWD++, and UCF-QNRF datasets.</p></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"2 ","pages":"Pages 96-111"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667241322000076/pdfft?md5=9be2d6987eecd7631f37947f00a23f45&pid=1-s2.0-S2667241322000076-main.pdf","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667241322000076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Due to the numerous real-world applications of the crowd counting job, it has become a popular research topic. Modern crowd counting systems have a sophisticated structure and employ a filter on a big image size, making them difficult to use. Because these technologies are computationally intensive and difficult to implement in small surveillance systems, they are not appropriate for use in small surveillance systems. They also function poorly in a variety of sizes and densities, as well. Transfer learning and deep convolutional neural network architecture are used to create a modest but efficient network, which we describe herein. We named the proposed crowd counting architecture deep crowd transfer network (DCTNets) since it incorporates both deep learning and transfer learning principles into a single system. Among DCTNets’ key components are a detection module that is based on mask R-CNNs and an estimate module that is based on deep convolutional neural networks. In the first step, we apply transfer learning to the Mask R-CNN model using the datasets ShanghaiTech, JHU-CROWD++, and UCF-QNRF. After that, we train and evaluate the complete architecture on these datasets using the transfer learning results. Input images are sent through a Mask R-CNN, which counts individuals and segments the counted region, then through an estimation network, which estimates the population size, and finally through a merge of the outputs from the two models. According to the findings of comparative tests, the proposed model outperforms existing state-of-the-art approaches on the ShanghaiTech, JHU-CROWD++, and UCF-QNRF datasets.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DCTNets:用于近似人群计数的深度人群转移网络
由于人群计数工作在现实世界中的大量应用,它已成为一个热门的研究课题。现代人群计数系统具有复杂的结构,并且在大图像尺寸上使用过滤器,这使得它们难以使用。由于这些技术计算量大,难以在小型监控系统中实现,因此不适合在小型监控系统中使用。它们在各种大小和密度下的功能也很差。使用迁移学习和深度卷积神经网络架构来创建一个适度但高效的网络,我们在这里描述。我们将提出的人群计数架构命名为深度人群迁移网络(DCTNets),因为它将深度学习和迁移学习原理结合到一个系统中。DCTNets的关键组件包括基于掩码r - cnn的检测模块和基于深度卷积神经网络的估计模块。在第一步,我们将迁移学习应用于Mask R-CNN模型,使用数据集ShanghaiTech, JHU-CROWD++和UCF-QNRF。之后,我们使用迁移学习结果在这些数据集上训练和评估完整的架构。输入图像通过Mask R-CNN发送,该Mask R-CNN对个体进行计数,并对被计数的区域进行分割,然后通过估计网络发送,该网络估计种群大小,最后通过合并两个模型的输出。根据对比测试的结果,所提出的模型在上海科技、JHU-CROWD++和UCF-QNRF数据集上优于现有的最先进方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.40
自引率
0.00%
发文量
0
期刊最新文献
Optimizing Food Sample Handling and Placement Pattern Recognition with YOLO: Advanced Techniques in Robotic Object Detection Intelligent path planning for cognitive mobile robot based on Dhouib-Matrix-SPP method YOLOT: Multi-scale and diverse tire sidewall text region detection based on You-Only-Look-Once(YOLOv5) Scalable and cohesive swarm control based on reinforcement learning POMDP-based probabilistic decision making for path planning in wheeled mobile robot
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1