A Deep Learning-Based Method for Crowd Counting Using Shunting Inhibition Mechanism

Fok Hing Chi Tivive;Abdesselam Bouzerdoum;Son Lam Phung;Hoang Thanh Le;Hamza Baali
{"title":"A Deep Learning-Based Method for Crowd Counting Using Shunting Inhibition Mechanism","authors":"Fok Hing Chi Tivive;Abdesselam Bouzerdoum;Son Lam Phung;Hoang Thanh Le;Hamza Baali","doi":"10.1109/TAI.2024.3443789","DOIUrl":null,"url":null,"abstract":"Image-based crowd counting has gained significant attention due to its widespread applications in security and surveillance. Recent advancements in deep learning have led to the development of numerous methods that have achieved remarkable success in accurately counting crowds. However, many of the existing deep learning methods, which have large model sizes, are unsuitable for deployment on edge devices. This article introduces a novel network architecture and processing element designed to create an efficient and compact deep learning model for crowd counting. The processing element, referred to as the shunting inhibitory neuron, generates complex decision boundaries, making it more powerful than the traditional perceptron. It is employed in both the encoder and decoder modules of the proposed model for feature extraction. Furthermore, the decoder includes alternating convolutional and transformer layers, which provide local receptive fields and global self-attention, respectively. This design captures rich contextual information that is used for generating accurate segmentation and density maps. The self-attention mechanism is implemented using convolution modulation instead of matrix multiplication to reduce computational costs. Experiments conducted on three challenging crowd counting datasets demonstrate that the proposed deep learning network, which comprises a small model size, achieves crowd counting performance comparable to that of state-of-the-art techniques. Codes are available at \n<uri>https://github.com/ftivive/SINet</uri>\n.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5733-5745"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10636204/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Image-based crowd counting has gained significant attention due to its widespread applications in security and surveillance. Recent advancements in deep learning have led to the development of numerous methods that have achieved remarkable success in accurately counting crowds. However, many of the existing deep learning methods, which have large model sizes, are unsuitable for deployment on edge devices. This article introduces a novel network architecture and processing element designed to create an efficient and compact deep learning model for crowd counting. The processing element, referred to as the shunting inhibitory neuron, generates complex decision boundaries, making it more powerful than the traditional perceptron. It is employed in both the encoder and decoder modules of the proposed model for feature extraction. Furthermore, the decoder includes alternating convolutional and transformer layers, which provide local receptive fields and global self-attention, respectively. This design captures rich contextual information that is used for generating accurate segmentation and density maps. The self-attention mechanism is implemented using convolution modulation instead of matrix multiplication to reduce computational costs. Experiments conducted on three challenging crowd counting datasets demonstrate that the proposed deep learning network, which comprises a small model size, achieves crowd counting performance comparable to that of state-of-the-art techniques. Codes are available at https://github.com/ftivive/SINet .
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用分流抑制机制的基于深度学习的人群计数方法
基于图像的人群计数因其在安防和监控领域的广泛应用而备受关注。近来,深度学习技术的进步推动了众多方法的发展,这些方法在精确计数人群方面取得了显著的成功。然而,许多现有的深度学习方法都具有较大的模型规模,不适合在边缘设备上部署。本文介绍了一种新型网络架构和处理元件,旨在为人群计数创建一个高效、紧凑的深度学习模型。该处理元件被称为分流抑制神经元,可生成复杂的决策边界,使其比传统的感知器更强大。该模型的编码器和解码器模块都采用了该神经元进行特征提取。此外,解码器还包括交替卷积层和变换层,分别提供局部感受野和全局自我注意。这种设计可以捕捉丰富的上下文信息,用于生成精确的分割和密度图。自我注意机制是通过卷积调制而不是矩阵乘法实现的,以降低计算成本。在三个具有挑战性的人群计数数据集上进行的实验表明,所提出的深度学习网络具有较小的模型规模,其人群计数性能可与最先进的技术相媲美。代码见 https://github.com/ftivive/SINet。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.70
自引率
0.00%
发文量
0
期刊最新文献
Table of Contents Front Cover IEEE Transactions on Artificial Intelligence Publication Information Table of Contents Front Cover
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1