A pedestrian detection algorithm based on deep learning

Jiangkun Lu, Hongyang Chen
{"title":"A pedestrian detection algorithm based on deep learning","authors":"Jiangkun Lu, Hongyang Chen","doi":"10.1145/3558819.3565151","DOIUrl":null,"url":null,"abstract":"In order to solve the problem of crowd counting and crowd density statistics in the security intelligent video surveillance system, this paper adopts the method based on deep learning to optimize the algorithm. This method mainly uses the VGG16 backbone network with 1*1 and 3*3 small convolution Classification and feature extraction of people information in a crowd. In order to reduce the sharp reduction in the number of positive samples after increasing the threshold, and to avoid the situation where using different thresholds during training and testing will cause the performance of the detector to degrade, this paper draws on the cascade structure of the Cascade R-CNN network for input video. The frame images are analyzed and processed, and different IoU thresholds are set at different stages to obtain enough positive samples to reduce over-fitting, and use the multi-task loss function and the Hadamard product to obtain the pedestrian detection network, and output the final number of people. The improved pedestrian counting algorithm in this paper is tested in the public dataset WorldExpo'10 Crowd Counting, and compared with other algorithms to verify the feasibility and effectiveness of this method.","PeriodicalId":373484,"journal":{"name":"Proceedings of the 7th International Conference on Cyber Security and Information Engineering","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th International Conference on Cyber Security and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3558819.3565151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In order to solve the problem of crowd counting and crowd density statistics in the security intelligent video surveillance system, this paper adopts the method based on deep learning to optimize the algorithm. This method mainly uses the VGG16 backbone network with 1*1 and 3*3 small convolution Classification and feature extraction of people information in a crowd. In order to reduce the sharp reduction in the number of positive samples after increasing the threshold, and to avoid the situation where using different thresholds during training and testing will cause the performance of the detector to degrade, this paper draws on the cascade structure of the Cascade R-CNN network for input video. The frame images are analyzed and processed, and different IoU thresholds are set at different stages to obtain enough positive samples to reduce over-fitting, and use the multi-task loss function and the Hadamard product to obtain the pedestrian detection network, and output the final number of people. The improved pedestrian counting algorithm in this paper is tested in the public dataset WorldExpo'10 Crowd Counting, and compared with other algorithms to verify the feasibility and effectiveness of this method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于深度学习的行人检测算法
为了解决安防智能视频监控系统中的人群计数和人群密度统计问题,本文采用基于深度学习的方法对算法进行优化。该方法主要利用VGG16骨干网的1*1和3*3小卷积对人群中的人物信息进行分类和特征提取。为了减少增加阈值后阳性样本数量急剧减少的情况,避免在训练和测试过程中使用不同的阈值会导致检测器性能下降的情况,本文对输入视频借鉴了cascade R-CNN网络的级联结构。对帧图像进行分析和处理,在不同阶段设置不同的IoU阈值,以获得足够的正样本,减少过拟合,并利用多任务损失函数和Hadamard积得到行人检测网络,并输出最终的人数。本文改进的行人计数算法在公共数据集world dexpo’10 Crowd counting中进行了测试,并与其他算法进行了比较,验证了该方法的可行性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Development and Application of Portable Multi-Function Power Distribution Emergency Repair Standardized Equipment Research on Automatic Self-healing Control of Intelligent Feeder based on Multi-Agent Algorithm Research and implementation of IP address management in medium and large-scale local area networks Application of Compressive Sensing Technology and Image Processing in Space Exploration House Price Prediction Model Using Bridge Memristors Recurrent Neural Network
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1