Improving face recognition in surveillance video with judicious selection and fusion of representative frames

Proceedings of the 2nd ACM International Conference on Multimedia in Asia Pub Date : 2021-03-07 DOI:10.1145/3444685.3446259

Zhaozhen Ding, Qingfang Zheng, Chunhua Hou, Guang Shen

引用次数: 0

Abstract

Face recognition in unconstrained surveillance videos is challenging due to the different acquisition settings and face variations. We propose to utilize the complementary correlation between multi-frames to improve face recognition performance. We design an algorithm to build a representative frame set from the video sequence, selecting faces with high quality and large appearance diversity. We also devise a refined Deep Residual Equivariant Mapping (DREAM) block to improve the discriminative power of the extracted deep features. Extensive experiments on two relevant face recognition benchmarks, YouTube Face and IJB-A, show the effectiveness of the proposed method. Our work is also lightweight, and can be easily embedded into existing CNN based face recognition systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过代表性帧的选择和融合，提高监控视频中的人脸识别能力

由于采集设置和人脸变化的不同，无约束监控视频中的人脸识别具有挑战性。我们提出利用多帧之间的互补相关性来提高人脸识别性能。我们设计了一种算法，从视频序列中构建具有代表性的帧集，选择具有高质量和大外观多样性的人脸。我们还设计了一个改进的深度残差等变映射(DREAM)块，以提高提取的深度特征的判别能力。在YouTube face和IJB-A两个相关的人脸识别基准上进行的大量实验表明了所提出方法的有效性。我们的工作也是轻量级的，可以很容易地嵌入到现有的基于CNN的人脸识别系统中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2nd ACM International Conference on Multimedia in Asia

自引率

0.00%

发文量

期刊最新文献

Storyboard relational model for group activity recognition Objective object segmentation visual quality evaluation based on pixel-level and region-level characteristics Multiplicative angular margin loss for text-based person search Distilling knowledge in causal inference for unbiased visual question answering A large-scale image retrieval system for everyday scenes