多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用

2019 International Automatic Control Conference (CACS) Pub Date : 2019-11-01 DOI:10.1109/CACS47674.2019.9024357

Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen

{"title":"多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用","authors":"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen","doi":"10.1109/CACS47674.2019.9024357","DOIUrl":null,"url":null,"abstract":"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.","PeriodicalId":247039,"journal":{"name":"2019 International Automatic Control Conference (CACS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System\",\"authors\":\"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen\",\"doi\":\"10.1109/CACS47674.2019.9024357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.\",\"PeriodicalId\":247039,\"journal\":{\"name\":\"2019 International Automatic Control Conference (CACS)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Automatic Control Conference (CACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACS47674.2019.9024357\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Automatic Control Conference (CACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACS47674.2019.9024357","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

人脸检测与识别是计算机视觉和人机交互领域的一个重要课题和难点。近年来，随着深度学习的发展，人脸检测和面部表情识别(FER)的相关技术被提出，其中最突出的是卷积神经网络。本文将多任务级联卷积神经网络应用于人脸检测，设计了基于密集连接卷积网络(DenseNet)的实时人脸识别系统。该系统首先将输入图像缩放为图像金字塔，然后使用分层网络确定候选窗口是否包含人脸。如果存在，则将候选窗口发送到FER系统。由于DenseNet具有特征重用的特性，可以有效地减少参数量和计算量，有利于实时系统的开发。为了捕捉面部肌肉在不同表情下的变化，该架构采用步长为1的卷积运算，并尝试不同数量的密集块。通过实验，该系统可以在30FPS的速度下实现实时识别，识别精度优于人眼。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System

Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 International Automatic Control Conference (CACS)

自引率

0.00%

发文量