多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用

Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen
{"title":"多任务级联密集连接卷积网络在人脸检测和面部表情识别系统中的应用","authors":"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen","doi":"10.1109/CACS47674.2019.9024357","DOIUrl":null,"url":null,"abstract":"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.","PeriodicalId":247039,"journal":{"name":"2019 International Automatic Control Conference (CACS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System\",\"authors\":\"Kuan-Yu Chou, Yi-Wen Cheng, Wei-Ren Chen, Yon-Ping Chen\",\"doi\":\"10.1109/CACS47674.2019.9024357\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.\",\"PeriodicalId\":247039,\"journal\":{\"name\":\"2019 International Automatic Control Conference (CACS)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Automatic Control Conference (CACS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CACS47674.2019.9024357\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Automatic Control Conference (CACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CACS47674.2019.9024357","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

人脸检测与识别是计算机视觉和人机交互领域的一个重要课题和难点。近年来,随着深度学习的发展,人脸检测和面部表情识别(FER)的相关技术被提出,其中最突出的是卷积神经网络。本文将多任务级联卷积神经网络应用于人脸检测,设计了基于密集连接卷积网络(DenseNet)的实时人脸识别系统。该系统首先将输入图像缩放为图像金字塔,然后使用分层网络确定候选窗口是否包含人脸。如果存在,则将候选窗口发送到FER系统。由于DenseNet具有特征重用的特性,可以有效地减少参数量和计算量,有利于实时系统的开发。为了捕捉面部肌肉在不同表情下的变化,该架构采用步长为1的卷积运算,并尝试不同数量的密集块。通过实验,该系统可以在30FPS的速度下实现实时识别,识别精度优于人眼。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multi-task Cascaded and Densely Connected Convolutional Networks Applied to Human Face Detection and Facial Expression Recognition System
Face detection and recognition is an important issue and a difficult task in computer vision and human-computer interaction. Recently, with the development of deep learning, several related technologies have been proposed for face detection and facial expression recognition (FER), and the outstanding convolutional neural networks are the most common used in this field. This thesis applies the multi-task cascade convolutional neural network to face detection, and then designs the real-time FER system based on densely connected convolution network (DenseNet). The system first scales the input image to an image pyramid, and then uses the hierarchical network to determine whether a candidate window includes a human face. If a face exists, then send the candidate window to the FER system. Since DenseNet possesses the property of feature reuse, it can effectively reduce the amount of parameters and computation efforts, beneficial to develop the real-time system. In order to capture the variation of facial muscle in different expressions, this architecture adopts convolution operations with a stride 1 and tries different numbers of dense blocks. Through experiments, the proposed system can achieve real-time recognition in 30FPS and with recognition accuracy better than human eyes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Human-Robot Interaction Design Based on Specific Person Finding and Localization of a Mobile Robot Parametric synthesis of a robust controller on a base of D-partition and method of dominant poles A Path Planning Algorithm based on Leading Rapidly-exploring Random Trees Attitude Motion Control of a Half car Model with Tracking Controller Using Aerodynamic Surfaces Systems Drug Discovery and Design for Triple-Negative Breast Cancer and Non-Triple-Negative Breast Cancer Based on Systems Carcinogenic Mechanism and Deep Learning Method
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1