Micro-expression recognition based on multi-scale 3D residual convolutional neural network.

IF 2.6 4区工程技术 Q1 Mathematics Mathematical Biosciences and Engineering Pub Date : 2024-03-01 DOI:10.3934/mbe.2024221

Hongmei Jin, Ning He, Zhanli Li, Pengcheng Yang

{"title":"Micro-expression recognition based on multi-scale 3D residual convolutional neural network.","authors":"Hongmei Jin, Ning He, Zhanli Li, Pengcheng Yang","doi":"10.3934/mbe.2024221","DOIUrl":null,"url":null,"abstract":"<p><p>In demanding application scenarios such as clinical psychotherapy and criminal interrogation, the accurate recognition of micro-expressions is of utmost importance but poses significant challenges. One of the main difficulties lies in effectively capturing weak and fleeting facial features and improving recognition performance. To address this fundamental issue, this paper proposed a novel architecture based on a multi-scale 3D residual convolutional neural network. The algorithm leveraged a deep 3D-ResNet50 as the skeleton model and utilized the micro-expression optical flow feature map as the input for the network model. Drawing upon the complex spatial and temporal features inherent in micro-expressions, the network incorporated multi-scale convolutional modules of varying sizes to integrate both global and local information. Furthermore, an attention mechanism feature fusion module was introduced to enhance the model's contextual awareness. Finally, to optimize the model's prediction of the optimal solution, a discriminative network structure with multiple output channels was constructed. The algorithm's performance was evaluated using the public datasets SMIC, SAMM, and CASME Ⅱ. The experimental results demonstrated that the proposed algorithm achieves recognition accuracies of 74.6, 84.77 and 91.35% on these datasets, respectively. This substantial improvement in efficiency compared to existing mainstream methods for extracting micro-expression subtle features effectively enhanced micro-expression recognition performance and increased the accuracy of high-precision micro-expression recognition. Consequently, this paper served as an important reference for researchers working on high-precision micro-expression recognition.</p>","PeriodicalId":49870,"journal":{"name":"Mathematical Biosciences and Engineering","volume":"21 4","pages":"5007-5031"},"PeriodicalIF":2.6000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Biosciences and Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3934/mbe.2024221","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

In demanding application scenarios such as clinical psychotherapy and criminal interrogation, the accurate recognition of micro-expressions is of utmost importance but poses significant challenges. One of the main difficulties lies in effectively capturing weak and fleeting facial features and improving recognition performance. To address this fundamental issue, this paper proposed a novel architecture based on a multi-scale 3D residual convolutional neural network. The algorithm leveraged a deep 3D-ResNet50 as the skeleton model and utilized the micro-expression optical flow feature map as the input for the network model. Drawing upon the complex spatial and temporal features inherent in micro-expressions, the network incorporated multi-scale convolutional modules of varying sizes to integrate both global and local information. Furthermore, an attention mechanism feature fusion module was introduced to enhance the model's contextual awareness. Finally, to optimize the model's prediction of the optimal solution, a discriminative network structure with multiple output channels was constructed. The algorithm's performance was evaluated using the public datasets SMIC, SAMM, and CASME Ⅱ. The experimental results demonstrated that the proposed algorithm achieves recognition accuracies of 74.6, 84.77 and 91.35% on these datasets, respectively. This substantial improvement in efficiency compared to existing mainstream methods for extracting micro-expression subtle features effectively enhanced micro-expression recognition performance and increased the accuracy of high-precision micro-expression recognition. Consequently, this paper served as an important reference for researchers working on high-precision micro-expression recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于多尺度三维残差卷积神经网络的微表情识别。

在临床心理治疗和刑事审讯等要求苛刻的应用场景中，准确识别微表情至关重要，但也带来了巨大挑战。其中一个主要困难在于如何有效捕捉微弱和短暂的面部特征并提高识别性能。为解决这一根本问题，本文提出了一种基于多尺度三维残差卷积神经网络的新型架构。该算法利用深度 3D-ResNet50 作为骨架模型，并将微表情光流特征图作为网络模型的输入。利用微表情固有的复杂空间和时间特征，该网络纳入了不同规模的多尺度卷积模块，以整合全局和局部信息。此外，还引入了注意力机制特征融合模块，以增强模型的情境意识。最后，为了优化模型对最优解的预测，构建了一个具有多个输出通道的判别网络结构。利用公共数据集 SMIC、SAMM 和 CASME Ⅱ 评估了算法的性能。实验结果表明，所提出的算法在这些数据集上的识别准确率分别达到了 74.6%、84.77% 和 91.35%。与现有的提取微表情细微特征的主流方法相比，该算法的效率有了大幅提高，有效地增强了微表情识别性能，提高了高精度微表情识别的准确率。因此，本文对研究人员进行高精度微表情识别具有重要的参考价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Mathematical Biosciences and Engineering 工程技术-数学跨学科应用

CiteScore

3.90

自引率

7.70%

发文量

586

审稿时长

>12 weeks

期刊介绍： Mathematical Biosciences and Engineering (MBE) is an interdisciplinary Open Access journal promoting cutting-edge research, technology transfer and knowledge translation about complex data and information processing. MBE publishes Research articles (long and original research); Communications (short and novel research); Expository papers; Technology Transfer and Knowledge Translation reports (description of new technologies and products); Announcements and Industrial Progress and News (announcements and even advertisement, including major conferences).