Micro-expression recognition based on multi-scale 3D residual convolutional neural network.

IF 2.6 4区 工程技术 Q1 Mathematics Mathematical Biosciences and Engineering Pub Date : 2024-03-01 DOI:10.3934/mbe.2024221
Hongmei Jin, Ning He, Zhanli Li, Pengcheng Yang
{"title":"Micro-expression recognition based on multi-scale 3D residual convolutional neural network.","authors":"Hongmei Jin, Ning He, Zhanli Li, Pengcheng Yang","doi":"10.3934/mbe.2024221","DOIUrl":null,"url":null,"abstract":"<p><p>In demanding application scenarios such as clinical psychotherapy and criminal interrogation, the accurate recognition of micro-expressions is of utmost importance but poses significant challenges. One of the main difficulties lies in effectively capturing weak and fleeting facial features and improving recognition performance. To address this fundamental issue, this paper proposed a novel architecture based on a multi-scale 3D residual convolutional neural network. The algorithm leveraged a deep 3D-ResNet50 as the skeleton model and utilized the micro-expression optical flow feature map as the input for the network model. Drawing upon the complex spatial and temporal features inherent in micro-expressions, the network incorporated multi-scale convolutional modules of varying sizes to integrate both global and local information. Furthermore, an attention mechanism feature fusion module was introduced to enhance the model's contextual awareness. Finally, to optimize the model's prediction of the optimal solution, a discriminative network structure with multiple output channels was constructed. The algorithm's performance was evaluated using the public datasets SMIC, SAMM, and CASME Ⅱ. The experimental results demonstrated that the proposed algorithm achieves recognition accuracies of 74.6, 84.77 and 91.35% on these datasets, respectively. This substantial improvement in efficiency compared to existing mainstream methods for extracting micro-expression subtle features effectively enhanced micro-expression recognition performance and increased the accuracy of high-precision micro-expression recognition. Consequently, this paper served as an important reference for researchers working on high-precision micro-expression recognition.</p>","PeriodicalId":49870,"journal":{"name":"Mathematical Biosciences and Engineering","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Biosciences and Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3934/mbe.2024221","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

Abstract

In demanding application scenarios such as clinical psychotherapy and criminal interrogation, the accurate recognition of micro-expressions is of utmost importance but poses significant challenges. One of the main difficulties lies in effectively capturing weak and fleeting facial features and improving recognition performance. To address this fundamental issue, this paper proposed a novel architecture based on a multi-scale 3D residual convolutional neural network. The algorithm leveraged a deep 3D-ResNet50 as the skeleton model and utilized the micro-expression optical flow feature map as the input for the network model. Drawing upon the complex spatial and temporal features inherent in micro-expressions, the network incorporated multi-scale convolutional modules of varying sizes to integrate both global and local information. Furthermore, an attention mechanism feature fusion module was introduced to enhance the model's contextual awareness. Finally, to optimize the model's prediction of the optimal solution, a discriminative network structure with multiple output channels was constructed. The algorithm's performance was evaluated using the public datasets SMIC, SAMM, and CASME Ⅱ. The experimental results demonstrated that the proposed algorithm achieves recognition accuracies of 74.6, 84.77 and 91.35% on these datasets, respectively. This substantial improvement in efficiency compared to existing mainstream methods for extracting micro-expression subtle features effectively enhanced micro-expression recognition performance and increased the accuracy of high-precision micro-expression recognition. Consequently, this paper served as an important reference for researchers working on high-precision micro-expression recognition.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多尺度三维残差卷积神经网络的微表情识别。
在临床心理治疗和刑事审讯等要求苛刻的应用场景中,准确识别微表情至关重要,但也带来了巨大挑战。其中一个主要困难在于如何有效捕捉微弱和短暂的面部特征并提高识别性能。为解决这一根本问题,本文提出了一种基于多尺度三维残差卷积神经网络的新型架构。该算法利用深度 3D-ResNet50 作为骨架模型,并将微表情光流特征图作为网络模型的输入。利用微表情固有的复杂空间和时间特征,该网络纳入了不同规模的多尺度卷积模块,以整合全局和局部信息。此外,还引入了注意力机制特征融合模块,以增强模型的情境意识。最后,为了优化模型对最优解的预测,构建了一个具有多个输出通道的判别网络结构。利用公共数据集 SMIC、SAMM 和 CASME Ⅱ 评估了算法的性能。实验结果表明,所提出的算法在这些数据集上的识别准确率分别达到了 74.6%、84.77% 和 91.35%。与现有的提取微表情细微特征的主流方法相比,该算法的效率有了大幅提高,有效地增强了微表情识别性能,提高了高精度微表情识别的准确率。因此,本文对研究人员进行高精度微表情识别具有重要的参考价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Mathematical Biosciences and Engineering
Mathematical Biosciences and Engineering 工程技术-数学跨学科应用
CiteScore
3.90
自引率
7.70%
发文量
586
审稿时长
>12 weeks
期刊介绍: Mathematical Biosciences and Engineering (MBE) is an interdisciplinary Open Access journal promoting cutting-edge research, technology transfer and knowledge translation about complex data and information processing. MBE publishes Research articles (long and original research); Communications (short and novel research); Expository papers; Technology Transfer and Knowledge Translation reports (description of new technologies and products); Announcements and Industrial Progress and News (announcements and even advertisement, including major conferences).
期刊最新文献
CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion. Video-based person re-identification with complementary local and global features using a graph transformer. Modeling free tumor growth: Discrete, continuum, and hybrid approaches to interpreting cancer development. Retraction notice to "A video images-aware knowledge extraction method for intelligent healthcare management of basketball players" [Mathematical Biosciences and Engineering 20(2) (2023) 1919-1937]. Improved optimizer with deep learning model for emotion detection and classification.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1