Fuli Zhang , Yu Liu , Xiaoling Yu , Zhichen Wang , Qi Zhang , Jing Wang , Qionghua Zhang
{"title":"Towards facial micro-expression detection and classification using modified multimodal ensemble learning approach","authors":"Fuli Zhang , Yu Liu , Xiaoling Yu , Zhichen Wang , Qi Zhang , Jing Wang , Qionghua Zhang","doi":"10.1016/j.inffus.2024.102735","DOIUrl":null,"url":null,"abstract":"<div><div>A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro-expressions. In recent years, Deep convolution neural methods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDC-MMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is maintained by optimizations that minimize computing cost. Empirical findings reveal that this methodology notably surpasses conventional techniques, providing enhanced precision and resilience on a variety of complex and demanding datasets. In addition to pushing the boundaries of micro-expression analysis research, the proposed strategy has potential uses in the real world in fields including security, psychology testing, and human-computer interaction.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"115 ","pages":"Article 102735"},"PeriodicalIF":14.7000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156625352400513X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro-expressions. In recent years, Deep convolution neural methods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDC-MMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is maintained by optimizations that minimize computing cost. Empirical findings reveal that this methodology notably surpasses conventional techniques, providing enhanced precision and resilience on a variety of complex and demanding datasets. In addition to pushing the boundaries of micro-expression analysis research, the proposed strategy has potential uses in the real world in fields including security, psychology testing, and human-computer interaction.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.