{"title":"Facial Expression Recognition based on Convolutional Neural Network with Sparse Representation","authors":"Xuan Liu, Jiachen Ma, Qianqian Wang","doi":"10.1109/ICSAI57119.2022.10005481","DOIUrl":null,"url":null,"abstract":"Facial Expression Recognition (FER) in the wild using Convolutional Neural Networks (CNNs) has been a challenge for years because of the significant intra-class variances and interclass similarities. In contrast, facial expression recognition in the wild is vital for human-computer interactions and has numerous applications. Enhancing the discriminative features extraction ability is one approach to solving this issue. In this work, a sparse transform is used to improve a CNN’s ability to extract features without adding to the network’s computational load. We use a sparse representation layer that is built by the Haar wavelet transform or shearlet transform prior to the convolutional layers of a standard CNN. With the proposed sparse representation layers, we introduce a VGGNet and an AlexNet architecture and conduct experiments on the FER2013 dataset without the use of additional training data. The experimental results demonstrated that the wavelet transform’s sparse representation layer can improve FER performance without increasing an excessive computational burden. We achieved testing accuracy of 73.25 percent on the FER2013 dataset using VGGNet paired with a sparse representation layer built inside a wavelet transform, which is among the best results for a single network.","PeriodicalId":339547,"journal":{"name":"2022 8th International Conference on Systems and Informatics (ICSAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI57119.2022.10005481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Facial Expression Recognition (FER) in the wild using Convolutional Neural Networks (CNNs) has been a challenge for years because of the significant intra-class variances and interclass similarities. In contrast, facial expression recognition in the wild is vital for human-computer interactions and has numerous applications. Enhancing the discriminative features extraction ability is one approach to solving this issue. In this work, a sparse transform is used to improve a CNN’s ability to extract features without adding to the network’s computational load. We use a sparse representation layer that is built by the Haar wavelet transform or shearlet transform prior to the convolutional layers of a standard CNN. With the proposed sparse representation layers, we introduce a VGGNet and an AlexNet architecture and conduct experiments on the FER2013 dataset without the use of additional training data. The experimental results demonstrated that the wavelet transform’s sparse representation layer can improve FER performance without increasing an excessive computational burden. We achieved testing accuracy of 73.25 percent on the FER2013 dataset using VGGNet paired with a sparse representation layer built inside a wavelet transform, which is among the best results for a single network.