Tensor Nuclear Norm-Based Multi-Channel Atomic Representation for Robust Face Recognition

IF 13.7 IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-02-17 DOI:10.1109/TIP.2025.3539472

Yutao Hu;Yulong Wang;Libin Wang;Han Li;Hong Chen;Yuan Yan Tang

{"title":"Tensor Nuclear Norm-Based Multi-Channel Atomic Representation for Robust Face Recognition","authors":"Yutao Hu;Yulong Wang;Libin Wang;Han Li;Hong Chen;Yuan Yan Tang","doi":"10.1109/TIP.2025.3539472","DOIUrl":null,"url":null,"abstract":"Numerous representation-based classification (RC) methods have been developed for face recognition due to their decent model interpretability and robustness against noise. Most existing RC methods primarily characterize the gray-scale reconstruction error image (single-channel data) in two ways: the one-dimensional (1D) pixel-based error model and the two-dimensional (2D) gray-scale image-matrix-based error model. The former measures the reconstruction error pixel by pixel, while the latter leverages 2D structural information of the gray-scale error image, such as the low-rank property. However, when applying these methods to different color channels of a test color face image (multi-channel data) separately and independently, they neglect the three-dimensional (3D) structural correlations among distinct color channels. In real-world scenarios, face images are often contaminated with complex noise, including contiguous occlusion and random pixel corruption, which pose significant challenges to these approaches and can lead to a decline in performance. In this paper, we propose a Tensor Nuclear Norm based Robust Multi-channel Atomic Representation (TNN-RMAR) framework with application to color face recognition. The proposed method has the following three critical ingredients: 1) We propose a 3D color image-tensor-based error model, which can take full advantage of the 3D structural information of the color error image. 2) To leverage the 3D structural information of the color error image, we model it as a 3-order tensor <inline-formula> <tex-math>${\\mathcal {E}}$ </tex-math></inline-formula> and exploit its low-rank property with the tensor nuclear norm. Given that multiple color channels in a color image are generally corrupted at the same positions, we design a tube-wise tailored loss function to further leverage its tube-wise structure. 3) We devise the multi-channel atomic norm (MAN) regularization for the representation coefficient matrix, which allows us to jointly harness the correlation information of coefficients in different color channels. In addition, we also devise an efficient algorithm to solve the TNN-RMAR framework based on the alternating direction method of multipliers (ADMM) framework. By leveraging TNN-RMAR as a general platform, we also develop several novel robust multi-channel RC methods. Experimental results on benchmark real-world databases validate the effectiveness and robustness of the proposed framework for robust color face recognition.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"1311-1325"},"PeriodicalIF":13.7000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10891337/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Numerous representation-based classification (RC) methods have been developed for face recognition due to their decent model interpretability and robustness against noise. Most existing RC methods primarily characterize the gray-scale reconstruction error image (single-channel data) in two ways: the one-dimensional (1D) pixel-based error model and the two-dimensional (2D) gray-scale image-matrix-based error model. The former measures the reconstruction error pixel by pixel, while the latter leverages 2D structural information of the gray-scale error image, such as the low-rank property. However, when applying these methods to different color channels of a test color face image (multi-channel data) separately and independently, they neglect the three-dimensional (3D) structural correlations among distinct color channels. In real-world scenarios, face images are often contaminated with complex noise, including contiguous occlusion and random pixel corruption, which pose significant challenges to these approaches and can lead to a decline in performance. In this paper, we propose a Tensor Nuclear Norm based Robust Multi-channel Atomic Representation (TNN-RMAR) framework with application to color face recognition. The proposed method has the following three critical ingredients: 1) We propose a 3D color image-tensor-based error model, which can take full advantage of the 3D structural information of the color error image. 2) To leverage the 3D structural information of the color error image, we model it as a 3-order tensor

${\mathcal {E}}$

and exploit its low-rank property with the tensor nuclear norm. Given that multiple color channels in a color image are generally corrupted at the same positions, we design a tube-wise tailored loss function to further leverage its tube-wise structure. 3) We devise the multi-channel atomic norm (MAN) regularization for the representation coefficient matrix, which allows us to jointly harness the correlation information of coefficients in different color channels. In addition, we also devise an efficient algorithm to solve the TNN-RMAR framework based on the alternating direction method of multipliers (ADMM) framework. By leveraging TNN-RMAR as a general platform, we also develop several novel robust multi-channel RC methods. Experimental results on benchmark real-world databases validate the effectiveness and robustness of the proposed framework for robust color face recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于张量核范数的多通道原子表示鲁棒人脸识别

许多基于表示的分类（RC）方法由于其良好的模型可解释性和抗噪声的鲁棒性而被开发用于人脸识别。大多数现有的RC方法主要以两种方式表征灰度重建误差图像（单通道数据）：基于一维（1D）像素的误差模型和基于二维（2D）灰度图像矩阵的误差模型。前者逐像素测量重建误差，后者利用灰度误差图像的二维结构信息，如低秩特性。然而，当将这些方法分别独立应用于测试彩色人脸图像（多通道数据）的不同颜色通道时，它们忽略了不同颜色通道之间的三维（3D）结构相关性。在现实场景中，人脸图像经常受到复杂噪声的污染，包括连续遮挡和随机像素损坏，这对这些方法构成了重大挑战，并可能导致性能下降。提出了一种基于张量核范数的鲁棒多通道原子表示（tnn - rar）框架，并将其应用于彩色人脸识别。提出了一种基于三维彩色图像张量的误差模型，该模型可以充分利用彩色误差图像的三维结构信息。2)为了利用彩色误差图像的三维结构信息，我们将其建模为3阶张量${\mathcal {E}}$，并利用张量核范数利用其低秩特性。考虑到彩色图像中的多个颜色通道通常在相同位置被破坏，我们设计了一个管型定制损失函数来进一步利用其管型结构。3)设计了表示系数矩阵的多通道原子范数（MAN）正则化，使我们能够共同利用不同颜色通道中系数的相关信息。此外，我们还设计了一种基于交替方向乘法器（ADMM）框架的求解TNN-RMAR框架的高效算法。通过利用TNN-RMAR作为通用平台，我们还开发了几种新的鲁棒多通道RC方法。在基准数据库上的实验结果验证了该框架的有效性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量