首页 > 最新文献

Pattern Recognition最新文献

英文 中文
Quaternion adaptive approximation normalization graph guided implicit low rank for robust matrix completion 四元数自适应逼近归一化图导隐式低秩鲁棒矩阵补全
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-30 DOI: 10.1016/j.patcog.2026.113210
Yu Guo , Yi Liu , Guoqing Chen , Tieyong Zeng , Qiyu Jin , Michael Kwok-Po Ng
Graph structures are effective for capturing low-dimensional manifolds within high-dimensional data spaces and are frequently utilized as regularization terms to smooth graph signals. A crucial element in this process is the construction of the graph Laplacian. However, the normalization of this Laplacian often necessitates computationally expensive inverse operations. To address this limitation, this paper introduces quaternion graph regularity and proposes the quaternion adaptive approximation normalization graph (QAANG). QAANG offers a computationally efficient solution by requiring only a single adaptive scalar for approximate normalization, thereby circumventing the need for inverse operations. To promote the low rank of the graph, we implicitly embed the low rank into the data fidelity term. This approach not only avoids the significant costs associated with the explicit computation of the low-rank of quaternion matrices, but also eliminates the need to balance multiple regularization terms and adjust hyperparameters. Experimental results demonstrate that QAANG surpasses current state-of-the-art quaternion methods in both completion performance and robustness.
图结构对于在高维数据空间中捕获低维流形是有效的,并且经常被用作平滑图信号的正则化项。这个过程中的一个关键因素是图拉普拉斯的构造。然而,这个拉普拉斯函数的归一化常常需要计算代价昂贵的逆操作。为了解决这一缺陷,本文引入了四元数图的正则性,提出了四元数自适应逼近归一化图(QAANG)。QAANG提供了一种计算效率高的解决方案,它只需要一个自适应标量进行近似归一化,从而避免了对逆操作的需要。为了提高图的低秩,我们隐式地将低秩嵌入到数据保真度项中。这种方法不仅避免了显式计算低秩四元数矩阵所带来的巨大开销,而且消除了平衡多个正则化项和调整超参数的需要。实验结果表明,QAANG算法在完井性能和鲁棒性方面都优于当前最先进的四元数算法。
{"title":"Quaternion adaptive approximation normalization graph guided implicit low rank for robust matrix completion","authors":"Yu Guo ,&nbsp;Yi Liu ,&nbsp;Guoqing Chen ,&nbsp;Tieyong Zeng ,&nbsp;Qiyu Jin ,&nbsp;Michael Kwok-Po Ng","doi":"10.1016/j.patcog.2026.113210","DOIUrl":"10.1016/j.patcog.2026.113210","url":null,"abstract":"<div><div>Graph structures are effective for capturing low-dimensional manifolds within high-dimensional data spaces and are frequently utilized as regularization terms to smooth graph signals. A crucial element in this process is the construction of the graph Laplacian. However, the normalization of this Laplacian often necessitates computationally expensive inverse operations. To address this limitation, this paper introduces quaternion graph regularity and proposes the quaternion adaptive approximation normalization graph (QAANG). QAANG offers a computationally efficient solution by requiring only a single adaptive scalar for approximate normalization, thereby circumventing the need for inverse operations. To promote the low rank of the graph, we implicitly embed the low rank into the data fidelity term. This approach not only avoids the significant costs associated with the explicit computation of the low-rank of quaternion matrices, but also eliminates the need to balance multiple regularization terms and adjust hyperparameters. Experimental results demonstrate that QAANG surpasses current state-of-the-art quaternion methods in both completion performance and robustness.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113210"},"PeriodicalIF":7.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SOFP: Capturing subtle facial dynamics with symmetric optical flow perception for micro-expression recognition SOFP:用对称光流感知捕捉细微的面部动态,用于微表情识别
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-30 DOI: 10.1016/j.patcog.2026.113199
Kejian Yu, Zhaohui Zhang, Chaochao Hu, Jiehao Luo
Facial micro-expressions (MEs) are fleeting, involuntary facial movements that reveal genuine emotions and play a key role in lie detection and affective computing. However, existing micro-expression recognition (MER) methods often rely on imprecise priors, such as facial landmark localization and action unit annotations, resulting in incomplete or noisy motion representations. To address these limitations, we propose the Symmetric Optical Flow Perception (SOFP) framework, which partitions the optical flow (OF) into four bilaterally symmetric facial regions to capture both local and global motion cues through attention-guided encoders. Furthermore, we introduce the Symmetric Region-Aware Attention Fusion (SRAF) module, which (i) enforces semantic consistency between symmetric facial regions, (ii) models inter-region dependencies via structure-guided self-attention, and (iii) integrates global-to-local information through adaptive cross-attention fusion. Extensive experiments are conducted under both composite database evaluation (CDE) with leave-one-subject-out (LOSO) cross-validation and single dataset evaluation (SDE) settings. On the Composite (Full) dataset, SOFP achieves state-of-the-art performance with a UF1 of 92.84% and a UAR of 92.93%. In SDE, SOFP consistently outperforms existing methods across the SAMM, CASME II, CAS(ME)3, and DFME datasets. These results demonstrate that explicitly modeling facial symmetry and region-level motion information enables robust and accurate MER. The source code is publicly available at https://github.com/Healer-ML/MER.
面部微表情(MEs)是一种短暂的、无意识的面部运动,它揭示了真实的情绪,在测谎和情感计算中起着关键作用。然而,现有的微表情识别方法往往依赖于不精确的先验,如面部地标定位和动作单元注释,导致运动表示不完整或有噪声。为了解决这些限制,我们提出了对称光流感知(SOFP)框架,该框架将光流(OF)划分为四个双边对称的面部区域,通过注意引导编码器捕获局部和全局运动线索。此外,我们引入了对称区域感知注意力融合(SRAF)模块,该模块(i)增强对称面部区域之间的语义一致性,(ii)通过结构引导的自注意建模区域间依赖关系,以及(iii)通过自适应交叉注意融合集成全局到局部的信息。在丢下一个受试者(LOSO)交叉验证的复合数据库评估(CDE)和单数据集评估(SDE)设置下进行了大量实验。在Composite (Full)数据集上,SOFP实现了最先进的性能,UF1为92.84%,UAR为92.93%。在SDE中,SOFP在SAMM、CASME II、CAS(ME)3和DFME数据集上始终优于现有方法。这些结果表明,明确建模面部对称性和区域级运动信息可以实现鲁棒和准确的MER。源代码可在https://github.com/Healer-ML/MER上公开获得。
{"title":"SOFP: Capturing subtle facial dynamics with symmetric optical flow perception for micro-expression recognition","authors":"Kejian Yu,&nbsp;Zhaohui Zhang,&nbsp;Chaochao Hu,&nbsp;Jiehao Luo","doi":"10.1016/j.patcog.2026.113199","DOIUrl":"10.1016/j.patcog.2026.113199","url":null,"abstract":"<div><div>Facial micro-expressions (MEs) are fleeting, involuntary facial movements that reveal genuine emotions and play a key role in lie detection and affective computing. However, existing micro-expression recognition (MER) methods often rely on imprecise priors, such as facial landmark localization and action unit annotations, resulting in incomplete or noisy motion representations. To address these limitations, we propose the Symmetric Optical Flow Perception (SOFP) framework, which partitions the optical flow (OF) into four bilaterally symmetric facial regions to capture both local and global motion cues through attention-guided encoders. Furthermore, we introduce the Symmetric Region-Aware Attention Fusion (SRAF) module, which (i) enforces semantic consistency between symmetric facial regions, (ii) models inter-region dependencies via structure-guided self-attention, and (iii) integrates global-to-local information through adaptive cross-attention fusion. Extensive experiments are conducted under both composite database evaluation (CDE) with leave-one-subject-out (LOSO) cross-validation and single dataset evaluation (SDE) settings. On the Composite (Full) dataset, SOFP achieves state-of-the-art performance with a UF1 of 92.84% and a UAR of 92.93%. In SDE, SOFP consistently outperforms existing methods across the SAMM, CASME II, CAS(ME)<sup>3</sup>, and DFME datasets. These results demonstrate that explicitly modeling facial symmetry and region-level motion information enables robust and accurate MER. The source code is publicly available at <span><span>https://github.com/Healer-ML/MER</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113199"},"PeriodicalIF":7.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Manifold regularized non-negative PCA with robust ℓ2,p-norm enhancement 具有鲁棒l2,p模增强的流形正则化非负主成分分析
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-30 DOI: 10.1016/j.patcog.2026.113195
Minghua Wan , Taotao Chen , Hai Tan , Mingwei Tang , Guowei Yang
Facing the challenges of ubiquitous noise in high-dimensional datasets and the embedding of data samples in low-dimensional manifolds, traditional robust NMF algorithms have limitations in noise reduction and preserving the geometric structure of data. This paper proposes a novel algorithm, Manifold Regularized Non-negative Principal Component Analysis (ℓ2,p-MRNPCA), which enhances the model’s robustness to noise by introducing ℓ2,p norm constraints and maintains the intrinsic geometric structure of the data. The algorithm further incorporates a Laplacian graph regularization term to preserve local manifold structure, and additionally imposes an independent ℓ2,1-norm penalty on the residual matrix to enhance robustness. Compared to ℓ2,p-PCA, ℓ2,p-MRNPCA demonstrates stronger local learning ability in image data processing, more effectively recognizing image details and patterns. The main contribution of this study is the proposal of a new method that integrates ℓ2,p regularization, NMF, and manifold learning, enhancing the model’s robustness and recognition capabilities. During the optimization of the projection matrix, this method effectively reduces the impact of noise and maintains the geometric integrity of the original data, thus obtaining superior part-based representations. Finally, we designed a Lagrangian–KKT multiplicative update framework to solve ℓ2,p-MRNPCA and conducted experiments on three common datasets and the handwritten MNIST dataset, demonstrating optimal performance.
面对高维数据集中无处不在的噪声和数据样本嵌入到低维流形中的挑战,传统的鲁棒NMF算法在降噪和保持数据几何结构方面存在局限性。本文提出了一种新的算法——流形正则化非负主成分分析(流形正则化非负主成分分析,p- mrnpca),该算法通过引入l2,p范数约束来增强模型对噪声的鲁棒性,并保持了数据固有的几何结构。该算法进一步引入拉普拉斯图正则化项以保持局部流形结构,并在残差矩阵上施加独立的1,1,2范数惩罚以增强鲁棒性。与l2 - p-PCA相比,l2 - p-MRNPCA在图像数据处理中表现出更强的局部学习能力,能更有效地识别图像细节和模式。本研究的主要贡献是提出了一种集成了l2、p正则化、NMF和流形学习的新方法,增强了模型的鲁棒性和识别能力。在投影矩阵优化过程中,该方法有效地降低了噪声的影响,保持了原始数据的几何完整性,从而获得了更好的基于部分的表示。最后,我们设计了一个拉格朗日- kkt乘法更新框架来求解l2,p-MRNPCA,并在三个常用数据集和手写MNIST数据集上进行了实验,证明了最优的性能。
{"title":"Manifold regularized non-negative PCA with robust ℓ2,p-norm enhancement","authors":"Minghua Wan ,&nbsp;Taotao Chen ,&nbsp;Hai Tan ,&nbsp;Mingwei Tang ,&nbsp;Guowei Yang","doi":"10.1016/j.patcog.2026.113195","DOIUrl":"10.1016/j.patcog.2026.113195","url":null,"abstract":"<div><div>Facing the challenges of ubiquitous noise in high-dimensional datasets and the embedding of data samples in low-dimensional manifolds, traditional robust NMF algorithms have limitations in noise reduction and preserving the geometric structure of data. This paper proposes a novel algorithm, Manifold Regularized Non-negative Principal Component Analysis (ℓ<sub>2,<em>p</em></sub>-MRNPCA), which enhances the model’s robustness to noise by introducing ℓ<sub>2,<em>p</em></sub> norm constraints and maintains the intrinsic geometric structure of the data. The algorithm further incorporates a Laplacian graph regularization term to preserve local manifold structure, and additionally imposes an independent ℓ<sub>2,1</sub>-norm penalty on the residual matrix to enhance robustness. Compared to ℓ<sub>2,<em>p</em></sub>-PCA, ℓ<sub>2,<em>p</em></sub>-MRNPCA demonstrates stronger local learning ability in image data processing, more effectively recognizing image details and patterns. The main contribution of this study is the proposal of a new method that integrates ℓ<sub>2,<em>p</em></sub> regularization, NMF, and manifold learning, enhancing the model’s robustness and recognition capabilities. During the optimization of the projection matrix, this method effectively reduces the impact of noise and maintains the geometric integrity of the original data, thus obtaining superior part-based representations. Finally, we designed a Lagrangian–KKT multiplicative update framework to solve ℓ<sub>2,<em>p</em></sub>-MRNPCA and conducted experiments on three common datasets and the handwritten MNIST dataset, demonstrating optimal performance.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113195"},"PeriodicalIF":7.6,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic change detection of roads and bridges: A fine-grained dataset and multimodal frequency-driven detector 道路和桥梁的语义变化检测:一个细粒度数据集和多模态频率驱动检测器
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113191
Qing-Ling Shu , Si-Bao Chen , Xiao Wang , Zhi-Hui You , Wei Lu , Jin Tang , Bin Luo
Accurate detection of road and bridge changes is crucial for urban planning and transportation management, yet presents unique challenges for general change detection (CD). Key difficulties arise from maintaining the continuity of roads and bridges as linear structures and disambiguating visually similar land covers (e.g., road construction vs. bare land). Existing spatial-domain models struggle with these issues, further hindered by the lack of specialized, semantically rich datasets. To fill these gaps, we introduce the Road and Bridge Semantic Change Detection (RB-SCD) dataset. Unlike existing benchmarks that primarily focus on general land cover changes, RB-SCD is the first to systematically target 11 specific semantic change transition types (e.g., water → bridge) anchored to traffic infrastructure. This enables a detailed analysis of traffic infrastructure evolution. Building on this, we propose a novel framework, the Multimodal Frequency-Driven Change Detector (MFDCD). MFDCD integrates multimodal features in the frequency domain through two key components: (1) the Dynamic Frequency Coupler (DFC), which leverages wavelet transform to decompose visual features, enabling it to robustly model the continuity of linear transitions; and (2) the Textual Frequency Filter (TFF), which encodes semantic priors into frequency-domain graphs and applies filter banks to align them with visual features, resolving semantic ambiguities. Experiments demonstrate the state-of-the-art performance of MFDCD on RB-SCD and three public CD datasets. The code will be available at https://github.com/DaGuangDaGuang/RB-SCD.
道路和桥梁变化的准确检测对城市规划和交通管理至关重要,但对一般变化检测(CD)提出了独特的挑战。关键的困难来自保持道路和桥梁作为线性结构的连续性,以及消除视觉上相似的土地覆盖(例如,道路建设与裸地)的歧义。现有的空间域模型与这些问题作斗争,进一步受到缺乏专门的、语义丰富的数据集的阻碍。为了填补这些空白,我们引入了道路和桥梁语义变化检测(RB-SCD)数据集。与现有的主要关注一般土地覆盖变化的基准不同,RB-SCD是第一个系统地针对11种特定的语义变化转换类型(例如,水 → 桥梁)锚定在交通基础设施上。这使得对交通基础设施演变的详细分析成为可能。在此基础上,我们提出了一个新的框架,即多模态频率驱动变化检测器(MFDCD)。MFDCD通过两个关键组件在频域中集成多模态特征:(1)动态频率耦合器(DFC),它利用小波变换对视觉特征进行分解,使其能够鲁棒地模拟线性过渡的连续性;(2)文本频率滤波器(TFF),它将语义先验编码到频域图中,并应用滤波器组将它们与视觉特征对齐,从而解决语义歧义。实验证明了MFDCD在RB-SCD和三个公共CD数据集上的性能。代码可在https://github.com/DaGuangDaGuang/RB-SCD上获得。
{"title":"Semantic change detection of roads and bridges: A fine-grained dataset and multimodal frequency-driven detector","authors":"Qing-Ling Shu ,&nbsp;Si-Bao Chen ,&nbsp;Xiao Wang ,&nbsp;Zhi-Hui You ,&nbsp;Wei Lu ,&nbsp;Jin Tang ,&nbsp;Bin Luo","doi":"10.1016/j.patcog.2026.113191","DOIUrl":"10.1016/j.patcog.2026.113191","url":null,"abstract":"<div><div>Accurate detection of road and bridge changes is crucial for urban planning and transportation management, yet presents unique challenges for general change detection (CD). Key difficulties arise from maintaining the continuity of roads and bridges as linear structures and disambiguating visually similar land covers (e.g., road construction vs. bare land). Existing spatial-domain models struggle with these issues, further hindered by the lack of specialized, semantically rich datasets. To fill these gaps, we introduce the Road and Bridge Semantic Change Detection (RB-SCD) dataset. Unlike existing benchmarks that primarily focus on general land cover changes, RB-SCD is the first to systematically target 11 specific semantic change transition types (e.g., water → bridge) anchored to traffic infrastructure. This enables a detailed analysis of traffic infrastructure evolution. Building on this, we propose a novel framework, the Multimodal Frequency-Driven Change Detector (MFDCD). MFDCD integrates multimodal features in the frequency domain through two key components: (1) the Dynamic Frequency Coupler (DFC), which leverages wavelet transform to decompose visual features, enabling it to robustly model the continuity of linear transitions; and (2) the Textual Frequency Filter (TFF), which encodes semantic priors into frequency-domain graphs and applies filter banks to align them with visual features, resolving semantic ambiguities. Experiments demonstrate the state-of-the-art performance of MFDCD on RB-SCD and three public CD datasets. The code will be available at <span><span>https://github.com/DaGuangDaGuang/RB-SCD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113191"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continual relation extraction with wake-sleep memory consolidation 连续关系提取与清醒-睡眠记忆巩固
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113192
Tingting Hang , Ya Guo , Jun Huang , Yirui Wu , Umapada Pal , Shivakumara Palaiahnakote
Continual Relation Extraction (CRE) has achieved significant success due to its ability to adapt to new relations without frequent retraining. However, existing methods still face challenges such as overfitting and representation bias. Inspired by the wake-sleep memory consolidation process of the human brain, this paper proposes a Wake-Sleep Memory Consolidation (WSMC) framework to address these issues systematically. During the wake phase, the model simulates the brain’s information processing mechanism, quickly encoding new relations and storing them in short-term memory. We also introduce the Experience Iterative Learning (EIL) approach, which dynamically adjusts the distribution of relation samples. This approach corrects the model’s representation bias and enhances memory stability through experience replay. During the sleep phase, the model consolidates existing knowledge by replaying long-term memory. Moreover, the framework generates diverse dream data from existing memory sets, thereby increasing the diversity of the training data and improving the model’s generalization capability. Experimental results show that WSMC significantly outperforms other CRE baseline methods on FewRel and TACRED datasets, demonstrating its superior performance compared to baseline methods. Our source code is available at https://github.com/Gyanis9/WSMC.git.
连续关系提取(CRE)由于能够适应新的关系而无需频繁的再培训而取得了显著的成功。然而,现有的方法仍然面临着过拟合和代表性偏差等挑战。受人类大脑清醒-睡眠记忆巩固过程的启发,本文提出了一个清醒-睡眠记忆巩固(WSMC)框架来系统地解决这些问题。在清醒阶段,该模型模拟大脑的信息处理机制,快速编码新的关系并将其存储在短期记忆中。我们还介绍了经验迭代学习(EIL)方法,该方法可以动态调整关系样本的分布。这种方法纠正了模型的表征偏差,并通过经验重放增强了记忆的稳定性。在睡眠阶段,该模型通过重放长期记忆来巩固现有知识。此外,该框架从现有的记忆集中生成多样化的梦数据,从而增加了训练数据的多样性,提高了模型的泛化能力。实验结果表明,在FewRel和TACRED数据集上,WSMC显著优于其他CRE基线方法,显示了其优于基线方法的性能。我们的源代码可从https://github.com/Gyanis9/WSMC.git获得。
{"title":"Continual relation extraction with wake-sleep memory consolidation","authors":"Tingting Hang ,&nbsp;Ya Guo ,&nbsp;Jun Huang ,&nbsp;Yirui Wu ,&nbsp;Umapada Pal ,&nbsp;Shivakumara Palaiahnakote","doi":"10.1016/j.patcog.2026.113192","DOIUrl":"10.1016/j.patcog.2026.113192","url":null,"abstract":"<div><div>Continual Relation Extraction (CRE) has achieved significant success due to its ability to adapt to new relations without frequent retraining. However, existing methods still face challenges such as overfitting and representation bias. Inspired by the wake-sleep memory consolidation process of the human brain, this paper proposes a <strong>W</strong>ake-<strong>S</strong>leep <strong>M</strong>emory <strong>C</strong>onsolidation (WSMC) framework to address these issues systematically. During the wake phase, the model simulates the brain’s information processing mechanism, quickly encoding new relations and storing them in short-term memory. We also introduce the Experience Iterative Learning (EIL) approach, which dynamically adjusts the distribution of relation samples. This approach corrects the model’s representation bias and enhances memory stability through experience replay. During the sleep phase, the model consolidates existing knowledge by replaying long-term memory. Moreover, the framework generates diverse dream data from existing memory sets, thereby increasing the diversity of the training data and improving the model’s generalization capability. Experimental results show that WSMC significantly outperforms other CRE baseline methods on FewRel and TACRED datasets, demonstrating its superior performance compared to baseline methods. Our source code is available at <span><span>https://github.com/Gyanis9/WSMC.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113192"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel incremental Gaussian mixture model based on fuzzy three-way decision for concept drift adaptation 一种新的基于模糊三向决策的增量高斯混合模型用于概念漂移自适应
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113181
Wenxin Shen , Zhixuan Deng , Tianrui Li , Keyu Liu , Deyou Xia , Dayong Deng
As a clustering model based on probability distribution, the Gaussian mixture model (GMM) is extensively implemented in data stream learning. Most GMMs rely on historical instances to adapt to concept drift, but determining how to distinguish drift instances to reduce drift’s negative impact on GMM remains difficult. In addition, owing to the uncertainty of drift ranges, GMM may incorrectly adapt to instances on the drift boundary, resulting in the distribution of sub-clusters being inconsistent with the current data distribution. To address the two issues, this study proposes an incremental Gaussian mixture model based on fuzzy three-way decision (IGMMFTWD). In contrast to existing GMMs for data streams, IGMMFTWD adapts to concept drift based on drift risk and updates drifting sub-clusters locally. First, a fuzzy nearest neighbour method is proposed to construct the region that is suitable for the current drift range. Subsequently, a novel drift risk estimation based on three-way decisions is proposed. This method can reduce the misjudgement costs of drift instances. Finally, the Gaussian mixture model completes incremental adaptation with the local update method. In the experiment, the proposed model is compared and verified in terms of classification accuracy and G-mean. The results show that IGMMFTWD outperforms six state-of-the-art methods.
高斯混合模型作为一种基于概率分布的聚类模型,在数据流学习中得到了广泛的应用。大多数GMM依赖历史实例来适应概念漂移,但如何区分漂移实例以减少漂移对GMM的负面影响仍然是困难的。此外,由于漂移范围的不确定性,GMM可能不正确地适应漂移边界上的实例,导致子聚类的分布与当前数据分布不一致。为了解决这两个问题,本文提出了一种基于模糊三向决策的增量高斯混合模型(IGMMFTWD)。与现有的数据流GMMs相比,IGMMFTWD适应基于漂移风险的概念漂移,并在局部更新漂移子簇。首先,提出了一种模糊最近邻法来构造适合电流漂移范围的区域;在此基础上,提出了一种基于三向决策的漂移风险估计方法。该方法可以降低漂移实例的误判代价。最后,利用局部更新方法对高斯混合模型进行增量自适应。在实验中,从分类精度和g均值两个方面对所提出的模型进行了比较和验证。结果表明,IGMMFTWD优于六种最先进的方法。
{"title":"A novel incremental Gaussian mixture model based on fuzzy three-way decision for concept drift adaptation","authors":"Wenxin Shen ,&nbsp;Zhixuan Deng ,&nbsp;Tianrui Li ,&nbsp;Keyu Liu ,&nbsp;Deyou Xia ,&nbsp;Dayong Deng","doi":"10.1016/j.patcog.2026.113181","DOIUrl":"10.1016/j.patcog.2026.113181","url":null,"abstract":"<div><div>As a clustering model based on probability distribution, the Gaussian mixture model (GMM) is extensively implemented in data stream learning. Most GMMs rely on historical instances to adapt to concept drift, but determining how to distinguish drift instances to reduce drift’s negative impact on GMM remains difficult. In addition, owing to the uncertainty of drift ranges, GMM may incorrectly adapt to instances on the drift boundary, resulting in the distribution of sub-clusters being inconsistent with the current data distribution. To address the two issues, this study proposes an incremental Gaussian mixture model based on fuzzy three-way decision (IGMMFTWD). In contrast to existing GMMs for data streams, IGMMFTWD adapts to concept drift based on drift risk and updates drifting sub-clusters locally. First, a fuzzy nearest neighbour method is proposed to construct the region that is suitable for the current drift range. Subsequently, a novel drift risk estimation based on three-way decisions is proposed. This method can reduce the misjudgement costs of drift instances. Finally, the Gaussian mixture model completes incremental adaptation with the local update method. In the experiment, the proposed model is compared and verified in terms of classification accuracy and G-mean. The results show that IGMMFTWD outperforms six state-of-the-art methods.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113181"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tab2Visual: Deep learning for limited tabular data via visual representations and augmentation Tab2Visual:通过可视化表示和增强对有限表格数据进行深度学习
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113173
Ahmed Mamdouh , Moumen El-Melegy , Samia Ali , Ron Kikinis
This research addresses the challenge of limited data in tabular data classification, particularly prevalent in domains with constraints like healthcare. We propose Tab2Visual, a novel approach that transforms heterogeneous tabular data into visual representations, enabling the application of powerful deep learning models. Tab2Visual effectively addresses data scarcity by incorporating novel image augmentation techniques and facilitating transfer learning. We extensively evaluate the proposed approach on diverse tabular datasets, comparing its performance against a wide range of machine learning algorithms, including classical methods, tree-based ensembles, and state-of-the-art deep learning models specifically designed for tabular data. We also perform an in-depth analysis of factors influencing Tab2Visual’s performance. Our experimental results demonstrate that Tab2Visual outperforms other methods in classification problems with limited tabular data.
本研究解决了表格数据分类中有限数据的挑战,特别是在医疗保健等有限制的领域。我们提出了Tab2Visual,这是一种将异构表格数据转换为视觉表示的新方法,使强大的深度学习模型得以应用。Tab2Visual通过结合新的图像增强技术和促进迁移学习,有效地解决了数据稀缺问题。我们在不同的表格数据集上广泛评估了所提出的方法,并将其与各种机器学习算法(包括经典方法、基于树的集成和专门为表格数据设计的最先进的深度学习模型)的性能进行了比较。我们还对影响Tab2Visual性能的因素进行了深入的分析。我们的实验结果表明,Tab2Visual在有限表格数据的分类问题上优于其他方法。
{"title":"Tab2Visual: Deep learning for limited tabular data via visual representations and augmentation","authors":"Ahmed Mamdouh ,&nbsp;Moumen El-Melegy ,&nbsp;Samia Ali ,&nbsp;Ron Kikinis","doi":"10.1016/j.patcog.2026.113173","DOIUrl":"10.1016/j.patcog.2026.113173","url":null,"abstract":"<div><div>This research addresses the challenge of limited data in tabular data classification, particularly prevalent in domains with constraints like healthcare. We propose Tab2Visual, a novel approach that transforms heterogeneous tabular data into visual representations, enabling the application of powerful deep learning models. Tab2Visual effectively addresses data scarcity by incorporating novel image augmentation techniques and facilitating transfer learning. We extensively evaluate the proposed approach on diverse tabular datasets, comparing its performance against a wide range of machine learning algorithms, including classical methods, tree-based ensembles, and state-of-the-art deep learning models specifically designed for tabular data. We also perform an in-depth analysis of factors influencing Tab2Visual’s performance. Our experimental results demonstrate that Tab2Visual outperforms other methods in classification problems with limited tabular data.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113173"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
INSERTION: From traditional incremental learning to open-world stream learning 插入:从传统的增量学习到开放世界流学习
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113163
Yanchao Li , Hongwei Dou , Guanxiao Li , Guangwei Gao , Huiyu Zhou
In plenty of real-world applications, data are generated/collected in a streaming way, and it is hard to obtain their accurate labels of known (seen) classes. Moreover, there are several unknown (unseen/novel) classes would emerge with evolving stream data. In the literatures, existing approaches suffer from three limitations: (1) a gap in intra-class variance arises when seen classes are learned more faster than novel classes; (2) a significant issue arises regarding the imbalance in feature weighting among the learning procedures for both new and old classes; (3) a catastrophic forgetting can occur if we exclusively update the model with new data, resulting in the loss of knowledge acquired from known classes when integrating information related to the current novel classes. This paper investigates the problem of learning with unseen classes detection over a non-stationary data stream. Particularly, we introduce uncertainty adaptive margin mechanism from open-world semi-supervised learning to address the bias stemming from the faster learning of discriminative features for seen classes compared to novel classes. We also develop adaptive weighting scheme to dynamically balance the usage of seen and novel classes data by updating their aggregation weights. In addition, we propose a model updating scheme to gradually incorporated the stored memory and novel class information, thereby reducing the risk of forgetting distinctive attributes associated with known classes. Finally, we formulate the objective in a bi-level optimization that enables our model to maintain consistent performance under class distribution shifts, detect unseen classes with minimal supervision, and achieve robust continual learning in open-world streaming scenarios. Our empirical evaluation of this framework using real-world datasets highlights its superior performance when compared to existing methods.
在许多现实世界的应用程序中,数据以流方式生成/收集,很难获得已知(已见)类的准确标签。此外,随着流数据的发展,还会出现一些未知(未见/新颖)的类。在文献中,现有的方法存在三个局限性:(1)当已知类的学习速度比新类快时,会出现类内方差的差距;(2)在新旧类的学习过程中出现了特征权重不平衡的显著问题;(3)如果只使用新数据更新模型,则会导致灾难性遗忘,导致在整合与当前新类相关的信息时,从已知类获得的知识丢失。本文研究了非平稳数据流上不可见类检测的学习问题。特别地,我们从开放世界半监督学习中引入了不确定性自适应边际机制,以解决由于与新类别相比,已见类别的判别特征学习速度更快而产生的偏差。我们还开发了自适应加权方案,通过更新聚合权值来动态平衡已见类和新类数据的使用。此外,我们提出了一种模型更新方案,将存储的记忆和新类信息逐步融合,从而降低遗忘已知类相关的独特属性的风险。最后,我们在双级优化中制定目标,使我们的模型能够在类分布变化下保持一致的性能,以最小的监督检测未见过的类,并在开放世界流场景中实现鲁棒的持续学习。我们使用真实世界数据集对该框架进行了实证评估,与现有方法相比,突出了其优越的性能。
{"title":"INSERTION: From traditional incremental learning to open-world stream learning","authors":"Yanchao Li ,&nbsp;Hongwei Dou ,&nbsp;Guanxiao Li ,&nbsp;Guangwei Gao ,&nbsp;Huiyu Zhou","doi":"10.1016/j.patcog.2026.113163","DOIUrl":"10.1016/j.patcog.2026.113163","url":null,"abstract":"<div><div>In plenty of real-world applications, data are generated/collected in a streaming way, and it is hard to obtain their accurate labels of known (seen) classes. Moreover, there are several unknown (unseen/novel) classes would emerge with evolving stream data. In the literatures, existing approaches suffer from three limitations: (1) a gap in intra-class variance arises when seen classes are learned more faster than novel classes; (2) a significant issue arises regarding the imbalance in feature weighting among the learning procedures for both new and old classes; (3) a catastrophic forgetting can occur if we exclusively update the model with new data, resulting in the loss of knowledge acquired from known classes when integrating information related to the current novel classes. This paper investigates the problem of learning with unseen classes detection over a non-stationary data stream. Particularly, we introduce uncertainty adaptive margin mechanism from open-world semi-supervised learning to address the bias stemming from the faster learning of discriminative features for seen classes compared to novel classes. We also develop adaptive weighting scheme to dynamically balance the usage of seen and novel classes data by updating their aggregation weights. In addition, we propose a model updating scheme to gradually incorporated the stored memory and novel class information, thereby reducing the risk of forgetting distinctive attributes associated with known classes. Finally, we formulate the objective in a bi-level optimization that enables our model to maintain consistent performance under class distribution shifts, detect unseen classes with minimal supervision, and achieve robust continual learning in open-world streaming scenarios. Our empirical evaluation of this framework using real-world datasets highlights its superior performance when compared to existing methods.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113163"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRDFusion: Iterative relation-map difference guided feature fusion for multispectral object detection IRDFusion:用于多光谱目标检测的迭代关系图差分制导特征融合
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113189
Jifeng Shen , Haibo Zhan , Xin Zuo , Heng Fan , Xiaohui Yuan , Jun Li , Wankou Yang
Current multispectral object detection methods often retain extraneous background or noise during feature fusion, limiting perceptual performance. To address this, we propose a feature fusion framework based on cross-modal feature contrastive and screening strategy, diverging from conventional approaches. The proposed method adaptively enhances salient structures by fusing object-aware complementary cross-modal features while suppressing shared background interference. Our solution centers on two novel, specially designed modules: the Mutual Feature Refinement Module (MFRM) and the Differential Feature Feedback Module (DFFM). The MFRM enhances intra- and inter-modal feature representations by modeling their relationships, thereby improving cross-modal alignment and discriminative power. Inspired by feedback differential amplifiers, the DFFM dynamically computes inter-modal differential features as guidance signals and feeds them back to the MFRM, enabling adaptive fusion of complementary information while suppressing common-mode noise across modalities. To enable robust feature learning, the MFRM and DFFM are integrated into a unified framework, which is formally formulated as an Iterative Relation-Map Differential Guided Feature Fusion mechanism, termed IRDFusion. IRDFusion enables high-quality cross-modal fusion by progressively amplifying salient relational signals through iterative feedback, while suppressing feature noise, leading to significant performance gains. In extensive experiments on FLIR, LLVIP and M3FD datasets, IRDFusion achieves state-of-the-art performance and consistently outperforms existing methods across diverse challenging scenarios, demonstrating its robustness and effectiveness. Code will be available at https://github.com/61s61min/IRDFusion.git.
当前的多光谱目标检测方法在特征融合过程中往往会保留无关的背景或噪声,从而限制了感知性能。为了解决这个问题,我们提出了一种基于跨模态特征对比和筛选策略的特征融合框架,与传统方法不同。该方法通过融合目标感知互补的跨模态特征自适应增强显著结构,同时抑制共享背景干扰。我们的解决方案集中在两个新颖的,特别设计的模块:互特征细化模块(MFRM)和差分特征反馈模块(DFFM)。MFRM通过建模模式内和模式间的关系来增强模式内和模式间的特征表示,从而提高跨模式对齐和判别能力。受反馈差分放大器的启发,DFFM动态计算模态间差分特征作为制导信号,并将其反馈给MFRM,实现互补信息的自适应融合,同时抑制模态间的共模噪声。为了实现稳健的特征学习,MFRM和DFFM被集成到一个统一的框架中,该框架被正式表述为迭代关系映射差分引导特征融合机制,称为IRDFusion。IRDFusion通过迭代反馈逐步放大显著的相关信号,同时抑制特征噪声,从而实现高质量的跨模态融合,从而显著提高性能。在FLIR、LLVIP和M3FD数据集的大量实验中,IRDFusion实现了最先进的性能,并在各种具有挑战性的场景中始终优于现有方法,证明了其鲁棒性和有效性。代码将在https://github.com/61s61min/IRDFusion.git上提供。
{"title":"IRDFusion: Iterative relation-map difference guided feature fusion for multispectral object detection","authors":"Jifeng Shen ,&nbsp;Haibo Zhan ,&nbsp;Xin Zuo ,&nbsp;Heng Fan ,&nbsp;Xiaohui Yuan ,&nbsp;Jun Li ,&nbsp;Wankou Yang","doi":"10.1016/j.patcog.2026.113189","DOIUrl":"10.1016/j.patcog.2026.113189","url":null,"abstract":"<div><div>Current multispectral object detection methods often retain extraneous background or noise during feature fusion, limiting perceptual performance. To address this, we propose a feature fusion framework based on cross-modal feature contrastive and screening strategy, diverging from conventional approaches. The proposed method adaptively enhances salient structures by fusing object-aware complementary cross-modal features while suppressing shared background interference. Our solution centers on two novel, specially designed modules: the Mutual Feature Refinement Module (MFRM) and the Differential Feature Feedback Module (DFFM). The MFRM enhances intra- and inter-modal feature representations by modeling their relationships, thereby improving cross-modal alignment and discriminative power. Inspired by feedback differential amplifiers, the DFFM dynamically computes inter-modal differential features as guidance signals and feeds them back to the MFRM, enabling adaptive fusion of complementary information while suppressing common-mode noise across modalities. To enable robust feature learning, the MFRM and DFFM are integrated into a unified framework, which is formally formulated as an Iterative Relation-Map Differential Guided Feature Fusion mechanism, termed IRDFusion. IRDFusion enables high-quality cross-modal fusion by progressively amplifying salient relational signals through iterative feedback, while suppressing feature noise, leading to significant performance gains. In extensive experiments on FLIR, LLVIP and M<sup>3</sup>FD datasets, IRDFusion achieves state-of-the-art performance and consistently outperforms existing methods across diverse challenging scenarios, demonstrating its robustness and effectiveness. Code will be available at <span><span>https://github.com/61s61min/IRDFusion.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113189"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146174888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TranSAC: An unsupervised transferability metric based on task speciality and domain commonality TranSAC:基于任务特殊性和领域共性的无监督可转移性度量
IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.patcog.2026.113137
Qianshan Zhan , Xiao-Jun Zeng , Qian Wang
In transfer learning, one fundamental problem is transferability estimation, where a metric measures transfer performance without training. Existing metrics face two issues: 1) requiring target domain labels, and 2) only focusing on task speciality but ignoring equally important domain commonality. To overcome these limitations, we propose TranSAC, a Transferability metric based on task Speciality And domain Commonality, capturing the separation between classes and the similarity between domains. Its main advantages are: 1) unsupervised, 2) fine-tuning free, and 3) applicable to source-dependent and source-free transfer scenarios. To achieve this, we investigate the upper and lower bounds of transfer performance based on fixed representations extracted from the pre-trained model. Theoretical results reveal that unsupervised transfer performance is characterized by entropy-based quantities, naturally reflecting task specificity and domain commonality. These insights motivate the design of TranSAC, which integrates both factors to enhance transferability. Extensive experiments are performed across 12 target datasets with 36 pre-trained models, including supervised CNNs, self-supervised CNNs, and ViTs. Results demonstrate the importance of domain commonality and task speciality, allowing TranSAC as superior to state-of-the-art metrics for pre-trained model ranking, target domain ranking, and source domain ranking.
在迁移学习中,一个基本问题是可迁移性估计,其中度量是在未经训练的情况下度量迁移性能。现有的度量标准面临两个问题:1)需要目标领域标签;2)只关注任务的特殊性,而忽略了同样重要的领域共性。为了克服这些限制,我们提出了TranSAC,一种基于任务特殊性和领域共性的可转移性度量,捕获类之间的分离和领域之间的相似性。它的主要优点是:1)无监督,2)无微调,以及3)适用于依赖源和无源的传输场景。为了实现这一点,我们研究了基于从预训练模型中提取的固定表示的传输性能的上界和下界。理论结果表明,无监督迁移性能具有基于熵的特征量,自然地反映了任务的特殊性和领域的共性。这些见解激发了TranSAC的设计,它整合了这两个因素以增强可转移性。在12个目标数据集和36个预训练模型上进行了广泛的实验,包括监督cnn、自监督cnn和vit。结果证明了领域共性和任务特殊性的重要性,使得TranSAC在预训练模型排名、目标领域排名和源领域排名方面优于最先进的指标。
{"title":"TranSAC: An unsupervised transferability metric based on task speciality and domain commonality","authors":"Qianshan Zhan ,&nbsp;Xiao-Jun Zeng ,&nbsp;Qian Wang","doi":"10.1016/j.patcog.2026.113137","DOIUrl":"10.1016/j.patcog.2026.113137","url":null,"abstract":"<div><div>In transfer learning, one fundamental problem is transferability estimation, where a metric measures transfer performance without training. Existing metrics face two issues: 1) requiring target domain labels, and 2) only focusing on task speciality but ignoring equally important domain commonality. To overcome these limitations, we propose TranSAC, a <strong>Tran</strong>sferability metric based on task <strong>S</strong>peciality <strong>A</strong>nd domain <strong>C</strong>ommonality, capturing the separation between classes and the similarity between domains. Its main advantages are: 1) unsupervised, 2) fine-tuning free, and 3) applicable to source-dependent and source-free transfer scenarios. To achieve this, we investigate the upper and lower bounds of transfer performance based on fixed representations extracted from the pre-trained model. Theoretical results reveal that unsupervised transfer performance is characterized by entropy-based quantities, naturally reflecting task specificity and domain commonality. These insights motivate the design of TranSAC, which integrates both factors to enhance transferability. Extensive experiments are performed across 12 target datasets with 36 pre-trained models, including supervised CNNs, self-supervised CNNs, and ViTs. Results demonstrate the importance of domain commonality and task speciality, allowing TranSAC as superior to state-of-the-art metrics for pre-trained model ranking, target domain ranking, and source domain ranking.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"176 ","pages":"Article 113137"},"PeriodicalIF":7.6,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Pattern Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1