Applied Intelligence最新文献_第2页

HiProIBM: unsupervised continual learning through hierarchical prototypical cross-level discrimination along with information bottleneck subnetwork masking

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-25 DOI: 10.1007/s10489-025-06362-z

Ankit Malviya, Chandresh Kumar Maurya

Catastrophic Forgetting (CF) occurs when a machine learning model forgets the experience of previous tasks while learning new tasks due to inadequate retention mechanisms. Unsupervised continual learning (UCL) addresses this by enabling the model to adapt to new tasks using unlabeled data while retaining past knowledge. To mitigate CF in UCL, we use a parameter isolation technique to mask sub-networks dedicated to each task, thus preventing interference with previous tasks. However, relying solely on weight magnitude for constructing these sub-networks can result in the retention of irrelevant weights and the creation of redundant sub-networks. This approach also risks capacity saturation and information suppression for tasks encountered later in the sequence. To overcome this, we use masked sub-networks, inspired by the information bottleneck (IB) concept. It accumulates valuable information into essential weights to construct redundancy-free sub-networks which effectively mitigates CF and enables the new task training. The IB subnetwork masking faces challenges in balancing input compression with meaningful pattern preservation without labels. It risks overcompression and loss of crucial latent structures, which degrades model performance. We address this by learning multiple semantic hierarchies present in the data using unsupervised contrastive learning. However traditional contrastive learning techniques learn meaningful representations by contrasting similar and dissimilar data points. These approaches lack adequate representational power for modeling datasets with multiple semantic hierarchies. The inherent hierarchical semantic structures in datasets are necessary to integrate semantically related clusters into larger, coarser-grained clusters, but existing contrastive learning methods often overlook this and limit semantic understanding. We address this by constructing and updating hierarchical prototypes with cross-level group discrimination to represent semantic structures in the latent space. Our experiments on four standard datasets show performance improvements over SOTA baselines for varying task-sequences from 5 to 100, with nearly-zero forgetting.

灾难性遗忘（CF）是指机器学习模型在学习新任务时，由于保留机制不足而遗忘了之前任务的经验。无监督持续学习（UCL）通过使模型在保留过去知识的同时使用无标记数据适应新任务，从而解决了这一问题。为了减轻 UCL 中的 CF，我们使用了参数隔离技术来屏蔽专用于每个任务的子网络，从而防止干扰以前的任务。然而，仅仅依靠权重大小来构建这些子网络可能会导致无关权重的保留和冗余子网络的创建。这种方法还存在容量饱和和信息抑制的风险，不利于后面的任务。为了克服这一问题，我们受信息瓶颈（IB）概念的启发，使用了屏蔽子网络。它将有价值的信息积累到基本权重中，构建出无冗余子网络，从而有效缓解 CF 问题，实现新任务训练。IB 子网络屏蔽在平衡输入压缩和无标签有意义模式保存方面面临挑战。它存在过度压缩和丢失关键潜在结构的风险，从而降低了模型性能。我们通过使用无监督对比学习来学习数据中存在的多个语义层次来解决这个问题。然而，传统的对比学习技术是通过对比相似和不相似的数据点来学习有意义的表征。这些方法缺乏足够的表征能力，无法对具有多种语义层次的数据集进行建模。数据集固有的分层语义结构是将语义相关的聚类整合到更大、更粗粒度的聚类中所必需的，但现有的对比学习方法往往忽略了这一点，从而限制了对语义的理解。为了解决这个问题，我们构建并更新了具有跨级群组辨别能力的分层原型，以表示潜空间中的语义结构。我们在四个标准数据集上进行的实验表明，与 SOTA 基线相比，在 5 到 100 个不同任务序列中的性能均有所提高，遗忘几乎为零。

{"title":"HiProIBM: unsupervised continual learning through hierarchical prototypical cross-level discrimination along with information bottleneck subnetwork masking","authors":"Ankit Malviya, Chandresh Kumar Maurya","doi":"10.1007/s10489-025-06362-z","DOIUrl":"10.1007/s10489-025-06362-z","url":null,"abstract":"<p>Catastrophic Forgetting (CF) occurs when a machine learning model forgets the experience of previous tasks while learning new tasks due to inadequate retention mechanisms. Unsupervised continual learning (UCL) addresses this by enabling the model to adapt to new tasks using unlabeled data while retaining past knowledge. To mitigate CF in UCL, we use a parameter isolation technique to mask sub-networks dedicated to each task, thus preventing interference with previous tasks. However, relying solely on weight magnitude for constructing these sub-networks can result in the retention of irrelevant weights and the creation of redundant sub-networks. This approach also risks capacity saturation and information suppression for tasks encountered later in the sequence. To overcome this, we use masked sub-networks, inspired by the information bottleneck (IB) concept. It accumulates valuable information into essential weights to construct redundancy-free sub-networks which effectively mitigates CF and enables the new task training. The IB subnetwork masking faces challenges in balancing input compression with meaningful pattern preservation without labels. It risks overcompression and loss of crucial latent structures, which degrades model performance. We address this by learning multiple semantic hierarchies present in the data using unsupervised contrastive learning. However traditional contrastive learning techniques learn meaningful representations by contrasting similar and dissimilar data points. These approaches lack adequate representational power for modeling datasets with multiple semantic hierarchies. The inherent hierarchical semantic structures in datasets are necessary to integrate semantically related clusters into larger, coarser-grained clusters, but existing contrastive learning methods often overlook this and limit semantic understanding. We address this by constructing and updating hierarchical prototypes with cross-level group discrimination to represent semantic structures in the latent space. Our experiments on four standard datasets show performance improvements over SOTA baselines for varying task-sequences from 5 to 100, with nearly-zero forgetting.</p>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143489491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A framework for solving bias in graph-based recommender systems with a causal perspective 从因果角度解决图式推荐系统偏差的框架

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-25 DOI: 10.1007/s10489-025-06388-3

Kewu Yang, Guogang Li, Linjia Wang, Jianrong Xie

Recommendation systems founded on graph neural networks (GNN) have been extensively employed because of their exceptional recommendation efficiency. Nevertheless, numerous recommendation biases also crop up, We have observed that delicate details such as gender and age are frequently implicitly apprehended by recommendation systems, culminating in unfair recommendations, and the associated algorithms of GNN will magnify this bias. To tackle these difficulties, this paper puts forth a method of introducing the notion of causal fairness into the issue of fairness in GNN-based recommendation systems, to accomplish counterfactual fairness of user-sensitive information and thereby attain unbiased recommendations. Specifically, given a GNN-based recommendation system model, which is implemented in our devised fairness framework, chiefly obtaining equitable effects through two facets: (1) attaining user embedding fairness through the counterfactual fairness technique; (2) mitigating the prejudiced impact caused by the GNN algorithm using the proposed central association subgraph method. The amalgamation of these two facets ultimately delivers unbiased recommendations. The effectiveness and sophistication of our proposed method for mitigating partiality problems in GNN recommendation systems from a causal perspective (MGRC) have been proven via experiments on four real-world datasets.

{"title":"A framework for solving bias in graph-based recommender systems with a causal perspective","authors":"Kewu Yang, Guogang Li, Linjia Wang, Jianrong Xie","doi":"10.1007/s10489-025-06388-3","DOIUrl":"10.1007/s10489-025-06388-3","url":null,"abstract":"<div><p>Recommendation systems founded on graph neural networks (GNN) have been extensively employed because of their exceptional recommendation efficiency. Nevertheless, numerous recommendation biases also crop up, We have observed that delicate details such as gender and age are frequently implicitly apprehended by recommendation systems, culminating in unfair recommendations, and the associated algorithms of GNN will magnify this bias. To tackle these difficulties, this paper puts forth a method of introducing the notion of causal fairness into the issue of fairness in GNN-based recommendation systems, to accomplish counterfactual fairness of user-sensitive information and thereby attain unbiased recommendations. Specifically, given a GNN-based recommendation system model, which is implemented in our devised fairness framework, chiefly obtaining equitable effects through two facets: (1) attaining user embedding fairness through the counterfactual fairness technique; (2) mitigating the prejudiced impact caused by the GNN algorithm using the proposed central association subgraph method. The amalgamation of these two facets ultimately delivers unbiased recommendations. The effectiveness and sophistication of our proposed method for mitigating partiality problems in GNN recommendation systems from a causal perspective (MGRC) have been proven via experiments on four real-world datasets.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143481134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DSFL: a blockchain-based data sharing and federated learning framework DSFL：基于区块链的数据共享和联合学习框架

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-25 DOI: 10.1007/s10489-025-06400-w

Haiqian Niu, Xing Zhang, Zhiguang Chu, Wei Shi

The massive amount of data generated by the proliferation of Internet of Things (IoT) devices has become one of the key factors driving the advancement of artificial intelligence (AI) technology. However, the lack of storage space and limited computational power of edge devices make it difficult to directly process large data volumes or run complex machine learning algorithms on these devices. At the same time, existing Federated Learning (FL) schemes still face a number of shortcomings, including a single point of failure, vulnerability to poisoning attacks, and a lack of incentives. To address the above issues, we propose DSFL, a blockchain-based framework for fair data sharing and FL. Specifically, we combine digital envelope technology and one-way accumulator with smart contracts to design fair, secure, and trustworthy data sharing protocols that facilitate edge devices to share data proactively, realize the value of data and reduce storage pressure. In addition, we propose blockchain extension schemes suitable for coupling with FL to improve training efficiency. Importantly, the node management mechanism and incentive algorithms are designed to effectively monitor and trace the behavior of nodes, and promote the virtuous cycle of model training and the motivation of participants. Experimental results show that DSFL is able to ensure fair data sharing and efficient model training without the involvement of trusted third parties. In particular, it is able to achieve model accuracy close to that of existing popular schemes even when 40% of the nodes are lazy, providing an excellent defense against malicious nodes.

{"title":"DSFL: a blockchain-based data sharing and federated learning framework","authors":"Haiqian Niu, Xing Zhang, Zhiguang Chu, Wei Shi","doi":"10.1007/s10489-025-06400-w","DOIUrl":"10.1007/s10489-025-06400-w","url":null,"abstract":"<div><p>The massive amount of data generated by the proliferation of Internet of Things (IoT) devices has become one of the key factors driving the advancement of artificial intelligence (AI) technology. However, the lack of storage space and limited computational power of edge devices make it difficult to directly process large data volumes or run complex machine learning algorithms on these devices. At the same time, existing Federated Learning (FL) schemes still face a number of shortcomings, including a single point of failure, vulnerability to poisoning attacks, and a lack of incentives. To address the above issues, we propose DSFL, a blockchain-based framework for fair data sharing and FL. Specifically, we combine digital envelope technology and one-way accumulator with smart contracts to design fair, secure, and trustworthy data sharing protocols that facilitate edge devices to share data proactively, realize the value of data and reduce storage pressure. In addition, we propose blockchain extension schemes suitable for coupling with FL to improve training efficiency. Importantly, the node management mechanism and incentive algorithms are designed to effectively monitor and trace the behavior of nodes, and promote the virtuous cycle of model training and the motivation of participants. Experimental results show that DSFL is able to ensure fair data sharing and efficient model training without the involvement of trusted third parties. In particular, it is able to achieve model accuracy close to that of existing popular schemes even when 40% of the nodes are lazy, providing an excellent defense against malicious nodes.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143481135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FreqFaceNet: an enhanced transformer architecture with dual-order frequency attention for deepfake detection

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-25 DOI: 10.1007/s10489-024-06168-5

Varun Gupta, Vaibhav Srivastava, Ankit Yadav, Dinesh Kumar Vishwakarma, Narendra Kumar

With the advent of AI-based image synthesis tools and techniques, Deepfakes have become a serious problem as they pose a massive threat to one’s information security and personal privacy. Several architectures have been proposed to achieve robust Deep Fake detection. However, these methods suffer a drastic drop in performance if the images are visually degraded or have low resolution. To resolve these two issues, a novel FreqFaceNet model has been proposed that employs two novel attentions namely, Wavelet Attention and Fourier Attention, for extracting important frequency-based features from low-resolution images. The extraction of frequency-based features ensures minimal interference of noise due to image compression or low resolution. The proposed model excels on two public benchmark datasets—the DFDC and CelebDF. On the DFDC dataset, FreqFaceNet achieves 98.041% accuracy, an AUC value of 99.748, and a Mathews Correlation Coefficient (MCC) value of 93.857, while on the CelebDF dataset, it obtains an accuracy of 98.325%, an AUC value of 99.81, and an MCC value of 92.819. Qualitative analysis of the proposed model indicates strong classification capabilities. An ablation study has also been conducted to verify the complementary contributions of both Wavelet and Fourier Attention mechanisms.

{"title":"FreqFaceNet: an enhanced transformer architecture with dual-order frequency attention for deepfake detection","authors":"Varun Gupta, Vaibhav Srivastava, Ankit Yadav, Dinesh Kumar Vishwakarma, Narendra Kumar","doi":"10.1007/s10489-024-06168-5","DOIUrl":"10.1007/s10489-024-06168-5","url":null,"abstract":"<div><p>With the advent of AI-based image synthesis tools and techniques, Deepfakes have become a serious problem as they pose a massive threat to one’s information security and personal privacy. Several architectures have been proposed to achieve robust Deep Fake detection. However, these methods suffer a drastic drop in performance if the images are visually degraded or have low resolution. To resolve these two issues, a novel <i>FreqFaceNet</i> model has been proposed that employs two novel attentions namely, <i>Wavelet Attention</i> and <i>Fourier Attention</i>, for extracting important frequency-based features from low-resolution images. The extraction of frequency-based features ensures minimal interference of noise due to image compression or low resolution. The proposed model excels on two public benchmark datasets—the DFDC and CelebDF. On the DFDC dataset, FreqFaceNet achieves 98.041% accuracy, an AUC value of 99.748, and a Mathews Correlation Coefficient (MCC) value of 93.857, while on the CelebDF dataset, it obtains an accuracy of 98.325%, an AUC value of 99.81, and an MCC value of 92.819. Qualitative analysis of the proposed model indicates strong classification capabilities. An ablation study has also been conducted to verify the complementary contributions of both Wavelet and Fourier Attention mechanisms.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143489487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PAG-Unet: multi-task dense scene understanding with pixel-attention-guided Unet

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-24 DOI: 10.1007/s10489-025-06389-2

Yi Xu, Changhao Li

Multi-task dense scene understanding is a fundamental research area in computer vision (CV). By predicting pixels, perceiving, and reasoning about multiple related tasks, it improves both accuracy and data efficiency. However, it faces the challenge that some tasks may require more independent feature representations, and excessive sharing can lead to interference between tasks. To address this issue, we propose a novel Pixel-Attention-Guided Unet (PAG-Unet). PAG-Unet incorporates a Pixel-Attention-Guided Fusion module (PAG Fusion) and a Multi-Task Self-Attention module (MTSA) to enhance task-specific feature extraction and reduce task interference. PAG Fusion leverages the relationship between shallow and deep features by using task-specific deep features to calibrate the distribution of shared shallow features. This suppresses background noise and enhances semantic features, thereby fully extracting task-specific features for different tasks and achieving feature enhancement. MTSA considers both global and local spatial interactions for each task during task interactions, capturing task-specific information and compensating for the loss of crucial details, thus improving prediction accuracy for each task. Our method achieves superior multi-task performance on the New York University Depth v2(NYUD-v2) and PASCAL Visual Object Classes Context(PASCAL-Context) datasets, with most metrics significantly outperforming previous state-of-the-art methods. The code is available at https://github.com/UPLI-123/Pag-Unet.

多任务密集场景理解是计算机视觉（CV）的一个基础研究领域。通过预测像素、感知和推理多个相关任务，它可以提高准确性和数据效率。然而，它面临的挑战是，有些任务可能需要更多独立的特征表征，而过度共享会导致任务间的干扰。为了解决这个问题，我们提出了一种新颖的像素注意力引导式 Unet（PAG-Unet）。PAG-Unet 融合了像素注意力引导融合模块（PAG Fusion）和多任务自我注意力模块（MTSA），以加强特定任务的特征提取并减少任务干扰。PAG 融合利用浅层和深层特征之间的关系，使用特定任务的深层特征来校准共享浅层特征的分布。这可以抑制背景噪声，增强语义特征，从而充分提取不同任务的特定特征，实现特征增强。在任务交互过程中，MTSA 会考虑每个任务的全局和局部空间交互，从而捕捉特定任务信息，弥补关键细节的损失，从而提高每个任务的预测准确性。我们的方法在纽约大学深度 v2（NYUD-v2）和 PASCAL Visual Object Classes Context（PASCAL-Context）数据集上实现了卓越的多任务性能，大多数指标都明显优于以前的先进方法。代码可在 https://github.com/UPLI-123/Pag-Unet 上获取。

{"title":"PAG-Unet: multi-task dense scene understanding with pixel-attention-guided Unet","authors":"Yi Xu, Changhao Li","doi":"10.1007/s10489-025-06389-2","DOIUrl":"10.1007/s10489-025-06389-2","url":null,"abstract":"<div><p>Multi-task dense scene understanding is a fundamental research area in computer vision (CV). By predicting pixels, perceiving, and reasoning about multiple related tasks, it improves both accuracy and data efficiency. However, it faces the challenge that some tasks may require more independent feature representations, and excessive sharing can lead to interference between tasks. To address this issue, we propose a novel Pixel-Attention-Guided Unet (PAG-Unet). PAG-Unet incorporates a Pixel-Attention-Guided Fusion module (PAG Fusion) and a Multi-Task Self-Attention module (MTSA) to enhance task-specific feature extraction and reduce task interference. PAG Fusion leverages the relationship between shallow and deep features by using task-specific deep features to calibrate the distribution of shared shallow features. This suppresses background noise and enhances semantic features, thereby fully extracting task-specific features for different tasks and achieving feature enhancement. MTSA considers both global and local spatial interactions for each task during task interactions, capturing task-specific information and compensating for the loss of crucial details, thus improving prediction accuracy for each task. Our method achieves superior multi-task performance on the New York University Depth v2(NYUD-v2) and PASCAL Visual Object Classes Context(PASCAL-Context) datasets, with most metrics significantly outperforming previous state-of-the-art methods. The code is available at https://github.com/UPLI-123/Pag-Unet.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143475209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DropMismatch: removing mismatched UI elements for better pixel to code generation

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-24 DOI: 10.1007/s10489-025-06384-7

Ming Li, Tao Lin

Automating the generation of user interface (UI) code from design images has gained significant attention due to its potential to streamline application development. However, the effectiveness of deep learning models in this domain is often hindered by mismatches between UI images and their corresponding layout code, a common issue in image-text datasets. In this paper, we introduce a framework that locates and removes these mismatches, thereby improving the accuracy of UI code generation models. Our approach leverages a convolutional neural network to predict the alignment between UI components and layout code nodes, coupled with a tree-based heuristic algorithm to localize mismatches. Through extensive evaluation, we demonstrate that our method enhances the accuracy of UI code generation by approximately 15%, while significantly reducing the need for costly manual annotations. The proposed framework not only advances the state of automated UI code generation but also lays the foundation for creating high-quality, large-scale UI datasets, essential for future research and development in this field.

引用次数: 0

Analysis of deep non-smooth symmetric nonnegative matrix factorization on hierarchical clustering

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-24 DOI: 10.1007/s10489-025-06367-8

Shunli Li, Linzhang Lu, Qilong Liu, Zhen Chen

Deep matrix factorization (deep MF) is an increasingly popular unsupervised data-mining technique that operates as a deep decomposition rooted in traditional nonnegative matrix factorization (NMF). Compared with standard NMF, deep MF has shown excellent performance in the extraction of hierarchical information from complex datasets. For cases in which the data matrices corresponding to the dataset are symmetric—such as the adjacency matrix of an undirected graph in network analysis—this paper proposes a deep MF variant called deep non-smooth nonnegative symmetric matrix factorization (DNSSNMF). The aim of this work is to enhance the extraction of complex hierarchical structures in high-dimensional datasets and achieve the clustering of structures inherent in graphical representations by improving the goodness-of-fit of the factor matrix product. Accordingly, we successfully applied DNSSNMF to post-traumatic-stress-disorder (PTSD) datasets and synthetic datasets to extract several hierarchical communities. In particular, we extracted non-disjoint communities in the partial correlation network of psychiatric symptoms in PTSD, revealing correlations between different symptoms and leading to meaningful clinical interpretations. The results of our numerical experiments indicated promising applications of DNSSNMF in fields including network analysis and medicine.

{"title":"Analysis of deep non-smooth symmetric nonnegative matrix factorization on hierarchical clustering","authors":"Shunli Li, Linzhang Lu, Qilong Liu, Zhen Chen","doi":"10.1007/s10489-025-06367-8","DOIUrl":"10.1007/s10489-025-06367-8","url":null,"abstract":"<div><p>Deep matrix factorization (deep MF) is an increasingly popular unsupervised data-mining technique that operates as a deep decomposition rooted in traditional nonnegative matrix factorization (NMF). Compared with standard NMF, deep MF has shown excellent performance in the extraction of hierarchical information from complex datasets. For cases in which the data matrices corresponding to the dataset are symmetric—such as the adjacency matrix of an undirected graph in network analysis—this paper proposes a deep MF variant called deep non-smooth nonnegative symmetric matrix factorization (DNSSNMF). The aim of this work is to enhance the extraction of complex hierarchical structures in high-dimensional datasets and achieve the clustering of structures inherent in graphical representations by improving the goodness-of-fit of the factor matrix product. Accordingly, we successfully applied DNSSNMF to post-traumatic-stress-disorder (PTSD) datasets and synthetic datasets to extract several hierarchical communities. In particular, we extracted non-disjoint communities in the partial correlation network of psychiatric symptoms in PTSD, revealing correlations between different symptoms and leading to meaningful clinical interpretations. The results of our numerical experiments indicated promising applications of DNSSNMF in fields including network analysis and medicine.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143475228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Self-supervised heterogeneous graph neural network based on deep and broad neighborhood encoding

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-22 DOI: 10.1007/s10489-025-06348-x

Qianyu Song, Chao Li, Jinhu Fu, Qingtian Zeng, Nengfu Xie

Self-supervised heterogeneous graph neural networks have shown remarkable effectiveness in addressing the challenge of limited labeled data. However, current contrastive learning methods face limitations in leveraging neighborhood information for each node. Some approaches utilize the local information of the target node, ignoring useful signals from deeper neighborhoods. On the other hand, simply stacking convolutional layers to expand the neighborhood inevitably leads to over-smoothing. To address the problems, we propose HGNN-DB, a Self-supervised Heterogeneous Graph Neural Network Based on Deep and Broad Neighborhood Encoding to tackle the over-smoothing problem within heterogeneous graphs. Specifically, HGNN-DB aims to learn informative node representations by incorporating both deep and broad neighborhoods. We introduce a deep neighborhood encoder with a distance-weighted strategy to capture deep features of target nodes. Additionally, a single-layer graph convolutional network is employed for the broad neighborhood encoder to aggregate broad features of target nodes. Furthermore, we introduce a collaborative contrastive mechanism to learn the complementarity and potential invariance between the two views of neighborhood information. Experimental results on four real-world datasets and seven baselines demonstrate that our model significantly outperforms the current state-of-the-art techniques on multiple downstream tasks. The codes and datasets for this work are available at https://github.com/SSQiana/HGNN-DB.

{"title":"Self-supervised heterogeneous graph neural network based on deep and broad neighborhood encoding","authors":"Qianyu Song, Chao Li, Jinhu Fu, Qingtian Zeng, Nengfu Xie","doi":"10.1007/s10489-025-06348-x","DOIUrl":"10.1007/s10489-025-06348-x","url":null,"abstract":"<div><p>Self-supervised heterogeneous graph neural networks have shown remarkable effectiveness in addressing the challenge of limited labeled data. However, current contrastive learning methods face limitations in leveraging neighborhood information for each node. Some approaches utilize the local information of the target node, ignoring useful signals from deeper neighborhoods. On the other hand, simply stacking convolutional layers to expand the neighborhood inevitably leads to over-smoothing. To address the problems, we propose HGNN-DB, a Self-supervised <u>H</u>eterogeneous <u>G</u>raph <u>N</u>eural <u>N</u>etwork Based on <u>D</u>eep and <u>B</u>road Neighborhood Encoding to tackle the over-smoothing problem within heterogeneous graphs. Specifically, HGNN-DB aims to learn informative node representations by incorporating both deep and broad neighborhoods. We introduce a deep neighborhood encoder with a distance-weighted strategy to capture deep features of target nodes. Additionally, a single-layer graph convolutional network is employed for the broad neighborhood encoder to aggregate broad features of target nodes. Furthermore, we introduce a collaborative contrastive mechanism to learn the complementarity and potential invariance between the two views of neighborhood information. Experimental results on four real-world datasets and seven baselines demonstrate that our model significantly outperforms the current state-of-the-art techniques on multiple downstream tasks. The codes and datasets for this work are available at https://github.com/SSQiana/HGNN-DB.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning discriminative features for multi-hop knowledge graph reasoning

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-22 DOI: 10.1007/s10489-025-06327-2

Hao Liu, Dong Li, Bing Zeng, Yang Xu

Reinforcement learning (RL)-based multi-hop knowledge graph reasoning has achieved remarkable success in real-world applications and can effectively handle knowledge graph completion tasks. This approach involves a policy-based agent navigating the graph environment to extend reasoning paths and identify the target entity. However, most existing multi-hop reasoning models are typically constrained to stepwise inference, which inherently disrupts the global information of multi-hop paths. To overcome this limitation, we introduce discriminative features between valid and invalid paths as global information. Here, we propose a multi-hop path encoder specifically designed to extract these discriminative features. Firstly, a multi-hop path encoding module is employed to derive each path’s hidden features, using cross-attention mechanisms to strengthen the interaction between triple context and path features. Secondly, a discriminative feature extraction module is used to capture the differences between valid and invalid paths. Thirdly, an attention-enhanced gated fusion mechanism is implemented to integrate these discriminative features into the multi-hop inference decoder. We further evaluate our method on five standard datasets. Our method outperforms the baseline models, demonstrating the effectiveness of discriminative features in improving prediction performance, learning speed, and path interpretability.

{"title":"Learning discriminative features for multi-hop knowledge graph reasoning","authors":"Hao Liu, Dong Li, Bing Zeng, Yang Xu","doi":"10.1007/s10489-025-06327-2","DOIUrl":"10.1007/s10489-025-06327-2","url":null,"abstract":"<div><p>Reinforcement learning (RL)-based multi-hop knowledge graph reasoning has achieved remarkable success in real-world applications and can effectively handle knowledge graph completion tasks. This approach involves a policy-based agent navigating the graph environment to extend reasoning paths and identify the target entity. However, most existing multi-hop reasoning models are typically constrained to stepwise inference, which inherently disrupts the global information of multi-hop paths. To overcome this limitation, we introduce discriminative features between valid and invalid paths as global information. Here, we propose a multi-hop path encoder specifically designed to extract these discriminative features. Firstly, a multi-hop path encoding module is employed to derive each path’s hidden features, using cross-attention mechanisms to strengthen the interaction between triple context and path features. Secondly, a discriminative feature extraction module is used to capture the differences between valid and invalid paths. Thirdly, an attention-enhanced gated fusion mechanism is implemented to integrate these discriminative features into the multi-hop inference decoder. We further evaluate our method on five standard datasets. Our method outperforms the baseline models, demonstrating the effectiveness of discriminative features in improving prediction performance, learning speed, and path interpretability.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Feature optimization based on multi-order fusion and adaptive recursive elimination for motion classification in doppler radar 基于多阶融合和自适应递归消除的特征优化，用于多普勒雷达的运动分类

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence

Pub Date : 2025-02-22 DOI: 10.1007/s10489-025-06342-3

Tong Sun, Yipeng Ding, Yuxin Chen, Lv Ping

Radar-based human motion recognition (HMR) technology has gained substantial importance across diverse domains such as security surveillance, post-disaster search and rescue operations, and the development of smart home environments. The intricate nature of human movements generates radar echo signals with pronounced non-stationary attributes, which encapsulate a wealth of target feature data. However, striking a balance between the precision of motion recognition and the requirement for real-time processing, especially in the context of extracting meaningful features from radar signals, remains a formidable challenge. This research paper introduces a novel approach to tackle this challenge. Firstly,we apply the multi-order fractional Fourier transform (m-FRFT) to radar echo signals, facilitating the extraction of micro-Doppler (m-D) frequency information. Secondly, we have developed an optimized feature selection model named MPG, which stands for m-D parameter screening based on genetic algorithm (GA) and adaptive weight particle swarm optimization (AWPSO). Thirdly, we apply the MPG model to the recursive feature elimination (RFE) algorithm to refine the representation of m-D frequency information, allowing for adaptive parameter adjustment and effective feature dimensionality reduction. The proposed method has been tested using human motion echo data collected from a Doppler radar prototype. The experimental outcomes demonstrate that our approach outperforms traditional feature extraction methods in terms of reducing feature dimensionality, computational efficiency, and classification accuracy.

{"title":"Feature optimization based on multi-order fusion and adaptive recursive elimination for motion classification in doppler radar","authors":"Tong Sun, Yipeng Ding, Yuxin Chen, Lv Ping","doi":"10.1007/s10489-025-06342-3","DOIUrl":"10.1007/s10489-025-06342-3","url":null,"abstract":"<div><p>Radar-based human motion recognition (HMR) technology has gained substantial importance across diverse domains such as security surveillance, post-disaster search and rescue operations, and the development of smart home environments. The intricate nature of human movements generates radar echo signals with pronounced non-stationary attributes, which encapsulate a wealth of target feature data. However, striking a balance between the precision of motion recognition and the requirement for real-time processing, especially in the context of extracting meaningful features from radar signals, remains a formidable challenge. This research paper introduces a novel approach to tackle this challenge. Firstly,we apply the multi-order fractional Fourier transform (m-FRFT) to radar echo signals, facilitating the extraction of micro-Doppler (m-D) frequency information. Secondly, we have developed an optimized feature selection model named MPG, which stands for m-D parameter screening based on genetic algorithm (GA) and adaptive weight particle swarm optimization (AWPSO). Thirdly, we apply the MPG model to the recursive feature elimination (RFE) algorithm to refine the representation of m-D frequency information, allowing for adaptive parameter adjustment and effective feature dimensionality reduction. The proposed method has been tested using human motion echo data collected from a Doppler radar prototype. The experimental outcomes demonstrate that our approach outperforms traditional feature extraction methods in terms of reducing feature dimensionality, computational efficiency, and classification accuracy.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143471959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0