首页 > 最新文献

Applied Intelligence最新文献

英文 中文
An adaptive dynamic multi-objective evolutionary algorithm using two-stage correlation-based decision variable analysis 基于两阶段相关决策变量分析的自适应动态多目标进化算法
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1007/s10489-025-07069-x
Tianyu Liu, Siyue Xu, Xiangfei Wu

In recent years, dynamic multi-objective optimization problems (DMOPs) have received increasing attention, and numerous strategies have emerged to solve such problems. However, in most existing algorithms, the specific roles of each decision variable are often overlooked. This article proposes an adaptive dynamic multi-objective optimization algorithm utilizing two-stage correlation-based decision variable analysis named ADMOEA-TSCDVA. In the proposed decision variable analysis strategy, the decision variables are initially divided into convergence-related and diversity-related variables based on correlation analysis in the first stage. Subsequently, in the second stage, diversity-related variables are further classified into simple diversity-related and complex diversity-related variables. The results of the decision variable analysis will be used in a dual-model initialization strategy to guide the algorithm in generating a high-quality initial population when changes occur, and in an adaptive strategy to select a suitable static optimization algorithm when the environment remains unchanged. Comprehensive comparative experiments were conducted on two DMOP benchmark test suites against five state-of-the-art algorithms. The results show that the proposed ADMOEA-TSCDVA achieved the best MIGD and MHV results in 63% and 64% of the 57 test cases, demonstrating its excellent performance in dynamic multi-objective optimization scenarios.

近年来,动态多目标优化问题越来越受到人们的关注,并出现了许多解决动态多目标优化问题的策略。然而,在大多数现有算法中,每个决策变量的具体作用往往被忽视。本文提出了一种基于两阶段相关决策变量分析的自适应动态多目标优化算法ADMOEA-TSCDVA。在本文提出的决策变量分析策略中,第一阶段基于相关性分析将决策变量初步划分为收敛性相关变量和多样性相关变量。随后,在第二阶段,将多样性相关变量进一步划分为简单多样性相关变量和复杂多样性相关变量。决策变量分析的结果将用于双模型初始化策略,指导算法在发生变化时生成高质量的初始种群;用于自适应策略,在环境保持不变时选择合适的静态优化算法。在两个DMOP基准测试套件上对五种最先进的算法进行了全面的对比实验。结果表明,在57个测试用例中,提出的ADMOEA-TSCDVA在63%和64%的测试用例中获得了最佳的MIGD和MHV结果,证明了其在动态多目标优化场景中的优异性能。
{"title":"An adaptive dynamic multi-objective evolutionary algorithm using two-stage correlation-based decision variable analysis","authors":"Tianyu Liu,&nbsp;Siyue Xu,&nbsp;Xiangfei Wu","doi":"10.1007/s10489-025-07069-x","DOIUrl":"10.1007/s10489-025-07069-x","url":null,"abstract":"<div>\u0000 \u0000 <p>In recent years, dynamic multi-objective optimization problems (DMOPs) have received increasing attention, and numerous strategies have emerged to solve such problems. However, in most existing algorithms, the specific roles of each decision variable are often overlooked. This article proposes an adaptive dynamic multi-objective optimization algorithm utilizing two-stage correlation-based decision variable analysis named ADMOEA-TSCDVA. In the proposed decision variable analysis strategy, the decision variables are initially divided into convergence-related and diversity-related variables based on correlation analysis in the first stage. Subsequently, in the second stage, diversity-related variables are further classified into simple diversity-related and complex diversity-related variables. The results of the decision variable analysis will be used in a dual-model initialization strategy to guide the algorithm in generating a high-quality initial population when changes occur, and in an adaptive strategy to select a suitable static optimization algorithm when the environment remains unchanged. Comprehensive comparative experiments were conducted on two DMOP benchmark test suites against five state-of-the-art algorithms. The results show that the proposed ADMOEA-TSCDVA achieved the best MIGD and MHV results in 63% and 64% of the 57 test cases, demonstrating its excellent performance in dynamic multi-objective optimization scenarios.</p>\u0000 </div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatiotemporal decoupling-gated transformer: modeling high-dimensional coupling for traffic flow prediction 时空解耦门控变压器:用于交通流预测的高维耦合建模
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1007/s10489-025-07056-2
Gong Wang, Wei Sun, Junbo Gao, Chunyu Wang, Lai Wei

Traffic flow prediction is a core task in intelligent transportation systems (ITS), challenged by complex spatiotemporal dynamics. Nodes in traffic networks exhibit diverse temporal patterns, including abrupt fluctuations during peak hours or sudden weather changes, while maintaining spatial correlations that evolve over time. Accurate prediction thus requires effectively capturing high-dimensional spatiotemporal dependencies. In this paper, we propose the Spatiotemporal Decoupling-Gated Transformer (STDGformer), whose key principle is decoupling for coupling. Instead of directly modeling entangled spatiotemporal features, STDGformer first decouples temporal and spatial representations using a Gated Temporal Transformer and a Gated Spatial Transformer. This allows the model to capture high- and low-frequency temporal patterns, as well as dynamic and static spatial dependencies, more effectively. Subsequently, a Cross Attention Transformer integrates the decoupled features, reconstructing their joint interactions and enabling precise modeling of multi-dimensional spatiotemporal dependencies. Extensive experiments on four real-world traffic datasets demonstrate that STDGformer consistently outperforms representative baselines, achieving superior accuracy and robustness while providing interpretable insights into temporal and spatial feature contributions. Specifically, STDGformer reduces MAE by 1.1% on PEMS04 and decreases MAPE by 1.2% on PEMS03, respectively.

交通流预测是智能交通系统的核心任务之一,具有复杂的时空动态特征。交通网络中的节点表现出不同的时间模式,包括高峰时段的突然波动或突然的天气变化,同时保持随时间演变的空间相关性。因此,准确的预测需要有效地捕获高维时空依赖关系。本文提出了一种时空解耦门控变压器(STDGformer),其关键原理是为了耦合而解耦。与直接建模纠缠的时空特征不同,STDGformer首先使用门控时间变压器和门控空间变压器对时空表示进行解耦。这使得模型能够更有效地捕获高频和低频的时间模式,以及动态和静态的空间依赖关系。随后,交叉注意转换器集成解耦特征,重建它们的联合交互,从而实现多维时空依赖的精确建模。在四个真实世界交通数据集上进行的大量实验表明,STDGformer始终优于代表性基线,在提供对时间和空间特征贡献的可解释性见解的同时,实现了卓越的准确性和鲁棒性。具体来说,STDGformer降低了PEMS04上1.1%的MAE和PEMS03上1.2%的MAPE。
{"title":"Spatiotemporal decoupling-gated transformer: modeling high-dimensional coupling for traffic flow prediction","authors":"Gong Wang,&nbsp;Wei Sun,&nbsp;Junbo Gao,&nbsp;Chunyu Wang,&nbsp;Lai Wei","doi":"10.1007/s10489-025-07056-2","DOIUrl":"10.1007/s10489-025-07056-2","url":null,"abstract":"<div><p>Traffic flow prediction is a core task in intelligent transportation systems (ITS), challenged by complex spatiotemporal dynamics. Nodes in traffic networks exhibit diverse temporal patterns, including abrupt fluctuations during peak hours or sudden weather changes, while maintaining spatial correlations that evolve over time. Accurate prediction thus requires effectively capturing high-dimensional spatiotemporal dependencies. In this paper, we propose the Spatiotemporal Decoupling-Gated Transformer (STDGformer), whose key principle is decoupling for coupling. Instead of directly modeling entangled spatiotemporal features, STDGformer first decouples temporal and spatial representations using a Gated Temporal Transformer and a Gated Spatial Transformer. This allows the model to capture high- and low-frequency temporal patterns, as well as dynamic and static spatial dependencies, more effectively. Subsequently, a Cross Attention Transformer integrates the decoupled features, reconstructing their joint interactions and enabling precise modeling of multi-dimensional spatiotemporal dependencies. Extensive experiments on four real-world traffic datasets demonstrate that STDGformer consistently outperforms representative baselines, achieving superior accuracy and robustness while providing interpretable insights into temporal and spatial feature contributions. Specifically, STDGformer reduces MAE by 1.1% on PEMS04 and decreases MAPE by 1.2% on PEMS03, respectively.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting multi-view semi-supervised classification: a reinforcement learning perspective 回顾多视图半监督分类:强化学习的视角
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1007/s10489-025-06929-w
Zhicheng Wei, Zhiling Cai, Zhibin Shi, Zihan Fang, Mingjian Fu, Shiping Wang

Multi-view learning has been capturing widespread attention from a variety of areas, while very limited labeling information poses a great challenge for multi-view semi-supervised classification. Especially, rare labels often require multi-hop label propagation, which leads to over-smoothing and inaccurate prediction. In contrast, reinforcement learning can capture long-term policy series by a reward function. In this paper, we address the semi-supervised multi-view classification problem from a reinforcement learning perspective. The proposed approach frames the classification problem as a process where an agent, guided by reinforcement learning, starts from the state representing the unlabeled sample, transitions to the state corresponding to the label, and completes the label assignment. The reward matrix’s core is defined as the affinity matrix between samples, complemented by one-hot encoding in the label space and an identity matrix. The Q-table is then derived from the rewards in Q-learning, enabling the use of limited label information to effectively mine multi-view data. In the label assignment phase, the agent utilizes the table to eventually reach the state corresponding to the label and assign that label to the sample represented by the initial state. Our method’s effectiveness is confirmed through extensive experiments on multi-view semi-supervised classification tasks compared with several advanced approaches.

多视图学习已经引起了各个领域的广泛关注,然而有限的标注信息给多视图半监督分类带来了很大的挑战。特别是稀有标签往往需要多跳标签传播,导致过度平滑和不准确的预测。相比之下,强化学习可以通过奖励函数捕获长期策略序列。在本文中,我们从强化学习的角度解决了半监督多视图分类问题。提出的方法将分类问题框架为一个过程,其中智能体在强化学习的指导下,从表示未标记样本的状态开始,过渡到与标签对应的状态,并完成标签分配。奖励矩阵的核心定义为样本间的亲和矩阵,并辅以标签空间中的单热编码和单位矩阵。然后从q学习中的奖励中导出q表,从而可以使用有限的标签信息来有效地挖掘多视图数据。在标签分配阶段,代理利用表最终达到标签对应的状态,并将该标签分配给由初始状态表示的样本。通过对多视图半监督分类任务的大量实验,与几种先进方法进行了比较,验证了该方法的有效性。
{"title":"Revisiting multi-view semi-supervised classification: a reinforcement learning perspective","authors":"Zhicheng Wei,&nbsp;Zhiling Cai,&nbsp;Zhibin Shi,&nbsp;Zihan Fang,&nbsp;Mingjian Fu,&nbsp;Shiping Wang","doi":"10.1007/s10489-025-06929-w","DOIUrl":"10.1007/s10489-025-06929-w","url":null,"abstract":"<div><p>Multi-view learning has been capturing widespread attention from a variety of areas, while very limited labeling information poses a great challenge for multi-view semi-supervised classification. Especially, rare labels often require multi-hop label propagation, which leads to over-smoothing and inaccurate prediction. In contrast, reinforcement learning can capture long-term policy series by a reward function. In this paper, we address the semi-supervised multi-view classification problem from a reinforcement learning perspective. The proposed approach frames the classification problem as a process where an agent, guided by reinforcement learning, starts from the state representing the unlabeled sample, transitions to the state corresponding to the label, and completes the label assignment. The reward matrix’s core is defined as the affinity matrix between samples, complemented by one-hot encoding in the label space and an identity matrix. The Q-table is then derived from the rewards in Q-learning, enabling the use of limited label information to effectively mine multi-view data. In the label assignment phase, the agent utilizes the table to eventually reach the state corresponding to the label and assign that label to the sample represented by the initial state. Our method’s effectiveness is confirmed through extensive experiments on multi-view semi-supervised classification tasks compared with several advanced approaches.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attentive Q-matrix learning for knowledge tracing 关注q矩阵学习知识追踪
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-09 DOI: 10.1007/s10489-025-07064-2
Zhongfeng Jia, Wei Su, Jiamin Liu, Wenli Yue

With the rapid development of intelligent tutoring systems (ITSs) in the past decade, tracing students’ knowledge state has become increasingly important for providing individualized learning guidance; this is the main idea of knowledge tracing (KT), which models students’ mastery of knowledge concepts (KCs, skills needed to solve a question) based on their past interactions on platforms. Many KT models have been proposed and have recently shown remarkable performance. However, the majority of these models use concepts to index questions, which implies that the predefined skill tags for each question are required in advance to indicate the specific KCs needed for answering the question correctly; this makes it difficult to apply to large-scale online education platforms where questions are often not well organized by skill tags. In this paper, we propose Q-matrix-based attentive knowledge tracing (QAKT), an end-to-end KT model that uses the attentive approach in situations where predefined skill tags are not available. With a novel hybrid embedding method based on the Q-matrix and Rasch model, the QAKT model is capable of modeling problems hierarchically and learning the Q-matrix efficiently based on students’ sequences. Moreover, the architecture of the QAKT ensures that it is friendly to questions associated with multiple skills and has outstanding interpretability. After conducting experiments on a variety of open datasets, we empirically verified that even without predefined skill tags, our model performs similarly to or even better than the state-of-the-art KT methods, by up to 2% in AUC in some cases. Moreover, our model outperforms existing models that do not require skill tags (by up to 7% in AUC) in predicting future learner responses. The results of further experiments suggest that the Q-matrix learned by the QAKT is highly model-agnostic and more information-sufficient than the one labeled by human experts, which could help with the data mining tasks in existing ITSs.

近十年来,随着智能辅导系统的快速发展,跟踪学生的知识状态对于提供个性化的学习指导变得越来越重要;这是知识追踪(KT)的主要思想,它基于学生过去在平台上的互动来模拟他们对知识概念(KCs,解决问题所需的技能)的掌握。许多KT模型已经被提出,并且最近显示出显著的性能。然而,这些模型中的大多数使用概念来索引问题,这意味着需要预先为每个问题预定义技能标签,以指示正确回答问题所需的特定KCs;这使得它很难应用于大规模的在线教育平台,因为这些平台的问题往往没有按照技能标签很好地组织起来。在本文中,我们提出了基于q矩阵的关注知识跟踪(QAKT),这是一种端到端的KT模型,在预定义的技能标签不可用的情况下使用关注方法。QAKT模型采用了一种基于q矩阵和Rasch模型的新型混合嵌入方法,能够分层建模问题,并根据学生的序列高效地学习q矩阵。此外,QAKT的架构确保了它对与多种技能相关的问题是友好的,并且具有出色的可解释性。在对各种开放数据集进行实验后,我们通过经验验证,即使没有预定义的技能标签,我们的模型的性能与最先进的KT方法相似甚至更好,在某些情况下,AUC高达2%。此外,我们的模型在预测未来学习者的反应方面优于不需要技能标签的现有模型(AUC高达7%)。进一步的实验结果表明,QAKT学习的q矩阵是高度模型不可知的,并且比人类专家标记的q矩阵信息更充足,这可以帮助现有ITSs中的数据挖掘任务。
{"title":"Attentive Q-matrix learning for knowledge tracing","authors":"Zhongfeng Jia,&nbsp;Wei Su,&nbsp;Jiamin Liu,&nbsp;Wenli Yue","doi":"10.1007/s10489-025-07064-2","DOIUrl":"10.1007/s10489-025-07064-2","url":null,"abstract":"<div><p>With the rapid development of intelligent tutoring systems (ITSs) in the past decade, tracing students’ knowledge state has become increasingly important for providing individualized learning guidance; this is the main idea of <i>knowledge tracing (KT)</i>, which models students’ mastery of knowledge concepts (KCs, skills needed to solve a question) based on their past interactions on platforms. Many KT models have been proposed and have recently shown remarkable performance. However, the majority of these models use concepts to index questions, which implies that the predefined skill tags for each question are required in advance to indicate the specific KCs needed for answering the question correctly; this makes it difficult to apply to large-scale online education platforms where questions are often not well organized by skill tags. In this paper, we propose Q-matrix-based attentive knowledge tracing (QAKT), an end-to-end KT model that uses the attentive approach in situations where predefined skill tags are not available. With a novel hybrid embedding method based on the Q-matrix and Rasch model, the QAKT model is capable of modeling problems hierarchically and learning the Q-matrix efficiently based on students’ sequences. Moreover, the architecture of the QAKT ensures that it is friendly to questions associated with multiple skills and has outstanding interpretability. After conducting experiments on a variety of open datasets, we empirically verified that even without predefined skill tags, our model performs similarly to or even better than the state-of-the-art KT methods, by up to 2% in AUC in some cases. Moreover, our model outperforms existing models that do not require skill tags (by up to 7% in AUC) in predicting future learner responses. The results of further experiments suggest that the Q-matrix learned by the QAKT is highly model-agnostic and more information-sufficient than the one labeled by human experts, which could help with the data mining tasks in existing ITSs.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DEA statistical analysis and Shannon entropy-driven interval multiplicative probabilistic linguistic group decision-making DEA统计分析与Shannon熵驱动区间乘法概率语言群体决策
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 DOI: 10.1007/s10489-025-06977-2
Feifei Jin, Yiping Cao, Jinpei Liu

The aim of this paper is to propose a novel group decision-making (GDM) method that addresses expert weight uncertainty and efficiency interval estimation. Based on interval multiplicative probabilistic linguistic preference relations, this study combines entropy weight method, multiplicative data envelopment analysis (DEA) cross-efficiency and Bootstrap analysis to provide a more accurate and reliable GDM framework. First, logarithmic function is introduced into the entropy method to measure the fuzziness and hesitancy of preference matrix sufficiently and quantify the weight of expert information. Then, a multiplicative DEA cross-efficiency model is adopted for efficiency evaluation of the decision-making units. Logarithmic function is designed in this model to intuitively display efficiency differences. We further develop Bootstrap-DEA for efficiency correction, leveraging its statistical advantages to enhance the robustness and reliability of our GDM method. Finally, a numerical example and comparative analysis are provided to highlight the rationality and effectiveness of the proposed GDM method.

提出了一种解决专家权重不确定性和效率区间估计问题的群体决策方法。本文基于区间乘性概率语言偏好关系,结合熵权法、乘性数据包络分析(DEA)交叉效率和Bootstrap分析,提供了一个更准确、更可靠的GDM框架。首先,在熵值法中引入对数函数,充分度量偏好矩阵的模糊性和犹豫性,量化专家信息的权重;然后,采用乘法DEA交叉效率模型对决策单元进行效率评价。该模型设计了对数函数,直观地显示效率差异。我们进一步发展了Bootstrap-DEA进行效率校正,利用其统计优势来增强我们的GDM方法的鲁棒性和可靠性。最后,通过数值算例和对比分析,验证了GDM方法的合理性和有效性。
{"title":"DEA statistical analysis and Shannon entropy-driven interval multiplicative probabilistic linguistic group decision-making","authors":"Feifei Jin,&nbsp;Yiping Cao,&nbsp;Jinpei Liu","doi":"10.1007/s10489-025-06977-2","DOIUrl":"10.1007/s10489-025-06977-2","url":null,"abstract":"<div><p>The aim of this paper is to propose a novel group decision-making (GDM) method that addresses expert weight uncertainty and efficiency interval estimation. Based on interval multiplicative probabilistic linguistic preference relations, this study combines entropy weight method, multiplicative data envelopment analysis (DEA) cross-efficiency and Bootstrap analysis to provide a more accurate and reliable GDM framework. First, logarithmic function is introduced into the entropy method to measure the fuzziness and hesitancy of preference matrix sufficiently and quantify the weight of expert information. Then, a multiplicative DEA cross-efficiency model is adopted for efficiency evaluation of the decision-making units. Logarithmic function is designed in this model to intuitively display efficiency differences. We further develop Bootstrap-DEA for efficiency correction, leveraging its statistical advantages to enhance the robustness and reliability of our GDM method. Finally, a numerical example and comparative analysis are provided to highlight the rationality and effectiveness of the proposed GDM method.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
KA-AttLSTMnet: a Kolmogorov-Arnold attentional architecture for egocentric navigation prediction from hippocampal CA1 spikes KA-AttLSTMnet:一种基于海马CA1峰的自我中心导航预测的Kolmogorov-Arnold注意结构
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-08 DOI: 10.1007/s10489-025-07070-4
Xiaolong Wu, Jianhong Yang, Zhanhong Du

In the field of neuroscience, it has been confirmed that the hippocampal CA1 plays a crucial role in spatial navigation. Establishing an appropriate deep learning architecture to replicate this process can not only more accurately predict behavior but also better understand the working mechanisms of CA1. This study utilizes an open-source dataset to predict rat immediate behavior by constructing a hybrid network capable of processing spatial-temporal information and dynamically activating over input. Specifically, the KA-AttLSTMnet architecture is proposed to replicate the CA1’s navigation function. First, this study employs spike-related methods to encode CA1 activity and represents a total of eight open-field behavioral states under allocentric/egocentric strategies. Then, spike encoding and behavioral states are used as inputs and outputs, respectively, to establish eight different immediate behavior prediction models. Finally, in terms of model structure, a self-attention mechanism architecture based on recurrent neural network (RNN) is built, and Kolmogorov-Arnold network (KAN) is employed to further dynamically adjust the architecture, which already possesses spatial-temporal processing capabilities, to enhance the extraction of CA1 neural activity information and improve the prediction of the rat’s immediate behavior. By comparing the prediction results of the eight behavioral states, only two behaviors, direction of turn ({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}}) and speed ({{b}_{s}}^{i}), achieved relatively stable predictions, indicating that the CA1 internal circuit is more inclined to fully reflect an egocentric strategy rather than an allocentric one. Ablation experiments demonstrated that the unrolled-LSTM network is more effective in processing spike encoding over time and organizing it over space. Additionally, the mean squared error (MSE) for predicting ({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}}) and ({{b}_{s}}^{i}) decreased from 0.4203/0.0435 to 0.3687/0.0150, and eventually to 0.3255/0.0110. This reduction highlights the positive impact of the multi-head self-attention mechanism in RNN (AttLSTMnet) for extracting contextual information, as well as the dynamic regulation capability of neurons in the KAN (KA-AttLSTMnet), which differs from the traditional weight-activation function mechanism, both of which contribute significantly to improving the immediate behavior prediction model.

在神经科学领域,已经证实海马体CA1在空间导航中起着至关重要的作用。建立适当的深度学习架构来复制这一过程,不仅可以更准确地预测行为,还可以更好地理解CA1的工作机制。本研究利用一个开源数据集,通过构建一个能够处理时空信息和动态激活输入的混合网络来预测大鼠的即时行为。具体来说,提出了KA-AttLSTMnet体系结构来复制CA1的导航功能。首先,本研究采用与峰值相关的方法编码CA1活性,表征了异中心/自我中心策略下的8种开放场行为状态。然后,以尖峰编码和行为状态分别作为输入和输出,建立了8种不同的即时行为预测模型。最后,在模型结构方面,构建了基于递归神经网络(RNN)的自注意机制架构,并利用Kolmogorov-Arnold网络(KAN)对已经具备时空处理能力的自注意机制架构进行进一步动态调整,增强了CA1神经活动信息的提取,提高了对大鼠即时行为的预测能力。通过对比8种行为状态的预测结果,只有转弯方向({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}})和速度({{b}_{s}}^{i})两种行为的预测相对稳定,说明CA1内部电路更倾向于充分反映自我中心策略,而不是非中心策略。消融实验表明,展开lstm网络在处理脉冲编码和空间组织方面更有效。此外,预测({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}})和({{b}_{s}}^{i})的均方误差(MSE)从0.4203/0.0435下降到0.3687/0.0150,最终下降到0.3255/0.0110。这一约简突出了RNN中的多头自注意机制(AttLSTMnet)在提取上下文信息方面的积极作用,以及KAN中神经元的动态调节能力(KA-AttLSTMnet),它不同于传统的权重激活函数机制,两者都对改进即时行为预测模型有重要贡献。
{"title":"KA-AttLSTMnet: a Kolmogorov-Arnold attentional architecture for egocentric navigation prediction from hippocampal CA1 spikes","authors":"Xiaolong Wu,&nbsp;Jianhong Yang,&nbsp;Zhanhong Du","doi":"10.1007/s10489-025-07070-4","DOIUrl":"10.1007/s10489-025-07070-4","url":null,"abstract":"<div><p>In the field of neuroscience, it has been confirmed that the hippocampal CA1 plays a crucial role in spatial navigation. Establishing an appropriate deep learning architecture to replicate this process can not only more accurately predict behavior but also better understand the working mechanisms of CA1. This study utilizes an open-source dataset to predict rat immediate behavior by constructing a hybrid network capable of processing spatial-temporal information and dynamically activating over input. Specifically, the KA-AttLSTMnet architecture is proposed to replicate the CA1’s navigation function. First, this study employs spike-related methods to encode CA1 activity and represents a total of eight open-field behavioral states under allocentric/egocentric strategies. Then, spike encoding and behavioral states are used as inputs and outputs, respectively, to establish eight different immediate behavior prediction models. Finally, in terms of model structure, a self-attention mechanism architecture based on recurrent neural network (RNN) is built, and Kolmogorov-Arnold network (KAN) is employed to further dynamically adjust the architecture, which already possesses spatial-temporal processing capabilities, to enhance the extraction of CA1 neural activity information and improve the prediction of the rat’s immediate behavior. By comparing the prediction results of the eight behavioral states, only two behaviors, direction of turn <span>({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}})</span> and speed <span>({{b}_{s}}^{i})</span>, achieved relatively stable predictions, indicating that the CA1 internal circuit is more inclined to fully reflect an egocentric strategy rather than an allocentric one. Ablation experiments demonstrated that the unrolled-LSTM network is more effective in processing spike encoding over time and organizing it over space. Additionally, the mean squared error (MSE) for predicting <span>({{{varvec{b}}}_{{varvec{t}}}}^{{varvec{i}}})</span> and <span>({{b}_{s}}^{i})</span> decreased from 0.4203/0.0435 to 0.3687/0.0150, and eventually to 0.3255/0.0110. This reduction highlights the positive impact of the multi-head self-attention mechanism in RNN (AttLSTMnet) for extracting contextual information, as well as the dynamic regulation capability of neurons in the KAN (KA-AttLSTMnet), which differs from the traditional weight-activation function mechanism, both of which contribute significantly to improving the immediate behavior prediction model.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DA-NGF: A Domain-Adaptive Neurogenerative Framework for Few-Shot Railway Fastener Defect Synthesis and Transferable Representation Learning 基于DA-NGF的区域自适应神经生成框架,用于铁路扣件缺陷综合和可转移表征学习
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10489-025-07054-4
Qasim Zaheer, Haleema Ehsan, Weidong Wang, Syed Faizan Hussain Shah, Chengbo Ai, Jin Wang, Shi Qiu

Monitoring railway fasteners is essential for track safety. However, the development of intelligent inspection systems is hindered by the scarcity of defective fastener images and the pronounced imbalance between defective and non-defective classes. Conventional generative approaches, such as GAN- and diffusion-based models, can increase data volume but often incur high computational cost, unstable convergence, or produce samples lacking structural fidelity, limiting their suitability for engineering deployment. This work proposes the Domain-Adaptive NeuroGenerative Framework (DA-NGF), a lightweight few-shot generative model tailored to rail fastener inspection. The framework leverages compact latent modeling, orientation-invariant self-supervision, and defect-aware attention to synthesize structurally consistent fastener images from extremely limited real samples. Synthetic samples are evaluated using a separate downstream classifier (FastenerNet + +), enabling quality validation without coupling generation and classification pipelines. Experimental results show that augmenting real data with DA-NGF synthetic samples improves downstream defect recognition accuracy by 21.13% and precision by 22.53%, with corresponding gains in recall and F1-score. The model executes efficiently on CPU hardware, underscoring its suitability for low-resource deployment.

监控铁路紧固件对轨道安全至关重要。然而,智能检测系统的发展受到缺陷紧固件图像稀缺和缺陷与非缺陷类别之间明显不平衡的阻碍。传统的生成方法,如基于GAN和扩散的模型,可以增加数据量,但通常会产生高计算成本,不稳定的收敛性,或者产生缺乏结构保真度的样本,限制了它们在工程部署中的适用性。这项工作提出了域自适应神经生成框架(DA-NGF),这是一种专为轨道扣件检测量身定制的轻量化少镜头生成模型。该框架利用紧凑的潜在建模、方向不变的自我监督和缺陷感知注意力,从极其有限的真实样本中合成结构一致的紧固件图像。合成样品使用单独的下游分类器(fastenernet++)进行评估,无需耦合生成和分类管道即可进行质量验证。实验结果表明,用DA-NGF合成样本增强真实数据后,下游缺陷识别正确率提高了21.13%,精度提高了22.53%,召回率和f1分数也相应提高。该模型在CPU硬件上有效地执行,强调其适合低资源部署。
{"title":"DA-NGF: A Domain-Adaptive Neurogenerative Framework for Few-Shot Railway Fastener Defect Synthesis and Transferable Representation Learning","authors":"Qasim Zaheer,&nbsp;Haleema Ehsan,&nbsp;Weidong Wang,&nbsp;Syed Faizan Hussain Shah,&nbsp;Chengbo Ai,&nbsp;Jin Wang,&nbsp;Shi Qiu","doi":"10.1007/s10489-025-07054-4","DOIUrl":"10.1007/s10489-025-07054-4","url":null,"abstract":"<div><p>Monitoring railway fasteners is essential for track safety. However, the development of intelligent inspection systems is hindered by the scarcity of defective fastener images and the pronounced imbalance between defective and non-defective classes. Conventional generative approaches, such as GAN- and diffusion-based models, can increase data volume but often incur high computational cost, unstable convergence, or produce samples lacking structural fidelity, limiting their suitability for engineering deployment. This work proposes the Domain-Adaptive NeuroGenerative Framework (DA-NGF), a lightweight few-shot generative model tailored to rail fastener inspection. The framework leverages compact latent modeling, orientation-invariant self-supervision, and defect-aware attention to synthesize structurally consistent fastener images from extremely limited real samples. Synthetic samples are evaluated using a separate downstream classifier (FastenerNet + +), enabling quality validation without coupling generation and classification pipelines. Experimental results show that augmenting real data with DA-NGF synthetic samples improves downstream defect recognition accuracy by 21.13% and precision by 22.53%, with corresponding gains in recall and F1-score. The model executes efficiently on CPU hardware, underscoring its suitability for low-resource deployment.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gender biases in online communication: A case study of soccer 在线交流中的性别偏见:以足球为例
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10489-025-06988-z
Mariana Macedo, Akrati Saxena

Social media and digital platforms allow us to freely and easily express our opinions to a wide audience. In this study, we investigate gender-based differences in online communication, specifically on Twitter (now X), in the context of soccer, by analyzing patterns of participation, sentiment, and engagement across male and female users. As one of the most popular sports, soccer engages a diverse audience on social media, regardless of expertise. We collected 9.5 million tweets related to soccer in English and Portuguese during three months (March - June 2022). Only 18.38% tweets were identified as written by women, highlighting a possible gender gap already in the number of people who participated actively in this topic. We analyze tweets in Portuguese and English, as these languages represent different perspectives on soccer. English serves as a proxy for global discussions, while Portuguese reflects more localized but deeply engaged communities. We observe that women and men communicate more between each other in Portuguese than in English, exhibiting lower homophily within their social networks. However, this difference in homophily does not appear to influence how women and men express emotions and sentiments, suggesting that these aspects may be shaped by other factors such as societal expectations, gender role socialization, online community dynamics, visibility constraints, or other gender-related norms and characteristics. Women express their emotions more intensely in response to events than men, regardless of the differences in the number of tweets, and men tend to be more negative in their tweets than women. Our study reveals persistent gender gaps across both weeks and hours through qualitative and quantitative analyses, including detailed text-level and network-level examinations. These findings underscore the importance of identifying and reporting gender disparities in online communication to foster more inclusive spaces where individuals can freely express their opinions.

社交媒体和数字平台使我们能够自由、轻松地向广大受众表达自己的观点。在这项研究中,我们通过分析男性和女性用户的参与、情绪和参与模式,调查了在线交流中基于性别的差异,特别是在Twitter(现在的X)上,在足球的背景下。作为最受欢迎的运动之一,足球在社交媒体上吸引了各种各样的观众,无论他们的专业是什么。我们在三个月内(2022年3月至6月)用英语和葡萄牙语收集了950万条与足球相关的推文。只有18.38%的推文是由女性撰写的,这表明积极参与这一话题的人数可能已经存在性别差距。我们分析葡萄牙语和英语的推文,因为这两种语言代表了对足球的不同看法。英语是全球讨论的代表,而葡萄牙语反映了更本地化但深入参与的社区。我们观察到,女性和男性之间用葡萄牙语交流比用英语交流更多,在他们的社会网络中表现出更低的同质性。然而,同性性的这种差异似乎并不影响男女表达情感和情绪的方式,这表明这些方面可能受到其他因素的影响,如社会期望、性别角色社会化、在线社区动态、可见性限制或其他与性别相关的规范和特征。无论推特数量的差异如何,女性在对事件的反应中表达的情绪比男性更强烈,而且男性在推特中往往比女性更消极。我们的研究通过定性和定量分析,包括详细的文本级和网络级考试,揭示了持续存在的性别差距。这些发现强调了识别和报告在线交流中的性别差异的重要性,以促进个人可以自由表达意见的更具包容性的空间。
{"title":"Gender biases in online communication: A case study of soccer","authors":"Mariana Macedo,&nbsp;Akrati Saxena","doi":"10.1007/s10489-025-06988-z","DOIUrl":"10.1007/s10489-025-06988-z","url":null,"abstract":"<div><p>Social media and digital platforms allow us to freely and easily express our opinions to a wide audience. In this study, we investigate gender-based differences in online communication, specifically on Twitter (now X), in the context of soccer, by analyzing patterns of participation, sentiment, and engagement across male and female users. As one of the most popular sports, soccer engages a diverse audience on social media, regardless of expertise. We collected 9.5 million tweets related to soccer in English and Portuguese during three months (March - June 2022). Only 18.38% tweets were identified as written by women, highlighting a possible gender gap already in the number of people who participated actively in this topic. We analyze tweets in Portuguese and English, as these languages represent different perspectives on soccer. English serves as a proxy for global discussions, while Portuguese reflects more localized but deeply engaged communities. We observe that women and men communicate more between each other in Portuguese than in English, exhibiting lower homophily within their social networks. However, this difference in homophily does not appear to influence how women and men express emotions and sentiments, suggesting that these aspects may be shaped by other factors such as societal expectations, gender role socialization, online community dynamics, visibility constraints, or other gender-related norms and characteristics. Women express their emotions more intensely in response to events than men, regardless of the differences in the number of tweets, and men tend to be more negative in their tweets than women. Our study reveals persistent gender gaps across both weeks and hours through qualitative and quantitative analyses, including detailed text-level and network-level examinations. These findings underscore the importance of identifying and reporting gender disparities in online communication to foster more inclusive spaces where individuals can freely express their opinions.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SDLK-Net: Enhanced squeezed directional large kernel multi-scale multi-modal fusion network for salient object detection SDLK-Net:用于显著目标检测的增强压缩定向大核多尺度多模态融合网络
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-07 DOI: 10.1007/s10489-025-07060-6
Lingyu Yan, Ting Zhou, Rong Gao, Zengmao Wang, Zhiwei Ye, Xinyun Wu

In recent years, multi-modal salient object detection methods (e.g., RGB-D or RGB-T) have made significant progress in complex scene applications. However, existing methods often neglect edge feature enhancement and detail preservation, which are crucial for the clarity and accuracy of foreground-background separation. Additionally, effectively highlighting spatially related features, balancing local and global information, and suppressing background noise remain major challenges. To address these issues, we propose a new enhanced squeezed directional large kernel multi-scale multi-modal fusion network, named SDLK-Net, to optimize the separation of foreground and background and enhance detail capture ability. Specifically, we introduce a Squeezed Large Kernel Edge-Enhanced Fusion (SLKEF) method, which combines the squeeze operation with bidirectional large kernel convolutions to effectively enhance the separation between foreground and background, achieving the regional integrity and edge clarity of salient object segmentation results. In addition, we develop an Adaptive Extraction Multi-path Matching (AEMM) method, which enhances the fine-grained feature representation of the ROI region through path decoupling and fusion to help better capture the detailed information. Lastly, we design a Multi-Scale Filter Gate Integration (MSFGI) model, which flexibly balances local and global contextual information through dynamic filtering and gating mechanisms, ensuring the highlighting of spatially related features while effectively suppressing background noise. Experimental results demonstrate that SDLK-Net significantly improves detection accuracy in RGB-D and RGB-T salient object detection tasks, outperforming existing methods and achieving state-of-the-art performance across multiple datasets.

近年来,多模态显著目标检测方法(如RGB-D或RGB-T)在复杂场景应用中取得了重大进展。然而,现有方法往往忽略了边缘特征增强和细节保留,而边缘特征增强和细节保留对前景背景分离的清晰度和准确性至关重要。此外,有效地突出空间相关特征、平衡局部和全局信息以及抑制背景噪声仍然是主要的挑战。针对这些问题,本文提出了一种新的增强的压缩定向大核多尺度多模态融合网络SDLK-Net,以优化前景与背景的分离,增强细节捕获能力。具体来说,我们引入了一种压缩大核边缘增强融合(SLKEF)方法,该方法将压缩操作与双向大核卷积相结合,有效地增强了前景和背景的分离,实现了显著目标分割结果的区域完整性和边缘清晰度。此外,我们开发了一种自适应提取多路径匹配(AEMM)方法,该方法通过路径解耦和融合来增强ROI区域的细粒度特征表示,以帮助更好地捕获细节信息。最后,我们设计了一个多尺度滤波门集成(MSFGI)模型,该模型通过动态滤波和门控机制灵活地平衡了局部和全局上下文信息,在有效抑制背景噪声的同时保证了空间相关特征的突出。实验结果表明,SDLK-Net显著提高了RGB-D和RGB-T显著目标检测任务的检测精度,优于现有方法,在多数据集上实现了最先进的性能。
{"title":"SDLK-Net: Enhanced squeezed directional large kernel multi-scale multi-modal fusion network for salient object detection","authors":"Lingyu Yan,&nbsp;Ting Zhou,&nbsp;Rong Gao,&nbsp;Zengmao Wang,&nbsp;Zhiwei Ye,&nbsp;Xinyun Wu","doi":"10.1007/s10489-025-07060-6","DOIUrl":"10.1007/s10489-025-07060-6","url":null,"abstract":"<div><p>In recent years, multi-modal salient object detection methods (e.g., RGB-D or RGB-T) have made significant progress in complex scene applications. However, existing methods often neglect edge feature enhancement and detail preservation, which are crucial for the clarity and accuracy of foreground-background separation. Additionally, effectively highlighting spatially related features, balancing local and global information, and suppressing background noise remain major challenges. To address these issues, we propose a new enhanced squeezed directional large kernel multi-scale multi-modal fusion network, named SDLK-Net, to optimize the separation of foreground and background and enhance detail capture ability. Specifically, we introduce a Squeezed Large Kernel Edge-Enhanced Fusion (SLKEF) method, which combines the squeeze operation with bidirectional large kernel convolutions to effectively enhance the separation between foreground and background, achieving the regional integrity and edge clarity of salient object segmentation results. In addition, we develop an Adaptive Extraction Multi-path Matching (AEMM) method, which enhances the fine-grained feature representation of the ROI region through path decoupling and fusion to help better capture the detailed information. Lastly, we design a Multi-Scale Filter Gate Integration (MSFGI) model, which flexibly balances local and global contextual information through dynamic filtering and gating mechanisms, ensuring the highlighting of spatially related features while effectively suppressing background noise. Experimental results demonstrate that SDLK-Net significantly improves detection accuracy in RGB-D and RGB-T salient object detection tasks, outperforming existing methods and achieving state-of-the-art performance across multiple datasets.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metric learning based weighted linear discriminant analysis for imbalanced multi-label classification 基于度量学习的不平衡多标签分类加权线性判别分析
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-06 DOI: 10.1007/s10489-025-07045-5
Tingquan Deng, Jintian Huang, Yiying Chen

Imbalanced multi-label learning (IMLL) aims to learn a multi-label classifier from instances with imbalanced label distribution. Most existing IMLL models either use resampling techniques for preprocessing or transform multi-label learning problems into single-label problems, and traditional imbalanced learning methods are applied. The resampling methods may distort data distribution, whereas the latter overlook dependency and correlation between labels. To address those challenges, this paper proposes an approach to imbalanced multi-label learning, nominated as MWLDAIML. In the proposed model, a label enhancement matrix is designed according to the imbalance rates of positive and negative instances to enlarge the influence of instances in minority classes. A linear mapping is learned to function as a classifier from the feature space to the enhanced label space, and simultaneously acts as a projection from a high-dimensional space to a low-dimensional space, ensuring a large divergence between inter-class instances and a tight distribution of intra-class instances via introducing a weighted linear discriminant analysis (WLDA). Moreover, the metric learning is embedded into WLDA to discern intra-class instances and inter-class instances to capture a more complex nonlinear relationship between instances and their multiple labels. Additionally, the graph Laplacian regularization is imposed to ensure that predicted labels inherit the topological structure of instances in the feature space. An efficient algorithm for implementing MWLDAIML is developed to implement the proposed model, and extensive experiments on real-world benchmark datasets demonstrate that the proposed model outperforms existing methods for imbalanced multi-label classification.

不平衡多标签学习(IMLL)旨在从标签分布不平衡的实例中学习一个多标签分类器。现有的IMLL模型要么采用重采样技术进行预处理,要么将多标签学习问题转化为单标签问题,并采用传统的不平衡学习方法。重采样方法可能会扭曲数据分布,而后者忽略了标签之间的依赖性和相关性。为了解决这些挑战,本文提出了一种不平衡多标签学习方法,称为MWLDAIML。在该模型中,根据正面和负面实例的不平衡率设计了标签增强矩阵,以扩大实例在少数类中的影响。学习线性映射作为从特征空间到增强标签空间的分类器,同时作为从高维空间到低维空间的投影,通过引入加权线性判别分析(WLDA)确保类间实例之间的大分歧和类内实例的紧密分布。此外,度量学习被嵌入到WLDA中,以识别类内实例和类间实例,以捕获实例及其多个标签之间更复杂的非线性关系。此外,采用图拉普拉斯正则化来确保预测标签继承特征空间中实例的拓扑结构。开发了一种高效的实现MWLDAIML算法来实现所提出的模型,并在现实世界的基准数据集上进行了大量实验,结果表明所提出的模型优于现有的不平衡多标签分类方法。
{"title":"Metric learning based weighted linear discriminant analysis for imbalanced multi-label classification","authors":"Tingquan Deng,&nbsp;Jintian Huang,&nbsp;Yiying Chen","doi":"10.1007/s10489-025-07045-5","DOIUrl":"10.1007/s10489-025-07045-5","url":null,"abstract":"<div><p>Imbalanced multi-label learning (IMLL) aims to learn a multi-label classifier from instances with imbalanced label distribution. Most existing IMLL models either use resampling techniques for preprocessing or transform multi-label learning problems into single-label problems, and traditional imbalanced learning methods are applied. The resampling methods may distort data distribution, whereas the latter overlook dependency and correlation between labels. To address those challenges, this paper proposes an approach to imbalanced multi-label learning, nominated as MWLDAIML. In the proposed model, a label enhancement matrix is designed according to the imbalance rates of positive and negative instances to enlarge the influence of instances in minority classes. A linear mapping is learned to function as a classifier from the feature space to the enhanced label space, and simultaneously acts as a projection from a high-dimensional space to a low-dimensional space, ensuring a large divergence between inter-class instances and a tight distribution of intra-class instances via introducing a weighted linear discriminant analysis (WLDA). Moreover, the metric learning is embedded into WLDA to discern intra-class instances and inter-class instances to capture a more complex nonlinear relationship between instances and their multiple labels. Additionally, the graph Laplacian regularization is imposed to ensure that predicted labels inherit the topological structure of instances in the feature space. An efficient algorithm for implementing MWLDAIML is developed to implement the proposed model, and extensive experiments on real-world benchmark datasets demonstrate that the proposed model outperforms existing methods for imbalanced multi-label classification.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"56 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145930030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1