首页 > 最新文献

Applied Intelligence最新文献

英文 中文
Fuzzy reinforced hyperbox neural network: analysis and design 模糊增强超盒神经网络:分析与设计
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-15 DOI: 10.1007/s10489-025-06869-5
Mingjie Gao, Wei Huang, Mingxi Sun

Fuzzy min-max neural network (FMNN) realized based on hyperbox has been widely used in the field of pattern classification. However, the parameter dependency coming from hyperboxes still remains open. Specifically, the construction process of the hyperbox is influenced by the expansion coefficient, different coefficients result in varying final configurations of the hyperbox. To address this issue, this paper proposes a fuzzy reinforced hyperbox neural network(FRHNN). FRHNN employs an innovative multilayer structure to generate hyperboxes, and its construction process is no longer constrained by traditional expansion coefficients. Compared to traditional hyperbox layers, the model is divided into two phases: hyperbox initialization and hyperbox enhancement. In the initialization phase, a clustering algorithm is used to form the initial mixed hyperboxes. Subsequently, during the hyperbox enhancement phase, the multilayer structure will progressively segment the mixed hyperboxes until each hyperbox is transformed into a pure hyperbox. The hyperbox enhancement process of this model encompasses three key steps: judgement hyperbox types, determining hyperbox categories, and executing hyperbox segmentation. A comparative study also indicates that the proposed FRHNN leads to higher classification accuracy in comparison with some state-of-the-art fuzzy min-max neural networks in tackling data classification.

基于hyperbox实现的模糊最小-最大神经网络(FMNN)在模式分类领域得到了广泛的应用。但是,来自超框的参数依赖性仍然是开放的。具体来说,膨胀系数会影响超箱的构造过程,不同的膨胀系数会导致超箱的最终构型不同。为了解决这一问题,本文提出了一种模糊增强超盒神经网络(FRHNN)。FRHNN采用创新的多层结构生成超盒,其构造过程不再受传统膨胀系数的约束。与传统的hyperbox层相比,该模型分为两个阶段:hyperbox初始化和hyperbox增强。在初始化阶段,使用聚类算法形成初始混合超盒。随后,在超盒增强阶段,多层结构将逐步分割混合超盒,直到每个超盒都转化为纯超盒。该模型的超盒增强过程包括三个关键步骤:判断超盒类型、确定超盒类别和执行超盒分割。对比研究还表明,在处理数据分类时,与目前一些最先进的模糊最小-最大神经网络相比,所提出的FRHNN具有更高的分类精度。
{"title":"Fuzzy reinforced hyperbox neural network: analysis and design","authors":"Mingjie Gao,&nbsp;Wei Huang,&nbsp;Mingxi Sun","doi":"10.1007/s10489-025-06869-5","DOIUrl":"10.1007/s10489-025-06869-5","url":null,"abstract":"<div><p>Fuzzy min-max neural network (FMNN) realized based on hyperbox has been widely used in the field of pattern classification. However, the parameter dependency coming from hyperboxes still remains open. Specifically, the construction process of the hyperbox is influenced by the expansion coefficient, different coefficients result in varying final configurations of the hyperbox. To address this issue, this paper proposes a fuzzy reinforced hyperbox neural network(FRHNN). FRHNN employs an innovative multilayer structure to generate hyperboxes, and its construction process is no longer constrained by traditional expansion coefficients. Compared to traditional hyperbox layers, the model is divided into two phases: hyperbox initialization and hyperbox enhancement. In the initialization phase, a clustering algorithm is used to form the initial mixed hyperboxes. Subsequently, during the hyperbox enhancement phase, the multilayer structure will progressively segment the mixed hyperboxes until each hyperbox is transformed into a pure hyperbox. The hyperbox enhancement process of this model encompasses three key steps: judgement hyperbox types, determining hyperbox categories, and executing hyperbox segmentation. A comparative study also indicates that the proposed FRHNN leads to higher classification accuracy in comparison with some state-of-the-art fuzzy min-max neural networks in tackling data classification.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145778866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal contextual transformer augmented fusion for emotion recognition 多模态语境变换增强融合情感识别
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-13 DOI: 10.1007/s10489-025-07027-7
Wesagn Dawit Chemma, Adane Letta Mamuye, Marco Piangerelli

Accurate emotion recognition in dialogue depends on making effective use of conversational context, yet many multimodal systems under-utilize this contextual information and misclassify nuanced speech. We introduce Multimodal Contextual Transformer Augmented Fusion (MCTAF), a lightweight context-sensitive transformer-based multimodal model that encodes speech and transcripts with Bi-GRUs and treats the last K utterances as a separate modality for contextual information. The context module summarizes the most recent K preceding utterances into a single context vector. This vector is fed to the fusion transformer as its own information stream, so cross-attention can relate the current text and audio to dialogue history without simply concatenating features. All modality vectors operate within a shared 128-dimensional space and are integrated through twelve directed cross-modal attention over text, audio, context modality, and an early-fusion feature. Trained end-to-end with Adam, MCTAF attains 89.9% accuracy on IEMOCAP and 88.3% on MELD, boosting weighted F1 by up to 3% and accuracy by up to 4% points over strong state-of-the-art methods. Ablation studies show that removing the context branch lowers weighted F1 by 3 to 4 points, demonstrating the importance of modeling explicit contextual module for improved emotion recognition. These results on benchmark datasets show that the inclusion of a dedicated contextual module yields consistent gains in both dyadic and multi-party conversations with modest computational cost.

对话中准确的情感识别依赖于对对话语境的有效利用,然而许多多模态系统没有充分利用这一语境信息,并对细微差别的语音进行了错误分类。我们介绍了多模态上下文转换器增强融合(MCTAF),这是一种轻量级的基于上下文敏感转换器的多模态模型,它使用bi - gru编码语音和转录本,并将最后K个话语作为上下文信息的单独模态处理。上下文模块将最近的K个前面的话语总结成一个上下文向量。这个向量作为自己的信息流被馈送到融合转换器中,因此交叉注意可以将当前的文本和音频与对话历史联系起来,而不需要简单地连接特征。所有模态向量在共享的128维空间内运行,并通过文本、音频、上下文模态和早期融合特征的12个定向跨模态注意力进行整合。与Adam进行端到端训练后,MCTAF在IEMOCAP上的准确率达到89.9%,在MELD上的准确率达到88.3%,与最先进的方法相比,加权F1提高了3%,准确率提高了4%。消融研究表明,去除上下文分支会使权重F1降低3到4点,这表明了为显式上下文模块建模对于改善情绪识别的重要性。在基准数据集上的这些结果表明,包含一个专用的上下文模块在二元和多方对话中产生一致的收益,并且计算成本适中。
{"title":"Multimodal contextual transformer augmented fusion for emotion recognition","authors":"Wesagn Dawit Chemma,&nbsp;Adane Letta Mamuye,&nbsp;Marco Piangerelli","doi":"10.1007/s10489-025-07027-7","DOIUrl":"10.1007/s10489-025-07027-7","url":null,"abstract":"<div><p>Accurate emotion recognition in dialogue depends on making effective use of conversational context, yet many multimodal systems under-utilize this contextual information and misclassify nuanced speech. We introduce <i>Multimodal Contextual Transformer Augmented Fusion</i> (MCTAF), a lightweight context-sensitive transformer-based multimodal model that encodes speech and transcripts with Bi-GRUs and treats the last <i>K</i> utterances as a separate modality for contextual information. The context module summarizes the most recent <i>K</i> preceding <i>utterances</i> into a single <i>context vector</i>. This vector is fed to the fusion transformer as its own information stream, so cross-attention can relate the current text and audio to dialogue history without simply concatenating features. All modality vectors operate within a shared 128-dimensional space and are integrated through twelve directed cross-modal attention over text, audio, context modality, and an early-fusion feature. Trained end-to-end with Adam, MCTAF attains 89.9% accuracy on IEMOCAP and 88.3% on MELD, boosting weighted F1 by up to 3% and accuracy by up to 4% points over strong state-of-the-art methods. Ablation studies show that removing the context branch lowers weighted F1 by 3 to 4 points, demonstrating the importance of modeling explicit contextual module for improved emotion recognition. These results on benchmark datasets show that the inclusion of a dedicated contextual module yields consistent gains in both dyadic and multi-party conversations with modest computational cost.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145730317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A brain information decomposition mechanism inspired evolutionary algorithm for large-scale multi-objective optimization 基于大脑信息分解机制的大规模多目标优化进化算法
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-13 DOI: 10.1007/s10489-025-07001-3
Tongxuan Wu, Junzhong Ji, Cuicui Yang

The high-dimensional decision space of large-scale multi-objective optimization poses challenges to evolutionary algorithms, which leads to potential trapping in local optima. This paper proposes a brain information decomposition mechanism inspired large-scale multi-objective evolutionary algorithm (IDLMEA). Three strategies are developed. 1) The redundant information subpopulation division strategy selects high-quality solutions using shift-based density estimation, which determines the global optimal information. 2) The synergistic information subpopulation division strategy identifies exploration direction information by dividing solutions using local sensitive hashing. 3) The redundant and synergistic information-based reproduction strategy generates offspring for effective exploration of the search space. The Friedman test values of IDLMEA for inverted generational distance on LSMOP and IMF benchmarks outperform ten competitors by at least 25.64% and 4.04%, respectively.

大规模多目标优化的高维决策空间给进化算法带来了挑战,导致算法可能陷入局部最优。提出了一种基于大规模多目标进化算法(IDLMEA)的脑信息分解机制。制定了三种策略。1)冗余信息子种群划分策略利用基于位移的密度估计选择高质量解,确定全局最优信息。2)协同信息子种群划分策略通过局部敏感哈希划分解来识别探索方向信息。3)冗余协同的信息复制策略产生子代,对搜索空间进行有效的探索。在LSMOP和IMF基准上,IDLMEA对倒代际距离的Friedman检验值分别优于10个竞争对手至少25.64%和4.04%。
{"title":"A brain information decomposition mechanism inspired evolutionary algorithm for large-scale multi-objective optimization","authors":"Tongxuan Wu,&nbsp;Junzhong Ji,&nbsp;Cuicui Yang","doi":"10.1007/s10489-025-07001-3","DOIUrl":"10.1007/s10489-025-07001-3","url":null,"abstract":"<div><p>The high-dimensional decision space of large-scale multi-objective optimization poses challenges to evolutionary algorithms, which leads to potential trapping in local optima. This paper proposes a brain information decomposition mechanism inspired large-scale multi-objective evolutionary algorithm (IDLMEA). Three strategies are developed. 1) The redundant information subpopulation division strategy selects high-quality solutions using shift-based density estimation, which determines the global optimal information. 2) The synergistic information subpopulation division strategy identifies exploration direction information by dividing solutions using local sensitive hashing. 3) The redundant and synergistic information-based reproduction strategy generates offspring for effective exploration of the search space. The Friedman test values of IDLMEA for inverted generational distance on LSMOP and IMF benchmarks outperform ten competitors by at least 25.64% and 4.04%, respectively.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145730321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A stability analysis for the online retailing cyber security situation piecewise variable weight rating method 基于分段变权评级法的网上零售网络安全形势稳定性分析
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-13 DOI: 10.1007/s10489-025-06860-0
Gaofeng Yu, Zhen Zhang, Jian Wu

The cyber security situation rating (CSSR) method and its stability analysis is key problems in nowadays cyber security. However, the existing CSSR methods do not take into account to the issues such as the fuzziness and hesitation of the grades of attributes, the non-compensation of information among attributes, the internal differences of attributes and the stability of CSSR method. A new piecewise variable weight rating (PVWR) method for CSS is proposed, and then the stability analysis for the PVWR method is researched. Based on the description of the CSS rating problem, linear membership functions (LMFs) and linear non-membership functions (LNMFs) of attributes with four categories grade threshold are constructed. The definition of PVWR function is proposed to more accurately reflect the characteristics of CSS, which is constructed by continuous piecewise function. A novel stability analysis of the PVWR method is proposed, and the attributes’ weight ranges under the condition of the current grade assessment is given when the CSSRM results is stable. The proposed method not only offers a novel method to identify the key impact attributes, but also provides theory basic for judging whether CSSR results have changed.

网络安全态势评级方法及其稳定性分析是当今网络安全研究的关键问题。然而,现有的CSSR方法没有考虑到属性等级的模糊性和犹豫性、属性间信息的不补补性、属性的内部差异性以及CSSR方法的稳定性等问题。提出了一种新的分段变权评级(PVWR)方法,并对该方法的稳定性进行了研究。在描述CSS等级问题的基础上,构造了具有四类等级阈值的属性的线性隶属函数和线性非隶属函数。为了更准确地反映CSS的特性,提出了PVWR函数的定义,该函数由连续分段函数构造。提出了一种新的PVWR方法的稳定性分析方法,在CSSRM结果稳定的情况下给出了当前等级评价条件下属性的权重范围。该方法不仅提供了一种识别关键影响属性的新方法,而且为判断CSSR结果是否发生变化提供了理论基础。
{"title":"A stability analysis for the online retailing cyber security situation piecewise variable weight rating method","authors":"Gaofeng Yu,&nbsp;Zhen Zhang,&nbsp;Jian Wu","doi":"10.1007/s10489-025-06860-0","DOIUrl":"10.1007/s10489-025-06860-0","url":null,"abstract":"<div><p>The cyber security situation rating (CSSR) method and its stability analysis is key problems in nowadays cyber security. However, the existing CSSR methods do not take into account to the issues such as the fuzziness and hesitation of the grades of attributes, the non-compensation of information among attributes, the internal differences of attributes and the stability of CSSR method. A new piecewise variable weight rating (PVWR) method for CSS is proposed, and then the stability analysis for the PVWR method is researched. Based on the description of the CSS rating problem, linear membership functions (LMFs) and linear non-membership functions (LNMFs) of attributes with four categories grade threshold are constructed. The definition of PVWR function is proposed to more accurately reflect the characteristics of CSS, which is constructed by continuous piecewise function. A novel stability analysis of the PVWR method is proposed, and the attributes’ weight ranges under the condition of the current grade assessment is given when the CSSRM results is stable. The proposed method not only offers a novel method to identify the key impact attributes, but also provides theory basic for judging whether CSSR results have changed.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145730290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attributes reduction for incomplete data by bidirectional fuzzy similarity discriminability 基于双向模糊相似判别的不完全数据属性约简
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-12 DOI: 10.1007/s10489-025-07025-9
Xiangjian Chen, Jiabao Tang, Jianhua Dai

In the real world, there is a significant amount of incomplete data, making attribute reduction for incomplete data a critical issue. Many existing reduction methods mainly study the relationship between conditional attributes and decision attributes, and often lack the correlation between conditional attributes. In this paper, first, incomplete Euclidean distance and global similarity are proposed, and correspondingly, fused fuzzy similarity relations are constructed to characterize the fuzzy similarity relations between objects with missing values, and to generate an object relation matrix. Second, incomplete cosine similarity is proposed to characterize the similarity between conditional attributes, and an attribute relation matrix is generated to describe the correlation between conditional attributes. Finally, we construct the bidirectional fuzzy similarity discriminability (BFD) and the corresponding reduction algorithm on this basis. Under the KNN classifier, the average lead compared to other state-of-the-art algorithms is 6.44%, and this value reaches 6.67% under the CART classifier and demonstrate the effectiveness of the algorithm through experiments.

在现实世界中,存在大量的不完整数据,这使得不完整数据的属性约简成为一个关键问题。现有的许多约简方法主要研究条件属性与决策属性之间的关系,往往缺乏条件属性之间的相关性。本文首先提出了不完全欧氏距离和全局相似度,并相应构造了融合模糊相似关系来表征缺失值对象之间的模糊相似关系,生成对象关系矩阵。其次,提出不完全余弦相似度来表征条件属性之间的相似度,并生成属性关系矩阵来描述条件属性之间的相关性;最后,在此基础上构造了双向模糊相似判别性(BFD)和相应的约简算法。在KNN分类器下,与其他先进算法相比,平均领先率为6.44%,在CART分类器下达到6.67%,通过实验证明了算法的有效性。
{"title":"Attributes reduction for incomplete data by bidirectional fuzzy similarity discriminability","authors":"Xiangjian Chen,&nbsp;Jiabao Tang,&nbsp;Jianhua Dai","doi":"10.1007/s10489-025-07025-9","DOIUrl":"10.1007/s10489-025-07025-9","url":null,"abstract":"<div><p>In the real world, there is a significant amount of incomplete data, making attribute reduction for incomplete data a critical issue. Many existing reduction methods mainly study the relationship between conditional attributes and decision attributes, and often lack the correlation between conditional attributes. In this paper, first, incomplete Euclidean distance and global similarity are proposed, and correspondingly, fused fuzzy similarity relations are constructed to characterize the fuzzy similarity relations between objects with missing values, and to generate an object relation matrix. Second, incomplete cosine similarity is proposed to characterize the similarity between conditional attributes, and an attribute relation matrix is generated to describe the correlation between conditional attributes. Finally, we construct the bidirectional fuzzy similarity discriminability (BFD) and the corresponding reduction algorithm on this basis. Under the KNN classifier, the average lead compared to other state-of-the-art algorithms is 6.44%, and this value reaches 6.67% under the CART classifier and demonstrate the effectiveness of the algorithm through experiments.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145730282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A remaining useful life prediction method for rotating machinery based on trend encoding and multi-scale spatio-temporal feature fusion 基于趋势编码和多尺度时空特征融合的旋转机械剩余使用寿命预测方法
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-12 DOI: 10.1007/s10489-025-07007-x
Yurong Guo, Jie Zhang, Junliang Wang, Jian Mao, Feifan Lu

In response to the issues of incomplete local data degradation characterization and multi-scale, non-stationary characteristics exhibited during the degradation process of rotary machinery vibration signals, a remaining useful life (RUL) prediction method based on trend encoding and multi-scale spatio-temporal feature fusion is proposed. Firstly, a trend encoding method is introduced to compensate for the missing temporal information and long-term degradation information in vibration signal samples, enhancing the degradation characterization ability of local data. Subsequently, a soft threshold self-attention mechanism is proposed for the adaptive fusion of trend-encoded features and vibration signal features, to prevent imbalanced weight distribution. Finally, a spatio-temporal feature fusion network, MACNN-Informer, is designed. It possesses multi-scale spatial feature extraction capabilities and can effectively capture long-distance dependencies in sequential features, thereby better revealing the degradation characteristics of vibration signals at different degradation stages. Experimental results show that, compared with methods such as MSCNN, the proposed method–despite being slightly slower in inference speed–achieves the lowest prediction error and oscillation amplitude, making it well-suited for RUL prediction of critical rotating machinery components such as rolling bearings.

针对旋转机械振动信号在退化过程中表现出的局部数据退化表征不完整、多尺度非平稳等问题,提出了一种基于趋势编码和多尺度时空特征融合的剩余使用寿命预测方法。首先,引入趋势编码方法,补偿振动信号样本中缺失的时间信息和长期退化信息,增强局部数据的退化表征能力;随后,提出了一种软阈值自关注机制,用于趋势编码特征与振动信号特征的自适应融合,以防止权重分布不平衡。最后,设计了一个时空特征融合网络MACNN-Informer。该方法具有多尺度空间特征提取能力,能够有效捕获序列特征中的长距离依赖关系,从而更好地揭示振动信号在不同退化阶段的退化特征。实验结果表明,与MSCNN等方法相比,尽管推理速度稍慢,但该方法具有最低的预测误差和振荡幅度,非常适合于滚动轴承等关键旋转机械部件的RUL预测。
{"title":"A remaining useful life prediction method for rotating machinery based on trend encoding and multi-scale spatio-temporal feature fusion","authors":"Yurong Guo,&nbsp;Jie Zhang,&nbsp;Junliang Wang,&nbsp;Jian Mao,&nbsp;Feifan Lu","doi":"10.1007/s10489-025-07007-x","DOIUrl":"10.1007/s10489-025-07007-x","url":null,"abstract":"<div><p>In response to the issues of incomplete local data degradation characterization and multi-scale, non-stationary characteristics exhibited during the degradation process of rotary machinery vibration signals, a remaining useful life (RUL) prediction method based on trend encoding and multi-scale spatio-temporal feature fusion is proposed. Firstly, a trend encoding method is introduced to compensate for the missing temporal information and long-term degradation information in vibration signal samples, enhancing the degradation characterization ability of local data. Subsequently, a soft threshold self-attention mechanism is proposed for the adaptive fusion of trend-encoded features and vibration signal features, to prevent imbalanced weight distribution. Finally, a spatio-temporal feature fusion network, MACNN-Informer, is designed. It possesses multi-scale spatial feature extraction capabilities and can effectively capture long-distance dependencies in sequential features, thereby better revealing the degradation characteristics of vibration signals at different degradation stages. Experimental results show that, compared with methods such as MSCNN, the proposed method–despite being slightly slower in inference speed–achieves the lowest prediction error and oscillation amplitude, making it well-suited for RUL prediction of critical rotating machinery components such as rolling bearings.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145730281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dependency-driven spectral embedding based multi-view clustering 基于依赖驱动谱嵌入的多视图聚类
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-09 DOI: 10.1007/s10489-025-07026-8
Zien Liang, Zhuojie Huang, Shuping Zhao, Jigang Wu

Multi-view clustering has important applications in machine learning and computer vision, but existing methods face three major limitations: insufficient consideration of the differences among views, failure to capture the complex relationships among views, and neglect of the interdependence of embeddings and clustering labels. To overcome these limitations, this paper proposes a novel dependency-driven spectral embedding based multi-view clustering (DDSE), which introduces a dual-layer learning system. On the global layer, DDSE highlights the differences among views by selectively retaining the discriminative features of each view and adjusting their weights; on the local layer, Grassmann manifolds are used to maintain the topological information among views and improve clustering adaptability. By learning a unified embedding from the Reproducing Kernel Hilbert Spaces, DDSE can capture high-order nonlinear dependencies among views and avoid information loss by generating a discrete indicator matrix. In addition, this paper derives an efficient optimization scheme to improve the performance of the proposed method. Multiple rounds of experiments on ten datasets verify the advantages of this method over other state-of-the-art methods.

多视图聚类在机器学习和计算机视觉中有着重要的应用,但现有的方法面临三个主要的局限性:没有充分考虑视图之间的差异,未能捕捉视图之间的复杂关系,以及忽视嵌入和聚类标签之间的相互依存关系。为了克服这些限制,本文提出了一种新的基于依赖驱动谱嵌入的多视图聚类方法(DDSE),该方法引入了一个双层学习系统。在全局层,DDSE通过选择性地保留每个视图的判别特征并调整其权重来突出视图之间的差异;在局部层,使用格拉斯曼流形来维护视图间的拓扑信息,提高聚类的适应性。DDSE通过从再现核希尔伯特空间中学习统一嵌入,捕获视图之间的高阶非线性依赖关系,并通过生成离散指示矩阵来避免信息丢失。此外,本文还推导了一种有效的优化方案,以提高所提方法的性能。在10个数据集上进行的多轮实验验证了该方法优于其他最先进的方法。
{"title":"Dependency-driven spectral embedding based multi-view clustering","authors":"Zien Liang,&nbsp;Zhuojie Huang,&nbsp;Shuping Zhao,&nbsp;Jigang Wu","doi":"10.1007/s10489-025-07026-8","DOIUrl":"10.1007/s10489-025-07026-8","url":null,"abstract":"<div><p>Multi-view clustering has important applications in machine learning and computer vision, but existing methods face three major limitations: insufficient consideration of the differences among views, failure to capture the complex relationships among views, and neglect of the interdependence of embeddings and clustering labels. To overcome these limitations, this paper proposes a novel dependency-driven spectral embedding based multi-view clustering (DDSE), which introduces a dual-layer learning system. On the global layer, DDSE highlights the differences among views by selectively retaining the discriminative features of each view and adjusting their weights; on the local layer, Grassmann manifolds are used to maintain the topological information among views and improve clustering adaptability. By learning a unified embedding from the Reproducing Kernel Hilbert Spaces, DDSE can capture high-order nonlinear dependencies among views and avoid information loss by generating a discrete indicator matrix. In addition, this paper derives an efficient optimization scheme to improve the performance of the proposed method. Multiple rounds of experiments on ten datasets verify the advantages of this method over other state-of-the-art methods.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Industrial anomaly detection via curriculum-based deep learning 基于课程的深度学习的工业异常检测
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-09 DOI: 10.1007/s10489-025-07032-w
Devilliers Dube, Mehmet Akar

Anomaly detection in industries is critical to ensure safety, reliability, and product quality. Industrial processes often produce high-dimensional multivariate time series (MVTS) data from sensors and actuators, making the detection of subtle or complex anomalies increasingly challenging. In this work, we introduce the novel application of curriculum learning (CL) to deep MVTS anomaly detection, a strategy inspired by the human learning process that begins training with easier examples before progressing to more difficult ones. We propose two CL frameworks: a data-based curriculum, which ranks training samples by signal complexity (e.g. noise level, window size), and a model-based curriculum, in which the knowledge learned from a simple long short-term memory encoder-decoder model is used to initialize a more advanced multi-scale convolutional recurrent encoder-decoder model. Furthermore, we present the first use of system signature matrices as an MVTS representation that captures spatial-temporal dependencies in textile process data for collective anomaly detection. We perform comprehensive evaluations on the SWaT benchmark dataset and a dataset collected from real-world textile processes. Our empirical results demonstrate the possibility of designing curriculum-trained models that outperform standard training baselines, achieving higher F1 scores while offering more structured training dynamics. Notably, the data-based CL strategy consistently yields better performance than non-curriculum baselines. This study represents the first systematic adaptation of CL to industrial anomaly detection and establishes a foundation for more structured training paradigms in MVTS anomaly detection.

在工业中,异常检测对于确保安全性、可靠性和产品质量至关重要。工业过程通常会产生来自传感器和执行器的高维多元时间序列(MVTS)数据,这使得检测细微或复杂的异常变得越来越具有挑战性。在这项工作中,我们介绍了课程学习(CL)在深度MVTS异常检测中的新应用,这是一种受人类学习过程启发的策略,从简单的例子开始训练,然后再进入更困难的例子。我们提出了两个CL框架:一个基于数据的课程,它根据信号复杂性(例如噪声水平,窗口大小)对训练样本进行排名;一个基于模型的课程,其中从简单的长短期记忆编码器-解码器模型中学习的知识用于初始化更高级的多尺度卷积循环编码器-解码器模型。此外,我们首次使用系统签名矩阵作为MVTS表示,捕获纺织过程数据中的时空依赖关系,用于集体异常检测。我们对SWaT基准数据集和从真实纺织过程中收集的数据集进行了全面的评估。我们的实证结果表明,设计课程训练模型的可能性优于标准训练基线,在提供更结构化的训练动态的同时获得更高的F1分数。值得注意的是,基于数据的CL策略始终比非课程基线产生更好的性能。本研究首次将CL系统地应用于工业异常检测,并为MVTS异常检测中更结构化的训练范式奠定了基础。
{"title":"Industrial anomaly detection via curriculum-based deep learning","authors":"Devilliers Dube,&nbsp;Mehmet Akar","doi":"10.1007/s10489-025-07032-w","DOIUrl":"10.1007/s10489-025-07032-w","url":null,"abstract":"<div><p>Anomaly detection in industries is critical to ensure safety, reliability, and product quality. Industrial processes often produce high-dimensional multivariate time series (MVTS) data from sensors and actuators, making the detection of subtle or complex anomalies increasingly challenging. In this work, we introduce the novel application of curriculum learning (CL) to deep MVTS anomaly detection, a strategy inspired by the human learning process that begins training with easier examples before progressing to more difficult ones. We propose two CL frameworks: a <i>data-based curriculum</i>, which ranks training samples by signal complexity (e.g. noise level, window size), and a <i>model-based curriculum</i>, in which the knowledge learned from a simple long short-term memory encoder-decoder model is used to initialize a more advanced multi-scale convolutional recurrent encoder-decoder model. Furthermore, we present the first use of <i>system signature matrices</i> as an MVTS representation that captures spatial-temporal dependencies in textile process data for collective anomaly detection. We perform comprehensive evaluations on the SWaT benchmark dataset and a dataset collected from real-world textile processes. Our empirical results demonstrate the possibility of designing curriculum-trained models that outperform standard training baselines, achieving higher F1 scores while offering more structured training dynamics. Notably, the data-based CL strategy consistently yields better performance than non-curriculum baselines. This study represents the first systematic adaptation of CL to industrial anomaly detection and establishes a foundation for more structured training paradigms in MVTS anomaly detection.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Three-way experience replay for the prediction under concept drift 三方经验回放为预测下的概念漂移
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-09 DOI: 10.1007/s10489-025-07024-w
Jing Wang, Yanbing Ju, Peiwu Dong, Tian Ju

In the context of online time series forecasting, experience replay (ER) is a crucial technology to mitigate catastrophic forgetting arising from concept drift. ER uses historical data sampled from a buffer to update model parameters along with current data incrementally. However, existing ER methods overlook buffer management, which is certainly important due to the space constraints and dynamic data distributions. To fill this gap, we propose the three-way experience replay (TWER). In contrast to existing ER methods, TWER is a comprehensive buffer management mechanism that involves admission, sampling, and eviction to improve the accuracy of online prediction. To capture samples indicative of concept drift, we design the admission mechanism based on three-way decision (TWD) rules deduced from historical and future data. By integrating Bayesian decision theory and rough set theory, TWD can provide the option of deferment action, enabling a more informed judgment on concept drift. To preserve the historical data with rare patterns, we enhance the sampling and eviction mechanisms by introducing three-way clustering (TWC), which can identify fringe samples typically missed by existing methods. Leveraging these mechanisms, TWER can adaptively manage data within the buffer, thus enhancing the prediction under concept drift. Empirical results on five real-world datasets show that TWER reduces online prediction error by more than 12% in terms of MSE compared with the existing ER methods such as DER++, GDumb, and memory select.

在在线时间序列预测的背景下,经验重放(ER)是减轻概念漂移引起的灾难性遗忘的关键技术。ER使用从缓冲区中采样的历史数据来增量地更新模型参数和当前数据。然而,现有的ER方法忽略了缓冲区管理,由于空间限制和动态数据分布,缓冲区管理当然很重要。为了填补这一空白,我们提出了三向体验回放(TWER)。与现有的ER方法相比,TWER是一种综合的缓冲管理机制,包括入场、抽样和驱逐,以提高在线预测的准确性。为了捕获指示概念漂移的样本,我们设计了基于从历史和未来数据推断出的三向决策(TWD)规则的准入机制。通过整合贝叶斯决策理论和粗糙集理论,TWD可以提供延迟行为的选择,使概念漂移的判断更加明智。为了保留具有罕见模式的历史数据,我们通过引入三向聚类(TWC)来增强采样和剔除机制,该机制可以识别现有方法通常错过的边缘样本。利用这些机制,TWER可以自适应地管理缓冲区内的数据,从而增强概念漂移下的预测。在5个真实数据集上的实证结果表明,与现有的ER方法(如der++、GDumb和memory select)相比,TWER在MSE方面降低了12%以上的在线预测误差。
{"title":"Three-way experience replay for the prediction under concept drift","authors":"Jing Wang,&nbsp;Yanbing Ju,&nbsp;Peiwu Dong,&nbsp;Tian Ju","doi":"10.1007/s10489-025-07024-w","DOIUrl":"10.1007/s10489-025-07024-w","url":null,"abstract":"<div><p>In the context of online time series forecasting, experience replay (ER) is a crucial technology to mitigate catastrophic forgetting arising from concept drift. ER uses historical data sampled from a buffer to update model parameters along with current data incrementally. However, existing ER methods overlook buffer management, which is certainly important due to the space constraints and dynamic data distributions. To fill this gap, we propose the three-way experience replay (TWER). In contrast to existing ER methods, TWER is a comprehensive buffer management mechanism that involves admission, sampling, and eviction to improve the accuracy of online prediction. To capture samples indicative of concept drift, we design the admission mechanism based on three-way decision (TWD) rules deduced from historical and future data. By integrating Bayesian decision theory and rough set theory, TWD can provide the option of deferment action, enabling a more informed judgment on concept drift. To preserve the historical data with rare patterns, we enhance the sampling and eviction mechanisms by introducing three-way clustering (TWC), which can identify fringe samples typically missed by existing methods. Leveraging these mechanisms, TWER can adaptively manage data within the buffer, thus enhancing the prediction under concept drift. Empirical results on five real-world datasets show that TWER reduces online prediction error by more than 12% in terms of MSE compared with the existing ER methods such as DER++, GDumb, and memory select.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample-efficient multi-agent reinforcement learning with high update-to-data ratio and state-action embedding 具有高更新数据比和状态-动作嵌入的样本高效多智能体强化学习
IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-12-09 DOI: 10.1007/s10489-025-07035-7
Chenyang Miao, Yingzhuo Jiang, Yunduan Cui, Yidong Chen, Tianfu Sun

A novel Reinforcement Learning (RL) approach Multi-agent Joint Control with State-Action Embedding (MASAE) is proposed in this paper to address the sample-efficiency issue of RL in robot control. It combines the relative entropy regularization and high update-to-data (UTD) ratios in one multi-agent framework to accelerate the learning process while naturally mitigating the overestimation of value functions caused by high UTD ratios by multiple agents. The state-action embeddings are employed to adaptively abstract the hidden features behind the state-action space for enhanced learning efficiency. Evaluated by several simulated benchmark control tasks and a real-world Unitree Go1 quadruped robot system, MASAE demonstrates significant advantages in learning capability and sampling efficiency compared to various related RL baselines, indicating its potential in learning challenging real-world robot systems with a limited number of samples. The open-source code of MASAE is available at https://github.com/AdrienLin1/MASAE.

针对强化学习在机器人控制中的样本效率问题,提出了一种基于状态-动作嵌入的多智能体联合控制方法。它在一个多智能体框架中结合了相对熵正则化和高数据更新(UTD)比率,以加速学习过程,同时自然地减轻了由多个智能体的高UTD比率引起的价值函数的高估。采用状态-动作嵌入自适应抽象状态-动作空间背后的隐藏特征,提高学习效率。通过几个模拟基准控制任务和一个真实世界的Unitree Go1四足机器人系统进行评估,与各种相关的强化学习基线相比,MASAE在学习能力和采样效率方面表现出显著优势,表明它在具有有限样本数量的具有挑战性的真实世界机器人系统中具有潜力。MASAE的开源代码可在https://github.com/AdrienLin1/MASAE上获得。
{"title":"Sample-efficient multi-agent reinforcement learning with high update-to-data ratio and state-action embedding","authors":"Chenyang Miao,&nbsp;Yingzhuo Jiang,&nbsp;Yunduan Cui,&nbsp;Yidong Chen,&nbsp;Tianfu Sun","doi":"10.1007/s10489-025-07035-7","DOIUrl":"10.1007/s10489-025-07035-7","url":null,"abstract":"<div><p>A novel Reinforcement Learning (RL) approach Multi-agent Joint Control with State-Action Embedding (MASAE) is proposed in this paper to address the sample-efficiency issue of RL in robot control. It combines the relative entropy regularization and high update-to-data (UTD) ratios in one multi-agent framework to accelerate the learning process while naturally mitigating the overestimation of value functions caused by high UTD ratios by multiple agents. The state-action embeddings are employed to adaptively abstract the hidden features behind the state-action space for enhanced learning efficiency. Evaluated by several simulated benchmark control tasks and a real-world Unitree Go1 quadruped robot system, MASAE demonstrates significant advantages in learning capability and sampling efficiency compared to various related RL baselines, indicating its potential in learning challenging real-world robot systems with a limited number of samples. The open-source code of MASAE is available at https://github.com/AdrienLin1/MASAE.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145729908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1