Pub Date : 2025-02-19DOI: 10.1016/j.ins.2025.121989
Zhen Fang, Jie Lu, Guangquan Zhang
Out-of-distribution (OOD) detection is crucial in modern deep learning applications, as it can identify OOD data drawn from distributions differing from those of the in-distribution (ID) data. Advanced OOD detection methods primarily rely on post-hoc strategies, which identify OOD data by analyzing the predictions of a model well-trained on ID data. However, deep models are known to be impacted by spurious features such as backgrounds, causing existing OOD detection methods to fail in identifying OOD data that share the same spurious features as ID data. Therefore, this paper studies how to mitigate spurious features to improve OOD detection. To address this challenge, we propose a novel method called Non-semantic Exploration OOD Detection (NsED), which focuses on exploring and exploiting non-semantic features. In particular, NsED first explores non-semantic features in an OOD generalization manner. These non-semantic features are then used to train deep models to be more robust against spurious features. Through extensive experiments on representative benchmarks, we show that NsED significantly and consistently improves the detection performance of many representative post-hoc OOD detection methods.
{"title":"Out-of-distribution detection with non-semantic exploration","authors":"Zhen Fang, Jie Lu, Guangquan Zhang","doi":"10.1016/j.ins.2025.121989","DOIUrl":"10.1016/j.ins.2025.121989","url":null,"abstract":"<div><div>Out-of-distribution (OOD) detection is crucial in modern deep learning applications, as it can identify OOD data drawn from distributions differing from those of the in-distribution (ID) data. Advanced OOD detection methods primarily rely on post-hoc strategies, which identify OOD data by analyzing the predictions of a model well-trained on ID data. However, deep models are known to be impacted by spurious features such as backgrounds, causing existing OOD detection methods to fail in identifying OOD data that share the same spurious features as ID data. Therefore, this paper studies how to mitigate spurious features to improve OOD detection. To address this challenge, we propose a novel method called <u>N</u>on-<u>s</u>emantic <u>E</u>xploration OOD <u>D</u>etection (NsED), which focuses on exploring and exploiting non-semantic features. In particular, NsED first explores non-semantic features in an OOD generalization manner. These non-semantic features are then used to train deep models to be more robust against spurious features. Through extensive experiments on representative benchmarks, we show that NsED significantly and consistently improves the detection performance of many representative post-hoc OOD detection methods.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121989"},"PeriodicalIF":8.1,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143454856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-19DOI: 10.1016/j.ins.2025.121992
Youchao Liu, Dingjiang Huang
Recently federated learning (FL) has attracted growing attention by performing data-private collaborative training on decentralized clients. However, the majority of existing FL methods concentrate on single-task scenarios with static data. In real-world scenarios, local clients usually continuously collect new classes from the data stream and have just a small amount of memory to store training samples of old classes. Using single-task models directly will lead to significant catastrophic forgetting in old classes. In addition, there are some typical challenges in FL scenarios, such as computation and communication overhead, data heterogeneity, etc. To comprehensively describe these challenges, we propose a new Personalized Federated Class-Incremental Learning (PFCIL) problem. Furthermore, we propose an innovative Sparse Personalized Federated Class-Incremental Learning (SpaPFCIL) framework that learns a personalized class-incremental model for each client through sparse training to solve this problem. Unlike most knowledge distillation-based methods, our framework does not require additional data to assist. Specifically, to tackle catastrophic forgetting brought by class-incremental tasks, we utilize expandable class-incremental models instead of single-task models. For typical challenges in FL, we use dynamic sparse training to customize sparse local models on clients. It alleviates the negative effects of data heterogeneity and over-parameterization. Our framework outperforms state-of-the-art methods in terms of average accuracy on representative benchmark datasets by 3.3% to 43.6%.
{"title":"Sparse personalized federated class-incremental learning","authors":"Youchao Liu, Dingjiang Huang","doi":"10.1016/j.ins.2025.121992","DOIUrl":"10.1016/j.ins.2025.121992","url":null,"abstract":"<div><div>Recently federated learning (FL) has attracted growing attention by performing data-private collaborative training on decentralized clients. However, the majority of existing FL methods concentrate on single-task scenarios with static data. In real-world scenarios, local clients usually continuously collect new classes from the data stream and have just a small amount of memory to store training samples of old classes. Using single-task models directly will lead to significant catastrophic forgetting in old classes. In addition, there are some typical challenges in FL scenarios, such as computation and communication overhead, data heterogeneity, etc. To comprehensively describe these challenges, we propose a new Personalized Federated Class-Incremental Learning (PFCIL) problem. Furthermore, we propose an innovative Sparse Personalized Federated Class-Incremental Learning (SpaPFCIL) framework that learns a personalized class-incremental model for each client through sparse training to solve this problem. Unlike most knowledge distillation-based methods, our framework does not require additional data to assist. Specifically, to tackle catastrophic forgetting brought by class-incremental tasks, we utilize expandable class-incremental models instead of single-task models. For typical challenges in FL, we use dynamic sparse training to customize sparse local models on clients. It alleviates the negative effects of data heterogeneity and over-parameterization. Our framework outperforms state-of-the-art methods in terms of average accuracy on representative benchmark datasets by 3.3% to 43.6%.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"706 ","pages":"Article 121992"},"PeriodicalIF":8.1,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143454153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-19DOI: 10.1016/j.ins.2025.121996
Pei Lai , Fan Zhang , Tianrui Li , Jin Guo , Fei Teng
Fault diagnosis has long been a topic of great interest, owing to a disaster that can result from the faults of safety-critical systems. In recent years, researchers have realized that fault diagnosis of real equipment, and more precisely the fault identification task, is not simply a pattern recognition problem but instead, a few-shot classification problem. Despite valuable publications on few-shot fault diagnosis (FSFD), these surveys have primarily focused on a methodological perspective. Furthermore, few articles have been published to provide a comprehensive summary of FSFD methods from a knowledge perspective. This paper proposes a comprehensive taxonomy for FSFD methods that classifies them into data-based and knowledge-based approaches, as knowledge and data represent different levels in the knowledge perspective. The paper focuses on the knowledge-based approaches, which include knowledge embedding and knowledge discovery. These approaches aim to leverage the knowledge available in limited datasets and auxiliary datasets. The paper examines various knowledge representations such as predefined rules, learning biases, network parameters, and feature representations. Furthermore, the study assesses potential challenges and future research directions from a knowledge perspective. Finally, some public datasets and code repositories are summarized. This paper can serve as a useful reference for advancing FSFD research.
{"title":"Unlocking the power of knowledge for few-shot fault diagnosis: A review from a knowledge perspective","authors":"Pei Lai , Fan Zhang , Tianrui Li , Jin Guo , Fei Teng","doi":"10.1016/j.ins.2025.121996","DOIUrl":"10.1016/j.ins.2025.121996","url":null,"abstract":"<div><div>Fault diagnosis has long been a topic of great interest, owing to a disaster that can result from the faults of safety-critical systems. In recent years, researchers have realized that fault diagnosis of real equipment, and more precisely the fault identification task, is not simply a pattern recognition problem but instead, a few-shot classification problem. Despite valuable publications on few-shot fault diagnosis (FSFD), these surveys have primarily focused on a methodological perspective. Furthermore, few articles have been published to provide a comprehensive summary of FSFD methods from a knowledge perspective. This paper proposes a comprehensive taxonomy for FSFD methods that classifies them into data-based and knowledge-based approaches, as knowledge and data represent different levels in the knowledge perspective. The paper focuses on the knowledge-based approaches, which include knowledge embedding and knowledge discovery. These approaches aim to leverage the knowledge available in limited datasets and auxiliary datasets. The paper examines various knowledge representations such as predefined rules, learning biases, network parameters, and feature representations. Furthermore, the study assesses potential challenges and future research directions from a knowledge perspective. Finally, some public datasets and code repositories are summarized. This paper can serve as a useful reference for advancing FSFD research.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"706 ","pages":"Article 121996"},"PeriodicalIF":8.1,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143473967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-19DOI: 10.1016/j.ins.2025.121993
Junhong Zhao , Yunliu Li , Ting Liu , Peng Liu , Junwei Sun
This paper investigates the cluster output synchronization of coupled fractional-order uncertain neural networks. By utilizing Lyapunov's theorem and effective inequalities applicable to fractional-order systems, sufficient criteria are established to achieve the cluster output synchronization of coupled fractional-order uncertain neural networks for two different communication topologies, namely strongly connected topology and topology with a spanning tree. Unlike previous works that have focused on the output synchronization of neural networks within the confines of integer order systems or strongly connected topologies, this paper extends the exploration to the output synchronization of coupled fractional-order uncertain neural networks with a spanning tree. Additionally, the conclusions of this paper include the complete synchronization of both fractional-order and integer-order neural networks as special cases. Numerical examples are shown to substantiate the obtained results.
{"title":"Cluster output synchronization analysis of coupled fractional-order uncertain neural networks","authors":"Junhong Zhao , Yunliu Li , Ting Liu , Peng Liu , Junwei Sun","doi":"10.1016/j.ins.2025.121993","DOIUrl":"10.1016/j.ins.2025.121993","url":null,"abstract":"<div><div>This paper investigates the cluster output synchronization of coupled fractional-order uncertain neural networks. By utilizing Lyapunov's theorem and effective inequalities applicable to fractional-order systems, sufficient criteria are established to achieve the cluster output synchronization of coupled fractional-order uncertain neural networks for two different communication topologies, namely strongly connected topology and topology with a spanning tree. Unlike previous works that have focused on the output synchronization of neural networks within the confines of integer order systems or strongly connected topologies, this paper extends the exploration to the output synchronization of coupled fractional-order uncertain neural networks with a spanning tree. Additionally, the conclusions of this paper include the complete synchronization of both fractional-order and integer-order neural networks as special cases. Numerical examples are shown to substantiate the obtained results.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121993"},"PeriodicalIF":8.1,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143444224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-18DOI: 10.1016/j.ins.2025.121986
Jinrong Sheng, Jiaruo Yu, Ziqiang Li, Ao Li, Yongxin Ge
Weakly-supervised temporal action localization (WTAL) identifies and localizes actions in untrimmed videos with only video-level labels. Most methods prioritize discriminative snippets, often neglecting of hard action snippets while focusing on class-specific background. Although recent methods have tackled this issue through temporal modeling, they overlook the local temporal structure of actions. To model such temporal structure effectively, we propose a novel self-supervised temporal adaptive learning (STAL) framework, which is composed of two core parts, i.e. self-supervised temporal learning (STL) network and the adaptive learning unit (ALU). Specifically, STL constructs a self-supervised task by performing an erasure and reconstruction process. This pseudo-label-based method relies on a classification task to perceive continuous temporal information for action localization task. To avoid the disturbance of un-confident pseudo labels during self-supervised learning process, two adaptive learning strategies of ALU are designed from two perspectives. In detail, a task-adaptive learning strategy is used to train the proposed tasks to the best for more reliable pseudo labels. Meanwhile, a score-adaptive learning strategy is designed to balance class activation and attention scores. Experiments on two classical datasets, namely, THUMOS14 and ActivityNet datasets, verify the effectiveness of our method.
{"title":"Self-supervised temporal adaptive learning for weakly-supervised temporal action localization","authors":"Jinrong Sheng, Jiaruo Yu, Ziqiang Li, Ao Li, Yongxin Ge","doi":"10.1016/j.ins.2025.121986","DOIUrl":"10.1016/j.ins.2025.121986","url":null,"abstract":"<div><div>Weakly-supervised temporal action localization (WTAL) identifies and localizes actions in untrimmed videos with only video-level labels. Most methods prioritize discriminative snippets, often neglecting of hard action snippets while focusing on class-specific background. Although recent methods have tackled this issue through temporal modeling, they overlook the local temporal structure of actions. To model such temporal structure effectively, we propose a novel self-supervised temporal adaptive learning (STAL) framework, which is composed of two core parts, i.e. self-supervised temporal learning (STL) network and the adaptive learning unit (ALU). Specifically, STL constructs a self-supervised task by performing an erasure and reconstruction process. This pseudo-label-based method relies on a classification task to perceive continuous temporal information for action localization task. To avoid the disturbance of un-confident pseudo labels during self-supervised learning process, two adaptive learning strategies of ALU are designed from two perspectives. In detail, a task-adaptive learning strategy is used to train the proposed tasks to the best for more reliable pseudo labels. Meanwhile, a score-adaptive learning strategy is designed to balance class activation and attention scores. Experiments on two classical datasets, namely, THUMOS14 and ActivityNet datasets, verify the effectiveness of our method.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121986"},"PeriodicalIF":8.1,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143437279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-18DOI: 10.1016/j.ins.2025.121988
Hector Zenil , Luan Carlos de Sena Monteiro Ozelim
In communication systems, the circumstances and capabilities of senders and receivers cannot be known/assumed beforehand so as to design optimal semantic transference strategies. Regardless of the recipient (plants, insects, or even life forms unknown on Earth), the spatio-temporal scale of a message could be inappropriate and may never be decoded due to incompatibilities at both ends. We devise a new method to encode messages that is agnostic vis-a-vis space and time scales. We propose the use of fractal functions as self-executable carriers for sending messages, given their properties of structural self-similarity and scale invariance. We call this ‘fractal messaging’. Starting from a spatial embedding, we introduce a framework for a space-time scale-free messaging approach. In creating a space and time agnostic framework for message transmission, encoding a message that could be decoded at several spatio-temporal scales is the objective. Our core idea is to encode a binary message as waves along infinitely many frequencies (in power-like distributions) and amplitudes, transmit such a message, and then decode and reproduce it. To do so, the components/cycles of the Weierstrass function, a known fractal, are used as carriers of the message. Each component will have its amplitude modulated to embed the binary stream, allowing for a space-time agnostic approach to messaging.
{"title":"Fractal spatio-temporal scale-free messaging: Amplitude modulation of self-executable carriers given by the Weierstrass function's components","authors":"Hector Zenil , Luan Carlos de Sena Monteiro Ozelim","doi":"10.1016/j.ins.2025.121988","DOIUrl":"10.1016/j.ins.2025.121988","url":null,"abstract":"<div><div>In communication systems, the circumstances and capabilities of senders and receivers cannot be known/assumed beforehand so as to design optimal semantic transference strategies. Regardless of the recipient (plants, insects, or even life forms unknown on Earth), the spatio-temporal scale of a message could be inappropriate and may never be decoded due to incompatibilities at both ends. We devise a new method to encode messages that is agnostic vis-a-vis space and time scales. We propose the use of fractal functions as self-executable carriers for sending messages, given their properties of structural self-similarity and scale invariance. We call this ‘fractal messaging’. Starting from a spatial embedding, we introduce a framework for a space-time scale-free messaging approach. In creating a space and time agnostic framework for message transmission, encoding a message that could be decoded at several spatio-temporal scales is the objective. Our core idea is to encode a binary message as waves along infinitely many frequencies (in power-like distributions) and amplitudes, transmit such a message, and then decode and reproduce it. To do so, the components/cycles of the Weierstrass function, a known fractal, are used as carriers of the message. Each component will have its amplitude modulated to embed the binary stream, allowing for a space-time agnostic approach to messaging.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"706 ","pages":"Article 121988"},"PeriodicalIF":8.1,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143479517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-17DOI: 10.1016/j.ins.2025.121977
Mamadou Ben Hamidou Cissoko , Vincent Castelain , Nicolas Lachiche
In personalized predictive medicine, accurately modeling a patient's illness and care processes is essential, given their inherent long-term temporal dependencies. However, Electronic Health Records (EHRs) often contain episodic and irregularly timed data, resulting from patients' sporadic hospital admissions, leading to unique patterns for each hospital stay. Consequently, constructing a personalized predictive model requires careful consideration of these factors to accurately capture the patient's health journey and assist in clinical decision-making.
Long Short-Term Memory (LSTM) is an effective model for handling sequential data, such as EHRs, but it encounters two major limitations when applied to EHRs: it is unable to interpret the prediction results and it ignores the irregular time intervals between consecutive events. To tackle these limitations, we present a novel deep dynamic memory neural network called Adaptive Multi-Way Interpretable Time-Aware LSTM for irregularly collected sequential data “AMITA”. The primary objective of AMITA is to leverage medical records, memorize illness trajectories and care processes, estimate current illness states, and predict future risks, thereby providing a high level of precision and predictive power.
To enhance its capabilities, AMITA extends the standard LSTM model in two key ways. Firstly, it incorporates frequency measurement and the most recent observation to enhance personalized predictive modeling of patient illnesses, enabling a more accurate understanding of the patient's condition. Secondly, it parameterizes the cell state to handle irregular timing effectively, utilizing both elapsed times and a frequency-based decay factor, which considers both measurement frequency and contextual information. Furthermore, the model capitalizes on both to comprehend the impact of interventions on the course of illness on the cell state, facilitating the memorization of illness courses and improving its ability to capture the temporal dynamics of healthcare data, accommodating variations and irregularities in event and observation timing.
The effectiveness of our proposed model is validated through empirical experiments conducted on two real-world clinical datasets. The results demonstrate the superiority of AMITA over current state-of-the-art models and other robust baselines, showcasing its potential in advancing personalized predictive medicine by offering a more accurate and comprehensive approach to modeling patient health trajectories.
{"title":"Predicting and interpreting healthcare trajectories from irregularly collected sequential patient data using AMITA","authors":"Mamadou Ben Hamidou Cissoko , Vincent Castelain , Nicolas Lachiche","doi":"10.1016/j.ins.2025.121977","DOIUrl":"10.1016/j.ins.2025.121977","url":null,"abstract":"<div><div>In personalized predictive medicine, accurately modeling a patient's illness and care processes is essential, given their inherent long-term temporal dependencies. However, Electronic Health Records (EHRs) often contain episodic and irregularly timed data, resulting from patients' sporadic hospital admissions, leading to unique patterns for each hospital stay. Consequently, constructing a personalized predictive model requires careful consideration of these factors to accurately capture the patient's health journey and assist in clinical decision-making.</div><div>Long Short-Term Memory (LSTM) is an effective model for handling sequential data, such as EHRs, but it encounters two major limitations when applied to EHRs: it is unable to interpret the prediction results and it ignores the irregular time intervals between consecutive events. To tackle these limitations, we present a novel deep dynamic memory neural network called Adaptive Multi-Way Interpretable Time-Aware LSTM for irregularly collected sequential data “AMITA”. The primary objective of AMITA is to leverage medical records, memorize illness trajectories and care processes, estimate current illness states, and predict future risks, thereby providing a high level of precision and predictive power.</div><div>To enhance its capabilities, AMITA extends the standard LSTM model in two key ways. Firstly, it incorporates frequency measurement and the most recent observation to enhance personalized predictive modeling of patient illnesses, enabling a more accurate understanding of the patient's condition. Secondly, it parameterizes the cell state to handle irregular timing effectively, utilizing both elapsed times and a frequency-based decay factor, which considers both measurement frequency and contextual information. Furthermore, the model capitalizes on both to comprehend the impact of interventions on the course of illness on the cell state, facilitating the memorization of illness courses and improving its ability to capture the temporal dynamics of healthcare data, accommodating variations and irregularities in event and observation timing.</div><div>The effectiveness of our proposed model is validated through empirical experiments conducted on two real-world clinical datasets. The results demonstrate the superiority of AMITA over current state-of-the-art models and other robust baselines, showcasing its potential in advancing personalized predictive medicine by offering a more accurate and comprehensive approach to modeling patient health trajectories.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121977"},"PeriodicalIF":8.1,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143428158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-17DOI: 10.1016/j.ins.2025.121985
Yuhang Hu , Yuelin Qu , Wei Li , Ying Huang
Constraint multiobjective algorithms are the most widely applied direction in intelligent optimization, with excellent research value. Currently, most multiobjective multi-constraints algorithms are designed based on the relationship between feasible and infeasible solutions, but they ignore the complex associations between constraints. At the same time, the most significant difficulty in constraint problems lies in the irregularity of constrained Pareto front (CPF) and the lack of a powerful strategy for exploration. The paper proposes a Pareto front searching based on reinforcement learning (PFRL) for multi-constraints multiobjective optimization problems. The algorithm employs reinforcement learning to guide the evolution process through interaction with the environment and adaptively learns the shape and characteristics of CPF to cover the structure of CPF effectively. The environment and CPF information gained by reinforcement learning are utilized for CPF translation and extension to deal with various irregular feasible regions. In addition, the paper also designs a constraint priority evaluation mechanism based on the correlation distance (CD) metric to process constraint relationships. It allows the algorithm to effectively cross over Pareto front (PF) of a single constraint that is unrelated to CPF, improving algorithm efficiency. The introduced algorithm implemented the above strategy using only one population. The effectiveness of the introduced algorithm was verified and compared with nine state-of-the-art algorithms and four real-world constrained multiobjective optimization problems (CMOPs). Experimental results show that the algorithm provides a low-resource and efficient method for solving CMOPs.
{"title":"A Pareto Front searching algorithm based on reinforcement learning for constrained multiobjective optimization","authors":"Yuhang Hu , Yuelin Qu , Wei Li , Ying Huang","doi":"10.1016/j.ins.2025.121985","DOIUrl":"10.1016/j.ins.2025.121985","url":null,"abstract":"<div><div>Constraint multiobjective algorithms are the most widely applied direction in intelligent optimization, with excellent research value. Currently, most multiobjective multi-constraints algorithms are designed based on the relationship between feasible and infeasible solutions, but they ignore the complex associations between constraints. At the same time, the most significant difficulty in constraint problems lies in the irregularity of constrained Pareto front (CPF) and the lack of a powerful strategy for exploration. The paper proposes a Pareto front searching based on reinforcement learning (PFRL) for multi-constraints multiobjective optimization problems. The algorithm employs reinforcement learning to guide the evolution process through interaction with the environment and adaptively learns the shape and characteristics of CPF to cover the structure of CPF effectively. The environment and CPF information gained by reinforcement learning are utilized for CPF translation and extension to deal with various irregular feasible regions. In addition, the paper also designs a constraint priority evaluation mechanism based on the correlation distance (CD) metric to process constraint relationships. It allows the algorithm to effectively cross over Pareto front (PF) of a single constraint that is unrelated to CPF, improving algorithm efficiency. The introduced algorithm implemented the above strategy using only one population. The effectiveness of the introduced algorithm was verified and compared with nine state-of-the-art algorithms and four real-world constrained multiobjective optimization problems (CMOPs). Experimental results show that the algorithm provides a low-resource and efficient method for solving CMOPs.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121985"},"PeriodicalIF":8.1,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143437278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-17DOI: 10.1016/j.ins.2025.121969
Jiazhen Liang , Wai Li , Qingshan Zhong , Jun Huang , Dazhi Jiang , Erik Cambria
Emotion recognition in conversation (ERC) is essential for enhancing human-computer interaction and improving intelligent systems' emotional intelligence. Despite advancements, ERC still struggles to capture the complexity and nuances of emotions in dialogues. Traditional approaches rely on context modeling to improve text representations but often fail to fully capture emotions' continuous and intricate nature. While contrastive learning has emerged as a promising technique to enhance the discriminative power of text features, many methods still rely on discrete emotion labels, limiting their ability to model emotions' inherent continuity and relational structure. To address these challenges, we introduce the Chain of Triplex Contrastive Learning (CoTCL) framework, which progressively refines utterance representations, strengthens emotional distinctions, and incorporates contextual information from dialogues. CoTCL employs instance contrastive learning using dropout-based positive samples to improve the richness and separation of utterance features. Additionally, it introduces the Pleasure-Arousal-Dominance (PAD) space as a continuous representation framework, embedding utterances in a way that reduces confusion between similar emotions. This allows for more nuanced, relation-aware emotion modeling. Furthermore, CoTCL enhances contextual understanding by constructing an utterance graph, where nodes represent utterances and edges denote relationships. By integrating external knowledge from the COMET model and applying graph-based contrastive learning with edge perturbation and node masking, CoTCL improves contextual transmission, making the model more robust and adaptable to real-world dialogues. Experimental results demonstrate that CoTCL achieves state-of-the-art performance across multiple ERC benchmarks, highlighting the importance of continuous emotion modeling and context integration. By refining text representation, introducing a continuous emotion space, and leveraging external knowledge, CoTCL provides a strong foundation for advancing ERC research and improving real-world dialogue systems.
{"title":"Learning chain for clause awareness: Triplex-contrastive learning for emotion recognition in conversations","authors":"Jiazhen Liang , Wai Li , Qingshan Zhong , Jun Huang , Dazhi Jiang , Erik Cambria","doi":"10.1016/j.ins.2025.121969","DOIUrl":"10.1016/j.ins.2025.121969","url":null,"abstract":"<div><div>Emotion recognition in conversation (ERC) is essential for enhancing human-computer interaction and improving intelligent systems' emotional intelligence. Despite advancements, ERC still struggles to capture the complexity and nuances of emotions in dialogues. Traditional approaches rely on context modeling to improve text representations but often fail to fully capture emotions' continuous and intricate nature. While contrastive learning has emerged as a promising technique to enhance the discriminative power of text features, many methods still rely on discrete emotion labels, limiting their ability to model emotions' inherent continuity and relational structure. To address these challenges, we introduce the Chain of Triplex Contrastive Learning (CoTCL) framework, which progressively refines utterance representations, strengthens emotional distinctions, and incorporates contextual information from dialogues. CoTCL employs instance contrastive learning using dropout-based positive samples to improve the richness and separation of utterance features. Additionally, it introduces the Pleasure-Arousal-Dominance (PAD) space as a continuous representation framework, embedding utterances in a way that reduces confusion between similar emotions. This allows for more nuanced, relation-aware emotion modeling. Furthermore, CoTCL enhances contextual understanding by constructing an utterance graph, where nodes represent utterances and edges denote relationships. By integrating external knowledge from the COMET model and applying graph-based contrastive learning with edge perturbation and node masking, CoTCL improves contextual transmission, making the model more robust and adaptable to real-world dialogues. Experimental results demonstrate that CoTCL achieves state-of-the-art performance across multiple ERC benchmarks, highlighting the importance of continuous emotion modeling and context integration. By refining text representation, introducing a continuous emotion space, and leveraging external knowledge, CoTCL provides a strong foundation for advancing ERC research and improving real-world dialogue systems.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121969"},"PeriodicalIF":8.1,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143454852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, a data-driven fault-tolerant control (FTC) method is proposed to solve the consensus problem of constrained multiagent systems (MASs) with denial-of-service attacks. First, a resilient distributed observer is introduced to extract the leader's state in real-time for each follower, even in the presence of attacks. A nonlinear mapping is employed to transform the original system with state constraints into an equivalent constraint-free system, ensuring that the original system's states remain within prescribed limits. Then, an adaptive dynamic programming (ADP)-based FTC scheme is designed for the system to mitigate the effects of actuator faults, enabling the nominal system to balance cost and performance. The ADP algorithm is implemented using an actor-critic structure to solve the Hamilton-Jacobi-Bellman equation based on system data collected via the least-squares method. In this framework, the designed controller is data-driven rather than reliant on precise system information, which broadens the controller's applicability to systems with unknown dynamics. Finally, the effectiveness of the established controller is validated through two examples.
{"title":"Data-driven fault-tolerant consensus control for constrained nonlinear multiagent systems via adaptive dynamic programming","authors":"Lulu Zhang , Huaguang Zhang , Tianbiao Wang , Xiaohui Yue","doi":"10.1016/j.ins.2025.121976","DOIUrl":"10.1016/j.ins.2025.121976","url":null,"abstract":"<div><div>In this paper, a data-driven fault-tolerant control (FTC) method is proposed to solve the consensus problem of constrained multiagent systems (MASs) with denial-of-service attacks. First, a resilient distributed observer is introduced to extract the leader's state in real-time for each follower, even in the presence of attacks. A nonlinear mapping is employed to transform the original system with state constraints into an equivalent constraint-free system, ensuring that the original system's states remain within prescribed limits. Then, an adaptive dynamic programming (ADP)-based FTC scheme is designed for the system to mitigate the effects of actuator faults, enabling the nominal system to balance cost and performance. The ADP algorithm is implemented using an actor-critic structure to solve the Hamilton-Jacobi-Bellman equation based on system data collected via the least-squares method. In this framework, the designed controller is data-driven rather than reliant on precise system information, which broadens the controller's applicability to systems with unknown dynamics. Finally, the effectiveness of the established controller is validated through two examples.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"705 ","pages":"Article 121976"},"PeriodicalIF":8.1,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143428694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}