Pub Date : 2025-01-15DOI: 10.1109/TSMC.2024.3523708
Youqiang Chen;Ridong Zhang
In industrial fault diagnosis, traditional methods grapple with challenges, such as nonstationarity, nonlinearity, high dimensionality, and strong coupling. To address these issues, we propose an end-to-end fusion model based on multiscale residual convolutional channel attention and transformer model (MRCC-Transformer). This approach initially leverages a multiscale residual convolutional neural network (CNN) to extract data features across various scales, thereby preventing model degradation and autonomously learning and integrating abundant fault information from multiple monitoring variables. Subsequently, a channel attention mechanism (CAM) is introduced to prioritize focus on pertinent convolutional channels to enhance the network’s effectiveness and discriminative capacity. Furthermore, the Transformer is employed to establish dependencies among distinct features to enhance fault diagnosis accuracy. Lastly, the input data is classified for fault diagnosis. The efficacy of the proposed method was validated through simulation experiments on the Tennessee-Eastman (TE) process and an industrial coking furnace. Comparative results demonstrate that the proposed method significantly improves the accuracy of fault diagnosis.
{"title":"Deep Multiscale Convolutional Model With Multihead Self-Attention for Industrial Process Fault Diagnosis","authors":"Youqiang Chen;Ridong Zhang","doi":"10.1109/TSMC.2024.3523708","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3523708","url":null,"abstract":"In industrial fault diagnosis, traditional methods grapple with challenges, such as nonstationarity, nonlinearity, high dimensionality, and strong coupling. To address these issues, we propose an end-to-end fusion model based on multiscale residual convolutional channel attention and transformer model (MRCC-Transformer). This approach initially leverages a multiscale residual convolutional neural network (CNN) to extract data features across various scales, thereby preventing model degradation and autonomously learning and integrating abundant fault information from multiple monitoring variables. Subsequently, a channel attention mechanism (CAM) is introduced to prioritize focus on pertinent convolutional channels to enhance the network’s effectiveness and discriminative capacity. Furthermore, the Transformer is employed to establish dependencies among distinct features to enhance fault diagnosis accuracy. Lastly, the input data is classified for fault diagnosis. The efficacy of the proposed method was validated through simulation experiments on the Tennessee-Eastman (TE) process and an industrial coking furnace. Comparative results demonstrate that the proposed method significantly improves the accuracy of fault diagnosis.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 4","pages":"2503-2512"},"PeriodicalIF":8.6,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143654910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-10DOI: 10.1109/TSMC.2024.3516857
Yuchen Li;Erhao Zhou;Chi-Man Vong;Shitong Wang
To overcome the inappropriateness of the recently-developed fully interpretable Takagi-Sugeno–Kang fuzzy systems (FIMG-TSK) for high-dimensional classification tasks, which is caused by their unreliable Gaussian mixture models and their very lengthy fuzzy rules on all the original features, this study attempts to develop a stacked ensemble of extremely interpretable first-order TSK fuzzy classifiers (SEXI-TSK-FC) comprising extremely interpretable FIMG-TSK-based classifiers. SEXI-TSK-FC has structural and algorithmic novelties. In the structural sense, to guarantee enhanced generalizability and short fuzzy rules, the proposed XI-TSK is created as each subclassifier on a subset of the original features. Then it stacks each successive subclassifier on both the outputs and the important features selected, which are from the incorrectly classified dataset by the previous subclassifier. After that, SEXI-TSK-FC linearly aggregates all the outputs of its subclassifiers with a one-step calculation to enhance classification accuracy while preserving extreme interpretability. In the algorithmic sense, each short fuzzy rule of the XI-TSK subclassifier is determined using the proposed fuzzy feature selection and clustering algorithm to select the subset of all the original features and simultaneously fix the antecedent and consequent of each rule. After that, the rule weights in each subclassifier are trained quickly with strong generalizability using the proposed Vapnik-Chervonenkis dimension minimization-based learning. Experimental results on 12 benchmark datasets demonstrate the power of the proposed classifier SEXI-TSK-FC on high-dimensional data in testing accuracy, training time, and extreme interpretability.
{"title":"Stacked Ensemble of Extremely Interpretable Takagi-Sugeno–Kang Fuzzy Classifiers for High-Dimensional Data","authors":"Yuchen Li;Erhao Zhou;Chi-Man Vong;Shitong Wang","doi":"10.1109/TSMC.2024.3516857","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3516857","url":null,"abstract":"To overcome the inappropriateness of the recently-developed fully interpretable Takagi-Sugeno–Kang fuzzy systems (FIMG-TSK) for high-dimensional classification tasks, which is caused by their unreliable Gaussian mixture models and their very lengthy fuzzy rules on all the original features, this study attempts to develop a stacked ensemble of extremely interpretable first-order TSK fuzzy classifiers (SEXI-TSK-FC) comprising extremely interpretable FIMG-TSK-based classifiers. SEXI-TSK-FC has structural and algorithmic novelties. In the structural sense, to guarantee enhanced generalizability and short fuzzy rules, the proposed XI-TSK is created as each subclassifier on a subset of the original features. Then it stacks each successive subclassifier on both the outputs and the important features selected, which are from the incorrectly classified dataset by the previous subclassifier. After that, SEXI-TSK-FC linearly aggregates all the outputs of its subclassifiers with a one-step calculation to enhance classification accuracy while preserving extreme interpretability. In the algorithmic sense, each short fuzzy rule of the XI-TSK subclassifier is determined using the proposed fuzzy feature selection and clustering algorithm to select the subset of all the original features and simultaneously fix the antecedent and consequent of each rule. After that, the rule weights in each subclassifier are trained quickly with strong generalizability using the proposed Vapnik-Chervonenkis dimension minimization-based learning. Experimental results on 12 benchmark datasets demonstrate the power of the proposed classifier SEXI-TSK-FC on high-dimensional data in testing accuracy, training time, and extreme interpretability.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 4","pages":"2414-2425"},"PeriodicalIF":8.6,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143654909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In our daily lives, emotions are extremely important. However, predicting and regulating emotion is still a critical problem to be solved in the research of the human-computer interaction (HCI). In this study, we explore this problem using a regulation method. A novel multimodal affective computing algorithm is proposed and implemented in an emotion regulation system. The selection of music stimuli and the modeling of dynamic emotions are done using emotional Markov chains. This regulation system can monitor the user’s emotion and play music, selected by the regulation policy until the user can maintain the desired emotion. Our system was verified by two experiments. In the first experiment, by predicting the participants’ affective states, we tested the precision of our multimodal affective computing system. In the second experiment, we tested the regulation algorithm embedded in a closed-loop regulation system by comparing it with playing music without feedback. The results suggest that participants can regulate and maintain the desired affective state by using the emotion regulation system.
{"title":"Design and Analysis of a Closed-Loop Emotion Regulation System Based on Multimodal Affective Computing and Emotional Markov Chain","authors":"Xingchao Wang;Chen-Zhong Li;Zhenglong Sun;Yangsheng Xu","doi":"10.1109/TSMC.2024.3523342","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3523342","url":null,"abstract":"In our daily lives, emotions are extremely important. However, predicting and regulating emotion is still a critical problem to be solved in the research of the human-computer interaction (HCI). In this study, we explore this problem using a regulation method. A novel multimodal affective computing algorithm is proposed and implemented in an emotion regulation system. The selection of music stimuli and the modeling of dynamic emotions are done using emotional Markov chains. This regulation system can monitor the user’s emotion and play music, selected by the regulation policy until the user can maintain the desired emotion. Our system was verified by two experiments. In the first experiment, by predicting the participants’ affective states, we tested the precision of our multimodal affective computing system. In the second experiment, we tested the regulation algorithm embedded in a closed-loop regulation system by comparing it with playing music without feedback. The results suggest that participants can regulate and maintain the desired affective state by using the emotion regulation system.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 4","pages":"2426-2437"},"PeriodicalIF":8.6,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143654908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This article is focused on the stabilization and control of variable fractional-order (VFO) neutral delay systems with time-varying structured uncertainties and delays. Using the Lyapunov theorem, the delay- and order-dependent stability criterion of both the nominal and uncertain VFO systems of neutral-type with time-varying delays are derived using a set of linear matrix inequalities (LMIs). An LMI-based output-feedback proportional-derivative (PD) control law that meets the delay-dependent and order-dependent criteria is then derived to guarantee the asymptotic stability of such systems. The performance of the proposed control structure is assessed using a simulation study of two VFO systems.
{"title":"Robust Delay-Dependent Output-Feedback PD Controller Design for Variable Fractional-Order Uncertain Neutral Systems With Time-Varying Delays","authors":"Zahra Sadat Aghayan;Alireza Alfi;Yashar Mousavi;Afef Fekih","doi":"10.1109/TSMC.2024.3508573","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3508573","url":null,"abstract":"This article is focused on the stabilization and control of variable fractional-order (VFO) neutral delay systems with time-varying structured uncertainties and delays. Using the Lyapunov theorem, the delay- and order-dependent stability criterion of both the nominal and uncertain VFO systems of neutral-type with time-varying delays are derived using a set of linear matrix inequalities (LMIs). An LMI-based output-feedback proportional-derivative (PD) control law that meets the delay-dependent and order-dependent criteria is then derived to guarantee the asymptotic stability of such systems. The performance of the proposed control structure is assessed using a simulation study of two VFO systems.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"1986-1996"},"PeriodicalIF":8.6,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143438465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1109/TSMC.2024.3522980
Jianghong Ma;Rong Wang;Tianjun Wei;Kangzhe Liu;Haijun Zhang;Xiaolei Lu
Recommender systems are essential in the ever-evolving landscape of e-commerce and social media platforms, delivering personalized recommendations by predicting user preferences. However, the growing need for explainable recommendation has arisen to enhance transparency and persuasiveness. In response, we present correlation-driven explainable recommendation with aspect and rating boosted representation learning (CER-ARRL), a unified joint-ranking framework that capitalizes on the robust capabilities of neural collaborative filtering to model the intricate dynamics among users, items, and explanations. By extracting information from explicit and implicit user emotional reviews, our framework enriches the representations of users and items. This integration yields simultaneous improvements in both item recommendation and explanation ranking tasks. In addition, CER-ARRL effectively exploits the structural correlation between phrases as well as the structural and semantic correlations between emojis to facilitate explanation ranking. This work represents the pioneering work to address the item-explanation joint recommendation task by integrating both interpretative phrases and illustrative emojis. Through extensive experiments on various datasets, including our collected dataset, we demonstrate the superiority of the proposed method over existing baselines.
{"title":"Correlation-Driven Explainable Recommendation With Aspect and Rating Boosted Representation Learning: A Unified Joint-Ranking Framework","authors":"Jianghong Ma;Rong Wang;Tianjun Wei;Kangzhe Liu;Haijun Zhang;Xiaolei Lu","doi":"10.1109/TSMC.2024.3522980","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3522980","url":null,"abstract":"Recommender systems are essential in the ever-evolving landscape of e-commerce and social media platforms, delivering personalized recommendations by predicting user preferences. However, the growing need for explainable recommendation has arisen to enhance transparency and persuasiveness. In response, we present correlation-driven explainable recommendation with aspect and rating boosted representation learning (CER-ARRL), a unified joint-ranking framework that capitalizes on the robust capabilities of neural collaborative filtering to model the intricate dynamics among users, items, and explanations. By extracting information from explicit and implicit user emotional reviews, our framework enriches the representations of users and items. This integration yields simultaneous improvements in both item recommendation and explanation ranking tasks. In addition, CER-ARRL effectively exploits the structural correlation between phrases as well as the structural and semantic correlations between emojis to facilitate explanation ranking. This work represents the pioneering work to address the item-explanation joint recommendation task by integrating both interpretative phrases and illustrative emojis. Through extensive experiments on various datasets, including our collected dataset, we demonstrate the superiority of the proposed method over existing baselines.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 4","pages":"2489-2502"},"PeriodicalIF":8.6,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143654932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-08DOI: 10.1109/TSMC.2024.3523369
Baoyu Wen;Jiangshuai Huang;Xiaojie Su;Yue Yang
This article investigates the distributed control of a group of stochastic high-order nonlinear systems in which the subsystems are with unknown and time-varying control coefficients of unknown signs, inherent nonlinear drift and diffusion terms. To solve the control problem with unknown control directions, where traditional available Nussbaum functions are not applicable for the consensus of stochastic nonlinear systems with unknown and time-varying coefficients of unknown signs, a novel type of Nussbaum function is proposed with a new paradigm of stability analysis in probability. Global consensus control of stochastic multiagent systems is achieved by designing distributed controllers which integrate designed distributed filters and novel Nussbaum functions. In addition, it can be proved that all signals in the closed-loop system are bound in probability, and the transient consensus errors of the followers are bounded by positive constants which can be adjusted arbitrarily small. The effectiveness of the proposed control scheme is demonstrated by simulation results.
{"title":"Consensus Control of Nonlinear Stochastic Multiagent Systems With Unknown and Time-Varying Control Coefficients Based on Novel Nussbaum Functions","authors":"Baoyu Wen;Jiangshuai Huang;Xiaojie Su;Yue Yang","doi":"10.1109/TSMC.2024.3523369","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3523369","url":null,"abstract":"This article investigates the distributed control of a group of stochastic high-order nonlinear systems in which the subsystems are with unknown and time-varying control coefficients of unknown signs, inherent nonlinear drift and diffusion terms. To solve the control problem with unknown control directions, where traditional available Nussbaum functions are not applicable for the consensus of stochastic nonlinear systems with unknown and time-varying coefficients of unknown signs, a novel type of Nussbaum function is proposed with a new paradigm of stability analysis in probability. Global consensus control of stochastic multiagent systems is achieved by designing distributed controllers which integrate designed distributed filters and novel Nussbaum functions. In addition, it can be proved that all signals in the closed-loop system are bound in probability, and the transient consensus errors of the followers are bounded by positive constants which can be adjusted arbitrarily small. The effectiveness of the proposed control scheme is demonstrated by simulation results.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2264-2276"},"PeriodicalIF":8.6,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143438464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-08DOI: 10.1109/TSMC.2024.3520526
Yanchi Li;Dongcheng Li;Wenyin Gong;Qiong Gu
Multiobjective multitask optimization (MO-MTO) aims to exploit the similarities among different multiobjective optimization tasks through knowledge transfer (KT), facilitating their simultaneous resolution. The effective design of KT techniques embedded in multiobjective evolutionary optimizers is crucial for enhancing the performance of multiobjective multitask evolutionary algorithms (MO-MTEAs). However, a significant limitation of existing KT techniques in MO-MTEAs is their equal treatment of particles/individuals for transferred knowledge reception, which can negatively impact the balance of diversity and convergence in population evolution. To remedy this limitation, this article proposes a new MO-MTEA, named MTEA-DCK, which incorporates diversity-oriented KT (DKT) and convergence-oriented KT (CKT) techniques tailored for different particles in the population. MTEA-DCK utilizes a strength-Pareto-based competitive mechanism to divide particles into winners and losers: 1) for winners, DKT is conducted via an intertask domain alignment approach to enhance population diversity and 2) for losers, CKT is executed within the unified search space to improve convergence. Additionally, to ensure robust performance on complex task combinations, we introduce two automatic parameter control strategies specifically designed for these KT techniques. MTEA-DCK was performed on 39 benchmark MO-MTO problems and demonstrated superior performance compared to eight state-of-the-art MO-MTEAs and six multiobjective evolutionary algorithms. Finally, we present three real-world MO-MTO application cases, where our approach also yielded better results than other algorithms.
{"title":"Multiobjective Multitask Optimization via Diversity- and Convergence-Oriented Knowledge Transfer","authors":"Yanchi Li;Dongcheng Li;Wenyin Gong;Qiong Gu","doi":"10.1109/TSMC.2024.3520526","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3520526","url":null,"abstract":"Multiobjective multitask optimization (MO-MTO) aims to exploit the similarities among different multiobjective optimization tasks through knowledge transfer (KT), facilitating their simultaneous resolution. The effective design of KT techniques embedded in multiobjective evolutionary optimizers is crucial for enhancing the performance of multiobjective multitask evolutionary algorithms (MO-MTEAs). However, a significant limitation of existing KT techniques in MO-MTEAs is their equal treatment of particles/individuals for transferred knowledge reception, which can negatively impact the balance of diversity and convergence in population evolution. To remedy this limitation, this article proposes a new MO-MTEA, named MTEA-DCK, which incorporates diversity-oriented KT (DKT) and convergence-oriented KT (CKT) techniques tailored for different particles in the population. MTEA-DCK utilizes a strength-Pareto-based competitive mechanism to divide particles into winners and losers: 1) for winners, DKT is conducted via an intertask domain alignment approach to enhance population diversity and 2) for losers, CKT is executed within the unified search space to improve convergence. Additionally, to ensure robust performance on complex task combinations, we introduce two automatic parameter control strategies specifically designed for these KT techniques. MTEA-DCK was performed on 39 benchmark MO-MTO problems and demonstrated superior performance compared to eight state-of-the-art MO-MTEAs and six multiobjective evolutionary algorithms. Finally, we present three real-world MO-MTO application cases, where our approach also yielded better results than other algorithms.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2367-2379"},"PeriodicalIF":8.6,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The application of automated guided vehicle (AGV) greatly improves the production efficiency of workshop. However, machine flexibility and limited logistics equipment increase the complexity of collaborative scheduling, and frequent dynamic events bring uncertainty. Therefore, this article proposes a real-time scheduling method for dynamic flexible job shop scheduling problem with AGVs using multiagent reinforcement learning (MARL). Specifically, a real-time scheduling framework is proposed in which a multiagent scheduling architecture is designed for achieving task selection, machine allocation and AGV allocation. Then, an action space and an efficient action decoding algorithm are proposed, which enable agents to explore in the high-quality solution space and improve the learning efficiency. In addition, a state space with generalization, a reward function considering machine idle time and a strategy for handling four disturbance events are designed to minimize the total tardiness cost. Comparison experiments show that the proposed method outperforms the priority dispatching rules, genetic programming and four popular reinforcement learning (RL)-based methods, with performance improvements mostly exceeding 10%. Furthermore, experiments considering four disturbance events demonstrate that the proposed method has strong robustness, and it can provide appropriate scheme for uncertain manufacturing system.
{"title":"Real-Time Scheduling for Flexible Job Shop With AGVs Using Multiagent Reinforcement Learning and Efficient Action Decoding","authors":"Yuxin Li;Qingzheng Wang;Xinyu Li;Liang Gao;Ling Fu;Yanbin Yu;Wei Zhou","doi":"10.1109/TSMC.2024.3520381","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3520381","url":null,"abstract":"The application of automated guided vehicle (AGV) greatly improves the production efficiency of workshop. However, machine flexibility and limited logistics equipment increase the complexity of collaborative scheduling, and frequent dynamic events bring uncertainty. Therefore, this article proposes a real-time scheduling method for dynamic flexible job shop scheduling problem with AGVs using multiagent reinforcement learning (MARL). Specifically, a real-time scheduling framework is proposed in which a multiagent scheduling architecture is designed for achieving task selection, machine allocation and AGV allocation. Then, an action space and an efficient action decoding algorithm are proposed, which enable agents to explore in the high-quality solution space and improve the learning efficiency. In addition, a state space with generalization, a reward function considering machine idle time and a strategy for handling four disturbance events are designed to minimize the total tardiness cost. Comparison experiments show that the proposed method outperforms the priority dispatching rules, genetic programming and four popular reinforcement learning (RL)-based methods, with performance improvements mostly exceeding 10%. Furthermore, experiments considering four disturbance events demonstrate that the proposed method has strong robustness, and it can provide appropriate scheme for uncertain manufacturing system.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2120-2132"},"PeriodicalIF":8.6,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143438448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-07DOI: 10.1109/TSMC.2024.3522968
Yuhan Wang;Zhuping Wang;Hao Zhang;Huaicheng Yan
This article investigates the optimal group consensus problem (GCP) in multiagent systems (MASs). To address this problem, a novel distributed optimal control policy is designed in the framework of off-policy reinforcement learning (RL). First, a framework for multiagent differential graphical games is formulated. Second, a min-max strategy is then introduced to ensure the achievement of group consensus through a data-driven value iteration (VI) approach. Finally, the presented consensus control policy is extended to address the group formation tracking problem (GFTP) of nonholonomic mobile robots, with a numerical example to illustrate the efficacy of the proposed results. Compared with the existing literature, this article has the following contributions: 1) A group of agents are decomposed into multiple subgroups to accomplish different consensus objectives; 2) the prior knowledge of agents’ dynamics and initial stabilizing control gains can be eliminated; and 3) the performance index function (PIF) for each agent is designed to integrate not only its individual control policy but also that of its neighboring agents.
{"title":"Optimal Group Consensus of Multiagent Systems in Graphical Games Using Reinforcement Learning","authors":"Yuhan Wang;Zhuping Wang;Hao Zhang;Huaicheng Yan","doi":"10.1109/TSMC.2024.3522968","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3522968","url":null,"abstract":"This article investigates the optimal group consensus problem (GCP) in multiagent systems (MASs). To address this problem, a novel distributed optimal control policy is designed in the framework of off-policy reinforcement learning (RL). First, a framework for multiagent differential graphical games is formulated. Second, a min-max strategy is then introduced to ensure the achievement of group consensus through a data-driven value iteration (VI) approach. Finally, the presented consensus control policy is extended to address the group formation tracking problem (GFTP) of nonholonomic mobile robots, with a numerical example to illustrate the efficacy of the proposed results. Compared with the existing literature, this article has the following contributions: 1) A group of agents are decomposed into multiple subgroups to accomplish different consensus objectives; 2) the prior knowledge of agents’ dynamics and initial stabilizing control gains can be eliminated; and 3) the performance index function (PIF) for each agent is designed to integrate not only its individual control policy but also that of its neighboring agents.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2343-2353"},"PeriodicalIF":8.6,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143465577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Epileptor model is a mathematical framework utilized for simulating the transition from interictal to ictal local field potential (LFP) activity in the brain, with the aim of predicting and preventing epileptic seizures. This article introduces a novel approach integrating Lyapunov and Poincaré–Bendixson methods to analyze the stability of limit cycles in nonlinear systems, specifically focusing on Epileptors with a two-state dynamic. Our method accurately delineates the limit cycle boundary through eigenvalue-based analysis, facilitating precise assessment of stability properties and identification of critical regions linked to seizure initiation and termination. Through the investigation of the two-state dynamics of Epileptors, we gain deeper insights into the transition between low activity and seizure states, consequently improving our understanding of epileptic seizures. Our approach can be employed to establish stability conditions and determine the existence of limit cycles in Epileptor models, which can further aid in predicting and preventing epileptic seizures by identifying critical regions associated with seizure initiation and termination. The simulations conducted in this study demonstrate that the model under investigation exhibits stable limit cycle behavior and manifests bifurcation, with significant implications for the development of targeted interventions and more effective prediction and treatments for epilepsy. The findings indicate that the suggested approach establishes that external stimulation should not surpass 10.8 mA. Moreover, the initial normal state lies within the range of −1.6 to −0.1 ictal LFP. On the other hand, the LaSalle and eigenvalue methods individually cannot precisely determine the limit cycle region.
{"title":"New Eigenvalue-Based Analysis for Precise Limit Cycle Stability Assessment in a Two-State Epileptor Model","authors":"Samaneh-Alsadat Saeedinia;Mohammad-Reza Jahed-Motlagh;Nikola Kirilov Kasabov;Abbas Tafakhori","doi":"10.1109/TSMC.2024.3517620","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3517620","url":null,"abstract":"The Epileptor model is a mathematical framework utilized for simulating the transition from interictal to ictal local field potential (LFP) activity in the brain, with the aim of predicting and preventing epileptic seizures. This article introduces a novel approach integrating Lyapunov and Poincaré–Bendixson methods to analyze the stability of limit cycles in nonlinear systems, specifically focusing on Epileptors with a two-state dynamic. Our method accurately delineates the limit cycle boundary through eigenvalue-based analysis, facilitating precise assessment of stability properties and identification of critical regions linked to seizure initiation and termination. Through the investigation of the two-state dynamics of Epileptors, we gain deeper insights into the transition between low activity and seizure states, consequently improving our understanding of epileptic seizures. Our approach can be employed to establish stability conditions and determine the existence of limit cycles in Epileptor models, which can further aid in predicting and preventing epileptic seizures by identifying critical regions associated with seizure initiation and termination. The simulations conducted in this study demonstrate that the model under investigation exhibits stable limit cycle behavior and manifests bifurcation, with significant implications for the development of targeted interventions and more effective prediction and treatments for epilepsy. The findings indicate that the suggested approach establishes that external stimulation should not surpass 10.8 mA. Moreover, the initial normal state lies within the range of −1.6 to −0.1 ictal LFP. On the other hand, the LaSalle and eigenvalue methods individually cannot precisely determine the limit cycle region.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 3","pages":"2062-2072"},"PeriodicalIF":8.6,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143438339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}