首页 > 最新文献

IEEE Transactions on Systems Man Cybernetics-Systems最新文献

英文 中文
Approximate Optimized Backstepping Control of Uncertain Fractional-Order Nonlinear Systems Based on Reinforcement Learning 基于强化学习的不确定分数阶非线性系统的近似优化反步法控制
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-13 DOI: 10.1109/TSMC.2024.3426923
Dongdong Li;Jiuxiang Dong
In this article, a feasible reinforcement learning (RL) scheme is proposed for partially unknown fractional-order nonlinear systems (FONSs). First, the fractional Hamilton-Jacobi–Bellman (HJB) equation containing the dynamics of FONSs is proposed by constructing an auxiliary system and equivalent transformation. Then, the optimal solution of FONSs optimal control under a performance constraint is obtained. It is proved that the optimal cost function and optimal control policy can be approximated gradually by the policy iteration. By using the backstepping control, RL, and identifier-actor-critic neural networks (NNs), the unknown dynamics functions are approximated and the approximate optimal controllers are obtained. A Lyapunov function based on optimality error is constructed, then the fractional-order update laws of NNs weights are designed to ensure that the weights converge to the optimum. Thus, the use of the gradient descent algorithm in the context of the fractional-order calculus to train the NNs is avoided. Finally, the error signals are proved to be bounded and the effectiveness of the proposed algorithm is verified by the simulation of two practical examples.
本文针对部分未知分数阶非线性系统(FONS)提出了一种可行的强化学习(RL)方案。首先,通过构建辅助系统和等价变换,提出了包含 FONS 动力学的分数汉密尔顿-雅各比-贝尔曼(HJB)方程。然后,得到了性能约束下 FONSs 优化控制的最优解。研究证明,最优成本函数和最优控制策略可以通过策略迭代逐步逼近。通过使用反步态控制、RL 和识别器-动作批判神经网络(NN),对未知动力学函数进行近似,并得到近似最优控制器。构建了基于最优误差的 Lyapunov 函数,然后设计了神经网络权重的分数阶更新规律,以确保权重收敛到最优值。这样,就避免了使用分数阶微积分背景下的梯度下降算法来训练 NN。最后,通过对两个实际例子的仿真,证明了误差信号是有界的,并验证了所提算法的有效性。
{"title":"Approximate Optimized Backstepping Control of Uncertain Fractional-Order Nonlinear Systems Based on Reinforcement Learning","authors":"Dongdong Li;Jiuxiang Dong","doi":"10.1109/TSMC.2024.3426923","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426923","url":null,"abstract":"In this article, a feasible reinforcement learning (RL) scheme is proposed for partially unknown fractional-order nonlinear systems (FONSs). First, the fractional Hamilton-Jacobi–Bellman (HJB) equation containing the dynamics of FONSs is proposed by constructing an auxiliary system and equivalent transformation. Then, the optimal solution of FONSs optimal control under a performance constraint is obtained. It is proved that the optimal cost function and optimal control policy can be approximated gradually by the policy iteration. By using the backstepping control, RL, and identifier-actor-critic neural networks (NNs), the unknown dynamics functions are approximated and the approximate optimal controllers are obtained. A Lyapunov function based on optimality error is constructed, then the fractional-order update laws of NNs weights are designed to ensure that the weights converge to the optimum. Thus, the use of the gradient descent algorithm in the context of the fractional-order calculus to train the NNs is avoided. Finally, the error signals are proved to be bounded and the effectiveness of the proposed algorithm is verified by the simulation of two practical examples.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Event-Triggered Fuzzy Adaptive Resilient Consensus Control for Nonlinear MASs Under DoS Attacks DoS 攻击下非线性 MAS 的动态事件触发模糊自适应弹性共识控制
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-09 DOI: 10.1109/TSMC.2024.3426931
Jun Zhang;Yi Zuo;Shaocheng Tong
In this article, the adaptive fuzzy dynamic event-triggered output feedback resilient consensus control issue is investigated for nonlinear multiagent systems (MASs) subject to denial-of-service (DoS) attacks. Fuzzy logic systems (FLSs) are employed to model uncertain agents, and a state observer is constructed to estimate unmeasurable states. An event-triggered distributed resilient observer is designed to save the communication resources between agents, and estimate the unknown leader and its high-order derivatives in case of the communication topology being interrupted by DoS attacks. By the designed state observer and distributed resilient observer, a dynamic event-triggered resilient consensus control method is presented. It is proved that the controlled MASs are stable, and the followers can track the leader under DoS attacks. Moreover, the Zeno behavior can be excluded. Finally, we apply the developed resilient consensus control algorithm to multiple unmanned surface vehicles (USVs), the simulation results verify its effectiveness.
本文研究了受拒绝服务(DoS)攻击的非线性多代理系统(MAS)的自适应模糊动态事件触发输出反馈弹性共识控制问题。研究采用模糊逻辑系统(FLS)来模拟不确定的代理,并构建了一个状态观测器来估计不可测量的状态。设计了一个事件触发的分布式弹性观测器,以节省代理之间的通信资源,并在通信拓扑被 DoS 攻击中断时估计未知领导者及其高阶导数。通过所设计的状态观测器和分布式弹性观测器,提出了一种动态事件触发弹性共识控制方法。实验证明,受控的 MAS 是稳定的,在 DoS 攻击下,跟随者可以跟踪领导者。此外,还可以排除 Zeno 行为。最后,我们将所开发的弹性共识控制算法应用于多个无人水面飞行器(USV),仿真结果验证了该算法的有效性。
{"title":"Dynamic Event-Triggered Fuzzy Adaptive Resilient Consensus Control for Nonlinear MASs Under DoS Attacks","authors":"Jun Zhang;Yi Zuo;Shaocheng Tong","doi":"10.1109/TSMC.2024.3426931","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426931","url":null,"abstract":"In this article, the adaptive fuzzy dynamic event-triggered output feedback resilient consensus control issue is investigated for nonlinear multiagent systems (MASs) subject to denial-of-service (DoS) attacks. Fuzzy logic systems (FLSs) are employed to model uncertain agents, and a state observer is constructed to estimate unmeasurable states. An event-triggered distributed resilient observer is designed to save the communication resources between agents, and estimate the unknown leader and its high-order derivatives in case of the communication topology being interrupted by DoS attacks. By the designed state observer and distributed resilient observer, a dynamic event-triggered resilient consensus control method is presented. It is proved that the controlled MASs are stable, and the followers can track the leader under DoS attacks. Moreover, the Zeno behavior can be excluded. Finally, we apply the developed resilient consensus control algorithm to multiple unmanned surface vehicles (USVs), the simulation results verify its effectiveness.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fully Distributed Robust MPC Approach for Frequency and Voltage Regulation in Smart Grids With Active and Reactive Power Constraints 有功和无功功率约束下智能电网中频率和电压调节的全分布式鲁棒 MPC 方法
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-09 DOI: 10.1109/TSMC.2024.3426083
Giulio Ferro;Michela Robba;Roberto Sacile
Shortly, power distribution grids will incorporate large amounts of distributed energy resources and flexible loads, allowing the operation of a portion of the network in islanded mode to increase the reliability and resilience of the whole power system. A fully distributed robust model predictive control (MPC) strategy for voltage and frequency regulation in interconnected distribution grids is stated. Each grid node represents a collection of prosumers with a large active and reactive power regulation capacity. The advantages of this approach rely on the capability to afford any type of uncertainties, without making any assumption on the probability density function, on distributed generation and load nowcasting. We propose a two-stage architecture: at the first stage, an MPC approach, based on the distributed alternating direction method of multipliers (dADMM), is performed, considering the data nowcasting; instead, the second stage (based on robust distributed team decision theory) takes as input the trajectory of the first stage to compensate the noise that affects the system. The developed architecture has been tested on a modified IEEE5 bus system, considering multiple loads and renewable generation.
不久的将来,配电网将纳入大量分布式能源资源和灵活负载,允许部分网络以孤岛模式运行,以提高整个电力系统的可靠性和恢复能力。本文阐述了一种用于互联配电网电压和频率调节的全分布式鲁棒模型预测控制(MPC)策略。每个电网节点都代表了具有较大有功和无功功率调节能力的用户集合。这种方法的优势在于能够承受任何类型的不确定性,而无需对分布式发电和负荷预测的概率密度函数做任何假设。我们提出了一种两阶段架构:在第一阶段,基于分布式交替乘法(dADMM)的 MPC 方法将考虑数据预报;而第二阶段(基于稳健分布式团队决策理论)将第一阶段的轨迹作为输入,以补偿影响系统的噪声。所开发的架构已在改进的 IEEE5 总线系统上进行了测试,并考虑了多负载和可再生能源发电。
{"title":"A Fully Distributed Robust MPC Approach for Frequency and Voltage Regulation in Smart Grids With Active and Reactive Power Constraints","authors":"Giulio Ferro;Michela Robba;Roberto Sacile","doi":"10.1109/TSMC.2024.3426083","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3426083","url":null,"abstract":"Shortly, power distribution grids will incorporate large amounts of distributed energy resources and flexible loads, allowing the operation of a portion of the network in islanded mode to increase the reliability and resilience of the whole power system. A fully distributed robust model predictive control (MPC) strategy for voltage and frequency regulation in interconnected distribution grids is stated. Each grid node represents a collection of prosumers with a large active and reactive power regulation capacity. The advantages of this approach rely on the capability to afford any type of uncertainties, without making any assumption on the probability density function, on distributed generation and load nowcasting. We propose a two-stage architecture: at the first stage, an MPC approach, based on the distributed alternating direction method of multipliers (dADMM), is performed, considering the data nowcasting; instead, the second stage (based on robust distributed team decision theory) takes as input the trajectory of the first stage to compensate the noise that affects the system. The developed architecture has been tested on a modified IEEE5 bus system, considering multiple loads and renewable generation.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Centimeter-Level Indoor Positioning With Facing Direction Detection for Microlocation-Aware Services 利用面朝方向检测进行厘米级室内定位,实现微定位感知服务
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-06 DOI: 10.1109/TSMC.2024.3432615
Lien-Wu Chen;Chi-Ren Chen
This study proposes a centimeter-level indoor positioning (CLIP) framework to achieve highly accurate localization with facing direction detection for the microlocation-aware Internet of Things (IoT). The CLIP framework can provide accurate centimeter-level positioning information to people indoors by integrating installed surveillance cameras with the IoT, where the efficient operation of microlocation-aware IoT applications and services can be enabled for smart spaces. CLIP can be used to accurately determine the position and facing direction of an individual. According to our review of relevant research, CLIP is the first indoor positioning framework that includes the following features: 1) centimeter-level positioning accuracy for the microlocation-aware IoT that can detect the facing direction of individuals; 2) employment of existing surveillance cameras with low-additional installation cost; and 3) innovative infrastructure for microlocation-aware IoT applications that can enable accurate centimeter-level path planning for individuals, emergency evacuation for groups of people, and geofencing with microlocation awareness. An Android-based system was implemented to verify the feasibility and effectiveness of the CLIP framework, and experimental results indicate that CLIP outperforms existing indoor positioning methods and can achieve centimeter-level accuracy with the improvement ratio of 94.6% over Sextant.
本研究提出了一种厘米级室内定位(CLIP)框架,为微定位感知物联网(IoT)实现高精度定位和朝向检测。CLIP 框架可通过将已安装的监控摄像头与物联网集成,为室内人员提供精确的厘米级定位信息,从而实现智能空间中微定位感知物联网应用和服务的高效运行。CLIP 可用于准确确定个人的位置和朝向。根据我们对相关研究的回顾,CLIP 是首个室内定位框架,具有以下特点:1) 微定位感知物联网的厘米级定位精度,可检测个人的朝向;2) 利用现有监控摄像头,安装成本低;3) 微定位感知物联网应用的创新基础设施,可实现精确的厘米级个人路径规划、群体紧急疏散以及具有微定位感知功能的地理围栏。为了验证 CLIP 框架的可行性和有效性,我们实施了一个基于 Android 的系统,实验结果表明 CLIP 优于现有的室内定位方法,可以达到厘米级的精度,与 Sextant 相比提高了 94.6%。
{"title":"Centimeter-Level Indoor Positioning With Facing Direction Detection for Microlocation-Aware Services","authors":"Lien-Wu Chen;Chi-Ren Chen","doi":"10.1109/TSMC.2024.3432615","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3432615","url":null,"abstract":"This study proposes a centimeter-level indoor positioning (CLIP) framework to achieve highly accurate localization with facing direction detection for the microlocation-aware Internet of Things (IoT). The CLIP framework can provide accurate centimeter-level positioning information to people indoors by integrating installed surveillance cameras with the IoT, where the efficient operation of microlocation-aware IoT applications and services can be enabled for smart spaces. CLIP can be used to accurately determine the position and facing direction of an individual. According to our review of relevant research, CLIP is the first indoor positioning framework that includes the following features: 1) centimeter-level positioning accuracy for the microlocation-aware IoT that can detect the facing direction of individuals; 2) employment of existing surveillance cameras with low-additional installation cost; and 3) innovative infrastructure for microlocation-aware IoT applications that can enable accurate centimeter-level path planning for individuals, emergency evacuation for groups of people, and geofencing with microlocation awareness. An Android-based system was implemented to verify the feasibility and effectiveness of the CLIP framework, and experimental results indicate that CLIP outperforms existing indoor positioning methods and can achieve centimeter-level accuracy with the improvement ratio of 94.6% over Sextant.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142275005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive Path Coordination of Collaborative Transportation Multirobot System in a Smart Factory 智能工厂中协作运输多机器人系统的预测路径协调
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-06 DOI: 10.1109/TSMC.2024.3431222
Zixiang Nie;Kwang-Cheng Chen
Smart factories employ intelligent transportaton systems such as autonomous mobile robots (AMRs) to support real-time adjusted production flows for agile and flexible production. While decentralized transportation task execution provides a scalable multirobot system (MRS) for a smart factory, new coordination challenges arise in implementing such a system. Transportation-MRS collaborates with production-MRS to accommodate just-in-time (JIT) production, leading to nonstationary transportation tasks that transportation-MRS must learn and adapt to. Also, decentralized operation on a shared shop floor means that one robot cannot factor in peer robots’ task execution planning, leading to competitive collisions. Meanwhile, predictive coordination with communication among multiple learning and adapting intelligent robots is still an open problem. On top of identifying the aforementioned challenges, this article first proposes a multifloor transportation graph model to discretize transportation task execution and allow real-time adjustment of transportation paths toward collision-free. We introduce a unique collaborative multi-intelligent robot system approach taking each robot as a cyber–physical agent with automated artificial intelligence (AI) workflow. First, it includes a novel multiagent reinforcement learning (MARL) algorithm, where each robot predictively plans collision-avoidant paths. Second, we introduce a token-passing mechanism to resolve inevitable competitive collisions due to nonstationary tasks. The proposed approach innovatively uses the multifloor model as a domain model for planning. By allowing competitive collision to occur and resolve, a robot only needs to learn and adapt to uncertain parts of the environment—nonstationary tasks and peer robots’ paths. Computational experiments show that our approach is both sample-efficient and computationally efficient. The transportation-MRS quickly reaches near-optimal performance levels, which are empirically shown to scale with the number of robots involved.
智能工厂采用自主移动机器人(AMR)等智能运输系统,支持实时调整生产流程,以实现灵活敏捷的生产。虽然分散式运输任务执行为智能工厂提供了一个可扩展的多机器人系统(MRS),但在实施这样一个系统的过程中也出现了新的协调挑战。运输-多机器人系统与生产-多机器人系统协作,以适应准时制(JIT)生产,从而导致运输-多机器人系统必须学习和适应非稳定的运输任务。此外,共享车间的分散操作意味着一个机器人无法考虑到同行机器人的任务执行规划,从而导致竞争性碰撞。同时,多个正在学习和适应的智能机器人之间的预测性协调与沟通仍是一个有待解决的问题。在确定上述挑战的基础上,本文首先提出了一个多楼层运输图模型,以离散化运输任务的执行,并允许实时调整运输路径以实现无碰撞。我们引入了一种独特的多智能机器人协作系统方法,将每个机器人视为具有自动人工智能(AI)工作流程的网络物理代理。首先,它包括一种新颖的多代理强化学习(MARL)算法,每个机器人都能预测性地规划避免碰撞的路径。其次,我们引入了令牌传递机制,以解决由于非稳态任务而不可避免的竞争性碰撞。所提出的方法创新性地使用多楼层模型作为规划的领域模型。通过允许竞争碰撞的发生和解决,机器人只需学习和适应环境中不确定的部分--非静态任务和同伴机器人的路径。计算实验表明,我们的方法既具有样本效率,又具有计算效率。运输-MRS很快就能达到接近最优的性能水平,而且经验表明,它还能随着参与机器人数量的增加而扩展。
{"title":"Predictive Path Coordination of Collaborative Transportation Multirobot System in a Smart Factory","authors":"Zixiang Nie;Kwang-Cheng Chen","doi":"10.1109/TSMC.2024.3431222","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3431222","url":null,"abstract":"Smart factories employ intelligent transportaton systems such as autonomous mobile robots (AMRs) to support real-time adjusted production flows for agile and flexible production. While decentralized transportation task execution provides a scalable multirobot system (MRS) for a smart factory, new coordination challenges arise in implementing such a system. Transportation-MRS collaborates with production-MRS to accommodate just-in-time (JIT) production, leading to nonstationary transportation tasks that transportation-MRS must learn and adapt to. Also, decentralized operation on a shared shop floor means that one robot cannot factor in peer robots’ task execution planning, leading to competitive collisions. Meanwhile, predictive coordination with communication among multiple learning and adapting intelligent robots is still an open problem. On top of identifying the aforementioned challenges, this article first proposes a multifloor transportation graph model to discretize transportation task execution and allow real-time adjustment of transportation paths toward collision-free. We introduce a unique collaborative multi-intelligent robot system approach taking each robot as a cyber–physical agent with automated artificial intelligence (AI) workflow. First, it includes a novel multiagent reinforcement learning (MARL) algorithm, where each robot predictively plans collision-avoidant paths. Second, we introduce a token-passing mechanism to resolve inevitable competitive collisions due to nonstationary tasks. The proposed approach innovatively uses the multifloor model as a domain model for planning. By allowing competitive collision to occur and resolve, a robot only needs to learn and adapt to uncertain parts of the environment—nonstationary tasks and peer robots’ paths. Computational experiments show that our approach is both sample-efficient and computationally efficient. The transportation-MRS quickly reaches near-optimal performance levels, which are empirically shown to scale with the number of robots involved.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142274908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Detection for Switched Positive Systems With Application to Traffic Signal Systems 应用于交通信号系统的开关正向系统故障检测
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-06 DOI: 10.1109/TSMC.2024.3430550
Shuo Li;Jinfeng Zhang;Yu-Ping Tian
This article is concerned with $L_{-}$ fault detection functional observer design for the switched positive systems (SPSs) with unknown external input under mode-dependent minimum dwell time (MDMDT) constraint. First, a necessary and sufficient condition is presented, such that the designed p-order functional observer provides a positive estimation with an $L_{-}$ fault sensitivity performance. Then, a new discretized linear co-positive Lyapunov function method is proposed to derive the asymptotic tracking condition of the $L_{-}$ fault detection functional observer under the MDMDT constraint. Further, the matrix decomposition technology is applied to solve the bilinear inequality problem in the observer gain matrix design, and an effective mode-dependent piece-wise design scheme of the $L_{-}$ fault detection functional observer in the linear programming form is proposed. The designed $L_{-}$ functional observer is not only sensitive to fault but also robust against any form of unknown input. The results are also extended to the cases of mode-dependent constant dwell time (MDCDT) constraint, mode-independent dwell time constraint, and non-SPSs. Finally, the proposed method is illustrated in detail by constructing a fault detection observer for a traffic signal system. Simulation results verify the effectiveness of the theoretical results.
本文关注的是在与模式相关的最短停留时间(MDMDT)约束下,为具有未知外部输入的开关正系统(SPSs)设计 L_{-}$ 故障检测功能观测器。首先,提出了一个必要条件和充分条件,即所设计的 p 阶功能观测器能提供具有 $L_{-}$ 故障灵敏度性能的正估计。然后,提出了一种新的离散化线性共正 Lyapunov 函数方法,以推导出 MDMDT 约束下 $L_{-}$ 故障检测功能观测器的渐近跟踪条件。此外,还应用矩阵分解技术解决了观测器增益矩阵设计中的双线性不等式问题,并提出了线性规划形式的 $L_{-}$ 故障检测功能观测器的有效模态分片设计方案。所设计的 $L_{-}$ 功能观测器不仅对故障敏感,而且对任何形式的未知输入都具有鲁棒性。研究结果还扩展到了与模式相关的恒定停留时间 (MDCDT) 约束、与模式无关的停留时间约束和非 SPS 的情况。最后,通过构建交通信号系统的故障检测观测器详细说明了所提出的方法。仿真结果验证了理论结果的有效性。
{"title":"Fault Detection for Switched Positive Systems With Application to Traffic Signal Systems","authors":"Shuo Li;Jinfeng Zhang;Yu-Ping Tian","doi":"10.1109/TSMC.2024.3430550","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3430550","url":null,"abstract":"This article is concerned with \u0000<inline-formula> <tex-math>$L_{-}$ </tex-math></inline-formula>\u0000 fault detection functional observer design for the switched positive systems (SPSs) with unknown external input under mode-dependent minimum dwell time (MDMDT) constraint. First, a necessary and sufficient condition is presented, such that the designed p-order functional observer provides a positive estimation with an \u0000<inline-formula> <tex-math>$L_{-}$ </tex-math></inline-formula>\u0000 fault sensitivity performance. Then, a new discretized linear co-positive Lyapunov function method is proposed to derive the asymptotic tracking condition of the \u0000<inline-formula> <tex-math>$L_{-}$ </tex-math></inline-formula>\u0000 fault detection functional observer under the MDMDT constraint. Further, the matrix decomposition technology is applied to solve the bilinear inequality problem in the observer gain matrix design, and an effective mode-dependent piece-wise design scheme of the \u0000<inline-formula> <tex-math>$L_{-}$ </tex-math></inline-formula>\u0000 fault detection functional observer in the linear programming form is proposed. The designed \u0000<inline-formula> <tex-math>$L_{-}$ </tex-math></inline-formula>\u0000 functional observer is not only sensitive to fault but also robust against any form of unknown input. The results are also extended to the cases of mode-dependent constant dwell time (MDCDT) constraint, mode-independent dwell time constraint, and non-SPSs. Finally, the proposed method is illustrated in detail by constructing a fault detection observer for a traffic signal system. Simulation results verify the effectiveness of the theoretical results.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142235878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning for Finite-Horizon H∞ Tracking Control of Unknown Discrete Linear Time-Varying System 未知离散线性时变系统有限边界 H∞ 跟踪控制的强化学习
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-05 DOI: 10.1109/TSMC.2024.3431453
Linwei Ye;Zhonggai Zhao;Fei Liu
This article considers the finite-horizon H $_{infty }$ tracking problem for a class of discrete linear time-varying systems. Two reinforcement learning (RL) methods—policy iteration (PI) and Q-learning—are proposed to solve this problem. The latter can obtain the H $_{infty }$ controller without system dynamics. In the field of RL control, most studies focus on infinite-horizon control and time-invariant systems, and few studies have investigated finite-horizon control or time-varying systems. In contrast to infinite-horizon H $_{infty }$ tracking control, finite-horizon H $_{infty }$ tracking control involves a time-varying value function. While this introduces challenges, it empowers the algorithm to effectively handle time-varying problems. Within the finite-horizon framework, the value function is bounded, allowing the removal of the discount factor, thereby enhancing control performance. Additionally, there is no longer a need for an admissible control law for initialization, providing the proposed algorithms with the combined advantages of both PI and value iteration (VI). Two simulation examples are used to verify the effectiveness of the proposed algorithms.
本文研究了一类离散线性时变系统的有限视距 H $_{infty }$ 跟踪问题。提出了两种强化学习(RL)方法--策略迭代(PI)和 Q 学习--来解决这个问题。后者可以在不考虑系统动态的情况下获得 H $_{infty }$ 控制器。在 RL 控制领域,大多数研究集中于无限视距控制和时变系统,很少有研究涉及有限视距控制或时变系统。与无限视距 H $_{infty }$ 跟踪控制不同,有限视距 H $_{infty }$ 跟踪控制涉及时变值函数。虽然这带来了挑战,但却使算法能够有效处理时变问题。在有限视距框架内,价值函数是有界的,因此可以去除贴现因子,从而提高控制性能。此外,初始化时不再需要可接受的控制律,从而使所提出的算法兼具 PI 和值迭代 (VI) 的优点。两个仿真实例用于验证所提算法的有效性。
{"title":"Reinforcement Learning for Finite-Horizon H∞ Tracking Control of Unknown Discrete Linear Time-Varying System","authors":"Linwei Ye;Zhonggai Zhao;Fei Liu","doi":"10.1109/TSMC.2024.3431453","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3431453","url":null,"abstract":"This article considers the finite-horizon H\u0000<inline-formula> <tex-math>$_{infty }$ </tex-math></inline-formula>\u0000 tracking problem for a class of discrete linear time-varying systems. Two reinforcement learning (RL) methods—policy iteration (PI) and Q-learning—are proposed to solve this problem. The latter can obtain the H\u0000<inline-formula> <tex-math>$_{infty }$ </tex-math></inline-formula>\u0000 controller without system dynamics. In the field of RL control, most studies focus on infinite-horizon control and time-invariant systems, and few studies have investigated finite-horizon control or time-varying systems. In contrast to infinite-horizon H\u0000<inline-formula> <tex-math>$_{infty }$ </tex-math></inline-formula>\u0000 tracking control, finite-horizon H\u0000<inline-formula> <tex-math>$_{infty }$ </tex-math></inline-formula>\u0000 tracking control involves a time-varying value function. While this introduces challenges, it empowers the algorithm to effectively handle time-varying problems. Within the finite-horizon framework, the value function is bounded, allowing the removal of the discount factor, thereby enhancing control performance. Additionally, there is no longer a need for an admissible control law for initialization, providing the proposed algorithms with the combined advantages of both PI and value iteration (VI). Two simulation examples are used to verify the effectiveness of the proposed algorithms.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142274917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond the Surface of Digital Contact Tracing: Delving into the Interconnected World of Technology, Individuals, and Society 超越数字联络追踪的表象:深入探索技术、个人和社会的互联世界
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-05 DOI: 10.1109/TSMC.2024.3428393
Jiejun Hu-Bolz;Katayoun Farrahi;Manuel Cebrian
Information-enabled technology, such as digital contact tracing (DCT), is designed to assist decision-making. However, during the pandemic, DCT also contributes additional factors to consider for individuals and policymakers. In this article, we conceptualise the interactions between individuals and policymakers and propose a leader-followers mean-field Game model to analyse their decision-making processes by integrating key factors, such as individuals’ health state, self-control effort, privacy, and social activities, under the influence of interventions and subsidies from policymakers. The simulation demonstrates that moderate subsidies are sufficient to induce early control efforts and higher collective efforts even with privacy concerns and the negative influence of engaging activities; Equilibrium and system stability can be reached when healthy or the summation of healthy and mediocre healthy populations dominates; The model highlights the crucial elements for future data analysis and collection, such as DCT is vital to indicate population health states.
数字接触追踪技术(DCT)等信息化技术旨在协助决策。然而,在大流行期间,DCT 也为个人和决策者提供了额外的考虑因素。在本文中,我们将个人与政策制定者之间的互动概念化,并提出了一个领导者-追随者均值场博弈模型,在政策制定者的干预和补贴影响下,综合个人的健康状况、自我控制努力、隐私和社交活动等关键因素,分析他们的决策过程。模拟结果表明,即使存在隐私问题和参与活动的负面影响,适度的补贴也足以诱导早期的控制努力和更高的集体努力;当健康人群或健康人群与平庸健康人群的总和占主导地位时,就能达到平衡和系统稳定;该模型强调了未来数据分析和收集的关键要素,例如 DCT 对于显示人群健康状况至关重要。
{"title":"Beyond the Surface of Digital Contact Tracing: Delving into the Interconnected World of Technology, Individuals, and Society","authors":"Jiejun Hu-Bolz;Katayoun Farrahi;Manuel Cebrian","doi":"10.1109/TSMC.2024.3428393","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3428393","url":null,"abstract":"Information-enabled technology, such as digital contact tracing (DCT), is designed to assist decision-making. However, during the pandemic, DCT also contributes additional factors to consider for individuals and policymakers. In this article, we conceptualise the interactions between individuals and policymakers and propose a leader-followers mean-field Game model to analyse their decision-making processes by integrating key factors, such as individuals’ health state, self-control effort, privacy, and social activities, under the influence of interventions and subsidies from policymakers. The simulation demonstrates that moderate subsidies are sufficient to induce early control efforts and higher collective efforts even with privacy concerns and the negative influence of engaging activities; Equilibrium and system stability can be reached when healthy or the summation of healthy and mediocre healthy populations dominates; The model highlights the crucial elements for future data analysis and collection, such as DCT is vital to indicate population health states.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142275011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heterogeneous Window Transformer for Image Denoising 用于图像去噪的异构窗变换器
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-05 DOI: 10.1109/TSMC.2024.3429345
Chunwei Tian;Menghua Zheng;Chia-Wen Lin;Zhiwu Li;David Zhang
Deep networks can usually depend on extracting more structural information to improve denoising results. However, they may ignore correlation between pixels from an image to pursue better-denoising performance. Window Transformer can use long- and short-distance modeling to interact pixels to address mentioned problem. To make a tradeoff between distance modeling and denoising time, we propose a heterogeneous window Transformer (HWformer) for image denoising. HWformer first designs heterogeneous global windows to capture global context information for improving denoising effects. To build a bridge between long and short-distance modeling, global windows are horizontally and vertically shifted to facilitate diversified information without increasing denoising time. To prevent the information loss phenomenon of independent patches, sparse idea is guided a feed-forward network to extract local information of neighboring patches. The proposed HWformer only takes 30% of popular restoration Transformer in terms of denoising time. Its codes can be obtained at https://github.com/hellloxiaotian/HWformer.
深度网络通常可以依靠提取更多的结构信息来改善去噪效果。但是,为了追求更好的去噪性能,它们可能会忽略图像中像素之间的相关性。Window Transformer 可以使用长短距离建模来交互像素,从而解决上述问题。为了在距离建模和去噪时间之间做出权衡,我们提出了一种用于图像去噪的异构窗口变换器(HWformer)。HWformer 首先设计了异构全局窗口来捕捉全局上下文信息,从而提高去噪效果。为了在长距离建模和短距离建模之间架起一座桥梁,全局窗口被水平和垂直移动,以在不增加去噪时间的情况下促进信息多样化。为防止独立斑块的信息丢失现象,稀疏思想通过前馈网络来提取相邻斑块的局部信息。所提出的 HWformer 的去噪时间仅为流行的复原变换器的 30%。其代码可通过 https://github.com/hellloxiaotian/HWformer 获取。
{"title":"Heterogeneous Window Transformer for Image Denoising","authors":"Chunwei Tian;Menghua Zheng;Chia-Wen Lin;Zhiwu Li;David Zhang","doi":"10.1109/TSMC.2024.3429345","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3429345","url":null,"abstract":"Deep networks can usually depend on extracting more structural information to improve denoising results. However, they may ignore correlation between pixels from an image to pursue better-denoising performance. Window Transformer can use long- and short-distance modeling to interact pixels to address mentioned problem. To make a tradeoff between distance modeling and denoising time, we propose a heterogeneous window Transformer (HWformer) for image denoising. HWformer first designs heterogeneous global windows to capture global context information for improving denoising effects. To build a bridge between long and short-distance modeling, global windows are horizontally and vertically shifted to facilitate diversified information without increasing denoising time. To prevent the information loss phenomenon of independent patches, sparse idea is guided a feed-forward network to extract local information of neighboring patches. The proposed HWformer only takes 30% of popular restoration Transformer in terms of denoising time. Its codes can be obtained at \u0000<uri>https://github.com/hellloxiaotian/HWformer</uri>\u0000.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142443058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stackelberg Game-Based Control Design for Fuzzy Underactuated Mechanical Systems With Inequality Constraints 基于堆栈博弈的不等式约束模糊欠动机械系统控制设计
IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-08-02 DOI: 10.1109/TSMC.2024.3428314
Zicheng Zhu;Han Zhao;Yuanjie Xian;Ye-Hwa Chen;Hao Sun;Jun Ma
A Stackelberg game-based design for an adaptive robust control for the fuzzy uncertain underactuated mechanical systems (UMSs) is proposed. The emphasis is on fuzzy-based uncertainty and inequality constraint. The uncertainty is time varying and bounded within a prescribed fuzzy set. For the inequality constraint, we creatively have it merge into constraint-following performance by a diffeomorphism technique. An adaptive robust control strategy is then proposed. Deterministic performance is guaranteed provided the control design parameters are within feasible regions. To further enhance the performance, we introduce a two-player Stackelberg game setting. The optimal choice of design parameters can be solved. The feasibility of this design is demonstrated on an autonomous wheeled mobile robot (AWMR), which is confined in a bounded space.
本文提出了一种基于 Stackelberg 博弈的模糊不确定欠动机械系统(UMS)自适应鲁棒控制设计。重点在于基于模糊的不确定性和不平等约束。不确定性是随时间变化的,并且在规定的模糊集合内有边界。对于不平等约束,我们创造性地通过差分同构技术将其合并为约束跟随性能。然后提出了一种自适应鲁棒控制策略。只要控制设计参数在可行区域内,就能保证确定性能。为了进一步提高性能,我们引入了双人斯塔克尔伯格博弈设置。可以求解设计参数的最优选择。这种设计的可行性在一个自主轮式移动机器人(AWMR)上得到了验证,该机器人被限制在一个有界的空间内。
{"title":"Stackelberg Game-Based Control Design for Fuzzy Underactuated Mechanical Systems With Inequality Constraints","authors":"Zicheng Zhu;Han Zhao;Yuanjie Xian;Ye-Hwa Chen;Hao Sun;Jun Ma","doi":"10.1109/TSMC.2024.3428314","DOIUrl":"https://doi.org/10.1109/TSMC.2024.3428314","url":null,"abstract":"A Stackelberg game-based design for an adaptive robust control for the fuzzy uncertain underactuated mechanical systems (UMSs) is proposed. The emphasis is on fuzzy-based uncertainty and inequality constraint. The uncertainty is time varying and bounded within a prescribed fuzzy set. For the inequality constraint, we creatively have it merge into constraint-following performance by a diffeomorphism technique. An adaptive robust control strategy is then proposed. Deterministic performance is guaranteed provided the control design parameters are within feasible regions. To further enhance the performance, we introduce a two-player Stackelberg game setting. The optimal choice of design parameters can be solved. The feasibility of this design is demonstrated on an autonomous wheeled mobile robot (AWMR), which is confined in a bounded space.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":null,"pages":null},"PeriodicalIF":8.6,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142274971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Systems Man Cybernetics-Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1