首页 > 最新文献

Ieee-Caa Journal of Automatica Sinica最新文献

英文 中文
A Double Sensitive Fault Detection Filter for Positive Markovian Jump Systems with A Hybrid Event-Triggered Mechanism 采用混合事件触发机制的正马尔可夫跃迁系统双敏故障检测滤波器
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124677
Junfeng Zhang;Baozhu Du;Suhuan Zhang;Shihong Ding
This paper is concerned with the double sensitive fault detection filter for positive Markovian jump systems. A new hybrid adaptive event-triggered mechanism is proposed by introducing a non-monotonic adaptive law. A linear adaptive event-triggered threshold is established by virtue of 1-norm inequality. Under such a triggering strategy, the original system can be transformed into an interval uncertain system. By using a stochastic copositive Lyapunov function, an asynchronous fault detection filter is designed for positive Markovian jump systems (PMJSs) in terms of linear programming. The presented filter satisfies both $L_{-}$-gain ($ell_{-}$-gain) fault sensitivity and $L_{1} (ell_{1})$ internal differential privacy sensitivity. The proposed approach is also extended to the discrete-time case. Finally, two examples are provided to illustrate the effectiveness of the proposed design.
本文关注正马尔可夫跃迁系统的双敏故障检测滤波器。通过引入非单调自适应定律,提出了一种新的混合自适应事件触发机制。通过 1-norm 不等式建立了线性自适应事件触发阈值。在这种触发策略下,原始系统可以转化为区间不确定系统。利用随机共正 Lyapunov 函数,从线性规划的角度为正马尔可夫跃迁系统(PMJS)设计了一种异步故障检测滤波器。所提出的滤波器同时满足 $L_{-}$-gain (ell_{-}$-gain) 故障灵敏度和 $L_{1} (ell_{1})$ 内部差分隐私灵敏度。所提出的方法还扩展到了离散时间情况。最后,提供了两个示例来说明所提设计的有效性。
{"title":"A Double Sensitive Fault Detection Filter for Positive Markovian Jump Systems with A Hybrid Event-Triggered Mechanism","authors":"Junfeng Zhang;Baozhu Du;Suhuan Zhang;Shihong Ding","doi":"10.1109/JAS.2024.124677","DOIUrl":"https://doi.org/10.1109/JAS.2024.124677","url":null,"abstract":"This paper is concerned with the double sensitive fault detection filter for positive Markovian jump systems. A new hybrid adaptive event-triggered mechanism is proposed by introducing a non-monotonic adaptive law. A linear adaptive event-triggered threshold is established by virtue of 1-norm inequality. Under such a triggering strategy, the original system can be transformed into an interval uncertain system. By using a stochastic copositive Lyapunov function, an asynchronous fault detection filter is designed for positive Markovian jump systems (PMJSs) in terms of linear programming. The presented filter satisfies both \u0000<tex>$L_{-}$</tex>\u0000-gain (\u0000<tex>$ell_{-}$</tex>\u0000-gain) fault sensitivity and \u0000<tex>$L_{1} (ell_{1})$</tex>\u0000 internal differential privacy sensitivity. The proposed approach is also extended to the discrete-time case. Finally, two examples are provided to illustrate the effectiveness of the proposed design.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
General Lyapunov Stability and its Application to Time-Varying Convex Optimization 一般李雅普诺夫稳定性及其在时变凸优化中的应用
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124374
Zhibao Song;Ping Li
In this article, a general Lyapunov stability theory of nonlinear systems is put forward and it contains asymptotic/finite-time/fast finite-time/fixed-time stability. Especially, a more accurate estimate of the settling-time function is exhibited for fixed-time stability, and it is still extraneous to the initial conditions. This can be applied to obtain less conservative convergence time of the practical systems without the information of the initial conditions. As an application, the given fixed-time stability theorem is used to resolve time-varying (TV) convex optimization problem. By the Newton's method, two classes of new dynamical systems are constructed to guarantee that the solution of the dynamic system can track to the optimal trajectory of the unconstrained and equality constrained TV convex optimization problems in fixed time, respectively. Without the exact knowledge of the time derivative of the cost function gradient, a fixed-time dynamical non-smooth system is established to overcome the issue of robust TV convex optimization. Two examples are provided to illustrate the effectiveness of the proposed TV convex optimization algorithms. Subsequently, the fixed-time stability theory is extended to the theories of predefined-time/practical predefined-time stability whose bound of convergence time can be arbitrarily given in advance, without tuning the system parameters. Under which, TV convex optimization problem is solved. The previous two examples are used to demonstrate the validity of the predefined-time TV convex optimization algorithms.
本文提出了非线性系统的一般李雅普诺夫稳定性理论,它包含渐近/有限时间/快速有限时间/固定时间稳定性。特别是在定时稳定性方面,展示了对沉降时间函数更精确的估计,而且它仍然与初始条件无关。这可以用于在不考虑初始条件信息的情况下,获得实际系统的较少保守收敛时间。在应用中,给出的定时稳定性定理被用于解决时变(TV)凸优化问题。通过牛顿法,构建了两类新的动力系统,分别保证动力系统的解能在固定时间内跟踪到无约束和等约束 TV 凸优化问题的最优轨迹。在不确切知道代价函数梯度时间导数的情况下,建立了固定时间动态非光滑系统,从而克服了鲁棒 TV 凸优化问题。本文举了两个例子来说明所提出的 TV 凸优化算法的有效性。随后,固定时间稳定性理论被扩展到预定时间/实用预定时间稳定性理论,其收敛时间的边界可以事先任意给出,无需调整系统参数。在此基础上,TV 凸优化问题得以求解。前面两个例子用来证明预定义时间 TV 凸优化算法的有效性。
{"title":"General Lyapunov Stability and its Application to Time-Varying Convex Optimization","authors":"Zhibao Song;Ping Li","doi":"10.1109/JAS.2024.124374","DOIUrl":"https://doi.org/10.1109/JAS.2024.124374","url":null,"abstract":"In this article, a general Lyapunov stability theory of nonlinear systems is put forward and it contains asymptotic/finite-time/fast finite-time/fixed-time stability. Especially, a more accurate estimate of the settling-time function is exhibited for fixed-time stability, and it is still extraneous to the initial conditions. This can be applied to obtain less conservative convergence time of the practical systems without the information of the initial conditions. As an application, the given fixed-time stability theorem is used to resolve time-varying (TV) convex optimization problem. By the Newton's method, two classes of new dynamical systems are constructed to guarantee that the solution of the dynamic system can track to the optimal trajectory of the unconstrained and equality constrained TV convex optimization problems in fixed time, respectively. Without the exact knowledge of the time derivative of the cost function gradient, a fixed-time dynamical non-smooth system is established to overcome the issue of robust TV convex optimization. Two examples are provided to illustrate the effectiveness of the proposed TV convex optimization algorithms. Subsequently, the fixed-time stability theory is extended to the theories of predefined-time/practical predefined-time stability whose bound of convergence time can be arbitrarily given in advance, without tuning the system parameters. Under which, TV convex optimization problem is solved. The previous two examples are used to demonstrate the validity of the predefined-time TV convex optimization algorithms.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning 深度多代理强化学习的迁移学习框架
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2023.124173
Yi Liu;Xiang Wu;Yuming Bo;Jiacun Wang;Lifeng Ma
Dear Editor, This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning (DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1], [2]. The proposed transfer learning framework includes the design of neural network architecture, curriculum transfer learning (CTL) and strategy distillation. Experimental results demonstrate that our framework enables DMARL models to converge faster while improving the final performance.
亲爱的编辑,这封信为深度多代理强化学习(DMARL)提出了一个新的迁移学习框架,以减少将DMARL应用于新场景时的收敛难度和训练时间[1], [2]。所提出的迁移学习框架包括神经网络架构设计、课程迁移学习(CTL)和策略提炼。实验结果表明,我们的框架能使 DMARL 模型更快地收敛,同时提高最终性能。
{"title":"A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning","authors":"Yi Liu;Xiang Wu;Yuming Bo;Jiacun Wang;Lifeng Ma","doi":"10.1109/JAS.2023.124173","DOIUrl":"https://doi.org/10.1109/JAS.2023.124173","url":null,"abstract":"Dear Editor, This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning (DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1], [2]. The proposed transfer learning framework includes the design of neural network architecture, curriculum transfer learning (CTL) and strategy distillation. Experimental results demonstrate that our framework enables DMARL models to converge faster while improving the final performance.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10707687","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction-Based State Estimation and Compensation Control for Networked Systems with Communication Constraints and DoS Attacks 具有通信限制和 DoS 攻击的网络系统的基于预测的状态估计和补偿控制
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124605
Zhong-Hua Pang;Qian Cao;Haibin Guo;Zhe Dong
Dear Editor, This letter investigates the output tracking control issue of networked control systems (NCSs) with communication constraints and denial-of-service (DoS) attacks in the sensor-to-controller channel, both of which would induce random network delays. A dual-prediction-based compensation control (DPCC) scheme, consisting of a predictive observer and a predictive controller, is proposed to actively compensate for the adverse effect of network delays on NCSs. Compared with existing networked predictive control (NPC) methods, the DPCC scheme only requires the sensor to send a single measurement output to the controller at each sampling instant, and also does not need to know the upper bound of random network delays in advance. The stability condition of the closed-loop system is derived. Finally, numerical simulations are carried out to validate the effectiveness of the proposed scheme.
亲爱的编辑,这封信研究了网络控制系统(NCS)的输出跟踪控制问题,该系统具有通信限制和传感器到控制器信道中的拒绝服务(DoS)攻击,这两种攻击都会引起随机网络延迟。本文提出了一种由预测观察器和预测控制器组成的基于双预测的补偿控制(DPCC)方案,以积极补偿网络延迟对 NCS 的不利影响。与现有的网络预测控制(NPC)方法相比,DPCC 方案只需要传感器在每个采样瞬间向控制器发送一个测量输出,而且不需要提前知道随机网络延迟的上限。本文推导了闭环系统的稳定性条件。最后,通过数值模拟验证了所提方案的有效性。
{"title":"Prediction-Based State Estimation and Compensation Control for Networked Systems with Communication Constraints and DoS Attacks","authors":"Zhong-Hua Pang;Qian Cao;Haibin Guo;Zhe Dong","doi":"10.1109/JAS.2024.124605","DOIUrl":"https://doi.org/10.1109/JAS.2024.124605","url":null,"abstract":"Dear Editor, This letter investigates the output tracking control issue of networked control systems (NCSs) with communication constraints and denial-of-service (DoS) attacks in the sensor-to-controller channel, both of which would induce random network delays. A dual-prediction-based compensation control (DPCC) scheme, consisting of a predictive observer and a predictive controller, is proposed to actively compensate for the adverse effect of network delays on NCSs. Compared with existing networked predictive control (NPC) methods, the DPCC scheme only requires the sensor to send a single measurement output to the controller at each sampling instant, and also does not need to know the upper bound of random network delays in advance. The stability condition of the closed-loop system is derived. Finally, numerical simulations are carried out to validate the effectiveness of the proposed scheme.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10707690","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Image Enhancement via Associated Perturbation Removal and Texture Reconstruction Learning 通过相关扰动消除和纹理重构学习增强图像效果
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124521
Kui Jiang;Ruoxi Wang;Yi Xiao;Junjun Jiang;Xin Xu;Tao Lu
Degradation under challenging conditions such as rain, haze, and low light not only diminishes content visibility, but also results in additional degradation side effects, including detail occlusion and color distortion. However, current technologies have barely explored the correlation between perturbation removal and background restoration, consequently struggling to generate high-naturalness content in challenging scenarios. In this paper, we rethink the image enhancement task from the perspective of joint optimization: Perturbation removal and texture reconstruction. To this end, we advise an efficient yet effective image enhancement model, termed the perturbation-guided texture reconstruction network (PerTeRNet). It contains two sub-networks designed for the perturbation elimination and texture reconstruction tasks, respectively. To facilitate texture recovery, we develop a novel perturbation-guided texture enhancement module (PerTEM) to connect these two tasks, where informative background features are extracted from the input with the guidance of predicted perturbation priors. To alleviate the learning burden and computational cost, we suggest performing perturbation removal in a sub-space and exploiting super-resolution to infer high-frequency background details. Our PerTeRNet has demonstrated significant superiority over typical methods in both quantitative and qualitative measures, as evidenced by extensive experimental results on popular image enhancement and joint detection tasks. The source code is available at https://github.com/kuijiang94/PerTeRNet.
在雨天、雾霾和弱光等恶劣条件下的降级不仅会降低内容的可视性,还会导致额外的降级副作用,包括细节遮挡和色彩失真。然而,目前的技术几乎没有探索扰动消除与背景还原之间的关联,因此很难在具有挑战性的场景中生成高自然度的内容。在本文中,我们从联合优化的角度重新思考图像增强任务:去除扰动和纹理重建。为此,我们提出了一种高效且有效的图像增强模型,即扰动引导纹理重建网络(PerTeRNet)。它包含两个子网络,分别用于消除扰动和纹理重建任务。为了促进纹理恢复,我们开发了一个新颖的扰动引导纹理增强模块(PerTEM)来连接这两个任务,在扰动预测前验的引导下,从输入中提取信息背景特征。为了减轻学习负担和计算成本,我们建议在子空间中执行扰动去除,并利用超分辨率来推断高频背景细节。我们的 PerTeRNet 在定量和定性指标上都明显优于典型方法,在流行的图像增强和联合检测任务上的大量实验结果就是证明。源代码见 https://github.com/kuijiang94/PerTeRNet。
{"title":"Image Enhancement via Associated Perturbation Removal and Texture Reconstruction Learning","authors":"Kui Jiang;Ruoxi Wang;Yi Xiao;Junjun Jiang;Xin Xu;Tao Lu","doi":"10.1109/JAS.2024.124521","DOIUrl":"https://doi.org/10.1109/JAS.2024.124521","url":null,"abstract":"Degradation under challenging conditions such as rain, haze, and low light not only diminishes content visibility, but also results in additional degradation side effects, including detail occlusion and color distortion. However, current technologies have barely explored the correlation between perturbation removal and background restoration, consequently struggling to generate high-naturalness content in challenging scenarios. In this paper, we rethink the image enhancement task from the perspective of joint optimization: Perturbation removal and texture reconstruction. To this end, we advise an efficient yet effective image enhancement model, termed the perturbation-guided texture reconstruction network (PerTeRNet). It contains two sub-networks designed for the perturbation elimination and texture reconstruction tasks, respectively. To facilitate texture recovery, we develop a novel perturbation-guided texture enhancement module (PerTEM) to connect these two tasks, where informative background features are extracted from the input with the guidance of predicted perturbation priors. To alleviate the learning burden and computational cost, we suggest performing perturbation removal in a sub-space and exploiting super-resolution to infer high-frequency background details. Our PerTeRNet has demonstrated significant superiority over typical methods in both quantitative and qualitative measures, as evidenced by extensive experimental results on popular image enhancement and joint detection tasks. The source code is available at https://github.com/kuijiang94/PerTeRNet.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Revisiting the LQR Problem of Singular Systems 重新审视奇异系统的 LQR 问题
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124665
Komeil Nosrati;Juri Belikov;Aleksei Tepljakov;Eduard Petlenkov
In the development of linear quadratic regulator (LQR) algorithms, the Riccati equation approach offers two important characteristics —it is recursive and readily meets the existence condition. However, these attributes are applicable only to transformed singular systems, and the efficiency of the regulator may be undermined if constraints are violated in nonsingular versions. To address this gap, we introduce a direct approach to the LQR problem for linear singular systems, avoiding the need for any transformations and eliminating the need for regularity assumptions. To achieve this goal, we begin by formulating a quadratic cost function to derive the LQR algorithm through a penalized and weighted regression framework and then connect it to a constrained minimization problem using the Bellman's criterion. Then, we employ a dynamic programming strategy in a backward approach within a finite horizon to develop an LQR algorithm for the original system. To accomplish this, we address the stability and convergence analysis under the reachability and observability assumptions of a hypothetical system constructed by the pencil of augmented matrices and connected using the Hamiltonian diagonalization technique.
在开发线性二次调节器(LQR)算法的过程中,里卡提方程方法具有两个重要特点--递归性和易于满足存在条件。然而,这些特性只适用于变换后的奇异系统,如果在非奇异系统中违反了约束条件,调节器的效率可能会受到影响。为了弥补这一缺陷,我们引入了一种直接解决线性奇异系统 LQR 问题的方法,无需任何变换,也无需正则性假设。为了实现这一目标,我们首先提出了一个二次成本函数,通过惩罚和加权回归框架推导出 LQR 算法,然后利用贝尔曼准则将其与受约束最小化问题联系起来。然后,我们采用动态编程策略,在有限视界内以逆向方式为原始系统开发 LQR 算法。为了实现这一目标,我们在可达到性和可观测性假设条件下,对由增强矩阵的铅笔构建的假定系统进行稳定性和收敛性分析,并利用哈密尔顿对角化技术将其连接起来。
{"title":"Revisiting the LQR Problem of Singular Systems","authors":"Komeil Nosrati;Juri Belikov;Aleksei Tepljakov;Eduard Petlenkov","doi":"10.1109/JAS.2024.124665","DOIUrl":"https://doi.org/10.1109/JAS.2024.124665","url":null,"abstract":"In the development of linear quadratic regulator (LQR) algorithms, the Riccati equation approach offers two important characteristics —it is recursive and readily meets the existence condition. However, these attributes are applicable only to transformed singular systems, and the efficiency of the regulator may be undermined if constraints are violated in nonsingular versions. To address this gap, we introduce a direct approach to the LQR problem for linear singular systems, avoiding the need for any transformations and eliminating the need for regularity assumptions. To achieve this goal, we begin by formulating a quadratic cost function to derive the LQR algorithm through a penalized and weighted regression framework and then connect it to a constrained minimization problem using the Bellman's criterion. Then, we employ a dynamic programming strategy in a backward approach within a finite horizon to develop an LQR algorithm for the original system. To accomplish this, we address the stability and convergence analysis under the reachability and observability assumptions of a hypothetical system constructed by the pencil of augmented matrices and connected using the Hamiltonian diagonalization technique.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems 基于概率自动机的深度强化学习系统性能提升方法
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124818
Min Yang;Guanjun Liu;Ziyuan Zhou;Jiacun Wang
Deep reinforcement learning (DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management. However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the “black-box” nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata, which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications. First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units (PDMUs), and a reverse breadth-first search (BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.
深度强化学习(DRL)已在车间调度和能源系统管理等工业制造领域展现出巨大潜力。然而,由于模型本身的不确定性,要将其应用于实际任务,就必须进行严格的验证。特定的测试可能会暴露出预训练 DRL 模型性能的不足,而 DRL 的 "黑箱 "性质又给模型行为测试带来了挑战。我们提出了一种基于概率自动机的新型性能改进框架,旨在主动识别和纠正 DRL 系统的关键漏洞,从而以最小的模型修改提高 DRL 模型在实际任务中的性能。首先,通过抽象状态生成概率决策单元(PDMU),根据 DRL 系统的历史轨迹构建概率自动机,然后使用反向广度优先搜索(BFS)方法识别对不利结果影响最大的关键 PDMU-行动对。这一过程只依赖于每个轨迹的状态-行动序列和最终结果。然后,在关键 PDMU 下,我们搜索对有利结果影响最大的新行动。最后,将关键 PDMU、不良行动和新行动封装为监控器,通过实时监控和修正机制,引导 DRL 系统获得更有利的结果。在两个标准强化学习环境和三个实际工作调度场景中进行的评估证实了该方法的有效性,为 DRL 模型在实际应用中的部署提供了一定的保障。
{"title":"Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems","authors":"Min Yang;Guanjun Liu;Ziyuan Zhou;Jiacun Wang","doi":"10.1109/JAS.2024.124818","DOIUrl":"https://doi.org/10.1109/JAS.2024.124818","url":null,"abstract":"Deep reinforcement learning (DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management. However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the “black-box” nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata, which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications. First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units (PDMUs), and a reverse breadth-first search (BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting Adaptive Weighted Broad Learning System for Multi-Label Learning 用于多标签学习的提升式自适应加权广义学习系统
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124557
Yuanxin Lin;Zhiwen Yu;Kaixiang Yang;Ziwei Fan;C. L. Philip Chen
Multi-label classification is a challenging problem that has attracted significant attention from researchers, particularly in the domain of image and text attribute annotation. However, multi-label datasets are prone to serious intra-class and inter-class imbalance problems, which can significantly degrade the classification performance. To address the above issues, we propose the multi-label weighted broad learning system (MLW-BLS) from the perspective of label imbalance weighting and label correlation mining. Further, we propose the multi-label adaptive weighted broad learning system (MLAW-BLS) to adaptively adjust the specific weights and values of labels of MLW-BLS and construct an efficient imbalanced classifier set. Extensive experiments are conducted on various datasets to evaluate the effectiveness of the proposed model, and the results demonstrate its superiority over other advanced approaches.
多标签分类是一个极具挑战性的问题,尤其是在图像和文本属性标注领域,已经引起了研究人员的极大关注。然而,多标签数据集容易出现严重的类内和类间不平衡问题,这会大大降低分类性能。针对上述问题,我们从标签不平衡加权和标签相关性挖掘的角度出发,提出了多标签加权广泛学习系统(MLW-BLS)。此外,我们还提出了多标签自适应加权广义学习系统(MLAW-BLS),以自适应地调整 MLW-BLS 的具体权重和标签值,构建高效的不平衡分类器集。我们在各种数据集上进行了广泛的实验,以评估所提出模型的有效性,结果表明它优于其他先进方法。
{"title":"Boosting Adaptive Weighted Broad Learning System for Multi-Label Learning","authors":"Yuanxin Lin;Zhiwen Yu;Kaixiang Yang;Ziwei Fan;C. L. Philip Chen","doi":"10.1109/JAS.2024.124557","DOIUrl":"https://doi.org/10.1109/JAS.2024.124557","url":null,"abstract":"Multi-label classification is a challenging problem that has attracted significant attention from researchers, particularly in the domain of image and text attribute annotation. However, multi-label datasets are prone to serious intra-class and inter-class imbalance problems, which can significantly degrade the classification performance. To address the above issues, we propose the multi-label weighted broad learning system (MLW-BLS) from the perspective of label imbalance weighting and label correlation mining. Further, we propose the multi-label adaptive weighted broad learning system (MLAW-BLS) to adaptively adjust the specific weights and values of labels of MLW-BLS and construct an efficient imbalanced classifier set. Extensive experiments are conducted on various datasets to evaluate the effectiveness of the proposed model, and the results demonstrate its superiority over other advanced approaches.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A State-Migration Particle Swarm Optimizer for Adaptive Latent Factor Analysis of High-Dimensional and Incomplete Data 用于高维不完整数据自适应潜因分析的状态迁移粒子群优化器
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124575
Jiufang Chen;Kechen Liu;Xin Luo;Ye Yuan;Khaled Sedraoui;Yusuf Al-Turki;MengChu Zhou
High-dimensional and incomplete (HDI) matrices are primarily generated in all kinds of big-data-related practical applications. A latent factor analysis (LFA) model is capable of conducting efficient representation learning to an HDI matrix, whose hyper-parameter adaptation can be implemented through a particle swarm optimizer (PSO) to meet scalable requirements. However, conventional PSO is limited by its premature issues, which leads to the accuracy loss of a resultant LFA model. To address this thorny issue, this study merges the information of each particle's state migration into its evolution process following the principle of a generalized momentum method for improving its search ability, thereby building a state-migration particle swarm optimizer (SPSO), whose theoretical convergence is rigorously proved in this study. It is then incorporated into an LFA model for implementing efficient hyper-parameter adaptation without accuracy loss. Experiments on six HDI matrices indicate that an SPSO-incorporated LFA model outperforms state-of-the-art LFA models in terms of prediction accuracy for missing data of an HDI matrix with competitive computational efficiency. Hence, SPSO's use ensures efficient and reliable hyper-parameter adaptation in an LFA model, thus ensuring practicality and accurate representation learning for HDI matrices.
高维不完整(HDI)矩阵主要产生于各种与大数据相关的实际应用中。潜在因素分析(LFA)模型能够对 HDI 矩阵进行有效的表征学习,其超参数适应可以通过粒子群优化器(PSO)来实现,以满足可扩展的要求。然而,传统的 PSO 受限于其不成熟的问题,导致生成的 LFA 模型精度下降。为了解决这个棘手的问题,本研究根据广义动量法的原理,将每个粒子的状态迁移信息融合到粒子群的进化过程中,以提高粒子群的搜索能力,从而建立了状态迁移粒子群优化器(SPSO)。然后将其纳入 LFA 模型,在不损失精度的情况下实现高效的超参数适应。对六个人类发展指数矩阵的实验表明,就人类发展指数矩阵缺失数据的预测准确性而言,SPSO-incorporated LFA 模型优于最先进的 LFA 模型,且计算效率极具竞争力。因此,SPSO 的使用确保了 LFA 模型中超参数适应的高效性和可靠性,从而保证了 HDI 矩阵的实用性和准确表示学习。
{"title":"A State-Migration Particle Swarm Optimizer for Adaptive Latent Factor Analysis of High-Dimensional and Incomplete Data","authors":"Jiufang Chen;Kechen Liu;Xin Luo;Ye Yuan;Khaled Sedraoui;Yusuf Al-Turki;MengChu Zhou","doi":"10.1109/JAS.2024.124575","DOIUrl":"https://doi.org/10.1109/JAS.2024.124575","url":null,"abstract":"High-dimensional and incomplete (HDI) matrices are primarily generated in all kinds of big-data-related practical applications. A latent factor analysis (LFA) model is capable of conducting efficient representation learning to an HDI matrix, whose hyper-parameter adaptation can be implemented through a particle swarm optimizer (PSO) to meet scalable requirements. However, conventional PSO is limited by its premature issues, which leads to the accuracy loss of a resultant LFA model. To address this thorny issue, this study merges the information of each particle's state migration into its evolution process following the principle of a generalized momentum method for improving its search ability, thereby building a state-migration particle swarm optimizer (SPSO), whose theoretical convergence is rigorously proved in this study. It is then incorporated into an LFA model for implementing efficient hyper-parameter adaptation without accuracy loss. Experiments on six HDI matrices indicate that an SPSO-incorporated LFA model outperforms state-of-the-art LFA models in terms of prediction accuracy for missing data of an HDI matrix with competitive computational efficiency. Hence, SPSO's use ensures efficient and reliable hyper-parameter adaptation in an LFA model, thus ensuring practicality and accurate representation learning for HDI matrices.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-Stage Approach for Targeted Knowledge Transfer in Self-Knowledge Distillation 自我知识提炼中的两阶段定向知识转移法
IF 15.3 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JAS.2024.124629
Zimo Yin;Jian Pu;Yijie Zhou;Xiangyang Xue
Knowledge distillation (KD) enhances student network generalization by transferring dark knowledge from a complex teacher network. To optimize computational expenditure and memory utilization, self-knowledge distillation (SKD) extracts dark knowledge from the model itself rather than an external teacher network. However, previous SKD methods performed distillation indiscriminately on full datasets, overlooking the analysis of representative samples. In this work, we present a novel two-stage approach to providing targeted knowledge on specific samples, named two-stage approach self-knowledge distillation (TOAST). We first soften the hard targets using class medoids generated based on logit vectors per class. Then, we iteratively distill the under-trained data with past predictions of half the batch size. The two-stage knowledge is linearly combined, efficiently enhancing model performance. Extensive experiments conducted on five backbone architectures show our method is model-agnostic and achieves the best generalization performance. Besides, TOAST is strongly compatible with existing augmentation-based regularization methods. Our method also obtains a speedup of up to 2.95x compared with a recent state-of-the-art method.
知识蒸馏(KD)通过从复杂的教师网络中转移暗知识来增强学生网络的泛化能力。为了优化计算支出和内存利用率,自知识蒸馏(SKD)从模型本身而非外部教师网络中提取暗知识。然而,以往的自知蒸馏方法都是不加区分地对完整数据集进行蒸馏,忽略了对代表性样本的分析。在这项工作中,我们提出了一种新颖的两阶段方法,为特定样本提供有针对性的知识,命名为两阶段方法自我知识蒸馏(TOAST)。我们首先使用基于每类对数向量生成的类 medoids 来软化硬目标。然后,我们用过去预测的一半批量数据对训练不足的数据进行迭代蒸馏。两阶段知识线性组合,有效提高了模型性能。在五种骨干架构上进行的广泛实验表明,我们的方法与模型无关,并实现了最佳的泛化性能。此外,TOAST 与现有的基于增强的正则化方法具有很强的兼容性。与最近最先进的方法相比,我们的方法的速度提高了 2.95 倍。
{"title":"Two-Stage Approach for Targeted Knowledge Transfer in Self-Knowledge Distillation","authors":"Zimo Yin;Jian Pu;Yijie Zhou;Xiangyang Xue","doi":"10.1109/JAS.2024.124629","DOIUrl":"https://doi.org/10.1109/JAS.2024.124629","url":null,"abstract":"Knowledge distillation (KD) enhances student network generalization by transferring dark knowledge from a complex teacher network. To optimize computational expenditure and memory utilization, self-knowledge distillation (SKD) extracts dark knowledge from the model itself rather than an external teacher network. However, previous SKD methods performed distillation indiscriminately on full datasets, overlooking the analysis of representative samples. In this work, we present a novel two-stage approach to providing targeted knowledge on specific samples, named two-stage approach self-knowledge distillation (TOAST). We first soften the hard targets using class medoids generated based on logit vectors per class. Then, we iteratively distill the under-trained data with past predictions of half the batch size. The two-stage knowledge is linearly combined, efficiently enhancing model performance. Extensive experiments conducted on five backbone architectures show our method is model-agnostic and achieves the best generalization performance. Besides, TOAST is strongly compatible with existing augmentation-based regularization methods. Our method also obtains a speedup of up to 2.95x compared with a recent state-of-the-art method.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":null,"pages":null},"PeriodicalIF":15.3,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142397459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Ieee-Caa Journal of Automatica Sinica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1