首页 > 最新文献

IEEE open journal of control systems最新文献

英文 中文
DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting DeepSplit:通过算子分裂对深度神经网络进行可扩展验证
Pub Date : 2022-06-30 DOI: 10.1109/OJCSYS.2022.3187429
Shaoru Chen;Eric Wong;J. Zico Kolter;Mahyar Fazlyab
Analyzing the worst-case performance of deep neural networks against input perturbations amounts to solving a large-scale non-convex optimization problem, for which several past works have proposed convex relaxations as a promising alternative. However, even for reasonably-sized neural networks, these relaxations are not tractable, and so must be replaced by even weaker relaxations in practice. In this work, we propose a novel operator splitting method that can directly solve a convex relaxation of the problem to high accuracy, by splitting it into smaller sub-problems that often have analytical solutions. The method is modular, scales to very large problem instances, and compromises of operations that are amenable to fast parallelization with GPU acceleration. We demonstrate our method in bounding the worst-case performance of large convolutional networks in image classification and reinforcement learning settings, and in reachability analysis of neural network dynamical systems.
分析深度神经网络对输入扰动的最坏情况性能相当于解决一个大规模的非凸优化问题,过去的几项工作已经提出了凸松弛作为一种有前途的替代方案。然而,即使对于大小合理的神经网络,这些松弛也是不可处理的,因此在实践中必须用更弱的松弛来取代。在这项工作中,我们提出了一种新的算子分裂方法,通过将问题分裂成较小的子问题,通常具有解析解,可以直接高精度地解决问题的凸松弛。该方法是模块化的,可以扩展到非常大的问题实例,并且可以折衷操作,以适应GPU加速的快速并行化。我们在图像分类和强化学习设置中,以及在神经网络动态系统的可达性分析中,演示了我们的方法来限制大型卷积网络的最坏情况性能。
{"title":"DeepSplit: Scalable Verification of Deep Neural Networks via Operator Splitting","authors":"Shaoru Chen;Eric Wong;J. Zico Kolter;Mahyar Fazlyab","doi":"10.1109/OJCSYS.2022.3187429","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3187429","url":null,"abstract":"Analyzing the worst-case performance of deep neural networks against input perturbations amounts to solving a large-scale non-convex optimization problem, for which several past works have proposed convex relaxations as a promising alternative. However, even for reasonably-sized neural networks, these relaxations are not tractable, and so must be replaced by even weaker relaxations in practice. In this work, we propose a novel operator splitting method that can directly solve a convex relaxation of the problem to high accuracy, by splitting it into smaller sub-problems that often have analytical solutions. The method is modular, scales to very large problem instances, and compromises of operations that are amenable to fast parallelization with GPU acceleration. We demonstrate our method in bounding the worst-case performance of large convolutional networks in image classification and reinforcement learning settings, and in reachability analysis of neural network dynamical systems.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"126-140"},"PeriodicalIF":0.0,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09811356.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50349024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Dissipative Deep Neural Dynamical Systems 耗散深度神经动力学系统
Pub Date : 2022-06-28 DOI: 10.1109/OJCSYS.2022.3186838
Ján Drgoňa;Aaron Tuor;Soumya Vasisht;Draguna Vrabie
In this paper, we provide sufficient conditions for dissipativity and local asymptotic stability of discrete-time dynamical systems parametrized by deep neural networks. We leverage the representation of neural networks as pointwise affine maps, thus exposing their local linear operators and making them accessible to classical system analytic and design methods. This allows us to “crack open the black box” of the neural dynamical system’s behavior by evaluating their dissipativity, and estimating their stationary points and state-space partitioning. We relate the norms of these local linear operators to the energy stored in the dissipative system with supply rates represented by their aggregate bias terms. Empirically, we analyze the variance in dynamical behavior and eigenvalue spectra of these local linear operators with varying weight factorizations, activation functions, bias terms, and depths.
本文给出了由深度神经网络参数化的离散时间动力系统的耗散性和局部渐近稳定性的充分条件。我们利用神经网络作为逐点仿射映射的表示,从而暴露其局部线性算子,并使其可用于经典的系统分析和设计方法。这使我们能够通过评估神经动力系统的耗散性、估计其驻点和状态空间划分来“打开”神经动力系统行为的黑匣子。我们将这些局部线性算子的范数与耗散系统中存储的能量联系起来,耗散系统的供应率由它们的总偏差项表示。根据经验,我们分析了这些局部线性算子的动力学行为和特征值谱的方差,这些算子具有不同的权重因子分解、激活函数、偏差项和深度。
{"title":"Dissipative Deep Neural Dynamical Systems","authors":"Ján Drgoňa;Aaron Tuor;Soumya Vasisht;Draguna Vrabie","doi":"10.1109/OJCSYS.2022.3186838","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3186838","url":null,"abstract":"In this paper, we provide sufficient conditions for dissipativity and local asymptotic stability of discrete-time dynamical systems parametrized by deep neural networks. We leverage the representation of neural networks as pointwise affine maps, thus exposing their local linear operators and making them accessible to classical system analytic and design methods. This allows us to “crack open the black box” of the neural dynamical system’s behavior by evaluating their dissipativity, and estimating their stationary points and state-space partitioning. We relate the norms of these local linear operators to the energy stored in the dissipative system with supply rates represented by their aggregate bias terms. Empirically, we analyze the variance in dynamical behavior and eigenvalue spectra of these local linear operators with varying weight factorizations, activation functions, bias terms, and depths.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"100-112"},"PeriodicalIF":0.0,"publicationDate":"2022-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09809789.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50349022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Discrete Fractional Order Adaptive Law for Parameter Estimation and Adaptive Control 用于参数估计和自适应控制的离散分数阶自适应律
Pub Date : 2022-06-21 DOI: 10.1109/OJCSYS.2022.3185002
Mohamed Aburakhis;Raúl Ordóñez;Ouboti Djaneye-Boundjou
In this article, a discrete fractional order adaptive law (DFOAL) is designed based on the Caputo fractional difference to perform parameter estimation of structured uncertainties. The paper provides a rigorous stability analysis of the DFOAL parameter estimation method. The DFOAL is then modified in order to improve parameter estimator performance to show that, under certain conditions, it provides asymptotic convergence to the true parameter values even when the regressor is not persistently exciting. A method to allow for practical implementation of the DFOAL and the modified DFOAL is developed. Finally, the modified DFOAL is used to identify the plant parameters in an indirect adaptive control law for a class of nonlinear discrete-time systems with structured uncertainty.
本文设计了一种基于Caputo分数差分的离散分数阶自适应律(DFOAL),用于对结构不确定性进行参数估计。本文对DFOAL参数估计方法进行了严格的稳定性分析。然后对DFOAL进行了修改,以提高参数估计器的性能,从而表明在某些条件下,即使回归器不是持续激励的,它也能提供对真实参数值的渐近收敛性。开发了一种允许DFOAL和修改后的DFOAL的实际实现的方法。最后,使用改进的DFOAL来识别一类具有结构不确定性的非线性离散时间系统的间接自适应控制律中的对象参数。
{"title":"A Discrete Fractional Order Adaptive Law for Parameter Estimation and Adaptive Control","authors":"Mohamed Aburakhis;Raúl Ordóñez;Ouboti Djaneye-Boundjou","doi":"10.1109/OJCSYS.2022.3185002","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3185002","url":null,"abstract":"In this article, a discrete fractional order adaptive law (DFOAL) is designed based on the Caputo fractional difference to perform parameter estimation of structured uncertainties. The paper provides a rigorous stability analysis of the DFOAL parameter estimation method. The DFOAL is then modified in order to improve parameter estimator performance to show that, under certain conditions, it provides asymptotic convergence to the true parameter values even when the regressor is not persistently exciting. A method to allow for practical implementation of the DFOAL and the modified DFOAL is developed. Finally, the modified DFOAL is used to identify the plant parameters in an indirect adaptive control law for a class of nonlinear discrete-time systems with structured uncertainty.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"113-125"},"PeriodicalIF":0.0,"publicationDate":"2022-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09802697.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50349023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Lipschitz Feedback Policies From Expert Demonstrations: Closed-Loop Guarantees, Robustness and Generalization 从专家演示中学习Lipschitz反馈策略:闭环保证、鲁棒性和泛化
Pub Date : 2022-06-17 DOI: 10.1109/OJCSYS.2022.3181584
Abed AlRahman Al Makdah;Vishaal Krishnan;Fabio Pasqualetti
In this work, we propose a framework in which we use a Lipschitz-constrained loss minimization scheme to learn feedback control policies with guarantees on closed-loop stability, adversarial robustness, and generalization. These policies are learned directly from expert demonstrations, contained in a dataset of state-control input pairs, without any prior knowledge of the task and system model. Our analysis exploits the Lipschitz property of the learned policies to obtain closed-loop guarantees on stability, adversarial robustness, and generalization over scenarios unexplored by the expert. In particular, first, we establish robust closed-loop stability under the learned control policy, where we provide guarantees that the closed-loop trajectory under the learned policy stays within a bounded region around the expert trajectory and converges asymptotically to a bounded region around the origin. Second, we derive bounds on the closed-loop regret with respect to the expert policy and on the deterioration of the closed-loop performance under bounded (adversarial) disturbances to the state measurements. These bounds provide certificates for closed-loop performance and adversarial robustness for learned policies. Third, we derive a (probabilistic) bound on generalization error for the learned policies. Numerical results validate our analysis and demonstrate the effectiveness of our robust feedback policy learning framework. Finally, our results support the existence of a potential tradeoff between nominal closed-loop performance and adversarial robustness, and that improvements in nominal closed-loop performance can only be made at the expense of robustness to adversarial perturbations.
在这项工作中,我们提出了一个框架,在该框架中,我们使用Lipschitz约束的损失最小化方案来学习具有闭环稳定性、对抗性鲁棒性和泛化保证的反馈控制策略。这些策略直接从专家演示中学习,包含在状态控制输入对的数据集中,而不需要任何任务和系统模型的先验知识。我们的分析利用学习策略的Lipschitz特性,在专家未探索的场景中获得稳定性、对抗性鲁棒性和泛化的闭环保证。特别地,首先,我们在学习控制策略下建立了鲁棒闭环稳定性,其中我们保证学习策略下的闭环轨迹保持在专家轨迹周围的有界区域内,并渐近收敛到原点周围的有边界区域。其次,我们推导了关于专家策略的闭环遗憾的边界,以及在状态测量的有界(对抗性)扰动下闭环性能恶化的边界。这些边界为闭环性能和学习策略的对抗性鲁棒性提供了证书。第三,我们推导了学习策略的泛化误差的(概率)界。数值结果验证了我们的分析,并证明了我们稳健的反馈政策学习框架的有效性。最后,我们的结果支持名义闭环性能和对抗性鲁棒性之间存在潜在的折衷,并且名义闭环性能的改进只能以牺牲对抗性扰动的鲁棒性为代价。
{"title":"Learning Lipschitz Feedback Policies From Expert Demonstrations: Closed-Loop Guarantees, Robustness and Generalization","authors":"Abed AlRahman Al Makdah;Vishaal Krishnan;Fabio Pasqualetti","doi":"10.1109/OJCSYS.2022.3181584","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3181584","url":null,"abstract":"In this work, we propose a framework in which we use a Lipschitz-constrained loss minimization scheme to learn feedback control policies with guarantees on closed-loop stability, adversarial robustness, and generalization. These policies are learned directly from expert demonstrations, contained in a dataset of state-control input pairs, without any prior knowledge of the task and system model. Our analysis exploits the Lipschitz property of the learned policies to obtain closed-loop guarantees on stability, adversarial robustness, and generalization over scenarios unexplored by the expert. In particular, first, we establish robust closed-loop stability under the learned control policy, where we provide guarantees that the closed-loop trajectory under the learned policy stays within a bounded region around the expert trajectory and converges asymptotically to a bounded region around the origin. Second, we derive bounds on the closed-loop regret with respect to the expert policy and on the deterioration of the closed-loop performance under bounded (adversarial) disturbances to the state measurements. These bounds provide certificates for closed-loop performance and adversarial robustness for learned policies. Third, we derive a (probabilistic) bound on generalization error for the learned policies. Numerical results validate our analysis and demonstrate the effectiveness of our robust feedback policy learning framework. Finally, our results support the existence of a potential tradeoff between nominal closed-loop performance and adversarial robustness, and that improvements in nominal closed-loop performance can only be made at the expense of robustness to adversarial perturbations.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"85-99"},"PeriodicalIF":0.0,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09798865.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50349021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Generalized Lyapunov Demodulator: High-Bandwidth, Low-Noise Amplitude and Phase Estimation 广义李雅普诺夫解调器:高带宽、低噪声幅度和相位估计
Pub Date : 2022-06-08 DOI: 10.1109/OJCSYS.2022.3181111
Michael R. P. Ragazzon;Saverio Messineo;Jan Tommy Gravdahl;David M. Harcombe;Michael G. Ruppert
Effective demodulation of amplitude and phase is a requirement in a wide array of applications. Recent efforts have increased the demodulation performance, in particular, the Lyapunov demodulator allows bandwidths up to the carrier frequency of the signal. However, being inherently restricted to first-order filtering of the input signal, it is highly sensitive to frequency components outside its passband region. This makes it unsuitable for certain applications such as multifrequency atomic force microscopy (AFM). In this article, the structure of the Lyapunov demodulator is transformed to an equivalent form and generalized by exploiting the internal model principle. The resulting generalized Lyapunov demodulator structure allows for arbitrary filtering order and is easy to implement, requiring only a bandpass filter, a single integrator, and two nonlinear transformations. The generalized Lyapunov demodulator is implemented experimentally on a field-programmable gate array (FPGA). Then it is used for imaging in an AFM and benchmarked against the standard Lyapunov demodulator and the widely used lock-in amplifier. The lock-in amplifier achieves great noise attenuation capabilities and off-mode rejection at low bandwidths, whereas the standard Lyapunov demodulator is shown to be effective at high bandwidths. We demonstrate that the proposed demodulator combines the best from the two state-of-the-art demodulators, demonstrating high bandwidths, large off-mode rejection, and excellent noise attenuation simultaneously.
振幅和相位的有效解调是广泛应用中的要求。最近的努力提高了解调性能,特别是Lyapunov解调器允许高达信号载波频率的带宽。然而,由于固有地局限于输入信号的一阶滤波,它对其通带区域之外的频率分量高度敏感。这使得它不适合某些应用,例如多频原子力显微镜(AFM)。本文将李雅普诺夫解调器的结构转化为等效形式,并利用内模原理对其进行了推广。由此产生的广义李雅普诺夫解调器结构允许任意滤波阶数,并且易于实现,只需要一个带通滤波器、一个积分器和两个非线性变换。在现场可编程门阵列(FPGA)上实验实现了广义李雅普诺夫解调器。然后将其用于AFM中的成像,并与标准李雅普诺夫解调器和广泛使用的锁定放大器进行对比。锁定放大器在低带宽下实现了良好的噪声衰减能力和关模抑制,而标准李雅普诺夫解调器在高带宽下被证明是有效的。我们证明,所提出的解调器结合了两个最先进的解调器中的最佳解调器,同时展示了高带宽、大的离模抑制和出色的噪声衰减。
{"title":"The Generalized Lyapunov Demodulator: High-Bandwidth, Low-Noise Amplitude and Phase Estimation","authors":"Michael R. P. Ragazzon;Saverio Messineo;Jan Tommy Gravdahl;David M. Harcombe;Michael G. Ruppert","doi":"10.1109/OJCSYS.2022.3181111","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3181111","url":null,"abstract":"Effective demodulation of amplitude and phase is a requirement in a wide array of applications. Recent efforts have increased the demodulation performance, in particular, the Lyapunov demodulator allows bandwidths up to the carrier frequency of the signal. However, being inherently restricted to first-order filtering of the input signal, it is highly sensitive to frequency components outside its passband region. This makes it unsuitable for certain applications such as multifrequency atomic force microscopy (AFM). In this article, the structure of the Lyapunov demodulator is transformed to an equivalent form and generalized by exploiting the internal model principle. The resulting generalized Lyapunov demodulator structure allows for arbitrary filtering order and is easy to implement, requiring only a bandpass filter, a single integrator, and two nonlinear transformations. The generalized Lyapunov demodulator is implemented experimentally on a field-programmable gate array (FPGA). Then it is used for imaging in an AFM and benchmarked against the standard Lyapunov demodulator and the widely used lock-in amplifier. The lock-in amplifier achieves great noise attenuation capabilities and off-mode rejection at low bandwidths, whereas the standard Lyapunov demodulator is shown to be effective at high bandwidths. We demonstrate that the proposed demodulator combines the best from the two state-of-the-art demodulators, demonstrating high bandwidths, large off-mode rejection, and excellent noise attenuation simultaneously.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"69-84"},"PeriodicalIF":0.0,"publicationDate":"2022-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09790310.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Noninvasive Breathing Effort Estimation of Mechanically Ventilated Patients Using Sparse Optimization 稀疏优化法估计机械通气患者的无创呼吸力
Pub Date : 2022-06-03 DOI: 10.1109/OJCSYS.2022.3180002
Joey Reinders;Bram Hunnekens;Nathan van de Wouw;Tom Oomen
Mechanical ventilators facilitate breathing for patients who cannot breathe (sufficiently) on their own. The aim of this paper is to estimate relevant lung parameters and the spontaneous breathing effort of a ventilated patient that help keeping track of the patient’s clinical condition. A key challenge is that estimation using the available sensors for typical model structures results in a non-identifiable parametrization. A sparse optimization algorithm to estimate the lung parameters and the patient effort, without interfering with the patient’s treatment, using an $ell _{1}$-regularization approach is presented. It is confirmed that accurate estimates of the lung parameters and the patient effort can be retrieved through a simulation case study and an experimental case study.
机械通气机有助于无法自主(充分)呼吸的患者的呼吸。本文的目的是估计相关的肺部参数和通气患者的自主呼吸力,以帮助跟踪患者的临床状况。一个关键的挑战是,使用典型模型结构的可用传感器进行估计会导致不可识别的参数化。提出了一种稀疏优化算法,在不干扰患者治疗的情况下,使用$ell_{1}$正则化方法来估计肺部参数和患者工作量。已经证实,可以通过模拟案例研究和实验案例研究来检索对肺部参数和患者努力的准确估计。
{"title":"Noninvasive Breathing Effort Estimation of Mechanically Ventilated Patients Using Sparse Optimization","authors":"Joey Reinders;Bram Hunnekens;Nathan van de Wouw;Tom Oomen","doi":"10.1109/OJCSYS.2022.3180002","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3180002","url":null,"abstract":"Mechanical ventilators facilitate breathing for patients who cannot breathe (sufficiently) on their own. The aim of this paper is to estimate relevant lung parameters and the spontaneous breathing effort of a ventilated patient that help keeping track of the patient’s clinical condition. A key challenge is that estimation using the available sensors for typical model structures results in a non-identifiable parametrization. A sparse optimization algorithm to estimate the lung parameters and the patient effort, without interfering with the patient’s treatment, using an \u0000<inline-formula><tex-math>$ell _{1}$</tex-math></inline-formula>\u0000-regularization approach is presented. It is confirmed that accurate estimates of the lung parameters and the patient effort can be retrieved through a simulation case study and an experimental case study.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"57-68"},"PeriodicalIF":0.0,"publicationDate":"2022-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09787788.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50255796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimally Biomimetic Passivity-Based Control of a Lower-Limb Exoskeleton Over the Primary Activities of Daily Life 基于最佳仿生被动性的下肢外骨骼对日常生活主要活动的控制
Pub Date : 2022-04-12 DOI: 10.1109/OJCSYS.2022.3165733
Jianping Lin;Nikhil V. Divekar;Gray C. Thomas;Robert D. Gregg
Task-specific, trajectory-based control methods commonly used in exoskeletons may be appropriate for individuals with paraplegia, but they overly constrain the volitional motion of individuals with remnant voluntary ability (representing a far larger population). Human-exoskeleton systems can be represented in the form of the Euler-Lagrange equations or, equivalently, the port-controlled Hamiltonian equations to design control laws that provide task-invariant assistance across a continuum of activities/environments by altering energetic properties of the human body. We previously introduced a port-controlled Hamiltonian framework that parameterizes the control law through basis functions related to gravitational and gyroscopic terms, which are optimized to fit normalized able-bodied joint torques across multiple walking gaits on different ground inclines. However, this approach did not have the flexibility to reproduce joint torques for a broader set of activities, including stair climbing and stand-to-sit, due to strict assumptions related to input-output passivity, which ensures the human remains in control of energy growth in the closed-loop dynamics. To provide biomimetic assistance across all primary activities of daily life, this paper generalizes this energy shaping framework by incorporating vertical ground reaction forces and global planar orientation into the basis set, while preserving passivity between the human joint torques and human joint velocities. We present an experimental implementation on a powered knee-ankle exoskeleton used by three able-bodied human subjects during walking on various inclines, ramp ascent/descent, and stand-to-sit, demonstrating the versatility of this control approach and its effect on muscular effort.
外骨骼中常用的任务特异性、基于轨迹的控制方法可能适用于截瘫患者,但它们过度限制了具有残余自愿能力的患者(代表了更大的人群)的意志运动。人类外骨骼系统可以用欧拉-拉格朗日方程或等效的端口控制哈密顿方程的形式来表示,以设计控制律,该控制律通过改变人体的能量特性来在连续的活动/环境中提供任务不变的帮助。我们之前介绍了一种端口控制的哈密顿框架,该框架通过与重力项和陀螺项相关的基函数来参数化控制律,这些基函数经过优化,以适应不同地面坡度上多个步态上的标准化健全关节力矩。然而,由于与输入-输出被动性相关的严格假设,这种方法不具有为更广泛的活动(包括爬楼梯和站-坐)复制关节力矩的灵活性,这确保了人类在闭环动力学中保持对能量增长的控制。为了在日常生活的所有主要活动中提供仿生辅助,本文通过将垂直地面反作用力和全局平面方向纳入基集中来推广这种能量塑造框架,同时保持人体关节力矩和人体关节速度之间的被动性。我们介绍了三名身体健全的人类受试者在不同斜坡行走、坡道上升/下降和站到坐过程中使用的动力膝踝外骨骼的实验实现,展示了这种控制方法的多功能性及其对肌肉力量的影响。
{"title":"Optimally Biomimetic Passivity-Based Control of a Lower-Limb Exoskeleton Over the Primary Activities of Daily Life","authors":"Jianping Lin;Nikhil V. Divekar;Gray C. Thomas;Robert D. Gregg","doi":"10.1109/OJCSYS.2022.3165733","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3165733","url":null,"abstract":"Task-specific, trajectory-based control methods commonly used in exoskeletons may be appropriate for individuals with paraplegia, but they overly constrain the volitional motion of individuals with remnant voluntary ability (representing a far larger population). Human-exoskeleton systems can be represented in the form of the Euler-Lagrange equations or, equivalently, the port-controlled Hamiltonian equations to design control laws that provide \u0000<italic>task-invariant</i>\u0000 assistance across a continuum of activities/environments by altering energetic properties of the human body. We previously introduced a port-controlled Hamiltonian framework that parameterizes the control law through basis functions related to gravitational and gyroscopic terms, which are optimized to fit normalized able-bodied joint torques across multiple walking gaits on different ground inclines. However, this approach did not have the flexibility to reproduce joint torques for a broader set of activities, including stair climbing and stand-to-sit, due to strict assumptions related to input-output passivity, which ensures the human remains in control of energy growth in the closed-loop dynamics. To provide biomimetic assistance across all primary activities of daily life, this paper generalizes this energy shaping framework by incorporating vertical ground reaction forces and global planar orientation into the basis set, while preserving passivity between the human joint torques and human joint velocities. We present an experimental implementation on a powered knee-ankle exoskeleton used by three able-bodied human subjects during walking on various inclines, ramp ascent/descent, and stand-to-sit, demonstrating the versatility of this control approach and its effect on muscular effort.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"15-28"},"PeriodicalIF":0.0,"publicationDate":"2022-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09756252.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Non-Stationary Representation Learning in Sequential Linear Bandits 序列线性带中的非平稳表示学习
Pub Date : 2022-03-27 DOI: 10.1109/OJCSYS.2022.3178540
Yuzhen Qin;Tommaso Menara;Samet Oymak;ShiNung Ching;Fabio Pasqualetti
In this paper, we study representation learning for multi-task decision-making in non-stationary environments. We consider the framework of sequential linear bandits, where the agent performs a series of tasks drawn from different environments. The embeddings of tasks in each environment share a low-dimensional feature extractor called representation, and representations are different across environments. We propose an online algorithm that facilitates efficient decision-making by learning and transferring non-stationary representations in an adaptive fashion. We prove that our algorithm significantly outperforms the existing ones that treat tasks independently. We also conduct experiments using both synthetic and real data to validate our theoretical insights and demonstrate the efficacy of our algorithm.
在本文中,我们研究了非平稳环境中多任务决策的表示学习。我们考虑序列线性土匪的框架,其中代理执行从不同环境中提取的一系列任务。每个环境中的任务嵌入共享一个称为表示的低维特征提取器,并且表示在不同环境中是不同的。我们提出了一种在线算法,通过以自适应方式学习和转移非平稳表示来促进高效决策。我们证明了我们的算法显著优于现有的独立处理任务的算法。我们还使用合成数据和真实数据进行了实验,以验证我们的理论见解,并证明我们算法的有效性。
{"title":"Non-Stationary Representation Learning in Sequential Linear Bandits","authors":"Yuzhen Qin;Tommaso Menara;Samet Oymak;ShiNung Ching;Fabio Pasqualetti","doi":"10.1109/OJCSYS.2022.3178540","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3178540","url":null,"abstract":"In this paper, we study representation learning for multi-task decision-making in non-stationary environments. We consider the framework of sequential linear bandits, where the agent performs a series of tasks drawn from different environments. The embeddings of tasks in each environment share a low-dimensional feature extractor called \u0000<italic>representation</i>\u0000, and representations are different across environments. We propose an online algorithm that facilitates efficient decision-making by learning and transferring non-stationary representations in an adaptive fashion. We prove that our algorithm significantly outperforms the existing ones that treat tasks independently. We also conduct experiments using both synthetic and real data to validate our theoretical insights and demonstrate the efficacy of our algorithm.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"41-56"},"PeriodicalIF":0.0,"publicationDate":"2022-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09783063.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Data-Driven Distributed and Localized Model Predictive Control 数据驱动的分布式局部模型预测控制
Pub Date : 2022-03-11 DOI: 10.1109/OJCSYS.2022.3171787
Carmen Amo Alonso;Fengjun Yang;Nikolai Matni
Motivated by large-scale but computationally constrained settings, e.g., the Internet of Things, we present a novel data-driven distributed control algorithm that is synthesized directly from trajectory data. Our method, data-driven Distributed and Localized Model Predictive Control (D$^{3}$LMPC), builds upon the data-driven System Level Synthesis (SLS) framework, which allows one to parameterize closed-loop system responses directly from collected open-loop trajectories. The resulting model-predictive controller can be implemented with distributed computation and only local information sharing. By imposing locality constraints on the system response, we show that the amount of data needed for our synthesis problem is independent of the size of the global system. Moreover, we show that our algorithm enjoys theoretical guarantees for recursive feasibility and asymptotic stability. Finally, we also demonstrate the optimality and scalability of our algorithm in a simulation experiment.
受大规模但计算受限的环境(例如物联网)的启发,我们提出了一种直接从轨迹数据合成的新型数据驱动分布式控制算法。我们的方法,数据驱动的分布式和局部模型预测控制(D$^{3}$LMPC),建立在数据驱动的系统级综合(SLS)框架之上,该框架允许直接从收集的开环轨迹中参数化闭环系统响应。所得到的模型预测控制器可以通过分布式计算和仅局部信息共享来实现。通过对系统响应施加局部约束,我们表明我们的综合问题所需的数据量与全局系统的大小无关。此外,我们还证明了我们的算法具有递归可行性和渐近稳定性的理论保证。最后,我们还通过仿真实验证明了算法的最优性和可扩展性。
{"title":"Data-Driven Distributed and Localized Model Predictive Control","authors":"Carmen Amo Alonso;Fengjun Yang;Nikolai Matni","doi":"10.1109/OJCSYS.2022.3171787","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3171787","url":null,"abstract":"Motivated by large-scale but computationally constrained settings, e.g., the Internet of Things, we present a novel data-driven distributed control algorithm that is synthesized directly from trajectory data. Our method, data-driven Distributed and Localized Model Predictive Control (D\u0000<inline-formula><tex-math>$^{3}$</tex-math></inline-formula>\u0000LMPC), builds upon the data-driven System Level Synthesis (SLS) framework, which allows one to parameterize \u0000<italic>closed-loop</i>\u0000 system responses directly from collected open-loop trajectories. The resulting model-predictive controller can be implemented with distributed computation and only local information sharing. By imposing locality constraints on the system response, we show that the amount of data needed for our synthesis problem is independent of the size of the global system. Moreover, we show that our algorithm enjoys theoretical guarantees for recursive feasibility and asymptotic stability. Finally, we also demonstrate the optimality and scalability of our algorithm in a simulation experiment.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"29-40"},"PeriodicalIF":0.0,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09772975.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An Algorithm to Warm Start Perturbed (WASP) Constrained Dynamic Programs 热启动扰动(WASP)约束动态程序的一种算法
Pub Date : 2022-02-14 DOI: 10.1109/OJCSYS.2022.3150535
Abhishek Gupta;Shreshta Rajakumar Deshpande;Marcello Canova
Receding horizon optimal control problems compute the solution at each time step to operate the system on a near-optimal path. However, in many practical cases, the boundary conditions, such as external inputs, constraint equations, or the objective function, vary only marginally from one time step to the next. In this case, recomputing the optimal solution at each time represents a significant burden for real-time applications. This paper proposes a novel algorithm to approximately solve a perturbed constrained dynamic program that significantly improves the computational burden when the objective function and the constraints are perturbed slightly. The method hinges on determining closed-form expressions for first-order perturbations in the optimal strategy and the Lagrange multipliers of the perturbed constrained dynamic programming problem. This information can be used to initialize any algorithm (such as the method of Lagrange multipliers, or the augmented Lagrangian method) to solve the perturbed dynamic programming problem with minimal computational resources.
递归时域最优控制问题在每个时间步长计算解,以使系统运行在接近最优的路径上。然而,在许多实际情况下,边界条件,如外部输入、约束方程或目标函数,在一个时间步长与下一个时间阶之间仅略有变化。在这种情况下,每次重新计算最优解对实时应用程序来说都是一个巨大的负担。本文提出了一种近似求解扰动约束动态程序的新算法,当目标函数和约束受到轻微扰动时,该算法显著提高了计算负担。该方法依赖于确定最优策略中一阶扰动的闭式表达式和扰动约束动态规划问题的拉格朗日乘子。该信息可用于初始化任何算法(如拉格朗日乘子法或增广拉格朗日法),以最小的计算资源解决扰动动态规划问题。
{"title":"An Algorithm to Warm Start Perturbed (WASP) Constrained Dynamic Programs","authors":"Abhishek Gupta;Shreshta Rajakumar Deshpande;Marcello Canova","doi":"10.1109/OJCSYS.2022.3150535","DOIUrl":"https://doi.org/10.1109/OJCSYS.2022.3150535","url":null,"abstract":"Receding horizon optimal control problems compute the solution at each time step to operate the system on a near-optimal path. However, in many practical cases, the boundary conditions, such as external inputs, constraint equations, or the objective function, vary only marginally from one time step to the next. In this case, recomputing the optimal solution at each time represents a significant burden for real-time applications. This paper proposes a novel algorithm to approximately solve a perturbed constrained dynamic program that significantly improves the computational burden when the objective function and the constraints are perturbed slightly. The method hinges on determining closed-form expressions for first-order perturbations in the optimal strategy and the Lagrange multipliers of the perturbed constrained dynamic programming problem. This information can be used to initialize any algorithm (such as the method of Lagrange multipliers, or the augmented Lagrangian method) to solve the perturbed dynamic programming problem with minimal computational resources.","PeriodicalId":73299,"journal":{"name":"IEEE open journal of control systems","volume":"1 ","pages":"1-14"},"PeriodicalIF":0.0,"publicationDate":"2022-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/9552933/9683993/09712345.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50237548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
IEEE open journal of control systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1