首页 > 最新文献

arXiv - MATH - Optimization and Control最新文献

英文 中文
A kernel-based PEM estimator for forward model 基于核的前向模型 PEM 估计器
Pub Date : 2024-09-15 DOI: arxiv-2409.09679
Giulio Fattore, Marco Peruzzo, Giacomo Sartori, Mattia Zorzi
This paper addresses the problem of learning the impulse responsescharacterizing forward models by means of a regularized kernel-based PredictionError Method (PEM). The common approach to accomplish that is to approximatethe system with a high-order stable ARX model. However, such choice induces acertain undesired prior information in the system that we want to estimate. Toovercome this issue, we propose a new kernel-based paradigm which is formulateddirectly in terms of the impulse responses of the forward model and leading tothe identification of a high-order MAX model. The most challenging step is theestimation of the kernel hyperparameters optimizing the marginal likelihood.The latter, indeed, does not admit a closed form expression. We propose amethod for evaluating the marginal likelihood which makes possible thehyperparameters estimation. Finally, some numerical results showing theeffectiveness of the method are presented.
本文通过基于正则化核的预测误差法(PEM)来解决学习前向模型脉冲响应特征的问题。常用的方法是用高阶稳定 ARX 模型来逼近系统。然而,这种选择会在我们想要估计的系统中引起某些不想要的先验信息。为了克服这个问题,我们提出了一种基于核的新范式,它直接根据前向模型的脉冲响应进行表述,从而识别出高阶 MAX 模型。最具挑战性的步骤是优化边际似然的核超参数估计。我们提出了一种评估边际似然的方法,这使得超参数估计成为可能。最后,一些数值结果显示了该方法的有效性。
{"title":"A kernel-based PEM estimator for forward model","authors":"Giulio Fattore, Marco Peruzzo, Giacomo Sartori, Mattia Zorzi","doi":"arxiv-2409.09679","DOIUrl":"https://doi.org/arxiv-2409.09679","url":null,"abstract":"This paper addresses the problem of learning the impulse responses\u0000characterizing forward models by means of a regularized kernel-based Prediction\u0000Error Method (PEM). The common approach to accomplish that is to approximate\u0000the system with a high-order stable ARX model. However, such choice induces a\u0000certain undesired prior information in the system that we want to estimate. To\u0000overcome this issue, we propose a new kernel-based paradigm which is formulated\u0000directly in terms of the impulse responses of the forward model and leading to\u0000the identification of a high-order MAX model. The most challenging step is the\u0000estimation of the kernel hyperparameters optimizing the marginal likelihood.\u0000The latter, indeed, does not admit a closed form expression. We propose a\u0000method for evaluating the marginal likelihood which makes possible the\u0000hyperparameters estimation. Finally, some numerical results showing the\u0000effectiveness of the method are presented.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lossy Image Compression with Stochastic Quantization 利用随机量化技术压缩有损图像
Pub Date : 2024-09-14 DOI: arxiv-2409.09488
Anton Kozyriev, Vladimir Norkin
Lossy image compression algorithms play a crucial role in various domains,including graphics, and image processing. As image information densityincreases, so do the resources required for processing and transmission. One ofthe most prominent approaches to address this challenge is color quantization,proposed by Orchard et al. (1991). This technique optimally maps each pixel ofan image to a color from a limited palette, maintaining image resolution whilesignificantly reducing information content. Color quantization can beinterpreted as a clustering problem (Krishna et al. (1997), Wan (2019)), whereimage pixels are represented in a three-dimensional space, with each axiscorresponding to the intensity of an RGB channel. However, scaling oftraditional algorithms like K-Means can be challenging for large data, such asmodern images with millions of colors. This paper reframes color quantizationas a three-dimensional stochastic transportation problem between the set ofimage pixels and an optimal color palette, where the number of colors is apredefined hyperparameter. We employ Stochastic Quantization (SQ) with aseeding technique proposed by Arthur et al. (2007) to enhance the scalabilityof color quantization. This method introduces a probabilistic element to thequantization process, potentially improving efficiency and adaptability todiverse image characteristics. To demonstrate the efficiency of our approach,we present experimental results using images from the ImageNet dataset. Theseexperiments illustrate the performance of our Stochastic Quantization method interms of compression quality, computational efficiency, and scalabilitycompared to traditional color quantization techniques.
有损图像压缩算法在图形和图像处理等多个领域发挥着重要作用。随着图像信息密度的增加,处理和传输所需的资源也在增加。Orchard 等人(1991 年)提出的色彩量化技术是应对这一挑战的最主要方法之一。这种技术将图像的每个像素从有限的调色板中最优化地映射成一种颜色,在保持图像分辨率的同时大大减少了信息含量。颜色量化可以被解释为一个聚类问题(Krishna 等人(1997),Wan(2019)),其中图像像素在三维空间中表示,每个轴对应一个 RGB 通道的强度。然而,像 K-Means 这样的传统算法的扩展对于大型数据(如具有数百万种颜色的现代图像)来说具有挑战性。本文将颜色量化重构为图像像素集与最优调色板之间的三维随机运输问题,其中颜色的数量是一个已定义的超参数。我们采用亚瑟等人(2007 年)提出的播种技术随机量化(SQ)来增强色彩量化的可扩展性。这种方法在量化过程中引入了概率元素,可能会提高效率和对不同图像特征的适应性。为了证明我们方法的效率,我们使用 ImageNet 数据集中的图像给出了实验结果。与传统的颜色量化技术相比,这些实验说明了我们的随机量化方法在压缩质量、计算效率和可扩展性方面的性能。
{"title":"Lossy Image Compression with Stochastic Quantization","authors":"Anton Kozyriev, Vladimir Norkin","doi":"arxiv-2409.09488","DOIUrl":"https://doi.org/arxiv-2409.09488","url":null,"abstract":"Lossy image compression algorithms play a crucial role in various domains,\u0000including graphics, and image processing. As image information density\u0000increases, so do the resources required for processing and transmission. One of\u0000the most prominent approaches to address this challenge is color quantization,\u0000proposed by Orchard et al. (1991). This technique optimally maps each pixel of\u0000an image to a color from a limited palette, maintaining image resolution while\u0000significantly reducing information content. Color quantization can be\u0000interpreted as a clustering problem (Krishna et al. (1997), Wan (2019)), where\u0000image pixels are represented in a three-dimensional space, with each axis\u0000corresponding to the intensity of an RGB channel. However, scaling of\u0000traditional algorithms like K-Means can be challenging for large data, such as\u0000modern images with millions of colors. This paper reframes color quantization\u0000as a three-dimensional stochastic transportation problem between the set of\u0000image pixels and an optimal color palette, where the number of colors is a\u0000predefined hyperparameter. We employ Stochastic Quantization (SQ) with a\u0000seeding technique proposed by Arthur et al. (2007) to enhance the scalability\u0000of color quantization. This method introduces a probabilistic element to the\u0000quantization process, potentially improving efficiency and adaptability to\u0000diverse image characteristics. To demonstrate the efficiency of our approach,\u0000we present experimental results using images from the ImageNet dataset. These\u0000experiments illustrate the performance of our Stochastic Quantization method in\u0000terms of compression quality, computational efficiency, and scalability\u0000compared to traditional color quantization techniques.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Initial Error Affection and Error Correction in Linear Quadratic Mean Field Games under Erroneous Initial Information 错误初始信息下线性二次均场博弈中的初始错误情感和错误纠正
Pub Date : 2024-09-14 DOI: arxiv-2409.09375
Yuxin Jin, Lu Ren, Wang Yao, Xiao Zhang
In this paper, the initial error affection and error correction in linearquadratic mean field games (MPLQMFGs) under erroneous initial distributioninformation are investigated. First, a LQMFG model is developed where agentsare coupled by dynamics and cost functions. Next, by studying the evolutionaryof LQMFGs under erroneous initial distributions information, the affection ofinitial error on the game and agents' strategies are given. Furthermore, underdeterministic situation, we provide a sufficient condition for agents tocorrect initial error and give their optimal strategies when agents are allowedto change their strategies at a intermediate time. Besides, the situation whereagents are allowed to predict MF and adjust their strategies in real-time isconsidered. Finally, simulations are performed to verify above conclusions.
本文研究了线性二次均值场博弈(MPLQMFGs)在错误的初始分布信息下的初始误差影响和误差修正问题。首先,建立了一个 LQMFG 模型,该模型中的代理由动力学和成本函数耦合。接着,通过研究错误初始分布信息下 LQMFG 的演化,给出了初始误差对博弈和代理策略的影响。此外,在非确定性情况下,我们提供了代理纠正初始错误的充分条件,并给出了允许代理在中间时间改变策略时的最优策略。此外,我们还考虑了允许代理预测 MF 并实时调整策略的情况。最后,通过模拟验证了上述结论。
{"title":"Initial Error Affection and Error Correction in Linear Quadratic Mean Field Games under Erroneous Initial Information","authors":"Yuxin Jin, Lu Ren, Wang Yao, Xiao Zhang","doi":"arxiv-2409.09375","DOIUrl":"https://doi.org/arxiv-2409.09375","url":null,"abstract":"In this paper, the initial error affection and error correction in linear\u0000quadratic mean field games (MPLQMFGs) under erroneous initial distribution\u0000information are investigated. First, a LQMFG model is developed where agents\u0000are coupled by dynamics and cost functions. Next, by studying the evolutionary\u0000of LQMFGs under erroneous initial distributions information, the affection of\u0000initial error on the game and agents' strategies are given. Furthermore, under\u0000deterministic situation, we provide a sufficient condition for agents to\u0000correct initial error and give their optimal strategies when agents are allowed\u0000to change their strategies at a intermediate time. Besides, the situation where\u0000agents are allowed to predict MF and adjust their strategies in real-time is\u0000considered. Finally, simulations are performed to verify above conclusions.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Observability inequalities for heat equations with potentials 有势热方程的可观测性不等式
Pub Date : 2024-09-14 DOI: arxiv-2409.09476
Jiuyi Zhu, Jinping Zhuge
This paper is mainly concerned with the observability inequalities for heatequations with time-dependent Lipschtiz potentials. The observabilityinequality for heat equations asserts that the total energy of a solution isbounded above by the energy localized in a subdomain with an observabilityconstant. For a bounded measurable potential $V = V(x,t)$, the factor in theobservability constant arising from the Carleman estimate is best known to be$exp(C|V|_{infty}^{2/3})$ (even for time-independent potentials). In thispaper, we show that, for Lipschtiz potentials, this factor can be replaced by$exp(C(|nabla V|_{infty}^{1/2} +|partial_tV|_{infty}^{1/3} ))$, whichimproves the previous bound $exp(C|V|_{infty}^{2/3})$ in some typicalscenarios. As a consequence, with such a Lipschitz potential, we obtain aquantitative regular control in a null controllability problem. In addition,for the one-dimensional heat equation with some time-independent boundedmeasurable potential $V = V(x)$, we obtain the optimal observability constant.
本文主要研究具有时变 Lipschtiz 势的热方程的可观测性不等式。热方程的可观测性不等式断言,解的总能量由局部子域中具有可观测性常数的能量限定。对于有界可测的势 $V = V(x,t)$,卡勒曼估计所产生的可观测性常数的因子已知为$exp(C|V|_{infty}^{2/3})$(即使对于与时间无关的势)。在本文中,我们证明了对于利普西奇兹电势,这个系数可以被$exp(C(|nabla V|_{infty}^{1/2} +|partial_tV|_{infty}^{1/3} ))$ 取代,这在某些典型情况下改进了之前的约束$exp(C|V|_{infty}^{2/3})$。因此,有了这样一个 Lipschitz 势,我们就能在空可控性问题中获得定量正则控制。此外,对于一维热方程与某种时间无关的有界可测量势 $V = V(x)$,我们得到了最优可观测常数。
{"title":"Observability inequalities for heat equations with potentials","authors":"Jiuyi Zhu, Jinping Zhuge","doi":"arxiv-2409.09476","DOIUrl":"https://doi.org/arxiv-2409.09476","url":null,"abstract":"This paper is mainly concerned with the observability inequalities for heat\u0000equations with time-dependent Lipschtiz potentials. The observability\u0000inequality for heat equations asserts that the total energy of a solution is\u0000bounded above by the energy localized in a subdomain with an observability\u0000constant. For a bounded measurable potential $V = V(x,t)$, the factor in the\u0000observability constant arising from the Carleman estimate is best known to be\u0000$exp(C|V|_{infty}^{2/3})$ (even for time-independent potentials). In this\u0000paper, we show that, for Lipschtiz potentials, this factor can be replaced by\u0000$exp(C(|nabla V|_{infty}^{1/2} +|partial_tV|_{infty}^{1/3} ))$, which\u0000improves the previous bound $exp(C|V|_{infty}^{2/3})$ in some typical\u0000scenarios. As a consequence, with such a Lipschitz potential, we obtain a\u0000quantitative regular control in a null controllability problem. In addition,\u0000for the one-dimensional heat equation with some time-independent bounded\u0000measurable potential $V = V(x)$, we obtain the optimal observability constant.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing Discrete Dynamic Optimization via a Logic-Based Discrete-Steepest Descent Algorithm 通过基于逻辑的离散陡坡下降算法解决离散动态优化问题
Pub Date : 2024-09-14 DOI: arxiv-2409.09237
Zedong Peng, Albert Lee, David E. Bernal Neira
Dynamic optimization problems involving discrete decisions have severalapplications, yet lead to challenging optimization problems that must beaddressed efficiently. Combining discrete variables with potentially nonlinearconstraints stemming from dynamics within an optimization model results inmathematical programs for which off-the-shelf techniques might be insufficient.This work uses a novel approach, the Logic-based Discrete-Steepest DescentAlgorithm (LD-SDA), to solve Discrete Dynamic Optimization problems. Theproblems are formulated using Boolean variables that enforce differentialsystems of constraints and encode logic constraints that the optimizationproblem needs to satisfy. By posing the problem as a generalized disjunctiveprogram with dynamic equations within the disjunctions, the LD-SDA takesadvantage of the problem's inherent structure to efficiently explore thecombinatorial space of the Boolean variables and selectively include relevantdifferential equations to mitigate the computational complexity inherent indynamic optimization scenarios. We rigorously evaluate the LD-SDA withbenchmark problems from the literature that include dynamic transitioning modesand find it to outperform traditional methods, i.e., mixed-integer nonlinearand generalized disjunctive programming solvers, in terms of efficiency andcapability to handle dynamic scenarios. This work presents a systematic methodand provides an open-source software implementation to address these discretedynamic optimization problems by harnessing the information within itslogical-differential structure.
涉及离散决策的动态优化问题有多种应用,但却导致了必须有效解决的具有挑战性的优化问题。将离散变量与优化模型中动态产生的潜在非线性约束相结合,会产生现成技术可能无法满足要求的数学程序。这项研究采用了一种新方法--基于逻辑的离散-陡坡下降算法(LD-SDA)来解决离散动态优化问题。这些问题使用布尔变量来表述,布尔变量强制执行差分约束系统,并对优化问题需要满足的逻辑约束进行编码。LD-SDA 将问题假设为一个广义的带动态方程的分节式程序,利用问题固有的结构优势,高效地探索布尔变量的组合空间,并有选择性地包含相关的微分方程,以减轻动态优化方案固有的计算复杂性。我们利用文献中包含动态转换模式的基准问题对 LD-SDA 进行了严格评估,发现它在处理动态场景的效率和能力方面优于传统方法,即混合整数非线性和广义互断编程求解器。这项工作提出了一种系统方法,并提供了一个开源软件实现,通过利用其逻辑差分结构中的信息来解决这些离散动态优化问题。
{"title":"Addressing Discrete Dynamic Optimization via a Logic-Based Discrete-Steepest Descent Algorithm","authors":"Zedong Peng, Albert Lee, David E. Bernal Neira","doi":"arxiv-2409.09237","DOIUrl":"https://doi.org/arxiv-2409.09237","url":null,"abstract":"Dynamic optimization problems involving discrete decisions have several\u0000applications, yet lead to challenging optimization problems that must be\u0000addressed efficiently. Combining discrete variables with potentially nonlinear\u0000constraints stemming from dynamics within an optimization model results in\u0000mathematical programs for which off-the-shelf techniques might be insufficient.\u0000This work uses a novel approach, the Logic-based Discrete-Steepest Descent\u0000Algorithm (LD-SDA), to solve Discrete Dynamic Optimization problems. The\u0000problems are formulated using Boolean variables that enforce differential\u0000systems of constraints and encode logic constraints that the optimization\u0000problem needs to satisfy. By posing the problem as a generalized disjunctive\u0000program with dynamic equations within the disjunctions, the LD-SDA takes\u0000advantage of the problem's inherent structure to efficiently explore the\u0000combinatorial space of the Boolean variables and selectively include relevant\u0000differential equations to mitigate the computational complexity inherent in\u0000dynamic optimization scenarios. We rigorously evaluate the LD-SDA with\u0000benchmark problems from the literature that include dynamic transitioning modes\u0000and find it to outperform traditional methods, i.e., mixed-integer nonlinear\u0000and generalized disjunctive programming solvers, in terms of efficiency and\u0000capability to handle dynamic scenarios. This work presents a systematic method\u0000and provides an open-source software implementation to address these discrete\u0000dynamic optimization problems by harnessing the information within its\u0000logical-differential structure.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal Adaptive Control of Linear Stochastic Systems with Quadratic Cost Function 具有二次成本函数的线性随机系统的最优自适应控制
Pub Date : 2024-09-14 DOI: arxiv-2409.09250
Nian Liu, Cheng Zhao, Shaolin Tan, Jinhu Lü
In this paper, we consider the adaptive linear quadratic Gaussian controlproblem, where both the linear transformation matrix of the state $A$ and thecontrol gain matrix $B$ are unknown. The proposed adaptive optimal control onlyassumes that $(A, B)$ is stabilizable and $(A, Q^{1/2})$ is detectable, where$Q$ is the weighting matrix of the state in the quadratic cost function. Thiscondition significantly weakens the classic assumptions used in the literature.To tackle this problem, a weighted least squares algorithm is modified by usingrandom regularization method, which can ensure uniform stabilizability anduniform detectability of the family of estimated models. At the same time, adiminishing excitation is incorporated into the design of the proposed adaptivecontrol to guarantee strong consistency of the desired components of theestimates. Finally, by utilizing this family of estimates, even if not allcomponents of them converge to the true values, it is demonstrated that acertainty equivalence control with such a diminishing excitation is optimal foran ergodic quadratic cost function.
本文考虑的是自适应线性二次高斯控制问题,其中状态的线性变换矩阵 $A$ 和控制增益矩阵 $B$ 都是未知的。本文提出的自适应最优控制只假定 $(A, B)$ 是可稳定的,且 $(A, Q^{1/2})$ 是可检测的,其中 $Q$ 是二次成本函数中的状态加权矩阵。为了解决这个问题,我们采用随机正则化方法对加权最小二乘法进行了改进,从而确保估计模型族的均匀可稳定和均匀可检测性。同时,在所提出的自适应控制设计中加入了最小激励,以保证估计值的理想成分具有很强的一致性。最后,通过利用这个估计值系列,即使不是所有的估计值都收敛到真实值,也证明了具有这种递减激励的确定性等价控制对于遍历二次成本函数是最优的。
{"title":"Optimal Adaptive Control of Linear Stochastic Systems with Quadratic Cost Function","authors":"Nian Liu, Cheng Zhao, Shaolin Tan, Jinhu Lü","doi":"arxiv-2409.09250","DOIUrl":"https://doi.org/arxiv-2409.09250","url":null,"abstract":"In this paper, we consider the adaptive linear quadratic Gaussian control\u0000problem, where both the linear transformation matrix of the state $A$ and the\u0000control gain matrix $B$ are unknown. The proposed adaptive optimal control only\u0000assumes that $(A, B)$ is stabilizable and $(A, Q^{1/2})$ is detectable, where\u0000$Q$ is the weighting matrix of the state in the quadratic cost function. This\u0000condition significantly weakens the classic assumptions used in the literature.\u0000To tackle this problem, a weighted least squares algorithm is modified by using\u0000random regularization method, which can ensure uniform stabilizability and\u0000uniform detectability of the family of estimated models. At the same time, a\u0000diminishing excitation is incorporated into the design of the proposed adaptive\u0000control to guarantee strong consistency of the desired components of the\u0000estimates. Finally, by utilizing this family of estimates, even if not all\u0000components of them converge to the true values, it is demonstrated that a\u0000certainty equivalence control with such a diminishing excitation is optimal for\u0000an ergodic quadratic cost function.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constraints-Informed Neural-Laguerre Approximation of Nonlinear MPC with Application in Power Electronics 非线性 MPC 的约束条件神经-拉盖尔逼近法在电力电子技术中的应用
Pub Date : 2024-09-14 DOI: arxiv-2409.09436
Duo Xu, Rody Aerts, Petros Karamanakos, Mircea Lazar
This paper considers learning online (implicit) nonlinear model predictivecontrol (MPC) laws using neural networks and Laguerre functions. Firstly, weparameterize the control sequence of nonlinear MPC using Laguerre functions,which typically yields a smoother control law compared to the originalnonlinear MPC law. Secondly, we employ neural networks to learn thecoefficients of the Laguerre nonlinear MPC solution, which comes with severalbenefits, namely the dimension of the learning space is dictated by the numberof Laguerre functions and the complete predicted input sequence can be used tolearn the coefficients. To mitigate constraints violation for neuralapproximations of nonlinear MPC, we develop a constraints-informed lossfunction that penalizes the violation of polytopic state constraints duringlearning. Box input constraints are handled by using a clamp function in theoutput layer of the neural network. We demonstrate the effectiveness of thedeveloped framework on a nonlinear buck-boost converter model with samplingrates in the sub-millisecond range, where online nonlinear MPC would not beable to run in real time. The developed constraints-informed neural-Laguerreapproximation yields similar performance with long-horizon online nonlinearMPC, but with execution times of a few microseconds, as validated on afield-programmable gate array (FPGA) platform.
本文考虑利用神经网络和拉盖尔函数学习在线(隐式)非线性模型预测控制(MPC)法则。首先,我们使用拉盖尔函数对非线性 MPC 的控制序列进行参数化,与原始的非线性 MPC 规律相比,这种方法通常能得到更平滑的控制规律。其次,我们利用神经网络来学习拉盖尔非线性 MPC 解决方案的系数,这样做有几个好处,即学习空间的维度由拉盖尔函数的数量决定,而且可以使用完整的预测输入序列来学习系数。为了减轻非线性 MPC 神经逼近的约束违反情况,我们开发了一种约束信息损失函数,对学习过程中违反多点状态约束的情况进行惩罚。在神经网络的输出层中使用钳位函数来处理盒式输入约束。我们在一个非线性降压-升压转换器模型上演示了所开发框架的有效性,该模型的采样率在亚毫秒范围内,在线非线性 MPC 无法实时运行。经现场可编程门阵列 (FPGA) 平台验证,所开发的约束信息神经-拉盖尔逼近方法与长视距在线非线性 MPC 性能相似,但执行时间仅为几微秒。
{"title":"Constraints-Informed Neural-Laguerre Approximation of Nonlinear MPC with Application in Power Electronics","authors":"Duo Xu, Rody Aerts, Petros Karamanakos, Mircea Lazar","doi":"arxiv-2409.09436","DOIUrl":"https://doi.org/arxiv-2409.09436","url":null,"abstract":"This paper considers learning online (implicit) nonlinear model predictive\u0000control (MPC) laws using neural networks and Laguerre functions. Firstly, we\u0000parameterize the control sequence of nonlinear MPC using Laguerre functions,\u0000which typically yields a smoother control law compared to the original\u0000nonlinear MPC law. Secondly, we employ neural networks to learn the\u0000coefficients of the Laguerre nonlinear MPC solution, which comes with several\u0000benefits, namely the dimension of the learning space is dictated by the number\u0000of Laguerre functions and the complete predicted input sequence can be used to\u0000learn the coefficients. To mitigate constraints violation for neural\u0000approximations of nonlinear MPC, we develop a constraints-informed loss\u0000function that penalizes the violation of polytopic state constraints during\u0000learning. Box input constraints are handled by using a clamp function in the\u0000output layer of the neural network. We demonstrate the effectiveness of the\u0000developed framework on a nonlinear buck-boost converter model with sampling\u0000rates in the sub-millisecond range, where online nonlinear MPC would not be\u0000able to run in real time. The developed constraints-informed neural-Laguerre\u0000approximation yields similar performance with long-horizon online nonlinear\u0000MPC, but with execution times of a few microseconds, as validated on a\u0000field-programmable gate array (FPGA) platform.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Heuristics for the Operation of an Ambulance Fleet under Uncertainty 不确定情况下救护车队运行的新启发式方法
Pub Date : 2024-09-13 DOI: arxiv-2409.09158
Vincent Guigues, Anton J. Kleywegt, Victor Hugo Nascimento
The operation of an ambulance fleet involves ambulance selection decisionsabout which ambulance to dispatch to each emergency, and ambulance reassignmentdecisions about what each ambulance should do after it has finished the serviceassociated with an emergency. For ambulance selection decisions, we proposefour new heuristics: the Best Myopic (BM) heuristic, a NonMyopic (NM)heuristic, and two greedy heuristics (GHP1 and GHP2). Two variants of thegreedy heuristics are also considered. We also propose an optimization problemfor an extension of the BM heuristic, useful when a call for several patientsarrives. For ambulance reassignment decisions, we propose several strategies tochoose which emergency in queue to send an ambulance to or which ambulancestation to send an ambulance to when it finishes service. These heuristics arealso used in a rollout approach: each time a new decision has to be made (whena call arrives or when an ambulance finishes service), a two-stage stochasticprogram is solved. The proposed heuristics are used to efficiently compute thesecond stage cost of these problems. We apply the rollout approach with ourheuristics to data of the Emergency Medical Service (EMS) of a large city, andshow that these methods outperform other heuristics that have been proposed forambulance dispatch decisions. We also show that better response times can beobtained using the rollout approach instead of using the heuristics withoutrollout. Moreover, each decision is computed in a few seconds, which allowsthese methods to be used for the real-time management of a fleet of ambulances.
救护车队的运行包括救护车选择决策和救护车重新分派决策,前者涉及为每起突发事件派遣哪辆救护车,后者涉及每辆救护车在完成与突发事件相关的服务后应该做什么。针对救护车选择决策,我们提出了四种新的启发式方法:最佳近视(BM)启发式、非近视(NM)启发式和两种贪婪启发式(GHP1 和 GHP2)。我们还考虑了贪婪启发式的两种变体。我们还为 BM 启发式的扩展提出了一个优化问题,该问题在呼叫多名患者时非常有用。在救护车重新分配决策方面,我们提出了几种策略来选择将救护车送往队列中的哪个急救站,或在救护车结束服务后将其送往哪个救护车站。这些启发式方法也被应用于滚动方法中:每次需要做出新的决策时(当呼叫到达或救护车结束服务时),都需要求解一个两阶段的随机程序。所提出的启发式方法可用于有效计算这些问题的第二阶段成本。我们在一个大城市的紧急医疗服务(EMS)数据中应用了我们的启发式推出方法,结果表明这些方法优于其他已提出的救护车调度决策启发式方法。我们还表明,使用滚动方法比使用不滚动的启发式方法能获得更好的响应时间。此外,每次决策的计算只需几秒钟,这使得这些方法可以用于救护车车队的实时管理。
{"title":"New Heuristics for the Operation of an Ambulance Fleet under Uncertainty","authors":"Vincent Guigues, Anton J. Kleywegt, Victor Hugo Nascimento","doi":"arxiv-2409.09158","DOIUrl":"https://doi.org/arxiv-2409.09158","url":null,"abstract":"The operation of an ambulance fleet involves ambulance selection decisions\u0000about which ambulance to dispatch to each emergency, and ambulance reassignment\u0000decisions about what each ambulance should do after it has finished the service\u0000associated with an emergency. For ambulance selection decisions, we propose\u0000four new heuristics: the Best Myopic (BM) heuristic, a NonMyopic (NM)\u0000heuristic, and two greedy heuristics (GHP1 and GHP2). Two variants of the\u0000greedy heuristics are also considered. We also propose an optimization problem\u0000for an extension of the BM heuristic, useful when a call for several patients\u0000arrives. For ambulance reassignment decisions, we propose several strategies to\u0000choose which emergency in queue to send an ambulance to or which ambulance\u0000station to send an ambulance to when it finishes service. These heuristics are\u0000also used in a rollout approach: each time a new decision has to be made (when\u0000a call arrives or when an ambulance finishes service), a two-stage stochastic\u0000program is solved. The proposed heuristics are used to efficiently compute the\u0000second stage cost of these problems. We apply the rollout approach with our\u0000heuristics to data of the Emergency Medical Service (EMS) of a large city, and\u0000show that these methods outperform other heuristics that have been proposed for\u0000ambulance dispatch decisions. We also show that better response times can be\u0000obtained using the rollout approach instead of using the heuristics without\u0000rollout. Moreover, each decision is computed in a few seconds, which allows\u0000these methods to be used for the real-time management of a fleet of ambulances.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified Funnel Restoration SQP Algorithm 统一的漏斗恢复 SQP 算法
Pub Date : 2024-09-13 DOI: arxiv-2409.09208
David Kiessling, Sven Leyffer, Charlie Vanaret
We consider nonlinearly constrained optimization problems and discuss ageneric double-loop framework consisting of four algorithmic ingredients thatunifies a broad range of nonlinear optimization solvers. This framework hasbeen implemented in the open-source solver Uno, a Swiss Army knife-like C++optimization framework that unifies many nonlinearly constrained nonconvexoptimization solvers. We illustrate the framework with a sequential quadraticprogramming (SQP) algorithm that maintains an acceptable upper bound on theconstraint violation, called a funnel, that is monotonically decreased tocontrol the feasibility of the iterates. Infeasible quadratic subproblems arehandled by a feasibility restoration strategy. Globalization is controlled by aline search or a trust-region method. We prove global convergence of thetrust-region funnel SQP method, building on known results from filter methods.We implement the algorithm in Uno, and we provide extensive test results forthe trust-region line-search funnel SQP on small CUTEst instances.
我们考虑了非线性约束优化问题,并讨论了由四种算法成分组成的通用双环框架,该框架统一了广泛的非线性优化求解器。该框架已在开源求解器 Uno 中实现,Uno 是一个类似瑞士军刀的 C++ 优化框架,它统一了许多非线性约束非凸优化求解器。我们用一种顺序二次编程(SQP)算法来说明该框架,该算法对违反约束的情况保持一个可接受的上限,称为漏斗,该漏斗单调递减,以控制迭代的可行性。不可行的二次子问题由可行性恢复策略处理。全局化由直线搜索或信任区域方法控制。我们以滤波方法的已知结果为基础,证明了信任区域漏斗 SQP 方法的全局收敛性。我们在 Uno 中实现了该算法,并在小型 CUTEst 实例上提供了信任区域线性搜索漏斗 SQP 的大量测试结果。
{"title":"A Unified Funnel Restoration SQP Algorithm","authors":"David Kiessling, Sven Leyffer, Charlie Vanaret","doi":"arxiv-2409.09208","DOIUrl":"https://doi.org/arxiv-2409.09208","url":null,"abstract":"We consider nonlinearly constrained optimization problems and discuss a\u0000generic double-loop framework consisting of four algorithmic ingredients that\u0000unifies a broad range of nonlinear optimization solvers. This framework has\u0000been implemented in the open-source solver Uno, a Swiss Army knife-like C++\u0000optimization framework that unifies many nonlinearly constrained nonconvex\u0000optimization solvers. We illustrate the framework with a sequential quadratic\u0000programming (SQP) algorithm that maintains an acceptable upper bound on the\u0000constraint violation, called a funnel, that is monotonically decreased to\u0000control the feasibility of the iterates. Infeasible quadratic subproblems are\u0000handled by a feasibility restoration strategy. Globalization is controlled by a\u0000line search or a trust-region method. We prove global convergence of the\u0000trust-region funnel SQP method, building on known results from filter methods.\u0000We implement the algorithm in Uno, and we provide extensive test results for\u0000the trust-region line-search funnel SQP on small CUTEst instances.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence rate of opinion dynamics with complex interaction types 具有复杂互动类型的舆论动态收敛率
Pub Date : 2024-09-13 DOI: arxiv-2409.09100
Lingling Yao, Aming Li
The convergence rate is a crucial issue in opinion dynamics, whichcharacterizes how quickly opinions reach a consensus and tells when thecollective behavior can be formed. However, the key factors that determine theconvergence rate of opinions are elusive, especially when individuals interactwith complex interaction types such as friend/foe, ally/adversary, ortrust/mistrust. In this paper, using random matrix theory and low-rankperturbation theory, we present a new body of theory to comprehensively studythe convergence rate of opinion dynamics. First, we divide the complexinteraction types into five typical scenarios: mutual trust $(+/+)$, mutualmistrust $(-/-)$, trust$/$mistrust $(+/-)$, unilateral trust $(+/0)$, andunilateral mistrust $(-/0)$. For diverse interaction types, we derive themathematical expression of the convergence rate, and further establish thedirect connection between the convergence rate and population size, the densityof interactions (network connectivity), and individuals' self-confidence level.Second, taking advantage of these connections, we prove that for the $(+/+)$,$(+/-)$, $(+/0)$, and random mixture of different interaction types, theconvergence rate is proportional to the population size and networkconnectivity, while it is inversely proportional to the individuals'self-confidence level. However, for the $(-/-)$ and $(-/0)$ scenarios, we drawthe exact opposite conclusions. Third, for the $(+/+,-/-)$ and $(-/-,-/0)$scenarios, we derive the optimal proportion of different interaction types toensure the fast convergence of opinions. Finally, simulation examples areprovided to illustrate the effectiveness and robustness of our theoreticalfindings.
意见趋同率是意见动力学中的一个关键问题,它描述了意见达成共识的速度,并告诉人们何时可以形成集体行为。然而,决定意见收敛速度的关键因素却难以捉摸,尤其是当个体与复杂的互动类型(如朋友/敌人、盟友/对手或信任/不信任)发生互动时。本文利用随机矩阵理论和低秩扰动理论,提出了一套新的理论体系来全面研究意见动态的趋同率。首先,我们把复杂的互动类型分为五种典型情况:相互信任 $(+/+)$、相互不信任 $(-/-)$、信任 $/$ 不信任 $(+/-)$、单边信任 $(+/0)$、单边不信任 $(-/0)$。其次,利用这些联系,我们证明了对于$(+/+)$、$(+/-)$、$(+/0)$和不同互动类型的随机混合,收敛率与种群规模和网络连接成正比,而与个体的自信水平成反比。然而,对于$(-/-)$和$(-/0)$情景,我们得出了完全相反的结论。第三,对于$(+/+,-/-)$和$(-/-,-/0)$情景,我们推导出了不同互动类型的最佳比例,以确保意见的快速收敛。最后,我们提供了一些模拟实例,以说明我们理论发现的有效性和稳健性。
{"title":"Convergence rate of opinion dynamics with complex interaction types","authors":"Lingling Yao, Aming Li","doi":"arxiv-2409.09100","DOIUrl":"https://doi.org/arxiv-2409.09100","url":null,"abstract":"The convergence rate is a crucial issue in opinion dynamics, which\u0000characterizes how quickly opinions reach a consensus and tells when the\u0000collective behavior can be formed. However, the key factors that determine the\u0000convergence rate of opinions are elusive, especially when individuals interact\u0000with complex interaction types such as friend/foe, ally/adversary, or\u0000trust/mistrust. In this paper, using random matrix theory and low-rank\u0000perturbation theory, we present a new body of theory to comprehensively study\u0000the convergence rate of opinion dynamics. First, we divide the complex\u0000interaction types into five typical scenarios: mutual trust $(+/+)$, mutual\u0000mistrust $(-/-)$, trust$/$mistrust $(+/-)$, unilateral trust $(+/0)$, and\u0000unilateral mistrust $(-/0)$. For diverse interaction types, we derive the\u0000mathematical expression of the convergence rate, and further establish the\u0000direct connection between the convergence rate and population size, the density\u0000of interactions (network connectivity), and individuals' self-confidence level.\u0000Second, taking advantage of these connections, we prove that for the $(+/+)$,\u0000$(+/-)$, $(+/0)$, and random mixture of different interaction types, the\u0000convergence rate is proportional to the population size and network\u0000connectivity, while it is inversely proportional to the individuals'\u0000self-confidence level. However, for the $(-/-)$ and $(-/0)$ scenarios, we draw\u0000the exact opposite conclusions. Third, for the $(+/+,-/-)$ and $(-/-,-/0)$\u0000scenarios, we derive the optimal proportion of different interaction types to\u0000ensure the fast convergence of opinions. Finally, simulation examples are\u0000provided to illustrate the effectiveness and robustness of our theoretical\u0000findings.","PeriodicalId":501286,"journal":{"name":"arXiv - MATH - Optimization and Control","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142254592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - MATH - Optimization and Control
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1