首页 > 最新文献

Journal of Optimization Theory and Applications最新文献

英文 中文
Stochastic Augmented Lagrangian Method in Riemannian Shape Manifolds 黎曼形状芒模中的随机增量拉格朗日法
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-21 DOI: 10.1007/s10957-024-02488-1
Caroline Geiersbach, Tim Suchan, Kathrin Welker

In this paper, we present a stochastic augmented Lagrangian approach on (possibly infinite-dimensional) Riemannian manifolds to solve stochastic optimization problems with a finite number of deterministic constraints. We investigate the convergence of the method, which is based on a stochastic approximation approach with random stopping combined with an iterative procedure for updating Lagrange multipliers. The algorithm is applied to a multi-shape optimization problem with geometric constraints and demonstrated numerically.

在本文中,我们提出了一种(可能是无限维的)黎曼流形上的随机增强拉格朗日方法,用于解决带有有限数量确定性约束的随机优化问题。我们对该方法的收敛性进行了研究,该方法基于随机逼近方法,并将随机停止与更新拉格朗日乘数的迭代程序相结合。我们将该算法应用于具有几何约束的多形状优化问题,并进行了数值演示。
{"title":"Stochastic Augmented Lagrangian Method in Riemannian Shape Manifolds","authors":"Caroline Geiersbach, Tim Suchan, Kathrin Welker","doi":"10.1007/s10957-024-02488-1","DOIUrl":"https://doi.org/10.1007/s10957-024-02488-1","url":null,"abstract":"<p>In this paper, we present a stochastic augmented Lagrangian approach on (possibly infinite-dimensional) Riemannian manifolds to solve stochastic optimization problems with a finite number of deterministic constraints. We investigate the convergence of the method, which is based on a stochastic approximation approach with random stopping combined with an iterative procedure for updating Lagrange multipliers. The algorithm is applied to a multi-shape optimization problem with geometric constraints and demonstrated numerically.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"159 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regularized and Structured Tensor Total Least Squares Methods with Applications 正规化和结构化张量最小二乘法及其应用
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-12 DOI: 10.1007/s10957-024-02507-1
Feiyang Han, Yimin Wei, Pengpeng Xie

Total least squares (TLS), also named as errors in variables in statistical analysis, is an effective method for solving linear equations with the situations, when noise is not just in observation data but also in mapping operations. Besides, the Tikhonov regularization is widely considered in plenty of ill-posed problems. Moreover, the structure of mapping operator plays a crucial role in solving the TLS problem. Tensor operators have some advantages over the characterization of models, which requires us to build the corresponding theory on the tensor TLS. This paper proposes tensor regularized TLS and structured tensor TLS methods for solving ill-conditioned and structured tensor equations, respectively, adopting a tensor-tensor-product. Properties and algorithms for the solution of these approaches are also presented and proved. Based on this method, some applications in image and video deblurring are explored. Numerical examples illustrate the effectiveness of our methods, compared with some existing methods.

全最小二乘法(TLS)在统计分析中也被称为变量误差法,是解决线性方程的一种有效方法。此外,Tikhonov 正则化还被广泛应用于许多问题的求解。此外,映射算子的结构在解决 TLS 问题中起着至关重要的作用。张量算子在模型表征方面具有一定的优势,这就要求我们建立相应的张量 TLS 理论。本文提出了张量正则化 TLS 和结构化张量 TLS 方法,分别采用张量-张量乘积求解非条件张量方程和结构化张量方程。此外,还提出并证明了这些方法的性质和求解算法。基于这种方法,探讨了图像和视频去模糊的一些应用。数值示例说明了我们的方法与一些现有方法相比的有效性。
{"title":"Regularized and Structured Tensor Total Least Squares Methods with Applications","authors":"Feiyang Han, Yimin Wei, Pengpeng Xie","doi":"10.1007/s10957-024-02507-1","DOIUrl":"https://doi.org/10.1007/s10957-024-02507-1","url":null,"abstract":"<p>Total least squares (TLS), also named as errors in variables in statistical analysis, is an effective method for solving linear equations with the situations, when noise is not just in observation data but also in mapping operations. Besides, the Tikhonov regularization is widely considered in plenty of ill-posed problems. Moreover, the structure of mapping operator plays a crucial role in solving the TLS problem. Tensor operators have some advantages over the characterization of models, which requires us to build the corresponding theory on the tensor TLS. This paper proposes tensor regularized TLS and structured tensor TLS methods for solving ill-conditioned and structured tensor equations, respectively, adopting a tensor-tensor-product. Properties and algorithms for the solution of these approaches are also presented and proved. Based on this method, some applications in image and video deblurring are explored. Numerical examples illustrate the effectiveness of our methods, compared with some existing methods.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"2 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Continuous Equality Knapsack with Probit-Style Objectives 带有 Probit 类型目标的连续等价包
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-12 DOI: 10.1007/s10957-024-02503-5
Jamie Fravel, Robert Hildebrand, Laurel Travis

We study continuous, equality knapsack problems with uniform separable, non-convex objective functions that are continuous, antisymmetric about a point, and have concave and convex regions. For example, this model captures a simple allocation problem with the goal of optimizing an expected value where the objective is a sum of cumulative distribution functions of identically distributed normal distributions (i.e., a sum of inverse probit functions). We prove structural results of this model under general assumptions and provide two algorithms for efficient optimization: (1) running in linear time and (2) running in a constant number of operations given preprocessing of the objective function.

我们研究的是连续、相等、具有均匀可分、非凸目标函数的knapsack问题,这些目标函数是连续的、关于一点的非对称的,并且有凹和凸区域。例如,该模型捕捉了一个简单的分配问题,其目标是优化预期值,目标函数是同分布正态分布的累积分布函数之和(即逆概率函数之和)。我们证明了该模型在一般假设下的结构性结果,并提供了两种高效优化算法:(1) 在线性时间内运行;(2) 在对目标函数进行预处理的情况下,以恒定的运算次数运行。
{"title":"Continuous Equality Knapsack with Probit-Style Objectives","authors":"Jamie Fravel, Robert Hildebrand, Laurel Travis","doi":"10.1007/s10957-024-02503-5","DOIUrl":"https://doi.org/10.1007/s10957-024-02503-5","url":null,"abstract":"<p>We study continuous, equality knapsack problems with uniform separable, non-convex objective functions that are continuous, antisymmetric about a point, and have concave and convex regions. For example, this model captures a simple allocation problem with the goal of optimizing an expected value where the objective is a sum of cumulative distribution functions of identically distributed normal distributions (i.e., a sum of inverse probit functions). We prove structural results of this model under general assumptions and provide two algorithms for efficient optimization: (1) running in linear time and (2) running in a constant number of operations given preprocessing of the objective function.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"160 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal Actuator Location of the Norm Optimal Controls for Degenerate Parabolic Equations 畸变抛物方程的规范最优控制的最佳执行器位置
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-06 DOI: 10.1007/s10957-024-02498-z
Yuanhang Liu, Weijia Wu, Donghui Yang

This paper focuses on investigating the optimal actuator location for achieving minimum norm controls in the context of approximate controllability for degenerate parabolic equations. We propose a formulation of the optimization problem that encompasses both the actuator location and its associated minimum norm control. Specifically, we transform the problem into a two-person zero-sum game problem, resulting in the development of four equivalent formulations. Finally, we establish the crucial result that the solution to the relaxed optimization problem serves as an optimal actuator location for the classical problem.

本文重点研究在退化抛物方程近似可控性的背景下,实现最小规范控制的最优致动器位置。我们提出了一种优化问题的表述方式,既包括致动器位置,也包括相关的最小规范控制。具体来说,我们将该问题转化为两人零和博弈问题,从而提出了四种等效公式。最后,我们得出了一个重要结果,即松弛优化问题的解决方案可以作为经典问题的最优致动器位置。
{"title":"Optimal Actuator Location of the Norm Optimal Controls for Degenerate Parabolic Equations","authors":"Yuanhang Liu, Weijia Wu, Donghui Yang","doi":"10.1007/s10957-024-02498-z","DOIUrl":"https://doi.org/10.1007/s10957-024-02498-z","url":null,"abstract":"<p>This paper focuses on investigating the optimal actuator location for achieving minimum norm controls in the context of approximate controllability for degenerate parabolic equations. We propose a formulation of the optimization problem that encompasses both the actuator location and its associated minimum norm control. Specifically, we transform the problem into a two-person zero-sum game problem, resulting in the development of four equivalent formulations. Finally, we establish the crucial result that the solution to the relaxed optimization problem serves as an optimal actuator location for the classical problem.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"160 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Most Iterations of Projections Converge 大多数迭代预测趋于一致
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-05 DOI: 10.1007/s10957-024-02504-4
Daylen K. Thimm

Consider three closed linear subspaces (C_1, C_2,) and (C_3) of a Hilbert space H and the orthogonal projections (P_1, P_2) and (P_3) onto them. Halperin showed that a point in (C_1cap C_2 cap C_3) can be found by iteratively projecting any point (x_0 in H) onto all the sets in a periodic fashion. The limit point is then the projection of (x_0) onto (C_1cap C_2 cap C_3). Nevertheless, a non-periodic projection order may lead to a non-convergent projection series, as shown by Kopecká, Müller, and Paszkiewicz. This raises the question how many projection orders in ({1,2,3}^{mathbb {N}}) are “well behaved” in the sense that they lead to a convergent projection series. Melo, da Cruz Neto, and de Brito provided a necessary and sufficient condition under which the projection series converges and showed that the “well behaved” projection orders form a large subset in the sense of having full product measure. We show that also from a topological viewpoint the set of “well behaved” projection orders is a large subset: it contains a dense (G_delta ) subset with respect to the product topology. Furthermore, we analyze why the proof of the measure theoretic case cannot be directly adapted to the topological setting.

考虑希尔伯特空间 H 的三个封闭线性子空间 (C_1, C_2, )和 (C_3),以及到它们的正交投影 (P_1, P_2) 和 (P_3)。哈尔佩林(Halperin)证明,通过以周期性的方式将任意点(x_0in H )迭代投影到所有集合上,就可以找到(C_1cap C_2 cap C_3)中的一个点。极限点就是 (x_0) 在 (C_1cap C_2cap C_3) 上的投影。然而,正如 Kopecká、Müller 和 Paszkiewicz 所证明的那样,非周期性投影阶可能会导致投影序列的非收敛性。这就提出了一个问题:在 ({1,2,3}^{mathbb {N}}) 中,有多少投影阶是 "表现良好 "的,即它们会导致收敛投影序列。梅洛、达-克鲁兹-内托和德-布里托提供了投影序列收敛的必要条件和充分条件,并证明 "表现良好 "的投影阶在具有全积度量的意义上形成了一个大子集。我们证明,从拓扑学的角度来看,"表现良好 "的投影阶的集合也是一个大子集:它包含一个关于乘积拓扑学的密集(G_delta )子集。此外,我们还分析了为什么度量理论情形的证明不能直接应用于拓扑情形。
{"title":"Most Iterations of Projections Converge","authors":"Daylen K. Thimm","doi":"10.1007/s10957-024-02504-4","DOIUrl":"https://doi.org/10.1007/s10957-024-02504-4","url":null,"abstract":"<p>Consider three closed linear subspaces <span>(C_1, C_2,)</span> and <span>(C_3)</span> of a Hilbert space <i>H</i> and the orthogonal projections <span>(P_1, P_2)</span> and <span>(P_3)</span> onto them. Halperin showed that a point in <span>(C_1cap C_2 cap C_3)</span> can be found by iteratively projecting any point <span>(x_0 in H)</span> onto all the sets in a periodic fashion. The limit point is then the projection of <span>(x_0)</span> onto <span>(C_1cap C_2 cap C_3)</span>. Nevertheless, a non-periodic projection order may lead to a non-convergent projection series, as shown by Kopecká, Müller, and Paszkiewicz. This raises the question how many projection orders in <span>({1,2,3}^{mathbb {N}})</span> are “well behaved” in the sense that they lead to a convergent projection series. Melo, da Cruz Neto, and de Brito provided a necessary and sufficient condition under which the projection series converges and showed that the “well behaved” projection orders form a large subset in the sense of having full product measure. We show that also from a topological viewpoint the set of “well behaved” projection orders is a large subset: it contains a dense <span>(G_delta )</span> subset with respect to the product topology. Furthermore, we analyze why the proof of the measure theoretic case cannot be directly adapted to the topological setting.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"9 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY 使用 CVXPY 指定和解决稳健经验风险最小化问题
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-04 DOI: 10.1007/s10957-024-02491-6
Eric Luxenberg, Dhruv Malik, Yuanzhi Li, Aarti Singh, Stephen Boyd

We consider robust empirical risk minimization (ERM), where model parameters are chosen to minimize the worst-case empirical loss when each data point varies over a given convex uncertainty set. In some simple cases, such problems can be expressed in an analytical form. In general the problem can be made tractable via dualization, which turns a min-max problem into a min-min problem. Dualization requires expertise and is tedious and error-prone. We demonstrate how CVXPY can be used to automate this dualization procedure in a user-friendly manner. Our framework allows practitioners to specify and solve robust ERM problems with a general class of convex losses, capturing many standard regression and classification problems. Users can easily specify any complex uncertainty set that is representable via disciplined convex programming (DCP) constraints.

我们考虑的是稳健经验风险最小化(ERM)问题,即当每个数据点在给定的凸不确定性集合上变化时,选择模型参数以最小化最坏情况下的经验损失。在某些简单的情况下,这类问题可以用分析形式表达。一般情况下,可以通过二元化使问题变得简单,即把最小-最大问题转化为最小-最小问题。二元化需要专业知识,既繁琐又容易出错。我们展示了如何利用 CVXPY 以用户友好的方式自动完成这种二元化过程。我们的框架允许从业人员指定和解决具有一般类凸损失的稳健 ERM 问题,包括许多标准回归和分类问题。用户可以轻松指定任何复杂的不确定性集,这些不确定性集可以通过约束凸编程(DCP)约束来表示。
{"title":"Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY","authors":"Eric Luxenberg, Dhruv Malik, Yuanzhi Li, Aarti Singh, Stephen Boyd","doi":"10.1007/s10957-024-02491-6","DOIUrl":"https://doi.org/10.1007/s10957-024-02491-6","url":null,"abstract":"<p>We consider robust empirical risk minimization (ERM), where model parameters are chosen to minimize the worst-case empirical loss when each data point varies over a given convex uncertainty set. In some simple cases, such problems can be expressed in an analytical form. In general the problem can be made tractable via dualization, which turns a min-max problem into a min-min problem. Dualization requires expertise and is tedious and error-prone. We demonstrate how CVXPY can be used to automate this dualization procedure in a user-friendly manner. Our framework allows practitioners to specify and solve robust ERM problems with a general class of convex losses, capturing many standard regression and classification problems. Users can easily specify any complex uncertainty set that is representable via disciplined convex programming (DCP) constraints.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"9 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141935014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence Analysis of a New Forward-Reflected-Backward Algorithm for Four Operators Without Cocoercivity 四算子无矫顽力新正向-反射-后向算法的收敛性分析
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-02 DOI: 10.1007/s10957-024-02501-7
Yu Cao, Yuanheng Wang, Habib ur Rehman, Yekini Shehu, Jen-Chih Yao

In this paper, we propose a new splitting algorithm to find the zero of a monotone inclusion problem that features the sum of three maximal monotone operators and a Lipschitz continuous monotone operator in Hilbert spaces. We prove that the sequence of iterates generated by our proposed splitting algorithm converges weakly to the zero of the considered inclusion problem under mild conditions on the iterative parameters. Several splitting algorithms in the literature are recovered as special cases of our proposed algorithm. Another interesting feature of our algorithm is that one forward evaluation of the Lipschitz continuous monotone operator is utilized at each iteration. Numerical results are given to support the theoretical analysis.

在本文中,我们提出了一种新的分裂算法,用于寻找单调包含问题的零点,该问题的特征是希尔伯特空间中三个最大单调算子与一个利普希兹连续单调算子之和。我们证明,在迭代参数的温和条件下,我们提出的分裂算法产生的迭代序列弱收敛于所考虑的包含问题的零点。文献中的几种分裂算法都是我们提出的算法的特例。我们算法的另一个有趣特点是,每次迭代都会对 Lipschitz 连续单调算子进行一次前向评估。我们给出了数值结果来支持理论分析。
{"title":"Convergence Analysis of a New Forward-Reflected-Backward Algorithm for Four Operators Without Cocoercivity","authors":"Yu Cao, Yuanheng Wang, Habib ur Rehman, Yekini Shehu, Jen-Chih Yao","doi":"10.1007/s10957-024-02501-7","DOIUrl":"https://doi.org/10.1007/s10957-024-02501-7","url":null,"abstract":"<p>In this paper, we propose a new splitting algorithm to find the zero of a monotone inclusion problem that features the sum of three maximal monotone operators and a Lipschitz continuous monotone operator in Hilbert spaces. We prove that the sequence of iterates generated by our proposed splitting algorithm converges weakly to the zero of the considered inclusion problem under mild conditions on the iterative parameters. Several splitting algorithms in the literature are recovered as special cases of our proposed algorithm. Another interesting feature of our algorithm is that one forward evaluation of the Lipschitz continuous monotone operator is utilized at each iteration. Numerical results are given to support the theoretical analysis.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"26 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variance Reduction Techniques for Stochastic Proximal Point Algorithms 随机近点算法的方差缩小技术
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-02 DOI: 10.1007/s10957-024-02502-6
Cheik Traoré, Vassilis Apidopoulos, Saverio Salzo, Silvia Villa

In the context of finite sums minimization, variance reduction techniques are widely used to improve the performance of state-of-the-art stochastic gradient methods. Their practical impact is clear, as well as their theoretical properties. Stochastic proximal point algorithms have been studied as an alternative to stochastic gradient algorithms since they are more stable with respect to the choice of the step size. However, their variance-reduced versions are not as well studied as the gradient ones. In this work, we propose the first unified study of variance reduction techniques for stochastic proximal point algorithms. We introduce a generic stochastic proximal-based algorithm that can be specified to give the proximal version of SVRG, SAGA, and some of their variants. For this algorithm, in the smooth setting, we provide several convergence rates for the iterates and the objective function values, which are faster than those of the vanilla stochastic proximal point algorithm. More specifically, for convex functions, we prove a sublinear convergence rate of O(1/k). In addition, under the Polyak-łojasiewicz condition, we obtain linear convergence rates. Finally, our numerical experiments demonstrate the advantages of the proximal variance reduction methods over their gradient counterparts in terms of the stability with respect to the choice of the step size in most cases, especially for difficult problems.

在有限和最小化的背景下,方差缩小技术被广泛用于改善最先进的随机梯度方法的性能。它们的实际影响和理论特性都是显而易见的。随机近点算法作为随机梯度算法的一种替代方法被研究,因为它们在步长的选择上更加稳定。然而,对其方差缩小版本的研究却不如梯度算法。在这项工作中,我们首次提出了随机近点算法方差缩小技术的统一研究。我们介绍了一种基于随机近点的通用算法,它可以指定为 SVRG、SAGA 及其一些变体的近点版本。对于这种算法,在平滑设置中,我们提供了迭代和目标函数值的几种收敛速率,它们比普通随机近似点算法更快。更具体地说,对于凸函数,我们证明了 O(1/k)的亚线性收敛率。此外,在 Polyak-łojasiewicz 条件下,我们获得了线性收敛速率。最后,我们的数值实验证明了近似方差缩小方法在大多数情况下,尤其是在困难问题上,与梯度方法相比,在步长选择的稳定性方面具有优势。
{"title":"Variance Reduction Techniques for Stochastic Proximal Point Algorithms","authors":"Cheik Traoré, Vassilis Apidopoulos, Saverio Salzo, Silvia Villa","doi":"10.1007/s10957-024-02502-6","DOIUrl":"https://doi.org/10.1007/s10957-024-02502-6","url":null,"abstract":"<p>In the context of finite sums minimization, variance reduction techniques are widely used to improve the performance of state-of-the-art stochastic gradient methods. Their practical impact is clear, as well as their theoretical properties. Stochastic proximal point algorithms have been studied as an alternative to stochastic gradient algorithms since they are more stable with respect to the choice of the step size. However, their variance-reduced versions are not as well studied as the gradient ones. In this work, we propose the first unified study of variance reduction techniques for stochastic proximal point algorithms. We introduce a generic stochastic proximal-based algorithm that can be specified to give the proximal version of SVRG, SAGA, and some of their variants. For this algorithm, in the smooth setting, we provide several convergence rates for the iterates and the objective function values, which are faster than those of the vanilla stochastic proximal point algorithm. More specifically, for convex functions, we prove a sublinear convergence rate of <i>O</i>(1/<i>k</i>). In addition, under the Polyak-łojasiewicz condition, we obtain linear convergence rates. Finally, our numerical experiments demonstrate the advantages of the proximal variance reduction methods over their gradient counterparts in terms of the stability with respect to the choice of the step size in most cases, especially for difficult problems.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"5 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Method for Multi-Leader–Multi-Follower Games by Smoothing the Followers’ Response Function 通过平滑追随者的响应函数实现多领导者与多追随者博弈的方法
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-02 DOI: 10.1007/s10957-024-02506-2
Atsushi Hori, Daisuke Tsuyuguchi, Ellen H. Fukuda

The multi-leader–multi-follower game (MLMFG) involves two or more leaders and followers and serves as a generalization of the Stackelberg game and the single-leader–multi-follower game. Although MLMFG covers wide range of real-world applications, its research is still sparse. Notably, fundamental solution methods for this class of problems remain insufficiently established. A prevailing approach is to recast the MLMFG as an equilibrium problem with equilibrium constraints (EPEC) and solve it using a solver. Meanwhile, interpreting the solution to the EPEC in the context of MLMFG may be complex due to shared decision variables among all leaders, followers’ strategies that each leader can unilaterally change, but the variables are essentially controlled by followers. To address this issue, we introduce a response function of followers’ noncooperative game that is a function with leaders’ strategies as a variable. Employing this approach allows the MLMFG to be solved as a single-level differentiable variational inequality using a smoothing scheme for the followers’ response function. We also demonstrate that the sequence of solutions to the smoothed variational inequality converges to a stationary equilibrium of the MLMFG. Finally, we illustrate the behavior of the smoothing method by numerical experiments.

多领导者-多追随者博弈(MLMFG)涉及两个或两个以上的领导者和追随者,是对斯泰尔伯格博弈和单领导者-多追随者博弈的概括。虽然 MLMFG 在现实世界中应用广泛,但对它的研究仍然很少。值得注意的是,该类问题的基本求解方法仍然不够成熟。目前流行的一种方法是将 MLMFG 重塑为带均衡约束的均衡问题(EPEC),并使用求解器进行求解。同时,在 MLMFG 的背景下解释 EPEC 的解可能很复杂,因为所有领导者共享决策变量,每个领导者可以单方面改变追随者的策略,但变量基本上由追随者控制。为了解决这个问题,我们引入了追随者非合作博弈的响应函数,它是一个以领导者策略为变量的函数。采用这种方法,就可以利用追随者响应函数的平滑方案,将 MLMFG 作为单级可变不等式求解。我们还证明,平滑变分不等式的解序列收敛于 MLMFG 的静态均衡。最后,我们通过数值实验说明了平滑方法的行为。
{"title":"A Method for Multi-Leader–Multi-Follower Games by Smoothing the Followers’ Response Function","authors":"Atsushi Hori, Daisuke Tsuyuguchi, Ellen H. Fukuda","doi":"10.1007/s10957-024-02506-2","DOIUrl":"https://doi.org/10.1007/s10957-024-02506-2","url":null,"abstract":"<p>The multi-leader–multi-follower game (MLMFG) involves two or more leaders and followers and serves as a generalization of the Stackelberg game and the single-leader–multi-follower game. Although MLMFG covers wide range of real-world applications, its research is still sparse. Notably, fundamental solution methods for this class of problems remain insufficiently established. A prevailing approach is to recast the MLMFG as an equilibrium problem with equilibrium constraints (EPEC) and solve it using a solver. Meanwhile, interpreting the solution to the EPEC in the context of MLMFG may be complex due to shared decision variables among all leaders, followers’ strategies that each leader can unilaterally change, but the variables are essentially controlled by followers. To address this issue, we introduce a response function of followers’ noncooperative game that is a function with leaders’ strategies as a variable. Employing this approach allows the MLMFG to be solved as a single-level differentiable variational inequality using a smoothing scheme for the followers’ response function. We also demonstrate that the sequence of solutions to the smoothed variational inequality converges to a stationary equilibrium of the MLMFG. Finally, we illustrate the behavior of the smoothing method by numerical experiments.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"22 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pontryagin’s Maximum Principle for a State-Constrained System of Douglis-Nirenberg Type 道格里斯-尼伦伯格类型状态受限系统的庞特里亚金最大原则
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-08-01 DOI: 10.1007/s10957-024-02499-y
Alexey S. Matveev, Dmitrii V. Sugak

This article is concerned with optimal control problems for plants described by systems of high order nonlinear PDE’s (whose linear approximation is elliptic in the sense of Douglis-Nirenberg), with a special attention being given to their particular case: the standard stationary system of non-linear Navier–Stokes equations. The objective is to establish an analog of the Pontryagin’s maximum principle. The major challenge stems from the presence of infinitely many point-wise constraints on the system’s state, which are imposed at any point from a given continuum set of independent variables. Necessary conditions for optimality in the form of an “abstract” maximum principle are first obtained for a general optimal control problem couched in the language of functional analysis. This result is targeted at a wide class of problems, with an idea to absorb, in its proof, a great deal of technical work needed for derivation of optimality conditions so that only an interpretation of the discussed result would be basically needed to handle a particular problem. The applicability of this approach is demonstrated via obtaining the afore-mentioned analog of the Pontryagin’s maximum principle for a state-constrained system of high-order elliptic equations and the Navier–Stokes equations.

本文主要研究高阶非线性 PDE 系统(其线性近似为 Douglis-Nirenberg 意义上的椭圆)所描述的植物最优控制问题,特别关注其特殊情况:非线性 Navier-Stokes 方程的标准静态系统。我们的目标是建立庞特里亚金最大原理的类似模型。主要挑战来自于对系统状态的无穷多个点向约束,这些约束是在给定自变量连续集的任意点上施加的。我们首先以 "抽象 "最大值原理的形式,为以函数分析语言表述的一般最优控制问题获得了最优性的必要条件。这一结果针对的是一类广泛的问题,其想法是在证明过程中吸收推导最优性条件所需的大量技术工作,这样,在处理特定问题时,基本上只需要对所讨论的结果进行解释。通过对高阶椭圆方程和纳维-斯托克斯方程的状态约束系统进行上述庞特里亚金最大原理的模拟,证明了这种方法的适用性。
{"title":"Pontryagin’s Maximum Principle for a State-Constrained System of Douglis-Nirenberg Type","authors":"Alexey S. Matveev, Dmitrii V. Sugak","doi":"10.1007/s10957-024-02499-y","DOIUrl":"https://doi.org/10.1007/s10957-024-02499-y","url":null,"abstract":"<p>This article is concerned with optimal control problems for plants described by systems of high order nonlinear PDE’s (whose linear approximation is elliptic in the sense of Douglis-Nirenberg), with a special attention being given to their particular case: the standard stationary system of non-linear Navier–Stokes equations. The objective is to establish an analog of the Pontryagin’s maximum principle. The major challenge stems from the presence of infinitely many point-wise constraints on the system’s state, which are imposed at any point from a given continuum set of independent variables. Necessary conditions for optimality in the form of an “abstract” maximum principle are first obtained for a general optimal control problem couched in the language of functional analysis. This result is targeted at a wide class of problems, with an idea to absorb, in its proof, a great deal of technical work needed for derivation of optimality conditions so that only an interpretation of the discussed result would be basically needed to handle a particular problem. The applicability of this approach is demonstrated via obtaining the afore-mentioned analog of the Pontryagin’s maximum principle for a state-constrained system of high-order elliptic equations and the Navier–Stokes equations.\u0000</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"4 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Optimization Theory and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1