首页 > 最新文献

Journal of Optimization Theory and Applications最新文献

英文 中文
Simultaneous Diagonalization Under Weak Regularity and a Characterization 弱规则性下的同时对角线化及其特征
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-17 DOI: 10.1007/s10957-024-02526-y
Fabián Flores-Bazán, Felipe Opazo

We analyze the fulfillment of the simultaneous diagonalization (SD via congruence) property for any two real matrices, and develop sufficient conditions expressed in different way to those appeared in the last few years. These conditions are established under a different perspective, and in any case, they supplement and clarify other similar results published elsewhere. Following our point of view reflected in a previous work, we offer some necessary and sufficient conditions, different in nature to those in Jiang and Li (SIAM J Optim 26:1649–1668, 2016), for SD: roughly speaking our approach is more geometric and needs to compute images and kernels of matrices; whereas that in Jiang and Li (SIAM J Optim 26:1649–1668, 2016) requires to compute determinant and canonical forms. The bidimensional situation is particularly analyzed, providing new more precise characterizations than those in higher dimension and joint those given earlier by the authors. In addition, we also establish the connection of our characterization of SD with that provided in Jiang and Li (SIAM J Optim 26:1649–1668, 2016).

我们分析了任意两个实数矩阵的同时对角化(通过全等的 SD)性质的实现,并提出了与过去几年中出现的不同的充分条件。这些条件是在不同的视角下建立的,无论如何,它们补充并澄清了其他地方发表的类似结果。根据我们在之前工作中反映的观点,我们为SD提供了一些必要和充分条件,这些条件在性质上不同于Jiang和Li(SIAM J Optim 26:1649-1668,2016)中的条件:粗略地说,我们的方法更加几何化,需要计算矩阵的图像和核;而Jiang和Li(SIAM J Optim 26:1649-1668,2016)中的方法需要计算行列式和典范形式。我们特别分析了二维情况,提供了比高维情况更精确的新表征,并与作者早先给出的表征相结合。此外,我们还建立了 SD 特征描述与 Jiang 和 Li(SIAM J Optim 26:1649-1668, 2016)所提供的特征描述之间的联系。
{"title":"Simultaneous Diagonalization Under Weak Regularity and a Characterization","authors":"Fabián Flores-Bazán, Felipe Opazo","doi":"10.1007/s10957-024-02526-y","DOIUrl":"https://doi.org/10.1007/s10957-024-02526-y","url":null,"abstract":"<p>We analyze the fulfillment of the simultaneous diagonalization (SD via congruence) property for any two real matrices, and develop sufficient conditions expressed in different way to those appeared in the last few years. These conditions are established under a different perspective, and in any case, they supplement and clarify other similar results published elsewhere. Following our point of view reflected in a previous work, we offer some necessary and sufficient conditions, different in nature to those in Jiang and Li (SIAM J Optim 26:1649–1668, 2016), for SD: roughly speaking our approach is more geometric and needs to compute images and kernels of matrices; whereas that in Jiang and Li (SIAM J Optim 26:1649–1668, 2016) requires to compute determinant and canonical forms. The bidimensional situation is particularly analyzed, providing new more precise characterizations than those in higher dimension and joint those given earlier by the authors. In addition, we also establish the connection of our characterization of SD with that provided in Jiang and Li (SIAM J Optim 26:1649–1668, 2016).</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"41 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142260879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Seeking Consensus on Subspaces in Federated Principal Component Analysis 在联合主成分分析中寻求子空间共识
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-14 DOI: 10.1007/s10957-024-02523-1
Lei Wang, Xin Liu, Yin Zhang

In this paper, we develop an algorithm for federated principal component analysis (PCA) with emphases on both communication efficiency and data privacy. Generally speaking, federated PCA algorithms based on direct adaptations of classic iterative methods, such as simultaneous subspace iterations, are unable to preserve data privacy, while algorithms based on variable-splitting and consensus-seeking, such as alternating direction methods of multipliers (ADMM), lack in communication-efficiency. In this work, we propose a novel consensus-seeking formulation by equalizing subspaces spanned by splitting variables instead of equalizing variables themselves, thus greatly relaxing feasibility restrictions and allowing much faster convergence. Then we develop an ADMM-like algorithm with several special features to make it practically efficient, including a low-rank multiplier formula and techniques for treating subproblems. We establish that the proposed algorithm can better protect data privacy than classic methods adapted to the federated PCA setting. We derive convergence results, including a worst-case complexity estimate, for the proposed ADMM-like algorithm in the presence of the nonlinear equality constraints. Extensive empirical results are presented to show that the new algorithm, while enhancing data privacy, requires far fewer rounds of communication than existing peer algorithms for federated PCA.

在本文中,我们开发了一种联合主成分分析(PCA)算法,重点关注通信效率和数据隐私。一般来说,基于经典迭代法直接改编的联合 PCA 算法(如同步子空间迭代)无法保护数据隐私,而基于变量拆分和寻求共识的算法(如交替方向乘法(ADMM))则缺乏通信效率。在这项工作中,我们提出了一种新颖的寻求共识公式,通过均衡拆分变量所跨的子空间而不是均衡变量本身,从而大大放宽了可行性限制,使收敛速度大大加快。然后,我们开发了一种类似 ADMM 的算法,该算法具有一些特殊功能,包括低阶乘法公式和处理子问题的技术,使其具有实际效率。我们发现,与适用于联合 PCA 设置的经典方法相比,所提出的算法能更好地保护数据隐私。我们推导出了收敛结果,包括在非线性相等约束条件下对所提出的 ADMM 类算法的最坏情况复杂度估计。广泛的实证结果表明,新算法在提高数据隐私性的同时,所需的通信轮数远远少于现有的联合 PCA 对等算法。
{"title":"Seeking Consensus on Subspaces in Federated Principal Component Analysis","authors":"Lei Wang, Xin Liu, Yin Zhang","doi":"10.1007/s10957-024-02523-1","DOIUrl":"https://doi.org/10.1007/s10957-024-02523-1","url":null,"abstract":"<p>In this paper, we develop an algorithm for federated principal component analysis (PCA) with emphases on both communication efficiency and data privacy. Generally speaking, federated PCA algorithms based on direct adaptations of classic iterative methods, such as simultaneous subspace iterations, are unable to preserve data privacy, while algorithms based on variable-splitting and consensus-seeking, such as alternating direction methods of multipliers (ADMM), lack in communication-efficiency. In this work, we propose a novel consensus-seeking formulation by equalizing subspaces spanned by splitting variables instead of equalizing variables themselves, thus greatly relaxing feasibility restrictions and allowing much faster convergence. Then we develop an ADMM-like algorithm with several special features to make it practically efficient, including a low-rank multiplier formula and techniques for treating subproblems. We establish that the proposed algorithm can better protect data privacy than classic methods adapted to the federated PCA setting. We derive convergence results, including a worst-case complexity estimate, for the proposed ADMM-like algorithm in the presence of the nonlinear equality constraints. Extensive empirical results are presented to show that the new algorithm, while enhancing data privacy, requires far fewer rounds of communication than existing peer algorithms for federated PCA.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"18 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142260878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multilevel Method for Self-Concordant Minimization 自洽最小化的多层次方法
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-14 DOI: 10.1007/s10957-024-02509-z
Nick Tsipinakis, Panos Parpas

The analysis of second-order optimization methods based either on sub-sampling, randomization or sketching has two serious shortcomings compared to the conventional Newton method. The first shortcoming is that the analysis of the iterates has only been shown to be scale-invariant only under specific assumptions on the problem structure. The second shortfall is that the fast convergence rates of second-order methods have only been established by making assumptions regarding the input data. In this paper, we propose a randomized Newton method for self-concordant functions to address both shortfalls. We propose a Self-concordant Iterative-minimization-Galerkin-based Multilevel Algorithm (SIGMA) and establish its super-linear convergence rate using the theory of self-concordant functions. Our analysis is based on the connections between multigrid optimization methods, and the role of coarse-grained or reduced-order models in the computation of search directions. We take advantage of the insights from the analysis to significantly improve the performance of second-order methods in machine learning applications. We report encouraging initial experiments that suggest SIGMA outperforms other state-of-the-art sub-sampled/sketched Newton methods for both medium and large-scale problems.

与传统牛顿法相比,基于子采样、随机化或草图的二阶优化方法的分析存在两个严重缺陷。第一个不足是,对迭代的分析只证明了在问题结构的特定假设下是规模不变的。第二个不足是,二阶方法的快速收敛率只能通过对输入数据的假设来确定。在本文中,我们针对自洽函数提出了一种随机牛顿方法,以解决这两个不足。我们提出了自洽迭代-最小化-基于伽勒金的多级算法(SIGMA),并利用自洽函数理论确定了其超线性收敛率。我们的分析基于多网格优化方法之间的联系,以及粗粒度或低阶模型在计算搜索方向中的作用。我们利用分析中的洞察力,大大提高了二阶方法在机器学习应用中的性能。我们报告了令人鼓舞的初步实验结果,表明 SIGMA 在中型和大型问题上的表现都优于其他最先进的子采样/草图牛顿方法。
{"title":"A Multilevel Method for Self-Concordant Minimization","authors":"Nick Tsipinakis, Panos Parpas","doi":"10.1007/s10957-024-02509-z","DOIUrl":"https://doi.org/10.1007/s10957-024-02509-z","url":null,"abstract":"<p>The analysis of second-order optimization methods based either on sub-sampling, randomization or sketching has two serious shortcomings compared to the conventional Newton method. The first shortcoming is that the analysis of the iterates has only been shown to be scale-invariant only under specific assumptions on the problem structure. The second shortfall is that the fast convergence rates of second-order methods have only been established by making assumptions regarding the input data. In this paper, we propose a randomized Newton method for self-concordant functions to address both shortfalls. We propose a Self-concordant Iterative-minimization-Galerkin-based Multilevel Algorithm (SIGMA) and establish its super-linear convergence rate using the theory of self-concordant functions. Our analysis is based on the connections between multigrid optimization methods, and the role of coarse-grained or reduced-order models in the computation of search directions. We take advantage of the insights from the analysis to significantly improve the performance of second-order methods in machine learning applications. We report encouraging initial experiments that suggest SIGMA outperforms other state-of-the-art sub-sampled/sketched Newton methods for both medium and large-scale problems.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"87 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142261102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Descent Method for Nonsmooth Multiobjective Optimization in Hilbert Spaces 希尔伯特空间非光滑多目标优化的后裔方法
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-12 DOI: 10.1007/s10957-024-02520-4
Konstantin Sonntag, Bennet Gebken, Georg Müller, Sebastian Peitz, Stefan Volkwein

The efficient optimization method for locally Lipschitz continuous multiobjective optimization problems from Gebken and Peitz (J Optim Theory Appl 188:696–723, 2021) is extended from finite-dimensional problems to general Hilbert spaces. The method iteratively computes Pareto critical points, where in each iteration, an approximation of the Clarke subdifferential is computed in an efficient manner and then used to compute a common descent direction for all objective functions. To prove convergence, we present some new optimality results for nonsmooth multiobjective optimization problems in Hilbert spaces. Using these, we can show that every accumulation point of the sequence generated by our algorithm is Pareto critical under common assumptions. Computational efficiency for finding Pareto critical points is numerically demonstrated for multiobjective optimal control of an obstacle problem.

Gebken 和 Peitz(J Optim Theory Appl 188:696-723, 2021)提出的局部利普齐兹连续多目标优化问题的高效优化方法从有限维问题扩展到了一般希尔伯特空间。该方法以迭代的方式计算帕累托临界点,在每次迭代中,都会以高效的方式计算克拉克次微分的近似值,然后用于计算所有目标函数的共同下降方向。为了证明收敛性,我们针对希尔伯特空间中的非光滑多目标优化问题提出了一些新的最优性结果。利用这些结果,我们可以证明由我们的算法生成的序列的每个累积点在共同假设下都是帕累托临界点。针对障碍物的多目标优化控制问题,我们用数值证明了找到帕累托临界点的计算效率。
{"title":"A Descent Method for Nonsmooth Multiobjective Optimization in Hilbert Spaces","authors":"Konstantin Sonntag, Bennet Gebken, Georg Müller, Sebastian Peitz, Stefan Volkwein","doi":"10.1007/s10957-024-02520-4","DOIUrl":"https://doi.org/10.1007/s10957-024-02520-4","url":null,"abstract":"<p>The efficient optimization method for locally Lipschitz continuous multiobjective optimization problems from Gebken and Peitz (J Optim Theory Appl 188:696–723, 2021) is extended from finite-dimensional problems to general Hilbert spaces. The method iteratively computes Pareto critical points, where in each iteration, an approximation of the Clarke subdifferential is computed in an efficient manner and then used to compute a common descent direction for all objective functions. To prove convergence, we present some new optimality results for nonsmooth multiobjective optimization problems in Hilbert spaces. Using these, we can show that every accumulation point of the sequence generated by our algorithm is Pareto critical under common assumptions. Computational efficiency for finding Pareto critical points is numerically demonstrated for multiobjective optimal control of an obstacle problem.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"46 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expected Residual Minimization Formulation for Stochastic Absolute Value Equations 随机绝对值方程的期望残差最小化公式
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-12 DOI: 10.1007/s10957-024-02527-x
Jingyong Tang, Jinchuan Zhou

In this paper we investigate a class of stochastic absolute value equations (SAVE). After establishing the relationship between the stochastic linear complementarity problem and SAVE, we study the expected residual minimization (ERM) formulation for SAVE and its Monte Carlo sample average approximation. In particular, we show that the ERM problem and its sample average approximation have optimal solutions under the condition of (R_0) pair, and the optimal value of the sample average approximation has uniform exponential convergence. Furthermore, we prove that the solutions to the ERM problem are robust for SAVE. For a class of SAVE problems, we use its special structure to construct a smooth residual and further study the convergence of the stationary points. Finally, a smoothing gradient method is proposed by simultaneously considering sample sampling and smooth techniques for solving SAVE. Numerical experiments exhibit the effectiveness of the method.

本文研究了一类随机绝对值方程(SAVE)。在建立了随机线性互补问题和 SAVE 之间的关系之后,我们研究了 SAVE 的期望残差最小化(ERM)公式及其蒙特卡罗样本平均近似值。特别是,我们证明了 ERM 问题及其样本平均近似值在 (R_0) 对的条件下有最优解,而且样本平均近似值的最优值具有均匀的指数收敛性。此外,我们还证明了 ERM 问题的解对于 SAVE 是稳健的。对于一类 SAVE 问题,我们利用其特殊结构构建了平滑残差,并进一步研究了静止点的收敛性。最后,我们提出了一种平滑梯度法,同时考虑了样本采样和平滑技术来求解 SAVE。数值实验证明了该方法的有效性。
{"title":"Expected Residual Minimization Formulation for Stochastic Absolute Value Equations","authors":"Jingyong Tang, Jinchuan Zhou","doi":"10.1007/s10957-024-02527-x","DOIUrl":"https://doi.org/10.1007/s10957-024-02527-x","url":null,"abstract":"<p>In this paper we investigate a class of stochastic absolute value equations (SAVE). After establishing the relationship between the stochastic linear complementarity problem and SAVE, we study the expected residual minimization (ERM) formulation for SAVE and its Monte Carlo sample average approximation. In particular, we show that the ERM problem and its sample average approximation have optimal solutions under the condition of <span>(R_0)</span> pair, and the optimal value of the sample average approximation has uniform exponential convergence. Furthermore, we prove that the solutions to the ERM problem are robust for SAVE. For a class of SAVE problems, we use its special structure to construct a smooth residual and further study the convergence of the stationary points. Finally, a smoothing gradient method is proposed by simultaneously considering sample sampling and smooth techniques for solving SAVE. Numerical experiments exhibit the effectiveness of the method.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"7 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Weak Maximum Principle for Discrete Optimal Control Problems with Mixed Constraints 具有混合约束条件的离散最优控制问题的弱最大原则
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-10 DOI: 10.1007/s10957-024-02524-0
Roberto Andreani, John Frank Matos Ascona, Valeriano Antunes de Oliveira

In this study, first-order necessary optimality conditions, in the form of a weak maximum principle, are derived for discrete optimal control problems with mixed equality and inequality constraints. Such conditions are achieved by using the Dubovitskii–Milyutin formalism approach. Nondegenerate conditions are obtained under the constant rank of the subspace component (CRSC) constraint qualification, which is an important generalization of both the Mangasarian–Fromovitz and constant rank constraint qualifications. Beyond its theoretical significance, CRSC has practical importance because it is closely related to the formulation of optimization algorithms. In addition, an instance of a discrete optimal control problem is presented in which CRSC holds while other stronger regularity conditions do not.

本研究以弱最大原则的形式,为具有混合相等和不等式约束的离散最优控制问题导出了一阶必要最优性条件。这些条件是通过使用 Dubovitskii-Milyutin 形式主义方法实现的。在子空间分量恒定秩(CRSC)约束条件下得到了非enerate 条件,这是对 Mangasarian-Fromovitz 和恒定秩约束条件的重要概括。除了理论意义之外,CRSC 还具有实际意义,因为它与优化算法的制定密切相关。此外,本文还提出了一个离散最优控制问题的实例,在该实例中,CRSC 成立,而其他更强的正则条件不成立。
{"title":"A Weak Maximum Principle for Discrete Optimal Control Problems with Mixed Constraints","authors":"Roberto Andreani, John Frank Matos Ascona, Valeriano Antunes de Oliveira","doi":"10.1007/s10957-024-02524-0","DOIUrl":"https://doi.org/10.1007/s10957-024-02524-0","url":null,"abstract":"<p>In this study, first-order necessary optimality conditions, in the form of a weak maximum principle, are derived for discrete optimal control problems with mixed equality and inequality constraints. Such conditions are achieved by using the Dubovitskii–Milyutin formalism approach. Nondegenerate conditions are obtained under the constant rank of the subspace component (CRSC) constraint qualification, which is an important generalization of both the Mangasarian–Fromovitz and constant rank constraint qualifications. Beyond its theoretical significance, CRSC has practical importance because it is closely related to the formulation of optimization algorithms. In addition, an instance of a discrete optimal control problem is presented in which CRSC holds while other stronger regularity conditions do not.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"25 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient Descent Provably Escapes Saddle Points in the Training of Shallow ReLU Networks 在浅层 ReLU 网络的训练中梯度下降可避开鞍点
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-10 DOI: 10.1007/s10957-024-02513-3
Patrick Cheridito, Arnulf Jentzen, Florian Rossmannek

Dynamical systems theory has recently been applied in optimization to prove that gradient descent algorithms bypass so-called strict saddle points of the loss function. However, in many modern machine learning applications, the required regularity conditions are not satisfied. In this paper, we prove a variant of the relevant dynamical systems result, a center-stable manifold theorem, in which we relax some of the regularity requirements. We explore its relevance for various machine learning tasks, with a particular focus on shallow rectified linear unit (ReLU) and leaky ReLU networks with scalar input. Building on a detailed examination of critical points of the square integral loss function for shallow ReLU and leaky ReLU networks relative to an affine target function, we show that gradient descent circumvents most saddle points. Furthermore, we prove convergence to global minima under favourable initialization conditions, quantified by an explicit threshold on the limiting loss.

动态系统理论最近被应用于优化,证明梯度下降算法可以绕过损失函数的所谓严格鞍点。然而,在许多现代机器学习应用中,所需的正则性条件并不满足。在本文中,我们证明了相关动力系统结果的一个变体--中心稳定流形定理,其中我们放宽了一些正则性要求。我们探讨了它与各种机器学习任务的相关性,尤其关注具有标量输入的浅层整型线性单元(ReLU)和泄漏 ReLU 网络。我们详细研究了浅层 ReLU 和泄漏 ReLU 网络相对于仿射目标函数的平方积分损失函数临界点,在此基础上,我们证明梯度下降可以绕过大多数鞍点。此外,我们还证明了在有利的初始化条件下对全局最小值的收敛,并通过对极限损失的明确阈值进行量化。
{"title":"Gradient Descent Provably Escapes Saddle Points in the Training of Shallow ReLU Networks","authors":"Patrick Cheridito, Arnulf Jentzen, Florian Rossmannek","doi":"10.1007/s10957-024-02513-3","DOIUrl":"https://doi.org/10.1007/s10957-024-02513-3","url":null,"abstract":"<p>Dynamical systems theory has recently been applied in optimization to prove that gradient descent algorithms bypass so-called strict saddle points of the loss function. However, in many modern machine learning applications, the required regularity conditions are not satisfied. In this paper, we prove a variant of the relevant dynamical systems result, a center-stable manifold theorem, in which we relax some of the regularity requirements. We explore its relevance for various machine learning tasks, with a particular focus on shallow rectified linear unit (ReLU) and leaky ReLU networks with scalar input. Building on a detailed examination of critical points of the square integral loss function for shallow ReLU and leaky ReLU networks relative to an affine target function, we show that gradient descent circumvents most saddle points. Furthermore, we prove convergence to global minima under favourable initialization conditions, quantified by an explicit threshold on the limiting loss.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"58 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized Fractional-Order Type-2 Fuzzy PID Attitude Controller for Fixed-Wing Aircraft 用于固定翼飞机的优化分数阶 2 型模糊 PID 姿态控制器
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-09 DOI: 10.1007/s10957-024-02512-4
Wenfan Wang, Jun Zhang, Ruili Jiao

This paper addresses the design of attitude controller for a fixed-wing unmanned aerial vehicle. To address the complexity of the coupled nonlinear model of a fixed-wing aircraft, this paper introduces a Fractional-Order Type-2 Fuzzy PID (FOTFPID) controller. The adoption of interval valued type-2 fuzzy sets, as an extension of conventional fuzzy sets, has endowed decision makers with the ability to assign membership and non-membership values as intervals. This enhanced capability facilitates more resilient decision-making processes. The Bat optimization algorithm is also employed to fine-tune the membership functions, scaling factors, and primary controller parameters, aiming to minimize the integrated absolute error index. Numerical simulations are conducted to demonstrate effectiveness of the proposed controllers in comparison to classical PID controllers, while subjecting the aircraft system to various disturbance conditions.

本文探讨了固定翼无人飞行器姿态控制器的设计。针对固定翼飞机耦合非线性模型的复杂性,本文引入了分数阶 2 型模糊 PID(FOTFPID)控制器。作为传统模糊集的扩展,区间值 2 型模糊集的采用赋予了决策者将成员值和非成员值分配为区间的能力。这种增强的能力有助于提高决策过程的弹性。此外,还采用了 Bat 优化算法对成员函数、缩放因子和主控制器参数进行微调,目的是使综合绝对误差指数最小。在飞机系统受到各种干扰的条件下,进行了数值模拟,以证明所提出的控制器与传统 PID 控制器相比的有效性。
{"title":"Optimized Fractional-Order Type-2 Fuzzy PID Attitude Controller for Fixed-Wing Aircraft","authors":"Wenfan Wang, Jun Zhang, Ruili Jiao","doi":"10.1007/s10957-024-02512-4","DOIUrl":"https://doi.org/10.1007/s10957-024-02512-4","url":null,"abstract":"<p>This paper addresses the design of attitude controller for a fixed-wing unmanned aerial vehicle. To address the complexity of the coupled nonlinear model of a fixed-wing aircraft, this paper introduces a Fractional-Order Type-2 Fuzzy PID (FOTFPID) controller. The adoption of interval valued type-2 fuzzy sets, as an extension of conventional fuzzy sets, has endowed decision makers with the ability to assign membership and non-membership values as intervals. This enhanced capability facilitates more resilient decision-making processes. The Bat optimization algorithm is also employed to fine-tune the membership functions, scaling factors, and primary controller parameters, aiming to minimize the integrated absolute error index. Numerical simulations are conducted to demonstrate effectiveness of the proposed controllers in comparison to classical PID controllers, while subjecting the aircraft system to various disturbance conditions.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"32 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence-Accelerated Fixed-Time Dynamical Methods for Absolute Value Equations 绝对值方程的收敛加速固定时间动态方法
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-09 DOI: 10.1007/s10957-024-02525-z
Xu Zhang, Cailian Li, Longcheng Zhang, Yaling Hu, Zheng Peng

Two new accelerated fixed-time stable dynamic systems are proposed for solving absolute value equations (AVEs): (Ax-|x|-b=0). Under some mild conditions, the equilibrium point of the proposed dynamic systems is completely equivalent to the solution of the AVEs under consideration. Meanwhile, we have introduced a new relatively tighter global error bound for the AVEs. Leveraging this finding, we have separately established the globally fixed-time stability of the proposed methods, along with providing the conservative settling-time for each method. Compared with some existing state-of-the-art dynamical methods, preliminary numerical experiments show the effectiveness of our methods in solving the AVEs.

为求解绝对值方程(AVE)提出了两个新的加速定时稳定动态系统:(Ax-|x|-b=0/)。在一些温和的条件下,所提动态系统的平衡点与所考虑的绝对值方程的解完全等价。同时,我们为 AVE 引入了一个新的相对更严格的全局误差约束。利用这一发现,我们分别建立了所提方法的全局定时稳定性,并提供了每种方法的保守沉降时间。与现有的一些最先进的动力学方法相比,初步的数值实验表明,我们的方法在求解反向电动势方程方面非常有效。
{"title":"Convergence-Accelerated Fixed-Time Dynamical Methods for Absolute Value Equations","authors":"Xu Zhang, Cailian Li, Longcheng Zhang, Yaling Hu, Zheng Peng","doi":"10.1007/s10957-024-02525-z","DOIUrl":"https://doi.org/10.1007/s10957-024-02525-z","url":null,"abstract":"<p>Two new accelerated fixed-time stable dynamic systems are proposed for solving absolute value equations (AVEs): <span>(Ax-|x|-b=0)</span>. Under some mild conditions, the equilibrium point of the proposed dynamic systems is completely equivalent to the solution of the AVEs under consideration. Meanwhile, we have introduced a new relatively tighter global error bound for the AVEs. Leveraging this finding, we have separately established the globally fixed-time stability of the proposed methods, along with providing the conservative settling-time for each method. Compared with some existing state-of-the-art dynamical methods, preliminary numerical experiments show the effectiveness of our methods in solving the AVEs.</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"6 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metric Subregularity and $$omega (cdot )$$ -Normal Regularity Properties 公次正则性和 $$omega (cdot )$$ - 正则性属性
IF 1.9 3区 数学 Q2 MATHEMATICS, APPLIED Pub Date : 2024-09-04 DOI: 10.1007/s10957-024-02476-5
Florent Nacry, Vo Anh Thuong Nguyen, Juliette Venel

In this paper, we establish through an openness condition the metric subregularity of a multimapping with normal (omega (cdot ))-regularity of either the graph or values. Various preservation results for prox-regular and subsmooth sets are also provided.

在本文中,我们通过一个开放性条件建立了一个多映射的度量次规则性,这个多映射的图或值都具有法线(omega (cdot ))规则性。本文还提供了近规则集和次光滑集的各种保存结果。
{"title":"Metric Subregularity and $$omega (cdot )$$ -Normal Regularity Properties","authors":"Florent Nacry, Vo Anh Thuong Nguyen, Juliette Venel","doi":"10.1007/s10957-024-02476-5","DOIUrl":"https://doi.org/10.1007/s10957-024-02476-5","url":null,"abstract":"<p>In this paper, we establish through an openness condition the metric subregularity of a multimapping with normal <span>(omega (cdot ))</span>-regularity of either the graph or values. Various preservation results for prox-regular and subsmooth sets are also provided.\u0000</p>","PeriodicalId":50100,"journal":{"name":"Journal of Optimization Theory and Applications","volume":"67 1","pages":""},"PeriodicalIF":1.9,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142186661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Optimization Theory and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1