{"title":"p正则化最小二乘(0","authors":"Masahiro Yukawa, S. Amari","doi":"10.1109/TIT.2015.2501362","DOIUrl":null,"url":null,"abstract":"This paper elucidates the underlying structures of ℓ<sub>p</sub>-regularized least squares problems in the nonconvex case of 0 <; p <; 1. The difference between two formulations is highlighted (which does not occur in the convex case of p = 1): 1) an ℓ<sub>p</sub>-constrained optimization (P<sub>c</sub><sup>p</sup>) and 2) an ℓ<sub>p</sub>-penalized (unconstrained) optimization (L<sub>λ</sub><sup>p</sup>). It is shown that the solution path of (L<sub>λ</sub><sup>p</sup>) is discontinuous and also a part of the solution path of (P<sub>c</sub><sup>p</sup>). As an alternative to the solution path, a critical path is considered, which is a maximal continuous curve consisting of critical points. Critical paths are piecewise smooth, as can be seen from the viewpoint of the variational method, and generally contain non-optimal points, such as saddle points and local maxima as well as global/local minima. Our study reveals multiplicity (non-monotonicity) in the correspondence between the regularization parameters of (P<sub>c</sub><sup>p</sup>) and (L<sub>λ</sub><sup>p</sup> ). Two particular paths of critical points connecting the origin and an ordinary least squares (OLS) solution are studied further. One is a main path starting at an OLS solution, and the other is a greedy path starting at the origin. Part of the greedy path can be constructed with a generalized Minkowskian gradient. This paper of greedy path leads to a nontrivial close-link between the optimization problem of ℓ<sub>p</sub>-regularized least squares and the greedy method of orthogonal matching pursuit.","PeriodicalId":13250,"journal":{"name":"IEEE Trans. Inf. Theory","volume":"186 1","pages":"488-502"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"ℓp-Regularized Least Squares (0\",\"authors\":\"Masahiro Yukawa, S. Amari\",\"doi\":\"10.1109/TIT.2015.2501362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper elucidates the underlying structures of ℓ<sub>p</sub>-regularized least squares problems in the nonconvex case of 0 <; p <; 1. The difference between two formulations is highlighted (which does not occur in the convex case of p = 1): 1) an ℓ<sub>p</sub>-constrained optimization (P<sub>c</sub><sup>p</sup>) and 2) an ℓ<sub>p</sub>-penalized (unconstrained) optimization (L<sub>λ</sub><sup>p</sup>). It is shown that the solution path of (L<sub>λ</sub><sup>p</sup>) is discontinuous and also a part of the solution path of (P<sub>c</sub><sup>p</sup>). As an alternative to the solution path, a critical path is considered, which is a maximal continuous curve consisting of critical points. Critical paths are piecewise smooth, as can be seen from the viewpoint of the variational method, and generally contain non-optimal points, such as saddle points and local maxima as well as global/local minima. Our study reveals multiplicity (non-monotonicity) in the correspondence between the regularization parameters of (P<sub>c</sub><sup>p</sup>) and (L<sub>λ</sub><sup>p</sup> ). Two particular paths of critical points connecting the origin and an ordinary least squares (OLS) solution are studied further. One is a main path starting at an OLS solution, and the other is a greedy path starting at the origin. Part of the greedy path can be constructed with a generalized Minkowskian gradient. This paper of greedy path leads to a nontrivial close-link between the optimization problem of ℓ<sub>p</sub>-regularized least squares and the greedy method of orthogonal matching pursuit.\",\"PeriodicalId\":13250,\"journal\":{\"name\":\"IEEE Trans. Inf. Theory\",\"volume\":\"186 1\",\"pages\":\"488-502\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Trans. Inf. Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TIT.2015.2501362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Trans. Inf. Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TIT.2015.2501362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
摘要
本文讨论了$\ well _{p}$ -正则化最小二乘问题在$0非凸情况下的基本结构。突出显示了两个公式之间的区别(这在$p=1$的凸情况下不会出现):1)$\ well _{p}$约束优化($\mathcal {p} _{c}^{p}$)和2)$\ well _{p}$惩罚(无约束)优化$ (\mathcal {L}_{\lambda}^{p})$。证明了$ (\mathcal {L}_{\lambda}^{p})$的解路径是不连续的,也是$\mathcal {p} _{c}^{p}$的解路径的一部分。作为解路径的备选,我们考虑了一条关键路径,它是由临界点组成的最大连续曲线。从变分方法的角度来看,关键路径是分段光滑的,通常包含非最优点,如鞍点和局部最大值以及全局/局部最小值。我们的研究揭示了$ (\mathcal {P}_{c}^{P})$和$ (\mathcal {L}_{\lambda}^{P})$正则化参数之间对应关系的多重性(非单调性)。进一步研究了连接原点的两个临界点的特殊路径和一个普通最小二乘解。一条是从OLS解开始的主路径,另一条是从原点开始的贪心路径。部分贪心路径可以用广义闵可夫斯基梯度来构造。本文的贪心路径将正则化最小二乘优化问题与正交匹配寻优的贪心方法联系在了一起。
This paper elucidates the underlying structures of ℓp-regularized least squares problems in the nonconvex case of 0 <; p <; 1. The difference between two formulations is highlighted (which does not occur in the convex case of p = 1): 1) an ℓp-constrained optimization (Pcp) and 2) an ℓp-penalized (unconstrained) optimization (Lλp). It is shown that the solution path of (Lλp) is discontinuous and also a part of the solution path of (Pcp). As an alternative to the solution path, a critical path is considered, which is a maximal continuous curve consisting of critical points. Critical paths are piecewise smooth, as can be seen from the viewpoint of the variational method, and generally contain non-optimal points, such as saddle points and local maxima as well as global/local minima. Our study reveals multiplicity (non-monotonicity) in the correspondence between the regularization parameters of (Pcp) and (Lλp ). Two particular paths of critical points connecting the origin and an ordinary least squares (OLS) solution are studied further. One is a main path starting at an OLS solution, and the other is a greedy path starting at the origin. Part of the greedy path can be constructed with a generalized Minkowskian gradient. This paper of greedy path leads to a nontrivial close-link between the optimization problem of ℓp-regularized least squares and the greedy method of orthogonal matching pursuit.