Numerical properties of solutions of LASSO regression

IF 2.2 2区 数学 Q1 MATHEMATICS, APPLIED Applied Numerical Mathematics Pub Date : 2025-02-01 DOI:10.1016/j.apnum.2024.03.010
Mayur V. Lakshmi , Joab R. Winkler
{"title":"Numerical properties of solutions of LASSO regression","authors":"Mayur V. Lakshmi ,&nbsp;Joab R. Winkler","doi":"10.1016/j.apnum.2024.03.010","DOIUrl":null,"url":null,"abstract":"<div><div>The determination of a concise model of a linear system when there are fewer samples <em>m</em> than predictors <em>n</em> requires the solution of the equation <span><math><mi>A</mi><mi>x</mi><mo>=</mo><mi>b</mi></math></span>, where <span><math><mi>A</mi><mo>∈</mo><msup><mrow><mi>R</mi></mrow><mrow><mi>m</mi><mo>×</mo><mi>n</mi></mrow></msup></math></span> and <span><math><mtext>rank</mtext><mspace></mspace><mi>A</mi><mo>=</mo><mi>m</mi></math></span>, such that the selected solution from the infinite number of solutions is sparse, that is, many of its components are zero. This leads to the minimisation with respect to <em>x</em> of <span><math><mi>f</mi><mo>(</mo><mi>x</mi><mo>,</mo><mi>λ</mi><mo>)</mo><mo>=</mo><msubsup><mrow><mo>‖</mo><mi>A</mi><mi>x</mi><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mrow><mn>2</mn></mrow><mrow><mn>2</mn></mrow></msubsup><mo>+</mo><mi>λ</mi><msub><mrow><mo>‖</mo><mi>x</mi><mo>‖</mo></mrow><mrow><mn>1</mn></mrow></msub></math></span>, where <em>λ</em> is the regularisation parameter. This problem, which is called LASSO regression, yields a family of functions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> and it is necessary to determine the optimal value of <em>λ</em>, that is, the value of <em>λ</em> that balances the fidelity of the model, <span><math><mrow><mo>‖</mo><mi>A</mi><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mo>≈</mo><mn>0</mn></math></span>, and the satisfaction of the constraint that <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> be sparse. The aim of this paper is an investigation of the numerical properties of <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span>, and the main conclusion of this investigation is the incompatibility of sparsity and stability, that is, a sparse solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> that preserves the fidelity of the model exists if the least squares (LS) solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub><mo>=</mo><msup><mrow><mi>A</mi></mrow><mrow><mi>†</mi></mrow></msup><mi>b</mi></math></span> is unstable. Two methods, cross validation and the L-curve, for the computation of the optimal value of <em>λ</em> are compared and it is shown that the L-curve yields significantly better results. This difference between stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> of the LS problem manifests itself in the very different forms of the L-curve for these two solutions. The paper includes examples of stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> that demonstrate the theory.</div></div>","PeriodicalId":8199,"journal":{"name":"Applied Numerical Mathematics","volume":"208 ","pages":"Pages 297-309"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Numerical Mathematics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168927424000576","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

The determination of a concise model of a linear system when there are fewer samples m than predictors n requires the solution of the equation Ax=b, where ARm×n and rankA=m, such that the selected solution from the infinite number of solutions is sparse, that is, many of its components are zero. This leads to the minimisation with respect to x of f(x,λ)=Axb22+λx1, where λ is the regularisation parameter. This problem, which is called LASSO regression, yields a family of functions xlasso(λ) and it is necessary to determine the optimal value of λ, that is, the value of λ that balances the fidelity of the model, Axlasso(λ)b0, and the satisfaction of the constraint that xlasso(λ) be sparse. The aim of this paper is an investigation of the numerical properties of xlasso(λ), and the main conclusion of this investigation is the incompatibility of sparsity and stability, that is, a sparse solution xlasso(λ) that preserves the fidelity of the model exists if the least squares (LS) solution xls=Ab is unstable. Two methods, cross validation and the L-curve, for the computation of the optimal value of λ are compared and it is shown that the L-curve yields significantly better results. This difference between stable and unstable solutions xls of the LS problem manifests itself in the very different forms of the L-curve for these two solutions. The paper includes examples of stable and unstable solutions xls that demonstrate the theory.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LASSO 回归解的数值特性
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Numerical Mathematics
Applied Numerical Mathematics 数学-应用数学
CiteScore
5.60
自引率
7.10%
发文量
225
审稿时长
7.2 months
期刊介绍: The purpose of the journal is to provide a forum for the publication of high quality research and tutorial papers in computational mathematics. In addition to the traditional issues and problems in numerical analysis, the journal also publishes papers describing relevant applications in such fields as physics, fluid dynamics, engineering and other branches of applied science with a computational mathematics component. The journal strives to be flexible in the type of papers it publishes and their format. Equally desirable are: (i) Full papers, which should be complete and relatively self-contained original contributions with an introduction that can be understood by the broad computational mathematics community. Both rigorous and heuristic styles are acceptable. Of particular interest are papers about new areas of research, in which other than strictly mathematical arguments may be important in establishing a basis for further developments. (ii) Tutorial review papers, covering some of the important issues in Numerical Mathematics, Scientific Computing and their Applications. The journal will occasionally publish contributions which are larger than the usual format for regular papers. (iii) Short notes, which present specific new results and techniques in a brief communication.
期刊最新文献
Parametric finite element method for a nonlocal curvature flow Orthogonal wavelet method for multi-stage expansion and contraction options under stochastic volatility Finite element error estimation for parabolic optimal control problems with time delay Editorial Board Editorial Board
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1