Nonconvex Zeroth-Order Stochastic ADMM Methods with Lower Function Query Complexity.

IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-08-14 DOI:10.1109/TPAMI.2023.3347082

Feihu Huang, Shangqian Gao, Jian Pei, Heng Huang

{"title":"Nonconvex Zeroth-Order Stochastic ADMM Methods with Lower Function Query Complexity.","authors":"Feihu Huang, Shangqian Gao, Jian Pei, Heng Huang","doi":"10.1109/TPAMI.2023.3347082","DOIUrl":null,"url":null,"abstract":"<p><p>Zeroth-order (a.k.a, derivative-free) methods are a class of effective optimization methods for solving complex machine learning problems, where gradients of the objective functions are not available or computationally prohibitive. Recently, although many zeroth-order methods have been developed, these approaches still have two main drawbacks: 1) high function query complexity; 2) not being well suitable for solving the problems with complex penalties and constraints. To address these challenging drawbacks, in this paper, we propose a class of faster zeroth-order stochastic alternating direction method of multipliers (ADMM) methods (ZO-SPIDER-ADMM) to solve the nonconvex finite-sum problems with multiple nonsmooth penalties. Moreover, we prove that the ZO-SPIDER-ADMM methods can achieve a lower function query complexity of [Formula: see text] for finding an ϵ-stationary point, which improves the existing best nonconvex zeroth-order ADMM methods by a factor of [Formula: see text], where n and d denote the sample size and data dimension, respectively. At the same time, we propose a class of faster zeroth-order online ADMM methods (ZOO-ADMM+) to solve the nonconvex online problems with multiple nonsmooth penalties. We also prove that the proposed ZOO-ADMM+ methods achieve a lower function query complexity of [Formula: see text], which improves the existing best result by a factor of [Formula: see text]. Extensive experimental results on the structure adversarial attack on black-box deep neural networks demonstrate the efficiency of our new algorithms.</p>","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPAMI.2023.3347082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Zeroth-order (a.k.a, derivative-free) methods are a class of effective optimization methods for solving complex machine learning problems, where gradients of the objective functions are not available or computationally prohibitive. Recently, although many zeroth-order methods have been developed, these approaches still have two main drawbacks: 1) high function query complexity; 2) not being well suitable for solving the problems with complex penalties and constraints. To address these challenging drawbacks, in this paper, we propose a class of faster zeroth-order stochastic alternating direction method of multipliers (ADMM) methods (ZO-SPIDER-ADMM) to solve the nonconvex finite-sum problems with multiple nonsmooth penalties. Moreover, we prove that the ZO-SPIDER-ADMM methods can achieve a lower function query complexity of [Formula: see text] for finding an ϵ-stationary point, which improves the existing best nonconvex zeroth-order ADMM methods by a factor of [Formula: see text], where n and d denote the sample size and data dimension, respectively. At the same time, we propose a class of faster zeroth-order online ADMM methods (ZOO-ADMM+) to solve the nonconvex online problems with multiple nonsmooth penalties. We also prove that the proposed ZOO-ADMM+ methods achieve a lower function query complexity of [Formula: see text], which improves the existing best result by a factor of [Formula: see text]. Extensive experimental results on the structure adversarial attack on black-box deep neural networks demonstrate the efficiency of our new algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有较低函数查询复杂性的非凸零序随机 ADMM 方法。

零阶（又称无导数）方法是一类有效的优化方法，可用于解决目标函数梯度不可用或计算量过大的复杂机器学习问题。最近，虽然开发了很多零阶方法，但这些方法仍有两个主要缺点：1）函数查询复杂度高；2）不太适合解决具有复杂惩罚和约束条件的问题。为了解决这些具有挑战性的缺点，我们在本文中提出了一类更快的零阶随机交替乘法（ADMM）方法（ZO-SPIDER-ADMM），用于解决具有多个非光滑惩罚的非凸求和问题。此外，我们还证明了 ZO-SPIDER-ADMM 方法在寻找ϵ静止点时可以实现更低的函数查询复杂度[式：见正文]，比现有的最佳非凸零阶 ADMM 方法提高了[式：见正文]，其中 n 和 d 分别表示样本大小和数据维度。同时，我们提出了一类更快的零阶在线 ADMM 方法（ZOO-ADMM+），用于解决具有多重非光滑惩罚的非凸在线问题。我们还证明，所提出的 ZOO-ADMM+ 方法实现了更低的函数查询复杂度[公式：见正文]，将现有最佳结果提高了[公式：见正文]倍。针对黑盒深度神经网络的结构对抗攻击的大量实验结果证明了我们新算法的高效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量