On Gradient-Based Optimization: Accelerated, Distributed, Asynchronous and Stochastic

Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems Pub Date : 2017-06-05 DOI:10.1145/3143314.3078506

Michael I. Jordan

引用次数: 3

Abstract

Many new theoretical challenges have arisen in the area of gradient-based optimization for large-scale statistical data analysis, driven by the needs of applications and the opportunities provided by new hardware and software platforms. I discuss several recent results in this area, including: (1) a new framework for understanding Nesterov acceleration, obtained by taking a continuous-time, Lagrangian/Hamiltonian perspective, (2) a general theory of asynchronous optimization in multi-processor systems, (3) a computationally-efficient approach to stochastic variance reduction, (4) a primal-dual methodology for gradient-based optimization that targets communication bottlenecks in distributed systems, and (5) a discussion of how to avoid saddle-points in nonconvex optimization.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于梯度的优化:加速、分布式、异步和随机

在应用需求和新软硬件平台机遇的推动下，基于梯度的大规模统计数据分析优化领域出现了许多新的理论挑战。我将讨论这一领域的几个最新成果，包括:(1)从连续时间、拉格朗日/哈密顿角度获得了理解Nesterov加速的新框架;(2)多处理器系统中异步优化的一般理论;(3)随机方差减少的高效计算方法;(4)针对分布式系统中通信瓶颈的基于梯度的优化的原始对偶方法;(5)讨论如何避免非凸优化中的鞍点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems

自引率

0.00%

发文量

期刊最新文献

Session details: Session 5: Towards Efficient and Durable Storage Routing Money, Not Packets: A Tutorial on Internet Economics Accelerating Performance Inference over Closed Systems by Asymptotic Methods Session details: Session 3: Assessing Vulnerability of Large Networks Exploiting Data Longevity for Enhancing the Lifetime of Flash-based Storage Class Memory