Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities

J. Mach. Learn. Res. Pub Date : 2022-10-13 DOI:10.48550/arXiv.2210.06640

Brian Bartoldson, B. Kailkhura, Davis W. Blalock

{"title":"Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities","authors":"Brian Bartoldson, B. Kailkhura, Davis W. Blalock","doi":"10.48550/arXiv.2210.06640","DOIUrl":null,"url":null,"abstract":"Although deep learning has made great progress in recent years, the exploding economic and environmental costs of training neural networks are becoming unsustainable. To address this problem, there has been a great deal of research on *algorithmically-efficient deep learning*, which seeks to reduce training costs not at the hardware or implementation level, but through changes in the semantics of the training program. In this paper, we present a structured and comprehensive overview of the research in this field. First, we formalize the *algorithmic speedup* problem, then we use fundamental building blocks of algorithmically efficient training to develop a taxonomy. Our taxonomy highlights commonalities of seemingly disparate methods and reveals current research gaps. Next, we present evaluation best practices to enable comprehensive, fair, and reliable comparisons of speedup techniques. To further aid research and applications, we discuss common bottlenecks in the training pipeline (illustrated via experiments) and offer taxonomic mitigation strategies for them. Finally, we highlight some unsolved research challenges and present promising future directions.","PeriodicalId":14794,"journal":{"name":"J. Mach. Learn. Res.","volume":"36 1","pages":"122:1-122:77"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Mach. Learn. Res.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2210.06640","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Although deep learning has made great progress in recent years, the exploding economic and environmental costs of training neural networks are becoming unsustainable. To address this problem, there has been a great deal of research on *algorithmically-efficient deep learning*, which seeks to reduce training costs not at the hardware or implementation level, but through changes in the semantics of the training program. In this paper, we present a structured and comprehensive overview of the research in this field. First, we formalize the *algorithmic speedup* problem, then we use fundamental building blocks of algorithmically efficient training to develop a taxonomy. Our taxonomy highlights commonalities of seemingly disparate methods and reveals current research gaps. Next, we present evaluation best practices to enable comprehensive, fair, and reliable comparisons of speedup techniques. To further aid research and applications, we discuss common bottlenecks in the training pipeline (illustrated via experiments) and offer taxonomic mitigation strategies for them. Finally, we highlight some unsolved research challenges and present promising future directions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高效计算深度学习:算法趋势和机遇

为了解决这个问题，已经有大量关于“算法高效深度学习”的研究，它寻求减少训练成本，而不是在硬件或实现层面，而是通过改变训练程序的语义。在本文中，我们对这一领域的研究进行了结构化和全面的概述。首先，我们将“算法加速”问题形式化，然后使用算法高效训练的基本构建块来开发分类法。我们的分类法突出了看似不同的方法的共性，并揭示了当前的研究差距。接下来，我们将介绍评估最佳实践，以便对加速技术进行全面、公平和可靠的比较。为了进一步帮助研究和应用，我们讨论了培训管道中的常见瓶颈(通过实验说明)，并提供了分类缓解策略。最后，我们强调了一些尚未解决的研究挑战，并提出了有希望的未来方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

J. Mach. Learn. Res.

自引率

0.00%

发文量

期刊最新文献

Scalable Computation of Causal Bounds A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning Adaptive False Discovery Rate Control with Privacy Guarantee Fairlearn: Assessing and Improving Fairness of AI Systems Generalization Bounds for Adversarial Contrastive Learning