Deep Learning in the Enhanced Cloud

Proceedings of the 2017 ACM on International Symposium on Physical Design Pub Date : 2017-03-19 DOI:10.1145/3036669.3038243

Eric S. Chung

{"title":"Deep Learning in the Enhanced Cloud","authors":"Eric S. Chung","doi":"10.1145/3036669.3038243","DOIUrl":null,"url":null,"abstract":"Deep Learning has emerged as a singularly critical technology for enabling human-like intelligence in online services such as Azure, Office 365, Bing, Cortana, Skype, and other high-valued scenarios at Microsoft. While Deep Neural Networks (DNNs) have enabled state-of-the-art accuracy in many intelligence tasks, they are notoriously expensive and difficult to deploy in hyperscale datacenters constrained by power, cost, and latency. Furthermore, the escalating (and insatiable) demand for DNNs comes at an inopportune time as ideal silicon scaling (Moore's Law) comes to a diminishing end. At Microsoft, we have developed a new cloud architecture that's enhanced using FPGA (Field Programmable Gate Array). FPGAs can be viewed as programmable silicon and are being deployed into each and every new server in Microsoft's hyperscale infrastructure. The flexibility of FPGAs combined with a novel Hardware-as-a-Service (HaaS) architecture unlocks the full potential of a completely programmable hardware and software acceleration plane. In this talk, I'll give a history and overview of the project, discuss the key enabling technologies behind our enhanced cloud, present opportunities to harness this technology for accelerated deep learning, and conclude with directions for future work.","PeriodicalId":269197,"journal":{"name":"Proceedings of the 2017 ACM on International Symposium on Physical Design","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 ACM on International Symposium on Physical Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3036669.3038243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Deep Learning has emerged as a singularly critical technology for enabling human-like intelligence in online services such as Azure, Office 365, Bing, Cortana, Skype, and other high-valued scenarios at Microsoft. While Deep Neural Networks (DNNs) have enabled state-of-the-art accuracy in many intelligence tasks, they are notoriously expensive and difficult to deploy in hyperscale datacenters constrained by power, cost, and latency. Furthermore, the escalating (and insatiable) demand for DNNs comes at an inopportune time as ideal silicon scaling (Moore's Law) comes to a diminishing end. At Microsoft, we have developed a new cloud architecture that's enhanced using FPGA (Field Programmable Gate Array). FPGAs can be viewed as programmable silicon and are being deployed into each and every new server in Microsoft's hyperscale infrastructure. The flexibility of FPGAs combined with a novel Hardware-as-a-Service (HaaS) architecture unlocks the full potential of a completely programmable hardware and software acceleration plane. In this talk, I'll give a history and overview of the project, discuss the key enabling technologies behind our enhanced cloud, present opportunities to harness this technology for accelerated deep learning, and conclude with directions for future work.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

增强云中的深度学习

深度学习已经成为一项非常关键的技术，可以在Azure、Office 365、必应、Cortana、Skype等在线服务中实现类似人类的智能，以及微软的其他高价值场景。虽然深度神经网络(dnn)在许多智能任务中实现了最先进的准确性，但它们在受功率、成本和延迟限制的超大规模数据中心中部署是出了名的昂贵和困难。此外，随着理想的硅缩放(摩尔定律)逐渐消失，对深度神经网络不断升级(和永不满足)的需求来得不合时宜。在微软，我们开发了一种新的云架构，它使用FPGA(现场可编程门阵列)进行了增强。fpga可以被看作是可编程的芯片，并且正在被部署到微软超大规模基础设施的每一台新服务器中。fpga的灵活性与新颖的硬件即服务(HaaS)架构相结合，释放了完全可编程硬件和软件加速平面的全部潜力。在这次演讲中，我将介绍该项目的历史和概述，讨论我们增强云背后的关键支持技术，提供利用该技术加速深度学习的机会，并总结未来工作的方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2017 ACM on International Symposium on Physical Design

自引率

0.00%

发文量