Multi-Armed Bandits for Autonomous Timing-driven Design Optimization

2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS) Pub Date : 2019-07-01 DOI:10.1109/PATMOS.2019.8862056

A. Stefanidis, Dimitrios Mangiras, C. Nicopoulos, G. Dimitrakopoulos

{"title":"Multi-Armed Bandits for Autonomous Timing-driven Design Optimization","authors":"A. Stefanidis, Dimitrios Mangiras, C. Nicopoulos, G. Dimitrakopoulos","doi":"10.1109/PATMOS.2019.8862056","DOIUrl":null,"url":null,"abstract":"Timing closure is a complex process that involves many iterative optimization steps applied in various phases of the physical design flow. Cell sizing and transistor threshold selection, as well as datapath and clock buffering, are some of the tools available for design optimization. At the moment, design optimization methods are integrated into EDA tools and applied incrementally in various parts of the flow, while the optimal order of their application is yet to be determined. In this work, we rely on reinforcement learning – through the use of the Multi-Armed Bandit model for decision making under uncertainty – to automatically suggest online which optimization heuristic should be applied to the design. The goal is to improve the performance metrics based on the rewards learned from the previous applications of each heuristic. Experimental results show that automating the process of design optimization with machine learning not only results in designs that are close to the best-published results derived from deterministic approaches, but it also allows for the execution of the optimization flow without any human in the loop, and without any need for offline training of the heuristic-orchestration algorithm.","PeriodicalId":430458,"journal":{"name":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PATMOS.2019.8862056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Timing closure is a complex process that involves many iterative optimization steps applied in various phases of the physical design flow. Cell sizing and transistor threshold selection, as well as datapath and clock buffering, are some of the tools available for design optimization. At the moment, design optimization methods are integrated into EDA tools and applied incrementally in various parts of the flow, while the optimal order of their application is yet to be determined. In this work, we rely on reinforcement learning – through the use of the Multi-Armed Bandit model for decision making under uncertainty – to automatically suggest online which optimization heuristic should be applied to the design. The goal is to improve the performance metrics based on the rewards learned from the previous applications of each heuristic. Experimental results show that automating the process of design optimization with machine learning not only results in designs that are close to the best-published results derived from deterministic approaches, but it also allows for the execution of the optimization flow without any human in the loop, and without any need for offline training of the heuristic-orchestration algorithm.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自主时序驱动设计优化的多武装土匪

时序关闭是一个复杂的过程，涉及到在物理设计流程的各个阶段应用的许多迭代优化步骤。单元尺寸和晶体管阈值选择，以及数据路径和时钟缓冲，是一些可用于设计优化的工具。目前，设计优化方法被集成到EDA工具中，并逐步应用于流程的各个部分，其应用的最优顺序尚未确定。在这项工作中，我们依靠强化学习——通过使用Multi-Armed Bandit模型进行不确定性下的决策——在线自动建议应该将哪种优化启发式应用于设计。目标是基于从每个启发式的先前应用程序中学习到的奖励来改进性能指标。实验结果表明，使用机器学习自动化设计优化过程不仅可以使设计接近于由确定性方法得出的最佳发表结果，而且还允许在没有任何人工参与的情况下执行优化流程，并且不需要启发式编排算法的离线训练。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 29th International Symposium on Power and Timing Modeling, Optimization and Simulation (PATMOS)

自引率

0.00%

发文量

期刊最新文献

On the Static CMOS Implementation of Magnitude Comparators [PATMOS 2019 Title Page] UVM-based Verification of a Digital PLL Using SystemVerilog Minimizing Power for Neural Network Training with Logarithm-Approximate Floating-Point Multiplier Stochastic Radial Basis Neural Networks