Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks

IF 2.9 3区计算机科学 Q2 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems Pub Date : 2021-03-03 DOI:10.1109/TCAD.2021.3077193

Arman Iranfar;Marina Zapater;David Atienza

{"title":"Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks","authors":"Arman Iranfar;Marina Zapater;David Atienza","doi":"10.1109/TCAD.2021.3077193","DOIUrl":null,"url":null,"abstract":"Nowadays, deep convolutional neural networks (DCNNs) play a significant role in many application domains, such as computer vision, medical imaging, and image processing. Nonetheless, designing a DCNN, able to defeat the state of the art, is a manual, challenging, and time-consuming task, due to the extremely large design space, as a consequence of a large number of layers and their corresponding hyperparameters. In this work, we address the challenge of performing hyperparameter optimization of DCNNs through a novel multiagent reinforcement learning (MARL)-based approach, eliminating the human effort. In particular, we adapt \n<inline-formula> <tex-math>$Q$ </tex-math></inline-formula>\n-learning and define learning agents per layer to split the design space into independent smaller design subspaces such that each agent fine tunes the hyperparameters of the assigned layer concerning a global reward. Moreover, we provide a novel formation of \n<inline-formula> <tex-math>$Q$ </tex-math></inline-formula>\n-tables along with a new update rule that facilitates agents’ communication. Our MARL-based approach is data driven and able to consider an arbitrary set of design objectives and constraints. We apply our MARL-based solution to different well-known DCNNs, including GoogLeNet, VGG, and U-Net, and various datasets for image classification and semantic segmentation. Our results have shown that compared to the original CNNs, the MARL-based approach can reduce the model size, training time, and inference time by up to, respectively, \n<inline-formula> <tex-math>$83\\times $ </tex-math></inline-formula>\n, 52%, and 54% without any degradation in accuracy. Moreover, our approach is very competitive to state-of-the-art neural architecture search methods in terms of the designed CNN accuracy and its number of parameters while significantly reducing the optimization cost.","PeriodicalId":13251,"journal":{"name":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","volume":"41 4","pages":"1034-1047"},"PeriodicalIF":2.9000,"publicationDate":"2021-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TCAD.2021.3077193","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/9420739/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 5

Abstract

Nowadays, deep convolutional neural networks (DCNNs) play a significant role in many application domains, such as computer vision, medical imaging, and image processing. Nonetheless, designing a DCNN, able to defeat the state of the art, is a manual, challenging, and time-consuming task, due to the extremely large design space, as a consequence of a large number of layers and their corresponding hyperparameters. In this work, we address the challenge of performing hyperparameter optimization of DCNNs through a novel multiagent reinforcement learning (MARL)-based approach, eliminating the human effort. In particular, we adapt

$Q$

-learning and define learning agents per layer to split the design space into independent smaller design subspaces such that each agent fine tunes the hyperparameters of the assigned layer concerning a global reward. Moreover, we provide a novel formation of

$Q$

-tables along with a new update rule that facilitates agents’ communication. Our MARL-based approach is data driven and able to consider an arbitrary set of design objectives and constraints. We apply our MARL-based solution to different well-known DCNNs, including GoogLeNet, VGG, and U-Net, and various datasets for image classification and semantic segmentation. Our results have shown that compared to the original CNNs, the MARL-based approach can reduce the model size, training time, and inference time by up to, respectively,

$83\times $

, 52%, and 54% without any degradation in accuracy. Moreover, our approach is very competitive to state-of-the-art neural architecture search methods in terms of the designed CNN accuracy and its number of parameters while significantly reducing the optimization cost.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于卷积神经网络超参数优化的多智能体强化学习

如今，深度卷积神经网络在计算机视觉、医学成像和图像处理等许多应用领域发挥着重要作用。尽管如此，设计能够击败现有技术的DCNN是一项手动、具有挑战性和耗时的任务，因为由于大量的层及其相应的超参数，设计空间非常大。在这项工作中，我们通过一种新的基于多智能体强化学习（MARL）的方法来解决对DCNN进行超参数优化的挑战，消除了人为的努力。特别地，我们调整$Q$-学习，并定义每层的学习代理，以将设计空间划分为独立的较小设计子空间，使得每个代理微调与全局奖励有关的指定层的超参数。此外，我们还提供了一种新的$Q$表形式，以及一个新的更新规则，以促进代理的通信。我们基于MARL的方法是数据驱动的，能够考虑任意一组设计目标和约束。我们将基于MARL的解决方案应用于不同的知名DCNN，包括GoogLeNet、VGG和U-Net，以及用于图像分类和语义分割的各种数据集。我们的结果表明，与原始细胞神经网络相比，基于MARL的方法可以将模型大小、训练时间和推理时间分别减少83美元、52%和54%，而不会降低准确性。此外，就设计的CNN精度及其参数数量而言，我们的方法与最先进的神经结构搜索方法相比具有很强的竞争力，同时显著降低了优化成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 工程技术-工程：电子与电气

CiteScore

5.60

自引率

13.80%

发文量

500

审稿时长

7 months

期刊介绍： The purpose of this Transactions is to publish papers of interest to individuals in the area of computer-aided design of integrated circuits and systems composed of analog, digital, mixed-signal, optical, or microwave components. The aids include methods, models, algorithms, and man-machine interfaces for system-level, physical and logical design including: planning, synthesis, partitioning, modeling, simulation, layout, verification, testing, hardware-software co-design and documentation of integrated circuit and system designs of all complexities. Design tools and techniques for evaluating and designing integrated circuits and systems for metrics such as performance, power, reliability, testability, and security are a focus.