{"title":"Push-sum Distributed Dual Averaging Online Convex Optimization With Bandit Feedback","authors":"Ju Yang, Mengli Wei, Yan Wang, Zhongyuan Zhao","doi":"10.1007/s12555-023-0211-3","DOIUrl":null,"url":null,"abstract":"<p>This paper investigates the distributed online convex optimization problem in multi-agent systems, where each node cannot directly access the gradient information of its own cost function. The communication topology is formed by the strongly connected time-varying directed graphs with the column stochastic weight matrices, where each node updates its own decisions by exchanging information with neighbouring nodes. It is not feasible to sample objective function values at several consecutive points simultaneously since the online setting is time-varying. To solve this problem over directed graphs, a push-sum one-point bandit distributed dual averaging (PS-OBDDA) algorithm is proposed, where the one-point gradient estimator is employed to estimate the true gradient information, to guide the updating of the decision variables. Moreover, by selecting the appropriate exploration parameter <i>δ</i> and step sizes <i>α</i>(<i>t</i>), the algorithm is shown to achieve the sublinear regret bound with the convergence rate <span>\\(O({T^{{5 \\over 6}}})\\)</span>. Furthermore, the effect of one-point estimation parameters on the regret of the algorithm in online settings is explored. Finally, the performance of the algorithm is evaluated through simulation.</p>","PeriodicalId":54965,"journal":{"name":"International Journal of Control Automation and Systems","volume":"25 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Control Automation and Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12555-023-0211-3","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates the distributed online convex optimization problem in multi-agent systems, where each node cannot directly access the gradient information of its own cost function. The communication topology is formed by the strongly connected time-varying directed graphs with the column stochastic weight matrices, where each node updates its own decisions by exchanging information with neighbouring nodes. It is not feasible to sample objective function values at several consecutive points simultaneously since the online setting is time-varying. To solve this problem over directed graphs, a push-sum one-point bandit distributed dual averaging (PS-OBDDA) algorithm is proposed, where the one-point gradient estimator is employed to estimate the true gradient information, to guide the updating of the decision variables. Moreover, by selecting the appropriate exploration parameter δ and step sizes α(t), the algorithm is shown to achieve the sublinear regret bound with the convergence rate \(O({T^{{5 \over 6}}})\). Furthermore, the effect of one-point estimation parameters on the regret of the algorithm in online settings is explored. Finally, the performance of the algorithm is evaluated through simulation.
期刊介绍:
International Journal of Control, Automation and Systems is a joint publication of the Institute of Control, Robotics and Systems (ICROS) and the Korean Institute of Electrical Engineers (KIEE).
The journal covers three closly-related research areas including control, automation, and systems.
The technical areas include
Control Theory
Control Applications
Robotics and Automation
Intelligent and Information Systems
The Journal addresses research areas focused on control, automation, and systems in electrical, mechanical, aerospace, chemical, and industrial engineering in order to create a strong synergy effect throughout the interdisciplinary research areas.