具有预测和反馈延迟的切换约束在线凸优化

Q4 Computer Science Performance Evaluation Review Pub Date : 2023-09-28 DOI:10.1145/3626570.3626573

Weici Pan, Zhenhua Liu

{"title":"具有预测和反馈延迟的切换约束在线凸优化","authors":"Weici Pan, Zhenhua Liu","doi":"10.1145/3626570.3626573","DOIUrl":null,"url":null,"abstract":"In various applications such as smart grids, the online player is allowed a limited number of switches among decisions. Additionally, real-world scenarios often involve feedback delays or access to near-future predictions. Motivated by this, we study Online Convex Optimization with a switching limit, incorporating feedback delays and predictions. In this extended abstract, we established a near-optimal regret of O(T/S) for delayed feedbacks and a bound of O(T/S - t ) for predictions of t rounds even though the player is only allowed to move at most S times, in expectation, across T rounds. We developed an algorithm which achieves the bounds in both cases and still works when there are both delays and predictions.","PeriodicalId":35745,"journal":{"name":"Performance Evaluation Review","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Switching Constrained Online Convex Optimization with Predictions and Feedback Delays\",\"authors\":\"Weici Pan, Zhenhua Liu\",\"doi\":\"10.1145/3626570.3626573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In various applications such as smart grids, the online player is allowed a limited number of switches among decisions. Additionally, real-world scenarios often involve feedback delays or access to near-future predictions. Motivated by this, we study Online Convex Optimization with a switching limit, incorporating feedback delays and predictions. In this extended abstract, we established a near-optimal regret of O(T/S) for delayed feedbacks and a bound of O(T/S - t ) for predictions of t rounds even though the player is only allowed to move at most S times, in expectation, across T rounds. We developed an algorithm which achieves the bounds in both cases and still works when there are both delays and predictions.\",\"PeriodicalId\":35745,\"journal\":{\"name\":\"Performance Evaluation Review\",\"volume\":\"71 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Performance Evaluation Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3626570.3626573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Performance Evaluation Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3626570.3626573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

在智能电网等各种应用中，在线玩家被允许在决策之间进行有限数量的切换。此外，现实世界的场景通常涉及反馈延迟或对近期预测的访问。受此启发，我们研究了带有切换限制的在线凸优化，结合了反馈延迟和预测。在这个扩展的抽象中，我们为延迟反馈建立了一个接近最优的后悔0 (T/S)，为T轮的预测建立了一个0 (T/S - T)的界限，即使玩家在T轮中最多只被允许移动S次。我们开发了一种算法，它在两种情况下都能达到边界，并且在有延迟和预测的情况下仍然有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Switching Constrained Online Convex Optimization with Predictions and Feedback Delays

In various applications such as smart grids, the online player is allowed a limited number of switches among decisions. Additionally, real-world scenarios often involve feedback delays or access to near-future predictions. Motivated by this, we study Online Convex Optimization with a switching limit, incorporating feedback delays and predictions. In this extended abstract, we established a near-optimal regret of O(T/S) for delayed feedbacks and a bound of O(T/S - t ) for predictions of t rounds even though the player is only allowed to move at most S times, in expectation, across T rounds. We developed an algorithm which achieves the bounds in both cases and still works when there are both delays and predictions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊