Optimal Caching for Partial-Observation Regime and Beyond

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-08-28 DOI:10.1109/TSC.2024.3451163

Zifan Jia;Qingsong Liu;Jiang Zhou;Xiaoyan Gu;Yaoyu Zhang;Bo Li;Weiping Wang

{"title":"Optimal Caching for Partial-Observation Regime and Beyond","authors":"Zifan Jia;Qingsong Liu;Jiang Zhou;Xiaoyan Gu;Yaoyu Zhang;Bo Li;Weiping Wang","doi":"10.1109/TSC.2024.3451163","DOIUrl":null,"url":null,"abstract":"We study the caching problem from an online learning point-of-view, i.e., no model assumptions and prior knowledge for the file request sequence. Our goal is to design an efficient online caching policy with minimal regret, i.e., minimizing the total number of cache miss with respect to the best static configuration in hindsight. Previous studies, such as Follow-The-Perturbed-Leader (FTPL) and Follow-The-Regularized-Leader (FTRL) caching policies, have provided some near-optimal results, but their theoretical performance guarantees only valid for the regime wherein all arrival requests could be seen by the cache, which is not the case in some practical scenarios. Hence our work closes this gap by considering the partial-observation regime wherein only requests for currently cached files are seen by the cache, which is more challenging and has not been studied before. We propose an online caching policy integrating the FTPL with a popularity estimation procedure called Geometric Resampling (GR), which is the first no-regret policy in this regime (achieve sublinear regret guarantee). Moreover, in the partial-observation regime, we also consider the caching problem with additional operational requirements of real-world systems, i.e., long-term constraints, and proposed a modified version of FTRL combining with GR to address this challenge setting. The theoretical analysis shows that this caching policy is able to achieve no-regret guarantee while satisfying the operational long-term constraints in expectation. Finally, we conduct numerical experiments to validate the theoretical guarantees of our proposed caching policies.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"4041-4054"},"PeriodicalIF":5.8000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10654576/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

We study the caching problem from an online learning point-of-view, i.e., no model assumptions and prior knowledge for the file request sequence. Our goal is to design an efficient online caching policy with minimal regret, i.e., minimizing the total number of cache miss with respect to the best static configuration in hindsight. Previous studies, such as Follow-The-Perturbed-Leader (FTPL) and Follow-The-Regularized-Leader (FTRL) caching policies, have provided some near-optimal results, but their theoretical performance guarantees only valid for the regime wherein all arrival requests could be seen by the cache, which is not the case in some practical scenarios. Hence our work closes this gap by considering the partial-observation regime wherein only requests for currently cached files are seen by the cache, which is more challenging and has not been studied before. We propose an online caching policy integrating the FTPL with a popularity estimation procedure called Geometric Resampling (GR), which is the first no-regret policy in this regime (achieve sublinear regret guarantee). Moreover, in the partial-observation regime, we also consider the caching problem with additional operational requirements of real-world systems, i.e., long-term constraints, and proposed a modified version of FTRL combining with GR to address this challenge setting. The theoretical analysis shows that this caching policy is able to achieve no-regret guarantee while satisfying the operational long-term constraints in expectation. Finally, we conduct numerical experiments to validate the theoretical guarantees of our proposed caching policies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

部分观测时段及其他时段的优化缓存

我们从在线学习的角度研究缓存问题，即没有模型假设和文件请求序列的先验知识。我们的目标是设计一个具有最小遗憾的高效在线缓存策略，也就是说，最小化缓存丢失的总数，相对于事后的最佳静态配置。以前的研究，如跟随扰流领导者（FTPL）和跟随正则化领导者（FTRL）缓存策略，已经提供了一些接近最优的结果，但它们的理论性能保证只适用于所有到达请求都可以被缓存看到的机制，而在一些实际场景中并非如此。因此，我们的工作通过考虑部分观察机制来缩小这一差距，其中只有对当前缓存文件的请求才会被缓存看到，这更具挑战性，以前没有研究过。我们提出了一种将FTPL与称为几何重采样（GR）的流行度估计过程相结合的在线缓存策略，这是该机制中第一个无后悔策略（实现次线性后悔保证）。此外，在部分观测机制中，我们还考虑了现实系统的额外操作需求（即长期约束）的缓存问题，并提出了一个结合GR的修改版本的FTRL来解决这一挑战设置。理论分析表明，该缓存策略能够在满足预期操作长期约束的前提下实现无遗憾保证。最后，我们进行了数值实验来验证我们提出的缓存策略的理论保证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

11.50

自引率

6.20%

发文量

278

审稿时长

>12 weeks

期刊介绍： IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.