Optimal Caching for Partial-Observation Regime and Beyond

IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-08-28 DOI:10.1109/TSC.2024.3451163
Zifan Jia;Qingsong Liu;Jiang Zhou;Xiaoyan Gu;Yaoyu Zhang;Bo Li;Weiping Wang
{"title":"Optimal Caching for Partial-Observation Regime and Beyond","authors":"Zifan Jia;Qingsong Liu;Jiang Zhou;Xiaoyan Gu;Yaoyu Zhang;Bo Li;Weiping Wang","doi":"10.1109/TSC.2024.3451163","DOIUrl":null,"url":null,"abstract":"We study the caching problem from an online learning point-of-view, i.e., no model assumptions and prior knowledge for the file request sequence. Our goal is to design an efficient online caching policy with minimal regret, i.e., minimizing the total number of cache miss with respect to the best static configuration in hindsight. Previous studies, such as Follow-The-Perturbed-Leader (FTPL) and Follow-The-Regularized-Leader (FTRL) caching policies, have provided some near-optimal results, but their theoretical performance guarantees only valid for the regime wherein all arrival requests could be seen by the cache, which is not the case in some practical scenarios. Hence our work closes this gap by considering the partial-observation regime wherein only requests for currently cached files are seen by the cache, which is more challenging and has not been studied before. We propose an online caching policy integrating the FTPL with a popularity estimation procedure called Geometric Resampling (GR), which is the first no-regret policy in this regime (achieve sublinear regret guarantee). Moreover, in the partial-observation regime, we also consider the caching problem with additional operational requirements of real-world systems, i.e., long-term constraints, and proposed a modified version of FTRL combining with GR to address this challenge setting. The theoretical analysis shows that this caching policy is able to achieve no-regret guarantee while satisfying the operational long-term constraints in expectation. Finally, we conduct numerical experiments to validate the theoretical guarantees of our proposed caching policies.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"4041-4054"},"PeriodicalIF":5.8000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10654576/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

We study the caching problem from an online learning point-of-view, i.e., no model assumptions and prior knowledge for the file request sequence. Our goal is to design an efficient online caching policy with minimal regret, i.e., minimizing the total number of cache miss with respect to the best static configuration in hindsight. Previous studies, such as Follow-The-Perturbed-Leader (FTPL) and Follow-The-Regularized-Leader (FTRL) caching policies, have provided some near-optimal results, but their theoretical performance guarantees only valid for the regime wherein all arrival requests could be seen by the cache, which is not the case in some practical scenarios. Hence our work closes this gap by considering the partial-observation regime wherein only requests for currently cached files are seen by the cache, which is more challenging and has not been studied before. We propose an online caching policy integrating the FTPL with a popularity estimation procedure called Geometric Resampling (GR), which is the first no-regret policy in this regime (achieve sublinear regret guarantee). Moreover, in the partial-observation regime, we also consider the caching problem with additional operational requirements of real-world systems, i.e., long-term constraints, and proposed a modified version of FTRL combining with GR to address this challenge setting. The theoretical analysis shows that this caching policy is able to achieve no-regret guarantee while satisfying the operational long-term constraints in expectation. Finally, we conduct numerical experiments to validate the theoretical guarantees of our proposed caching policies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
部分观测时段及其他时段的优化缓存
我们从在线学习的角度研究缓存问题,即没有模型假设和文件请求序列的先验知识。我们的目标是设计一个具有最小遗憾的高效在线缓存策略,也就是说,最小化缓存丢失的总数,相对于事后的最佳静态配置。以前的研究,如跟随扰流领导者(FTPL)和跟随正则化领导者(FTRL)缓存策略,已经提供了一些接近最优的结果,但它们的理论性能保证只适用于所有到达请求都可以被缓存看到的机制,而在一些实际场景中并非如此。因此,我们的工作通过考虑部分观察机制来缩小这一差距,其中只有对当前缓存文件的请求才会被缓存看到,这更具挑战性,以前没有研究过。我们提出了一种将FTPL与称为几何重采样(GR)的流行度估计过程相结合的在线缓存策略,这是该机制中第一个无后悔策略(实现次线性后悔保证)。此外,在部分观测机制中,我们还考虑了现实系统的额外操作需求(即长期约束)的缓存问题,并提出了一个结合GR的修改版本的FTRL来解决这一挑战设置。理论分析表明,该缓存策略能够在满足预期操作长期约束的前提下实现无遗憾保证。最后,我们进行了数值实验来验证我们提出的缓存策略的理论保证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Services Computing
IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING
CiteScore
11.50
自引率
6.20%
发文量
278
审稿时长
>12 weeks
期刊介绍: IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.
期刊最新文献
Two-Phase Account Group Migration Service with Dynamic Load Awareness for Optimizing Sharding Blockchain Privacy-Protected Joint Service placement and Task Offloading for Knowledge-Defined Cloud-Edge Networking QoS-Aware Deep Reinforcement Learning for Dynamic CPU Pinning of Co-located Cloud Workloads Did I Vet You Before? Assessing the Chrome Web Store Vetting Process through Browser Extension Similarity MARS: A Multi-Agent Collaborative Reasoning Framework for Service Recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1