跟踪片断静止序列的平均值

Indian Journal of Pure and Applied Mathematics Pub Date : 2024-07-03 DOI:10.1007/s13226-024-00641-0

Ghurumuruhan Ganesan

{"title":"跟踪片断静止序列的平均值","authors":"Ghurumuruhan Ganesan","doi":"10.1007/s13226-024-00641-0","DOIUrl":null,"url":null,"abstract":"<p>In this paper we study the problem of tracking the mean of a piecewise stationary sequence of independent random variables. First we consider the case where the transition times are known and show that a direct running average performs the tracking in short time and with high accuracy. We then use a single valued weighted running average with a tunable parameter for the case when transition times are unknown and establish deviation bounds for the tracking accuracy. Our result has applications in choosing the optimal rewards for the multiarmed bandit scenario.</p>","PeriodicalId":501427,"journal":{"name":"Indian Journal of Pure and Applied Mathematics","volume":"745 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Tracking the mean of a piecewise stationary sequence\",\"authors\":\"Ghurumuruhan Ganesan\",\"doi\":\"10.1007/s13226-024-00641-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In this paper we study the problem of tracking the mean of a piecewise stationary sequence of independent random variables. First we consider the case where the transition times are known and show that a direct running average performs the tracking in short time and with high accuracy. We then use a single valued weighted running average with a tunable parameter for the case when transition times are unknown and establish deviation bounds for the tracking accuracy. Our result has applications in choosing the optimal rewards for the multiarmed bandit scenario.</p>\",\"PeriodicalId\":501427,\"journal\":{\"name\":\"Indian Journal of Pure and Applied Mathematics\",\"volume\":\"745 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indian Journal of Pure and Applied Mathematics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s13226-024-00641-0\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Pure and Applied Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s13226-024-00641-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了独立随机变量片断静止序列均值的跟踪问题。首先，我们考虑了过渡时间已知的情况，并证明直接运行平均法能在短时间内高精度地完成跟踪。然后，我们在过渡时间未知的情况下使用带有可调参数的单值加权运行平均法，并建立了跟踪精度的偏差边界。我们的结果可应用于选择多臂强盗方案的最优奖励。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Tracking the mean of a piecewise stationary sequence

In this paper we study the problem of tracking the mean of a piecewise stationary sequence of independent random variables. First we consider the case where the transition times are known and show that a direct running average performs the tracking in short time and with high accuracy. We then use a single valued weighted running average with a tunable parameter for the case when transition times are unknown and establish deviation bounds for the tracking accuracy. Our result has applications in choosing the optimal rewards for the multiarmed bandit scenario.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Indian Journal of Pure and Applied Mathematics

自引率

0.00%

发文量