{"title":"具有未知延迟统计的目标导向状态更新在线学习","authors":"Fuzhou Peng;Xijun Wang;Xiang Chen","doi":"10.1109/JSAC.2024.3431522","DOIUrl":null,"url":null,"abstract":"With the proliferation of communication demand, goal-oriented communication goes beyond traditional bit-level approaches by emphasizing the significance of information and its relevance to specific goals. This paper addresses the goal-oriented status updating problem, where detecting status changes is crucial. We employ the Age of Changed Information (AoCI) as a metric, which considers both the timeliness and content of the update. Our goal is to minimize the weighted sum of AoCI and transmission cost without channel delay statistics. The investigated problem is formulated as a semi-Markov decision process (SMDP) and is tackled by converting it into a multi-variable optimization problem. We prove that the optimal updating policy is of threshold type, and derive a nearly closed-form expression for the optimal threshold. When delay statistics are available, the optimal threshold can be obtained by a bisection searching algorithm. In the absence of prior delay statistics, we develop an online learning policy. We demonstrate that the optimality gap decays at a rate of \n<inline-formula> <tex-math>$\\mathcal {O}(\\log K / K)$ </tex-math></inline-formula>\n, where K is the number of samples. Simulation results are presented to compare the performance of various policies under different statistical conditions, showcasing the superiority of our proposed algorithm.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"42 11","pages":"3293-3305"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online Learning of Goal-Oriented Status Updating With Unknown Delay Statistics\",\"authors\":\"Fuzhou Peng;Xijun Wang;Xiang Chen\",\"doi\":\"10.1109/JSAC.2024.3431522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the proliferation of communication demand, goal-oriented communication goes beyond traditional bit-level approaches by emphasizing the significance of information and its relevance to specific goals. This paper addresses the goal-oriented status updating problem, where detecting status changes is crucial. We employ the Age of Changed Information (AoCI) as a metric, which considers both the timeliness and content of the update. Our goal is to minimize the weighted sum of AoCI and transmission cost without channel delay statistics. The investigated problem is formulated as a semi-Markov decision process (SMDP) and is tackled by converting it into a multi-variable optimization problem. We prove that the optimal updating policy is of threshold type, and derive a nearly closed-form expression for the optimal threshold. When delay statistics are available, the optimal threshold can be obtained by a bisection searching algorithm. In the absence of prior delay statistics, we develop an online learning policy. We demonstrate that the optimality gap decays at a rate of \\n<inline-formula> <tex-math>$\\\\mathcal {O}(\\\\log K / K)$ </tex-math></inline-formula>\\n, where K is the number of samples. Simulation results are presented to compare the performance of various policies under different statistical conditions, showcasing the superiority of our proposed algorithm.\",\"PeriodicalId\":73294,\"journal\":{\"name\":\"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society\",\"volume\":\"42 11\",\"pages\":\"3293-3305\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10605803/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10605803/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
随着通信需求的激增,以目标为导向的通信超越了传统的比特级方法,强调信息的重要性及其与特定目标的相关性。本文探讨了以目标为导向的状态更新问题,其中检测状态变化至关重要。我们采用 "变化信息年龄"(AoCI)作为衡量标准,它同时考虑了更新的及时性和内容。我们的目标是在不统计信道延迟的情况下,使 AoCI 和传输成本的加权和最小化。所研究的问题被表述为半马尔可夫决策过程(SMDP),并通过将其转换为多变量优化问题来解决。我们证明了最优更新策略属于阈值类型,并推导出最优阈值的近似闭式表达式。在有延迟统计数据的情况下,可以通过分段搜索算法获得最佳阈值。在没有先验延迟统计数据的情况下,我们开发了一种在线学习策略。我们证明,最优性差距的衰减率为 $\mathcal {O}(\log K / K)$ ,其中 K 是样本数。仿真结果比较了不同统计条件下各种策略的性能,展示了我们提出的算法的优越性。
Online Learning of Goal-Oriented Status Updating With Unknown Delay Statistics
With the proliferation of communication demand, goal-oriented communication goes beyond traditional bit-level approaches by emphasizing the significance of information and its relevance to specific goals. This paper addresses the goal-oriented status updating problem, where detecting status changes is crucial. We employ the Age of Changed Information (AoCI) as a metric, which considers both the timeliness and content of the update. Our goal is to minimize the weighted sum of AoCI and transmission cost without channel delay statistics. The investigated problem is formulated as a semi-Markov decision process (SMDP) and is tackled by converting it into a multi-variable optimization problem. We prove that the optimal updating policy is of threshold type, and derive a nearly closed-form expression for the optimal threshold. When delay statistics are available, the optimal threshold can be obtained by a bisection searching algorithm. In the absence of prior delay statistics, we develop an online learning policy. We demonstrate that the optimality gap decays at a rate of
$\mathcal {O}(\log K / K)$
, where K is the number of samples. Simulation results are presented to compare the performance of various policies under different statistical conditions, showcasing the superiority of our proposed algorithm.