随机到达下基于harq的状态更新系统的平均AoI最小化

2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS) Pub Date : 2022-11-24 DOI:10.1109/IoTaIS56727.2022.9975894

Saeid Sadeghi Vilni, Mohammad Moltafet, Markus Leinonen, M. Codreanu

{"title":"随机到达下基于harq的状态更新系统的平均AoI最小化","authors":"Saeid Sadeghi Vilni, Mohammad Moltafet, Markus Leinonen, M. Codreanu","doi":"10.1109/IoTaIS56727.2022.9975894","DOIUrl":null,"url":null,"abstract":"We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.","PeriodicalId":138894,"journal":{"name":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Average AoI Minimization in an HARQ-based Status Update System under Random Arrivals\",\"authors\":\"Saeid Sadeghi Vilni, Mohammad Moltafet, Markus Leinonen, M. Codreanu\",\"doi\":\"10.1109/IoTaIS56727.2022.9975894\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.\",\"PeriodicalId\":138894,\"journal\":{\"name\":\"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IoTaIS56727.2022.9975894\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IoTaIS56727.2022.9975894","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们考虑一个由一个源、一个黄油辅助发送器和一个接收器组成的状态更新系统。源随机生成状态更新数据包，发送方使用混合自动重复请求(HARQ)协议通过不可靠的信道将数据包发送给接收方。系统保存两个包:一个包在黄油中，它存储最后生成的包，另一个包目前在发射机中工作。在每个时隙，发送器决定是否保持空闲，传输最后生成的数据包，或者重传当前正在使用的数据包。我们的目标是在平均传输次数的约束下，找到每个时隙的最优操作，以最小化源的平均信息年龄(AoI)。我们将该问题建模为约束马尔可夫决策过程(CMDP)问题，并对已知和未知的学习环境进行如下求解。首先，我们利用拉格朗日方法将CMDP问题转化为MDP问题，在已知环境下使用相对值迭代(RVI)，在未知环境下使用深度q -学习(DQL)算法。其次，我们利用Lyapunov方法将CMDP问题转化为MDP问题，并利用DQL算法对未知环境进行求解。仿真结果验证了所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Average AoI Minimization in an HARQ-based Status Update System under Random Arrivals

We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)

自引率

0.00%

发文量