A non-asymptotic analysis of adaptive TD(λ) learning in wireless sensor networks

IF 1.9 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Distributed Sensor Networks Pub Date : 2022-07-01 DOI:10.1177/15501329221114546

Bing Li, Tao Li, Muhua Liu, Junlong Zhu, Mingchuan Zhang, Qingtao Wu

{"title":"A non-asymptotic analysis of adaptive TD(λ) learning in wireless sensor networks","authors":"Bing Li, Tao Li, Muhua Liu, Junlong Zhu, Mingchuan Zhang, Qingtao Wu","doi":"10.1177/15501329221114546","DOIUrl":null,"url":null,"abstract":"Wireless sensor network has been widely used in different fields, such as structural health monitoring and artificial intelligence technology. The routing planning, an important part of wireless sensor network, can be formalized as an optimization problem needing to be solved. In this article, a reinforcement learning algorithm is proposed to solve the problem of optimal routing in wireless sensor networks, namely, adaptive TD( λ ) learning algorithm referred to as ADTD( λ ) under Markovian noise, which is more practical than i.i.d. (identically and independently distributed) noise in reinforcement learning. Moreover, we also present non-asymptotic analysis of ADTD( λ ) with both constant and diminishing step-sizes. Specifically, when the step-size is constant, the convergence rate of O ( 1 / T ) is achieved, where T is the number of iterations; when the step-size is diminishing, the convergence rate of O ~ ( 1 / T ) is also obtained. In addition, the performance of the algorithm is verified by simulation.","PeriodicalId":50327,"journal":{"name":"International Journal of Distributed Sensor Networks","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Distributed Sensor Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1177/15501329221114546","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Wireless sensor network has been widely used in different fields, such as structural health monitoring and artificial intelligence technology. The routing planning, an important part of wireless sensor network, can be formalized as an optimization problem needing to be solved. In this article, a reinforcement learning algorithm is proposed to solve the problem of optimal routing in wireless sensor networks, namely, adaptive TD( λ ) learning algorithm referred to as ADTD( λ ) under Markovian noise, which is more practical than i.i.d. (identically and independently distributed) noise in reinforcement learning. Moreover, we also present non-asymptotic analysis of ADTD( λ ) with both constant and diminishing step-sizes. Specifically, when the step-size is constant, the convergence rate of O ( 1 / T ) is achieved, where T is the number of iterations; when the step-size is diminishing, the convergence rate of O ~ ( 1 / T ) is also obtained. In addition, the performance of the algorithm is verified by simulation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

无线传感器网络中自适应TD（λ）学习的非渐近分析

无线传感器网络已被广泛应用于结构健康监测和人工智能技术等不同领域。路由规划是无线传感器网络的重要组成部分，它可以形式化为一个需要解决的优化问题。本文提出了一种增强学习算法来解决无线传感器网络中的最优路由问题，即马尔可夫噪声下的自适应TD（λ）学习算法ADTD（λ。此外，我们还给出了具有恒定步长和递减步长的ADTD（λ）的非渐近分析。具体地，当步长为常数时，实现了O（1/T）的收敛速度，其中T是迭代次数；当步长减小时，也得到了O~（1/T）的收敛速度。此外，通过仿真验证了算法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Distributed Sensor Networks 工程技术-电信学

CiteScore

6.50

自引率

4.30%

发文量

审稿时长

3.6 months

期刊介绍： International Journal of Distributed Sensor Networks (IJDSN) is a JCR ranked, peer-reviewed, open access journal that focuses on applied research and applications of sensor networks. The goal of this journal is to provide a forum for the publication of important research contributions in developing high performance computing solutions to problems arising from the complexities of these sensor network systems. Articles highlight advances in uses of sensor network systems for solving computational tasks in manufacturing, engineering and environmental systems.