Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

Adaptive Agents and Multi-Agent Systems Pub Date : 2023-02-13 DOI:10.48550/arXiv.2302.06548

Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, D. Mocanu

{"title":"Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning","authors":"Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, D. Mocanu","doi":"10.48550/arXiv.2302.06548","DOIUrl":null,"url":null,"abstract":"Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement learning literature. To start resolving this, we formulate a problem setting in reinforcement learning called the $\\textit{extremely noisy environment}$ (ENE), where up to $99\\%$ of the input features are pure noise. Agents need to detect which features provide task-relevant information about the state of the environment. Consequently, we propose a new method termed $\\textit{Automatic Noise Filtering}$ (ANF), which uses the principles of dynamic sparse training in synergy with various deep reinforcement learning algorithms. The sparse input layer learns to focus its connectivity on task-relevant features, such that ANF-SAC and ANF-TD3 outperform standard SAC and TD3 by a large margin, while using up to $95\\%$ fewer weights. Furthermore, we devise a transfer learning setting for ENEs, by permuting all features of the environment after 1M timesteps to simulate the fact that other information sources can become relevant as the world evolves. Again, ANF surpasses the baselines in final performance and sample complexity. Our code is available at https://github.com/bramgrooten/automatic-noise-filtering","PeriodicalId":326727,"journal":{"name":"Adaptive Agents and Multi-Agent Systems","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adaptive Agents and Multi-Agent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2302.06548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement learning literature. To start resolving this, we formulate a problem setting in reinforcement learning called the $\textit{extremely noisy environment}$ (ENE), where up to $99\%$ of the input features are pure noise. Agents need to detect which features provide task-relevant information about the state of the environment. Consequently, we propose a new method termed $\textit{Automatic Noise Filtering}$ (ANF), which uses the principles of dynamic sparse training in synergy with various deep reinforcement learning algorithms. The sparse input layer learns to focus its connectivity on task-relevant features, such that ANF-SAC and ANF-TD3 outperform standard SAC and TD3 by a large margin, while using up to $95\%$ fewer weights. Furthermore, we devise a transfer learning setting for ENEs, by permuting all features of the environment after 1M timesteps to simulate the fact that other information sources can become relevant as the world evolves. Again, ANF surpasses the baselines in final performance and sample complexity. Our code is available at https://github.com/bramgrooten/automatic-noise-filtering

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

深度强化学习中基于动态稀疏训练的自动噪声滤波

未来的机器人在执行不同任务时需要从噪音中区分有用的信息。例如，一个家用机器人可能会不断地接收关于家庭的大量信息，但只需要关注一小部分就能成功地完成当前的家务。过滤包含不相关数据的分散输入在强化学习文献中很少受到关注。为了开始解决这个问题，我们在强化学习中制定了一个名为$\textit{extremely noisy environment}$ (ENE)的问题设置，其中高达$99\%$的输入特征是纯噪声。代理需要检测哪些特性提供了关于环境状态的任务相关信息。因此，我们提出了一种称为$\textit{Automatic Noise Filtering}$ (ANF)的新方法，该方法将动态稀疏训练原理与各种深度强化学习算法协同使用。稀疏输入层学习将其连通性集中在与任务相关的特征上，因此，ANF-SAC和ANF-TD3的性能大大优于标准SAC和TD3，同时使用的权重最多减少$95\%$。此外，我们为ENEs设计了一个迁移学习设置，通过在1M时间步后排列环境的所有特征来模拟其他信息源随着世界的发展而变得相关的事实。同样，ANF在最终性能和样本复杂性方面超过了基线。我们的代码可在https://github.com/bramgrooten/automatic-noise-filtering上获得

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Adaptive Agents and Multi-Agent Systems

自引率

0.00%

发文量

期刊最新文献

Discovering Consistent Subelections Strategic Cost Selection in Participatory Budgeting Minimizing State Exploration While Searching Graphs with Unknown Obstacles vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms Reinforcement Nash Equilibrium Solver