基于层次结构的强化学习算法集成方法

Daniil Kozlov, V. Myasnikov
{"title":"基于层次结构的强化学习算法集成方法","authors":"Daniil Kozlov, V. Myasnikov","doi":"10.1109/ITNT57377.2023.10139122","DOIUrl":null,"url":null,"abstract":"The article proposes an ensemble method for reinforcement learning algorithms. The proposed approach is on average more efficient than each of the algorithms in the ensemble separately. The article discusses the implementation of the method, which includes an ensemble of REDQ and SAC algorithms. The output from the ensemble is the output of the algorithm selected following the output of the DQN acting as the control algorithm. It is possible to ensemble other algorithms in a different quantity. Reinforcement learning is a promising area in machine learning. An important unsolved problem of reinforcement learning is the generalization of complex problems, and their solution using meta-algorithms. The proposed method can be used in complex problems consisting of many subtasks, effective solutions for which can be offered by various algorithms from the ensemble.","PeriodicalId":296438,"journal":{"name":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ensemble Method for Reinforcement Learning Algorithms Based on Hierarchy\",\"authors\":\"Daniil Kozlov, V. Myasnikov\",\"doi\":\"10.1109/ITNT57377.2023.10139122\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article proposes an ensemble method for reinforcement learning algorithms. The proposed approach is on average more efficient than each of the algorithms in the ensemble separately. The article discusses the implementation of the method, which includes an ensemble of REDQ and SAC algorithms. The output from the ensemble is the output of the algorithm selected following the output of the DQN acting as the control algorithm. It is possible to ensemble other algorithms in a different quantity. Reinforcement learning is a promising area in machine learning. An important unsolved problem of reinforcement learning is the generalization of complex problems, and their solution using meta-algorithms. The proposed method can be used in complex problems consisting of many subtasks, effective solutions for which can be offered by various algorithms from the ensemble.\",\"PeriodicalId\":296438,\"journal\":{\"name\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITNT57377.2023.10139122\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IX International Conference on Information Technology and Nanotechnology (ITNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITNT57377.2023.10139122","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种强化学习算法的集成方法。所提出的方法平均比集成中的每一种算法都要高效。本文讨论了该方法的实现,其中包括REDQ和SAC算法的集成。集成的输出是在DQN作为控制算法的输出之后选择的算法的输出。以不同的数量集成其他算法是可能的。强化学习是机器学习中一个很有前途的领域。强化学习尚未解决的一个重要问题是复杂问题的泛化及其使用元算法的解决方案。该方法可用于由多个子任务组成的复杂问题,并可通过集成中的各种算法提供有效的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Ensemble Method for Reinforcement Learning Algorithms Based on Hierarchy
The article proposes an ensemble method for reinforcement learning algorithms. The proposed approach is on average more efficient than each of the algorithms in the ensemble separately. The article discusses the implementation of the method, which includes an ensemble of REDQ and SAC algorithms. The output from the ensemble is the output of the algorithm selected following the output of the DQN acting as the control algorithm. It is possible to ensemble other algorithms in a different quantity. Reinforcement learning is a promising area in machine learning. An important unsolved problem of reinforcement learning is the generalization of complex problems, and their solution using meta-algorithms. The proposed method can be used in complex problems consisting of many subtasks, effective solutions for which can be offered by various algorithms from the ensemble.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cooperative Application of Vehicular Traffic Rerouting Method and Adaptive Traffic Signal Control Method Analysis of the Influence of Space Weather Factors on the Telemetry Parameters of Small Spacecraft in Low Earth Orbit Correlations and Statistical Memory Effects as Markers of Age-related Changes in Complex Systems of Living Nature Visualization of feature spaces based on spectral and texture characteristics Electrically controlled optical spectral filters for WDM communication networks based on multilayer inhomogeneous holographic diffraction structures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1