连接随机优化控制和强化学习

IF 1.2 3区 物理与天体物理 Q3 PHYSICS, MATHEMATICAL Journal of Mathematical Physics Pub Date : 2024-08-19 DOI:10.1063/5.0140665
J. Quer, Enric Ribera Borrell
{"title":"连接随机优化控制和强化学习","authors":"J. Quer, Enric Ribera Borrell","doi":"10.1063/5.0140665","DOIUrl":null,"url":null,"abstract":"In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.","PeriodicalId":16174,"journal":{"name":"Journal of Mathematical Physics","volume":"1 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Connecting stochastic optimal control and reinforcement learning\",\"authors\":\"J. Quer, Enric Ribera Borrell\",\"doi\":\"10.1063/5.0140665\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.\",\"PeriodicalId\":16174,\"journal\":{\"name\":\"Journal of Mathematical Physics\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Mathematical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1063/5.0140665\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHYSICS, MATHEMATICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Physics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1063/5.0140665","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, MATHEMATICAL","Score":null,"Total":0}
引用次数: 0

摘要

本文研究了随机最优控制与强化学习之间的联系。我们的主要动机是将重要性采样应用于罕见事件的采样,这可以重新表述为一个最优控制问题。通过使用参数化方法,最优控制问题变成了一个随机优化问题,而这一问题在如何解决高维问题的可扩展性以及如何处理系统内在的不稳定性方面仍存在一些悬而未决的问题。为了探索新方法,我们将最优控制问题与强化学习联系起来,因为二者具有相同的基础框架,即马尔可夫决策过程(MDP)。对于最优控制问题,我们展示了如何制定马尔可夫决策过程。此外,我们还讨论了如何在强化学习框架下解释随机最优控制问题。文章最后,我们介绍了两种不同的强化学习算法在最优控制问题中的应用,并对两种算法的优缺点进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Connecting stochastic optimal control and reinforcement learning
In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Mathematical Physics
Journal of Mathematical Physics 物理-物理:数学物理
CiteScore
2.20
自引率
15.40%
发文量
396
审稿时长
4.3 months
期刊介绍: Since 1960, the Journal of Mathematical Physics (JMP) has published some of the best papers from outstanding mathematicians and physicists. JMP was the first journal in the field of mathematical physics and publishes research that connects the application of mathematics to problems in physics, as well as illustrates the development of mathematical methods for such applications and for the formulation of physical theories. The Journal of Mathematical Physics (JMP) features content in all areas of mathematical physics. Specifically, the articles focus on areas of research that illustrate the application of mathematics to problems in physics, the development of mathematical methods for such applications, and for the formulation of physical theories. The mathematics featured in the articles are written so that theoretical physicists can understand them. JMP also publishes review articles on mathematical subjects relevant to physics as well as special issues that combine manuscripts on a topic of current interest to the mathematical physics community. JMP welcomes original research of the highest quality in all active areas of mathematical physics, including the following: Partial Differential Equations Representation Theory and Algebraic Methods Many Body and Condensed Matter Physics Quantum Mechanics - General and Nonrelativistic Quantum Information and Computation Relativistic Quantum Mechanics, Quantum Field Theory, Quantum Gravity, and String Theory General Relativity and Gravitation Dynamical Systems Classical Mechanics and Classical Fields Fluids Statistical Physics Methods of Mathematical Physics.
期刊最新文献
Extreme eigenvalues of random matrices from Jacobi ensembles How the non-metricity of the connection arises naturally in the classical theory of gravity Vertex coupling interpolation in quantum chain graphs Rescaling transformations and the Grothendieck bound formalism in a single quantum system Integrable decompositions for the (2 + 1)-dimensional multi-component Ablowitz–Kaup–Newell–Segur hierarchy and their applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1