连接随机优化控制和强化学习

IF 1.2 3区物理与天体物理 Q3 PHYSICS, MATHEMATICAL Journal of Mathematical Physics Pub Date : 2024-08-19 DOI:10.1063/5.0140665

J. Quer, Enric Ribera Borrell

{"title":"连接随机优化控制和强化学习","authors":"J. Quer, Enric Ribera Borrell","doi":"10.1063/5.0140665","DOIUrl":null,"url":null,"abstract":"In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.","PeriodicalId":16174,"journal":{"name":"Journal of Mathematical Physics","volume":"1 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Connecting stochastic optimal control and reinforcement learning\",\"authors\":\"J. Quer, Enric Ribera Borrell\",\"doi\":\"10.1063/5.0140665\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.\",\"PeriodicalId\":16174,\"journal\":{\"name\":\"Journal of Mathematical Physics\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Mathematical Physics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1063/5.0140665\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHYSICS, MATHEMATICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Physics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1063/5.0140665","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, MATHEMATICAL","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了随机最优控制与强化学习之间的联系。我们的主要动机是将重要性采样应用于罕见事件的采样，这可以重新表述为一个最优控制问题。通过使用参数化方法，最优控制问题变成了一个随机优化问题，而这一问题在如何解决高维问题的可扩展性以及如何处理系统内在的不稳定性方面仍存在一些悬而未决的问题。为了探索新方法，我们将最优控制问题与强化学习联系起来，因为二者具有相同的基础框架，即马尔可夫决策过程（MDP）。对于最优控制问题，我们展示了如何制定马尔可夫决策过程。此外，我们还讨论了如何在强化学习框架下解释随机最优控制问题。文章最后，我们介绍了两种不同的强化学习算法在最优控制问题中的应用，并对两种算法的优缺点进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Connecting stochastic optimal control and reinforcement learning

In this paper the connection between stochastic optimal control and reinforcement learning is investigated. Our main motivation is to apply importance sampling to sampling rare events which can be reformulated as an optimal control problem. By using a parameterised approach the optimal control problem becomes a stochastic optimization problem which still raises some open questions regarding how to tackle the scalability to high-dimensional problems and how to deal with the intrinsic metastability of the system. To explore new methods we link the optimal control problem to reinforcement learning since both share the same underlying framework, namely a Markov Decision Process (MDP). For the optimal control problem we show how the MDP can be formulated. In addition we discuss how the stochastic optimal control problem can be interpreted in the framework of reinforcement learning. At the end of the article we present the application of two different reinforcement learning algorithms to the optimal control problem and a comparison of the advantages and disadvantages of the two algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Mathematical Physics 物理-物理：数学物理

CiteScore

2.20

自引率

15.40%

发文量

396

审稿时长

4.3 months

期刊介绍： Since 1960, the Journal of Mathematical Physics (JMP) has published some of the best papers from outstanding mathematicians and physicists. JMP was the first journal in the field of mathematical physics and publishes research that connects the application of mathematics to problems in physics, as well as illustrates the development of mathematical methods for such applications and for the formulation of physical theories. The Journal of Mathematical Physics (JMP) features content in all areas of mathematical physics. Specifically, the articles focus on areas of research that illustrate the application of mathematics to problems in physics, the development of mathematical methods for such applications, and for the formulation of physical theories. The mathematics featured in the articles are written so that theoretical physicists can understand them. JMP also publishes review articles on mathematical subjects relevant to physics as well as special issues that combine manuscripts on a topic of current interest to the mathematical physics community. JMP welcomes original research of the highest quality in all active areas of mathematical physics, including the following: Partial Differential Equations Representation Theory and Algebraic Methods Many Body and Condensed Matter Physics Quantum Mechanics - General and Nonrelativistic Quantum Information and Computation Relativistic Quantum Mechanics, Quantum Field Theory, Quantum Gravity, and String Theory General Relativity and Gravitation Dynamical Systems Classical Mechanics and Classical Fields Fluids Statistical Physics Methods of Mathematical Physics.