基于模型的深度强化学习，从流动模拟中加速学习

IF 1.9 3区工程技术 Q3 MECHANICS Meccanica Pub Date : 2024-05-14 DOI:10.1007/s11012-024-01808-z

Andre Weiner, Janis Geise

{"title":"基于模型的深度强化学习，从流动模拟中加速学习","authors":"Andre Weiner, Janis Geise","doi":"10.1007/s11012-024-01808-z","DOIUrl":null,"url":null,"abstract":"In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to \\(85\\%\\) for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.","PeriodicalId":695,"journal":{"name":"Meccanica","volume":"23 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Model-based deep reinforcement learning for accelerated learning from flow simulations\",\"authors\":\"Andre Weiner, Janis Geise\",\"doi\":\"10.1007/s11012-024-01808-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to \\\\(85\\\\%\\\\) for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.\",\"PeriodicalId\":695,\"journal\":{\"name\":\"Meccanica\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Meccanica\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11012-024-01808-z\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Meccanica","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11012-024-01808-z","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}

引用次数: 0

摘要

近年来，深度强化学习已成为解决闭环流量控制问题的一种技术。在强化学习中采用基于仿真的环境，可以事先对控制系统进行端到端优化，为安全关键型控制应用提供虚拟试验台，并能深入了解控制机制。虽然强化学习已成功应用于许多相当简单的流量控制基准，但现实世界应用的一个主要瓶颈是流量模拟的高计算成本和周转时间。在本文中，我们展示了基于模型的强化学习在流量控制应用中的优势。具体来说，我们通过交替使用从流量模拟中采样的轨迹和从环境模型集合中采样的轨迹来优化策略。在流体弹球测试案例中，基于模型的学习将整体训练时间减少了（85%）。对于要求更高的流动模拟，预计还能节省更多时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Model-based deep reinforcement learning for accelerated learning from flow simulations

In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to \(85\%\) for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Meccanica 物理-力学

CiteScore

4.70

自引率

3.70%

发文量

151

审稿时长

7 months

期刊介绍： Meccanica focuses on the methodological framework shared by mechanical scientists when addressing theoretical or applied problems. Original papers address various aspects of mechanical and mathematical modeling, of solution, as well as of analysis of system behavior. The journal explores fundamental and applications issues in established areas of mechanics research as well as in emerging fields; contemporary research on general mechanics, solid and structural mechanics, fluid mechanics, and mechanics of machines; interdisciplinary fields between mechanics and other mathematical and engineering sciences; interaction of mechanics with dynamical systems, advanced materials, control and computation; electromechanics; biomechanics. Articles include full length papers; topical overviews; brief notes; discussions and comments on published papers; book reviews; and an international calendar of conferences. Meccanica, the official journal of the Italian Association of Theoretical and Applied Mechanics, was established in 1966.