基于模型的深度强化学习,从流动模拟中加速学习

IF 1.9 3区 工程技术 Q3 MECHANICS Meccanica Pub Date : 2024-05-14 DOI:10.1007/s11012-024-01808-z
Andre Weiner, Janis Geise
{"title":"基于模型的深度强化学习,从流动模拟中加速学习","authors":"Andre Weiner, Janis Geise","doi":"10.1007/s11012-024-01808-z","DOIUrl":null,"url":null,"abstract":"<p>In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to <span>\\(85\\%\\)</span> for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.</p>","PeriodicalId":695,"journal":{"name":"Meccanica","volume":"23 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Model-based deep reinforcement learning for accelerated learning from flow simulations\",\"authors\":\"Andre Weiner, Janis Geise\",\"doi\":\"10.1007/s11012-024-01808-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to <span>\\\\(85\\\\%\\\\)</span> for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.</p>\",\"PeriodicalId\":695,\"journal\":{\"name\":\"Meccanica\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Meccanica\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11012-024-01808-z\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MECHANICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Meccanica","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11012-024-01808-z","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MECHANICS","Score":null,"Total":0}
引用次数: 0

摘要

近年来,深度强化学习已成为解决闭环流量控制问题的一种技术。在强化学习中采用基于仿真的环境,可以事先对控制系统进行端到端优化,为安全关键型控制应用提供虚拟试验台,并能深入了解控制机制。虽然强化学习已成功应用于许多相当简单的流量控制基准,但现实世界应用的一个主要瓶颈是流量模拟的高计算成本和周转时间。在本文中,我们展示了基于模型的强化学习在流量控制应用中的优势。具体来说,我们通过交替使用从流量模拟中采样的轨迹和从环境模型集合中采样的轨迹来优化策略。在流体弹球测试案例中,基于模型的学习将整体训练时间减少了(85%)。对于要求更高的流动模拟,预计还能节省更多时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Model-based deep reinforcement learning for accelerated learning from flow simulations

In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control system, provides a virtual testbed for safety-critical control applications, and allows to gain a deep understanding of the control mechanisms. While reinforcement learning has been applied successfully in a number of rather simple flow control benchmarks, a major bottleneck toward real-world applications is the high computational cost and turnaround time of flow simulations. In this contribution, we demonstrate the benefits of model-based reinforcement learning for flow control applications. Specifically, we optimize the policy by alternating between trajectories sampled from flow simulations and trajectories sampled from an ensemble of environment models. The model-based learning reduces the overall training time by up to \(85\%\) for the fluidic pinball test case. Even larger savings are expected for more demanding flow simulations.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Meccanica
Meccanica 物理-力学
CiteScore
4.70
自引率
3.70%
发文量
151
审稿时长
7 months
期刊介绍: Meccanica focuses on the methodological framework shared by mechanical scientists when addressing theoretical or applied problems. Original papers address various aspects of mechanical and mathematical modeling, of solution, as well as of analysis of system behavior. The journal explores fundamental and applications issues in established areas of mechanics research as well as in emerging fields; contemporary research on general mechanics, solid and structural mechanics, fluid mechanics, and mechanics of machines; interdisciplinary fields between mechanics and other mathematical and engineering sciences; interaction of mechanics with dynamical systems, advanced materials, control and computation; electromechanics; biomechanics. Articles include full length papers; topical overviews; brief notes; discussions and comments on published papers; book reviews; and an international calendar of conferences. Meccanica, the official journal of the Italian Association of Theoretical and Applied Mechanics, was established in 1966.
期刊最新文献
Investigation of droplet collision characteristics with moving film and its comparison with stationary film: unsteady and 3D CLSVOF method Compound control method for reliability of the robotic arms with clearance joint Multiscale topology optimization of anisotropic multilayer periodic structures based on the isogeometric analysis method CFD and ray tracing analysis of a discrete nozzle for laser metal deposition Design and performance investigation of a sliding-mode adaptive proportional–integral–derivative control for cable-breakage scenario
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1