基于 RL 的智能基地隔离系统控制(使用 Unity ML-Agents

IF 1.1 4区 工程技术 Q3 CONSTRUCTION & BUILDING TECHNOLOGY International Journal of Steel Structures Pub Date : 2024-07-19 DOI:10.1007/s13296-024-00862-3
Hyun-Su Kim, Joo-Won Kang
{"title":"基于 RL 的智能基地隔离系统控制(使用 Unity ML-Agents","authors":"Hyun-Su Kim,&nbsp;Joo-Won Kang","doi":"10.1007/s13296-024-00862-3","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning (RL) has been used in the development of various control systems presenting desirable control performances. There have been many studies examining the development of structural control algorithms using conventional methods and soft computing algorithms. However, research investigating RL-based structural control techniques in particular is still in an early stage. In RL algorithms, the agent interacts with the environment by taking the appropriate action under the specific state. In the RL-based structural control problem, the environment usually includes the structure, control system, external loads, etc., and it is generally presented by the finite element model. In the present study, the Unity game engine—which has recently come to be used in various engineering simulations because of its accurate physics calculations—was used to construct a reinforcement learning environment for structural control systems. A smart base isolation system (SBIS) that was composed of a magnetorheological damper and four friction pendulum systems was used as an example structural control system, and it was modeled using the Unity physics engine for RL environment. Among various RL algorithms, a Deep Q-Network (DQN) was used to make the control algorithm for the SBIS. The command voltage for the smart base isolation was mapped into the agent’s action. The reward of the DQN algorithm was designed to be a higher value when the agent takes a better action resulting in reduced seismic responses. Three artificial ground motions were used to train the DQN-based control algorithm, and another artificial earthquake was used to investigate the control efficiency of the trained DQN-based control algorithm. The passive-on case with the maximum damper force was used for comparative study. This study shows that the DQN-based algorithm can successfully control the SBIS. The findings show that the unity game engine can accurately present the dynamic responses of the SBIS, showing that it can be effectively used for the construction of a RL environment for structural dynamic systems.</p></div>","PeriodicalId":596,"journal":{"name":"International Journal of Steel Structures","volume":"24 4","pages":"908 - 917"},"PeriodicalIF":1.1000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RL-based Control of Smart Base Isolation System Using Unity ML-Agents\",\"authors\":\"Hyun-Su Kim,&nbsp;Joo-Won Kang\",\"doi\":\"10.1007/s13296-024-00862-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Reinforcement learning (RL) has been used in the development of various control systems presenting desirable control performances. There have been many studies examining the development of structural control algorithms using conventional methods and soft computing algorithms. However, research investigating RL-based structural control techniques in particular is still in an early stage. In RL algorithms, the agent interacts with the environment by taking the appropriate action under the specific state. In the RL-based structural control problem, the environment usually includes the structure, control system, external loads, etc., and it is generally presented by the finite element model. In the present study, the Unity game engine—which has recently come to be used in various engineering simulations because of its accurate physics calculations—was used to construct a reinforcement learning environment for structural control systems. A smart base isolation system (SBIS) that was composed of a magnetorheological damper and four friction pendulum systems was used as an example structural control system, and it was modeled using the Unity physics engine for RL environment. Among various RL algorithms, a Deep Q-Network (DQN) was used to make the control algorithm for the SBIS. The command voltage for the smart base isolation was mapped into the agent’s action. The reward of the DQN algorithm was designed to be a higher value when the agent takes a better action resulting in reduced seismic responses. Three artificial ground motions were used to train the DQN-based control algorithm, and another artificial earthquake was used to investigate the control efficiency of the trained DQN-based control algorithm. The passive-on case with the maximum damper force was used for comparative study. This study shows that the DQN-based algorithm can successfully control the SBIS. The findings show that the unity game engine can accurately present the dynamic responses of the SBIS, showing that it can be effectively used for the construction of a RL environment for structural dynamic systems.</p></div>\",\"PeriodicalId\":596,\"journal\":{\"name\":\"International Journal of Steel Structures\",\"volume\":\"24 4\",\"pages\":\"908 - 917\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2024-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Steel Structures\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13296-024-00862-3\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Steel Structures","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s13296-024-00862-3","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

强化学习(RL)已被用于开发各种具有理想控制性能的控制系统。许多研究都在探讨如何利用传统方法和软计算算法开发结构控制算法。然而,对基于 RL 的结构控制技术的研究仍处于早期阶段。在 RL 算法中,代理通过在特定状态下采取适当的行动与环境进行交互。在基于 RL 的结构控制问题中,环境通常包括结构、控制系统、外部载荷等,一般由有限元模型呈现。在本研究中,Unity 游戏引擎因其精确的物理计算而被广泛应用于各种工程模拟中,本研究利用该引擎构建了结构控制系统的强化学习环境。我们以一个由磁流变阻尼器和四个摩擦摆系统组成的智能基座隔离系统(SBIS)为例,使用 Unity 物理引擎为 RL 环境建模。在各种 RL 算法中,SBIS 的控制算法采用了深度 Q 网络(DQN)。智能基座隔离的指令电压被映射为代理的动作。DQN 算法的奖励设计为,当代理采取更好的行动时,奖励值越高,地震反应越小。利用三次人工地面运动来训练基于 DQN 的控制算法,并利用另一次人工地震来研究训练后的基于 DQN 的控制算法的控制效率。比较研究采用了具有最大阻尼力的被动开启情况。这项研究表明,基于 DQN 的算法可以成功控制 SBIS。研究结果表明,unity 游戏引擎可以准确地呈现 SBIS 的动态响应,表明它可以有效地用于构建结构动态系统的 RL 环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
RL-based Control of Smart Base Isolation System Using Unity ML-Agents

Reinforcement learning (RL) has been used in the development of various control systems presenting desirable control performances. There have been many studies examining the development of structural control algorithms using conventional methods and soft computing algorithms. However, research investigating RL-based structural control techniques in particular is still in an early stage. In RL algorithms, the agent interacts with the environment by taking the appropriate action under the specific state. In the RL-based structural control problem, the environment usually includes the structure, control system, external loads, etc., and it is generally presented by the finite element model. In the present study, the Unity game engine—which has recently come to be used in various engineering simulations because of its accurate physics calculations—was used to construct a reinforcement learning environment for structural control systems. A smart base isolation system (SBIS) that was composed of a magnetorheological damper and four friction pendulum systems was used as an example structural control system, and it was modeled using the Unity physics engine for RL environment. Among various RL algorithms, a Deep Q-Network (DQN) was used to make the control algorithm for the SBIS. The command voltage for the smart base isolation was mapped into the agent’s action. The reward of the DQN algorithm was designed to be a higher value when the agent takes a better action resulting in reduced seismic responses. Three artificial ground motions were used to train the DQN-based control algorithm, and another artificial earthquake was used to investigate the control efficiency of the trained DQN-based control algorithm. The passive-on case with the maximum damper force was used for comparative study. This study shows that the DQN-based algorithm can successfully control the SBIS. The findings show that the unity game engine can accurately present the dynamic responses of the SBIS, showing that it can be effectively used for the construction of a RL environment for structural dynamic systems.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Steel Structures
International Journal of Steel Structures 工程技术-工程:土木
CiteScore
2.70
自引率
13.30%
发文量
122
审稿时长
12 months
期刊介绍: The International Journal of Steel Structures provides an international forum for a broad classification of technical papers in steel structural research and its applications. The journal aims to reach not only researchers, but also practicing engineers. Coverage encompasses such topics as stability, fatigue, non-linear behavior, dynamics, reliability, fire, design codes, computer-aided analysis and design, optimization, expert systems, connections, fabrications, maintenance, bridges, off-shore structures, jetties, stadiums, transmission towers, marine vessels, storage tanks, pressure vessels, aerospace, and pipelines and more.
期刊最新文献
Numerical Investigation and Design of Cold-Formed Steel Channel and Z-Sections Undergoing Local and Global Interactive Buckling Stochastic Robustness of Cable Dome Structures Under Impact Loads Fire Behaviour of Rectangular Steel Tubed-Reinforced-Concrete Columns with End Restraints Finite Element Modeling for Concrete-Filled Steel Tube Stub Columns Under Axial Compression Experimental and Analytical Study on Fire Resistance Performance of Mid-High Rise Modular Rectangular Steel Tube Columns Using a 3 h Fireproof Cladding Method
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1