{"title":"利用时间差学习实现海浪能量转换器中提取能量的在线最大化","authors":"Sadegh Khaleghi, R. Moghaddam, M. Sistani","doi":"10.1177/14750902221149468","DOIUrl":null,"url":null,"abstract":"This article presents temporal-difference (TD) learning which is a combination of Monte Carlo and dynamic programing (DP) as a Method for controlling single-body wave energy converters (WECs). Since TD methods are designed to solve the prediction problems, we use this feature to maximize the energy captured from the sea waves. The entered force to the buoy system is addressed implicitly in the state matrix to design the problem into a TD framework. In order to enhance the captured power by the WEC, the control method is built to have an online active control. This will help the device to predict the best controller based on its previous experiences in the same situations. Two methods of TD, Q-Learning and SARSA, are used and the features are analyzed and several testing functions are carried out in simulation part. To perform on-line optimal control, a force control has acted as a controller and TD coefficients are tuned at a proper rate significantly after specific number of episodes. The power of suggested TD methods is compared to PGM, IPOPT and with other learning control strategies. Several computer simulations were carried out to evaluate the controller effectiveness by applying different sea-states and analyzing the resultant WEC dynamics.","PeriodicalId":20667,"journal":{"name":"Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment","volume":"33 1","pages":"565 - 578"},"PeriodicalIF":1.5000,"publicationDate":"2023-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online maximization of extracted energy in sea wave energy converters using temporal-difference learning\",\"authors\":\"Sadegh Khaleghi, R. Moghaddam, M. Sistani\",\"doi\":\"10.1177/14750902221149468\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents temporal-difference (TD) learning which is a combination of Monte Carlo and dynamic programing (DP) as a Method for controlling single-body wave energy converters (WECs). Since TD methods are designed to solve the prediction problems, we use this feature to maximize the energy captured from the sea waves. The entered force to the buoy system is addressed implicitly in the state matrix to design the problem into a TD framework. In order to enhance the captured power by the WEC, the control method is built to have an online active control. This will help the device to predict the best controller based on its previous experiences in the same situations. Two methods of TD, Q-Learning and SARSA, are used and the features are analyzed and several testing functions are carried out in simulation part. To perform on-line optimal control, a force control has acted as a controller and TD coefficients are tuned at a proper rate significantly after specific number of episodes. The power of suggested TD methods is compared to PGM, IPOPT and with other learning control strategies. Several computer simulations were carried out to evaluate the controller effectiveness by applying different sea-states and analyzing the resultant WEC dynamics.\",\"PeriodicalId\":20667,\"journal\":{\"name\":\"Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment\",\"volume\":\"33 1\",\"pages\":\"565 - 578\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/14750902221149468\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, MARINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/14750902221149468","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}
Online maximization of extracted energy in sea wave energy converters using temporal-difference learning
This article presents temporal-difference (TD) learning which is a combination of Monte Carlo and dynamic programing (DP) as a Method for controlling single-body wave energy converters (WECs). Since TD methods are designed to solve the prediction problems, we use this feature to maximize the energy captured from the sea waves. The entered force to the buoy system is addressed implicitly in the state matrix to design the problem into a TD framework. In order to enhance the captured power by the WEC, the control method is built to have an online active control. This will help the device to predict the best controller based on its previous experiences in the same situations. Two methods of TD, Q-Learning and SARSA, are used and the features are analyzed and several testing functions are carried out in simulation part. To perform on-line optimal control, a force control has acted as a controller and TD coefficients are tuned at a proper rate significantly after specific number of episodes. The power of suggested TD methods is compared to PGM, IPOPT and with other learning control strategies. Several computer simulations were carried out to evaluate the controller effectiveness by applying different sea-states and analyzing the resultant WEC dynamics.
期刊介绍:
The Journal of Engineering for the Maritime Environment is concerned with the design, production and operation of engineering artefacts for the maritime environment. The journal straddles the traditional boundaries of naval architecture, marine engineering, offshore/ocean engineering, coastal engineering and port engineering.