{"title":"考虑严重干扰下推进器失效的自主水下航行器动力学控制算法","authors":"H. Kawano, T. Ura","doi":"10.1109/IROS.2001.976295","DOIUrl":null,"url":null,"abstract":"A training algorithm for dynamics control of nonholonomic AUV (autonomous underwater vehicle) is proposed in this paper which can recover from thruster failure during cruising mission. It is based on Q-learning and teaching method. The back up data that represents dynamics model expressed in the form of Bayesian net can be used effectively in this case. In order to overcome difficulties due to, making discrete expression of continuous state space of AUV, the algorithm uses multiresolution Q-value tables which is combined in the form of subsumption architecture. Simulation results show high performance of the proposed algorithm for a vertical ascent mission in a severe current condition. It is shown that AUV users can conveniently and quickly train the control algorithm of the AUV by using simulation of dynamics of the vehicle.","PeriodicalId":319679,"journal":{"name":"Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Dynamics control algorithm of autonomous underwater vehicle by reinforcement learning and teaching method considering thruster failure under severe disturbance\",\"authors\":\"H. Kawano, T. Ura\",\"doi\":\"10.1109/IROS.2001.976295\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A training algorithm for dynamics control of nonholonomic AUV (autonomous underwater vehicle) is proposed in this paper which can recover from thruster failure during cruising mission. It is based on Q-learning and teaching method. The back up data that represents dynamics model expressed in the form of Bayesian net can be used effectively in this case. In order to overcome difficulties due to, making discrete expression of continuous state space of AUV, the algorithm uses multiresolution Q-value tables which is combined in the form of subsumption architecture. Simulation results show high performance of the proposed algorithm for a vertical ascent mission in a severe current condition. It is shown that AUV users can conveniently and quickly train the control algorithm of the AUV by using simulation of dynamics of the vehicle.\",\"PeriodicalId\":319679,\"journal\":{\"name\":\"Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IROS.2001.976295\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.2001.976295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamics control algorithm of autonomous underwater vehicle by reinforcement learning and teaching method considering thruster failure under severe disturbance
A training algorithm for dynamics control of nonholonomic AUV (autonomous underwater vehicle) is proposed in this paper which can recover from thruster failure during cruising mission. It is based on Q-learning and teaching method. The back up data that represents dynamics model expressed in the form of Bayesian net can be used effectively in this case. In order to overcome difficulties due to, making discrete expression of continuous state space of AUV, the algorithm uses multiresolution Q-value tables which is combined in the form of subsumption architecture. Simulation results show high performance of the proposed algorithm for a vertical ascent mission in a severe current condition. It is shown that AUV users can conveniently and quickly train the control algorithm of the AUV by using simulation of dynamics of the vehicle.