{"title":"基于强化学习策略的仿生极化组合导航系统自适应姿态确定","authors":"HuiYi Bao, Tao Du, Luyue Sun","doi":"10.3934/mfc.2022014","DOIUrl":null,"url":null,"abstract":"The bionic polarization integrated navigation system includes three-axis gyroscopes, three-axis accelerometers, three-axis magnetometers, and polarization sensors, which provide pitch, roll, and yaw. When the magnetometers are interfered or the polarization sensors are obscured, the accuracy of attitude will be decreased due to abnormal measurement. To improve the accuracy of attitude of the integrated navigation system under these complex environments, an adaptive complementary filter based on DQN (Deep Q-learning Network) is proposed. The complementary filter is first designed to fuse the measurements from the gyroscopes, accelerometers, magnetometers, and polarization sensors. Then, a reward function of the bionic polarization integrated navigation system is defined as the function of the absolute value of the attitude angle error. The action-value function is introduced by a fully-connected network obtained by historical sensor data training. The strategy can be calculated by the deep Q-learning network and the action that optimal action-value function is obtained. Based on the optimized action, three types of integration are switched automatically to adapt to the different environments. Three cases of simulations are conducted to validate the effectiveness of the proposed algorithm. The results show that the adaptive attitude determination of bionic polarization integrated navigation system based on DQN can improve the accuracy of the attitude estimation.","PeriodicalId":93334,"journal":{"name":"Mathematical foundations of computing","volume":"96 1","pages":"161-177"},"PeriodicalIF":1.3000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Adaptive attitude determination of bionic polarization integrated navigation system based on reinforcement learning strategy\",\"authors\":\"HuiYi Bao, Tao Du, Luyue Sun\",\"doi\":\"10.3934/mfc.2022014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The bionic polarization integrated navigation system includes three-axis gyroscopes, three-axis accelerometers, three-axis magnetometers, and polarization sensors, which provide pitch, roll, and yaw. When the magnetometers are interfered or the polarization sensors are obscured, the accuracy of attitude will be decreased due to abnormal measurement. To improve the accuracy of attitude of the integrated navigation system under these complex environments, an adaptive complementary filter based on DQN (Deep Q-learning Network) is proposed. The complementary filter is first designed to fuse the measurements from the gyroscopes, accelerometers, magnetometers, and polarization sensors. Then, a reward function of the bionic polarization integrated navigation system is defined as the function of the absolute value of the attitude angle error. The action-value function is introduced by a fully-connected network obtained by historical sensor data training. The strategy can be calculated by the deep Q-learning network and the action that optimal action-value function is obtained. Based on the optimized action, three types of integration are switched automatically to adapt to the different environments. Three cases of simulations are conducted to validate the effectiveness of the proposed algorithm. The results show that the adaptive attitude determination of bionic polarization integrated navigation system based on DQN can improve the accuracy of the attitude estimation.\",\"PeriodicalId\":93334,\"journal\":{\"name\":\"Mathematical foundations of computing\",\"volume\":\"96 1\",\"pages\":\"161-177\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical foundations of computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3934/mfc.2022014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical foundations of computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3934/mfc.2022014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Adaptive attitude determination of bionic polarization integrated navigation system based on reinforcement learning strategy
The bionic polarization integrated navigation system includes three-axis gyroscopes, three-axis accelerometers, three-axis magnetometers, and polarization sensors, which provide pitch, roll, and yaw. When the magnetometers are interfered or the polarization sensors are obscured, the accuracy of attitude will be decreased due to abnormal measurement. To improve the accuracy of attitude of the integrated navigation system under these complex environments, an adaptive complementary filter based on DQN (Deep Q-learning Network) is proposed. The complementary filter is first designed to fuse the measurements from the gyroscopes, accelerometers, magnetometers, and polarization sensors. Then, a reward function of the bionic polarization integrated navigation system is defined as the function of the absolute value of the attitude angle error. The action-value function is introduced by a fully-connected network obtained by historical sensor data training. The strategy can be calculated by the deep Q-learning network and the action that optimal action-value function is obtained. Based on the optimized action, three types of integration are switched automatically to adapt to the different environments. Three cases of simulations are conducted to validate the effectiveness of the proposed algorithm. The results show that the adaptive attitude determination of bionic polarization integrated navigation system based on DQN can improve the accuracy of the attitude estimation.