Takayuki Murooka, Masashi Hamaya, Felix von Drigalski, Kazutoshi Tanaka, Yoshihisa Ijiri
{"title":"具有前向动力学模型的迭代反向传播扰动观测器","authors":"Takayuki Murooka, Masashi Hamaya, Felix von Drigalski, Kazutoshi Tanaka, Yoshihisa Ijiri","doi":"10.1109/CASE49439.2021.9551455","DOIUrl":null,"url":null,"abstract":"Disturbance Observer (DOB) has been widely used for robotic applications to eliminate various kinds of disturbances. Recently, learning-based DOB has attracted significant attention as it can deal with complex robotic systems. In this study, we propose the Iterative Backpropagation Disturbance Observer (IB-DOB) method. IB-DOB learns the forward model with a neural network, and calculates disturbances via iterative backpropagations, which behaves like the inverse model. Our method can not only improve estimation performances owing to the iterative calculation but also be applied to both model-free and -based learning control. We conducted experiments for two manipulation tasks: the cart pole with Deep Deterministic Policy Gradient (DDPG) and the pushing object task with Deep Model Predictive Control (DeepMPC). Our method demonstrated better task performances than the baselines without DOB and with DOB using a learned inverse model even though disturbances of external forces and model errors were provided.","PeriodicalId":232083,"journal":{"name":"2021 IEEE 17th International Conference on Automation Science and Engineering (CASE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Iterative Backpropagation Disturbance Observer with Forward Dynamics Model\",\"authors\":\"Takayuki Murooka, Masashi Hamaya, Felix von Drigalski, Kazutoshi Tanaka, Yoshihisa Ijiri\",\"doi\":\"10.1109/CASE49439.2021.9551455\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Disturbance Observer (DOB) has been widely used for robotic applications to eliminate various kinds of disturbances. Recently, learning-based DOB has attracted significant attention as it can deal with complex robotic systems. In this study, we propose the Iterative Backpropagation Disturbance Observer (IB-DOB) method. IB-DOB learns the forward model with a neural network, and calculates disturbances via iterative backpropagations, which behaves like the inverse model. Our method can not only improve estimation performances owing to the iterative calculation but also be applied to both model-free and -based learning control. We conducted experiments for two manipulation tasks: the cart pole with Deep Deterministic Policy Gradient (DDPG) and the pushing object task with Deep Model Predictive Control (DeepMPC). Our method demonstrated better task performances than the baselines without DOB and with DOB using a learned inverse model even though disturbances of external forces and model errors were provided.\",\"PeriodicalId\":232083,\"journal\":{\"name\":\"2021 IEEE 17th International Conference on Automation Science and Engineering (CASE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 17th International Conference on Automation Science and Engineering (CASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CASE49439.2021.9551455\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 17th International Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CASE49439.2021.9551455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Iterative Backpropagation Disturbance Observer with Forward Dynamics Model
Disturbance Observer (DOB) has been widely used for robotic applications to eliminate various kinds of disturbances. Recently, learning-based DOB has attracted significant attention as it can deal with complex robotic systems. In this study, we propose the Iterative Backpropagation Disturbance Observer (IB-DOB) method. IB-DOB learns the forward model with a neural network, and calculates disturbances via iterative backpropagations, which behaves like the inverse model. Our method can not only improve estimation performances owing to the iterative calculation but also be applied to both model-free and -based learning control. We conducted experiments for two manipulation tasks: the cart pole with Deep Deterministic Policy Gradient (DDPG) and the pushing object task with Deep Model Predictive Control (DeepMPC). Our method demonstrated better task performances than the baselines without DOB and with DOB using a learned inverse model even though disturbances of external forces and model errors were provided.