{"title":"Variable Impedance Control for Force Tracking Based on PILCO in Uncertain Environment","authors":"Zicheng Dong, Hui Shao, H. Huang","doi":"10.1109/ICMA57826.2023.10216082","DOIUrl":null,"url":null,"abstract":"Traditional impedance control is a simple and valid way for robot force tracking, but the uncertainty of the contact environment can seriously interfere with tracking accuracy. In this paper, we present a novel reinforcement learning variable impedance scheme based on PILCO algorithm, which trains a RBF policy network that dynamically adjusts the damping coefficient to compensate for environment uncertainty. Considering the randomness of environment and learning efficiency, a contact state transition model is established by Gaussian process regression, which can be used for state prediction and policy evaluation. The policy is then updated by a gradient-based approach. The simulation study indicates that our robot only takes 18 interactions with an unknown environment to learn an optimal variable impedance policy, which can be applied to various unknown contact environments and has better control accuracy than traditional methods.","PeriodicalId":151364,"journal":{"name":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mechatronics and Automation (ICMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMA57826.2023.10216082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Traditional impedance control is a simple and valid way for robot force tracking, but the uncertainty of the contact environment can seriously interfere with tracking accuracy. In this paper, we present a novel reinforcement learning variable impedance scheme based on PILCO algorithm, which trains a RBF policy network that dynamically adjusts the damping coefficient to compensate for environment uncertainty. Considering the randomness of environment and learning efficiency, a contact state transition model is established by Gaussian process regression, which can be used for state prediction and policy evaluation. The policy is then updated by a gradient-based approach. The simulation study indicates that our robot only takes 18 interactions with an unknown environment to learn an optimal variable impedance policy, which can be applied to various unknown contact environments and has better control accuracy than traditional methods.