{"title":"A modular reinforcement-based neural controller for a three-link manipulator","authors":"P. Martín, J. Millán","doi":"10.1109/IROS.1997.655100","DOIUrl":null,"url":null,"abstract":"This paper presents a modular neural controller that learns goal-oriented obstacle-avoiding motion strategies for a sensor-based three-link planar robot arm. It acquires these strategies through reinforcement learning from local sensory data. The controller has two reinforcement-based modules: a module for negotiating obstacles and a module for moving to the goal. Both modules generate actions that are interpreted with regard to a goal vector in the robot joint space. A differential inverse kinematics (DIV) module is used to obtain such a goal vector. The DIV module is based on the inversion of a neural network that has been previously trained to approximate the manipulator forward kinematics in polar coordinates. The controller achieves a satisfactory performance quite rapidly and shows good generalization capabilities in the face of new environments.","PeriodicalId":408848,"journal":{"name":"Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IROS.1997.655100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This paper presents a modular neural controller that learns goal-oriented obstacle-avoiding motion strategies for a sensor-based three-link planar robot arm. It acquires these strategies through reinforcement learning from local sensory data. The controller has two reinforcement-based modules: a module for negotiating obstacles and a module for moving to the goal. Both modules generate actions that are interpreted with regard to a goal vector in the robot joint space. A differential inverse kinematics (DIV) module is used to obtain such a goal vector. The DIV module is based on the inversion of a neural network that has been previously trained to approximate the manipulator forward kinematics in polar coordinates. The controller achieves a satisfactory performance quite rapidly and shows good generalization capabilities in the face of new environments.