{"title":"Learning to Improve Performance During Non-Repetitive Tasks Performed by Robots","authors":"Yeo Jung Yoon, Satyandra K. Gupta","doi":"10.1115/detc2021-67627","DOIUrl":null,"url":null,"abstract":"\n To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.","PeriodicalId":23602,"journal":{"name":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 41st Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-67627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To execute non-repetitive tasks, robots need to learn on the tasks to improve task performance. The performance model cannot be built in advance for such non-repetitive tasks. The robot can execute a small portion of the task with certain process parameters and attractively update the process parameters based on the observations of the task performance. To make the learning process efficient, the trade-off between exploration and exploitation should be explicitly considered. Too much exploration may lead to the waste of time without significant improvement on task performance. On the other hand, stopping exploration prematurely may lead to suboptimal task performance. This paper describes a sequential decision making approach to select the set of parameters to improve task performance. The overall learning approach uses feasibility biased sampling, surrogate model construction and greedy optimization. We implement our approach in the simulation of robotic sanding. We also compare our method with other design of experiments methods.