S. Manko, S. Diane, Aleksey E. Krivoshatskiy, I. D. Margolin, Evgeniya A. Slepynina
{"title":"基于强化学习的大型物体运输多机器人系统自适应控制","authors":"S. Manko, S. Diane, Aleksey E. Krivoshatskiy, I. D. Margolin, Evgeniya A. Slepynina","doi":"10.1109/EICONRUS.2018.8317240","DOIUrl":null,"url":null,"abstract":"This paper describes models and algorithms for intelligent control of a group of autonomous mobile robots, which perform large-sized object transportation in a complex environment. The proposed models allow the multi-robot system to reach its target position while avoiding obstacles and maintaining object orientation with coordinated motion of several robots. We use neural based Q-learning to provide robots adaptability to unknown environments. The inputs of the learning subsystem are 2d-map data collected during system operation and target misalignments of multi-robot system. The primary output is a control decision with a maximum value of estimated efficiency. Experimental results presented in the paper fully confirm the reliability of the proposed approach.","PeriodicalId":6562,"journal":{"name":"2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)","volume":"15 1","pages":"923-927"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Adaptive control of a multi-robot system for transportation of large-sized objects based on reinforcement learning\",\"authors\":\"S. Manko, S. Diane, Aleksey E. Krivoshatskiy, I. D. Margolin, Evgeniya A. Slepynina\",\"doi\":\"10.1109/EICONRUS.2018.8317240\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes models and algorithms for intelligent control of a group of autonomous mobile robots, which perform large-sized object transportation in a complex environment. The proposed models allow the multi-robot system to reach its target position while avoiding obstacles and maintaining object orientation with coordinated motion of several robots. We use neural based Q-learning to provide robots adaptability to unknown environments. The inputs of the learning subsystem are 2d-map data collected during system operation and target misalignments of multi-robot system. The primary output is a control decision with a maximum value of estimated efficiency. Experimental results presented in the paper fully confirm the reliability of the proposed approach.\",\"PeriodicalId\":6562,\"journal\":{\"name\":\"2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)\",\"volume\":\"15 1\",\"pages\":\"923-927\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EICONRUS.2018.8317240\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EICONRUS.2018.8317240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive control of a multi-robot system for transportation of large-sized objects based on reinforcement learning
This paper describes models and algorithms for intelligent control of a group of autonomous mobile robots, which perform large-sized object transportation in a complex environment. The proposed models allow the multi-robot system to reach its target position while avoiding obstacles and maintaining object orientation with coordinated motion of several robots. We use neural based Q-learning to provide robots adaptability to unknown environments. The inputs of the learning subsystem are 2d-map data collected during system operation and target misalignments of multi-robot system. The primary output is a control decision with a maximum value of estimated efficiency. Experimental results presented in the paper fully confirm the reliability of the proposed approach.