Autonomous construction of structures in a dynamic environment using Reinforcement Learning

S. Santos, S. Givigi, C. Nascimento
{"title":"Autonomous construction of structures in a dynamic environment using Reinforcement Learning","authors":"S. Santos, S. Givigi, C. Nascimento","doi":"10.1109/SysCon.2013.6549922","DOIUrl":null,"url":null,"abstract":"This paper presents an adaptive approach based on the Reinforcement Learning (RL) method to manipulate and transport parts and also assemble 3-D structures in a moderately constrained and dynamic environment using a quad-rotor. Nowadays, complex construction tasks using mobile robots are characterized by two fundamental problems such as task planning and motion planning. However, to obtain the task and path planning that define a specific sequence of operations for construction of a given structure is generally very complex. In this context, we propose and investigate a system in which an aerial robot learns the assembly and construction tasks of multiple 3-D structures. This process involves the learning of the sequence of maneuvers of a vehicle, the assembly sequence of the parts and also the correct types of structural elements for each assembly point of the structure. A heuristic search algorithm is used in the learning process to find the optimal path for the quad-rotor so that its navigation through the dynamic environment is performed. The experimental results show that a 3-D structure can be built using the task planning approach derived from a learning algorithm combined with a heuristic search method.","PeriodicalId":218073,"journal":{"name":"2013 IEEE International Systems Conference (SysCon)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Systems Conference (SysCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SysCon.2013.6549922","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

This paper presents an adaptive approach based on the Reinforcement Learning (RL) method to manipulate and transport parts and also assemble 3-D structures in a moderately constrained and dynamic environment using a quad-rotor. Nowadays, complex construction tasks using mobile robots are characterized by two fundamental problems such as task planning and motion planning. However, to obtain the task and path planning that define a specific sequence of operations for construction of a given structure is generally very complex. In this context, we propose and investigate a system in which an aerial robot learns the assembly and construction tasks of multiple 3-D structures. This process involves the learning of the sequence of maneuvers of a vehicle, the assembly sequence of the parts and also the correct types of structural elements for each assembly point of the structure. A heuristic search algorithm is used in the learning process to find the optimal path for the quad-rotor so that its navigation through the dynamic environment is performed. The experimental results show that a 3-D structure can be built using the task planning approach derived from a learning algorithm combined with a heuristic search method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用强化学习在动态环境中自主构建结构
本文提出了一种基于强化学习(RL)方法的自适应方法,用于在中等约束和动态环境中使用四旋翼操纵和运输零件以及组装三维结构。目前,移动机器人的复杂施工任务有两个基本问题:任务规划和运动规划。然而,要获得任务和路径规划,为给定结构的构建定义特定的操作序列通常是非常复杂的。在此背景下,我们提出并研究了一个空中机器人学习多个三维结构的组装和建造任务的系统。这个过程包括学习车辆的动作顺序,零件的装配顺序以及结构的每个装配点的结构元素的正确类型。在学习过程中,采用启发式搜索算法为四旋翼飞行器寻找最优路径,实现四旋翼飞行器在动态环境中的导航。实验结果表明,采用启发式搜索和学习算法相结合的任务规划方法可以构建三维结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Methodology for designing highly reliable Fault Tolerance Space Systems based on COTS devices Quantitative metrics for improving software performance for an integrated tool platform A method for analyzing architectural drivers when engineering a system architecture Intelligent and defensive framework for decision-making systems-of-systems with applications to healthcare Improving decision-making and management by thinking about the enterprise through multiple dimensions
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1