Self-Reflective Learning Strategy for Persistent Autonomy of Aerial Manipulators

IF 1 Q4 AUTOMATION & CONTROL SYSTEMS Mechatronic Systems and Control Pub Date : 2019-11-26 DOI:10.1115/dscc2019-9086

Xu Zhou, Jiucai Zhang, Xiaoli Zhang

{"title":"Self-Reflective Learning Strategy for Persistent Autonomy of Aerial Manipulators","authors":"Xu Zhou, Jiucai Zhang, Xiaoli Zhang","doi":"10.1115/dscc2019-9086","DOIUrl":null,"url":null,"abstract":"\n Autonomous aerial manipulators have great potentials to assist humans or even fully automate manual labor-intensive tasks such as aerial cleaning, aerial transportation, infrastructure repair, and agricultural inspection and sampling. Reinforcement learning holds the promise of enabling persistent autonomy of aerial manipulators because it can adapt to different situations by automatically learning optimal policies from the interactions between the aerial manipulator and environments. However, the learning process itself could experience failures that can practically endanger the safety of aerial manipulators and hence hinder persistent autonomy. In order to solve this problem, we propose for the aerial manipulator a self-reflective learning strategy that can smartly and safely finding optimal policies for different new situations. This self-reflective manner consists of three steps: identifying the appearance of new situations, re-seeking the optimal policy with reinforcement learning, and evaluating the termination of self-reflection. Numerical simulations demonstrate, compared with conventional learning-based autonomy, our strategy can significantly reduce failures while still can finish the given task.","PeriodicalId":41412,"journal":{"name":"Mechatronic Systems and Control","volume":"21 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2019-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechatronic Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/dscc2019-9086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous aerial manipulators have great potentials to assist humans or even fully automate manual labor-intensive tasks such as aerial cleaning, aerial transportation, infrastructure repair, and agricultural inspection and sampling. Reinforcement learning holds the promise of enabling persistent autonomy of aerial manipulators because it can adapt to different situations by automatically learning optimal policies from the interactions between the aerial manipulator and environments. However, the learning process itself could experience failures that can practically endanger the safety of aerial manipulators and hence hinder persistent autonomy. In order to solve this problem, we propose for the aerial manipulator a self-reflective learning strategy that can smartly and safely finding optimal policies for different new situations. This self-reflective manner consists of three steps: identifying the appearance of new situations, re-seeking the optimal policy with reinforcement learning, and evaluating the termination of self-reflection. Numerical simulations demonstrate, compared with conventional learning-based autonomy, our strategy can significantly reduce failures while still can finish the given task.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

空中机械臂持续自主的自反思学习策略

自主空中机械手具有很大的潜力，可以辅助人类，甚至完全自动化人工劳动密集型任务，如空中清洁、空中运输、基础设施维修、农业检验和抽样。强化学习有望实现空中机械臂的持久自治，因为它可以通过从空中机械臂与环境之间的相互作用中自动学习最佳策略来适应不同的情况。然而，学习过程本身可能会出现故障，这实际上会危及空中操纵器的安全，从而阻碍持续的自主性。为了解决这一问题，我们提出了一种能够针对不同新情况智能安全地寻找最优策略的空中机械臂自反射学习策略。这种自我反思方式包括三个步骤:识别新情况的出现，通过强化学习重新寻求最优策略，以及评估自我反思的终止。数值模拟表明，与传统的基于学习的自主学习相比，我们的策略在完成给定任务的同时显著减少了失败。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Mechatronic Systems and Control AUTOMATION & CONTROL SYSTEMS-

CiteScore

1.40

自引率

66.70%

发文量

期刊介绍： This international journal publishes both theoretical and application-oriented papers on various aspects of mechatronic systems, modelling, design, conventional and intelligent control, and intelligent systems. Application areas of mechatronics may include robotics, transportation, energy systems, manufacturing, sensors, actuators, and automation. Techniques of artificial intelligence may include soft computing (fuzzy logic, neural networks, genetic algorithms/evolutionary computing, probabilistic methods, etc.). Techniques may cover frequency and time domains, linear and nonlinear systems, and deterministic and stochastic processes. Hybrid techniques of mechatronics that combine conventional and intelligent methods are also included. First published in 1972, this journal originated with an emphasis on conventional control systems and computer-based applications. Subsequently, with rapid advances in the field and in view of the widespread interest and application of soft computing in control systems, this latter aspect was integrated into the journal. Now the area of mechatronics is included as the main focus. A unique feature of the journal is its pioneering role in bridging the gap between conventional systems and intelligent systems, with an equal emphasis on theory and practical applications, including system modelling, design and instrumentation. It appears four times per year.