Lateral Motion Control of a Maneuverable Aircraft Using Reinforcement Learning

IF 0.8 Q4 OPTICS Optical Memory and Neural Networks Pub Date : 2024-03-25 DOI:10.3103/S1060992X2401003X

Yu. V. Tiumentsev, R. A. Zarubin

{"title":"Lateral Motion Control of a Maneuverable Aircraft Using Reinforcement Learning","authors":"Yu. V. Tiumentsev, R. A. Zarubin","doi":"10.3103/S1060992X2401003X","DOIUrl":null,"url":null,"abstract":"<p>Machine learning is currently one of the most actively developing research areas. Considerable attention in the ongoing research is paid to problems related to dynamical systems. One of the areas in which the application of machine learning technologies is being actively explored is aircraft of various types and purposes. This state of the art is due to the complexity and variety of tasks that are assigned to aircraft. The complicating factor in this case is incomplete and inaccurate knowledge of the properties of the object under study and the conditions in which it operates. In particular, a variety of abnormal situations may occur during flight, such as equipment failures and structural damage, which must be counteracted by reconfiguring the aircraft’s control system and controls. The aircraft control system must be able to operate effectively under these conditions by promptly changing the parameters and/or structure of the control laws used. Adaptive control methods allow to satisfy this requirement. One of the ways to synthesize control laws for dynamic systems, widely used nowadays, is LQR approach. A significant limitation of this approach is the lack of adaptability of the resulting control law, which prevents its use in conditions of incomplete and inaccurate knowledge of the properties of the control object and the environment in which it operates. To overcome this limitation, it was proposed to modify the standard variant of LQR (Linear Quadratic Regulator) based on approximate dynamic programming, a special case of which is the adaptive critic design (ACD) method. For the ACD-LQR combination, the problem of controlling the lateral motion of a maneuvering aircraft is solved. The results obtained demonstrate the promising potential of this approach to controlling the airplane motion under uncertainty conditions.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 1","pages":"1 - 12"},"PeriodicalIF":0.8000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Memory and Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S1060992X2401003X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning is currently one of the most actively developing research areas. Considerable attention in the ongoing research is paid to problems related to dynamical systems. One of the areas in which the application of machine learning technologies is being actively explored is aircraft of various types and purposes. This state of the art is due to the complexity and variety of tasks that are assigned to aircraft. The complicating factor in this case is incomplete and inaccurate knowledge of the properties of the object under study and the conditions in which it operates. In particular, a variety of abnormal situations may occur during flight, such as equipment failures and structural damage, which must be counteracted by reconfiguring the aircraft’s control system and controls. The aircraft control system must be able to operate effectively under these conditions by promptly changing the parameters and/or structure of the control laws used. Adaptive control methods allow to satisfy this requirement. One of the ways to synthesize control laws for dynamic systems, widely used nowadays, is LQR approach. A significant limitation of this approach is the lack of adaptability of the resulting control law, which prevents its use in conditions of incomplete and inaccurate knowledge of the properties of the control object and the environment in which it operates. To overcome this limitation, it was proposed to modify the standard variant of LQR (Linear Quadratic Regulator) based on approximate dynamic programming, a special case of which is the adaptive critic design (ACD) method. For the ACD-LQR combination, the problem of controlling the lateral motion of a maneuvering aircraft is solved. The results obtained demonstrate the promising potential of this approach to controlling the airplane motion under uncertainty conditions.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用强化学习实现可操控飞机的横向运动控制

摘要机器学习是当前发展最活跃的研究领域之一。正在进行的研究相当关注与动力系统有关的问题。正在积极探索机器学习技术应用的领域之一是各种类型和用途的飞机。这种技术现状是由于分配给飞机的任务复杂多样。在这种情况下，复杂的因素是对所研究对象的属性及其运行条件的了解不全面、不准确。特别是在飞行过程中可能会出现各种异常情况，如设备故障和结构损坏，必须通过重新配置飞机的控制系统和控制装置来应对。飞机控制系统必须能够通过及时改变所使用的控制法则的参数和/或结构，在这些情况下有效运行。自适应控制方法可以满足这一要求。目前广泛使用的动态系统控制法则合成方法之一是 LQR 方法。这种方法的一个重要局限是所产生的控制法则缺乏适应性，因此无法在对控制对象及其运行环境的属性了解不全面和不准确的情况下使用。为了克服这一局限性，有人建议在近似动态编程的基础上修改 LQR（线性二次调节器）的标准变体，其特例就是自适应批判设计（ACD）方法。针对 ACD-LQR 组合，解决了控制机动飞机横向运动的问题。结果表明，这种方法在不确定条件下控制飞机运动的潜力巨大。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Optical Memory and Neural Networks OPTICS-

CiteScore

1.50

自引率

11.10%

发文量

期刊介绍： The journal covers a wide range of issues in information optics such as optical memory, mechanisms for optical data recording and processing, photosensitive materials, optical, optoelectronic and holographic nanostructures, and many other related topics. Papers on memory systems using holographic and biological structures and concepts of brain operation are also included. The journal pays particular attention to research in the field of neural net systems that may lead to a new generation of computional technologies by endowing them with intelligence.