Hierarchical reinforcement learning method for long-horizon path planning of stratospheric airship

IF 5 1区工程技术 Q1 ENGINEERING, AEROSPACE Aerospace Science and Technology Pub Date : 2025-02-18 DOI:10.1016/j.ast.2025.110075

Chao Lv , Ming Zhu , Xiao Guo , Jiajun Ou , Wenjie Lou

{"title":"Hierarchical reinforcement learning method for long-horizon path planning of stratospheric airship","authors":"Chao Lv , Ming Zhu , Xiao Guo , Jiajun Ou , Wenjie Lou","doi":"10.1016/j.ast.2025.110075","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid development of stratospheric airships has shown excellent application prospects, such as meteorological research, remote sensing, communication, and so on. The path planning of stratospheric airships has become the focus of research. Traditional methods have already implemented the path planning problem for simple scenarios. However, long-horizon path planning in a dynamic environment, causing problems like state explosion and time abstraction, is difficult to solve by traditional algorithms. This paper presents a hierarchical TD3 algorithm (H-TD3), a long-horizon path planning with a hierarchical framework operating on different temporal scales. It consists of two layers: the high-level controller and the low-level controller. The high-level controller decomposes the long-horizon path planning task into short-horizon navigation tasks, completed by the low-level controller for short-horizon path planning. In addition, we introduce an execution reward to promote cooperation between the high-level controller and the low-level controller to complete the task. Finally, the model is trained and tested in forecast wind fields and compared with other algorithms based on deep reinforcement learning. The effectiveness of the proposed method in long-horizon path planning is verified.</div></div>","PeriodicalId":50955,"journal":{"name":"Aerospace Science and Technology","volume":"160 ","pages":"Article 110075"},"PeriodicalIF":5.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1270963825001464","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}

引用次数: 0

Abstract

The rapid development of stratospheric airships has shown excellent application prospects, such as meteorological research, remote sensing, communication, and so on. The path planning of stratospheric airships has become the focus of research. Traditional methods have already implemented the path planning problem for simple scenarios. However, long-horizon path planning in a dynamic environment, causing problems like state explosion and time abstraction, is difficult to solve by traditional algorithms. This paper presents a hierarchical TD3 algorithm (H-TD3), a long-horizon path planning with a hierarchical framework operating on different temporal scales. It consists of two layers: the high-level controller and the low-level controller. The high-level controller decomposes the long-horizon path planning task into short-horizon navigation tasks, completed by the low-level controller for short-horizon path planning. In addition, we introduce an execution reward to promote cooperation between the high-level controller and the low-level controller to complete the task. Finally, the model is trained and tested in forecast wind fields and compared with other algorithms based on deep reinforcement learning. The effectiveness of the proposed method in long-horizon path planning is verified.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Aerospace Science and Technology 工程技术-工程：宇航

CiteScore

10.30

自引率

28.60%

发文量

654

审稿时长

54 days

期刊介绍： Aerospace Science and Technology publishes articles of outstanding scientific quality. Each article is reviewed by two referees. The journal welcomes papers from a wide range of countries. This journal publishes original papers, review articles and short communications related to all fields of aerospace research, fundamental and applied, potential applications of which are clearly related to: • The design and the manufacture of aircraft, helicopters, missiles, launchers and satellites • The control of their environment • The study of various systems they are involved in, as supports or as targets. Authors are invited to submit papers on new advances in the following topics to aerospace applications: • Fluid dynamics • Energetics and propulsion • Materials and structures • Flight mechanics • Navigation, guidance and control • Acoustics • Optics • Electromagnetism and radar • Signal and image processing • Information processing • Data fusion • Decision aid • Human behaviour • Robotics and intelligent systems • Complex system engineering. Etc.