{"title":"Many-Versus-Many AUV Attack-Defense Game in 3-D Scenarios Using Hierarchical Multiagent Reinforcement Learning","authors":"Wenhao Gan;Lei Qiao","doi":"10.1109/JIOT.2025.3552116","DOIUrl":null,"url":null,"abstract":"This article proposes a deep reinforcement learning (DRL)-based method for many-versus-many attack-defense games involving autonomous underwater vehicles (AUVs) in 3-D space, focusing on training a defense team to counter attackers. The attackers benefit from speed and unpredictability, while defenders leverage numerical superiority. The scenario includes irregular terrain, and AUVs are limited by low-frequency communication. First, a constrained Apollonius model considering AUV 3-D motion characteristics is developed to evaluate the repulsive effect of defenders on attackers. Second, a hybrid 3-D AUV maneuvering framework integrating end-to-velocity and hierarchical approaches is proposed to reduce the complexity of decision-making strategy learning, enabling AUVs to counter multiattacker threats and learn repulsion strategies across subteams. Third, a scalable learning architecture is designed to adapt to different game scales, with an improved update method to enhance advantage and credit estimation efficiency while ensuring convergence. The combination of population expansion-curriculum training and asynchronous parallel training strengthens the generalization of strategies across various environments. Finally, through comparative analysis with mainstream multiagent deep reinforcement learning-based methods, as well as ablation studies on the framework and rewards, our scheme demonstrates superior learning efficiency and generalization ability. Adversarial experiments across different game scales, along with specialized performance tests, indicate that the defense group exhibits strong robustness and adaptive characteristics.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 13","pages":"23479-23494"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10930426/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This article proposes a deep reinforcement learning (DRL)-based method for many-versus-many attack-defense games involving autonomous underwater vehicles (AUVs) in 3-D space, focusing on training a defense team to counter attackers. The attackers benefit from speed and unpredictability, while defenders leverage numerical superiority. The scenario includes irregular terrain, and AUVs are limited by low-frequency communication. First, a constrained Apollonius model considering AUV 3-D motion characteristics is developed to evaluate the repulsive effect of defenders on attackers. Second, a hybrid 3-D AUV maneuvering framework integrating end-to-velocity and hierarchical approaches is proposed to reduce the complexity of decision-making strategy learning, enabling AUVs to counter multiattacker threats and learn repulsion strategies across subteams. Third, a scalable learning architecture is designed to adapt to different game scales, with an improved update method to enhance advantage and credit estimation efficiency while ensuring convergence. The combination of population expansion-curriculum training and asynchronous parallel training strengthens the generalization of strategies across various environments. Finally, through comparative analysis with mainstream multiagent deep reinforcement learning-based methods, as well as ablation studies on the framework and rewards, our scheme demonstrates superior learning efficiency and generalization ability. Adversarial experiments across different game scales, along with specialized performance tests, indicate that the defense group exhibits strong robustness and adaptive characteristics.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.