Pub Date : 2024-07-25DOI: 10.1007/s11432-023-4087-4
Longhao Qian, Yi Lok Lo, Hugh Hong-tao Liu
With the growing demand for automation in agriculture, industries increasingly rely on drones to perform crop monitoring and surveillance. In this regard, fixed-wing unmanned aerial systems (UASs) are viable platforms for scanning a large crop field, given their payload capacity and range. To achieve maximum coverage without landing for battery replacement, an algorithm for producing a minimal required energy survey path is essential. Hence, an energy-aware coverage path planning algorithm is proposed herein. The constraints for a fixed-wing UAS to fly at low altitudes while achieving full coverage of the crop field are first analyzed. Then, the full path is decomposed into straight-line and U-turn primitives. Finally, an algorithm to calculate a combination of straight-line segments and U-turns is proposed to obtain the path with minimum required energy consumption. The genetic algorithm is used to efficiently determine the order of the straight-line paths to traverse. Case studies show that the proposed algorithm can produce planning results for a convex-polygon-shaped crop field.
随着农业自动化需求的不断增长,各行各业越来越依赖无人机来进行作物监测和监控。在这方面,鉴于其有效载荷能力和航程,固定翼无人机系统(UAS)是扫描大片作物田的可行平台。为了在不着陆更换电池的情况下实现最大覆盖范围,必须有一种算法来生成所需的最小能量勘测路径。因此,本文提出了一种能量感知覆盖路径规划算法。首先分析了固定翼无人机在低空飞行的同时实现作物田全覆盖的约束条件。然后,将完整路径分解为直线和 U 形转弯基元。最后,提出了一种计算直线段和 U 形转弯组合的算法,以获得所需能耗最小的路径。遗传算法用于有效地确定要穿越的直线路径的顺序。案例研究表明,所提出的算法可以为凸多边形作物田提供规划结果。
{"title":"A path planning algorithm for a crop monitoring fixed-wing unmanned aerial system","authors":"Longhao Qian, Yi Lok Lo, Hugh Hong-tao Liu","doi":"10.1007/s11432-023-4087-4","DOIUrl":"https://doi.org/10.1007/s11432-023-4087-4","url":null,"abstract":"<p>With the growing demand for automation in agriculture, industries increasingly rely on drones to perform crop monitoring and surveillance. In this regard, fixed-wing unmanned aerial systems (UASs) are viable platforms for scanning a large crop field, given their payload capacity and range. To achieve maximum coverage without landing for battery replacement, an algorithm for producing a minimal required energy survey path is essential. Hence, an energy-aware coverage path planning algorithm is proposed herein. The constraints for a fixed-wing UAS to fly at low altitudes while achieving full coverage of the crop field are first analyzed. Then, the full path is decomposed into straight-line and U-turn primitives. Finally, an algorithm to calculate a combination of straight-line segments and U-turns is proposed to obtain the path with minimum required energy consumption. The genetic algorithm is used to efficiently determine the order of the straight-line paths to traverse. Case studies show that the proposed algorithm can produce planning results for a convex-polygon-shaped crop field.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"2021 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141786258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1007/s11432-024-4056-8
Chen Wang, Xian Li, Yanfeng Gu, Zixu Wang
A multispectral imaging system often cannot capture 3D spatial information owing to hardware limitations, which diminishes the effectiveness across various domains. To address this problem, we have developed a multispectral stereo imaging system along with an adaptive 3D reconstruction algorithm. Unlike existing unmanned aerial vehicle stereo imaging systems, our multispectral stereo imaging system uses two multispectral cameras with asymmetric spectral bands positioned at different angles. This design enables the acquisition of a higher number of bands and lateral spatial information while maintaining a lightweight structure. This system introduces challenges such as large geometric distortions and intensity differences between multiple bands. To accurately recover 3D spatial information, we propose an adaptive 3D reconstruction method. This method employs a position and orientation system-assisted projection transformation and a normalized threshold adjustment strategy. Finally, mutual information is used to reconstruct the multispectral images densely, effectively addressing nonlinear differences and generating a comprehensive multispectral point cloud. Our stereo system was used for two real data collections in different regions, and the efficacy of the proposed 3D reconstruction method was validated by comparing it with existing methods and commercial software.
{"title":"An adaptive 3D reconstruction method for asymmetric dual-angle multispectral stereo imaging system on UAV platform","authors":"Chen Wang, Xian Li, Yanfeng Gu, Zixu Wang","doi":"10.1007/s11432-024-4056-8","DOIUrl":"https://doi.org/10.1007/s11432-024-4056-8","url":null,"abstract":"<p>A multispectral imaging system often cannot capture 3D spatial information owing to hardware limitations, which diminishes the effectiveness across various domains. To address this problem, we have developed a multispectral stereo imaging system along with an adaptive 3D reconstruction algorithm. Unlike existing unmanned aerial vehicle stereo imaging systems, our multispectral stereo imaging system uses two multispectral cameras with asymmetric spectral bands positioned at different angles. This design enables the acquisition of a higher number of bands and lateral spatial information while maintaining a lightweight structure. This system introduces challenges such as large geometric distortions and intensity differences between multiple bands. To accurately recover 3D spatial information, we propose an adaptive 3D reconstruction method. This method employs a position and orientation system-assisted projection transformation and a normalized threshold adjustment strategy. Finally, mutual information is used to reconstruct the multispectral images densely, effectively addressing nonlinear differences and generating a comprehensive multispectral point cloud. Our stereo system was used for two real data collections in different regions, and the efficacy of the proposed 3D reconstruction method was validated by comparing it with existing methods and commercial software.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"86 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1007/s11432-023-4029-9
Yu Kang, Jian Di, Ming Li, Yunbo Zhao, Yuhui Wang
Racing drones have attracted increasing attention due to their remarkable high speed and excellent maneuverability. However, autonomous multi-drone racing is quite difficult since it requires quick and agile flight in intricate surroundings and rich drone interaction. To address these issues, we propose a novel autonomous multi-drone racing method based on deep reinforcement learning. A new set of reward functions is proposed to make racing drones learn the racing skills of human experts. Unlike previous methods that required global information about tracks and track boundary constraints, the proposed method requires only limited localized track information within the range of its own onboard sensors. Further, the dynamic response characteristics of racing drones are incorporated into the training environment, so that the proposed method is more in line with the requirements of real drone racing scenarios. In addition, our method has a low computational cost and can meet the requirements of real-time racing. Finally, the effectiveness and superiority of the proposed method are verified by extensive comparison with the state-of-the-art methods in a series of simulations and real-world experiments.
{"title":"Autonomous multi-drone racing method based on deep reinforcement learning","authors":"Yu Kang, Jian Di, Ming Li, Yunbo Zhao, Yuhui Wang","doi":"10.1007/s11432-023-4029-9","DOIUrl":"https://doi.org/10.1007/s11432-023-4029-9","url":null,"abstract":"<p>Racing drones have attracted increasing attention due to their remarkable high speed and excellent maneuverability. However, autonomous multi-drone racing is quite difficult since it requires quick and agile flight in intricate surroundings and rich drone interaction. To address these issues, we propose a novel autonomous multi-drone racing method based on deep reinforcement learning. A new set of reward functions is proposed to make racing drones learn the racing skills of human experts. Unlike previous methods that required global information about tracks and track boundary constraints, the proposed method requires only limited localized track information within the range of its own onboard sensors. Further, the dynamic response characteristics of racing drones are incorporated into the training environment, so that the proposed method is more in line with the requirements of real drone racing scenarios. In addition, our method has a low computational cost and can meet the requirements of real-time racing. Finally, the effectiveness and superiority of the proposed method are verified by extensive comparison with the state-of-the-art methods in a series of simulations and real-world experiments.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"104 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Emotion recognition in conversation (ERC) has attracted growing attention in recent years as a result of the advancement and implementation of human-computer interface technologies. In this paper, we propose an emotional inertia and contagion-driven dependency modeling approach (EmotionIC) for ERC tasks. Our EmotionIC consists of three main components, i.e., identity masked multi-head attention (IM-MHA), dialogue-based gated recurrent unit (DiaGRU), and skip-chain conditional random field (SkipCRF). Compared to previous ERC models, EmotionIC can model a conversation more thoroughly at both the feature-extraction and classification levels. The proposed model attempts to integrate the advantages of attention- and recurrence-based methods at the feature-extraction level. Specifically, IMMHA is applied to capture identity-based global contextual dependencies, while DiaGRU is utilized to extract speaker- and temporal-aware local contextual information. At the classification level, SkipCRF can explicitly mine complex emotional flows from higher-order neighboring utterances in the conversation. Experimental results show that our method can significantly outperform the state-of-the-art models on four benchmark datasets. The ablation studies confirm that our modules can effectively model emotional inertia and contagion.
{"title":"EmotionIC: emotional inertia and contagion-driven dependency modeling for emotion recognition in conversation","authors":"Yingjian Liu, Jiang Li, Xiaoping Wang, Zhigang Zeng","doi":"10.1007/s11432-023-3908-6","DOIUrl":"https://doi.org/10.1007/s11432-023-3908-6","url":null,"abstract":"<p>Emotion recognition in conversation (ERC) has attracted growing attention in recent years as a result of the advancement and implementation of human-computer interface technologies. In this paper, we propose an emotional inertia and contagion-driven dependency modeling approach (EmotionIC) for ERC tasks. Our EmotionIC consists of three main components, i.e., identity masked multi-head attention (IM-MHA), dialogue-based gated recurrent unit (DiaGRU), and skip-chain conditional random field (SkipCRF). Compared to previous ERC models, EmotionIC can model a conversation more thoroughly at both the feature-extraction and classification levels. The proposed model attempts to integrate the advantages of attention- and recurrence-based methods at the feature-extraction level. Specifically, IMMHA is applied to capture identity-based global contextual dependencies, while DiaGRU is utilized to extract speaker- and temporal-aware local contextual information. At the classification level, SkipCRF can explicitly mine complex emotional flows from higher-order neighboring utterances in the conversation. Experimental results show that our method can significantly outperform the state-of-the-art models on four benchmark datasets. The ablation studies confirm that our modules can effectively model emotional inertia and contagion.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"71 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1007/s11432-024-4057-0
Junzhe Lin, Dan Guo, Tianrui Zhai
High-energy radiation detection and imaging technology has significant applications in high-energy physics research, medical imaging, and industrial monitoring. Lead-free metal halides exhibit exceptional potential for conducting indirect detection of high-energy radiation due to their characteristics of low toxicity, strong stability, high light yield, and large Stokes shift. This paper reviews the most recent advances in lead-free metal halide scintillator materials for X-ray imaging. Subsequently, it lists the most important parameters of scintillator performance and introduces the production procedures for single crystal, powder, and nanocrystal scintillators. Furthermore, it discusses the manufacturing of scintillator films with improved performance, focusing on large-area flexible scintillator films and the coupling with microstructures. Finally, it discusses current challenges and opportunities for enhancing X-ray imaging using lead-free metal halide scintillator materials.
高能辐射探测和成像技术在高能物理研究、医学成像和工业监测方面有着重要的应用。无铅金属卤化物具有毒性低、稳定性强、光产率高、斯托克斯位移大等特点,在进行高能辐射间接探测方面具有非凡的潜力。本文回顾了用于 X 射线成像的无铅金属卤化物闪烁体材料的最新进展。随后,它列出了闪烁体性能的最重要参数,并介绍了单晶、粉末和纳米晶闪烁体的生产程序。此外,它还讨论了如何制造性能更好的闪烁体薄膜,重点是大面积柔性闪烁体薄膜以及与微结构的耦合。最后,它还讨论了当前使用无铅金属卤化物闪烁体材料增强 X 射线成像所面临的挑战和机遇。
{"title":"Lead-free metal halide scintillator materials for imaging applications","authors":"Junzhe Lin, Dan Guo, Tianrui Zhai","doi":"10.1007/s11432-024-4057-0","DOIUrl":"https://doi.org/10.1007/s11432-024-4057-0","url":null,"abstract":"<p>High-energy radiation detection and imaging technology has significant applications in high-energy physics research, medical imaging, and industrial monitoring. Lead-free metal halides exhibit exceptional potential for conducting indirect detection of high-energy radiation due to their characteristics of low toxicity, strong stability, high light yield, and large Stokes shift. This paper reviews the most recent advances in lead-free metal halide scintillator materials for X-ray imaging. Subsequently, it lists the most important parameters of scintillator performance and introduces the production procedures for single crystal, powder, and nanocrystal scintillators. Furthermore, it discusses the manufacturing of scintillator films with improved performance, focusing on large-area flexible scintillator films and the coupling with microstructures. Finally, it discusses current challenges and opportunities for enhancing X-ray imaging using lead-free metal halide scintillator materials.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"140 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1007/s11432-023-3974-2
Jingxuan Wen, Huafeng Liu, Liping Jing, Jian Yu
Counterfactual explanations provide explanations by exploring the changes in effect caused by changes in cause. They have attracted significant attention in recommender system research to explore the impact of changes in certain properties on the recommendation mechanism. Among several counterfactual recommendation methods, item-based counterfactual explanation methods have attracted considerable attention because of their flexibility. The core idea of item-based counterfactual explanation methods is to find a minimal subset of interacted items (i.e., short length) such that the recommended item would topple out of the top-K recommendation list once these items have been removed from user interactions (i.e., good quality). Usually, explanations are generated by ranking the precomputed importance of items, which fails to characterize the true importance of interacted items due to separation from the explanation generation. Additionally, the final explanations are generated according to a certain search strategy given the precomputed importance. This indicates that the quality and length of counterfactual explanations are deterministic; therefore, they cannot be balanced once the search strategy is fixed. To overcome these obstacles, this study proposes learning-based counterfactual explanations for recommendation (LCER) to provide counterfactual explanations based on personalized recommendations by jointly modeling the factual and counterfactual preference. To achieve consistency between the computation of importance and generation of counterfactual explanations, the proposed LCER endows an optimizable importance for each interacted item, which is supervised by the goal of counterfactual explanations to guarantee its credibility. Because of the model’s flexibility, the trade-off between quality and length can be customized by setting different proportions. The experimental results on four real-world datasets demonstrate the effectiveness of the proposed LCER over several state-of-the-art baselines, both quantitatively and qualitatively.
{"title":"Learning-based counterfactual explanations for recommendation","authors":"Jingxuan Wen, Huafeng Liu, Liping Jing, Jian Yu","doi":"10.1007/s11432-023-3974-2","DOIUrl":"https://doi.org/10.1007/s11432-023-3974-2","url":null,"abstract":"<p>Counterfactual explanations provide explanations by exploring the changes in effect caused by changes in cause. They have attracted significant attention in recommender system research to explore the impact of changes in certain properties on the recommendation mechanism. Among several counterfactual recommendation methods, item-based counterfactual explanation methods have attracted considerable attention because of their flexibility. The core idea of item-based counterfactual explanation methods is to find a minimal subset of interacted items (i.e., short length) such that the recommended item would topple out of the top-<i>K</i> recommendation list once these items have been removed from user interactions (i.e., good quality). Usually, explanations are generated by ranking the precomputed importance of items, which fails to characterize the true importance of interacted items due to separation from the explanation generation. Additionally, the final explanations are generated according to a certain search strategy given the precomputed importance. This indicates that the quality and length of counterfactual explanations are deterministic; therefore, they cannot be balanced once the search strategy is fixed. To overcome these obstacles, this study proposes learning-based counterfactual explanations for recommendation (LCER) to provide counterfactual explanations based on personalized recommendations by jointly modeling the factual and counterfactual preference. To achieve consistency between the computation of importance and generation of counterfactual explanations, the proposed LCER endows an optimizable importance for each interacted item, which is supervised by the goal of counterfactual explanations to guarantee its credibility. Because of the model’s flexibility, the trade-off between quality and length can be customized by setting different proportions. The experimental results on four real-world datasets demonstrate the effectiveness of the proposed LCER over several state-of-the-art baselines, both quantitatively and qualitatively.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"84 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-25DOI: 10.1007/s11432-024-4049-5
Zhao-Min Chen, Jun-Lin Zhan, Hao Chen, Ya Li, Hongjun He, Wu Yang, Zhen-Guo Liu, Wei-Bing Lu
Securing a comfortable, wearable compact frequency beam scanning antenna (FBSA) with robustness to deformation, low specific absorption rate (SAR), and good coverage of the surrounding environment for Internet of Things (IoT) applications, such as on-body navigation and wireless communication is an emerging challenge. In this work, a robust textile-based spoof plasmonic frequency scanning antenna utilizing higher-order modes is presented, which is also robust to deformation caused by the activities of the human body. The innovative design of the element ensures the high-efficiency transmission of the fundamental mode of spoof surface plasmon polaritons (SSPP) structure, providing the potential of being a multifunctional composite device in the compact on-body network. Besides, an artificial magnetic conductor (AMC) is designed underneath the SSPP structure, obtaining a low SAR value (0.113 W/kg), which ensures the safety of users. As a practical realization of this concept, a textile-based spoof plasmonic antenna was fabricated in the microwave regime and the performed experimental results show the proposed antenna has a single-beam radiation characteristic with a 70° beam scanning angle range when the frequency is 4.7–6.0 GHz with a high average realized gain of 13.15 dBi. And it still maintains a steady performance when faced with structure deformation, which proves its robustness. Wireless communication quality experiments are performed to demonstrate the proposed antenna can measure the angles of targets and realize wireless signal transmission to specific targets as the frequency varies, it may find great potential in the field of on-body IoT applications.
{"title":"Robust textile-based spoof plasmonic frequency scanning antenna for on-body IoT applications","authors":"Zhao-Min Chen, Jun-Lin Zhan, Hao Chen, Ya Li, Hongjun He, Wu Yang, Zhen-Guo Liu, Wei-Bing Lu","doi":"10.1007/s11432-024-4049-5","DOIUrl":"https://doi.org/10.1007/s11432-024-4049-5","url":null,"abstract":"<p>Securing a comfortable, wearable compact frequency beam scanning antenna (FBSA) with robustness to deformation, low specific absorption rate (SAR), and good coverage of the surrounding environment for Internet of Things (IoT) applications, such as on-body navigation and wireless communication is an emerging challenge. In this work, a robust textile-based spoof plasmonic frequency scanning antenna utilizing higher-order modes is presented, which is also robust to deformation caused by the activities of the human body. The innovative design of the element ensures the high-efficiency transmission of the fundamental mode of spoof surface plasmon polaritons (SSPP) structure, providing the potential of being a multifunctional composite device in the compact on-body network. Besides, an artificial magnetic conductor (AMC) is designed underneath the SSPP structure, obtaining a low SAR value (0.113 W/kg), which ensures the safety of users. As a practical realization of this concept, a textile-based spoof plasmonic antenna was fabricated in the microwave regime and the performed experimental results show the proposed antenna has a single-beam radiation characteristic with a 70° beam scanning angle range when the frequency is 4.7–6.0 GHz with a high average realized gain of 13.15 dBi. And it still maintains a steady performance when faced with structure deformation, which proves its robustness. Wireless communication quality experiments are performed to demonstrate the proposed antenna can measure the angles of targets and realize wireless signal transmission to specific targets as the frequency varies, it may find great potential in the field of on-body IoT applications.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"20 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141786254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24DOI: 10.1007/s11432-023-4088-2
Zhiqiang Zheng, Chen Wei, Haibin Duan
During short-range air combat involving unmanned aircraft vehicle (UAV) swarms, UAVs must make accurate maneuver decisions based on information from both enemy and friendly UAVs. This dual requirement of competition and cooperation presents a significant challenge in the field of unmanned air combat. In this paper, a method based on multi-agent reinforcement learning (MARL) is proposed to address this issue. An actor network containing three subnetworks that can handle different types of situational information is designed. Hence, the results from simpler one-on-one scenarios are leveraged to enhance the complex swarm air combat training process. Separate state spaces for local and global information are designed for the actor and critic networks. A detailed reward function is proposed to encourage participation. To prevent lazy participants in air combat, a reward assignment operation is applied to distribute these dense rewards. Simulation testing and ablation experiments demonstrate that both the transfer operation and reward assignment operation can effectively deal with the swarm air combat scenario, and reflect the effectiveness of the proposed method.
{"title":"UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring","authors":"Zhiqiang Zheng, Chen Wei, Haibin Duan","doi":"10.1007/s11432-023-4088-2","DOIUrl":"https://doi.org/10.1007/s11432-023-4088-2","url":null,"abstract":"<p>During short-range air combat involving unmanned aircraft vehicle (UAV) swarms, UAVs must make accurate maneuver decisions based on information from both enemy and friendly UAVs. This dual requirement of competition and cooperation presents a significant challenge in the field of unmanned air combat. In this paper, a method based on multi-agent reinforcement learning (MARL) is proposed to address this issue. An actor network containing three subnetworks that can handle different types of situational information is designed. Hence, the results from simpler one-on-one scenarios are leveraged to enhance the complex swarm air combat training process. Separate state spaces for local and global information are designed for the actor and critic networks. A detailed reward function is proposed to encourage participation. To prevent lazy participants in air combat, a reward assignment operation is applied to distribute these dense rewards. Simulation testing and ablation experiments demonstrate that both the transfer operation and reward assignment operation can effectively deal with the swarm air combat scenario, and reflect the effectiveness of the proposed method.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"3 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24DOI: 10.1007/s11432-023-4042-1
Siyuan Wang, Andrey Polyakov, Min Li, Gang Zheng, Driss Boutat
The objective of the invariant ellipsoid method is to minimize the smallest invariant and attractive set of a linear control system operating under the influence of bounded external disturbances. This study extends the application of this method to address the leader-following consensus problem. Initially, a linear control protocol is designed for the multi-agent system in the absence of disturbances. Subsequently, in the presence of bounded disturbances, by employing a similar linear control protocol, a necessary and sufficient condition is introduced to derive the optimal control parameters for the multi-agent system such that the state of followers converges to and remains in a minimal invariant ellipsoid around the state of the leader.
{"title":"Optimal rejection of bounded perturbations in linear leader-following consensus protocol: invariant ellipsoid method","authors":"Siyuan Wang, Andrey Polyakov, Min Li, Gang Zheng, Driss Boutat","doi":"10.1007/s11432-023-4042-1","DOIUrl":"https://doi.org/10.1007/s11432-023-4042-1","url":null,"abstract":"<p>The objective of the invariant ellipsoid method is to minimize the smallest invariant and attractive set of a linear control system operating under the influence of bounded external disturbances. This study extends the application of this method to address the leader-following consensus problem. Initially, a linear control protocol is designed for the multi-agent system in the absence of disturbances. Subsequently, in the presence of bounded disturbances, by employing a similar linear control protocol, a necessary and sufficient condition is introduced to derive the optimal control parameters for the multi-agent system such that the state of followers converges to and remains in a minimal invariant ellipsoid around the state of the leader.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"35 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24DOI: 10.1007/s11432-022-4097-y
Yu-Lei Li, Yan Yan, Yang Lu, Hanzi Wang
We present a wavelet-domain feature-decoupling Transformer-based tracking network for the weakly supervised MOT task (FDMOT). Our FDMOT has two improvements over the previous weakly supervised methods. First, FDMOT decouples noisy intermediate features caused by noisy pseudo identity labels in the wavelet domain, extracting discriminative features for accurately detecting and identifying multiple targets. Second, FDMOT further improves the noise-decoupled embedding features into the well-refined ones with the cooperation of the three feature-decoupling Transformer-based branches, which can accurately identify and track heavily occluded targets in crowded scenes. Experimental results show the superiority of FDMOT compared with several state-of-the-art supervised and weakly supervised MOT methods.
我们提出了一种基于小波域特征解耦变换器的跟踪网络,用于弱监督 MOT 任务(FDMOT)。与之前的弱监督方法相比,我们的 FDMOT 有两点改进。首先,FDMOT 在小波域中解耦了由噪声伪身份标签引起的噪声中间特征,提取了用于准确检测和识别多个目标的判别特征。其次,在基于变换器的三个特征解耦分支的配合下,FDMOT 进一步将噪声解耦嵌入特征改进为精炼特征,可在拥挤场景中准确识别和跟踪重度遮挡目标。实验结果表明,与几种最先进的监督式和弱监督式 MOT 方法相比,FDMOT 更具优势。
{"title":"Wavelet-domain feature decoupling for weakly supervised multi-object tracking","authors":"Yu-Lei Li, Yan Yan, Yang Lu, Hanzi Wang","doi":"10.1007/s11432-022-4097-y","DOIUrl":"https://doi.org/10.1007/s11432-022-4097-y","url":null,"abstract":"<p>We present a wavelet-domain feature-decoupling Transformer-based tracking network for the weakly supervised MOT task (FDMOT). Our FDMOT has two improvements over the previous weakly supervised methods. First, FDMOT decouples noisy intermediate features caused by noisy pseudo identity labels in the wavelet domain, extracting discriminative features for accurately detecting and identifying multiple targets. Second, FDMOT further improves the noise-decoupled embedding features into the well-refined ones with the cooperation of the three feature-decoupling Transformer-based branches, which can accurately identify and track heavily occluded targets in crowded scenes. Experimental results show the superiority of FDMOT compared with several state-of-the-art supervised and weakly supervised MOT methods.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"35 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}