{"title":"基于空间自适应特征聚合的多代理检测系统","authors":"Hongbo Wang;He Wang;Xin Zhang;Runze Ruan;Yueyun Wang;Yuyu Yin","doi":"10.1109/JSYST.2024.3423752","DOIUrl":null,"url":null,"abstract":"Detection systems based on computer vision play important roles in Large-Scale Multiagent Systems. In particular, it can automatically locate and identify key objects and enhance intelligent collaboration and coordination among multiple agents. However, classification and localization in object detection may produce inconsistent prediction results due to different learning focus. Therefore, we propose a Spatial Decoupling and Boundary Feature Aggregation Network (SDBA-Net) to achieve spatial decoupling and task alignment. SDBA-Net includes a spatially sensitive region-aware module (SSRM) and a boundary feature aggregation module (BFAM). SSRM predicts sensitive regions for each task while minimizing computational cost. BFAM extracts valuable boundary features within sensitive regions and aligns them with corresponding anchors. These two modules are combined to spatially decouple and align the features of two tasks. In addition, a significance dependency complementary module (SDCM) is introduced. It enables SSRM to quickly adjust the sensitive region of the classification task to the significant feature region. Experiments are conducted on a large-scale complex real-world dataset MS COCO (Lin et al., 2014). The results show that SDBA-Net achieves better results than the baselines. Using the ResNet-50 backbone, our method improves the average precision (AP) of the single-stage detector VFNet by 1.0 point (from 41.3 to 42.3). In particular, when using the Res2Net-101-DCN backbone, SDBA-Net achieves an AP of 51.8 on the MS COCO test-dev.","PeriodicalId":55017,"journal":{"name":"IEEE Systems Journal","volume":"18 4","pages":"1849-1859"},"PeriodicalIF":4.0000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multiagent Detection System Based on Spatial Adaptive Feature Aggregation\",\"authors\":\"Hongbo Wang;He Wang;Xin Zhang;Runze Ruan;Yueyun Wang;Yuyu Yin\",\"doi\":\"10.1109/JSYST.2024.3423752\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detection systems based on computer vision play important roles in Large-Scale Multiagent Systems. In particular, it can automatically locate and identify key objects and enhance intelligent collaboration and coordination among multiple agents. However, classification and localization in object detection may produce inconsistent prediction results due to different learning focus. Therefore, we propose a Spatial Decoupling and Boundary Feature Aggregation Network (SDBA-Net) to achieve spatial decoupling and task alignment. SDBA-Net includes a spatially sensitive region-aware module (SSRM) and a boundary feature aggregation module (BFAM). SSRM predicts sensitive regions for each task while minimizing computational cost. BFAM extracts valuable boundary features within sensitive regions and aligns them with corresponding anchors. These two modules are combined to spatially decouple and align the features of two tasks. In addition, a significance dependency complementary module (SDCM) is introduced. It enables SSRM to quickly adjust the sensitive region of the classification task to the significant feature region. Experiments are conducted on a large-scale complex real-world dataset MS COCO (Lin et al., 2014). The results show that SDBA-Net achieves better results than the baselines. Using the ResNet-50 backbone, our method improves the average precision (AP) of the single-stage detector VFNet by 1.0 point (from 41.3 to 42.3). In particular, when using the Res2Net-101-DCN backbone, SDBA-Net achieves an AP of 51.8 on the MS COCO test-dev.\",\"PeriodicalId\":55017,\"journal\":{\"name\":\"IEEE Systems Journal\",\"volume\":\"18 4\",\"pages\":\"1849-1859\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Systems Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10598835/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Systems Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10598835/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
基于计算机视觉的检测系统在大规模多智能体系统中占有重要地位。特别是,它可以自动定位和识别关键对象,增强多个agent之间的智能协作和协调。然而,在目标检测中的分类和定位,由于学习重点的不同,可能会产生不一致的预测结果。为此,我们提出了一种空间解耦和边界特征聚合网络(SDBA-Net)来实现空间解耦和任务对齐。SDBA-Net包括一个空间敏感区域感知模块(SSRM)和一个边界特征聚合模块(BFAM)。SSRM预测每个任务的敏感区域,同时最小化计算成本。BFAM提取敏感区域内有价值的边界特征,并将其与相应的锚点对齐。将这两个模块结合起来,对两个任务的特征进行空间解耦和对齐。此外,还引入了显著性依赖互补模块(SDCM)。它使SSRM能够快速地将分类任务的敏感区域调整到显著特征区域。实验是在大规模复杂的现实世界数据集MS COCO上进行的(Lin et al., 2014)。结果表明,SDBA-Net取得了比基线更好的效果。利用ResNet-50骨干网,将单级探测器VFNet的平均精度(AP)提高了1.0点(从41.3提高到42.3)。特别是,当使用Res2Net-101-DCN骨干网时,SDBA-Net在MS COCO测试开发上实现了51.8的AP。
Multiagent Detection System Based on Spatial Adaptive Feature Aggregation
Detection systems based on computer vision play important roles in Large-Scale Multiagent Systems. In particular, it can automatically locate and identify key objects and enhance intelligent collaboration and coordination among multiple agents. However, classification and localization in object detection may produce inconsistent prediction results due to different learning focus. Therefore, we propose a Spatial Decoupling and Boundary Feature Aggregation Network (SDBA-Net) to achieve spatial decoupling and task alignment. SDBA-Net includes a spatially sensitive region-aware module (SSRM) and a boundary feature aggregation module (BFAM). SSRM predicts sensitive regions for each task while minimizing computational cost. BFAM extracts valuable boundary features within sensitive regions and aligns them with corresponding anchors. These two modules are combined to spatially decouple and align the features of two tasks. In addition, a significance dependency complementary module (SDCM) is introduced. It enables SSRM to quickly adjust the sensitive region of the classification task to the significant feature region. Experiments are conducted on a large-scale complex real-world dataset MS COCO (Lin et al., 2014). The results show that SDBA-Net achieves better results than the baselines. Using the ResNet-50 backbone, our method improves the average precision (AP) of the single-stage detector VFNet by 1.0 point (from 41.3 to 42.3). In particular, when using the Res2Net-101-DCN backbone, SDBA-Net achieves an AP of 51.8 on the MS COCO test-dev.
期刊介绍:
This publication provides a systems-level, focused forum for application-oriented manuscripts that address complex systems and system-of-systems of national and global significance. It intends to encourage and facilitate cooperation and interaction among IEEE Societies with systems-level and systems engineering interest, and to attract non-IEEE contributors and readers from around the globe. Our IEEE Systems Council job is to address issues in new ways that are not solvable in the domains of the existing IEEE or other societies or global organizations. These problems do not fit within traditional hierarchical boundaries. For example, disaster response such as that triggered by Hurricane Katrina, tsunamis, or current volcanic eruptions is not solvable by pure engineering solutions. We need to think about changing and enlarging the paradigm to include systems issues.