{"title":"A Progressive-Assisted Object Detection Method Based on Instance Attention","authors":"Ziwen Sun;Zhizhong Xi;Hao Li;Chong Ling;Dong Chen;Xiaoyan Qin","doi":"10.1109/ACCESS.2024.3459941","DOIUrl":null,"url":null,"abstract":"Overcoming the high cost of self-attention operation in Transformer-based object detection methods and improving the detection accuracy of small objects is one of the difficulties in the field of object detection research. This paper designs a progressive assisted object detection method PaoDet based on Transformer, which uses common feature extraction backbone such as Resnet and ViT to extract multi-scale features of the input image, and uses RPN(Region Proposal Network) to extract proposals of different scales; Subsequently, a progressive modeling approach was adopted to perform self-attention and cross-attention operations on proposals of different scales from large to small, achieving feature interaction between instances, ensuring high detection efficiency and low computational complexity. During the training process, each layer of the network has certain generalization ability for detecting adjacent scale objects under the supervision of a dynamic scale division method. Compared with state-of-the-art object detection methods on COCO and UAVDT datasets, the effectiveness and superiority of the proposed method were demonstrated.","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10713239","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10713239/","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Overcoming the high cost of self-attention operation in Transformer-based object detection methods and improving the detection accuracy of small objects is one of the difficulties in the field of object detection research. This paper designs a progressive assisted object detection method PaoDet based on Transformer, which uses common feature extraction backbone such as Resnet and ViT to extract multi-scale features of the input image, and uses RPN(Region Proposal Network) to extract proposals of different scales; Subsequently, a progressive modeling approach was adopted to perform self-attention and cross-attention operations on proposals of different scales from large to small, achieving feature interaction between instances, ensuring high detection efficiency and low computational complexity. During the training process, each layer of the network has certain generalization ability for detecting adjacent scale objects under the supervision of a dynamic scale division method. Compared with state-of-the-art object detection methods on COCO and UAVDT datasets, the effectiveness and superiority of the proposed method were demonstrated.