{"title":"无监督域自适应目标检测:一种基于UDA-DETR的有效方法","authors":"Hanguang Xiao, Tingting Zhou, Shidong Xiong, Jinlan Li, Zhuhan Li, Xin Liu, Tianhao Deng","doi":"10.1016/j.neucom.2025.129711","DOIUrl":null,"url":null,"abstract":"<div><div>Object detection based on deep learning has a wide range of applications in everyday life. However, when a domain gap exists between the training data (source domain) and real-world data (target domain), the performance of many object detectors significantly deteriorates. To address this issue, numerous Unsupervised Domain Adaptation (UDA) detectors attempt to reduce domain discrepancies and align cross-domain features. While these methods have achieved some success, they often align global features in a class-agnostic manner, neglecting the differences in feature distributions across categories within each domain. Our approach emphasizes the importance of both global image features and local instance features across domains, as well as category-specific information. Specifically, we propose an Encoder Feature Alignment (EFA) module, which introduces domain queries to adversarially align encoder-generated features, enabling the detector to extract more domain-invariant features. Additionally, we design an Instance-level Feature Alignment (IFA) module that extracts class-specific central features from the decoder for category-aware cross-domain feature alignment. During each training iteration, local class features progressively converge to global class features, guided by contrastive learning and adversarial loss to achieve Global Feature Alignment (GFA). Our method achieves 46.0% mean accuracy (mAP) in the Weather Adaptation scenario. Compared to the baseline model, a 19.1% mAP gain is achieved (26.9% <span><math><mo>→</mo></math></span> 46.0%). Extensive experimental results show that our proposed model achieves excellent detection performance and strong generalization ability on multiple domain adaptation benchmark datasets.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"631 ","pages":"Article 129711"},"PeriodicalIF":6.5000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsupervised domain-adaptive object detection: An efficient method based on UDA-DETR\",\"authors\":\"Hanguang Xiao, Tingting Zhou, Shidong Xiong, Jinlan Li, Zhuhan Li, Xin Liu, Tianhao Deng\",\"doi\":\"10.1016/j.neucom.2025.129711\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Object detection based on deep learning has a wide range of applications in everyday life. However, when a domain gap exists between the training data (source domain) and real-world data (target domain), the performance of many object detectors significantly deteriorates. To address this issue, numerous Unsupervised Domain Adaptation (UDA) detectors attempt to reduce domain discrepancies and align cross-domain features. While these methods have achieved some success, they often align global features in a class-agnostic manner, neglecting the differences in feature distributions across categories within each domain. Our approach emphasizes the importance of both global image features and local instance features across domains, as well as category-specific information. Specifically, we propose an Encoder Feature Alignment (EFA) module, which introduces domain queries to adversarially align encoder-generated features, enabling the detector to extract more domain-invariant features. Additionally, we design an Instance-level Feature Alignment (IFA) module that extracts class-specific central features from the decoder for category-aware cross-domain feature alignment. During each training iteration, local class features progressively converge to global class features, guided by contrastive learning and adversarial loss to achieve Global Feature Alignment (GFA). Our method achieves 46.0% mean accuracy (mAP) in the Weather Adaptation scenario. Compared to the baseline model, a 19.1% mAP gain is achieved (26.9% <span><math><mo>→</mo></math></span> 46.0%). Extensive experimental results show that our proposed model achieves excellent detection performance and strong generalization ability on multiple domain adaptation benchmark datasets.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"631 \",\"pages\":\"Article 129711\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225003832\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/24 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225003832","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/24 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Unsupervised domain-adaptive object detection: An efficient method based on UDA-DETR
Object detection based on deep learning has a wide range of applications in everyday life. However, when a domain gap exists between the training data (source domain) and real-world data (target domain), the performance of many object detectors significantly deteriorates. To address this issue, numerous Unsupervised Domain Adaptation (UDA) detectors attempt to reduce domain discrepancies and align cross-domain features. While these methods have achieved some success, they often align global features in a class-agnostic manner, neglecting the differences in feature distributions across categories within each domain. Our approach emphasizes the importance of both global image features and local instance features across domains, as well as category-specific information. Specifically, we propose an Encoder Feature Alignment (EFA) module, which introduces domain queries to adversarially align encoder-generated features, enabling the detector to extract more domain-invariant features. Additionally, we design an Instance-level Feature Alignment (IFA) module that extracts class-specific central features from the decoder for category-aware cross-domain feature alignment. During each training iteration, local class features progressively converge to global class features, guided by contrastive learning and adversarial loss to achieve Global Feature Alignment (GFA). Our method achieves 46.0% mean accuracy (mAP) in the Weather Adaptation scenario. Compared to the baseline model, a 19.1% mAP gain is achieved (26.9% 46.0%). Extensive experimental results show that our proposed model achieves excellent detection performance and strong generalization ability on multiple domain adaptation benchmark datasets.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.