{"title":"Bag of Tricks for “Vision Meet Alage” Object Detection Challenge","authors":"Xiaode Fu, Fei Shen, Xiaoyu Du, Zechao Li","doi":"10.1109/UV56588.2022.10185500","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce our solution to the “Vision Meets Algae” Workshop and Challenge (VisAlgae) in details. Since a large number of small objects and similar categories, the location and classification of algae are challenging. For that, we propose a bag of tricks for VisAlgae, including data augmentation, model architecture, and pipeline. For data augmentation, we introduce bounding-box jitter, mix-up, multi-scale, albu, and test time augmentation to increase sample diversity and randomness. We learn a better region of interest (RoI) features by adding global semantic information to RoI features. Especially a novelty double head is devised to enhance final features via reserving spatial and channel information. For the pipeline, We introduce the detector framework, backbone, stochastic weights averaging, pseudo labels, and weighted boxes fusion. Experimental results demonstrate that our approach can achieve an excellent mean average precision (mAP) performance of object detection.","PeriodicalId":211011,"journal":{"name":"2022 6th International Conference on Universal Village (UV)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Universal Village (UV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UV56588.2022.10185500","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we introduce our solution to the “Vision Meets Algae” Workshop and Challenge (VisAlgae) in details. Since a large number of small objects and similar categories, the location and classification of algae are challenging. For that, we propose a bag of tricks for VisAlgae, including data augmentation, model architecture, and pipeline. For data augmentation, we introduce bounding-box jitter, mix-up, multi-scale, albu, and test time augmentation to increase sample diversity and randomness. We learn a better region of interest (RoI) features by adding global semantic information to RoI features. Especially a novelty double head is devised to enhance final features via reserving spatial and channel information. For the pipeline, We introduce the detector framework, backbone, stochastic weights averaging, pseudo labels, and weighted boxes fusion. Experimental results demonstrate that our approach can achieve an excellent mean average precision (mAP) performance of object detection.