{"title":"Long-Tailed Object Mining Based on CLIP Model for Autonomous Driving","authors":"Guorun Yang, Y. Qiao, Jianping Shit, Zhe Wang","doi":"10.1109/ICCR55715.2022.10053861","DOIUrl":null,"url":null,"abstract":"The long-tailed object distribution poses great challenges for autonomous driving. And the field collection of long-tailed objects is difficult and high-cost. In this paper, we propose a novel data mining approach for those long-tailed objects. The softmax distribution produced by CLIP model is adopted as the representation of cropped objects in the image. Then for each long-tailed classification, the category grouping is performed to divide the text concepts into three sets. Finally, combining the softmax representation with the grouped categories, we develop an effective softmax mining algorithm to search and identify the long-tailed objects from the large database. Experiments demonstrate that the proposed method outperforms the baseline results and accurately finds the long-tailed data.","PeriodicalId":441511,"journal":{"name":"2022 4th International Conference on Control and Robotics (ICCR)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Control and Robotics (ICCR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCR55715.2022.10053861","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The long-tailed object distribution poses great challenges for autonomous driving. And the field collection of long-tailed objects is difficult and high-cost. In this paper, we propose a novel data mining approach for those long-tailed objects. The softmax distribution produced by CLIP model is adopted as the representation of cropped objects in the image. Then for each long-tailed classification, the category grouping is performed to divide the text concepts into three sets. Finally, combining the softmax representation with the grouped categories, we develop an effective softmax mining algorithm to search and identify the long-tailed objects from the large database. Experiments demonstrate that the proposed method outperforms the baseline results and accurately finds the long-tailed data.