{"title":"Cot-DCN-YOLO: Self-attention-enhancing YOLOv8s for detecting garbage bins in urban street view images","authors":"Shan Dong , Wenhao Xu , Huihui Zhang , Litao Gong","doi":"10.1016/j.ejrs.2025.01.002","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately and quickly obtaining information from garbage bins has great application value in smart city construction and urban environmental management. However, existing deep learning methods are affected by factors such as occlusion, large geometric appearance differences, and multi-scale, leading to missed detections in garbage bin detection results. We propose a Cot-DCN-YOLO model for garbage bin detection, which is designed to effectively extract contextual information with the Double Convolutions Semantic Transformation (DCST) module, which addresses the vulnerability of garbage bins to occlusion. According to the large geometric appearance differences when garbage bins are damaged, we propose the C2f embedded with DCNv2 (DC2f) module, which can adaptively adjust the target shape with a flexible receptive field. Furthermore, considering the multi-scale characteristics of garbage bins in images, we introduce the SPPCSPC module. Experimental results show that compared with other methods, Cot-DCN-YOLO achieves the best results on our self-made garbage bin dataset, with Precision, Recall, and mAP reaching 77.1%, 69.4%, and 74.0%, respectively, outperforming existing SOTA methods.</div></div>","PeriodicalId":48539,"journal":{"name":"Egyptian Journal of Remote Sensing and Space Sciences","volume":"28 1","pages":"Pages 89-98"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Journal of Remote Sensing and Space Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S111098232500002X","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately and quickly obtaining information from garbage bins has great application value in smart city construction and urban environmental management. However, existing deep learning methods are affected by factors such as occlusion, large geometric appearance differences, and multi-scale, leading to missed detections in garbage bin detection results. We propose a Cot-DCN-YOLO model for garbage bin detection, which is designed to effectively extract contextual information with the Double Convolutions Semantic Transformation (DCST) module, which addresses the vulnerability of garbage bins to occlusion. According to the large geometric appearance differences when garbage bins are damaged, we propose the C2f embedded with DCNv2 (DC2f) module, which can adaptively adjust the target shape with a flexible receptive field. Furthermore, considering the multi-scale characteristics of garbage bins in images, we introduce the SPPCSPC module. Experimental results show that compared with other methods, Cot-DCN-YOLO achieves the best results on our self-made garbage bin dataset, with Precision, Recall, and mAP reaching 77.1%, 69.4%, and 74.0%, respectively, outperforming existing SOTA methods.
期刊介绍:
The Egyptian Journal of Remote Sensing and Space Sciences (EJRS) encompasses a comprehensive range of topics within Remote Sensing, Geographic Information Systems (GIS), planetary geology, and space technology development, including theories, applications, and modeling. EJRS aims to disseminate high-quality, peer-reviewed research focusing on the advancement of remote sensing and GIS technologies and their practical applications for effective planning, sustainable development, and environmental resource conservation. The journal particularly welcomes innovative papers with broad scientific appeal.