{"title":"Adaptive Depth Enhancement Network for RGB-D Salient Object Detection","authors":"Kang Yi;Yumeng Li;Haoran Tang;Jing Xu","doi":"10.1109/LSP.2024.3506863","DOIUrl":null,"url":null,"abstract":"RGB-D Salient Object Detection (SOD) aims to identify and highlight the most visually prominent objects from complex backgrounds by leveraging both RGB and depth information. However, depth maps often suffer from noise and inconsistencies due to the imaging modalities and sensor limitations. Additionally, the low-level spatial details and high-level semantic information from multiple levels pose another complexity layer. These issues result in depth maps that may not align well with the corresponding RGB images, causing incorrect foreground and background segmentation. To address these issues, we propose a novel adaptive depth enhancement network (ADENet), which adopts the Depth Feature Refinement (DFR) module to mitigate the negative impact of low-quality depth data and improve the synergy between multi-modal features. We also design a simple yet effective Cross Modality Fusion (CMF) module that combines the spatial and channel attention mechanisms to calibrate single modality features and boost the fusion. The Progressive Multiscale Aggregation (PMA) decoder has also been introduced to integrate multiscale features, promoting more globally retained information. Extensive experiments illustrate that our proposed ADENet is superior to the other 10 state-of-the-art methods on four benchmark datasets.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"176-180"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10767761/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
RGB-D Salient Object Detection (SOD) aims to identify and highlight the most visually prominent objects from complex backgrounds by leveraging both RGB and depth information. However, depth maps often suffer from noise and inconsistencies due to the imaging modalities and sensor limitations. Additionally, the low-level spatial details and high-level semantic information from multiple levels pose another complexity layer. These issues result in depth maps that may not align well with the corresponding RGB images, causing incorrect foreground and background segmentation. To address these issues, we propose a novel adaptive depth enhancement network (ADENet), which adopts the Depth Feature Refinement (DFR) module to mitigate the negative impact of low-quality depth data and improve the synergy between multi-modal features. We also design a simple yet effective Cross Modality Fusion (CMF) module that combines the spatial and channel attention mechanisms to calibrate single modality features and boost the fusion. The Progressive Multiscale Aggregation (PMA) decoder has also been introduced to integrate multiscale features, promoting more globally retained information. Extensive experiments illustrate that our proposed ADENet is superior to the other 10 state-of-the-art methods on four benchmark datasets.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.