{"title":"Attention-assisted dual-branch interactive face super-resolution network","authors":"Xujie Wan , Siyu Xu , Guangwei Gao","doi":"10.1016/j.cogr.2025.01.001","DOIUrl":null,"url":null,"abstract":"<div><div>We propose a deep learning-based Attention-Assisted Dual-Branch Interactive Network (ADBINet) to improve facial super-resolution by addressing key challenges like inadequate feature extraction and poor multi-scale information handling. ADBINet features a multi-scale encoder-decoder architecture that captures and integrates features across scales, enhancing detail and reconstruction quality. The key to our approach is the Transformer and CNN Interaction Module (TCIM), which includes a Dual Attention Collaboration Module (DACM) for improved local and spatial feature extraction. The Channel Attention Guidance Module (CAGM) refines CNN and Transformer fusion, ensuring precise facial detail restoration. Additionally, the Attention Feature Fusion Unit (AFFM) optimizes multi-scale feature integration. Experimental results demonstrate that ADBINet outperforms existing methods in both quantitative and qualitative facial super-resolution metrics.</div></div>","PeriodicalId":100288,"journal":{"name":"Cognitive Robotics","volume":"5 ","pages":"Pages 77-85"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Robotics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667241325000023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a deep learning-based Attention-Assisted Dual-Branch Interactive Network (ADBINet) to improve facial super-resolution by addressing key challenges like inadequate feature extraction and poor multi-scale information handling. ADBINet features a multi-scale encoder-decoder architecture that captures and integrates features across scales, enhancing detail and reconstruction quality. The key to our approach is the Transformer and CNN Interaction Module (TCIM), which includes a Dual Attention Collaboration Module (DACM) for improved local and spatial feature extraction. The Channel Attention Guidance Module (CAGM) refines CNN and Transformer fusion, ensuring precise facial detail restoration. Additionally, the Attention Feature Fusion Unit (AFFM) optimizes multi-scale feature integration. Experimental results demonstrate that ADBINet outperforms existing methods in both quantitative and qualitative facial super-resolution metrics.