{"title":"基于双分支特征提取的少镜头语义分割","authors":"Hongjie Zhou","doi":"10.1109/PRMVIA58252.2023.00053","DOIUrl":null,"url":null,"abstract":"Few-shot semantic segmentation (FSS) requires only few labeled samples to achieve good segmentation performance and thus has received extensive attention. However, existing FFS methods usually adopt a simple convolutional structure as the backbone, which suffers from poor feature extraction ability. In order to address this issue, a novel few-shot segmentation network based on dual-branch feature extraction (DFESN) is proposed. First, an attention-enhanced ResNet is used as the local feature extraction branch. Specifically, we in-corporate channel attention operations into each building block of ResNet to model the importance among channels, which enables DFESN to learn important class information for the segmentation task. Besides, we introduce a Vision Transformer as the global feature extraction branch. This branch leverages the multi-head self-attention mechanism in Vision Transformer to model the global dependencies of support and query image features, further enhancing the feature extraction capabilities of DFESN. We conduct experiments on the PASCAL-5i dataset and demonstrate the superiority of our DFESN.","PeriodicalId":221346,"journal":{"name":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-Shot Semantic Segmentation Based on Dual-Branch Feature Extraction\",\"authors\":\"Hongjie Zhou\",\"doi\":\"10.1109/PRMVIA58252.2023.00053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot semantic segmentation (FSS) requires only few labeled samples to achieve good segmentation performance and thus has received extensive attention. However, existing FFS methods usually adopt a simple convolutional structure as the backbone, which suffers from poor feature extraction ability. In order to address this issue, a novel few-shot segmentation network based on dual-branch feature extraction (DFESN) is proposed. First, an attention-enhanced ResNet is used as the local feature extraction branch. Specifically, we in-corporate channel attention operations into each building block of ResNet to model the importance among channels, which enables DFESN to learn important class information for the segmentation task. Besides, we introduce a Vision Transformer as the global feature extraction branch. This branch leverages the multi-head self-attention mechanism in Vision Transformer to model the global dependencies of support and query image features, further enhancing the feature extraction capabilities of DFESN. We conduct experiments on the PASCAL-5i dataset and demonstrate the superiority of our DFESN.\",\"PeriodicalId\":221346,\"journal\":{\"name\":\"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRMVIA58252.2023.00053\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Pattern Recognition, Machine Vision and Intelligent Algorithms (PRMVIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRMVIA58252.2023.00053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Few-Shot Semantic Segmentation Based on Dual-Branch Feature Extraction
Few-shot semantic segmentation (FSS) requires only few labeled samples to achieve good segmentation performance and thus has received extensive attention. However, existing FFS methods usually adopt a simple convolutional structure as the backbone, which suffers from poor feature extraction ability. In order to address this issue, a novel few-shot segmentation network based on dual-branch feature extraction (DFESN) is proposed. First, an attention-enhanced ResNet is used as the local feature extraction branch. Specifically, we in-corporate channel attention operations into each building block of ResNet to model the importance among channels, which enables DFESN to learn important class information for the segmentation task. Besides, we introduce a Vision Transformer as the global feature extraction branch. This branch leverages the multi-head self-attention mechanism in Vision Transformer to model the global dependencies of support and query image features, further enhancing the feature extraction capabilities of DFESN. We conduct experiments on the PASCAL-5i dataset and demonstrate the superiority of our DFESN.