基于多相似度的小镜头分割超关系网络

IET Image Process. Pub Date : 2022-03-17 DOI:10.48550/arXiv.2203.09550

Xian Shi, Zhe Cui, Shaobing Zhang, Miao Cheng, L. He, Xianghong Tang

{"title":"基于多相似度的小镜头分割超关系网络","authors":"Xian Shi, Zhe Cui, Shaobing Zhang, Miao Cheng, L. He, Xianghong Tang","doi":"10.48550/arXiv.2203.09550","DOIUrl":null,"url":null,"abstract":"Few-shot semantic segmentation aims at recognizing the object regions of unseen categories with only a few annotated examples as supervision. The key to few-shot segmentation is to establish a robust semantic relationship between the support and query images and to prevent overfitting. In this paper, we propose an effective Multi-similarity Hyperrelation Network (MSHNet) to tackle the few-shot semantic segmentation problem. In MSHNet, we propose a new Generative Prototype Similarity (GPS), which together with cosine similarity can establish a strong semantic relation between the support and query images. The locally generated prototype similarity based on global feature is logically complementary to the global cosine similarity based on local feature, and the relationship between the query image and the supported image can be expressed more comprehensively by using the two similarities simultaneously. In addition, we propose a Symmetric Merging Block (SMB) in MSHNet to efficiently merge multi-layer, multi-shot and multi-similarity hyperrelational features. MSHNet is built on the basis of similarity rather than specific category features, which can achieve more general unity and effectively reduce overfitting. On two benchmark semantic segmentation datasets Pascal-5i and COCO-20i, MSHNet achieves new state-of-the-art performances on 1-shot and 5-shot semantic segmentation tasks.","PeriodicalId":13486,"journal":{"name":"IET Image Process.","volume":"9 1","pages":"204-214"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Multi-similarity based Hyperrelation Network for few-shot segmentation\",\"authors\":\"Xian Shi, Zhe Cui, Shaobing Zhang, Miao Cheng, L. He, Xianghong Tang\",\"doi\":\"10.48550/arXiv.2203.09550\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot semantic segmentation aims at recognizing the object regions of unseen categories with only a few annotated examples as supervision. The key to few-shot segmentation is to establish a robust semantic relationship between the support and query images and to prevent overfitting. In this paper, we propose an effective Multi-similarity Hyperrelation Network (MSHNet) to tackle the few-shot semantic segmentation problem. In MSHNet, we propose a new Generative Prototype Similarity (GPS), which together with cosine similarity can establish a strong semantic relation between the support and query images. The locally generated prototype similarity based on global feature is logically complementary to the global cosine similarity based on local feature, and the relationship between the query image and the supported image can be expressed more comprehensively by using the two similarities simultaneously. In addition, we propose a Symmetric Merging Block (SMB) in MSHNet to efficiently merge multi-layer, multi-shot and multi-similarity hyperrelational features. MSHNet is built on the basis of similarity rather than specific category features, which can achieve more general unity and effectively reduce overfitting. On two benchmark semantic segmentation datasets Pascal-5i and COCO-20i, MSHNet achieves new state-of-the-art performances on 1-shot and 5-shot semantic segmentation tasks.\",\"PeriodicalId\":13486,\"journal\":{\"name\":\"IET Image Process.\",\"volume\":\"9 1\",\"pages\":\"204-214\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Image Process.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2203.09550\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Image Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2203.09550","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

少量语义分割旨在识别未见类别的对象区域，仅使用少量注释示例作为监督。少镜头分割的关键是在支持图像和查询图像之间建立鲁棒的语义关系，防止过拟合。本文提出了一种有效的多相似度超关系网络(MSHNet)来解决少镜头语义分割问题。在MSHNet中，我们提出了一种新的生成原型相似度(GPS)，它与余弦相似度一起可以在支持图像和查询图像之间建立强大的语义关系。基于全局特征的局部生成的原型相似度与基于局部特征的全局余弦相似度在逻辑上是互补的，同时使用这两种相似度可以更全面地表达查询图像与支持图像之间的关系。此外，我们在MSHNet中提出了一种对称合并块(SMB)来有效地合并多层、多镜头和多相似的超关系特征。MSHNet是建立在相似度的基础上，而不是基于特定的类别特征，可以达到更一般的统一性，有效地减少过拟合。在Pascal-5i和COCO-20i两个基准语义分割数据集上，MSHNet在1次和5次语义分割任务上取得了新的最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Multi-similarity based Hyperrelation Network for few-shot segmentation

Few-shot semantic segmentation aims at recognizing the object regions of unseen categories with only a few annotated examples as supervision. The key to few-shot segmentation is to establish a robust semantic relationship between the support and query images and to prevent overfitting. In this paper, we propose an effective Multi-similarity Hyperrelation Network (MSHNet) to tackle the few-shot semantic segmentation problem. In MSHNet, we propose a new Generative Prototype Similarity (GPS), which together with cosine similarity can establish a strong semantic relation between the support and query images. The locally generated prototype similarity based on global feature is logically complementary to the global cosine similarity based on local feature, and the relationship between the query image and the supported image can be expressed more comprehensively by using the two similarities simultaneously. In addition, we propose a Symmetric Merging Block (SMB) in MSHNet to efficiently merge multi-layer, multi-shot and multi-similarity hyperrelational features. MSHNet is built on the basis of similarity rather than specific category features, which can achieve more general unity and effectively reduce overfitting. On two benchmark semantic segmentation datasets Pascal-5i and COCO-20i, MSHNet achieves new state-of-the-art performances on 1-shot and 5-shot semantic segmentation tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助