Sravan Kumar Jagadeesh, René Schuster, D. Stricker
{"title":"Multi-task Fusion for Efficient Panoptic-Part Segmentation","authors":"Sravan Kumar Jagadeesh, René Schuster, D. Stricker","doi":"10.48550/arXiv.2212.07671","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce a novel network that generates semantic, instance, and part segmentation using a shared encoder and effectively fuses them to achieve panoptic-part segmentation. Unifying these three segmentation problems allows for mutually improved and consistent representation learning. To fuse the predictions of all three heads efficiently, we introduce a parameter-free joint fusion module that dynamically balances the logits and fuses them to create panoptic-part segmentation. Our method is evaluated on the Cityscapes Panoptic Parts (CPP) and Pascal Panoptic Parts (PPP) datasets. For CPP, the PartPQ of our proposed model with joint fusion surpasses the previous state-of-the-art by 1.6 and 4.7 percentage points for all areas and segments with parts, respectively. On PPP, our joint fusion outperforms a model using the previous top-down merging strategy by 3.3 percentage points in PartPQ and 10.5 percentage points in PartPQ for partitionable classes.","PeriodicalId":410036,"journal":{"name":"International Conference on Pattern Recognition Applications and Methods","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Pattern Recognition Applications and Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.07671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, we introduce a novel network that generates semantic, instance, and part segmentation using a shared encoder and effectively fuses them to achieve panoptic-part segmentation. Unifying these three segmentation problems allows for mutually improved and consistent representation learning. To fuse the predictions of all three heads efficiently, we introduce a parameter-free joint fusion module that dynamically balances the logits and fuses them to create panoptic-part segmentation. Our method is evaluated on the Cityscapes Panoptic Parts (CPP) and Pascal Panoptic Parts (PPP) datasets. For CPP, the PartPQ of our proposed model with joint fusion surpasses the previous state-of-the-art by 1.6 and 4.7 percentage points for all areas and segments with parts, respectively. On PPP, our joint fusion outperforms a model using the previous top-down merging strategy by 3.3 percentage points in PartPQ and 10.5 percentage points in PartPQ for partitionable classes.
在本文中,我们介绍了一种新的网络,该网络使用共享编码器生成语义、实例和部分分割,并有效地融合它们以实现全景部分分割。统一这三个分割问题允许相互改进和一致的表示学习。为了有效地融合所有三个头部的预测,我们引入了一个无参数的关节融合模块,该模块动态平衡逻辑并融合它们以创建全景部分分割。我们的方法在cityscape Panoptic Parts (CPP)和Pascal Panoptic Parts (PPP)数据集上进行了评估。对于CPP,我们提出的具有关节融合的模型的PartPQ在所有有零件的区域和部分上分别超过了以前最先进的1.6和4.7个百分点。在PPP上,我们的联合融合在PartPQ上比使用以前的自顶向下合并策略的模型高出3.3个百分点,在可分区类上比PartPQ高出10.5个百分点。