{"title":"Semantic segmentation with step-by-step upsampling of the fusion context","authors":"Yanzhao Lu, Huiyi Liu","doi":"10.1109/ICAICA52286.2021.9497923","DOIUrl":null,"url":null,"abstract":"The existing semantic segmentation network deelabv3+ has the problem of weak segmentation ability to small-scale target objects and rough edge segmentation. The method of parallel connection of multiple resolution subnets in HRNet network is introduced. After deeplabv3+ down sampling, the network layers of different sizes were fused with features, and the decode side was fused with up sampling step by step to improve the edge segmentation accuracy. Attention mechanism is added before feature fusion to improve the recognition ability of small object. At the end, the edge is refined again by using CRF random vector field. The test is carried out on Pascal VOC 2012, the experimental results show that: the segmentation is more detailed from the image edge details, the recognition of small objects is more accurate, the Pixel Accuracy (PA) and Mean Intersection over Union (MIOU) are improved compared with deeplabv3+.","PeriodicalId":121979,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICA52286.2021.9497923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The existing semantic segmentation network deelabv3+ has the problem of weak segmentation ability to small-scale target objects and rough edge segmentation. The method of parallel connection of multiple resolution subnets in HRNet network is introduced. After deeplabv3+ down sampling, the network layers of different sizes were fused with features, and the decode side was fused with up sampling step by step to improve the edge segmentation accuracy. Attention mechanism is added before feature fusion to improve the recognition ability of small object. At the end, the edge is refined again by using CRF random vector field. The test is carried out on Pascal VOC 2012, the experimental results show that: the segmentation is more detailed from the image edge details, the recognition of small objects is more accurate, the Pixel Accuracy (PA) and Mean Intersection over Union (MIOU) are improved compared with deeplabv3+.