{"title":"DCNet: diffusion convolutional networks for semantic image segmentation","authors":"Lan Yang, Zhixiong Jiang, Hongbo Zhou, Jun Guo","doi":"10.1504/IJES.2021.116135","DOIUrl":null,"url":null,"abstract":"Semantic image segmentation makes a pixel-level classification play an essential role in scene understanding. Recently, most approaches exploit deep learning neural networks, especially convolutional neural networks (CNNs), to tackle the image segmentation challenge. Common issues of these CNN-based methods are the loss of spatial features during learning representations and the limited capacity for capturing contextual information in a large receptive field. This paper proposes a diffusion convolutional network (DCNet) to combine the CNN and graph convolutional neural network (GCNN) for semantic image segmentation. In the proposed model, diffusion convolution is formulated as a graph convolutional layer to aggregate structural and contextual information without losing spatial features. The final segmentation results on the PASCAL VOC 2012 and Cityscapes datasets show better performance than baseline approaches and can be competitive with state-of-the-art methods.","PeriodicalId":412308,"journal":{"name":"Int. J. Embed. Syst.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Embed. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJES.2021.116135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic image segmentation makes a pixel-level classification play an essential role in scene understanding. Recently, most approaches exploit deep learning neural networks, especially convolutional neural networks (CNNs), to tackle the image segmentation challenge. Common issues of these CNN-based methods are the loss of spatial features during learning representations and the limited capacity for capturing contextual information in a large receptive field. This paper proposes a diffusion convolutional network (DCNet) to combine the CNN and graph convolutional neural network (GCNN) for semantic image segmentation. In the proposed model, diffusion convolution is formulated as a graph convolutional layer to aggregate structural and contextual information without losing spatial features. The final segmentation results on the PASCAL VOC 2012 and Cityscapes datasets show better performance than baseline approaches and can be competitive with state-of-the-art methods.