A Novel Structure of Convolutional Layers with a Higher Performance-Complexity Ratio for Semantic Segmentation

2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV) Pub Date : 2018-11-01 DOI:10.1109/ICARCV.2018.8580632

Yalong Jiang, Z. Chi

{"title":"A Novel Structure of Convolutional Layers with a Higher Performance-Complexity Ratio for Semantic Segmentation","authors":"Yalong Jiang, Z. Chi","doi":"10.1109/ICARCV.2018.8580632","DOIUrl":null,"url":null,"abstract":"In this paper, we study an important factor that determines the capacity of a CNN model and propose a novel structure of convolutional layers with a higher performance-complexity ratio. Firstly, the relationship of the model capacity and the number of parameters versus segmentation performance is explored. Secondly, a mechanism is proposed to optimize the structure of a CNN model for a specific task. The mechanism also provides better convergence than current state-of-the-art methods for factorizing convolutional layers, such as MobileNet. Thirdly, we propose a measure based on the mutual information between hidden activations and inputs/outputs to compute the capacity of a CNN model. This measure is highly correlated with segmentation performance. Experimental results on the segmentation of the PASCAL Person Parts Dataset show that the linear dependency among convolutional kernels is an important factor determining the capacity of a CNN model. It is also demonstrated that our approach can successfully adjust the model capacity to best match to the complexity of a dataset. The optimized CNN model achieves the similar performance to Deeplab-V2 on the segmentation task with 100 × less parameters, resulting in a significantly improved performance-complexity ratio.","PeriodicalId":395380,"journal":{"name":"2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARCV.2018.8580632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we study an important factor that determines the capacity of a CNN model and propose a novel structure of convolutional layers with a higher performance-complexity ratio. Firstly, the relationship of the model capacity and the number of parameters versus segmentation performance is explored. Secondly, a mechanism is proposed to optimize the structure of a CNN model for a specific task. The mechanism also provides better convergence than current state-of-the-art methods for factorizing convolutional layers, such as MobileNet. Thirdly, we propose a measure based on the mutual information between hidden activations and inputs/outputs to compute the capacity of a CNN model. This measure is highly correlated with segmentation performance. Experimental results on the segmentation of the PASCAL Person Parts Dataset show that the linear dependency among convolutional kernels is an important factor determining the capacity of a CNN model. It is also demonstrated that our approach can successfully adjust the model capacity to best match to the complexity of a dataset. The optimized CNN model achieves the similar performance to Deeplab-V2 on the segmentation task with 100 × less parameters, resulting in a significantly improved performance-complexity ratio.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种具有更高性能复杂度比的新型卷积层结构用于语义分割

本文研究了决定CNN模型容量的一个重要因素，提出了一种具有更高性能复杂度比的新颖卷积层结构。首先，探讨了模型容量、参数个数与分割性能的关系。其次，提出了一种针对特定任务优化CNN模型结构的机制。该机制还提供了比当前最先进的卷积层分解方法(如MobileNet)更好的收敛性。第三，我们提出了一种基于隐藏激活和输入/输出之间互信息的度量来计算CNN模型的容量。该度量与分割性能高度相关。对PASCAL人体部位数据集的分割实验结果表明，卷积核之间的线性相关性是决定CNN模型容量的重要因素。实验还表明，我们的方法可以成功地调整模型容量，以最佳地匹配数据集的复杂性。优化后的CNN模型在参数减少100倍的分割任务上达到了与Deeplab-V2相似的性能，性能复杂度比显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV)

自引率

0.00%

发文量