A Novel Structure of Convolutional Layers with a Higher Performance-Complexity Ratio for Semantic Segmentation

Yalong Jiang, Z. Chi
{"title":"A Novel Structure of Convolutional Layers with a Higher Performance-Complexity Ratio for Semantic Segmentation","authors":"Yalong Jiang, Z. Chi","doi":"10.1109/ICARCV.2018.8580632","DOIUrl":null,"url":null,"abstract":"In this paper, we study an important factor that determines the capacity of a CNN model and propose a novel structure of convolutional layers with a higher performance-complexity ratio. Firstly, the relationship of the model capacity and the number of parameters versus segmentation performance is explored. Secondly, a mechanism is proposed to optimize the structure of a CNN model for a specific task. The mechanism also provides better convergence than current state-of-the-art methods for factorizing convolutional layers, such as MobileNet. Thirdly, we propose a measure based on the mutual information between hidden activations and inputs/outputs to compute the capacity of a CNN model. This measure is highly correlated with segmentation performance. Experimental results on the segmentation of the PASCAL Person Parts Dataset show that the linear dependency among convolutional kernels is an important factor determining the capacity of a CNN model. It is also demonstrated that our approach can successfully adjust the model capacity to best match to the complexity of a dataset. The optimized CNN model achieves the similar performance to Deeplab-V2 on the segmentation task with 100 × less parameters, resulting in a significantly improved performance-complexity ratio.","PeriodicalId":395380,"journal":{"name":"2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARCV.2018.8580632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we study an important factor that determines the capacity of a CNN model and propose a novel structure of convolutional layers with a higher performance-complexity ratio. Firstly, the relationship of the model capacity and the number of parameters versus segmentation performance is explored. Secondly, a mechanism is proposed to optimize the structure of a CNN model for a specific task. The mechanism also provides better convergence than current state-of-the-art methods for factorizing convolutional layers, such as MobileNet. Thirdly, we propose a measure based on the mutual information between hidden activations and inputs/outputs to compute the capacity of a CNN model. This measure is highly correlated with segmentation performance. Experimental results on the segmentation of the PASCAL Person Parts Dataset show that the linear dependency among convolutional kernels is an important factor determining the capacity of a CNN model. It is also demonstrated that our approach can successfully adjust the model capacity to best match to the complexity of a dataset. The optimized CNN model achieves the similar performance to Deeplab-V2 on the segmentation task with 100 × less parameters, resulting in a significantly improved performance-complexity ratio.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种具有更高性能复杂度比的新型卷积层结构用于语义分割
本文研究了决定CNN模型容量的一个重要因素,提出了一种具有更高性能复杂度比的新颖卷积层结构。首先,探讨了模型容量、参数个数与分割性能的关系。其次,提出了一种针对特定任务优化CNN模型结构的机制。该机制还提供了比当前最先进的卷积层分解方法(如MobileNet)更好的收敛性。第三,我们提出了一种基于隐藏激活和输入/输出之间互信息的度量来计算CNN模型的容量。该度量与分割性能高度相关。对PASCAL人体部位数据集的分割实验结果表明,卷积核之间的线性相关性是决定CNN模型容量的重要因素。实验还表明,我们的方法可以成功地调整模型容量,以最佳地匹配数据集的复杂性。优化后的CNN模型在参数减少100倍的分割任务上达到了与Deeplab-V2相似的性能,性能复杂度比显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Virtual Commissioning of Machine Vision Applications in Aero Engine Manufacturing Barrier Lyapunov Function Based Output-constrained Control of Nonlinear Euler-Lagrange Systems Visuo-Tactile Recognition of Daily-Life Objects Never Seen or Touched Before Synthesis of Point Memory-Based Adaptive Gain Robust Controllers with Guaranteed $\mathcal{L}_{2}$ Gain Performance for a Class of Uncertain Time-Delay Systems Formation Control of Multiple Mobile Robots with Large Obstacle Avoidance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1