具有专家分支的测试代价敏感卷积神经网络

Signal and image processing : an international journal Pub Date : 2019-10-31 DOI:10.5121/sipij.2019.10502

Mahdi Naghibi, R. Anvari, A. Forghani, B. Minaei

{"title":"具有专家分支的测试代价敏感卷积神经网络","authors":"Mahdi Naghibi, R. Anvari, A. Forghani, B. Minaei","doi":"10.5121/sipij.2019.10502","DOIUrl":null,"url":null,"abstract":"It has been proven that deeper convolutional neural networks (CNN) can result in better accuracy in many problems, but this accuracy comes with a high computational cost. Also, input instances have not the same difficulty. As a solution for accuracy vs. computational cost dilemma, we introduce a new test-cost-sensitive method for convolutional neural networks. This method trains a CNN with a set of auxiliary outputs and expert branches in some middle layers of the network. The expert branches decide to use a shallower part of the network or going deeper to the end, based on the difficulty of input instance. The expert branches learn to determine: is the current network prediction is wrong and if the given instance passed to deeper layers of the network it will generate right output; If not, then the expert branches stop the computation process. The experimental results on standard dataset CIFAR-10 show that the proposed method can train models with lower test-cost and competitive accuracy in comparison with basic models.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"72 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Test-cost-sensitive Convolutional Neural Networks with Expert Branches\",\"authors\":\"Mahdi Naghibi, R. Anvari, A. Forghani, B. Minaei\",\"doi\":\"10.5121/sipij.2019.10502\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It has been proven that deeper convolutional neural networks (CNN) can result in better accuracy in many problems, but this accuracy comes with a high computational cost. Also, input instances have not the same difficulty. As a solution for accuracy vs. computational cost dilemma, we introduce a new test-cost-sensitive method for convolutional neural networks. This method trains a CNN with a set of auxiliary outputs and expert branches in some middle layers of the network. The expert branches decide to use a shallower part of the network or going deeper to the end, based on the difficulty of input instance. The expert branches learn to determine: is the current network prediction is wrong and if the given instance passed to deeper layers of the network it will generate right output; If not, then the expert branches stop the computation process. The experimental results on standard dataset CIFAR-10 show that the proposed method can train models with lower test-cost and competitive accuracy in comparison with basic models.\",\"PeriodicalId\":90726,\"journal\":{\"name\":\"Signal and image processing : an international journal\",\"volume\":\"72 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal and image processing : an international journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/sipij.2019.10502\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal and image processing : an international journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/sipij.2019.10502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

已经证明，深度卷积神经网络(CNN)可以在许多问题上产生更好的准确性，但这种准确性伴随着较高的计算成本。此外，输入实例也没有相同的难度。为了解决准确率与计算成本的矛盾，我们提出了一种新的卷积神经网络测试成本敏感方法。该方法训练一个具有一组辅助输出和一些网络中间层专家分支的CNN。专家分支根据输入实例的难度决定使用网络的较浅部分或深入到最后。专家分支学会判断:当前的网络预测是错误的，如果给定的实例传递给网络的更深层，它将产生正确的输出;如果不是，那么专家分支停止计算过程。在标准数据集CIFAR-10上的实验结果表明，与基本模型相比，该方法能够以更低的测试成本和具有竞争力的准确率训练模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Test-cost-sensitive Convolutional Neural Networks with Expert Branches

It has been proven that deeper convolutional neural networks (CNN) can result in better accuracy in many problems, but this accuracy comes with a high computational cost. Also, input instances have not the same difficulty. As a solution for accuracy vs. computational cost dilemma, we introduce a new test-cost-sensitive method for convolutional neural networks. This method trains a CNN with a set of auxiliary outputs and expert branches in some middle layers of the network. The expert branches decide to use a shallower part of the network or going deeper to the end, based on the difficulty of input instance. The expert branches learn to determine: is the current network prediction is wrong and if the given instance passed to deeper layers of the network it will generate right output; If not, then the expert branches stop the computation process. The experimental results on standard dataset CIFAR-10 show that the proposed method can train models with lower test-cost and competitive accuracy in comparison with basic models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signal and image processing : an international journal

自引率

0.00%

发文量