Fusion based Heterogeneous Convolutional Neural Networks Architecture

2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR) Pub Date : 2018-10-01 DOI:10.1109/AIPR.2018.8707371

David Kornish, Soundararajan Ezekiel, Maria Scalzo-Cornacchia

{"title":"Fusion based Heterogeneous Convolutional Neural Networks Architecture","authors":"David Kornish, Soundararajan Ezekiel, Maria Scalzo-Cornacchia","doi":"10.1109/AIPR.2018.8707371","DOIUrl":null,"url":null,"abstract":"In recent years, Deep Convolutional Neural Networks (DCNNs) have gained lots of attention and won many competitions in machine learning, object detection, image classification, and pattern recognition. The breakthroughs in the development of graphical processing units have made it possible to train DCNNs quickly for state-of-the-art tasks such as image classification, speech recognition, and many others. However, to solve complex problems, these multilayered convolutional neural networks become increasingly large, complex, and abstract. We propose methods to improve the performance of neural networks while reducing their dimensionality, enabling a better understanding of the learning process. To leverage the extensive training, as well as strengths of several pretrained models, we explored new approaches for combining features from fully connected layers of models with heterogeneous architectures. The proposed approach combines features extracted from the penultimate fully connected layer from three different DCNNs. We merge the features of all three DCNNs together and apply principal component analysis or linear discriminant analysis. Our approach aims to reduce the dimensionality of the feature vector and find the smallest feature vector dimension that can maintain the classifier performance. For this task we use a linear Support Vector Machine as a classifier. We also investigate whether it is advantageous to fuse only penultimate fully connected layers, or to perform fusion based on other fully connected layers using multiple homogenous or heterogeneous networks. The results show that the fusion method outperformed both individual networks in terms of accuracy and computational time in all of our various trial sizes. Overall our fusion methods are faster and more accurate than individual networks in both training and testing. Finally, we compared heterogeneous with homogenous fusion methods and the results show heterogeneous methods outperform homogeneous methods.","PeriodicalId":230582,"journal":{"name":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2018.8707371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

In recent years, Deep Convolutional Neural Networks (DCNNs) have gained lots of attention and won many competitions in machine learning, object detection, image classification, and pattern recognition. The breakthroughs in the development of graphical processing units have made it possible to train DCNNs quickly for state-of-the-art tasks such as image classification, speech recognition, and many others. However, to solve complex problems, these multilayered convolutional neural networks become increasingly large, complex, and abstract. We propose methods to improve the performance of neural networks while reducing their dimensionality, enabling a better understanding of the learning process. To leverage the extensive training, as well as strengths of several pretrained models, we explored new approaches for combining features from fully connected layers of models with heterogeneous architectures. The proposed approach combines features extracted from the penultimate fully connected layer from three different DCNNs. We merge the features of all three DCNNs together and apply principal component analysis or linear discriminant analysis. Our approach aims to reduce the dimensionality of the feature vector and find the smallest feature vector dimension that can maintain the classifier performance. For this task we use a linear Support Vector Machine as a classifier. We also investigate whether it is advantageous to fuse only penultimate fully connected layers, or to perform fusion based on other fully connected layers using multiple homogenous or heterogeneous networks. The results show that the fusion method outperformed both individual networks in terms of accuracy and computational time in all of our various trial sizes. Overall our fusion methods are faster and more accurate than individual networks in both training and testing. Finally, we compared heterogeneous with homogenous fusion methods and the results show heterogeneous methods outperform homogeneous methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于融合的异构卷积神经网络结构

近年来，深度卷积神经网络(Deep Convolutional Neural Networks, DCNNs)在机器学习、目标检测、图像分类、模式识别等领域获得了广泛的关注，并赢得了许多竞赛。图形处理单元发展的突破使得训练DCNNs快速完成图像分类、语音识别等最先进的任务成为可能。然而，为了解决复杂的问题，这些多层卷积神经网络变得越来越庞大、复杂和抽象。我们提出了一些方法来提高神经网络的性能，同时降低它们的维数，从而更好地理解学习过程。为了利用广泛的训练，以及几个预训练模型的优势，我们探索了将来自完全连接的模型层的特征与异构体系结构相结合的新方法。该方法结合了从三个不同的DCNNs的倒数第二个完全连接层提取的特征。我们将所有三个DCNNs的特征合并在一起，并应用主成分分析或线性判别分析。我们的方法旨在降低特征向量的维数，找到能够保持分类器性能的最小特征向量维数。对于这个任务，我们使用线性支持向量机作为分类器。我们还研究了只融合倒数第二个完全连接层，还是使用多个同质或异构网络在其他完全连接层的基础上进行融合是有利的。结果表明，在我们所有不同的试验规模下，融合方法在精度和计算时间方面都优于单独的网络。总的来说，我们的融合方法在训练和测试中都比单独的网络更快、更准确。最后，我们比较了异质和同质融合方法，结果表明异质融合方法优于同质融合方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)

自引率

0.00%

发文量