The Analysis and Optimization of CNN Hyperparameters with Fuzzy Tree Model for Image Classification

IF 1.2 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Turkish Journal of Electrical Engineering and Computer Sciences Pub Date : 2021-01-01 DOI:10.3906/elk-2107-130

K. Uyar, Sakir Tasdemir, Ilker Ali Özkan

{"title":"The Analysis and Optimization of CNN Hyperparameters with Fuzzy Tree Model for Image Classification","authors":"K. Uyar, Sakir Tasdemir, Ilker Ali Özkan","doi":"10.3906/elk-2107-130","DOIUrl":null,"url":null,"abstract":"The meaningful performance of convolutional neural network (CNN) has enabled the solution of various state-of-the-art problems. Although CNNs achieve satisfactory results in computer-vision problems, they still have some difficulties. As the designed CNN models are deepened to achieve much better accuracy, computational cost and complexity increase. It is significant to train CNNs with suitable topology and training hyperparameters that include initial learning rate, minibatch size, epoch number, filter size, number of filters, etc. because the initialization of hyperparameters affects classification results. On the other hand, it is not possible to make a definite inference for the hyperparameter initialization and there is uncertainty. This study is carried out to model uncertainty using fuzzy inference system (FIS). The designed fuzzy model provides estimation of classification result depending on CNN topology and training hyperparameters. GoogleNet and Inceptionv3 that contain inception-modules, ShuffleNet that contains shuffleblocks, DenseNet201 that contains dense-blocks, EfficientNet, ResNet18, ResNet50, ResNet101, and MobileNetv2 that contain residual-blocks, and InceptionResNetv2 that includes both inception-modules and residual-blocks were evaluated as CNN models. Test sample dataset was obtained by training CNN models with various training hyperparameter combinations. CNN models were trained on Animal Diagnostics Lab (ADL) which is a histopathological dataset includes healthy and inflamed kidney, lung, and spleen images. A new FIS tree model that is more computationally efficient and easier to understand than a single FIS was designed and classification accuracy prediction of CNN models depending on hyperparameter combinations was performed. The best, the worst, and the average classification accuracies obtained with CNN models that use best training hyperparameter set are 97.70%, 93.60%, and 96.30%, respectively. Moreover, Cifar10 and Cifar100 benchmark datasets were experimented to reveal true capability and limitations of the proposed approach. Experimental results indicate that the designed FIS tree model provides a successful hyperparameter evaluation mechanism with an average RMSE value of 1.2652.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"62 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Electrical Engineering and Computer Sciences","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3906/elk-2107-130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 1

Abstract

The meaningful performance of convolutional neural network (CNN) has enabled the solution of various state-of-the-art problems. Although CNNs achieve satisfactory results in computer-vision problems, they still have some difficulties. As the designed CNN models are deepened to achieve much better accuracy, computational cost and complexity increase. It is significant to train CNNs with suitable topology and training hyperparameters that include initial learning rate, minibatch size, epoch number, filter size, number of filters, etc. because the initialization of hyperparameters affects classification results. On the other hand, it is not possible to make a definite inference for the hyperparameter initialization and there is uncertainty. This study is carried out to model uncertainty using fuzzy inference system (FIS). The designed fuzzy model provides estimation of classification result depending on CNN topology and training hyperparameters. GoogleNet and Inceptionv3 that contain inception-modules, ShuffleNet that contains shuffleblocks, DenseNet201 that contains dense-blocks, EfficientNet, ResNet18, ResNet50, ResNet101, and MobileNetv2 that contain residual-blocks, and InceptionResNetv2 that includes both inception-modules and residual-blocks were evaluated as CNN models. Test sample dataset was obtained by training CNN models with various training hyperparameter combinations. CNN models were trained on Animal Diagnostics Lab (ADL) which is a histopathological dataset includes healthy and inflamed kidney, lung, and spleen images. A new FIS tree model that is more computationally efficient and easier to understand than a single FIS was designed and classification accuracy prediction of CNN models depending on hyperparameter combinations was performed. The best, the worst, and the average classification accuracies obtained with CNN models that use best training hyperparameter set are 97.70%, 93.60%, and 96.30%, respectively. Moreover, Cifar10 and Cifar100 benchmark datasets were experimented to reveal true capability and limitations of the proposed approach. Experimental results indicate that the designed FIS tree model provides a successful hyperparameter evaluation mechanism with an average RMSE value of 1.2652.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于模糊树模型的CNN超参数图像分类分析与优化

卷积神经网络(CNN)有意义的性能使各种尖端问题的解决成为可能。尽管cnn在计算机视觉问题上取得了令人满意的结果，但仍然存在一些困难。随着所设计的CNN模型不断深化以达到更高的精度，计算成本和复杂度也随之增加。由于超参数的初始化会影响分类结果，所以用合适的拓扑和训练超参数(包括初始学习率、小批量大小、epoch数、滤波器大小、滤波器数量等)训练cnn是非常重要的。另一方面，对于超参数初始化不能做出明确的推断，存在不确定性。本研究采用模糊推理系统(FIS)对不确定性进行建模。设计的模糊模型根据CNN拓扑和训练超参数对分类结果进行估计。包含inception-modules的GoogleNet和Inceptionv3，包含shuffleblocks的ShuffleNet，包含dense-blocks的DenseNet201，包含残块的EfficientNet、ResNet18、ResNet50、ResNet101和MobileNetv2，以及同时包含inception-modules和残块的InceptionResNetv2被评估为CNN模型。测试样本数据集是通过训练不同训练超参数组合的CNN模型得到的。CNN模型在动物诊断实验室(ADL)上进行训练，该实验室是一个组织病理学数据集，包括健康和发炎的肾脏、肺和脾脏图像。设计了一种比单个FIS更高效、更易于理解的新的FIS树模型，并进行了基于超参数组合的CNN模型分类精度预测。使用最佳训练超参数集的CNN模型得到的最佳分类准确率为97.70%，最差分类准确率为93.60%，平均分类准确率为96.30%。此外，对Cifar10和Cifar100基准数据集进行了实验，以揭示所提出方法的真实能力和局限性。实验结果表明，所设计的FIS树模型提供了一种成功的超参数评价机制，平均RMSE值为1.2652。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Turkish Journal of Electrical Engineering and Computer Sciences COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-ENGINEERING, ELECTRICAL & ELECTRONIC

CiteScore

2.90

自引率

9.10%

发文量

审稿时长

6.9 months

期刊介绍： The Turkish Journal of Electrical Engineering & Computer Sciences is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK) Accepts English-language manuscripts in the areas of power and energy, environmental sustainability and energy efficiency, electronics, industry applications, control systems, information and systems, applied electromagnetics, communications, signal and image processing, tomographic image reconstruction, face recognition, biometrics, speech processing, video processing and analysis, object recognition, classification, feature extraction, parallel and distributed computing, cognitive systems, interaction, robotics, digital libraries and content, personalized healthcare, ICT for mobility, sensors, and artificial intelligence. Contribution is open to researchers of all nationalities.