The Analysis and Optimization of CNN Hyperparameters with Fuzzy Tree Model for Image Classification

IF 1.2 4区 计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Turkish Journal of Electrical Engineering and Computer Sciences Pub Date : 2021-01-01 DOI:10.3906/elk-2107-130
K. Uyar, Sakir Tasdemir, Ilker Ali Özkan
{"title":"The Analysis and Optimization of CNN Hyperparameters with Fuzzy Tree Model for Image Classification","authors":"K. Uyar, Sakir Tasdemir, Ilker Ali Özkan","doi":"10.3906/elk-2107-130","DOIUrl":null,"url":null,"abstract":"The meaningful performance of convolutional neural network (CNN) has enabled the solution of various state-of-the-art problems. Although CNNs achieve satisfactory results in computer-vision problems, they still have some difficulties. As the designed CNN models are deepened to achieve much better accuracy, computational cost and complexity increase. It is significant to train CNNs with suitable topology and training hyperparameters that include initial learning rate, minibatch size, epoch number, filter size, number of filters, etc. because the initialization of hyperparameters affects classification results. On the other hand, it is not possible to make a definite inference for the hyperparameter initialization and there is uncertainty. This study is carried out to model uncertainty using fuzzy inference system (FIS). The designed fuzzy model provides estimation of classification result depending on CNN topology and training hyperparameters. GoogleNet and Inceptionv3 that contain inception-modules, ShuffleNet that contains shuffleblocks, DenseNet201 that contains dense-blocks, EfficientNet, ResNet18, ResNet50, ResNet101, and MobileNetv2 that contain residual-blocks, and InceptionResNetv2 that includes both inception-modules and residual-blocks were evaluated as CNN models. Test sample dataset was obtained by training CNN models with various training hyperparameter combinations. CNN models were trained on Animal Diagnostics Lab (ADL) which is a histopathological dataset includes healthy and inflamed kidney, lung, and spleen images. A new FIS tree model that is more computationally efficient and easier to understand than a single FIS was designed and classification accuracy prediction of CNN models depending on hyperparameter combinations was performed. The best, the worst, and the average classification accuracies obtained with CNN models that use best training hyperparameter set are 97.70%, 93.60%, and 96.30%, respectively. Moreover, Cifar10 and Cifar100 benchmark datasets were experimented to reveal true capability and limitations of the proposed approach. Experimental results indicate that the designed FIS tree model provides a successful hyperparameter evaluation mechanism with an average RMSE value of 1.2652.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"62 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Electrical Engineering and Computer Sciences","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3906/elk-2107-130","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 1

Abstract

The meaningful performance of convolutional neural network (CNN) has enabled the solution of various state-of-the-art problems. Although CNNs achieve satisfactory results in computer-vision problems, they still have some difficulties. As the designed CNN models are deepened to achieve much better accuracy, computational cost and complexity increase. It is significant to train CNNs with suitable topology and training hyperparameters that include initial learning rate, minibatch size, epoch number, filter size, number of filters, etc. because the initialization of hyperparameters affects classification results. On the other hand, it is not possible to make a definite inference for the hyperparameter initialization and there is uncertainty. This study is carried out to model uncertainty using fuzzy inference system (FIS). The designed fuzzy model provides estimation of classification result depending on CNN topology and training hyperparameters. GoogleNet and Inceptionv3 that contain inception-modules, ShuffleNet that contains shuffleblocks, DenseNet201 that contains dense-blocks, EfficientNet, ResNet18, ResNet50, ResNet101, and MobileNetv2 that contain residual-blocks, and InceptionResNetv2 that includes both inception-modules and residual-blocks were evaluated as CNN models. Test sample dataset was obtained by training CNN models with various training hyperparameter combinations. CNN models were trained on Animal Diagnostics Lab (ADL) which is a histopathological dataset includes healthy and inflamed kidney, lung, and spleen images. A new FIS tree model that is more computationally efficient and easier to understand than a single FIS was designed and classification accuracy prediction of CNN models depending on hyperparameter combinations was performed. The best, the worst, and the average classification accuracies obtained with CNN models that use best training hyperparameter set are 97.70%, 93.60%, and 96.30%, respectively. Moreover, Cifar10 and Cifar100 benchmark datasets were experimented to reveal true capability and limitations of the proposed approach. Experimental results indicate that the designed FIS tree model provides a successful hyperparameter evaluation mechanism with an average RMSE value of 1.2652.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于模糊树模型的CNN超参数图像分类分析与优化
卷积神经网络(CNN)有意义的性能使各种尖端问题的解决成为可能。尽管cnn在计算机视觉问题上取得了令人满意的结果,但仍然存在一些困难。随着所设计的CNN模型不断深化以达到更高的精度,计算成本和复杂度也随之增加。由于超参数的初始化会影响分类结果,所以用合适的拓扑和训练超参数(包括初始学习率、小批量大小、epoch数、滤波器大小、滤波器数量等)训练cnn是非常重要的。另一方面,对于超参数初始化不能做出明确的推断,存在不确定性。本研究采用模糊推理系统(FIS)对不确定性进行建模。设计的模糊模型根据CNN拓扑和训练超参数对分类结果进行估计。包含inception-modules的GoogleNet和Inceptionv3,包含shuffleblocks的ShuffleNet,包含dense-blocks的DenseNet201,包含残块的EfficientNet、ResNet18、ResNet50、ResNet101和MobileNetv2,以及同时包含inception-modules和残块的InceptionResNetv2被评估为CNN模型。测试样本数据集是通过训练不同训练超参数组合的CNN模型得到的。CNN模型在动物诊断实验室(ADL)上进行训练,该实验室是一个组织病理学数据集,包括健康和发炎的肾脏、肺和脾脏图像。设计了一种比单个FIS更高效、更易于理解的新的FIS树模型,并进行了基于超参数组合的CNN模型分类精度预测。使用最佳训练超参数集的CNN模型得到的最佳分类准确率为97.70%,最差分类准确率为93.60%,平均分类准确率为96.30%。此外,对Cifar10和Cifar100基准数据集进行了实验,以揭示所提出方法的真实能力和局限性。实验结果表明,所设计的FIS树模型提供了一种成功的超参数评价机制,平均RMSE值为1.2652。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Turkish Journal of Electrical Engineering and Computer Sciences
Turkish Journal of Electrical Engineering and Computer Sciences COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-ENGINEERING, ELECTRICAL & ELECTRONIC
CiteScore
2.90
自引率
9.10%
发文量
95
审稿时长
6.9 months
期刊介绍: The Turkish Journal of Electrical Engineering & Computer Sciences is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK) Accepts English-language manuscripts in the areas of power and energy, environmental sustainability and energy efficiency, electronics, industry applications, control systems, information and systems, applied electromagnetics, communications, signal and image processing, tomographic image reconstruction, face recognition, biometrics, speech processing, video processing and analysis, object recognition, classification, feature extraction, parallel and distributed computing, cognitive systems, interaction, robotics, digital libraries and content, personalized healthcare, ICT for mobility, sensors, and artificial intelligence. Contribution is open to researchers of all nationalities.
期刊最新文献
A comparative study of YOLO models and a transformer-based YOLOv5 model for mass detection in mammograms Feature selection optimization with filtering and wrapper methods: two disease classification cases New modified carrier-based level-shifted PWM control for NPC rectifiers considered for implementation in EV fast chargers FuzzyCSampling: A Hybrid fuzzy c-means clustering sampling strategy for imbalanced datasets A practical low-dimensional feature vector generation method based on wavelet transform for psychophysiological signals
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1