Distributed Deep Learning Approach for Optimal Hyper-Parameter Values

Ziya Tan, M. Karakose
{"title":"Distributed Deep Learning Approach for Optimal Hyper-Parameter Values","authors":"Ziya Tan, M. Karakose","doi":"10.1109/IT54280.2022.9743528","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.","PeriodicalId":335678,"journal":{"name":"2022 26th International Conference on Information Technology (IT)","volume":"118 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference on Information Technology (IT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IT54280.2022.9743528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
最优超参数值的分布式深度学习方法
随着人工智能的发展,特别是在技术和工业领域发生了巨大的变化。事实上,深度学习和强化学习研究是研究人员的热门话题,加速了这一变化。在本文中,提出了一个分布式系统来确定用于对象检测的深度学习算法的超参数的最准确值。在使用深度学习算法的目标识别方法中,影响准确率的最重要因素之一是确定具有正确值的超参数。为了确定这些参数的最佳值,可能需要进行很长时间的实验。为了解决这个问题,我们将RAY分布式架构与深度学习算法相结合,训练了一个用于对象检测的深度学习网络。通过在每次迭代中改变参数来观察准确率。对于目标检测,使用CPU对我们使用CIFAR-10数据集创建的神经网络进行训练。此外,由于分布式体系结构,每个流程由4个不同的工作人员进行培训。下面将详细介绍训练结果和人工神经网络的性质。因此,我们可以通过三点来突出本文的主要贡献。首先;由于深度学习算法与分布式系统的集成,长过程可以在短时间内完成;训练用于确定最优超参数值的模型,第三是分布式深度学习方法的介绍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A New Framework for Quantum Image Processing and Application of Binary Template Matching Some IT Tools for Virtual Exchange in Higher Education Audio Signal Denoising Based on Laplacian Filter and Sparse Signal Reconstruction 360-degree Video Technology with Potential Use in Educational Applications [Copyright notice]
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1