Distributed Deep Learning Approach for Optimal Hyper-Parameter Values

2022 26th International Conference on Information Technology (IT) Pub Date : 2022-02-16 DOI:10.1109/IT54280.2022.9743528

Ziya Tan, M. Karakose

{"title":"Distributed Deep Learning Approach for Optimal Hyper-Parameter Values","authors":"Ziya Tan, M. Karakose","doi":"10.1109/IT54280.2022.9743528","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.","PeriodicalId":335678,"journal":{"name":"2022 26th International Conference on Information Technology (IT)","volume":"118 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference on Information Technology (IT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IT54280.2022.9743528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

最优超参数值的分布式深度学习方法

随着人工智能的发展，特别是在技术和工业领域发生了巨大的变化。事实上，深度学习和强化学习研究是研究人员的热门话题，加速了这一变化。在本文中，提出了一个分布式系统来确定用于对象检测的深度学习算法的超参数的最准确值。在使用深度学习算法的目标识别方法中，影响准确率的最重要因素之一是确定具有正确值的超参数。为了确定这些参数的最佳值，可能需要进行很长时间的实验。为了解决这个问题，我们将RAY分布式架构与深度学习算法相结合，训练了一个用于对象检测的深度学习网络。通过在每次迭代中改变参数来观察准确率。对于目标检测，使用CPU对我们使用CIFAR-10数据集创建的神经网络进行训练。此外，由于分布式体系结构，每个流程由4个不同的工作人员进行培训。下面将详细介绍训练结果和人工神经网络的性质。因此，我们可以通过三点来突出本文的主要贡献。首先;由于深度学习算法与分布式系统的集成，长过程可以在短时间内完成;训练用于确定最优超参数值的模型，第三是分布式深度学习方法的介绍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 26th International Conference on Information Technology (IT)

自引率

0.00%

发文量

期刊最新文献

A New Framework for Quantum Image Processing and Application of Binary Template Matching Some IT Tools for Virtual Exchange in Higher Education Audio Signal Denoising Based on Laplacian Filter and Sparse Signal Reconstruction 360-degree Video Technology with Potential Use in Educational Applications [Copyright notice]