{"title":"Distributed Deep Learning Approach for Optimal Hyper-Parameter Values","authors":"Ziya Tan, M. Karakose","doi":"10.1109/IT54280.2022.9743528","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.","PeriodicalId":335678,"journal":{"name":"2022 26th International Conference on Information Technology (IT)","volume":"118 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference on Information Technology (IT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IT54280.2022.9743528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of artificial intelligence, there are great changes especially in technology and industry sectors. The fact that deep learning and reinforcement learning studies are popular topics by researchers accelerates this change. In this article, a distributed system is presented to determine the hyper-parameters of the deep learning algorithm used for object detection at the most accurate value. One of the most important factors affecting the accuracy rate in object recognition approaches using deep learning algorithms is the determination of hyper-parameters with correct values. It may be necessary to carry out very long experiments to determine the optimum of these parameters. To solve this problem, a deep learning network used for object detection has been trained by combining the RAY distributed architecture with a deep learning algorithm. The accuracy rate is observed by changing the parameters in each iteration. For object detection, the training of the neural network we created with the CIFAR-10 dataset was carried out using CPU. In addition, thanks to the distributed architecture, each process is trained by 4 different workers. The training results and the properties of the artificial neural network are given in detail in the following sections. Accordingly, we can highlight the main contributions of this article in three points. Firstly; to show that long processes are completed in a short time, thanks to the integration of deep learning algorithms with the distributed system; training the model used to determine the optimal hyper-parameter values and the third is the presentation of the distributed deep learning approach.