{"title":"Kohonen self-organizing map application to representative sample formation in the training of the multilayer perceptron","authors":"Aleksey A. Pastukhov, Alexander A. Prokofiev","doi":"10.1016/j.spjpm.2016.05.012","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper, we have considered the issue of effectively forming a representative sample for training the neural network of the multilayer perceptron (MLP) type. The main problems arising in the process of the factor space division into the test, validation and training sets were formulated. An approach based on the use of clustering that allowed to increase the entropy of the training set was put forward. Kohonen's self-organizing maps (SOM) were examined as an effective clustering procedure. Based on such maps, the clustering of factor spaces of different dimensions was carried out, and a representative sample was formed. To verify our approach we synthesized the MLP neural network and trained it. The training technique was performed with the sets formed both with and without clustering. The approach under consideration was concluded to have an influence on the increase in the entropy of the training set and (as a result) to lead to the quality improvement of MLP training with the small dimension of the factor space.</p></div>","PeriodicalId":41808,"journal":{"name":"St Petersburg Polytechnic University Journal-Physics and Mathematics","volume":"2 2","pages":"Pages 134-143"},"PeriodicalIF":0.2000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.spjpm.2016.05.012","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"St Petersburg Polytechnic University Journal-Physics and Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405722316300809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 16
Abstract
In this paper, we have considered the issue of effectively forming a representative sample for training the neural network of the multilayer perceptron (MLP) type. The main problems arising in the process of the factor space division into the test, validation and training sets were formulated. An approach based on the use of clustering that allowed to increase the entropy of the training set was put forward. Kohonen's self-organizing maps (SOM) were examined as an effective clustering procedure. Based on such maps, the clustering of factor spaces of different dimensions was carried out, and a representative sample was formed. To verify our approach we synthesized the MLP neural network and trained it. The training technique was performed with the sets formed both with and without clustering. The approach under consideration was concluded to have an influence on the increase in the entropy of the training set and (as a result) to lead to the quality improvement of MLP training with the small dimension of the factor space.