{"title":"Selecting the variables that train a self-organizing map (SOM) which best separates predefined clusters","authors":"S. Laine","doi":"10.1109/ICONIP.2002.1199016","DOIUrl":null,"url":null,"abstract":"The paper presents how to find the variables that best illustrate a problem of interest when visualizing with the self-organizing map (SOM). The user defines what is interesting by labeling data points, e.g. with alphabets. These labels assign the data points into clusters. An optimization algorithm looks for the set of variables that best separates the clusters. These variables reflect the knowledge the user applied when labeling the data points. The paper measures the separability, not in the variable space, but on a SOM trained into this space. The found variables contain interesting information, and are well suited for the SOM. The trained SOM can comprehensively visualize the problem of interest, which supports discussion and learning from data. The approach is illustrated using the case of the Hitura mine; and compared with a standard statistical visualization algorithm, the Fisher discriminant analysis.","PeriodicalId":146553,"journal":{"name":"Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02.","volume":"223 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONIP.2002.1199016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The paper presents how to find the variables that best illustrate a problem of interest when visualizing with the self-organizing map (SOM). The user defines what is interesting by labeling data points, e.g. with alphabets. These labels assign the data points into clusters. An optimization algorithm looks for the set of variables that best separates the clusters. These variables reflect the knowledge the user applied when labeling the data points. The paper measures the separability, not in the variable space, but on a SOM trained into this space. The found variables contain interesting information, and are well suited for the SOM. The trained SOM can comprehensively visualize the problem of interest, which supports discussion and learning from data. The approach is illustrated using the case of the Hitura mine; and compared with a standard statistical visualization algorithm, the Fisher discriminant analysis.