首页 > 最新文献

2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)最新文献

英文 中文
Empirical evaluation of gradient methods for matrix learning vector quantization 梯度方法在矩阵学习向量量化中的经验评价
Michael LeKander, Michael Biehl, Harm de Vries
Generalized Matrix Learning Vector Quantization (GMLVQ) critically relies on the use of an optimization algorithm to train its model parameters. We test various schemes for automated control of learning rates in gradient-based training. We evaluate these algorithms in terms of their achieved performance and their practical feasibility. We find that some algorithms do indeed perform better than others across multiple benchmark datasets. These algorithms produce GMLVQ models which not only better fit the training data, but also perform better upon validation. In particular, we find that the Variance-based Stochastic Gradient Descent algorithm consistently performs best across all experiments.
广义矩阵学习向量量化(GMLVQ)主要依赖于使用优化算法来训练其模型参数。在基于梯度的训练中,我们测试了各种自动控制学习率的方案。我们对这些算法的性能和实际可行性进行了评估。我们发现,在多个基准数据集上,一些算法确实比其他算法表现得更好。这些算法产生的GMLVQ模型不仅能更好地拟合训练数据,而且在验证时也表现得更好。特别是,我们发现基于方差的随机梯度下降算法在所有实验中始终表现最好。
{"title":"Empirical evaluation of gradient methods for matrix learning vector quantization","authors":"Michael LeKander, Michael Biehl, Harm de Vries","doi":"10.1109/WSOM.2017.8020027","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020027","url":null,"abstract":"Generalized Matrix Learning Vector Quantization (GMLVQ) critically relies on the use of an optimization algorithm to train its model parameters. We test various schemes for automated control of learning rates in gradient-based training. We evaluate these algorithms in terms of their achieved performance and their practical feasibility. We find that some algorithms do indeed perform better than others across multiple benchmark datasets. These algorithms produce GMLVQ models which not only better fit the training data, but also perform better upon validation. In particular, we find that the Variance-based Stochastic Gradient Descent algorithm consistently performs best across all experiments.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115223803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Fusion of deep learning architectures, multilayer feedforward networks and learning vector quantizers for deep classification learning 融合深度学习架构、多层前馈网络和深度分类学习的学习向量量化器
T. Villmann, Michael Biehl, A. Villmann, S. Saralajew
The advantage of prototype based learning vector quantizers are the intuitive and simple model adaptation as well as the easy interpretability of the prototypes as class representatives for the class distribution to be learned. Although they frequently yield competitive performance and show robust behavior nowadays powerful alternatives have increasing attraction. Particularly, deep architectures of multilayer networks achieve frequently very high accuracies and are, thanks to modern graphic processor units use for calculation, trainable in acceptable time. In this conceptual paper we show, how we can combine both network architectures to benefit from their advantages. For this purpose, we consider learning vector quantizers in terms of feedforward network architectures and explain how it can be combined effectively with multilayer or single-layer feedforward network architectures. This approach includes deep and flat architectures as well as the popular extreme learning machines. For the resulting networks, the multi-/single-layer networks act as adaptive filters like in signal processing while the interpretability of the prototype-based learning vector quantizers is kept for the resulting filtered feature space. In this way a powerful combination of two successful architectures is obtained.
基于原型的学习向量量化器的优点是直观和简单的模型适应,以及原型作为待学习的类分布的类代表的易解释性。虽然它们经常产生有竞争力的表现,并表现出稳健的行为,但如今强大的替代品越来越有吸引力。特别是,多层网络的深度架构通常实现非常高的精度,并且由于使用现代图形处理器单元进行计算,可以在可接受的时间内进行训练。在这篇概念性的论文中,我们展示了如何结合这两种网络架构以从它们的优势中获益。为此,我们从前馈网络架构的角度考虑学习向量量化器,并解释如何将其与多层或单层前馈网络架构有效地结合起来。这种方法包括深度和平面架构以及流行的极限学习机。对于所得到的网络,多层/单层网络充当自适应滤波器,就像信号处理一样,而基于原型的学习向量量化器对所得到的过滤特征空间保持可解释性。通过这种方式,可以获得两个成功架构的强大组合。
{"title":"Fusion of deep learning architectures, multilayer feedforward networks and learning vector quantizers for deep classification learning","authors":"T. Villmann, Michael Biehl, A. Villmann, S. Saralajew","doi":"10.1109/WSOM.2017.8020009","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020009","url":null,"abstract":"The advantage of prototype based learning vector quantizers are the intuitive and simple model adaptation as well as the easy interpretability of the prototypes as class representatives for the class distribution to be learned. Although they frequently yield competitive performance and show robust behavior nowadays powerful alternatives have increasing attraction. Particularly, deep architectures of multilayer networks achieve frequently very high accuracies and are, thanks to modern graphic processor units use for calculation, trainable in acceptable time. In this conceptual paper we show, how we can combine both network architectures to benefit from their advantages. For this purpose, we consider learning vector quantizers in terms of feedforward network architectures and explain how it can be combined effectively with multilayer or single-layer feedforward network architectures. This approach includes deep and flat architectures as well as the popular extreme learning machines. For the resulting networks, the multi-/single-layer networks act as adaptive filters like in signal processing while the interpretability of the prototype-based learning vector quantizers is kept for the resulting filtered feature space. In this way a powerful combination of two successful architectures is obtained.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115557307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Prototypes and matrix relevance learning in complex fourier space 复傅立叶空间中的原型与矩阵相关学习
M. Straat, M. Kaden, M. Gay, T. Villmann, A. Lampe, U. Seiffert, Michael Biehl, F. Melchert
In this contribution, we consider the classification of time-series and similar functional data which can be represented in complex Fourier coefficient space. We apply versions of Learning Vector Quantization (LVQ) which are suitable for complex-valued data, based on the so-called Wirtinger calculus. It makes possible the formulation of gradient based update rules in the framework of cost-function based Generalized Matrix Relevance LVQ (GMLVQ). Alternatively, we consider the concatenation of real and imaginary parts of Fourier coefficients in a real-valued feature vector and the classification of time domain representations by means of conventional GMLVQ.
在这篇贡献中,我们考虑了可以在复傅立叶系数空间中表示的时间序列和类似函数数据的分类。我们基于所谓的Wirtinger微积分,应用了适合于复值数据的学习向量量化(LVQ)版本。这使得在基于成本函数的广义矩阵关联LVQ (GMLVQ)框架下建立基于梯度的更新规则成为可能。或者,我们考虑在实值特征向量中傅里叶系数的实部和虚部的串联,并通过传统的GMLVQ方法对时域表示进行分类。
{"title":"Prototypes and matrix relevance learning in complex fourier space","authors":"M. Straat, M. Kaden, M. Gay, T. Villmann, A. Lampe, U. Seiffert, Michael Biehl, F. Melchert","doi":"10.1109/WSOM.2017.8020019","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020019","url":null,"abstract":"In this contribution, we consider the classification of time-series and similar functional data which can be represented in complex Fourier coefficient space. We apply versions of Learning Vector Quantization (LVQ) which are suitable for complex-valued data, based on the so-called Wirtinger calculus. It makes possible the formulation of gradient based update rules in the framework of cost-function based Generalized Matrix Relevance LVQ (GMLVQ). Alternatively, we consider the concatenation of real and imaginary parts of Fourier coefficients in a real-valued feature vector and the classification of time domain representations by means of conventional GMLVQ.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128030404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Motivated self-organization 动机的自我组织
N. Rougier, Y. Boniface
We present in this paper a variation of the self-organizing map algorithm where the original time-dependent (learning rate and neighborhood) learning function is replaced by a time-invariant one. The resulting self-organization does not fit the magnification law and the final vector density is not directly proportional to the density of the distribution. This lead us to introduce the notion of motivated self-organization where the self-organization is biased toward some data thanks to a supplementary signal. From a behavioral point of view, this signal may be understood as a motivational signal allowing a finer tuning of the final self-organization where needed. We illustrate this behavior through a simple robotic arm setup. Open access version of this article is available at https://hal.inria.fr/hal-01513519.
本文提出了一种自组织映射算法的变体,将原始的时变(学习率和邻域)学习函数替换为时不变的学习函数。由此产生的自组织不符合放大定律,最终矢量密度与分布密度不成正比。这导致我们引入了激励自组织的概念,其中自组织由于补充信号而偏向于某些数据。从行为的角度来看,这个信号可以被理解为一种动机信号,允许在需要的地方对最终的自组织进行更精细的调整。我们通过一个简单的机械臂设置来说明这种行为。本文的开放获取版本可在https://hal.inria.fr/hal-01513519上获得。
{"title":"Motivated self-organization","authors":"N. Rougier, Y. Boniface","doi":"10.1109/WSOM.2017.8020000","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020000","url":null,"abstract":"We present in this paper a variation of the self-organizing map algorithm where the original time-dependent (learning rate and neighborhood) learning function is replaced by a time-invariant one. The resulting self-organization does not fit the magnification law and the final vector density is not directly proportional to the density of the distribution. This lead us to introduce the notion of motivated self-organization where the self-organization is biased toward some data thanks to a supplementary signal. From a behavioral point of view, this signal may be understood as a motivational signal allowing a finer tuning of the final self-organization where needed. We illustrate this behavior through a simple robotic arm setup. Open access version of this article is available at https://hal.inria.fr/hal-01513519.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126063138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Credible visualizations for planar projections 可信的平面投影可视化
A. Ultsch, Michael C. Thrun
Planar projections, i.e. projections from a high dimensional data space onto a two dimensional plane, are still in use to detect structures, such as clusters, in multivariate data. It can be shown that only the subclass of focusing projections such as CCA, NeRV and the ESOM are able to disentangle linear non separable data. However, even these projections are sometimes erroneous. U-matrix methods are able to visualize these errors for SOM based projections. This paper extends the U-matrix methods to other projections in form of a so called generalized U-matrix. Based on previous work, an algorithm for the construction of generalized U-matrix is introduced, that is more efficient and free of parameters which may be hard to determine. Results are presented on a difficult artificial data set and a real word multivariate data set from cancer research.
平面投影,即从高维数据空间到二维平面的投影,仍然用于检测多元数据中的结构,如簇。结果表明,只有聚焦投影的子类,如CCA、NeRV和ESOM能够解纠缠线性不可分数据。然而,即使是这些预测有时也是错误的。对于基于SOM的投影,u矩阵方法能够将这些误差可视化。本文以广义u矩阵的形式将u矩阵方法推广到其他投影。在前人工作的基础上,提出了一种构造广义u矩阵的算法,该算法不仅效率高,而且不存在难以确定的参数。在癌症研究的一个复杂的人工数据集和一个真实的多变量数据集上给出了结果。
{"title":"Credible visualizations for planar projections","authors":"A. Ultsch, Michael C. Thrun","doi":"10.1109/WSOM.2017.8020010","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020010","url":null,"abstract":"Planar projections, i.e. projections from a high dimensional data space onto a two dimensional plane, are still in use to detect structures, such as clusters, in multivariate data. It can be shown that only the subclass of focusing projections such as CCA, NeRV and the ESOM are able to disentangle linear non separable data. However, even these projections are sometimes erroneous. U-matrix methods are able to visualize these errors for SOM based projections. This paper extends the U-matrix methods to other projections in form of a so called generalized U-matrix. Based on previous work, an algorithm for the construction of generalized U-matrix is introduced, that is more efficient and free of parameters which may be hard to determine. Results are presented on a difficult artificial data set and a real word multivariate data set from cancer research.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133295832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Improving individual predictions using social networks assortativity 利用社交网络分类性改进个人预测
D. Mulders, Cyril de Bodt, Johannes Bjelland, A. Pentland, M. Verleysen, Yves-Alexandre de Montjoye
Social networks are known to be assortative with respect to many attributes, such as age, weight, wealth, level of education, ethnicity and gender. This can be explained by influences and homophilies. Independently of its origin, this assortativity gives us information about each node given its neighbors. Assortativity can thus be used to improve individual predictions in a broad range of situations, when data are missing or inaccurate. This paper presents a general framework based on probabilistic graphical models to exploit social network structures for improving individual predictions of node attributes. Using this framework, we quantify the assortativity range leading to an accuracy gain in several situations. We finally show how specific characteristics of the network can improve performances further. For instance, the gender assortativity in real-world mobile phone data changes significantly according to some communication attributes. In this case, individual predictions with 75% accuracy are improved by up to 3%.
众所周知,社交网络在许多属性方面都是分类的,比如年龄、体重、财富、教育水平、种族和性别。这可以用影响和同质性来解释。与它的起源无关,这种分类性为我们提供了每个节点在给定其邻居的情况下的信息。因此,当数据缺失或不准确时,选型性可用于在广泛的情况下改进个人预测。本文提出了一个基于概率图模型的通用框架,利用社会网络结构来改进节点属性的个体预测。使用这个框架,我们量化了在几种情况下导致准确性增加的分类范围。我们最后展示了网络的特定特征如何进一步提高性能。例如,现实世界手机数据中的性别分类性根据某些通信属性发生了显著变化。在这种情况下,准确率为75%的个体预测提高了3%。
{"title":"Improving individual predictions using social networks assortativity","authors":"D. Mulders, Cyril de Bodt, Johannes Bjelland, A. Pentland, M. Verleysen, Yves-Alexandre de Montjoye","doi":"10.1109/WSOM.2017.8020023","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020023","url":null,"abstract":"Social networks are known to be assortative with respect to many attributes, such as age, weight, wealth, level of education, ethnicity and gender. This can be explained by influences and homophilies. Independently of its origin, this assortativity gives us information about each node given its neighbors. Assortativity can thus be used to improve individual predictions in a broad range of situations, when data are missing or inaccurate. This paper presents a general framework based on probabilistic graphical models to exploit social network structures for improving individual predictions of node attributes. Using this framework, we quantify the assortativity range leading to an accuracy gain in several situations. We finally show how specific characteristics of the network can improve performances further. For instance, the gender assortativity in real-world mobile phone data changes significantly according to some communication attributes. In this case, individual predictions with 75% accuracy are improved by up to 3%.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124306036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An evolutionary building algorithm for Deep Neural Networks 深度神经网络的进化构建算法
R. Zemouri
The increase of the computer power has contributed significantly to the development of the Deep Neural Networks. However, the training phase is more difficult since there are many hidden layers with many connections. The aim of this paper is to improve the learning procedure for Deep Neural Networks. A new method for building an evolutionary DNN is presented. With our method, the user does not have to arbitrary specify the number of hidden layers nor the number of neurons per layer. Illustrative examples are provided to support the theoretical analysis.
计算机性能的提高为深度神经网络的发展做出了重要贡献。然而,训练阶段更加困难,因为有许多隐藏层和许多连接。本文的目的是改进深度神经网络的学习过程。提出了一种构建进化深度神经网络的新方法。使用我们的方法,用户不必任意指定隐藏层的数量或每层神经元的数量。并提供了实例来支持理论分析。
{"title":"An evolutionary building algorithm for Deep Neural Networks","authors":"R. Zemouri","doi":"10.1109/WSOM.2017.8020002","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020002","url":null,"abstract":"The increase of the computer power has contributed significantly to the development of the Deep Neural Networks. However, the training phase is more difficult since there are many hidden layers with many connections. The aim of this paper is to improve the learning procedure for Deep Neural Networks. A new method for building an evolutionary DNN is presented. With our method, the user does not have to arbitrary specify the number of hidden layers nor the number of neurons per layer. Illustrative examples are provided to support the theoretical analysis.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115138726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Imputation of reactive silica and available alumina in bauxites by self-organizing maps 用自组织图法测定铝土矿中活性二氧化硅和有效氧化铝
C. C. Carneiro, Dayana Niazabeth Del Valle Silva Yanez, C. Ulsen, S. Fraser, Juliana Livi Antoniassi, S. Paz, R. Angélica, H. Kahn
Geochemical analyses can provide multiple analytical variables. Accordingly, the generation of large geochemical databases enables imputation studies or analytical estimates of missing values or complex measuring. The processing of bauxite is a key step in the production of aluminum, in which the determination of Reactive Silica (RxSiO2) and Available Alumina (AvAl2O3) are very relevant. The traditional analytical method for achieving RxSiO2 has limitations associated with poor repeatability and reproducibility of results. Based on the values from the unsupervised Self-Organizing Maps technique, this study aims to develop, systematically, the imputation of missing grades of the geochemical composition of bauxite samples of a database from three trial projects, for the variables: total Al2O3; total SiO2; total Fe2O3; and total TiO2. Each project was submitted to partial exclusion of AvAl2O3 and RxSiO2 values, in proportion of 20%, 30%, 40% and 50%, to investigate the SOM technique as imputation method for RxSiO2 and AvAl2O3. By comparing the imputed values from the SOM analysis with the original values, SOM technique demonstrated to be an imputation tool capable of obtaining analytical results with up to 50% of missing data. Specifically, the best results demonstrate that AvAl2O3 can be obtained by imputation with a higher correlation than RxSiO2, based on the parameters and variables involved in the study. Similarity in the nature of samples and an increase in the number of embedded analytical variables are factors that provided better imputation results.
地球化学分析可以提供多个分析变量。因此,大型地球化学数据库的产生使对缺失值或复杂测量的归算研究或分析估计成为可能。铝土矿的加工是铝生产的关键步骤,其中活性二氧化硅(RxSiO2)和有效氧化铝(AvAl2O3)的测定是非常重要的。获得RxSiO2的传统分析方法存在重复性差和结果重现性差的局限性。基于无监督自组织图(unsupervised Self-Organizing Maps)技术的值,本研究旨在系统地建立三个试验项目数据库中铝土矿样品地球化学成分缺失品位的归算方法,变量为:Al2O3总量;总二氧化硅;总Fe2O3;和总TiO2。每个项目按20%、30%、40%和50%的比例对AvAl2O3和RxSiO2值进行部分排除,研究SOM技术作为RxSiO2和AvAl2O3的imputation方法。通过将SOM分析的输入值与原始值进行比较,SOM技术证明是一种能够在缺失数据高达50%的情况下获得分析结果的输入工具。具体而言,最佳结果表明,基于研究中涉及的参数和变量,通过代入得到的AvAl2O3比RxSiO2具有更高的相关性。样品性质的相似性和嵌入分析变量数量的增加是提供更好的imputation结果的因素。
{"title":"Imputation of reactive silica and available alumina in bauxites by self-organizing maps","authors":"C. C. Carneiro, Dayana Niazabeth Del Valle Silva Yanez, C. Ulsen, S. Fraser, Juliana Livi Antoniassi, S. Paz, R. Angélica, H. Kahn","doi":"10.1109/WSOM.2017.8020008","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020008","url":null,"abstract":"Geochemical analyses can provide multiple analytical variables. Accordingly, the generation of large geochemical databases enables imputation studies or analytical estimates of missing values or complex measuring. The processing of bauxite is a key step in the production of aluminum, in which the determination of Reactive Silica (RxSiO<inf>2</inf>) and Available Alumina (AvAl<inf>2</inf>O<inf>3</inf>) are very relevant. The traditional analytical method for achieving RxSiO<inf>2</inf> has limitations associated with poor repeatability and reproducibility of results. Based on the values from the unsupervised Self-Organizing Maps technique, this study aims to develop, systematically, the imputation of missing grades of the geochemical composition of bauxite samples of a database from three trial projects, for the variables: total Al<inf>2</inf>O<inf>3</inf>; total SiO<inf>2</inf>; total Fe<inf>2</inf>O<inf>3</inf>; and total TiO<inf>2</inf>. Each project was submitted to partial exclusion of AvAl<inf>2</inf>O<inf>3</inf> and RxSiO<inf>2</inf> values, in proportion of 20%, 30%, 40% and 50%, to investigate the SOM technique as imputation method for RxSiO<inf>2</inf> and AvAl<inf>2</inf>O<inf>3</inf>. By comparing the imputed values from the SOM analysis with the original values, SOM technique demonstrated to be an imputation tool capable of obtaining analytical results with up to 50% of missing data. Specifically, the best results demonstrate that AvAl<inf>2</inf>O<inf>3</inf> can be obtained by imputation with a higher correlation than RxSiO<inf>2</inf>, based on the parameters and variables involved in the study. Similarity in the nature of samples and an increase in the number of embedded analytical variables are factors that provided better imputation results.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"50 15","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113934161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SOM-empowered graph segmentation for fast automatic clustering of large and complex data 基于som的图形分割,用于大型复杂数据的快速自动聚类
E. Merényi, Joshua Taylor
Many clustering methods, including modern graph segmentation algorithms, run into limitations when encountering “Big Data”, data with high feature dimensions, large volume, and complex structure. SOM-based clustering has been demonstrated to accurately capture many clusters of widely varying statistical properties in such data. While a number of automated SOM segmentations have been put forward, the best identifications of complex cluster structures to date are those performed interactively from informative visualizations of the learned SOM's knowledge. This does not scale for Big Data, large archives or near-real time analyses for fast decision-making. We present a new automated approach to SOM-segmentation which closely approximates the precision of the interactive method for complicated data, and at the same time is very fast and memory-efficient. We achieve this by infusing SOM knowledge into leading graph segmentation algorithms which, by themselves, produce extremely poor results segmenting the SOM prototypes. We use the SOM prototypes as input vectors and CONN similarity measure, derived from the SOM's knowledge of the data connectivity, as edge weighting to the graph segmentation algorithms. We demonstrate the effectiveness on synthetic data and on real spectral imagery.
许多聚类方法,包括现代的图分割算法,在面对“大数据”这种特征维数高、体积大、结构复杂的数据时,都存在局限性。基于som的聚类已被证明可以准确地捕获此类数据中具有广泛不同统计属性的许多聚类。虽然已经提出了许多自动化的SOM分割方法,但迄今为止,对复杂簇结构的最佳识别是那些从学习的SOM知识的信息可视化中交互式执行的识别。这并不适用于大数据、大型档案或用于快速决策的近实时分析。本文提出了一种新的自动分割方法,该方法既接近复杂数据的交互式分割方法的精度,又具有快速和节省内存的特点。我们通过将SOM知识注入到领先的图分割算法中来实现这一点,这些算法本身会产生非常差的分割SOM原型的结果。我们使用SOM原型作为输入向量和CONN相似度度量,从SOM对数据连通性的了解中获得,作为图分割算法的边缘加权。我们证明了在合成数据和真实光谱图像上的有效性。
{"title":"SOM-empowered graph segmentation for fast automatic clustering of large and complex data","authors":"E. Merényi, Joshua Taylor","doi":"10.1109/WSOM.2017.8020004","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020004","url":null,"abstract":"Many clustering methods, including modern graph segmentation algorithms, run into limitations when encountering “Big Data”, data with high feature dimensions, large volume, and complex structure. SOM-based clustering has been demonstrated to accurately capture many clusters of widely varying statistical properties in such data. While a number of automated SOM segmentations have been put forward, the best identifications of complex cluster structures to date are those performed interactively from informative visualizations of the learned SOM's knowledge. This does not scale for Big Data, large archives or near-real time analyses for fast decision-making. We present a new automated approach to SOM-segmentation which closely approximates the precision of the interactive method for complicated data, and at the same time is very fast and memory-efficient. We achieve this by infusing SOM knowledge into leading graph segmentation algorithms which, by themselves, produce extremely poor results segmenting the SOM prototypes. We use the SOM prototypes as input vectors and CONN similarity measure, derived from the SOM's knowledge of the data connectivity, as edge weighting to the graph segmentation algorithms. We demonstrate the effectiveness on synthetic data and on real spectral imagery.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134153102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Self-organizing map for orienteering problem with dubins vehicle 杜宾车辆定向问题的自组织地图
J. Faigl
This paper reports on the application of the self-organizing map (SOM) to solve a novel generalization of the Orienteering Problem (OP) for curvature-constrained vehicles that is called the Dubins Orienteering Problem (DOP). Having a set of target locations, each with associated reward, and a given travel budget, the problem is to find the most valuable curvature-constrained path connecting the target locations such that the path does not exceed the travel budget. The proposed approach is based on two existing SOM-based approaches to solving the OP and Dubins Traveling Salesman Problem (Dubins TSP) that are further generalized to provide a solution of the more computational challenging DOP. DOP combines challenges of the combinatorial optimization of the OP and TSP to determine a subset of the most valuable targets and the optimal sequence of the waypoints to collect rewards of the targets together with the continuous optimization of determining headings of Dubins vehicle at the waypoints such that the total length of the curvature-constrained path is shorter than the given travel budget and the total sum of the collected rewards is maximized.
本文报道了应用自组织映射(SOM)求解曲率受限车辆定向问题(OP)的一种新推广,即Dubins定向问题(DOP)。有了一组目标地点,每个地点都有相应的奖励和给定的旅行预算,问题是找到连接目标地点的最有价值的曲率约束路径,使路径不超过旅行预算。提出的方法是基于现有的两种基于som的方法来解决OP和Dubins旅行推销员问题(Dubins TSP),这两种方法进一步推广到提供更具计算挑战性的DOP的解决方案。DOP结合了OP和TSP组合优化的挑战,确定最有价值的目标子集和收集目标奖励的最优路径点序列,并在路径点上连续优化确定Dubins车辆的航向,使曲率约束路径的总长度小于给定的旅行预算,并使收集的奖励总额最大化。
{"title":"Self-organizing map for orienteering problem with dubins vehicle","authors":"J. Faigl","doi":"10.1109/WSOM.2017.8020017","DOIUrl":"https://doi.org/10.1109/WSOM.2017.8020017","url":null,"abstract":"This paper reports on the application of the self-organizing map (SOM) to solve a novel generalization of the Orienteering Problem (OP) for curvature-constrained vehicles that is called the Dubins Orienteering Problem (DOP). Having a set of target locations, each with associated reward, and a given travel budget, the problem is to find the most valuable curvature-constrained path connecting the target locations such that the path does not exceed the travel budget. The proposed approach is based on two existing SOM-based approaches to solving the OP and Dubins Traveling Salesman Problem (Dubins TSP) that are further generalized to provide a solution of the more computational challenging DOP. DOP combines challenges of the combinatorial optimization of the OP and TSP to determine a subset of the most valuable targets and the optimal sequence of the waypoints to collect rewards of the targets together with the continuous optimization of determining headings of Dubins vehicle at the waypoints such that the total length of the curvature-constrained path is shorter than the given travel budget and the total sum of the collected rewards is maximized.","PeriodicalId":130086,"journal":{"name":"2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115449697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2017 12th International Workshop on Self-Organizing Maps and Learning Vector Quantization, Clustering and Data Visualization (WSOM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1