The 2013 International Joint Conference on Neural Networks (IJCNN)最新文献

英文中文

Robust neural predictor for noisy chaotic time series prediction 噪声混沌时间序列鲁棒神经预测器

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6706996

Min Han, Xinying Wang

A robust neural predictor is designed for noisy chaotic time series prediction in this paper. The main idea is based on the consideration of the bounded uncertainty in predictor input, and it is a typical Errors-in-Variables problem. The robust design is based on the linear-in-parameters ESN (Echo State Network) model. By minimizing the worst-case residual induced by the bounded perturbations in the echo state variables, the robust predictor is obtained in coping with the uncertainty in the noisy time series. In the experiment, the classical Mackey-Glass 84-step benchmark prediction task is investigated. The prediction performance is studied for the nominal and robust design of ESN predictors.

本文设计了一种鲁棒神经预测器，用于噪声混沌时间序列的预测。其主要思想是考虑了预测器输入的有界不确定性，是一个典型的变量误差问题。鲁棒性设计基于参数线性回声状态网络(ESN)模型。通过最小化回波状态变量中有界扰动引起的最坏情况残差，获得了鲁棒预测器，以应对噪声时间序列中的不确定性。在实验中，研究了经典的Mackey-Glass 84步基准预测任务。研究了回声状态网络预测器的标称设计和稳健设计的预测性能。

引用次数: 5

Cellular neural network based situational awareness system for power grids 基于细胞神经网络的电网态势感知系统

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6707108

K. Balasubramaniam, G. Venayagamoorthy, N. Watson

Situational awareness (SA) in simple terms is to understand the current state of the system and based on that understanding predict how system states are to evolve over time. Predictive modeling of power systems using conventional methods is time consuming and hence not well suited for real-time operation. In this study, neural network (NN) based non-linear predictor is used to predict states of power system for future time instance. Required control signals are computed based on predicted state variables and control set points. In order to reduce computation the problem is decoupled and solved in a cellular array of NNs. The cellular neural network (CNN) framework allows for accurate prediction with only minimal information exchange between neighboring predictors. The predicted states are then used in computing stability metrics that give proximity to point of instability. The situational awareness platform developed using CNN framework extracts information from data for the next time instance i.e. a step ahead of time and maps this data with geographical coordinates of power system components. The geographic information system (GIS) provides a visual indication of operating status of individual components as well as that of the entire system.

简单来说，态势感知(SA)就是了解系统的当前状态，并在此基础上预测系统状态如何随时间演变。采用传统方法对电力系统进行预测建模耗时长，因此不适合实时运行。本文采用基于神经网络的非线性预测器对电力系统的未来状态进行预测。根据预测的状态变量和控制设定点计算所需的控制信号。为了减少计算量，该问题解耦并在神经网络的元胞阵列中求解。细胞神经网络(CNN)框架允许在相邻预测器之间只有最小的信息交换的情况下进行准确的预测。然后将预测的状态用于计算稳定性度量，以接近不稳定点。使用CNN框架开发的态势感知平台从数据中提取下一个时间实例的信息，即提前一步，并将这些数据与电力系统组件的地理坐标进行映射。地理信息系统(GIS)提供单个部件以及整个系统运行状态的可视化指示。

{"title":"Cellular neural network based situational awareness system for power grids","authors":"K. Balasubramaniam, G. Venayagamoorthy, N. Watson","doi":"10.1109/IJCNN.2013.6707108","DOIUrl":"https://doi.org/10.1109/IJCNN.2013.6707108","url":null,"abstract":"Situational awareness (SA) in simple terms is to understand the current state of the system and based on that understanding predict how system states are to evolve over time. Predictive modeling of power systems using conventional methods is time consuming and hence not well suited for real-time operation. In this study, neural network (NN) based non-linear predictor is used to predict states of power system for future time instance. Required control signals are computed based on predicted state variables and control set points. In order to reduce computation the problem is decoupled and solved in a cellular array of NNs. The cellular neural network (CNN) framework allows for accurate prediction with only minimal information exchange between neighboring predictors. The predicted states are then used in computing stability metrics that give proximity to point of instability. The situational awareness platform developed using CNN framework extracts information from data for the next time instance i.e. a step ahead of time and maps this data with geographical coordinates of power system components. The geographic information system (GIS) provides a visual indication of operating status of individual components as well as that of the entire system.","PeriodicalId":376975,"journal":{"name":"The 2013 International Joint Conference on Neural Networks (IJCNN)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122685482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Chunk incremental IDR/QR LDA learning 块增量IDR/QR LDA学习

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6707018

Yiming Peng, Shaoning Pang, Gang Chen, A. Sarrafzadeh, Tao Ban, D. Inoue

Training data in real world is often presented in random chunks. Yet existing sequential Incremental IDR/QR LDA (s-QR/IncLDA) can only process data one sample after another. This paper proposes a constructive chunk Incremental IDR/QR LDA (c-QR/IncLDA) for multiple data samples incremental learning. Given a chunk of s samples for incremental learning, the proposed c-QR/IncLDA increments current discriminant model Ω, by implementing computation on the compressed the residue matrix Δ ϵ Rd×n, instead of the entire incoming data chunk X ϵ Rd×s, where η ≤ s holds. Meanwhile, we derive a more accurate reduced within-class scatter matrix W to minimize the discriminative information loss at every incremental learning cycle. It is noted that the computational complexity of c-QR/IncLDA can be more expensive than s-QR/IncLDA for single sample processing. However, for multiple samples processing, the computational efficiency of c-QR/IncLDA deterministically surpasses s-QR/IncLDA when the chunk size is large, i.e., s ≫ η holds. Moreover, experiments evaluation shows that the proposed c-QR/IncLDA can achieve an accuracy level that is competitive to batch QR/LDA and is consistently higher than s-QR/IncLDA.

现实世界中的训练数据通常以随机块的形式呈现。而现有的顺序增量式IDR/QR LDA (s-QR/IncLDA)只能一个样本接一个样本地处理数据。本文提出了一种用于多数据样本增量学习的构造块增量IDR/QR LDA (c-QR/IncLDA)算法。给定s个样本块用于增量学习，所提出的c-QR/IncLDA通过在压缩残差矩阵Δ御Rd×n上实现计算，而不是在η≤s成立的整个传入数据块X御Rd×s上实现对当前区分模型Ω的增量计算。同时，我们导出了一个更精确的类内散点矩阵W，以最小化每个增量学习周期的判别信息损失。对于单样本处理，c-QR/IncLDA的计算复杂度可能比s-QR/IncLDA要高。然而，对于多样本处理，当块大小较大时，c-QR/IncLDA的计算效率确定性地优于s- qr /IncLDA，即s比η保持不变。实验结果表明，c-QR/IncLDA的精度水平与批量QR/LDA相当，且始终高于s-QR/IncLDA。

{"title":"Chunk incremental IDR/QR LDA learning","authors":"Yiming Peng, Shaoning Pang, Gang Chen, A. Sarrafzadeh, Tao Ban, D. Inoue","doi":"10.1109/IJCNN.2013.6707018","DOIUrl":"https://doi.org/10.1109/IJCNN.2013.6707018","url":null,"abstract":"Training data in real world is often presented in random chunks. Yet existing sequential Incremental IDR/QR LDA (s-QR/IncLDA) can only process data one sample after another. This paper proposes a constructive chunk Incremental IDR/QR LDA (c-QR/IncLDA) for multiple data samples incremental learning. Given a chunk of s samples for incremental learning, the proposed c-QR/IncLDA increments current discriminant model Ω, by implementing computation on the compressed the residue matrix Δ ϵ Rd×n, instead of the entire incoming data chunk X ϵ Rd×s, where η ≤ s holds. Meanwhile, we derive a more accurate reduced within-class scatter matrix W to minimize the discriminative information loss at every incremental learning cycle. It is noted that the computational complexity of c-QR/IncLDA can be more expensive than s-QR/IncLDA for single sample processing. However, for multiple samples processing, the computational efficiency of c-QR/IncLDA deterministically surpasses s-QR/IncLDA when the chunk size is large, i.e., s ≫ η holds. Moreover, experiments evaluation shows that the proposed c-QR/IncLDA can achieve an accuracy level that is competitive to batch QR/LDA and is consistently higher than s-QR/IncLDA.","PeriodicalId":376975,"journal":{"name":"The 2013 International Joint Conference on Neural Networks (IJCNN)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122920618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

A comparative analysis of dissimilarity measures for clustering categorical data 聚类分类数据的不同度量的比较分析

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6707039

J. C. Xavier, A. Canuto, N. D. Almeida, L. Gonçalves

Similarity and dissimilarity (distance) between objects is an important aspect that must be considered when clustering data. When clustering categorical data, for instance, these distance (similarity or dissimilarity) measures need to address properly the real particularities of categorical data. In this paper, we perform a comparative analysis with four different dissimilarity measures used as a distance metric for clustering categorical data. The first one is the Simple Matching Dissimilarity Measure (SMDM), which is one of the simplest and the most used metric for categorical attribute. The other two are context-based approaches (DIstance Learning in Categorical Attributes - DILCA and Domain Value Dissimilarity-DVD), and the last one is an extension of the SMDM, which is proposed in this paper. All four dissimilarities are applied as distance metrics in two well known clustering algorithms, k-means and agglomerative hierarchical clustering algorithms. In this analysis, we also use internal and external cluster validity measures, aiming to compare the effectiveness of all four distance measures in both clustering algorithms.

对象之间的相似性和不相似性(距离)是聚类数据时必须考虑的一个重要方面。例如，当对分类数据进行聚类时，这些距离(相似或不相似)度量需要适当地处理分类数据的实际特殊性。在本文中，我们使用四种不同的不相似性度量作为聚类分类数据的距离度量进行比较分析。第一种是简单匹配不相似性度量(Simple Matching Dissimilarity Measure, SMDM)，它是分类属性最简单、最常用的度量之一。另外两种方法是基于上下文的远程学习方法(DILCA和dvd)，最后一种方法是本文提出的SMDM的扩展。在两种著名的聚类算法——k-means和聚类分层聚类算法中，所有四种不相似性都被用作距离度量。在本分析中，我们还使用了内部和外部聚类有效性度量，旨在比较两种聚类算法中所有四种距离度量的有效性。

{"title":"A comparative analysis of dissimilarity measures for clustering categorical data","authors":"J. C. Xavier, A. Canuto, N. D. Almeida, L. Gonçalves","doi":"10.1109/IJCNN.2013.6707039","DOIUrl":"https://doi.org/10.1109/IJCNN.2013.6707039","url":null,"abstract":"Similarity and dissimilarity (distance) between objects is an important aspect that must be considered when clustering data. When clustering categorical data, for instance, these distance (similarity or dissimilarity) measures need to address properly the real particularities of categorical data. In this paper, we perform a comparative analysis with four different dissimilarity measures used as a distance metric for clustering categorical data. The first one is the Simple Matching Dissimilarity Measure (SMDM), which is one of the simplest and the most used metric for categorical attribute. The other two are context-based approaches (DIstance Learning in Categorical Attributes - DILCA and Domain Value Dissimilarity-DVD), and the last one is an extension of the SMDM, which is proposed in this paper. All four dissimilarities are applied as distance metrics in two well known clustering algorithms, k-means and agglomerative hierarchical clustering algorithms. In this analysis, we also use internal and external cluster validity measures, aiming to compare the effectiveness of all four distance measures in both clustering algorithms.","PeriodicalId":376975,"journal":{"name":"The 2013 International Joint Conference on Neural Networks (IJCNN)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122438294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

A first analysis of the effect of local and global optimization weights methods in the cooperative-competitive design of RBFN for imbalanced environments 首次分析了局部和全局优化权值法在非平衡环境下RBFN协同-竞争设计中的作用

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6706973

M. D. Pérez-Godoy, A. J. Rivera, M. J. Jesús, F. Martínez

Many real applications are composed of data sets where the distribution of the classes is significantly different. These data sets are commonly known as imbalanced data sets. Proposed approaches that address this problem can be categorized into two types: data-based, which resample problem data in a preprocessing phase and algorithm-based which modify or create new methods to address the imbalance problem. In this paper, CO2 RBFN a cooperative-competitive design method for Radial Basis Function Networks that has previously demonstrated a good behaviour tackling imbalanced data sets, is tested using two different training weights algorithms, local and global, in order to gain knowledge about this problem. As conclusions we can outline that a more global optimizer training algorithm obtains worse results.

许多实际应用程序由数据集组成，其中类的分布有很大不同。这些数据集通常被称为不平衡数据集。提出的解决这一问题的方法可以分为两类:基于数据的方法，在预处理阶段对问题数据进行采样;基于算法的方法，修改或创建新的方法来解决不平衡问题。在本文中，CO2 RBFN是一种合作-竞争设计方法，用于径向基函数网络，以前已经证明了处理不平衡数据集的良好行为，使用两种不同的训练权重算法进行测试，局部和全局，以获得有关该问题的知识。作为结论，我们可以勾勒出一个更全局优化的训练算法得到更差的结果。

引用次数: 0

A robot on-line area coverage approach based on the probabilistic Lloyd method 基于概率劳埃德法的机器人在线区域覆盖方法

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6707007

M. R. Batista, R. Calvo, R. Romero

Area Coverage is a standard problem in which Robotics techniques can be applied. An approach to solve this problem is through techniques based on Centroidal Voronoi Tesselations (CVT), considering that each robot is a generator used to build Voronoi polygons. In this work, a new approach named by Sample Lloyd Area Coverage System (SLACS), is proposed that does not need of the explicit building of the diagram based in the Probabilistic Lloyd method to estimate a Voronoi polygon's centroid. In addition, it is proposed a method to close Voronoi diagrams to apply in a classic Lloyd CVT procedure. Both approaches are compared in empty and roomlike environments done in simulated tests using both Player interface and Stage simulator. Results obtained show that the proposed approach is well suited to solve the area coverage problem via mobile sensor deployment and it is a simple and effective substitute to a Lloyd CVT method.

区域覆盖是机器人技术可以应用的标准问题。解决这个问题的一种方法是通过基于质心Voronoi镶嵌(CVT)的技术，考虑到每个机器人都是一个用于构建Voronoi多边形的生成器。本文提出了一种新的方法，即样本劳埃德区域覆盖系统(SLACS)，该方法不需要基于概率劳埃德方法明确地构建图来估计Voronoi多边形的质心。此外，提出了一种封闭Voronoi图的方法，适用于经典劳埃德无级变速器程序。在使用玩家界面和舞台模拟器进行的模拟测试中，对两种方法在空环境和房间环境中进行了比较。结果表明，该方法可以很好地解决移动传感器部署的区域覆盖问题，是Lloyd CVT方法的一种简单有效的替代方法。

引用次数: 5

Some results about the Vapnik-Chervonenkis entropy and the rademacher complexity 关于Vapnik-Chervonenkis熵和rademacher复杂度的一些结果

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6706943

D. Anguita, A. Ghio, L. Oneto, S. Ridella

This paper deals with the problem of identifying a connection between the Vapnik-Chervonenkis (VC) Entropy, a notion of complexity introduced by Vapnik in his seminal work, and the Rademacher Complexity, a more powerful notion of complexity, which has been in the limelight of several works in the recent Machine Learning literature. In order to establish this connection, we refine some previously known relationships and derive a new result. Our proposal allows computing an admissible range for the Rademacher Complexity, given a value of the VC-Entropy, and vice versa, therefore opening new appealing research perspectives in the field of assessing the complexity of an hypothesis space.

本文处理的问题是确定Vapnik- chervonenkis (VC)熵(Vapnik在其开创性工作中引入的复杂性概念)和Rademacher复杂性(一个更强大的复杂性概念)之间的联系，Rademacher复杂性是最近机器学习文献中几部作品的焦点。为了建立这种联系，我们改进了一些先前已知的关系，并得出了一个新的结果。我们的建议允许计算Rademacher复杂度的可接受范围，给定vc -熵的值，反之亦然，因此在评估假设空间的复杂性领域开辟了新的吸引人的研究视角。

引用次数: 1

Learning population of spiking neural networks with perturbation of conductances 电导摄动下尖峰神经网络的学习群体

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6706756

Piotr Suszynski, Pawel Wawrzynski

In this paper a method is presented for learning of spiking neural networks. It is based on perturbation of synaptic conductances. While this approach is known to be model-free, it is also known to be slow, because it applies improvement direction estimates with large variance. Two ideas are analysed to alleviate this problem: First, learning of many networks at the same time instead of one. Second, autocorrelation of perturbations in time. In the experimental study the method is validated on three learning tasks in which information is conveyed with frequency and spike timing.

本文提出了一种尖峰神经网络的学习方法。它是基于突触传导的扰动。虽然这种方法被认为是无模型的，但它也被认为是缓慢的，因为它应用了具有大方差的改进方向估计。本文分析了两种方法来缓解这个问题:第一，同时学习多个网络，而不是一个网络。第二，摄动在时间上的自相关。在实验研究中，该方法在三个具有频率和脉冲定时的学习任务中得到了验证。

引用次数: 2

Cognitive computing programming paradigm: A Corelet Language for composing networks of neurosynaptic cores 认知计算程序设计范例:一种构成神经突触核心网络的核心语言

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6707078

A. Amir, Pallab Datta, W. Risk, A. Cassidy, J. Kusnitz, Steven K. Esser, Alexander Andreopoulos, T. Wong, M. Flickner, Rodrigo Alvarez-Icaza, E. McQuinn, Ben Shaw, Norm Pass, D. Modha

Marching along the DARPA SyNAPSE roadmap, IBM unveils a trilogy of innovations towards the TrueNorth cognitive computing system inspired by the brain's function and efficiency. The sequential programming paradigm of the von Neumann architecture is wholly unsuited for TrueNorth. Therefore, as our main contribution, we develop a new programming paradigm that permits construction of complex cognitive algorithms and applications while being efficient for TrueNorth and effective for programmer productivity. The programming paradigm consists of (a) an abstraction for a TrueNorth program, named Corelet, for representing a network of neurosynaptic cores that encapsulates all details except external inputs and outputs; (b) an object-oriented Corelet Language for creating, composing, and decomposing corelets; (c) a Corelet Library that acts as an ever-growing repository of reusable corelets from which programmers compose new corelets; and (d) an end-to-end Corelet Laboratory that is a programming environment which integrates with the TrueNorth architectural simulator, Compass, to support all aspects of the programming cycle from design, through development, debugging, and up to deployment. The new paradigm seamlessly scales from a handful of synapses and neurons to networks of neurosynaptic cores of progressively increasing size and complexity. The utility of the new programming paradigm is underscored by the fact that we have designed and implemented more than 100 algorithms as corelets for TrueNorth in a very short time span.

在DARPA SyNAPSE路线图的指引下，IBM公布了TrueNorth认知计算系统的创新三部曲，其灵感来自于大脑的功能和效率。冯·诺伊曼架构的顺序编程范式完全不适合TrueNorth。因此，作为我们的主要贡献，我们开发了一种新的编程范式，允许构建复杂的认知算法和应用程序，同时对TrueNorth有效，对程序员的生产力有效。编程范例包括(a)一个名为Corelet的TrueNorth程序的抽象，用于表示一个神经突触核心网络，该网络封装了除外部输入和输出外的所有细节;(b)面向对象的Corelet语言，用于创建、组合和分解Corelet;(c)一个Corelet库，作为一个不断增长的可重用Corelet库，程序员可以从中组成新的Corelet;(d)端到端内核实验室，这是一个与TrueNorth架构模拟器Compass集成的编程环境，以支持从设计、开发、调试到部署的编程周期的各个方面。这个新范式无缝地从少数突触和神经元扩展到逐渐增加大小和复杂性的神经突触核心网络。我们已经在很短的时间内为TrueNorth设计并实现了100多个算法，这一事实强调了新编程范式的实用性。

{"title":"Cognitive computing programming paradigm: A Corelet Language for composing networks of neurosynaptic cores","authors":"A. Amir, Pallab Datta, W. Risk, A. Cassidy, J. Kusnitz, Steven K. Esser, Alexander Andreopoulos, T. Wong, M. Flickner, Rodrigo Alvarez-Icaza, E. McQuinn, Ben Shaw, Norm Pass, D. Modha","doi":"10.1109/IJCNN.2013.6707078","DOIUrl":"https://doi.org/10.1109/IJCNN.2013.6707078","url":null,"abstract":"Marching along the DARPA SyNAPSE roadmap, IBM unveils a trilogy of innovations towards the TrueNorth cognitive computing system inspired by the brain's function and efficiency. The sequential programming paradigm of the von Neumann architecture is wholly unsuited for TrueNorth. Therefore, as our main contribution, we develop a new programming paradigm that permits construction of complex cognitive algorithms and applications while being efficient for TrueNorth and effective for programmer productivity. The programming paradigm consists of (a) an abstraction for a TrueNorth program, named Corelet, for representing a network of neurosynaptic cores that encapsulates all details except external inputs and outputs; (b) an object-oriented Corelet Language for creating, composing, and decomposing corelets; (c) a Corelet Library that acts as an ever-growing repository of reusable corelets from which programmers compose new corelets; and (d) an end-to-end Corelet Laboratory that is a programming environment which integrates with the TrueNorth architectural simulator, Compass, to support all aspects of the programming cycle from design, through development, debugging, and up to deployment. The new paradigm seamlessly scales from a handful of synapses and neurons to networks of neurosynaptic cores of progressively increasing size and complexity. The utility of the new programming paradigm is underscored by the fact that we have designed and implemented more than 100 algorithms as corelets for TrueNorth in a very short time span.","PeriodicalId":376975,"journal":{"name":"The 2013 International Joint Conference on Neural Networks (IJCNN)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122079403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 162

Gaussian-Bernoulli deep Boltzmann machine 高斯-伯努利深度玻尔兹曼机

The 2013 International Joint Conference on Neural Networks (IJCNN)

Pub Date : 2013-08-01 DOI: 10.1109/IJCNN.2013.6706831

Kyunghyun Cho, T. Raiko, A. Ilin

In this paper, we study a model that we call Gaussian-Bernoulli deep Boltzmann machine (GDBM) and discuss potential improvements in training the model. GDBM is designed to be applicable to continuous data and it is constructed from Gaussian-Bernoulli restricted Boltzmann machine (GRBM) by adding multiple layers of binary hidden neurons. The studied improvements of the learning algorithm for GDBM include parallel tempering, enhanced gradient, adaptive learning rate and layer-wise pretraining. We empirically show that they help avoid some of the common difficulties found in training deep Boltzmann machines such as divergence of learning, the difficulty in choosing right learning rate scheduling, and the existence of meaningless higher layers.

在本文中，我们研究了一个我们称之为高斯-伯努利深度玻尔兹曼机(GDBM)的模型，并讨论了训练模型的潜在改进。GDBM是在高斯-伯努利受限玻尔兹曼机(GRBM)的基础上，通过添加多层二值隐藏神经元构造而成的，适用于连续数据。研究了对GDBM学习算法的改进，包括并行回火、增强梯度、自适应学习率和分层预训练。我们的经验表明，它们有助于避免在训练深度玻尔兹曼机器时发现的一些常见困难，如学习的发散，选择正确的学习率调度的困难，以及无意义的更高层的存在。

引用次数: 134

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

The 2013 International Joint Conference on Neural Networks (IJCNN)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀