首页 > 最新文献

IEEE transactions on neural networks最新文献

英文 中文
Unified development of multiplicative algorithms for linear and quadratic nonnegative matrix factorization. 线性和二次非负矩阵分解的乘法算法的统一发展。
Pub Date : 2011-12-01 Epub Date: 2011-10-17 DOI: 10.1109/TNN.2011.2170094
Zhirong Yang, Erkki Oja

Multiplicative updates have been widely used in approximative nonnegative matrix factorization (NMF) optimization because they are convenient to deploy. Their convergence proof is usually based on the minimization of an auxiliary upper-bounding function, the construction of which however remains specific and only available for limited types of dissimilarity measures. Here we make significant progress in developing convergent multiplicative algorithms for NMF. First, we propose a general approach to derive the auxiliary function for a wide variety of NMF problems, as long as the approximation objective can be expressed as a finite sum of monomials with real exponents. Multiplicative algorithms with theoretical guarantee of monotonically decreasing objective function sequence can thus be obtained. The solutions of NMF based on most commonly used dissimilarity measures such as α- and β-divergence as well as many other more comprehensive divergences can be derived by the new unified principle. Second, our method is extended to a nonseparable case that includes e.g., γ-divergence and Rényi divergence. Third, we develop multiplicative algorithms for NMF using second-order approximative factorizations, in which each factorizing matrix may appear twice. Preliminary numerical experiments demonstrate that the multiplicative algorithms developed using the proposed procedure can achieve satisfactory Karush-Kuhn-Tucker optimality. We also demonstrate NMF problems where algorithms by the conventional method fail to guarantee descent at each iteration but those by our principle are immune to such violation.

乘法更新由于便于部署,在近似非负矩阵分解优化中得到了广泛的应用。它们的收敛性证明通常是基于一个辅助上限函数的最小化,然而它的构造仍然是特定的,并且只适用于有限类型的不相似测度。在这里,我们在NMF的收敛乘法算法的开发方面取得了重大进展。首先,我们提出了一种推导辅助函数的一般方法,适用于各种各样的NMF问题,只要近似目标可以表示为具有实数指数的单项式的有限和。由此可以得到具有目标函数序列单调递减理论保证的乘法算法。利用新的统一原理,可以推导出基于α-散度和β-散度等最常用的非相似性测度以及许多其他更全面的散度的NMF解。其次,我们的方法被推广到不可分离的情况下,包括例如,γ-散度和r逍遥散。第三,我们使用二阶近似分解开发了NMF的乘法算法,其中每个分解矩阵可以出现两次。初步的数值实验表明,利用该方法开发的乘法算法可以达到满意的Karush-Kuhn-Tucker最优性。我们还演示了NMF问题,其中传统方法的算法不能保证每次迭代的下降,但我们的原理的算法不受这种破坏。
{"title":"Unified development of multiplicative algorithms for linear and quadratic nonnegative matrix factorization.","authors":"Zhirong Yang,&nbsp;Erkki Oja","doi":"10.1109/TNN.2011.2170094","DOIUrl":"https://doi.org/10.1109/TNN.2011.2170094","url":null,"abstract":"<p><p>Multiplicative updates have been widely used in approximative nonnegative matrix factorization (NMF) optimization because they are convenient to deploy. Their convergence proof is usually based on the minimization of an auxiliary upper-bounding function, the construction of which however remains specific and only available for limited types of dissimilarity measures. Here we make significant progress in developing convergent multiplicative algorithms for NMF. First, we propose a general approach to derive the auxiliary function for a wide variety of NMF problems, as long as the approximation objective can be expressed as a finite sum of monomials with real exponents. Multiplicative algorithms with theoretical guarantee of monotonically decreasing objective function sequence can thus be obtained. The solutions of NMF based on most commonly used dissimilarity measures such as α- and β-divergence as well as many other more comprehensive divergences can be derived by the new unified principle. Second, our method is extended to a nonseparable case that includes e.g., γ-divergence and Rényi divergence. Third, we develop multiplicative algorithms for NMF using second-order approximative factorizations, in which each factorizing matrix may appear twice. Preliminary numerical experiments demonstrate that the multiplicative algorithms developed using the proposed procedure can achieve satisfactory Karush-Kuhn-Tucker optimality. We also demonstrate NMF problems where algorithms by the conventional method fail to guarantee descent at each iteration but those by our principle are immune to such violation.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2170094","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30072528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 72
Data-based identification and control of nonlinear systems via piecewise affine approximation. 基于数据的非线性系统分段仿射逼近辨识与控制。
Pub Date : 2011-12-01 Epub Date: 2011-11-30 DOI: 10.1109/TNN.2011.2175946
Chow Yin Lai, Cheng Xiang, Tong Heng Lee

The piecewise affine (PWA) model represents an attractive model structure for approximating nonlinear systems. In this paper, a procedure for obtaining the PWA autoregressive exogenous (ARX) (autoregressive systems with exogenous inputs) models of nonlinear systems is proposed. Two key parameters defining a PWARX model, namely, the parameters of locally affine subsystems and the partition of the regressor space, are estimated, the former through a least-squares-based identification method using multiple models, and the latter using standard procedures such as neural network classifier or support vector machine classifier. Having obtained the PWARX model of the nonlinear system, a controller is then derived to control the system for reference tracking. Both simulation and experimental studies show that the proposed algorithm can indeed provide accurate PWA approximation of nonlinear systems, and the designed controller provides good tracking performance.

分段仿射(PWA)模型是一种有吸引力的模型结构,用于逼近非线性系统。本文提出了一种获取非线性系统的PWA自回归外生(ARX)模型的方法。定义PWARX模型的两个关键参数,即局部仿射子系统的参数和回归量空间的划分,前者通过使用多模型的基于最小二乘的识别方法进行估计,后者使用神经网络分类器或支持向量机分类器等标准程序进行估计。在得到非线性系统的PWARX模型后,推导出控制器对系统进行参考跟踪控制。仿真和实验研究均表明,该算法确实能够对非线性系统提供精确的PWA逼近,所设计的控制器具有良好的跟踪性能。
{"title":"Data-based identification and control of nonlinear systems via piecewise affine approximation.","authors":"Chow Yin Lai,&nbsp;Cheng Xiang,&nbsp;Tong Heng Lee","doi":"10.1109/TNN.2011.2175946","DOIUrl":"https://doi.org/10.1109/TNN.2011.2175946","url":null,"abstract":"<p><p>The piecewise affine (PWA) model represents an attractive model structure for approximating nonlinear systems. In this paper, a procedure for obtaining the PWA autoregressive exogenous (ARX) (autoregressive systems with exogenous inputs) models of nonlinear systems is proposed. Two key parameters defining a PWARX model, namely, the parameters of locally affine subsystems and the partition of the regressor space, are estimated, the former through a least-squares-based identification method using multiple models, and the latter using standard procedures such as neural network classifier or support vector machine classifier. Having obtained the PWARX model of the nonlinear system, a controller is then derived to control the system for reference tracking. Both simulation and experimental studies show that the proposed algorithm can indeed provide accurate PWA approximation of nonlinear systems, and the designed controller provides good tracking performance.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2175946","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30307609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Synchronization of continuous dynamical networks with discrete-time communications. 离散时间通信连续动态网络的同步。
Pub Date : 2011-12-01 Epub Date: 2011-10-25 DOI: 10.1109/TNN.2011.2171501
Yan-Wu Wang, Jiang-Wen Xiao, Changyun Wen, Zhi-Hong Guan

In this paper, synchronization of continuous dynamical networks with discrete-time communications is studied. Though the dynamical behavior of each node is continuous-time, the communications between every two different nodes are discrete-time, i.e., they are active only at some discrete time instants. Moreover, the communication intervals between every two communication instants can be uncertain and variable. By choosing a piecewise Lyapunov-Krasovskii functional to govern the characteristics of the discrete communication instants and by utilizing a convex combination technique, a synchronization criterion is derived in terms of linear matrix inequalities with an upper bound for the communication intervals obtained. The results extend and improve upon earlier work. Simulation results show the effectiveness of the proposed communication scheme. Some relationships between the allowable upper bound of communication intervals and the coupling strength of the network are illustrated through simulations on a fully connected network, a star-like network, and a nearest neighbor network.

研究了具有离散时间通信的连续动态网络的同步问题。虽然每个节点的动态行为是连续时间的,但每两个不同节点之间的通信是离散时间的,即它们只在某些离散的时间瞬间是活动的。此外,每两个通信瞬间之间的通信间隔可能是不确定的和可变的。通过选择分段Lyapunov-Krasovskii泛函来控制离散通信瞬间的特征,并利用凸组合技术,导出了基于线性矩阵不等式的同步准则,该准则具有所获得的通信区间的上界。结果扩展和改进了早期的工作。仿真结果表明了该通信方案的有效性。通过对全连通网络、星形网络和最近邻网络的仿真,说明了通信间隔允许上界与网络耦合强度之间的关系。
{"title":"Synchronization of continuous dynamical networks with discrete-time communications.","authors":"Yan-Wu Wang,&nbsp;Jiang-Wen Xiao,&nbsp;Changyun Wen,&nbsp;Zhi-Hong Guan","doi":"10.1109/TNN.2011.2171501","DOIUrl":"https://doi.org/10.1109/TNN.2011.2171501","url":null,"abstract":"<p><p>In this paper, synchronization of continuous dynamical networks with discrete-time communications is studied. Though the dynamical behavior of each node is continuous-time, the communications between every two different nodes are discrete-time, i.e., they are active only at some discrete time instants. Moreover, the communication intervals between every two communication instants can be uncertain and variable. By choosing a piecewise Lyapunov-Krasovskii functional to govern the characteristics of the discrete communication instants and by utilizing a convex combination technique, a synchronization criterion is derived in terms of linear matrix inequalities with an upper bound for the communication intervals obtained. The results extend and improve upon earlier work. Simulation results show the effectiveness of the proposed communication scheme. Some relationships between the allowable upper bound of communication intervals and the coupling strength of the network are illustrated through simulations on a fully connected network, a star-like network, and a nearest neighbor network.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2171501","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40117954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
Real-time vector quantization and clustering based on ordinary differential equations. 基于常微分方程的实时矢量量化和聚类。
Pub Date : 2011-12-01 Epub Date: 2011-10-31 DOI: 10.1109/TNN.2011.2172627
Jie Cheng, Mohammad R Sayeh, Mehdi R Zargham, Qiang Cheng

This brief presents a dynamical system approach to vector quantization or clustering based on ordinary differential equations with the potential for real-time implementation. Two examples of different pattern clusters demonstrate that the model can successfully quantize different types of input patterns. Furthermore, we analyze and study the stability of our dynamical system. By discovering the equilibrium points for certain input patterns and analyzing their stability, we have shown the quantizing behavior of the system with respect to its vigilance parameter. The proposed system is applied to two real-world problems, providing comparable results to the best reported findings. This validates the effectiveness of our proposed approach.

本文简要介绍了一种基于常微分方程的矢量量化或聚类的动态系统方法,具有实时实现的潜力。两个不同模式聚类的实例表明,该模型可以成功地量化不同类型的输入模式。此外,我们还分析和研究了动力系统的稳定性。通过发现某些输入模式的平衡点并分析其稳定性,我们给出了系统对其警戒参数的量化行为。提出的系统应用于两个现实世界的问题,提供了可比较的结果,最好的报告结果。这验证了我们提出的方法的有效性。
{"title":"Real-time vector quantization and clustering based on ordinary differential equations.","authors":"Jie Cheng,&nbsp;Mohammad R Sayeh,&nbsp;Mehdi R Zargham,&nbsp;Qiang Cheng","doi":"10.1109/TNN.2011.2172627","DOIUrl":"https://doi.org/10.1109/TNN.2011.2172627","url":null,"abstract":"<p><p>This brief presents a dynamical system approach to vector quantization or clustering based on ordinary differential equations with the potential for real-time implementation. Two examples of different pattern clusters demonstrate that the model can successfully quantize different types of input patterns. Furthermore, we analyze and study the stability of our dynamical system. By discovering the equilibrium points for certain input patterns and analyzing their stability, we have shown the quantizing behavior of the system with respect to its vigilance parameter. The proposed system is applied to two real-world problems, providing comparable results to the best reported findings. This validates the effectiveness of our proposed approach.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2172627","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40131704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A one-layer recurrent neural network for pseudoconvex optimization subject to linear equality constraints. 基于线性等式约束的一层递归神经网络拟凸优化。
Pub Date : 2011-12-01 Epub Date: 2011-10-31 DOI: 10.1109/TNN.2011.2169682
Zhishan Guo, Qingshan Liu, Jun Wang

In this paper, a one-layer recurrent neural network is presented for solving pseudoconvex optimization problems subject to linear equality constraints. The global convergence of the neural network can be guaranteed even though the objective function is pseudoconvex. The finite-time state convergence to the feasible region defined by the equality constraints is also proved. In addition, global exponential convergence is proved when the objective function is strongly pseudoconvex on the feasible region. Simulation results on illustrative examples and application on chemical process data reconciliation are provided to demonstrate the effectiveness and characteristics of the neural network.

本文提出了一种单层递归神经网络,用于求解线性等式约束下的伪凸优化问题。即使目标函数为伪凸,也能保证神经网络的全局收敛性。并证明了由等式约束定义的可行域的有限时间状态收敛性。此外,还证明了当目标函数在可行域上为强伪凸时,算法具有全局指数收敛性。通过实例仿真和在化工过程数据协调中的应用,验证了该神经网络的有效性和特点。
{"title":"A one-layer recurrent neural network for pseudoconvex optimization subject to linear equality constraints.","authors":"Zhishan Guo,&nbsp;Qingshan Liu,&nbsp;Jun Wang","doi":"10.1109/TNN.2011.2169682","DOIUrl":"https://doi.org/10.1109/TNN.2011.2169682","url":null,"abstract":"<p><p>In this paper, a one-layer recurrent neural network is presented for solving pseudoconvex optimization problems subject to linear equality constraints. The global convergence of the neural network can be guaranteed even though the objective function is pseudoconvex. The finite-time state convergence to the feasible region defined by the equality constraints is also proved. In addition, global exponential convergence is proved when the objective function is strongly pseudoconvex on the feasible region. Simulation results on illustrative examples and application on chemical process data reconciliation are provided to demonstrate the effectiveness and characteristics of the neural network.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2169682","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40132306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 93
Classifiability-based discriminatory projection pursuit. 基于可分类性的歧视性投射追踪。
Pub Date : 2011-12-01 Epub Date: 2011-10-20 DOI: 10.1109/TNN.2011.2170220
Yu Su, Shiguang Shan, Xilin Chen, Wen Gao

Fisher's linear discriminant (FLD) is one of the most widely used linear feature extraction method, especially in many visual computation tasks. Based on the analysis on several limitations of the traditional FLD, this paper attempts to propose a new computational paradigm for discriminative linear feature extraction, named "classifiability-based discriminatory projection pursuit" (CDPP), which is different from the traditional FLD and its variants. There are two steps in the proposed CDPP: one is the construction of a candidate projection set (CPS), and the other is the pursuit of discriminatory projections. Specifically, in the former step, candidate projections are generated by using the nearest between-class boundary samples, while the latter is efficiently achieved by classifiability-based AdaBoost learning from the CPS. We show that the new "projection pursuit" paradigm not only does not suffer from the limitations of the traditional FLD but also inherits good generalizability from the boundary attribute of candidate projections. Extensive experiments on both synthetic and real datasets validate the effectiveness of CDPP for discriminative linear feature extraction.

Fisher线性判别法(FLD)是一种应用最广泛的线性特征提取方法,特别是在许多视觉计算任务中。本文在分析传统FLD的局限性的基础上,提出了一种区别于传统FLD及其变体的判别性线性特征提取计算范式,即基于可分类性的判别性投影寻踪(CDPP)。该方法分为两个步骤:一是构建候选投影集(CPS),二是追求歧视性投影。具体来说,在前一步中,候选预测是通过使用最接近的类间边界样本生成的,而后一步是通过基于可分类性的AdaBoost从CPS中学习有效地实现的。研究表明,新的“投影寻踪”范式不仅克服了传统FLD的局限性,而且继承了候选投影边界属性的良好泛化性。在合成数据集和真实数据集上的大量实验验证了CDPP在判别线性特征提取方面的有效性。
{"title":"Classifiability-based discriminatory projection pursuit.","authors":"Yu Su,&nbsp;Shiguang Shan,&nbsp;Xilin Chen,&nbsp;Wen Gao","doi":"10.1109/TNN.2011.2170220","DOIUrl":"https://doi.org/10.1109/TNN.2011.2170220","url":null,"abstract":"<p><p>Fisher's linear discriminant (FLD) is one of the most widely used linear feature extraction method, especially in many visual computation tasks. Based on the analysis on several limitations of the traditional FLD, this paper attempts to propose a new computational paradigm for discriminative linear feature extraction, named \"classifiability-based discriminatory projection pursuit\" (CDPP), which is different from the traditional FLD and its variants. There are two steps in the proposed CDPP: one is the construction of a candidate projection set (CPS), and the other is the pursuit of discriminatory projections. Specifically, in the former step, candidate projections are generated by using the nearest between-class boundary samples, while the latter is efficiently achieved by classifiability-based AdaBoost learning from the CPS. We show that the new \"projection pursuit\" paradigm not only does not suffer from the limitations of the traditional FLD but also inherits good generalizability from the boundary attribute of candidate projections. Extensive experiments on both synthetic and real datasets validate the effectiveness of CDPP for discriminative linear feature extraction.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2170220","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30233684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Fast independent component analysis algorithm for quaternion valued signals. 四元数值信号的快速独立分量分析算法。
Pub Date : 2011-12-01 Epub Date: 2011-10-20 DOI: 10.1109/TNN.2011.2171362
Soroush Javidi, Clive Cheong Took, Danilo P Mandic

An extension of the fast independent component analysis algorithm is proposed for the blind separation of both Q-proper and Q-improper quaternion-valued signals. This is achieved by maximizing a negentropy-based cost function, and is derived rigorously using the recently developed HR calculus in order to implement Newton optimization in the augmented quaternion statistics framework. It is shown that the use of augmented statistics and the associated widely linear modeling provides theoretical and practical advantages when dealing with general quaternion signals with noncircular (rotation-dependent) distributions. Simulations using both benchmark and real-world quaternion-valued signals support the approach.

提出了一种快速独立分量分析算法的扩展,用于盲分离q -固有和q -非固有四元数值信号。这是通过最大化基于负熵的成本函数来实现的,并且是使用最近开发的HR演算严格推导出来的,以便在增广四元数统计框架中实现牛顿优化。结果表明,在处理具有非圆(旋转相关)分布的一般四元数信号时,增广统计和相关的广泛线性建模的使用提供了理论和实践优势。使用基准和现实世界四元数值信号的模拟都支持这种方法。
{"title":"Fast independent component analysis algorithm for quaternion valued signals.","authors":"Soroush Javidi,&nbsp;Clive Cheong Took,&nbsp;Danilo P Mandic","doi":"10.1109/TNN.2011.2171362","DOIUrl":"https://doi.org/10.1109/TNN.2011.2171362","url":null,"abstract":"<p><p>An extension of the fast independent component analysis algorithm is proposed for the blind separation of both Q-proper and Q-improper quaternion-valued signals. This is achieved by maximizing a negentropy-based cost function, and is derived rigorously using the recently developed HR calculus in order to implement Newton optimization in the augmented quaternion statistics framework. It is shown that the use of augmented statistics and the associated widely linear modeling provides theoretical and practical advantages when dealing with general quaternion signals with noncircular (rotation-dependent) distributions. Simulations using both benchmark and real-world quaternion-valued signals support the approach.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2171362","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30233686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
Learning and representing temporal knowledge in recurrent networks. 在循环网络中学习和表示时间知识。
Pub Date : 2011-12-01 Epub Date: 2011-10-17 DOI: 10.1109/TNN.2011.2170180
Rafael V Borges, Artur d'Avila Garcez, Luis C Lamb

The effective integration of knowledge representation, reasoning, and learning in a robust computational model is one of the key challenges of computer science and artificial intelligence. In particular, temporal knowledge and models have been fundamental in describing the behavior of computational systems. However, knowledge acquisition of correct descriptions of a system's desired behavior is a complex task. In this paper, we present a novel neural-computation model capable of representing and learning temporal knowledge in recurrent networks. The model works in an integrated fashion. It enables the effective representation of temporal knowledge, the adaptation of temporal models given a set of desirable system properties, and effective learning from examples, which in turn can lead to temporal knowledge extraction from the corresponding trained networks. The model is sound from a theoretical standpoint, but it has also been tested on a case study in the area of model verification and adaptation. The results contained in this paper indicate that model verification and learning can be integrated within the neural computation paradigm, contributing to the development of predictive temporal knowledge-based systems and offering interpretable results that allow system researchers and engineers to improve their models and specifications. The model has been implemented and is available as part of a neural-symbolic computational toolkit.

在一个健壮的计算模型中有效地集成知识表示、推理和学习是计算机科学和人工智能的关键挑战之一。特别是,时间知识和模型已经成为描述计算系统行为的基础。然而,获取系统期望行为的正确描述是一项复杂的任务。在本文中,我们提出了一种新的神经计算模型,能够表示和学习递归网络中的时间知识。该模型以集成的方式工作。它能够有效地表示时间知识,在给定一组理想的系统属性的情况下对时间模型进行适应,并从示例中有效地学习,这反过来又可以从相应的训练网络中提取时间知识。该模型从理论角度来看是合理的,但在模型验证和适应方面也经过了案例研究的检验。本文所包含的结果表明,模型验证和学习可以集成在神经计算范式中,有助于基于预测时间知识的系统的发展,并提供可解释的结果,使系统研究人员和工程师能够改进他们的模型和规范。该模型已经实现,并可作为神经符号计算工具包的一部分。
{"title":"Learning and representing temporal knowledge in recurrent networks.","authors":"Rafael V Borges,&nbsp;Artur d'Avila Garcez,&nbsp;Luis C Lamb","doi":"10.1109/TNN.2011.2170180","DOIUrl":"https://doi.org/10.1109/TNN.2011.2170180","url":null,"abstract":"<p><p>The effective integration of knowledge representation, reasoning, and learning in a robust computational model is one of the key challenges of computer science and artificial intelligence. In particular, temporal knowledge and models have been fundamental in describing the behavior of computational systems. However, knowledge acquisition of correct descriptions of a system's desired behavior is a complex task. In this paper, we present a novel neural-computation model capable of representing and learning temporal knowledge in recurrent networks. The model works in an integrated fashion. It enables the effective representation of temporal knowledge, the adaptation of temporal models given a set of desirable system properties, and effective learning from examples, which in turn can lead to temporal knowledge extraction from the corresponding trained networks. The model is sound from a theoretical standpoint, but it has also been tested on a case study in the area of model verification and adaptation. The results contained in this paper indicate that model verification and learning can be integrated within the neural computation paradigm, contributing to the development of predictive temporal knowledge-based systems and offering interpretable results that allow system researchers and engineers to improve their models and specifications. The model has been implemented and is available as part of a neural-symbolic computational toolkit.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2170180","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30072532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Silicon modeling of the Mihalaş-Niebur neuron. mihala<e:1> - niebur神经元的硅建模。
Pub Date : 2011-12-01 Epub Date: 2011-10-10 DOI: 10.1109/TNN.2011.2167020
Fopefolu Folowosele, Tara Julia Hamilton, Ralph Etienne-Cummings

There are a number of spiking and bursting neuron models with varying levels of complexity, ranging from the simple integrate-and-fire model to the more complex Hodgkin-Huxley model. The simpler models tend to be easily implemented in silicon but yet not biologically plausible. Conversely, the more complex models tend to occupy a large area although they are more biologically plausible. In this paper, we present the 0.5 μm complementary metal-oxide-semiconductor (CMOS) implementation of the Mihalaş-Niebur neuron model--a generalized model of the leaky integrate-and-fire neuron with adaptive threshold--that is able to produce most of the known spiking and bursting patterns that have been observed in biology. Our implementation modifies the original proposed model, making it more amenable to CMOS implementation and more biologically plausible. All but one of the spiking properties--tonic spiking, class 1 spiking, phasic spiking, hyperpolarized spiking, rebound spiking, spike frequency adaptation, accommodation, threshold variability, integrator and input bistability--are demonstrated in this model.

有许多复杂程度不同的尖峰和破裂神经元模型,从简单的集成-发射模型到更复杂的霍奇金-赫胥黎模型。更简单的模型往往很容易在硅中实现,但在生物学上却不可信。相反,更复杂的模型往往占据更大的区域,尽管它们在生物学上更合理。在本文中,我们提出了0.5 μm互补金属氧化物半导体(CMOS)实现的mihala - niebur神经元模型-具有自适应阈值的泄漏集成-点火神经元的广义模型-能够产生生物学中观察到的大多数已知的spike和burst模式。我们的实现修改了最初提出的模型,使其更适合CMOS实现,并且在生物学上更合理。除一种特性外,其他所有特性——强直脉冲、第一类脉冲、相位脉冲、超极化脉冲、反弹脉冲、脉冲频率自适应、调节、阈值可变性、积分器和输入双稳性——都在该模型中得到了证明。
{"title":"Silicon modeling of the Mihalaş-Niebur neuron.","authors":"Fopefolu Folowosele,&nbsp;Tara Julia Hamilton,&nbsp;Ralph Etienne-Cummings","doi":"10.1109/TNN.2011.2167020","DOIUrl":"https://doi.org/10.1109/TNN.2011.2167020","url":null,"abstract":"<p><p>There are a number of spiking and bursting neuron models with varying levels of complexity, ranging from the simple integrate-and-fire model to the more complex Hodgkin-Huxley model. The simpler models tend to be easily implemented in silicon but yet not biologically plausible. Conversely, the more complex models tend to occupy a large area although they are more biologically plausible. In this paper, we present the 0.5 μm complementary metal-oxide-semiconductor (CMOS) implementation of the Mihalaş-Niebur neuron model--a generalized model of the leaky integrate-and-fire neuron with adaptive threshold--that is able to produce most of the known spiking and bursting patterns that have been observed in biology. Our implementation modifies the original proposed model, making it more amenable to CMOS implementation and more biologically plausible. All but one of the spiking properties--tonic spiking, class 1 spiking, phasic spiking, hyperpolarized spiking, rebound spiking, spike frequency adaptation, accommodation, threshold variability, integrator and input bistability--are demonstrated in this model.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2167020","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30201401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Hierarchical approximate policy iteration with binary-tree state space decomposition. 基于二叉树状态空间分解的分层近似策略迭代。
Pub Date : 2011-12-01 Epub Date: 2011-10-10 DOI: 10.1109/TNN.2011.2168422
Xin Xu, Chunming Liu, Simon X Yang, Dewen Hu

In recent years, approximate policy iteration (API) has attracted increasing attention in reinforcement learning (RL), e.g., least-squares policy iteration (LSPI) and its kernelized version, the kernel-based LSPI algorithm. However, it remains difficult for API algorithms to obtain near-optimal policies for Markov decision processes (MDPs) with large or continuous state spaces. To address this problem, this paper presents a hierarchical API (HAPI) method with binary-tree state space decomposition for RL in a class of absorbing MDPs, which can be formulated as time-optimal learning control tasks. In the proposed method, after collecting samples adaptively in the state space of the original MDP, a learning-based decomposition strategy of sample sets was designed to implement the binary-tree state space decomposition process. Then, API algorithms were used on the sample subsets to approximate local optimal policies of sub-MDPs. The original MDP was decomposed into a binary-tree structure of absorbing sub-MDPs, constructed during the learning process, thus, local near-optimal policies were approximated by API algorithms with reduced complexity and higher precision. Furthermore, because of the improved quality of local policies, the combined global policy performed better than the near-optimal policy obtained by a single API algorithm in the original MDP. Three learning control problems, including path-tracking control of a real mobile robot, were studied to evaluate the performance of the HAPI method. With the same setting for basis function selection and sample collection, the proposed HAPI obtained better near-optimal policies than previous API methods such as LSPI and KLSPI.

近年来,近似策略迭代(API)在强化学习(RL)中引起了越来越多的关注,例如最小二乘策略迭代(LSPI)及其核化版本,即基于核的LSPI算法。然而,对于具有大或连续状态空间的马尔可夫决策过程(mdp), API算法仍然难以获得接近最优的策略。为了解决这一问题,本文提出了一种具有二叉树状态空间分解的分层API (HAPI)方法,用于一类吸收MDPs中的RL,该方法可表述为时间最优学习控制任务。该方法在原MDP状态空间中自适应采集样本后,设计基于学习的样本集分解策略,实现二叉树状态空间分解过程。然后,在样本子集上使用API算法来近似子mdp的局部最优策略。将原始MDP分解为吸收子MDP的二叉树结构,并在学习过程中构造,从而通过API算法逼近局部近最优策略,降低了复杂度,提高了精度。此外,由于改进了局部策略的质量,组合全局策略的性能优于原始MDP中单个API算法获得的近最优策略。以实际移动机器人的路径跟踪控制为例,研究了HAPI方法的学习控制性能。在基函数选择和样本收集设置相同的情况下,所提出的HAPI比以前的API方法(如LSPI和KLSPI)获得了更好的近最优策略。
{"title":"Hierarchical approximate policy iteration with binary-tree state space decomposition.","authors":"Xin Xu,&nbsp;Chunming Liu,&nbsp;Simon X Yang,&nbsp;Dewen Hu","doi":"10.1109/TNN.2011.2168422","DOIUrl":"https://doi.org/10.1109/TNN.2011.2168422","url":null,"abstract":"<p><p>In recent years, approximate policy iteration (API) has attracted increasing attention in reinforcement learning (RL), e.g., least-squares policy iteration (LSPI) and its kernelized version, the kernel-based LSPI algorithm. However, it remains difficult for API algorithms to obtain near-optimal policies for Markov decision processes (MDPs) with large or continuous state spaces. To address this problem, this paper presents a hierarchical API (HAPI) method with binary-tree state space decomposition for RL in a class of absorbing MDPs, which can be formulated as time-optimal learning control tasks. In the proposed method, after collecting samples adaptively in the state space of the original MDP, a learning-based decomposition strategy of sample sets was designed to implement the binary-tree state space decomposition process. Then, API algorithms were used on the sample subsets to approximate local optimal policies of sub-MDPs. The original MDP was decomposed into a binary-tree structure of absorbing sub-MDPs, constructed during the learning process, thus, local near-optimal policies were approximated by API algorithms with reduced complexity and higher precision. Furthermore, because of the improved quality of local policies, the combined global policy performed better than the near-optimal policy obtained by a single API algorithm in the original MDP. Three learning control problems, including path-tracking control of a real mobile robot, were studied to evaluate the performance of the HAPI method. With the same setting for basis function selection and sample collection, the proposed HAPI obtained better near-optimal policies than previous API methods such as LSPI and KLSPI.</p>","PeriodicalId":13434,"journal":{"name":"IEEE transactions on neural networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/TNN.2011.2168422","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30201403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
期刊
IEEE transactions on neural networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1