The 2011 International Joint Conference on Neural Networks最新文献

英文中文

Improving classification accuracy by identifying and removing instances that should be misclassified 通过识别和删除应该被错误分类的实例来提高分类准确性

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033571

Michael R. Smith, T. Martinez

Appropriately handling noise and outliers is an important issue in data mining. In this paper we examine how noise and outliers are handled by learning algorithms. We introduce a filtering method called PRISM that identifies and removes instances that should be misclassified. We refer to the set of removed instances as ISMs (instances that should be misclassified). We examine PRISM and compare it against 3 existing outlier detection methods and 1 noise reduction technique on 48 data sets using 9 learning algorithms. Using PRISM, the classification accuracy increases from 78.5% to 79.8% on a set of 53 data sets and is statistically significant. In addition, the accuracy on the non-outlier instances increases from 82.8% to 84.7%. PRISM achieves a higher classification accuracy than the outlier detection methods and compares favorably with the noise reduction method.

适当处理噪声和异常值是数据挖掘中的一个重要问题。在本文中，我们研究了如何通过学习算法处理噪声和异常值。我们引入了一种名为PRISM的过滤方法，用于识别和删除应该被错误分类的实例。我们将删除的实例集称为ISMs(应该被错误分类的实例)。我们对PRISM进行了研究，并将其与现有的3种离群检测方法和1种降噪技术在48个数据集上使用9种学习算法进行了比较。使用PRISM，在53个数据集上，分类准确率从78.5%提高到79.8%，具有统计学意义。此外，非离群实例的准确率从82.8%提高到84.7%。PRISM的分类精度高于离群点检测方法，优于降噪方法。

引用次数: 115

Where-What Network 5: Dealing with scales for objects in complex backgrounds Where-What网络5:处理复杂背景下物体的尺度

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033587

Xiaoying Song, Wenqiang Zhang, J. Weng

The biologically-inspired developmental Where-What Networks (WWN) are general purpose visuomotor networks for detecting and recognizing objects from complex backgrounds, modeling the dorsal and ventral streams of the biological visual cortex. The networks are designed for the attention and recognition problem. The architecture in previous versions were meant for a single scale of foreground. This paper focuses on Where-What Network-5 (WWN-5), the extension for multiple scales. WWN-5 can learn three concepts of an object: type, location and scale.

受生物学启发发展的“哪里-什么网络”(WWN)是一种通用的视觉运动网络，用于检测和识别来自复杂背景的物体，模拟生物视觉皮层的背侧和腹侧流。该网络是针对注意力和识别问题而设计的。以前版本的架构只适用于单一比例的前景。本文重点研究了多尺度的扩展——何处-什么网络-5 (WWN-5)。WWN-5可以学习对象的三个概念:类型、位置和规模。

引用次数: 17

An event-driven model for the SpiNNaker virtual synaptic channel SpiNNaker虚拟突触通道的事件驱动模型

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033466

Alexander D. Rast, F. Galluppi, Sergio Davies, L. Plana, T. Sharp, S. Furber

Neural networks present a fundamentally different model of computation from conventional sequential hardware, making it inefficient for very-large-scale models. Current neuromorphic devices do not yet offer a fully satisfactory solution even though they have improved simulation performance, in part because of fixed hardware, in part because of poor software support. SpiNNaker introduces a different approach, the “neuromimetic” architecture, that maintains the neural optimisation of dedicated chips while offering FPGA-like universal configurability. Central to this parallel multiprocessor is an asynchronous event-driven model that uses interrupt-generating dedicated hardware on the chip to support real-time neural simulation. In turn this requires an event-driven software model: a rethink as fundamental as that of the hardware. We examine this event-driven software model for an important hardware subsystem, the previously-introduced virtual synaptic channel. Using a scheduler-based system service architecture, the software can “hide” low-level processes and events from models so that the only event the model sees is “spike received”. Results from simulation on-chip demonstrate the robustness of the system even in the presence of extremely bursty, unpredictable traffic, but also expose important model-evel tradeoffs that are a consequence of the physical nature of the SpiNNaker chip. This event-driven subsystem is the first component of a library-based development system that allows the user to describe a model in a high-level neural description environment and be able to rely on a lower layer of system services to execute the model efficiently on SpiNNaker. Such a system realises a general-purpose platform that can generate an arbitrary neural network and run it with hardware speed and scale.

神经网络提供了一种与传统顺序硬件完全不同的计算模型，这使得它在处理大规模模型时效率低下。尽管目前的神经形态设备已经改善了模拟性能，但它们还没有提供一个完全令人满意的解决方案，部分原因是固定的硬件，部分原因是软件支持不足。SpiNNaker引入了一种不同的方法，即“神经模拟”架构，它在提供类似fpga的通用可配置性的同时，保持了专用芯片的神经优化。这个并行多处理器的核心是一个异步事件驱动模型，它使用芯片上的中断生成专用硬件来支持实时神经仿真。反过来，这需要一个事件驱动的软件模型:重新思考硬件的基础。我们研究了一个重要硬件子系统的事件驱动软件模型，即前面介绍的虚拟突触通道。使用基于调度器的系统服务体系结构，软件可以对模型“隐藏”低级进程和事件，以便模型看到的唯一事件是“spike received”。片上仿真的结果表明，即使在极其突发的、不可预测的交通情况下，系统也具有鲁棒性，但也暴露了SpiNNaker芯片物理性质导致的重要模型级权衡。这个事件驱动的子系统是基于库的开发系统的第一个组件，它允许用户在高级神经描述环境中描述模型，并能够依靠较低层次的系统服务在SpiNNaker上有效地执行模型。该系统实现了一个通用平台，可以生成任意的神经网络，并以硬件速度和规模运行。

{"title":"An event-driven model for the SpiNNaker virtual synaptic channel","authors":"Alexander D. Rast, F. Galluppi, Sergio Davies, L. Plana, T. Sharp, S. Furber","doi":"10.1109/IJCNN.2011.6033466","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033466","url":null,"abstract":"Neural networks present a fundamentally different model of computation from conventional sequential hardware, making it inefficient for very-large-scale models. Current neuromorphic devices do not yet offer a fully satisfactory solution even though they have improved simulation performance, in part because of fixed hardware, in part because of poor software support. SpiNNaker introduces a different approach, the “neuromimetic” architecture, that maintains the neural optimisation of dedicated chips while offering FPGA-like universal configurability. Central to this parallel multiprocessor is an asynchronous event-driven model that uses interrupt-generating dedicated hardware on the chip to support real-time neural simulation. In turn this requires an event-driven software model: a rethink as fundamental as that of the hardware. We examine this event-driven software model for an important hardware subsystem, the previously-introduced virtual synaptic channel. Using a scheduler-based system service architecture, the software can “hide” low-level processes and events from models so that the only event the model sees is “spike received”. Results from simulation on-chip demonstrate the robustness of the system even in the presence of extremely bursty, unpredictable traffic, but also expose important model-evel tradeoffs that are a consequence of the physical nature of the SpiNNaker chip. This event-driven subsystem is the first component of a library-based development system that allows the user to describe a model in a high-level neural description environment and be able to rely on a lower layer of system services to execute the model efficiently on SpiNNaker. Such a system realises a general-purpose platform that can generate an arbitrary neural network and run it with hardware speed and scale.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117001208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Non-Gaussian component analysis using Density Gradient Covariance matrix 密度梯度协方差矩阵的非高斯分量分析

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033327

N. Reyhani, E. Oja

High dimensional data are often modeled by signal plus noise where the signal belongs to a low dimensional manifold contaminated with high dimensional noise. Estimating the signal subspace when the noise is Gaussian and the signal is non-Gaussian is the main focus of this paper. We assume that the Gaussian noise variance can be high, so standard denoising approaches like Principal Component Analysis fail. The approach also differs from standard Independent Component Analysis in that no independent signal factors are assumed. This model is called non-Gaussian subspace/component analysis (NGCA). The previous approaches proposed for this subspace analysis use the fourth cumulant matrix or the Hessian of the logarithm of characteristic functions, which both have some practical and theoretical issues. We propose to use sample Density Gradient Covariances, which are similar to the Fisher information matrix for estimating the non-Gaussian subspace. Here, we use nonparametric kernel density estimator to estimate the gradients of density functions. Moreover, we extend the notion of non-Gaussian subspace analysis to a supervised version where the label or response information is present. For the supervised non-Gaussian subspace analysis, we propose to use conditional density gradient covariances which are computed by conditioning on the discretized response variable. A non-asymptotic analysis of density gradient covariance is also provided which relates the error of estimating the population DGC matrix using sample DGC to the number of dimensions and the number of samples.

高维数据通常采用信号加噪声建模，其中信号属于被高维噪声污染的低维流形。本文主要研究了高斯噪声和非高斯噪声条件下信号子空间的估计问题。我们假设高斯噪声方差可能很高，所以标准的去噪方法，如主成分分析失败。该方法也不同于标准的独立分量分析，因为不假设独立的信号因素。该模型称为非高斯子空间/分量分析(NGCA)。以往提出的这种子空间分析方法使用的是第四累积矩阵或特征函数对数的Hessian，这两种方法都存在一些实际和理论问题。我们建议使用样本密度梯度协方差，它类似于Fisher信息矩阵来估计非高斯子空间。这里，我们使用非参数核密度估计器来估计密度函数的梯度。此外，我们将非高斯子空间分析的概念扩展到存在标签或响应信息的监督版本。对于有监督的非高斯子空间分析，我们建议使用条件密度梯度协方差，该协方差是通过对离散响应变量的条件作用来计算的。本文还提供了密度梯度协方差的非渐近分析，该分析将使用样本DGC估计总体DGC矩阵的误差与维数和样本数联系起来。

{"title":"Non-Gaussian component analysis using Density Gradient Covariance matrix","authors":"N. Reyhani, E. Oja","doi":"10.1109/IJCNN.2011.6033327","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033327","url":null,"abstract":"High dimensional data are often modeled by signal plus noise where the signal belongs to a low dimensional manifold contaminated with high dimensional noise. Estimating the signal subspace when the noise is Gaussian and the signal is non-Gaussian is the main focus of this paper. We assume that the Gaussian noise variance can be high, so standard denoising approaches like Principal Component Analysis fail. The approach also differs from standard Independent Component Analysis in that no independent signal factors are assumed. This model is called non-Gaussian subspace/component analysis (NGCA). The previous approaches proposed for this subspace analysis use the fourth cumulant matrix or the Hessian of the logarithm of characteristic functions, which both have some practical and theoretical issues. We propose to use sample Density Gradient Covariances, which are similar to the Fisher information matrix for estimating the non-Gaussian subspace. Here, we use nonparametric kernel density estimator to estimate the gradients of density functions. Moreover, we extend the notion of non-Gaussian subspace analysis to a supervised version where the label or response information is present. For the supervised non-Gaussian subspace analysis, we propose to use conditional density gradient covariances which are computed by conditioning on the discretized response variable. A non-asymptotic analysis of density gradient covariance is also provided which relates the error of estimating the population DGC matrix using sample DGC to the number of dimensions and the number of samples.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117295256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

B-spline neural network based digital baseband predistorter solution using the inverse of De Boor algorithm 基于b样条神经网络的数字基带预失真器的反De Boor算法求解

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033196

X. Hong, Yu Gong, Sheng Chen

In this paper a new nonlinear digital baseband predistorter design is introduced based on direct learning, together with a new Wiener system modeling approach for the high power amplifiers (HPA) based on the B-spline neural network. The contribution is twofold. Firstly, by assuming that the nonlinearity in the HPA is mainly dependent on the input signal amplitude the complex valued nonlinear static function is represented by two real valued B-spline neural networks, one for the amplitude distortion and another for the phase shift. The Gauss-Newton algorithm is applied for the parameter estimation, in which the De Boor recursion is employed to calculate both the B-spline curve and the first order derivatives. Secondly, we derive the predistorter algorithm calculating the inverse of the complex valued nonlinear static function according to B-spline neural network based Wiener models. The inverse of the amplitude and phase shift distortion are then computed and compensated using the identified phase shift model. Numerical examples have been employed to demonstrate the efficacy of the proposed approaches.

本文介绍了一种新的基于直接学习的非线性数字基带预失真器设计方法，以及一种基于b样条神经网络的高功率放大器维纳系统建模新方法。这种贡献是双重的。首先，假设HPA的非线性主要依赖于输入信号的幅度，用两个实值b样条神经网络表示复值非线性静态函数，一个用于幅度失真，另一个用于相移。参数估计采用高斯-牛顿算法，其中采用De Boor递推计算b样条曲线和一阶导数。其次，根据基于b样条神经网络的Wiener模型，推导了计算复值非线性静态函数逆的预失真器算法。然后计算振幅和相移失真的逆，并使用识别的相移模型进行补偿。数值算例验证了所提方法的有效性。

{"title":"B-spline neural network based digital baseband predistorter solution using the inverse of De Boor algorithm","authors":"X. Hong, Yu Gong, Sheng Chen","doi":"10.1109/IJCNN.2011.6033196","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033196","url":null,"abstract":"In this paper a new nonlinear digital baseband predistorter design is introduced based on direct learning, together with a new Wiener system modeling approach for the high power amplifiers (HPA) based on the B-spline neural network. The contribution is twofold. Firstly, by assuming that the nonlinearity in the HPA is mainly dependent on the input signal amplitude the complex valued nonlinear static function is represented by two real valued B-spline neural networks, one for the amplitude distortion and another for the phase shift. The Gauss-Newton algorithm is applied for the parameter estimation, in which the De Boor recursion is employed to calculate both the B-spline curve and the first order derivatives. Secondly, we derive the predistorter algorithm calculating the inverse of the complex valued nonlinear static function according to B-spline neural network based Wiener models. The inverse of the amplitude and phase shift distortion are then computed and compensated using the identified phase shift model. Numerical examples have been employed to demonstrate the efficacy of the proposed approaches.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114531322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Beyond probabilistic record linkage: Using neural networks and complex features to improve genealogical record linkage 超越概率记录链接:使用神经网络和复杂特征来改进家谱记录链接

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033192

D. Wilson

Probabilistic record linkage has been used for many years in a variety of industries, including medical, government, private sector and research groups. The formulas used for probabilistic record linkage have been recognized by some as being equivalent to the naïve Bayes classifier. While this method can produce useful results, it is not difficult to improve accuracy by using one of a host of other machine learning or neural network algorithms. Even a simple single-layer perceptron tends to outperform the naïve Bayes classifier—and thus traditional probabilistic record linkage methods—by a substantial margin. Furthermore, many record linkage system use simple field comparisons rather than more complex features, partially due to the limits of the probabilistic formulas they use. This paper presents an overview of probabilistic record linkage, shows how to cast it in machine learning terms, and then shows that it is equivalent to a naïve Bayes classifier. It then discusses how to use more complex features than simple field comparisons, and shows how probabilistic record linkage formulas can be modified to handle this. Finally, it demonstrates a huge improvement in accuracy through the use of neural networks and higher-level matching features, compared to traditional probabilistic record linkage on a large (80,000 pair) set of labeled pairs of genealogical records used by FamilySearch.org.

概率记录关联已在各种行业中使用多年，包括医疗，政府，私营部门和研究团体。用于概率记录链接的公式已被一些人认为相当于naïve贝叶斯分类器。虽然这种方法可以产生有用的结果，但通过使用许多其他机器学习或神经网络算法之一来提高准确性并不困难。即使是一个简单的单层感知器，其表现也往往优于naïve贝叶斯分类器，从而优于传统的概率记录链接方法。此外，许多记录链接系统使用简单的字段比较而不是更复杂的特征，部分原因是它们使用的概率公式的限制。本文概述了概率记录链接，展示了如何在机器学习术语中投射它，然后表明它相当于naïve贝叶斯分类器。然后讨论了如何使用比简单的字段比较更复杂的特性，并展示了如何修改概率记录链接公式来处理这个问题。最后，与FamilySearch.org使用的大型(80,000对)标记对家谱记录的传统概率记录链接相比，它通过使用神经网络和更高级别的匹配特征，在准确性方面有了巨大的提高。

{"title":"Beyond probabilistic record linkage: Using neural networks and complex features to improve genealogical record linkage","authors":"D. Wilson","doi":"10.1109/IJCNN.2011.6033192","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033192","url":null,"abstract":"Probabilistic record linkage has been used for many years in a variety of industries, including medical, government, private sector and research groups. The formulas used for probabilistic record linkage have been recognized by some as being equivalent to the naïve Bayes classifier. While this method can produce useful results, it is not difficult to improve accuracy by using one of a host of other machine learning or neural network algorithms. Even a simple single-layer perceptron tends to outperform the naïve Bayes classifier—and thus traditional probabilistic record linkage methods—by a substantial margin. Furthermore, many record linkage system use simple field comparisons rather than more complex features, partially due to the limits of the probabilistic formulas they use. This paper presents an overview of probabilistic record linkage, shows how to cast it in machine learning terms, and then shows that it is equivalent to a naïve Bayes classifier. It then discusses how to use more complex features than simple field comparisons, and shows how probabilistic record linkage formulas can be modified to handle this. Finally, it demonstrates a huge improvement in accuracy through the use of neural networks and higher-level matching features, compared to traditional probabilistic record linkage on a large (80,000 pair) set of labeled pairs of genealogical records used by FamilySearch.org.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115146046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 49

Communicated somatic markers benefit both the individual and the species 传递的体细胞标记对个体和物种都有好处

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033655

K. Harrington, Megan M. Olsen, H. Siegelmann

We use emotional communication within a predator-prey game to evaluate the tradeoff between socio-emotional behavior at individual- and species- scales. In this predator-prey game, individual predators and prey use emotion in their decision making, and communicate their emotional state with neighboring conspecifics. The model of emotion is based upon the somatic marker hypothesis. In comparing individual utility and population dynamics we find emotion is capable of both supporting species and individual gain. We suggest this type of dynamic may provide a mechanism for the emergence of altruistic behavior within a species under individual and/or group selection.

我们在捕食者-猎物游戏中使用情感交流来评估个体和物种尺度上社会情感行为之间的权衡。在这种捕食者-猎物博弈中，个体捕食者和猎物在决策过程中使用情绪，并将情绪状态与邻近的同种个体进行交流。情绪模型是基于躯体标记假说的。在比较个体效用和群体动态时，我们发现情感既能支持物种也能支持个体收益。我们认为，这种动态可能为个体和/或群体选择下物种内部利他行为的出现提供了一种机制。

引用次数: 1

Kernel adaptive filtering with maximum correntropy criterion 最大熵准则核自适应滤波

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033473

Songlin Zhao, Badong Chen, J. Príncipe

Kernel adaptive filters have drawn increasing attention due to their advantages such as universal nonlinear approximation with universal kernels, linearity and convexity in Reproducing Kernel Hilbert Space (RKHS). Among them, the kernel least mean square (KLMS) algorithm deserves particular attention because of its simplicity and sequential learning approach. Similar to most conventional adaptive filtering algorithms, the KLMS adopts the mean square error (MSE) as the adaptation cost. However, the mere second-order statistics is often not suitable for nonlinear and non-Gaussian situations. Therefore, various non-MSE criteria, which involve higher-order statistics, have received an increasing interest. Recently, the correntropy, as an alternative of MSE, has been successfully used in nonlinear and non-Gaussian signal processing and machine learning domains. This fact motivates us in this paper to develop a new kernel adaptive algorithm, called the kernel maximum correntropy (KMC), which combines the advantages of the KLMS and maximum correntropy criterion (MCC). We also study its convergence and self-regularization properties by using the energy conservation relation. The superior performance of the new algorithm has been demonstrated by simulation experiments in the noisy frequency doubling problem.

核自适应滤波器在核希尔伯特空间(RKHS)的再现中，以其具有通用非线性近似、线性和凸性等优点而受到越来越多的关注。其中，核最小均方(KLMS)算法因其简单性和顺序学习方法而备受关注。与大多数传统的自适应滤波算法相似，KLMS采用均方误差(MSE)作为自适应代价。然而，单纯的二阶统计量往往不适合非线性和非高斯情况。因此，涉及高阶统计量的各种非mse标准受到了越来越多的关注。近年来，相关熵作为MSE的一种替代方法，已成功地应用于非线性和非高斯信号处理以及机器学习领域。这一事实促使我们在本文中开发了一种新的核自适应算法，称为核最大相关熵(KMC)，它结合了KLMS和最大相关熵准则(MCC)的优点。利用能量守恒关系研究了它的收敛性和自正则性。仿真实验证明了新算法在噪声倍频问题中的优越性能。

{"title":"Kernel adaptive filtering with maximum correntropy criterion","authors":"Songlin Zhao, Badong Chen, J. Príncipe","doi":"10.1109/IJCNN.2011.6033473","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033473","url":null,"abstract":"Kernel adaptive filters have drawn increasing attention due to their advantages such as universal nonlinear approximation with universal kernels, linearity and convexity in Reproducing Kernel Hilbert Space (RKHS). Among them, the kernel least mean square (KLMS) algorithm deserves particular attention because of its simplicity and sequential learning approach. Similar to most conventional adaptive filtering algorithms, the KLMS adopts the mean square error (MSE) as the adaptation cost. However, the mere second-order statistics is often not suitable for nonlinear and non-Gaussian situations. Therefore, various non-MSE criteria, which involve higher-order statistics, have received an increasing interest. Recently, the correntropy, as an alternative of MSE, has been successfully used in nonlinear and non-Gaussian signal processing and machine learning domains. This fact motivates us in this paper to develop a new kernel adaptive algorithm, called the kernel maximum correntropy (KMC), which combines the advantages of the KLMS and maximum correntropy criterion (MCC). We also study its convergence and self-regularization properties by using the energy conservation relation. The superior performance of the new algorithm has been demonstrated by simulation experiments in the noisy frequency doubling problem.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"359 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115200658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 184

A class of fast quaternion valued variable stepsize stochastic gradient learning algorithms for vector sensor processes 矢量传感器过程的快速四元数值变步长随机梯度学习算法

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033585

Mingxuan Wang, C. C. Took, D. Mandic

We introduce a class of gradient adaptive stepsize algorithms for quaternion valued adaptive filtering based on three- and four-dimensional vector sensors. This equips the recently introduced quaternion least mean square (QLMS) algorithm with enhanced tracking ability and enables it to be more responsive to dynamically changing environments, while maintaining its desired characteristics of catering for large dynamical differences and coupling between signal components. For generality, the analysis is performed for the widely linear signal model, which by virtue of accounting for signal noncircularity, is optimal in the mean squared error (MSE) sense for both second order circular (proper) and noncircular (improper) processes. The widely linear QLMS (WL-QLMS) employing the proposed adaptive stepsize modifications is shown to provide enhanced performance for both synthetic and real world quaternion valued signals. Simulations include signals with drastically different component dynamics, such as four dimensional quaternion comprising three dimensional turbulent wind and air temperature for renewable energy applications.

介绍了一类基于三维和四维矢量传感器的四元数值自适应滤波的梯度自适应步长算法。这使最近引入的四元数最小均方(QLMS)算法具有增强的跟踪能力，使其能够更好地响应动态变化的环境，同时保持其满足大动态差异和信号分量之间耦合的所需特性。为了通用性，对广义线性信号模型进行了分析，该模型由于考虑了信号的非圆性，在均方误差(MSE)意义上对二阶圆(适当)和非圆(不适当)过程都是最优的。广泛线性QLMS (WL-QLMS)采用所提出的自适应步长修改，为合成和现实世界的四元数值信号提供了增强的性能。模拟包括具有完全不同组分动力学的信号，例如可再生能源应用中包含三维湍流风和空气温度的四维四元数。

{"title":"A class of fast quaternion valued variable stepsize stochastic gradient learning algorithms for vector sensor processes","authors":"Mingxuan Wang, C. C. Took, D. Mandic","doi":"10.1109/IJCNN.2011.6033585","DOIUrl":"https://doi.org/10.1109/IJCNN.2011.6033585","url":null,"abstract":"We introduce a class of gradient adaptive stepsize algorithms for quaternion valued adaptive filtering based on three- and four-dimensional vector sensors. This equips the recently introduced quaternion least mean square (QLMS) algorithm with enhanced tracking ability and enables it to be more responsive to dynamically changing environments, while maintaining its desired characteristics of catering for large dynamical differences and coupling between signal components. For generality, the analysis is performed for the widely linear signal model, which by virtue of accounting for signal noncircularity, is optimal in the mean squared error (MSE) sense for both second order circular (proper) and noncircular (improper) processes. The widely linear QLMS (WL-QLMS) employing the proposed adaptive stepsize modifications is shown to provide enhanced performance for both synthetic and real world quaternion valued signals. Simulations include signals with drastically different component dynamics, such as four dimensional quaternion comprising three dimensional turbulent wind and air temperature for renewable energy applications.","PeriodicalId":415833,"journal":{"name":"The 2011 International Joint Conference on Neural Networks","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114100017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

A new efficient SVM and its application to real-time accurate eye localization 一种新的高效支持向量机及其在眼睛实时精确定位中的应用

The 2011 International Joint Conference on Neural Networks

Pub Date : 2011-10-03 DOI: 10.1109/IJCNN.2011.6033547

Shuo Chen, Chengjun Liu

For complicated classification problems, the standard Support Vector Machine (SVM) is likely to be complex and thus the classification efficiency is low. In this paper, we propose a new efficient SVM (eSVM), which is based on the idea of minimizing the margin of misclassified samples. Compared with the conventional SVM, the eSVM is defined on fewer support vectors and thus can achieve much faster classification speed and comparable or even higher classification accuracy. We then present a real-time accurate eye localization system using the eSVM together with color information and 2D Haar wavelet features. Experiments on some public data sets show that (i) the eSVM significantly improves the efficiency of the standard SVM without sacrificing its accuracy and (ii) the eye localization system has real-time speed and higher detection accuracy than some state-of-the-art approaches.

对于复杂的分类问题，标准的支持向量机(SVM)可能比较复杂，分类效率较低。在本文中，我们提出了一种新的高效支持向量机(eSVM)，该支持向量机基于最小化误分类样本裕度的思想。与传统支持向量机相比，eSVM的支持向量更少，因此可以实现更快的分类速度和相当甚至更高的分类精度。然后，我们利用eSVM结合颜色信息和二维Haar小波特征提出了一个实时准确的眼睛定位系统。在一些公开数据集上的实验表明:(i) eSVM在不牺牲其精度的情况下显著提高了标准SVM的效率;(ii)眼睛定位系统比一些最先进的方法具有实时性和更高的检测精度。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

The 2011 International Joint Conference on Neural Networks

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀