首页 > 最新文献

信号处理最新文献

英文 中文
Sparse image coding using learned overcomplete dictionaries 使用学习过完全字典的稀疏图像编码
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1423021
Joseph F. Murray, K. Kreutz-Delgado
Images can be coded accurately using a sparse set of vectors from an overcomplete dictionary, with potential applications in image compression and feature selection for pattern recognition. We discuss algorithms that perform sparse coding and make three contributions. First, we compare our overcomplete dictionary learning algorithm (FOCUSS-CNDL) with overcomplete independent component analysis (ICA). Second, noting that once a dictionary has been learned in a given domain the problem becomes one of choosing the vectors to form an accurate, sparse representation, we compare a recently developed algorithm (sparse Bayesian learning with adjustable variance Gaussians) to well known methods of subset selection: matching pursuit and FOCUSS. Third, noting that in some cases it may be necessary to find a non-negative sparse coding, we present a modified version of the FOCUSS algorithm that can find such non-negative codings
使用来自过完备字典的稀疏向量集可以准确地对图像进行编码,这在图像压缩和模式识别的特征选择方面具有潜在的应用。我们讨论了执行稀疏编码的算法,并做出了三个贡献。首先,我们比较了我们的过完备字典学习算法(FOCUSS-CNDL)和过完备独立成分分析(ICA)。其次,注意到一旦在给定领域中学习了字典,问题就变成了选择向量以形成准确的稀疏表示的问题之一,我们将最近开发的算法(具有可调方差高斯的稀疏贝叶斯学习)与众所周知的子集选择方法进行比较:匹配追踪和焦点。第三,注意到在某些情况下可能需要找到非负稀疏编码,我们提出了一个改进版本的focus算法,可以找到这种非负编码
{"title":"Sparse image coding using learned overcomplete dictionaries","authors":"Joseph F. Murray, K. Kreutz-Delgado","doi":"10.1109/MLSP.2004.1423021","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1423021","url":null,"abstract":"Images can be coded accurately using a sparse set of vectors from an overcomplete dictionary, with potential applications in image compression and feature selection for pattern recognition. We discuss algorithms that perform sparse coding and make three contributions. First, we compare our overcomplete dictionary learning algorithm (FOCUSS-CNDL) with overcomplete independent component analysis (ICA). Second, noting that once a dictionary has been learned in a given domain the problem becomes one of choosing the vectors to form an accurate, sparse representation, we compare a recently developed algorithm (sparse Bayesian learning with adjustable variance Gaussians) to well known methods of subset selection: matching pursuit and FOCUSS. Third, noting that in some cases it may be necessary to find a non-negative sparse coding, we present a modified version of the FOCUSS algorithm that can find such non-negative codings","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"13 1","pages":"579-588"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88594865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A 4/sup N/-QAM adaptive decision device to mitigate I/Q imbalance and impairments caused by time-varying flat fading channels 一种4/sup N/-QAM自适应决策装置,用于缓解时变平坦衰落信道造成的I/Q不平衡和损伤
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1423031
C. Panazio, R.R. de F Attux
In this work, we propose an adaptive decision device based on a Kohonen network that can automatically generate the classes associated with each symbol of a 4n-QAM in the presence of non-linearities caused by the I/Q imbalance and additive Gaussian white noise, being also capable of compensating phase and gain variations produced by a time-varying flat-fading channel. Our proposal can achieve optimality in the maximum-likelihood sense with a small computational cost. Furthermore, due to the tracking ability inherent to the devised scheme, there is no need for an automatic gain controller or a phase-locked loop
在这项工作中,我们提出了一种基于Kohonen网络的自适应决策装置,该装置可以在存在由I/Q不平衡和加性高斯白噪声引起的非线性的情况下自动生成与4n-QAM的每个符号相关的类,还能够补偿由时变平坦衰落信道产生的相位和增益变化。我们的方案能够以较小的计算成本实现最大似然意义上的最优性。此外,由于所设计方案固有的跟踪能力,不需要自动增益控制器或锁相环
{"title":"A 4/sup N/-QAM adaptive decision device to mitigate I/Q imbalance and impairments caused by time-varying flat fading channels","authors":"C. Panazio, R.R. de F Attux","doi":"10.1109/MLSP.2004.1423031","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1423031","url":null,"abstract":"In this work, we propose an adaptive decision device based on a Kohonen network that can automatically generate the classes associated with each symbol of a 4n-QAM in the presence of non-linearities caused by the I/Q imbalance and additive Gaussian white noise, being also capable of compensating phase and gain variations produced by a time-varying flat-fading channel. Our proposal can achieve optimality in the maximum-likelihood sense with a small computational cost. Furthermore, due to the tracking ability inherent to the devised scheme, there is no need for an automatic gain controller or a phase-locked loop","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"5 1","pages":"665-674"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73224680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GMM and kernel-based speaker recognition with the ISIP toolkit 基于GMM和内核的说话人识别与ISIP工具包
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1422996
T. Imbiriba, A. Klautau, N. Parihar, S. Raghavan, J. Picone
This paper describes an open source framework for developing speaker recognition systems. Among other features, it supports kernel classifiers, such as the support and relevance vector machines. The paper also presents results for the IME corpus using Gaussian mixture models, which outperforms previously published ones, and discusses strategies for applying discriminative classifiers to speaker recognition
本文描述了一个开发说话人识别系统的开源框架。除其他特性外,它还支持内核分类器,例如支持向量机和相关向量机。本文还介绍了使用高斯混合模型对IME语料库的结果,该结果优于先前发表的结果,并讨论了将判别分类器应用于说话人识别的策略
{"title":"GMM and kernel-based speaker recognition with the ISIP toolkit","authors":"T. Imbiriba, A. Klautau, N. Parihar, S. Raghavan, J. Picone","doi":"10.1109/MLSP.2004.1422996","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1422996","url":null,"abstract":"This paper describes an open source framework for developing speaker recognition systems. Among other features, it supports kernel classifiers, such as the support and relevance vector machines. The paper also presents results for the IME corpus using Gaussian mixture models, which outperforms previously published ones, and discusses strategies for applying discriminative classifiers to speaker recognition","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"106 1","pages":"371-380"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74789711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Numerical stability of nystrom extension for image segmentation 图像分割中nystrom扩展的数值稳定性
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1423024
E. Izquierdo, V. Guerra-Ones
A stability analysis of the approximate solution for spectral partitioning in image segmentation based on the Nystrom extension is presented. Algorithmic modifications are introduced to improve the stability of the original technique reported in (C. Fowlkes et al., 2004). The proposed improvement includes a criterion for the selection of the initial sample and more stable estimations of inverse matrices. The proposed algorithm is validated by several computer experiments
给出了基于Nystrom扩展的图像分割中光谱划分近似解的稳定性分析。在C. Fowlkes等人,2004年的报道中,引入了算法修改来提高原始技术的稳定性。提出的改进包括初始样本选择准则和更稳定的逆矩阵估计。通过计算机实验验证了该算法的有效性
{"title":"Numerical stability of nystrom extension for image segmentation","authors":"E. Izquierdo, V. Guerra-Ones","doi":"10.1109/MLSP.2004.1423024","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1423024","url":null,"abstract":"A stability analysis of the approximate solution for spectral partitioning in image segmentation based on the Nystrom extension is presented. Algorithmic modifications are introduced to improve the stability of the original technique reported in (C. Fowlkes et al., 2004). The proposed improvement includes a criterion for the selection of the initial sample and more stable estimations of inverse matrices. The proposed algorithm is validated by several computer experiments","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"13 1","pages":"609-614"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87920565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Speech enhancement by lateral inhibition and binaural masking 侧抑制和双耳掩蔽的语音增强
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1422995
E. James, A. Barros, T. Yoshinori, D. Mandic, N. Ohnishi
In this paper we propose a simple algorithm to enhance a speech signal with larger energy from a mixture of two sound sources. We used two microphones for acquisition of the sound signals and assume that either one of the speakers is closer of one of the microphones. In this algorithm, we use the concept of auditory filter banks with two psychoacoustic concepts: lateral inhibition and binaural masking. Preliminary computer simulations experiments confirm the validity of the proposed algorithm by objective and subjective measures
在本文中,我们提出了一种简单的算法来增强来自两个声源混合的具有较大能量的语音信号。我们使用两个麦克风来获取声音信号,并假设其中一个扬声器离其中一个麦克风更近。在该算法中,我们将听觉滤波器组的概念与两个心理声学概念相结合:侧抑制和双耳掩蔽。初步的计算机仿真实验从客观和主观两方面验证了该算法的有效性
{"title":"Speech enhancement by lateral inhibition and binaural masking","authors":"E. James, A. Barros, T. Yoshinori, D. Mandic, N. Ohnishi","doi":"10.1109/MLSP.2004.1422995","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1422995","url":null,"abstract":"In this paper we propose a simple algorithm to enhance a speech signal with larger energy from a mixture of two sound sources. We used two microphones for acquisition of the sound signals and assume that either one of the speakers is closer of one of the microphones. In this algorithm, we use the concept of auditory filter banks with two psychoacoustic concepts: lateral inhibition and binaural masking. Preliminary computer simulations experiments confirm the validity of the proposed algorithm by objective and subjective measures","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"21 1","pages":"365-370"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75305686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximate leave-one-out error estimation for learning with smooth, strictly convex margin loss functions 近似留一误差估计学习光滑,严格凸边缘损失函数
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1422960
Christopher P. Diehl
Leave-one-out (LOO) error estimation is an important statistical tool for assessing generalization performance. A number of papers have focused on LOO error estimation for support vector machines, but little work has focused on LOO error estimation when learning with smooth, convex margin loss functions. We consider the problem of approximating the LOO error estimate in the context of sparse kernel machine learning. We first motivate a general framework for learning sparse kernel machines that involves minimizing a regularized, smooth, strictly convex margin loss. Then we present an approximation of the LOO error for the family of learning algorithms admissible in the general framework. We examine the implications of the approximation and review preliminary experimental results demonstrating the utility of the approach
留一误差估计是评估泛化性能的重要统计工具。许多论文关注的是支持向量机的LOO误差估计,但很少有研究关注平滑凸边损失函数学习时的LOO误差估计。我们考虑了稀疏核机器学习中LOO误差估计的逼近问题。我们首先激发了一个学习稀疏核机的通用框架,该框架涉及最小化正则化的、光滑的、严格凸的边缘损失。然后,我们给出了在一般框架中允许的学习算法族的LOO误差的近似值。我们研究了近似的含义,并回顾了初步的实验结果,证明了该方法的实用性
{"title":"Approximate leave-one-out error estimation for learning with smooth, strictly convex margin loss functions","authors":"Christopher P. Diehl","doi":"10.1109/MLSP.2004.1422960","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1422960","url":null,"abstract":"Leave-one-out (LOO) error estimation is an important statistical tool for assessing generalization performance. A number of papers have focused on LOO error estimation for support vector machines, but little work has focused on LOO error estimation when learning with smooth, convex margin loss functions. We consider the problem of approximating the LOO error estimate in the context of sparse kernel machine learning. We first motivate a general framework for learning sparse kernel machines that involves minimizing a regularized, smooth, strictly convex margin loss. Then we present an approximation of the LOO error for the family of learning algorithms admissible in the general framework. We examine the implications of the approximation and review preliminary experimental results demonstrating the utility of the approach","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"17 1","pages":"63-72"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75139206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A hybrid neural network/rule based system for bilingual text-to-phoneme mapping 基于混合神经网络/规则的双语文本到音素映射系统
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1422992
E. B. Bilcu, J. Astola, J. Saarinen
Text-to-phoneme (TTP) mapping is a preliminary step in text-to-speech synthesis and it affects the naturalness and understandability of synthetic speech. In this paper, we propose a hybrid neural network/rule based system for bilingual text-to-phoneme mapping. Our system uses three neural networks and a simple rule to perform the phoneme transcription. The first network is trained to convert the letters from the first language into their corresponding phonemes, the second one is used to obtain the phonemes for the second language whereas the third neural network together with a simple rule is responsible of the language recognition. The proposed approach can be easily extended for multilingual applications when more neural networks are introduced. Simulations performed on a bilingual dictionary (English+French) show the improvements in terms of phoneme accuracy of our method against the approach that uses a single neural network for multilingual TTP
文本-音素映射是文本-语音合成的第一步,它直接影响到合成语音的自然度和可理解性。本文提出了一种基于神经网络/规则的双语文本-音素映射混合系统。我们的系统使用三个神经网络和一个简单的规则来执行音素转录。第一个神经网络用于将第一语言中的字母转换为对应的音素,第二个神经网络用于获取第二语言的音素,第三个神经网络与一个简单的规则一起负责语言识别。当引入更多的神经网络时,该方法可以很容易地扩展到多语言应用中。在双语词典(英语+法语)上进行的模拟表明,与使用单一神经网络进行多语言TTP的方法相比,我们的方法在音素准确性方面有所提高
{"title":"A hybrid neural network/rule based system for bilingual text-to-phoneme mapping","authors":"E. B. Bilcu, J. Astola, J. Saarinen","doi":"10.1109/MLSP.2004.1422992","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1422992","url":null,"abstract":"Text-to-phoneme (TTP) mapping is a preliminary step in text-to-speech synthesis and it affects the naturalness and understandability of synthetic speech. In this paper, we propose a hybrid neural network/rule based system for bilingual text-to-phoneme mapping. Our system uses three neural networks and a simple rule to perform the phoneme transcription. The first network is trained to convert the letters from the first language into their corresponding phonemes, the second one is used to obtain the phonemes for the second language whereas the third neural network together with a simple rule is responsible of the language recognition. The proposed approach can be easily extended for multilingual applications when more neural networks are introduced. Simulations performed on a bilingual dictionary (English+French) show the improvements in terms of phoneme accuracy of our method against the approach that uses a single neural network for multilingual TTP","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"109 1","pages":"345-354"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77266517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Automated feature extraction using genetic programming for bearing condition monitoring 自动特征提取使用遗传编程轴承状态监测
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1423015
Hong Guo, L. B. Jack, A. Nandi
The feature extraction is one of the major challenges for the pattern recognition. This helps to maximise the useful information from the raw data in order to make the classification effective and simple. In this paper, one of the machine learning approaches, genetic programming (GP), is employed to extract features from the raw vibration data taken from a rotating machine with several different conditions. The created features are then used as the input to a simple ANN for the identification of different bearing conditions, in comparison with the other classical machine learning methods. Experimental results demonstrate the capability of GP to discover automatically the functional relationships among the raw vibration data, to give improved performance
特征提取是模式识别的主要挑战之一。这有助于最大限度地从原始数据中获取有用信息,从而使分类有效和简单。本文采用遗传规划(GP)作为机器学习的一种方法,从不同条件下的旋转机器的原始振动数据中提取特征。然后将创建的特征用作简单人工神经网络的输入,用于识别不同的轴承条件,并与其他经典机器学习方法进行比较。实验结果表明,GP能够自动发现原始振动数据之间的函数关系,从而提高了性能
{"title":"Automated feature extraction using genetic programming for bearing condition monitoring","authors":"Hong Guo, L. B. Jack, A. Nandi","doi":"10.1109/MLSP.2004.1423015","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1423015","url":null,"abstract":"The feature extraction is one of the major challenges for the pattern recognition. This helps to maximise the useful information from the raw data in order to make the classification effective and simple. In this paper, one of the machine learning approaches, genetic programming (GP), is employed to extract features from the raw vibration data taken from a rotating machine with several different conditions. The created features are then used as the input to a simple ANN for the identification of different bearing conditions, in comparison with the other classical machine learning methods. Experimental results demonstrate the capability of GP to discover automatically the functional relationships among the raw vibration data, to give improved performance","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"33 1","pages":"519-528"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80065611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Tracking and visualization of changes in high-dimensional non-parametric distributions 高维非参数分布变化的跟踪和可视化
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1422975
J. Kohlmorgen
Most real-world systems exhibit a non-stationary behavior, e.g., slow drifts due to wear or fast changes due to external influences. Extracting and quantifying these phenomena is often difficult due to the lack of a precise mathematical model of the underlying system. We here propose to model such high-level changes of a dynamical system solely on the basis of the observed measurements rather than by modeling the underlying system itself. In particular, we present a method to track and visualize changes in general data distributions. We approach the problem of how to represent continuous changes in high-dimensional non-parametric distributions by identifying anchor distributions and we model the transitions between those anchor distributions by defining a suitable similarity measure. Applications to a high-dimensional chaotic system and to a sleep-onset detection task in EEG demonstrate the efficiency of this approach
大多数现实世界的系统表现出非平稳的行为,例如,由于磨损而缓慢漂移或由于外部影响而快速变化。由于缺乏底层系统的精确数学模型,提取和量化这些现象往往是困难的。在这里,我们建议仅根据观测到的测量来模拟动力系统的这种高层变化,而不是通过对底层系统本身进行建模。特别是,我们提出了一种方法来跟踪和可视化一般数据分布的变化。我们通过识别锚点分布来解决如何表示高维非参数分布的连续变化的问题,并通过定义合适的相似性度量来模拟这些锚点分布之间的转换。应用于高维混沌系统和脑电睡眠检测任务,证明了该方法的有效性
{"title":"Tracking and visualization of changes in high-dimensional non-parametric distributions","authors":"J. Kohlmorgen","doi":"10.1109/MLSP.2004.1422975","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1422975","url":null,"abstract":"Most real-world systems exhibit a non-stationary behavior, e.g., slow drifts due to wear or fast changes due to external influences. Extracting and quantifying these phenomena is often difficult due to the lack of a precise mathematical model of the underlying system. We here propose to model such high-level changes of a dynamical system solely on the basis of the observed measurements rather than by modeling the underlying system itself. In particular, we present a method to track and visualize changes in general data distributions. We approach the problem of how to represent continuous changes in high-dimensional non-parametric distributions by identifying anchor distributions and we model the transitions between those anchor distributions by defining a suitable similarity measure. Applications to a high-dimensional chaotic system and to a sleep-onset detection task in EEG demonstrate the efficiency of this approach","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"24 1","pages":"203-212"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84762875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A new sparse image representation algorithm applied to facial expression recognition 一种新的稀疏图像表示算法应用于面部表情识别
Pub Date : 2004-09-29 DOI: 10.1109/MLSP.2004.1423017
I. Buciu, Ioannis Pitas
In this paper, we present a novel algorithm for learning facial expressions in a supervised manner. This algorithm is derived from the local non-negative matrix factorization (LNMF) algorithm, which is an extension of non-negative matrix factorization (NMF) method. We call this newly proposed algorithm discriminant non-negative matrix factorization (DNMF). Given an image database, all these three algorithms decompose the database into basis images and their corresponding coefficients. This decomposition is computed differently for each method. The decomposition results are applied on facial images for the recognition of the six basic facial expressions. We found that our algorithm shows superior performance by achieving a higher recognition rate, when compared to NMF and LNMF
在本文中,我们提出了一种新的基于监督的面部表情学习算法。该算法由局部非负矩阵分解(LNMF)算法衍生而来,是对非负矩阵分解(NMF)方法的扩展。我们将这种新提出的算法称为判别非负矩阵分解(DNMF)。给定一个图像数据库,这三种算法都将数据库分解为基图像及其对应的系数。对于每种方法,这种分解的计算方式是不同的。将分解结果应用于人脸图像,对六种基本面部表情进行识别。我们发现,与NMF和LNMF相比,我们的算法在实现更高的识别率方面表现出了卓越的性能
{"title":"A new sparse image representation algorithm applied to facial expression recognition","authors":"I. Buciu, Ioannis Pitas","doi":"10.1109/MLSP.2004.1423017","DOIUrl":"https://doi.org/10.1109/MLSP.2004.1423017","url":null,"abstract":"In this paper, we present a novel algorithm for learning facial expressions in a supervised manner. This algorithm is derived from the local non-negative matrix factorization (LNMF) algorithm, which is an extension of non-negative matrix factorization (NMF) method. We call this newly proposed algorithm discriminant non-negative matrix factorization (DNMF). Given an image database, all these three algorithms decompose the database into basis images and their corresponding coefficients. This decomposition is computed differently for each method. The decomposition results are applied on facial images for the recognition of the six basic facial expressions. We found that our algorithm shows superior performance by achieving a higher recognition rate, when compared to NMF and LNMF","PeriodicalId":70952,"journal":{"name":"信号处理","volume":"1 1","pages":"539-548"},"PeriodicalIF":0.0,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78350701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 80
期刊
信号处理
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1