首页 > 最新文献

2018 26th European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization 基于组非负矩阵分解的基于脑电图的情感识别多任务特征学习
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553390
Ayoub Hajlaoui, M. Chetouani, S. Essid
Electroencephalographic sensors have proven to be promising for emotion recognition. Our study focuses on the recognition of valence and arousal levels using such sensors. Usually, ad hoc features are extracted for such recognition tasks. In this paper, we rely on automatic feature learning techniques instead. Our main contribution is the use of Group Nonnegative Matrix Factorization in a multi-task fashion, where we exploit both valence and arousal labels to control valence-related and arousal-related feature learning. Applying this method on HCI MAHNOB and EMOEEG, two databases where emotions are elicited by means of audiovisual stimuli and performing binary inter-session classification of valence labels, we obtain significant improvement of valence classification Fl scores in comparison to baseline frequency-band power features computed on predefined frequency bands. The valence classification F1 score is improved from 0.56 to 0.69 in the case of HCI MAHNOB, and from 0.56 to 0.59 in the case of EMOEEG.
脑电图传感器已被证明在情绪识别方面很有前途。我们的研究重点是利用这些传感器来识别效价和唤醒水平。通常,针对这类识别任务,需要提取特别的特征。在本文中,我们依赖于自动特征学习技术。我们的主要贡献是在多任务方式中使用组非负矩阵分解,其中我们利用价和唤醒标签来控制价相关和唤醒相关的特征学习。将该方法应用于HCI MAHNOB和EMOEEG这两个通过视听刺激引发情绪的数据库,并对价标签进行二元会话间分类,与在预定义频段上计算的基线频段功率特征相比,我们获得了价分类Fl分数的显着提高。HCI MAHNOB的价态分类F1评分从0.56提高到0.69,EMOEEG的价态分类F1评分从0.56提高到0.59。
{"title":"Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization","authors":"Ayoub Hajlaoui, M. Chetouani, S. Essid","doi":"10.23919/EUSIPCO.2018.8553390","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553390","url":null,"abstract":"Electroencephalographic sensors have proven to be promising for emotion recognition. Our study focuses on the recognition of valence and arousal levels using such sensors. Usually, ad hoc features are extracted for such recognition tasks. In this paper, we rely on automatic feature learning techniques instead. Our main contribution is the use of Group Nonnegative Matrix Factorization in a multi-task fashion, where we exploit both valence and arousal labels to control valence-related and arousal-related feature learning. Applying this method on HCI MAHNOB and EMOEEG, two databases where emotions are elicited by means of audiovisual stimuli and performing binary inter-session classification of valence labels, we obtain significant improvement of valence classification Fl scores in comparison to baseline frequency-band power features computed on predefined frequency bands. The valence classification F1 score is improved from 0.56 to 0.69 in the case of HCI MAHNOB, and from 0.56 to 0.59 in the case of EMOEEG.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126652542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sparse Method for Tip-Timing Signals Analysis with Non Stationary Engine Rotation Frequency 发动机旋转频率非平稳时尖信号分析的稀疏方法
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553254
A. Bouchain, A. Vercoutter, J. Picheral, A. Talon
Blades vibrations must be measured in operations to validate blade design. Tip-timing is one of the classical measurement methods but its main drawback is the generation of sub-sampled and non-uniform sampled signals. This paper presents a new sparse method for tip-timing spectral analysis that makes use of engine rotation variations. Assuming that blade vibration signals yield to line spectra, a sparse signal model is introduced as a linear system. The solution to the problem is obtained by ADMM (Alternating Direction Method of Multipliers) with a $p^{1}$ -regularization. Results for simulated and real signals are given to illustrate the efficiency of this method. The main advantages of the proposed method are to provide a fast solution and to take into account the variations of the rotation speed. Results show that this approach reduces frequency aliasings caused by the low sampling frequency of the measured signals.
在作业中必须测量叶片振动以验证叶片设计。Tip-timing是一种经典的测量方法,但其主要缺点是会产生次采样和非均匀采样信号。本文提出了一种利用发动机转速变化进行叶尖正时谱分析的稀疏方法。假设叶片振动信号服从线谱,将稀疏信号模型作为线性系统引入。利用乘法器的交替方向法(ADMM)进行p^{1}$ -正则化,得到了该问题的解。仿真结果和实际信号表明了该方法的有效性。该方法的主要优点是提供了一个快速的解决方案,并考虑了转速的变化。结果表明,该方法有效地降低了因被测信号采样频率过低而引起的频率混叠现象。
{"title":"Sparse Method for Tip-Timing Signals Analysis with Non Stationary Engine Rotation Frequency","authors":"A. Bouchain, A. Vercoutter, J. Picheral, A. Talon","doi":"10.23919/EUSIPCO.2018.8553254","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553254","url":null,"abstract":"Blades vibrations must be measured in operations to validate blade design. Tip-timing is one of the classical measurement methods but its main drawback is the generation of sub-sampled and non-uniform sampled signals. This paper presents a new sparse method for tip-timing spectral analysis that makes use of engine rotation variations. Assuming that blade vibration signals yield to line spectra, a sparse signal model is introduced as a linear system. The solution to the problem is obtained by ADMM (Alternating Direction Method of Multipliers) with a $p^{1}$ -regularization. Results for simulated and real signals are given to illustrate the efficiency of this method. The main advantages of the proposed method are to provide a fast solution and to take into account the variations of the rotation speed. Results show that this approach reduces frequency aliasings caused by the low sampling frequency of the measured signals.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"490 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115535039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Efficient Variance-Reduced Learning Over Multi-Agent Networks 基于多智能体网络的高效减方差学习
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553100
K. Yuan, Bicheng Ying, A. H. Sayed
This work develops a fully decentralized variance-reduced learning algorithm for multi-agent networks where nodes store and process the data locally and are only allowed to communicate with their immediate neighbors. In the proposed algorithm, there is no need for a central or master unit while the objective is to enable the dispersed nodes to learn the exact global model despite their limited localized interactions. The resulting algorithm is shown to have low memory requirement, guaranteed linear convergence, robustness to failure of links or nodes and scalability to the network size. Moreover, the decentralized nature of the solution makes large-scale machine learning problems more tractable and also scalable since data is stored and processed locally at the nodes.
这项工作为多代理网络开发了一种完全分散的方差减少学习算法,其中节点在本地存储和处理数据,并且只允许与其近邻通信。在该算法中,不需要中心或主单元,而目标是使分散的节点能够在有限的局部相互作用下学习精确的全局模型。结果表明,该算法具有低内存需求,保证线性收敛,对链路或节点故障的鲁棒性以及对网络规模的可扩展性。此外,该解决方案的分散性使得大规模机器学习问题更易于处理和扩展,因为数据是在节点本地存储和处理的。
{"title":"Efficient Variance-Reduced Learning Over Multi-Agent Networks","authors":"K. Yuan, Bicheng Ying, A. H. Sayed","doi":"10.23919/EUSIPCO.2018.8553100","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553100","url":null,"abstract":"This work develops a fully decentralized variance-reduced learning algorithm for multi-agent networks where nodes store and process the data locally and are only allowed to communicate with their immediate neighbors. In the proposed algorithm, there is no need for a central or master unit while the objective is to enable the dispersed nodes to learn the exact global model despite their limited localized interactions. The resulting algorithm is shown to have low memory requirement, guaranteed linear convergence, robustness to failure of links or nodes and scalability to the network size. Moreover, the decentralized nature of the solution makes large-scale machine learning problems more tractable and also scalable since data is stored and processed locally at the nodes.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115985980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multiscale DCNN Ensemble Applied to Human Activity Recognition Based on Wearable Sensors 多尺度DCNN集成在可穿戴传感器人体活动识别中的应用
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553354
Jessica Sena, Jesimon Barreto Santos, W. R. Schwartz
Sensor-based Human Activity Recognition (HAR) provides valuable knowledge to many areas. Recently, wearable devices have gained space as a relevant source of data. However, there are two issues: large number of heterogeneous sensors available and the temporal nature of the sensor data. To handle those issues, we propose a multimodal approach that processes each sensor separately and, through an ensemble of Deep Convolution Neural Networks (DCNN), extracts information from multiple temporal scales of the sensor data. In this ensemble, we use a convolutional kernel with a different height for each DCNN. Considering that the number of rows in the sensor data reflects the data captured over time, each kernel height reflects a temporal scale from which we can extract patterns. Consequently, our approach is able to extract from simple movement patterns such as a wrist twist when picking up a spoon to complex movements such as the human gait. This multimodal and multitemporal approach outperforms previous state-of-the-art works in seven important datasets using two different protocols. In addition, we demonstrate that the use of our proposed set of kernels improves sensor-based HAR in another multi-kernel approach, the widely employed inception network.
基于传感器的人类活动识别(HAR)为许多领域提供了有价值的知识。最近,可穿戴设备作为一个相关的数据来源已经获得了空间。然而,有两个问题:大量的异构传感器和传感器数据的时间性质。为了解决这些问题,我们提出了一种多模态方法,该方法分别处理每个传感器,并通过深度卷积神经网络(DCNN)的集合,从传感器数据的多个时间尺度中提取信息。在这个集合中,我们为每个DCNN使用具有不同高度的卷积核。考虑到传感器数据中的行数反映了随时间捕获的数据,每个核高度反映了我们可以从中提取模式的时间尺度。因此,我们的方法能够从简单的运动模式(如拿起勺子时的手腕扭动)提取到复杂的运动(如人类的步态)。这种多模式和多时间的方法在使用两种不同协议的七个重要数据集中优于以前最先进的工作。此外,我们证明了使用我们提出的核集可以在另一种多核方法中改进基于传感器的HAR,即广泛使用的初始网络。
{"title":"Multiscale DCNN Ensemble Applied to Human Activity Recognition Based on Wearable Sensors","authors":"Jessica Sena, Jesimon Barreto Santos, W. R. Schwartz","doi":"10.23919/EUSIPCO.2018.8553354","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553354","url":null,"abstract":"Sensor-based Human Activity Recognition (HAR) provides valuable knowledge to many areas. Recently, wearable devices have gained space as a relevant source of data. However, there are two issues: large number of heterogeneous sensors available and the temporal nature of the sensor data. To handle those issues, we propose a multimodal approach that processes each sensor separately and, through an ensemble of Deep Convolution Neural Networks (DCNN), extracts information from multiple temporal scales of the sensor data. In this ensemble, we use a convolutional kernel with a different height for each DCNN. Considering that the number of rows in the sensor data reflects the data captured over time, each kernel height reflects a temporal scale from which we can extract patterns. Consequently, our approach is able to extract from simple movement patterns such as a wrist twist when picking up a spoon to complex movements such as the human gait. This multimodal and multitemporal approach outperforms previous state-of-the-art works in seven important datasets using two different protocols. In addition, we demonstrate that the use of our proposed set of kernels improves sensor-based HAR in another multi-kernel approach, the widely employed inception network.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116068641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Consistent Spectral Methods for Dimensionality Reduction 降维的一致光谱方法
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553295
M. Kharouf, Tabea Rebafka, Nataliya Sokolovska
This paper addresses the problem of dimension reduction of noisy data, more precisely the challenge to determine the dimension of the subspace where the observed signal lives in. Based on results from random matrix theory, two novel estimators of the signal dimension are proposed in this paper. Consistency of the estimators is proved in the modern asymptotic regime, where the number of parameters grows proportionally with the sample size. Experimental results show that the novel estimators are robust to noise and, moreover, they give highly accurate results in settings where standard methods fail. We apply the novel dimension estimators to several life sciences benchmarks in the context of classification, and illustrate the improvements achieved by the new methods compared to the state-of-the-art approaches.
本文解决了噪声数据的降维问题,更准确地说,是确定观测信号所在子空间的维数的挑战。基于随机矩阵理论的结果,提出了两种新的信号维数估计方法。在现代渐近状态下,参数数量随样本容量成比例增长,证明了估计量的相合性。实验结果表明,新的估计器对噪声具有较强的鲁棒性,而且在标准方法无法实现的情况下给出了较高的精度。我们将新的维估计器应用于分类背景下的几个生命科学基准,并说明了与最先进的方法相比,新方法所取得的改进。
{"title":"Consistent Spectral Methods for Dimensionality Reduction","authors":"M. Kharouf, Tabea Rebafka, Nataliya Sokolovska","doi":"10.23919/EUSIPCO.2018.8553295","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553295","url":null,"abstract":"This paper addresses the problem of dimension reduction of noisy data, more precisely the challenge to determine the dimension of the subspace where the observed signal lives in. Based on results from random matrix theory, two novel estimators of the signal dimension are proposed in this paper. Consistency of the estimators is proved in the modern asymptotic regime, where the number of parameters grows proportionally with the sample size. Experimental results show that the novel estimators are robust to noise and, moreover, they give highly accurate results in settings where standard methods fail. We apply the novel dimension estimators to several life sciences benchmarks in the context of classification, and illustrate the improvements achieved by the new methods compared to the state-of-the-art approaches.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122380613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Method for Topological Embedding of Time-Series Data 一种新的时间序列数据拓扑嵌入方法
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553502
Sean M. Kennedy, J. Roth, James W. Scrofani
In this paper, we propose a novel method for embedding one-dimensional, periodic time-series data into higher-dimensional topological spaces to support robust recovery of signal features via topological data analysis under noisy sampling conditions. Our method can be considered an extension of the popular time delay embedding method to a larger class of linear operators. To provide evidence for the viability of this method, we analyze the simple case of sinusoidal data in three steps. First, we discuss some of the drawbacks of the time delay embedding framework in the context of periodic, sinusoidal data. Next, we show analytically that using the Hilbert transform as an alternative embedding function for sinusoidal data overcomes these drawbacks. Finally, we provide empirical evidence of the viability of the Hilbert transform as an embedding function when the parameters of the sinusoidal data vary over time.
在本文中,我们提出了一种新的方法,将一维周期时间序列数据嵌入到高维拓扑空间中,以支持在噪声采样条件下通过拓扑数据分析对信号特征进行鲁棒恢复。我们的方法可以看作是流行的时间延迟嵌入方法的扩展,适用于更大的线性算子类。为了证明这种方法的可行性,我们分三步分析了正弦数据的简单情况。首先,我们讨论了时间延迟嵌入框架在周期性正弦数据背景下的一些缺点。接下来,我们分析地表明,使用希尔伯特变换作为正弦数据的替代嵌入函数克服了这些缺点。最后,我们提供了经验证据,证明当正弦数据的参数随时间变化时,希尔伯特变换作为嵌入函数的可行性。
{"title":"A Novel Method for Topological Embedding of Time-Series Data","authors":"Sean M. Kennedy, J. Roth, James W. Scrofani","doi":"10.23919/EUSIPCO.2018.8553502","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553502","url":null,"abstract":"In this paper, we propose a novel method for embedding one-dimensional, periodic time-series data into higher-dimensional topological spaces to support robust recovery of signal features via topological data analysis under noisy sampling conditions. Our method can be considered an extension of the popular time delay embedding method to a larger class of linear operators. To provide evidence for the viability of this method, we analyze the simple case of sinusoidal data in three steps. First, we discuss some of the drawbacks of the time delay embedding framework in the context of periodic, sinusoidal data. Next, we show analytically that using the Hilbert transform as an alternative embedding function for sinusoidal data overcomes these drawbacks. Finally, we provide empirical evidence of the viability of the Hilbert transform as an embedding function when the parameters of the sinusoidal data vary over time.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122999181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
High-Order CPD Estimation with Dimensionality Reduction Using a Tensor Train Model 基于张量序列模型的高阶CPD降维估计
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553466
Yassine Zniyed, R. Boyer, A. Almeida, G. Favier
The canonical polyadic decomposition (CPD) is one of the most popular tensor-based analysis tools due to its usefulness in numerous fields of application. The Q-order CPD is parametrized by $Q$ matrices also called factors which have to be recovered. The factors estimation is usually carried out by means of the alternating least squares (ALS) algorithm. In the context of multi-modal big data analysis, i.e., large order $(Q)$ and dimensions, the ALS algorithm has two main drawbacks. Firstly, its convergence is generally slow and may fail, in particular for large values of $Q$, and secondly it is highly time consuming. In this paper, it is proved that a Q-order CPD of rank-R is equivalent to a train of $Q$ 3-order CPD(s) of rank-R. In other words, each tensor train (TT)-core admits a 3-order CPD of rank-R. Based on the structure of the TT-cores, a new dimensionality reduction and factor retrieval scheme is derived. The proposed method has a better robustness to noise with a smaller computational cost than the ALS algorithm.
典型多进分解(CPD)由于其在许多领域的应用而成为最流行的基于张量的分析工具之一。Q阶CPD由$Q$矩阵参数化,也称为必须恢复的因子。因子估计通常采用交替最小二乘(ALS)算法进行。在多模态大数据分析的背景下,即大阶$(Q)$和维度,ALS算法有两个主要的缺点。首先,它的收敛速度通常很慢,可能会失败,特别是对于较大的$Q$,其次,它非常耗时。本文证明了秩为- r的Q阶CPD等价于秩为- r的$Q$ 3阶CPD序列。也就是说,每个张量列(TT)核都有一个秩为r的3阶CPD。基于tt -核的结构,提出了一种新的降维和因子检索方案。该方法对噪声具有更好的鲁棒性,且计算量比ALS算法小。
{"title":"High-Order CPD Estimation with Dimensionality Reduction Using a Tensor Train Model","authors":"Yassine Zniyed, R. Boyer, A. Almeida, G. Favier","doi":"10.23919/EUSIPCO.2018.8553466","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553466","url":null,"abstract":"The canonical polyadic decomposition (CPD) is one of the most popular tensor-based analysis tools due to its usefulness in numerous fields of application. The Q-order CPD is parametrized by $Q$ matrices also called factors which have to be recovered. The factors estimation is usually carried out by means of the alternating least squares (ALS) algorithm. In the context of multi-modal big data analysis, i.e., large order $(Q)$ and dimensions, the ALS algorithm has two main drawbacks. Firstly, its convergence is generally slow and may fail, in particular for large values of $Q$, and secondly it is highly time consuming. In this paper, it is proved that a Q-order CPD of rank-R is equivalent to a train of $Q$ 3-order CPD(s) of rank-R. In other words, each tensor train (TT)-core admits a 3-order CPD of rank-R. Based on the structure of the TT-cores, a new dimensionality reduction and factor retrieval scheme is derived. The proposed method has a better robustness to noise with a smaller computational cost than the ALS algorithm.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123020158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Online Parametric NMF for Speech Enhancement 语音增强的在线参数NMF
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553039
Mathew Shaji Kavalekalam, J. Nielsen, Liming Shi, M. G. Christensen, J. Boldt
In this paper, we propose a speech enhancement method based on non-negative matrix factorization (NMF) techniques. NMF techniques allow us to approximate the power spectral density (PSD) of the noisy signal as a weighted linear combination of trained speech and noise basis vectors arranged as the columns of a matrix. In this work, we propose to use basis vectors that are parameterised by autoregressive (AR) coefficients. Parametric representation of the spectral basis is beneficial as it can encompass the signal characteristics like, e.g. the speech production model. It is observed that the parametric representation of basis vectors is beneficial while performing online speech enhancement in low delay scenarios.
本文提出一种基于非负矩阵分解(NMF)技术的语音增强方法。NMF技术允许我们将噪声信号的功率谱密度(PSD)近似为经过训练的语音和噪声基向量的加权线性组合,这些基向量排列为矩阵的列。在这项工作中,我们建议使用由自回归(AR)系数参数化的基向量。谱基的参数化表示是有益的,因为它可以包含信号特征,例如语音产生模型。观察到,在低延迟场景下,基向量的参数化表示有利于在线语音增强。
{"title":"Online Parametric NMF for Speech Enhancement","authors":"Mathew Shaji Kavalekalam, J. Nielsen, Liming Shi, M. G. Christensen, J. Boldt","doi":"10.23919/EUSIPCO.2018.8553039","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553039","url":null,"abstract":"In this paper, we propose a speech enhancement method based on non-negative matrix factorization (NMF) techniques. NMF techniques allow us to approximate the power spectral density (PSD) of the noisy signal as a weighted linear combination of trained speech and noise basis vectors arranged as the columns of a matrix. In this work, we propose to use basis vectors that are parameterised by autoregressive (AR) coefficients. Parametric representation of the spectral basis is beneficial as it can encompass the signal characteristics like, e.g. the speech production model. It is observed that the parametric representation of basis vectors is beneficial while performing online speech enhancement in low delay scenarios.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"12 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114128467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Collaborative Speech Dereverberation: Regularized Tensor Factorization for Crowdsourced Multi-Channel Recordings 协同语音去噪:众包多声道录音的正则化张量分解
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553565
Sanna Wager, Minje Kim
We propose a regularized nonnegative tensor factorization (NTF) model for multi-channel speech derestriction that incorporates prior knowledge about clean speech. The approach models the problem as recovering a signal convolved with different room impulse responses, allowing the dereverberation problem to benefit from microphone arrays. The factorization learns both individual reverberation filters and channel-specific delays, which makes it possible to employ an ad-hoc microphone array with heterogeneous sensors (such as multi-channel recordings by a crowd) even if they are not synchronized. We integrate two prior-knowledge regularization schemes to increase the stability of dereverberation performance. First, a Nonnegative Matrix Factorization (NMF) inner routine is introduced to inform the original NTF problem of the pre-trained clean speech basis vectors, so that the optimization process can focus on estimating their activations rather than the whole clean speech spectra. Second, the NMF activation matrix is further regularized to take on characteristics of dry signals using sparsity and smoothness constraints. Empirical dereverberation results on different simulated reverberation setups show that the prior-knowledge regularization schemes improve both recovered sound quality and speech intelligibility compared to a baseline NTF approach.
我们提出了一种正则化非负张量分解(NTF)模型用于多通道语音去约束,该模型结合了关于干净语音的先验知识。该方法将问题建模为恢复与不同房间脉冲响应卷积的信号,从而使去混响问题受益于麦克风阵列。分解学习了单独的混响滤波器和通道特定的延迟,这使得使用具有异构传感器的特设麦克风阵列(例如人群的多通道录音)成为可能,即使它们不同步。我们整合了两种先验知识正则化方案来提高去噪性能的稳定性。首先,引入非负矩阵分解(non - negative Matrix Factorization, NMF)内部例程,将预先训练好的干净语音基向量告知原始NTF问题,使优化过程可以专注于估计它们的激活,而不是整个干净语音谱。其次,利用稀疏性和平滑性约束进一步正则化NMF激活矩阵,使其具有干信号的特征。在不同模拟混响设置下的经验去噪结果表明,与基线NTF方法相比,先验知识正则化方案提高了恢复的音质和语音可理解性。
{"title":"Collaborative Speech Dereverberation: Regularized Tensor Factorization for Crowdsourced Multi-Channel Recordings","authors":"Sanna Wager, Minje Kim","doi":"10.23919/EUSIPCO.2018.8553565","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553565","url":null,"abstract":"We propose a regularized nonnegative tensor factorization (NTF) model for multi-channel speech derestriction that incorporates prior knowledge about clean speech. The approach models the problem as recovering a signal convolved with different room impulse responses, allowing the dereverberation problem to benefit from microphone arrays. The factorization learns both individual reverberation filters and channel-specific delays, which makes it possible to employ an ad-hoc microphone array with heterogeneous sensors (such as multi-channel recordings by a crowd) even if they are not synchronized. We integrate two prior-knowledge regularization schemes to increase the stability of dereverberation performance. First, a Nonnegative Matrix Factorization (NMF) inner routine is introduced to inform the original NTF problem of the pre-trained clean speech basis vectors, so that the optimization process can focus on estimating their activations rather than the whole clean speech spectra. Second, the NMF activation matrix is further regularized to take on characteristics of dry signals using sparsity and smoothness constraints. Empirical dereverberation results on different simulated reverberation setups show that the prior-knowledge regularization schemes improve both recovered sound quality and speech intelligibility compared to a baseline NTF approach.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114132204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Classification Asymptotics in the Random Matrix Regime 随机矩阵域的分类渐近性
Pub Date : 2018-09-01 DOI: 10.23919/EUSIPCO.2018.8553034
Romain Couillet, Zhenyu Liao, Xiaoyi Mai
This article discusses the asymptotic performance of classical machine learning classification methods (from discriminant analysis to neural networks) for simultaneously large and numerous Gaussian mixture modelled data. We first provide theoretical bounds on the minimally discriminable class means and covariances under an oracle setting, which are then compared to recent theoretical findings on the performance of machine learning. Non-obvious phenomena are discussed, among which surprising phase transitions in the optimal performance rates for specific hyperparameter settings.
本文讨论了经典机器学习分类方法(从判别分析到神经网络)对同时大量和大量高斯混合建模数据的渐近性能。我们首先提供了在oracle设置下的最小可判别类均值和协方差的理论界限,然后将其与最近关于机器学习性能的理论发现进行比较。讨论了非明显的现象,其中在特定的超参数设置下,最优性能率的惊人相变。
{"title":"Classification Asymptotics in the Random Matrix Regime","authors":"Romain Couillet, Zhenyu Liao, Xiaoyi Mai","doi":"10.23919/EUSIPCO.2018.8553034","DOIUrl":"https://doi.org/10.23919/EUSIPCO.2018.8553034","url":null,"abstract":"This article discusses the asymptotic performance of classical machine learning classification methods (from discriminant analysis to neural networks) for simultaneously large and numerous Gaussian mixture modelled data. We first provide theoretical bounds on the minimally discriminable class means and covariances under an oracle setting, which are then compared to recent theoretical findings on the performance of machine learning. Non-obvious phenomena are discussed, among which surprising phase transitions in the optimal performance rates for specific hyperparameter settings.","PeriodicalId":303069,"journal":{"name":"2018 26th European Signal Processing Conference (EUSIPCO)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114599429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
期刊
2018 26th European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1