首页 > 最新文献

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文 中文
Opportunistic spectrum access with temporal-spatial reuse in cognitive radio networks 认知无线电网络中时空复用的机会频谱接入
Yi Zhang, Wee Peng Tay, K. H. Li, M. Esseghir, D. Gaïti
We formulate and study a multi-user multi-armed bandit (MAB) problem that exploits the temporal-spatial reuse of primary user (PU) channels so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve a NP-complete optimization problem at exponentially increasing time intervals. To avoid the high computation complexity at the central processor and the need for SU synchronization, we propose a heuristic distributed policy that incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for opportunistic spectrum access. Simulations suggest that our proposed policies significantly outperform the benchmark algorithms when spectrum temporal-spatial reuse is allowed.
本文提出并研究了一种多用户多臂盗匪(MAB)问题,该问题利用主用户信道的时空复用性,使互不干扰的辅助用户能够利用同一主用户信道。我们首先提出了一个具有对数遗憾的集中式通道分配策略,但需要一个中央处理器以指数增长的时间间隔解决np完全优化问题。为了避免中央处理器的高计算复杂度和对SU同步的需求,我们提出了一种启发式分布式策略,该策略以较高的遗憾为代价,在每个SU的本地过程中结合通道访问排名学习。我们将我们提出的策略的性能与最近为机会性频谱接入提出的其他分布式策略进行了比较。仿真结果表明,在允许频谱时空复用的情况下,我们提出的策略明显优于基准算法。
{"title":"Opportunistic spectrum access with temporal-spatial reuse in cognitive radio networks","authors":"Yi Zhang, Wee Peng Tay, K. H. Li, M. Esseghir, D. Gaïti","doi":"10.1109/ICASSP.2016.7472360","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472360","url":null,"abstract":"We formulate and study a multi-user multi-armed bandit (MAB) problem that exploits the temporal-spatial reuse of primary user (PU) channels so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve a NP-complete optimization problem at exponentially increasing time intervals. To avoid the high computation complexity at the central processor and the need for SU synchronization, we propose a heuristic distributed policy that incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for opportunistic spectrum access. Simulations suggest that our proposed policies significantly outperform the benchmark algorithms when spectrum temporal-spatial reuse is allowed.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"8 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115275663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A unified approach to the design of IIR and FIR notch filters IIR和FIR陷波滤波器的统一设计方法
Yi Jiang, Cong Shen, J. Dai
This paper presents a unified, optimization-driven solution for designing IIR and FIR notch filters with prescribed, possibly varying notch levels in the given stop-bands, and near unit magnitude frequency response at the pass-bands. Although the original IIR notch filter optimization problem is non-convex, we show that it can be well approximated by a convex problem, by replacing a non-positive semi-definite 2 × 2 Hermitian matrix with its nearest positive semi-definite counterpart. With this approach, the IIR filter design can be efficiently solved via Newton iteration. The same approach can be directly applied to the FIR filter design since it is a degenerated case of the IIR filter. Moreover, we show that the FIR design problem is convex and therefore can be solved optimally. Numerical examples are presented to verify the effectiveness of the proposed design.
本文提出了一种统一的、优化驱动的方案来设计IIR和FIR陷波滤波器,这些滤波器在给定的阻带中具有规定的、可能变化的陷波电平,并且在通带中具有接近单位幅度的频率响应。虽然原始的IIR陷波滤波器优化问题是非凸的,但我们证明了它可以很好地近似为凸问题,通过用其最接近的正半定对立物替换非正半定2 × 2厄米特矩阵。该方法可以通过牛顿迭代有效地解决IIR滤波器的设计问题。同样的方法可以直接应用于FIR滤波器设计,因为它是IIR滤波器的退化情况。此外,我们还证明了FIR设计问题是凸的,因此可以最优地解决。数值算例验证了所提设计的有效性。
{"title":"A unified approach to the design of IIR and FIR notch filters","authors":"Yi Jiang, Cong Shen, J. Dai","doi":"10.1109/ICASSP.2016.7472587","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472587","url":null,"abstract":"This paper presents a unified, optimization-driven solution for designing IIR and FIR notch filters with prescribed, possibly varying notch levels in the given stop-bands, and near unit magnitude frequency response at the pass-bands. Although the original IIR notch filter optimization problem is non-convex, we show that it can be well approximated by a convex problem, by replacing a non-positive semi-definite 2 × 2 Hermitian matrix with its nearest positive semi-definite counterpart. With this approach, the IIR filter design can be efficiently solved via Newton iteration. The same approach can be directly applied to the FIR filter design since it is a degenerated case of the IIR filter. Moreover, we show that the FIR design problem is convex and therefore can be solved optimally. Numerical examples are presented to verify the effectiveness of the proposed design.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115405850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Improving speech privacy in personal sound zones 改善个人声音区域的语音隐私
Jacob Donley, C. Ritz, W. Kleijn
This paper proposes two methods for providing speech privacy between spatial zones in anechoic and reverberant environments. The methods are based on masking the content leaked between regions. The masking is optimised to maximise the speech intelligibility contrast (SIC) between the zones. The first method uses a uniform masker signal that is combined with desired multizone loudspeaker signals and requires acoustic contrast between zones. The second method computes a space-time domain masker signal in parallel with the loudspeaker signals so that the combination of the two emphasises the spectral masking in the targeted quiet zone. Simulations show that it is possible to achieve a significant SIC in anechoic environments whilst maintaining speech quality in the bright zone.
本文提出了在消声和混响环境中提供空间区域间语音隐私的两种方法。这些方法是基于屏蔽区域间泄漏的内容。掩蔽优化,以最大限度地提高区域之间的语音清晰度对比度(SIC)。第一种方法使用均匀掩蔽信号,该信号与所需的多区域扬声器信号相结合,并要求区域之间的声学对比度。第二种方法与扬声器信号并行计算空时域掩模信号,使两者的组合强调目标安静区的频谱掩模。仿真结果表明,在消声环境中实现显著的SIC是可能的,同时在明亮区保持语音质量。
{"title":"Improving speech privacy in personal sound zones","authors":"Jacob Donley, C. Ritz, W. Kleijn","doi":"10.1109/ICASSP.2016.7471687","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7471687","url":null,"abstract":"This paper proposes two methods for providing speech privacy between spatial zones in anechoic and reverberant environments. The methods are based on masking the content leaked between regions. The masking is optimised to maximise the speech intelligibility contrast (SIC) between the zones. The first method uses a uniform masker signal that is combined with desired multizone loudspeaker signals and requires acoustic contrast between zones. The second method computes a space-time domain masker signal in parallel with the loudspeaker signals so that the combination of the two emphasises the spectral masking in the targeted quiet zone. Simulations show that it is possible to achieve a significant SIC in anechoic environments whilst maintaining speech quality in the bright zone.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115599814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Exploring the role of phonetic bottleneck features for speaker and language recognition 探讨语音瓶颈特征在说话人和语言识别中的作用
Mitchell McLaren, L. Ferrer, A. Lawson
Using bottleneck features extracted from a deep neural network (DNN) trained to predict senone posteriors has resulted in new, state-of-the-art technology for language and speaker identification. For language identification, the features' dense phonetic information is believed to enable improved performance by better representing language-dependent phone distributions. For speaker recognition, the role of these features is less clear, given that a bottleneck layer near the DNN output layer is thought to contain limited speaker information. In this article, we analyze the role of bottleneck features in these identification tasks by varying the DNN layer from which they are extracted, under the hypothesis that speaker information is traded for dense phonetic information as the layer moves toward the DNN output layer. Experiments support this hypothesis under certain conditions, and highlight the benefit of using a bottleneck layer close to the DNN output layer when DNN training data is matched to the evaluation conditions, and a layer more central to the DNN otherwise.
使用从深度神经网络(DNN)中提取的瓶颈特征来预测senone后验,已经产生了新的,最先进的语言和说话者识别技术。对于语言识别,特征的密集语音信息被认为可以通过更好地表示依赖于语言的电话分布来提高性能。对于说话人识别,考虑到DNN输出层附近的瓶颈层被认为包含有限的说话人信息,这些特征的作用不太清楚。在本文中,我们通过改变从其提取的DNN层来分析瓶颈特征在这些识别任务中的作用,假设随着层向DNN输出层移动,说话人信息被交换为密集的语音信息。实验在一定条件下支持这一假设,并强调了当DNN训练数据与评估条件相匹配时,使用靠近DNN输出层的瓶颈层,而在其他情况下使用更靠近DNN中心的层的好处。
{"title":"Exploring the role of phonetic bottleneck features for speaker and language recognition","authors":"Mitchell McLaren, L. Ferrer, A. Lawson","doi":"10.1109/ICASSP.2016.7472744","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472744","url":null,"abstract":"Using bottleneck features extracted from a deep neural network (DNN) trained to predict senone posteriors has resulted in new, state-of-the-art technology for language and speaker identification. For language identification, the features' dense phonetic information is believed to enable improved performance by better representing language-dependent phone distributions. For speaker recognition, the role of these features is less clear, given that a bottleneck layer near the DNN output layer is thought to contain limited speaker information. In this article, we analyze the role of bottleneck features in these identification tasks by varying the DNN layer from which they are extracted, under the hypothesis that speaker information is traded for dense phonetic information as the layer moves toward the DNN output layer. Experiments support this hypothesis under certain conditions, and highlight the benefit of using a bottleneck layer close to the DNN output layer when DNN training data is matched to the evaluation conditions, and a layer more central to the DNN otherwise.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115702089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 44
Bayesian quickest detection with unknown post-change parameter 未知后变参数的贝叶斯最快检测
Jun Geng, L. Lai
In this paper, Bayesian quickest change-point detection problem with incomplete post-change information is considered. In particular, the observer knows that the post-change distribution belongs to a parametric distribution family, but he does not know the true value of the post-change parameter. Two problem formulations are considered in this paper. In the first formulation, we assume no additional prior information about the post-change parameter. In this case, the observer aims to design a detection algorithm to minimize the average (over the change-point) detection delay for all possible post-change parameters simultaneously subject to a worst case false alarm constraint. In the second formulation, we assume that there is a prior distribution on the possible value of the unknown parameter. For this case, we propose another formulation that minimizes the average (over both the change-point and the post-change parameter) detection delay subject to an average false alarm constraint. We propose a noval algorithm, which is termed as M-Shiryaev procedure, and show that the proposed algorithm is first order asymptotically optimal for both formulations considered in this paper.
研究了不完全变化后信息下的贝叶斯最快变化点检测问题。具体来说,观察者知道变化后的分布属于参数分布族,但他不知道变化后参数的真实值。本文考虑了两种问题的表述。在第一个公式中,我们假设没有关于变化后参数的额外先验信息。在这种情况下,观测器的目标是设计一种检测算法,使所有可能的后变化参数同时受到最坏情况虚警约束的平均(在变化点上)检测延迟最小化。在第二个公式中,我们假设未知参数的可能值有一个先验分布。对于这种情况,我们提出了另一种公式,该公式最小化受平均虚警约束的平均(在变化点和变化后参数上)检测延迟。我们提出了一种称为M-Shiryaev过程的新算法,并证明了该算法对于本文所考虑的两种公式都是一阶渐近最优的。
{"title":"Bayesian quickest detection with unknown post-change parameter","authors":"Jun Geng, L. Lai","doi":"10.1109/ICASSP.2016.7472462","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472462","url":null,"abstract":"In this paper, Bayesian quickest change-point detection problem with incomplete post-change information is considered. In particular, the observer knows that the post-change distribution belongs to a parametric distribution family, but he does not know the true value of the post-change parameter. Two problem formulations are considered in this paper. In the first formulation, we assume no additional prior information about the post-change parameter. In this case, the observer aims to design a detection algorithm to minimize the average (over the change-point) detection delay for all possible post-change parameters simultaneously subject to a worst case false alarm constraint. In the second formulation, we assume that there is a prior distribution on the possible value of the unknown parameter. For this case, we propose another formulation that minimizes the average (over both the change-point and the post-change parameter) detection delay subject to an average false alarm constraint. We propose a noval algorithm, which is termed as M-Shiryaev procedure, and show that the proposed algorithm is first order asymptotically optimal for both formulations considered in this paper.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123094748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis 基于缺失目标跨任务标记的半自主数据充实,用于整体语音分析
Yue Zhang, Yuxiang Zhou, Jie Shen, Björn Schuller
In this work, we propose a novel approach for large-scale data enrichment, with the aim to address a major shortcoming of current research in computational paralinguistics, namely, looking at speaker attributes in isolation although strong interdependencies between them exist. The scarcity of multi-target databases, in which instances are labelled for different kinds of speaker characteristics, compounds this problem. The core idea of our work is to join existing data resources into one single holistic database with a multi-dimensional label space by using semi-supervised learning techniques to predict missing labels. In the proposed new Cross-Task Labelling (CTL) method, a model is first trained on the labelled training set of the selected databases for each individual task. Then, the trained classifiers are used for the crosslabelling of databases among each other. To exemplify the effectiveness of the `CTL' method, we evaluated it for likability, personality, and emotion recognition as representative tasks from the INTERSPEECH Computational Paralinguistics ChallengE (ComParE) series. The results show that `CTL' lays the foundation for holistic speech analysis by semi-autonomously annotating the existing databases, and expanding the multi-target label space at the same time, while achieving higher accuracy as the baseline performance of the challenges.
在这项工作中,我们提出了一种大规模数据丰富的新方法,旨在解决当前计算副语言学研究的一个主要缺点,即尽管它们之间存在很强的相互依赖性,但仍然孤立地研究说话人属性。多目标数据库的稀缺性使得这个问题更加复杂。在多目标数据库中,每个实例都被标记为不同类型的说话人特征。我们工作的核心思想是通过使用半监督学习技术来预测缺失的标签,将现有的数据资源加入到一个具有多维标签空间的单一整体数据库中。在提出的新的跨任务标记(CTL)方法中,首先对每个单独任务的选定数据库的标记训练集进行模型训练。然后,将训练好的分类器用于数据库之间的交叉标记。为了证明“CTL”方法的有效性,我们将其作为INTERSPEECH计算副语言学挑战(ComParE)系列的代表性任务,对其进行了可爱性、个性和情感识别的评估。结果表明,“CTL”通过对现有数据库进行半自主标注,同时扩展多目标标签空间,为整体语音分析奠定了基础,同时实现了更高的准确率作为挑战的基准性能。
{"title":"Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis","authors":"Yue Zhang, Yuxiang Zhou, Jie Shen, Björn Schuller","doi":"10.1109/ICASSP.2016.7472847","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472847","url":null,"abstract":"In this work, we propose a novel approach for large-scale data enrichment, with the aim to address a major shortcoming of current research in computational paralinguistics, namely, looking at speaker attributes in isolation although strong interdependencies between them exist. The scarcity of multi-target databases, in which instances are labelled for different kinds of speaker characteristics, compounds this problem. The core idea of our work is to join existing data resources into one single holistic database with a multi-dimensional label space by using semi-supervised learning techniques to predict missing labels. In the proposed new Cross-Task Labelling (CTL) method, a model is first trained on the labelled training set of the selected databases for each individual task. Then, the trained classifiers are used for the crosslabelling of databases among each other. To exemplify the effectiveness of the `CTL' method, we evaluated it for likability, personality, and emotion recognition as representative tasks from the INTERSPEECH Computational Paralinguistics ChallengE (ComParE) series. The results show that `CTL' lays the foundation for holistic speech analysis by semi-autonomously annotating the existing databases, and expanding the multi-target label space at the same time, while achieving higher accuracy as the baseline performance of the challenges.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"139 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114673256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Stochastic thermodynamic integration: Efficient Bayesian model selection via stochastic gradient MCMC 随机热力学集成:基于随机梯度MCMC的高效贝叶斯模型选择
Umut Simsekli, R. Badeau, G. Richard, A. Cemgil
Model selection is a central topic in Bayesian machine learning, which requires the estimation of the marginal likelihood of the data under the models to be compared. During the last decade, conventional model selection methods have lost their charm as they have high computational requirements. In this study, we propose a computationally efficient model selection method by integrating ideas from Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) literature and statistical physics. As opposed to conventional methods, the proposed method has very low computational needs and can be implemented almost without modifying existing SG-MCMC code. We provide an upper-bound for the bias of the proposed method. Our experiments show that, our method is 40 times as fast as the baseline method on finding the optimal model order in a matrix factorization problem.
模型选择是贝叶斯机器学习的核心问题,它需要估计待比较模型下数据的边际似然。在过去的十年中,传统的模型选择方法由于计算量大而失去了吸引力。在这项研究中,我们提出了一种计算效率高的模型选择方法,该方法结合了随机梯度马尔可夫链蒙特卡罗(SG-MCMC)文献和统计物理的思想。与传统方法相比,该方法的计算量非常低,几乎不需要修改现有的SG-MCMC代码即可实现。我们为所提出的方法的偏差提供了一个上界。我们的实验表明,我们的方法在矩阵分解问题中找到最优模型阶数的速度是基线方法的40倍。
{"title":"Stochastic thermodynamic integration: Efficient Bayesian model selection via stochastic gradient MCMC","authors":"Umut Simsekli, R. Badeau, G. Richard, A. Cemgil","doi":"10.1109/ICASSP.2016.7472142","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472142","url":null,"abstract":"Model selection is a central topic in Bayesian machine learning, which requires the estimation of the marginal likelihood of the data under the models to be compared. During the last decade, conventional model selection methods have lost their charm as they have high computational requirements. In this study, we propose a computationally efficient model selection method by integrating ideas from Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) literature and statistical physics. As opposed to conventional methods, the proposed method has very low computational needs and can be implemented almost without modifying existing SG-MCMC code. We provide an upper-bound for the bias of the proposed method. Our experiments show that, our method is 40 times as fast as the baseline method on finding the optimal model order in a matrix factorization problem.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116977596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Fast adaptive PARAFAC decomposition algorithm with linear complexity 具有线性复杂度的快速自适应PARAFAC分解算法
V. Nguyen, K. Abed-Meraim, N. Linh-Trung
We present a fast adaptive PARAFAC decomposition algorithm with low computational complexity. The proposed algorithm generalizes the Orthonormal Projection Approximation Subspace Tracking (OPAST) approach for tracking a class of third-order tensors which have one dimension growing with time. It has linear complexity, good convergence rate and good estimation accuracy. To deal with large-scale problems, a parallel implementation can be applied to reduce both computational complexity and storage. We illustrate the effectiveness of our algorithm in comparison with the state-of-the-art algorithms through simulation experiments.
提出了一种计算复杂度低的快速自适应PARAFAC分解算法。针对一类一维随时间增长的三阶张量,推广了正交投影逼近子空间跟踪(OPAST)方法。它具有线性复杂度、良好的收敛速度和较好的估计精度。为了处理大规模问题,可以采用并行实现来降低计算复杂度和存储空间。我们通过仿真实验说明了我们的算法与最先进的算法的有效性。
{"title":"Fast adaptive PARAFAC decomposition algorithm with linear complexity","authors":"V. Nguyen, K. Abed-Meraim, N. Linh-Trung","doi":"10.1109/ICASSP.2016.7472876","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472876","url":null,"abstract":"We present a fast adaptive PARAFAC decomposition algorithm with low computational complexity. The proposed algorithm generalizes the Orthonormal Projection Approximation Subspace Tracking (OPAST) approach for tracking a class of third-order tensors which have one dimension growing with time. It has linear complexity, good convergence rate and good estimation accuracy. To deal with large-scale problems, a parallel implementation can be applied to reduce both computational complexity and storage. We illustrate the effectiveness of our algorithm in comparison with the state-of-the-art algorithms through simulation experiments.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"720 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116980463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Practical considerations on the use of preference learning for ranking emotional speech 使用偏好学习对情绪言语排序的实际考虑
Reza Lotfian, C. Busso
A speech emotion retrieval system aims to detect a subset of data with specific expressive content. Preference learning represents an appealing framework to rank speech samples in terms of continuous attributes such as arousal and valence. The training of ranking classifiers usually requires pairwise samples where one is preferred over the other according to a specific criterion. For emotional databases, these relative labels are not available and are very difficult to collect. As an alternative, they can be derived from existing absolute emotional labels. For continuous attributes, we can create relative rankings by forming pairs with high and low values of a specific attribute which are separated by a predefined margin. This approach raises questions about efficient approaches for building such a training set, which is important to improve the performance of the emotional retrieval system. This paper analyzes practical considerations in training ranking classifiers including optimum number of pairs used during training, and the margin used to define the relative labels. We compare the preference learning approach to binary classifier and regression models. The experimental results on a spontaneous emotional database indicate that a rank-based classifier with fine-tuned parameters outperforms the other two approaches in both arousal and valence dimensions.
语音情感检索系统旨在检测具有特定表达内容的数据子集。偏好学习代表了一种有吸引力的框架,可以根据唤醒和效价等连续属性对语音样本进行排序。排序分类器的训练通常需要成对样本,其中一个根据特定的标准优于另一个。对于情感数据库,这些相对标签是不可用的,并且很难收集。作为一种选择,它们可以从现有的绝对情感标签中衍生出来。对于连续属性,我们可以通过形成特定属性的高值和低值对来创建相对排名,这些值由预定义的边距分隔。这种方法提出了建立这种训练集的有效方法的问题,这对提高情感检索系统的性能很重要。本文分析了训练排序分类器的实际考虑因素,包括训练中使用的最优对数,以及用于定义相对标签的余量。我们将偏好学习方法与二元分类器和回归模型进行比较。在一个自发情绪数据库上的实验结果表明,具有微调参数的基于等级的分类器在唤醒和效价维度上都优于其他两种方法。
{"title":"Practical considerations on the use of preference learning for ranking emotional speech","authors":"Reza Lotfian, C. Busso","doi":"10.1109/ICASSP.2016.7472670","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472670","url":null,"abstract":"A speech emotion retrieval system aims to detect a subset of data with specific expressive content. Preference learning represents an appealing framework to rank speech samples in terms of continuous attributes such as arousal and valence. The training of ranking classifiers usually requires pairwise samples where one is preferred over the other according to a specific criterion. For emotional databases, these relative labels are not available and are very difficult to collect. As an alternative, they can be derived from existing absolute emotional labels. For continuous attributes, we can create relative rankings by forming pairs with high and low values of a specific attribute which are separated by a predefined margin. This approach raises questions about efficient approaches for building such a training set, which is important to improve the performance of the emotional retrieval system. This paper analyzes practical considerations in training ranking classifiers including optimum number of pairs used during training, and the margin used to define the relative labels. We compare the preference learning approach to binary classifier and regression models. The experimental results on a spontaneous emotional database indicate that a rank-based classifier with fine-tuned parameters outperforms the other two approaches in both arousal and valence dimensions.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117129408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Random matrix based method for joint DOD and DOA estimation for large scale MIMO radar in non-Gaussian noise 基于随机矩阵的非高斯噪声条件下大规模MIMO雷达DOD和DOA联合估计方法
Hong Jiang, Yiwei Lu, Shunyou Yao
Traditional methods of target parameter estimation in MIMO radar are carried out under the assumption that the number of observations is much larger than the number of array elements. However, their estimation performance will decline for the MIMO radar with large arrays and insufficient observations. In this paper, we investigate the situation in bistatic MIMO radar that the product of the numbers of the transmit and receive elements and the number of observations grow at the same rate. We propose a robust method for joint direction-of-departure (DOD) and direction-of-arrival (DOA) estimation in non-Gaussian noise environment. The method uses robust M-estimator to form an estimate of the covariance matrix, and then applies random matrix theory (RMT) and polynomial rooting algorithm to receive accurate DOD and DOA estimates for large scale MIMO radar. The simulation results demonstrate the robustness and improvement in accuracy.
传统的MIMO雷达目标参数估计方法是在观测数远大于阵元数的前提下进行的。然而,对于阵列较大且观测量不足的MIMO雷达,其估计性能会下降。本文研究了双基地MIMO雷达中收发元数与观测数乘积以相同速率增长的情况。提出了一种非高斯噪声环境下出发方向和到达方向联合估计的鲁棒方法。该方法利用鲁棒m估计量形成协方差矩阵估计,然后利用随机矩阵理论(RMT)和多项式生根算法对大规模MIMO雷达进行精确的DOD和DOA估计。仿真结果证明了该方法的鲁棒性和精度的提高。
{"title":"Random matrix based method for joint DOD and DOA estimation for large scale MIMO radar in non-Gaussian noise","authors":"Hong Jiang, Yiwei Lu, Shunyou Yao","doi":"10.1109/ICASSP.2016.7472234","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472234","url":null,"abstract":"Traditional methods of target parameter estimation in MIMO radar are carried out under the assumption that the number of observations is much larger than the number of array elements. However, their estimation performance will decline for the MIMO radar with large arrays and insufficient observations. In this paper, we investigate the situation in bistatic MIMO radar that the product of the numbers of the transmit and receive elements and the number of observations grow at the same rate. We propose a robust method for joint direction-of-departure (DOD) and direction-of-arrival (DOA) estimation in non-Gaussian noise environment. The method uses robust M-estimator to form an estimate of the covariance matrix, and then applies random matrix theory (RMT) and polynomial rooting algorithm to receive accurate DOD and DOA estimates for large scale MIMO radar. The simulation results demonstrate the robustness and improvement in accuracy.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117261281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1