首页 > 最新文献

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文 中文
Cooperative joint synchronization and localization using time delay measurements 基于时延测量的协同联合同步与定位
H. Naseri, V. Koivunen
In this paper a novel algorithm is proposed for joint synchronization and localization in ad hoc networks. The proposed algorithm is based on broadcast messaging, with number of messages linear to the number of nodes, versus quadratic for techniques based on two-way message exchange. The identifiability of network synchronization problem is improved by introducing localization constraints. Hence, the proposed algorithm does not require a full set of measurements. Numerical results are provided using a model based on wireless LAN specifications. In scenarios with missing data, the proposed algorithm significantly improves synchronization and localization performance compared to commonly used techniques.
本文提出了一种新的自组网联合同步与定位算法。所提出的算法基于广播消息传递,消息数量与节点数量成线性关系,而基于双向消息交换的技术是二次型的。通过引入定位约束,提高了网络同步问题的可识别性。因此,该算法不需要完整的测量集。利用基于无线局域网规范的模型给出了数值结果。在数据缺失的情况下,与常用的同步和定位技术相比,该算法显著提高了同步和定位性能。
{"title":"Cooperative joint synchronization and localization using time delay measurements","authors":"H. Naseri, V. Koivunen","doi":"10.1109/ICASSP.2016.7472257","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472257","url":null,"abstract":"In this paper a novel algorithm is proposed for joint synchronization and localization in ad hoc networks. The proposed algorithm is based on broadcast messaging, with number of messages linear to the number of nodes, versus quadratic for techniques based on two-way message exchange. The identifiability of network synchronization problem is improved by introducing localization constraints. Hence, the proposed algorithm does not require a full set of measurements. Numerical results are provided using a model based on wireless LAN specifications. In scenarios with missing data, the proposed algorithm significantly improves synchronization and localization performance compared to commonly used techniques.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125636596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Graph signal recovery from incomplete and noisy information using approximate message passing 利用近似消息传递从不完全和噪声信息中恢复图形信号
Gita Babazadeh Eslamlou, A. Jung, N. Goertz, M. Fereydooni
We consider the problem of recovering a graph signal from noisy and incomplete information. In particular, we propose an approximate message passing based iterative method for graph signal recovery. The recovery of the graph signal is based on noisy signal values at a small number of randomly selected nodes. Our approach exploits the smoothness of typical graph signals occurring in many applications, such as wireless sensor networks or social network analysis. The graph signals are smooth in the sense that neighboring nodes have similar signal values. Methodologically, our algorithm is a new instance of the denoising based approximate message passing framework introduced recently by Metzler et. al. We validate the performance of the proposed recovery method via numerical experiments. In certain scenarios our algorithm outperforms existing methods.
研究了从噪声和不完全信息中恢复图信号的问题。特别地,我们提出了一种基于近似消息传递的迭代图信号恢复方法。图信号的恢复是基于少量随机选择的节点上的噪声信号值。我们的方法利用了许多应用中出现的典型图形信号的平滑性,例如无线传感器网络或社交网络分析。图信号是平滑的,因为相邻节点具有相似的信号值。在方法上,我们的算法是Metzler等人最近引入的基于去噪的近似消息传递框架的新实例。我们通过数值实验验证了所提出的恢复方法的性能。在某些情况下,我们的算法优于现有的方法。
{"title":"Graph signal recovery from incomplete and noisy information using approximate message passing","authors":"Gita Babazadeh Eslamlou, A. Jung, N. Goertz, M. Fereydooni","doi":"10.1109/ICASSP.2016.7472863","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472863","url":null,"abstract":"We consider the problem of recovering a graph signal from noisy and incomplete information. In particular, we propose an approximate message passing based iterative method for graph signal recovery. The recovery of the graph signal is based on noisy signal values at a small number of randomly selected nodes. Our approach exploits the smoothness of typical graph signals occurring in many applications, such as wireless sensor networks or social network analysis. The graph signals are smooth in the sense that neighboring nodes have similar signal values. Methodologically, our algorithm is a new instance of the denoising based approximate message passing framework introduced recently by Metzler et. al. We validate the performance of the proposed recovery method via numerical experiments. In certain scenarios our algorithm outperforms existing methods.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131955095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Linearly augmented deep neural network 线性增强深度神经网络
Pegah Ghahremani, J. Droppo, M. Seltzer
Deep neural networks (DNN) are a powerful tool for many large vocabulary continuous speech recognition (LVCSR) tasks. Training a very deep network is a challenging problem and pre-training techniques are needed in order to achieve the best results. In this paper, we propose a new type of network architecture, Linear Augmented Deep Neural Network (LA-DNN). This type of network augments each non-linear layer with a linear connection from layer input to layer output. The resulting LA-DNN model eliminates the need for pre-training, addresses the gradient vanishing problem for deep networks, has higher capacity in modeling linear transformations, trains significantly faster than normal DNN, and produces better acoustic models. The proposed model has been evaluated on TIMIT phoneme recognition and AMI speech recognition tasks. Experimental results show that the LA-DNN models can have 70% fewer parameters than a DNN, while still improving accuracy. On the TIMIT phoneme recognition task, the smaller LA-DNN model improves TIMIT phone accuracy by 2% absolute, and AMI word accuracy by 1.7% absolute.
深度神经网络(DNN)是解决大词汇量连续语音识别(LVCSR)任务的有力工具。训练一个非常深的网络是一个具有挑战性的问题,为了达到最佳效果,需要预训练技术。在本文中,我们提出了一种新的网络结构,线性增强深度神经网络(LA-DNN)。这种类型的网络通过从层输入到层输出的线性连接来增强每个非线性层。得到的LA-DNN模型消除了预训练的需要,解决了深度网络的梯度消失问题,具有更高的线性变换建模能力,训练速度明显快于普通DNN,并且产生更好的声学模型。在TIMIT音素识别和AMI语音识别任务中对该模型进行了评价。实验结果表明,与DNN模型相比,LA-DNN模型的参数可以减少70%,同时仍能提高准确率。在TIMIT音素识别任务上,较小的LA-DNN模型将TIMIT电话的绝对准确率提高了2%,AMI单词的绝对准确率提高了1.7%。
{"title":"Linearly augmented deep neural network","authors":"Pegah Ghahremani, J. Droppo, M. Seltzer","doi":"10.1109/ICASSP.2016.7472646","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472646","url":null,"abstract":"Deep neural networks (DNN) are a powerful tool for many large vocabulary continuous speech recognition (LVCSR) tasks. Training a very deep network is a challenging problem and pre-training techniques are needed in order to achieve the best results. In this paper, we propose a new type of network architecture, Linear Augmented Deep Neural Network (LA-DNN). This type of network augments each non-linear layer with a linear connection from layer input to layer output. The resulting LA-DNN model eliminates the need for pre-training, addresses the gradient vanishing problem for deep networks, has higher capacity in modeling linear transformations, trains significantly faster than normal DNN, and produces better acoustic models. The proposed model has been evaluated on TIMIT phoneme recognition and AMI speech recognition tasks. Experimental results show that the LA-DNN models can have 70% fewer parameters than a DNN, while still improving accuracy. On the TIMIT phoneme recognition task, the smaller LA-DNN model improves TIMIT phone accuracy by 2% absolute, and AMI word accuracy by 1.7% absolute.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121585491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models 使用基于短语的翻译模型和命名实体模型对口语翻译进行语言模型适配
Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Lyan Verwimp, H. V. hamme, P. Wambacq
Language model adaptation based on Machine Translation (MT) is a recently proposed approach to improve the Automatic Speech Recognition (ASR) of spoken translations that does not suffer from a common problem in approaches based on rescoring i.e. errors made during recognition cannot be recovered by the MT system. In previous work we presented an efficient implementation for MT-based language model adaptation using a word-based translation model. By omitting renormalization and employing weighted updates, the implementation exhibited virtually no adaptation overhead, enabling its use in a real-time setting. In this paper we investigate whether we can improve recognition accuracy without sacrificing the achieved efficiency. More precisely, we investigate the effect of both state-of-the-art phrase-based translation models and named entity probability estimation. We report relative WER reductions of 6.2% over a word-based LM adaptation technique and 25.3% over an unadapted 3-gram baseline on an English-to-Dutch dataset.
基于机器翻译(MT)的语言模型自适应是最近提出的一种改进口语翻译自动语音识别(ASR)的方法,该方法不存在基于评分方法的常见问题,即在识别过程中出现的错误不能被机器翻译系统恢复。在之前的工作中,我们提出了一种使用基于词的翻译模型的基于mt的语言模型自适应的有效实现。通过省略重新规范化和使用加权更新,该实现几乎没有适应性开销,使其能够在实时设置中使用。本文主要研究如何在不牺牲现有效率的前提下提高识别精度。更准确地说,我们研究了最先进的基于短语的翻译模型和命名实体概率估计的效果。我们报告说,在英语到荷兰语数据集上,与基于单词的LM适应技术相比,相对降低了6.2%,与未适应的3克基线相比,降低了25.3%。
{"title":"Language model adaptation for ASR of spoken translations using phrase-based translation models and named entity models","authors":"Joris Pelemans, Tom Vanallemeersch, Kris Demuynck, Lyan Verwimp, H. V. hamme, P. Wambacq","doi":"10.1109/ICASSP.2016.7472826","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472826","url":null,"abstract":"Language model adaptation based on Machine Translation (MT) is a recently proposed approach to improve the Automatic Speech Recognition (ASR) of spoken translations that does not suffer from a common problem in approaches based on rescoring i.e. errors made during recognition cannot be recovered by the MT system. In previous work we presented an efficient implementation for MT-based language model adaptation using a word-based translation model. By omitting renormalization and employing weighted updates, the implementation exhibited virtually no adaptation overhead, enabling its use in a real-time setting. In this paper we investigate whether we can improve recognition accuracy without sacrificing the achieved efficiency. More precisely, we investigate the effect of both state-of-the-art phrase-based translation models and named entity probability estimation. We report relative WER reductions of 6.2% over a word-based LM adaptation technique and 25.3% over an unadapted 3-gram baseline on an English-to-Dutch dataset.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123429588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Graph filter banks with M-channels, maximal decimation, and perfect reconstruction 图滤波器组与m通道,最大抽取,和完美的重建
Oguzhan Teke, P. Vaidyanathan
Signal processing on graphs finds applications in many areas. Motivated by recent developments, this paper studies the concept of spectrum folding (aliasing) for graph signals under the downsample-then-upsample operation. In this development, we use a special eigenvector structure that is unique to the adjacency matrix of M-block cyclic matrices. We then introduce M-channel maximally decimated filter banks. Manipulating the characteristics of the aliasing effect, we construct polynomial filter banks with perfect reconstruction property. Later we describe how we can remove the eigenvector condition by using a generalized decimator. In this study graphs are assumed to be general with a possibly non-symmetric and complex adjacency matrix.
图形上的信号处理在许多领域都有应用。受最新研究进展的启发,本文研究了下采样-上采样操作下图信号的频谱折叠(混叠)概念。在这个发展中,我们使用了一个特殊的特征向量结构,它是m块循环矩阵邻接矩阵所独有的。然后我们引入m通道最大抽取滤波器组。利用混叠效应的特点,构造了具有良好重构性能的多项式滤波器组。稍后我们将描述如何使用广义十进制数来去除特征向量条件。在本研究中,假设图具有可能非对称的复杂邻接矩阵。
{"title":"Graph filter banks with M-channels, maximal decimation, and perfect reconstruction","authors":"Oguzhan Teke, P. Vaidyanathan","doi":"10.1109/ICASSP.2016.7472446","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472446","url":null,"abstract":"Signal processing on graphs finds applications in many areas. Motivated by recent developments, this paper studies the concept of spectrum folding (aliasing) for graph signals under the downsample-then-upsample operation. In this development, we use a special eigenvector structure that is unique to the adjacency matrix of M-block cyclic matrices. We then introduce M-channel maximally decimated filter banks. Manipulating the characteristics of the aliasing effect, we construct polynomial filter banks with perfect reconstruction property. Later we describe how we can remove the eigenvector condition by using a generalized decimator. In this study graphs are assumed to be general with a possibly non-symmetric and complex adjacency matrix.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128216746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Filterbank learning using Convolutional Restricted Boltzmann Machine for speech recognition 使用卷积受限玻尔兹曼机进行语音识别的滤波器组学习
Hardik B. Sailor, H. Patil
Convolutional Restricted Boltzmann Machine (ConvRBM) as a model for speech signal is presented in this paper. We have developed ConvRBM with sampling from noisy rectified linear units (NReLUs). ConvRBM is trained in an unsupervised way to model speech signal of arbitrary lengths. Weights of the model can represent an auditory-like filterbank. Our proposed learned filterbank is also nonlinear with respect to center frequencies of subband filters similar to standard filterbanks (such as Mel, Bark, ERB, etc.). We have used our proposed model as a front-end to learn features and applied to speech recognition task. Performance of ConvRBM features is improved compared to MFCC with relative improvement of 5% on TIMIT test set and 7% on WSJ0 database for both Nov'92 test sets using GMM-HMM systems. With DNN-HMM systems, we achieved relative improvement of 3% on TIMIT test set over MFCC and Mel filterbank (FBANK). On WSJ0 Nov'92 test sets, we achieved relative improvement of 4-14% using ConvRBM features over MFCC features and 3.6-5.6% using ConvRBM filterbank over FBANK features.
本文提出了卷积受限玻尔兹曼机(ConvRBM)作为语音信号的模型。我们开发了从噪声整流线性单元(NReLUs)采样的卷积rbm。采用无监督的方法训练卷积rbm,对任意长度的语音信号进行建模。模型的权重可以表示一个类似听觉的滤波器组。我们提出的学习滤波器组对于子带滤波器的中心频率也是非线性的,类似于标准滤波器组(如Mel, Bark, ERB等)。我们将所提出的模型作为学习特征的前端,并应用于语音识别任务。与MFCC相比,使用GMM-HMM系统的ConvRBM特征的性能得到了提高,在TIMIT测试集上的相对提高了5%,在WSJ0数据库上的相对提高了7%。对于DNN-HMM系统,我们在TIMIT测试集上比MFCC和Mel滤波器组(FBANK)实现了3%的相对改进。在WSJ0 11月92日的测试集上,我们使用ConvRBM特征比MFCC特征获得了4-14%的相对改进,使用ConvRBM滤波器组比FBANK特征获得了3.6-5.6%的相对改进。
{"title":"Filterbank learning using Convolutional Restricted Boltzmann Machine for speech recognition","authors":"Hardik B. Sailor, H. Patil","doi":"10.1109/ICASSP.2016.7472808","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472808","url":null,"abstract":"Convolutional Restricted Boltzmann Machine (ConvRBM) as a model for speech signal is presented in this paper. We have developed ConvRBM with sampling from noisy rectified linear units (NReLUs). ConvRBM is trained in an unsupervised way to model speech signal of arbitrary lengths. Weights of the model can represent an auditory-like filterbank. Our proposed learned filterbank is also nonlinear with respect to center frequencies of subband filters similar to standard filterbanks (such as Mel, Bark, ERB, etc.). We have used our proposed model as a front-end to learn features and applied to speech recognition task. Performance of ConvRBM features is improved compared to MFCC with relative improvement of 5% on TIMIT test set and 7% on WSJ0 database for both Nov'92 test sets using GMM-HMM systems. With DNN-HMM systems, we achieved relative improvement of 3% on TIMIT test set over MFCC and Mel filterbank (FBANK). On WSJ0 Nov'92 test sets, we achieved relative improvement of 4-14% using ConvRBM features over MFCC features and 3.6-5.6% using ConvRBM filterbank over FBANK features.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121111659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
The recursive hessian sketch for adaptive filtering 自适应滤波的递归hessian草图
Robin Scheibler, M. Vetterli
We introduce in this paper the recursive Hessian sketch, a new adaptive filtering algorithm based on sketching the same exponentially weighted least squares problem solved by the recursive least squares algorithm. The algorithm maintains a number of sketches of the inverse autocorrelation matrix and recursively updates them at random intervals. These are in turn used to update the unknown filter estimate. The complexity of the proposed algorithm compares favorably to that of recursive least squares. The convergence properties of this algorithm are studied through extensive numerical experiments. With an appropriate choice or parameters, its convergence speed falls between that of least mean squares and recursive least squares adaptive filters, with less computations than the latter.
递归Hessian sketch是一种新的自适应滤波算法,它基于对递归最小二乘算法解决的指数加权最小二乘问题进行速写。该算法保留了一些逆自相关矩阵的草图,并以随机间隔递归地更新它们。这些依次用于更新未知的过滤器估计。该算法的复杂度优于递推最小二乘算法。通过大量的数值实验研究了该算法的收敛性。在适当选择参数的情况下,其收敛速度介于最小均二乘和递推最小二乘自适应滤波器之间,计算量小于递推最小二乘自适应滤波器。
{"title":"The recursive hessian sketch for adaptive filtering","authors":"Robin Scheibler, M. Vetterli","doi":"10.1109/ICASSP.2016.7471659","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7471659","url":null,"abstract":"We introduce in this paper the recursive Hessian sketch, a new adaptive filtering algorithm based on sketching the same exponentially weighted least squares problem solved by the recursive least squares algorithm. The algorithm maintains a number of sketches of the inverse autocorrelation matrix and recursively updates them at random intervals. These are in turn used to update the unknown filter estimate. The complexity of the proposed algorithm compares favorably to that of recursive least squares. The convergence properties of this algorithm are studied through extensive numerical experiments. With an appropriate choice or parameters, its convergence speed falls between that of least mean squares and recursive least squares adaptive filters, with less computations than the latter.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126160470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A largest matching area approach to image denoising 一种最大匹配面积图像去噪方法
Jack Gaston, J. Ming, D. Crookes
Given the success of patch-based approaches to image denoising, this paper addresses the ill-posed problem of patch size selection. Large patch sizes improve noise robustness in the presence of good matches, but can also lead to artefacts in textured regions due to the rare patch effect; smaller patch sizes reconstruct details more accurately but risk over-fitting to the noise in uniform regions. We propose to jointly optimize each matching patch's identity and size for grayscale image denoising, and present several implementations. The new approach effectively selects the largest matching areas, subject to the constraints of the available data and noise level, to improve noise robustness. Experiments on standard test images demonstrate our approach's ability to improve on fixed-size reconstruction, particularly at high noise levels, on smoother image regions.
鉴于基于补丁的图像去噪方法的成功,本文解决了补丁大小选择的不适定问题。在存在良好匹配的情况下,大的斑块大小可以提高噪声的鲁棒性,但由于罕见的斑块效应,也可能导致纹理区域出现伪影;较小的补丁尺寸更准确地重建细节,但有可能过度拟合均匀区域的噪声。针对灰度图像去噪问题,提出了联合优化每个匹配patch的身份和大小,并给出了几种实现方法。该方法在不受可用数据和噪声水平限制的情况下,有效地选择最大的匹配区域,提高了噪声的鲁棒性。在标准测试图像上的实验表明,我们的方法能够在更平滑的图像区域上改进固定大小的重建,特别是在高噪声水平下。
{"title":"A largest matching area approach to image denoising","authors":"Jack Gaston, J. Ming, D. Crookes","doi":"10.1109/ICASSP.2016.7471865","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7471865","url":null,"abstract":"Given the success of patch-based approaches to image denoising, this paper addresses the ill-posed problem of patch size selection. Large patch sizes improve noise robustness in the presence of good matches, but can also lead to artefacts in textured regions due to the rare patch effect; smaller patch sizes reconstruct details more accurately but risk over-fitting to the noise in uniform regions. We propose to jointly optimize each matching patch's identity and size for grayscale image denoising, and present several implementations. The new approach effectively selects the largest matching areas, subject to the constraints of the available data and noise level, to improve noise robustness. Experiments on standard test images demonstrate our approach's ability to improve on fixed-size reconstruction, particularly at high noise levels, on smoother image regions.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"158 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128868046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Delay estimation between EEG and EMG via coherence with time lag 基于相干时滞的脑电与肌电延迟估计
Yuhang Xu, V. McClelland, Z. Cvetković, K. Mills
The traditional way to estimate the time delay between the motor cortex and the periphery is based on the estimation of the slope of the phase of the cross spectral density between motor cortex electroencephalogram (EEG) and electromyography (EMG) signals recorded synchronously during a motor control task. There are several issues that could make the delay estimation using this method subject to errors, leading frequently to estimates which are in disagreement with underlying physiology. This study introduces cortico-muscular coherence with time lag (CMCTL) function and proposes a method for estimating the delay based on finding its local maxima. We further address the issue of the interpretation of such time delay in multi-path propagation systems. Delay estimates obtained using the proposed method are more consistent compared with results obtained using the phase method and in a better agreement with physiological facts.
估计运动皮层与外周之间的时间延迟的传统方法是基于估计运动控制任务中同步记录的运动皮层脑电图(EEG)和肌电图(EMG)信号之间交叉谱密度的相位斜率。有几个问题可能使使用这种方法的延迟估计受到误差的影响,导致经常与潜在生理学不一致的估计。本文引入了带时滞的皮质肌相干(CMCTL)函数,提出了一种基于局部最大值估计延迟的方法。我们进一步解决了多路径传播系统中这种时间延迟的解释问题。用该方法得到的延迟估计与用相位法得到的结果更一致,更符合生理事实。
{"title":"Delay estimation between EEG and EMG via coherence with time lag","authors":"Yuhang Xu, V. McClelland, Z. Cvetković, K. Mills","doi":"10.1109/ICASSP.2016.7471772","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7471772","url":null,"abstract":"The traditional way to estimate the time delay between the motor cortex and the periphery is based on the estimation of the slope of the phase of the cross spectral density between motor cortex electroencephalogram (EEG) and electromyography (EMG) signals recorded synchronously during a motor control task. There are several issues that could make the delay estimation using this method subject to errors, leading frequently to estimates which are in disagreement with underlying physiology. This study introduces cortico-muscular coherence with time lag (CMCTL) function and proposes a method for estimating the delay based on finding its local maxima. We further address the issue of the interpretation of such time delay in multi-path propagation systems. Delay estimates obtained using the proposed method are more consistent compared with results obtained using the phase method and in a better agreement with physiological facts.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131197532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Compact kernel models for acoustic modeling via random feature selection 基于随机特征选择的声学建模的紧凑核模型
Avner May, Michael Collins, Daniel J. Hsu, Brian Kingsbury
A simple but effective method is proposed for learning compact random feature models that approximate non-linear kernel methods, in the context of acoustic modeling. The method is able to explore a large number of non-linear features while maintaining a compact model via feature selection more efficiently than existing approaches. For certain kernels, this random feature selection may be regarded as a means of non-linear feature selection at the level of the raw input features, which motivates additional methods for computational improvements. An empirical evaluation demonstrates the effectiveness of the proposed method relative to the natural baseline method for kernel approximation.
在声学建模的背景下,提出了一种简单而有效的方法来学习近似非线性核方法的紧凑随机特征模型。该方法能够探索大量的非线性特征,同时通过特征选择保持一个紧凑的模型,比现有方法更有效。对于某些核,这种随机特征选择可以看作是原始输入特征级别的非线性特征选择的一种手段,这激发了其他计算改进的方法。经验评价表明,相对于核近似的自然基线方法,所提出的方法是有效的。
{"title":"Compact kernel models for acoustic modeling via random feature selection","authors":"Avner May, Michael Collins, Daniel J. Hsu, Brian Kingsbury","doi":"10.1109/ICASSP.2016.7472112","DOIUrl":"https://doi.org/10.1109/ICASSP.2016.7472112","url":null,"abstract":"A simple but effective method is proposed for learning compact random feature models that approximate non-linear kernel methods, in the context of acoustic modeling. The method is able to explore a large number of non-linear features while maintaining a compact model via feature selection more efficiently than existing approaches. For certain kernels, this random feature selection may be regarded as a means of non-linear feature selection at the level of the raw input features, which motivates additional methods for computational improvements. An empirical evaluation demonstrates the effectiveness of the proposed method relative to the natural baseline method for kernel approximation.","PeriodicalId":165321,"journal":{"name":"2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115854983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1