首页 > 最新文献

2014 22nd European Signal Processing Conference (EUSIPCO)最新文献

英文 中文
An improved chirp group delay based algorithm for estimating the vocal tract response 基于改进啁啾群延迟的声道响应估计算法
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.54522
M. Jayesh, C. S. Ramalingam
We propose a method for vocal tract estimation that is better than Bozkurt's chirp group delay method [1] and its zero-phase variant [2]. The chirp group delay method works only for voiced speech, is critically dependent on finding the glottal closure instants (GCI), deteriorates in performance when more than two pitch cycles are included for analysis, and does not work for unvoiced speech. The zero-phase variant eliminates these drawbacks but works poorly for nasal sounds. In our proposed method all outside-unit-circle zeros are reflected inside before computing the chirp group delay. The advantages are: (a) GCI knowledge not required, (b) the vocal tract estimate is far less sensitive to the location and duration of the analysis window, (c) works for unvoiced sounds, and (d) captures the spectral valleys well for nasals, which in turn leads to better recognition accuracy.
我们提出了一种优于Bozkurt啁啾群延迟法[1]及其零相位变体[2]的声道估计方法。啁啾群延迟方法仅适用于发声语音,严重依赖于找到声门关闭瞬间(GCI),当包括两个以上的音高周期进行分析时,性能会恶化,并且不适用于非发声语音。零相位变体消除了这些缺点,但对于鼻音效果不佳。在我们提出的方法中,在计算啁啾群延迟之前,所有单位圆外的零都在内部反射。其优点是:(a)不需要GCI知识,(b)声道估计对分析窗口的位置和持续时间的敏感性要低得多,(c)适用于不发音的声音,(d)可以很好地捕获鼻音的频谱谷,从而提高识别精度。
{"title":"An improved chirp group delay based algorithm for estimating the vocal tract response","authors":"M. Jayesh, C. S. Ramalingam","doi":"10.5281/ZENODO.54522","DOIUrl":"https://doi.org/10.5281/ZENODO.54522","url":null,"abstract":"We propose a method for vocal tract estimation that is better than Bozkurt's chirp group delay method [1] and its zero-phase variant [2]. The chirp group delay method works only for voiced speech, is critically dependent on finding the glottal closure instants (GCI), deteriorates in performance when more than two pitch cycles are included for analysis, and does not work for unvoiced speech. The zero-phase variant eliminates these drawbacks but works poorly for nasal sounds. In our proposed method all outside-unit-circle zeros are reflected inside before computing the chirp group delay. The advantages are: (a) GCI knowledge not required, (b) the vocal tract estimate is far less sensitive to the location and duration of the analysis window, (c) works for unvoiced sounds, and (d) captures the spectral valleys well for nasals, which in turn leads to better recognition accuracy.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114069042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Joint sic and multi-relay selection algorithms for cooperative DS-CDMA systems 协同DS-CDMA系统的联合sic和多中继选择算法
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.43825
Jiaqi Gu, R. D. Lamare
In this work, we propose a cross-layer design strategy based on a joint successive interference cancellation (SIC) detection technique and a multi-relay selection algorithm for the uplink of cooperative direct-sequence code-division multiple access (DS-CDMA) systems. We devise a low-cost greedy list-based SIC (GL-SIC) strategy with RAKE receivers as the front-end that can approach the maximum likelihood detector performance. We also present a low-complexity multi-relay selection algorithm based on greedy techniques that can approach the performance of an exhaustive search. Simulations show an excellent bit error rate performance of the proposed detection and relay selection algorithms as compared to existing techniques.
在这项工作中,我们提出了一种基于联合连续干扰消除(SIC)检测技术和多中继选择算法的跨层设计策略,用于合作直接顺序码分多址(DS-CDMA)系统的上行链路。我们设计了一种低成本的基于贪婪列表的SIC (GL-SIC)策略,以RAKE接收器作为前端,可以接近最大似然检测器的性能。我们还提出了一种基于贪婪技术的低复杂度多中继选择算法,该算法可以接近穷举搜索的性能。仿真结果表明,与现有技术相比,所提出的检测和中继选择算法具有较好的误码率性能。
{"title":"Joint sic and multi-relay selection algorithms for cooperative DS-CDMA systems","authors":"Jiaqi Gu, R. D. Lamare","doi":"10.5281/ZENODO.43825","DOIUrl":"https://doi.org/10.5281/ZENODO.43825","url":null,"abstract":"In this work, we propose a cross-layer design strategy based on a joint successive interference cancellation (SIC) detection technique and a multi-relay selection algorithm for the uplink of cooperative direct-sequence code-division multiple access (DS-CDMA) systems. We devise a low-cost greedy list-based SIC (GL-SIC) strategy with RAKE receivers as the front-end that can approach the maximum likelihood detector performance. We also present a low-complexity multi-relay selection algorithm based on greedy techniques that can approach the performance of an exhaustive search. Simulations show an excellent bit error rate performance of the proposed detection and relay selection algorithms as compared to existing techniques.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125985433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Automatic WH-based edge detector in Weibull clutter 威布尔杂波中基于wh的自动边缘检测
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.43834
Souad Chabbi, T. Laroussi, A. Mezache
Assuming a non-stationary Weibull background with no prior knowledge about the presence or not of a clutter edge, we propose and analyze the censoring and detection performances of the automatic censoring Weber-Haykin constant false censoring and alarm rates (ACWH-CFCAR) detector in homogeneous clutter and in the presence of a clutter edge within the reference window. The cfcarness property is assured by use of the Weber-Haykin (WH) adaptive thresholding which bypasses the estimation of the distribution parameters. The censoring algorithm starts up by considering the two most left ranked cells and proceeds forward. The selected homogeneous set is used to estimate the unknown background level. Extensive Monte Carlo simulations show that the performances of the proposed detector are similar to those exhibited by the corresponding fixed-point censoring WH-CFAR detector.
假设在不知道杂波边缘存在与否的非平稳威布尔背景下,提出并分析了在均匀杂波和参考窗内杂波边缘存在的情况下自动滤波Weber-Haykin常数误检报警率检测器(acwhc - cfcar)的滤波和检测性能。采用webber - haykin (WH)自适应阈值法,避免了对分布参数的估计,从而保证了系统的可靠性。审查算法首先考虑两个排名最左的细胞,然后继续进行。选取的齐次集用于估计未知背景水平。大量的蒙特卡罗模拟表明,所提出的探测器的性能与相应的定点滤波WH-CFAR探测器的性能相似。
{"title":"Automatic WH-based edge detector in Weibull clutter","authors":"Souad Chabbi, T. Laroussi, A. Mezache","doi":"10.5281/ZENODO.43834","DOIUrl":"https://doi.org/10.5281/ZENODO.43834","url":null,"abstract":"Assuming a non-stationary Weibull background with no prior knowledge about the presence or not of a clutter edge, we propose and analyze the censoring and detection performances of the automatic censoring Weber-Haykin constant false censoring and alarm rates (ACWH-CFCAR) detector in homogeneous clutter and in the presence of a clutter edge within the reference window. The cfcarness property is assured by use of the Weber-Haykin (WH) adaptive thresholding which bypasses the estimation of the distribution parameters. The censoring algorithm starts up by considering the two most left ranked cells and proceeds forward. The selected homogeneous set is used to estimate the unknown background level. Extensive Monte Carlo simulations show that the performances of the proposed detector are similar to those exhibited by the corresponding fixed-point censoring WH-CFAR detector.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126617602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical eigenvalue-threshold test for sparsity level estimation from compressed measurements 压缩测量稀疏度估计的经验特征值阈值检验
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.44108
A. Lavrenko, F. Roemer, G. D. Galdo, R. Thomä, O. Arikan
Compressed sensing allows for a significant reduction of the number of measurements when the signal of interest is of a sparse nature. Most computationally efficient algorithms for signal recovery rely on some knowledge of the sparsity level, i.e., the number of non-zero elements. However, the sparsity level is often not known a priori and can even vary with time. In this contribution we show that it is possible to estimate the sparsity level directly in the compressed domain, provided that multiple independent observations are available. In fact, one can use classical model order selection algorithms for this purpose. Nevertheless, due to the influence of the measurement process they may not perform satisfactorily in the compressed sensing setup. To overcome this drawback, we propose an approach which exploits the empirical distributions of the noise eigenvalues. We demonstrate its superior performance compared to state-of-the-art model order estimation algorithms numerically.
当感兴趣的信号具有稀疏性质时,压缩感知允许显著减少测量次数。大多数计算效率高的信号恢复算法依赖于对稀疏度水平的一些了解,即非零元素的数量。然而,稀疏度级别通常不是先验的,甚至可能随时间而变化。在这一贡献中,我们表明,只要有多个独立的观测值可用,就可以直接在压缩域中估计稀疏度水平。事实上,我们可以使用经典的模型顺序选择算法来实现这个目的。然而,由于测量过程的影响,它们在压缩感知设置中可能无法令人满意地执行。为了克服这一缺点,我们提出了一种利用噪声特征值的经验分布的方法。我们在数值上证明了与最先进的模型阶估计算法相比,其优越的性能。
{"title":"An empirical eigenvalue-threshold test for sparsity level estimation from compressed measurements","authors":"A. Lavrenko, F. Roemer, G. D. Galdo, R. Thomä, O. Arikan","doi":"10.5281/ZENODO.44108","DOIUrl":"https://doi.org/10.5281/ZENODO.44108","url":null,"abstract":"Compressed sensing allows for a significant reduction of the number of measurements when the signal of interest is of a sparse nature. Most computationally efficient algorithms for signal recovery rely on some knowledge of the sparsity level, i.e., the number of non-zero elements. However, the sparsity level is often not known a priori and can even vary with time. In this contribution we show that it is possible to estimate the sparsity level directly in the compressed domain, provided that multiple independent observations are available. In fact, one can use classical model order selection algorithms for this purpose. Nevertheless, due to the influence of the measurement process they may not perform satisfactorily in the compressed sensing setup. To overcome this drawback, we propose an approach which exploits the empirical distributions of the noise eigenvalues. We demonstrate its superior performance compared to state-of-the-art model order estimation algorithms numerically.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129111742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Relevance of Dirichlet process mixtures for modeling interferences in underlay cognitive radio 狄利克雷过程混合物与底层认知无线电建模干扰的相关性
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.44128
V. Pereira, G. Ferré, A. Giremus, É. Grivel
In the field of underlay cognitive radio communications, the signal transmitted by the secondary user is disturbed by incoming signals from primary users. Thus, it is necessary to compensate for this secondary-link degradation at the receiver level. In this paper we use Dirichlet process mixtures (DPM) to relax a priori assumptions on the characteristics of the primary user-induced interference. DPM allow us to model the probability density function of the interference. The latter is estimated jointly with the symbols and the channel of the secondary link by using marginalized particle filtering. Our approach makes it possible to improve the symbol error rate compared with an algorithm that simply models the interference as a Gaussian noise.
在底层认知无线电通信领域中,次要用户发送的信号会受到主要用户输入信号的干扰。因此,有必要在接收端级别补偿这种次级链路的退化。本文利用狄利克雷过程混合(DPM)放宽了对原始用户诱导干扰特性的先验假设。DPM允许我们对干扰的概率密度函数进行建模。利用边缘粒子滤波的方法,结合二次链路的符号和信道进行估计。与简单地将干扰建模为高斯噪声的算法相比,我们的方法可以提高符号错误率。
{"title":"Relevance of Dirichlet process mixtures for modeling interferences in underlay cognitive radio","authors":"V. Pereira, G. Ferré, A. Giremus, É. Grivel","doi":"10.5281/ZENODO.44128","DOIUrl":"https://doi.org/10.5281/ZENODO.44128","url":null,"abstract":"In the field of underlay cognitive radio communications, the signal transmitted by the secondary user is disturbed by incoming signals from primary users. Thus, it is necessary to compensate for this secondary-link degradation at the receiver level. In this paper we use Dirichlet process mixtures (DPM) to relax a priori assumptions on the characteristics of the primary user-induced interference. DPM allow us to model the probability density function of the interference. The latter is estimated jointly with the symbols and the channel of the secondary link by using marginalized particle filtering. Our approach makes it possible to improve the symbol error rate compared with an algorithm that simply models the interference as a Gaussian noise.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127009163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Direction-of-arrival estimation using multi-frequency co-prime arrays 基于多频共素数阵列的到达方向估计
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.43952
Elie BouDaher, Yong Jia, F. Ahmad, M. Amin
In this paper, we present a new method for increasing the number of resolvable sources in direction-of-arrival estimation using co-prime arrays. This is achieved by utilizing multiple frequencies to fill in the missing elements in the difference coarray of the co-prime array corresponding to the reference frequency. For high signal-to-noise ratio, the multi-frequency approach effectively utilizes all of the degrees-of-freedom offered by the coarray, provided that the sources have proportional spectra. The performance of the proposed method is evaluated through numerical simulations.
本文提出了一种利用共素数阵列增加到达方向估计中可分辨源数目的新方法。这是通过利用多个频率来填补与参考频率相对应的协素数阵列的差阵中的缺失元素来实现的。对于高信噪比,多频方法有效地利用了同轴阵列提供的所有自由度,只要源具有比例谱。通过数值模拟对该方法的性能进行了评价。
{"title":"Direction-of-arrival estimation using multi-frequency co-prime arrays","authors":"Elie BouDaher, Yong Jia, F. Ahmad, M. Amin","doi":"10.5281/ZENODO.43952","DOIUrl":"https://doi.org/10.5281/ZENODO.43952","url":null,"abstract":"In this paper, we present a new method for increasing the number of resolvable sources in direction-of-arrival estimation using co-prime arrays. This is achieved by utilizing multiple frequencies to fill in the missing elements in the difference coarray of the co-prime array corresponding to the reference frequency. For high signal-to-noise ratio, the multi-frequency approach effectively utilizes all of the degrees-of-freedom offered by the coarray, provided that the sources have proportional spectra. The performance of the proposed method is evaluated through numerical simulations.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116985530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Advanced interference reduction in NC-OFDM based Cognitive Radio with Cancellation Carriers 基于消去载波的NC-OFDM认知无线电的高级抗干扰技术
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.43848
P. Kryszkiewicz, H. Bogucka
Reduction of the out-of-band (OOB) emission is essential for Cognitive Radio (CR) systems to enable coexistence with licensed (primary) systems operating in the adjacent frequency bands. This paper proposes an algorithm for the Non Contiguous Orthogonal Frequency Division Multiplexing (NC-OFDM)-based CR, to reduce the interference caused by both OOB radiation and by non-ideal frequency selectivity of a primary user (PU) receiver. It is based on a concept to use a set of subcarriers called Cancellation Carriers (CCs). By being aware of the PU's carrier frequency, the observed interference power can by decreased by about 10 dB in comparison with the standard OOB-power minimizing algorithms.
减少带外(OOB)发射对于认知无线电(CR)系统来说至关重要,以使其能够与相邻频带中运行的许可(主)系统共存。本文提出了一种基于非连续正交频分复用(NC-OFDM)的CR算法,以减少OOB辐射和主用户(PU)接收机的非理想频率选择性所造成的干扰。它基于使用一组称为取消载波(cc)的子载波的概念。通过了解PU的载波频率,与标准的oob功率最小化算法相比,观察到的干扰功率可以降低约10 dB。
{"title":"Advanced interference reduction in NC-OFDM based Cognitive Radio with Cancellation Carriers","authors":"P. Kryszkiewicz, H. Bogucka","doi":"10.5281/ZENODO.43848","DOIUrl":"https://doi.org/10.5281/ZENODO.43848","url":null,"abstract":"Reduction of the out-of-band (OOB) emission is essential for Cognitive Radio (CR) systems to enable coexistence with licensed (primary) systems operating in the adjacent frequency bands. This paper proposes an algorithm for the Non Contiguous Orthogonal Frequency Division Multiplexing (NC-OFDM)-based CR, to reduce the interference caused by both OOB radiation and by non-ideal frequency selectivity of a primary user (PU) receiver. It is based on a concept to use a set of subcarriers called Cancellation Carriers (CCs). By being aware of the PU's carrier frequency, the observed interference power can by decreased by about 10 dB in comparison with the standard OOB-power minimizing algorithms.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132218872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Parameter estimation in Bayesian Blind Deconvolution with super Gaussian image priors 超高斯图像先验贝叶斯盲反卷积参数估计
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.43886
M. Vega, R. Molina, A. Katsaggelos
Super Gaussian (SG) distributions have proven to be very powerful prior models to induce sparsity in Bayesian Blind Deconvolution (BD) problems. Their conjugate based representations make them specially attractive when Variational Bayes (VB) inference is used since their variational parameters can be calculated in closed form with the sole knowledge of the energy function of the prior model. In this work we show how the introduction in the SG distribution of a global strength (not necessary scale) parameter can be used to improve the quality of the obtained restorations as well as to introduce additional information on the global weight of the prior. A model to estimate the new unknown parameter within the Bayesian framework is provided. Experimental results, on both synthetic and real images, demonstrate the effectiveness of the proposed approach.
在贝叶斯盲反卷积(BD)问题中,超高斯(SG)分布已被证明是非常强大的先验模型。当使用变分贝叶斯(VB)推理时,它们的共轭表示使它们特别有吸引力,因为它们的变分参数可以用先验模型的能量函数的唯一知识以封闭形式计算。在这项工作中,我们展示了如何在SG分布中引入全局强度(非必要尺度)参数来提高获得的恢复质量,以及引入关于先验全局权重的附加信息。给出了一个在贝叶斯框架下估计新的未知参数的模型。在合成图像和真实图像上的实验结果都证明了该方法的有效性。
{"title":"Parameter estimation in Bayesian Blind Deconvolution with super Gaussian image priors","authors":"M. Vega, R. Molina, A. Katsaggelos","doi":"10.5281/ZENODO.43886","DOIUrl":"https://doi.org/10.5281/ZENODO.43886","url":null,"abstract":"Super Gaussian (SG) distributions have proven to be very powerful prior models to induce sparsity in Bayesian Blind Deconvolution (BD) problems. Their conjugate based representations make them specially attractive when Variational Bayes (VB) inference is used since their variational parameters can be calculated in closed form with the sole knowledge of the energy function of the prior model. In this work we show how the introduction in the SG distribution of a global strength (not necessary scale) parameter can be used to improve the quality of the obtained restorations as well as to introduce additional information on the global weight of the prior. A model to estimate the new unknown parameter within the Bayesian framework is provided. Experimental results, on both synthetic and real images, demonstrate the effectiveness of the proposed approach.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134470351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Binaural localization of speech sources in the median plane using cepstral hrtf extraction 用倒谱hrtf提取中位面语音源的双耳定位
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.44021
Dumidu S. Talagala, Xiang Wu, Wen Zhang, T. Abhayapala
In binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.
在双耳系统中,由于难以独立于源光谱探索头部相关传递函数(HRTF)的光谱线索,因此在中位面定位源具有挑战性。本文提出了一种利用倒谱分析提取HRTF频谱线索的方法,用于中位面声源定位。在倒谱域对双耳信号进行预处理,使语音的精细频谱结构和HRTF频谱包络容易分离。我们引入了(i)截断倒谱变换来提取相关的定位线索,以及(ii)一种机制来标准化时变语音频谱的影响。利用多说话人的语音语料库对该方法进行了评价,并与基于卷积的定位方法进行了比较。结果表明,该方法充分利用了可用的频谱线索,实现了与说话人无关的双耳源在中位面上的鲁棒定位。
{"title":"Binaural localization of speech sources in the median plane using cepstral hrtf extraction","authors":"Dumidu S. Talagala, Xiang Wu, Wen Zhang, T. Abhayapala","doi":"10.5281/ZENODO.44021","DOIUrl":"https://doi.org/10.5281/ZENODO.44021","url":null,"abstract":"In binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131946007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A homography-based CDVS pipeline for image matchingwith improved resilience to viewpoint changes 基于同形图的cdv图像匹配管道,提高了视点变化的弹性
Pub Date : 2014-11-13 DOI: 10.5281/ZENODO.44189
Biao Zhao, E. Magli
Compact Descriptors for Visual Search (CDVS) is MPEG proposed standard that will enable efficient and interoperable design of visual search applications using local descriptors. Such descriptors are invariant to rotation and scaling, but are not very robust towards viewpoint changes. In this paper, we address this problem and propose a modified version of the CDVS pipeline that employs image back-projection to compensate for perspective distortion. The proposed technique is based on the homography derived from the correspondence extracted from pairs of matching keypoints. Extensive results show that it improves the CDVS matching accuracy under viewpoint changes while having low complexity.
紧凑的描述符对视觉搜索(CDVS)是MPEG建议标准,使效率和可互操作的设计,使用当地的视觉搜索应用程序描述符。这样的描述符对旋转和缩放是不变的,但对视点变化不是很健壮。在本文中,我们解决了这个问题,并提出了一种改进版本的cdv管道,该管道使用图像反向投影来补偿透视失真。该技术基于从匹配关键点对中提取的对应关系衍生出的单应性。大量的实验结果表明,该方法提高了视点变化下的cdv匹配精度,同时具有较低的复杂度。
{"title":"A homography-based CDVS pipeline for image matchingwith improved resilience to viewpoint changes","authors":"Biao Zhao, E. Magli","doi":"10.5281/ZENODO.44189","DOIUrl":"https://doi.org/10.5281/ZENODO.44189","url":null,"abstract":"Compact Descriptors for Visual Search (CDVS) is MPEG proposed standard that will enable efficient and interoperable design of visual search applications using local descriptors. Such descriptors are invariant to rotation and scaling, but are not very robust towards viewpoint changes. In this paper, we address this problem and propose a modified version of the CDVS pipeline that employs image back-projection to compensate for perspective distortion. The proposed technique is based on the homography derived from the correspondence extracted from pairs of matching keypoints. Extensive results show that it improves the CDVS matching accuracy under viewpoint changes while having low complexity.","PeriodicalId":198408,"journal":{"name":"2014 22nd European Signal Processing Conference (EUSIPCO)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2014-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123022221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2014 22nd European Signal Processing Conference (EUSIPCO)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1