首页 > 最新文献

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)最新文献

英文 中文
Effective speaker adaptations for speaker verification 有效的说话人适应说话人验证
Sungjoo Ahn, Sunmee Kang, Hanseok Ko
This paper concerns effective speaker adaptation methods to solve the over-training problem in speaker verification, which frequently occurs when modeling a speaker with sparse training data. While various speaker adaptations have already been applied to speech recognition, these methods have not yet been formally considered in speaker verification. This paper proposes speaker adaptation methods using a combination of maximum a posteriori (MAP) and maximum likelihood linear regression (MLLR) adaptations, which are successfully used in speech recognition, and applies to speaker verification. Our aim is to remedy the small training data problem by investigating effective speaker adaptations for speaker modeling. Experimental results show that the speaker verification system using a weighted MAP and MLLR adaptation outperforms that of the conventional speaker models without adaptation by a factor of up to 5 times. From these results, we show that the speaker adaptation method achieves significantly better performance even when only small training data is available for speaker verification.
本文研究了有效的说话人自适应方法,以解决使用稀疏训练数据建模说话人时经常出现的说话人验证中的过度训练问题。虽然各种说话人适应已经应用于语音识别,但这些方法尚未正式考虑在说话人验证。本文提出了最大后验(MAP)和最大似然线性回归(MLLR)相结合的说话人自适应方法,该方法已成功应用于语音识别,并应用于说话人验证。我们的目标是通过研究说话人对说话人建模的有效适应来纠正小的训练数据问题。实验结果表明,使用加权MAP和MLLR自适应的说话人验证系统比不自适应的传统说话人模型性能提高了5倍。从这些结果中,我们发现说话人自适应方法即使在只有少量训练数据可用于说话人验证时也能取得明显更好的性能。
{"title":"Effective speaker adaptations for speaker verification","authors":"Sungjoo Ahn, Sunmee Kang, Hanseok Ko","doi":"10.1109/ICASSP.2000.859151","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859151","url":null,"abstract":"This paper concerns effective speaker adaptation methods to solve the over-training problem in speaker verification, which frequently occurs when modeling a speaker with sparse training data. While various speaker adaptations have already been applied to speech recognition, these methods have not yet been formally considered in speaker verification. This paper proposes speaker adaptation methods using a combination of maximum a posteriori (MAP) and maximum likelihood linear regression (MLLR) adaptations, which are successfully used in speech recognition, and applies to speaker verification. Our aim is to remedy the small training data problem by investigating effective speaker adaptations for speaker modeling. Experimental results show that the speaker verification system using a weighted MAP and MLLR adaptation outperforms that of the conventional speaker models without adaptation by a factor of up to 5 times. From these results, we show that the speaker adaptation method achieves significantly better performance even when only small training data is available for speaker verification.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128792123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Blind LPTV joint equalization and interference suppression 盲LPTV联合均衡与干扰抑制
G. Gelli, F. Verde
The problem of jointly equalizing a digital communication signal distorted by a linear time-invariant channel and rejecting co-channel or adjacent-channel digital interference is tackled. Owing to the presence of the interfering signal, the proposed optimum linear MMSE equalizer turns out to be periodically time-varying (LPTV). Moreover, new simple and effective blind channel identification procedures are presented, which can be applied as long as the desired and interfering signal exhibit different circularity and/or cyclo-stationarity properties. Simulation results confirm the effectiveness of the proposed techniques.
解决了由线性时不变信道畸变的数字通信信号联合均衡和抑制同信道或邻接信道数字干扰的问题。由于干扰信号的存在,本文提出的最佳线性MMSE均衡器为周期性时变均衡器(LPTV)。此外,提出了一种新的简单有效的盲信道识别方法,只要期望信号和干扰信号具有不同的圆度和/或循环平稳特性,该方法就可以应用。仿真结果验证了所提方法的有效性。
{"title":"Blind LPTV joint equalization and interference suppression","authors":"G. Gelli, F. Verde","doi":"10.1109/ICASSP.2000.861068","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.861068","url":null,"abstract":"The problem of jointly equalizing a digital communication signal distorted by a linear time-invariant channel and rejecting co-channel or adjacent-channel digital interference is tackled. Owing to the presence of the interfering signal, the proposed optimum linear MMSE equalizer turns out to be periodically time-varying (LPTV). Moreover, new simple and effective blind channel identification procedures are presented, which can be applied as long as the desired and interfering signal exhibit different circularity and/or cyclo-stationarity properties. Simulation results confirm the effectiveness of the proposed techniques.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"155 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128700980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Nontrivial analytic signals with positive instantaneous frequency and band-limited amplitude 具有正瞬时频率和带限幅值的非平凡解析信号
M. Doroslovački
Questions have previously been raised about the existence of an analytic signal with positive instantaneous frequency when the form of the analytic signal is prescribed. Here, it is shown that the complex function a(t)exp[j(/spl omega//sub 0/t+m(t))] is an analytic signal when m(t) is a real periodic function and a(t) is a band-limited real function with the maximum bandwidth depending on /spl omega//sub 0/ and the fundamental frequency of m(t). That implies as a special case m(t) which is simultaneously a periodic and piecewise polynomial. Positivity of the instantaneous frequency is simply obtained by requiring that the absolute value of the first derivative of m(t) is smaller than /spl omega//sub 0/.
先前已经提出了当解析信号的形式被规定时,是否存在具有正瞬时频率的解析信号的问题。在这里,证明了复函数a(t)exp[j(/spl ω //下标0/t+m(t))]是解析信号,当m(t)是实周期函数,a(t)是带限实函数,其最大带宽取决于/spl ω //下标0/和m(t)的基频。这意味着作为一个特例m(t)它同时是一个周期分段多项式。通过要求m(t)的一阶导数的绝对值小于/spl //下标0/,可以简单地获得瞬时频率的正性。
{"title":"Nontrivial analytic signals with positive instantaneous frequency and band-limited amplitude","authors":"M. Doroslovački","doi":"10.1109/ICASSP.2000.859048","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859048","url":null,"abstract":"Questions have previously been raised about the existence of an analytic signal with positive instantaneous frequency when the form of the analytic signal is prescribed. Here, it is shown that the complex function a(t)exp[j(/spl omega//sub 0/t+m(t))] is an analytic signal when m(t) is a real periodic function and a(t) is a band-limited real function with the maximum bandwidth depending on /spl omega//sub 0/ and the fundamental frequency of m(t). That implies as a special case m(t) which is simultaneously a periodic and piecewise polynomial. Positivity of the instantaneous frequency is simply obtained by requiring that the absolute value of the first derivative of m(t) is smaller than /spl omega//sub 0/.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"163 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129269795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural network for solving generalized eigenvalues of matrix pair 求解矩阵对广义特征值的神经网络
H. B. Yang, L. Jiao
Neural networks for solving a class of generalized eigenvalue problems of matrix pair are proposed, in which a universal function satisfying several conditions is introduced by replacing some ones. For its simplicity in structure and excellence in performance, it can be widely used in many areas including array signal processing, blind equalization and identification. Both the theoretical analysis and the experimental results show that the proposed network can gives the extreme eigenvalue and its corresponding eigenvector of the matrix pair (A, B) in real time.
提出了用于求解一类矩阵对的广义特征值问题的神经网络,其中引入了一个满足多个条件的通用函数,替换了一些条件。由于其结构简单、性能卓越,可广泛应用于阵列信号处理、盲均衡和识别等多个领域。理论分析和实验结果都表明,所提出的网络能实时给出矩阵对(A,B)的极值特征值及其相应的特征向量。
{"title":"Neural network for solving generalized eigenvalues of matrix pair","authors":"H. B. Yang, L. Jiao","doi":"10.1109/ICASSP.2000.860150","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860150","url":null,"abstract":"Neural networks for solving a class of generalized eigenvalue problems of matrix pair are proposed, in which a universal function satisfying several conditions is introduced by replacing some ones. For its simplicity in structure and excellence in performance, it can be widely used in many areas including array signal processing, blind equalization and identification. Both the theoretical analysis and the experimental results show that the proposed network can gives the extreme eigenvalue and its corresponding eigenvector of the matrix pair (A, B) in real time.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129305989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A blind multiuser receiver for the CDMA downlink 用于CDMA下行链路的盲多用户接收机
S. Mudulodu, A. Paulraj
We propose a new blind space-time linear multiuser receiver for the CDMA downlink. The structure in the multiple access interference (MAI) for the downlink makes it possible for a receiver to suppress it without first obtaining the matched filter outputs for all the users (which comprise the sufficient statistics for optimum detection of the transmitted symbols); a 2D-RAKE receiver with appropriately chosen taps can suppress MAI reasonably well. Knowledge of all the users' codes is assumed and the taps of the 2D-RAKE receiver are estimated by making use of the subspace structure in the transmitted signal. Our approach does not require training symbols or channel estimation in order to estimate the receiver taps.
提出了一种用于CDMA下行链路的盲空线性多用户接收机。下行链路的多址干扰(MAI)中的结构使得接收器可以在不首先获得所有用户的匹配滤波器输出(其包含用于最佳检测所传输符号的足够统计量)的情况下抑制它;适当选择轻拍的2D-RAKE接收器可以很好地抑制MAI。假设知道所有用户的编码,并利用发射信号中的子空间结构估计2D-RAKE接收机的抽头。我们的方法不需要训练符号或信道估计来估计接收机的抽头。
{"title":"A blind multiuser receiver for the CDMA downlink","authors":"S. Mudulodu, A. Paulraj","doi":"10.1109/ICASSP.2000.861148","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.861148","url":null,"abstract":"We propose a new blind space-time linear multiuser receiver for the CDMA downlink. The structure in the multiple access interference (MAI) for the downlink makes it possible for a receiver to suppress it without first obtaining the matched filter outputs for all the users (which comprise the sufficient statistics for optimum detection of the transmitted symbols); a 2D-RAKE receiver with appropriately chosen taps can suppress MAI reasonably well. Knowledge of all the users' codes is assumed and the taps of the 2D-RAKE receiver are estimated by making use of the subspace structure in the transmitted signal. Our approach does not require training symbols or channel estimation in order to estimate the receiver taps.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129733491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Codeword quantization for interference avoidance 避免干扰的码字量化
D. Popescu, C. Rose
Programmable radios offer a new perspective on wireless communications since the modulation method is no longer fixed. Adaptive methods where user signatures and corresponding receiver filters are iteratively adapted can be used to improve performance. However, since codeword adjustments must be fed back to the transmitter, compact representation of codewords is extremely important. This issue is important for systems which employ interference avoidance since as opposed to current CDMA systems where uniform-amplitude codeword chips are used, interference avoidance employs real-valued "chips"-real-valued coefficients for a set of orthonormal basis functions of the signal space used by the transmitter and receiver. The paper represents a simple investigation of how codeword quantization affects the performance of interference avoidance algorithms. Results indicate that using 4-5 bits per chip for codeword representation is sufficient to maintain performance close to optimal values.
可编程无线电为无线通信提供了一个新的视角,因为调制方法不再是固定的。可使用自适应方法改进性能,其中迭代地调整用户签名和相应的接收方过滤器。然而,由于码字的调整必须反馈给发射机,码字的紧凑表示是极其重要的。这个问题对于采用干扰避免的系统很重要,因为与使用等幅码字芯片的当前CDMA系统相反,干扰避免使用实值“芯片”-发射机和接收机使用的信号空间的一组正交基函数的实值系数。本文对码字量化如何影响干扰避免算法的性能进行了简单的研究。结果表明,每个芯片使用4-5位用于码字表示足以保持接近最佳值的性能。
{"title":"Codeword quantization for interference avoidance","authors":"D. Popescu, C. Rose","doi":"10.1109/ICASSP.2000.860198","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860198","url":null,"abstract":"Programmable radios offer a new perspective on wireless communications since the modulation method is no longer fixed. Adaptive methods where user signatures and corresponding receiver filters are iteratively adapted can be used to improve performance. However, since codeword adjustments must be fed back to the transmitter, compact representation of codewords is extremely important. This issue is important for systems which employ interference avoidance since as opposed to current CDMA systems where uniform-amplitude codeword chips are used, interference avoidance employs real-valued \"chips\"-real-valued coefficients for a set of orthonormal basis functions of the signal space used by the transmitter and receiver. The paper represents a simple investigation of how codeword quantization affects the performance of interference avoidance algorithms. Results indicate that using 4-5 bits per chip for codeword representation is sufficient to maintain performance close to optimal values.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126672391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A single-carrier frequency-domain SDMA basestation 一种单载波频域SDMA基站
P. Vandenameele, L. Perre, B. Gyselinckx, M. Engels, M. Moonen, H. Man
Two major technical challenges in the design of future broadband wireless networks are the impairments of the propagation channel and the need for spectral efficiency. We previously proposed a combined OFDM/SDMA approach that mitigates the channel impairments by orthogonal frequency division multiplexing (OFDM) with cyclic prefix insertion and that achieves a high spectral efficiency by space division multiple access (SDMA). However, because of the multicarrier modulation, this approach requires high-backoff power amplifiers in the analog frontend. We present a SC-FD-SDMA basestation, which avoids these expensive amplifiers by using constant-envelope single-carrier (SC) modulation and still features the advantages of frequency-domain (FD) multipath mitigation and SDMA. We pay special attention to the initialization of such basestation and its fixed point requirements, since they are critical aspects of any realistic implementation. A case-study shows how SC-FD-SDMA enables a 100 Mbps wireless LAN with a bandwidth efficiency of 8 bps/Hz and an uncoded BER of 10/sup -3/ at 13.5 dB.
未来宽带无线网络设计面临的两大技术挑战是传播信道的缺陷和对频谱效率的要求。我们之前提出了一种结合OFDM/SDMA的方法,该方法通过带循环前缀插入的正交频分复用(OFDM)减轻了信道损伤,并通过空分多址(SDMA)实现了高频谱效率。然而,由于多载波调制,这种方法需要在模拟前端使用高回退功率放大器。我们提出了一种SC-FD-SDMA基站,它通过使用恒定包络单载波(SC)调制避免了这些昂贵的放大器,并且仍然具有频域(FD)多径缓解和SDMA的优点。我们特别注意这种基站的初始化及其定点需求,因为它们是任何实际实现的关键方面。一个案例研究展示了SC-FD-SDMA如何在13.5 dB下实现带宽效率为8 bps/Hz的100 Mbps无线局域网和10/sup -3/的非编码误码率。
{"title":"A single-carrier frequency-domain SDMA basestation","authors":"P. Vandenameele, L. Perre, B. Gyselinckx, M. Engels, M. Moonen, H. Man","doi":"10.1109/ICASSP.2000.860209","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860209","url":null,"abstract":"Two major technical challenges in the design of future broadband wireless networks are the impairments of the propagation channel and the need for spectral efficiency. We previously proposed a combined OFDM/SDMA approach that mitigates the channel impairments by orthogonal frequency division multiplexing (OFDM) with cyclic prefix insertion and that achieves a high spectral efficiency by space division multiple access (SDMA). However, because of the multicarrier modulation, this approach requires high-backoff power amplifiers in the analog frontend. We present a SC-FD-SDMA basestation, which avoids these expensive amplifiers by using constant-envelope single-carrier (SC) modulation and still features the advantages of frequency-domain (FD) multipath mitigation and SDMA. We pay special attention to the initialization of such basestation and its fixed point requirements, since they are critical aspects of any realistic implementation. A case-study shows how SC-FD-SDMA enables a 100 Mbps wireless LAN with a bandwidth efficiency of 8 bps/Hz and an uncoded BER of 10/sup -3/ at 13.5 dB.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126701805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Inference in symmetric alpha-stable noise using MCMC and the slice sampler 基于MCMC和切片采样器的对称稳定噪声推断
S. Godsill
We have previously shown how to perform inference about symmetric stable processes using Monte Carlo EM (MCEM) and Markov chain Monte Carlo (MCMC) techniques. Simulation based methods such as these are an excellent tool for inference with stable law distributions, since they do not require any direct evaluation of the stable density function, which is unavailable analytically in the general case. We review the existing methods for inference with MCMC and propose new methods based on the slice sampler, a very simple sampling algorithm which draws points from a uniform distribution over the area under the required density function. There is some evidence in the literature that the slice sampler has better convergence properties than the independence Metropolis samplers and rejection samplers previously proposed. We investigate this in the context of alpha-stable noise distributions.
我们之前已经展示了如何使用蒙特卡罗EM (MCEM)和马尔可夫链蒙特卡罗(MCMC)技术对对称稳定过程进行推理。诸如此类的基于模拟的方法是对稳定定律分布进行推理的极好工具,因为它们不需要对稳定密度函数进行任何直接评估,这在一般情况下是不可用的。我们回顾了现有的MCMC推理方法,并提出了基于切片采样器的新方法,切片采样器是一种非常简单的采样算法,它在所需密度函数下从面积上的均匀分布中提取点。文献中有一些证据表明,切片采样器比先前提出的独立Metropolis采样器和拒绝采样器具有更好的收敛性能。我们在α稳定噪声分布的背景下对此进行了研究。
{"title":"Inference in symmetric alpha-stable noise using MCMC and the slice sampler","authors":"S. Godsill","doi":"10.1109/ICASSP.2000.860232","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860232","url":null,"abstract":"We have previously shown how to perform inference about symmetric stable processes using Monte Carlo EM (MCEM) and Markov chain Monte Carlo (MCMC) techniques. Simulation based methods such as these are an excellent tool for inference with stable law distributions, since they do not require any direct evaluation of the stable density function, which is unavailable analytically in the general case. We review the existing methods for inference with MCMC and propose new methods based on the slice sampler, a very simple sampling algorithm which draws points from a uniform distribution over the area under the required density function. There is some evidence in the literature that the slice sampler has better convergence properties than the independence Metropolis samplers and rejection samplers previously proposed. We investigate this in the context of alpha-stable noise distributions.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126903367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Robust speech enhancement using amplitude spectral estimator 基于幅度谱估计的鲁棒语音增强
A. Wahab, E. Tan, H. Abut
This paper focuses on a class of speech enhancement systems using the amplitude spectral estimates of noisy speech and noise to drive a Wiener filter to suppress simultaneously the ensemble of degradations picked up by microphones. A simple stereo microphone set, with left and right channels can be used to provide enough separation that unwanted signals can be reduced significantly to yield an acceptable quality speech signal for hands-free telephony applications in a vehicular environment. A generalized transform approach was introduced and experimental results show great potential when using the DCT as alternatives to the traditional Fourier transform approach to derive the amplitude spectral estimates of the corrupted speech signals and the noise process.
本文重点研究了一类语音增强系统,该系统利用噪声语音和噪声的振幅谱估计来驱动维纳滤波器来同时抑制麦克风拾取的退化集合。一套简单的立体声麦克风,带有左右声道,可以提供足够的分离,从而大大减少不必要的信号,从而产生可接受的质量语音信号,用于车载环境中的免提电话应用。介绍了一种广义变换方法,实验结果表明,将DCT作为传统傅立叶变换方法的替代方法,可以得到被破坏语音信号和噪声过程的幅度谱估计。
{"title":"Robust speech enhancement using amplitude spectral estimator","authors":"A. Wahab, E. Tan, H. Abut","doi":"10.1109/ICASSP.2000.860170","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860170","url":null,"abstract":"This paper focuses on a class of speech enhancement systems using the amplitude spectral estimates of noisy speech and noise to drive a Wiener filter to suppress simultaneously the ensemble of degradations picked up by microphones. A simple stereo microphone set, with left and right channels can be used to provide enough separation that unwanted signals can be reduced significantly to yield an acceptable quality speech signal for hands-free telephony applications in a vehicular environment. A generalized transform approach was introduced and experimental results show great potential when using the DCT as alternatives to the traditional Fourier transform approach to derive the amplitude spectral estimates of the corrupted speech signals and the noise process.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129207191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Trellis-based parallel stereo matching 基于网格的平行立体匹配
Yuns Oh, Hong Jeong
We present a center-referenced basis for discrete representation of stereo correspondence that includes new occlusion nodes. This basis improves the inclusion of constraints and the parallelism of the final algorithm. Disparity estimation is formulated in a MAP context and natural constraints are incorporated, resulting in an optimal path problem in a sparsely connected trellis. Like other dynamic programming methods, the computational complexity is low at O(MN/sup 2/) for M/spl times/N pixel images. However, this method is better suited to parallel solution, scaling up to O(MN) processors. Experimental results confirm the performance of this method.
我们提出了一个中心参考基础的立体对应的离散表示,包括新的遮挡节点。这一基础提高了约束的包含和最终算法的并行性。在MAP环境下建立了视差估计,并引入了自然约束条件,得到了稀疏连接网格中的最优路径问题。与其他动态规划方法一样,对于M/spl次/N像素图像,计算复杂度低至0 (MN/sup 2/)。然而,这种方法更适合并行解决方案,扩展到O(MN)处理器。实验结果证实了该方法的有效性。
{"title":"Trellis-based parallel stereo matching","authors":"Yuns Oh, Hong Jeong","doi":"10.1109/ICASSP.2000.859260","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859260","url":null,"abstract":"We present a center-referenced basis for discrete representation of stereo correspondence that includes new occlusion nodes. This basis improves the inclusion of constraints and the parallelism of the final algorithm. Disparity estimation is formulated in a MAP context and natural constraints are incorporated, resulting in an optimal path problem in a sparsely connected trellis. Like other dynamic programming methods, the computational complexity is low at O(MN/sup 2/) for M/spl times/N pixel images. However, this method is better suited to parallel solution, scaling up to O(MN) processors. Experimental results confirm the performance of this method.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129290748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1