2008 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

英文中文

Through wall radar imaging using UWB noise waveforms 采用超宽带噪声波形的穿壁雷达成像

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518827

R. Narayanan

This paper examines the results of our research on the use of ultra-wideband noise waveforms for imaging objects behind walls. The advantages of using thermally generated noise as a probing signal are introduced. The technique of heterodyne correlation, used to inject coherence in the random noise probing signal and to collapse the wideband reflected signal into a single frequency, is presented. We address issues related to locating, detection, and tracking humans behind walls using the Hilbert-Huang transform approach for human activity characterization. The results indicate that noise radar technology combined with modern signal processing approaches is indeed a viable technique for covert high-resolution imaging of obscured stationary and moving targets.

本文介绍了利用超宽带噪声波形对墙后物体成像的研究结果。介绍了利用热噪声作为探测信号的优点。提出了一种外差相关技术，用于在随机噪声探测信号中注入相干性，并将宽带反射信号压缩为单频。我们使用希尔伯特-黄变换方法来描述人类活动特征，解决与定位、检测和跟踪墙后人类相关的问题。结果表明，噪声雷达技术与现代信号处理方法相结合，确实是一种可行的高分辨率隐蔽成像技术。

引用次数: 95

SURE-LET multichannel image denoising: undecimated wavelet thresholding SURE-LET多通道图像去噪:无噪小波阈值

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517723

F. Luisier, T. Blu

We propose an extension of the recently devised SURE-LET grayscale denoising approach for multichannel images. Assuming additive Gaussian white noise, the unknown linear parameters of a transform-domain/wwfwwe multichannel thresholding are globally optimized by minimizing Stein's unbiased MSE estimate (SURE) in the image-domain. Using the undecimated wavelet transform, we demonstrate the efficiency of this approach for denoising color images by comparing our results with two other state-of-the-art denoising algorithms.

我们提出了最近设计的多通道图像的SURE-LET灰度去噪方法的扩展。假设加性高斯白噪声，通过最小化图像域的Stein's无偏MSE估计(SURE)，对transform-domain/wwfwwe多通道阈值的未知线性参数进行全局优化。使用未消差小波变换，我们通过将我们的结果与其他两种最先进的去噪算法进行比较，证明了这种方法对彩色图像去噪的效率。

引用次数: 3

A Bayesian hierarchical detection framework for parking space detection 车位检测的贝叶斯分层检测框架

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518055

Chingchun Huang, Sheng-Jyh Wang, Yao-Jen Chang, Tsuhan Chen

In this paper, a 3-layer Bayesian hierarchical detection framework (BHDF) is proposed for robust parking space detection. In practice, the challenges of the parking space detection problem come from luminance variations, inter- occlusions among cars, and occlusions caused by environmental obstacles. Instead of determining the status of parking spaces one by one, the proposed BHDF framework models the inter-occluded patterns as semantic knowledge and couple local classifiers with adjacency constraints to determine the status of parking spaces in a row-by-row manner. By applying the BHDF to the parking space detection problem, the available parking spaces and the labeling of parked cars can be achieved in a robust and efficient manner. Furthermore, this BHDF framework is generic enough to be used for various kinds of detection and segmentation applications.

本文提出了一种三层贝叶斯分层检测框架(BHDF)，用于鲁棒车位检测。在实践中，停车位检测问题的挑战来自于亮度变化、车辆间的遮挡以及环境障碍物引起的遮挡。提出的BHDF框架不是逐个确定车位的状态，而是将互遮挡模式建模为语义知识，并将局部分类器与邻接约束耦合，以逐行确定车位的状态。将BHDF应用于车位检测问题，可以鲁棒高效地实现可用车位和停放车辆的标注。此外，这个BHDF框架足够通用，可以用于各种类型的检测和分割应用程序。

引用次数: 32

Hands-free system with low-delay subband acoustic echo control and noise reduction 免提系统，低延迟子带回声控制和降噪

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517911

Kai Steinert, M. Schönle, C. Beaugeant, T. Fingscheidt

Echo cancellation and noise reduction for hands-free systems are challenging tasks in speech signal processing. The presence of strong local speech and noise and a changing acoustical enclosing may severely impair the performance of the algorithms. Usually additional constraints such as a low signal delay are also requested for real time implementation. We present a hands-free system consisting of a delayless sub- band adaptive filter with a low-delay echo and noise suppression postfilter. All parameters are estimated in the subband domain, whereas the filtering takes place in the time domain. Thus, our system has a significantly lower processing delay than similar proposals. We compare its performance with respect to echo and noise attenuation and speech distortion with a state-of-the-art hands-free system in a simulated car environment.

免提系统的回声消除和降噪是语音信号处理中具有挑战性的课题。强局部语音和噪声的存在以及声封闭的变化会严重影响算法的性能。通常，实时实现还需要额外的约束，例如低信号延迟。我们提出了一个由无延迟子带自适应滤波器、低延迟回波和噪声抑制后滤波器组成的免提系统。所有参数在子带域估计，而滤波在时域进行。因此，我们的系统比类似的提案具有更低的处理延迟。我们将其在模拟汽车环境中的回声和噪音衰减以及语音失真方面的性能与最先进的免提系统进行了比较。

引用次数: 13

Speaker indexing and speech enhancement in real meetings / conversations 真实会议/对话中的演讲者索引和语音增强

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517554

S. Araki, M. Fujimoto, K. Ishizuka, H. Sawada, S. Makino

This paper presents a speaker indexing method that uses a small number of microphones to estimate who spoke when. Our proposed speaker indexing is realized by using a noise robust voice activity detector (VAD), a QCC-PHAT based direction of arrival (DOA) estimator, and a DOA classifier. Using the estimated speaker indexing information, we can also enhance the utterances of each speaker with a maximum signal-to-noise-ratio (MaxSNR) beamformer. This paper applies our system to real recorded meetings / conversations recorded in a room with a reverberation time of 350 ms, and evaluates the performance by a standard measure: the diarization error rate (DER). Even for the real conversations, which have many speaker turn-takings and overlaps, the speaker error time was very small with our proposed system. We are planning to demonstrate a real-time speaker indexing system at ICASSP2008.

本文提出了一种利用少量麦克风来估计发言时间的说话人索引方法。我们提出的说话人索引是通过使用噪声鲁棒语音活动检测器(VAD)、基于QCC-PHAT的到达方向(DOA)估计器和DOA分类器实现的。利用估计的说话人索引信息，我们还可以使用最大信噪比波束形成器来增强每个说话人的话语。本文将该系统应用于混响时间为350 ms的室内会议/对话的真实录音，并通过拨号错误率(DER)的标准度量来评估其性能。即使是在真实的对话中，在有很多说话人轮流和重叠的情况下，我们提出的系统也能使说话人的错误时间非常小。我们计划在ICASSP2008上演示一个实时说话人索引系统。

引用次数: 24

Empirical capacity of a biometric channel under the constraint of global PCA and ICA encoding 全局PCA和ICA编码约束下生物识别信道的经验容量

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518840

Francesco Nicolo, N. Schmid

The ability of practical biometric systems to recognize a large number of subjects is constrained by a variety of factors that include a choice of a source encoding technique, quality of images, complexity and variability of underlying patterns and of collected data. Given a source encoding technique, the remaining factors can be attributed to distortions due to a biometric recognition channel. In this work, we define empirical mutual information and recognition rate and evaluate empirical recognition capacity of biometric systems under the constraint of two global encoding techniques: principal component analysis (PCA) and independent component analysis (ICA). The empirical capacity of biometric systems is numerically evaluated as a point of intersection of the empirical mutual information rate curve plotted as a function of the recognition rate and the diagonal line bisecting the first quadrant. The developed methodology is applied to find the empirical capacity of different recognition channels formed during acquisition of different iris and face databases.

实际生物识别系统识别大量受试者的能力受到多种因素的限制，这些因素包括源编码技术的选择、图像质量、基础模式和收集数据的复杂性和可变性。给定源编码技术，剩余的因素可以归因于由于生物识别通道的扭曲。在这项工作中，我们定义了经验互信息和识别率，并在两种全局编码技术:主成分分析(PCA)和独立成分分析(ICA)的约束下评估了生物识别系统的经验识别能力。生物识别系统的经验能力被数值评价为经验互信息率曲线的交点，该曲线作为识别率的函数与平分第一象限的对角线绘制。将所开发的方法应用于不同虹膜和人脸数据库采集过程中形成的不同识别通道的经验容量。

引用次数: 1

A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches 基于宏观时间演化系统的适应方法与间接/直接适应方法的统一解释

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518602

Shinji Watanabe, Atsushi Nakamura

Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. We proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. maximum likelihood linear regression (MLLR)) and direct adaptation of classifier parameters (e.g. maximum a posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.

语音识别的增量自适应技术旨在快速、稳定地调整声学模型以适应由于说话人、说话方式、噪声源等时间变化而产生的时变声学特征。提出了一种基于宏观时间演化系统的增量自适应框架，通过连续更新声学模型参数的后验分布来模拟时变特征。在本文中，我们对该提议和两种主要的传统方法进行了统一的解释，即通过转换参数间接自适应(例如最大似然线性回归(MLLR))和直接自适应分类器参数(例如最大后验(MAP))。分析和实验结果表明，增量自适应既包括常规自适应方法，也包括组合自适应方法，并同时具有快速稳定的自适应特点。

引用次数: 4

Extracting age information from local spatially flexible patches 从局部空间柔性斑块中提取年龄信息

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517715

Shuicheng Yan, Ming Liu, Thomas S. Huang

Motivated by the fact that age information can often be observed from local evidence on the human face, we contribute to the age estimation problem in two aspects. On the one hand, we present a new feature descriptor, called spatially flexible patch (SFP), which encodes the local appearance and position information simultaneously. SFP has the potential to alleviate the problem of insufficient samples owing to that SFPs similar in appearance yet slightly different in position can still provide similar confidence for age estimation. One the other hand, the SFP associated with age label is modeled with Gaussian Mixture Model, and then age estimation is conducted by maximizing the sum of likelihoods from all the SFPs associated with the hypothetic age. Experiments are conducted on the YAMAHA database with 8,000 face images and ages ranging from 0 to 93. Compared with the latest reported results, our new algorithm brings encouraging reduction in mean absolute error for age estimation.

由于年龄信息通常可以从人脸的局部证据中观察到，我们从两个方面对年龄估计问题做出了贡献。一方面，我们提出了一种新的特征描述子，称为空间柔性贴片(SFP)，它可以同时编码局部的外观和位置信息。SFP有可能缓解样品不足的问题，因为外观相似但位置略有不同的SFP仍然可以为年龄估计提供类似的置信度。另一方面，用高斯混合模型对与年龄标签相关的SFP进行建模，然后通过最大化与假设年龄相关的所有SFP的似然和来进行年龄估计。实验是在YAMAHA数据库中进行的，该数据库有8000张人脸图像，年龄从0岁到93岁不等。与最新报道的结果相比，我们的新算法在年龄估计的平均绝对误差上有了令人鼓舞的降低。

引用次数: 82

Stereo matching with asymmetric occlusion handling in weighted least square framework 加权最小二乘框架下非对称遮挡处理的立体匹配

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517796

Dongbo Min, K. Sohn

This paper presents a novel method for stereo matching with occlusion handling. In order to estimate optimal cost, we define an energy function and solve the iterative equation with the numerical method. We improve performance and convergence rate by using several acceleration techniques. The proposed method is computationally efficient since it does not use color segmentation or any global optimization techniques. For occlusion handling, which has not been performed effectively by any conventional cost aggregation approaches, we combine the occlusion problem with the proposed minimization scheme. Asymmetric information is used so that few additional computational loads are necessary. Experimental results show that performance is comparable to that of many state-of-the-art methods.

提出了一种基于遮挡处理的立体匹配方法。为了估计最优成本，我们定义了能量函数，并用数值方法求解迭代方程。我们通过使用几种加速技术来提高性能和收敛速度。该方法不使用颜色分割或任何全局优化技术，计算效率高。对于任何传统的成本聚合方法都无法有效执行的遮挡处理，我们将遮挡问题与提出的最小化方案结合起来。使用非对称信息，因此很少需要额外的计算负载。实验结果表明，该方法的性能可与许多最先进的方法相媲美。

引用次数: 2

Classification of GMSK signals with different bandwidths 不同带宽GMSK信号的分类

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518034

Anchalee Puengnim, N. Thomas, J. Tourneret, Herve Guillon

This paper studies a Bayesian classifier which recognizes Gaussian minimum shift keying (GMSK) modulated signals with different bandwiths. We focus on identifying two different GMSK signals with BT = 0.25 and BT = 0.5 standardized by the consultative committee for space data system (CCSDS) for future space missions. The main idea of the proposed classifier is to compute the posterior probability of the observation sequence given each possible model by a modified Baum-Welch (BW) algorithm. The received GMSK signals are then classified according to the maximum a posteriori (MAP) rule.

研究了一种贝叶斯分类器，用于识别不同带宽的高斯最小移位键控(GMSK)调制信号。我们着重于识别由空间数据系统协商委员会(CCSDS)为未来空间任务标准化的BT = 0.25和BT = 0.5两种不同的GMSK信号。该分类器的主要思想是通过改进的Baum-Welch (BW)算法计算给定每种可能模型的观测序列的后验概率。然后根据最大后验规则对接收到的GMSK信号进行分类。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀