Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)最新文献

英文中文

Application of basis pursuit in spectrum estimation 基追踪在频谱估计中的应用

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-15 DOI: 10.1109/ICASSP.1998.681827

S. Chen, D. Donoho

We apply basis pursuit, an atomic decomposition technique, for spectrum estimation. Compared with several modern time series methods, our approach can greatly reduce the problem of power leakage; it is able to superresolve; moreover, it works well with noisy and unevenly sampled signals. We present experiments on bizarrely spaced radial velocity data from one of the newly-discovered extrasolar planetary systems.

我们将原子分解技术——基追踪技术应用于光谱估计。与几种现代时间序列方法相比，我们的方法可以大大减少功率泄漏问题;它能够超分辨;此外，它还能很好地处理噪声和不均匀采样信号。我们展示了一个新发现的太阳系外行星系统的奇怪间隔径向速度数据的实验。

引用次数: 63

Extraction of detailed image regions for content-based image retrieval 基于内容的图像检索中详细图像区域的提取

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-15 DOI: 10.1109/ICASSP.1998.679690

D. Androutsos, K. Plataniotis, A. Venetsanopoulos

We present a technique for coarsely extracting the regions of natural color images which contain directional detail, e.g., edges, texture, etc., which we then use for image database indexing. As a measure of color activity, we use a perceptually modified distance measure based on the sum-of-angles criterion. We then apply histogram thresholding techniques to separate the image into smooth color regions and busy regions where edge, texture and colour activity exists. Database indices are then created from the busy regions using the directional detail histogram technique and retrieval is performed using these.

我们提出了一种粗略提取包含方向细节(如边缘、纹理等)的自然彩色图像区域的技术，然后将其用于图像数据库索引。作为颜色活动的度量，我们使用基于角度和准则的感知修改距离度量。然后，我们应用直方图阈值分割技术将图像分为平滑颜色区域和存在边缘、纹理和颜色活动的繁忙区域。然后使用定向细节直方图技术从繁忙区域创建数据库索引，并使用这些索引执行检索。

引用次数: 6

Improving vocabulary independent HMM decoding results by using the dynamically expanding context 利用动态扩展上下文改进词汇无关HMM解码结果

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.675394

M. Kurimo

A method is presented to correct phoneme strings produced by a vocabulary independent speech recognizer. The method first extracts the N best matching result strings using mixture density hidden Markov models (HMMs) trained by neural networks. Then the strings are corrected by the rules generated automatically by the dynamically expanding context (DEC). Finally, the corrected string candidates and the extra alternatives proposed by the DEC are ranked according to the likelihood score of the best HMM path to generate the obtained string. The experiments show that N need not be very large and the method is able to decrease recognition errors from a test data that even has no common words with the training data of the speech recognizer.

提出了一种对词汇无关语音识别器产生的音素字符串进行校正的方法。该方法首先利用神经网络训练的混合密度隐马尔可夫模型(hmm)提取N个最佳匹配结果串;然后根据动态扩展上下文(DEC)自动生成的规则对字符串进行校正。最后，根据最佳HMM路径的似然得分，对修正后的候选字符串和DEC提出的额外备选字符串进行排序，生成得到的字符串。实验表明，该方法不需要很大的N，并且可以在没有常用词的测试数据中减少与语音识别器训练数据的识别误差。

引用次数: 2

Improved robustness for speech recognition under noisy conditions using correlated parallel model combination 利用相关并行模型组合提高噪声条件下语音识别的鲁棒性

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.674490

J. Hung, Jia-Lin Shen, Lin-Shan Lee

The parallel model combination (PMC) technique has been shown to achieve very good performance for speech recognition under noisy conditions. In this approach, the speech signal and the noise are assumed uncorrelated during modeling. A new correlated PMC is proposed by properly estimating and modeling the nonzero correlation between the speech signal and the noise. Preliminary experimental results show that this correlated PMC can provide significant improvements over the original PMC in terms of both the model differences and the recognition accuracies. Error rate reduction on the order of 14% can be achieved.

并行模型组合(PMC)技术在噪声条件下具有很好的语音识别性能。该方法在建模过程中假设语音信号和噪声不相关。通过对语音信号与噪声之间的非零相关性进行适当的估计和建模，提出了一种新的相关PMC。初步的实验结果表明，这种相关PMC在模型差异和识别精度方面都比原始PMC有显著改善。错误率可以降低14%左右。

引用次数: 12

Topic adaptation for language modeling using unnormalized exponential models 基于非归一化指数模型的主题自适应语言建模

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.675356

Stanley F. Chen, K. Seymore, R. Rosenfeld

We present novel techniques for performing topic adaptation on an n-gram language model. Given training text labeled with topic information, we automatically identify the most relevant topics for new text. We adapt our language model toward these topics using an exponential model, by adjusting the probabilities in our model to agree with those found in the topical subset of the training data. For efficiency, we do not normalize the model; that is, we do not require that the "probabilities" in the language model sum to 1. With these techniques, we were able to achieve a modest reduction in speech recognition word-error rate in the broadcast news domain.

我们提出了在n-gram语言模型上执行主题自适应的新技术。给定标有主题信息的训练文本，我们自动为新文本识别最相关的主题。我们使用指数模型调整我们的语言模型来适应这些主题，通过调整我们模型中的概率使其与训练数据的主题子集中的概率一致。为了提高效率，我们没有将模型归一化;也就是说，我们不要求语言模型中的“概率”之和为1。利用这些技术，我们能够在广播新闻领域实现语音识别错误率的适度降低。

引用次数: 65

A comparison of a priori threshold setting procedures for speaker verification in the CAVE project CAVE项目中说话人验证先验阈值设置程序的比较

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.674383

Jean-Benoît Pierrot, J. Lindberg, J. Koolwaaij, H. Hutter, D. Genoud, M. Blomberg, F. Bimbot

The issue of a priori threshold setting in speaker verification is a key problem for field applications. In the context of the Caller Verification in Banking and Telecommunications (CAVE) project, we compared several methods for estimating speaker-independent and speaker-dependent decision thresholds. Relevant parameters are estimated from development data only, i.e. without resorting to additional client data. The various approaches are tested on the Dutch SESP database.

说话人验证中的先验阈值设置问题是现场应用中的一个关键问题。在银行和电信呼叫验证(CAVE)项目的背景下，我们比较了几种估计说话人独立和说话人依赖决策阈值的方法。相关参数仅根据开发数据估计，即无需诉诸额外的客户数据。各种方法在荷兰SESP数据库中进行了测试。

引用次数: 36

Improvements in children's speech recognition performance 儿童语音识别能力的提高

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.674460

S. Das, D. Nix, M. Picheny

There are several reasons why conventional speech recognition systems modeled on adult data fail to perform satisfactorily on children's speech input. For instance, children's vocal characteristics differ significantly from those of adults. In addition, their choices of vocabulary and sentence construction modalities usually do not conform to adult patterns. We describe comparative studies demonstrating the performance gain realized by adopting to children's acoustic and language model data to construct a children's speech recognition system.

基于成人数据的传统语音识别系统在处理儿童语音输入时表现不佳，有几个原因。例如，儿童的声音特征与成人有很大的不同。此外，他们的词汇选择和造句方式往往不符合成人的模式。我们描述了通过比较研究来证明采用儿童声学和语言模型数据来构建儿童语音识别系统所实现的性能增益。

引用次数: 88

A new general distance measure for quantization of LSF and their transformed coefficients 一种量化LSF及其变换系数的通用距离度量方法

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.674363

H. Vu, László Lois

We have developed a new general distance measure that not only can be used in a vector quantization (VQ) of the line spectrum frequency (LSF) parameters but performs well in the LSF transformed domain. The new distance is based on the spectral sensitivity of the LSF and their transformed coefficients. In addition, the fixed scaling factor is used to decrease the sensitivity of the spectral error at higher frequencies. Experimental results have shown that the proposed distance measure leads to as good as or better performance of VQ compared to other methods in the field of LSF coding. The use of this distance as the weighting function of the LSFs' transformed parameters is also suggested.

我们开发了一种新的通用距离度量，它不仅可以用于线谱频率(LSF)参数的矢量量化(VQ)，而且在LSF变换域具有良好的性能。新的距离是基于LSF的光谱灵敏度及其转换系数。此外，采用固定的比例因子降低了高频谱误差的灵敏度。实验结果表明，与LSF编码领域的其他方法相比，所提出的距离度量可以获得与VQ相同或更好的性能。本文还建议使用这个距离作为lsf变换参数的加权函数。

引用次数: 7

A hybrid equalizer merging the advantages of Baud spaced and fractionally spaced equalizers 融合波特间隔均衡器和分数间隔均衡器优点的混合均衡器

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.679584

Christian Lütkemeyer, Hans-Martin Blüthgen, T. Noll

A transversal equalizer with half Baud spaced taps in the center and extended with Baud spaced taps on both sides is presented. This hybrid equalizer combines the benefits of Baud spaced equalizers-like superior equalization of notches in the middle of the transmission band-and fractionally spaced equalizers, which have a superior performance when equalizing asymmetric notches in the slope of the transmission band, when the same number of coefficients are used. The hybrid equalizer offers a reduced sensitivity to sampling time changes and the ability to model the matched filter in the receiver as the fractionally spaced equalizer. The problem of tap-wandering, which is present in fractionally spaced equalizers, is reduced due to the reduced degree of freedom in the coefficient adjustment.

提出了一种中间有半波特间隔抽头，两侧有波特间隔抽头的横向均衡器。这种混合均衡器结合了波特间隔均衡器的优点，比如在传输频带中间的凹痕的优越均衡，以及分数间隔均衡器，当使用相同数量的系数时，分数间隔均衡器在均衡传输频带斜坡上的不对称凹痕时具有优越的性能。混合均衡器降低了对采样时间变化的灵敏度，并且能够将接收器中的匹配滤波器建模为分数间隔均衡器。分数间隔均衡器中存在的抽头偏移问题，由于系数调整的自由度降低而得到了减少。

引用次数: 0

Vector-sensor array processing for estimating angles and times of arrival of multipath communication signals 矢量传感器阵列处理估计多径通信信号的角度和到达时间

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

Pub Date : 1998-05-12 DOI: 10.1109/ICASSP.1998.679576

Peng-Huat Chua, C. See, A. Nehorai

We develop vector-sensor array processing to estimate the angles-of-arrival (AOAs) and time delays of multipath channels in the space-time-polarization domain. A MUSIC-type algorithm for joint angle and delay estimation with a vector-sensor array is derived. Potential applications include multipath channel estimation and mobile localization. Simulation results show that the space-time-polarization parameterization of the multipath channels results in improved accuracy and resolution performance.

我们发展了矢量传感器阵列处理来估计多径信道在时空极化域中的到达角和时延。提出了一种基于矢量传感器阵列的关节角和时延估计music型算法。潜在的应用包括多径信道估计和移动定位。仿真结果表明，对多径信道进行时空极化参数化可以提高精度和分辨率。

引用次数: 17

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀