ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing最新文献

英文中文

A high resolution data-adaptive time-frequency representation 一种高分辨率数据自适应时频表示

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1990-12-01 DOI: 10.1109/ICASSP.1987.1169580

Douglas L. Jones, T. Parks

We present a data-adaptive time-frequency representation that obtains high resolution of signal components in time-frequency. This representation overcomes the often poor resolution of the traditional short-time Fourier transform, while avoiding the nonlinearities that make the Wigner distribution and other bilinear representations difficult to interpret and use. The new method uses adaptive Gaussian windows, with the window parameters varying at different time-frequency locations to maximize the local signal concentration in time-frequency. Two methods for selecting the Gaussian parameters are presented: a parameter estimation approach, and a method that maximizes a measure of local signal concentration.

提出了一种数据自适应时频表示方法，可获得高时频分辨率的信号分量。这种表示克服了传统短时傅里叶变换的低分辨率，同时避免了使维格纳分布和其他双线性表示难以解释和使用的非线性。该方法采用自适应高斯窗，在不同时频位置设置不同的窗参数，最大限度地提高时频局部信号的集中程度。提出了两种选择高斯参数的方法:一种是参数估计法，另一种是最大化局部信号浓度的方法。

引用次数: 286

A fast prediction-error detector for estimating sparse-spike sequences 稀疏脉冲序列估计的快速预测误差检测器

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1989-05-01 DOI: 10.1109/ICASSP.1987.1169788

G. Giannakis, J. Mendel, Xiaofeng Zhao

Based on the Maximum-Likelihood principle, we develop a locally optimal method for detecting the location and estimating the amplitude of spikes in a sequence, which are considered the random input of a known ARMA model. A Bernoulli-Gaussian product model is adopted for the sparse-spike sequence, and the available data consist of a single, noisy, output record. By employing a Prediction-Error formulation our iterative algorithm guarantees the increase of a unique likelihood function used for the combined estimation/detection problem. Amplitude estimation is carried out with Kalman smoothing techniques, and event detection is performed in two ways, as an event adder and as an event remover. Synthetic examples verify that our algorithm is self-initialized, consistent, and fast.

基于极大似然原理，我们开发了一种局部最优方法来检测序列中峰值的位置和估计峰值的幅度，该序列被认为是已知ARMA模型的随机输入。稀疏尖峰序列采用伯努利-高斯积模型，可用数据由单个、有噪声的输出记录组成。通过采用预测误差公式，我们的迭代算法保证了用于组合估计/检测问题的唯一似然函数的增加。使用卡尔曼平滑技术进行幅度估计，并以两种方式进行事件检测，作为事件加法器和事件去除器。综合示例验证了我们的算法是自初始化的、一致的和快速的。

引用次数: 14

Some applications of mathematical morphology to range imagery 数学形态学在距离图像中的一些应用

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-12-23 DOI: 10.1109/ICASSP.1987.1169668

T. R. Esselman, J. Verly

Although little known, mathematical morphology (MM) offers great potential in the areas of image enhancement, feature extraction, and object recognition. MM has the intrinsic ability to quantitatively analyze object shapes in both 2 and 3 dimensions. Using MM to extract features and recognize objects in range imagery seems particularly appropriate since range data is a natural source of shape information. We present several experimental results of applying MM techniques to real and synthetic range imagery, both for noise removal and feature extraction.

尽管鲜为人知，数学形态学(MM)在图像增强、特征提取和对象识别领域提供了巨大的潜力。MM具有定量分析二维和三维物体形状的内在能力。使用MM提取特征和识别距离图像中的物体似乎特别合适，因为距离数据是形状信息的自然来源。我们介绍了将MM技术应用于真实和合成距离图像的几个实验结果，包括噪声去除和特征提取。

引用次数: 17

Almost unique specification of discrete finite length signal: From its end point and Fourier transform magnitude 几乎唯一的离散有限长度信号的规范:从它的端点和傅里叶变换幅度

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169328

Lei Xu, P. Yan, Tong Chang

In this paper, the reconstruction of discrete signal with finite time duration from its end point and Fourier Transform (FT) magnitude is considered. Based on one result of [1] that a class of discrete signal can be reconstructed from its FT magnitude and one end sample point, with the help of Measure Theory, furtherly we point out that a correspondence between RN+1space and discrete signals with duration of N+1 points can be set up, and the signals that can't be reconstructed from its end point and FT magnitude correspond to a subset of RN+1with measure zero. In other words, discrete signal with finite time duration can almost be uniquely reconstructed from its end point and FT magnitude.

本文研究了有限时长的离散信号从其端点和傅里叶变换(FT)幅度出发的重构问题。基于文献[1]中一类离散信号可以由其FT幅值和一个端点采样点重构的结果，利用测度理论进一步指出，可以建立RN+1空间与持续时间为N+1点的离散信号的对应关系，并且不能由其端点和FT幅值重构的信号对应于一个测度为0的RN+1子集。换句话说，具有有限持续时间的离散信号几乎可以唯一地从其端点和FT幅值重建。

引用次数: 9

Parameter estimation using the autocorrelation of the discrete Fourier transform 离散傅里叶变换的自相关参数估计

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169901

M. Manry, C. T. Huddleston

Optimal parameter estimation algorithms are developed using the maximum likelihood technique, when no statistics are available for the parameter. Sub-optimal parameter estimates, using one sample of the autocorrelation of the DFT, have been developed previously. In this paper, maximum likelihood estimates are derived, given the auto-correlation function of the received signal's DFT. These estimates sometimes require less computation time than conventional estimates, and frequently have a closed form or simple iterative implementation.

当没有可用的参数统计时，使用最大似然技术开发了最优参数估计算法。次优参数估计，使用一个样本的自相关的DFT，已经开发了以前。在本文中，给出了接收信号的DFT的自相关函数，导出了极大似然估计。这些评估有时需要比传统评估更少的计算时间，并且经常具有封闭的形式或简单的迭代实现。

引用次数: 1

Integration of acoustic information in a large vocabulary word recognizer 大词汇量词识别器中声学信息的整合

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169578

Vishwa Gupta, Matthew Lennig, P. Mermelstein

This paper proposes a new way of using vector quantization for improving recognition performance for a 60,000 word vocabulary speaker-trained isolated word recognizer using a phonemic Markov model approach to speech recognition. We show that we can effectively increase the codebook size by dividing the feature vector into two vectors of lower dimensionality, and then quantizing and training each vector separately. For a small codebook size, integration of the results of the two parameter vectors provides significant improvement in recognition performance as compared to the quantizing and training of the entire feature set together. Even for a codebook size as small as 64, the results obtained when using the new quantization procedure are quite close to those obtained when using Gaussian distribution of the parameter vectors.

本文提出了一种基于音位马尔可夫模型的语音识别方法，利用向量量化来提高6万词汇的说话者训练的孤立词识别器的识别性能。通过将特征向量分成两个较低维数的向量，然后分别对每个向量进行量化和训练，可以有效地增加码本的大小。对于较小的码本大小，与对整个特征集进行量化和训练相比，将两个参数向量的结果集成在一起可以显著提高识别性能。即使对于码本大小为64的码本，使用新的量化方法得到的结果与使用参数向量的高斯分布得到的结果非常接近。

引用次数: 93

Binary search trees for vector quantisation 矢量量化的二叉搜索树

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169380

A. Lowry, Sqama Hossain, W. Millar

This paper presents a data structure based on the k-d binary tree which substantially reduces the search complexity of a full search vector quantiser with negligible degradation in signal-to-noise ratio. The search complexity isk + O(logN)rather than N for a codebook of dimension k and size N. Special features of the structure are (1) the use of a rotational transform prior to encoding and (2) the computational efficiency of the design algorithm due to the simple structure of the k-d tree.

本文提出了一种基于k-d二叉树的数据结构，大大降低了全搜索矢量量化器的搜索复杂度，而信噪比的下降可以忽略不计。对于维度为k，大小为N的码本，搜索复杂度风险为+ O(logN)而不是N。该结构的特殊特征是(1)在编码之前使用旋转变换，(2)由于k-d树的简单结构，设计算法的计算效率很高。

引用次数: 25

Extraction of phonemic variation rules in continuous speech spoken by multiple speakers 多说话人连续讲话中音位变化规律的提取

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169598

S. Kimura, Y. Nara

This paper describes an interactive extraction of phonemic variation rules in continuous speech spoken by multiple speakers. To realize a continuous speech recognizer, we must first develop a highly accurate phoneme recognizer. The major problem related to phoneme recognizers is the phonemic variations in continuous speech. Our work focuses on the interactive analysis of phonemic variations in continuous speech and the extraction of the phonemic variation rules for many speakers. We extracted 317 rules related to 21 kinds of phonemic variation phenomena from 10,000 Japanese-language phrases spoken by 10 male speakers. With these rules, 97.6% of 36,000 Japanese-language phrases spoken by 36 test speakers (30 males and 6 females) were correctly segmented by our top-down phoneme segmentation system. Furthermore, a subset of the rules for each speaker was automatically obtained. On average, each subset contains 53.2% of the rules.

本文描述了一种交互式的多说话人连续语音音位变化规则提取方法。要实现连续语音识别器，首先要开发高精度的音素识别器。音位识别的主要问题是连续语音中的音位变化。我们的工作重点是对连续语音中的音位变化进行交互分析，并提取多个说话者的音位变化规律。我们从10位男性的1万个日语短语中提取了21种音位变异现象的317条规则。根据这些规则，36名测试者(30名男性和6名女性)所讲的36000个日语短语中，97.6%的短语被我们的自上而下的音素切分系统正确切分。此外，自动获得每个说话人的规则子集。平均而言，每个子集包含53.2%的规则。

引用次数: 4

A low distortion adaptive noise cancellation structure for real time applications 一种适用于实时应用的低失真自适应噪声消除结构

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169423

M. Al-Kindi, J. Dunlop

This paper describes an adaptive noise cancelling structure suitable for situations where the noise reference transducer is closely spaced relative to the primary transducer. The structure is based on two LMS delay line cancellers with cross coupled feedback. This structure is shown, under certain circumstances, to cancel noise with low signal distortion when the transmission paths between primary and secondary sensors have low attenuation and the primary signal is continuous. The system is shown to have an enhanced performance when the primary signal is intermittent and a signal energy detector is used.

本文介绍了一种自适应降噪结构，适用于噪声参考换能器相对于主换能器距离较近的情况。该结构基于两个具有交叉耦合反馈的LMS延迟线消除器。在一定情况下，当主、次传感器之间的传输路径衰减较低，且主信号连续时，这种结构能够以较低的信号失真抵消噪声。结果表明，当主信号为间歇性信号时，采用信号能量检测器时，系统的性能得到了提高。

引用次数: 26

Fast image segmentation for some machine vision applications 一些机器视觉应用的快速图像分割

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

Pub Date : 1987-04-06 DOI: 10.1109/ICASSP.1987.1169665

E. B. Hinkle, J. Sanz

This paper describes the use of an image contrast measure for producing binary segmentations of images in a certain class of applications. This method is well-suited for fast pipeline implementations, because the contrast measure uses only two local features in the image. To eliminate segmentation noise, we post-process the segmentations using binary morphological operations. This method has been applied to three different microelectronics inspection problems, with consistently good results, and experimental results from each of these applications are presented here. Also, we discuss this technique in terms of the theory of polynomial classifiers.

本文描述了在某一类应用中使用图像对比度度量来产生图像的二值分割。这种方法非常适合快速流水线实现，因为对比度测量只使用图像中的两个局部特征。为了消除分割噪声，我们使用二值形态学运算对分割进行后处理。该方法已应用于三种不同的微电子检测问题，具有一致的良好结果，并从这些应用的实验结果在这里提出。此外，我们还从多项式分类器理论的角度讨论了这种技术。

引用次数: 1

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀