Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

英文中文

High-resolution direction of arrival estimation using minimum-norm method without eigendecomposition 无特征分解的最小范数法高分辨率到达方向估计

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389831

A. Shaw, W. Xia

The minimum-norm method (MNM) for high-resolution directions-of-arrival (DOA) estimation relies on special purpose hardware or software for obtaining the signal and noise subspace eigenvectors of autocorrelation (AC) matrices. It is shown in this paper that the DFT of the AC matrix (DFT-of-AC) essentially performs an equivalent task of separating the signal and noise subspaces. Furthermore, when the signal-subspace part of the DFT-of-AC vectors are used in the minimum-norm framework, almost identical high-resolution DOA estimates are produced. When compared with eigendecomposition-based MNM, the computational load of the proposed DFT-based approach (D-MNM) is lower but the bias, mean-squared error and the root locations are almost similar. The simulations further show that at low SNR the performance of D-MNM is more robust and it also has superior dynamic range.<>

用于高分辨率到达方向(DOA)估计的最小范数方法(MNM)依赖于专用的硬件或软件来获取自相关(AC)矩阵的信号和噪声子空间特征向量。本文表明，交流矩阵的DFT (DFT-of-AC)本质上完成了分离信号和噪声子空间的等效任务。此外，当在最小范数框架中使用交流向量的dft的信号子空间部分时，产生几乎相同的高分辨率DOA估计。与基于特征分解的MNM相比，本文提出的DFT-based方法(D-MNM)的计算量更小，但偏差、均方误差和根位置几乎相同。仿真结果进一步表明，在低信噪比条件下，D-MNM具有更好的鲁棒性和动态范围。

引用次数: 1

Development of a text-to-speech system for Japanese based on waveform splicing 基于波形拼接的日语文本转语音系统的开发

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389230

H. Kawai, N. Higuchi, Tohru Shimizu, Seiichi Yamamoto

A text-to-speech system for Japanese was developed based on waveform splicing. A stored unit is a sequence of phonemes segmented at vowel-consonant boundaries. Four and eight phoneme groups are distinguished for the preceding and succeeding phonemic environment, respectively. An inventory of waveform segments including frequently used 1020 units was constructed based on a statistical analysis of a text database consisting of 20 million phonemes. Each stored unit has, on average, 2.5 waveform segments with different fundamental frequency (F/sub 0/) and phoneme duration. The F/sub 0/ and phoneme duration are modified by a pitch synchronous overlap add (PSOLA) method. A time window which has a flat portion at its center (Tukey window) was adopted in place of an ordinary Hanning window. A preference test indicated that the Tukey window gives better quality when the F/sub 0/ is lowered. The articulation score of an intelligibility test was 89.2%.<>

提出了一种基于波形拼接的日语文本转语音系统。存储单元是按元音-辅音边界分割的音素序列。四个音素组和八个音素组分别为前音素环境和后音素环境。在对包含2000万个音素的文本数据库进行统计分析的基础上，构建了包含1020个常用单元的波形段清单。每个存储单元平均有2.5个不同基频(F/sub 0/)和音素持续时间的波形段。通过音高同步重叠添加(PSOLA)方法修改F/sub / 0/和音素持续时间。在其中心有一个平坦部分的时间窗(土基窗)被采用来代替普通的汉宁窗。偏好测试表明，当F/sub 0/降低时，Tukey窗口的质量更好。可理解性测验的发音得分为89.2%。

引用次数: 6

Low-complexity encoding of speech LSF parameters using constrained-storage TSVQ 基于约束存储TSVQ的低复杂度语音LSF参数编码

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389241

W. Chan, David Chemla

Tree structured vector quantization (TSVQ) is employed as a low-complexity approach to performing vector quantization of speech linear prediction coefficients, expressed for the purpose of quantization as line spectral frequency (LSF) parameters. Good tradeoffs between search complexity and distortion-rate performance are obtained using multiple-survivor encoding. The exponential storage-complexity of conventional TSVQ is circumvented by using multiple stages, where one or more tree codebooks may be used in each stage. Experimental results show that for rates between 23-25 bits/frame,the encoding complexity required to achieve "transparent coding" quality ranges from below two hundred to several hundred weighted-squared-error distortion computations per frame.<>

树形结构矢量量化(TSVQ)是一种低复杂度的方法，用于对语音线性预测系数进行矢量量化，用于量化的线性预测系数表示为线谱频率(LSF)参数。多幸存者编码在搜索复杂度和失真率性能之间取得了很好的平衡。通过使用多个阶段，每个阶段可以使用一个或多个树码本，可以避免传统TSVQ的指数级存储复杂性。实验结果表明，对于23-25比特/帧的速率，实现“透明编码”质量所需的编码复杂度在每帧200到数百次加权平方误差失真计算之间。

引用次数: 9

Far field array processing with neural networks 基于神经网络的远场阵列处理

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389665

Brigitte Colnet, J. Haton

Source localisation is among the most important steps in array processing. The authors present a neuromimetic approach in signal processing. A set of neural networks is used to find the azimuth of one or several sources impinging on a linear array of equally spaced sensors. Each network in this set is specialised to determine if there is an emitter in a given angular sector. Thus a neural network has a specific architecture suited to detect and enhance the signal coming from the angular sector it is associated with. The performances of this method on real underwater signals confirm the encouraging results obtained on simulation tests.<>

源定位是数组处理中最重要的步骤之一。作者提出了一种神经模拟信号处理方法。一组神经网络被用来寻找一个或几个源的方位冲击到一个等间距的线性阵列的传感器。该集合中的每个网络都专门用于确定给定角扇区中是否存在发射器。因此，神经网络具有适合于检测和增强来自与之相关联的角扇区的信号的特定结构。该方法对真实水下信号的处理效果与仿真试验结果一致。

引用次数: 5

Approaches to topic identification on the switchboard corpus 总机语料库的主题识别方法

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389275

J. McDonough, Kenney Ng, P. Jeanrenaud, H. Gish, J. R. Rohlicek

Topic identification (TID) is the automatic classification of speech messages into one of a known set of possible topics. The TID task can be view as having three principal components: 1) event generation, 2) keyword event selection, and 3) topic modeling. Using data from the Switchboard corpus, the authors present experimental results for various approaches to the TID problem and compare the relative effectiveness of each. In addition, they examine the effect of keyword set size on identification accuracy and gauge the loss in performance when mismatched topic modeling and keyword selection schemes are used.<>

主题识别(TID)是将语音信息自动分类为一组已知的可能主题之一。可以将TID任务视为具有三个主要组件:1)事件生成、2)关键字事件选择和3)主题建模。利用交换机语料库中的数据，作者给出了处理TID问题的各种方法的实验结果，并比较了每种方法的相对有效性。此外，他们还研究了关键字集大小对识别准确性的影响，并测量了使用不匹配的主题建模和关键字选择方案时的性能损失。

引用次数: 75

Cluster-based probability model applied to image restoration and compression 聚类概率模型在图像恢复与压缩中的应用

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389408

Ashok Popat, Rosalind W. Picard

The performance of a statistical signal processing system is determined in large part by the accuracy of the probabilistic model it employs. Accurate modeling often requires working in several dimensions, but doing so can introduce dimensionality-related difficulties. A previously introduced model circumvents some of these difficulties while maintaining accuracy sufficient to account for much of the high-order, nonlinear statistical interdependence of samples. Properties of this model are reviewed, and its power demonstrated by application to image restoration and compression. Also described is a vector quantization (VQ) scheme which employs the model in entropy coding a Z/sup N/-lattice. The scheme has the advantage over standard VQ of bounding maximum instantaneous errors.<>

统计信号处理系统的性能在很大程度上取决于它所采用的概率模型的准确性。准确的建模通常需要在多个维度上工作，但是这样做会引入与维度相关的困难。先前介绍的模型绕过了这些困难，同时保持足够的精度来解释样本的高阶非线性统计相互依赖。回顾了该模型的性质，并通过对图像恢复和压缩的应用证明了该模型的有效性。本文还介绍了一种利用该模型对Z/sup N/-晶格进行熵编码的矢量量化(VQ)方案。该方案优于限定最大瞬时误差的标准VQ。

引用次数: 25

An overview of multiple-window and quadratic-inverse spectrum estimation methods 多窗口和二次逆谱估计方法综述

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389911

D. Thomson

Presents examples, history, and a brief review of the theory of multiple-window and quadratic-inverse spectrum estimation methods for mixed harmonizable processes. In addition to the standard uses of making consistent non-parametric auto- and cross-spectrum estimates with jackknife confidence intervals and estimating periodic components in coloured noise, quadratic-inverse theory gives a time-frequency decomposition for stochastic processes. This leads to new estimates of both common and less-familiar functions such as the "time-derivative" of a spectrum.<>

介绍了混合调和过程的多窗口和二次逆谱估计方法的实例、历史和简要回顾。除了使用叠刀置信区间进行一致的非参数自估计和交叉谱估计以及估计彩色噪声中的周期分量的标准用途外，二次逆理论还给出了随机过程的时频分解。这导致了对常见和不太熟悉的函数(如频谱的“时间导数”)的新估计。

引用次数: 29

A new efficient subspace tracking algorithm based on singular value decomposition 一种新的基于奇异值分解的子空间跟踪算法

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389774

A. Kavcic, Bin Yang

A new algorithm for signal subspace tracking is presented. It is based on an approximated singular value decomposition using interlaced QR-updating and Jacobi plane rotations. By forcing the noise subspace to be spherical, the computational complexity of the algorithm is brought down to O(nr), where n is the problem dimension and r is the desired number of signal components. The algorithm lends itself for a very efficient systolic array implementation, resulting in a throughput of O(n/sup 0/). Simulations show that the frequency tracking capabilities of the new method are at least as good as those of the computationally much more expensive exact singular value decomposition.<>

提出了一种新的信号子空间跟踪算法。它是基于一个近似的奇异值分解，使用交错qr更新和雅可比平面旋转。通过将噪声子空间强制为球形，将算法的计算复杂度降至O(nr)，其中n为问题维数，r为期望的信号分量数。该算法适合于非常有效的收缩数组实现，导致吞吐量为0 (n/sup 0/)。仿真结果表明，新方法的频率跟踪能力至少与计算成本高得多的精确奇异值分解方法相当

引用次数: 16

Monitoring the stage of diagonalization in Jacobi-type methods 雅可比型方法对角化阶段的监测

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389995

J. Gotze

Since the stage of diagonalization of Jacobi-type methods is difficult to monitor in a parallel environment, it is usually proposed to execute a predetermined number of sweeps (iterations) on a parallel processor array. A possibility for monitoring the stage of diagonalization is essential in order to avoid the execution of a significant number of unnecessary sweeps. Based on a Lemma used for a generalized proof of the quadratic convergence of the Jacobi EVD and SVD methods a new criteria for monitoring the stage of diagonalization is derived. Using this criteria it can easily be monitored when the stage of quadratic convergence is reached (only one bit yields this information). Therefore, only the (small) number of quadratically convergent sweeps must be predetermined. A further similar criteria particularly useful for Jacobi-type methods using CORDIC-based approximate rotations is also given.<>

由于jacobi型方法的对角化阶段在并行环境中难以监控，因此通常建议在并行处理器阵列上执行预定数量的扫描(迭代)。为了避免执行大量不必要的扫井，监测对角化阶段的可能性是必不可少的。基于Jacobi EVD和SVD方法二次收敛性的一个广义证明引理，导出了监测对角化阶段的新准则。使用这个准则可以很容易地监测到何时达到二次收敛阶段(只有一个比特产生这个信息)。因此，只有(少量)次收敛扫描必须预先确定。还给出了一个类似的准则，特别适用于使用基于cordic的近似旋转的jacobi型方法。

引用次数: 10

SLHMM: a continuous speech recognition system based on Alphanet-HMM SLHMM:基于alpha - hmm的连续语音识别系统

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 1994-04-19 DOI: 10.1109/ICASSP.1994.389317

J. D. Verdejo, J. C. Segura, P. García-Teodoro, A. Rubio

This paper presents a new framework developed to apply Alphanets to CSR. For this purpose, a modular system is proposed. This system is made up by three different modules: LVQ module, SLHMM module and DP module. The SLHMM module is an expansion of an Alphanet, and therefore, can be interpreted as a HMM. The system can be trained globally applying backpropagation techniques. The used pruning procedure is based upon recognized units instead of observations, which reduces the number of nodes needed to recognize a sentence, compared to HMM-based systems using the same parameters for the models in both systems. Besides, the training procedure re-adapts the weights according to the new architecture in a few iterations since the initial parameters can be estimated from a classical HMM CSR system.<>

本文提出了一个新的框架，将字母表应用于企业社会责任。为此，提出了一种模块化系统。该系统由三个不同的模块组成:LVQ模块、SLHMM模块和DP模块。SLHMM模块是alpha模块的扩展，因此可以理解为HMM模块。应用反向传播技术可以对系统进行全局训练。所使用的修剪过程基于已识别的单元而不是观察值，与基于hmm的系统在两个系统中使用相同的模型参数相比，这减少了识别句子所需的节点数量。此外，由于初始参数可以从经典HMM CSR系统中估计出来，因此训练过程可以在几次迭代中根据新的体系结构重新调整权重。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀