ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)最新文献

英文中文

Fast VQ of multi-tap pitch predictor coefficients 快速VQ多抽头pitch预测系数

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770278

Zeng Zhihua, Xiao Zimei

Pitch predictors are important in achieving high-quality speech for linear prediction-based analysis-by-synthesis (LPAS) coders. Multi-tap pitch predictors with vector quantization (VQ) of the predictor coefficients have been adopted in more and more LPAS coders because they can provide high prediction gain. Higher-tap pitch predictors (tap>3) have better performance but they are seldom used due to the high computational complexity of vector quantizing the pitch predictor coefficients. This paper proposes a new VQ scheme for multi-tap pitch predictors, the likelihood error criterion-based vector quantization (LEC-VQ). Experiments show that the proposed VQ method can efficiently reduce the computational complexity while maintaining high speech quality.

基音预测器对于基于线性预测的合成分析(LPAS)编码器实现高质量语音非常重要。对预测系数进行矢量量化的多分导基音预测器由于具有较高的预测增益，已被越来越多的LPAS编码器所采用。高抽头的基音预测器(抽头>3)具有较好的性能，但由于矢量量化基音预测系数的计算复杂度较高，因此很少使用。本文提出了一种基于似然误差准则的矢量量化(lece -VQ)的多分音基音预测方法。实验表明，所提出的VQ方法可以有效地降低计算复杂度，同时保持较高的语音质量。

引用次数: 1

A novel lip localization method based on shiftable wavelets transform 一种基于可移小波变换的唇形定位方法

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770790

Xu Yanjun, Du Limin, Hou Ziqiang

Visual feature extraction is one of the most important techniques in audiovisual bimodal speech recognition, and also remains a very challenging area in image understanding. A shiftable multiscale transform is introduced into the construction of an active shape model. It uses the pyramidal data to describe the structure of an image, which is invariant to illumination and perspective variability and thus contributes a lot to the improvement of the robustness of the model. A segmental downhill simplex method is also put forward to improve the minimization procedure of lip localization. It employs a kind of "coarse-to-fine" strategy to speed up the convergence and improve the robustness of lip localization. Experiments support the validity of the new method, and show better robustness and higher efficiency.

视觉特征提取是视听双峰语音识别中最重要的技术之一，也是图像理解中一个非常具有挑战性的领域。将可移多尺度变换引入到活动形状模型的构造中。它利用金字塔形数据来描述图像的结构，对光照和视角的变化具有不变性，从而大大提高了模型的鲁棒性。提出了一种分段下坡单纯形法，改进了唇部定位的最小化过程。采用了一种“由粗到精”的策略，加快了收敛速度，提高了唇形定位的鲁棒性。实验结果表明，该方法具有较好的鲁棒性和较高的效率。

引用次数: 0

Hierarchical object-oriented video segmentation and representation algorithm 分层面向对象的视频分割与表示算法

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770771

Jianping Fan, G. Fujita, Jun Yu, Koji Miyanohana, T. Onoye, N. Ishiura, Lide Wu, I. Shirakawa

In this paper, a novel object-oriented hierarchical video segmentation and representation algorithm is proposed based on a four-component video model, where the local variance contrast and the frame difference contrast are selected for generating the 2D spatiotemporal entropy. The extracted object is first represented by a group of (4/spl times/4) blocks coarsely, then the intra-block edge extraction on edge blocks and the joint spatiotemporal similarity test among neighboring blocks are further performed for determining meaningful real objects. This proposed hierarchical segmentation algorithm may be very useful for MPEG-4 applications. A novel fast algorithm is also introduced for reducing the search burden. Moreover, this unsupervised algorithm also makes automatic image and video segmentation possible.

本文提出了一种基于四分量视频模型的面向对象分层视频分割与表示算法，选取局部方差对比和帧差对比生成二维时空熵。首先用一组(4/spl × /4)块对提取的目标进行粗表示，然后对边缘块进行块内边缘提取和相邻块间的联合时空相似性检验，确定有意义的真实目标。本文提出的分层分割算法对MPEG-4应用非常有用。为了减少搜索负担，提出了一种新的快速算法。此外，这种无监督算法还使图像和视频的自动分割成为可能。

引用次数: 2

An initialization method for multi-type prototype fuzzy clustering 多类型原型模糊聚类的初始化方法

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770834

G. Xinbo, Xue Zhong, Li Jie, Xie Weixin

Fuzzy clustering is an important branch of unsupervised classification, and has been widely used in pattern recognition and image processing. However, most existing fuzzy clustering algorithms are sensitive to initialization, and strongly depend on the number of clusters, which limits their applications. Moreover, it these algorithms also need to know the type and number of prototypes in advance in multi-type prototype fuzzy clustering. To overcome these limitations, a method for acquiring a priori knowledge about the clustering prototype is proposed in this paper, which obtains better performance in initializing multi-type prototype fuzzy clustering.

模糊聚类是无监督分类的一个重要分支，在模式识别和图像处理中有着广泛的应用。然而，现有的模糊聚类算法对初始化很敏感，并且对聚类的数量依赖很大，这限制了它们的应用。此外，在多类型原型模糊聚类中，这些算法还需要事先知道原型的类型和数量。为了克服这些局限性，本文提出了一种获取聚类原型先验知识的方法，该方法在初始化多类型原型模糊聚类时获得了较好的性能。

引用次数: 3

A novel technique for quasar recognition 一种新的类星体识别技术

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770152

H. Zhou, M.L. Luo

We present a Hough transform (HT)-based technique for quasar recognition. The main purpose of quasar recognition is to identify redshifts. There are two main approaches to calculating redshifts. One is a direct way based on recognizing emission peaks firstly, the other is a statistical one. We apply HT, a statistical way to compute redshifts. In addition, we employ two post-processing techniques for HT: one uses a linear kernel to recover the weakened true peak in the accumulator array; the other is a voting technique to eliminate sharp false peaks. Moreover, we use a weighted HT to improve the recognition results further. Experimental results are satisfactory.

提出了一种基于霍夫变换的类星体识别技术。类星体识别的主要目的是识别红移。有两种计算红移的主要方法。一种是基于先识别排放峰的直接方法，另一种是统计方法。我们应用HT，一种统计方法来计算红移。此外，我们采用了两种HT后处理技术:一种是使用线性核来恢复累加器阵列中减弱的真峰;另一种是投票技术，以消除尖锐的假峰值。此外，我们使用加权HT进一步改善识别结果。实验结果令人满意。

引用次数: 1

On the computation of wavelet series transform 小波级数变换的计算方法

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770214

Yu Yue, Zhou Jian, Wang Yiliang, L. Fengting, Ge Chenghui

Because the discrete wavelet transform (DWT) can be computed effectively with a fast algorithm, the DWT is often used to approximate the continuous wavelet transform (CWT) and wavelet series transform (WST). Approximation accuracy is considered as an open problem in wavelet theory. In this paper, we firstly give three parts that affect the approximation accuracy. Based on sampling theory for wavelet subspaces, two kinds of prefilters are given; one can exactly compute the WST for any signal in this wavelet subspace and the other one can effectively approximate the true WST. Finally, numerical examples are given to show that our algorithms are effective.

由于离散小波变换(DWT)可以用一种快速有效的算法进行计算，因此DWT常被用来近似连续小波变换(CWT)和小波序列变换(WST)。逼近精度是小波理论中的一个开放性问题。本文首先给出了影响近似精度的三个部分。基于小波子空间的采样理论，给出了两种预滤波器;一种方法可以精确地计算出小波子空间中任意信号的WST，另一种方法可以有效地逼近真实的WST。最后给出了数值算例，证明了算法的有效性。

引用次数: 2

Including a parametric model in Tikhonov-Miller image restoration 包括参数模型在Tikhonov-Miller图像恢复

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770766

V. Barakat, R. Goutte, R. Prost

We propose a new Tikhonov-Miller restoration method where an a priori model of the solution is included. In sharp contrast with the classical method, this approach allows local information to be incorporated. The main difficulty is to express this local information in a model. We show that this new method can lead to better results than the usual Tikhonov-Miller approach, if a parametric a priori model is used.

我们提出了一种新的Tikhonov-Miller恢复方法，其中包含了解决方案的先验模型。与传统方法形成鲜明对比的是，这种方法允许纳入局部信息。主要的困难是在模型中表达这些局部信息。我们表明，如果使用参数先验模型，这种新方法可以比通常的Tikhonov-Miller方法得到更好的结果。

引用次数: 0

Speaker identification using hidden Markov models 使用隐马尔可夫模型识别说话人

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770285

M. Inman, D. Danforth, S. Hangai, K. Sato

In this study, we show that the use of hidden Markov models (HMMs) significantly enhances the success rate of speaker identification over time. The segment boundary information derived from HMMs provides a means of normalizing the formant patterns obtained from a digital cochlear filter, which we also describe. The use of the digital cochlear filter and HMMs in our study was motivated by two well-known problems in speech recognition generally, i.e. phonetic tempo variability and variability over temporal units of a given length, typically days. We show how these problems can be minimized to achieve more robust speaker identification.

在这项研究中，我们发现随着时间的推移，隐马尔可夫模型(hmm)的使用显著提高了说话人识别的成功率。从hmm中得到的段边界信息提供了一种归一化从数字耳蜗滤波器中得到的峰模式的方法，我们也描述了这一点。在我们的研究中，数字耳蜗滤波器和hmm的使用是由语音识别中两个众所周知的问题所驱动的，即语音节奏变异性和给定长度(通常是天)的时间单位的变异性。我们展示了如何将这些问题最小化以实现更稳健的说话人识别。

引用次数: 10

A maximum entropy algorithm based on the aperiodic model of deconvolution for image restoration 基于非周期反卷积模型的最大熵图像恢复算法

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770791

Shi Dong-cheng, Han Liqiang, W. Hongzhi

We describe the aperiodic matrix model of deconvolution. Its kernel matrix is high order, and we cannot calculate this large matrix in a practical image restoration problem directly, but it can be handled by the DFT. We show a kind of maximum entropy algorithm for image restoration, based on the aperiodic matrix model, FFT algorithm and conjugate gradient algorithm (CGA). In experimental results its total computational burden and the memory requirement is moderate. It can run on a common PC computer.

我们描述了反卷积的非周期矩阵模型。它的核矩阵是高阶的，在实际的图像恢复问题中我们不能直接计算这个大矩阵，但可以用DFT来处理。提出了一种基于非周期矩阵模型、FFT算法和共轭梯度算法(CGA)的最大熵图像恢复算法。实验结果表明，该算法的总计算量和内存需求适中。它可以在普通的个人电脑上运行。

引用次数: 0

The upper limit of the stability of delay-type cellular neural networks 延迟型细胞神经网络稳定性的上限

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

Pub Date : 1998-10-12 DOI: 10.1109/ICOSP.1998.770859

Zhuang Daming, Ma Shao-han, Jiang Mingyan

This paper gives an improved sufficient condition of ensuring the stability of delayed cellular neural networks (DCNN) by means of the analysis of a correspondent Lyapunov function. This result improved the limit given by Civalleri and gives an optional upper limit of the stability of DCNN.

通过分析相应的Lyapunov函数，给出了延迟细胞神经网络(DCNN)稳定性的一个改进的充分条件。该结果改进了Civalleri给出的极限，并给出了DCNN稳定性的可选上限。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀