Phoneme recognition based on distinctive phonetic features (DPFs) incorporating a syllable based language model

2009 12th International Conference on Computers and Information Technology Pub Date : 2009-12-01 DOI:10.1109/ICCIT.2009.5407123

M. N. Huda, Manoj Banik, G. Muhammad, Bernd J. Kroger

引用次数: 1

Abstract

This paper presents a phoneme recognition method based on distinctive phonetic features (DPFs). The method comprises three stages. The first stage extracts 3 DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (In/En) network to obtain more categorical DPF movement and decorrelates the DPF vectors using the Gram-Schmidt orthogonalization procedure. Then, the third stage embeds acoustic models (AMs) and language models (LMs) of syllable-based subwords to output more precise phoneme strings. The proposed method provides a higher phoneme correct rate as well as phoneme accuracy with fewer mixture components in hidden Markov models (HMMs).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于独特语音特征的音素识别，并结合基于音节的语言模型

提出了一种基于显著语音特征的音素识别方法。该方法包括三个阶段。第一阶段使用三个多层神经网络(mln)从输入语音信号的局部特征(LFs)中提取3个各为15维的DPF向量。第二阶段采用抑制/增强(In/En)网络来获得更分类的DPF运动，并使用Gram-Schmidt正交化过程解除DPF向量的关联。然后，第三阶段嵌入基于音节的子词的声学模型(AMs)和语言模型(lm)，以输出更精确的音素字符串。该方法在隐马尔可夫模型(hmm)中具有较高的音素正确率和较少的混合成分的音素准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2009 12th International Conference on Computers and Information Technology

自引率

0.00%

发文量