Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Proceedings of the 23rd international conference on Machine learning Pub Date : 2006-06-25 DOI:10.1145/1143844.1143891

Alex Graves, Santiago Fernández, F. Gomez, J. Schmidhuber

引用次数: 4767

Abstract

Many real-world sequence learning tasks require the prediction of sequences of labels from noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is transcribed into words or sub-word units. Recurrent neural networks (RNNs) are powerful sequence learners that would seem well suited to such tasks. However, because they require pre-segmented training data, and post-processing to transform their outputs into label sequences, their applicability has so far been limited. This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems. An experiment on the TIMIT speech corpus demonstrates its advantages over both a baseline HMM and a hybrid HMM-RNN.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

联结主义时间分类:用循环神经网络标记未分割的序列数据

许多现实世界的序列学习任务需要从有噪声的、未分割的输入数据中预测标签序列。例如，在语音识别中，声音信号被转录成单词或子单词单位。循环神经网络(RNNs)是功能强大的序列学习器，似乎非常适合此类任务。然而，由于它们需要预先分割训练数据，并需要后处理将其输出转换为标签序列，因此迄今为止它们的适用性受到限制。本文提出了一种训练rnn直接标记未分割序列的新方法，从而解决了这两个问题。在TIMIT语音语料库上的实验表明，它比基线HMM和混合HMM- rnn都有优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 23rd international conference on Machine learning

自引率

0.00%

发文量

期刊最新文献

On a theory of learning with similarity functions Bayesian learning of measurement and structural models Predictive search distributions Data association for topic intensity tracking Feature value acquisition in testing: a sequential batch test algorithm