A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability

A. Moro, György Szaszák
{"title":"A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability","authors":"A. Moro, György Szaszák","doi":"10.1109/COGINFOCOM.2017.8268246","DOIUrl":null,"url":null,"abstract":"Speech communication human-machine interfaces exploit automatic speech recognition to implement speech-to-text conversion. Unfortunately, in the past, not much effort has been devoted to add punctuation marks to the recognized word chain after speech recognition. This affects human readability and makes interpretation hard. This paper presents an effort to restore punctuation marks by keeping low the latency resulting from this post-processing step. The approach exploits the prosodic structure and proposes a sequential modelling paradigm based on recurrent neural networks. Results show satisfying punctuation restoration abilities, especially taking into account that sentence boundaries are reliably detected. Even if the predicted punctuation sequence is not error free w.r.t. writing standards, human perception is expected to “repair” these errors more easily compared to the case when no punctuation is given at all and the reader is left in confusion regarding the basic segmentation of the word chain.","PeriodicalId":212559,"journal":{"name":"2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COGINFOCOM.2017.8268246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Speech communication human-machine interfaces exploit automatic speech recognition to implement speech-to-text conversion. Unfortunately, in the past, not much effort has been devoted to add punctuation marks to the recognized word chain after speech recognition. This affects human readability and makes interpretation hard. This paper presents an effort to restore punctuation marks by keeping low the latency resulting from this post-processing step. The approach exploits the prosodic structure and proposes a sequential modelling paradigm based on recurrent neural networks. Results show satisfying punctuation restoration abilities, especially taking into account that sentence boundaries are reliably detected. Even if the predicted punctuation sequence is not error free w.r.t. writing standards, human perception is expected to “repair” these errors more easily compared to the case when no punctuation is given at all and the reader is left in confusion regarding the basic segmentation of the word chain.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种韵律启发的RNN方法用于机器生成的语音文本的标点符号,以提高人类的可读性
语音通信人机界面利用语音自动识别实现语音到文本的转换。遗憾的是,在过去,在语音识别后的识别词链中添加标点符号并没有付出太多的努力。这影响了人类的可读性,并使解释变得困难。本文提出了一种通过降低这一后处理步骤产生的延迟来恢复标点符号的方法。该方法利用韵律结构,提出了一种基于递归神经网络的序列建模范式。结果表明,该系统具有令人满意的标点恢复能力,特别是考虑到句子边界的可靠检测。即使预测的标点顺序不是没有错误的w.r.t.写作标准,与根本没有标点符号的情况相比,人类的感知也更容易“修复”这些错误,而读者则对单词链的基本分割感到困惑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Classification of cognitive load using voice features: A preliminary investigation A case study on time-interval fuzzy cognitive maps in a complex organization Á bilingual comparison of MaxEnt-and RNN-based punctuation restoration in speech transcripts Presentation of a medieval church in MaxWhere Blocklino: A graphical language for Arduino
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1