击败混响:先进的去混响和识别技术,免提语音识别

Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani
{"title":"击败混响:先进的去混响和识别技术,免提语音识别","authors":"Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani","doi":"10.1109/GlobalSIP.2014.7032172","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.","PeriodicalId":362306,"journal":{"name":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition\",\"authors\":\"Marc Delcroix, Takuya Yoshioka, A. Ogawa, Yotaro Kubo, M. Fujimoto, N. Ito, K. Kinoshita, Miquel Espi, S. Araki, Takaaki Hori, T. Nakatani\",\"doi\":\"10.1109/GlobalSIP.2014.7032172\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.\",\"PeriodicalId\":362306,\"journal\":{\"name\":\"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"volume\":\"195 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GlobalSIP.2014.7032172\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP.2014.7032172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

自动语音识别在越来越多的产品中得到了成功的应用。然而,目前的识别系统通常需要使用近距离通话麦克风。这种约束限制了语音识别在新应用程序中的部署。在免提情况下,噪声和混响会严重降低识别性能。噪声鲁棒性问题引起了广泛的关注,并提出了实用的解决方案,并使用通用基准进行了评估。相比之下,混响一直被认为是一个无法解决的问题。最近,混响语音识别领域取得了重大进展,这一进展已经通过2014年的REVERB挑战进行了评估。在本文中,我们描述了我们针对REVERB挑战提出的混响语音识别系统,即使在严重混响条件下也能表现出很高的识别性能。我们将我们的系统与其他提出的方法进行比较,以提出该领域潜在的未来研究方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
Automatic speech recognition is being used successfully in more and more products. However, current recognition systems usually require the use of close-talking microphones. This constraint limits the deployment of speech recognition for new applications. In hands-free situations, noise and reverberation cause a severe degradation of the recognition performance. The problem of noise robustness has attracted a great deal of attention and practical solutions have been proposed and evaluated with common benchmarks. In contrast, reverberation has long been considered an unsolvable problem. Recently, significant progress has been made in the field of reverberant speech recognition and this progress has been evaluated with the REVERB challenge 2014. In this paper, we describe the reverberant speech recognition system we proposed for the REVERB challenge that exhibited high recognition performance even under severe reverberation conditions. We compare our system with other proposed approaches to suggest potential future research directions in the field.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Competitive design of power allocation strategies for energy harvesting wireless communication systems Correction of over-exposure using color channel correlations Communications meets copula modeling: Non-standard dependence features in wireless fading channels Energy efficient and low complex wireless communication Feasibility of positive secrecy rate in wiretap interference channels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1