英语、德语和希伯来语语音识别系统的跨语言研究

Vered Silber Varod, Ingo Siegert, O. Jokisch, Yamini Sinha, N. Geri
{"title":"英语、德语和希伯来语语音识别系统的跨语言研究","authors":"Vered Silber Varod, Ingo Siegert, O. Jokisch, Yamini Sinha, N. Geri","doi":"10.36965/ojakm.2021.9(1)1-15","DOIUrl":null,"url":null,"abstract":"Despite the growing importance of Automatic Speech Recognition (ASR), its application is still challenging, limited, language-dependent, and requires considerable resources. The resources required for ASR are not only technical, they also need to reflect technological trends and cultural diversity. The purpose of this research is to explore ASR performance gaps by a comparative study of American English, German, and Hebrew. Apart from different languages, we also investigate different speaking styles – utterances from spontaneous dialogues and utterances from frontal lectures (TED-like genre). The analysis includes a comparison of the performance of four ASR engines (Google Cloud, Google Search, IBM Watson, and WIT.ai) using four commonly used metrics: Word Error Rate (WER); Character Error Rate (CER); Word Information Lost (WIL); and Match Error Rate (MER). As expected, findings suggest that English ASR systems provide the best results. Contrary to our hypothesis regarding ASR’s low performance for under-resourced languages, we found that the Hebrew and German ASR systems have similar performance. Overall, our findings suggest that ASR performance is language-dependent and system-dependent. Furthermore, ASR may be genre-sensitive, as our results showed for German. This research contributes a valuable insight for improving ubiquitous global consumption and management of knowledge and calls for corporate social responsibility of commercial companies, to develop ASR under Fair, Reasonable, and Non-Discriminatory (FRAND) terms","PeriodicalId":325473,"journal":{"name":"Online Journal of Applied Knowledge Management","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A cross-language study of speech recognition systems for English, German, and Hebrew\",\"authors\":\"Vered Silber Varod, Ingo Siegert, O. Jokisch, Yamini Sinha, N. Geri\",\"doi\":\"10.36965/ojakm.2021.9(1)1-15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite the growing importance of Automatic Speech Recognition (ASR), its application is still challenging, limited, language-dependent, and requires considerable resources. The resources required for ASR are not only technical, they also need to reflect technological trends and cultural diversity. The purpose of this research is to explore ASR performance gaps by a comparative study of American English, German, and Hebrew. Apart from different languages, we also investigate different speaking styles – utterances from spontaneous dialogues and utterances from frontal lectures (TED-like genre). The analysis includes a comparison of the performance of four ASR engines (Google Cloud, Google Search, IBM Watson, and WIT.ai) using four commonly used metrics: Word Error Rate (WER); Character Error Rate (CER); Word Information Lost (WIL); and Match Error Rate (MER). As expected, findings suggest that English ASR systems provide the best results. Contrary to our hypothesis regarding ASR’s low performance for under-resourced languages, we found that the Hebrew and German ASR systems have similar performance. Overall, our findings suggest that ASR performance is language-dependent and system-dependent. Furthermore, ASR may be genre-sensitive, as our results showed for German. This research contributes a valuable insight for improving ubiquitous global consumption and management of knowledge and calls for corporate social responsibility of commercial companies, to develop ASR under Fair, Reasonable, and Non-Discriminatory (FRAND) terms\",\"PeriodicalId\":325473,\"journal\":{\"name\":\"Online Journal of Applied Knowledge Management\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Online Journal of Applied Knowledge Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36965/ojakm.2021.9(1)1-15\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Journal of Applied Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36965/ojakm.2021.9(1)1-15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

尽管自动语音识别(ASR)越来越重要,但其应用仍然具有挑战性,局限性,语言依赖性,并且需要大量资源。ASR所需的资源不仅是技术资源,还需要反映技术趋势和文化多样性。本研究的目的是通过对美国英语、德语和希伯来语的比较研究来探讨ASR的表现差距。除了不同的语言,我们还研究了不同的说话风格——来自自发对话的话语和来自正面演讲的话语(类似ted的类型)。该分析包括使用四个常用指标对四个自动语音识别引擎(Google Cloud, Google Search, IBM Watson和WIT.ai)的性能进行比较:单词错误率(WER);字符错误率;单词信息丢失;和匹配错误率(MER)。正如预期的那样,研究结果表明英语ASR系统提供了最好的结果。与我们关于资源不足语言的ASR低性能的假设相反,我们发现希伯来语和德语ASR系统具有相似的性能。总的来说,我们的研究结果表明,ASR的表现依赖于语言和系统。此外,ASR可能是体裁敏感的,正如我们对德语的研究结果所显示的那样。本研究为改善全球无处不在的知识消费和管理提供了有价值的见解,并呼吁商业公司履行企业社会责任,在公平、合理和非歧视(FRAND)的条件下发展ASR
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A cross-language study of speech recognition systems for English, German, and Hebrew
Despite the growing importance of Automatic Speech Recognition (ASR), its application is still challenging, limited, language-dependent, and requires considerable resources. The resources required for ASR are not only technical, they also need to reflect technological trends and cultural diversity. The purpose of this research is to explore ASR performance gaps by a comparative study of American English, German, and Hebrew. Apart from different languages, we also investigate different speaking styles – utterances from spontaneous dialogues and utterances from frontal lectures (TED-like genre). The analysis includes a comparison of the performance of four ASR engines (Google Cloud, Google Search, IBM Watson, and WIT.ai) using four commonly used metrics: Word Error Rate (WER); Character Error Rate (CER); Word Information Lost (WIL); and Match Error Rate (MER). As expected, findings suggest that English ASR systems provide the best results. Contrary to our hypothesis regarding ASR’s low performance for under-resourced languages, we found that the Hebrew and German ASR systems have similar performance. Overall, our findings suggest that ASR performance is language-dependent and system-dependent. Furthermore, ASR may be genre-sensitive, as our results showed for German. This research contributes a valuable insight for improving ubiquitous global consumption and management of knowledge and calls for corporate social responsibility of commercial companies, to develop ASR under Fair, Reasonable, and Non-Discriminatory (FRAND) terms
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Understanding knowledge hiding behaviors in the workplace using a serious game data collection approach Special issue editorial: Knowledge hiding and knowledge hoarding in different environments Knowledge hiding and knowledge hoarding: Using grounded theory for conceptual development The impact of knowledge hiding and toxic leadership on knowledge worker productivity – Evidence from IT sector of Pakistan Pilot testing of experimental procedures to measure user's judgment errors in simulated social engineering attacks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1