面向新加坡闽南语的交互式语音代理

Vanessa Lim, Hui Shan Ang, Estelle Lee, Boon Pang Lim
{"title":"面向新加坡闽南语的交互式语音代理","authors":"Vanessa Lim, Hui Shan Ang, Estelle Lee, Boon Pang Lim","doi":"10.1145/2974804.2980495","DOIUrl":null,"url":null,"abstract":"Singapore Hokkien (SH) is the most commonly spoken non-Mandarin Chinese dialect in Singapore. It is an important language for many members of Singapore's pioneer generation, but much less so for the younger generation who prefer English. In recent years, the greying of this demographic has placed an increasing demand on for assistive devices to support them. We report ongoing efforts to build limited-vocabulary speech recognition, with the eventual goal of a conversational voice agent in SH that can support applications in home-automation or in-hospital use case scenarios. This process is challenging as sizeable SH speech corpora do not yet exist, and SH is sufficiently different from existing Mandarin or Minnan such that other corpora cannot be directly used. We document our efforts at building language resources -- audio corpora, pronunciation lexicons -- and present some preliminary findings on multilingual training.","PeriodicalId":185756,"journal":{"name":"Proceedings of the Fourth International Conference on Human Agent Interaction","volume":"141 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Towards an Interactive Voice Agent for Singapore Hokkien\",\"authors\":\"Vanessa Lim, Hui Shan Ang, Estelle Lee, Boon Pang Lim\",\"doi\":\"10.1145/2974804.2980495\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Singapore Hokkien (SH) is the most commonly spoken non-Mandarin Chinese dialect in Singapore. It is an important language for many members of Singapore's pioneer generation, but much less so for the younger generation who prefer English. In recent years, the greying of this demographic has placed an increasing demand on for assistive devices to support them. We report ongoing efforts to build limited-vocabulary speech recognition, with the eventual goal of a conversational voice agent in SH that can support applications in home-automation or in-hospital use case scenarios. This process is challenging as sizeable SH speech corpora do not yet exist, and SH is sufficiently different from existing Mandarin or Minnan such that other corpora cannot be directly used. We document our efforts at building language resources -- audio corpora, pronunciation lexicons -- and present some preliminary findings on multilingual training.\",\"PeriodicalId\":185756,\"journal\":{\"name\":\"Proceedings of the Fourth International Conference on Human Agent Interaction\",\"volume\":\"141 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Fourth International Conference on Human Agent Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2974804.2980495\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fourth International Conference on Human Agent Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2974804.2980495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

新加坡闽南语(SH)是新加坡最常用的非普通话方言。对于新加坡的许多先驱一代来说,英语是一门重要的语言,但对于更喜欢英语的年轻一代来说,它就不那么重要了。近年来,这一人口老龄化对辅助设备的需求日益增加。我们报告了正在努力构建有限词汇的语音识别,最终目标是在SH中建立一个会话语音代理,可以支持家庭自动化或医院用例场景中的应用。这个过程是具有挑战性的,因为目前还没有大规模的SH语料库,并且SH与现有的普通话或闽南语有很大的不同,因此其他语料库无法直接使用。我们记录了我们在建立语言资源方面的努力——音频语料库、发音词典——并提出了一些关于多语言训练的初步发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Towards an Interactive Voice Agent for Singapore Hokkien
Singapore Hokkien (SH) is the most commonly spoken non-Mandarin Chinese dialect in Singapore. It is an important language for many members of Singapore's pioneer generation, but much less so for the younger generation who prefer English. In recent years, the greying of this demographic has placed an increasing demand on for assistive devices to support them. We report ongoing efforts to build limited-vocabulary speech recognition, with the eventual goal of a conversational voice agent in SH that can support applications in home-automation or in-hospital use case scenarios. This process is challenging as sizeable SH speech corpora do not yet exist, and SH is sufficiently different from existing Mandarin or Minnan such that other corpora cannot be directly used. We document our efforts at building language resources -- audio corpora, pronunciation lexicons -- and present some preliminary findings on multilingual training.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Investigation on Effects of Color, Sound, and Vibration on Human's Emotional Perception Humotion: A Human Inspired Gaze Control Framework for Anthropomorphic Robot Heads User Generated Agent: Designable Book Recommendation Robot Programmed by Children LAP: A Human-in-the-loop Adaptation Approach for Industrial Robots Human Posture Detection using H-ELM Body Part and Whole Person Detectors for Human-Robot Interaction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1