一种用于定制切换语音通信辅助系统的自动输入协议推荐方法

Fuming Fang, T. Shinozaki, Takao Kobayashi
{"title":"一种用于定制切换语音通信辅助系统的自动输入协议推荐方法","authors":"Fuming Fang, T. Shinozaki, Takao Kobayashi","doi":"10.1109/APSIPA.2014.7041638","DOIUrl":null,"url":null,"abstract":"A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems\",\"authors\":\"Fuming Fang, T. Shinozaki, Takao Kobayashi\",\"doi\":\"10.1109/APSIPA.2014.7041638\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.\",\"PeriodicalId\":231382,\"journal\":{\"name\":\"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIPA.2014.7041638\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2014.7041638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

语音转换界面可作为辅助系统,为残障人士提供互动沟通的手段。任何身体部位的运动,比如眼睛的运动,都可以作为开关的输入。可能的交换操作的数量因人而异,但带宽通常是相当有限的。因此,需要有效的输入协议来将切换操作映射到发音。同时,该协议必须易于学习,以便任何人都可以使用它。为此,我们提出了一种能够接受交换机操作中个性化需求的协议推荐方法。该方法为界面的每个用户提供了一个定制的协议,该协议既快速输入又易于记忆。协议设计的两个主要思想是利用每个人都知道的字母表知识,以及通过允许发音转换中的歧义来提高输入速度和可学习性。由歧义引起的转换误差由N-gram语言模型抵消。通过仿真和研究参与者的实测值对方案的性能进行了评价,表明了所提方法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems
A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Smoothing of spatial filter by graph Fourier transform for EEG signals Intra line copy for HEVC screen content coding Design of FPGA-based rapid prototype spectral subtraction for hands-free speech applications Fetal ECG extraction using adaptive functional link artificial neural network Opened Pins Recommendation System to promote tourism sector in Chiang Rai Thailand
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1