一种用于定制切换语音通信辅助系统的自动输入协议推荐方法

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific Pub Date : 2014-12-01 DOI:10.1109/APSIPA.2014.7041638

Fuming Fang, T. Shinozaki, Takao Kobayashi

{"title":"一种用于定制切换语音通信辅助系统的自动输入协议推荐方法","authors":"Fuming Fang, T. Shinozaki, Takao Kobayashi","doi":"10.1109/APSIPA.2014.7041638","DOIUrl":null,"url":null,"abstract":"A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.","PeriodicalId":231382,"journal":{"name":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems\",\"authors\":\"Fuming Fang, T. Shinozaki, Takao Kobayashi\",\"doi\":\"10.1109/APSIPA.2014.7041638\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.\",\"PeriodicalId\":231382,\"journal\":{\"name\":\"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific\",\"volume\":\"78 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIPA.2014.7041638\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2014.7041638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

语音转换界面可作为辅助系统，为残障人士提供互动沟通的手段。任何身体部位的运动，比如眼睛的运动，都可以作为开关的输入。可能的交换操作的数量因人而异，但带宽通常是相当有限的。因此，需要有效的输入协议来将切换操作映射到发音。同时，该协议必须易于学习，以便任何人都可以使用它。为此，我们提出了一种能够接受交换机操作中个性化需求的协议推荐方法。该方法为界面的每个用户提供了一个定制的协议，该协议既快速输入又易于记忆。协议设计的两个主要思想是利用每个人都知道的字母表知识，以及通过允许发音转换中的歧义来提高输入速度和可学习性。由歧义引起的转换误差由N-gram语言模型抵消。通过仿真和研究参与者的实测值对方案的性能进行了评价，表明了所提方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

An automatic input protocol recommendation method for tailored switch-to-speech communication aid systems

A switch-to-speech interface can provide a means of interactive communication as a support system for people with disabilities with voluntary movements. Any motion of a part of the body, such as eye movements, can be used for the switch input. The number of possible switch operations varies from person to person, but the bandwidth is generally quite limited. Therefore, efficient input protocols are needed to map the switch operations to pronunciations. Meanwhile, the protocol must be easily learnable so that anyone can use it. To this end, we propose a protocol recommendation method that can accept individual requirements in switch operations. This method suggests a customized protocol for each user of the interface that is both speedy to enter and easy to remember. The two main ideas in the protocol design are utilizing the knowledge about the alphabet table that everyone already knows and improving the input speed and learnability by allowing ambiguity in the switch to pronunciation conversion. The conversion errors due to the ambiguity are offset by an N-gram language model. The performance of the protocols was evaluated through simulations and the measured values obtained from research participants, and the advantage of the proposed method is shown.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific

自引率

0.00%

发文量

期刊最新文献

Smoothing of spatial filter by graph Fourier transform for EEG signals Intra line copy for HEVC screen content coding Design of FPGA-based rapid prototype spectral subtraction for hands-free speech applications Fetal ECG extraction using adaptive functional link artificial neural network Opened Pins Recommendation System to promote tourism sector in Chiang Rai Thailand