舌板:用于细微输入的口头界面

Proceedings of the 10th Augmented Human International Conference 2019 Pub Date : 2019-03-11 DOI:10.1145/3311823.3311831

Richard Li, Jason Wu, Thad Starner

{"title":"舌板:用于细微输入的口头界面","authors":"Richard Li, Jason Wu, Thad Starner","doi":"10.1145/3311823.3311831","DOIUrl":null,"url":null,"abstract":"We present TongueBoard, a retainer form-factor device for recognizing non-vocalized speech. TongueBoard enables absolute position tracking of the tongue by placing capacitive touch sensors on the roof of the mouth. We collect a dataset of 21 common words from four user study participants (two native American English speakers and two non-native speakers with severe hearing loss). We train a classifier that is able to recognize the words with 91.01% accuracy for the native speakers and 77.76% accuracy for the non-native speakers in a user dependent, offline setting. The native English speakers then participate in a user study involving operating a calculator application with 15 non-vocalized words and two tongue gestures at a desktop and with a mobile phone while walking. TongueBoard consistently maintains an information transfer rate of 3.78 bits per decision (number of choices = 17, accuracy = 97.1%) and 2.18 bits per second across stationary and mobile contexts, which is comparable to our control conditions of mouse (desktop) and touchpad (mobile) input.","PeriodicalId":433578,"journal":{"name":"Proceedings of the 10th Augmented Human International Conference 2019","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"53","resultStr":"{\"title\":\"TongueBoard: An Oral Interface for Subtle Input\",\"authors\":\"Richard Li, Jason Wu, Thad Starner\",\"doi\":\"10.1145/3311823.3311831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present TongueBoard, a retainer form-factor device for recognizing non-vocalized speech. TongueBoard enables absolute position tracking of the tongue by placing capacitive touch sensors on the roof of the mouth. We collect a dataset of 21 common words from four user study participants (two native American English speakers and two non-native speakers with severe hearing loss). We train a classifier that is able to recognize the words with 91.01% accuracy for the native speakers and 77.76% accuracy for the non-native speakers in a user dependent, offline setting. The native English speakers then participate in a user study involving operating a calculator application with 15 non-vocalized words and two tongue gestures at a desktop and with a mobile phone while walking. TongueBoard consistently maintains an information transfer rate of 3.78 bits per decision (number of choices = 17, accuracy = 97.1%) and 2.18 bits per second across stationary and mobile contexts, which is comparable to our control conditions of mouse (desktop) and touchpad (mobile) input.\",\"PeriodicalId\":433578,\"journal\":{\"name\":\"Proceedings of the 10th Augmented Human International Conference 2019\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"53\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 10th Augmented Human International Conference 2019\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3311823.3311831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th Augmented Human International Conference 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3311823.3311831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 53

摘要

我们介绍舌板，一个保留形状因素的设备，用于识别非发声语音。TongueBoard通过在口腔上颚放置电容式触摸传感器来实现舌头的绝对位置跟踪。我们从四个用户研究参与者(两个美国英语母语者和两个听力严重受损的非英语母语者)中收集了21个常用词的数据集。我们训练了一个分类器，在依赖用户的离线设置下，对母语使用者的单词识别准确率为91.01%，对非母语使用者的单词识别准确率为77.76%。然后，以英语为母语的人参加了一项用户研究，包括在桌面操作一个计算器应用程序，使用15个不发音的单词和两个舌头手势，并在走路时使用手机。TongueBoard始终保持每次决策3.78比特的信息传输速率(选择数= 17，准确性= 97.1%)，在固定和移动环境下每秒2.18比特，这与我们的鼠标(桌面)和触摸板(移动)输入的控制条件相当。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

TongueBoard: An Oral Interface for Subtle Input

We present TongueBoard, a retainer form-factor device for recognizing non-vocalized speech. TongueBoard enables absolute position tracking of the tongue by placing capacitive touch sensors on the roof of the mouth. We collect a dataset of 21 common words from four user study participants (two native American English speakers and two non-native speakers with severe hearing loss). We train a classifier that is able to recognize the words with 91.01% accuracy for the native speakers and 77.76% accuracy for the non-native speakers in a user dependent, offline setting. The native English speakers then participate in a user study involving operating a calculator application with 15 non-vocalized words and two tongue gestures at a desktop and with a mobile phone while walking. TongueBoard consistently maintains an information transfer rate of 3.78 bits per decision (number of choices = 17, accuracy = 97.1%) and 2.18 bits per second across stationary and mobile contexts, which is comparable to our control conditions of mouse (desktop) and touchpad (mobile) input.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 10th Augmented Human International Conference 2019

自引率

0.00%

发文量

期刊最新文献

CapMat Augmented taste of wine by artificial climate room: Influence of temperature and humidity on taste evaluation Let Your World Open: CAVE-based Visualization Methods of Public Virtual Reality towards a Shareable VR Experience Naviarm SubMe