Lipwatch: Enabling Silent Speech Recognition on Smartwatches using Acoustic Sensing

IF 4.7 3区材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC ACS Applied Electronic Materials Pub Date : 2024-05-13 DOI:10.1145/3659614

Qian Zhang, Yubin Lan, Kaiyi Guo, Dong Wang

{"title":"Lipwatch: Enabling Silent Speech Recognition on Smartwatches using Acoustic Sensing","authors":"Qian Zhang, Yubin Lan, Kaiyi Guo, Dong Wang","doi":"10.1145/3659614","DOIUrl":null,"url":null,"abstract":"Silent Speech Interfaces (SSI) on mobile devices offer a privacy-friendly alternative to conventional voice input methods. Previous research has primarily focused on smartphones. In this paper, we introduce Lipwatch, a novel system that utilizes acoustic sensing techniques to enable SSI on smartwatches. Lipwatch leverages the inaudible waves emitted by the watch's speaker to capture lip movements and then analyzes the echo to enable SSI. In contrast to acoustic sensing-based SSI on smartphones, our development of Lipwatch takes into full consideration the specific scenarios and requirements associated with smartwatches. Firstly, we elaborate a wake-up-free mechanism, allowing users to interact without the need for a wake-up phrase or button presses. The mechanism utilizes the inertial sensors on the smartwatch to detect gestures, in combination with acoustic signals that detecting lip movements to determine whether SSI should be activated. Secondly, we design a flexible silent speech recognition mechanism that explores limited vocabulary recognition to comprehend a broader range of user commands, even those not present in the training dataset, relieving users from strict adherence to predefined commands. We evaluate Lipwatch on 15 individuals using a set of the 80 most common interaction commands on smartwatches. The system achieves a Word Error Rate (WER) of 13.7% in user-independent test. Even when users utter commands containing words absent in the training set, Lipwatch still demonstrates a remarkable 88.7% top-3 accuracy. We implement a real-time version of Lipwatch on a commercial smartwatch. The user study shows that Lipwatch can be a practical and promising option to enable SSI on smartwatches.","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":"80 8","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3659614","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 1

Abstract

Silent Speech Interfaces (SSI) on mobile devices offer a privacy-friendly alternative to conventional voice input methods. Previous research has primarily focused on smartphones. In this paper, we introduce Lipwatch, a novel system that utilizes acoustic sensing techniques to enable SSI on smartwatches. Lipwatch leverages the inaudible waves emitted by the watch's speaker to capture lip movements and then analyzes the echo to enable SSI. In contrast to acoustic sensing-based SSI on smartphones, our development of Lipwatch takes into full consideration the specific scenarios and requirements associated with smartwatches. Firstly, we elaborate a wake-up-free mechanism, allowing users to interact without the need for a wake-up phrase or button presses. The mechanism utilizes the inertial sensors on the smartwatch to detect gestures, in combination with acoustic signals that detecting lip movements to determine whether SSI should be activated. Secondly, we design a flexible silent speech recognition mechanism that explores limited vocabulary recognition to comprehend a broader range of user commands, even those not present in the training dataset, relieving users from strict adherence to predefined commands. We evaluate Lipwatch on 15 individuals using a set of the 80 most common interaction commands on smartwatches. The system achieves a Word Error Rate (WER) of 13.7% in user-independent test. Even when users utter commands containing words absent in the training set, Lipwatch still demonstrates a remarkable 88.7% top-3 accuracy. We implement a real-time version of Lipwatch on a commercial smartwatch. The user study shows that Lipwatch can be a practical and promising option to enable SSI on smartwatches.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Lipwatch：利用声学传感在智能手表上实现无声语音识别

移动设备上的无声语音接口（SSI）为传统语音输入法提供了一种隐私友好型替代方案。以往的研究主要集中在智能手机上。在本文中，我们介绍了 Lipwatch，这是一种利用声学传感技术在智能手表上实现 SSI 的新型系统。Lipwatch 利用手表扬声器发出的不可听波来捕捉嘴唇动作，然后分析回声，从而实现 SSI。与智能手机上基于声学传感的唇部识别相比，我们在开发 Lipwatch 时充分考虑了与智能手表相关的特定场景和要求。首先，我们精心设计了免唤醒机制，让用户无需唤醒词或按键即可进行交互。该机制利用智能手表上的惯性传感器检测手势，并结合检测嘴唇动作的声学信号来确定是否应激活 SSI。其次，我们设计了一种灵活的无声语音识别机制，利用有限的词汇识别能力来理解更广泛的用户指令，甚至是训练数据集中没有的指令，从而使用户不必严格遵守预定义的指令。我们使用智能手表上最常见的 80 个交互命令集对 15 名用户进行了 Lipwatch 评估。在独立于用户的测试中，该系统的词错误率（WER）为 13.7%。即使用户发出的命令中包含了训练集中没有的单词，Lipwatch 仍能以 88.7% 的准确率名列前三。我们在商用智能手表上实现了实时版 Lipwatch。用户研究表明，Lipwatch 是在智能手表上实现 SSI 的一个实用而有前途的选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACS Applied Electronic Materials Multiple-

CiteScore

7.20

自引率

4.30%

发文量

567

期刊介绍： ACS Applied Electronic Materials is an interdisciplinary journal publishing original research covering all aspects of electronic materials. The journal is devoted to reports of new and original experimental and theoretical research of an applied nature that integrate knowledge in the areas of materials science, engineering, optics, physics, and chemistry into important applications of electronic materials. Sample research topics that span the journal's scope are inorganic, organic, ionic and polymeric materials with properties that include conducting, semiconducting, superconducting, insulating, dielectric, magnetic, optoelectronic, piezoelectric, ferroelectric and thermoelectric. Indexed/Abstracted： Web of Science SCIE Scopus CAS INSPEC Portico

期刊最新文献

Issue Editorial Masthead Issue Publication Information Marking the 100th Issue of ACS Applied Electronic Materials Pushing down the Limit of Ammonia Detection of ZnO-Based Chemiresistive Sensors with Exposed Hexagonal Facets at Room Temperature Direct-Printed Mn–Ni–Cu–O/Poly(vinyl butyral) Composites for Sintering-Free, Flexible Thermistors with High Sensitivity