{"title":"NLify: lightweight spoken natural language interfaces via exhaustive paraphrasing","authors":"Seungyeop Han, Matthai Philipose, Y. Ju","doi":"10.1145/2493432.2493458","DOIUrl":null,"url":null,"abstract":"This paper presents the design and implementation of a programming system that enables third-party developers to add spoken natural language (SNL) interfaces to standalone mobile applications. The central challenge is to create statistical recognition models that are accurate and resource-efficient in the face of the variety of natural language, while requiring little specialized knowledge from developers. We show that given a few examples from the developer, it is possible to elicit comprehensive sets of paraphrases of the examples using internet crowds. The exhaustive nature of these paraphrases allows us to use relatively simple, automatically derived statistical models for speech and language understanding that perform well without per-application tuning. We have realized our design fully as an extension to the Visual Studio IDE. Based on a new benchmark dataset with 3500 spoken instances of 27 commands from 20 subjects and a small developer study, we establish the promise of our approach and the impact of various design choices.","PeriodicalId":262104,"journal":{"name":"Proceedings of the 2013 ACM international joint conference on Pervasive and ubiquitous computing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2013 ACM international joint conference on Pervasive and ubiquitous computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2493432.2493458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This paper presents the design and implementation of a programming system that enables third-party developers to add spoken natural language (SNL) interfaces to standalone mobile applications. The central challenge is to create statistical recognition models that are accurate and resource-efficient in the face of the variety of natural language, while requiring little specialized knowledge from developers. We show that given a few examples from the developer, it is possible to elicit comprehensive sets of paraphrases of the examples using internet crowds. The exhaustive nature of these paraphrases allows us to use relatively simple, automatically derived statistical models for speech and language understanding that perform well without per-application tuning. We have realized our design fully as an extension to the Visual Studio IDE. Based on a new benchmark dataset with 3500 spoken instances of 27 commands from 20 subjects and a small developer study, we establish the promise of our approach and the impact of various design choices.
本文介绍了一个编程系统的设计和实现,该系统使第三方开发人员能够向独立的移动应用程序添加语音自然语言(SNL)接口。核心挑战是创建统计识别模型,该模型在面对各种自然语言时准确且资源高效,同时对开发人员的专业知识要求很少。我们表明,给出一些来自开发人员的例子,有可能利用互联网人群引出对这些例子的综合解释。这些解释的详尽性使我们能够使用相对简单的、自动派生的语音和语言理解统计模型,这些模型无需每个应用程序调优就能很好地执行。我们已经将我们的设计完全实现为Visual Studio IDE的扩展。基于一个新的基准数据集,其中包含来自20个主题的27个命令的3500个口头实例和一个小型开发人员研究,我们建立了我们的方法的承诺和各种设计选择的影响。