Weighted finite state transducer based statistical dialog management

2009 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 1900-01-01 DOI:10.1109/ASRU.2009.5373350

Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, H. Kashioka, Satoshi Nakamura

{"title":"Weighted finite state transducer based statistical dialog management","authors":"Chiori Hori, Kiyonori Ohtake, Teruhisa Misu, H. Kashioka, Satoshi Nakamura","doi":"10.1109/ASRU.2009.5373350","DOIUrl":null,"url":null,"abstract":"We proposed a dialog system using a weighted finite-state transducer (WFST) in which user concept and system action tags are input and output of the transducer, respectively. The WFST-based platform for dialog management enables us to combine various statistical models for dialog management (DM), user input understanding and system action generation, and then search the best system action in response to user inputs among multiple hypotheses. To test the potential of the WFST-based DM platform using statistical models, we constructed a dialog system using a human-to-human spoken dialog corpus for hotel reservation, which is annotated with Interchange Format (IF). A scenario WFST and a spoken language understanding (SLU) WFST were obtained from the corpus and then composed together and optimized. We evaluated the detection accuracy of the system next action tags using Mean Reciprocal Ranking (MRR). Finally, we constructed a full WFST-based dialog system by composing SLU, scenario and sentence generation (SG) WFSTs. Humans read the system responses in natural language and judged the quality of the responses. We confirmed that the WFST-based DM platform was capable of handling various spoken language and scenarios when the user concept and system action tags are consistent and distinguishable.","PeriodicalId":292194,"journal":{"name":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.2009.5373350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

We proposed a dialog system using a weighted finite-state transducer (WFST) in which user concept and system action tags are input and output of the transducer, respectively. The WFST-based platform for dialog management enables us to combine various statistical models for dialog management (DM), user input understanding and system action generation, and then search the best system action in response to user inputs among multiple hypotheses. To test the potential of the WFST-based DM platform using statistical models, we constructed a dialog system using a human-to-human spoken dialog corpus for hotel reservation, which is annotated with Interchange Format (IF). A scenario WFST and a spoken language understanding (SLU) WFST were obtained from the corpus and then composed together and optimized. We evaluated the detection accuracy of the system next action tags using Mean Reciprocal Ranking (MRR). Finally, we constructed a full WFST-based dialog system by composing SLU, scenario and sentence generation (SG) WFSTs. Humans read the system responses in natural language and judged the quality of the responses. We confirmed that the WFST-based DM platform was capable of handling various spoken language and scenarios when the user concept and system action tags are consistent and distinguishable.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于加权有限状态传感器的统计对话框管理

我们提出了一个使用加权有限状态传感器(WFST)的对话系统，其中用户概念和系统动作标签分别是传感器的输入和输出。基于wfst的对话管理平台使我们能够结合各种统计模型进行对话管理(DM)、用户输入理解和系统动作生成，然后在多个假设中搜索响应用户输入的最佳系统动作。为了使用统计模型测试基于wfst的DM平台的潜力，我们使用一个用于酒店预订的人对人口语对话语料库构建了一个对话系统，该对话语料库使用交换格式(IF)进行注释。从语料库中得到情景化WFST和口语理解WFST，并对其进行组合和优化。我们使用平均倒数排名(MRR)来评估系统下一个动作标签的检测精度。最后，我们通过组合SLU、场景生成和句子生成(SG) wfst，构建了一个完整的基于wfst的对话系统。人类用自然语言阅读系统的反应，并判断反应的质量。我们确认基于wfst的DM平台能够在用户概念和系统操作标签一致且可区分的情况下处理各种口语和场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2009 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量