End-To-End Named Entity And Semantic Concept Extraction From Speech

Sahar Ghannay, Antoine Caubrière, Y. Estève, Nathalie Camelin, E. Simonnet, Antoine Laurent, E. Morin
{"title":"End-To-End Named Entity And Semantic Concept Extraction From Speech","authors":"Sahar Ghannay, Antoine Caubrière, Y. Estève, Nathalie Camelin, E. Simonnet, Antoine Laurent, E. Morin","doi":"10.1109/SLT.2018.8639513","DOIUrl":null,"url":null,"abstract":"Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-optimal in regards to the final task, reduced space search at the ASR output level,...) and it is known that more integrated approaches outperform sequential ones, when they can be applied. In this paper, we explore an end-to-end approach that directly extracts named entities from speech, though a unique neural architecture. On a such way, a joint optimization is possible for both ASR and NER. Experiments are carried on French data easily accessible, composed of data distributed in several evaluation campaigns. The results are promising since this end-to-end approach provides similar results (F-measure= 0.66 on test data) than a classical pipeline approach to detect named entity categories (F-measure=0.64). Last, we also explore this approach applied to semantic concept extraction, through a slot filling task known as a spoken language understanding problem, and also observe an improvement in comparison to a pipeline approach.","PeriodicalId":377307,"journal":{"name":"2018 IEEE Spoken Language Technology Workshop (SLT)","volume":"138 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"75","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Spoken Language Technology Workshop (SLT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SLT.2018.8639513","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 75

Abstract

Named entity recognition (NER) is among SLU tasks that usually extract semantic information from textual documents. Until now, NER from speech is made through a pipeline process that consists in processing first an automatic speech recognition (ASR) on the audio and then processing a NER on the ASR outputs. Such approach has some disadvantages (error propagation, metric to tune ASR systems sub-optimal in regards to the final task, reduced space search at the ASR output level,...) and it is known that more integrated approaches outperform sequential ones, when they can be applied. In this paper, we explore an end-to-end approach that directly extracts named entities from speech, though a unique neural architecture. On a such way, a joint optimization is possible for both ASR and NER. Experiments are carried on French data easily accessible, composed of data distributed in several evaluation campaigns. The results are promising since this end-to-end approach provides similar results (F-measure= 0.66 on test data) than a classical pipeline approach to detect named entity categories (F-measure=0.64). Last, we also explore this approach applied to semantic concept extraction, through a slot filling task known as a spoken language understanding problem, and also observe an improvement in comparison to a pipeline approach.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
端到端命名实体与语音语义概念提取
命名实体识别(NER)是通常从文本文档中提取语义信息的SLU任务之一。到目前为止,语音识别是通过一个流水线过程进行的,该过程包括首先在音频上处理自动语音识别(ASR),然后在ASR输出上处理NER。这种方法有一些缺点(误差传播,调整ASR系统的度量在最终任务方面不是最优的,在ASR输出级别减少空间搜索,……),并且众所周知,当可以应用时,更集成的方法优于顺序方法。在本文中,我们探索了一种端到端方法,通过独特的神经结构直接从语音中提取命名实体。通过这种方式,ASR和NER的联合优化是可能的。实验是在法国容易获得的数据上进行的,这些数据由在几个评价活动中分发的数据组成。结果是有希望的,因为这种端到端方法提供了与传统管道方法相似的结果(测试数据上的F-measure= 0.66)来检测命名实体类别(F-measure=0.64)。最后,我们还探索了这种方法在语义概念提取中的应用,通过一个被称为口语理解问题的槽填充任务,并观察到与管道方法相比的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition Dynamic Extension of ASR Lexicon Using Wikipedia Data Detection and Calibration of Whisper for Speaker Recognition Out-of-Domain Slot Value Detection for Spoken Dialogue Systems with Context Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1