COMEX: A Multi-task Benchmark for Knowledge-grounded COnversational Media EXploration

Proceedings of the 4th Conference on Conversational User Interfaces Pub Date : 2022-07-26 DOI:10.1145/3543829.3543830

Zay Yar Tun, Alessandro Speggiorin, Jeffrey Dalton, Megan Stamper

{"title":"COMEX: A Multi-task Benchmark for Knowledge-grounded COnversational Media EXploration","authors":"Zay Yar Tun, Alessandro Speggiorin, Jeffrey Dalton, Megan Stamper","doi":"10.1145/3543829.3543830","DOIUrl":null,"url":null,"abstract":"Open-domain conversational interaction with news, podcasts, and other types of heterogeneous content remains an open challenge. Interactive agents must support information access in a way that is fair, impartial, and true to the content and knowledge discussed. To facilitate this, systems building on interactive retrieval from knowledge-grounded media are a controllable and known base for experimentation. A conversational media agent should retrieve relevant content, understand key concepts in the content through grounding to a knowledge base, and enable exploration by offering to discuss a topic further or progress to describe related topics. In this work, we release a new multi-task benchmark on COnversational Media EXploration (COMEX) to measure knowledge-grounded conversational content exploration. It consists of a heterogeneous semantically annotated media corpus and topic-specific data for 1) entity Wikification and salience, 2) conversational content ranking on heterogeneous media content, 3) background link ranking, and 4) background linking explanation. We develop COMEX with judgments and conversational interactions developed in partnership with professional editorial staff from the BBC. We study the behavior of state-of-the-art systems, with the results demonstrating significant headroom on all tasks.","PeriodicalId":138046,"journal":{"name":"Proceedings of the 4th Conference on Conversational User Interfaces","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th Conference on Conversational User Interfaces","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3543829.3543830","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Open-domain conversational interaction with news, podcasts, and other types of heterogeneous content remains an open challenge. Interactive agents must support information access in a way that is fair, impartial, and true to the content and knowledge discussed. To facilitate this, systems building on interactive retrieval from knowledge-grounded media are a controllable and known base for experimentation. A conversational media agent should retrieve relevant content, understand key concepts in the content through grounding to a knowledge base, and enable exploration by offering to discuss a topic further or progress to describe related topics. In this work, we release a new multi-task benchmark on COnversational Media EXploration (COMEX) to measure knowledge-grounded conversational content exploration. It consists of a heterogeneous semantically annotated media corpus and topic-specific data for 1) entity Wikification and salience, 2) conversational content ranking on heterogeneous media content, 3) background link ranking, and 4) background linking explanation. We develop COMEX with judgments and conversational interactions developed in partnership with professional editorial staff from the BBC. We study the behavior of state-of-the-art systems, with the results demonstrating significant headroom on all tasks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

COMEX:基于知识的对话媒体探索的多任务基准

与新闻、播客和其他类型的异构内容的开放域会话交互仍然是一个开放的挑战。交互式代理必须以公平、公正和忠实于所讨论的内容和知识的方式支持信息访问。为了促进这一点，建立在基于知识的媒体的交互式检索上的系统是一个可控的和已知的实验基础。会话媒体代理应该检索相关内容，通过建立知识库来理解内容中的关键概念，并通过提供进一步讨论主题或进展来描述相关主题来进行探索。在这项工作中，我们发布了一个新的会话媒体探索(COMEX)多任务基准来衡量基于知识的会话内容探索。它由异构语义注释的媒体语料库和特定主题的数据组成，用于1)实体维基化和显著性，2)异构媒体内容的会话内容排名，3)背景链接排名，以及4)背景链接解释。我们与英国广播公司的专业编辑人员合作开发了COMEX的判断和对话互动。我们研究了最先进系统的行为，结果表明在所有任务上都有显著的空间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 4th Conference on Conversational User Interfaces

自引率

0.00%

发文量