首页 > 最新文献

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining最新文献

英文 中文
Session details: Session 9: Recommendation 会议详情:第9部分:建议
M. Zhang
{"title":"Session details: Session 9: Recommendation","authors":"M. Zhang","doi":"10.1145/3310349","DOIUrl":"https://doi.org/10.1145/3310349","url":null,"abstract":"","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115406590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
More Than Just Words: Modeling Non-Textual Characteristics of Podcasts 不仅仅是单词:播客的非文本特征建模
Longqi Yang, Yu Wang, D. Dunne, Michael Sobolev, Mor Naaman, D. Estrin
Recent years have witnessed the flourishing of podcasts, a unique type of audio medium. Prior work on podcast content modeling focused on analyzing Automatic Speech Recognition outputs, which ignored vocal, musical, and conversational properties (e.g., energy, humor, and creativity) that uniquely characterize this medium. In this paper, we present an Adversarial Learning-based Podcast Representation (ALPR) that captures non-textual aspects of podcasts. Through extensive experiments on a large-scale podcast dataset (88,728 episodes from 18,433 channels), we show that (1) ALPR significantly outperforms the state-of-the-art features developed for music and speech in predicting theseriousness andenergy of podcasts, and (2) incorporating ALPR significantly improves the performance of topic-based podcast-popularity prediction. Our experiments also reveal factors that correlate with podcast popularity.
播客是一种独特的音频媒体,近年来蓬勃发展。之前关于播客内容建模的工作主要集中在分析自动语音识别输出,而忽略了这种媒体特有的声乐、音乐和会话属性(例如,能量、幽默和创造力)。在本文中,我们提出了一种基于对抗性学习的播客表示(ALPR),它可以捕获播客的非文本方面。通过大规模播客数据集(来自18433个频道的88,728集)的广泛实验,我们表明:(1)ALPR在预测播客的严重性和能量方面显著优于为音乐和语音开发的最先进特征,(2)结合ALPR显著提高了基于主题的播客流行度预测的性能。我们的实验还揭示了与播客受欢迎程度相关的因素。
{"title":"More Than Just Words: Modeling Non-Textual Characteristics of Podcasts","authors":"Longqi Yang, Yu Wang, D. Dunne, Michael Sobolev, Mor Naaman, D. Estrin","doi":"10.1145/3289600.3290993","DOIUrl":"https://doi.org/10.1145/3289600.3290993","url":null,"abstract":"Recent years have witnessed the flourishing of podcasts, a unique type of audio medium. Prior work on podcast content modeling focused on analyzing Automatic Speech Recognition outputs, which ignored vocal, musical, and conversational properties (e.g., energy, humor, and creativity) that uniquely characterize this medium. In this paper, we present an Adversarial Learning-based Podcast Representation (ALPR) that captures non-textual aspects of podcasts. Through extensive experiments on a large-scale podcast dataset (88,728 episodes from 18,433 channels), we show that (1) ALPR significantly outperforms the state-of-the-art features developed for music and speech in predicting theseriousness andenergy of podcasts, and (2) incorporating ALPR significantly improves the performance of topic-based podcast-popularity prediction. Our experiments also reveal factors that correlate with podcast popularity.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114394241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned 公平感知机器学习:实践挑战和经验教训
Sarah Bird, K. Kenthapadi, Emre Kıcıman, Margaret Mitchell
Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learned models and data-driven systems, and the potential for such systems to discriminate against certain population groups, due to biases in algorithmic decision-making systems. This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems. We will motivate the need for adopting a "fairness-first" approach (as opposed to viewing algorithmic bias / fairness considerations as an afterthought), when developing machine learning based models and systems for different consumer and enterprise applications. Then, we will focus on the application of fairness-aware machine learning techniques in practice, by presenting case studies from different technology companies. Based on our experiences in industry, we will identify open problems and research challenges for the data mining / machine learning community.
来自不同学科的研究人员和从业人员强调了使用机器学习模型和数据驱动系统所带来的伦理和法律挑战,以及由于算法决策系统中的偏见,这些系统可能会歧视某些人群。本教程旨在概述过去几年观察到的算法偏见/歧视问题,吸取的教训,关键法规和法律,以及在机器学习系统中实现公平的技术演变。在为不同的消费者和企业应用程序开发基于机器学习的模型和系统时,我们将激发采用“公平优先”方法的需求(而不是将算法偏见/公平考虑视为事后考虑)。然后,我们将通过介绍来自不同技术公司的案例研究,专注于公平感知机器学习技术在实践中的应用。基于我们的行业经验,我们将为数据挖掘/机器学习社区确定开放的问题和研究挑战。
{"title":"Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned","authors":"Sarah Bird, K. Kenthapadi, Emre Kıcıman, Margaret Mitchell","doi":"10.1145/3289600.3291383","DOIUrl":"https://doi.org/10.1145/3289600.3291383","url":null,"abstract":"Researchers and practitioners from different disciplines have highlighted the ethical and legal challenges posed by the use of machine learned models and data-driven systems, and the potential for such systems to discriminate against certain population groups, due to biases in algorithmic decision-making systems. This tutorial aims to present an overview of algorithmic bias / discrimination issues observed over the last few years and the lessons learned, key regulations and laws, and evolution of techniques for achieving fairness in machine learning systems. We will motivate the need for adopting a \"fairness-first\" approach (as opposed to viewing algorithmic bias / fairness considerations as an afterthought), when developing machine learning based models and systems for different consumer and enterprise applications. Then, we will focus on the application of fairness-aware machine learning techniques in practice, by presenting case studies from different technology companies. Based on our experiences in industry, we will identify open problems and research challenges for the data mining / machine learning community.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125607010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Task Intelligence Workshop @ WSDM 2019 任务智能研讨会@ WSDM 2019
Ahmed Hassan Awadallah, C. Gurrin, M. Sanderson, Ryen W. White
The task intelligence workshop at the 2019 ACM Web Search and Data Mining (WSDM) conference comprised a mixture of research paper presentations, reports from data challenge participants, invited keynote(s) on broad topics related to tasks, and a workshop-wide discussion about task intelligence and its implications for system development.
2019年ACM Web搜索和数据挖掘(WSDM)会议上的任务智能研讨会包括研究论文演讲、数据挑战参与者的报告、关于与任务相关的广泛主题的受邀主题演讲,以及关于任务智能及其对系统开发的影响的研讨会范围内的讨论。
{"title":"Task Intelligence Workshop @ WSDM 2019","authors":"Ahmed Hassan Awadallah, C. Gurrin, M. Sanderson, Ryen W. White","doi":"10.1145/3289600.3291374","DOIUrl":"https://doi.org/10.1145/3289600.3291374","url":null,"abstract":"The task intelligence workshop at the 2019 ACM Web Search and Data Mining (WSDM) conference comprised a mixture of research paper presentations, reports from data challenge participants, invited keynote(s) on broad topics related to tasks, and a workshop-wide discussion about task intelligence and its implications for system development.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122015362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
FaceOff: Assisting the Manifestation Design of Web Graphical User Interface FaceOff:协助Web图形用户界面的表现设计
Shuyu Zheng, Ziniu Hu, Yun Ma
Designing desirable and aesthetical manifestation of web graphic user interfaces (GUI) is a challenging task for web developers. After determining a web page's content, developers usually refer to existing pages, and adapt the styles from desired pages into the target one. However, it is not only difficult to find appropriate pages to exhibit the target page's content, but also tedious to incorporate styles from different pages harmoniously in the target page. To tackle these two issues, we propose FaceOff, a data-driven automation system that assists the manifestation design of web GUI. FaceOff constructs a repository of web GUI templates based on 15,491 web pages from popular websites and professional design examples. Given a web page for designing manifestation, FaceOff first segments it into multiple blocks, and retrieves GUI templates in the repository for each block. Subsequently, FaceOff recommends multiple combinations of templates according to a Convolutional Neural Network (CNN) based style-embedding model, which makes the recommended style combinations diverse and accordant. We demonstrate that FaceOff can retrieve suitable GUI templates with well-designed and harmonious style, and thus alleviate the developer efforts.
设计令人满意的、美观的web图形用户界面(GUI)对web开发人员来说是一项具有挑战性的任务。在确定了网页的内容后,开发人员通常会参考现有的页面,并将期望页面的样式调整为目标页面。然而,不仅很难找到合适的页面来展示目标页面的内容,而且在目标页面中和谐地融入不同页面的样式也很繁琐。为了解决这两个问题,我们提出了FaceOff,一个数据驱动的自动化系统,帮助web GUI的显示设计。FaceOff基于来自流行网站和专业设计示例的15,491个网页构建了一个web GUI模板库。给定一个用于设计显示的网页,FaceOff首先将其分成多个块,并在每个块的存储库中检索GUI模板。随后,FaceOff根据基于卷积神经网络(CNN)的样式嵌入模型推荐多种模板组合,使推荐的样式组合多样化和一致性。我们证明了FaceOff可以检索合适的GUI模板,设计良好,风格和谐,从而减轻了开发人员的工作量。
{"title":"FaceOff: Assisting the Manifestation Design of Web Graphical User Interface","authors":"Shuyu Zheng, Ziniu Hu, Yun Ma","doi":"10.1145/3289600.3290610","DOIUrl":"https://doi.org/10.1145/3289600.3290610","url":null,"abstract":"Designing desirable and aesthetical manifestation of web graphic user interfaces (GUI) is a challenging task for web developers. After determining a web page's content, developers usually refer to existing pages, and adapt the styles from desired pages into the target one. However, it is not only difficult to find appropriate pages to exhibit the target page's content, but also tedious to incorporate styles from different pages harmoniously in the target page. To tackle these two issues, we propose FaceOff, a data-driven automation system that assists the manifestation design of web GUI. FaceOff constructs a repository of web GUI templates based on 15,491 web pages from popular websites and professional design examples. Given a web page for designing manifestation, FaceOff first segments it into multiple blocks, and retrieves GUI templates in the repository for each block. Subsequently, FaceOff recommends multiple combinations of templates according to a Convolutional Neural Network (CNN) based style-embedding model, which makes the recommended style combinations diverse and accordant. We demonstrate that FaceOff can retrieve suitable GUI templates with well-designed and harmonious style, and thus alleviate the developer efforts.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131277420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Session details: Session 4: FATE & Privacy 会议详情:会议4:FATE & Privacy
Fernando Diaz
{"title":"Session details: Session 4: FATE & Privacy","authors":"Fernando Diaz","doi":"10.1145/3310344","DOIUrl":"https://doi.org/10.1145/3310344","url":null,"abstract":"","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131875490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interaction Embeddings for Prediction and Explanation in Knowledge Graphs 知识图中预测与解释的交互嵌入
Wen Zhang, B. Paudel, Wei Zhang, A. Bernstein, Huajun Chen
Knowledge graph embedding aims to learn distributed representations for entities and relations, and is proven to be effective in many applications. Crossover interactions -- bi-directional effects between entities and relations --- help select related information when predicting a new triple, but haven't been formally discussed before. In this paper, we propose CrossE, a novel knowledge graph embedding which explicitly simulates crossover interactions. It not only learns one general embedding for each entity and relation as most previous methods do, but also generates multiple triple specific embeddings for both of them, named interaction embeddings. We evaluate embeddings on typical link prediction tasks and find that CrossE achieves state-of-the-art results on complex and more challenging datasets. Furthermore, we evaluate embeddings from a new perspective -- giving explanations for predicted triples, which is important for real applications. In this work, an explanation for a triple is regarded as a reliable closed-path between the head and the tail entity. Compared to other baselines, we show experimentally that CrossE, benefiting from interaction embeddings, is more capable of generating reliable explanations to support its predictions.
知识图嵌入旨在学习实体和关系的分布式表示,并在许多应用中被证明是有效的。跨界互动——实体和关系之间的双向效应——有助于在预测新的三联体时选择相关信息,但之前还没有正式讨论过。在本文中,我们提出了一种新的知识图嵌入CrossE,它显式地模拟了交叉交互。它不仅像大多数以前的方法那样为每个实体和关系学习一个通用的嵌入,而且还为它们都生成多个三重特定的嵌入,称为交互嵌入。我们评估了典型链接预测任务中的嵌入,发现CrossE在复杂和更具挑战性的数据集上取得了最先进的结果。此外,我们从一个新的角度来评估嵌入——给出预测三元组的解释,这对实际应用很重要。在这项工作中,对三重的解释被视为头部和尾部实体之间的可靠闭合路径。与其他基线相比,我们通过实验表明,受益于交互嵌入的CrossE更有能力生成可靠的解释来支持其预测。
{"title":"Interaction Embeddings for Prediction and Explanation in Knowledge Graphs","authors":"Wen Zhang, B. Paudel, Wei Zhang, A. Bernstein, Huajun Chen","doi":"10.1145/3289600.3291014","DOIUrl":"https://doi.org/10.1145/3289600.3291014","url":null,"abstract":"Knowledge graph embedding aims to learn distributed representations for entities and relations, and is proven to be effective in many applications. Crossover interactions -- bi-directional effects between entities and relations --- help select related information when predicting a new triple, but haven't been formally discussed before. In this paper, we propose CrossE, a novel knowledge graph embedding which explicitly simulates crossover interactions. It not only learns one general embedding for each entity and relation as most previous methods do, but also generates multiple triple specific embeddings for both of them, named interaction embeddings. We evaluate embeddings on typical link prediction tasks and find that CrossE achieves state-of-the-art results on complex and more challenging datasets. Furthermore, we evaluate embeddings from a new perspective -- giving explanations for predicted triples, which is important for real applications. In this work, an explanation for a triple is regarded as a reliable closed-path between the head and the tail entity. Compared to other baselines, we show experimentally that CrossE, benefiting from interaction embeddings, is more capable of generating reliable explanations to support its predictions.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130679556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 144
The Local Closure Coefficient: A New Perspective On Network Clustering 局部闭合系数:研究网络聚类的新视角
Hao Yin, Austin R. Benson, J. Leskovec
The phenomenon of edge clustering in real-world networks is a fundamental property underlying many ideas and techniques in network science. Clustering is typically quantified by the clustering coefficient, which measures the fraction of pairs of neighbors of a given center node that are connected. However, many common explanations of edge clustering attribute the triadic closure to a head node instead of the center node of a length-2 path; for example, a friend of my friend is also my friend. While such explanations are common in network analysis, there is no measurement for edge clustering that can be attributed to the head node. Here we develop local closure coefficients as a metric quantifying head-node-based edge clustering. We define the local closure coefficient as the fraction of length-2 paths emanating from the head node that induce a triangle. This subtle difference in definition leads to remarkably different properties from traditional clustering coefficients. We analyze correlations with node degree, connect the closure coefficient to community detection, and show that closure coefficients as a feature can improve link prediction.
现实网络中的边缘聚类现象是网络科学中许多思想和技术的基本属性。聚类通常通过聚类系数来量化,聚类系数测量给定中心节点连接的邻居对的比例。然而,许多常见的边聚类解释将三元闭包归为一个头节点,而不是长度为2的路径的中心节点;例如,我朋友的朋友也是我的朋友。虽然这种解释在网络分析中很常见,但没有可以归因于头部节点的边缘聚类测量。在这里,我们开发了局部闭合系数作为量化基于头节点的边缘聚类的度量。我们将局部闭合系数定义为从头节点发出的长度为2的路径的分数,这些路径会产生一个三角形。这种定义上的细微差别导致了与传统聚类系数显著不同的性质。我们分析了节点度的相关性,将闭合系数与社区检测联系起来,并证明闭合系数作为特征可以改善链接预测。
{"title":"The Local Closure Coefficient: A New Perspective On Network Clustering","authors":"Hao Yin, Austin R. Benson, J. Leskovec","doi":"10.1145/3289600.3290991","DOIUrl":"https://doi.org/10.1145/3289600.3290991","url":null,"abstract":"The phenomenon of edge clustering in real-world networks is a fundamental property underlying many ideas and techniques in network science. Clustering is typically quantified by the clustering coefficient, which measures the fraction of pairs of neighbors of a given center node that are connected. However, many common explanations of edge clustering attribute the triadic closure to a head node instead of the center node of a length-2 path; for example, a friend of my friend is also my friend. While such explanations are common in network analysis, there is no measurement for edge clustering that can be attributed to the head node. Here we develop local closure coefficients as a metric quantifying head-node-based edge clustering. We define the local closure coefficient as the fraction of length-2 paths emanating from the head node that induce a triangle. This subtle difference in definition leads to remarkably different properties from traditional clustering coefficients. We analyze correlations with node degree, connect the closure coefficient to community detection, and show that closure coefficients as a feature can improve link prediction.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134444017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 49
Across-Time Comparative Summarization of News Articles 新闻文章的跨时间比较摘要
Yijun Duan, A. Jatowt
Comparative summarization is an effective strategy to discover important similarities and differences in collections of documents biased to users' interests. A natural method of this task is to find important and corresponding content. In this paper, we propose a novel research task of automatic query-based across-time summarization in news archives as well as we introduce an effective method to solve this task. The proposed model first learns an orthogonal transformation between temporally distant news collections. Then, it generates a set of corresponding sentence pairs based on a concise integer linear programming framework. We experimentally demonstrate the effectiveness of our method on the New York Times Annotated Corpus.
比较摘要是一种有效的策略,可以发现偏向于用户兴趣的文档集合中的重要异同。这项任务的自然方法是找到重要的和相应的内容。本文提出了一种新的基于查询的新闻档案跨时间自动摘要研究任务,并给出了一种有效的方法来解决这一问题。该模型首先学习时间间隔较远的新闻集合之间的正交变换。然后,基于简洁的整数线性规划框架生成一组相应的句子对。通过实验验证了该方法在《纽约时报》标注语料库上的有效性。
{"title":"Across-Time Comparative Summarization of News Articles","authors":"Yijun Duan, A. Jatowt","doi":"10.1145/3289600.3291008","DOIUrl":"https://doi.org/10.1145/3289600.3291008","url":null,"abstract":"Comparative summarization is an effective strategy to discover important similarities and differences in collections of documents biased to users' interests. A natural method of this task is to find important and corresponding content. In this paper, we propose a novel research task of automatic query-based across-time summarization in news archives as well as we introduce an effective method to solve this task. The proposed model first learns an orthogonal transformation between temporally distant news collections. Then, it generates a set of corresponding sentence pairs based on a concise integer linear programming framework. We experimentally demonstrate the effectiveness of our method on the New York Times Annotated Corpus.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133574638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots 基于检索的聊天机器人多回合响应选择的多表示融合网络
Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan
We consider context-response matching with multiple types of representations for multi-turn response selection in retrieval-based chatbots. The representations encode semantics of contexts and responses on words, n-grams, and sub-sequences of utterances, and capture both short-term and long-term dependencies among words. With such a number of representations in hand, we study how to fuse them in a deep neural architecture for matching and how each of them contributes to matching. To this end, we propose a multi-representation fusion network where the representations can be fused into matching at an early stage, at an intermediate stage, or at the last stage. We empirically compare different representations and fusing strategies on two benchmark data sets. Evaluation results indicate that late fusion is always better than early fusion, and by fusing the representations at the last stage, our model significantly outperforms the existing methods, and achieves new state-of-the-art performance on both data sets. Through a thorough ablation study, we demonstrate the effect of each representation to matching, which sheds light on how to select them in practical systems.
在基于检索的聊天机器人中,我们考虑了上下文-响应匹配与多种类型表示的多回合响应选择。表征对单词、n-gram和话语子序列上的上下文语义和反应进行编码,并捕获单词之间的短期和长期依赖关系。有了这么多的表征,我们研究如何将它们融合到一个深度神经结构中进行匹配,以及每个表征如何对匹配做出贡献。为此,我们提出了一种多表示融合网络,其中表示可以在早期阶段,中间阶段或最后阶段融合到匹配中。我们在两个基准数据集上比较了不同的表示和融合策略。评估结果表明,后期融合总是优于早期融合,并且通过在最后阶段融合表征,我们的模型显著优于现有方法,并在两个数据集上实现了新的最先进的性能。通过深入的烧蚀研究,我们证明了每种表示对匹配的影响,这有助于在实际系统中如何选择它们。
{"title":"Multi-Representation Fusion Network for Multi-Turn Response Selection in Retrieval-Based Chatbots","authors":"Chongyang Tao, Wei Wu, Can Xu, Wenpeng Hu, Dongyan Zhao, Rui Yan","doi":"10.1145/3289600.3290985","DOIUrl":"https://doi.org/10.1145/3289600.3290985","url":null,"abstract":"We consider context-response matching with multiple types of representations for multi-turn response selection in retrieval-based chatbots. The representations encode semantics of contexts and responses on words, n-grams, and sub-sequences of utterances, and capture both short-term and long-term dependencies among words. With such a number of representations in hand, we study how to fuse them in a deep neural architecture for matching and how each of them contributes to matching. To this end, we propose a multi-representation fusion network where the representations can be fused into matching at an early stage, at an intermediate stage, or at the last stage. We empirically compare different representations and fusing strategies on two benchmark data sets. Evaluation results indicate that late fusion is always better than early fusion, and by fusing the representations at the last stage, our model significantly outperforms the existing methods, and achieves new state-of-the-art performance on both data sets. Through a thorough ablation study, we demonstrate the effect of each representation to matching, which sheds light on how to select them in practical systems.","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"198 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133845865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 126
期刊
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1