首页 > 最新文献

Proceedings of the Web Conference 2021最新文献

英文 中文
Learning from Graph Propagation via Ordinal Distillation for One-Shot Automated Essay Scoring 基于顺序蒸馏的图传播学习用于一次性自动作文评分
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450017
Zhiwei Jiang, Meng Liu, Yafeng Yin, Hua Yu, Zifeng Cheng, Qing Gu
One-shot automated essay scoring (AES) aims to assign scores to a set of essays written specific to a certain prompt, with only one manually scored essay per distinct score. Compared to the previous-studied prompt-specific AES which usually requires a large number of manually scored essays for model training (e.g., about 600 manually scored essays out of totally 1000 essays), one-shot AES can greatly reduce the workload of manual scoring. In this paper, we propose a Transductive Graph-based Ordinal Distillation (TGOD) framework to tackle the task of one-shot AES. Specifically, we design a transductive graph-based model as a teacher model to generate pseudo labels of unlabeled essays based on the one-shot labeled essays. Then, we distill the knowledge in the teacher model into a neural student model by learning from the high confidence pseudo labels. Different from the general knowledge distillation, we propose an ordinal-aware unimodal distillation which makes a unimodal distribution constraint on the output of student model, to tolerate the minor errors existed in pseudo labels. Experimental results on the public dataset ASAP show that TGOD can improve the performance of existing neural AES models under the one-shot AES setting and achieve an acceptable average QWK of 0.69.
一次性自动论文评分(AES)旨在为一组特定于某个提示的文章分配分数,每个不同的分数只有一篇人工评分的文章。之前研究的针对提示的AES通常需要大量人工评分的文章进行模型训练(例如,1000篇文章中约有600篇文章是人工评分的),相比之下,一次性AES可以大大减少人工评分的工作量。在本文中,我们提出了一个基于换能图的有序蒸馏(TGOD)框架来解决一次性AES的任务。具体来说,我们设计了一个基于换能图的模型作为教师模型,在一次性标记文章的基础上生成未标记文章的伪标签。然后,我们通过学习高置信度的伪标签,将教师模型中的知识提炼成神经学生模型。与一般的知识蒸馏不同,我们提出了一种顺序感知的单峰蒸馏,对学生模型的输出进行单峰分布约束,以容忍伪标签中存在的微小误差。在公共数据集ASAP上的实验结果表明,TGOD可以提高现有神经AES模型在一次AES设置下的性能,达到可接受的0.69的平均QWK。
{"title":"Learning from Graph Propagation via Ordinal Distillation for One-Shot Automated Essay Scoring","authors":"Zhiwei Jiang, Meng Liu, Yafeng Yin, Hua Yu, Zifeng Cheng, Qing Gu","doi":"10.1145/3442381.3450017","DOIUrl":"https://doi.org/10.1145/3442381.3450017","url":null,"abstract":"One-shot automated essay scoring (AES) aims to assign scores to a set of essays written specific to a certain prompt, with only one manually scored essay per distinct score. Compared to the previous-studied prompt-specific AES which usually requires a large number of manually scored essays for model training (e.g., about 600 manually scored essays out of totally 1000 essays), one-shot AES can greatly reduce the workload of manual scoring. In this paper, we propose a Transductive Graph-based Ordinal Distillation (TGOD) framework to tackle the task of one-shot AES. Specifically, we design a transductive graph-based model as a teacher model to generate pseudo labels of unlabeled essays based on the one-shot labeled essays. Then, we distill the knowledge in the teacher model into a neural student model by learning from the high confidence pseudo labels. Different from the general knowledge distillation, we propose an ordinal-aware unimodal distillation which makes a unimodal distribution constraint on the output of student model, to tolerate the minor errors existed in pseudo labels. Experimental results on the public dataset ASAP show that TGOD can improve the performance of existing neural AES models under the one-shot AES setting and achieve an acceptable average QWK of 0.69.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132171808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Knowledge Embedding Based Graph Convolutional Network 基于知识嵌入的图卷积网络
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449925
Donghan Yu, Yiming Yang, Ruohong Zhang, Yuexin Wu
Recently, a considerable literature has grown up around the theme of Graph Convolutional Network (GCN). How to effectively leverage the rich structural information in complex graphs, such as knowledge graphs with heterogeneous types of entities and relations, is a primary open challenge in the field. Most GCN methods are either restricted to graphs with a homogeneous type of edges (e.g., citation links only), or focusing on representation learning for nodes only instead of jointly propagating and updating the embeddings of both nodes and edges for target-driven objectives. This paper addresses these limitations by proposing a novel framework, namely the Knowledge Embedding based Graph Convolutional Network (KE-GCN), which combines the power of GCNs in graph-based belief propagation and the strengths of advanced knowledge embedding (a.k.a. knowledge graph embedding) methods, and goes beyond. Our theoretical analysis shows that KE-GCN offers an elegant unification of several well-known GCN methods as specific cases, with a new perspective of graph convolution. Experimental results on benchmark datasets show the advantageous performance of KE-GCN over strong baseline methods in the tasks of knowledge graph alignment and entity classification1.
近年来,围绕图卷积网络(Graph Convolutional Network, GCN)这一主题出现了大量的文献。如何有效地利用复杂图中丰富的结构信息,如具有异构类型实体和关系的知识图,是该领域面临的主要挑战。大多数GCN方法要么局限于具有同质边缘类型的图(例如,仅引用链接),要么只关注节点的表示学习,而不是为目标驱动的目标联合传播和更新节点和边缘的嵌入。本文提出了一种新的框架,即基于知识嵌入的图卷积网络(KE-GCN),该框架结合了gcn在基于图的信念传播中的强大功能和高级知识嵌入(又称知识图嵌入)方法的优势,并超越了这些局限性。我们的理论分析表明,KE-GCN提供了几种著名的GCN方法作为具体案例的优雅统一,具有图卷积的新视角。在基准数据集上的实验结果表明,KE-GCN在知识图对齐和实体分类任务上优于强基线方法1。
{"title":"Knowledge Embedding Based Graph Convolutional Network","authors":"Donghan Yu, Yiming Yang, Ruohong Zhang, Yuexin Wu","doi":"10.1145/3442381.3449925","DOIUrl":"https://doi.org/10.1145/3442381.3449925","url":null,"abstract":"Recently, a considerable literature has grown up around the theme of Graph Convolutional Network (GCN). How to effectively leverage the rich structural information in complex graphs, such as knowledge graphs with heterogeneous types of entities and relations, is a primary open challenge in the field. Most GCN methods are either restricted to graphs with a homogeneous type of edges (e.g., citation links only), or focusing on representation learning for nodes only instead of jointly propagating and updating the embeddings of both nodes and edges for target-driven objectives. This paper addresses these limitations by proposing a novel framework, namely the Knowledge Embedding based Graph Convolutional Network (KE-GCN), which combines the power of GCNs in graph-based belief propagation and the strengths of advanced knowledge embedding (a.k.a. knowledge graph embedding) methods, and goes beyond. Our theoretical analysis shows that KE-GCN offers an elegant unification of several well-known GCN methods as specific cases, with a new perspective of graph convolution. Experimental results on benchmark datasets show the advantageous performance of KE-GCN over strong baseline methods in the tasks of knowledge graph alignment and entity classification1.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114667904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Understanding User Sensemaking in Machine Learning Fairness Assessment Systems 理解机器学习公平性评估系统中的用户语义
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450092
Ziwei Gu, Jing Nathan Yan, Jeffrey M. Rzeszotarski
A variety of systems have been proposed to assist users in detecting machine learning (ML) fairness issues. These systems approach bias reduction from a number of perspectives, including recommender systems, exploratory tools, and dashboards. In this paper, we seek to inform the design of these systems by examining how individuals make sense of fairness issues as they use different de-biasing affordances. In particular, we consider the tension between de-biasing recommendations which are quick but may lack nuance and ”what-if” style exploration which is time consuming but may lead to deeper understanding and transferable insights. Using logs, think-aloud data, and semi-structured interviews we find that exploratory systems promote a rich pattern of hypothesis generation and testing, while recommendations deliver quick answers which satisfy participants at the cost of reduced information exposure. We highlight design requirements and trade-offs in the design of ML fairness systems to promote accurate and explainable assessments.
已经提出了各种系统来帮助用户检测机器学习(ML)公平性问题。这些系统从许多角度来减少偏见,包括推荐系统、探索工具和仪表板。在本文中,我们试图通过研究个人在使用不同的去偏性支持时如何理解公平问题来为这些系统的设计提供信息。特别地,我们考虑了快速但可能缺乏细微差别的去偏见建议和“假设”风格的探索之间的紧张关系,这是耗时的,但可能导致更深入的理解和可转移的见解。通过使用日志、有声思考数据和半结构化访谈,我们发现探索性系统促进了假设生成和测试的丰富模式,而建议以减少信息暴露为代价提供快速答案,以满足参与者。我们强调了机器学习公平性系统设计中的设计要求和权衡,以促进准确和可解释的评估。
{"title":"Understanding User Sensemaking in Machine Learning Fairness Assessment Systems","authors":"Ziwei Gu, Jing Nathan Yan, Jeffrey M. Rzeszotarski","doi":"10.1145/3442381.3450092","DOIUrl":"https://doi.org/10.1145/3442381.3450092","url":null,"abstract":"A variety of systems have been proposed to assist users in detecting machine learning (ML) fairness issues. These systems approach bias reduction from a number of perspectives, including recommender systems, exploratory tools, and dashboards. In this paper, we seek to inform the design of these systems by examining how individuals make sense of fairness issues as they use different de-biasing affordances. In particular, we consider the tension between de-biasing recommendations which are quick but may lack nuance and ”what-if” style exploration which is time consuming but may lead to deeper understanding and transferable insights. Using logs, think-aloud data, and semi-structured interviews we find that exploratory systems promote a rich pattern of hypothesis generation and testing, while recommendations deliver quick answers which satisfy participants at the cost of reduced information exposure. We highlight design requirements and trade-offs in the design of ML fairness systems to promote accurate and explainable assessments.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114872641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Incremental Spatio-Temporal Graph Learning for Online Query-POI Matching 在线查询- poi匹配的增量时空图学习
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449810
Zixuan Yuan, Hao Liu, Junming Liu, Yanchi Liu, Yang Yang, Renjun Hu, Hui Xiong
Query and Point-of-Interest (POI) matching, aiming at recommending the most relevant POIs from partial query keywords, has become one of the most essential functions in online navigation and ride-hailing applications. Existing methods for query-POI matching, such as Google Maps and Uber, have a natural focus on measuring the static semantic similarity between contextual information of queries and geographical information of POIs. However, it remains challenging for dynamic and personalized online query-POI matching because of the non-stationary and situational context-dependent query-POI relevance. Moreover, the large volume of online queries requires an adaptive and incremental model training strategy that is efficient and scalable in the online scenario. To this end, in this paper, we propose an Incremental Spatio-Temporal Graph Learning (IncreSTGL) framework for intelligent online query-POI matching. Specifically, we first model dynamic query-POI interactions as microscopic and macroscopic graphs. Then, we propose an incremental graph representation learning module to refine and update query-POI interaction graphs in an online incremental fashion, which includes: (i) a contextual graph attention operation quantifying query-POI correlation based on historical queries under dynamic situational context, (ii) a graph discrimination operation capturing the sequential query-POI relevance drift from a holistic view of personalized preference and social homophily, and (iii) a multi-level temporal attention operation summarizing the temporal variations of query-POI interaction graphs for subsequent query-POI matching. Finally, we introduce a lightweight semantic matching module for online query-POI similarity measurement. To demonstrate the effectiveness and efficiency of the proposed algorithm, we conduct extensive experiments on two real-world datasets collected from a leading online navigation and map service provider in China.
查询与兴趣点匹配(Query and Point-of-Interest, POI)旨在从部分查询关键字中推荐最相关的兴趣点,已成为在线导航和网约车应用中最重要的功能之一。现有的查询- poi匹配方法,如谷歌Maps和Uber,自然侧重于测量查询的上下文信息和poi的地理信息之间的静态语义相似度。然而,由于查询- poi相关性的非平稳和情景上下文依赖,动态和个性化在线查询- poi匹配仍然具有挑战性。此外,大量的在线查询需要一种在在线场景中高效且可扩展的自适应增量模型训练策略。为此,本文提出了一种用于智能在线查询- poi匹配的增量时空图学习(IncreSTGL)框架。具体来说,我们首先将动态查询- poi交互建模为微观和宏观图。然后,我们提出了一个增量图表示学习模块,以在线增量方式精炼和更新查询- poi交互图,其中包括:(i)基于动态情景背景下历史查询量化查询- poi相关性的上下文图注意操作;(ii)从个性化偏好和社会同质性的整体角度捕捉顺序查询- poi相关性漂移的图判别操作;(iii)总结查询- poi交互图的时间变化,为后续查询- poi匹配提供多层次时间注意操作。最后,我们引入了一个轻量级的在线查询语义匹配模块——poi相似度度量。为了证明所提出算法的有效性和效率,我们在中国一家领先的在线导航和地图服务提供商收集的两个真实数据集上进行了广泛的实验。
{"title":"Incremental Spatio-Temporal Graph Learning for Online Query-POI Matching","authors":"Zixuan Yuan, Hao Liu, Junming Liu, Yanchi Liu, Yang Yang, Renjun Hu, Hui Xiong","doi":"10.1145/3442381.3449810","DOIUrl":"https://doi.org/10.1145/3442381.3449810","url":null,"abstract":"Query and Point-of-Interest (POI) matching, aiming at recommending the most relevant POIs from partial query keywords, has become one of the most essential functions in online navigation and ride-hailing applications. Existing methods for query-POI matching, such as Google Maps and Uber, have a natural focus on measuring the static semantic similarity between contextual information of queries and geographical information of POIs. However, it remains challenging for dynamic and personalized online query-POI matching because of the non-stationary and situational context-dependent query-POI relevance. Moreover, the large volume of online queries requires an adaptive and incremental model training strategy that is efficient and scalable in the online scenario. To this end, in this paper, we propose an Incremental Spatio-Temporal Graph Learning (IncreSTGL) framework for intelligent online query-POI matching. Specifically, we first model dynamic query-POI interactions as microscopic and macroscopic graphs. Then, we propose an incremental graph representation learning module to refine and update query-POI interaction graphs in an online incremental fashion, which includes: (i) a contextual graph attention operation quantifying query-POI correlation based on historical queries under dynamic situational context, (ii) a graph discrimination operation capturing the sequential query-POI relevance drift from a holistic view of personalized preference and social homophily, and (iii) a multi-level temporal attention operation summarizing the temporal variations of query-POI interaction graphs for subsequent query-POI matching. Finally, we introduce a lightweight semantic matching module for online query-POI similarity measurement. To demonstrate the effectiveness and efficiency of the proposed algorithm, we conduct extensive experiments on two real-world datasets collected from a leading online navigation and map service provider in China.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114908790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities 面向内容提供者感知的推荐系统:用户与提供者实用程序交互的模拟研究
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449889
Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen
Most existing recommender systems focus primarily on matching users (content consumers) to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into account the long-term utility of both users and content providers? By doing so, we hope to sustain more content providers and a more diverse content pool for long-term user satisfaction. Understanding the full impact of recommendations on both user and content provider groups is challenging. This paper aims to serve as a research investigation of one approach toward building a content provider aware recommender, and evaluating its impact in a simulated setup. To characterize the user-recommender-provider interdependence, we complement user modeling by formalizing provider dynamics as well. The resulting joint dynamical system gives rise to a weakly-coupled partially observable Markov decision process driven by recommender actions and user feedback to providers. We then build a REINFORCE recommender agent, coined EcoAgent, to optimize a joint objective of user utility and the counterfactual utility lift of the content provider associated with the recommended content, which we show to be equivalent to maximizing overall user utility and the utilities of all content providers on the platform under some mild assumptions. To evaluate our approach, we introduce a simulation environment capturing the key interactions among users, providers, and the recommender. We offer a number of simulated experiments that shed light on both the benefits and the limitations of our approach. These results help understand how and when a content provider aware recommender agent is of benefit in building multi-stakeholder recommender systems.
大多数现有的推荐系统主要关注用户(内容消费者)与内容的匹配,从而最大化用户在平台上的满意度。然而,越来越明显的是,内容提供商通过内容创建对用户满意度产生了关键影响,在很大程度上决定了可供推荐的内容池。因此,一个自然的问题出现了:我们能否在设计推荐时考虑到用户和内容提供者的长期效用?通过这样做,我们希望维持更多的内容提供商和更多样化的内容池,以获得长期的用户满意度。理解推荐对用户和内容提供者群体的全面影响是一项挑战。本文旨在研究构建内容提供商感知推荐的一种方法,并评估其在模拟设置中的影响。为了描述用户-推荐人-提供者的相互依赖关系,我们还通过形式化提供者动态来补充用户建模。由此产生的联合动力系统产生了由推荐行为和用户对提供者的反馈驱动的弱耦合部分可观察的马尔可夫决策过程。然后,我们建立了一个强化推荐代理,称为EcoAgent,以优化用户效用和与推荐内容相关的内容提供商的反事实效用提升的联合目标,我们证明,在一些温和的假设下,这相当于最大化整体用户效用和平台上所有内容提供商的效用。为了评估我们的方法,我们引入了一个模拟环境,捕捉用户、提供者和推荐者之间的关键交互。我们提供了一些模拟实验,阐明了我们的方法的优点和局限性。这些结果有助于理解内容提供者感知的推荐代理如何以及何时在构建多利益相关者推荐系统中受益。
{"title":"Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities","authors":"Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen","doi":"10.1145/3442381.3449889","DOIUrl":"https://doi.org/10.1145/3442381.3449889","url":null,"abstract":"Most existing recommender systems focus primarily on matching users (content consumers) to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into account the long-term utility of both users and content providers? By doing so, we hope to sustain more content providers and a more diverse content pool for long-term user satisfaction. Understanding the full impact of recommendations on both user and content provider groups is challenging. This paper aims to serve as a research investigation of one approach toward building a content provider aware recommender, and evaluating its impact in a simulated setup. To characterize the user-recommender-provider interdependence, we complement user modeling by formalizing provider dynamics as well. The resulting joint dynamical system gives rise to a weakly-coupled partially observable Markov decision process driven by recommender actions and user feedback to providers. We then build a REINFORCE recommender agent, coined EcoAgent, to optimize a joint objective of user utility and the counterfactual utility lift of the content provider associated with the recommended content, which we show to be equivalent to maximizing overall user utility and the utilities of all content providers on the platform under some mild assumptions. To evaluate our approach, we introduce a simulation environment capturing the key interactions among users, providers, and the recommender. We offer a number of simulated experiments that shed light on both the benefits and the limitations of our approach. These results help understand how and when a content provider aware recommender agent is of benefit in building multi-stakeholder recommender systems.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123453194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Bidirectional Distillation for Top-K Recommender System Top-K推荐系统的双向蒸馏
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449878
Wonbin Kweon, SeongKu Kang, Hwanjo Yu
Recommender systems (RS) have started to employ knowledge distillation, which is a model compression technique training a compact model (student) with the knowledge transferred from a cumbersome model (teacher). The state-of-the-art methods rely on unidirectional distillation transferring the knowledge only from the teacher to the student, with an underlying assumption that the teacher is always superior to the student. However, we demonstrate that the student performs better than the teacher on a significant proportion of the test set, especially for RS. Based on this observation, we propose Bidirectional Distillation (BD) framework whereby both the teacher and the student collaboratively improve with each other. Specifically, each model is trained with the distillation loss that makes to follow the other’s prediction along with its original loss function. For effective bidirectional distillation, we propose rank discrepancy-aware sampling scheme to distill only the informative knowledge that can fully enhance each other. The proposed scheme is designed to effectively cope with a large performance gap between the teacher and the student. Trained in the bidirectional way, it turns out that both the teacher and the student are significantly improved compared to when being trained separately. Our extensive experiments on real-world datasets show that our proposed framework consistently outperforms the state-of-the-art competitors. We also provide analyses for an in-depth understanding of BD and ablation studies to verify the effectiveness of each proposed component.
推荐系统(RS)已经开始采用知识蒸馏,这是一种模型压缩技术,用从笨重模型(教师)转移过来的知识训练一个紧凑模型(学生)。最先进的方法依赖于单向蒸馏,只将知识从教师转移到学生身上,并假设教师总是优于学生。然而,我们证明了学生在很大比例的测试集上比老师表现得更好,特别是对于RS。基于这一观察,我们提出了双向蒸馏(BD)框架,即教师和学生相互协作改进。具体来说,每个模型都是用蒸馏损失来训练的,这使得它跟随另一个模型的预测以及它的原始损失函数。为了实现有效的双向蒸馏,我们提出了等级差异感知的采样方案,只提取能充分增强彼此的信息知识。所提出的方案旨在有效地应对教师和学生之间的巨大表现差距。结果表明,在双向训练的情况下,教师和学生都比单独训练有了明显的提高。我们在真实世界数据集上的广泛实验表明,我们提出的框架始终优于最先进的竞争对手。我们还提供了深入了解BD和消融研究的分析,以验证每个提议组成部分的有效性。
{"title":"Bidirectional Distillation for Top-K Recommender System","authors":"Wonbin Kweon, SeongKu Kang, Hwanjo Yu","doi":"10.1145/3442381.3449878","DOIUrl":"https://doi.org/10.1145/3442381.3449878","url":null,"abstract":"Recommender systems (RS) have started to employ knowledge distillation, which is a model compression technique training a compact model (student) with the knowledge transferred from a cumbersome model (teacher). The state-of-the-art methods rely on unidirectional distillation transferring the knowledge only from the teacher to the student, with an underlying assumption that the teacher is always superior to the student. However, we demonstrate that the student performs better than the teacher on a significant proportion of the test set, especially for RS. Based on this observation, we propose Bidirectional Distillation (BD) framework whereby both the teacher and the student collaboratively improve with each other. Specifically, each model is trained with the distillation loss that makes to follow the other’s prediction along with its original loss function. For effective bidirectional distillation, we propose rank discrepancy-aware sampling scheme to distill only the informative knowledge that can fully enhance each other. The proposed scheme is designed to effectively cope with a large performance gap between the teacher and the student. Trained in the bidirectional way, it turns out that both the teacher and the student are significantly improved compared to when being trained separately. Our extensive experiments on real-world datasets show that our proposed framework consistently outperforms the state-of-the-art competitors. We also provide analyses for an in-depth understanding of BD and ablation studies to verify the effectiveness of each proposed component.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121979899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Long Short-Term Session Search: Joint Personalized Reranking and Next Query Prediction 长短期会话搜索:联合个性化重排序和下一个查询预测
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449941
Qiannan Cheng, Z. Ren, Yujie Lin, Pengjie Ren, Zhumin Chen, Xiangyuan Liu, M. de Rijke
DR and next query prediction (NQP) are two core tasks in session search. They are often driven by the same search intent and, hence, it is natural to jointly optimize both tasks. So far, most models proposed for jointly optimizing document reranking (DR) and NQP have focused on users’ short-term intent in an ongoing search session. Because of this limitation, these models fail to account for users’ long-term intent as captured in their historical search sessions. In contrast, we consider a personalized mechanism for learning a user’s profile from their long-term and short-term behavior to simultaneously enhance the performance of DR and NQP in an ongoing search session. We propose a personalized session search model, called Long short-term session search, Network (LostNet), that jointly learns to rerank documents for the current query and predict the next query. LostNet consists of three modules: The hierarchical session-based attention mechanism tracks the fine-grained short-term intent in an ongoing session. The personalized multi-hop memory network tracks a user’s dynamic profile information from their prior search sessions so as to infer their personal search intent. Jointly learning of DR and NQP is aimed at simultaneously reranking documents and predicting the next query based on outputs from the above two modules. We conduct experiments on two large-scale session search benchmark datasets. The results show that LostNet achieves significant improvements over state-of-the-art baselines.
DR和下一次查询预测(NQP)是会话搜索中的两个核心任务。它们通常是由相同的搜索意图驱动的,因此,联合优化这两个任务是很自然的。到目前为止,大多数联合优化文档重排序(DR)和NQP的模型都关注用户在持续搜索会话中的短期意图。由于这种限制,这些模型无法解释用户在历史搜索会话中捕获的长期意图。相比之下,我们考虑了一种个性化的机制,从用户的长期和短期行为中学习用户的个人资料,同时提高DR和NQP在持续搜索会话中的性能。我们提出了一种个性化的会话搜索模型,称为长短期会话搜索网络(LostNet),它共同学习为当前查询重新排序文档并预测下一个查询。LostNet由三个模块组成:基于会话的分层注意力机制跟踪正在进行的会话中的细粒度短期意图。所述个性化多跳存储器网络从其先前搜索会话中跟踪用户的动态配置信息,从而推断其个人搜索意图。DR和NQP的联合学习旨在同时对文档进行重新排序,并根据上述两个模块的输出预测下一个查询。我们在两个大规模会话搜索基准数据集上进行了实验。结果表明,与最先进的基线相比,LostNet实现了显著改进。
{"title":"Long Short-Term Session Search: Joint Personalized Reranking and Next Query Prediction","authors":"Qiannan Cheng, Z. Ren, Yujie Lin, Pengjie Ren, Zhumin Chen, Xiangyuan Liu, M. de Rijke","doi":"10.1145/3442381.3449941","DOIUrl":"https://doi.org/10.1145/3442381.3449941","url":null,"abstract":"DR and next query prediction (NQP) are two core tasks in session search. They are often driven by the same search intent and, hence, it is natural to jointly optimize both tasks. So far, most models proposed for jointly optimizing document reranking (DR) and NQP have focused on users’ short-term intent in an ongoing search session. Because of this limitation, these models fail to account for users’ long-term intent as captured in their historical search sessions. In contrast, we consider a personalized mechanism for learning a user’s profile from their long-term and short-term behavior to simultaneously enhance the performance of DR and NQP in an ongoing search session. We propose a personalized session search model, called Long short-term session search, Network (LostNet), that jointly learns to rerank documents for the current query and predict the next query. LostNet consists of three modules: The hierarchical session-based attention mechanism tracks the fine-grained short-term intent in an ongoing session. The personalized multi-hop memory network tracks a user’s dynamic profile information from their prior search sessions so as to infer their personal search intent. Jointly learning of DR and NQP is aimed at simultaneously reranking documents and predicting the next query based on outputs from the above two modules. We conduct experiments on two large-scale session search benchmark datasets. The results show that LostNet achieves significant improvements over state-of-the-art baselines.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125765528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
WiseKG: Balanced Access to Web Knowledge Graphs WiseKG:平衡访问网络知识图谱
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449911
Amr Azzam, Christian Aebeloe, Gabriela Montoya, Ilkcan Keles, A. Polleres, K. Hose
SPARQL query services that balance processing between clients and servers become more and more essential to handle the increasing load for open and decentralized knowledge graphs on the Web. To this end, Linked Data Fragments (LDF) have introduced a foundational framework that has sparked research exploring a spectrum of potential Web querying interfaces in between server-side query processing via SPARQL endpoints and client-side query processing of data dumps. Current proposals in between typically suffer from imbalanced load on either the client or the server. In this paper, to the best of our knowledge, we present the first work that combines both client-side and server-side query optimization techniques in a truly dynamic fashion: we introduce WiseKG, a system that employs a cost model that dynamically delegates the load between servers and clients by combining client-side processing of shipped partitions with efficient server-side processing of star-shaped sub-queries, based on current server workload and client capabilities. Our experiments show that WiseKG significantly outperforms state-of-the-art solutions in terms of average total query execution time per client, while at the same time decreasing network traffic and increasing server-side availability.
SPARQL查询服务平衡了客户机和服务器之间的处理,对于处理Web上开放和分散的知识图日益增加的负载变得越来越重要。为此,关联数据片段(Linked Data Fragments, LDF)引入了一个基础框架,该框架引发了对通过SPARQL端点进行服务器端查询处理和对数据转储进行客户端查询处理之间潜在Web查询接口的研究。当前处于两者之间的提案通常会受到客户机或服务器上负载不平衡的影响。在本文中,据我们所知,我们展示了第一个以真正动态的方式结合客户端和服务器端查询优化技术的工作:我们介绍了WiseKG,这是一个采用成本模型的系统,它根据当前服务器工作负载和客户端功能,将已交付分区的客户端处理与有效的服务器端星形子查询处理结合起来,动态地在服务器和客户端之间分配负载。我们的实验表明,WiseKG在每个客户机的平均总查询执行时间方面明显优于最先进的解决方案,同时减少了网络流量并提高了服务器端可用性。
{"title":"WiseKG: Balanced Access to Web Knowledge Graphs","authors":"Amr Azzam, Christian Aebeloe, Gabriela Montoya, Ilkcan Keles, A. Polleres, K. Hose","doi":"10.1145/3442381.3449911","DOIUrl":"https://doi.org/10.1145/3442381.3449911","url":null,"abstract":"SPARQL query services that balance processing between clients and servers become more and more essential to handle the increasing load for open and decentralized knowledge graphs on the Web. To this end, Linked Data Fragments (LDF) have introduced a foundational framework that has sparked research exploring a spectrum of potential Web querying interfaces in between server-side query processing via SPARQL endpoints and client-side query processing of data dumps. Current proposals in between typically suffer from imbalanced load on either the client or the server. In this paper, to the best of our knowledge, we present the first work that combines both client-side and server-side query optimization techniques in a truly dynamic fashion: we introduce WiseKG, a system that employs a cost model that dynamically delegates the load between servers and clients by combining client-side processing of shipped partitions with efficient server-side processing of star-shaped sub-queries, based on current server workload and client capabilities. Our experiments show that WiseKG significantly outperforms state-of-the-art solutions in terms of average total query execution time per client, while at the same time decreasing network traffic and increasing server-side availability.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127272691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
FANCY: Human-centered, Deep Learning-based Framework for Fashion Style Analysis FANCY:以人为本,基于深度学习的时尚风格分析框架
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449833
Youngseung Jeon, Seungwan Jin, Kyungsik Han
Fashion style analysis is of the utmost importance for fashion professionals. However, it has an issue of having different style classification criteria that rely heavily on professionals’ subjective experiences with no quantitative criteria. We present FANCY (Fashion Attributes detectioN for Clustering stYle), a human-centered, deep learning-based framework to support fashion professionals’ analytic tasks using a computational method integrated with their insights. We work closely with fashion professionals in the whole study process to reflect their domain knowledge and experience as much as possible. We redefine fashion attributes, demonstrate a strong association with fashion attributes and styles, and develop a deep learning model that detects attributes in a given fashion image and reflects fashion professionals’ insight. Based on attribute-annotated 302,772 runway fashion images, we developed 25 new fashion styles (FANCY dataset 1). We summarize quantitative standards of the fashion style groups and present fashion trends based on time, location, and brand.
时尚风格分析对时尚专业人士来说是至关重要的。然而,它有一个问题,即有不同的风格分类标准,严重依赖于专业人员的主观经验,没有定量的标准。我们提出了FANCY (Fashion Attributes detectioN for Clustering stYle),这是一个以人为中心的、基于深度学习的框架,它使用一种与时尚专业人士的见解相结合的计算方法来支持时尚专业人士的分析任务。在整个学习过程中,我们与时尚专业人士密切合作,尽可能多地反映他们的领域知识和经验。我们重新定义了时尚属性,展示了与时尚属性和风格的强烈关联,并开发了一个深度学习模型,可以检测给定时尚图像中的属性,并反映时尚专业人士的见解。基于属性标注的302772张t台时尚图像,我们开发了25种新的时尚风格(FANCY数据集1)。我们总结了时尚风格组的定量标准,并基于时间、地点和品牌呈现了时尚趋势。
{"title":"FANCY: Human-centered, Deep Learning-based Framework for Fashion Style Analysis","authors":"Youngseung Jeon, Seungwan Jin, Kyungsik Han","doi":"10.1145/3442381.3449833","DOIUrl":"https://doi.org/10.1145/3442381.3449833","url":null,"abstract":"Fashion style analysis is of the utmost importance for fashion professionals. However, it has an issue of having different style classification criteria that rely heavily on professionals’ subjective experiences with no quantitative criteria. We present FANCY (Fashion Attributes detectioN for Clustering stYle), a human-centered, deep learning-based framework to support fashion professionals’ analytic tasks using a computational method integrated with their insights. We work closely with fashion professionals in the whole study process to reflect their domain knowledge and experience as much as possible. We redefine fashion attributes, demonstrate a strong association with fashion attributes and styles, and develop a deep learning model that detects attributes in a given fashion image and reflects fashion professionals’ insight. Based on attribute-annotated 302,772 runway fashion images, we developed 25 new fashion styles (FANCY dataset 1). We summarize quantitative standards of the fashion style groups and present fashion trends based on time, location, and brand.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131944200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
IFSpard: An Information Fusion-based Framework for Spam Review Detection IFSpard:基于信息融合的垃圾邮件检测框架
Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449920
Yao Zhu, Hongzhi Liu, Yingpeng Du, Zhonghai Wu
Online reviews, which contain the quality information and user experience about products, always affect the consumption decisions of customers. Unfortunately, quite a number of spammers attempt to mislead consumers by writing fake reviews for some intents. Existing methods for detecting spam reviews mainly focus on constructing discriminative features, which heavily depend on experts and may miss some complex but effective features. Recently, some models attempt to learn the latent representations of reviews, users, and items. However, the learned embeddings usually lack interpretability. Moreover, most of existing methods are based on single classification model while ignoring the complementarity of different classification models. To solve these problems, we propose IFSpard, a novel information fusion-based framework that aims at exploring and exploiting useful information from various aspects for spam review detection. First, we design a graph-based feature extraction method and an interaction-mining-based feature crossing method to automatically extract basic and complex features with consideration of different sources of data. Then, we propose a mutual-information-based feature selection and representation learning method to remove the irrelevant and redundant information contained in the automatically constructed features. Finally, we devise an adaptive ensemble model to make use of the information of constructed features and the abilities of different classifiers for spam review detection. Experimental results on several public datasets show that the proposed model performs better than state-of-the-art methods.
在线评论包含了产品的质量信息和用户体验,影响着消费者的消费决策。不幸的是,相当多的垃圾邮件发送者试图通过撰写虚假评论来误导消费者。现有的垃圾评论检测方法主要集中在构建判别特征上,严重依赖专家,可能会遗漏一些复杂但有效的特征。最近,一些模型试图学习评论、用户和项目的潜在表示。然而,学习到的嵌入通常缺乏可解释性。此外,现有的方法大多基于单一的分类模型,忽略了不同分类模型之间的互补性。为了解决这些问题,我们提出了一种新的基于信息融合的框架IFSpard,旨在从各个方面探索和利用有用的信息来检测垃圾邮件。首先,我们设计了一种基于图的特征提取方法和一种基于交互挖掘的特征交叉方法,在考虑不同数据源的情况下自动提取基本特征和复杂特征。然后,我们提出了一种基于互信息的特征选择和表示学习方法来去除自动构造的特征中包含的不相关和冗余信息。最后,我们设计了一个自适应集成模型,利用构造的特征信息和不同分类器的能力进行垃圾邮件审查检测。在多个公开数据集上的实验结果表明,该模型的性能优于现有的方法。
{"title":"IFSpard: An Information Fusion-based Framework for Spam Review Detection","authors":"Yao Zhu, Hongzhi Liu, Yingpeng Du, Zhonghai Wu","doi":"10.1145/3442381.3449920","DOIUrl":"https://doi.org/10.1145/3442381.3449920","url":null,"abstract":"Online reviews, which contain the quality information and user experience about products, always affect the consumption decisions of customers. Unfortunately, quite a number of spammers attempt to mislead consumers by writing fake reviews for some intents. Existing methods for detecting spam reviews mainly focus on constructing discriminative features, which heavily depend on experts and may miss some complex but effective features. Recently, some models attempt to learn the latent representations of reviews, users, and items. However, the learned embeddings usually lack interpretability. Moreover, most of existing methods are based on single classification model while ignoring the complementarity of different classification models. To solve these problems, we propose IFSpard, a novel information fusion-based framework that aims at exploring and exploiting useful information from various aspects for spam review detection. First, we design a graph-based feature extraction method and an interaction-mining-based feature crossing method to automatically extract basic and complex features with consideration of different sources of data. Then, we propose a mutual-information-based feature selection and representation learning method to remove the irrelevant and redundant information contained in the automatically constructed features. Finally, we devise an adaptive ensemble model to make use of the information of constructed features and the abilities of different classifiers for spam review detection. Experimental results on several public datasets show that the proposed model performs better than state-of-the-art methods.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130235801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Proceedings of the Web Conference 2021
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1