Proceedings of the Web Conference 2021最新文献_第10页

Efficient Probabilistic Truss Indexing on Uncertain Graphs 不确定图的高效概率桁架索引

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449976

Zitang Sun, Xin Huang, Jianliang Xu, F. Bonchi

Networks in many real-world applications come with an inherent uncertainty in their structure, due to e.g., noisy measurements, inference and prediction models, or for privacy purposes. Modeling and analyzing uncertain graphs has attracted a great deal of attention. Among the various graph analytic tasks studied, the extraction of dense substructures, such as cores or trusses, has a central role. In this paper, we study the problem of (k, γ)-truss indexing and querying over an uncertain graph . A (k, γ)-truss is the largest subgraph of , such that the probability of each edge being contained in at least k − 2 triangles is no less than γ. Our first proposal, CPT-index, keeps all the (k, γ)-trusses: retrieval for any given k and γ can be executed in an optimal linear time w.r.t. the graph size of the queried (k, γ)-truss. We develop a bottom-up CPT-indexconstruction scheme and an improved algorithm for fast CPT-indexconstruction using top-down graph partitions. For trading off between (k, γ)-truss offline indexing and online querying, we further develop an approximate indexing approach (ϵ, Δr)-APXequipped with two parameters, ϵ and Δr, that govern tolerated errors. Extensive experiments using large-scale uncertain graphs with 261 million edges validate the efficiency of our proposed indexing and querying algorithms against state-of-the-art methods.

在许多现实世界的应用中，网络在其结构中具有固有的不确定性，例如，由于噪声测量，推理和预测模型，或出于隐私目的。不确定图的建模和分析已经引起了人们的广泛关注。在所研究的各种图分析任务中，密集子结构(如核心或桁架)的提取具有中心作用。本文研究了不确定图上(k， γ)-桁架索引和查询问题。A (k， γ)-truss是的最大子图，使得每条边被包含在至少k−2个三角形中的概率不小于γ。我们的第一个建议，CPT-index，保留了所有的(k， γ)-桁架:对于任何给定的k和γ的检索可以在最优线性时间内执行，而不是查询的(k， γ)-桁架的图大小。提出了一种自底向上的cpt索引构建方案和一种改进的基于自顶向下图分区的cpt索引快速构建算法。为了在(k， γ)桁架离线索引和在线查询之间进行权衡，我们进一步开发了一种近似索引方法(λ， Δr)- apx配备了两个参数，λ和Δr，用于控制可容忍误差。使用具有2.61亿个边的大规模不确定图的广泛实验验证了我们提出的索引和查询算法与最先进方法的效率。

{"title":"Efficient Probabilistic Truss Indexing on Uncertain Graphs","authors":"Zitang Sun, Xin Huang, Jianliang Xu, F. Bonchi","doi":"10.1145/3442381.3449976","DOIUrl":"https://doi.org/10.1145/3442381.3449976","url":null,"abstract":"Networks in many real-world applications come with an inherent uncertainty in their structure, due to e.g., noisy measurements, inference and prediction models, or for privacy purposes. Modeling and analyzing uncertain graphs has attracted a great deal of attention. Among the various graph analytic tasks studied, the extraction of dense substructures, such as cores or trusses, has a central role. In this paper, we study the problem of (k, γ)-truss indexing and querying over an uncertain graph . A (k, γ)-truss is the largest subgraph of , such that the probability of each edge being contained in at least k − 2 triangles is no less than γ. Our first proposal, CPT-index, keeps all the (k, γ)-trusses: retrieval for any given k and γ can be executed in an optimal linear time w.r.t. the graph size of the queried (k, γ)-truss. We develop a bottom-up CPT-indexconstruction scheme and an improved algorithm for fast CPT-indexconstruction using top-down graph partitions. For trading off between (k, γ)-truss offline indexing and online querying, we further develop an approximate indexing approach (ϵ, Δr)-APXequipped with two parameters, ϵ and Δr, that govern tolerated errors. Extensive experiments using large-scale uncertain graphs with 261 million edges validate the efficiency of our proposed indexing and querying algorithms against state-of-the-art methods.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114540927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Unsupervised Lifelong Learning with Curricula 无监督终身学习课程

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449839

Yi He, Sheng Chen, Baijun Wu, Xu Yuan, Xindong Wu

Lifelong machine learning (LML) has driven the development of extensive web applications, enabling the learning systems deployed on web servers to deal with a sequence of tasks in an incremental fashion. Such systems can retain knowledge from learned tasks in a knowledge base and seamlessly apply it to improve the future learning. Unfortunately, most existing LML methods require labels in every task, whereas providing persistent human labeling for all future tasks is costly, onerous, error-prone, and hence impractical. Motivated by this situation, we propose a new paradigm named unsupervised lifelong learning with curricula (ULLC), where only one task needs to be labeled for initialization and the system then performs lifelong learning for subsequent tasks in an unsupervised fashion. A main challenge of realizing this paradigm lies in the occurrence of negative knowledge transfer, where partial old knowledge becomes detrimental for learning a given task yet cannot be filtered out by the learner without the help of labels. To overcome this challenge, we draw insights from the learning behaviors of humans. Specifically, when faced with a difficult task that cannot be well tackled by our current knowledge, we usually postpone it and work on some easier tasks first, which allows us to grow our knowledge. Thereafter, once we go back to the postponed task, we are more likely to tackle it well as we are more knowledgeable now. The key idea of ULLC is similar – at any time, a pool of candidate tasks are organized in a curriculum by their distances to the knowledge base. The learner then starts from the closer tasks, accumulates knowledge from learning them, and moves to learn the faraway tasks with a gradually augmented knowledge base. The viability and effectiveness of our proposal are substantiated through extensive empirical studies on both synthetic and real datasets.

终身机器学习(LML)推动了广泛的web应用程序的发展，使部署在web服务器上的学习系统能够以增量方式处理一系列任务。这样的系统可以将学习任务中的知识保留在知识库中，并无缝地应用于未来的学习。不幸的是，大多数现有的LML方法都要求在每个任务中都有标签，而为所有未来的任务提供持久的人工标签是昂贵的、繁重的、容易出错的，因此不切实际。在这种情况下，我们提出了一种新的范式，称为带课程的无监督终身学习(ULLC)，其中只需要标记一个任务进行初始化，然后系统以无监督的方式对后续任务执行终身学习。实现这一范式的一个主要挑战在于负知识迁移的发生，即部分旧知识对学习给定任务是有害的，但在没有标签的帮助下，学习者无法过滤掉。为了克服这一挑战，我们从人类的学习行为中汲取见解。具体来说，当面对一个我们现有的知识不能很好地解决的困难任务时，我们通常会推迟它，先做一些更容易的任务，这样可以让我们的知识增长。此后，一旦我们回到推迟的任务，我们更有可能解决它，因为我们现在更有知识。ULLC的关键思想是类似的——在任何时候，候选任务池根据它们与知识库的距离在课程中组织起来。然后，学习者从较近的任务开始，在学习中积累知识，并随着知识库的逐渐增加而学习较远的任务。通过对合成和真实数据集的广泛实证研究，我们的建议的可行性和有效性得到了证实。

{"title":"Unsupervised Lifelong Learning with Curricula","authors":"Yi He, Sheng Chen, Baijun Wu, Xu Yuan, Xindong Wu","doi":"10.1145/3442381.3449839","DOIUrl":"https://doi.org/10.1145/3442381.3449839","url":null,"abstract":"Lifelong machine learning (LML) has driven the development of extensive web applications, enabling the learning systems deployed on web servers to deal with a sequence of tasks in an incremental fashion. Such systems can retain knowledge from learned tasks in a knowledge base and seamlessly apply it to improve the future learning. Unfortunately, most existing LML methods require labels in every task, whereas providing persistent human labeling for all future tasks is costly, onerous, error-prone, and hence impractical. Motivated by this situation, we propose a new paradigm named unsupervised lifelong learning with curricula (ULLC), where only one task needs to be labeled for initialization and the system then performs lifelong learning for subsequent tasks in an unsupervised fashion. A main challenge of realizing this paradigm lies in the occurrence of negative knowledge transfer, where partial old knowledge becomes detrimental for learning a given task yet cannot be filtered out by the learner without the help of labels. To overcome this challenge, we draw insights from the learning behaviors of humans. Specifically, when faced with a difficult task that cannot be well tackled by our current knowledge, we usually postpone it and work on some easier tasks first, which allows us to grow our knowledge. Thereafter, once we go back to the postponed task, we are more likely to tackle it well as we are more knowledgeable now. The key idea of ULLC is similar – at any time, a pool of candidate tasks are organized in a curriculum by their distances to the knowledge base. The learner then starts from the closer tasks, accumulates knowledge from learning them, and moves to learn the faraway tasks with a gradually augmented knowledge base. The viability and effectiveness of our proposal are substantiated through extensive empirical studies on both synthetic and real datasets.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114285604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Disentangling User Interest and Conformity for Recommendation with Causal Embedding 基于因果嵌入的推荐用户兴趣与一致性分析

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449788

Y. Zheng, Chen Gao, Xiang Li, Xiangnan He, Depeng Jin, Yong Li

Recommendation models are usually trained on observational interaction data. However, observational interaction data could result from users’ conformity towards popular items, which entangles users’ real interest. Existing methods tracks this problem as eliminating popularity bias, e.g., by re-weighting training samples or leveraging a small fraction of unbiased data. However, the variety of user conformity is ignored by these approaches, and different causes of an interaction are bundled together as unified representations, hence robustness and interpretability are not guaranteed when underlying causes are changing. In this paper, we present DICE, a general framework that learns representations where interest and conformity are structurally disentangled, and various backbone recommendation models could be smoothly integrated. We assign users and items with separate embeddings for interest and conformity, and make each embedding capture only one cause by training with cause-specific data which is obtained according to the colliding effect of causal inference. Our proposed methodology outperforms state-of-the-art baselines with remarkable improvements on two real-world datasets on top of various backbone models. We further demonstrate that the learned embeddings successfully capture the desired causes, and show that DICE guarantees the robustness and interpretability of recommendation.

推荐模型通常是在观察交互数据上训练的。然而，观察性交互数据可能来自用户对流行项目的从众，这将纠缠用户的真正兴趣。现有的方法通过消除流行偏差来跟踪这个问题，例如，通过重新加权训练样本或利用一小部分无偏数据。然而，这些方法忽略了用户一致性的多样性，并且交互的不同原因被捆绑在一起作为统一的表示，因此当潜在原因发生变化时，不能保证鲁棒性和可解释性。在本文中，我们提出了DICE，这是一个学习表征的通用框架，其中兴趣和一致性在结构上是分离的，并且各种骨干推荐模型可以顺利集成。我们为用户和项目分配兴趣和一致性的单独嵌入，并通过使用根据因果推理的碰撞效应获得的原因特定数据进行训练，使每个嵌入只捕获一个原因。我们提出的方法优于最先进的基线，在各种骨干模型之上的两个真实世界数据集上有显著的改进。我们进一步证明了学习的嵌入成功地捕获了期望的原因，并表明DICE保证了推荐的鲁棒性和可解释性。

{"title":"Disentangling User Interest and Conformity for Recommendation with Causal Embedding","authors":"Y. Zheng, Chen Gao, Xiang Li, Xiangnan He, Depeng Jin, Yong Li","doi":"10.1145/3442381.3449788","DOIUrl":"https://doi.org/10.1145/3442381.3449788","url":null,"abstract":"Recommendation models are usually trained on observational interaction data. However, observational interaction data could result from users’ conformity towards popular items, which entangles users’ real interest. Existing methods tracks this problem as eliminating popularity bias, e.g., by re-weighting training samples or leveraging a small fraction of unbiased data. However, the variety of user conformity is ignored by these approaches, and different causes of an interaction are bundled together as unified representations, hence robustness and interpretability are not guaranteed when underlying causes are changing. In this paper, we present DICE, a general framework that learns representations where interest and conformity are structurally disentangled, and various backbone recommendation models could be smoothly integrated. We assign users and items with separate embeddings for interest and conformity, and make each embedding capture only one cause by training with cause-specific data which is obtained according to the colliding effect of causal inference. Our proposed methodology outperforms state-of-the-art baselines with remarkable improvements on two real-world datasets on top of various backbone models. We further demonstrate that the learned embeddings successfully capture the desired causes, and show that DICE guarantees the robustness and interpretability of recommendation.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116501207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 177

Elo-MMR: A Rating System for Massive Multiplayer Competitions Elo-MMR:大型多人竞赛的评级系统

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450091

Aram Ebtekar, Paul Liu

Skill estimation mechanisms, colloquially known as rating systems, play an important role in competitive sports and games. They provide a measure of player skill, which incentivizes competitive performances and enables balanced match-ups. In this paper, we present a novel Bayesian rating system for contests with many participants. It is widely applicable to competition formats with discrete ranked matches, such as online programming competitions, obstacle courses races, and video games. The system’s simplicity allows us to prove theoretical bounds on its robustness and runtime. In addition, we show that it is incentive-compatible: a player who seeks to maximize their rating will never want to underperform. Experimentally, the rating system surpasses existing systems in prediction accuracy, and computes faster than existing systems by up to an order of magnitude.

技能评估机制，俗称评级系统，在竞技体育和游戏中发挥着重要作用。它们提供了一种衡量玩家技能的方法，能够刺激玩家的竞技表现并实现平衡的对局。在本文中，我们提出了一种新的多参与者竞赛贝叶斯评分系统。它广泛适用于具有离散排名比赛的比赛形式，例如在线编程比赛、障碍赛和视频游戏。该系统的简单性使我们能够证明其鲁棒性和运行时间的理论界限。此外，我们还证明了这是一种激励相容的机制:玩家如果想要最大化自己的评分，就不会想要表现不佳。在实验上，评级系统在预测精度上超过了现有系统，并且比现有系统的计算速度快了一个数量级。

引用次数: 8

Constructing Explainable Opinion Graphs from Reviews 从评论中构建可解释的意见图表

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450081

Nofar Carmeli, Xiaolan Wang, Yoshihiko Suhara, S. Angelidis, Yuliang Li, Jinfeng Li, W. Tan

The Web is a major resource of both factual and subjective information. While there are significant efforts to organize factual information into knowledge bases, there is much less work on organizing opinions, which are abundant in subjective data, into a structured format. We present ExplainIt, a system that extracts and organizes opinions into an opinion graph, which are useful for downstream applications such as generating explainable review summaries and facilitating search over opinion phrases. In such graphs, a node represents a set of semantically similar opinions extracted from reviews and an edge between two nodes signifies that one node explains the other. ExplainIt mines explanations in a supervised method and groups similar opinions together in a weakly supervised way before combining the clusters of opinions together with their explanation relationships into an opinion graph. We experimentally demonstrate that the explanation relationships generated in the opinion graph are of good quality and our labeled datasets for explanation mining and grouping opinions are publicly available at https://github.com/megagonlabs/explainit.

网络是事实信息和主观信息的主要来源。虽然在将事实信息组织成知识库方面做出了重大努力，但在将主观数据丰富的意见组织成结构化格式方面的工作要少得多。我们介绍了ExplainIt，这是一个提取意见并将其组织成意见图的系统，它对下游应用程序很有用，例如生成可解释的评论摘要和促进对意见短语的搜索。在这样的图中，一个节点代表一组从评论中提取的语义相似的观点，两个节点之间的边表示一个节点解释另一个节点。ExplainIt以监督的方式挖掘解释，并以弱监督的方式将相似的意见组合在一起，然后将意见集群及其解释关系组合成意见图。我们通过实验证明，在意见图中生成的解释关系具有良好的质量，并且我们用于解释挖掘和分组意见的标记数据集可以在https://github.com/megagonlabs/explainit上公开获得。

{"title":"Constructing Explainable Opinion Graphs from Reviews","authors":"Nofar Carmeli, Xiaolan Wang, Yoshihiko Suhara, S. Angelidis, Yuliang Li, Jinfeng Li, W. Tan","doi":"10.1145/3442381.3450081","DOIUrl":"https://doi.org/10.1145/3442381.3450081","url":null,"abstract":"The Web is a major resource of both factual and subjective information. While there are significant efforts to organize factual information into knowledge bases, there is much less work on organizing opinions, which are abundant in subjective data, into a structured format. We present ExplainIt, a system that extracts and organizes opinions into an opinion graph, which are useful for downstream applications such as generating explainable review summaries and facilitating search over opinion phrases. In such graphs, a node represents a set of semantically similar opinions extracted from reviews and an edge between two nodes signifies that one node explains the other. ExplainIt mines explanations in a supervised method and groups similar opinions together in a weakly supervised way before combining the clusters of opinions together with their explanation relationships into an opinion graph. We experimentally demonstrate that the explanation relationships generated in the opinion graph are of good quality and our labeled datasets for explanation mining and grouping opinions are publicly available at https://github.com/megagonlabs/explainit.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130399135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

An Experimental Study to Understand User Experience and Perception Bias Occurred by Fact-checking Messages 事实核查信息引起的用户体验与感知偏差的实验研究

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450121

Sungkyu (Shaun) Park, Jamie Yejean Park, Hyojin Chin, Jeong-han Kang, M. Cha

Fact-checking has become the de facto solution for fighting fake news online. This research brings attention to the unexpected and diminished effect of fact-checking due to cognitive biases. We experimented (66,870 decisions) comparing the change in users’ stance toward unproven claims before and after being presented with a hypothetical fact-checked condition. We found that, first, the claims tagged with the ‘Lack of Evidence’ label are recognized similarly as false information unlike other borderline labels, indicating the presence of uncertainty-aversion bias in response to insufficient information. Second, users who initially show disapproval toward a claim are less likely to correct their views later than those who initially approve of the same claim when opposite fact-checking labels are shown — an indication of disapproval bias. Finally, user interviews revealed that users are more likely to share claims with Divided Evidence than those with Lack of Evidence among borderline messages, reaffirming the presence of uncertainty-aversion bias. On average, we confirm that fact-checking helps users correct their views and reduces the circulation of falsehoods by leading them to abandon extreme views. Simultaneously, the presence of two biases reveals that fact-checking does not always elicit the desired user experience and that the outcome varies by the design of fact-checking messages and people’s initial view. These new observations have direct implications for multiple stakeholders, including platforms, policy-makers, and online users.

事实核查已经成为打击网上假新闻的事实上的解决方案。这项研究引起了人们对事实核查由于认知偏见而产生的意想不到的和减弱的影响的关注。我们进行了实验(66,870个决定)，比较了用户在提供假设的事实核查条件之前和之后对未经证实的主张的立场变化。我们发现，首先，与其他边缘标签不同，带有“缺乏证据”标签的声明被认为是虚假信息，这表明存在对信息不足的不确定性厌恶偏见。其次，当出现相反的事实核查标签时，最初对一种说法表示不赞成的用户比最初赞成同一种说法的用户更不可能在之后纠正自己的观点——这是一种不赞成偏见的迹象。最后，用户访谈显示，在边缘信息中，用户更有可能分享证据分裂的主张，而不是缺乏证据的主张，这重申了不确定性厌恶偏见的存在。平均而言，我们确认事实核查可以帮助用户纠正他们的观点，并通过引导他们放弃极端观点来减少虚假信息的传播。同时，两种偏见的存在表明，事实核查并不总能带来期望的用户体验，结果会因事实核查信息的设计和人们的初始观点而有所不同。这些新的观察结果对多个利益相关者有直接影响，包括平台、政策制定者和在线用户。

{"title":"An Experimental Study to Understand User Experience and Perception Bias Occurred by Fact-checking Messages","authors":"Sungkyu (Shaun) Park, Jamie Yejean Park, Hyojin Chin, Jeong-han Kang, M. Cha","doi":"10.1145/3442381.3450121","DOIUrl":"https://doi.org/10.1145/3442381.3450121","url":null,"abstract":"Fact-checking has become the de facto solution for fighting fake news online. This research brings attention to the unexpected and diminished effect of fact-checking due to cognitive biases. We experimented (66,870 decisions) comparing the change in users’ stance toward unproven claims before and after being presented with a hypothetical fact-checked condition. We found that, first, the claims tagged with the ‘Lack of Evidence’ label are recognized similarly as false information unlike other borderline labels, indicating the presence of uncertainty-aversion bias in response to insufficient information. Second, users who initially show disapproval toward a claim are less likely to correct their views later than those who initially approve of the same claim when opposite fact-checking labels are shown — an indication of disapproval bias. Finally, user interviews revealed that users are more likely to share claims with Divided Evidence than those with Lack of Evidence among borderline messages, reaffirming the presence of uncertainty-aversion bias. On average, we confirm that fact-checking helps users correct their views and reduces the circulation of falsehoods by leading them to abandon extreme views. Simultaneously, the presence of two biases reveals that fact-checking does not always elicit the desired user experience and that the outcome varies by the design of fact-checking messages and people’s initial view. These new observations have direct implications for multiple stakeholders, including platforms, policy-makers, and online users.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127788691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Learning Neural Point Processes with Latent Graphs 用隐图学习神经点过程

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450135

Qiang Zhang, Aldo Lipani, Emine Yilmaz

Neural point processes (NPPs) employ neural networks to capture complicated dynamics of asynchronous event sequences. Existing NPPs feed all history events into neural networks, assuming that all event types contribute to the prediction of the target type. However, this assumption can be problematic because in reality some event types do not contribute to the predictions of another type. To correct this defect, we learn to omit those types of events that do not contribute to the prediction of one target type during the formulation of NPPs. Towards this end, we simultaneously consider the tasks of (1) finding event types that contribute to predictions of the target types and (2) learning a NPP model from event sequences. For the former, we formulate a latent graph, with event types being vertices and non-zero contributing relationships being directed edges; then we propose a probabilistic graph generator, from which we sample a latent graph. For the latter, the sampled graph can be readily used as a plug-in to modify an existing NPP model. Because these two tasks are nested, we propose to optimize the model parameters through bilevel programming, and develop an efficient solution based on truncated gradient back-propagation. Experimental results on both synthetic and real-world datasets show the improved performance against state-of-the-art baselines. This work removes disturbance of non-contributing event types with the aid of a validation procedure, similar to the practice to mitigate overfitting used when training machine learning models.

神经点过程(NPPs)利用神经网络捕捉异步事件序列的复杂动态。现有核电站将所有历史事件输入神经网络，假设所有事件类型都有助于目标类型的预测。然而，这种假设可能是有问题的，因为在现实中，一些事件类型对另一种类型的预测没有贡献。为了纠正这一缺陷，我们学会在制定核电站过程中忽略那些对预测一个目标类型没有贡献的事件类型。为此，我们同时考虑(1)寻找有助于预测目标类型的事件类型和(2)从事件序列中学习NPP模型的任务。对于前者，我们构造了一个隐图，其中事件类型为顶点，非零贡献关系为有向边;然后，我们提出了一个概率图生成器，并从中抽取了一个潜在图。对于后者，采样图可以很容易地用作插件来修改现有的NPP模型。由于这两个任务是嵌套的，我们提出通过双层规划来优化模型参数，并基于截断梯度反向传播开发一种有效的解决方案。在合成数据集和实际数据集上的实验结果表明，在最先进的基线上，性能有所提高。这项工作在验证过程的帮助下消除了非贡献事件类型的干扰，类似于在训练机器学习模型时使用的减轻过拟合的做法。

{"title":"Learning Neural Point Processes with Latent Graphs","authors":"Qiang Zhang, Aldo Lipani, Emine Yilmaz","doi":"10.1145/3442381.3450135","DOIUrl":"https://doi.org/10.1145/3442381.3450135","url":null,"abstract":"Neural point processes (NPPs) employ neural networks to capture complicated dynamics of asynchronous event sequences. Existing NPPs feed all history events into neural networks, assuming that all event types contribute to the prediction of the target type. However, this assumption can be problematic because in reality some event types do not contribute to the predictions of another type. To correct this defect, we learn to omit those types of events that do not contribute to the prediction of one target type during the formulation of NPPs. Towards this end, we simultaneously consider the tasks of (1) finding event types that contribute to predictions of the target types and (2) learning a NPP model from event sequences. For the former, we formulate a latent graph, with event types being vertices and non-zero contributing relationships being directed edges; then we propose a probabilistic graph generator, from which we sample a latent graph. For the latter, the sampled graph can be readily used as a plug-in to modify an existing NPP model. Because these two tasks are nested, we propose to optimize the model parameters through bilevel programming, and develop an efficient solution based on truncated gradient back-propagation. Experimental results on both synthetic and real-world datasets show the improved performance against state-of-the-art baselines. This work removes disturbance of non-contributing event types with the aid of a validation procedure, similar to the practice to mitigate overfitting used when training machine learning models.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117145366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Cross-domain Knowledge Distillation for Retrieval-based Question Answering Systems 基于检索的问答系统的跨领域知识蒸馏

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449814

Cen Chen, Chengyu Wang, Minghui Qiu, D. Gao, Linbo Jin, Wang Li

Question Answering (QA) systems have been extensively studied in both academia and the research community due to their wide real-world applications. When building such industrial-scale QA applications, we are facing two prominent challenges, i.e., i) lacking a sufficient amount of training data to learn an accurate model and ii) requiring high inference speed for online model serving. There are generally two ways to mitigate the above-mentioned problems. One is to adopt transfer learning to leverage information from other domains; the other is to distill the “dark knowledge” from a large teacher model to small student models. The former usually employs parameter sharing mechanisms for knowledge transfer, but does not utilize the “dark knowledge” of pre-trained large models. The latter usually does not consider the cross-domain information from other domains. We argue that these two types of methods can be complementary to each other. Hence in this work, we provide a new perspective on the potential of the teacher-student paradigm facilitating cross-domain transfer learning, where the teacher and student tasks belong to heterogeneous domains, with the goal to improve the student model’s performance in the target domain. Our framework considers the “dark knowledge” learned from large teacher models and also leverages the adaptive hints to alleviate the domain differences between teacher and student models. Extensive experiments have been conducted on two text matching tasks for retrieval-based QA systems. Results show the proposed method has better performance than the competing methods including the existing state-of-the-art transfer learning methods. We have also deployed our method in an online production system and observed significant improvements compared to the existing approaches in terms of both accuracy and cross-domain robustness.

问答(QA)系统由于其广泛的实际应用，在学术界和研究界得到了广泛的研究。在构建这种工业规模的QA应用程序时，我们面临着两个突出的挑战，即i)缺乏足够数量的训练数据来学习准确的模型，ii)在线模型服务需要很高的推理速度。一般有两种方法可以缓解上述问题。一是采用迁移学习来利用其他领域的信息;二是将“暗知识”从大的教师模型提炼到小的学生模型。前者通常采用参数共享机制进行知识转移，但不利用预训练大模型的“暗知识”。后者通常不考虑来自其他领域的跨领域信息。我们认为这两种方法可以相互补充。因此，在这项工作中，我们为师生范式促进跨领域迁移学习的潜力提供了一个新的视角，其中教师和学生的任务属于异质领域，目的是提高学生模型在目标领域的表现。我们的框架考虑了从大型教师模型中学到的“暗知识”，并利用自适应提示来缓解教师和学生模型之间的领域差异。在基于检索的QA系统中，对两个文本匹配任务进行了大量的实验。结果表明，该方法比现有的迁移学习方法具有更好的性能。我们还在一个在线生产系统中部署了我们的方法，并观察到与现有方法相比，在准确性和跨域鲁棒性方面有了显著的改进。

{"title":"Cross-domain Knowledge Distillation for Retrieval-based Question Answering Systems","authors":"Cen Chen, Chengyu Wang, Minghui Qiu, D. Gao, Linbo Jin, Wang Li","doi":"10.1145/3442381.3449814","DOIUrl":"https://doi.org/10.1145/3442381.3449814","url":null,"abstract":"Question Answering (QA) systems have been extensively studied in both academia and the research community due to their wide real-world applications. When building such industrial-scale QA applications, we are facing two prominent challenges, i.e., i) lacking a sufficient amount of training data to learn an accurate model and ii) requiring high inference speed for online model serving. There are generally two ways to mitigate the above-mentioned problems. One is to adopt transfer learning to leverage information from other domains; the other is to distill the “dark knowledge” from a large teacher model to small student models. The former usually employs parameter sharing mechanisms for knowledge transfer, but does not utilize the “dark knowledge” of pre-trained large models. The latter usually does not consider the cross-domain information from other domains. We argue that these two types of methods can be complementary to each other. Hence in this work, we provide a new perspective on the potential of the teacher-student paradigm facilitating cross-domain transfer learning, where the teacher and student tasks belong to heterogeneous domains, with the goal to improve the student model’s performance in the target domain. Our framework considers the “dark knowledge” learned from large teacher models and also leverages the adaptive hints to alleviate the domain differences between teacher and student models. Extensive experiments have been conducted on two text matching tasks for retrieval-based QA systems. Results show the proposed method has better performance than the competing methods including the existing state-of-the-art transfer learning methods. We have also deployed our method in an online production system and observed significant improvements compared to the existing approaches in terms of both accuracy and cross-domain robustness.","PeriodicalId":106672,"journal":{"name":"Proceedings of the Web Conference 2021","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115218371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

High-dimensional Sparse Embeddings for Collaborative Filtering 协同过滤的高维稀疏嵌入

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3450054

J. V. Balen, Bart Goethals

A widely adopted paradigm in the design of recommender systems is to represent users and items as vectors, often referred to as latent factors or embeddings. Embeddings can be obtained using a variety of recommendation models and served in production using a variety of data engineering solutions. Embeddings also facilitate transfer learning, where trained embeddings from one model are reused in another. In contrast, some of the best-performing collaborative filtering models today are high-dimensional linear models that do not rely on factorization, and so they do not produce embeddings [27, 28]. They also require pruning, amounting to a trade-off between the model size and the density of the predicted affinities. This paper argues for the use of high-dimensional, sparse latent factor models, instead. We propose a new recommendation model based on a full-rank factorization of the inverse Gram matrix. The resulting high-dimensional embeddings can be made sparse while still factorizing a dense affinity matrix. We show how the embeddings combine the advantages of latent representations with the performance of high-dimensional linear models.

在推荐系统的设计中，一个被广泛采用的范例是将用户和项目表示为向量，通常被称为潜在因素或嵌入。可以使用各种推荐模型获得嵌入，并使用各种数据工程解决方案在生产中提供嵌入。嵌入还可以促进迁移学习，从一个模型中训练好的嵌入可以在另一个模型中重用。相比之下，目前一些性能最好的协同过滤模型是不依赖于因子分解的高维线性模型，因此它们不会产生嵌入[27,28]。它们也需要修剪，相当于在模型大小和预测亲和的密度之间进行权衡。本文主张使用高维、稀疏的潜在因素模型来代替。提出了一种基于逆格拉姆矩阵全秩分解的推荐模型。所得的高维嵌入可以在分解密集亲和矩阵的同时变得稀疏。我们展示了嵌入如何将潜在表示的优势与高维线性模型的性能相结合。

引用次数: 4

Incentive Mechanism for Horizontal Federated Learning Based on Reputation and Reverse Auction 基于信誉和逆向拍卖的横向联邦学习激励机制

Proceedings of the Web Conference 2021

Pub Date : 2021-04-19 DOI: 10.1145/3442381.3449888

Jingwen Zhang, Yuezhou Wu, Rong Pan

Current research on federated learning mainly focuses on joint optimization, improving efficiency and effectiveness, and protecting privacy. However, there are relatively few studies on incentive mechanisms. Most studies fail to consider the fact that if there is no profit, participants have no incentive to provide data and training models, and task requesters cannot identify and select reliable participants with high-quality data. Therefore, this paper proposes a federated learning incentive mechanism based on reputation and reverse auction theory. Participants bid for tasks, and reputation indirectly reflects their reliability and data quality. In this federated learning program, we select and reward participants by combining the reputation and bids of the participants under a limited budget. Theoretical analysis proves that the mechanism satisfies computational efficiency, individual rationality, budget feasibility, and truthfulness. The simulation results show the effectiveness of the mechanism.

目前对联邦学习的研究主要集中在联合优化、提高效率和有效性、保护隐私等方面。然而，对激励机制的研究相对较少。大多数研究没有考虑到，如果没有利润，参与者就没有动力提供数据和训练模型，任务请求者无法识别和选择具有高质量数据的可靠参与者。为此，本文提出了一种基于声誉和反向拍卖理论的联合学习激励机制。参与者竞标任务，声誉间接反映了他们的可靠性和数据质量。在这个联合学习计划中，我们通过在有限的预算下结合参与者的声誉和出价来选择和奖励参与者。理论分析证明，该机制满足计算效率、个体合理性、预算可行性和真实性。仿真结果表明了该机构的有效性。

引用次数: 61