Proceedings of the 13th International Conference on Web Search and Data Mining最新文献_第5页

Key Opinion Leaders in Recommendation Systems: Opinion Elicitation and Diffusion 推荐系统中的关键意见领袖:意见的激发和传播

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371826

Jianling Wang, Kaize Ding, Ziwei Zhu, Yin Zhang, James Caverlee

Recommendation systems typically rely on the interactions between a crowd of ordinary users and items, ignoring the fact that many real-world communities are notably influenced by a small group of key opinion leaders, whose feedback on items wields outsize influence. With important positions in the community (e.g. have a large number of followers), their elite opinions are able to diffuse to the community and further impact what items we buy, what media we consume, and how we interact with online platforms. Hence, this paper investigates how to develop a novel recommendation system by explicitly capturing the influence from key opinion leaders to the whole community. Centering around opinion elicitation and diffusion, we propose an end-to-end Graph-based neural model - GoRec. Specifically, to preserve the multi-relations between key opinion leaders and items, GoRec elicits the opinions from key opinion leaders with a translation-based embedding method. Moreover, GoRec adopts the idea of Graph Neural Networks to model the elite opinion diffusion process for improved recommendation. Through experiments on Goodreads and Epinions, the proposed model outperforms state-of-the-art approaches by 10.75% and 9.28% on average in Top-K item recommendation.

推荐系统通常依赖于一群普通用户和物品之间的互动，忽略了一个事实，即许多现实世界的社区明显受到一小群关键意见领袖的影响，他们对物品的反馈具有巨大的影响力。他们在社区中占有重要的地位(例如拥有大量的粉丝)，他们的精英意见能够传播到社区，并进一步影响我们购买什么东西，消费什么媒体，以及我们如何与网络平台互动。因此，本文研究了如何通过明确捕获关键意见领袖对整个社区的影响来开发一种新的推荐系统。围绕意见的激发和扩散，我们提出了一个端到端的基于图的神经模型——GoRec。具体而言，为了保持关键意见领袖与项目之间的多重关系，GoRec采用基于翻译的嵌入方法从关键意见领袖中引出意见。此外，GoRec采用图神经网络的思想对精英意见扩散过程进行建模，以改进推荐。通过在Goodreads和Epinions上的实验，本文提出的模型在Top-K条目推荐上的平均表现比现有方法高出10.75%和9.28%。

{"title":"Key Opinion Leaders in Recommendation Systems: Opinion Elicitation and Diffusion","authors":"Jianling Wang, Kaize Ding, Ziwei Zhu, Yin Zhang, James Caverlee","doi":"10.1145/3336191.3371826","DOIUrl":"https://doi.org/10.1145/3336191.3371826","url":null,"abstract":"Recommendation systems typically rely on the interactions between a crowd of ordinary users and items, ignoring the fact that many real-world communities are notably influenced by a small group of key opinion leaders, whose feedback on items wields outsize influence. With important positions in the community (e.g. have a large number of followers), their elite opinions are able to diffuse to the community and further impact what items we buy, what media we consume, and how we interact with online platforms. Hence, this paper investigates how to develop a novel recommendation system by explicitly capturing the influence from key opinion leaders to the whole community. Centering around opinion elicitation and diffusion, we propose an end-to-end Graph-based neural model - GoRec. Specifically, to preserve the multi-relations between key opinion leaders and items, GoRec elicits the opinions from key opinion leaders with a translation-based embedding method. Moreover, GoRec adopts the idea of Graph Neural Networks to model the elite opinion diffusion process for improved recommendation. Through experiments on Goodreads and Epinions, the proposed model outperforms state-of-the-art approaches by 10.75% and 9.28% on average in Top-K item recommendation.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122512015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Practice of Efficient Data Collection via Crowdsourcing: Aggregation, Incremental Relabelling, and Pricing 通过众包有效收集数据的实践:聚合、增量重新标签和定价

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371875

Alexey Drutsa, Valentina Fedorova, Dmitry Ustalov, Olga Megorskaya, Evfrosiniya Zerminova, Daria Baidakova

In this tutorial, we present a portion of unique industry experience in efficient data labelling via crowdsourcing shared by both leading researchers and engineers from Yandex. We will make an introduction to data labelling via public crowdsourcing marketplaces and will present key components of efficient label collection. This will be followed by a practice session, where participants will choose one of the real label collection tasks, experiment with selecting settings for the labelling process, and launch their label collection project on Yandex.Toloka, one of the largest crowdsourcing marketplaces. The projects will be run on real crowds within the tutorial session. Finally, participants will receive a feedback about their projects and practical advice to make them more efficient. We expect that our tutorial will address an audience with a wide range of background and interests. We do not require specific prerequisite knowledge or skills. We invite beginners, advanced specialists, and researchers to learn how to efficiently collect labelled data.

在本教程中，我们将介绍Yandex领先的研究人员和工程师通过众包分享的有效数据标签的部分独特行业经验。我们将通过公共众包市场介绍数据标签，并将介绍有效标签收集的关键组成部分。接下来是一个练习环节，参与者将选择一个真正的标签收集任务，尝试选择标签过程的设置，并在Yandex上启动他们的标签收集项目。Toloka，最大的众包市场之一。这些项目将在辅导课程中在真实人群中进行。最后，参加者将收到有关他们的项目的反馈和实用的建议，以提高他们的效率。我们希望我们的教程能够满足具有广泛背景和兴趣的读者。我们不需要特定的先决知识或技能。我们邀请初学者，高级专家和研究人员学习如何有效地收集标记数据。

{"title":"Practice of Efficient Data Collection via Crowdsourcing: Aggregation, Incremental Relabelling, and Pricing","authors":"Alexey Drutsa, Valentina Fedorova, Dmitry Ustalov, Olga Megorskaya, Evfrosiniya Zerminova, Daria Baidakova","doi":"10.1145/3336191.3371875","DOIUrl":"https://doi.org/10.1145/3336191.3371875","url":null,"abstract":"In this tutorial, we present a portion of unique industry experience in efficient data labelling via crowdsourcing shared by both leading researchers and engineers from Yandex. We will make an introduction to data labelling via public crowdsourcing marketplaces and will present key components of efficient label collection. This will be followed by a practice session, where participants will choose one of the real label collection tasks, experiment with selecting settings for the labelling process, and launch their label collection project on Yandex.Toloka, one of the largest crowdsourcing marketplaces. The projects will be run on real crowds within the tutorial session. Finally, participants will receive a feedback about their projects and practical advice to make them more efficient. We expect that our tutorial will address an audience with a wide range of background and interests. We do not require specific prerequisite knowledge or skills. We invite beginners, advanced specialists, and researchers to learn how to efficiently collect labelled data.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126668955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Computer Vision for Fashion: From Individual Recommendations to World-wide Trends 时尚的计算机视觉:从个人推荐到全球趋势

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3372192

K. Grauman

The fashion domain is a magnet for computer vision. New vision problems are emerging in step with the fashion industry's rapid evolution towards an online, social, and personalized business. Style models, trend forecasting, interactive search, and recommendation all require visual understanding with rich detail and subtlety. As a result, research in this area is poised to have great influence on how people shop, how the fashion industry analyzes its enterprise, and how we model the cultural trends revealed by what people wear. In this talk, I will present our work over the last few years developing computer vision methods for fashion. To begin, we explore how to discover styles from Web photos, learning how people assemble their outfits and the latent themes they share. Leveraging such styles, we show how to infer compatibility of new garments, optimize personalized mix-and-match capsule wardrobes, suggest minimal edits to make an outfit more fashionable, and recommend clothing that flatters diverse human body shapes. Next, turning to the world stage, we investigate fashion forecasting and influence. Given photos of fashion products, we learn to forecast what looks and styles will be popular in the future. We further boost those forecasts by modeling the spatio-temporal style influences between 44 major world cities. Throughout, by learning models from unlabeled Web photos, our approaches sidestep subjective manual annotations in favor of direct observations of what people choose to wear.

时尚领域对计算机视觉来说是一块磁石。随着时尚行业向在线、社交和个性化业务的快速发展，新的视力问题也随之出现。风格模型、趋势预测、交互式搜索和推荐都需要具有丰富细节和微妙的视觉理解。因此，这一领域的研究将对人们如何购物、时尚行业如何分析其企业、以及我们如何通过人们的穿着来塑造文化趋势产生重大影响。在这次演讲中，我将展示我们在过去几年里为时尚开发计算机视觉方法的工作。首先，我们探索如何从网络照片中发现风格，了解人们如何组合他们的服装以及他们共享的潜在主题。利用这些风格，我们展示了如何推断新衣服的兼容性，优化个性化的混搭胶囊衣橱，建议最小的编辑使一套衣服更时尚，并推荐适合不同人体形状的衣服。接下来，转向世界舞台，我们研究时尚预测和影响力。给定时尚产品的照片，我们学会预测未来流行的外观和风格。我们通过模拟44个世界主要城市之间的时空风格影响，进一步加强了这些预测。在整个过程中，通过从未标记的网络照片中学习模型，我们的方法避免了主观的手动注释，而是倾向于直接观察人们选择穿什么。

{"title":"Computer Vision for Fashion: From Individual Recommendations to World-wide Trends","authors":"K. Grauman","doi":"10.1145/3336191.3372192","DOIUrl":"https://doi.org/10.1145/3336191.3372192","url":null,"abstract":"The fashion domain is a magnet for computer vision. New vision problems are emerging in step with the fashion industry's rapid evolution towards an online, social, and personalized business. Style models, trend forecasting, interactive search, and recommendation all require visual understanding with rich detail and subtlety. As a result, research in this area is poised to have great influence on how people shop, how the fashion industry analyzes its enterprise, and how we model the cultural trends revealed by what people wear. In this talk, I will present our work over the last few years developing computer vision methods for fashion. To begin, we explore how to discover styles from Web photos, learning how people assemble their outfits and the latent themes they share. Leveraging such styles, we show how to infer compatibility of new garments, optimize personalized mix-and-match capsule wardrobes, suggest minimal edits to make an outfit more fashionable, and recommend clothing that flatters diverse human body shapes. Next, turning to the world stage, we investigate fashion forecasting and influence. Given photos of fashion products, we learn to forecast what looks and styles will be popular in the future. We further boost those forecasts by modeling the spatio-temporal style influences between 44 major world cities. Throughout, by learning models from unlabeled Web photos, our approaches sidestep subjective manual annotations in favor of direct observations of what people choose to wear.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128133514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Improving the Estimation of Tail Ratings in Recommender System with Multi-Latent Representations 基于多潜在表示的推荐系统尾部评级估计改进

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371810

Xing Zhao, Ziwei Zhu, Yin Zhang, James Caverlee

The importance of the distribution of ratings on recommender systems (RS) is well-recognized. And yet, recommendation approaches based on latent factor models and recently introduced neural variants (e.g., NCF) optimize for the head of these distributions, potentially leading to large estimation errors for tail ratings. These errors in tail ratings that are far from the mean predicted rating fall out of a uni-modal assumption underlying these popular models, as we show in this paper. We propose to improve the estimation of tail ratings by extending traditional single latent representations (e.g., an item is represented by a single latent vector) with new multi-latent representations for better modeling these tail ratings. We show how to incorporate these multi-latent representations in an end-to-end neural prediction model that is designed to better reflect the underlying ratings distributions of items. Through experiments over six datasets, we find the proposed model leads to a significant improvement in RMSE versus a suite of benchmark methods. We also find that the predictions for the most polarized items are improved by more than 15%.

评分分布在推荐系统(RS)上的重要性是公认的。然而，基于潜在因素模型和最近引入的神经变体(例如NCF)的推荐方法针对这些分布的头部进行了优化，这可能会导致尾部评级的估计误差很大。正如我们在本文中所展示的那样，这些偏离平均预测评级的尾部评级错误脱离了这些流行模型的单模态假设。我们建议通过扩展传统的单潜在表示(例如，一个项目由单个潜在向量表示)来改进尾部评级的估计，并使用新的多潜在表示来更好地建模这些尾部评级。我们展示了如何将这些多潜表征合并到端到端神经预测模型中，该模型旨在更好地反映项目的潜在评分分布。通过对六个数据集的实验，我们发现与一套基准方法相比，所提出的模型导致RMSE的显着改进。我们还发现，对最极化的项目的预测提高了15%以上。

{"title":"Improving the Estimation of Tail Ratings in Recommender System with Multi-Latent Representations","authors":"Xing Zhao, Ziwei Zhu, Yin Zhang, James Caverlee","doi":"10.1145/3336191.3371810","DOIUrl":"https://doi.org/10.1145/3336191.3371810","url":null,"abstract":"The importance of the distribution of ratings on recommender systems (RS) is well-recognized. And yet, recommendation approaches based on latent factor models and recently introduced neural variants (e.g., NCF) optimize for the head of these distributions, potentially leading to large estimation errors for tail ratings. These errors in tail ratings that are far from the mean predicted rating fall out of a uni-modal assumption underlying these popular models, as we show in this paper. We propose to improve the estimation of tail ratings by extending traditional single latent representations (e.g., an item is represented by a single latent vector) with new multi-latent representations for better modeling these tail ratings. We show how to incorporate these multi-latent representations in an end-to-end neural prediction model that is designed to better reflect the underlying ratings distributions of items. Through experiments over six datasets, we find the proposed model leads to a significant improvement in RMSE versus a suite of benchmark methods. We also find that the predictions for the most polarized items are improved by more than 15%.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121600475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Incremental Filter Pruning via Random Walk for Accelerating Deep Convolutional Neural Networks 基于随机行走的深度卷积神经网络增量滤波剪枝

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371849

Qinghua Li, Cuiping Li, Hong Chen

Accelerating Deep Convolutional Neural Networks (CNNs) has recently received ever-increasing research focus. Among various approaches proposed in the literature, filter pruning has been regarded as a promising solution, which is due to its advantage in significant speedup and memory reduction of both network model and intermediate feature maps. Previous works utilized "smaller-norm-less-important" criterion to prune filters with smaller ࡁp-norm values by pruning and retraining alternately. This trends to narrow the model capacity for the following reasons: (1) Violent pruning. Previous works adopt a violent strategy in which all filters are simultaneously pruned, which leaving the room to retain model accuracy limited. (2) Filter degradation. Previous works simply set the pruned filter to 0 and retrained it alterately, which easily led to the loss of learning ability of filters. To solve this problem, we propose a novel filter pruning method, namely Incremental Filter Pruning via Random Walk (IFPRW). IFPRW solves the problem of violent pruning by incremental method and Filter degradation by means of random walk. When applied to two image classification benchmarks, the usefulness and strength of IFPRW is validated. Notably, on CIFAR-10, IFPRW reduces more than 46% FLOPs on ResNet-110 with even 0.28% relative accuracy improvement. Moreover, on ILSVRC-2012, IFPRW reduces more than 54% FLOPs on ResNet-101 with only 0.7% top-5 accurcacy drop. which proving that IFPRW outperforms the state-of-the-art filter pruning methods.

加速深度卷积神经网络(cnn)近年来受到越来越多的研究热点。在文献中提出的各种方法中，过滤器修剪被认为是一种很有前途的解决方案，这是因为它在网络模型和中间特征映射的显著加速和内存减少方面具有优势。以前的工作采用“较小的规范-不重要”准则，通过修剪和再训练交替修剪ࡁp-norm值较小的过滤器。这将导致模型容量的缩小，原因如下:(1)剧烈修剪。以前的作品采用了一种暴力策略，即同时修剪所有滤波器，这使得保留模型精度的空间有限。(2)过滤器降解。以往的工作都是简单地将被修剪过的滤波器设为0，然后交替进行再训练，容易导致滤波器的学习能力丧失。为了解决这一问题，我们提出了一种新的滤波剪枝方法，即通过随机行走的增量滤波剪枝(IFPRW)。IFPRW用增量法解决了暴力剪枝问题，用随机漫步法解决了滤波器退化问题。当应用于两个图像分类基准时，验证了IFPRW的有效性和强度。值得注意的是，在CIFAR-10上，IFPRW在ResNet-110上减少了46%以上的FLOPs，相对精度甚至提高了0.28%。此外，在ILSVRC-2012上，IFPRW在ResNet-101上减少了54%以上的FLOPs，而前5名精度仅下降了0.7%。这证明IFPRW优于最先进的过滤器修剪方法。

{"title":"Incremental Filter Pruning via Random Walk for Accelerating Deep Convolutional Neural Networks","authors":"Qinghua Li, Cuiping Li, Hong Chen","doi":"10.1145/3336191.3371849","DOIUrl":"https://doi.org/10.1145/3336191.3371849","url":null,"abstract":"Accelerating Deep Convolutional Neural Networks (CNNs) has recently received ever-increasing research focus. Among various approaches proposed in the literature, filter pruning has been regarded as a promising solution, which is due to its advantage in significant speedup and memory reduction of both network model and intermediate feature maps. Previous works utilized \"smaller-norm-less-important\" criterion to prune filters with smaller ࡁp-norm values by pruning and retraining alternately. This trends to narrow the model capacity for the following reasons: (1) Violent pruning. Previous works adopt a violent strategy in which all filters are simultaneously pruned, which leaving the room to retain model accuracy limited. (2) Filter degradation. Previous works simply set the pruned filter to 0 and retrained it alterately, which easily led to the loss of learning ability of filters. To solve this problem, we propose a novel filter pruning method, namely Incremental Filter Pruning via Random Walk (IFPRW). IFPRW solves the problem of violent pruning by incremental method and Filter degradation by means of random walk. When applied to two image classification benchmarks, the usefulness and strength of IFPRW is validated. Notably, on CIFAR-10, IFPRW reduces more than 46% FLOPs on ResNet-110 with even 0.28% relative accuracy improvement. Moreover, on ILSVRC-2012, IFPRW reduces more than 54% FLOPs on ResNet-101 with only 0.7% top-5 accurcacy drop. which proving that IFPRW outperforms the state-of-the-art filter pruning methods.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125252521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Joint Recognition of Names and Publications in Academic Homepages 学术网页的名称和出版物的联合识别

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371771

Yimeng Dai, Jianzhong Qi, Rui Zhang

Academic homepages are an important source for learning researchers' profiles. Recognising person names and publications in academic homepages are two fundamental tasks for understanding the identities of the homepages and collaboration networks of the researchers. Existing studies have tackled person name recognition and publication recognition separately. We observe that these two tasks are correlated since person names and publications often co-occur. Further, there are strong position patterns for the occurrence of person names and publications. With these observations, we propose a novel deep learning model consisting of two main modules, an alternatingly updated memory module which exploits the knowledge and correlation from both tasks, and a position-aware memory module which captures the patterns of where in a homepage names and publications appear. Empirical results show that our proposed model outperforms the state-of-the-art publication recognition model by 3.64% in F1 score and outperforms the state-of-the-art person name recognition model by 2.06% in F1 score. Ablation studies and visualisation confirm the effectiveness of the proposed modules.

学术网站是学习研究者资料的重要来源。识别学术主页上的人名和出版物是了解研究人员的主页身份和合作网络的两项基本任务。现有的研究将人名识别和出版物识别分开处理。我们观察到这两个任务是相关的，因为人名和出版物经常同时出现。此外，人名和出版物的出现也有很强的位置模式。根据这些观察，我们提出了一个新的深度学习模型，该模型由两个主要模块组成，一个是交替更新的记忆模块，它利用来自两个任务的知识和相关性，另一个是位置感知记忆模块，它捕获主页名称和出版物出现的模式。实证结果表明，本文提出的模型在F1得分上优于目前最先进的出版物识别模型3.64%，在F1得分上优于目前最先进的人名识别模型2.06%。消融研究和可视化证实了所提出模块的有效性。

{"title":"Joint Recognition of Names and Publications in Academic Homepages","authors":"Yimeng Dai, Jianzhong Qi, Rui Zhang","doi":"10.1145/3336191.3371771","DOIUrl":"https://doi.org/10.1145/3336191.3371771","url":null,"abstract":"Academic homepages are an important source for learning researchers' profiles. Recognising person names and publications in academic homepages are two fundamental tasks for understanding the identities of the homepages and collaboration networks of the researchers. Existing studies have tackled person name recognition and publication recognition separately. We observe that these two tasks are correlated since person names and publications often co-occur. Further, there are strong position patterns for the occurrence of person names and publications. With these observations, we propose a novel deep learning model consisting of two main modules, an alternatingly updated memory module which exploits the knowledge and correlation from both tasks, and a position-aware memory module which captures the patterns of where in a homepage names and publications appear. Empirical results show that our proposed model outperforms the state-of-the-art publication recognition model by 3.64% in F1 score and outperforms the state-of-the-art person name recognition model by 2.06% in F1 score. Ablation studies and visualisation confirm the effectiveness of the proposed modules.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114235774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

LouvainNE

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371800

Ayan Kumar Bhowmick, Koushik Meneni, Maximilien Danisch, J. Guillaume, Bivas Mitra

Network embedding, that aims to learn low-dimensional vector representation of nodes such that the network structure is preserved, has gained significant research attention in recent years. However, most state-of-the-art network embedding methods are computationally expensive and hence unsuitable for representing nodes in billion-scale networks. In this paper, we present LouvainNE, a hierarchical clustering approach to network embedding. Precisely, we employ Louvain, an extremely fast and accurate community detection method, to build a hierarchy of successively smaller subgraphs. We obtain representations of individual nodes in the original graph at different levels of the hierarchy, then we aggregate these representations to learn the final embedding vectors. Our theoretical analysis shows that our proposed algorithm has quasi-linear run-time and memory complexity. Our extensive experimental evaluation, carried out on multiple real-world networks of different scales, demonstrates both (i) the scalability of our proposed approach that can handle graphs containing tens of billions of edges, as well as (ii) its effectiveness in performing downstream network mining tasks such as network reconstruction and node classification.

{"title":"LouvainNE","authors":"Ayan Kumar Bhowmick, Koushik Meneni, Maximilien Danisch, J. Guillaume, Bivas Mitra","doi":"10.1145/3336191.3371800","DOIUrl":"https://doi.org/10.1145/3336191.3371800","url":null,"abstract":"Network embedding, that aims to learn low-dimensional vector representation of nodes such that the network structure is preserved, has gained significant research attention in recent years. However, most state-of-the-art network embedding methods are computationally expensive and hence unsuitable for representing nodes in billion-scale networks. In this paper, we present LouvainNE, a hierarchical clustering approach to network embedding. Precisely, we employ Louvain, an extremely fast and accurate community detection method, to build a hierarchy of successively smaller subgraphs. We obtain representations of individual nodes in the original graph at different levels of the hierarchy, then we aggregate these representations to learn the final embedding vectors. Our theoretical analysis shows that our proposed algorithm has quasi-linear run-time and memory complexity. Our extensive experimental evaluation, carried out on multiple real-world networks of different scales, demonstrates both (i) the scalability of our proposed approach that can handle graphs containing tens of billions of edges, as well as (ii) its effectiveness in performing downstream network mining tasks such as network reconstruction and node classification.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114259025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Ultra Fine-Grained Image Semantic Embedding 超细粒度图像语义嵌入

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371784

Da-Cheng Juan, Chun-Ta Lu, Zhuguo Li, Futang Peng, Aleksei Timofeev, Yi-Ting Chen, Y. Gao, Tom Duerig, A. Tomkins, Sujith Ravi

"How to learn image embeddings that capture fine-grained semantics based on the instance of an image?" "Is it possible for such embeddings to further understand image semantics closer to humans' perception?" In this paper, we present, Graph-Regularized Image Semantic Embedding (Graph-RISE), a web-scale neural graph learning framework deployed at Google, which allows us to train image embeddings to discriminate an unprecedented O(40M) ultra-fine-grained semantic labels. The proposed Graph-RISE outperforms state-of-the-art image embedding algorithms on several evaluation tasks, including kNN search and triplet ranking: the accuracy is improved by approximately 2X on the ImageNet dataset and by more than 5X on the iNaturalist dataset. Qualitatively, image retrieval from one billion images based on the proposed Graph-RISE effectively captures semantics and, compared to the state-of-the-art, differentiates nuances at levels that are closer to human-perception.

“如何学习基于图像实例捕获细粒度语义的图像嵌入?”“这种嵌入是否有可能进一步理解更接近人类感知的图像语义?”在本文中，我们提出了图-正则化图像语义嵌入(graph - rise)，这是谷歌部署的一个网络规模的神经图学习框架，它允许我们训练图像嵌入来区分前所未有的O(40M)超细粒度语义标签。提出的Graph-RISE在几个评估任务上优于最先进的图像嵌入算法，包括kNN搜索和三元组排名:在ImageNet数据集上精度提高了大约2倍，在iNaturalist数据集上提高了5倍以上。从质量上讲，基于所提出的Graph-RISE的10亿张图像的图像检索有效地捕获了语义，并且与最先进的技术相比，在更接近人类感知的水平上区分细微差别。

引用次数: 18

Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation 伪动态- q:交互式推荐的强化学习框架

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3371801

Lixin Zou, Long Xia, Pan Du, Zhuo Zhang, Ting Bai, Weidong Liu, J. Nie, Dawei Yin

Applying reinforcement learning (RL) in recommender systems is attractive but costly due to the constraint of the interaction with real customers, where performing online policy learning through interacting with real customers usually harms customer experiences. A practical alternative is to build a recommender agent offline from logged data, whereas directly using logged data offline leads to the problem of selection bias between logging policy and the recommendation policy. The existing direct offline learning algorithms, such as Monte Carlo methods and temporal difference methods are either computationally expensive or unstable on convergence. To address these issues, we propose Pseudo Dyna-Q (PDQ). In PDQ, instead of interacting with real customers, we resort to a customer simulator, referred to as the World Model, which is designed to simulate the environment and handle the selection bias of logged data. During policy improvement, the World Model is constantly updated and optimized adaptively, according to the current recommendation policy. This way, the proposed PDQ not only avoids the instability of convergence and high computation cost of existing approaches but also provides unlimited interactions without involving real customers. Moreover, a proved upper bound of empirical error of reward function guarantees that the learned offline policy has lower bias and variance. Extensive experiments demonstrated the advantages of PDQ on two real-world datasets against state-of-the-arts methods.

在推荐系统中应用强化学习(RL)是有吸引力的，但由于与真实客户交互的限制，成本很高，其中通过与真实客户交互进行在线策略学习通常会损害客户体验。一种实用的替代方法是从日志数据离线构建推荐代理，而直接离线使用日志数据会导致日志策略和推荐策略之间的选择偏差问题。现有的直接离线学习算法，如蒙特卡罗方法和时间差分方法，要么计算量大，要么收敛性不稳定。为了解决这些问题，我们提出了伪动态q (PDQ)。在PDQ中，我们不与真实的客户进行交互，而是使用客户模拟器(称为World Model)，该模拟器旨在模拟环境并处理记录数据的选择偏差。在政策改进过程中，World Model会根据当前的推荐策略不断更新和自适应优化。这样，所提出的PDQ既避免了现有方法收敛的不稳定性和较高的计算成本，又可以在不涉及真实客户的情况下提供无限的交互。此外，奖励函数经验误差上界的证明保证了学习到的离线策略具有较小的偏差和方差。大量的实验证明了PDQ在两个真实世界数据集上与最先进的方法相比的优势。

{"title":"Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation","authors":"Lixin Zou, Long Xia, Pan Du, Zhuo Zhang, Ting Bai, Weidong Liu, J. Nie, Dawei Yin","doi":"10.1145/3336191.3371801","DOIUrl":"https://doi.org/10.1145/3336191.3371801","url":null,"abstract":"Applying reinforcement learning (RL) in recommender systems is attractive but costly due to the constraint of the interaction with real customers, where performing online policy learning through interacting with real customers usually harms customer experiences. A practical alternative is to build a recommender agent offline from logged data, whereas directly using logged data offline leads to the problem of selection bias between logging policy and the recommendation policy. The existing direct offline learning algorithms, such as Monte Carlo methods and temporal difference methods are either computationally expensive or unstable on convergence. To address these issues, we propose Pseudo Dyna-Q (PDQ). In PDQ, instead of interacting with real customers, we resort to a customer simulator, referred to as the World Model, which is designed to simulate the environment and handle the selection bias of logged data. During policy improvement, the World Model is constantly updated and optimized adaptively, according to the current recommendation policy. This way, the proposed PDQ not only avoids the instability of convergence and high computation cost of existing approaches but also provides unlimited interactions without involving real customers. Moreover, a proved upper bound of empirical error of reward function guarantees that the learned offline policy has lower bias and variance. Extensive experiments demonstrated the advantages of PDQ on two real-world datasets against state-of-the-arts methods.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115474817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 87

User Intent Inference for Web Search and Conversational Agents Web搜索和会话代理的用户意图推理

Proceedings of the 13th International Conference on Web Search and Data Mining

Pub Date : 2020-01-20 DOI: 10.1145/3336191.3372187

Ali Ahmadvand

User intent understanding is a crucial step in designing both conversational agents and search engines. Detecting or inferring user intent is challenging, since the user utterances or queries can be short, ambiguous, and contextually dependent. To address these research challenges, my thesis work focuses on: 1) Utterance topic and intent classification for conversational agents 2) Query intent mining and classification for Web search engines, focusing on the e-commerce domain. To address the first topic, I proposed novel models to incorporate entity information and conversation-context clues to predict both topic and intent of the user's utterances. For the second research topic, I plan to extend the existing state of the art methods in Web search intent prediction to the e-commerce domain, via: 1) Developing a joint learning model to predict search queries' intents and the product categories associated with them, 2) Discovering new hidden users' intents. All the models will be evaluated on the real queries available from a major e-commerce site search engine. The results from these studies can be leveraged to improve performance of various tasks such as natural language understanding, query scoping, query suggestion, and ranking, resulting in an enriched user experience.

用户意图理解是设计会话代理和搜索引擎的关键一步。检测或推断用户意图是具有挑战性的，因为用户的话语或查询可能很短、含糊不清，并且依赖于上下文。为了解决这些研究挑战，我的论文工作集中在:1)会话代理的话语主题和意图分类2)Web搜索引擎的查询意图挖掘和分类，重点是电子商务领域。为了解决第一个主题，我提出了新的模型来结合实体信息和对话上下文线索来预测用户话语的主题和意图。对于第二个研究课题，我计划将现有的Web搜索意图预测方法扩展到电子商务领域，通过:1)开发一个联合学习模型来预测搜索查询的意图和与之相关的产品类别，2)发现新的隐藏用户的意图。所有模型都将根据一个主要电子商务网站搜索引擎提供的真实查询进行评估。可以利用这些研究的结果来提高各种任务的性能，例如自然语言理解、查询范围、查询建议和排名，从而丰富用户体验。

{"title":"User Intent Inference for Web Search and Conversational Agents","authors":"Ali Ahmadvand","doi":"10.1145/3336191.3372187","DOIUrl":"https://doi.org/10.1145/3336191.3372187","url":null,"abstract":"User intent understanding is a crucial step in designing both conversational agents and search engines. Detecting or inferring user intent is challenging, since the user utterances or queries can be short, ambiguous, and contextually dependent. To address these research challenges, my thesis work focuses on: 1) Utterance topic and intent classification for conversational agents 2) Query intent mining and classification for Web search engines, focusing on the e-commerce domain. To address the first topic, I proposed novel models to incorporate entity information and conversation-context clues to predict both topic and intent of the user's utterances. For the second research topic, I plan to extend the existing state of the art methods in Web search intent prediction to the e-commerce domain, via: 1) Developing a joint learning model to predict search queries' intents and the product categories associated with them, 2) Discovering new hidden users' intents. All the models will be evaluated on the real queries available from a major e-commerce site search engine. The results from these studies can be leveraged to improve performance of various tasks such as natural language understanding, query scoping, query suggestion, and ranking, resulting in an enriched user experience.","PeriodicalId":319008,"journal":{"name":"Proceedings of the 13th International Conference on Web Search and Data Mining","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115594873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2