首页 > 最新文献

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management最新文献

英文 中文
Deep Neural Networks for News Recommendations 新闻推荐的深度神经网络
Keunchan Park, Jisoo Lee, Jaeho Choi
A fundamental role of news websites is to recommend articles that are interesting to read. The key challenge of news recommendation is to recommend newly published articles. Unlike other domains, outdated items are considered to be irrelevant in the news recommendation task. Another challenge is that the recommendation candidates are not seen in the training phase. In this paper, we introduce deep neural network models to overcome these challenges. we propose a modified session-based Recurrent Neural Network (RNN) model tailored to news recommendation as well as a history-based RNN model that spans the whole user's past histories. Finally, we propose a Convolutional Neural Network (CNN) model to capture user preferences and to personalize recommendation results. Experimental results on real-world news dataset shows that our model outperforms competitive baselines.
新闻网站的一个基本作用是推荐有趣的文章。新闻推荐的关键挑战是推荐新发表的文章。与其他领域不同,过时的条目在新闻推荐任务中被认为是不相关的。另一个挑战是在培训阶段看不到推荐候选人。在本文中,我们引入深度神经网络模型来克服这些挑战。我们提出了一种改进的基于会话的递归神经网络(RNN)模型,该模型适合于新闻推荐,以及一种基于历史的RNN模型,该模型涵盖了整个用户的过去历史。最后,我们提出了一个卷积神经网络(CNN)模型来捕获用户偏好并个性化推荐结果。在真实新闻数据集上的实验结果表明,我们的模型优于竞争基线。
{"title":"Deep Neural Networks for News Recommendations","authors":"Keunchan Park, Jisoo Lee, Jaeho Choi","doi":"10.1145/3132847.3133154","DOIUrl":"https://doi.org/10.1145/3132847.3133154","url":null,"abstract":"A fundamental role of news websites is to recommend articles that are interesting to read. The key challenge of news recommendation is to recommend newly published articles. Unlike other domains, outdated items are considered to be irrelevant in the news recommendation task. Another challenge is that the recommendation candidates are not seen in the training phase. In this paper, we introduce deep neural network models to overcome these challenges. we propose a modified session-based Recurrent Neural Network (RNN) model tailored to news recommendation as well as a history-based RNN model that spans the whole user's past histories. Finally, we propose a Convolutional Neural Network (CNN) model to capture user preferences and to personalize recommendation results. Experimental results on real-world news dataset shows that our model outperforms competitive baselines.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75679456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
A Study of Feature Construction for Text-based Forecasting of Time Series Variables 基于文本的时间序列变量预测特征构建研究
Yiren Wang, Dominic Seyler, Shubhra (Santu) Karmaker, ChengXiang Zhai
Time series are ubiquitous in the world since they are used to measure various phenomena (e.g., temperature, spread of a virus, sales, etc.). Forecasting of time series is highly beneficial (and necessary) for optimizing decisions, yet is a very challenging problem; using only the historical values of the time series is often insufficient. In this paper, we study how to construct effective additional features based on related text data for time series forecasting. Besides the commonly used n-gram features, we propose a general strategy for constructing multiple topical features based on the topics discovered by a topic model. We evaluate feature effectiveness using a data set for predicting stock price changes where we constructed additional features from news text articles for stock market prediction. We found that: 1) Text-based features outperform time series-based features, suggesting the great promise of leveraging text data for improving time series forecasting. 2) Topic-based features are not very effective stand-alone, but they can further improve performance when added on top of n-gram features. 3) The best topic-based feature appears to be a long-term aggregation of topics over time with high weights on recent topics.
时间序列在世界上无处不在,因为它们被用来测量各种现象(例如,温度、病毒传播、销售等)。时间序列的预测对优化决策非常有益(也是必要的),但也是一个非常具有挑战性的问题;仅使用时间序列的历史值通常是不够的。本文研究了如何基于相关文本数据构建有效的附加特征用于时间序列预测。除了常用的n-gram特征外,我们还提出了一种基于主题模型发现的主题构建多个主题特征的通用策略。我们使用预测股票价格变化的数据集来评估特征的有效性,其中我们从新闻文本文章中构建了用于股票市场预测的附加特征。我们发现:1)基于文本的特征优于基于时间序列的特征,这表明利用文本数据来改进时间序列预测的巨大前景。2)基于主题的特征在单独使用时不是很有效,但是当添加到n-gram特征上时,它们可以进一步提高性能。3)最好的基于主题的特征似乎是一个长期的主题聚合,随着时间的推移,最近的主题权重很高。
{"title":"A Study of Feature Construction for Text-based Forecasting of Time Series Variables","authors":"Yiren Wang, Dominic Seyler, Shubhra (Santu) Karmaker, ChengXiang Zhai","doi":"10.1145/3132847.3133109","DOIUrl":"https://doi.org/10.1145/3132847.3133109","url":null,"abstract":"Time series are ubiquitous in the world since they are used to measure various phenomena (e.g., temperature, spread of a virus, sales, etc.). Forecasting of time series is highly beneficial (and necessary) for optimizing decisions, yet is a very challenging problem; using only the historical values of the time series is often insufficient. In this paper, we study how to construct effective additional features based on related text data for time series forecasting. Besides the commonly used n-gram features, we propose a general strategy for constructing multiple topical features based on the topics discovered by a topic model. We evaluate feature effectiveness using a data set for predicting stock price changes where we constructed additional features from news text articles for stock market prediction. We found that: 1) Text-based features outperform time series-based features, suggesting the great promise of leveraging text data for improving time series forecasting. 2) Topic-based features are not very effective stand-alone, but they can further improve performance when added on top of n-gram features. 3) The best topic-based feature appears to be a long-term aggregation of topics over time with high weights on recent topics.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74789308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Interactive Spatial Keyword Querying with Semantics 具有语义的交互式空间关键字查询
Jiabao Sun, Jiajie Xu, Kai Zheng, Chengfei Liu
Conventional spatial keyword queries confront the difficulty of returning desired objects that are synonyms but morphologically different to query keywords. To overcome this flaw, this paper investigates the interactive spatial keyword querying with semantics. It aims to enhance the conventional queries by not only making sense of the query keywords, but also refining the understanding of query semantics through interactions. On top of the probabilistic topic model, a novel interactive strategy is proposed to precisely infer the latent query semantics by learning from user feedbacks. In each interaction, the returned objects are carefully selected to ensure effective inference of user intended query semantics. Query processing is carried out on a small candidate object set at each round of interaction, and the whole querying process terminates when the latent query semantics learned from user feedback becomes explicit enough. The experimental results on real check-in dataset demonstrates that the quality of results has been significantly improved through limited number of interactions.
传统的空间关键字查询面临返回与查询关键字在形态上不同的同义词的所需对象的困难。为了克服这一缺陷,本文研究了具有语义的交互式空间关键字查询。它不仅通过理解查询关键字,而且通过交互改进对查询语义的理解,从而增强传统查询。在概率主题模型的基础上,提出了一种新的交互策略,通过学习用户反馈来精确地推断潜在的查询语义。在每次交互中,都会仔细选择返回的对象,以确保对用户期望的查询语义进行有效推断。在每一轮交互中,对一个小的候选对象集进行查询处理,当从用户反馈中学习到的潜在查询语义足够明确时,整个查询过程终止。在真实签入数据集上的实验结果表明,通过有限的交互次数,结果的质量得到了显著提高。
{"title":"Interactive Spatial Keyword Querying with Semantics","authors":"Jiabao Sun, Jiajie Xu, Kai Zheng, Chengfei Liu","doi":"10.1145/3132847.3132969","DOIUrl":"https://doi.org/10.1145/3132847.3132969","url":null,"abstract":"Conventional spatial keyword queries confront the difficulty of returning desired objects that are synonyms but morphologically different to query keywords. To overcome this flaw, this paper investigates the interactive spatial keyword querying with semantics. It aims to enhance the conventional queries by not only making sense of the query keywords, but also refining the understanding of query semantics through interactions. On top of the probabilistic topic model, a novel interactive strategy is proposed to precisely infer the latent query semantics by learning from user feedbacks. In each interaction, the returned objects are carefully selected to ensure effective inference of user intended query semantics. Query processing is carried out on a small candidate object set at each round of interaction, and the whole querying process terminates when the latent query semantics learned from user feedback becomes explicit enough. The experimental results on real check-in dataset demonstrates that the quality of results has been significantly improved through limited number of interactions.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74343076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A New Approach to Compute CNNs for Extremely Large Images 一种计算超大图像cnn的新方法
Sai Wu, Mengdan Zhang, Gang Chen, Ke Chen
CNN (Convolution Neural Network) is widely used in visual analysis and achieves exceptionally high performances in image classification, face detection, object recognition, image recoloring, and other learning jobs. Using deep learning frameworks, such as Torch and Tensorflow, CNN can be efficiently computed by leveraging the power of GPU. However, one drawback of GPU is its limited memory which prohibits us from handling large images. Passing a 4K resolution image to the VGG network will result in an exception of out-of-memory for Titan-X GPU. In this paper, we propose a new approach that adopts the BSP (bulk synchronization parallel) model to compute CNNs for images of any size. Before fed to a specific CNN layer, the image is split into smaller pieces which go through the neural network separately. Then, a specific padding and normalization technique is adopted to merge sub-images back into one image. Our approach can be easily extended to support distributed multi-GPUs. In this paper, we use neural style network as our example to illustrate the effectiveness of our approach. We show that using one Titan-X GPU, we can transfer the style of an image with 10,000×10,000 pixels within 1 minute.
卷积神经网络(convolutional Neural Network, CNN)广泛应用于视觉分析领域,在图像分类、人脸检测、物体识别、图像重着色等学习工作中都取得了非常高的性能。使用Torch和Tensorflow等深度学习框架,可以利用GPU的强大功能高效地计算CNN。然而,GPU的一个缺点是它有限的内存,这使我们无法处理大图像。通过4K分辨率的图像到VGG网络将导致Titan-X GPU内存不足的异常。在本文中,我们提出了一种采用BSP(批量同步并行)模型计算任意大小图像cnn的新方法。在输入到特定的CNN层之前,图像被分成更小的部分,分别通过神经网络。然后,采用特定的填充和归一化技术将子图像合并回一幅图像。我们的方法可以很容易地扩展到支持分布式多gpu。在本文中,我们以神经风格网络为例来说明我们的方法的有效性。我们表明,使用一个Titan-X GPU,我们可以在1分钟内传输具有10,000×10,000像素的图像的样式。
{"title":"A New Approach to Compute CNNs for Extremely Large Images","authors":"Sai Wu, Mengdan Zhang, Gang Chen, Ke Chen","doi":"10.1145/3132847.3132872","DOIUrl":"https://doi.org/10.1145/3132847.3132872","url":null,"abstract":"CNN (Convolution Neural Network) is widely used in visual analysis and achieves exceptionally high performances in image classification, face detection, object recognition, image recoloring, and other learning jobs. Using deep learning frameworks, such as Torch and Tensorflow, CNN can be efficiently computed by leveraging the power of GPU. However, one drawback of GPU is its limited memory which prohibits us from handling large images. Passing a 4K resolution image to the VGG network will result in an exception of out-of-memory for Titan-X GPU. In this paper, we propose a new approach that adopts the BSP (bulk synchronization parallel) model to compute CNNs for images of any size. Before fed to a specific CNN layer, the image is split into smaller pieces which go through the neural network separately. Then, a specific padding and normalization technique is adopted to merge sub-images back into one image. Our approach can be easily extended to support distributed multi-GPUs. In this paper, we use neural style network as our example to illustrate the effectiveness of our approach. We show that using one Titan-X GPU, we can transfer the style of an image with 10,000×10,000 pixels within 1 minute.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73334645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Recipe Popularity Prediction with Deep Visual-Semantic Fusion 基于深度视觉语义融合的配方流行度预测
Satoshi Sanjo, Marie Katsurai
Predicting the popularity of user-created recipes has great potential to be adopted in several applications on recipe-sharing websites. To ensure timely prediction when a recipe is uploaded, a prediction model needs to be trained based on the recipe's content features (i.e., its visual and semantic features). This paper presents a novel approach to predicting recipe popularity using deep visual-semantic fusion. We first pre-train a deep model that predicts the popularity of recipes based on each single modality. We insert additional layers to the two models and concatenate their activations. Finally, we train a network comprising fully connected (FC) layers on the fused features to learn more powerful features, which are used for training a regressor. Based on experiments conducted on more than 150K recipes collected from the Cookpad website, we present a comprehensive comparison with several baselines to verify the effectiveness of our method. The best practice for the proposed method is also described.
预测用户创建的食谱的受欢迎程度在食谱分享网站上的几个应用程序中具有很大的潜力。为了确保在菜谱上传时及时预测,需要根据菜谱的内容特征(即其视觉和语义特征)训练预测模型。本文提出了一种基于深度视觉语义融合的食谱流行度预测方法。我们首先预训练一个深度模型,该模型基于每种单一模态预测食谱的受欢迎程度。我们向两个模型插入额外的层,并连接它们的激活。最后,我们在融合的特征上训练一个由全连接层(FC)组成的网络,以学习更强大的特征,这些特征用于训练回归器。通过对Cookpad网站上收集的15万多份食谱进行实验,我们与几个基线进行了全面的比较,以验证我们方法的有效性。本文还描述了该方法的最佳实践。
{"title":"Recipe Popularity Prediction with Deep Visual-Semantic Fusion","authors":"Satoshi Sanjo, Marie Katsurai","doi":"10.1145/3132847.3133137","DOIUrl":"https://doi.org/10.1145/3132847.3133137","url":null,"abstract":"Predicting the popularity of user-created recipes has great potential to be adopted in several applications on recipe-sharing websites. To ensure timely prediction when a recipe is uploaded, a prediction model needs to be trained based on the recipe's content features (i.e., its visual and semantic features). This paper presents a novel approach to predicting recipe popularity using deep visual-semantic fusion. We first pre-train a deep model that predicts the popularity of recipes based on each single modality. We insert additional layers to the two models and concatenate their activations. Finally, we train a network comprising fully connected (FC) layers on the fused features to learn more powerful features, which are used for training a regressor. Based on experiments conducted on more than 150K recipes collected from the Cookpad website, we present a comprehensive comparison with several baselines to verify the effectiveness of our method. The best practice for the proposed method is also described.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"94 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79279835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
An Ad CTR Prediction Method Based on Feature Learning of Deep and Shallow Layers 基于深层和浅层特征学习的广告点击率预测方法
Zai Huang, Zhen Pan, Qi Liu, Bai Long, Haiping Ma, Enhong Chen
In online advertising, Click-Through Rate (CTR) prediction is a crucial task, as it may benefit the ranking and pricing of online ads. To the best of our knowledge, most of the existing CTR prediction methods are shallow layer models (e.g., Logistic Regression and Factorization Machines) or deep layer models (e.g., Neural Networks). Unfortunately, the shallow layer models cannot capture or utilize high-order nonlinear features in ad data. On the other side, the deep layer models cannot satisfy the necessity of updating CTR models online efficiently due to their high computational complexity. To address the shortcomings above, in this paper, we propose a novel hybrid method based on feature learning of both Deep and Shallow Layers (DSL). In DSL, we utilize Deep Neural Network as a deep layer model trained offline to learn high-order nonlinear features and use Factorization Machines as a shallow layer model for CTR prediction. Furthermore, we also develop an online learning implementation based on DSL, i.e., onlineDSL. Extensive experiments on large-scale real-world datasets clearly validate the effectiveness of our DSL method and onlineDSL algorithm compared with several state-of-the-art baselines.
在网络广告中,点击率(CTR)预测是一项至关重要的任务,因为它可能有利于在线广告的排名和定价。据我们所知,大多数现有的CTR预测方法是浅层模型(例如,逻辑回归和分解机器)或深层模型(例如,神经网络)。不幸的是,浅层模型不能捕获或利用广告数据中的高阶非线性特征。另一方面,深层模型的计算复杂度较高,不能满足在线高效更新CTR模型的需要。为了解决上述缺点,本文提出了一种基于深层和浅层特征学习(DSL)的新型混合方法。在DSL中,我们使用深度神经网络作为离线训练的深层模型来学习高阶非线性特征,并使用Factorization Machines作为CTR预测的浅层模型。此外,我们还开发了一种基于DSL的在线学习实现,即在线edsl。与几种最先进的基线相比,在大规模真实数据集上进行的大量实验清楚地验证了我们的DSL方法和在线edsl算法的有效性。
{"title":"An Ad CTR Prediction Method Based on Feature Learning of Deep and Shallow Layers","authors":"Zai Huang, Zhen Pan, Qi Liu, Bai Long, Haiping Ma, Enhong Chen","doi":"10.1145/3132847.3133072","DOIUrl":"https://doi.org/10.1145/3132847.3133072","url":null,"abstract":"In online advertising, Click-Through Rate (CTR) prediction is a crucial task, as it may benefit the ranking and pricing of online ads. To the best of our knowledge, most of the existing CTR prediction methods are shallow layer models (e.g., Logistic Regression and Factorization Machines) or deep layer models (e.g., Neural Networks). Unfortunately, the shallow layer models cannot capture or utilize high-order nonlinear features in ad data. On the other side, the deep layer models cannot satisfy the necessity of updating CTR models online efficiently due to their high computational complexity. To address the shortcomings above, in this paper, we propose a novel hybrid method based on feature learning of both Deep and Shallow Layers (DSL). In DSL, we utilize Deep Neural Network as a deep layer model trained offline to learn high-order nonlinear features and use Factorization Machines as a shallow layer model for CTR prediction. Furthermore, we also develop an online learning implementation based on DSL, i.e., onlineDSL. Extensive experiments on large-scale real-world datasets clearly validate the effectiveness of our DSL method and onlineDSL algorithm compared with several state-of-the-art baselines.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79343953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Simulating Zero-Resource Spoken Term Discovery 模拟零资源口语术语发现
Jerome White, Douglas W. Oard
If search engines are ever to index all of the spoken content in the world, they will need to handle hundreds of languages for which no automatic speech recognition systems exist. Zero-resource spoken term discovery, in which repeated content is detected in some acoustic representation, offers a potentially useful source of indexing features. This paper describes a text-based simulation of a zero-resource spoken term discovery system that allows any information retrieval test collection to be used as a basis for early development of information retrieval techniques. It is proposed that these techniques can be later applied to actual zero-resource spoken term discovery results.
如果搜索引擎要索引世界上所有的口语内容,它们将需要处理数百种语言,而这些语言还没有自动语音识别系统存在。零资源口语术语发现,在某些声学表示中检测到重复内容,提供了一个潜在的有用的索引特征来源。本文描述了一个基于文本的零资源口语术语发现系统的模拟,该系统允许将任何信息检索测试集合用作信息检索技术早期开发的基础。提出这些技术以后可以应用于实际的零资源口语术语发现结果。
{"title":"Simulating Zero-Resource Spoken Term Discovery","authors":"Jerome White, Douglas W. Oard","doi":"10.1145/3132847.3133160","DOIUrl":"https://doi.org/10.1145/3132847.3133160","url":null,"abstract":"If search engines are ever to index all of the spoken content in the world, they will need to handle hundreds of languages for which no automatic speech recognition systems exist. Zero-resource spoken term discovery, in which repeated content is detected in some acoustic representation, offers a potentially useful source of indexing features. This paper describes a text-based simulation of a zero-resource spoken term discovery system that allows any information retrieval test collection to be used as a basis for early development of information retrieval techniques. It is proposed that these techniques can be later applied to actual zero-resource spoken term discovery results.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84210329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BayDNN
Daizong Ding, Mi Zhang, Shao-Yuan Li, Jie Tang, Xiaotie Chen, Zhi-Hua Zhou
Friendship is the cornerstone to build a social network. In online social networks, statistics show that the leading reason for user to create a new friendship is due to recommendation. Thus the accuracy of recommendation matters. In this paper, we propose a Bayesian Personalized Ranking Deep Neural Network (BayDNN) model for friend recommendation in social networks. With BayDNN, we achieve significant improvement on two public datasets: Epinions and Slashdot. For example, on Epinions dataset, BayDNN significantly outperforms the state-of-the-art algorithms, with a 5% improvement on NDCG over the best baseline. The advantages of the proposed BayDNN mainly come from its underlying convolutional neural network (CNN), which offers a mechanism to extract latent deep structural feature representations of the complicated network data, and a novel Bayesian personalized ranking idea, which precisely captures the users' personal bias based on the extracted deep features. To get good parameter estimation for the neural network, we present a fine-tuned pre-training strategy for the proposed BayDNN model based on Poisson and Bernoulli probabilistic models.
{"title":"BayDNN","authors":"Daizong Ding, Mi Zhang, Shao-Yuan Li, Jie Tang, Xiaotie Chen, Zhi-Hua Zhou","doi":"10.1145/3132847.3132941","DOIUrl":"https://doi.org/10.1145/3132847.3132941","url":null,"abstract":"Friendship is the cornerstone to build a social network. In online social networks, statistics show that the leading reason for user to create a new friendship is due to recommendation. Thus the accuracy of recommendation matters. In this paper, we propose a Bayesian Personalized Ranking Deep Neural Network (BayDNN) model for friend recommendation in social networks. With BayDNN, we achieve significant improvement on two public datasets: Epinions and Slashdot. For example, on Epinions dataset, BayDNN significantly outperforms the state-of-the-art algorithms, with a 5% improvement on NDCG over the best baseline. The advantages of the proposed BayDNN mainly come from its underlying convolutional neural network (CNN), which offers a mechanism to extract latent deep structural feature representations of the complicated network data, and a novel Bayesian personalized ranking idea, which precisely captures the users' personal bias based on the extracted deep features. To get good parameter estimation for the neural network, we present a fine-tuned pre-training strategy for the proposed BayDNN model based on Poisson and Bernoulli probabilistic models.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81971266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Personalised Ranking Framework with Multiple Sampling Criteria for Venue Recommendation 个性化排名框架与多个采样标准的场地推荐
Jarana Manotumruksa, C. Macdonald, I. Ounis
Recommending a ranked list of interesting venues to users based on their preferences has become a key functionality in Location-Based Social Networks (LBSNs) such as Yelp and Gowalla. Bayesian Personalised Ranking (BPR) is a popular pairwise recommendation technique that is used to generate the ranked list of venues of interest to a user, by leveraging the user's implicit feedback such as their check-ins as instances of positive feedback, while randomly sampling other venues as negative instances. To alleviate the sparsity that affects the usefulness of recommendations by BPR for users with few check-ins, various approaches have been proposed in the literature to incorporate additional sources of information such as the social links between users, the textual content of comments, as well as the geographical location of the venues. However, such approaches can only readily leverage one source of additional information for negative sampling. Instead, we propose a novel Personalised Ranking Framework with Multiple sampling Criteria (PRFMC) that leverages both geographical influence and social correlation to enhance the effectiveness of BPR. In particular, we apply a multi-centre Gaussian model and a power-law distribution method, to capture geographical influence and social correlation when sampling negative venues, respectively. Finally, we conduct comprehensive experiments using three large-scale datasets from the Yelp, Gowalla and Brightkite LBSNs. The experimental results demonstrate the effectiveness of fusing both geographical influence and social correlation in our proposed PRFMC framework and its superiority in comparison to BPR-based and other similar ranking approaches. Indeed, our PRFMC approach attains a 37% improvement in MRR over a recently proposed approach that identifies negative venues only from social links.
根据用户的喜好向他们推荐有趣的地点已经成为Yelp和Gowalla等基于位置的社交网络(LBSNs)的一项关键功能。贝叶斯个性化排名(BPR)是一种流行的两两推荐技术,通过利用用户的隐式反馈(如他们的签到)作为积极反馈的实例,同时随机抽取其他场所作为消极实例,用于生成用户感兴趣的场所排名列表。为了减轻影响BPR建议对很少签到的用户有用性的稀疏性,文献中提出了各种方法来纳入其他信息来源,如用户之间的社会联系、评论的文本内容以及场所的地理位置。然而,这种方法只能很容易地利用一个来源的额外信息的负抽样。相反,我们提出了一种新的具有多采样标准的个性化排名框架(PRFMC),该框架利用地理影响和社会相关性来提高业务流程再造的有效性。特别是,我们应用多中心高斯模型和幂律分布方法,分别在采样负面场所时捕捉地理影响和社会相关性。最后,我们利用Yelp、Gowalla和Brightkite三个大型LBSNs数据集进行了综合实验。实验结果表明,我们提出的PRFMC框架融合地理影响和社会相关性的有效性,以及与基于bpr和其他类似排名方法相比的优越性。事实上,我们的PRFMC方法比最近提出的仅从社会联系中识别负面场所的方法在MRR方面提高了37%。
{"title":"A Personalised Ranking Framework with Multiple Sampling Criteria for Venue Recommendation","authors":"Jarana Manotumruksa, C. Macdonald, I. Ounis","doi":"10.1145/3132847.3132985","DOIUrl":"https://doi.org/10.1145/3132847.3132985","url":null,"abstract":"Recommending a ranked list of interesting venues to users based on their preferences has become a key functionality in Location-Based Social Networks (LBSNs) such as Yelp and Gowalla. Bayesian Personalised Ranking (BPR) is a popular pairwise recommendation technique that is used to generate the ranked list of venues of interest to a user, by leveraging the user's implicit feedback such as their check-ins as instances of positive feedback, while randomly sampling other venues as negative instances. To alleviate the sparsity that affects the usefulness of recommendations by BPR for users with few check-ins, various approaches have been proposed in the literature to incorporate additional sources of information such as the social links between users, the textual content of comments, as well as the geographical location of the venues. However, such approaches can only readily leverage one source of additional information for negative sampling. Instead, we propose a novel Personalised Ranking Framework with Multiple sampling Criteria (PRFMC) that leverages both geographical influence and social correlation to enhance the effectiveness of BPR. In particular, we apply a multi-centre Gaussian model and a power-law distribution method, to capture geographical influence and social correlation when sampling negative venues, respectively. Finally, we conduct comprehensive experiments using three large-scale datasets from the Yelp, Gowalla and Brightkite LBSNs. The experimental results demonstrate the effectiveness of fusing both geographical influence and social correlation in our proposed PRFMC framework and its superiority in comparison to BPR-based and other similar ranking approaches. Indeed, our PRFMC approach attains a 37% improvement in MRR over a recently proposed approach that identifies negative venues only from social links.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82022287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
A Compare-Aggregate Model with Dynamic-Clip Attention for Answer Selection 一种具有动态剪辑注意力的答案选择比较-聚合模型
Weijie Bian, Si Li, Zhao Yang, Guang Chen, Zhiqing Lin
Answer selection for question answering is a challenging task, since it requires effective capture of the complex semantic relations between questions and answers. Previous remarkable approaches mainly adopt general Compare-Aggregate framework that performs word-level comparison and aggregation. In this paper, unlike previous Compare-Aggregate models which utilize the traditional attention mechanism to generate corresponding word-level vector before comparison, we propose a novel attention mechanism named Dynamic-Clip Attention which is directly integrated into the Compare-Aggregate framework. Dynamic-Clip Attention focuses on filtering out noise in attention matrix, in order to better mine the semantic relevance of word-level vectors. At the same time, different from previous Compare-Aggregate works which treat answer selection task as a pointwise classification problem, we propose a listwise ranking approach to model this task to learn the relative order of candidate answers. Experiments on TrecQA and WikiQA datasets show that our proposed model achieves the state-of-the-art performance.
问题回答的答案选择是一项具有挑战性的任务,因为它需要有效地捕获问题和答案之间复杂的语义关系。以往的比较方法主要采用通用的Compare-Aggregate框架,进行词级比较和聚合。与以往的比较-聚合模型利用传统的注意力机制在比较前生成相应的词级向量不同,本文提出了一种新的注意力机制,即动态剪辑注意力,该机制直接集成到比较-聚合框架中。动态剪辑关注的重点是滤除注意矩阵中的噪声,以便更好地挖掘词级向量的语义相关性。同时,不同于以往的Compare-Aggregate将答案选择任务视为一个点分类问题,我们提出了一种列表排序的方法来建模该任务,以学习候选答案的相对顺序。在treqa和WikiQA数据集上的实验表明,我们提出的模型达到了最先进的性能。
{"title":"A Compare-Aggregate Model with Dynamic-Clip Attention for Answer Selection","authors":"Weijie Bian, Si Li, Zhao Yang, Guang Chen, Zhiqing Lin","doi":"10.1145/3132847.3133089","DOIUrl":"https://doi.org/10.1145/3132847.3133089","url":null,"abstract":"Answer selection for question answering is a challenging task, since it requires effective capture of the complex semantic relations between questions and answers. Previous remarkable approaches mainly adopt general Compare-Aggregate framework that performs word-level comparison and aggregation. In this paper, unlike previous Compare-Aggregate models which utilize the traditional attention mechanism to generate corresponding word-level vector before comparison, we propose a novel attention mechanism named Dynamic-Clip Attention which is directly integrated into the Compare-Aggregate framework. Dynamic-Clip Attention focuses on filtering out noise in attention matrix, in order to better mine the semantic relevance of word-level vectors. At the same time, different from previous Compare-Aggregate works which treat answer selection task as a pointwise classification problem, we propose a listwise ranking approach to model this task to learn the relative order of candidate answers. Experiments on TrecQA and WikiQA datasets show that our proposed model achieves the state-of-the-art performance.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84699126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
期刊
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
全部 Appl. Clay Sci. Geochim. Cosmochim. Acta Org. Geochem. Geochem. J. ERN: Other Microeconomics: General Equilibrium & Disequilibrium Models of Financial Markets (Topic) Geochem. Trans. 非金属矿 Chem. Ecol. J. Hydrol. Erziehungswissenschaftliche Revue Engineering Structures and Technologies npj Clim. Atmos. Sci. Engineering Science and Technology, an International Journal Ocean Modell. Environmental Claims Journal EXPERT OPIN DRUG DEL Acta Oceanolog. Sin. Environ. Educ. Res, [Hokkaido igaku zasshi] The Hokkaido journal of medical science Geosci. Front. Conserv. Genet. Resour. Atmos. Chem. Phys. J PALAEONTOL SOC IND Geol. Ore Deposits Environmental Epigenetics 2013 IEEE International Symposium on Hardware-Oriented Security and Trust (HOST) 国际生物医学工程杂志 Bull. Geol. Soc. Den. IZV-PHYS SOLID EART+ Espacio Tiempo y Forma. Serie VI, Geografía Geosci. Model Dev. Energy Systems Astrophys. J. Suppl. Ser. J. Appl. Phys. Ocean Sci. Am. J. Phys. Anthropol. Mineral. Mag. ACTA ORTHOP TRAUMATO Communications Earth & Environment Yan Ke Xue Bao (Hong Kong) Environ. Res. Lett. J. Cosmol. Astropart. Phys. Exp. Parasitol. «Узбекский физический журнал» Archaeol. Anthropol. Sci. ECOL RESTOR Phys. Chem. Miner. GEOTECH LETT 电力系统及其自动化学报 2012 SC Companion: High Performance Computing, Networking Storage and Analysis ENVIRON HEALTH-GLOB Adv. Atmos. Sci. Carbon Balance Manage. Asia-Pac. J. Atmos. Sci. Acta Geochimica Environ. Geochem. Health AAPG Bull. Am. Mineral. ARCHAEOMETRY Environ. Pollut. Bioavailability ACTA GEOL SIN-ENGL ACTA PETROL SIN ACTA GEOL POL Ecol. Processes J. Atmos. Chem. ARCT ANTARCT ALP RES Atmos. Res. ATMOSPHERE-BASEL ENVIRONMENT Environ. Technol. Innovation Environ. Eng. Sci. Adv. Meteorol. Aquat. Geochem. COMP BIOCHEM PHYS C Environ. Toxicol. Pharmacol. Environ. Mol. Mutagen. Acta Geophys. Int. J. Biometeorol. J. Electron. Spectrosc. Relat. Phenom. Annu. Rev. Earth Planet. Sci. ERN: Regulation (IO) (Topic) Geobiology Aust. J. Earth Sci. GEOLOGY Appl. Geochem. Ann. Glaciol. IEEE Trans. Appl. Supercond. Geostand. Geoanal. Res. Ecol. Indic. European Journal of Biological Research Environ. Eng. Manage. J. CRIT REV ENV SCI TEC Eurasian Journal of Emergency Medicine Basin Res. Enzyme Research ACAD EMERG MED Energy Environ. Conserv. Biol. Energy Ecol Environ ACTA ONCOL
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1