ACM Transactions on Information Systems (TOIS)最新文献_第8页

Exploiting Positional Information for Session-Based Recommendation 利用位置信息进行基于会话的推荐

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-07-02 DOI: 10.1145/3473339

Ruihong Qiu, Zi Huang, Tong Chen, Hongzhi Yin

For present e-commerce platforms, it is important to accurately predict users’ preference for a timely next-item recommendation. To achieve this goal, session-based recommender systems are developed, which are based on a sequence of the most recent user-item interactions to avoid the influence raised from outdated historical records. Although a session can usually reflect a user’s current preference, a local shift of the user’s intention within the session may still exist. Specifically, the interactions that take place in the early positions within a session generally indicate the user’s initial intention, while later interactions are more likely to represent the latest intention. Such positional information has been rarely considered in existing methods, which restricts their ability to capture the significance of interactions at different positions. To thoroughly exploit the positional information within a session, a theoretical framework is developed in this paper to provide an in-depth analysis of the positional information. We formally define the properties of forward-awareness and backward-awareness to evaluate the ability of positional encoding schemes in capturing the initial and the latest intention. According to our analysis, existing positional encoding schemes are generally forward-aware only, which can hardly represent the dynamics of the intention in a session. To enhance the positional encoding scheme for the session-based recommendation, a dual positional encoding (DPE) is proposed to account for both forward-awareness and backward-awareness. Based on DPE, we propose a novel Positional Recommender (PosRec) model with a well-designed Position-aware Gated Graph Neural Network module to fully exploit the positional information for session-based recommendation tasks. Extensive experiments are conducted on two e-commerce benchmark datasets, Yoochoose and Diginetica and the experimental results show the superiority of the PosRec by comparing it with the state-of-the-art session-based recommender models.

对于目前的电子商务平台来说，准确预测用户对下一项商品的偏好是非常重要的。为了实现这一目标，基于会话的推荐系统被开发出来，它基于最近的用户-项目交互序列，以避免过时的历史记录带来的影响。虽然会话通常可以反映用户当前的偏好，但会话中仍然可能存在用户意图的局部变化。具体来说，在会话的早期位置发生的交互通常表明用户的初始意图，而后期的交互更可能代表最新的意图。这些位置信息在现有的方法中很少被考虑，这限制了它们捕捉不同位置相互作用的重要性的能力。为了充分利用会话中的位置信息，本文建立了一个理论框架，对会话中的位置信息进行了深入的分析。我们正式定义了前向意识和后向意识的性质，以评估位置编码方案在捕获初始意图和最新意图方面的能力。根据我们的分析，现有的位置编码方案一般都是前向感知的，很难表征会话中意图的动态。为了改进基于会话推荐的位置编码方案，提出了一种考虑前向感知和后向感知的双位置编码(DPE)。基于DPE，我们提出了一种新的位置推荐器(PosRec)模型，该模型具有良好的位置感知门控图神经网络模块，可以充分利用位置信息进行基于会话的推荐任务。在Yoochoose和Diginetica两个电子商务基准数据集上进行了大量的实验，实验结果表明，与最先进的基于会话的推荐模型相比，PosRec具有优势。

{"title":"Exploiting Positional Information for Session-Based Recommendation","authors":"Ruihong Qiu, Zi Huang, Tong Chen, Hongzhi Yin","doi":"10.1145/3473339","DOIUrl":"https://doi.org/10.1145/3473339","url":null,"abstract":"For present e-commerce platforms, it is important to accurately predict users’ preference for a timely next-item recommendation. To achieve this goal, session-based recommender systems are developed, which are based on a sequence of the most recent user-item interactions to avoid the influence raised from outdated historical records. Although a session can usually reflect a user’s current preference, a local shift of the user’s intention within the session may still exist. Specifically, the interactions that take place in the early positions within a session generally indicate the user’s initial intention, while later interactions are more likely to represent the latest intention. Such positional information has been rarely considered in existing methods, which restricts their ability to capture the significance of interactions at different positions. To thoroughly exploit the positional information within a session, a theoretical framework is developed in this paper to provide an in-depth analysis of the positional information. We formally define the properties of forward-awareness and backward-awareness to evaluate the ability of positional encoding schemes in capturing the initial and the latest intention. According to our analysis, existing positional encoding schemes are generally forward-aware only, which can hardly represent the dynamics of the intention in a session. To enhance the positional encoding scheme for the session-based recommendation, a dual positional encoding (DPE) is proposed to account for both forward-awareness and backward-awareness. Based on DPE, we propose a novel Positional Recommender (PosRec) model with a well-designed Position-aware Gated Graph Neural Network module to fully exploit the positional information for session-based recommendation tasks. Extensive experiments are conducted on two e-commerce benchmark datasets, Yoochoose and Diginetica and the experimental results show the superiority of the PosRec by comparing it with the state-of-the-art session-based recommender models.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"1 1","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90242387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

What and How long: Prediction of Mobile App Engagement 预测手机应用用户粘性

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-06-02 DOI: 10.1145/3464301

Yuan Tian, Keren Zhou, D. Pelleg

User engagement is crucial to the long-term success of a mobile app. Several metrics, such as dwell time, have been used for measuring user engagement. However, how to effectively predict user engagement in the context of mobile apps is still an open research question. For example, do the mobile usage contexts (e.g., time of day) in which users access mobile apps impact their dwell time? Answers to such questions could help mobile operating system and publishers to optimize advertising and service placement. In this article, we first conduct an empirical study for assessing how user characteristics, temporal features, and the short/long-term contexts contribute to gains in predicting users’ app dwell time on the population level. The comprehensive analysis is conducted on large app usage logs collected through a mobile advertising company. The dataset covers more than 12K anonymous users and 1.3 million log events. Based on the analysis, we further investigate a novel mobile app engagement prediction problem—can we predict simultaneously what app the user will use next and how long he/she will stay on that app? We propose several strategies for this joint prediction problem and demonstrate that our model can improve the performance significantly when compared with the state-of-the-art baselines. Our work can help mobile system developers in designing a better and more engagement-aware mobile app user experience.

用户粘性对于手机应用的长期成功至关重要。停留时间等多个指标可用于衡量用户粘性。然而，如何在移动应用的背景下有效地预测用户粘性仍然是一个有待研究的问题。例如，用户访问移动应用程序的移动使用环境(例如，一天中的时间)是否会影响他们的停留时间?这些问题的答案可以帮助手机操作系统和发行商优化广告和服务布局。在本文中，我们首先进行了一项实证研究，以评估用户特征、时间特征和短期/长期背景如何有助于预测用户在总体水平上的应用停留时间。综合分析通过移动广告公司收集的大量应用使用日志。该数据集涵盖了超过12K个匿名用户和130万个日志事件。在此基础上，我们进一步研究了一个新的手机应用粘性预测问题——我们能否同时预测用户接下来会使用什么应用，以及他/她会在该应用上停留多久?我们为这个联合预测问题提出了几种策略，并证明与最先进的基线相比，我们的模型可以显着提高性能。我们的工作可以帮助移动系统开发者设计出更好的、更具参与性的移动应用用户体验。

{"title":"What and How long: Prediction of Mobile App Engagement","authors":"Yuan Tian, Keren Zhou, D. Pelleg","doi":"10.1145/3464301","DOIUrl":"https://doi.org/10.1145/3464301","url":null,"abstract":"User engagement is crucial to the long-term success of a mobile app. Several metrics, such as dwell time, have been used for measuring user engagement. However, how to effectively predict user engagement in the context of mobile apps is still an open research question. For example, do the mobile usage contexts (e.g., time of day) in which users access mobile apps impact their dwell time? Answers to such questions could help mobile operating system and publishers to optimize advertising and service placement. In this article, we first conduct an empirical study for assessing how user characteristics, temporal features, and the short/long-term contexts contribute to gains in predicting users’ app dwell time on the population level. The comprehensive analysis is conducted on large app usage logs collected through a mobile advertising company. The dataset covers more than 12K anonymous users and 1.3 million log events. Based on the analysis, we further investigate a novel mobile app engagement prediction problem—can we predict simultaneously what app the user will use next and how long he/she will stay on that app? We propose several strategies for this joint prediction problem and demonstrate that our model can improve the performance significantly when compared with the state-of-the-art baselines. Our work can help mobile system developers in designing a better and more engagement-aware mobile app user experience.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"21 1","pages":"1 - 38"},"PeriodicalIF":0.0,"publicationDate":"2021-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83280048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Popularity Bias in False-positive Metrics for Recommender Systems Evaluation 推荐系统评价中假阳性指标中的人气偏差

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-22 DOI: 10.1145/3452740

Elisa Mena-Maldonado, Rocío Cañamares, P. Castells, Yongli Ren, M. Sanderson

We investigate the impact of popularity bias in false-positive metrics in the offline evaluation of recommender systems. Unlike their true-positive complements, false-positive metrics reward systems that minimize recommendations disliked by users. Our analysis is, to the best of our knowledge, the first to show that false-positive metrics tend to penalise popular items, the opposite behavior of true-positive metrics—causing a disagreement trend between both types of metrics in the presence of popularity biases. We present a theoretical analysis of the metrics that identifies the reason that the metrics disagree and determines rare situations where the metrics might agree—the key to the situation lies in the relationship between popularity and relevance distributions, in terms of their agreement and steepness—two fundamental concepts we formalize. We then examine three well-known datasets using multiple popular true- and false-positive metrics on 16 recommendation algorithms. Specific datasets are chosen to allow us to estimate both biased and unbiased metric values. The results of the empirical study confirm and illustrate our analytical findings. With the conditions of the disagreement of the two types of metrics established, we then determine under which circumstances true-positive or false-positive metrics should be used by researchers of offline evaluation in recommender systems.1

我们研究了在推荐系统的离线评价中假阳性指标中的流行偏差的影响。与它们的真阳性补充不同，假阳性指标奖励系统将用户不喜欢的推荐最小化。据我们所知，我们的分析首次表明，假阳性指标倾向于惩罚受欢迎的项目，这与真阳性指标的行为相反——在受欢迎程度偏差的存在下，这两种指标之间出现了不一致的趋势。我们对指标进行了理论分析，确定了指标不一致的原因，并确定了指标可能一致的罕见情况——这种情况的关键在于流行度和相关性分布之间的关系，就它们的一致性和陡峭性而言——这是我们形式化的两个基本概念。然后，我们在16种推荐算法上使用多个流行的真阳性和假阳性指标来检查三个众所周知的数据集。选择特定的数据集，使我们能够估计有偏和无偏度量值。实证研究的结果证实并说明了我们的分析结果。在确定了两种度量标准不一致的条件下，我们确定了推荐系统中离线评价研究人员在哪些情况下应该使用真阳性或假阳性度量标准

{"title":"Popularity Bias in False-positive Metrics for Recommender Systems Evaluation","authors":"Elisa Mena-Maldonado, Rocío Cañamares, P. Castells, Yongli Ren, M. Sanderson","doi":"10.1145/3452740","DOIUrl":"https://doi.org/10.1145/3452740","url":null,"abstract":"We investigate the impact of popularity bias in false-positive metrics in the offline evaluation of recommender systems. Unlike their true-positive complements, false-positive metrics reward systems that minimize recommendations disliked by users. Our analysis is, to the best of our knowledge, the first to show that false-positive metrics tend to penalise popular items, the opposite behavior of true-positive metrics—causing a disagreement trend between both types of metrics in the presence of popularity biases. We present a theoretical analysis of the metrics that identifies the reason that the metrics disagree and determines rare situations where the metrics might agree—the key to the situation lies in the relationship between popularity and relevance distributions, in terms of their agreement and steepness—two fundamental concepts we formalize. We then examine three well-known datasets using multiple popular true- and false-positive metrics on 16 recommendation algorithms. Specific datasets are chosen to allow us to estimate both biased and unbiased metric values. The results of the empirical study confirm and illustrate our analytical findings. With the conditions of the disagreement of the two types of metrics established, we then determine under which circumstances true-positive or false-positive metrics should be used by researchers of offline evaluation in recommender systems.1","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"6 1","pages":"1 - 43"},"PeriodicalIF":0.0,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86645591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Inductive Contextual Relation Learning for Personalization 个性化的归纳语境关系学习

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-22 DOI: 10.1145/3450353

Chuxu Zhang, Huaxiu Yao, Lu Yu, Chao Huang, Dongjin Song, Haifeng Chen, Meng Jiang, N. Chawla

Web personalization, e.g., recommendation or relevance search, tailoring a service/product to accommodate specific online users, is becoming increasingly important. Inductive personalization aims to infer the relations between existing entities and unseen new ones, e.g., searching relevant authors for new papers or recommending new items to users. This problem, however, is challenging since most of recent studies focus on transductive problem for existing entities. In addition, despite some inductive learning approaches have been introduced recently, their performance is sub-optimal due to relatively simple and inflexible architectures for aggregating entity’s content. To this end, we propose the inductive contextual personalization (ICP) framework through contextual relation learning. Specifically, we first formulate the pairwise relations between entities with a ranking optimization scheme that employs neural aggregator to fuse entity’s heterogeneous contents. Next, we introduce a node embedding term to capture entity’s contextual relations, as a smoothness constraint over the prior ranking objective. Finally, the gradient descent procedure with adaptive negative sampling is employed to learn the model parameters. The learned model is capable of inferring the relations between existing entities and inductive ones. Thorough experiments demonstrate that ICP outperforms numerous baseline methods for two different applications, i.e., relevant author search and new item recommendation.

网络个性化，例如，推荐或相关搜索，定制服务/产品以适应特定的在线用户，正变得越来越重要。归纳个性化旨在推断现有实体和未见的新实体之间的关系，例如，为新论文搜索相关作者或向用户推荐新项目。然而，这个问题具有挑战性，因为最近的研究大多集中在现有实体的转导问题上。此外，尽管最近引入了一些归纳学习方法，但由于聚合实体内容的体系结构相对简单和不灵活，它们的性能不是最优的。为此，我们提出了基于语境关系学习的归纳语境个性化(ICP)框架。具体来说，我们首先用一种排序优化方案来表达实体之间的两两关系，该方案利用神经聚合器来融合实体的异构内容。接下来，我们引入一个节点嵌入项来捕获实体的上下文关系，作为对先前排序目标的平滑约束。最后，采用自适应负采样梯度下降法学习模型参数。学习的模型能够推断现有实体和归纳实体之间的关系。深入的实验表明，在相关作者搜索和新项目推荐两种不同的应用中，ICP优于许多基线方法。

{"title":"Inductive Contextual Relation Learning for Personalization","authors":"Chuxu Zhang, Huaxiu Yao, Lu Yu, Chao Huang, Dongjin Song, Haifeng Chen, Meng Jiang, N. Chawla","doi":"10.1145/3450353","DOIUrl":"https://doi.org/10.1145/3450353","url":null,"abstract":"Web personalization, e.g., recommendation or relevance search, tailoring a service/product to accommodate specific online users, is becoming increasingly important. Inductive personalization aims to infer the relations between existing entities and unseen new ones, e.g., searching relevant authors for new papers or recommending new items to users. This problem, however, is challenging since most of recent studies focus on transductive problem for existing entities. In addition, despite some inductive learning approaches have been introduced recently, their performance is sub-optimal due to relatively simple and inflexible architectures for aggregating entity’s content. To this end, we propose the inductive contextual personalization (ICP) framework through contextual relation learning. Specifically, we first formulate the pairwise relations between entities with a ranking optimization scheme that employs neural aggregator to fuse entity’s heterogeneous contents. Next, we introduce a node embedding term to capture entity’s contextual relations, as a smoothness constraint over the prior ranking objective. Finally, the gradient descent procedure with adaptive negative sampling is employed to learn the model parameters. The learned model is capable of inferring the relations between existing entities and inductive ones. Thorough experiments demonstrate that ICP outperforms numerous baseline methods for two different applications, i.e., relevant author search and new item recommendation.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"24 1","pages":"1 - 22"},"PeriodicalIF":0.0,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78714374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Exploiting Real-time Search Engine Queries for Earthquake Detection: A Summary of Results 利用实时搜索引擎查询地震探测:结果摘要

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-22 DOI: 10.1145/3453842

Qi Zhang, Hengshu Zhu, Qi Liu, Enhong Chen, Hui Xiong

Online search engine has been widely regarded as the most convenient approach for information acquisition. Indeed, the intensive information-seeking behaviors of search engine users make it possible to exploit search engine queries as effective “crowd sensors” for event monitoring. While some researchers have investigated the feasibility of using search engine queries for coarse-grained event analysis, the capability of search engine queries for real-time event detection has been largely neglected. To this end, in this article, we introduce a large-scale and systematic study on exploiting real-time search engine queries for outbreak event detection, with a focus on earthquake rapid reporting. In particular, we propose a realistic system of real-time earthquake detection through monitoring millions of queries related to earthquakes from a dominant online search engine in China. Specifically, we first investigate a large set of queries for selecting the representative queries that are highly correlated with the outbreak of earthquakes. Then, based on the real-time streams of selected queries, we design a novel machine learning–enhanced two-stage burst detection approach for detecting earthquake events. Meanwhile, the location of an earthquake epicenter can be accurately estimated based on the spatial-temporal distribution of search engine queries. Finally, through the extensive comparison with earthquake catalogs from China Earthquake Networks Center, 2015, the detection precision of our system can achieve 87.9%, and the accuracy of location estimation (province level) is 95.7%. In particular, 50% of successfully detected results can be found within 62 s after earthquake, and 50% of successful locations can be found within 25.5 km of seismic epicenter. Our system also found more than 23.3% extra earthquakes that were felt by people but not publicly released, 12.1% earthquake-like special outbreaks, and meanwhile, revealed many interesting findings, such as the typical query patterns of earthquake rumor and regular memorial events. Based on these results, our system can timely feed back information to the search engine users according to various cases and accelerate the information release of felt earthquakes.

在线搜索引擎已被广泛认为是最方便的信息获取方式。事实上，搜索引擎用户密集的信息寻求行为使得利用搜索引擎查询作为有效的“人群传感器”来监控事件成为可能。虽然一些研究人员已经研究了使用搜索引擎查询进行粗粒度事件分析的可行性，但搜索引擎查询进行实时事件检测的能力在很大程度上被忽视了。为此，在本文中，我们介绍了一项大规模、系统的研究，利用实时搜索引擎查询来检测爆发事件，重点是地震快速报告。特别是，我们提出了一个现实的实时地震检测系统，通过监测数以百万计的查询与地震有关的在线搜索引擎在中国占主导地位。具体来说，我们首先调查了一大批查询，以选择与地震爆发高度相关的代表性查询。然后，基于所选查询的实时流，我们设计了一种新的机器学习增强的两阶段突发检测方法来检测地震事件。同时，可以根据搜索引擎查询的时空分布准确估计地震震中的位置。最后，通过与中国地震台网中心2015年地震台刊的广泛对比，系统的探测精度可达到87.9%，定位精度(省级)达到95.7%。特别是，50%的成功探测结果可以在地震后62秒内找到，50%的成功探测位置可以在震中25.5公里内找到。我们的系统还发现了超过23.3%的人感觉到但未公开发布的额外地震，12.1%的类似地震的特殊爆发，同时还发现了许多有趣的发现，例如地震谣言的典型查询模式和定期纪念活动。基于这些结果，我们的系统可以根据各种情况及时向搜索引擎用户反馈信息，加快震感信息的发布。

{"title":"Exploiting Real-time Search Engine Queries for Earthquake Detection: A Summary of Results","authors":"Qi Zhang, Hengshu Zhu, Qi Liu, Enhong Chen, Hui Xiong","doi":"10.1145/3453842","DOIUrl":"https://doi.org/10.1145/3453842","url":null,"abstract":"Online search engine has been widely regarded as the most convenient approach for information acquisition. Indeed, the intensive information-seeking behaviors of search engine users make it possible to exploit search engine queries as effective “crowd sensors” for event monitoring. While some researchers have investigated the feasibility of using search engine queries for coarse-grained event analysis, the capability of search engine queries for real-time event detection has been largely neglected. To this end, in this article, we introduce a large-scale and systematic study on exploiting real-time search engine queries for outbreak event detection, with a focus on earthquake rapid reporting. In particular, we propose a realistic system of real-time earthquake detection through monitoring millions of queries related to earthquakes from a dominant online search engine in China. Specifically, we first investigate a large set of queries for selecting the representative queries that are highly correlated with the outbreak of earthquakes. Then, based on the real-time streams of selected queries, we design a novel machine learning–enhanced two-stage burst detection approach for detecting earthquake events. Meanwhile, the location of an earthquake epicenter can be accurately estimated based on the spatial-temporal distribution of search engine queries. Finally, through the extensive comparison with earthquake catalogs from China Earthquake Networks Center, 2015, the detection precision of our system can achieve 87.9%, and the accuracy of location estimation (province level) is 95.7%. In particular, 50% of successfully detected results can be found within 62 s after earthquake, and 50% of successful locations can be found within 25.5 km of seismic epicenter. Our system also found more than 23.3% extra earthquakes that were felt by people but not publicly released, 12.1% earthquake-like special outbreaks, and meanwhile, revealed many interesting findings, such as the typical query patterns of earthquake rumor and regular memorial events. Based on these results, our system can timely feed back information to the search engine users according to various cases and accelerate the information release of felt earthquakes.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"31 1","pages":"1 - 32"},"PeriodicalIF":0.0,"publicationDate":"2021-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75577573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Modeling Multiple Coexisting Category-Level Intentions for Next Item Recommendation 为下一个项目推荐建立多个共存的类别级意向模型

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3441642

Yanan Xu, Yanmin Zhu, Jiadi Yu

Purchase intentions have a great impact on future purchases and thus can be exploited for making recommendations. However, purchase intentions are typically complex and may change from time to time. Through empirical study with two e-commerce datasets, we observe that behaviors of multiple types can indicate user intentions and a user may have multiple coexisting category-level intentions that evolve over time. In this article, we propose a novel Intention-Aware Recommender System (IARS) which consists of four components for mining such complex intentions from user behaviors of multiple types. In the first component, we utilize several Recurrent Neural Networks (RNNs) and an attention layer to model diverse user intentions simultaneously and design two kinds of Multi-behavior GRU (MGRU) cells to deal with heterogeneous behaviors. To reveal user intentions, we carefully design three tasks that share representations from MGRUs. The next-item recommendation is the main task and leverages attention to select user intentions according to candidate items. The remaining two (item prediction and sequence comparison) are auxiliary tasks and can reveal user intentions. Extensive experiments on the two real-world datasets demonstrate the effectiveness of our models compared with several state-of-the-art recommendation methods in terms of hit ratio and NDCG.

购买意向对未来的购买有很大的影响，因此可以用来做推荐。然而，购买意图通常是复杂的，可能会不时发生变化。通过对两个电子商务数据集的实证研究，我们发现多种类型的行为可以表明用户的意图，用户可能有多个共存的类别级意图，这些意图会随着时间的推移而演变。在本文中，我们提出了一种新的意图感知推荐系统(IARS)，该系统由四个组件组成，用于从多种类型的用户行为中挖掘复杂的意图。在第一部分中，我们利用几个递归神经网络(rnn)和一个注意层同时建模不同的用户意图，并设计了两种多行为GRU (MGRU)单元来处理异构行为。为了揭示用户意图，我们精心设计了三个任务，它们共享来自mgru的表示。下一项推荐是主要任务，它利用注意力根据候选项选择用户意图。剩下的两个(项目预测和序列比较)是辅助任务，可以揭示用户的意图。在两个真实数据集上进行的大量实验表明，与几种最先进的推荐方法相比，我们的模型在命中率和NDCG方面是有效的。

{"title":"Modeling Multiple Coexisting Category-Level Intentions for Next Item Recommendation","authors":"Yanan Xu, Yanmin Zhu, Jiadi Yu","doi":"10.1145/3441642","DOIUrl":"https://doi.org/10.1145/3441642","url":null,"abstract":"Purchase intentions have a great impact on future purchases and thus can be exploited for making recommendations. However, purchase intentions are typically complex and may change from time to time. Through empirical study with two e-commerce datasets, we observe that behaviors of multiple types can indicate user intentions and a user may have multiple coexisting category-level intentions that evolve over time. In this article, we propose a novel Intention-Aware Recommender System (IARS) which consists of four components for mining such complex intentions from user behaviors of multiple types. In the first component, we utilize several Recurrent Neural Networks (RNNs) and an attention layer to model diverse user intentions simultaneously and design two kinds of Multi-behavior GRU (MGRU) cells to deal with heterogeneous behaviors. To reveal user intentions, we carefully design three tasks that share representations from MGRUs. The next-item recommendation is the main task and leverages attention to select user intentions according to candidate items. The remaining two (item prediction and sequence comparison) are auxiliary tasks and can reveal user intentions. Extensive experiments on the two real-world datasets demonstrate the effectiveness of our models compared with several state-of-the-art recommendation methods in terms of hit ratio and NDCG.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"37 1","pages":"1 - 24"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81650560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification 半监督短文本分类的异构图注意网络

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3450352

Tianchi Yang, Linmei Hu, C. Shi, Houye Ji, Xiaoli Li, Liqiang Nie

Short text classification has been widely explored in news tagging to provide more efficient search strategies and more effective search results for information retrieval. However, most existing studies, concentrating on long text classification, deliver unsatisfactory performance on short texts due to the sparsity issue and the insufficiency of labeled data. In this article, we propose a novel heterogeneous graph neural network-based method for semi-supervised short text classification, leveraging full advantage of limited labeled data and large unlabeled data through information propagation along the graph. Specifically, we first present a flexible heterogeneous information network (HIN) framework for modeling short texts, which can integrate any type of additional information and meanwhile capture their relations to address the semantic sparsity. Then, we propose Heterogeneous Graph Attention networks (HGAT) to embed the HIN for short text classification based on a dual-level attention mechanism, including node-level and type-level attentions. To efficiently classify new coming texts that do not previously exist in the HIN, we extend our model HGAT for inductive learning, avoiding re-training the model on the evolving HIN. Extensive experiments on single-/multi-label classification demonstrates that our proposed model HGAT significantly outperforms state-of-the-art methods across the benchmark datasets under both transductive and inductive learning.

短文本分类在新闻标注中得到了广泛的探索，为信息检索提供更高效的搜索策略和更有效的搜索结果。然而，大多数现有的研究都集中在长文本分类上，由于稀疏性问题和标记数据的不足，对短文本的分类效果并不理想。本文提出了一种基于异构图神经网络的半监督短文本分类方法，通过信息沿图传播，充分利用有限标记数据和大量未标记数据的优势。具体来说，我们首先提出了一个灵活的异构信息网络(HIN)框架，该框架可以集成任何类型的附加信息，同时捕获它们之间的关系以解决语义稀疏性问题。在此基础上，我们提出了基于节点级和类型级两种注意机制的异构图注意网络(HGAT)来嵌入HIN进行短文本分类。为了有效地对HIN中以前不存在的新文本进行分类，我们将模型HGAT扩展为归纳学习，避免了在不断发展的HIN上重新训练模型。在单/多标签分类上的大量实验表明，我们提出的HGAT模型在传导学习和归纳学习的基准数据集上都明显优于最先进的方法。

{"title":"HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification","authors":"Tianchi Yang, Linmei Hu, C. Shi, Houye Ji, Xiaoli Li, Liqiang Nie","doi":"10.1145/3450352","DOIUrl":"https://doi.org/10.1145/3450352","url":null,"abstract":"Short text classification has been widely explored in news tagging to provide more efficient search strategies and more effective search results for information retrieval. However, most existing studies, concentrating on long text classification, deliver unsatisfactory performance on short texts due to the sparsity issue and the insufficiency of labeled data. In this article, we propose a novel heterogeneous graph neural network-based method for semi-supervised short text classification, leveraging full advantage of limited labeled data and large unlabeled data through information propagation along the graph. Specifically, we first present a flexible heterogeneous information network (HIN) framework for modeling short texts, which can integrate any type of additional information and meanwhile capture their relations to address the semantic sparsity. Then, we propose Heterogeneous Graph Attention networks (HGAT) to embed the HIN for short text classification based on a dual-level attention mechanism, including node-level and type-level attentions. To efficiently classify new coming texts that do not previously exist in the HIN, we extend our model HGAT for inductive learning, avoiding re-training the model on the evolving HIN. Extensive experiments on single-/multi-label classification demonstrates that our proposed model HGAT significantly outperforms state-of-the-art methods across the benchmark datasets under both transductive and inductive learning.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"1 1","pages":"1 - 29"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78570339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 65

RLPS: A Reinforcement Learning–Based Framework for Personalized Search RLPS:基于强化学习的个性化搜索框架

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3446617

Jing Yao, Zhicheng Dou, Jun Xu, Jirong Wen

Personalized search is a promising way to improve search qualities by taking user interests into consideration. Recently, machine learning and deep learning techniques have been successfully applied to search result personalization. Most existing models simply regard the personal search history as a static set of user behaviors and learn fixed ranking strategies based on all the recorded data. Though improvements have been achieved, the essence that the search process is a sequence of interactions between the search engine and user is ignored. The user’s interests may dynamically change during the search process, therefore, it would be more helpful if a personalized search model could track the whole interaction process and adjust its ranking strategy continuously. In this article, we adapt reinforcement learning to personalized search and propose a framework, referred to as RLPS. It utilizes a Markov Decision Process (MDP) to track sequential interactions between the user and search engine, and continuously update the underlying personalized ranking model with the user’s real-time feedback to learn the user’s dynamic interests. Within this framework, we implement two models: the listwise RLPS-L and the hierarchical RLPS-H. RLPS-L interacts with users and trains the ranking model with document lists, while RLPS-H improves model training by designing a layered structure and introducing document pairs. In addition, we also design a feedback-aware personalized ranking component to capture the user’s feedback, which impacts the user interest profile for the next query. Significant improvements over existing personalized search models are observed in the experiments on the public AOL search log and a commercial log.

个性化搜索是一种很有前途的方法，可以通过考虑用户的兴趣来提高搜索质量。近年来，机器学习和深度学习技术已成功应用于搜索结果个性化。现有的大多数模型只是简单地将个人搜索历史视为一组静态的用户行为，并根据所有记录的数据学习固定的排名策略。虽然已经取得了改进，但搜索过程的本质是搜索引擎和用户之间的一系列交互，这一点被忽视了。在搜索过程中，用户的兴趣可能会发生动态变化，因此，个性化搜索模型如果能够跟踪整个交互过程并不断调整其排名策略，将会更有帮助。在本文中，我们将强化学习应用于个性化搜索，并提出了一个框架，称为RLPS。它利用马尔可夫决策过程(Markov Decision Process, MDP)跟踪用户与搜索引擎之间的顺序交互，并利用用户的实时反馈不断更新底层个性化排名模型，以了解用户的动态兴趣。在这个框架内，我们实现了两个模型:列表式RLPS-L和分层式RLPS-H。RLPS-L与用户交互，使用文档列表训练排名模型，而RLPS-H通过设计分层结构和引入文档对来改进模型训练。此外，我们还设计了一个反馈感知的个性化排名组件来捕获用户的反馈，这些反馈会影响下一个查询的用户兴趣配置文件。在公共AOL搜索日志和商业日志上的实验中，可以观察到对现有个性化搜索模型的显著改进。

{"title":"RLPS: A Reinforcement Learning–Based Framework for Personalized Search","authors":"Jing Yao, Zhicheng Dou, Jun Xu, Jirong Wen","doi":"10.1145/3446617","DOIUrl":"https://doi.org/10.1145/3446617","url":null,"abstract":"Personalized search is a promising way to improve search qualities by taking user interests into consideration. Recently, machine learning and deep learning techniques have been successfully applied to search result personalization. Most existing models simply regard the personal search history as a static set of user behaviors and learn fixed ranking strategies based on all the recorded data. Though improvements have been achieved, the essence that the search process is a sequence of interactions between the search engine and user is ignored. The user’s interests may dynamically change during the search process, therefore, it would be more helpful if a personalized search model could track the whole interaction process and adjust its ranking strategy continuously. In this article, we adapt reinforcement learning to personalized search and propose a framework, referred to as RLPS. It utilizes a Markov Decision Process (MDP) to track sequential interactions between the user and search engine, and continuously update the underlying personalized ranking model with the user’s real-time feedback to learn the user’s dynamic interests. Within this framework, we implement two models: the listwise RLPS-L and the hierarchical RLPS-H. RLPS-L interacts with users and trains the ranking model with document lists, while RLPS-H improves model training by designing a layered structure and introducing document pairs. In addition, we also design a feedback-aware personalized ranking component to capture the user’s feedback, which impacts the user interest profile for the next query. Significant improvements over existing personalized search models are observed in the experiments on the public AOL search log and a commercial log.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"24 1","pages":"1 - 29"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76764037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Hybrid Framework for Session Context Modeling 会话上下文建模的混合框架

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3448127

Chenjia, Jiaxin Mao, LiuYiqun, YeZiyi, MaWeizhi, wangchao, Zhangmin, MaShaoping

Understanding user intent is essential for various retrieval tasks. By leveraging contextual information within sessions, e.g., query history and user click behaviors, search systems can capture user intent more accurately and thus perform better. However, most existing systems only consider intra-session contexts and may suffer from the problem of lacking contextual information, because short search sessions account for a large proportion in practical scenarios. We believe that in these scenarios, considering more contexts, e.g., cross-session dependencies, may help alleviate the problem and contribute to better performance. Therefore, we propose a novel Hybrid framework for Session Context Modeling (HSCM), which realizes session-level multi-task learning based on the self-attention mechanism. To alleviate the problem of lacking contextual information within current sessions, HSCM exploits the cross-session contexts by sampling user interactions under similar search intents in the historical sessions and further aggregating them into the local contexts. Besides, application of the self-attention mechanism rather than RNN-based frameworks in modeling session-level sequences also helps (1) better capture interactions within sessions, (2) represent the session contexts in parallelization. Experimental results on two practical search datasets show that HSCM not only outperforms strong baseline solutions such as HiNT, CARS, and BERTserini in document ranking, but also performs significantly better than most existing query suggestion methods. According to the results in an additional experiment, we have also found that HSCM is superior to most ranking models in click prediction.

理解用户意图对于各种检索任务至关重要。通过利用会话中的上下文信息，例如查询历史和用户点击行为，搜索系统可以更准确地捕获用户意图，从而更好地执行。然而，大多数现有系统只考虑会话内上下文，并且可能存在缺乏上下文信息的问题，因为在实际场景中，短搜索会话占很大比例。我们相信，在这些情况下，考虑更多的上下文，例如，跨会话依赖关系，可能有助于缓解问题，并有助于提高性能。为此，我们提出了一种新的会话上下文建模(HSCM)混合框架，该框架基于自注意机制实现会话级多任务学习。为了缓解当前会话中缺乏上下文信息的问题，HSCM通过对历史会话中相似搜索意图下的用户交互进行采样，并进一步将其聚合到本地上下文中，从而利用跨会话上下文。此外，在会话级序列建模中应用自注意机制而不是基于rnn的框架也有助于(1)更好地捕获会话内的交互，(2)并行化表示会话上下文。在两个实际搜索数据集上的实验结果表明，HSCM不仅在文档排序方面优于HiNT、CARS和BERTserini等强基线解决方案，而且显著优于大多数现有的查询建议方法。根据另一个实验的结果，我们还发现HSCM在点击预测方面优于大多数排名模型。

{"title":"A Hybrid Framework for Session Context Modeling","authors":"Chenjia, Jiaxin Mao, LiuYiqun, YeZiyi, MaWeizhi, wangchao, Zhangmin, MaShaoping","doi":"10.1145/3448127","DOIUrl":"https://doi.org/10.1145/3448127","url":null,"abstract":"Understanding user intent is essential for various retrieval tasks. By leveraging contextual information within sessions, e.g., query history and user click behaviors, search systems can capture user intent more accurately and thus perform better. However, most existing systems only consider intra-session contexts and may suffer from the problem of lacking contextual information, because short search sessions account for a large proportion in practical scenarios. We believe that in these scenarios, considering more contexts, e.g., cross-session dependencies, may help alleviate the problem and contribute to better performance. Therefore, we propose a novel Hybrid framework for Session Context Modeling (HSCM), which realizes session-level multi-task learning based on the self-attention mechanism. To alleviate the problem of lacking contextual information within current sessions, HSCM exploits the cross-session contexts by sampling user interactions under similar search intents in the historical sessions and further aggregating them into the local contexts. Besides, application of the self-attention mechanism rather than RNN-based frameworks in modeling session-level sequences also helps (1) better capture interactions within sessions, (2) represent the session contexts in parallelization. Experimental results on two practical search datasets show that HSCM not only outperforms strong baseline solutions such as HiNT, CARS, and BERTserini in document ranking, but also performs significantly better than most existing query suggestion methods. According to the results in an additional experiment, we have also found that HSCM is superior to most ranking models in click prediction.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"30 1","pages":"1 - 35"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80948937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

DGeye: Probabilistic Risk Perception and Prediction for Urban Dangerous Goods Management 城市危险货物管理的概率风险感知与预测

ACM Transactions on Information Systems (TOIS)

Pub Date : 2021-05-06 DOI: 10.1145/3448256

Jingyuan Wang, Xin Lin, Y. Zuo, Junjie Wu

Recent years have witnessed the emergence of worldwide megalopolises and the accompanying public safety events, making urban safety a top priority in modern urban management. Among various threats, dangerous goods such as gas and hazardous chemicals transported through cities have bred repeated tragedies and become the deadly “bomb” we sleep with every day. While tremendous research efforts have been devoted to dealing with dangerous goods transportation (DGT) issues, further study is still in great need to quantify this problem and explore its intrinsic dynamics from a big data perspective. In this article, we present a novel system called DGeye, to feature a fusion between DGT trajectory data and residential population data for dangers perception and prediction. Specifically, DGeye first develops a probabilistic graphical model-based approach to mine spatio-temporally adjacent risk patterns from population-aware risk trajectories. Then, DGeye builds the novel causality network among risk patterns for risk pain-point identification, risk source attribution, and online risky state prediction. Experiments on both Beijing and Tianjin cities demonstrate the effectiveness of DGeye in real-life DGT risk management. As a case in point, our report powered by DGeye successfully drove the government to lay down gas pipelines for the famous Guijie food street in Beijing.

近年来，世界性特大城市的出现和随之而来的公共安全事件，使城市安全成为现代城市管理的重中之重。在各种威胁中，通过城市运输的天然气和危险化学品等危险货物一再酿成悲剧，成为我们每天睡觉的致命“炸弹”。尽管对危险品运输问题的研究已经投入了大量的精力，但从大数据的角度对危险品运输问题进行量化，探索其内在动态，仍需要进一步的研究。在本文中，我们提出了一个新的系统，称为geye，其特点是融合DGT轨迹数据和居住人口数据，用于危险感知和预测。具体来说，geye首先开发了一种基于概率图形模型的方法，从人口感知风险轨迹中挖掘时空相邻风险模式。然后，在风险模式之间构建新的因果关系网络，用于风险痛点识别、风险源归因和在线风险状态预测。在北京和天津的实验证明了geye在实际DGT风险管理中的有效性。一个很好的例子是，我们的报告成功地推动了政府为北京著名的簋街美食街铺设天然气管道。

{"title":"DGeye: Probabilistic Risk Perception and Prediction for Urban Dangerous Goods Management","authors":"Jingyuan Wang, Xin Lin, Y. Zuo, Junjie Wu","doi":"10.1145/3448256","DOIUrl":"https://doi.org/10.1145/3448256","url":null,"abstract":"Recent years have witnessed the emergence of worldwide megalopolises and the accompanying public safety events, making urban safety a top priority in modern urban management. Among various threats, dangerous goods such as gas and hazardous chemicals transported through cities have bred repeated tragedies and become the deadly “bomb” we sleep with every day. While tremendous research efforts have been devoted to dealing with dangerous goods transportation (DGT) issues, further study is still in great need to quantify this problem and explore its intrinsic dynamics from a big data perspective. In this article, we present a novel system called DGeye, to feature a fusion between DGT trajectory data and residential population data for dangers perception and prediction. Specifically, DGeye first develops a probabilistic graphical model-based approach to mine spatio-temporally adjacent risk patterns from population-aware risk trajectories. Then, DGeye builds the novel causality network among risk patterns for risk pain-point identification, risk source attribution, and online risky state prediction. Experiments on both Beijing and Tianjin cities demonstrate the effectiveness of DGeye in real-life DGT risk management. As a case in point, our report powered by DGeye successfully drove the government to lay down gas pipelines for the famous Guijie food street in Beijing.","PeriodicalId":6934,"journal":{"name":"ACM Transactions on Information Systems (TOIS)","volume":"46 1","pages":"1 - 30"},"PeriodicalIF":0.0,"publicationDate":"2021-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89951433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5