首页 > 最新文献

Proceedings of the 30th ACM International Conference on Information & Knowledge Management最新文献

英文 中文
librec-auto: A Tool for Recommender Systems Experimentation librec-auto:一个推荐系统实验的工具
Nasim Sonboli, M. Mansoury, Ziyue Guo, Shreyas Kadekodi, Weiwen Liu, Zijun Liu, Andrew Schwartz, R. Burke
Recommender systems are complex. They integrate the individual needs of users with the characteristics of particular domains of application which may span items from large and potentially heterogeneous collections. Extensive experimentation is required to understand the multidimensional properties of recommendation algorithms and the fit between algorithm and application. librec-auto is a tool that automates many aspects of off-line batch recommender system experimentation. It has a large library of state-of-the-art and historical recommendation algorithms and a wide variety of evaluation metrics. It further supports the study of diversity and fairness in recommendation through the integration of re-ranking algorithms and fairness-aware metrics. It supports declarative configuration for reproducible experiment management and supports multiple forms of hyper-parameter optimization.
推荐系统是复杂的。它们将用户的个人需求与特定应用程序领域的特征集成在一起,这些领域可能跨越大型且可能异构的集合。为了理解推荐算法的多维特性以及算法与应用之间的匹配,需要进行大量的实验。Librec-auto是一个工具,它可以自动化离线批量推荐系统实验的许多方面。它有一个大型的最先进和历史推荐算法库,以及各种各样的评估指标。通过整合重新排序算法和公平感知指标,进一步支持了推荐多样性和公平性的研究。它支持可重复实验管理的声明式配置,并支持多种形式的超参数优化。
{"title":"librec-auto: A Tool for Recommender Systems Experimentation","authors":"Nasim Sonboli, M. Mansoury, Ziyue Guo, Shreyas Kadekodi, Weiwen Liu, Zijun Liu, Andrew Schwartz, R. Burke","doi":"10.1145/3459637.3482006","DOIUrl":"https://doi.org/10.1145/3459637.3482006","url":null,"abstract":"Recommender systems are complex. They integrate the individual needs of users with the characteristics of particular domains of application which may span items from large and potentially heterogeneous collections. Extensive experimentation is required to understand the multidimensional properties of recommendation algorithms and the fit between algorithm and application. librec-auto is a tool that automates many aspects of off-line batch recommender system experimentation. It has a large library of state-of-the-art and historical recommendation algorithms and a wide variety of evaluation metrics. It further supports the study of diversity and fairness in recommendation through the integration of re-ranking algorithms and fairness-aware metrics. It supports declarative configuration for reproducible experiment management and supports multiple forms of hyper-parameter optimization.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"173 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116942449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
How to Leverage a Multi-layered Transformer Language Model for Text Clustering: an Ensemble Approach 如何利用多层转换语言模型进行文本聚类:一种集成方法
Mira Ait-Saada, François Role, M. Nadif
Pre-trained Transformer-based word embeddings are now widely used in text mining where they are known to significantly improve supervised tasks such as text classification, named entity recognition and question answering. Since the Transformer models create several different embeddings for the same input, one at each layer of their architecture, various studies have already tried to identify those of these embeddings that most contribute to the success of the above-mentioned tasks. In contrast the same performance analysis has not yet been carried out in the unsupervised setting. In this paper we evaluate the effectiveness of Transformer models on the important task of text clustering. In particular, we present a clustering ensemble approach that harnesses all the network's layers. Numerical experiments carried out on real datasets with different Transformer models show the effectiveness of the proposed method compared to several baselines.
基于预训练的transformer的词嵌入现在广泛应用于文本挖掘中,已知它们可以显着改善文本分类,命名实体识别和问答等监督任务。由于Transformer模型为相同的输入创建了几个不同的嵌入,在其体系结构的每一层都创建了一个嵌入,因此各种研究已经试图确定这些嵌入中最有助于上述任务成功的那些。相比之下,在无监督设置中尚未进行相同的性能分析。本文评估了Transformer模型在文本聚类这一重要任务上的有效性。特别地,我们提出了一种利用所有网络层的聚类集成方法。在不同Transformer模型的实际数据集上进行的数值实验表明,与几种基线相比,该方法是有效的。
{"title":"How to Leverage a Multi-layered Transformer Language Model for Text Clustering: an Ensemble Approach","authors":"Mira Ait-Saada, François Role, M. Nadif","doi":"10.1145/3459637.3482121","DOIUrl":"https://doi.org/10.1145/3459637.3482121","url":null,"abstract":"Pre-trained Transformer-based word embeddings are now widely used in text mining where they are known to significantly improve supervised tasks such as text classification, named entity recognition and question answering. Since the Transformer models create several different embeddings for the same input, one at each layer of their architecture, various studies have already tried to identify those of these embeddings that most contribute to the success of the above-mentioned tasks. In contrast the same performance analysis has not yet been carried out in the unsupervised setting. In this paper we evaluate the effectiveness of Transformer models on the important task of text clustering. In particular, we present a clustering ensemble approach that harnesses all the network's layers. Numerical experiments carried out on real datasets with different Transformer models show the effectiveness of the proposed method compared to several baselines.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117198564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
SNPR SNPR
Mingwei Zhang, Yang Yang, Rizwan Abbas, Ke Deng, Jianxin Li, Bin Zhang
Next Point-of-Interest (POI) recommendation plays an important role in location-based services. The state-of-the-art methods utilize recurrent neural networks (RNNs) to model users' check-in sequences and have shown promising results. However, they tend to recommend POIs similar to those that the user has often visited. As a result, users become bored with obvious recommendations. To address this issue, we propose Serendipity-oriented Next POI Recommendation model (SNPR), a supervised multi-task learning problem, with objective to recommend unexpected and relevant POIs only. To this end, we define the quantitativeserendipity as a trade-off ofrelevance andunexpectedness in the context of next POI recommendation, and design a dedicated neural network with Transformer to capture complex interdependencies between POIs in user's check-in sequence. Extensive experimental results show that our model can improverelevance significantly while theunexpectedness outperforms the state-of-the-art serendipity-oriented recommendation methods.
{"title":"SNPR","authors":"Mingwei Zhang, Yang Yang, Rizwan Abbas, Ke Deng, Jianxin Li, Bin Zhang","doi":"10.1145/3459637.3482394","DOIUrl":"https://doi.org/10.1145/3459637.3482394","url":null,"abstract":"Next Point-of-Interest (POI) recommendation plays an important role in location-based services. The state-of-the-art methods utilize recurrent neural networks (RNNs) to model users' check-in sequences and have shown promising results. However, they tend to recommend POIs similar to those that the user has often visited. As a result, users become bored with obvious recommendations. To address this issue, we propose Serendipity-oriented Next POI Recommendation model (SNPR), a supervised multi-task learning problem, with objective to recommend unexpected and relevant POIs only. To this end, we define the quantitativeserendipity as a trade-off ofrelevance andunexpectedness in the context of next POI recommendation, and design a dedicated neural network with Transformer to capture complex interdependencies between POIs in user's check-in sequence. Extensive experimental results show that our model can improverelevance significantly while theunexpectedness outperforms the state-of-the-art serendipity-oriented recommendation methods.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"55 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120921861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Vandalism Detection in OpenStreetMap via User Embeddings 基于用户嵌入的OpenStreetMap故意破坏检测
Yinxiao Li, T. J. Anderson, Yiqi Niu
OpenStreetMap (OSM) is a free and openly-editable database of geographic information. Over the years, OSM has evolved into the world's largest open knowledge base of geospatial data, and protecting OSM from the risk of vandalized and falsified information has become paramount to ensuring its continued success. However, despite the increasing usage of OSM and a wide interest in vandalism detection on open knowledge bases such as Wikipedia and Wikidata, OSM has not attracted as much attention from the research community, partially due to a lack of publicly available vandalism corpus. In this paper, we report on the construction of the first OSM vandalism corpus, and release it publicly. We describe a user embedding approach to create OSM user embeddings and add embedding features to a machine learning model to improve vandalism detection in OSM. We validate the model against our vandalism corpus, and observe solid improvements in key metrics. The validated model is deployed into production for vandalism detection on Daylight Map.
OpenStreetMap (OSM)是一个免费且开放编辑的地理信息数据库。多年来,OSM已发展成为世界上最大的地理空间数据开放知识库,保护OSM免受破坏和伪造信息的风险已成为确保其持续成功的最重要因素。然而,尽管OSM的使用越来越多,并且对维基百科和维基数据等开放知识库的破坏检测产生了广泛的兴趣,但OSM并没有引起研究界的太多关注,部分原因是缺乏公开的破坏语料库。在本文中,我们报告了第一个OSM故意破坏语料库的构建,并向公众发布。我们描述了一种用户嵌入方法来创建OSM用户嵌入,并将嵌入特征添加到机器学习模型中,以改进OSM中的破坏检测。我们根据我们的破坏语料库验证模型,并观察到关键指标的坚实改进。经过验证的模型已投入生产,用于日光地图上的破坏检测。
{"title":"Vandalism Detection in OpenStreetMap via User Embeddings","authors":"Yinxiao Li, T. J. Anderson, Yiqi Niu","doi":"10.1145/3459637.3482213","DOIUrl":"https://doi.org/10.1145/3459637.3482213","url":null,"abstract":"OpenStreetMap (OSM) is a free and openly-editable database of geographic information. Over the years, OSM has evolved into the world's largest open knowledge base of geospatial data, and protecting OSM from the risk of vandalized and falsified information has become paramount to ensuring its continued success. However, despite the increasing usage of OSM and a wide interest in vandalism detection on open knowledge bases such as Wikipedia and Wikidata, OSM has not attracted as much attention from the research community, partially due to a lack of publicly available vandalism corpus. In this paper, we report on the construction of the first OSM vandalism corpus, and release it publicly. We describe a user embedding approach to create OSM user embeddings and add embedding features to a machine learning model to improve vandalism detection in OSM. We validate the model against our vandalism corpus, and observe solid improvements in key metrics. The validated model is deployed into production for vandalism detection on Daylight Map.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"117 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120967726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Trilateral Spatiotemporal Attention Network for User Behavior Modeling in Location-based Search 基于位置搜索的用户行为建模的三边时空注意网络
Yi Qi, Ke Hu, Bo Zhang, Jia Cheng, Jun Lei
In location-based search, user's click behavior is naturally bonded with trilateral spatiotemporal information, i.e., the locations of historical user requests, the locations of corresponding clicked items and the occurring time of historical clicks. Appropriate modeling of the trilateral spatiotemporal user click behavior sequence is key to the success of any location-based search service. Though abundant and helpful, existing user behavior modeling methods are insufficient for modeling the rich patterns in trilateral spatiotemporal sequence in that they ignore the interplay among request's geo- graphic information, item's geographic information and the click time. In this work, we study the user behavior modeling problem in location-based search systematically. We propose TRISAN, short for Trilateral Spatiotemporal Attention Network, a novel attention- based neural model that incorporates temporal relatedness into both the modeling of item's geographic closeness and the modeling of request's geographic closeness through a fusion mechanism. In addition, we propose to model the geographic closeness both by distance and by semantic similarity. Extensive experiments demonstrate that the proposed method outperforms existing methods by a large margin and every part of our modeling strategy contributes to its final success.
在基于位置的搜索中,用户的点击行为自然地与三边时空信息绑定在一起,即历史用户请求的位置、相应点击项的位置和历史点击发生的时间。适当的三边时空用户点击行为序列建模是任何基于位置的搜索服务成功的关键。现有的用户行为建模方法虽然丰富且有用,但由于忽略了请求的地理信息、项目的地理信息和点击时间三者之间的相互作用,对三边时空序列中丰富的模式建模存在不足。本文系统地研究了基于位置的搜索中的用户行为建模问题。TRISAN是三边时空注意网络(Trilateral Spatiotemporal Attention Network)的缩写,它是一种基于注意力的神经网络模型,通过融合机制将时间相关性融入到物品地理亲密度的建模和请求地理亲密度的建模中。此外,我们建议通过距离和语义相似度来建立地理接近度模型。大量的实验表明,所提出的方法大大优于现有的方法,并且我们的建模策略的每个部分都有助于其最终的成功。
{"title":"Trilateral Spatiotemporal Attention Network for User Behavior Modeling in Location-based Search","authors":"Yi Qi, Ke Hu, Bo Zhang, Jia Cheng, Jun Lei","doi":"10.1145/3459637.3482206","DOIUrl":"https://doi.org/10.1145/3459637.3482206","url":null,"abstract":"In location-based search, user's click behavior is naturally bonded with trilateral spatiotemporal information, i.e., the locations of historical user requests, the locations of corresponding clicked items and the occurring time of historical clicks. Appropriate modeling of the trilateral spatiotemporal user click behavior sequence is key to the success of any location-based search service. Though abundant and helpful, existing user behavior modeling methods are insufficient for modeling the rich patterns in trilateral spatiotemporal sequence in that they ignore the interplay among request's geo- graphic information, item's geographic information and the click time. In this work, we study the user behavior modeling problem in location-based search systematically. We propose TRISAN, short for Trilateral Spatiotemporal Attention Network, a novel attention- based neural model that incorporates temporal relatedness into both the modeling of item's geographic closeness and the modeling of request's geographic closeness through a fusion mechanism. In addition, we propose to model the geographic closeness both by distance and by semantic similarity. Extensive experiments demonstrate that the proposed method outperforms existing methods by a large margin and every part of our modeling strategy contributes to its final success.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124937437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Fast k-NN Graph Construction by GPU based NN-Descent 基于GPU的快速k-NN图构建
Hui Wang, Wanlei Zhao, Xiangxiang Zeng, Jianye Yang
NN-Descent is a classic k-NN graph construction approach. It is still widely employed in machine learning, computer vision, and information retrieval tasks due to its efficiency and genericness. However, the current design only works well on CPU. In this paper, NN-Descent has been redesigned to adapt to the GPU architecture. A new graph update strategy called selective update is proposed. It reduces the data exchange between GPU cores and GPU global memory significantly, which is the processing bottleneck under GPU computation architecture. This redesign leads to full exploitation of the parallelism of the GPU hardware. In the meantime, the genericness, as well as the simplicity of NN-Descent, are well-preserved. Moreover, a procedure that allows to k-NN graph to be merged efficiently on GPU is proposed. It makes the construction of high-quality k-NN graphs for out-of-GPU-memory datasets tractable. Our approach is 100-250× faster than the single-thread NN-Descent and is 2.5-5× faster than the existing GPU-based approaches as we tested on million as well as billion scale datasets.
nn下降是一种经典的k-NN图构造方法。由于它的高效性和通用性,在机器学习、计算机视觉和信息检索任务中仍被广泛应用。然而,目前的设计只适用于CPU。本文对NN-Descent进行了重新设计,以适应GPU架构。提出了一种新的图更新策略——选择性更新。它显著减少了GPU内核与GPU全局内存之间的数据交换,这是GPU计算架构下的处理瓶颈。这种重新设计可以充分利用GPU硬件的并行性。同时,保留了神经网络下降法的通用性和简洁性。此外,还提出了一种在GPU上对k-NN图进行有效合并的方法。它使得构建高质量的k-NN图的gpu内存外的数据集易于处理。我们的方法比单线程NN-Descent快100-250倍,比现有的基于gpu的方法快2.5-5倍,我们在百万和十亿规模的数据集上进行了测试。
{"title":"Fast k-NN Graph Construction by GPU based NN-Descent","authors":"Hui Wang, Wanlei Zhao, Xiangxiang Zeng, Jianye Yang","doi":"10.1145/3459637.3482344","DOIUrl":"https://doi.org/10.1145/3459637.3482344","url":null,"abstract":"NN-Descent is a classic k-NN graph construction approach. It is still widely employed in machine learning, computer vision, and information retrieval tasks due to its efficiency and genericness. However, the current design only works well on CPU. In this paper, NN-Descent has been redesigned to adapt to the GPU architecture. A new graph update strategy called selective update is proposed. It reduces the data exchange between GPU cores and GPU global memory significantly, which is the processing bottleneck under GPU computation architecture. This redesign leads to full exploitation of the parallelism of the GPU hardware. In the meantime, the genericness, as well as the simplicity of NN-Descent, are well-preserved. Moreover, a procedure that allows to k-NN graph to be merged efficiently on GPU is proposed. It makes the construction of high-quality k-NN graphs for out-of-GPU-memory datasets tractable. Our approach is 100-250× faster than the single-thread NN-Descent and is 2.5-5× faster than the existing GPU-based approaches as we tested on million as well as billion scale datasets.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125164448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
ScarceGAN
S. Chakrabarty, Rukma Talwadker, Tridib Mukherjee
This paper introduces ScarceGAN which focuses on identification of extremely rare or scarce samples from multi-dimensional longitudinal telemetry data with small and weak label prior. We specifically address: (i) severe scarcity in positive class, stemming from both underlying organic skew in the data, as well as extremely limited labels; (ii) multi-class nature of the negative samples, with uneven density distributions and partially overlapping feature distributions; and (iii) massively unlabelled data leading to tiny and weak prior on both positive and negative classes, and possibility of unseen or unknown behavior in the unlabelled set, especially in the negative class. Although related to PU learning problems, we contend that knowledge (or lack of it) on the negative class can be leveraged to learn the compliment of it (i.e., the positive class) better in a semi-supervised manner. To this effect, ScarceGAN re-formulates semi-supervised GAN by accommodating weakly labelled multi- class negative samples and the available positive samples. It relaxes the supervised discriminator's constraint on exact differentiation be- tween negative samples by introducing a 'leeway' term for samples with noisy prior. We propose modifications to the cost objectives of discriminator, in supervised and unsupervised path as well as that of the generator. For identifying risky players in skill gaming, this formulation in whole gives us a recall of over 85% (~60% jump over vanilla semi-supervised GAN) on our scarce class with very minimal verbosity in the unknown space. Further ScarceGAN out- performs the recall benchmarks established by recent GAN based specialized models for the positive imbalanced class identification and establishes a new benchmark in identifying one of rare attack classes (0.09%) in the intrusion dataset from the KDDCUP99 challenge. We establish ScarceGAN to be one of new competitive benchmark frameworks in the rare class identification for longitudinal telemetry data.
{"title":"ScarceGAN","authors":"S. Chakrabarty, Rukma Talwadker, Tridib Mukherjee","doi":"10.1145/3459637.3482474","DOIUrl":"https://doi.org/10.1145/3459637.3482474","url":null,"abstract":"This paper introduces ScarceGAN which focuses on identification of extremely rare or scarce samples from multi-dimensional longitudinal telemetry data with small and weak label prior. We specifically address: (i) severe scarcity in positive class, stemming from both underlying organic skew in the data, as well as extremely limited labels; (ii) multi-class nature of the negative samples, with uneven density distributions and partially overlapping feature distributions; and (iii) massively unlabelled data leading to tiny and weak prior on both positive and negative classes, and possibility of unseen or unknown behavior in the unlabelled set, especially in the negative class. Although related to PU learning problems, we contend that knowledge (or lack of it) on the negative class can be leveraged to learn the compliment of it (i.e., the positive class) better in a semi-supervised manner. To this effect, ScarceGAN re-formulates semi-supervised GAN by accommodating weakly labelled multi- class negative samples and the available positive samples. It relaxes the supervised discriminator's constraint on exact differentiation be- tween negative samples by introducing a 'leeway' term for samples with noisy prior. We propose modifications to the cost objectives of discriminator, in supervised and unsupervised path as well as that of the generator. For identifying risky players in skill gaming, this formulation in whole gives us a recall of over 85% (~60% jump over vanilla semi-supervised GAN) on our scarce class with very minimal verbosity in the unknown space. Further ScarceGAN out- performs the recall benchmarks established by recent GAN based specialized models for the positive imbalanced class identification and establishes a new benchmark in identifying one of rare attack classes (0.09%) in the intrusion dataset from the KDDCUP99 challenge. We establish ScarceGAN to be one of new competitive benchmark frameworks in the rare class identification for longitudinal telemetry data.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123675514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge Graph Representation Learning as Groupoid: Unifying TransE, RotatE, QuatE, ComplEx Groupoid的知识图表示学习:统一TransE, RotatE, QuatE, ComplEx
Han Yang, Junfei Liu
Knowledge graph (KG) representation learning which aims to encode entities and relations into low-dimensional spaces, has been widely used in KG completion and link prediction. Although existing KG representation learning models have shown promising performance, the theoretical mechanism behind existing models is much less well-understood. It is challenging to accurately portray the internal connections between models and build a competitive model systematically. To overcome this problem, a unified KG representation learning framework, called GrpKG, is proposed in this paper to model the KG representation learning from a generic groupoid perspective. We discover that many existing models are essentially the same in the sense of groupoid isomorphism and further provide transformation methods between different models. Moreover, we explore the applications of GrpKG in the model classification as well as other processes. The experiments on several benchmark data sets validate the effectiveness and superiority of our framework by comparing two proposed models (GrpQ8 and GrpM2) with the state-of-the-art models.
知识图表示学习旨在将实体和关系编码到低维空间中,已广泛应用于知识图补全和链接预测。虽然现有的KG表示学习模型已经显示出良好的性能,但现有模型背后的理论机制却知之甚少。准确地描绘模型之间的内在联系,系统地构建竞争模型是一项挑战。为了克服这个问题,本文提出了一个统一的KG表示学习框架GrpKG,从一般类群的角度对KG表示学习进行建模。我们发现许多现有模型在类群同构意义上本质上是相同的,并进一步提供了不同模型之间的转换方法。此外,我们还探索了GrpKG在模型分类以及其他过程中的应用。在几个基准数据集上的实验通过将两个模型(GrpQ8和GrpM2)与最先进的模型进行比较,验证了我们框架的有效性和优越性。
{"title":"Knowledge Graph Representation Learning as Groupoid: Unifying TransE, RotatE, QuatE, ComplEx","authors":"Han Yang, Junfei Liu","doi":"10.1145/3459637.3482442","DOIUrl":"https://doi.org/10.1145/3459637.3482442","url":null,"abstract":"Knowledge graph (KG) representation learning which aims to encode entities and relations into low-dimensional spaces, has been widely used in KG completion and link prediction. Although existing KG representation learning models have shown promising performance, the theoretical mechanism behind existing models is much less well-understood. It is challenging to accurately portray the internal connections between models and build a competitive model systematically. To overcome this problem, a unified KG representation learning framework, called GrpKG, is proposed in this paper to model the KG representation learning from a generic groupoid perspective. We discover that many existing models are essentially the same in the sense of groupoid isomorphism and further provide transformation methods between different models. Moreover, we explore the applications of GrpKG in the model classification as well as other processes. The experiments on several benchmark data sets validate the effectiveness and superiority of our framework by comparing two proposed models (GrpQ8 and GrpM2) with the state-of-the-art models.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125400071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Self-Supervised Learning on Users' Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce 电子商务多场景排序中用户自发行为的自监督学习
Yulong Gu, Wentian Bao, Dan Ou, Xiang Li, Baoliang Cui, Biyu Ma, Haikuan Huang, Qingwen Liu, Xiaoyi Zeng
Multi-scenario Learning to Rank is essential for Recommender Systems, Search Engines and Online Advertising in e-commerce portals where the ranking models are usually applied in many scenarios. However, existing works mainly focus on learning the ranking model for a single scenario, and pay less attention to learning ranking models for multiple scenarios. We identify two practical challenges in industrial multi-scenario ranking systems: (1) The Feedback Loop problem that the model is always trained on the items chosen by the ranker itself. (2) Insufficient training data for small and new scenarios. To address the above issues, we present ZEUS, a novel framework that learns a Zoo of ranking modEls for mUltiple Scenarios based on pre-training on users' spontaneous behaviors (e.g. queries which are directly searched in the search box and not recommended by the ranking system). ZEUS decomposes the training process into two stages: self-supervised learning based pre-training and fine-tuning. Firstly, ZEUS performs self-supervised learning on users' spontaneous behaviors and generates a pre-trained model. Secondly, ZEUS fine-tunes the pre-trained model on users' implicit feedback in multiple scenarios. Extensive experiments on Alibaba's production dataset demonstrate the effectiveness of ZEUS, which significantly outperforms state-of-the-art methods. ZEUS averagely achieves 6.0%, 9.7%, 11.7% improvement in CTR, CVR and GMV respectively than state-of-the-art method.
多场景学习排名对于推荐系统、搜索引擎和电子商务门户网站的在线广告至关重要,其中排名模型通常应用于许多场景。然而,现有的工作主要集中在单一场景的排名模型学习上,而对多场景的排名模型学习关注较少。我们确定了工业多场景排名系统中的两个实际挑战:(1)反馈回路问题,即模型总是在排名者自己选择的项目上进行训练。(2)小场景和新场景的训练数据不足。为了解决上述问题,我们提出了ZEUS,这是一个新的框架,它基于对用户自发行为(例如,在搜索框中直接搜索的查询,而不是排名系统推荐的查询)的预训练,学习了多种场景的排名模型。ZEUS将训练过程分解为两个阶段:基于预训练的自监督学习和微调。首先,ZEUS对用户的自发行为进行自监督学习,生成预训练模型。其次,ZEUS根据用户在多个场景下的隐式反馈对预训练模型进行微调。在阿里巴巴生产数据集上的大量实验证明了ZEUS的有效性,它明显优于最先进的方法。与最先进的方法相比,ZEUS的CTR、CVR和GMV平均分别提高了6.0%、9.7%和11.7%。
{"title":"Self-Supervised Learning on Users' Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce","authors":"Yulong Gu, Wentian Bao, Dan Ou, Xiang Li, Baoliang Cui, Biyu Ma, Haikuan Huang, Qingwen Liu, Xiaoyi Zeng","doi":"10.1145/3459637.3481953","DOIUrl":"https://doi.org/10.1145/3459637.3481953","url":null,"abstract":"Multi-scenario Learning to Rank is essential for Recommender Systems, Search Engines and Online Advertising in e-commerce portals where the ranking models are usually applied in many scenarios. However, existing works mainly focus on learning the ranking model for a single scenario, and pay less attention to learning ranking models for multiple scenarios. We identify two practical challenges in industrial multi-scenario ranking systems: (1) The Feedback Loop problem that the model is always trained on the items chosen by the ranker itself. (2) Insufficient training data for small and new scenarios. To address the above issues, we present ZEUS, a novel framework that learns a Zoo of ranking modEls for mUltiple Scenarios based on pre-training on users' spontaneous behaviors (e.g. queries which are directly searched in the search box and not recommended by the ranking system). ZEUS decomposes the training process into two stages: self-supervised learning based pre-training and fine-tuning. Firstly, ZEUS performs self-supervised learning on users' spontaneous behaviors and generates a pre-trained model. Secondly, ZEUS fine-tunes the pre-trained model on users' implicit feedback in multiple scenarios. Extensive experiments on Alibaba's production dataset demonstrate the effectiveness of ZEUS, which significantly outperforms state-of-the-art methods. ZEUS averagely achieves 6.0%, 9.7%, 11.7% improvement in CTR, CVR and GMV respectively than state-of-the-art method.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126807648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
RCES RCES
Wei Li, Rishi Choudhary, A. Younus, Bruno Ohana, N. Baker, B. Leen, M. A. Qureshi
To assist the COVID-19 focused researchers in life science and healthcare in understanding the pandemic, we present an exploratory information retrieval system called RCES. The system employs a previously developed EVE (Explainable Vector-based Embedding) model using DBpedia and an adopted model using MeSH taxonomies to exploit concept relations related to COVID-19. Various expansion methods are also developed, along with explanations and facets that collectively form rapid cues for a valuable navigational and informed user experience.
{"title":"RCES","authors":"Wei Li, Rishi Choudhary, A. Younus, Bruno Ohana, N. Baker, B. Leen, M. A. Qureshi","doi":"10.1145/3459637.3481990","DOIUrl":"https://doi.org/10.1145/3459637.3481990","url":null,"abstract":"To assist the COVID-19 focused researchers in life science and healthcare in understanding the pandemic, we present an exploratory information retrieval system called RCES. The system employs a previously developed EVE (Explainable Vector-based Embedding) model using DBpedia and an adopted model using MeSH taxonomies to exploit concept relations related to COVID-19. Various expansion methods are also developed, along with explanations and facets that collectively form rapid cues for a valuable navigational and informed user experience.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115045766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 30th ACM International Conference on Information & Knowledge Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1