ACM Transactions on the Web最新文献_第9页

Learning Neighbor User Intention on User-Item Interaction Graphs for Better Sequential Recommendation 学习用户-项目交互图上的邻居用户意图以获得更好的顺序推荐

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-02-01 DOI: 10.1145/3580520

Mei Yu, Kun Zhu, Mankun Zhao, Jian Yu, Tianyi Xu, Di Jin, Xuewei Li, Ruiguo Yu

The task of Sequential Recommendation aims to predict the user’s preference by analyzing the user’s historical behaviours. Existing methods model item transitions through leveraging sequential patterns. However, they mainly consider the target user’s own behaviours and dynamic characteristics, while often ignore the high-order collaborative connections when modelling user preferences. Some recent works try to use graph-based methods to introduce high-order collaborative signals for Sequential Recommendation, but they have two main problems. One is that the sequential patterns cannot be effectively mined, and the other is that their way of introducing high-order collaborative signals is not very suitable for Sequential Recommendation. To address these problems, we propose to fully exploit sequence features and model high-order collaborative signals for Sequential Recommendation. We propose a Neighbor user Intention based Sequential Recommender, namely NISRec, which utilizes the intentions of high-order connected neighbor users as high-order collaborative signals, in order to improve recommendation performance for the target user. To be specific, NISRec contains two main modules: the neighbor user intention embedding module (NIE) and the fusion module. The NIE describes both the long-term and the short-term intentions of neighbor users and aggregates them separately. The fusion module uses these two types of aggregated intentions to model high-order collaborative signals in both the embedding process and the user preference modelling phase for recommendation of the target user. Experimental results show that our new approach outperforms the state-of-the-art methods on both sparse and dense datasets. Extensive studies further show the effectiveness of the diverse neighbor intentions introduced by NISRec.

顺序推荐的任务旨在通过分析用户的历史行为来预测用户的偏好。现有方法通过利用顺序模式对项目转换进行建模。然而，他们主要考虑目标用户自身的行为和动态特征，而在建模用户偏好时往往忽略了高阶的协作连接。最近的一些工作试图使用基于图的方法来引入用于序列推荐的高阶协作信号，但它们存在两个主要问题。一个是序列模式无法有效挖掘，另一个是它们引入高阶协同信号的方式不太适合序列推荐。为了解决这些问题，我们建议充分利用序列特征，并为序列推荐建立高阶协作信号模型。我们提出了一种基于邻居用户意图的顺序推荐器，即NISRec，它利用高阶连接邻居用户的意图作为高阶协作信号，以提高对目标用户的推荐性能。具体来说，NISRec包含两个主要模块：邻居用户意图嵌入模块（NIE）和融合模块。NIE描述了邻居用户的长期和短期意图，并分别汇总了它们。融合模块在嵌入过程和用户偏好建模阶段使用这两种类型的聚合意图来对高阶协作信号进行建模，以推荐目标用户。实验结果表明，我们的新方法在稀疏和密集数据集上都优于最先进的方法。广泛的研究进一步证明了NISRec引入的不同邻居意图的有效性。

{"title":"Learning Neighbor User Intention on User-Item Interaction Graphs for Better Sequential Recommendation","authors":"Mei Yu, Kun Zhu, Mankun Zhao, Jian Yu, Tianyi Xu, Di Jin, Xuewei Li, Ruiguo Yu","doi":"10.1145/3580520","DOIUrl":"https://doi.org/10.1145/3580520","url":null,"abstract":"The task of Sequential Recommendation aims to predict the user’s preference by analyzing the user’s historical behaviours. Existing methods model item transitions through leveraging sequential patterns. However, they mainly consider the target user’s own behaviours and dynamic characteristics, while often ignore the high-order collaborative connections when modelling user preferences. Some recent works try to use graph-based methods to introduce high-order collaborative signals for Sequential Recommendation, but they have two main problems. One is that the sequential patterns cannot be effectively mined, and the other is that their way of introducing high-order collaborative signals is not very suitable for Sequential Recommendation. To address these problems, we propose to fully exploit sequence features and model high-order collaborative signals for Sequential Recommendation. We propose a Neighbor user Intention based Sequential Recommender, namely NISRec, which utilizes the intentions of high-order connected neighbor users as high-order collaborative signals, in order to improve recommendation performance for the target user. To be specific, NISRec contains two main modules: the neighbor user intention embedding module (NIE) and the fusion module. The NIE describes both the long-term and the short-term intentions of neighbor users and aggregates them separately. The fusion module uses these two types of aggregated intentions to model high-order collaborative signals in both the embedding process and the user preference modelling phase for recommendation of the target user. Experimental results show that our new approach outperforms the state-of-the-art methods on both sparse and dense datasets. Extensive studies further show the effectiveness of the diverse neighbor intentions introduced by NISRec.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44213438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Multi-Task Graph Neural Network with Variational Graph Auto-Encoders for Session-Based Travel Packages Recommendation 基于会话的旅行包推荐的变图自编码多任务图神经网络

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-02-01 DOI: 10.1145/3577032

Guixiang Zhu, Jie Cao, Lei Chen, Youquan Wang, Zhan Bu, Shuxin Yang, Jianqing Wu, Zhiping Wang

Session-based travel packages recommendation aims to predict users’ next click based on their current and historical sessions recorded by Online Travel Agencies (OTAs). Recently, an increasing number of studies attempted to apply Graph Neural Networks (GNNs) to the session-based recommendation and obtained promising results. However, most of them do not take full advantage of the explicit latent structure from attributes of items, making learned representations of items less effective and difficult to interpret. Moreover, they only combine historical sessions (long-term preferences) with a current session (short-term preference) to learn a unified representation of users, ignoring the effects of historical sessions for the current session. To this end, this article proposes a novel session-based model named STR-VGAE, which fills subtasks of the travel packages recommendation and variational graph auto-encoders simultaneously. STR-VGAE mainly consists of three components: travel packages encoder, users behaviors encoder, and interaction modeling. Specifically, the travel packages encoder module is used to learn a unified travel package representation from co-occurrence attribute graphs by using multi-view variational graph auto-encoders and a multi-view attention network. The users behaviors encoder module is used to encode user’ historical and current sessions with a personalized GNN, which considers the effects of historical sessions on the current session, and coalesce these two kinds of session representations to learn the high-quality users’ representations by exploiting a gated fusion approach. The interaction modeling module is used to calculate recommendation scores over all candidate travel packages. Extensive experiments on a real-life tourism e-commerce dataset from China show that STR-VGAE yields significant performance advantages over several competitive methods, meanwhile provides an interpretation for the generated recommendation list.

基于会话的旅游套餐推荐旨在根据在线旅行社(ota)记录的用户当前和历史会话来预测用户的下一次点击。近年来，越来越多的研究尝试将图神经网络(GNNs)应用于基于会话的推荐，并取得了可喜的成果。然而，它们大多没有充分利用项目属性的显性潜在结构，使得学习后的项目表征效果不佳，难以解释。此外，它们只将历史会话(长期首选项)与当前会话(短期首选项)结合起来学习用户的统一表示，而忽略了历史会话对当前会话的影响。为此，本文提出了一种新的基于会话的STR-VGAE模型，该模型同时填充了旅游包推荐和变分图自编码器的子任务。STR-VGAE主要由三个部分组成:旅行包编码器、用户行为编码器和交互建模。其中，旅行包编码器模块利用多视图变分图自编码器和多视图关注网络，从共现属性图中学习统一的旅行包表示。用户行为编码器模块使用个性化的GNN对用户的历史会话和当前会话进行编码，该GNN考虑了历史会话对当前会话的影响，并利用门控融合方法将这两种会话表示合并以学习高质量的用户表示。交互建模模块用于计算所有候选旅行包的推荐分数。在中国真实的旅游电子商务数据集上进行的大量实验表明，STR-VGAE比几种竞争方法具有显著的性能优势，同时为生成的推荐列表提供了解释。

{"title":"A Multi-Task Graph Neural Network with Variational Graph Auto-Encoders for Session-Based Travel Packages Recommendation","authors":"Guixiang Zhu, Jie Cao, Lei Chen, Youquan Wang, Zhan Bu, Shuxin Yang, Jianqing Wu, Zhiping Wang","doi":"10.1145/3577032","DOIUrl":"https://doi.org/10.1145/3577032","url":null,"abstract":"Session-based travel packages recommendation aims to predict users’ next click based on their current and historical sessions recorded by Online Travel Agencies (OTAs). Recently, an increasing number of studies attempted to apply Graph Neural Networks (GNNs) to the session-based recommendation and obtained promising results. However, most of them do not take full advantage of the explicit latent structure from attributes of items, making learned representations of items less effective and difficult to interpret. Moreover, they only combine historical sessions (long-term preferences) with a current session (short-term preference) to learn a unified representation of users, ignoring the effects of historical sessions for the current session. To this end, this article proposes a novel session-based model named STR-VGAE, which fills subtasks of the travel packages recommendation and variational graph auto-encoders simultaneously. STR-VGAE mainly consists of three components: travel packages encoder, users behaviors encoder, and interaction modeling. Specifically, the travel packages encoder module is used to learn a unified travel package representation from co-occurrence attribute graphs by using multi-view variational graph auto-encoders and a multi-view attention network. The users behaviors encoder module is used to encode user’ historical and current sessions with a personalized GNN, which considers the effects of historical sessions on the current session, and coalesce these two kinds of session representations to learn the high-quality users’ representations by exploiting a gated fusion approach. The interaction modeling module is used to calculate recommendation scores over all candidate travel packages. Extensive experiments on a real-life tourism e-commerce dataset from China show that STR-VGAE yields significant performance advantages over several competitive methods, meanwhile provides an interpretation for the generated recommendation list.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":"1 - 30"},"PeriodicalIF":3.5,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43010479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Decoding the Kodi Ecosystem 解码Kodi生态系统

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-02-01 DOI: https://dl.acm.org/doi/10.1145/3563700

Yunming Xiao, Matteo Varvello, Marc Warrior, Aleksandar Kuzmanovic

Free and open-source media centers are experiencing a boom in popularity for the convenience they offer users seeking to remotely consume digital content. Kodi is today’s most popular home media center, with millions of users worldwide. Kodi’s popularity derives from its ability to centralize the sheer amount of media content available on the Web, both free and copyrighted. Researchers have been hinting at potential security concerns around Kodi, due to add-ons injecting unwanted content as well as user settings linked with security holes. Motivated by these observations, this article conducts the first comprehensive analysis of the Kodi ecosystem: 15,000 Kodi users from 104 countries, 11,000 unique add-ons, and data collected over 9 months.

Our work makes three important contributions. Our first contribution is that we build “crawling” software (de-Kodi) which can automatically install a Kodi add-on, explore its menu, and locate (video) content. This is challenging for two main reasons. First, Kodi largely relies on visual information and user input which intrinsically complicates automation. Second, the potential sheer size of this ecosystem (i.e., the number of available add-ons) requires a highly scalable crawling solution. Our second contribution is that we develop a solution to discover Kodi add-ons. Our solution combines Web crawling of popular websites where Kodi add-ons are published (LazyKodi and GitHub) and SafeKodi, a Kodi add-on we have developed which leverages the help of Kodi users to learn which add-ons are used in the wild and, in return, offers information about how safe these add-ons are, e.g., do they track user activity or contact sketchy URLs/IP addresses. Our third contribution is a classifier to passively detect Kodi traffic and add-on usage in the wild.

Our analysis of the Kodi ecosystem reveals the following findings. We find that most installed add-ons are unofficial but safe to use. Still, 78% of the users have installed at least one unsafe add-on, and even worse, such add-ons are among the most popular. In response to the information offered by SafeKodi, one-third of the users reacted by disabling some of their add-ons. However, the majority of users ignored our warnings for several months attracted by the content such unsafe add-ons have to offer. Last but not least, we show that Kodi’s auto-update, a feature active for 97.6% of SafeKodi users, makes Kodi users easily identifiable by their ISPs. While passively identifying which Kodi add-on is in use is, as expected, much harder, we also find that many unofficial add-ons do not use HTTPS yet, making their passive detection straightforward.¹

免费和开源媒体中心正因其为寻求远程消费数字内容的用户提供便利而受到广泛欢迎。Kodi是当今最受欢迎的家庭媒体中心，在全球拥有数百万用户。Kodi的受欢迎程度源于它能够集中网络上可用的大量媒体内容，无论是免费的还是受版权保护的。研究人员一直在暗示，由于附加组件注入不需要的内容以及与安全漏洞相关的用户设置，Kodi存在潜在的安全问题。受这些观察的启发，本文对Kodi生态系统进行了首次全面分析:来自104个国家的15,000名Kodi用户，11,000个独特的附加组件，以及9个多月收集的数据。我们的工作有三个重要贡献。我们的第一个贡献是我们建立了“爬行”软件(去Kodi)，它可以自动安装一个Kodi附加组件，探索它的菜单，并定位(视频)内容。这是一个挑战，主要有两个原因。首先，Kodi很大程度上依赖于视觉信息和用户输入，这本质上使自动化变得复杂。其次，这个生态系统的潜在规模(即可用附加组件的数量)需要一个高度可扩展的爬行解决方案。我们的第二个贡献是，我们开发了一个解决方案来发现Kodi附加组件。我们的解决方案结合了流行网站的网络爬行Kodi插件发布(LazyKodi和GitHub)和SafeKodi，一个Kodi插件，我们已经开发了它利用Kodi用户的帮助来学习哪些插件在野外使用，并作为回报，提供有关这些插件的安全性的信息，例如，他们是否跟踪用户活动或联系粗略的url /IP地址。我们的第三个贡献是一个分类器，用于被动地检测Kodi流量和附加组件的使用情况。我们对Kodi生态系统的分析揭示了以下发现。我们发现大多数安装的插件都是非官方的，但可以安全使用。尽管如此，78%的用户至少安装了一个不安全的附加组件，更糟糕的是，这些附加组件是最受欢迎的。作为对SafeKodi提供的信息的回应，三分之一的用户关闭了他们的一些插件。然而，大多数用户忽视了我们的警告几个月来吸引的内容，这些不安全的附加组件必须提供。最后但并非最不重要的是，我们显示Kodi的自动更新，97.6%的SafeKodi用户活跃的功能，使Kodi用户很容易被他们的互联网服务提供商识别。虽然被动识别哪个Kodi插件正在使用，正如预期的那样，困难得多，我们还发现许多非官方的插件还不使用HTTPS，使他们的被动检测变得直接

{"title":"Decoding the Kodi Ecosystem","authors":"Yunming Xiao, Matteo Varvello, Marc Warrior, Aleksandar Kuzmanovic","doi":"https://dl.acm.org/doi/10.1145/3563700","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3563700","url":null,"abstract":"Free and open-source media centers are experiencing a boom in popularity for the convenience they offer users seeking to remotely consume digital content. Kodi is today’s most popular home media center, with millions of users worldwide. Kodi’s popularity derives from its ability to centralize the sheer amount of media content available on the Web, both free and copyrighted. Researchers have been hinting at potential security concerns around Kodi, due to add-ons injecting unwanted content as well as user settings linked with security holes. Motivated by these observations, this article conducts the first comprehensive analysis of the Kodi ecosystem: 15,000 Kodi users from 104 countries, 11,000 unique add-ons, and data collected over 9 months.Our work makes three important contributions. Our first contribution is that we build “crawling” software (<monospace>de-Kodi</monospace>) which can automatically install a Kodi add-on, explore its menu, and locate (video) content. This is challenging for two main reasons. First, Kodi largely relies on visual information and user input which intrinsically complicates automation. Second, the potential sheer size of this ecosystem (i.e., the number of available add-ons) requires a highly scalable crawling solution. Our second contribution is that we develop a solution to discover Kodi add-ons. Our solution combines Web crawling of popular websites where Kodi add-ons are published (LazyKodi and GitHub) and <monospace>SafeKodi</monospace>, a Kodi add-on we have developed which leverages the help of Kodi users to learn which add-ons are used in the wild and, in return, offers information about how safe these add-ons are, e.g., do they track user activity or contact sketchy URLs/IP addresses. Our third contribution is a classifier to passively detect Kodi traffic and add-on usage in the wild.Our analysis of the Kodi ecosystem reveals the following findings. We find that most installed add-ons are unofficial but safe to use. Still, 78% of the users have installed at least one unsafe add-on, and even worse, such add-ons are among the most popular. In response to the information offered by SafeKodi, one-third of the users reacted by disabling some of their add-ons. However, the majority of users ignored our warnings for several months attracted by the content such unsafe add-ons have to offer. Last but not least, we show that Kodi’s auto-update, a feature active for 97.6% of SafeKodi users, makes Kodi users easily identifiable by their ISPs. While passively identifying which Kodi add-on is in use is, as expected, much harder, we also find that many unofficial add-ons do not use HTTPS yet, making their passive detection straightforward.1","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 34","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep Adaptive Graph Clustering via von Mises-Fisher Distributions 基于von Mises-Fisher分布的深度自适应图聚类

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-31 DOI: https://dl.acm.org/doi/10.1145/3580521

Pengfei Wang, Daqing Wu, Chong Chen, Kunpeng Liu, Yanjie Fu, Jianqiang Huang, Yuanchun Zhou, Jianfeng Zhan, Xiansheng Hua

Graph clustering has been a hot research topic and is widely used in many fields, such as community detection in social networks. Lots of works combining auto-encoder and graph neural networks have been applied to clustering tasks by utilizing node attributes and graph structure. These works usually assumed the inherent parameters (i.e. size and variance) of different clusters in the latent embedding space are homogeneous, and hence the assigned probability is monotonous over the Euclidean distance between node embeddings and centroids. Unfortunately, this assumption usually does not hold since the size and concentration of different clusters can be quite different, which limits the clustering accuracy. In addition, the node embeddings in deep graph clustering methods are usually L2 normalized so that it lies on the surface of a unit hyper-sphere. To solve this problem, we proposed Deep Adaptive Graph Clustering via von Mises-Fisher distributions, namely DAGC. DAGC assumes the node embeddings H can be drawn from a von Mises-Fisher distribution and each cluster k is associated with cluster inherent parameters ρ_k which includes cluster center μ and cluster cohesion degree κ. Then we adopt an EM-like approach (i.e. (mathcal {P}(mathbf {H}|mathbf {rho }) ) and (mathcal {P}(mathbf {rho }|mathbf {H}) ) respectively) to learn the embedding and cluster inherent parameters alternately. Specifically, with the node embeddings, we proposed to update the cluster centers in an attraction-repulsion manner to make the cluster centers more separable. And given the cluster inherent parameters, a likelihood-based loss is proposed to make node embeddings more concentrated around cluster centers. Thus, DAGC can simultaneously improve the intra-cluster compactness and inter-cluster heterogeneity. Finally, extensive experiments conducted on four benchmark datasets have demonstrated that the proposed DAGC consistently outperforms the state-of-the-art methods, especially on imbalanced datasets.

图聚类一直是一个研究热点，广泛应用于许多领域，如社交网络中的社区检测。利用节点属性和图结构，将自编码器和图神经网络相结合的大量工作已经被应用到聚类任务中。这些工作通常假设潜在嵌入空间中不同簇的固有参数(即大小和方差)是均匀的，因此分配的概率在节点嵌入与质心之间的欧几里德距离上是单调的。不幸的是，这种假设通常不成立，因为不同簇的大小和浓度可能会有很大的不同，这限制了聚类的准确性。此外，深度图聚类方法中的节点嵌入通常是L2归一化的，因此它位于单位超球的表面上。为了解决这个问题，我们提出了基于von Mises-Fisher分布的深度自适应图聚类，即DAGC。DAGC假设节点嵌入H可以从von Mises-Fisher分布中绘制，每个聚类k与聚类固有参数ρk相关联，其中包括聚类中心μ和聚类内聚度κ。然后我们采用类似em的方法(分别为(mathcal {P}(mathbf {H}|mathbf {rho }) )和(mathcal {P}(mathbf {rho }|mathbf {H}) ))交替学习嵌入和聚类固有参数。具体来说，通过节点嵌入，我们提出以吸引-排斥的方式更新簇中心，使簇中心更加可分离。在给定聚类固有参数的情况下，提出了一种基于似然的损失算法，使节点嵌入更加集中在聚类中心附近。因此，DAGC可以同时提高集群内的紧凑性和集群间的异构性。最后，在四个基准数据集上进行的大量实验表明，所提出的DAGC始终优于最先进的方法，特别是在不平衡数据集上。

{"title":"Deep Adaptive Graph Clustering via von Mises-Fisher Distributions","authors":"Pengfei Wang, Daqing Wu, Chong Chen, Kunpeng Liu, Yanjie Fu, Jianqiang Huang, Yuanchun Zhou, Jianfeng Zhan, Xiansheng Hua","doi":"https://dl.acm.org/doi/10.1145/3580521","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3580521","url":null,"abstract":"Graph clustering has been a hot research topic and is widely used in many fields, such as community detection in social networks. Lots of works combining auto-encoder and graph neural networks have been applied to clustering tasks by utilizing node attributes and graph structure. These works usually assumed the inherent parameters (i.e. size and variance) of different clusters in the latent embedding space are homogeneous, and hence the assigned probability is monotonous over the Euclidean distance between node embeddings and centroids. Unfortunately, this assumption usually does not hold since the size and concentration of different clusters can be quite different, which limits the clustering accuracy. In addition, the node embeddings in deep graph clustering methods are usually L2 normalized so that it lies on the surface of a unit hyper-sphere. To solve this problem, we proposed <underline>D</underline>eep <underline>A</underline>daptive <underline>G</underline>raph <underline>C</underline>lustering via von Mises-Fisher distributions, namely DAGC. DAGC assumes the node embeddings H can be drawn from a von Mises-Fisher distribution and each cluster k is associated with cluster inherent parameters ρk which includes cluster center μ and cluster cohesion degree κ. Then we adopt an EM-like approach (i.e. (mathcal {P}(mathbf {H}|mathbf {rho }) ) and (mathcal {P}(mathbf {rho }|mathbf {H}) ) respectively) to learn the embedding and cluster inherent parameters alternately. Specifically, with the node embeddings, we proposed to update the cluster centers in an attraction-repulsion manner to make the cluster centers more separable. And given the cluster inherent parameters, a likelihood-based loss is proposed to make node embeddings more concentrated around cluster centers. Thus, DAGC can simultaneously improve the intra-cluster compactness and inter-cluster heterogeneity. Finally, extensive experiments conducted on four benchmark datasets have demonstrated that the proposed DAGC consistently outperforms the state-of-the-art methods, especially on imbalanced datasets.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 35","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep Adaptive Graph Clustering via von Mises-Fisher Distributions 基于von Mises-Fisher分布的深度自适应图聚类

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-31 DOI: 10.1145/3580521

P. Wang, Daqing Wu, Chong Chen, Kunpeng Liu, Yanjie Fu, Jianqiang Huang, Yuanchun Zhou, Jianfeng Zhan, Xiansheng Hua

Graph clustering has been a hot research topic and is widely used in many fields, such as community detection in social networks. Lots of works combining auto-encoder and graph neural networks have been applied to clustering tasks by utilizing node attributes and graph structure. These works usually assumed the inherent parameters (i.e. size and variance) of different clusters in the latent embedding space are homogeneous, and hence the assigned probability is monotonous over the Euclidean distance between node embeddings and centroids. Unfortunately, this assumption usually does not hold since the size and concentration of different clusters can be quite different, which limits the clustering accuracy. In addition, the node embeddings in deep graph clustering methods are usually L2 normalized so that it lies on the surface of a unit hyper-sphere. To solve this problem, we proposed Deep Adaptive Graph Clustering via von Mises-Fisher distributions, namely DAGC. DAGC assumes the node embeddings H can be drawn from a von Mises-Fisher distribution and each cluster k is associated with cluster inherent parameters ρk which includes cluster center μ and cluster cohesion degree κ. Then we adopt an EM-like approach (i.e. (mathcal {P}(mathbf {H}|mathbf {rho }) ) and (mathcal {P}(mathbf {rho }|mathbf {H}) ) respectively) to learn the embedding and cluster inherent parameters alternately. Specifically, with the node embeddings, we proposed to update the cluster centers in an attraction-repulsion manner to make the cluster centers more separable. And given the cluster inherent parameters, a likelihood-based loss is proposed to make node embeddings more concentrated around cluster centers. Thus, DAGC can simultaneously improve the intra-cluster compactness and inter-cluster heterogeneity. Finally, extensive experiments conducted on four benchmark datasets have demonstrated that the proposed DAGC consistently outperforms the state-of-the-art methods, especially on imbalanced datasets.

图聚类一直是一个热门的研究课题，并被广泛应用于许多领域，如社交网络中的社区检测。许多将自动编码器和图神经网络相结合的工作已经被应用于利用节点属性和图结构的聚类任务。这些工作通常假设潜在嵌入空间中不同簇的固有参数（即大小和方差）是齐次的，因此在节点嵌入和质心之间的欧几里得距离上，分配的概率是单调的。不幸的是，这种假设通常不成立，因为不同聚类的大小和浓度可能非常不同，这限制了聚类的准确性。此外，深度图聚类方法中的节点嵌入通常是L2归一化的，使得它位于单位超球面的表面上。为了解决这个问题，我们提出了通过von Mises Fisher分布的深度自适应图聚类，即DAGC。DAGC假设节点嵌入H可以从von Mises Fisher分布中得出，并且每个聚类k与聚类固有参数ρk相关，其中包括聚类中心μ和聚类内聚度κ。然后，我们采用类似EM的方法（即分别为（mathcal｛P｝（mathbf｛H｝|mathbf｛rho｝））和（math cal｛P｝（ mathb｛rho｝| mathbf｛H}））来交替学习嵌入和聚类固有参数。具体来说，通过节点嵌入，我们提出以吸引-排斥的方式更新聚类中心，使聚类中心更加可分离。在给定聚类固有参数的情况下，提出了一种基于似然的损失方法，使节点嵌入更加集中在聚类中心。因此，DAGC可以同时提高簇内的紧凑性和簇间的异质性。最后，在四个基准数据集上进行的大量实验表明，所提出的DAGC始终优于最先进的方法，尤其是在不平衡数据集上。

{"title":"Deep Adaptive Graph Clustering via von Mises-Fisher Distributions","authors":"P. Wang, Daqing Wu, Chong Chen, Kunpeng Liu, Yanjie Fu, Jianqiang Huang, Yuanchun Zhou, Jianfeng Zhan, Xiansheng Hua","doi":"10.1145/3580521","DOIUrl":"https://doi.org/10.1145/3580521","url":null,"abstract":"Graph clustering has been a hot research topic and is widely used in many fields, such as community detection in social networks. Lots of works combining auto-encoder and graph neural networks have been applied to clustering tasks by utilizing node attributes and graph structure. These works usually assumed the inherent parameters (i.e. size and variance) of different clusters in the latent embedding space are homogeneous, and hence the assigned probability is monotonous over the Euclidean distance between node embeddings and centroids. Unfortunately, this assumption usually does not hold since the size and concentration of different clusters can be quite different, which limits the clustering accuracy. In addition, the node embeddings in deep graph clustering methods are usually L2 normalized so that it lies on the surface of a unit hyper-sphere. To solve this problem, we proposed Deep Adaptive Graph Clustering via von Mises-Fisher distributions, namely DAGC. DAGC assumes the node embeddings H can be drawn from a von Mises-Fisher distribution and each cluster k is associated with cluster inherent parameters ρk which includes cluster center μ and cluster cohesion degree κ. Then we adopt an EM-like approach (i.e. (mathcal {P}(mathbf {H}|mathbf {rho }) ) and (mathcal {P}(mathbf {rho }|mathbf {H}) ) respectively) to learn the embedding and cluster inherent parameters alternately. Specifically, with the node embeddings, we proposed to update the cluster centers in an attraction-repulsion manner to make the cluster centers more separable. And given the cluster inherent parameters, a likelihood-based loss is proposed to make node embeddings more concentrated around cluster centers. Thus, DAGC can simultaneously improve the intra-cluster compactness and inter-cluster heterogeneity. Finally, extensive experiments conducted on four benchmark datasets have demonstrated that the proposed DAGC consistently outperforms the state-of-the-art methods, especially on imbalanced datasets.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45250922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Constructing Spatio-Temporal Graphs for Face Forgery Detection 构建人脸伪造检测的时空图

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-30 DOI: 10.1145/3580512

Zhihua Shang, Hongtao Xie, Lingyun Yu, Zhengjun Zha, Yongdong Zhang

Recently, advanced development of facial manipulation techniques threatens web information security, thus, face forgery detection attracts a lot of attention. It is clear that both spatial and temporal information of facial videos contains the crucial manipulation traces, which are inevitably created during the generation process. However, most existing face forgery detectors only focus on the spatial artifacts or the temporal incoherence, and they are struggling to learn a significant and general kind of representations for manipulated facial videos. In this work, we propose to construct spatial-temporal graphs for fake videos to capture the spatial inconsistency and the temporal incoherence at the same time. To model the spatial-temporal relationship among the graph nodes, a novel forgery detector named Spatio-Temporal Graph Network (STGN) is proposed, which contains two kinds of graph-convolution-based units, the Spatial Relation Graph Unit (SRGU) and the Temporal Attention Graph Unit (TAGU). To exploit spatial information, the SRGU models the inconsistency between each pair of patches in the same frame, instead of focusing on the low-level local spatial artifacts which are vulnerable to samples created by unseen manipulation methods. And, the TAGU is proposed to model the long-distance temporal relation among the patches at the same spatial position in different frames with a graph attention mechanism based on the inter-node similarity. With the SRGU and the TAGU, our STGN can combine the discriminative power of spatial inconsistency and the generalization capacity of temporal incoherence for face forgery detection. Our STGN achieves state-of-the-art performances on several popular forgery detection datasets. Extensive experiments demonstrate both the superiority of our STGN on intra manipulation evaluation and the effectiveness for new sorts of face forgery videos on cross manipulation evaluation.

近年来，人脸操作技术的发展威胁着网络信息的安全，因此人脸伪造检测引起了人们的广泛关注。很明显，面部视频的空间和时间信息都包含着关键的操作痕迹，这些痕迹是在生成过程中不可避免地产生的。然而，大多数现有的人脸伪造检测器只关注空间伪影或时间不相干，并且他们正在努力学习被操纵的人脸视频的一种重要而通用的表示。在这项工作中，我们建议为假视频构建时空图，以同时捕捉空间不一致和时间不一致。为了对图节点之间的时空关系进行建模，提出了一种新的伪造检测器，称为时空图网络（STGN），该检测器包含两种基于图卷积的单元，即空间关系图单元（SRGU）和时间注意图单元（TAGU）。为了利用空间信息，SRGU对同一帧中每对补丁之间的不一致性进行建模，而不是关注低级别的局部空间伪影，这些伪影容易受到看不见的操作方法创建的样本的影响。并且，提出了TAGU，利用基于节点间相似性的图注意力机制，对不同帧中相同空间位置的补丁之间的长距离时间关系进行建模。通过SRGU和TAGU，我们的STGN可以将空间不一致的判别能力和时间不相干的泛化能力相结合，用于人脸伪造检测。我们的STGN在几个流行的伪造检测数据集上实现了最先进的性能。大量实验证明了我们的STGN在操作内评估方面的优越性，以及对新型人脸伪造视频在交叉操作评估方面的有效性。

{"title":"Constructing Spatio-Temporal Graphs for Face Forgery Detection","authors":"Zhihua Shang, Hongtao Xie, Lingyun Yu, Zhengjun Zha, Yongdong Zhang","doi":"10.1145/3580512","DOIUrl":"https://doi.org/10.1145/3580512","url":null,"abstract":"Recently, advanced development of facial manipulation techniques threatens web information security, thus, face forgery detection attracts a lot of attention. It is clear that both spatial and temporal information of facial videos contains the crucial manipulation traces, which are inevitably created during the generation process. However, most existing face forgery detectors only focus on the spatial artifacts or the temporal incoherence, and they are struggling to learn a significant and general kind of representations for manipulated facial videos. In this work, we propose to construct spatial-temporal graphs for fake videos to capture the spatial inconsistency and the temporal incoherence at the same time. To model the spatial-temporal relationship among the graph nodes, a novel forgery detector named Spatio-Temporal Graph Network (STGN) is proposed, which contains two kinds of graph-convolution-based units, the Spatial Relation Graph Unit (SRGU) and the Temporal Attention Graph Unit (TAGU). To exploit spatial information, the SRGU models the inconsistency between each pair of patches in the same frame, instead of focusing on the low-level local spatial artifacts which are vulnerable to samples created by unseen manipulation methods. And, the TAGU is proposed to model the long-distance temporal relation among the patches at the same spatial position in different frames with a graph attention mechanism based on the inter-node similarity. With the SRGU and the TAGU, our STGN can combine the discriminative power of spatial inconsistency and the generalization capacity of temporal incoherence for face forgery detection. Our STGN achieves state-of-the-art performances on several popular forgery detection datasets. Extensive experiments demonstrate both the superiority of our STGN on intra manipulation evaluation and the effectiveness for new sorts of face forgery videos on cross manipulation evaluation.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"17 1","pages":"1 - 25"},"PeriodicalIF":3.5,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44803752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Contrastive Graph Similarity Networks 对比图相似网络

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-30 DOI: 10.1145/3580511

Luzhi Wang, Yizhen Zheng, Di Jin, Fuyi Li, Yongliang Qiao, Shirui Pan

Graph similarity learning is a significant and fundamental issue in the theory and analysis of graphs, which has been applied in a variety of fields, including object tracking, recommender systems, similarity search, etc. Recent methods for graph similarity learning that utilize deep learning typically share two deficiencies: (1) they leverage graph neural networks as backbones for learning graph representations but have not well captured the complex information inside data, and (2) they employ a cross-graph attention mechanism for graph similarity learning, which is computationally expensive. Taking these limitations into consideration, a method for graph similarity learning is devised in this study, namely, Contrastive Graph Similarity Network (CGSim). To enhance graph similarity learning, CGSim makes use of the complementary information of two input graphs and captures pairwise relations in a contrastive learning framework. By developing a dual contrastive learning module with a node-graph matching and a graph-graph matching mechanism, our method significantly reduces the quadratic time complexity for cross-graph interaction modeling to linear time complexity. Jointly learning in an end-to-end framework, the graph representation embedding module and the well-designed contrastive learning module can be beneficial to one another. A comprehensive series of experiments indicate that CGSim outperforms state-of-the-art baselines on six datasets and significantly reduces the computational cost, which demonstrates our CGSim model’s superiority over other baselines.

图相似性学习是图理论和分析中的一个重要而基础的问题，已被应用于多个领域，包括对象跟踪、推荐系统、相似性搜索等。最近使用深度学习的图相似性学习方法通常有两个不足：（1）它们利用图神经网络作为学习图表示的骨干，但没有很好地捕捉数据中的复杂信息；（2）它们使用交叉图注意力机制进行图相似性学习，这在计算上是昂贵的。考虑到这些局限性，本研究设计了一种图相似性学习方法，即对比图相似性网络（CGSim）。为了增强图的相似性学习，CGSim利用两个输入图的互补信息，并在对比学习框架中捕获成对关系。通过开发具有节点图匹配和图图匹配机制的双重对比学习模块，我们的方法将跨图交互建模的二次时间复杂度显著降低为线性时间复杂度。在端到端框架中的联合学习，图表示嵌入模块和设计良好的对比学习模块可以相互有益。一系列综合实验表明，CGSim在六个数据集上优于最先进的基线，并显著降低了计算成本，这表明我们的CGSim模型优于其他基线。

{"title":"Contrastive Graph Similarity Networks","authors":"Luzhi Wang, Yizhen Zheng, Di Jin, Fuyi Li, Yongliang Qiao, Shirui Pan","doi":"10.1145/3580511","DOIUrl":"https://doi.org/10.1145/3580511","url":null,"abstract":"Graph similarity learning is a significant and fundamental issue in the theory and analysis of graphs, which has been applied in a variety of fields, including object tracking, recommender systems, similarity search, etc. Recent methods for graph similarity learning that utilize deep learning typically share two deficiencies: (1) they leverage graph neural networks as backbones for learning graph representations but have not well captured the complex information inside data, and (2) they employ a cross-graph attention mechanism for graph similarity learning, which is computationally expensive. Taking these limitations into consideration, a method for graph similarity learning is devised in this study, namely, Contrastive Graph Similarity Network (CGSim). To enhance graph similarity learning, CGSim makes use of the complementary information of two input graphs and captures pairwise relations in a contrastive learning framework. By developing a dual contrastive learning module with a node-graph matching and a graph-graph matching mechanism, our method significantly reduces the quadratic time complexity for cross-graph interaction modeling to linear time complexity. Jointly learning in an end-to-end framework, the graph representation embedding module and the well-designed contrastive learning module can be beneficial to one another. A comprehensive series of experiments indicate that CGSim outperforms state-of-the-art baselines on six datasets and significantly reduces the computational cost, which demonstrates our CGSim model’s superiority over other baselines.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43054099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Heterogeneous Graph Transformer for Meta-structure Learning with Application in Text Classification 用于元结构学习的异构图转换器及其在文本分类中的应用

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-30 DOI: 10.1145/3580508

Shuhai Wang, Xin Liu, Xiao-Bin Pan, Hanjie Xu, Mingrui Liu

The prevalent heterogeneous Graph Neural Network (GNN) models learn node and graph representations using pre-defined meta-paths or only automatically discovering meta-paths. However, the existing methods suffer from information loss due to neglecting undiscovered meta-structures with richer semantics than meta-paths in heterogeneous graphs. To take advantage of the current rich meta-structures in heterogeneous graphs, we propose a novel approach called HeGTM to automatically extract essential meta-structures (i.e., meta-paths and meta-graphs) from heterogeneous graphs. The discovered meta-structures can capture more prosperous relations between different types of nodes that can help the model to learn representations. Furthermore, we apply the proposed approach for text classification. Specifically, we first design a heterogeneous graph for the text corpus, and then apply HeGTM on the constructed text graph to learn better text representations that contain various semantic relations. In addition, our approach can also be used as a strong meta-structure extractor for other GNN models. In other words, the auto-discovered meta-structures can replace the pre-defined meta-paths. The experimental results on text classification demonstrate the effectiveness of our approach to automatically extracting informative meta-structures from heterogeneous graphs and its usefulness in acting as a meta-structure extractor for boosting other GNN models.

流行的异构图神经网络（GNN）模型使用预定义的元路径或仅自动发现元路径来学习节点和图表示。然而，现有的方法由于忽视了异构图中比元路径具有更丰富语义的未发现的元结构而遭受信息损失。为了利用当前异构图中丰富的元结构，我们提出了一种称为HeGTM的新方法来自动从异构图中提取基本元结构（即元路径和元图）。所发现的元结构可以捕捉不同类型节点之间更繁荣的关系，这可以帮助模型学习表示。此外，我们将所提出的方法应用于文本分类。具体来说，我们首先为文本语料库设计一个异构图，然后在构建的文本图上应用HeGTM来学习更好的包含各种语义关系的文本表示。此外，我们的方法还可以用作其他GNN模型的强元结构提取器。换句话说，自动发现的元结构可以替换预定义的元路径。关于文本分类的实验结果证明了我们的方法从异构图中自动提取信息元结构的有效性，以及它作为元结构提取器来增强其他GNN模型的有用性。

{"title":"Heterogeneous Graph Transformer for Meta-structure Learning with Application in Text Classification","authors":"Shuhai Wang, Xin Liu, Xiao-Bin Pan, Hanjie Xu, Mingrui Liu","doi":"10.1145/3580508","DOIUrl":"https://doi.org/10.1145/3580508","url":null,"abstract":"The prevalent heterogeneous Graph Neural Network (GNN) models learn node and graph representations using pre-defined meta-paths or only automatically discovering meta-paths. However, the existing methods suffer from information loss due to neglecting undiscovered meta-structures with richer semantics than meta-paths in heterogeneous graphs. To take advantage of the current rich meta-structures in heterogeneous graphs, we propose a novel approach called HeGTM to automatically extract essential meta-structures (i.e., meta-paths and meta-graphs) from heterogeneous graphs. The discovered meta-structures can capture more prosperous relations between different types of nodes that can help the model to learn representations. Furthermore, we apply the proposed approach for text classification. Specifically, we first design a heterogeneous graph for the text corpus, and then apply HeGTM on the constructed text graph to learn better text representations that contain various semantic relations. In addition, our approach can also be used as a strong meta-structure extractor for other GNN models. In other words, the auto-discovered meta-structures can replace the pre-defined meta-paths. The experimental results on text classification demonstrate the effectiveness of our approach to automatically extracting informative meta-structures from heterogeneous graphs and its usefulness in acting as a meta-structure extractor for boosting other GNN models.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":"1 - 27"},"PeriodicalIF":3.5,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44890406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment GroupAligner：一种基于领域自适应的深度强化学习方法

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-24 DOI: 10.1145/3580509

Li Sun, Yang Du, Shuai Gao, Junda Ye, Feiyang Wang, Fuxin Ren, Mingchen Liang, Yue Wang, Shuhai Wang

Social network alignment, which aims to uncover the correspondence across different social networks, shows fundamental importance in a wide spectrum of applications such as cross-domain recommendation and information propagation. In the literature, the vast majority of the existing studies focus on the social network alignment at user level. In practice, the user-level alignment usually relies on abundant personal information and high-quality supervision, which is expensive and even impossible in the real-world scenario. Alternatively, we propose to study the problem of social group alignment across different social networks, focusing on the interests of social groups rather than personal information. However, social group alignment is non-trivial and faces significant challenges in both (i) feature inconsistency across different social networks and (ii) group discovery within a social network. To bridge this gap, we present a novel GroupAligner, a deep reinforcement learning with domain adaptation for social group alignment. In GroupAligner, to address the first issue, we propose the cycle domain adaptation approach with the Wasserstein distance to transfer the knowledge from the source social network, aligning the feature space of social networks in the distribution level. To address the second issue, we model the group discovery as a sequential decision process with reinforcement learning in which the policy is parameterized by a proposed proximity-enhanced Graph Neural Network (pGNN) and a GNN-based discriminator to score the reward. Finally, we utilize pre-training and teacher forcing to stabilize the learning process of GroupAligner. Extensive experiments on several real-world datasets are conducted to evaluate GroupAligner, and experimental results show that GroupAligner outperforms the alternative methods for social group alignment.

社交网络比对旨在揭示不同社交网络之间的对应关系，在跨领域推荐和信息传播等广泛应用中显示出根本重要性。在文献中，现有的绝大多数研究都集中在用户层面的社交网络对齐。在实践中，用户级别的对齐通常依赖于丰富的个人信息和高质量的监督，这在现实世界中是昂贵的，甚至是不可能的。或者，我们建议研究不同社交网络中的社会群体结盟问题，重点关注社会群体的利益，而不是个人信息。然而，社交群体对齐是不平凡的，并且在（i）不同社交网络之间的特征不一致性和（ii）社交网络内的群体发现方面都面临着重大挑战。为了弥补这一差距，我们提出了一种新的群体对齐器，这是一种具有领域自适应的深度强化学习，用于社会群体对齐。在GroupAligner中，为了解决第一个问题，我们提出了具有Wasserstein距离的循环域自适应方法，以转移来自源社交网络的知识，在分布级别上调整社交网络的特征空间。为了解决第二个问题，我们将群体发现建模为具有强化学习的顺序决策过程，其中通过所提出的邻近增强图神经网络（pGNN）和基于GNN的鉴别器来参数化策略，以对奖励进行评分。最后，我们利用预先培训和教师强制来稳定GroupAlign的学习过程。在几个真实世界的数据集上进行了广泛的实验来评估GroupAlign，实验结果表明，GroupAlign在社会群体对齐方面优于其他方法。

{"title":"GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment","authors":"Li Sun, Yang Du, Shuai Gao, Junda Ye, Feiyang Wang, Fuxin Ren, Mingchen Liang, Yue Wang, Shuhai Wang","doi":"10.1145/3580509","DOIUrl":"https://doi.org/10.1145/3580509","url":null,"abstract":"Social network alignment, which aims to uncover the correspondence across different social networks, shows fundamental importance in a wide spectrum of applications such as cross-domain recommendation and information propagation. In the literature, the vast majority of the existing studies focus on the social network alignment at user level. In practice, the user-level alignment usually relies on abundant personal information and high-quality supervision, which is expensive and even impossible in the real-world scenario. Alternatively, we propose to study the problem of social group alignment across different social networks, focusing on the interests of social groups rather than personal information. However, social group alignment is non-trivial and faces significant challenges in both (i) feature inconsistency across different social networks and (ii) group discovery within a social network. To bridge this gap, we present a novel GroupAligner, a deep reinforcement learning with domain adaptation for social group alignment. In GroupAligner, to address the first issue, we propose the cycle domain adaptation approach with the Wasserstein distance to transfer the knowledge from the source social network, aligning the feature space of social networks in the distribution level. To address the second issue, we model the group discovery as a sequential decision process with reinforcement learning in which the policy is parameterized by a proposed proximity-enhanced Graph Neural Network (pGNN) and a GNN-based discriminator to score the reward. Finally, we utilize pre-training and teacher forcing to stabilize the learning process of GroupAligner. Extensive experiments on several real-world datasets are conducted to evaluate GroupAligner, and experimental results show that GroupAligner outperforms the alternative methods for social group alignment.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"17 1","pages":"1 - 30"},"PeriodicalIF":3.5,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42206032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Community Enhanced Link Prediction in Dynamic Networks 动态网络中社区增强的链路预测

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on the Web

Pub Date : 2023-01-24 DOI: 10.1145/3580513

Mukesh Kumar, S. Mishra, S. Singh, Bhaskar Biswas

The growing popularity of online social networks is quite evident nowadays and provides an opportunity to allow researchers in finding solutions for various practical applications. Link prediction is the technique of understanding network structure and identifying missing and future links in social networks. One of the well-known classes of methods in link prediction is a similarity-based method, which uses local and global topological information of the network to predict missing links. Some methods also exist based on quasi-local features to achieve a trade-off between local and global information on static networks. These quasi-local similarity-based methods are not best suited for considering community information in dynamic networks, failing to balance accuracy and efficiency. Therefore, a community enhanced framework is presented in this paper to predict missing links on dynamic social networks. First, a link prediction framework is presented to predict missing links using parameterized influence regions of nodes and their contribution in community partitions. Then, a unique feature set is generated using local, global, and quasi-local similarity-based as well as community information-based features. This feature set is further optimized using scoring-based feature selection methods to select only the most relevant features. Finally, four machine learning-based classification models are used for link prediction. The experiments are performed on six well-known dynamic networks and three performance metrics, and the results demonstrate that the proposed method outperforms the state-of-the-art methods.

如今，在线社交网络的日益普及是相当明显的，它为研究人员提供了一个机会，让他们找到各种实际应用的解决方案。链接预测是一种理解网络结构，识别社会网络中缺失和未来链接的技术。基于相似性的链路预测方法是一种众所周知的链路预测方法，它利用网络的局部和全局拓扑信息来预测缺失链路。在静态网络中，也存在一些基于准局部特征的方法来实现局部信息和全局信息之间的权衡。这些基于准局部相似度的方法不适合考虑动态网络中的社区信息，无法平衡准确性和效率。因此，本文提出了一个社区增强框架来预测动态社会网络中的缺失环节。首先，提出了一种链路预测框架，利用节点的参数化影响区域及其在社区划分中的贡献来预测缺失链路;然后，使用基于局部、全局和准局部相似度以及基于社区信息的特征生成唯一的特征集。使用基于评分的特征选择方法进一步优化该特征集，以只选择最相关的特征。最后，使用四种基于机器学习的分类模型进行链接预测。在六个知名的动态网络和三个性能指标上进行了实验，结果表明该方法优于现有的方法。

{"title":"Community Enhanced Link Prediction in Dynamic Networks","authors":"Mukesh Kumar, S. Mishra, S. Singh, Bhaskar Biswas","doi":"10.1145/3580513","DOIUrl":"https://doi.org/10.1145/3580513","url":null,"abstract":"The growing popularity of online social networks is quite evident nowadays and provides an opportunity to allow researchers in finding solutions for various practical applications. Link prediction is the technique of understanding network structure and identifying missing and future links in social networks. One of the well-known classes of methods in link prediction is a similarity-based method, which uses local and global topological information of the network to predict missing links. Some methods also exist based on quasi-local features to achieve a trade-off between local and global information on static networks. These quasi-local similarity-based methods are not best suited for considering community information in dynamic networks, failing to balance accuracy and efficiency. Therefore, a community enhanced framework is presented in this paper to predict missing links on dynamic social networks. First, a link prediction framework is presented to predict missing links using parameterized influence regions of nodes and their contribution in community partitions. Then, a unique feature set is generated using local, global, and quasi-local similarity-based as well as community information-based features. This feature set is further optimized using scoring-based feature selection methods to select only the most relevant features. Finally, four machine learning-based classification models are used for link prediction. The experiments are performed on six well-known dynamic networks and three performance metrics, and the results demonstrate that the proposed method outperforms the state-of-the-art methods.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":" ","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41646083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1