首页 > 最新文献

ACM Transactions on the Web最新文献

英文 中文
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks 异构信息网络下强化mooc概念推荐
4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: 10.1145/3580510
Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, Yuting Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang
Massive open online courses (MOOCs) , which offer open access and widespread interactive participation through the internet, are quickly becoming the preferred method for online and remote learning. Several MOOC platforms offer the service of course recommendation to users, to improve the learning experience of users. Despite the usefulness of this service, we consider that recommending courses to users directly may neglect their varying degrees of expertise. To mitigate this gap, we examine an interesting problem of concept recommendation in this paper, which can be viewed as recommending knowledge to users in a fine-grained way. We put forward a novel approach, termed HinCRec-RL, for C oncept Rec ommendation in MOOCs, which is based on H eterogeneous I nformation N etworks and R einforcement L earning . In particular, we propose to shape the problem of concept recommendation within a reinforcement learning framework to characterize the dynamic interaction between users and knowledge concepts in MOOCs. Furthermore, we propose to form the interactions among users, courses, videos, and concepts into a heterogeneous information network (HIN) to learn the semantic user representations better. We then employ an attentional graph neural network to represent the users in the HIN, based on meta-paths. Extensive experiments are conducted on a real-world dataset collected from a Chinese MOOC platform, XuetangX , to validate the efficacy of our proposed HinCRec-RL. Experimental results and analysis demonstrate that our proposed HinCRec-RL performs well when compared with several state-of-the-art models.
大规模在线开放课程(MOOCs)通过互联网提供开放获取和广泛的互动参与,正迅速成为在线和远程学习的首选方法。一些MOOC平台为用户提供课程推荐服务,以改善用户的学习体验。尽管这项服务很有用,但我们认为直接向用户推荐课程可能会忽略他们不同程度的专业知识。为了缩小这一差距,我们在本文中研究了一个有趣的概念推荐问题,它可以被视为以细粒度的方式向用户推荐知识。我们提出了一种基于异构信息网络和R强化学习的mooc C概念推荐新方法,称为HinCRec-RL。特别是,我们建议在强化学习框架内塑造概念推荐问题,以表征mooc中用户与知识概念之间的动态交互。此外,我们提出将用户、课程、视频和概念之间的交互形成一个异构信息网络(HIN),以更好地学习语义用户表示。然后,我们使用一个基于元路径的注意力图神经网络来表示HIN中的用户。在从中国MOOC平台XuetangX收集的真实数据集上进行了大量实验,以验证我们提出的HinCRec-RL的有效性。实验结果和分析表明,与几种最先进的模型相比,我们提出的HinCRec-RL具有良好的性能。
{"title":"Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks","authors":"Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, Yuting Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang","doi":"10.1145/3580510","DOIUrl":"https://doi.org/10.1145/3580510","url":null,"abstract":"Massive open online courses (MOOCs) , which offer open access and widespread interactive participation through the internet, are quickly becoming the preferred method for online and remote learning. Several MOOC platforms offer the service of course recommendation to users, to improve the learning experience of users. Despite the usefulness of this service, we consider that recommending courses to users directly may neglect their varying degrees of expertise. To mitigate this gap, we examine an interesting problem of concept recommendation in this paper, which can be viewed as recommending knowledge to users in a fine-grained way. We put forward a novel approach, termed HinCRec-RL, for C oncept Rec ommendation in MOOCs, which is based on H eterogeneous I nformation N etworks and R einforcement L earning . In particular, we propose to shape the problem of concept recommendation within a reinforcement learning framework to characterize the dynamic interaction between users and knowledge concepts in MOOCs. Furthermore, we propose to form the interactions among users, courses, videos, and concepts into a heterogeneous information network (HIN) to learn the semantic user representations better. We then employ an attentional graph neural network to represent the users in the HIN, based on meta-paths. Extensive experiments are conducted on a real-world dataset collected from a Chinese MOOC platform, XuetangX , to validate the efficacy of our proposed HinCRec-RL. Experimental results and analysis demonstrate that our proposed HinCRec-RL performs well when compared with several state-of-the-art models.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135288114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Heterogeneous Graph Transformer for Meta-structure Learning with Application in Text Classification 面向元结构学习的异构图转换器及其在文本分类中的应用
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3580508
Shuhai Wang, Xin Liu, Xiao Pan, Hanjie Xu, Mingrui Liu

The prevalent heterogeneous Graph Neural Network (GNN) models learn node and graph representations using pre-defined meta-paths or only automatically discovering meta-paths. However, the existing methods suffer from information loss due to neglecting undiscovered meta-structures with richer semantics than meta-paths in heterogeneous graphs. To take advantage of the current rich meta-structures in heterogeneous graphs, we propose a novel approach called HeGTM to automatically extract essential meta-structures (i.e., meta-paths and meta-graphs) from heterogeneous graphs. The discovered meta-structures can capture more prosperous relations between different types of nodes that can help the model to learn representations. Furthermore, we apply the proposed approach for text classification. Specifically, we first design a heterogeneous graph for the text corpus, and then apply HeGTM on the constructed text graph to learn better text representations that contain various semantic relations. In addition, our approach can also be used as a strong meta-structure extractor for other GNN models. In other words, the auto-discovered meta-structures can replace the pre-defined meta-paths. The experimental results on text classification demonstrate the effectiveness of our approach to automatically extracting informative meta-structures from heterogeneous graphs and its usefulness in acting as a meta-structure extractor for boosting other GNN models.

普遍的异构图神经网络(GNN)模型使用预定义的元路径或仅自动发现元路径来学习节点和图表示。然而,现有的方法由于忽略了异构图中具有比元路径更丰富语义的未发现元结构而存在信息丢失的问题。为了利用当前异构图中丰富的元结构,我们提出了一种名为HeGTM的新方法来自动从异构图中提取基本元结构(即元路径和元图)。发现的元结构可以捕获不同类型节点之间更繁荣的关系,从而帮助模型学习表征。此外,我们将该方法应用于文本分类。具体来说,我们首先为文本语料库设计一个异构图,然后在构建的文本图上应用HeGTM来学习包含各种语义关系的更好的文本表示。此外,我们的方法也可以用作其他GNN模型的强元结构提取器。换句话说,自动发现的元结构可以取代预定义的元路径。文本分类的实验结果证明了我们的方法从异构图中自动提取信息元结构的有效性,以及它作为元结构提取器用于增强其他GNN模型的有效性。
{"title":"Heterogeneous Graph Transformer for Meta-structure Learning with Application in Text Classification","authors":"Shuhai Wang, Xin Liu, Xiao Pan, Hanjie Xu, Mingrui Liu","doi":"https://dl.acm.org/doi/10.1145/3580508","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3580508","url":null,"abstract":"<p>The prevalent heterogeneous Graph Neural Network (GNN) models learn node and graph representations using pre-defined meta-paths or only automatically discovering meta-paths. However, the existing methods suffer from information loss due to neglecting undiscovered meta-structures with richer semantics than meta-paths in heterogeneous graphs. To take advantage of the current rich meta-structures in heterogeneous graphs, we propose a novel approach called HeGTM to automatically extract essential meta-structures (i.e., meta-paths and meta-graphs) from heterogeneous graphs. The discovered meta-structures can capture more prosperous relations between different types of nodes that can help the model to learn representations. Furthermore, we apply the proposed approach for text classification. Specifically, we first design a heterogeneous graph for the text corpus, and then apply HeGTM on the constructed text graph to learn better text representations that contain various semantic relations. In addition, our approach can also be used as a strong meta-structure extractor for other GNN models. In other words, the auto-discovered meta-structures can replace the pre-defined meta-paths. The experimental results on text classification demonstrate the effectiveness of our approach to automatically extracting informative meta-structures from heterogeneous graphs and its usefulness in acting as a meta-structure extractor for boosting other GNN models.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"15 1","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138516945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introduction to the Special Issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications: Part 1 Web上的高级图挖掘:理论、算法和应用特刊简介:第1部分
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3579360
Hao Peng, Jian Yang, Jia Wu, Philip S. Yu

No abstract available.

没有摘要。
{"title":"Introduction to the Special Issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications: Part 1","authors":"Hao Peng, Jian Yang, Jia Wu, Philip S. Yu","doi":"https://dl.acm.org/doi/10.1145/3579360","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3579360","url":null,"abstract":"<p>No abstract available.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 14","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introduction to the Special Issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications: Part 1 网络高级图挖掘特刊简介:理论、算法和应用:第1部分
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: 10.1145/3579360
Hao Peng, Jian Yang, Jia Wu, Philip S. Yu
We are delighted to present this special issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications. Graph mining plays an important role in data mining on the Web. It can take full advantage of the growing and easily accessible big data resources on the Web, such as rich semantic information in social media and complex associations between users in online social networks, which is crucial for the development of systems and applications such as event detection, social bot detection, and intelligent recommendation. However, extracting valuable and representative information from Web graph data is still a great challenge and requires research and development on advanced techniques. The purpose of this special issue is to provide a forum for researchers and practitioners to present their latest research findings and engineering experiences in the theoretical foundations, empirical studies, and novel applications of Graph Mining. This special issue consists of two parts. In Part 1, the guest editors selected 10 contributions that cover varying topics within this theme, ranging from reinforced and self-supervised GNN architecture search framework to the streaming growth algorithm of bipartite graphs. Yang et al. in “RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search” proposed a novel Reinforced and Self-supervised GNN Architecture Search framework named RoSGAS, which gains improvement in terms of accuracy, training efficiency, and stability. And has better generalization when handling unseen samples. Du et al. in “Niffler: Real-time Device-level Anomalies Detection in Smart Home” proposed a novel notion—a correlated graph, and with the aid of that, they developed a system to detect misbehaving devices without modifying the existing system, which is crucial for the device-level security in the smart home system. And then they further proposed a linkage path model and a sensitivity ranking method to assist in detecting the abnormalities. Sun et al. in “GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment” presented a novel GroupAligner, a deep reinforcement learning with domain adaptation for social group alignment, which solves the problems of feature inconsistency across different social networks and group discovery within a social network in social group alignment. Zhu et al. in “A Multi-task Graph Neural Network with Variational Graph Auto-encoders for Session-based Travel Packages Recommendation” proposed a novel session-based model named STR-VGAE, which provides robust attributes’ representations and takes the effects of historical sessions for the current session into consideration. The model obtained promising results in the session-based recommendation, and can fill subtasks of the travel packages recommendation and variational graph auto-encoders simultaneously.
我们很高兴向大家介绍这期关于网络上的高级图挖掘:理论、算法和应用的特刊。图挖掘在Web数据挖掘中起着重要的作用。它可以充分利用Web上日益增长且易于获取的大数据资源,如社交媒体中丰富的语义信息和在线社交网络中用户之间复杂的关联,这对于事件检测、社交机器人检测、智能推荐等系统和应用的发展至关重要。然而,从Web图数据中提取有价值和代表性的信息仍然是一个巨大的挑战,需要先进的技术研究和开发。本期特刊的目的是为研究人员和实践者提供一个论坛,展示他们在图挖掘的理论基础、实证研究和新应用方面的最新研究成果和工程经验。本期特刊由两部分组成。在第1部分中,客座编辑选择了10篇文章,涵盖了这个主题中的不同主题,从强化和自监督GNN架构搜索框架到二部图的流增长算法。Yang等人在“RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search”一文中提出了一种新的增强自监督GNN Architecture Search框架RoSGAS,该框架在准确率、训练效率和稳定性方面都得到了提高。并且在处理看不见的样本时具有更好的泛化性。Du等人在《嗅嗅:智能家居中实时设备级异常检测》中提出了一种新颖的概念——关联图,并以此开发了一种无需修改现有系统即可检测异常设备的系统,这对于智能家居系统中设备级安全至关重要。然后,他们进一步提出了一种链接路径模型和灵敏度排序方法来帮助检测异常。Sun等人在“GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment”一文中提出了一种新颖的GroupAligner,即一种具有领域适应的深度强化学习用于社会群体对齐,它解决了社会群体对齐中不同社会网络之间的特征不一致和社会网络内部的群体发现问题。Zhu等人在《基于会话的旅游包推荐的变分图自编码器多任务图神经网络》中提出了一种新的基于会话的模型STR-VGAE,该模型提供了鲁棒的属性表示,并考虑了历史会话对当前会话的影响。该模型在基于会话的推荐中取得了很好的效果,可以同时填补旅行包推荐和变分图自编码器的子任务。
{"title":"Introduction to the Special Issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications: Part 1","authors":"Hao Peng, Jian Yang, Jia Wu, Philip S. Yu","doi":"10.1145/3579360","DOIUrl":"https://doi.org/10.1145/3579360","url":null,"abstract":"We are delighted to present this special issue on Advanced Graph Mining on the Web: Theory, Algorithms, and Applications. Graph mining plays an important role in data mining on the Web. It can take full advantage of the growing and easily accessible big data resources on the Web, such as rich semantic information in social media and complex associations between users in online social networks, which is crucial for the development of systems and applications such as event detection, social bot detection, and intelligent recommendation. However, extracting valuable and representative information from Web graph data is still a great challenge and requires research and development on advanced techniques. The purpose of this special issue is to provide a forum for researchers and practitioners to present their latest research findings and engineering experiences in the theoretical foundations, empirical studies, and novel applications of Graph Mining. This special issue consists of two parts. In Part 1, the guest editors selected 10 contributions that cover varying topics within this theme, ranging from reinforced and self-supervised GNN architecture search framework to the streaming growth algorithm of bipartite graphs. Yang et al. in “RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search” proposed a novel Reinforced and Self-supervised GNN Architecture Search framework named RoSGAS, which gains improvement in terms of accuracy, training efficiency, and stability. And has better generalization when handling unseen samples. Du et al. in “Niffler: Real-time Device-level Anomalies Detection in Smart Home” proposed a novel notion—a correlated graph, and with the aid of that, they developed a system to detect misbehaving devices without modifying the existing system, which is crucial for the device-level security in the smart home system. And then they further proposed a linkage path model and a sensitivity ranking method to assist in detecting the abnormalities. Sun et al. in “GroupAligner: A Deep Reinforcement Learning with Domain Adaptation for Social Group Alignment” presented a novel GroupAligner, a deep reinforcement learning with domain adaptation for social group alignment, which solves the problems of feature inconsistency across different social networks and group discovery within a social network in social group alignment. Zhu et al. in “A Multi-task Graph Neural Network with Variational Graph Auto-encoders for Session-based Travel Packages Recommendation” proposed a novel session-based model named STR-VGAE, which provides robust attributes’ representations and takes the effects of historical sessions for the current session into consideration. The model obtained promising results in the session-based recommendation, and can fill subtasks of the travel packages recommendation and variational graph auto-encoders simultaneously.","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"17 1","pages":"1 - 2"},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46038205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks 异构信息网络下强化mooc概念推荐
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3580510
Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, Yuting Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang

Massive open online courses (MOOCs), which offer open access and widespread interactive participation through the internet, are quickly becoming the preferred method for online and remote learning. Several MOOC platforms offer the service of course recommendation to users, to improve the learning experience of users. Despite the usefulness of this service, we consider that recommending courses to users directly may neglect their varying degrees of expertise. To mitigate this gap, we examine an interesting problem of concept recommendation in this paper, which can be viewed as recommending knowledge to users in a fine-grained way. We put forward a novel approach, termed HinCRec-RL, for Concept Recommendation in MOOCs, which is based on Heterogeneous Information Networks and Reinforcement Learning. In particular, we propose to shape the problem of concept recommendation within a reinforcement learning framework to characterize the dynamic interaction between users and knowledge concepts in MOOCs. Furthermore, we propose to form the interactions among users, courses, videos, and concepts into a heterogeneous information network (HIN) to learn the semantic user representations better. We then employ an attentional graph neural network to represent the users in the HIN, based on meta-paths. Extensive experiments are conducted on a real-world dataset collected from a Chinese MOOC platform, XuetangX, to validate the efficacy of our proposed HinCRec-RL. Experimental results and analysis demonstrate that our proposed HinCRec-RL performs well when compared with several state-of-the-art models.

大规模在线开放课程(MOOCs)通过互联网提供开放获取和广泛的互动参与,正迅速成为在线和远程学习的首选方法。一些MOOC平台为用户提供课程推荐服务,以改善用户的学习体验。尽管这项服务很有用,但我们认为直接向用户推荐课程可能会忽略他们不同程度的专业知识。为了缩小这一差距,我们在本文中研究了一个有趣的概念推荐问题,它可以被视为以细粒度的方式向用户推荐知识。我们提出了一种基于异构信息网络和强化学习的MOOCs概念推荐新方法,称为HinCRec-RL。特别是,我们建议在强化学习框架内塑造概念推荐问题,以表征mooc中用户与知识概念之间的动态交互。此外,我们提出将用户、课程、视频和概念之间的交互形成一个异构信息网络(HIN),以更好地学习语义用户表示。然后,我们使用一个基于元路径的注意力图神经网络来表示HIN中的用户。在从中国MOOC平台XuetangX收集的真实数据集上进行了大量实验,以验证我们提出的HinCRec-RL的有效性。实验结果和分析表明,与几种最先进的模型相比,我们提出的HinCRec-RL具有良好的性能。
{"title":"Reinforced MOOCs Concept Recommendation in Heterogeneous Information Networks","authors":"Jibing Gong, Yao Wan, Ye Liu, Xuewen Li, Yi Zhao, Cheng Wang, Yuting Lin, Xiaohan Fang, Wenzheng Feng, Jingyi Zhang, Jie Tang","doi":"https://dl.acm.org/doi/10.1145/3580510","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3580510","url":null,"abstract":"<p><b>Massive open online courses (MOOCs)</b>, which offer open access and widespread interactive participation through the internet, are quickly becoming the preferred method for online and remote learning. Several MOOC platforms offer the service of course recommendation to users, to improve the learning experience of users. Despite the usefulness of this service, we consider that recommending courses to users directly may neglect their varying degrees of expertise. To mitigate this gap, we examine an interesting problem of concept recommendation in this paper, which can be viewed as recommending knowledge to users in a fine-grained way. We put forward a novel approach, termed <b>HinCRec-RL, for <underline>C</underline>oncept <underline>Rec</underline>ommendation in MOOCs, which is based on <underline>H</underline>eterogeneous <underline>I</underline>nformation <underline>N</underline>etworks and <underline>R</underline>einforcement <underline>L</underline>earning</b>. In particular, we propose to shape the problem of concept recommendation within a reinforcement learning framework to characterize the dynamic interaction between users and knowledge concepts in MOOCs. Furthermore, we propose to form the interactions among users, courses, videos, and concepts into a <b>heterogeneous information network (HIN)</b> to learn the semantic user representations better. We then employ an attentional graph neural network to represent the users in the HIN, based on meta-paths. Extensive experiments are conducted on a real-world dataset collected from a Chinese MOOC platform, <i>XuetangX</i>, to validate the efficacy of our proposed HinCRec-RL. Experimental results and analysis demonstrate that our proposed HinCRec-RL performs well when compared with several state-of-the-art models.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 20","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constructing Spatio-Temporal Graphs for Face Forgery Detection 人脸伪造检测的时空图谱构建
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3580512
Zhihua Shang, Hongtao Xie, Lingyun Yu, Zhengjun Zha, Yongdong Zhang

Recently, advanced development of facial manipulation techniques threatens web information security, thus, face forgery detection attracts a lot of attention. It is clear that both spatial and temporal information of facial videos contains the crucial manipulation traces, which are inevitably created during the generation process. However, most existing face forgery detectors only focus on the spatial artifacts or the temporal incoherence, and they are struggling to learn a significant and general kind of representations for manipulated facial videos. In this work, we propose to construct spatial-temporal graphs for fake videos to capture the spatial inconsistency and the temporal incoherence at the same time. To model the spatial-temporal relationship among the graph nodes, a novel forgery detector named Spatio-Temporal Graph Network (STGN) is proposed, which contains two kinds of graph-convolution-based units, the Spatial Relation Graph Unit (SRGU) and the Temporal Attention Graph Unit (TAGU). To exploit spatial information, the SRGU models the inconsistency between each pair of patches in the same frame, instead of focusing on the low-level local spatial artifacts which are vulnerable to samples created by unseen manipulation methods. And, the TAGU is proposed to model the long-distance temporal relation among the patches at the same spatial position in different frames with a graph attention mechanism based on the inter-node similarity. With the SRGU and the TAGU, our STGN can combine the discriminative power of spatial inconsistency and the generalization capacity of temporal incoherence for face forgery detection. Our STGN achieves state-of-the-art performances on several popular forgery detection datasets. Extensive experiments demonstrate both the superiority of our STGN on intra manipulation evaluation and the effectiveness for new sorts of face forgery videos on cross manipulation evaluation.

近年来,人脸处理技术的飞速发展对网络信息安全构成了威胁,人脸伪造检测备受关注。显然,人脸视频的时空信息中都包含着关键的操作痕迹,这些痕迹是在生成过程中不可避免地产生的。然而,大多数现有的人脸伪造检测器只关注空间伪影或时间不相干,并且他们正在努力学习一种重要的和通用的面部伪造视频表示。在这项工作中,我们建议为假视频构建时空图,同时捕捉空间不一致性和时间不一致性。为了对图节点之间的时空关系进行建模,提出了一种新的伪造检测方法——时空图网络(STGN),它包含两种基于图卷积的单元,即空间关系图单元(SRGU)和时间注意图单元(TAGU)。为了利用空间信息,SRGU对同一帧中每对补丁之间的不一致性进行建模,而不是专注于低级的局部空间伪影,这些伪影容易受到由看不见的操作方法创建的样本的影响。利用基于节点间相似度的图注意机制,提出了基于节点间相似度的图注意模型,对不同帧内同一空间位置的小块间的远距离时间关系进行建模。结合SRGU和TAGU,我们的STGN可以将空间不一致的判别能力和时间不相干的泛化能力结合起来用于人脸伪造检测。我们的STGN在几个流行的伪造检测数据集上实现了最先进的性能。大量的实验证明了我们的STGN在内部操作评估上的优越性和在交叉操作评估上对新型人脸伪造视频的有效性。
{"title":"Constructing Spatio-Temporal Graphs for Face Forgery Detection","authors":"Zhihua Shang, Hongtao Xie, Lingyun Yu, Zhengjun Zha, Yongdong Zhang","doi":"https://dl.acm.org/doi/10.1145/3580512","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3580512","url":null,"abstract":"<p>Recently, advanced development of facial manipulation techniques threatens web information security, thus, face forgery detection attracts a lot of attention. It is clear that both spatial and temporal information of facial videos contains the crucial manipulation traces, which are inevitably created during the generation process. However, most existing face forgery detectors only focus on the spatial artifacts or the temporal incoherence, and they are struggling to learn a significant and general kind of representations for manipulated facial videos. In this work, we propose to construct spatial-temporal graphs for fake videos to capture the spatial inconsistency and the temporal incoherence at the same time. To model the spatial-temporal relationship among the graph nodes, a novel forgery detector named Spatio-Temporal Graph Network (STGN) is proposed, which contains two kinds of graph-convolution-based units, the Spatial Relation Graph Unit (SRGU) and the Temporal Attention Graph Unit (TAGU). To exploit spatial information, the SRGU models the inconsistency between each pair of patches in the same frame, instead of focusing on the low-level local spatial artifacts which are vulnerable to samples created by unseen manipulation methods. And, the TAGU is proposed to model the long-distance temporal relation among the patches at the same spatial position in different frames with a graph attention mechanism based on the inter-node similarity. With the SRGU and the TAGU, our STGN can combine the discriminative power of spatial inconsistency and the generalization capacity of temporal incoherence for face forgery detection. Our STGN achieves state-of-the-art performances on several popular forgery detection datasets. Extensive experiments demonstrate both the superiority of our STGN on intra manipulation evaluation and the effectiveness for new sorts of face forgery videos on cross manipulation evaluation.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 21","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph Attention Network for Text Classification and Detection of Mental Disorder 精神障碍文本分类与检测的图注意网络
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3572406
Usman Ahmed, Jerry Chun-Wei Lin, Gautam Srivastava

A serious issue in today’s society is Depression, which can have a devastating impact on a person’s ability to cope in daily life. Numerous studies have examined the use of data generated directly from users using social media to diagnose and detect Depression as a mental illness. Therefore, this paper investigates the language used in individuals’ personal expressions to identify depressive symptoms via social media. Graph Attention Networks (GATs) are used in this study as a solution to the problems associated with text classification of depression. These GATs can be constructed using masked self-attention layers. Rather than requiring expensive matrix operations such as similarity or knowledge of network architecture, this study implicitly assigns weights to each node in a neighbourhood. This is possible because nodes and words can carry properties and sentiments of their neighbours. Another aspect of the study that contributed to the expansion of the emotion lexicon was the use of hypernyms. As a result, our method performs better when applied to data from the Reddit subreddit Depression. Our experiments show that the emotion lexicon constructed by using the Graph Attention Network ROC achieves 0.91 while remaining simple and interpretable.

抑郁症是当今社会的一个严重问题,它会对一个人应对日常生活的能力产生毁灭性的影响。许多研究调查了使用社交媒体用户直接产生的数据来诊断和检测抑郁症作为一种精神疾病的情况。因此,本文研究了个体通过社交媒体识别抑郁症状的个人表达中使用的语言。本研究使用图形注意网络(GATs)来解决与抑郁症文本分类相关的问题。这些GATs可以使用遮罩自注意层来构建。该研究不需要昂贵的矩阵运算,如相似性或网络架构知识,而是隐式地为邻居中的每个节点分配权重。这是可能的,因为节点和单词可以携带它们邻居的属性和情感。该研究的另一个有助于扩展情感词汇的方面是使用首字母缩略词。因此,我们的方法在应用于Reddit subreddit Depression的数据时表现更好。我们的实验表明,使用图注意网络构建的情绪词汇在保持简单和可解释性的前提下达到了0.91。
{"title":"Graph Attention Network for Text Classification and Detection of Mental Disorder","authors":"Usman Ahmed, Jerry Chun-Wei Lin, Gautam Srivastava","doi":"https://dl.acm.org/doi/10.1145/3572406","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3572406","url":null,"abstract":"<p>A serious issue in today’s society is Depression, which can have a devastating impact on a person’s ability to cope in daily life. Numerous studies have examined the use of data generated directly from users using social media to diagnose and detect Depression as a mental illness. Therefore, this paper investigates the language used in individuals’ personal expressions to identify depressive symptoms via social media. Graph Attention Networks (GATs) are used in this study as a solution to the problems associated with text classification of depression. These GATs can be constructed using masked self-attention layers. Rather than requiring expensive matrix operations such as similarity or knowledge of network architecture, this study implicitly assigns weights to each node in a neighbourhood. This is possible because nodes and words can carry properties and sentiments of their neighbours. Another aspect of the study that contributed to the expansion of the emotion lexicon was the use of hypernyms. As a result, our method performs better when applied to data from the Reddit subreddit Depression. Our experiments show that the emotion lexicon constructed by using the Graph Attention Network ROC achieves 0.91 while remaining simple and interpretable.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 18","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Type Information Utilized Event Detection via Multi-Channel GNNs in Electrical Power Systems 基于多通道gnn的电力系统事件检测类型信息
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3577031
Qian Li, Jianxin Li, Lihong Wang, Cheng Ji, Yiming Hei, Jiawei Sheng, Qingyun Sun, Shan Xue, Pengtao Xie

Event detection in power systems aims to identify triggers and event types, which helps relevant personnel respond to emergencies promptly and facilitates the optimization of power supply strategies. However, the limited length of short electrical record texts causes severe information sparsity, and numerous domain-specific terminologies of power systems makes it difficult to transfer knowledge from language models pre-trained on general-domain texts. Traditional event detection approaches primarily focus on the general domain and ignore these two problems in the power system domain. To address the above issues, we propose a Multi-Channel graph neural network utilizing Type information for Event Detection in power systems, named MC-TED, leveraging a semantic channel and a topological channel to enrich information interaction from short texts. Concretely, the semantic channel refines textual representations with semantic similarity, building the semantic information interaction among potential event-related words. The topological channel generates a relation-type-aware graph modeling word dependencies, and a word-type-aware graph integrating part-of-speech tags. To further reduce errors worsened by professional terminologies in type analysis, a type learning mechanism is designed for updating the representations of both the word type and relation type in the topological channel. In this way, the information sparsity and professional term occurrence problems can be alleviated by enabling interaction between topological and semantic information. Furthermore, to address the lack of labeled data in power systems, we built a Chinese event detection dataset based on electrical Power Event texts, named PoE. In experiments, our model achieves compelling results not only on the PoE dataset, but on general-domain event detection datasets including ACE 2005 and MAVEN.

电力系统事件检测的目的是识别触发事件和事件类型,帮助相关人员及时应对突发事件,优化供电策略。然而,短电记录文本的有限长度导致了严重的信息稀疏性,并且电力系统的许多领域特定术语使得从通用领域文本上预训练的语言模型中转移知识变得困难。传统的事件检测方法主要关注一般领域,而忽略了电力系统领域的这两个问题。为了解决上述问题,我们提出了一种利用类型信息进行电力系统事件检测的多通道图神经网络,命名为MC-TED,利用语义通道和拓扑通道来丰富短文本的信息交互。具体而言,语义通道通过语义相似度提炼文本表示,构建潜在事件相关词之间的语义信息交互。拓扑通道生成一个关系类型感知图(建模词依赖关系)和一个词类型感知图(集成词性标记)。为了进一步减少类型分析中专业术语造成的错误,设计了一种类型学习机制,用于更新拓扑通道中单词类型和关系类型的表示。通过实现拓扑信息和语义信息的交互,可以缓解信息稀疏和专业术语出现问题。此外,为了解决电力系统中缺乏标记数据的问题,我们建立了一个基于电力事件文本的中文事件检测数据集,命名为PoE。在实验中,我们的模型不仅在PoE数据集上取得了令人信服的结果,而且在ACE 2005和MAVEN等通用领域事件检测数据集上也取得了令人信服的结果。
{"title":"Type Information Utilized Event Detection via Multi-Channel GNNs in Electrical Power Systems","authors":"Qian Li, Jianxin Li, Lihong Wang, Cheng Ji, Yiming Hei, Jiawei Sheng, Qingyun Sun, Shan Xue, Pengtao Xie","doi":"https://dl.acm.org/doi/10.1145/3577031","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3577031","url":null,"abstract":"<p>Event detection in power systems aims to identify triggers and event types, which helps relevant personnel respond to emergencies promptly and facilitates the optimization of power supply strategies. However, the limited length of short electrical record texts causes severe information sparsity, and numerous domain-specific terminologies of power systems makes it difficult to transfer knowledge from language models pre-trained on general-domain texts. Traditional event detection approaches primarily focus on the general domain and ignore these two problems in the power system domain. To address the above issues, we propose a <b>Multi-Channel graph neural network utilizing Type information for Event Detection</b> in power systems, named <b>MC-TED</b>, leveraging a semantic channel and a topological channel to enrich information interaction from short texts. Concretely, the semantic channel refines textual representations with semantic similarity, building the semantic information interaction among potential event-related words. The topological channel generates a relation-type-aware graph modeling word dependencies, and a word-type-aware graph integrating part-of-speech tags. To further reduce errors worsened by professional terminologies in type analysis, a type learning mechanism is designed for updating the representations of both the word type and relation type in the topological channel. In this way, the information sparsity and professional term occurrence problems can be alleviated by enabling interaction between topological and semantic information. Furthermore, to address the lack of labeled data in power systems, we built a Chinese event detection dataset based on electrical Power Event texts, named <b>PoE</b>. In experiments, our model achieves compelling results not only on the PoE dataset, but on general-domain event detection datasets including ACE 2005 and MAVEN.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 19","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search 基于增强自监督GNN架构搜索的自适应社交机器人检测
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3572403
Yingguang Yang, Renyu Yang, Yangyang Li, Kai Cui, Zhiqin Yang, Yue Wang, Jie Xu, Haiyong Xie

Social bots are referred to as the automated accounts on social networks that make attempts to behave like humans. While Graph Neural Networks (GNNs) have been massively applied to the field of social bot detection, a huge amount of domain expertise and prior knowledge is heavily engaged in the state-of-the-art approaches to design a dedicated neural network architecture for a specific classification task. Involving oversized nodes and network layers in the model design, however, usually causes the over-smoothing problem and the lack of embedding discrimination. In this article, we propose RoSGAS, a novel Reinforced and Self-supervised GNN Architecture Search framework to adaptively pinpoint the most suitable multi-hop neighborhood and the number of layers in the GNN architecture. More specifically, we consider the social bot detection problem as a user-centric subgraph embedding and classification task. We exploit the heterogeneous information network to present the user connectivity by leveraging account metadata, relationships, behavioral features, and content features. RoSGAS uses a multi-agent deep reinforcement learning (RL), 31 pages. mechanism for navigating the search of optimal neighborhood and network layers to learn individually the subgraph embedding for each target user. A nearest neighbor mechanism is developed for accelerating the RL training process, and RoSGAS can learn more discriminative subgraph embedding with the aid of self-supervised learning. Experiments on five Twitter datasets show that RoSGAS outperforms the state-of-the-art approaches in terms of accuracy, training efficiency, and stability and has better generalization when handling unseen samples.

社交机器人指的是社交网络上的自动账户,它们试图像人类一样行事。虽然图神经网络(gnn)已被大量应用于社交机器人检测领域,但为特定分类任务设计专用神经网络架构的最新方法大量涉及领域专业知识和先验知识。然而,在模型设计中涉及过大的节点和网络层,通常会导致过度平滑问题和缺乏嵌入判别。在本文中,我们提出了一种新的增强和自监督GNN架构搜索框架RoSGAS,用于自适应地确定GNN架构中最合适的多跳邻域和层数。更具体地说,我们认为社交机器人检测问题是一个以用户为中心的子图嵌入和分类任务。我们利用异构信息网络,通过利用账户元数据、关系、行为特征和内容特征来呈现用户连通性。RoSGAS使用多智能体深度强化学习(RL), 31页。导航搜索最优邻域和网络层的机制,以单独学习每个目标用户的子图嵌入。提出了一种加速RL训练过程的最近邻机制,利用自监督学习,RoSGAS可以学习到更多的判别子图嵌入。在五个Twitter数据集上的实验表明,RoSGAS在准确率、训练效率和稳定性方面都优于最先进的方法,并且在处理未见过的样本时具有更好的泛化效果。
{"title":"RoSGAS: Adaptive Social Bot Detection with Reinforced Self-supervised GNN Architecture Search","authors":"Yingguang Yang, Renyu Yang, Yangyang Li, Kai Cui, Zhiqin Yang, Yue Wang, Jie Xu, Haiyong Xie","doi":"https://dl.acm.org/doi/10.1145/3572403","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3572403","url":null,"abstract":"<p>Social bots are referred to as the automated accounts on social networks that make attempts to behave like humans. While Graph Neural Networks (GNNs) have been massively applied to the field of social bot detection, a huge amount of domain expertise and prior knowledge is heavily engaged in the state-of-the-art approaches to design a dedicated neural network architecture for a specific classification task. Involving oversized nodes and network layers in the model design, however, usually causes the over-smoothing problem and the lack of embedding discrimination. In this article, we propose <span>RoSGAS</span>, a novel <underline>R</underline>einf<underline>o</underline>rced and <underline>S</underline>elf-supervised <underline>G</underline>NN <underline>A</underline>rchitecture <underline>S</underline>earch framework to adaptively pinpoint the most suitable multi-hop neighborhood and the number of layers in the GNN architecture. More specifically, we consider the social bot detection problem as a user-centric subgraph embedding and classification task. We exploit the heterogeneous information network to present the user connectivity by leveraging account metadata, relationships, behavioral features, and content features. <span>RoSGAS</span> uses a multi-agent deep reinforcement learning (RL), 31 pages. mechanism for navigating the search of optimal neighborhood and network layers to learn individually the subgraph embedding for each target user. A nearest neighbor mechanism is developed for accelerating the RL training process, and <span>RoSGAS</span> can learn more discriminative subgraph embedding with the aid of self-supervised learning. Experiments on five Twitter datasets show that <span>RoSGAS</span> outperforms the state-of-the-art approaches in terms of accuracy, training efficiency, and stability and has better generalization when handling unseen samples.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 15","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
sGrow: Explaining the Scale-Invariant Strength Assortativity of Streaming Butterflies sGrow:解释流动蝴蝶的尺度不变强度协调性
IF 3.5 4区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-05-22 DOI: https://dl.acm.org/doi/10.1145/3572408
Aida Sheshbolouki, M. Tamer Özsu

Bipartite graphs are rich data structures with prevalent applications and characteristic structural features. However, less is known about their growth patterns, particularly in streaming settings. Current works study the patterns of static or aggregated temporal graphs optimized for certain downstream analytics or ignoring multipartite/non-stationary data distributions, emergence patterns of subgraphs, and streaming paradigms. To address these, we perform statistical network analysis over web log streams and identify the governing patterns underlying the bursty emergence of mesoscopic building blocks, 2, 2-bicliques, leading to a phenomenon that we call scale-invariant strength assortativity of streaming butterflies. We provide the graph-theoretic explanation of this phenomenon. We further introduce a set of micro-mechanics in the body of a streaming growth algorithm, sGrow, to pinpoint the generative origins. sGrow supports streaming paradigms, emergence of four-vertex graphlets, and provides user-specified configurations for the scale, burstiness, level of strength assortativity, probability of out-of-order records, generation time, and time-sensitive connections. Comprehensive evaluations on pattern reproducing and stress testing validate the effectiveness, efficiency, and robustness of sGrow in realization of the observed patterns independent of initial conditions, scale, temporal characteristics, and model configurations. Theoretical and experimental analysis verify sGrow’s robustness in generating streaming graphs based on user-specified configurations that affect the scale and burstiness of the stream, level of strength assortativity, probability of out-of-order streaming records, generation time, and time-sensitive connections.

二部图是一种丰富的数据结构,具有广泛的应用和独特的结构特征。然而,人们对它们的增长模式知之甚少,尤其是在流媒体环境中。目前的工作是研究静态或聚合时间图的模式,为某些下游分析优化,或忽略多部/非平稳数据分布、子图的出现模式和流范式。为了解决这些问题,我们对网络日志流进行了统计网络分析,并确定了介观构建块(2,2 -bicliques)突然出现的控制模式,从而导致了一种我们称之为流蝴蝶的尺度不变强度分类的现象。我们提供了这种现象的图论解释。我们进一步在流生长算法sGrow的主体中引入了一套微观力学,以确定生成的起源。sGrow支持流范式、四顶点石墨烯的出现,并为规模、突发性、强度分类等级、乱序记录的概率、生成时间和时间敏感连接提供用户指定的配置。对模式再现和压力测试的综合评估验证了sGrow在独立于初始条件、规模、时间特征和模型配置实现所观察模式方面的有效性、效率和鲁棒性。理论和实验分析验证了sGrow在生成流图方面的鲁棒性,这些流图基于用户指定的配置,这些配置会影响流的规模和突发性、强度匹配程度、乱序流记录的概率、生成时间和时间敏感连接。
{"title":"sGrow: Explaining the Scale-Invariant Strength Assortativity of Streaming Butterflies","authors":"Aida Sheshbolouki, M. Tamer Özsu","doi":"https://dl.acm.org/doi/10.1145/3572408","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3572408","url":null,"abstract":"<p>Bipartite graphs are rich data structures with prevalent applications and characteristic structural features. However, less is known about their growth patterns, particularly in streaming settings. Current works study the patterns of static or aggregated temporal graphs optimized for certain downstream analytics or ignoring multipartite/non-stationary data distributions, emergence patterns of subgraphs, and streaming paradigms. To address these, we perform statistical network analysis over web log streams and identify the governing patterns underlying the bursty emergence of mesoscopic building blocks, 2, 2-bicliques, leading to a phenomenon that we call <i>scale-invariant strength assortativity of streaming butterflies</i>. We provide the graph-theoretic explanation of this phenomenon. We further introduce a set of micro-mechanics in the body of a streaming growth algorithm, <i>sGrow</i>, to pinpoint the generative origins. <i>sGrow</i> supports streaming paradigms, emergence of four-vertex graphlets, and provides user-specified configurations for the scale, burstiness, level of strength assortativity, probability of out-of-order records, generation time, and time-sensitive connections. Comprehensive evaluations on pattern reproducing and stress testing validate the effectiveness, efficiency, and robustness of <i>sGrow</i> in realization of the observed patterns independent of initial conditions, scale, temporal characteristics, and model configurations. Theoretical and experimental analysis verify <i>sGrow</i>’s robustness in generating streaming graphs based on user-specified configurations that affect the scale and burstiness of the stream, level of strength assortativity, probability of out-of-order streaming records, generation time, and time-sensitive connections.</p>","PeriodicalId":50940,"journal":{"name":"ACM Transactions on the Web","volume":"43 22","pages":""},"PeriodicalIF":3.5,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138495138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ACM Transactions on the Web
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1