Heterogeneous information networks have been proven to effectively improve recommendations due to their diverse information content. However, there are still two challenges for recommendation methods based on heterogeneous information networks. To begin with, current methods often depend on experts to manually craft meta-paths, and it can be challenging to define an adequate set of meta-paths for complex task scenarios. Second, most models fail to fully explore user preferences for paths or meta-paths whileimultaneously learning path or meta-path explicit representations. To tackle the aforementioned issues, we propose a model for recommendation utilizing meta-path automatically extracted from heterogeneous information network, called MAERec. Specifically, MAERec employs an automatic approach to extract high-quality path instances from heterogeneous information networks and construct meta-paths. These meta-paths are then utilized by a hierarchical attention network to learn an explicit representation of the meta-path-based context. Extensive experiments conducted on various real-world datasets not only showcase the superior performance of MAERec when compared to state-of-the-art methods but also underscore its capability to automatically discover high-quality path instances for meta-path extraction.
{"title":"Meta-path automatically extracted from heterogeneous information network for recommendation","authors":"Yihao Zhang, Weiwen Liao, Yulin Wang, Junlin Zhu, Ruizhen Chen, Yunjia Zhang","doi":"10.1007/s11280-024-01265-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01265-4","url":null,"abstract":"<p>Heterogeneous information networks have been proven to effectively improve recommendations due to their diverse information content. However, there are still two challenges for recommendation methods based on heterogeneous information networks. To begin with, current methods often depend on experts to manually craft meta-paths, and it can be challenging to define an adequate set of meta-paths for complex task scenarios. Second, most models fail to fully explore user preferences for paths or meta-paths whileimultaneously learning path or meta-path explicit representations. To tackle the aforementioned issues, we propose a model for recommendation utilizing meta-path automatically extracted from heterogeneous information network, called MAERec. Specifically, MAERec employs an automatic approach to extract high-quality path instances from heterogeneous information networks and construct meta-paths. These meta-paths are then utilized by a hierarchical attention network to learn an explicit representation of the meta-path-based context. Extensive experiments conducted on various real-world datasets not only showcase the superior performance of MAERec when compared to state-of-the-art methods but also underscore its capability to automatically discover high-quality path instances for meta-path extraction.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140572460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-12DOI: 10.1007/s11280-024-01248-5
Yehong Xu, Mengxuan Zhang, Ruizhong Wu, Lei Li, Xiaofang Zhou
Coverage Centrality is an important metric to evaluate vertex importance in road networks. However, current solutions have to compute the coverage centrality of all the vertices together, which is resource-wasting, especially when only some vertices centrality is required. In addition, they have poor adaption to the dynamic scenario because of the computation inefficiency. In this paper, we focus on the coverage centrality query problem and propose a method that efficiently computes the centrality of single vertices without relying on the underlying graph being static by employing the intra-region pruning, inter-region pruning, and top-down search. We further propose the bottom-up search and mixed search to improve efficiency. Experiments validate the efficiency and effectiveness of our algorithms compared with the state-of-the-art method.
{"title":"Efficient processing of coverage centrality queries on road networks","authors":"Yehong Xu, Mengxuan Zhang, Ruizhong Wu, Lei Li, Xiaofang Zhou","doi":"10.1007/s11280-024-01248-5","DOIUrl":"https://doi.org/10.1007/s11280-024-01248-5","url":null,"abstract":"<p>Coverage Centrality is an important metric to evaluate vertex importance in road networks. However, current solutions have to compute the coverage centrality of all the vertices together, which is resource-wasting, especially when only some vertices centrality is required. In addition, they have poor adaption to the dynamic scenario because of the computation inefficiency. In this paper, we focus on the coverage centrality query problem and propose a method that efficiently computes the centrality of single vertices without relying on the underlying graph being static by employing the intra-region pruning, inter-region pruning, and top-down search. We further propose the bottom-up search and mixed search to improve efficiency. Experiments validate the efficiency and effectiveness of our algorithms compared with the state-of-the-art method.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140572450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-12DOI: 10.1007/s11280-024-01264-5
Han Yu, Ziniu Liu, Hongkui Tu, Kai Chen, Aiping Li
Inductive relation prediction is an important learning task for knowledge graph reasoning that aims to infer new facts from existing ones. Previous graph neural networks (GNNs) based methods have demonstrated great success in inductive relation prediction by capturing more subgraph information. However, they aggregate all reasoning paths which might introduces redundant information. Such redundant information changes with the context of entity and easily outside the training distribution making existing GNN-base methods suffer from poor generalization. In this work, we propose a novel causal knowledge graph reasoning (CKGR) framework for inductive relation prediction task with better generalization. We first take a causal view of inductive relation prediction and construct a structural causal model (SCM) that reveals the relationship between variables. With our assumption, CKGR extracts causal and shortcut subgraphs conditioned on query triplet. Then, we parameter the backdoor adjustment of causality theory by making intervention in representation space. In this way, CKGR can learn stable causal feature and alleviates the confounding effect of shortcut features that are spuriously correlated to relation prediction. Extensive experiments on various tasks with real-world and synthetic datasets demonstrate the effectiveness of CKGR.
{"title":"Generalizable inductive relation prediction with causal subgraph","authors":"Han Yu, Ziniu Liu, Hongkui Tu, Kai Chen, Aiping Li","doi":"10.1007/s11280-024-01264-5","DOIUrl":"https://doi.org/10.1007/s11280-024-01264-5","url":null,"abstract":"<p>Inductive relation prediction is an important learning task for knowledge graph reasoning that aims to infer new facts from existing ones. Previous graph neural networks (GNNs) based methods have demonstrated great success in inductive relation prediction by capturing more subgraph information. However, they aggregate all reasoning paths which might introduces redundant information. Such redundant information changes with the context of entity and easily outside the training distribution making existing GNN-base methods suffer from poor generalization. In this work, we propose a novel causal knowledge graph reasoning (CKGR) framework for inductive relation prediction task with better generalization. We first take a causal view of inductive relation prediction and construct a structural causal model (SCM) that reveals the relationship between variables. With our assumption, CKGR extracts causal and shortcut subgraphs conditioned on query triplet. Then, we parameter the backdoor adjustment of causality theory by making intervention in representation space. In this way, CKGR can learn stable causal feature and alleviates the confounding effect of shortcut features that are spuriously correlated to relation prediction. Extensive experiments on various tasks with real-world and synthetic datasets demonstrate the effectiveness of CKGR.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140572770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-11DOI: 10.1007/s11280-024-01254-7
Zhengyun Zhou, Guojia Wan, Shirui Pan, Jia Wu, Wenbin Hu, Bo Du
Answering complex queries with First-order logical operators over knowledge graphs, such as conjunction ((wedge )), disjunction ((vee )), and negation ((lnot )) is immensely useful for identifying missing knowledge. Recently, neural symbolic reasoning methods have been proposed to map entities and relations into a continuous real vector space and model logical operators as differential neural networks. However, traditional methodss employ negative sampling, which corrupts complex queries to train embeddings. Consequently, these embeddings are susceptible to divergence in the open manifold of (mathbb {R}^n). The appropriate regularization is crucial for addressing the divergence of embeddings. In this paper, we introduces a Lie group as a compact embedding space for complex query embedding, enhancing ability to handle the intricacies of knowledge graphs the foundation model. Our method aims to solve the query of disjunctive and conjunctive problems. Entities and queries are represented as a region of a high-dimensional torus, where the projection, intersection, union, and negation of the torus naturally simulate entities and queries. After simulating the operations on the region of the torus we defined, we found that the resulting geometry remains unchanged. Experiments show that our method achieved a significant improvement on FB15K, FB15K-237, and NELL995. Through extensive experiments on datasets FB15K, FB15K-237, and NELL995, our approach demonstrates significant improvements, leveraging the strengths of knowledge graphs foundation model and complex query processing.
{"title":"Complex query answering over knowledge graphs foundation model using region embeddings on a lie group","authors":"Zhengyun Zhou, Guojia Wan, Shirui Pan, Jia Wu, Wenbin Hu, Bo Du","doi":"10.1007/s11280-024-01254-7","DOIUrl":"https://doi.org/10.1007/s11280-024-01254-7","url":null,"abstract":"<p>Answering complex queries with First-order logical operators over knowledge graphs, such as conjunction (<span>(wedge )</span>), disjunction (<span>(vee )</span>), and negation (<span>(lnot )</span>) is immensely useful for identifying missing knowledge. Recently, neural symbolic reasoning methods have been proposed to map entities and relations into a continuous real vector space and model logical operators as differential neural networks. However, traditional methodss employ negative sampling, which corrupts complex queries to train embeddings. Consequently, these embeddings are susceptible to divergence in the open manifold of <span>(mathbb {R}^n)</span>. The appropriate regularization is crucial for addressing the divergence of embeddings. In this paper, we introduces a Lie group as a compact embedding space for complex query embedding, enhancing ability to handle the intricacies of knowledge graphs the foundation model. Our method aims to solve the query of disjunctive and conjunctive problems. Entities and queries are represented as a region of a high-dimensional torus, where the projection, intersection, union, and negation of the torus naturally simulate entities and queries. After simulating the operations on the region of the torus we defined, we found that the resulting geometry remains unchanged. Experiments show that our method achieved a significant improvement on FB15K, FB15K-237, and NELL995. Through extensive experiments on datasets FB15K, FB15K-237, and NELL995, our approach demonstrates significant improvements, leveraging the strengths of knowledge graphs foundation model and complex query processing.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140572282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-03DOI: 10.1007/s11280-024-01259-2
Tianming Zhang, Xinwei Cai, Lu Chen, Zhengyi Yang, Yunjun Gao, Bin Cao, Jing Fan
In the context of searching a single data graph G, graph pattern matching is to find all the occurrences of a pattern graph Q in G, specified by a matching rule. It is of paramount importance in many real applications such as social network analysis and cyber security, among others. A wide spectrum of studies target general graph pattern matching. However, to analyze time-relevant services such as studying the spread of diseases and detecting attack patterns, it is attractive to study inexact temporal graph pattern matching. Hence, in this paper, we propose a relaxed matching rule called constrained temporal dual simulation, and study simulation-based constrained temporal graph pattern matching which guarantees that the matching result (i) preserves the ancestor and descendant temporal connectivities; and (ii) implements edge-to-temporal path mapping. We devise a decomposition-based matching method, which first decomposes the data graph into Source Temporal Connected Components, and then performs matching on decomposed subgraphs. To speed up the matching, we define child/parent dependency relation tables and propose an efficient double hierarchical traverse strategy. Considering that the temporal graphs are naturally dynamic, we further propose update algorithms. An extensive empirical study over real-world and synthetic temporal graphs has demonstrated the effectiveness and efficiency of our approach.
在搜索单个数据图 G 的情况下,图模式匹配是指根据匹配规则的规定,找到模式图 Q 在 G 中的所有出现。它在社交网络分析和网络安全等许多实际应用中至关重要。针对一般图模式匹配的研究范围很广。然而,要分析与时间相关的服务,如研究疾病传播和检测攻击模式,研究非精确的时间图模式匹配是很有吸引力的。因此,在本文中,我们提出了一种称为受限时空二元模拟的宽松匹配规则,并研究了基于模拟的受限时空图模式匹配,它能保证匹配结果:(i) 保留祖先和后代的时空连接性;(ii) 实现边到时空路径映射。我们设计了一种基于分解的匹配方法,它首先将数据图分解为源时空连接组件,然后在分解后的子图上执行匹配。为了加快匹配速度,我们定义了子/父依赖关系表,并提出了一种高效的双分层遍历策略。考虑到时态图是天然动态的,我们进一步提出了更新算法。对现实世界和合成时空图进行的广泛实证研究证明了我们方法的有效性和效率。
{"title":"Towards efficient simulation-based constrained temporal graph pattern matching","authors":"Tianming Zhang, Xinwei Cai, Lu Chen, Zhengyi Yang, Yunjun Gao, Bin Cao, Jing Fan","doi":"10.1007/s11280-024-01259-2","DOIUrl":"https://doi.org/10.1007/s11280-024-01259-2","url":null,"abstract":"<p>In the context of searching a single data graph <i>G</i>, graph pattern matching is to find all the occurrences of a pattern graph <i>Q</i> in <i>G</i>, specified by a matching rule. It is of paramount importance in many real applications such as social network analysis and cyber security, among others. A wide spectrum of studies target general graph pattern matching. However, to analyze time-relevant services such as studying the spread of diseases and detecting attack patterns, it is attractive to study inexact temporal graph pattern matching. Hence, in this paper, we propose a relaxed matching rule called constrained temporal dual simulation, and study simulation-based constrained temporal graph pattern matching which guarantees that the matching result (i) preserves the ancestor and descendant temporal connectivities; and (ii) implements edge-to-temporal path mapping. We devise a decomposition-based matching method, which first decomposes the data graph into Source Temporal Connected Components, and then performs matching on decomposed subgraphs. To speed up the matching, we define child/parent dependency relation tables and propose an efficient double hierarchical traverse strategy. Considering that the temporal graphs are naturally dynamic, we further propose update algorithms. An extensive empirical study over real-world and synthetic temporal graphs has demonstrated the effectiveness and efficiency of our approach.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140572455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The sequential recommendation task based on the multi-interest framework aims to model multiple interests of users from different aspects to predict their future interactions. However, researchers rarely consider the differences in features between the interests generated by the model. In extreme cases, all interest capsules have the same meaning, leading to the failure of modeling users with multiple interests. To address this issue, we propose the High-level Preferences as positive examples in Contrastive Learning for multi-interest Sequence Recommendation framework (HPCL4SR), which uses contrastive learning to distinguish differences in interests based on user item interaction information. In order to find high-quality comparative examples, this paper introduces the category information to construct a global graph, learning the association between categories for high-level preference interest of users. Then, a multi-layer perceptron is used to adaptively fuse the low-level preference interest features of the user’s items and the high-level preference interest features of the categories. Finally, user multi-interest contrastive samples are obtained through item sequence information and corresponding categories, which are fed into contrastive learning to optimize model parameters and generate multi-interest representations that are more in line with the user sequence. In addition, when modeling the user’s item sequence information, in order to increase the differentiation between item representations, the category of the item is used to supervise the learning process. Extensive experiments on three real datasets demonstrate that our method outperforms existing multi-interest recommendation models.
{"title":"High-level preferences as positive examples in contrastive learning for multi-interest sequential recommendation","authors":"Zizhong Zhu, Shuang Li, Yaokun Liu, Xiaowang Zhang, Zhiyong Feng, Yuexian Hou","doi":"10.1007/s11280-024-01263-6","DOIUrl":"https://doi.org/10.1007/s11280-024-01263-6","url":null,"abstract":"<p>The sequential recommendation task based on the multi-interest framework aims to model multiple interests of users from different aspects to predict their future interactions. However, researchers rarely consider the differences in features between the interests generated by the model. In extreme cases, all interest capsules have the same meaning, leading to the failure of modeling users with multiple interests. To address this issue, we propose the High-level Preferences as positive examples in Contrastive Learning for multi-interest Sequence Recommendation framework (HPCL4SR), which uses contrastive learning to distinguish differences in interests based on user item interaction information. In order to find high-quality comparative examples, this paper introduces the category information to construct a global graph, learning the association between categories for high-level preference interest of users. Then, a multi-layer perceptron is used to adaptively fuse the low-level preference interest features of the user’s items and the high-level preference interest features of the categories. Finally, user multi-interest contrastive samples are obtained through item sequence information and corresponding categories, which are fed into contrastive learning to optimize model parameters and generate multi-interest representations that are more in line with the user sequence. In addition, when modeling the user’s item sequence information, in order to increase the differentiation between item representations, the category of the item is used to supervise the learning process. Extensive experiments on three real datasets demonstrate that our method outperforms existing multi-interest recommendation models.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140149653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-06DOI: 10.1007/s11280-024-01258-3
Abstract
It is challenging to deploy convolutional neural networks (CNNs) for image denoising on low-power devices which can suffer from computational and memory constraints. To address this limitation, a simple yet effective and efficient feature redundancy reduction-based network (FRRN) is proposed in this paper, which integrates a feature refinement block (FRB), an attention fusion block (AFB), and an enhancement block (EB). Specifically, the FRB distills structural information via two parallel sub-networks, selecting representative feature representations while suppressing spatial-channel redundancy. The AFB absorbs an attentive fusion mechanism to facilitate diverse features extracted from two sub-networks, emphasizing texture and structure details but alleviating harmful features from problematic regions. The subsequent EB further boosts the feature representation abilities. Aiming to enhance denoising performance at both pixel level and semantic level, a multi-loss scheme comprising three popular loss functions is leveraged to improve the robustness of the denoiser. Comprehensive quantitative and qualitative analyses demonstrate the superiority of the proposed FRRN.
{"title":"Efficient feature redundancy reduction for image denoising","authors":"","doi":"10.1007/s11280-024-01258-3","DOIUrl":"https://doi.org/10.1007/s11280-024-01258-3","url":null,"abstract":"<h3>Abstract</h3> <p>It is challenging to deploy convolutional neural networks (CNNs) for image denoising on low-power devices which can suffer from computational and memory constraints. To address this limitation, a simple yet effective and efficient feature redundancy reduction-based network (FRRN) is proposed in this paper, which integrates a feature refinement block (FRB), an attention fusion block (AFB), and an enhancement block (EB). Specifically, the FRB distills structural information via two parallel sub-networks, selecting representative feature representations while suppressing spatial-channel redundancy. The AFB absorbs an attentive fusion mechanism to facilitate diverse features extracted from two sub-networks, emphasizing texture and structure details but alleviating harmful features from problematic regions. The subsequent EB further boosts the feature representation abilities. Aiming to enhance denoising performance at both pixel level and semantic level, a multi-loss scheme comprising three popular loss functions is leveraged to improve the robustness of the denoiser. Comprehensive quantitative and qualitative analyses demonstrate the superiority of the proposed FRRN.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140043956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-03-05DOI: 10.1007/s11280-024-01250-x
Abstract
Nowadays, with the development of recommender systems, an emerging recommendation scenario called group-to-group recommendation has played a vital role in information acquisition for users. The new recommendation scenario seeks to recommend a group of related items to users with similar interests. To some extent, it alleviates the problem of point-to-point recommendations getting trapped in an information cocoon due to an over-reliance on user behaviors. For the new recommendation scenario, the existing recommendation methods cannot model the complex interactions between user groups and item groups, thus affecting the accuracy of the group-to-group recommendation. In this paper, we propose a group-to-group recommendation method, which abstracts user groups and item groups into graphs and calculates the similarity between two graphs based on graph matching, dubbed as GMRec. Specifically, we construct the graph of user groups and item groups and then calculate the graph similarity scores between user groups and item groups from two perspectives of feature matching and structure matching. Experimental results show that our model achieves higher accuracy than state-of-the-art models on three industrial datasets with different group sizes, with a maximum improvement of 8.2%.
{"title":"Group-to-group recommendation with neural graph matching","authors":"","doi":"10.1007/s11280-024-01250-x","DOIUrl":"https://doi.org/10.1007/s11280-024-01250-x","url":null,"abstract":"<h3>Abstract</h3> <p>Nowadays, with the development of recommender systems, an emerging recommendation scenario called group-to-group recommendation has played a vital role in information acquisition for users. The new recommendation scenario seeks to recommend a group of related items to users with similar interests. To some extent, it alleviates the problem of point-to-point recommendations getting trapped in an information cocoon due to an over-reliance on user behaviors. For the new recommendation scenario, the existing recommendation methods cannot model the complex interactions between user groups and item groups, thus affecting the accuracy of the group-to-group recommendation. In this paper, we propose a group-to-group recommendation method, which abstracts user groups and item groups into graphs and calculates the similarity between two graphs based on graph matching, dubbed as GMRec. Specifically, we construct the graph of user groups and item groups and then calculate the graph similarity scores between user groups and item groups from two perspectives of feature matching and structure matching. Experimental results show that our model achieves higher accuracy than state-of-the-art models on three industrial datasets with different group sizes, with a maximum improvement of 8.2%.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140035920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-29DOI: 10.1007/s11280-024-01257-4
Lingling Zhang, Zhiping Shi, Zhiwei Zhang, Ye Yuan, Guoren Wang
The huge amount of graph data necessitates sampling methods to support graph-based analysis applications. Node influence is to count the influential nodes with a given node in large graphs that has wide applications including product promotion and information diffusion in social networks. However, existing sampling methods mainly consider node degree to compute the node influence while ignoring the important connections in terms of groups in which nodes participate, resulting in inaccuracy of influence estimations. To this end, this paper proposes group sampling, called GVRW, to count the groups along with node degrees to evaluate node influence in large graphs. Specifically, GVRW changes the way of random walker traversing a large graph from one node to a random neighbor node of the groups to enlarge the sampling space for the sake of characterizing the nodes and groups simultaneously. Furthermore, we carefully design the corresponding estimated method to employ the samples to estimate the specific distributions of groups and node degrees to compute the node influence. Experimental results on real-world graph datasets show that our proposed sampling and estimating methods can accurately obtain the properties and approximate the node influences closer to the real values than existing methods.
{"title":"Efficiently estimating node influence through group sampling over large graphs","authors":"Lingling Zhang, Zhiping Shi, Zhiwei Zhang, Ye Yuan, Guoren Wang","doi":"10.1007/s11280-024-01257-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01257-4","url":null,"abstract":"<p>The huge amount of graph data necessitates sampling methods to support graph-based analysis applications. Node influence is to count the influential nodes with a given node in large graphs that has wide applications including product promotion and information diffusion in social networks. However, existing sampling methods mainly consider node degree to compute the node influence while ignoring the important connections in terms of groups in which nodes participate, resulting in inaccuracy of influence estimations. To this end, this paper proposes group sampling, called GVRW, to count the groups along with node degrees to evaluate node influence in large graphs. Specifically, GVRW changes the way of random walker traversing a large graph from one node to a random neighbor node of the groups to enlarge the sampling space for the sake of characterizing the nodes and groups simultaneously. Furthermore, we carefully design the corresponding estimated method to employ the samples to estimate the specific distributions of groups and node degrees to compute the node influence. Experimental results on real-world graph datasets show that our proposed sampling and estimating methods can accurately obtain the properties and approximate the node influences closer to the real values than existing methods.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140006201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-26DOI: 10.1007/s11280-024-01255-6
Yanmin Chen, Enhong Chen, Kun Zhang, Qi Liu, Ruijun Sun
Online question matching is the process of comparing user queries with system questions to find appropriate answers. This task has become increasingly important with the popularity of knowledge sharing social networks in product search and intelligent Q &A in customer service. Many previous studies have focused on designing complex semantic structures through the questions themselves. In fact, the online user’s queries accumulate a large number of similar sentences, which have been grouped by semantics in the retrieval system. However, how to use these sentences to enhance the understanding of system questions is rarely studied. In this paper, we propose a novel Relation-aware Semantic Enhancement Network (RSEN) model. Specifically, we leverage the labels of the history records to identify different semantically related sentences. Then, we construct an expanded relation network to integrate the representation of different semantic relations. Furthermore, we interact we integrate the features of the system question with the semantically related sentences to augment the semantic information. Finally, we evaluate our proposed RSEN on two publicly available datasets. The results demonstrate the effectiveness of our proposed RSEN method compared to the advanced baselines.
{"title":"A relation-aware representation approach for the question matching system","authors":"Yanmin Chen, Enhong Chen, Kun Zhang, Qi Liu, Ruijun Sun","doi":"10.1007/s11280-024-01255-6","DOIUrl":"https://doi.org/10.1007/s11280-024-01255-6","url":null,"abstract":"<p>Online question matching is the process of comparing user queries with system questions to find appropriate answers. This task has become increasingly important with the popularity of knowledge sharing social networks in product search and intelligent Q &A in customer service. Many previous studies have focused on designing complex semantic structures through the questions themselves. In fact, the online user’s queries accumulate a large number of similar sentences, which have been grouped by semantics in the retrieval system. However, how to use these sentences to enhance the understanding of system questions is rarely studied. In this paper, we propose a novel Relation-aware Semantic Enhancement Network (RSEN) model. Specifically, we leverage the labels of the history records to identify different semantically related sentences. Then, we construct an expanded relation network to integrate the representation of different semantic relations. Furthermore, we interact we integrate the features of the system question with the semantically related sentences to augment the semantic information. Finally, we evaluate our proposed RSEN on two publicly available datasets. The results demonstrate the effectiveness of our proposed RSEN method compared to the advanced baselines.</p>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139978073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}