首页 > 最新文献

IEEE Transactions on Big Data最新文献

英文 中文
Guest Editorial TBD Special Issue on Graph Machine Learning for Recommender Systems 特约编辑 TBD 特刊:面向推荐系统的图式机器学习
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-12 DOI: 10.1109/TBDATA.2024.3452328
Senzhang Wang;Changdong Wang;Di Jin;Shirui Pan;Philip S. Yu
{"title":"Guest Editorial TBD Special Issue on Graph Machine Learning for Recommender Systems","authors":"Senzhang Wang;Changdong Wang;Di Jin;Shirui Pan;Philip S. Yu","doi":"10.1109/TBDATA.2024.3452328","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3452328","url":null,"abstract":"","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"682-682"},"PeriodicalIF":7.5,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10750533","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliable Data Augmented Contrastive Learning for Sequential Recommendation 用于序列推荐的可靠数据增强对比学习
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-18 DOI: 10.1109/TBDATA.2024.3453752
Mankun Zhao;Aitong Sun;Jian Yu;Xuewei Li;Dongxiao He;Ruiguo Yu;Mei Yu
Sequential recommendation aims to capture users’ dynamic preferences. Due to the limited information in the sequence and the uncertain user behavior, data sparsity has always been a key problem. Although data augmentation methods can alleviate this issue, unreliable data can affect the performance of such models. To solve the above problems, we propose a new framework, namely Reliable Data Augmented Contrastive Learning Recommender (RDCRec). Specifically, in order to generate more high-quality reliable items for data augmentation, we design a multi-attributes oriented sequence generator. It moves auxiliary information from the input layer to the attention layer for learning a better attention distribution. Then, we replace a percentage of items in the original sequence with reliable items generated by the generator as the augmented sequence, for creating a high-quality view for contrastive learning. In this way, RDCRec can extract more meaningful user patterns by using the self-supervised signals of the reliable items, thereby improving recommendation performance. Finally, we train a discriminator to identify unreplaced items in the augmented sequence thus we can update item embeddings selectively in order to increase the exposure of more reliable items and improve the accuracy of recommendation results. The discriminator, as an auxiliary model, is jointly trained with the generative task and the contrastive learning task. Large experiments on four popular datasets that are commonly used demonstrate the effectiveness of our new method for sequential recommendation.
序列推荐旨在捕捉用户的动态偏好。由于序列信息有限且用户行为不确定,数据稀疏一直是一个关键问题。尽管数据增强方法可以缓解这一问题,但不可靠的数据会影响此类模型的性能。为了解决上述问题,我们提出了一个新的框架,即可靠数据增强对比学习推荐器(RDCRec)。具体来说,为了生成更多用于数据增强的高质量可靠项目,我们设计了一种面向多属性的序列生成器。它将辅助信息从输入层转移到注意力层,以学习更好的注意力分布。然后,我们用生成器生成的可靠项目替换原始序列中一定比例的项目,作为增强序列,为对比学习创建高质量的视图。这样,RDCRec 就能利用可靠项目的自监督信号提取更有意义的用户模式,从而提高推荐性能。最后,我们会训练一个判别器来识别增强序列中未替换的项目,从而有选择性地更新项目嵌入,以增加更可靠项目的曝光率,提高推荐结果的准确性。判别器作为一个辅助模型,与生成任务和对比学习任务共同训练。在四个常用数据集上进行的大型实验证明了我们的新方法在顺序推荐方面的有效性。
{"title":"Reliable Data Augmented Contrastive Learning for Sequential Recommendation","authors":"Mankun Zhao;Aitong Sun;Jian Yu;Xuewei Li;Dongxiao He;Ruiguo Yu;Mei Yu","doi":"10.1109/TBDATA.2024.3453752","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3453752","url":null,"abstract":"Sequential recommendation aims to capture users’ dynamic preferences. Due to the limited information in the sequence and the uncertain user behavior, data sparsity has always been a key problem. Although data augmentation methods can alleviate this issue, unreliable data can affect the performance of such models. To solve the above problems, we propose a new framework, namely \u0000<bold>R</b>\u0000eliable \u0000<bold>D</b>\u0000ata Augmented \u0000<bold>C</b>\u0000ontrastive Learning \u0000<bold>Rec</b>\u0000ommender (RDCRec). Specifically, in order to generate more high-quality reliable items for data augmentation, we design a multi-attributes oriented sequence generator. It moves auxiliary information from the input layer to the attention layer for learning a better attention distribution. Then, we replace a percentage of items in the original sequence with reliable items generated by the generator as the augmented sequence, for creating a high-quality view for contrastive learning. In this way, RDCRec can extract more meaningful user patterns by using the self-supervised signals of the reliable items, thereby improving recommendation performance. Finally, we train a discriminator to identify unreplaced items in the augmented sequence thus we can update item embeddings selectively in order to increase the exposure of more reliable items and improve the accuracy of recommendation results. The discriminator, as an auxiliary model, is jointly trained with the generative task and the contrastive learning task. Large experiments on four popular datasets that are commonly used demonstrate the effectiveness of our new method for sequential recommendation.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"694-705"},"PeriodicalIF":7.5,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EduGraph: Learning Path-Based Hypergraph Neural Networks for MOOC Course Recommendation EduGraph:用于 MOOC 课程推荐的基于学习路径的超图神经网络
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-03 DOI: 10.1109/TBDATA.2024.3453757
Ming Li;Zhao Li;Changqin Huang;Yunliang Jiang;Xindong Wu
In online learning, personalized course recommendations that align with learners’ preferences and future needs are essential. Thus, the development of efficient recommender systems is crucial to guide learners to appropriate courses. Graph learning in recommender systems has been extensively studied, yet many models focus on low-frequency information, underscoring similar learner preferences and overlooking high-frequency data that indicates varied learning trajectories. Furthermore, course co-occurrence and sequential relationships are often insufficiently investigated. In this paper, we introduce EduGraph, a novel framework developed specifically for MOOC course recommendation systems. EduGraph is characterized by its incorporation of a learning path-based hypergraph, a unique perspective wherein learners are represented as hyperedges, and courses are delineated as vertices. The framework incorporates a framelet-based hypergraph convolution, integrating low-pass filters to highlight similarities and high-pass filters to underscore distinct learning paths among learners. Furthermore, EduGraph features a dual hypergraph learning model, with channels designated for vertex and hyperedge encoding, fostering a collaborative information exchange that refines the learners’ preference embeddings. The empirical assessment of EduGraph is conducted through a comprehensive comparison with many existing baselines, utilizing two distinct MOOC datasets. Our experimental studies not only emphasize the enhanced recommendation performance of EduGraph but also elucidate the significant contributions of its individual components, such as the integration of low-pass and high-pass filters and the framelet-wise collaborative strategy that effectively bridges hyperedge-level and vertex-level representations, augmenting the overall efficacy of the course recommendation system.
在在线学习中,符合学习者偏好和未来需求的个性化课程推荐至关重要。因此,开发高效的推荐系统对于引导学习者选择合适的课程至关重要。人们对推荐系统中的图学习进行了广泛的研究,但许多模型只关注低频信息,强调类似的学习者偏好,而忽略了显示不同学习轨迹的高频数据。此外,课程共现和顺序关系往往没有得到充分研究。在本文中,我们介绍了 EduGraph,这是一个专为 MOOC 课程推荐系统开发的新型框架。EduGraph 的特点是结合了基于学习路径的超图,这是一个独特的视角,学习者被表示为超边,课程被划分为顶点。该框架采用了基于小帧的超图卷积,集成了低通滤波器以突出学习者之间的相似性,以及高通滤波器以强调学习者之间不同的学习路径。此外,EduGraph 还采用了双重超图学习模型,为顶点和超边编码指定了通道,从而促进了协作式信息交流,完善了学习者的偏好嵌入。我们利用两个不同的MOOC数据集,通过与许多现有基线进行综合比较,对EduGraph进行了实证评估。我们的实验研究不仅强调了EduGraph增强的推荐性能,还阐明了其各个组件的重要贡献,如低通滤波器和高通滤波器的整合,以及有效连接超边级和顶点级表征的小帧协作策略,从而增强了课程推荐系统的整体功效。
{"title":"EduGraph: Learning Path-Based Hypergraph Neural Networks for MOOC Course Recommendation","authors":"Ming Li;Zhao Li;Changqin Huang;Yunliang Jiang;Xindong Wu","doi":"10.1109/TBDATA.2024.3453757","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3453757","url":null,"abstract":"In online learning, personalized course recommendations that align with learners’ preferences and future needs are essential. Thus, the development of efficient recommender systems is crucial to guide learners to appropriate courses. Graph learning in recommender systems has been extensively studied, yet many models focus on low-frequency information, underscoring similar learner preferences and overlooking high-frequency data that indicates varied learning trajectories. Furthermore, course co-occurrence and sequential relationships are often insufficiently investigated. In this paper, we introduce \u0000<monospace><b>EduGraph</b></monospace>\u0000, a novel framework developed specifically for MOOC course recommendation systems. \u0000<monospace><b>EduGraph</b></monospace>\u0000 is characterized by its incorporation of a learning path-based hypergraph, a unique perspective wherein learners are represented as hyperedges, and courses are delineated as vertices. The framework incorporates a framelet-based hypergraph convolution, integrating low-pass filters to highlight similarities and high-pass filters to underscore distinct learning paths among learners. Furthermore, \u0000<monospace><b>EduGraph</b></monospace>\u0000 features a dual hypergraph learning model, with channels designated for vertex and hyperedge encoding, fostering a collaborative information exchange that refines the learners’ preference embeddings. The empirical assessment of \u0000<monospace><b>EduGraph</b></monospace>\u0000 is conducted through a comprehensive comparison with many existing baselines, utilizing two distinct MOOC datasets. Our experimental studies not only emphasize the enhanced recommendation performance of \u0000<monospace><b>EduGraph</b></monospace>\u0000 but also elucidate the significant contributions of its individual components, such as the integration of low-pass and high-pass filters and the framelet-wise collaborative strategy that effectively bridges hyperedge-level and vertex-level representations, augmenting the overall efficacy of the course recommendation system.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"706-719"},"PeriodicalIF":7.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AKGNN: Attribute Knowledge Graph Neural Networks Recommendation for Corporate Volunteer Activities AKGNN:企业志愿者活动的属性知识图谱神经网络推荐
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-03 DOI: 10.1109/TBDATA.2024.3453761
Dan Du;Pei-Yuan Lai;Yan-Fei Wang;De-Zhang Liao;Min Chen
Due to the collective decision-making nature of enterprises, the process of accepting recommendations is predominantly characterized by an analytical synthesis of objective requirements and cost-effectiveness, rather than being rooted in individual interests. This distinguishes enterprise recommendation scenarios from those tailored for individuals or groups formed by similar individuals, rendering traditional recommendation algorithms less applicable in the corporate context. To overcome the challenges, by taking the corporate volunteer as an example, which aims to recommend volunteer activities to enterprises, we propose a novel recommendation model called Attribute Knowledge Graph Neural Networks (AKGNN). Specifically, a novel comprehensive attribute knowledge graph is constructed for enterprises and volunteer activities, based on which we obtain the feature representation. Then we utilize an extended Variational Auto-Encoder (eVAE) model to learn the preferences representation and then we utilize a GNN model to learn the comprehensive representation with representation of the similar nodes. Finally, all the comprehensive representations are input to the prediction layer. Extensive experiments have been conducted on real datasets, confirming the advantages of the AKGNN model. We delineate the challenges faced by recommendation algorithms in Business-to-Business (B2B) platforms and introduces a novel research approach utilizing attribute knowledge graphs.
由于企业的集体决策性质,接受推荐的过程主要是对客观要求和成本效益进行分析综合,而不是以个人利益为基础。这就使企业推荐方案有别于为个人或由相似个体组成的团体量身定制的方案,从而使传统推荐算法在企业环境中的适用性降低。为了克服这些挑战,我们以企业志愿者为例,向企业推荐志愿者活动,提出了一种名为属性知识图神经网络(AKGNN)的新型推荐模型。具体来说,我们为企业和志愿者活动构建了一个新颖的综合属性知识图谱,并在此基础上获得了特征表示。然后,我们利用扩展变异自动编码器(eVAE)模型来学习偏好表示,再利用 GNN 模型来学习带有相似节点表示的综合表示。最后,所有综合表征被输入到预测层。我们在真实数据集上进行了大量实验,证实了 AKGNN 模型的优势。我们描述了企业对企业(B2B)平台中推荐算法所面临的挑战,并介绍了一种利用属性知识图谱的新型研究方法。
{"title":"AKGNN: Attribute Knowledge Graph Neural Networks Recommendation for Corporate Volunteer Activities","authors":"Dan Du;Pei-Yuan Lai;Yan-Fei Wang;De-Zhang Liao;Min Chen","doi":"10.1109/TBDATA.2024.3453761","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3453761","url":null,"abstract":"Due to the collective decision-making nature of enterprises, the process of accepting recommendations is predominantly characterized by an analytical synthesis of objective requirements and cost-effectiveness, rather than being rooted in individual interests. This distinguishes enterprise recommendation scenarios from those tailored for individuals or groups formed by similar individuals, rendering traditional recommendation algorithms less applicable in the corporate context. To overcome the challenges, by taking the corporate volunteer as an example, which aims to recommend volunteer activities to enterprises, we propose a novel recommendation model called \u0000<bold>A</b>\u0000ttribute \u0000<bold>K</b>\u0000nowledge \u0000<bold>G</b>\u0000raph \u0000<bold>N</b>\u0000eural \u0000<bold>N</b>\u0000etworks (AKGNN). Specifically, a novel comprehensive attribute knowledge graph is constructed for enterprises and volunteer activities, based on which we obtain the feature representation. Then we utilize an \u0000<bold>e</b>\u0000xtended \u0000<bold>V</b>\u0000ariational \u0000<bold>A</b>\u0000uto-\u0000<bold>E</b>\u0000ncoder (eVAE) model to learn the preferences representation and then we utilize a GNN model to learn the comprehensive representation with representation of the similar nodes. Finally, all the comprehensive representations are input to the prediction layer. Extensive experiments have been conducted on real datasets, confirming the advantages of the AKGNN model. We delineate the challenges faced by recommendation algorithms in Business-to-Business (B2B) platforms and introduces a novel research approach utilizing attribute knowledge graphs.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"720-730"},"PeriodicalIF":7.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Higher-Order Smoothness Enhanced Graph Collaborative Filtering 高阶平滑度增强型图协同过滤
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-03 DOI: 10.1109/TBDATA.2024.3453758
Ling Huang;Zhi-Yuan Li;Zhen-Yu He;Yuefang Gao
Graph Neural Networks (GNNs) based recommendations have shown significant performance improvement by explicitly modeling the user-item interactions as a bipartite graph. However, the existing GNNs-based recommendation methods suffer from the over-smoothing problem caused by utilizing the uniform distance of the reception field. To address this issue, we propose to explicitly incorporate the higher-order smoothness information into the node representation learning, and propose a new GNNs-based recommendation model named Higher-order Smoothness enhanced Graph Collaborative Filtering (HS-GCF). The proposed model is mainly composed of two parts, namely lower-order module and higher-order module. The lower-order module guarantees that the lower-order smoothness is well obtained by using the user-item interactions. The higher-order module uses the latent group assumption to restrict too much noise introduced by the uniform distance property, which we call the higher-order smoothness information. Experiments are conducted on three real-world public datasets, and the experimental results show the performance improvements compared with several state-of-the-art methods and verify the importance of explicitly incorporating the higher-order smoothness information into the node representation learning.
基于图神经网络(GNNs)的推荐方法将用户与物品之间的交互明确建模为双向图,从而显著提高了性能。然而,现有的基于图神经网络的推荐方法由于利用了接收场的均匀距离而存在过度平滑问题。针对这一问题,我们提出将高阶平滑度信息明确纳入节点表示学习,并提出了一种基于 GNNs 的新推荐模型,即高阶平滑度增强图协同过滤(HS-GCF)。该模型主要由低阶模块和高阶模块两部分组成。低阶模块通过用户与项目的交互来保证低阶平滑度的良好获得。高阶模块利用潜在群体假设来限制均匀距离属性带来的过多噪声,我们称之为高阶平滑度信息。我们在三个真实世界的公共数据集上进行了实验,实验结果表明,与几种最先进的方法相比,该方法的性能有所提高,并验证了将高阶平滑度信息明确纳入节点表示学习的重要性。
{"title":"Higher-Order Smoothness Enhanced Graph Collaborative Filtering","authors":"Ling Huang;Zhi-Yuan Li;Zhen-Yu He;Yuefang Gao","doi":"10.1109/TBDATA.2024.3453758","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3453758","url":null,"abstract":"Graph Neural Networks (GNNs) based recommendations have shown significant performance improvement by explicitly modeling the user-item interactions as a bipartite graph. However, the existing GNNs-based recommendation methods suffer from the over-smoothing problem caused by utilizing the uniform distance of the reception field. To address this issue, we propose to explicitly incorporate the higher-order smoothness information into the node representation learning, and propose a new GNNs-based recommendation model named \u0000<underline>H</u>\u0000igher-order \u0000<underline>S</u>\u0000moothness enhanced \u0000<underline>G</u>\u0000raph \u0000<underline>C</u>\u0000ollaborative \u0000<underline>F</u>\u0000iltering (HS-GCF). The proposed model is mainly composed of two parts, namely lower-order module and higher-order module. The lower-order module guarantees that the lower-order smoothness is well obtained by using the user-item interactions. The higher-order module uses the latent group assumption to restrict too much noise introduced by the uniform distance property, which we call the higher-order smoothness information. Experiments are conducted on three real-world public datasets, and the experimental results show the performance improvements compared with several state-of-the-art methods and verify the importance of explicitly incorporating the higher-order smoothness information into the node representation learning.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"731-741"},"PeriodicalIF":7.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Denoised Graph Collaborative Filtering via Neighborhood Similarity and Dynamic Thresholding 通过邻域相似性和动态阈值进行去噪图协同过滤
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-09-03 DOI: 10.1109/TBDATA.2024.3453765
Haibo Ye;Lijun Zhang;Yuan Yao;Sheng-Jun Huang
Graph collaborative filtering (GCF) has achieved great success in recommender systems due to its ability in mining high-order collaborative signals from historical user-item interactions. However, GCF's performance could be severely affected by the intrinsic noise within the user-item interactions. To this end, several denoised GCF frameworks have been proposed, whose heart is to estimate and handle the reliability of existing interactions. However, most of them suffer from two limitations: 1) the reliability computation itself is noisy, and 2) the reliability threshold is difficult to determine. To address the two limitations, in this paper, we propose a new Neighborhood-informed Denoising framework NiDen for GCF. Specifically, for an existing user-item interaction, NiDen first estimates its reliability by employing the neighborhood information of the user and the item, and then determines whether the interaction is noisy or not via a dynamic thresholding strategy. After that, NiDen mitigates the negative impact of noise by both structure denoising and sample re-weighting. We instantiate NiDen on two representative GCF models and conduct extensive experiments on four widely-used datasets. The results show that NiDen achieves the best performance compared to the existing denoising methods, especially on datasets with heavy noise.
图协同过滤(Graph collaborative filtering,GCF)能够从历史用户-物品交互中挖掘高阶协同信号,因此在推荐系统中取得了巨大成功。然而,GCF 的性能可能会受到用户-项目交互中固有噪声的严重影响。为此,人们提出了几种去噪 GCF 框架,其核心是估计和处理现有交互的可靠性。然而,它们大多存在两个局限性:1) 可靠性计算本身存在噪声;2) 可靠性阈值难以确定。为了解决这两个局限性,我们在本文中为 GCF 提出了一个新的邻域信息去噪框架 NiDen。具体来说,对于已有的用户-物品交互,NiDen 首先利用用户和物品的邻域信息估计其可靠性,然后通过动态阈值策略确定交互是否有噪声。然后,NiDen 通过结构去噪和样本重新加权来减轻噪声的负面影响。我们在两个具有代表性的 GCF 模型上实例化了 NiDen,并在四个广泛使用的数据集上进行了大量实验。结果表明,与现有的去噪方法相比,NiDen 实现了最佳性能,尤其是在噪声严重的数据集上。
{"title":"Denoised Graph Collaborative Filtering via Neighborhood Similarity and Dynamic Thresholding","authors":"Haibo Ye;Lijun Zhang;Yuan Yao;Sheng-Jun Huang","doi":"10.1109/TBDATA.2024.3453765","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3453765","url":null,"abstract":"Graph collaborative filtering (GCF) has achieved great success in recommender systems due to its ability in mining high-order collaborative signals from historical user-item interactions. However, GCF's performance could be severely affected by the intrinsic noise within the user-item interactions. To this end, several denoised GCF frameworks have been proposed, whose heart is to estimate and handle the reliability of existing interactions. However, most of them suffer from two limitations: 1) the reliability computation itself is noisy, and 2) the reliability threshold is difficult to determine. To address the two limitations, in this paper, we propose a new \u0000<underline>N</u>\u0000eighborhood-\u0000<underline>i</u>\u0000nformed \u0000<underline>Den</u>\u0000oising framework NiDen for GCF. Specifically, for an existing user-item interaction, NiDen first estimates its reliability by employing the neighborhood information of the user and the item, and then determines whether the interaction is noisy or not via a dynamic thresholding strategy. After that, NiDen mitigates the negative impact of noise by both structure denoising and sample re-weighting. We instantiate NiDen on two representative GCF models and conduct extensive experiments on four widely-used datasets. The results show that NiDen achieves the best performance compared to the existing denoising methods, especially on datasets with heavy noise.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 6","pages":"683-693"},"PeriodicalIF":7.5,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142600149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fast and Robust Attention-Free Heterogeneous Graph Convolutional Network 快速稳健的无注意力异构图卷积网络
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-03-12 DOI: 10.1109/TBDATA.2024.3375152
Yeyu Yan;Zhongying Zhao;Zhan Yang;Yanwei Yu;Chao Li
Due to the widespread applications of heterogeneous graphs in the real world, heterogeneous graph neural networks (HGNNs) have developed rapidly and made a great success in recent years. To effectively capture the complex interactions in heterogeneous graphs, various attention mechanisms are widely used in designing HGNNs. However, the employment of these attention mechanisms brings two key problems: high computational complexity and poor robustness. To address these problems, we propose a Fast and Robust attention-free Heterogeneous Graph Convolutional Network (FastRo-HGCN) without any attention mechanisms. Specifically, we first construct virtual links based on the topology similarity and feature similarity of the nodes to strengthen the connections between the target nodes. Then, we design type normalization to aggregate and transfer the intra-type and inter-type node information. The above methods are used to reduce the interference of noisy information. Finally, we further enhance the robustness and relieve the negative effects of oversmoothing with the self-loops of nodes. Extensive experimental results on three real-world datasets fully demonstrate that the proposed FastRo-HGCN significantly outperforms the state-of-the-art models.
由于异构图在现实世界中的广泛应用,异构图神经网络(HGNN)近年来发展迅速并取得了巨大成功。为了有效捕捉异构图中的复杂交互,各种注意力机制被广泛应用于异构图神经网络的设计中。然而,这些注意力机制的使用带来了两个关键问题:计算复杂度高和鲁棒性差。为了解决这些问题,我们提出了一种无需任何注意力机制的快速鲁棒无注意力异构图卷积网络(FastRo-HGCN)。具体来说,我们首先根据节点的拓扑相似性和特征相似性构建虚拟链接,以加强目标节点之间的连接。然后,我们设计了类型归一化,以聚合和传递类型内和类型间的节点信息。通过上述方法,可以减少噪声信息的干扰。最后,我们利用节点的自循环进一步增强了鲁棒性并缓解了过平滑的负面影响。在三个真实世界数据集上的广泛实验结果充分证明,所提出的 FastRo-HGCN 明显优于最先进的模型。
{"title":"A Fast and Robust Attention-Free Heterogeneous Graph Convolutional Network","authors":"Yeyu Yan;Zhongying Zhao;Zhan Yang;Yanwei Yu;Chao Li","doi":"10.1109/TBDATA.2024.3375152","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3375152","url":null,"abstract":"Due to the widespread applications of heterogeneous graphs in the real world, heterogeneous graph neural networks (HGNNs) have developed rapidly and made a great success in recent years. To effectively capture the complex interactions in heterogeneous graphs, various attention mechanisms are widely used in designing HGNNs. However, the employment of these attention mechanisms brings two key problems: high computational complexity and poor robustness. To address these problems, we propose a \u0000<bold>Fast</b>\u0000 and \u0000<bold>Ro</b>\u0000bust attention-free \u0000<bold>H</b>\u0000eterogeneous \u0000<bold>G</b>\u0000raph \u0000<bold>C</b>\u0000onvolutional \u0000<bold>N</b>\u0000etwork (FastRo-HGCN) without any attention mechanisms. Specifically, we first construct virtual links based on the topology similarity and feature similarity of the nodes to strengthen the connections between the target nodes. Then, we design type normalization to aggregate and transfer the intra-type and inter-type node information. The above methods are used to reduce the interference of noisy information. Finally, we further enhance the robustness and relieve the negative effects of oversmoothing with the self-loops of nodes. Extensive experimental results on three real-world datasets fully demonstrate that the proposed FastRo-HGCN significantly outperforms the state-of-the-art models.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 5","pages":"669-681"},"PeriodicalIF":7.5,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142130219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FAER: Fairness-Aware Event-Participant Recommendation in Event-Based Social Networks FAER:基于事件的社交网络中的公平意识事件参与者推荐
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-03-01 DOI: 10.1109/TBDATA.2024.3372409
Yuan Liang
The event-based social network (EBSN) is a new type of social network that combines online and offline networks. In recent years, an important task in EBSN recommendation systems has been to design better and more reasonable recommendation algorithms to improve the accuracy of recommendation and enhance user satisfaction. However, the current research seldom considers how to coordinate fairness among individual users and reduce the impact of individual unreasonable feedback in group event recommendation. In addition, when considering the fairness to individuals, the accuracy of recommendation is not greatly improved by fully incorporating the key context information. To solve these problems, we propose a prefiltering algorithm to filter the candidate event set, a multidimensional context recommendation method to provide personalized event recommendations for each user in the group, and a group consensus function fusion strategy to fuse the recommendation results of the members of the group. To improve overall satisfaction with the recommendations, we propose a ranking adjustment strategy for the key context. Finally, we verify the effectiveness of our proposed algorithm on real data sets and find that FAER is superior to the latest algorithms in terms of global satisfaction, distance satisfaction and user fairness.
基于事件的社交网络(EBSN)是一种结合了线上和线下网络的新型社交网络。近年来,EBSN 推荐系统的一个重要任务是设计更好、更合理的推荐算法,以提高推荐的准确性和用户满意度。然而,目前的研究很少考虑如何协调个体用户之间的公平性,减少个体不合理反馈对群体事件推荐的影响。此外,在考虑对个体的公平性时,并不能通过充分结合关键上下文信息来大幅提高推荐的准确性。为了解决这些问题,我们提出了一种预过滤算法来过滤候选事件集,一种多维情境推荐方法来为群组中的每个用户提供个性化的事件推荐,以及一种群组共识函数融合策略来融合群组成员的推荐结果。为了提高推荐结果的整体满意度,我们提出了关键情境的排序调整策略。最后,我们在真实数据集上验证了所提算法的有效性,发现 FAER 在全局满意度、距离满意度和用户公平性方面都优于最新算法。
{"title":"FAER: Fairness-Aware Event-Participant Recommendation in Event-Based Social Networks","authors":"Yuan Liang","doi":"10.1109/TBDATA.2024.3372409","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3372409","url":null,"abstract":"The \u0000<underline>e</u>\u0000vent-\u0000<underline>b</u>\u0000ased \u0000<underline>s</u>\u0000ocial \u0000<underline>n</u>\u0000etwork (EBSN) is a new type of social network that combines online and offline networks. In recent years, an important task in EBSN recommendation systems has been to design better and more reasonable recommendation algorithms to improve the accuracy of recommendation and enhance user satisfaction. However, the current research seldom considers how to coordinate fairness among individual users and reduce the impact of individual unreasonable feedback in group event recommendation. In addition, when considering the fairness to individuals, the accuracy of recommendation is not greatly improved by fully incorporating the key context information. To solve these problems, we propose a prefiltering algorithm to filter the candidate event set, a multidimensional context recommendation method to provide personalized event recommendations for each user in the group, and a group consensus function fusion strategy to fuse the recommendation results of the members of the group. To improve overall satisfaction with the recommendations, we propose a ranking adjustment strategy for the key context. Finally, we verify the effectiveness of our proposed algorithm on real data sets and find that FAER is superior to the latest algorithms in terms of global satisfaction, distance satisfaction and user fairness.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 5","pages":"655-668"},"PeriodicalIF":7.5,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142130298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An End-to-End Approach for Graph-Based Multi-View Data Clustering 基于图形的多视图数据聚类的端到端方法
IF 7.5 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-02-28 DOI: 10.1109/TBDATA.2024.3371357
Fadi Dornaika;Sally El Hajjar
Clustering data from different sources or views is a key challenge in real-world applications. While traditional graph-based methods are effective at capturing data structures, they often require separate steps to estimate graphs of views or a consensus graph from the raw data. This reliance on intermediate steps can make these clustering methods susceptible to noisy graphs, which affects the overall performance of clustering. In response to this limitation, and with an emphasis on advocating end-to-end solutions for multi-view clustering, two comprehensive approaches are presented in this paper. Each approach starts from either the raw data or its kernelized features. The first proposal introduces a unified objective function that enables the simultaneous recovery of the graph for each view, the unified graph, the spectral projection matrices for all views, the soft cluster assignments, and the scores assigned to each view. The second proposal uses a global criterion that integrates regularization and constraints for the soft cluster assignment matrix based on the consensus graph matrix and the consensus data representation. Both proposed methods enable direct and straightforward clustering of the data without the need for additional steps. Extensive tests with various real-world image and text datasets confirm the superior performance of the two proposed methods.
对来自不同来源或视图的数据进行聚类是现实世界应用中的一项关键挑战。虽然传统的基于图的方法能有效捕捉数据结构,但它们通常需要单独的步骤来估算视图图或原始数据的共识图。这种对中间步骤的依赖会使这些聚类方法容易受到噪声图的影响,从而影响聚类的整体性能。针对这一局限性,本文重点倡导多视图聚类的端到端解决方案,提出了两种综合方法。每种方法都从原始数据或其核特征出发。第一种方案引入了一个统一的目标函数,可以同时恢复每个视图的图形、统一图形、所有视图的光谱投影矩阵、软聚类分配以及分配给每个视图的分数。第二项建议使用了一种全局标准,该标准基于共识图矩阵和共识数据表示,整合了软聚类分配矩阵的正则化和约束条件。这两种方法都能直接对数据进行聚类,无需额外步骤。利用各种真实世界的图像和文本数据集进行的广泛测试证实了这两种建议方法的卓越性能。
{"title":"An End-to-End Approach for Graph-Based Multi-View Data Clustering","authors":"Fadi Dornaika;Sally El Hajjar","doi":"10.1109/TBDATA.2024.3371357","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3371357","url":null,"abstract":"Clustering data from different sources or views is a key challenge in real-world applications. While traditional graph-based methods are effective at capturing data structures, they often require separate steps to estimate graphs of views or a consensus graph from the raw data. This reliance on intermediate steps can make these clustering methods susceptible to noisy graphs, which affects the overall performance of clustering. In response to this limitation, and with an emphasis on advocating end-to-end solutions for multi-view clustering, two comprehensive approaches are presented in this paper. Each approach starts from either the raw data or its kernelized features. The first proposal introduces a unified objective function that enables the simultaneous recovery of the graph for each view, the unified graph, the spectral projection matrices for all views, the soft cluster assignments, and the scores assigned to each view. The second proposal uses a global criterion that integrates regularization and constraints for the soft cluster assignment matrix based on the consensus graph matrix and the consensus data representation. Both proposed methods enable direct and straightforward clustering of the data without the need for additional steps. Extensive tests with various real-world image and text datasets confirm the superior performance of the two proposed methods.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 5","pages":"644-654"},"PeriodicalIF":7.5,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142130217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TgStore: An Efficient Storage System for Large Time-Evolving Graphs TgStore:大型时间演化图的高效存储系统
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-02-14 DOI: 10.1109/TBDATA.2024.3366087
Yongli Cheng;Yan Ma;Hong Jiang;Lingfang Zeng;Fang Wang;Xianghao Xu;Yuhang Wu
Existing graph systems focus mainly on the execution efficiency of the graph analysis tasks, often ignoring the importance and efficiency of time-evolving graph storage. However, to effectively mine the potential application values, an efficient storage system is important for time-evolving graphs whose storage requirement scales with the increasing number of snapshots. Storage cost and snapshot access speed are the two most important performance indicators for a time-evolving graph storage system, which are challenging for designers of such systems because they are conflicting goals. In this article, we address these challenges by proposing an efficient storage scheme for the large time-evolving graphs. We first design a Snapshot-level Data Deduplication (SLDD) strategy to eliminate the large number of repeated vertices and edges among the snapshots, and then a Structure-Changing Graph Representation (SCGR) to significantly improve the snapshot access speed. We implement an efficient time-evolving graph storage system, TgStore, based on this scheme to effectively store large-scale time-evolving graphs, aiming to efficiently support the time-evolving graph analysis tasks. Experimental results show that TgStore can obtain a high compression ratio of 43.03:1 when storing 100 snapshots of Twitter, while with an average snapshot access speedup of 16×. Efficient storage scheme enables TgStore to efficiently support time-evolving graph algorithms. For example, when executing the Pagerank algorithm on the time-evolving graph of Twitter, TgStore outperforms Graphone, a state-of-the-art time-evolving graph storage system, by 15.9× in algorithm execution speed and 1.45× in memory usage.
现有的图形系统主要关注图形分析任务的执行效率,往往忽视了随时间变化的图形存储的重要性和效率。然而,为了有效挖掘潜在的应用价值,高效的存储系统对于随时间变化的图来说非常重要,因为随快照数量的增加,存储需求也会随之增加。存储成本和快照访问速度是时间演化图存储系统的两个最重要的性能指标,这两个指标对于此类系统的设计者来说具有挑战性,因为它们是相互冲突的目标。在本文中,我们针对这些挑战,提出了一种针对大型时间演化图的高效存储方案。我们首先设计了一种快照级重复数据删除(SLDD)策略来消除快照中大量重复的顶点和边,然后设计了一种结构变化图表示(SCGR)来显著提高快照访问速度。在此基础上,我们实现了高效的时间演化图存储系统 TgStore,以有效存储大规模时间演化图,从而高效地支持时间演化图分析任务。实验结果表明,当存储100个Twitter快照时,TgStore可以获得43.03:1的高压缩比,同时快照平均访问速度提高了16倍。高效的存储方案使TgStore能够有效地支持时间演进图算法。例如,在Twitter的时间演化图上执行Pagerank算法时,TgStore的算法执行速度和内存使用量分别比最先进的时间演化图存储系统Graphone快15.9倍和1.45倍。
{"title":"TgStore: An Efficient Storage System for Large Time-Evolving Graphs","authors":"Yongli Cheng;Yan Ma;Hong Jiang;Lingfang Zeng;Fang Wang;Xianghao Xu;Yuhang Wu","doi":"10.1109/TBDATA.2024.3366087","DOIUrl":"https://doi.org/10.1109/TBDATA.2024.3366087","url":null,"abstract":"Existing graph systems focus mainly on the execution efficiency of the graph analysis tasks, often ignoring the importance and efficiency of time-evolving graph storage. However, to effectively mine the potential application values, an efficient storage system is important for time-evolving graphs whose storage requirement scales with the increasing number of snapshots. Storage cost and snapshot access speed are the two most important performance indicators for a time-evolving graph storage system, which are challenging for designers of such systems because they are conflicting goals. In this article, we address these challenges by proposing an efficient storage scheme for the large time-evolving graphs. We first design a \u0000<italic>Snapshot-level Data Deduplication (SLDD)</i>\u0000 strategy to eliminate the large number of repeated vertices and edges among the snapshots, and then a \u0000<italic>Structure-Changing Graph Representation (SCGR)</i>\u0000 to significantly improve the snapshot access speed. We implement an efficient time-evolving graph storage system, TgStore, based on this scheme to effectively store large-scale time-evolving graphs, aiming to efficiently support the time-evolving graph analysis tasks. Experimental results show that TgStore can obtain a high compression ratio of 43.03:1 when storing 100 snapshots of Twitter, while with an average snapshot access speedup of 16×. Efficient storage scheme enables TgStore to efficiently support time-evolving graph algorithms. For example, when executing the Pagerank algorithm on the time-evolving graph of Twitter, TgStore outperforms Graphone, a state-of-the-art time-evolving graph storage system, by 15.9× in algorithm execution speed and 1.45× in memory usage.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 2","pages":"158-173"},"PeriodicalIF":7.2,"publicationDate":"2024-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140123521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1