IEEE Transactions on Knowledge and Data Engineering最新文献_第5页

FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms FairSort：学习公平排序在双边平台的个性化推荐

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-12-02 DOI: 10.1109/TKDE.2024.3509912

Guoli Wu;Zhiyong Feng;Shizhan Chen;Hongyue Wu;Xiao Xue;Jianmao Xiao;Guodong Fan;Hongqi Chen;Jingyu Li

Traditional recommendation systems focus on maximizing user satisfaction by suggesting their favorite items. This user-centric approach may lead to unfair exposure distribution among the providers. On the contrary, a provider-centric design might become unfair to the users. Therefore, this paper proposes a re-ranking model FairSort¹ to find a trade-off solution among user-side fairness, provider-side fairness, and personalized recommendations utility. Previous works habitually treat this issue as a knapsack problem, incorporating both-side fairness as constraints. In this paper, we adopt a novel perspective, treating each recommendation list as a runway rather than a knapsack. In this perspective, each item on the runway gains a velocity and runs within a specific time, achieving re-ranking for both-side fairness. Meanwhile, we ensure the Minimum Utility Guarantee for personalized recommendations by designing a Binary Search approach. This can provide more reliable recommendations compared to the conventional greedy strategy based on the knapsack problem. We further broaden the applicability of FairSort, designing two versions for online and offline recommendation scenarios. Theoretical analysis and extensive experiments on real-world datasets indicate that FairSort can ensure more reliable personalized recommendations while considering fairness for both the provider and user.

传统的推荐系统专注于通过推荐用户喜欢的商品来最大化用户满意度。这种以用户为中心的方法可能导致提供者之间的公开分布不公平。相反，以提供者为中心的设计可能对用户不公平。因此，本文提出了一个重新排序模型FairSort1，以寻找用户端公平性、提供者端公平性和个性化推荐效用之间的权衡解决方案。以前的作品习惯性地将这个问题视为一个背包问题，将双方的公平作为约束。在本文中，我们采用了一种新颖的视角，将每个推荐列表视为一条跑道而不是一个背包。从这个角度来看，跑道上的每个项目都获得了一个速度，并在特定的时间内运行，实现了双方公平的重新排名。同时，我们通过设计一个二分搜索方法来保证个性化推荐的最小效用保证。与基于背包问题的传统贪心策略相比，这可以提供更可靠的建议。我们进一步拓宽了FairSort的适用性，设计了在线推荐和离线推荐两个版本。理论分析和对真实数据集的大量实验表明，FairSort可以在考虑提供者和用户的公平性的同时确保更可靠的个性化推荐。

{"title":"FairSort: Learning to Fair Rank for Personalized Recommendations in Two-Sided Platforms","authors":"Guoli Wu;Zhiyong Feng;Shizhan Chen;Hongyue Wu;Xiao Xue;Jianmao Xiao;Guodong Fan;Hongqi Chen;Jingyu Li","doi":"10.1109/TKDE.2024.3509912","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509912","url":null,"abstract":"Traditional recommendation systems focus on maximizing user satisfaction by suggesting their favorite items. This user-centric approach may lead to unfair exposure distribution among the providers. On the contrary, a provider-centric design might become unfair to the users. Therefore, this paper proposes a re-ranking model FairSort\u00001\u0000 to find a trade-off solution among user-side fairness, provider-side fairness, and personalized recommendations utility. Previous works habitually treat this issue as a knapsack problem, incorporating both-side fairness as constraints. In this paper, we adopt a novel perspective, treating each recommendation list as a runway rather than a knapsack. In this perspective, each item on the runway gains a velocity and runs within a specific time, achieving re-ranking for both-side fairness. Meanwhile, we ensure the Minimum Utility Guarantee for personalized recommendations by designing a Binary Search approach. This can provide more reliable recommendations compared to the conventional greedy strategy based on the knapsack problem. We further broaden the applicability of FairSort, designing two versions for online and offline recommendation scenarios. Theoretical analysis and extensive experiments on real-world datasets indicate that FairSort can ensure more reliable personalized recommendations while considering fairness for both the provider and user.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"641-654"},"PeriodicalIF":8.9,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust and Communication-Efficient Federated Domain Adaptation via Random Features

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-12-02 DOI: 10.1109/TKDE.2024.3510296

Zhanbo Feng;Yuanjie Wang;Jie Li;Fan Yang;Jiong Lou;Tiebin Mi;Robert Caiming Qiu;Zhenyu Liao

Modern machine learning (ML) models have grown to a scale where training them on a single machine becomes impractical. As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner. These models, however, when deployed on new devices, might struggle to generalize well due to domain shifts. In this context, federated domain adaptation (FDA) emerges as a powerful approach to address this challenge. Most existing FDA approaches typically focus on aligning the distributions between source and target domains by minimizing their (e.g., MMD) distance. Such strategies, however, inevitably introduce high communication overheads and can be highly sensitive to network reliability. In this paper, we introduce RF-TCA, an enhancement to the standard Transfer Component Analysis approach that significantly accelerates computation without compromising theoretical and empirical performance. Leveraging the computational advantage of RF-TCA, we further extend it to FDA setting with FedRF-TCA. The proposed FedRF-TCA protocol boasts communication complexity that is independent of the sample size, while maintaining performance that is either comparable to or even surpasses state-of-the-art FDA methods. We present extensive experiments to showcase the superior performance and robustness (to network condition) of FedRF-TCA.

{"title":"Robust and Communication-Efficient Federated Domain Adaptation via Random Features","authors":"Zhanbo Feng;Yuanjie Wang;Jie Li;Fan Yang;Jiong Lou;Tiebin Mi;Robert Caiming Qiu;Zhenyu Liao","doi":"10.1109/TKDE.2024.3510296","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3510296","url":null,"abstract":"Modern machine learning (ML) models have grown to a scale where training them on a single machine becomes impractical. As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner. These models, however, when deployed on new devices, might struggle to generalize well due to domain shifts. In this context, federated domain adaptation (FDA) emerges as a powerful approach to address this challenge. Most existing FDA approaches typically focus on aligning the distributions between source and target domains by minimizing their (e.g., MMD) distance. Such strategies, however, inevitably introduce high communication overheads and can be highly sensitive to network reliability. In this paper, we introduce RF-TCA, an enhancement to the standard Transfer Component Analysis approach that significantly accelerates computation without compromising theoretical and empirical performance. Leveraging the computational advantage of RF-TCA, we further extend it to FDA setting with FedRF-TCA. The proposed FedRF-TCA protocol boasts communication complexity that is <italic>independent of the sample size, while maintaining performance that is either comparable to or even surpasses state-of-the-art FDA methods. We present extensive experiments to showcase the superior performance and robustness (to network condition) of FedRF-TCA.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 3","pages":"1411-1424"},"PeriodicalIF":8.9,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Time- and Space-Efficiently Sketching Billion-Scale Attributed Networks 时间和空间高效绘制十亿尺度属性网络

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-12-02 DOI: 10.1109/TKDE.2024.3508256

Wei Wu;Shiqi Li;Mi Jiang;Chuan Luo;Fangfang Li

Attributed network embedding seeks to depict each network node via a compact, low-dimensional vector while effectively preserving the similarity between node pairs, which lays a strong foundation for a great many high-level network mining tasks. With the advent of the era of Big Data, the number of nodes and edges has reached billions in many real-world networks, which poses great computational and storage challenges to the existing methods. Although some algorithms have been developed to handle billion-scale networks, they often undergo accuracy degradation or tempo-spatial inefficiency owing to attribute information loss or substantial parameter learning. To this end, we propose a simple, time- and space-efficient billion-scale attributed network embedding algorithm called SketchBANE in this paper, which strikes an excellent balance between accuracy and efficiency by adopting sparse random projection with 1-bit quantization to sketch the iterative closed neighborhood and maintain the similarity among high-order nodes in a non-learning manner. The extensive experimental results indicate that our proposed SketchBANE algorithm competes favorably with the state-of-the-art approaches, while remarkably reducing runtime and space consumption. Also, the proposed SketchBANE algorithm exhibits good scalability and parallelization.

属性网络嵌入寻求通过紧凑的低维向量来描述每个网络节点，同时有效地保持节点对之间的相似性，这为许多高级网络挖掘任务奠定了坚实的基础。随着大数据时代的到来，在现实世界的许多网络中，节点和边缘的数量已经达到数十亿，这对现有的计算和存储方法提出了巨大的挑战。虽然已经开发了一些算法来处理十亿规模的网络，但由于属性信息丢失或大量参数学习，它们往往会出现精度下降或时空效率低下的问题。为此，本文提出了一种简单、节省时间和空间的十亿尺度属性网络嵌入算法SketchBANE，该算法采用1位量化的稀疏随机投影绘制迭代封闭邻域，以非学习的方式保持高阶节点之间的相似性，在精度和效率之间取得了很好的平衡。大量的实验结果表明，我们提出的SketchBANE算法与最先进的方法竞争，同时显着减少了运行时间和空间消耗。提出的算法具有良好的可扩展性和并行性。

{"title":"Time- and Space-Efficiently Sketching Billion-Scale Attributed Networks","authors":"Wei Wu;Shiqi Li;Mi Jiang;Chuan Luo;Fangfang Li","doi":"10.1109/TKDE.2024.3508256","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3508256","url":null,"abstract":"Attributed network embedding seeks to depict each network node via a compact, low-dimensional vector while effectively preserving the similarity between node pairs, which lays a strong foundation for a great many high-level network mining tasks. With the advent of the era of Big Data, the number of nodes and edges has reached billions in many real-world networks, which poses great computational and storage challenges to the existing methods. Although some algorithms have been developed to handle billion-scale networks, they often undergo accuracy degradation or tempo-spatial inefficiency owing to attribute information loss or substantial parameter learning. To this end, we propose a simple, time- and space-efficient billion-scale attributed network embedding algorithm called SketchBANE in this paper, which strikes an excellent balance between accuracy and efficiency by adopting sparse random projection with 1-bit quantization to sketch the iterative closed neighborhood and maintain the similarity among high-order nodes in a non-learning manner. The extensive experimental results indicate that our proposed SketchBANE algorithm competes favorably with the state-of-the-art approaches, while remarkably reducing runtime and space consumption. Also, the proposed SketchBANE algorithm exhibits good scalability and parallelization.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"966-978"},"PeriodicalIF":8.9,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Meta Recommendation With Robustness Improvement 鲁棒性改进的Meta推荐

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-29 DOI: 10.1109/TKDE.2024.3509416

Zeyu Zhang;Chaozhuo Li;Xu Chen;Xing Xie;Philip S. Yu

Meta learning has been recognized as an effective remedy for solving the cold-start problem in the recommendation domain. Existing models aim to learn how to generalize from the user behaviors in the training set to testing set. However, in the cold start settings, with only a small number of training samples, the testing distribution may easily deviate from the training one, which may invalidate the learned generalization patterns, and lower the recommendation performance. For alleviating this problem, in this paper, we propose a robust meta recommender framework to address the distribution shift problem. In specific, we argue that the distribution shift may exist on both the user- and interaction-levels, and in order to mitigate them simultaneously, we design a novel distributionally robust model by hierarchically reweighing the training samples. Different sample weights correspond to different training distributions, and we minimize the largest loss induced by the sample weights in a simplex, which essentially optimizes the upper bound of the testing loss. In addition, we analyze our framework on the convergence rates and generalization error bound to provide more theoretical insights. Empirically, we conduct extensive experiments based on different meta recommender models and real-world datasets to verify the generality and effectiveness of our framework.

元学习被认为是解决推荐领域冷启动问题的有效补救措施。现有模型的目标是学习如何将训练集中的用户行为推广到测试集中。然而，在冷启动设置中，由于训练样本数量较少，测试分布很容易偏离训练分布，这可能会使学习到的泛化模式失效，降低推荐性能。为了缓解这一问题，在本文中，我们提出了一个鲁棒的元推荐框架来解决分布转移问题。具体来说，我们认为分布转移可能同时存在于用户和交互层面，为了同时缓解它们，我们通过分层重加权训练样本设计了一个新的分布鲁棒模型。不同的样本权值对应不同的训练分布，我们最小化单纯形中由样本权值引起的最大损失，实质上优化了测试损失的上界。此外，我们还分析了我们的框架对收敛率和泛化误差界的影响，以提供更多的理论见解。在经验上，我们基于不同的元推荐模型和真实世界的数据集进行了广泛的实验，以验证我们框架的通用性和有效性。

{"title":"Meta Recommendation With Robustness Improvement","authors":"Zeyu Zhang;Chaozhuo Li;Xu Chen;Xing Xie;Philip S. Yu","doi":"10.1109/TKDE.2024.3509416","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509416","url":null,"abstract":"Meta learning has been recognized as an effective remedy for solving the cold-start problem in the recommendation domain. Existing models aim to learn how to generalize from the user behaviors in the training set to testing set. However, in the cold start settings, with only a small number of training samples, the testing distribution may easily deviate from the training one, which may invalidate the learned generalization patterns, and lower the recommendation performance. For alleviating this problem, in this paper, we propose a robust meta recommender framework to address the distribution shift problem. In specific, we argue that the distribution shift may exist on both the user- and interaction-levels, and in order to mitigate them simultaneously, we design a novel distributionally robust model by hierarchically reweighing the training samples. Different sample weights correspond to different training distributions, and we minimize the largest loss induced by the sample weights in a simplex, which essentially optimizes the upper bound of the testing loss. In addition, we analyze our framework on the convergence rates and generalization error bound to provide more theoretical insights. Empirically, we conduct extensive experiments based on different meta recommender models and real-world datasets to verify the generality and effectiveness of our framework.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"781-793"},"PeriodicalIF":8.9,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session-Oriented Fairness-Aware Recommendation via Dual Temporal Convolutional Networks 基于双时间卷积网络的面向会话的公平感知推荐

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-29 DOI: 10.1109/TKDE.2024.3509454

Jie Li;Ke Deng;Jianxin Li;Yongli Ren

Session-based Recommender Systems (SBRSs) aim at timely predicting the next likely item by capturing users’ current preferences in sessions. Existing SBRSs research only focuses on maximizing session utilities, and little has been done on the fairness issue in SBRSs, which is vital but different from the same issue in traditional Recommender Systems (RSs). To fill in this gap, we define a novel concept of session-oriented fairness to enforce individual items to have the same exposure accumulated within each single session, which is flexible enough to provide opportunities to achieve different fairness goals. Then, we devise a Session-Oriented Fairness-Aware algorithm (SOFA) with a dual Temporal Convolutional Networks (TCN) architecture: one is SOUP (Session-Oriented Utility Promoter) and the other is SODA (Session-Oriented Disparity Alleviator). Benefit from the collaborative learning of SOUP and SODA for the evolution of accumulated exposure in sessions, SOFA is effective to maximize session-oriented fairness while maintaining high session utilities. To the best of our knowledge, this research is the first to solve fairness issues in SBRSs. Extensive experiments on real-world datasets demonstrate that SOFA outperforms the state-of-the-art approaches in terms of both utility and fairness.

基于会话的推荐系统（sbrs）旨在通过捕获用户在会话中的当前偏好来及时预测下一个可能的项目。现有的sbrs研究只关注会话效用最大化，而对sbrs中的公平性问题研究甚少，而公平性问题与传统推荐系统中的公平性问题不同。为了填补这一空白，我们定义了一个新的面向会话的公平性概念，以强制单个项目在每个会话中积累相同的暴露，这足够灵活，可以提供实现不同公平性目标的机会。然后，我们设计了一种基于双时间卷积网络（TCN）架构的面向会话的公平性感知算法（SOFA）：一种是面向会话的效用促进器（SOUP），另一种是面向会话的差异缓解器（SODA）。得益于SOUP和SODA对会话中累积暴露的演化的协作学习，SOFA可以有效地最大化面向会话的公平性，同时保持较高的会话实用程序。据我们所知，本研究是第一个解决sbrs公平性问题的研究。在真实世界数据集上进行的大量实验表明，SOFA在效用和公平性方面都优于最先进的方法。

{"title":"Session-Oriented Fairness-Aware Recommendation via Dual Temporal Convolutional Networks","authors":"Jie Li;Ke Deng;Jianxin Li;Yongli Ren","doi":"10.1109/TKDE.2024.3509454","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509454","url":null,"abstract":"Session-based Recommender Systems (SBRSs) aim at timely predicting the next likely item by capturing users’ current preferences in sessions. Existing SBRSs research only focuses on maximizing session utilities, and little has been done on the fairness issue in SBRSs, which is vital but different from the same issue in traditional Recommender Systems (RSs). To fill in this gap, we define a novel concept of \u0000<italic>session-oriented fairness\u0000 to enforce individual items to have the same exposure accumulated within each single session, which is flexible enough to provide opportunities to achieve different fairness goals. Then, we devise a Session-Oriented Fairness-Aware algorithm (\u0000<italic>SOFA\u0000) with a dual Temporal Convolutional Networks (TCN) architecture: one is SOUP (Session-Oriented Utility Promoter) and the other is SODA (Session-Oriented Disparity Alleviator). Benefit from the collaborative learning of SOUP and SODA for the evolution of accumulated exposure in sessions, \u0000<italic>SOFA\u0000 is effective to maximize session-oriented fairness while maintaining high session utilities. To the best of our knowledge, this research is the first to solve fairness issues in SBRSs. Extensive experiments on real-world datasets demonstrate that \u0000<italic>SOFA\u0000 outperforms the state-of-the-art approaches in terms of both utility and fairness.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"923-935"},"PeriodicalIF":8.9,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dual-View Desynchronization Hypergraph Learning for Dynamic Hyperedge Prediction 用于动态超边缘预测的双视图非同步超图学习

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-29 DOI: 10.1109/TKDE.2024.3509024

Zhihui Wang;Jianrui Chen;Zhongshi Shao;Zhen Wang

Hyperedges, as extensions of pairwise edges, can characterize higher-order relations among multiple individuals. Due to the necessity of hypergraph detection in practical systems, hyperedge prediction has become a frontier problem in complex networks. However, previous hyperedge prediction models encounter three challenges: (i) failing to predict dynamic and arbitrary-order hyperedges simultaneously, (ii) confusing higher-order and lower-order features together to propagate neighborhood information, and (iii) lacking the capability to learn physical evolution laws, which lead to poor performance of the models. To tackle these challenges, we propose D

$^{3}$

HP, a Dual-view Desynchronization hypergraph learning for arbitrary-order Dynamic Hyperedge Prediction. Specifically, D

$^{3}$

HP extracts the dynamic higher-order and lower-order features of hyperedges separately through an elastic hypergraph neural network (EHGNN) and an alternate desynchronization graph convolutional network (ADGCN) at each time snapshot. EHGNN is designed to incrementally mine the implicit higher-order relations and propagate neighborhood information. Moreover, ADGCN aims to combine GCN with desynchronization learining to learn the physical evolution of lower-order relations and alleviate the over-smoothing problem. Further, we improve the prediction performance of the model by rationally fusing the features learned from the dual views. Extensive experiments on 8 dynamic higher-order networks demonstrate that D

$^{3}$

HP outperforms 14 state-of-the-art baselines.

超边作为对边的扩展，可以表征多个个体之间的高阶关系。由于超图检测在实际系统中的必要性，超边缘预测已成为复杂网络中的前沿问题。然而，以往的超边缘预测模型面临三个挑战：(1)不能同时预测动态和任意阶超边缘；(2)混淆高阶和低阶特征以传播邻域信息；(3)缺乏学习物理进化规律的能力，导致模型性能不佳。为了解决这些挑战，我们提出了D$^{3}$HP，一种用于任意阶动态超边缘预测的双视图非同步超图学习。具体来说，D$^{3}$HP通过弹性超图神经网络（EHGNN）和交替去同步图卷积网络（ADGCN）在每个时间快照分别提取超边的动态高阶和低阶特征。EHGNN旨在增量挖掘隐式高阶关系并传播邻域信息。此外，ADGCN旨在将GCN与去同步学习相结合，以学习低阶关系的物理演化，缓解过度平滑问题。进一步，我们通过合理融合从对偶视图中学习到的特征来提高模型的预测性能。在8个动态高阶网络上的广泛实验表明，D$^{3}$HP优于14个最先进的基线。

{"title":"Dual-View Desynchronization Hypergraph Learning for Dynamic Hyperedge Prediction","authors":"Zhihui Wang;Jianrui Chen;Zhongshi Shao;Zhen Wang","doi":"10.1109/TKDE.2024.3509024","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509024","url":null,"abstract":"Hyperedges, as extensions of pairwise edges, can characterize higher-order relations among multiple individuals. Due to the necessity of hypergraph detection in practical systems, hyperedge prediction has become a frontier problem in complex networks. However, previous hyperedge prediction models encounter three challenges: (i) failing to predict dynamic and arbitrary-order hyperedges simultaneously, (ii) confusing higher-order and lower-order features together to propagate neighborhood information, and (iii) lacking the capability to learn physical evolution laws, which lead to poor performance of the models. To tackle these challenges, we propose D\u0000<inline-formula><tex-math>$^{3}$</tex-math></inline-formula>\u0000HP, a \u0000D\u0000ual-view \u0000D\u0000esynchronization hypergraph learning for arbitrary-order \u0000D\u0000ynamic \u0000H\u0000yperedge \u0000P\u0000rediction. Specifically, D\u0000<inline-formula><tex-math>$^{3}$</tex-math></inline-formula>\u0000HP extracts the dynamic higher-order and lower-order features of hyperedges separately through an elastic hypergraph neural network (EHGNN) and an alternate desynchronization graph convolutional network (ADGCN) at each time snapshot. EHGNN is designed to incrementally mine the implicit higher-order relations and propagate neighborhood information. Moreover, ADGCN aims to combine GCN with desynchronization learining to learn the physical evolution of lower-order relations and alleviate the over-smoothing problem. Further, we improve the prediction performance of the model by rationally fusing the features learned from the dual views. Extensive experiments on 8 dynamic higher-order networks demonstrate that D\u0000<inline-formula><tex-math>$^{3}$</tex-math></inline-formula>\u0000HP outperforms 14 state-of-the-art baselines.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"597-612"},"PeriodicalIF":8.9,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Next Point-of-Interest Recommendation With Adaptive Graph Contrastive Learning

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-29 DOI: 10.1109/TKDE.2024.3509480

Xuan Rao;Renhe Jiang;Shuo Shang;Lisi Chen;Peng Han;Bin Yao;Panos Kalnis

Next point-of-interest (POI) recommendation predicts user’s next movement and facilitates location-based applications such as destination suggestion and travel planning. State-of-the-art (SOTA) methods learn an adaptive graph from user trajectories and compute POI representations using graph neural networks (GNNs). However, a single graph cannot capture the diverse dependencies among the POIs (e.g., geographical proximity and transition frequency). To tackle this limitation, we propose the Adaptive Graph Contrastive Learning (AGCL) framework. AGCL constructs multiple adaptive graphs, each modeling a kind of POI dependency and producing one POI representation; and the POI representations from different graphs are merged into a multi-facet representation that encodes comprehensive information. To train the POI representations, we tailor a graph-based contrastive learning, which encourages the representations of similar POIs to align and dissimilar POIs to differentiate. Moreover, to learn the sequential regularities of user trajectories, we design an attention mechanism to integrate spatial-temporal information into the POI representations. An explicit spatial-temporal bias is also employed to adjust the predictions for enhanced accuracy. We compare AGCL with 10 state-of-the-art baselines on 3 datasets. The results show that AGCL outperforms all baselines and achieves an improvement of 10.14% over the best performing baseline in average accuracy.

{"title":"Next Point-of-Interest Recommendation With Adaptive Graph Contrastive Learning","authors":"Xuan Rao;Renhe Jiang;Shuo Shang;Lisi Chen;Peng Han;Bin Yao;Panos Kalnis","doi":"10.1109/TKDE.2024.3509480","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509480","url":null,"abstract":"<italic>Next point-of-interest (POI) recommendation predicts user’s next movement and facilitates location-based applications such as destination suggestion and travel planning. State-of-the-art (SOTA) methods learn an adaptive graph from user trajectories and compute POI representations using graph neural networks (GNNs). However, a single graph cannot capture the <italic>diverse dependencies among the POIs (e.g., geographical proximity and transition frequency). To tackle this limitation, we propose the <underline>A<italic>daptive <underline>G<italic>raph <underline>C<italic>ontrastive <underline>L<italic>earning (<italic>AGCL) framework. AGCL constructs multiple adaptive graphs, each modeling a kind of POI dependency and producing one POI representation; and the POI representations from different graphs are merged into a <italic>multi-facet representation that encodes comprehensive information. To train the POI representations, we tailor a <italic>graph-based contrastive learning, which encourages the representations of similar POIs to align and dissimilar POIs to differentiate. Moreover, to learn the sequential regularities of user trajectories, we design an attention mechanism to integrate spatial-temporal information into the POI representations. An explicit <italic>spatial-temporal bias is also employed to adjust the predictions for enhanced accuracy. We compare AGCL with 10 state-of-the-art baselines on 3 datasets. The results show that AGCL outperforms all baselines and achieves an improvement of 10.14% over the best performing baseline in average accuracy.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 3","pages":"1366-1379"},"PeriodicalIF":8.9,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Relational Stock Selection via Probabilistic State Space Learning 基于概率状态空间学习的关系股票选择

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-28 DOI: 10.1109/TKDE.2024.3509267

Qiang Gao;Zhengxiang Liu;Li Huang;Kunpeng Zhang;Jun Wang;Guisong Liu

Optimizing stock selection through stock ranking is one of the critical but intricate tasks in quantitative trading areas because of the non-stationary dynamics and complicated interdependencies behind stock markets. Recent studies have made efforts to model historical market movements to enhance stock selection. However, they primarily borrowed the spirit of time series modeling and sought to build a deterministic paradigm without considering the uncertain fluctuations. In addition, some of these studies tailor to explore stock correlations from a predefined (e.g., binary) graph structure and use explicitly simple relations (such as first-order relations) to guide evolving interactions. Nevertheless, aggregating predefined but shallow relationships to collaborate with stock movements may affect selection generalizability and increase the risk of portfolio failure. This study introduces a novel Relational stock selection framework via probabilistic State Space Learning (or RSSL) for stock selection. Specifically, RSSL first attempts to build a tree-based structure to explicitly expose higher-order relations in the stock market, primarily by discovering a hierarchical delineation of ties between stocks. Whereafter, it couples with time-varying movements via an attention mechanism to smoothly explore the interactive correlations among different stocks. Inspired by recent state space models (SSM) in probabilistic Bayesian learning, we devise a Probabilistic Kalman Network (PKNet) with uncertainty estimates to recursively simulate ever-changing stock volatility, enabling more promising return-risk trade-offs. The experimental results on several real-world stock market datasets demonstrate that RSSL outperforms several representative baseline methods by a significant margin.

由于股票市场背后的非平稳动态和复杂的相互依赖性，通过股票排名优化股票选择是定量交易领域中关键而复杂的任务之一。最近的研究努力建立历史市场运动模型，以加强股票选择。然而，他们主要借用了时间序列建模的精神，试图在不考虑不确定波动的情况下建立一个确定性范式。此外，其中一些研究旨在从预定义的（例如，二元）图结构中探索股票相关性，并使用明确的简单关系（例如一阶关系）来指导不断发展的相互作用。然而，将预定义的浅层关系与股票走势结合起来，可能会影响选择的普遍性，并增加投资组合失败的风险。本文提出了一种基于概率状态空间学习（RSSL）的关系型选股框架。具体来说，RSSL首先尝试建立一个基于树的结构，以显式地暴露股票市场中的高阶关系，主要是通过发现股票之间关系的分层描述。然后，它通过注意机制与时变运动耦合，以顺利探索不同股票之间的交互相关性。受概率贝叶斯学习中最近的状态空间模型（SSM）的启发，我们设计了一个具有不确定性估计的概率卡尔曼网络（PKNet）来递归地模拟不断变化的股票波动，从而实现更有希望的回报-风险权衡。在几个真实的股票市场数据集上的实验结果表明，RSSL的性能明显优于几种代表性的基线方法。

{"title":"Relational Stock Selection via Probabilistic State Space Learning","authors":"Qiang Gao;Zhengxiang Liu;Li Huang;Kunpeng Zhang;Jun Wang;Guisong Liu","doi":"10.1109/TKDE.2024.3509267","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3509267","url":null,"abstract":"Optimizing stock selection through stock ranking is one of the critical but intricate tasks in quantitative trading areas because of the non-stationary dynamics and complicated interdependencies behind stock markets. Recent studies have made efforts to model historical market movements to enhance stock selection. However, they primarily borrowed the spirit of time series modeling and sought to build a deterministic paradigm without considering the uncertain fluctuations. In addition, some of these studies tailor to explore stock correlations from a predefined (e.g., binary) graph structure and use explicitly simple relations (such as first-order relations) to guide evolving interactions. Nevertheless, aggregating predefined but shallow relationships to collaborate with stock movements may affect selection generalizability and increase the risk of portfolio failure. This study introduces a novel \u0000<bold>R\u0000elational stock selection framework via probabilistic \u0000<bold>S\u0000tate \u0000<bold>S\u0000pace \u0000<bold>L\u0000earning (or \u0000<bold>RSSL\u0000) for stock selection. Specifically, RSSL first attempts to build a tree-based structure to explicitly expose higher-order relations in the stock market, primarily by discovering a hierarchical delineation of ties between stocks. Whereafter, it couples with time-varying movements via an attention mechanism to smoothly explore the interactive correlations among different stocks. Inspired by recent state space models (SSM) in probabilistic Bayesian learning, we devise a Probabilistic Kalman Network (PKNet) with uncertainty estimates to recursively simulate ever-changing stock volatility, enabling more promising return-risk trade-offs. The experimental results on several real-world stock market datasets demonstrate that RSSL outperforms several representative baseline methods by a significant margin.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"865-880"},"PeriodicalIF":8.9,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hierarchical Denoising for Robust Social Recommendation 鲁棒社会推荐的层次去噪

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-28 DOI: 10.1109/TKDE.2024.3508778

Zheng Hu;Satoshi Nakagawa;Yan Zhuang;Jiawen Deng;Shimin Cai;Tao Zhou;Fuji Ren

Social recommendations leverage social networks to augment the performance of recommender systems. However, the critical task of denoising social information has not been thoroughly investigated in prior research. In this study, we introduce a hierarchical denoising robust social recommendation model to tackle noise at two levels: 1) intra-domain noise, resulting from user multi-faceted social trust relationships, and 2) inter-domain noise, stemming from the entanglement of the latent factors over heterogeneous relations (e.g., user-item interactions, user-user trust relationships). Specifically, our model advances a preference and social psychology-aware methodology for the fine-grained and multi-perspective estimation of tie strength within social networks. This serves as a precursor to an edge weight-guided edge pruning strategy that refines the model's diversity and robustness by dynamically filtering social ties. Additionally, we propose a user interest-aware cross-domain denoising gate, which not only filters noise during the knowledge transfer process but also captures the high-dimensional, nonlinear information prevalent in social domains. We conduct extensive experiments on three real-world datasets to validate the effectiveness of our proposed model against state-of-the-art baselines. We perform empirical studies on synthetic datasets to validate the strong robustness of our proposed model.

社交推荐利用社交网络来增强推荐系统的性能。然而，在以往的研究中，对社会信息去噪这一关键任务尚未进行深入的研究。在本研究中，我们引入了一个分层去噪的鲁棒社会推荐模型来处理两个层次的噪声：1)域内噪声，源于用户多方面的社会信任关系；2)域间噪声，源于异构关系（如用户-物品交互、用户-用户信任关系）上潜在因素的纠缠。具体来说，我们的模型提出了一种偏好和社会心理意识的方法，用于社会网络中纽带强度的细粒度和多角度估计。这可以作为边缘权重导向的边缘修剪策略的先驱，该策略通过动态过滤社会关系来改进模型的多样性和鲁棒性。此外，我们提出了一种用户兴趣感知的跨域去噪门，它不仅可以过滤知识传递过程中的噪声，还可以捕获社会领域中普遍存在的高维非线性信息。我们在三个真实世界的数据集上进行了广泛的实验，以验证我们提出的模型对最先进基线的有效性。我们对合成数据集进行实证研究，以验证我们提出的模型的强鲁棒性。

{"title":"Hierarchical Denoising for Robust Social Recommendation","authors":"Zheng Hu;Satoshi Nakagawa;Yan Zhuang;Jiawen Deng;Shimin Cai;Tao Zhou;Fuji Ren","doi":"10.1109/TKDE.2024.3508778","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3508778","url":null,"abstract":"Social recommendations leverage social networks to augment the performance of recommender systems. However, the critical task of denoising social information has not been thoroughly investigated in prior research. In this study, we introduce a hierarchical denoising robust social recommendation model to tackle noise at two levels: 1) intra-domain noise, resulting from user multi-faceted social trust relationships, and 2) inter-domain noise, stemming from the entanglement of the latent factors over heterogeneous relations (e.g., user-item interactions, user-user trust relationships). Specifically, our model advances a preference and social psychology-aware methodology for the fine-grained and multi-perspective estimation of tie strength within social networks. This serves as a precursor to an edge weight-guided edge pruning strategy that refines the model's diversity and robustness by dynamically filtering social ties. Additionally, we propose a user interest-aware cross-domain denoising gate, which not only filters noise during the knowledge transfer process but also captures the high-dimensional, nonlinear information prevalent in social domains. We conduct extensive experiments on three real-world datasets to validate the effectiveness of our proposed model against state-of-the-art baselines. We perform empirical studies on synthetic datasets to validate the strong robustness of our proposed model.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"739-753"},"PeriodicalIF":8.9,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Graph Percolation Embeddings for Efficient Knowledge Graph Inductive Reasoning

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-28 DOI: 10.1109/TKDE.2024.3508064

Kai Wang;Dan Lin;Siqiang Luo

We study Graph Neural Networks (GNNs)-based embedding techniques for knowledge graph (KG) reasoning. For the first time, we link the path redundancy issue in the state-of-the-art path encoding-based models to the transformation error in model training, which brings us new theoretical insights into KG reasoning, as well as high efficacy in practice. On the theoretical side, we analyze the entropy of transformation error in KG paths and point out query-specific redundant paths causing entropy increases. These findings guide us to maintain the shortest paths and remove redundant paths for minimized-entropy message passing. To achieve this goal, on the practical side, we propose an efficient Graph Percolation process motivated by the percolation phenomenon in Fluid Mechanics, and design a lightweight GNN-based KG reasoning framework called Graph Percolation Embeddings (GraPE)¹. GraPE outperforms state-of-the-art methods in both transductive and inductive reasoning tasks, while requiring fewer training parameters and less inference time.

引用次数: 0