IEEE Transactions on Knowledge and Data Engineering最新文献_第7页

Multi-Modal Correction Network for Recommendation 推荐的多模态校正网络

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-07 DOI: 10.1109/TKDE.2024.3493374

Zengmao Wang;Yunzhen Feng;Xin Zhang;Renjie Yang;Bo Du

Multi-modal contents have proven to be the powerful knowledge for recommendation tasks. Most state-of-the-art multi-modal recommendation methods mainly focus on aligning the semantic spaces of different modalities to enhance the item representations and do not pay much attention on the relevant knowledge in the multi-modalities for recommendation, resulting in that the positive effects of the relevant knowledge is reduced and the improvement of recommendation performance is limited. In this paper, we propose a multi-modal correction network termed MMCN to enhance the item representation with the important semantic knowledge in each modality by a residual structure with attention mechanisms and a hierarchical contrastive learning framework. The residual information is obtained through self-attention and cross-attention, which can learn the relevant knowledge across different modalities effectively. While hierarchical contrastive learning further captures the relevant knowledge not only at the feature level but also at the element-wise level with a matrix. Extensive experiments on three large-scale real-world datasets show the superiority of MMCN over state-of-the-art multi-modal recommendation methods.

多模态内容已被证明是推荐任务的强大知识。目前大多数多模态推荐方法主要集中在对齐不同模态的语义空间来增强项目表征，而没有过多关注多模态中相关知识的推荐，导致相关知识的积极作用被削弱，推荐性能的提升受到限制。本文提出了一种多模态校正网络（MMCN），通过残差结构、注意机制和层次对比学习框架来增强各模态中重要语义知识对项目的表征。残差信息通过自注意和交叉注意两种方式获得，可以跨不同的模式有效地学习相关知识。而层次对比学习不仅在特征层次上，而且在元素层次上通过矩阵进一步捕获相关知识。在三个大规模真实数据集上的大量实验表明，MMCN优于最先进的多模态推荐方法。

{"title":"Multi-Modal Correction Network for Recommendation","authors":"Zengmao Wang;Yunzhen Feng;Xin Zhang;Renjie Yang;Bo Du","doi":"10.1109/TKDE.2024.3493374","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3493374","url":null,"abstract":"Multi-modal contents have proven to be the powerful knowledge for recommendation tasks. Most state-of-the-art multi-modal recommendation methods mainly focus on aligning the semantic spaces of different modalities to enhance the item representations and do not pay much attention on the relevant knowledge in the multi-modalities for recommendation, resulting in that the positive effects of the relevant knowledge is reduced and the improvement of recommendation performance is limited. In this paper, we propose a multi-modal correction network termed MMCN to enhance the item representation with the important semantic knowledge in each modality by a residual structure with attention mechanisms and a hierarchical contrastive learning framework. The residual information is obtained through self-attention and cross-attention, which can learn the relevant knowledge across different modalities effectively. While hierarchical contrastive learning further captures the relevant knowledge not only at the feature level but also at the element-wise level with a matrix. Extensive experiments on three large-scale real-world datasets show the superiority of MMCN over state-of-the-art multi-modal recommendation methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"810-822"},"PeriodicalIF":8.9,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

B-CAVE: A Robust Online Time Series Change Point Detection Algorithm Based on the Between-Class Average and Variance Evaluation Approach B-CAVE：基于类间平均和方差评估方法的稳健在线时间序列变化点检测算法

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-06 DOI: 10.1109/TKDE.2024.3492339

Aditi Gupta;Adeiza James Onumanyi;Satyadev Ahlawat;Yamuna Prasad;Virendra Singh

Change point detection (CPD) is a valuable technique in time series (TS) analysis, which allows for the automatic detection of abrupt variations within the TS. It is often useful in applications such as fault, anomaly, and intrusion detection systems. However, the inherent unpredictability and fluctuations in many real-time data sources pose a challenge for existing contemporary CPD techniques, leading to inconsistent performance across diverse real-time TS with varying characteristics. To address this challenge, we have developed a novel and robust online CPD algorithm constructed from the principle of discriminant analysis and based upon a newly proposed between-class average and variance evaluation approach, termed B-CAVE. Our B-CAVE algorithm features a unique change point measure, which has only one tunable parameter (i.e. the window size) in its computational process. We have also proposed a new evaluation metric that integrates time delay and the false alarm error towards effectively comparing the performance of different CPD methods in the literature. To validate the effectiveness of our method, we conducted experiments using both synthetic and real datasets, demonstrating the superior performance of the B-CAVE algorithm over other prominent existing techniques.

变化点检测（CPD）是时间序列（TS）分析中的一项有价值的技术，它允许自动检测时间序列中的突变变化，通常用于故障、异常和入侵检测系统等应用。然而，许多实时数据源固有的不可预测性和波动性对现有的当代CPD技术构成了挑战，导致具有不同特征的各种实时TS的性能不一致。为了应对这一挑战，我们开发了一种新的鲁棒在线CPD算法，该算法基于判别分析原理，并基于新提出的类间平均和方差评估方法，称为B-CAVE。我们的B-CAVE算法具有独特的变化点度量，在其计算过程中只有一个可调参数（即窗口大小）。我们还提出了一种新的评估指标，该指标集成了时间延迟和虚警误差，以有效地比较文献中不同CPD方法的性能。为了验证我们方法的有效性，我们使用合成和真实数据集进行了实验，证明了B-CAVE算法优于其他重要的现有技术。

{"title":"B-CAVE: A Robust Online Time Series Change Point Detection Algorithm Based on the Between-Class Average and Variance Evaluation Approach","authors":"Aditi Gupta;Adeiza James Onumanyi;Satyadev Ahlawat;Yamuna Prasad;Virendra Singh","doi":"10.1109/TKDE.2024.3492339","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3492339","url":null,"abstract":"Change point detection (CPD) is a valuable technique in time series (TS) analysis, which allows for the automatic detection of abrupt variations within the TS. It is often useful in applications such as fault, anomaly, and intrusion detection systems. However, the inherent unpredictability and fluctuations in many real-time data sources pose a challenge for existing contemporary CPD techniques, leading to inconsistent performance across diverse real-time TS with varying characteristics. To address this challenge, we have developed a novel and robust online CPD algorithm constructed from the principle of discriminant analysis and based upon a newly proposed between-class average and variance evaluation approach, termed B-CAVE. Our B-CAVE algorithm features a unique change point measure, which has only one tunable parameter (i.e. the window size) in its computational process. We have also proposed a new evaluation metric that integrates time delay and the false alarm error towards effectively comparing the performance of different CPD methods in the literature. To validate the effectiveness of our method, we conducted experiments using both synthetic and real datasets, demonstrating the superior performance of the B-CAVE algorithm over other prominent existing techniques.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"75-88"},"PeriodicalIF":8.9,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Context Correlation Discrepancy Analysis for Graph Anomaly Detection 图异常检测中的上下文相关差异分析

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-06 DOI: 10.1109/TKDE.2024.3488375

Ruidong Wang;Liang Xi;Fengbin Zhang;Haoyi Fan;Xu Yu;Lei Liu;Shui Yu;Victor C. M. Leung

In unsupervised graph anomaly detection, existing methods usually focus on detecting outliers by learning local context information of nodes, while often ignoring the importance of global context. However, global context information can provide more comprehensive relationship information between nodes in the network. By considering the structure of the entire network, detection methods are able to identify potential dependencies and interaction patterns between nodes, which is crucial for anomaly detection. Therefore, we propose an innovative graph anomaly detection framework, termed CoCo (Context Correlation Discrepancy Analysis), which detects anomalies by meticulously evaluating variances in correlations. Specifically, CoCo leverages the strengths of Transformers in sequence processing to effectively capture both global and local contextual features of nodes by aggregating neighbor features at various hops. Subsequently, a correlation analysis module is employed to maximize the correlation between local and global contexts of each normal node. Unseen anomalies are ultimately detected by measuring the discrepancy in the correlation of nodes’ contextual features. Extensive experiments conducted on six datasets with synthetic outliers and five datasets with organic outliers have demonstrated the significant effectiveness of CoCo compared to existing methods.

在无监督图异常检测中，现有方法通常侧重于通过学习节点的局部上下文信息来检测异常点，而忽略了全局上下文的重要性。而全局上下文信息可以提供更全面的网络中节点之间的关系信息。通过考虑整个网络的结构，检测方法能够识别节点之间潜在的依赖关系和交互模式，这对异常检测至关重要。因此，我们提出了一种创新的图异常检测框架，称为CoCo（上下文相关差异分析），它通过仔细评估相关性的方差来检测异常。具体来说，CoCo利用变形金刚在序列处理中的优势，通过聚合不同跳点的邻居特征，有效地捕获节点的全局和局部上下文特征。然后，使用相关性分析模块最大化每个正常节点的局部上下文和全局上下文之间的相关性。不可见的异常最终通过测量节点上下文特征相关性的差异来检测。在6个具有合成异常值的数据集和5个具有有机异常值的数据集上进行的大量实验表明，与现有方法相比，CoCo具有显著的有效性。

{"title":"Context Correlation Discrepancy Analysis for Graph Anomaly Detection","authors":"Ruidong Wang;Liang Xi;Fengbin Zhang;Haoyi Fan;Xu Yu;Lei Liu;Shui Yu;Victor C. M. Leung","doi":"10.1109/TKDE.2024.3488375","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3488375","url":null,"abstract":"In unsupervised graph anomaly detection, existing methods usually focus on detecting outliers by learning local context information of nodes, while often ignoring the importance of global context. However, global context information can provide more comprehensive relationship information between nodes in the network. By considering the structure of the entire network, detection methods are able to identify potential dependencies and interaction patterns between nodes, which is crucial for anomaly detection. Therefore, we propose an innovative graph anomaly detection framework, termed CoCo (Context Correlation Discrepancy Analysis), which detects anomalies by meticulously evaluating variances in correlations. Specifically, CoCo leverages the strengths of Transformers in sequence processing to effectively capture both global and local contextual features of nodes by aggregating neighbor features at various hops. Subsequently, a correlation analysis module is employed to maximize the correlation between local and global contexts of each normal node. Unseen anomalies are ultimately detected by measuring the discrepancy in the correlation of nodes’ contextual features. Extensive experiments conducted on six datasets with synthetic outliers and five datasets with organic outliers have demonstrated the significant effectiveness of CoCo compared to existing methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"174-187"},"PeriodicalIF":8.9,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Vertical Federated Density Peaks Clustering Under Nonlinear Mapping 非线性映射下的垂直联邦密度峰聚类

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-06 DOI: 10.1109/TKDE.2024.3487534

Chao Li;Shifei Ding;Xiao Xu;Lili Guo;Ling Ding;Xindong Wu

As the representative density-based clustering algorithm, density peaks clustering (DPC) has wide recognition, and many improved algorithms and applications have been extended from it. However, the DPC involving privacy protection has not been deeply studied. In addition, there is still room for improvement in the selection of centers and allocation methods of DPC. To address these issues, vertical federated density peaks clustering under nonlinear mapping (VFDPC) is proposed to address privacy protection issues in vertically partitioned data. Firstly, a hybrid encryption privacy protection mechanism is proposed to protect the merging process of distance matrices generated by client data. Secondly, according to the merged distance matrix, a more effective cluster merging under nonlinear mapping is proposed to ameliorate the process of DPC. Results on man-made, real, and multi-view data fully prove the improvement of VFDPC on clustering accuracy.

密度峰聚类作为一种具有代表性的基于密度的聚类算法，得到了广泛的认可，并在其基础上扩展了许多改进的算法和应用。然而，涉及隐私保护的DPC尚未得到深入研究。此外，在DPC的中心选择和分配方式等方面仍有改进的空间。为了解决这些问题，提出了非线性映射下的垂直联邦密度峰聚类（VFDPC）来解决垂直分区数据中的隐私保护问题。首先，提出了一种混合加密隐私保护机制，对客户端数据生成的距离矩阵合并过程进行保护。其次，根据合并的距离矩阵，提出了一种更有效的非线性映射下的聚类合并方法，以改进DPC过程。在人工数据、真实数据和多视图数据上的实验结果充分证明了VFDPC在聚类精度上的提高。

引用次数: 0

CAGS: Context-Aware Document Ranking With Contrastive Graph Sampling CAGS：利用对比图采样进行上下文感知文档排序

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-05 DOI: 10.1109/TKDE.2024.3491996

Zhaoheng Huang;Yutao Zhu;Zhicheng Dou;Ji-Rong Wen

In search sessions, a series of interactions in the context has been proven to be advantageous in capturing users’ search intents. Existing studies show that designing pre-training tasks and data augmentation strategies for session search improves the robustness and generalizability of the model. However, such data augmentation strategies only focus on changing the original session structure to learn a better representation. Ignoring information from outside the session, users’ diverse and complex intents cannot be learned well by simply reordering and deleting historical behaviors, proving that such strategies are limited and inadequate. In order to solve the problem of insufficient modeling under complex user intents, we propose exploiting information outside the original session. More specifically, in this paper, we sample queries and documents from the global click-on and follow-up session graph, alter an original session with these samples, and construct a new session that shares a similar user intent with the original one. Specifically, we design four data augmentation strategies based on session graphs in view of both one-hop and multi-hop structures to sample intent-associated query/document nodes. Experiments conducted on three large-scale public datasets demonstrate that our model outperforms the existing ad-hoc and context-aware document ranking models.

在搜索会话中，上下文中的一系列交互已被证明对捕获用户的搜索意图是有利的。已有研究表明，为会话搜索设计预训练任务和数据增强策略可以提高模型的鲁棒性和泛化性。然而，这种数据增强策略只关注于改变原始会话结构来学习更好的表示。忽略会话外部的信息，简单地对历史行为进行重新排序和删除，并不能很好地了解用户多样而复杂的意图，证明了这种策略的局限性和不足。为了解决复杂用户意图下建模不足的问题，我们提出利用原始会话之外的信息。更具体地说，在本文中，我们从全局点击和后续会话图中采样查询和文档，用这些样本修改原始会话，并构建一个与原始会话共享类似用户意图的新会话。具体来说，我们针对一跳和多跳结构设计了四种基于会话图的数据增强策略，以采样意图相关的查询/文档节点。在三个大规模公共数据集上进行的实验表明，我们的模型优于现有的ad-hoc和上下文感知文档排名模型。

{"title":"CAGS: Context-Aware Document Ranking With Contrastive Graph Sampling","authors":"Zhaoheng Huang;Yutao Zhu;Zhicheng Dou;Ji-Rong Wen","doi":"10.1109/TKDE.2024.3491996","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3491996","url":null,"abstract":"In search sessions, a series of interactions in the context has been proven to be advantageous in capturing users’ search intents. Existing studies show that designing pre-training tasks and data augmentation strategies for session search improves the robustness and generalizability of the model. However, such data augmentation strategies only focus on changing the original session structure to learn a better representation. Ignoring information from outside the session, users’ diverse and complex intents cannot be learned well by simply reordering and deleting historical behaviors, proving that such strategies are limited and inadequate. In order to solve the problem of insufficient modeling under complex user intents, we propose exploiting information outside the original session. More specifically, in this paper, we sample queries and documents from the global click-on and follow-up session graph, alter an original session with these samples, and construct a new session that shares a similar user intent with the original one. Specifically, we design four data augmentation strategies based on session graphs in view of both one-hop and multi-hop structures to sample intent-associated query/document nodes. Experiments conducted on three large-scale public datasets demonstrate that our model outperforms the existing ad-hoc and context-aware document ranking models.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"89-101"},"PeriodicalIF":8.9,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TGformer: A Graph Transformer Framework for Knowledge Graph Embedding TGformer：知识图嵌入的图转换器框架

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-04 DOI: 10.1109/TKDE.2024.3486747

Fobo Shi;Duantengchuan Li;Xiaoguang Wang;Bing Li;Xindong Wu

Knowledge graph embedding is efficient method for reasoning over known facts and inferring missing links. Existing methods are mainly triplet-based or graph-based. Triplet-based approaches learn the embedding of missing entities by a single triple only. They ignore the fact that the knowledge graph is essentially a graph structure. Graph-based methods consider graph structure information but ignore the contextual information of nodes in the knowledge graph, making them unable to discern valuable entity (relation) information. In response to the above limitations, we propose a general graph transformer framework for knowledge graph embedding (TGformer). It is the first to use a graph transformer to build knowledge embeddings with triplet-level and graph-level structural features in the static and temporal knowledge graph. Specifically, a context-level subgraph is constructed for each predicted triplet, which models the relation between triplets with the same entity. Afterward, we design a knowledge graph transformer network (KGTN) to fully explore multi-structural features in knowledge graphs, including triplet-level and graph-level, boosting the model to understand entities (relations) in different contexts. Finally, semantic matching is adopted to select the entity with the highest score. Experimental results on several public knowledge graph datasets show that our method can achieve state-of-the-art performance in link prediction.

知识图嵌入是对已知事实进行推理和推断缺失环节的有效方法。现有的方法主要是基于三元组或基于图的。基于三元组的方法仅通过单个三元组学习缺失实体的嵌入。他们忽略了一个事实，即知识图本质上是一个图结构。基于图的方法考虑了图的结构信息，但忽略了知识图中节点的上下文信息，无法识别有价值的实体（关系）信息。针对上述局限性，我们提出了一种通用的知识图嵌入图转换器框架（TGformer）。首次使用图转换器在静态和时态知识图中构建具有三重级和图级结构特征的知识嵌入。具体来说，为每个预测的三元组构建一个上下文级子图，该子图对具有相同实体的三元组之间的关系进行建模。随后，我们设计了一个知识图变压器网络（KGTN），以充分挖掘知识图的多结构特征，包括三重层和图层，增强模型对不同背景下实体（关系）的理解能力。最后，采用语义匹配的方法选择得分最高的实体。在多个公共知识图数据集上的实验结果表明，该方法可以达到最先进的链路预测性能。

{"title":"TGformer: A Graph Transformer Framework for Knowledge Graph Embedding","authors":"Fobo Shi;Duantengchuan Li;Xiaoguang Wang;Bing Li;Xindong Wu","doi":"10.1109/TKDE.2024.3486747","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3486747","url":null,"abstract":"Knowledge graph embedding is efficient method for reasoning over known facts and inferring missing links. Existing methods are mainly triplet-based or graph-based. Triplet-based approaches learn the embedding of missing entities by a single triple only. They ignore the fact that the knowledge graph is essentially a graph structure. Graph-based methods consider graph structure information but ignore the contextual information of nodes in the knowledge graph, making them unable to discern valuable entity (relation) information. In response to the above limitations, we propose a general graph transformer framework for knowledge graph embedding (TGformer). It is the first to use a graph transformer to build knowledge embeddings with triplet-level and graph-level structural features in the static and temporal knowledge graph. Specifically, a context-level subgraph is constructed for each predicted triplet, which models the relation between triplets with the same entity. Afterward, we design a knowledge graph transformer network (KGTN) to fully explore multi-structural features in knowledge graphs, including triplet-level and graph-level, boosting the model to understand entities (relations) in different contexts. Finally, semantic matching is adopted to select the entity with the highest score. Experimental results on several public knowledge graph datasets show that our method can achieve state-of-the-art performance in link prediction.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"526-541"},"PeriodicalIF":8.9,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning Without Missing-At-Random Prior Propensity-A Generative Approach for Recommender Systems 不丢失随机先验倾向的学习——推荐系统的生成方法

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-04 DOI: 10.1109/TKDE.2024.3490593

Yuanbo Xu;Fuzhen Zhuang;En Wang;Chaozhuo Li;Jie Wu

In recommender systems, it is frequently presumed that missing ratings adhere to a missing at random (MAR) mechanism, implying the absence of ratings is independent of their potential values. However, this assumption fails to hold in real-world scenarios, where users are inclined to rate items they either strongly favor or disfavor, introducing a missing not at random (MNAR) scenario. To tackle this issue, prior researchers have utilized explicit MAR feedbacks to infer the propensities of unobserved, implicit MNAR feedbacks. Nonetheless, acquiring explicit MAR feedbacks is resource-intensive and time-consuming and may not reflect users’ true preferences. Furthermore, most methods have only been tested on synthetic or small-scale datasets, thus their applicability and effectiveness in real-world settings without MAR feedbacks remain unclear. Along these lines, we aim to predict MNAR ratings without MAR prior propensities by exploring the consistency between MAR and MNAR feedbacks and narrowing the gap between them. From the empirical study and preliminary experiment, we hypothesize that user preferences can be treated as the common prior propensity for both MAR and MNAR generative processes. In this way, we extend this hypothesis to a more general MNAR scenario: user preferences learned from MNAR can partially substitute for the prior propensities derived from MAR feedbacks for MNAR recommendation tasks. To validate our hypothesis and approach, we develop a lightweight iterative probabilistic matrix factorization framework (lightIPMF) as a practical method of our methodology, utilizing user preferences extracted from MNAR, not MAR, to estimate MNAR feedbacks. Finally, the experimental results show that modeling user preferences can effectively improve MNAR feedback estimation without MAR feedback, and our proposed lightIPMF outperforms the state-of-the-art MNAR methods in predicting MNAR feedbacks.

在推荐系统中，通常假定缺失评级遵循随机缺失（MAR）机制，这意味着缺失评级与它们的潜在值无关。然而，这个假设在现实世界的场景中不成立，在现实世界中，用户倾向于对他们强烈喜欢或不喜欢的物品进行评级，这就引入了一个非随机缺失（MNAR）场景。为了解决这个问题，先前的研究人员利用显式MAR反馈来推断未观察到的隐式MNAR反馈的倾向。然而，获取明确的MAR反馈需要耗费大量资源和时间，而且可能无法反映用户的真实偏好。此外，大多数方法仅在合成或小规模数据集上进行了测试，因此它们在没有MAR反馈的现实环境中的适用性和有效性尚不清楚。沿着这些思路，我们的目标是通过探索MAR和MNAR反馈之间的一致性并缩小它们之间的差距来预测没有MAR先验倾向的MNAR评级。从实证研究和初步实验中，我们假设用户偏好可以被视为MAR和MNAR生成过程的共同先验倾向。通过这种方式，我们将这一假设扩展到更一般的MNAR场景：从MNAR中学习到的用户偏好可以部分替代从MNAR推荐任务的MAR反馈中获得的先验倾向。为了验证我们的假设和方法，我们开发了一个轻量级迭代概率矩阵分解框架（lightIPMF）作为我们方法的实用方法，利用从MNAR中提取的用户偏好，而不是MAR，来估计MNAR反馈。最后，实验结果表明，建模用户偏好可以有效地改善没有MAR反馈的MNAR反馈估计，并且我们提出的lightIPMF在预测MNAR反馈方面优于最先进的MNAR方法。

{"title":"Learning Without Missing-At-Random Prior Propensity-A Generative Approach for Recommender Systems","authors":"Yuanbo Xu;Fuzhen Zhuang;En Wang;Chaozhuo Li;Jie Wu","doi":"10.1109/TKDE.2024.3490593","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3490593","url":null,"abstract":"In recommender systems, it is frequently presumed that missing ratings adhere to a missing at random (MAR) mechanism, implying the absence of ratings is independent of their potential values. However, this assumption fails to hold in real-world scenarios, where users are inclined to rate items they either strongly favor or disfavor, introducing a missing not at random (MNAR) scenario. To tackle this issue, prior researchers have utilized explicit MAR feedbacks to infer the propensities of unobserved, implicit MNAR feedbacks. Nonetheless, acquiring explicit MAR feedbacks is resource-intensive and time-consuming and may not reflect users’ true preferences. Furthermore, most methods have only been tested on synthetic or small-scale datasets, thus their applicability and effectiveness in real-world settings without MAR feedbacks remain unclear. Along these lines, we aim to predict MNAR ratings without MAR prior propensities by exploring the consistency between MAR and MNAR feedbacks and narrowing the gap between them. From the empirical study and preliminary experiment, we hypothesize that \u0000<italic>user preferences</i>\u0000 can be treated as the common prior propensity for both MAR and MNAR generative processes. In this way, we extend this hypothesis to a more general MNAR scenario: user preferences learned from MNAR can partially substitute for the prior propensities derived from MAR feedbacks for MNAR recommendation tasks. To validate our hypothesis and approach, we develop a lightweight iterative probabilistic matrix factorization framework (lightIPMF) as a practical method of our methodology, utilizing user preferences extracted from MNAR, not MAR, to estimate MNAR feedbacks. Finally, the experimental results show that modeling user preferences can effectively improve MNAR feedback estimation without MAR feedback, and our proposed lightIPMF outperforms the state-of-the-art MNAR methods in predicting MNAR feedbacks.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"754-765"},"PeriodicalIF":8.9,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Explainable Session-Based Recommendation via Path Reasoning 基于路径推理的可解释会话推荐

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-04 DOI: 10.1109/TKDE.2024.3486326

Yang Cao;Shuo Shang;Jun Wang;Wei Zhang

This paper explores explaining session-based recommendation (SR) by path reasoning. Current SR models emphasize accuracy but lack explainability, while traditional path reasoning prioritizes knowledge graph exploration, ignoring sequential patterns present in the session history. Therefore, we propose a generalized hierarchical reinforcement learning framework for SR, which improves the explainability of existing SR models via Path Reasoning, namely PR4SR. Considering the different importance of items to the session, we design the session-level agent to select the items in the session as the starting nodes for path reasoning and the path-level agent to perform path reasoning. In particular, we design a multi-target reward mechanism to adapt to the skip behaviors of sequential patterns in SR and introduce path midpoint reward to enhance the exploration efficiency and accuracy in knowledge graphs. To improve the knowledge graph’s completeness and diversify the paths of explanation, we incorporate extracted feature information from images into the knowledge graph. We instantiate PR4SR in five state-of-the-art SR models (i.e., GRU4REC, NARM, GCSAN, SR-GNN, SASRec) and compare it with other explainable SR frameworks to demonstrate the effectiveness of PR4SR for recommendation and explanation tasks through extensive experiments with these approaches on four datasets.

本文探讨了通过路径推理来解释基于会话的推荐（SR）。当前的SR模型强调准确性，但缺乏可解释性，而传统的路径推理优先考虑知识图探索，忽略了会话历史中存在的顺序模式。因此，我们提出了一种广义层次强化学习框架，该框架通过路径推理提高了现有SR模型的可解释性，即PR4SR。考虑到项目对会话的不同重要性，我们设计了会话级代理，选择会话中的项目作为路径推理的起始节点，并设计了路径级代理进行路径推理。特别地，我们设计了一种多目标奖励机制来适应SR中顺序模式的跳过行为，并引入路径中点奖励来提高知识图的探索效率和准确性。为了提高知识图的完备性和丰富解释路径，我们将从图像中提取的特征信息整合到知识图中。我们在五个最先进的SR模型（即GRU4REC， NARM， GCSAN, SR- gnn, SASRec）中实例化了PR4SR，并将其与其他可解释的SR框架进行比较，通过在四个数据集上对这些方法进行广泛的实验，证明了PR4SR在推荐和解释任务中的有效性。

{"title":"Explainable Session-Based Recommendation via Path Reasoning","authors":"Yang Cao;Shuo Shang;Jun Wang;Wei Zhang","doi":"10.1109/TKDE.2024.3486326","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3486326","url":null,"abstract":"This paper explores explaining session-based recommendation (SR) by path reasoning. Current SR models emphasize accuracy but lack explainability, while traditional path reasoning prioritizes knowledge graph exploration, ignoring sequential patterns present in the session history. Therefore, we propose a generalized hierarchical reinforcement learning framework for SR, which improves the explainability of existing SR models via Path Reasoning, namely PR4SR. Considering the different importance of items to the session, we design the session-level agent to select the items in the session as the starting nodes for path reasoning and the path-level agent to perform path reasoning. In particular, we design a multi-target reward mechanism to adapt to the skip behaviors of sequential patterns in SR and introduce path midpoint reward to enhance the exploration efficiency and accuracy in knowledge graphs. To improve the knowledge graph’s completeness and diversify the paths of explanation, we incorporate extracted feature information from images into the knowledge graph. We instantiate PR4SR in five state-of-the-art SR models (i.e., GRU4REC, NARM, GCSAN, SR-GNN, SASRec) and compare it with other explainable SR frameworks to demonstrate the effectiveness of PR4SR for recommendation and explanation tasks through extensive experiments with these approaches on four datasets.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"278-290"},"PeriodicalIF":8.9,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142797899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MPM: Multi Patterns Memory Model for Short-Term Time Series Forecasting 短期时间序列预测的多模式记忆模型

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-04 DOI: 10.1109/TKDE.2024.3490843

Dezheng Wang;Rongjie Liu;Congyan Chen;Shihua Li

Short-term time series forecasting is pivotal in various scientific and industrial fields. Recent advancements in deep learning-based technologies have significantly improved the efficiency and accuracy of short-term time series modeling. Despite advancements, current time short-term series forecasting methods typically emphasize modeling dependencies across time stamps but frequently overlook inter-variable dependencies, which is crucial for multivariate forecasting. We propose a multi patterns memory model discovering various dependency patterns for short-term multivariate time series forecasting to fill the gap. The proposed model is structured around two key components: the short-term memory block and the long-term memory block. These networks are distinctively characterized by their use of asymmetric convolution, each tailored to process the various spatial-temporal dependencies among data. Experimental results show that the proposed model demonstrates competitive performance over the other time series forecasting methods across five benchmark datasets, likely thanks to the asymmetric structure, which can effectively extract the underlying various spatial-temporal dependencies among data.

短期时间序列预测在许多科学和工业领域都是至关重要的。基于深度学习技术的最新进展显著提高了短期时间序列建模的效率和准确性。尽管取得了进步，但目前的时间短期序列预测方法通常强调跨时间戳的建模依赖关系，但经常忽略变量间的依赖关系，这对多变量预测至关重要。我们提出了一种多模式记忆模型来发现短期多元时间序列预测的各种依赖模式，以填补这一空白。提出的模型是围绕两个关键组成部分：短期记忆块和长期记忆块。这些网络的特点是使用非对称卷积，每个网络都专门处理数据之间的各种时空依赖关系。实验结果表明，该模型在5个基准数据集上的表现优于其他时间序列预测方法，这可能要归功于该模型的非对称结构，它可以有效地提取数据之间潜在的各种时空依赖关系。

{"title":"MPM: Multi Patterns Memory Model for Short-Term Time Series Forecasting","authors":"Dezheng Wang;Rongjie Liu;Congyan Chen;Shihua Li","doi":"10.1109/TKDE.2024.3490843","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3490843","url":null,"abstract":"Short-term time series forecasting is pivotal in various scientific and industrial fields. Recent advancements in deep learning-based technologies have significantly improved the efficiency and accuracy of short-term time series modeling. Despite advancements, current time short-term series forecasting methods typically emphasize modeling dependencies across time stamps but frequently overlook inter-variable dependencies, which is crucial for multivariate forecasting. We propose a multi patterns memory model discovering various dependency patterns for short-term multivariate time series forecasting to fill the gap. The proposed model is structured around two key components: the short-term memory block and the long-term memory block. These networks are distinctively characterized by their use of asymmetric convolution, each tailored to process the various spatial-temporal dependencies among data. Experimental results show that the proposed model demonstrates competitive performance over the other time series forecasting methods across five benchmark datasets, likely thanks to the asymmetric structure, which can effectively extract the underlying various spatial-temporal dependencies among data.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 1","pages":"438-448"},"PeriodicalIF":8.9,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142810297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Modeling and Monitoring of Indoor Populations Using Sparse Positioning Data 基于稀疏定位数据的室内种群建模与监测

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering

Pub Date : 2024-11-01 DOI: 10.1109/TKDE.2024.3489796

Xiao Li;Huan Li;Hua Lu;Christian S. Jensen

In large venues like shopping malls and airports, knowledge on the indoor populations fuels applications such as business analytics, venue management, and safety control. In this work, we provide means of modeling populations in partitions of indoor space offline and of monitoring indoor populations continuously, by using indoor positioning data. However, the low-sampling rates of indoor positioning render the data temporally and spatially sparse, which in turn renders the offline capture of indoor populations challenging. It is even more challenging to continuously monitor indoor populations, as positioning data may be missing or not ready yet at the current moment. To address these challenges, we first enable probabilistic modeling of populations in indoor space partitions as Normal distributions. Based on that, we propose two learning-based estimators for on-the-fly prediction of population distributions. Leveraging the prediction-based schemes, we provide a unified continuous query processing framework for a type of query that enables continuous monitoring of populated partitions. The framework encompasses caching and result validity mechanisms to reduce cost and maintain monitoring effectiveness. Extensive experiments on two real data sets show that the proposed estimators are able to outperform the state-of-the-art alternatives and that the query processing framework is effective and efficient.

在大型场所，如购物中心和机场，室内人口的知识推动应用，如业务分析，场地管理和安全控制。在这项工作中，我们提供了利用室内定位数据对室内空间分区进行离线人口建模和连续监测室内人口的方法。然而，室内定位的低采样率使得数据在时间和空间上都很稀疏，这反过来又给室内人群的离线捕获带来了挑战。由于定位数据可能缺失或目前尚未准备好，因此持续监测室内人口更具挑战性。为了解决这些挑战，我们首先将室内空间分区中的人口作为正态分布进行概率建模。在此基础上，我们提出了两个基于学习的估计器，用于种群分布的动态预测。利用基于预测的模式，我们为一种查询类型提供了统一的连续查询处理框架，这种查询类型支持对已填充的分区进行连续监视。该框架包含缓存和结果有效性机制，以降低成本并保持监视有效性。在两个真实数据集上的大量实验表明，所提出的估计器能够优于最先进的替代方法，并且查询处理框架是有效和高效的。

{"title":"Modeling and Monitoring of Indoor Populations Using Sparse Positioning Data","authors":"Xiao Li;Huan Li;Hua Lu;Christian S. Jensen","doi":"10.1109/TKDE.2024.3489796","DOIUrl":"https://doi.org/10.1109/TKDE.2024.3489796","url":null,"abstract":"In large venues like shopping malls and airports, knowledge on the indoor populations fuels applications such as business analytics, venue management, and safety control. In this work, we provide means of modeling populations in partitions of indoor space offline and of monitoring indoor populations continuously, by using indoor positioning data. However, the low-sampling rates of indoor positioning render the data temporally and spatially sparse, which in turn renders the offline capture of indoor populations challenging. It is even more challenging to continuously monitor indoor populations, as positioning data may be missing or not ready yet at the current moment. To address these challenges, we first enable probabilistic modeling of populations in indoor space partitions as Normal distributions. Based on that, we propose two learning-based estimators for on-the-fly prediction of population distributions. Leveraging the prediction-based schemes, we provide a unified continuous query processing framework for a type of query that enables continuous monitoring of populated partitions. The framework encompasses caching and result validity mechanisms to reduce cost and maintain monitoring effectiveness. Extensive experiments on two real data sets show that the proposed estimators are able to outperform the state-of-the-art alternatives and that the query processing framework is effective and efficient.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 2","pages":"794-809"},"PeriodicalIF":8.9,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142940883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0