首页 > 最新文献

IEEE Transactions on Knowledge and Data Engineering最新文献

英文 中文
Reliable Truth Discovery for Dynamic and Dependent Sources 动态和依赖源的可靠真相发现
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1109/TKDE.2025.3631376
He Zhang;Shuang Wang;Long Chen;Xiaoping Li;Qing Gao;Quan Z. Sheng
In the era of Big Data and generative artificial intelligence (AI), discovering the truth about various objects from different sources has become a pressing topic. Existing studies primarily focus on dependent sources with conflicting information, where sources may copy information from each other. However, real-world scenarios are often more complex, with dynamic dependence relationships among sources over time. This complexity makes it much more difficult to discover the truth. One of the key challenges centers on measuring the dynamic dependence among sources. To address this challenge, we have developed three models: $Depen_{S}imple$, $Depen_{C}omplex$, and $Depen_{D}ynamic$. These models are based on the Hidden Markov Model (HMM) and are designed to handle different types of dependencies, namely simple source dependence, complex source dependence, and dynamic source dependence. Based on the constructed models, we propose a generic framework for discovering the latent truth which are evaluated by three HMM-based methods. We conduct extensive experiments on three real-world datasets to evaluate the performance of the proposed methods, and the results demonstrate that all three methods achieve high accuracy over the state-of-the-art methods.
在大数据和生成式人工智能(AI)时代,从不同来源发现各种物体的真相已成为一个紧迫的话题。现有的研究主要集中在信息相互冲突的依赖来源,其中来源可能相互复制信息。然而,现实世界的场景往往更加复杂,随着时间的推移,源之间的依赖关系是动态的。这种复杂性使得发现真相变得更加困难。关键的挑战之一集中在测量源之间的动态依赖关系。为了应对这一挑战,我们开发了三个模型:$Depen_{S} simple $、$Depen_{C} complex $和$Depen_{D} dynamic $。这些模型基于隐马尔可夫模型(HMM),设计用于处理不同类型的依赖,即简单源依赖、复杂源依赖和动态源依赖。在构建模型的基础上,提出了一种发现潜在真值的通用框架,并通过三种基于hmm的方法对潜在真值进行了评估。我们在三个真实世界的数据集上进行了广泛的实验来评估所提出的方法的性能,结果表明,所有三种方法都比最先进的方法具有更高的精度。
{"title":"Reliable Truth Discovery for Dynamic and Dependent Sources","authors":"He Zhang;Shuang Wang;Long Chen;Xiaoping Li;Qing Gao;Quan Z. Sheng","doi":"10.1109/TKDE.2025.3631376","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3631376","url":null,"abstract":"In the era of Big Data and generative artificial intelligence (AI), discovering the truth about various objects from different sources has become a pressing topic. Existing studies primarily focus on dependent sources with conflicting information, where sources may copy information from each other. However, real-world scenarios are often more complex, with dynamic dependence relationships among sources over time. This complexity makes it much more difficult to discover the truth. One of the key challenges centers on measuring the dynamic dependence among sources. To address this challenge, we have developed three models: <inline-formula><tex-math>$Depen_{S}imple$</tex-math></inline-formula>, <inline-formula><tex-math>$Depen_{C}omplex$</tex-math></inline-formula>, and <inline-formula><tex-math>$Depen_{D}ynamic$</tex-math></inline-formula>. These models are based on the Hidden Markov Model (HMM) and are designed to handle different types of dependencies, namely <i>simple source dependence</i>, <i>complex source dependence</i>, and <i>dynamic source dependence</i>. Based on the constructed models, we propose a generic framework for discovering the latent truth which are evaluated by three HMM-based methods. We conduct extensive experiments on three real-world datasets to evaluate the performance of the proposed methods, and the results demonstrate that all three methods achieve high accuracy over the state-of-the-art methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 1","pages":"546-558"},"PeriodicalIF":10.4,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subgraph-Centric Multi-Agent Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning 多跳知识图推理的子图中心多智能体强化学习
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-11 DOI: 10.1109/TKDE.2025.3631495
Tao He;Zerui Chen;Lizi Liao;Yixin Cao;Yuanxing Liu;Wei Tang;Xun Mao;Kai Lv;Ming Liu;Bing Qin
Multi-hop Knowledge Graph Reasoning (KGR) seeks to identify accurate answers within Knowledge Graphs (KGs) via multi-step reasoning, predominantly utilizing reinforcement learning (RL) to enhance the efficiency of the reasoning process. Unlike traditional Knowledge Graph Embedding (KGE) methods, RL-based approaches offer superior interpretability. However, these methods often underperform due to two critical limitations: (1) their over-reliance on Horn rules for reasoning paths, which restricts their expressive power; and (2) inadequate utilization of reasoning states during the process. To address these issues, we propose a novel RL-based framework, RAR, which shifts focus from individual paths to subgraph structures for more robust predictions. RAR frames the retrieval of reasoning subgraphs from the KG as a Markov Decision Process (MDP) and incorporates a subgraph retriever. To efficiently explore the extensive subgraph space, we integrate multi-agent RL to enhance the retriever’s capabilities. Additionally, RAR features an advanced analyst module that meticulously examines reasoning states. These modules function iteratively: the retriever expands the subgraph, followed by the analyst module’s in-depth analysis. The insights gained are then used to inform subsequent retrieval steps. Ultimately, the predicted scores from both modules are synthesized to produce more precise posterior scores. Experimental results across multiple datasets demonstrate RAR’s efficacy, showcasing a notable improvement over existing state-of-the-art RL-based KGR methods.
多跳知识图推理(KGR)寻求通过多步推理在知识图(KGs)中识别准确的答案,主要利用强化学习(RL)来提高推理过程的效率。与传统的知识图嵌入(KGE)方法不同,基于强化学习的方法具有更好的可解释性。然而,由于两个关键的限制,这些方法往往表现不佳:(1)它们过度依赖霍恩规则的推理路径,这限制了它们的表达能力;(2)过程中推理状态的利用不足。为了解决这些问题,我们提出了一个新的基于强化学习的框架,RAR,它将焦点从单个路径转移到子图结构,以获得更稳健的预测。RAR将从KG中检索推理子图作为马尔可夫决策过程(MDP),并包含子图检索器。为了有效地探索广泛的子图空间,我们集成了多智能体强化学习来增强检索器的能力。此外,RAR还具有高级分析模块,可以仔细检查推理状态。这些模块迭代地工作:检索器展开子图,然后是分析模块的深入分析。获得的信息将用于通知后续的检索步骤。最后,将两个模块的预测分数进行综合,以产生更精确的后验分数。跨多个数据集的实验结果证明了RAR的有效性,显示了比现有最先进的基于rl的KGR方法的显着改进。
{"title":"Subgraph-Centric Multi-Agent Reinforcement Learning for Multi-Hop Knowledge Graph Reasoning","authors":"Tao He;Zerui Chen;Lizi Liao;Yixin Cao;Yuanxing Liu;Wei Tang;Xun Mao;Kai Lv;Ming Liu;Bing Qin","doi":"10.1109/TKDE.2025.3631495","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3631495","url":null,"abstract":"Multi-hop Knowledge Graph Reasoning (KGR) seeks to identify accurate answers within Knowledge Graphs (KGs) via multi-step reasoning, predominantly utilizing reinforcement learning (RL) to enhance the efficiency of the reasoning process. Unlike traditional Knowledge Graph Embedding (KGE) methods, RL-based approaches offer superior interpretability. However, these methods often underperform due to two critical limitations: (1) their over-reliance on Horn rules for reasoning paths, which restricts their expressive power; and (2) inadequate utilization of reasoning states during the process. To address these issues, we propose a novel RL-based framework, RAR, which shifts focus from individual paths to subgraph structures for more robust predictions. RAR frames the retrieval of reasoning subgraphs from the KG as a Markov Decision Process (MDP) and incorporates a subgraph retriever. To efficiently explore the extensive subgraph space, we integrate multi-agent RL to enhance the retriever’s capabilities. Additionally, RAR features an advanced analyst module that meticulously examines reasoning states. These modules function iteratively: the retriever expands the subgraph, followed by the analyst module’s in-depth analysis. The insights gained are then used to inform subsequent retrieval steps. Ultimately, the predicted scores from both modules are synthesized to produce more precise posterior scores. Experimental results across multiple datasets demonstrate RAR’s efficacy, showcasing a notable improvement over existing state-of-the-art RL-based KGR methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1319-1333"},"PeriodicalIF":10.4,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Region Embedding With Adaptive Correlation Discovery for Predicting Urban Socioeconomic Indicators 基于自适应关联发现的区域嵌入预测城市社会经济指标
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-10 DOI: 10.1109/TKDE.2025.3631025
Meng Chen;Hongwei Jia;Zechen Li;Weiming Huang;Kai Zhao;Yongshun Gong;Haoran Xu;Hongjun Dai
A recent trend in urban computing involves utilizing multi-modal data for urban region embedding, which can be further expanded in a variety of downstream urban sensing tasks. Many previous studies rely on multi-graph embedding techniques and follow a two-stage paradigm: first building a $k$-nearest neighbor graph based on fixed region correlations for each view, and then blending multi-view information in a posterior stage to learn region representations. However, multi-graph construction and multi-graph representation learning are not associated in most existing two-stage studies, and the relationship between them is not leveraged, which can provide complementary information to each other. In this paper, we unify these two stages into one by constructing learnable weighted complete graphs of regions and propose a new one-stage Region Embedding method with Adaptive region correlation Discovery (READ). Specifically, READ comprises three modules, including a disentangled region feature learning module utilizing a city-context Transformer to encode regions’ semantic and mobility features, and an adaptive weighted multi-graph construction module that builds multiple complete graphs with learnable weights based on disentangled features of regions. In addition, we propose a multi-graph representation learning module to yield effective region representations that integrate information from multiple graphs. We conduct thorough experiments on three downstream tasks to assess READ. Experimental results demonstrate that READ considerably outperforms state-of-the-art baseline methods in urban region embedding.
城市计算的最新趋势是利用多模态数据进行城市区域嵌入,这可以进一步扩展到各种下游城市传感任务中。许多先前的研究依赖于多图嵌入技术,并遵循两阶段范式:首先基于每个视图的固定区域相关性构建$k$最近邻图,然后在后一阶段混合多视图信息以学习区域表示。然而,在现有的大多数两阶段研究中,没有将多图构建和多图表示学习联系起来,也没有利用它们之间的关系,可以相互提供互补的信息。本文通过构造可学习的区域加权完全图,将这两个阶段统一为一个阶段,提出了一种新的基于自适应区域相关发现(READ)的单阶段区域嵌入方法。具体来说,READ包括三个模块,一个是利用city-context Transformer对区域的语义和迁移特征进行编码的解纠缠区域特征学习模块,另一个是基于区域解纠缠特征构建具有可学习权的多个完全图的自适应加权多图构建模块。此外,我们提出了一个多图表示学习模块,以产生集成多个图信息的有效区域表示。我们对三个下游任务进行了深入的实验来评估READ。实验结果表明,READ在城市区域嵌入中明显优于最先进的基线方法。
{"title":"Region Embedding With Adaptive Correlation Discovery for Predicting Urban Socioeconomic Indicators","authors":"Meng Chen;Hongwei Jia;Zechen Li;Weiming Huang;Kai Zhao;Yongshun Gong;Haoran Xu;Hongjun Dai","doi":"10.1109/TKDE.2025.3631025","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3631025","url":null,"abstract":"A recent trend in urban computing involves utilizing multi-modal data for urban region embedding, which can be further expanded in a variety of downstream urban sensing tasks. Many previous studies rely on multi-graph embedding techniques and follow a two-stage paradigm: first building a <inline-formula><tex-math>$k$</tex-math></inline-formula>-nearest neighbor graph based on fixed region correlations for each view, and then blending multi-view information in a posterior stage to learn region representations. However, multi-graph construction and multi-graph representation learning are not associated in most existing two-stage studies, and the relationship between them is not leveraged, which can provide complementary information to each other. In this paper, we unify these two stages into one by constructing learnable weighted complete graphs of regions and propose a new one-stage Region Embedding method with Adaptive region correlation Discovery (READ). Specifically, READ comprises three modules, including a disentangled region feature learning module utilizing a city-context Transformer to encode regions’ semantic and mobility features, and an adaptive weighted multi-graph construction module that builds multiple complete graphs with learnable weights based on disentangled features of regions. In addition, we propose a multi-graph representation learning module to yield effective region representations that integrate information from multiple graphs. We conduct thorough experiments on three downstream tasks to assess READ. Experimental results demonstrate that READ considerably outperforms state-of-the-art baseline methods in urban region embedding.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1280-1291"},"PeriodicalIF":10.4,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pseudoarboricity-Based Skyline Important Community Search in Large Networks 基于伪树性的大型网络Skyline重要社区搜索
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-10 DOI: 10.1109/TKDE.2025.3631112
Jiaqi Jiang;Rong-Hua Li;Longlong Lin;Yalong Zhang;Yue Zeng;Xiaowei Ye;Guoren Wang
Important communities are densely connected subgraphs containing vertices with high importance values, which have received wide attention recently. However, existing methods, predominantly based on the $k$-core model, suffer from limitations such as rigid degree constraints and suboptimal density, often failing to capture highly important vertices. To address these limitations, we propose a new community model based on pseudoarboricity that guarantees near-optimal density while preserving important vertices. Further, we introduce a novel problem of Psudoarboricity-based Skyline Important Community (PSIC), which uniquely treats density and importance as independent attributes. To efficiently address PSIC, we first devise a basic algorithm $mathsf {ClimbStairs}$, which iteratively refines communities by peeling vertices with low importance. To boost efficiency, we develop an advanced algorithm $mathsf {DivAndCon}$, which employs a recursive divide-and-conquer strategy combined with weight-based and pseudoarboricity-based pruning techniques, significantly reducing the search space. For massive graphs with billions of edges, inspired by a recursive division tree, we develop several parallel algorithms utilizing thread-pool and free-synchronization mechanism. Finally, we conduct extensive experiments on 10 real-world networks, and the results demonstrate the superiority of our solutions in terms of effectiveness, efficiency, and scalability.
重要群落是由具有高重要值的顶点组成的紧密连接的子图,近年来受到广泛关注。然而,现有的方法,主要基于$k$-core模型,受到诸如刚性程度约束和次优密度等限制,经常无法捕获高度重要的顶点。为了解决这些限制,我们提出了一个基于伪树性的新社区模型,该模型在保留重要顶点的同时保证了接近最佳的密度。在此基础上,提出了一种将密度和重要性作为独立属性来处理的基于拟邻域的Skyline重要社区(PSIC)问题。为了有效地解决PSIC问题,我们首先设计了一个基本算法$mathsf {ClimbStairs}$,该算法通过剥离低重要性的顶点来迭代地优化社区。为了提高效率,我们开发了一种高级算法$mathsf {DivAndCon}$,该算法采用递归分治策略,结合基于权重和基于伪树性的修剪技术,显著减少了搜索空间。对于具有数十亿条边的海量图,受递归除法树的启发,我们开发了几种利用线程池和自由同步机制的并行算法。最后,我们在10个真实网络上进行了广泛的实验,结果证明了我们的解决方案在有效性、效率和可扩展性方面的优势。
{"title":"Pseudoarboricity-Based Skyline Important Community Search in Large Networks","authors":"Jiaqi Jiang;Rong-Hua Li;Longlong Lin;Yalong Zhang;Yue Zeng;Xiaowei Ye;Guoren Wang","doi":"10.1109/TKDE.2025.3631112","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3631112","url":null,"abstract":"Important communities are densely connected subgraphs containing vertices with high importance values, which have received wide attention recently. However, existing methods, predominantly based on the <inline-formula><tex-math>$k$</tex-math></inline-formula>-core model, suffer from limitations such as rigid degree constraints and suboptimal density, often failing to capture highly important vertices. To address these limitations, we propose a new community model based on pseudoarboricity that guarantees near-optimal density while preserving important vertices. Further, we introduce a novel problem of Psudoarboricity-based Skyline Important Community (PSIC), which uniquely treats density and importance as independent attributes. To efficiently address PSIC, we first devise a basic algorithm <inline-formula><tex-math>$mathsf {ClimbStairs}$</tex-math></inline-formula>, which iteratively refines communities by peeling vertices with low importance. To boost efficiency, we develop an advanced algorithm <inline-formula><tex-math>$mathsf {DivAndCon}$</tex-math></inline-formula>, which employs a recursive divide-and-conquer strategy combined with weight-based and pseudoarboricity-based pruning techniques, significantly reducing the search space. For massive graphs with billions of edges, inspired by a recursive division tree, we develop several parallel algorithms utilizing thread-pool and free-synchronization mechanism. Finally, we conduct extensive experiments on 10 real-world networks, and the results demonstrate the superiority of our solutions in terms of effectiveness, efficiency, and scalability.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1264-1279"},"PeriodicalIF":10.4,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NCSAC: Effective Neural Community Search via Attribute-Augmented Conductance 基于属性增强电导的有效神经社区搜索
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-07 DOI: 10.1109/TKDE.2025.3630626
Longlong Lin;Quanao Li;Miao Qiao;Zeli Wang;Jin Zhao;Rong-Hua Li;Xin Luo;Tao Jia
Identifying locally dense communities closely connected to the user-initiated query node is crucial for a wide range of applications. Existing approaches either solely depend on rule-based constraints or exclusively utilize deep learning technologies to identify target communities. Therefore, an important question is proposed: can deep learning be integrated with rule-based constraints to elevate the quality of community search? In this paper, we affirmatively address this question by introducing a novel approach called Neural Community Search via Attribute-augmented Conductance, abbreviated as NCSAC. Specifically, NCSAC first proposes a novel concept of attribute-augmented conductance, which harmoniously blends the (internal and external) structural proximity and the attribute similarity. Then, NCSAC extracts a coarse candidate community of satisfactory quality using the proposed attribute-augmented conductance. Subsequently, NCSAC frames the community search as a graph optimization task, refining the candidate community through sophisticated reinforcement learning techniques, thereby producing high-quality results. Extensive experiments on six real-world graphs and ten competitors demonstrate the superiority of our solutions in terms of accuracy, efficiency, and scalability. Notably, the proposed solution outperforms state-of-the-art methods, achieving an impressive F1-score improvement ranging from 5.3% to 42.4%.
识别与用户发起的查询节点紧密相连的本地密集社区对于广泛的应用程序至关重要。现有的方法要么完全依赖于基于规则的约束,要么完全利用深度学习技术来识别目标社区。因此,提出了一个重要的问题:能否将深度学习与基于规则的约束相结合,从而提升社区搜索的质量?在本文中,我们通过引入一种称为基于属性增强电导的神经社区搜索(简称NCSAC)的新方法,肯定地解决了这个问题。具体而言,NCSAC首先提出了一种新的属性增强电导概念,该概念将(内部和外部)结构接近性和属性相似性和谐地融合在一起。然后,NCSAC利用提出的属性增强电导提取质量满意的粗候选群体。随后,NCSAC将社区搜索框架为图优化任务,通过复杂的强化学习技术精炼候选社区,从而产生高质量的结果。在六个真实世界的图和十个竞争对手上进行的广泛实验表明,我们的解决方案在准确性、效率和可扩展性方面具有优势。值得注意的是,所提出的解决方案优于最先进的方法,实现了令人印象深刻的f1分数提高,从5.3%到42.4%不等。
{"title":"NCSAC: Effective Neural Community Search via Attribute-Augmented Conductance","authors":"Longlong Lin;Quanao Li;Miao Qiao;Zeli Wang;Jin Zhao;Rong-Hua Li;Xin Luo;Tao Jia","doi":"10.1109/TKDE.2025.3630626","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3630626","url":null,"abstract":"Identifying locally dense communities closely connected to the user-initiated query node is crucial for a wide range of applications. Existing approaches either solely depend on rule-based constraints or exclusively utilize deep learning technologies to identify target communities. Therefore, an important question is proposed: can deep learning be integrated with rule-based constraints to elevate the quality of community search? In this paper, we affirmatively address this question by introducing a novel approach called Neural Community Search via Attribute-augmented Conductance, abbreviated as NCSAC. Specifically, NCSAC first proposes a novel concept of attribute-augmented conductance, which harmoniously blends the (internal and external) structural proximity and the attribute similarity. Then, NCSAC extracts a coarse candidate community of satisfactory quality using the proposed attribute-augmented conductance. Subsequently, NCSAC frames the community search as a graph optimization task, refining the candidate community through sophisticated reinforcement learning techniques, thereby producing high-quality results. Extensive experiments on six real-world graphs and ten competitors demonstrate the superiority of our solutions in terms of accuracy, efficiency, and scalability. Notably, the proposed solution outperforms state-of-the-art methods, achieving an impressive F1-score improvement ranging from 5.3% to 42.4%.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1221-1235"},"PeriodicalIF":10.4,"publicationDate":"2025-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring Global and Local Hierarchies: Dual Classifier With Mutual Distillation for Hierarchical Text Classification 探索全局和局部层次:用于层次文本分类的互蒸馏双分类器
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-06 DOI: 10.1109/TKDE.2025.3629743
Yubin Li;Zhaojian Cui;Haokai Gao;Jiale Liu;Yuncheng Jiang
As a pivotal variant of multi-label classification, hierarchical text classification (HTC) faces unique challenges due to its intricate taxonomic hierarchy. Recent state-of-the-art approaches improve performance by considering both global hierarchy covering all labels and local hierarchy indicating substructure of sample-specific ground-truth labels. However, they often over-condense hierarchical information into one or several tokens, which may cause the loss of useful knowledge. Accordingly, we propose a dual classifier model with global and local hierarchies (DCGL). It adopts prompt tuning-based BERT as the backbone, where global hierarchy is integrated into the soft prompt template. And this resulting classifier branch is termed global pipeline. To mitigate information loss caused by hierarchy condensation, we introduce a parallel local hierarchy-aware classifier pipeline. This local pipeline acquires label-level classification features through text propagation on the label hierarchy and aligns these features with oracle label representations of local hierarchy via graph contrastive learning, which serve as a novel strategy for local hierarchy incorporation. Thereby, DCGL obtains more granular and targeted features and captures local hierarchy information such as label co-occurrence and local structure. Moreover, since global and local pipelines capture distinct yet complementary information, we further apply mutual knowledge distillation to bridge the gap between their output logits and facilitate mutual learning. And to better control the distillation degree, we design a dynamic temperature negatively correlated with label confidence. Comprehensive experiments demonstrate that our DCGL outperforms several representative HTC methods.
作为多标签分类的重要变体,层次文本分类由于其复杂的分类层次结构而面临着独特的挑战。最近最先进的方法通过考虑覆盖所有标签的全局层次结构和指示特定样本的基础真值标签子结构的局部层次结构来提高性能。然而,它们经常将层次信息过度压缩为一个或几个令牌,这可能导致有用知识的丢失。因此,我们提出了一种具有全局和局部层次结构的双分类器模型(DCGL)。它采用基于提示调优的BERT作为主干,将全局层次结构集成到软提示模板中。这个分类器分支被称为全局管道。为了减少层次结构压缩造成的信息丢失,我们引入了一个并行的局部层次感知分类器管道。该局部管道通过标签层次结构上的文本传播获取标签级分类特征,并通过图对比学习将这些特征与本地层次结构的oracle标签表示对齐,这是一种新的本地层次结构合并策略。因此,DCGL获得了更细粒度和更有针对性的特征,并捕获了标签共现和局部结构等局部层次信息。此外,由于全球和本地管道捕获不同但互补的信息,我们进一步应用相互知识蒸馏来弥合其输出逻辑之间的差距,并促进相互学习。为了更好地控制蒸馏程度,我们设计了一个与标签置信度负相关的动态温度。综合实验表明,我们的DCGL优于几种具有代表性的HTC方法。
{"title":"Exploring Global and Local Hierarchies: Dual Classifier With Mutual Distillation for Hierarchical Text Classification","authors":"Yubin Li;Zhaojian Cui;Haokai Gao;Jiale Liu;Yuncheng Jiang","doi":"10.1109/TKDE.2025.3629743","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3629743","url":null,"abstract":"As a pivotal variant of multi-label classification, hierarchical text classification (HTC) faces unique challenges due to its intricate taxonomic hierarchy. Recent state-of-the-art approaches improve performance by considering both global hierarchy covering all labels and local hierarchy indicating substructure of sample-specific ground-truth labels. However, they often over-condense hierarchical information into one or several tokens, which may cause the loss of useful knowledge. Accordingly, we propose a dual classifier model with global and local hierarchies (DCGL). It adopts prompt tuning-based BERT as the backbone, where global hierarchy is integrated into the soft prompt template. And this resulting classifier branch is termed global pipeline. To mitigate information loss caused by hierarchy condensation, we introduce a parallel local hierarchy-aware classifier pipeline. This local pipeline acquires label-level classification features through text propagation on the label hierarchy and aligns these features with oracle label representations of local hierarchy via graph contrastive learning, which serve as a novel strategy for local hierarchy incorporation. Thereby, DCGL obtains more granular and targeted features and captures local hierarchy information such as label co-occurrence and local structure. Moreover, since global and local pipelines capture distinct yet complementary information, we further apply mutual knowledge distillation to bridge the gap between their output logits and facilitate mutual learning. And to better control the distillation degree, we design a dynamic temperature negatively correlated with label confidence. Comprehensive experiments demonstrate that our DCGL outperforms several representative HTC methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1084-1098"},"PeriodicalIF":10.4,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Party Federated Urban Flow Mining and Analysis Based on Lazy Aggregation 基于惰性聚合的多方联合城市流挖掘与分析
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-06 DOI: 10.1109/TKDE.2025.3629816
Wenyu Fang;Wei Huang;Yanhua Liu;Jia Liu;Tianrui Li
Multi-party urban flow analysis is a crucial task in smart cities. However, existing analysis methods has difficulty in trade-off between data privacy security and spatio-temporal feature capture. The solution to the problem of how to capture the complete spatio-temporal features of multi-party urban flow data while protecting data privacy is of great importance in multi-party urban flow analysis. Therefore, to address data privacy and spatio-temporal feature capture in multi-party urban flow analysis, this paper proposes a spatio-temporal federated analysis model, for multi-party urban flow mining, which is able to effectively protect data privacy and capture spatio-temporal features completely at the same time. First, a multi-party urban flow mining framework based on federated learning is proposed to realize complete capture of spatio-temporal feature information of multi-party urban flow data and mining urban flow pattern knowledge under the premise of protecting data privacy. Second, to address the communication cost of the multi-party urban flow analysis, we propose a lazy aggregation method based on similarity clustering, which improves the communication efficiency between clients and the server. Further, we propose a similarity evaluation criteria for urban flow data based on step function, which can effectively calculate the similarity between urban flow data. Finally, we compare the proposed model with some benchmark methods on Chengdu Didi order data and point of interest data to prove the effectiveness of the proposed model and visualize and analyze the spatio-temporal features.
多方城市流分析是智慧城市的一项重要任务。然而,现有的分析方法难以在数据隐私安全和时空特征捕获之间权衡。如何在保护数据隐私的前提下完整地捕捉多方城市流数据的时空特征,对多方城市流分析具有重要意义。因此,为了解决多方城市流分析中的数据隐私和时空特征捕获问题,本文提出了一种用于多方城市流挖掘的时空联邦分析模型,该模型能够有效地保护数据隐私,同时完整地捕获时空特征。首先,提出了一种基于联邦学习的多方城市流挖掘框架,在保护数据隐私的前提下,实现多方城市流数据时空特征信息的完整捕获和城市流模式知识的挖掘。其次,针对多方城市流分析中的通信成本问题,提出了一种基于相似聚类的延迟聚合方法,提高了客户端与服务器端的通信效率。在此基础上,提出了一种基于阶跃函数的城市流数据相似性评价准则,可以有效地计算城市流数据之间的相似性。最后,将所提模型与成都滴滴订单数据和兴趣点数据的一些基准方法进行比较,验证所提模型的有效性,并对模型的时空特征进行可视化分析。
{"title":"Multi-Party Federated Urban Flow Mining and Analysis Based on Lazy Aggregation","authors":"Wenyu Fang;Wei Huang;Yanhua Liu;Jia Liu;Tianrui Li","doi":"10.1109/TKDE.2025.3629816","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3629816","url":null,"abstract":"Multi-party urban flow analysis is a crucial task in smart cities. However, existing analysis methods has difficulty in trade-off between data privacy security and spatio-temporal feature capture. The solution to the problem of how to capture the complete spatio-temporal features of multi-party urban flow data while protecting data privacy is of great importance in multi-party urban flow analysis. Therefore, to address data privacy and spatio-temporal feature capture in multi-party urban flow analysis, this paper proposes a spatio-temporal federated analysis model, for multi-party urban flow mining, which is able to effectively protect data privacy and capture spatio-temporal features completely at the same time. First, a multi-party urban flow mining framework based on federated learning is proposed to realize complete capture of spatio-temporal feature information of multi-party urban flow data and mining urban flow pattern knowledge under the premise of protecting data privacy. Second, to address the communication cost of the multi-party urban flow analysis, we propose a lazy aggregation method based on similarity clustering, which improves the communication efficiency between clients and the server. Further, we propose a similarity evaluation criteria for urban flow data based on step function, which can effectively calculate the similarity between urban flow data. Finally, we compare the proposed model with some benchmark methods on Chengdu Didi order data and point of interest data to prove the effectiveness of the proposed model and visualize and analyze the spatio-temporal features.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 1","pages":"475-488"},"PeriodicalIF":10.4,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiplex Graph Guided Deep Survival Analysis 多重图引导深度生存分析
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-11-05 DOI: 10.1109/TKDE.2025.3621708
Chang Cui;Yongqiang Tang;Yuxun Qu;Wensheng Zhang
Survival analysis is extensively employed to analyze the probability of the event of interest, particularly in the medical field. Most current research treats patients as isolated entities, neglecting the complex associations among them, which leads to underutilization of valuable information. Recently, several studies address this limitation by incorporating patient graph structures. However, these approaches generally overlook two critical issues: 1) the exploration of heterogeneous inter-patient relationships, and 2) flexible and scalable inductive inference for test samples. To overcome these challenges, this study introduces a novel framework, Multiplex Graph Guided Deep Survival Analysis (MGG-Surv). Specifically, we employ multiplex patient graphs to capture comprehensive inter-patient associative information. Furthermore, we propose a teacher-student dual network architecture, where the teacher network encodes multiplex graphs, and the learned graph knowledge is transferred to the student network via a unidirectional connection termed Graph-Guided Distillation. The student network integrates this graph knowledge to predict survival outcomes without requiring the patient graphs. These innovative designs facilitate comprehensive integration of inter-patient relationships while achieving flexible and scalable graph-free inference. Experiments on four datasets, encompassing both single and competing risks, demonstrate the superior performance of our framework.
生存分析被广泛用于分析感兴趣的事件的概率,特别是在医学领域。目前大多数研究将患者视为孤立的个体,忽视了他们之间的复杂联系,导致有价值的信息没有得到充分利用。最近,一些研究通过合并患者图结构来解决这一限制。然而,这些方法通常忽略了两个关键问题:1)对异质患者间关系的探索,以及2)对测试样本的灵活和可扩展的归纳推理。为了克服这些挑战,本研究引入了一个新的框架,即Multiplex Graph Guided Deep Survival Analysis (MGG-Surv)。具体来说,我们采用多路患者图来捕获全面的患者间关联信息。此外,我们提出了一种师生双网络架构,其中教师网络编码多路图,学习到的图知识通过称为图引导蒸馏的单向连接传输到学生网络。学生网络整合了这些图表知识来预测生存结果,而不需要病人的图表。这些创新的设计促进了患者间关系的全面集成,同时实现了灵活和可扩展的无图推理。在四个数据集上的实验,包括单一和竞争风险,证明了我们的框架的优越性能。
{"title":"Multiplex Graph Guided Deep Survival Analysis","authors":"Chang Cui;Yongqiang Tang;Yuxun Qu;Wensheng Zhang","doi":"10.1109/TKDE.2025.3621708","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3621708","url":null,"abstract":"Survival analysis is extensively employed to analyze the probability of the event of interest, particularly in the medical field. Most current research treats patients as isolated entities, neglecting the complex associations among them, which leads to underutilization of valuable information. Recently, several studies address this limitation by incorporating patient graph structures. However, these approaches generally overlook two critical issues: 1) the exploration of heterogeneous inter-patient relationships, and 2) flexible and scalable inductive inference for test samples. To overcome these challenges, this study introduces a novel framework, Multiplex Graph Guided Deep Survival Analysis (MGG-Surv). Specifically, we employ multiplex patient graphs to capture comprehensive inter-patient associative information. Furthermore, we propose a teacher-student dual network architecture, where the teacher network encodes multiplex graphs, and the learned graph knowledge is transferred to the student network via a unidirectional connection termed Graph-Guided Distillation. The student network integrates this graph knowledge to predict survival outcomes without requiring the patient graphs. These innovative designs facilitate comprehensive integration of inter-patient relationships while achieving flexible and scalable graph-free inference. Experiments on four datasets, encompassing both single and competing risks, demonstrate the superior performance of our framework.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 1","pages":"489-505"},"PeriodicalIF":10.4,"publicationDate":"2025-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unleashing Expert Opinion From Social Media for Stock Prediction 利用社交媒体的专家意见进行股票预测
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1109/TKDE.2025.3626439
Wanyun Zhou;Saizhuo Wang;Xiang Li;Yiyan Qi;Jian Guo;Xiaowen Chu
While stock prediction task traditionally relies on volume-price and fundamental data to predict the return ratio or price movement trend, sentiment factors derived from social media platforms such as StockTwits offer a complementary and useful source of real-time market information. However, we find that most social media posts, along with the public sentiment they reflect, provide limited value for trading predictions due to their noisy nature. To tackle this, we propose a novel dynamic expert tracing algorithm that filters out non-informative posts and identifies both true and inverse experts whose consistent predictions can serve as valuable trading signals. Our approach achieves significant improvements over existing expert identification methods in stock trend prediction. However, when using binary expert predictions to predict the return ratio, similar to all other expert identification methods, our approach faces a common challenge of signal sparsity with expert signals cover only about 4% of all stock-day combinations in our dataset. To address this challenge, we propose a dual graph attention neural network that effectively propagates expert signals across related stocks, enabling accurate prediction of return ratios and significantly increasing signal coverage. Empirical results show that our propagated expert-based signals not only exhibit strong predictive power independently but also work synergistically with traditional financial features. These combined signals significantly outperform representative baseline models in all quant-related metrics including predictive accuracy, return metrics, and correlation metrics, resulting in more robust investment strategies. We hope this work inspires further research into leveraging social media data for enhancing quantitative investment strategies.
虽然股票预测任务传统上依赖于量价和基本数据来预测回报率或价格运动趋势,但从社交媒体平台(如StockTwits)衍生的情绪因素提供了一个补充和有用的实时市场信息来源。然而,我们发现大多数社交媒体帖子以及它们所反映的公众情绪,由于其嘈杂的性质,对交易预测的价值有限。为了解决这个问题,我们提出了一种新的动态专家跟踪算法,该算法过滤掉非信息性帖子,并识别真实和反向专家,其一致的预测可以作为有价值的交易信号。我们的方法在股票趋势预测方面比现有的专家识别方法有了显著的改进。然而,当使用二元专家预测来预测回报率时,与所有其他专家识别方法类似,我们的方法面临着信号稀疏的共同挑战,专家信号仅覆盖我们数据集中所有股票日组合的4%左右。为了解决这一挑战,我们提出了一种双图注意神经网络,该网络有效地在相关股票之间传播专家信号,从而能够准确预测回报率并显着增加信号覆盖。实证结果表明,我们的传播专家信号不仅具有较强的独立预测能力,而且与传统金融特征具有协同作用。这些组合信号在所有量化相关指标(包括预测准确性、回报指标和相关性指标)中显著优于代表性基线模型,从而产生更稳健的投资策略。我们希望这项工作能激发进一步的研究,利用社交媒体数据来增强量化投资策略。
{"title":"Unleashing Expert Opinion From Social Media for Stock Prediction","authors":"Wanyun Zhou;Saizhuo Wang;Xiang Li;Yiyan Qi;Jian Guo;Xiaowen Chu","doi":"10.1109/TKDE.2025.3626439","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3626439","url":null,"abstract":"While stock prediction task traditionally relies on volume-price and fundamental data to predict the return ratio or price movement trend, sentiment factors derived from social media platforms such as StockTwits offer a complementary and useful source of real-time market information. However, we find that most social media posts, along with the public sentiment they reflect, provide limited value for trading predictions due to their noisy nature. To tackle this, we propose a novel dynamic expert tracing algorithm that filters out non-informative posts and identifies both true and inverse experts whose consistent predictions can serve as valuable trading signals. Our approach achieves significant improvements over existing expert identification methods in stock trend prediction. However, when using binary expert predictions to predict the return ratio, similar to all other expert identification methods, our approach faces a common challenge of signal sparsity with expert signals cover only about 4% of all stock-day combinations in our dataset. To address this challenge, we propose a dual graph attention neural network that effectively propagates expert signals across related stocks, enabling accurate prediction of return ratios and significantly increasing signal coverage. Empirical results show that our propagated expert-based signals not only exhibit strong predictive power independently but also work synergistically with traditional financial features. These combined signals significantly outperform representative baseline models in all quant-related metrics including predictive accuracy, return metrics, and correlation metrics, resulting in more robust investment strategies. We hope this work inspires further research into leveraging social media data for enhancing quantitative investment strategies.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 2","pages":"1380-1394"},"PeriodicalIF":10.4,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145898200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MMM: A Unified Weakly-Supervised Anomaly Detection Framework for Multi-Distributional Data 一种统一的多分布数据弱监督异常检测框架
IF 10.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1109/TKDE.2025.3626561
Xu Tan;Junqi Chen;Jiawei Yang;Jie Chen;Susanto Rahardja
Weakly-Supervised Anomaly Detection (WSAD) has garnered increasing research interest in recent years, as it enables superior detection performance while demanding only a small fraction of labeled data. However, existing WSAD methods face two major limitations. From the data aspect, they struggle to detect anomalies between normal clusters or collective anomalies due to overlooking the multi-distribution and complex manifolds of real-world data. From the label aspect, they fall short of detecting unknown anomalies because of the label-insufficiency and anomaly contamination. To address these issues, we propose MMM, a unified WSAD framework for multi-distributional data. The framework consists of three components: a Multi-distribution data modeler captures latent representations of complex data distributions, followed by a Multiform feature extractor that extracts multiple underlying features from the modeler, highlighting the characteristics of potential anomalies. Finally, a Multi-strategy anomaly score estimator converts these features into anomaly scores, with the aid of a novel training approach with three strategies that maximize the utility of both data and labels. Experimental results showed that MMM achieved superior performance and robustness compared to state-of-the-art WSAD methods, while providing interpretable results that facilitate practical anomaly analysis.
近年来,弱监督异常检测(WSAD)获得了越来越多的研究兴趣,因为它能够在只需要一小部分标记数据的情况下实现卓越的检测性能。然而,现有的WSAD方法面临两个主要的局限性。从数据方面来看,由于忽略了现实世界数据的多分布和复杂流形,他们很难检测正常集群或集体异常之间的异常。在标记方面,由于标记不足和异常污染,它们无法检测到未知异常。为了解决这些问题,我们提出了一个用于多分布数据的统一WSAD框架MMM。该框架由三个组件组成:一个多分布数据建模器捕获复杂数据分布的潜在表示,然后是一个多形式特征提取器,从建模器中提取多个底层特征,突出显示潜在异常的特征。最后,多策略异常分数估计器将这些特征转换为异常分数,借助一种新颖的训练方法,该方法具有三种策略,可以最大限度地利用数据和标签。实验结果表明,与最先进的WSAD方法相比,MMM具有更好的性能和鲁棒性,同时提供了有助于实际异常分析的可解释结果。
{"title":"MMM: A Unified Weakly-Supervised Anomaly Detection Framework for Multi-Distributional Data","authors":"Xu Tan;Junqi Chen;Jiawei Yang;Jie Chen;Susanto Rahardja","doi":"10.1109/TKDE.2025.3626561","DOIUrl":"https://doi.org/10.1109/TKDE.2025.3626561","url":null,"abstract":"Weakly-Supervised Anomaly Detection (WSAD) has garnered increasing research interest in recent years, as it enables superior detection performance while demanding only a small fraction of labeled data. However, existing WSAD methods face two major limitations. From the data aspect, they struggle to detect anomalies between normal clusters or collective anomalies due to overlooking the multi-distribution and complex manifolds of real-world data. From the label aspect, they fall short of detecting unknown anomalies because of the label-insufficiency and anomaly contamination. To address these issues, we propose MMM, a unified WSAD framework for multi-distributional data. The framework consists of three components: a Multi-distribution data modeler captures latent representations of complex data distributions, followed by a Multiform feature extractor that extracts multiple underlying features from the modeler, highlighting the characteristics of potential anomalies. Finally, a Multi-strategy anomaly score estimator converts these features into anomaly scores, with the aid of a novel training approach with three strategies that maximize the utility of both data and labels. Experimental results showed that MMM achieved superior performance and robustness compared to state-of-the-art WSAD methods, while providing interpretable results that facilitate practical anomaly analysis.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 1","pages":"442-456"},"PeriodicalIF":10.4,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Knowledge and Data Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1