Yue Wang , Shizhong Yuan , Weimin Li , Yifan Feng , Xiao Yu , Fangfang Liu , Can Wang , Quanke Pan
{"title":"Heterogeneous network for Hierarchical Fine-Grained Domain Fake News Detection","authors":"Yue Wang , Shizhong Yuan , Weimin Li , Yifan Feng , Xiao Yu , Fangfang Liu , Can Wang , Quanke Pan","doi":"10.1016/j.ipm.2025.104141","DOIUrl":null,"url":null,"abstract":"<div><div>Fake news on social media has significant negative consequences for individuals and society. However, existing multi-domain detection methods exhibit two primary limitations: dependence on precise domain annotation and inherent bias arising from single-domain categorization. To address these challenges, this paper introduces the Domain-Specific Narrow-Coverage Tree-Based Taxonomy (<span><math><msup><mrow><mtext>DNT</mtext></mrow><mrow><mn>2</mn></mrow></msup></math></span>), which enables more precise domain classification and domain relationship elucidation through refined categories. The constructed dataset is annotated with multiple labels by Large Language Models (LLMs), mitigating reliance on manual efforts and reducing annotation costs while maintaining annotation quality. Furthermore, a Hierarchical Fine-Grained Domain (HFGD) Fake News Detection Method is proposed, which explicitly employs a heterogeneous network to model multi-relationships. This method can mitigate domain bias and comprehensively capture news diversity and domain interactions. Specifically, domain cohesion based on news semantics is designed to reflect the relevance of news within a domain. News items are integrated as intersection nodes into the tree structure of multilevel domains to construct the heterogeneous network. Graph representation learning then fuses directly or indirectly connected news and domain information during feature enhancement. Finally, a composite loss is designed for news and domain node classification. HFGD captures potential differences and commonalities in domains and enhances label adaptation through domain interactions. Experiments on our dataset demonstrate that HFGD outperforms state-of-the-art methods by 1.08% and 0.91% in overall accuracy and macro-F1 score, respectively. Specifically, in the education and military domains with limited sample sizes, HFGD achieves 5.74% and 4.1% improvements in macro-F1 score over the second-best method. The results demonstrate our method’s effectiveness in mitigating domain bias and enhancing detection performance, providing valuable insights for practical multi-domain fake news detection systems.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 4","pages":"Article 104141"},"PeriodicalIF":6.9000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000822","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Fake news on social media has significant negative consequences for individuals and society. However, existing multi-domain detection methods exhibit two primary limitations: dependence on precise domain annotation and inherent bias arising from single-domain categorization. To address these challenges, this paper introduces the Domain-Specific Narrow-Coverage Tree-Based Taxonomy (), which enables more precise domain classification and domain relationship elucidation through refined categories. The constructed dataset is annotated with multiple labels by Large Language Models (LLMs), mitigating reliance on manual efforts and reducing annotation costs while maintaining annotation quality. Furthermore, a Hierarchical Fine-Grained Domain (HFGD) Fake News Detection Method is proposed, which explicitly employs a heterogeneous network to model multi-relationships. This method can mitigate domain bias and comprehensively capture news diversity and domain interactions. Specifically, domain cohesion based on news semantics is designed to reflect the relevance of news within a domain. News items are integrated as intersection nodes into the tree structure of multilevel domains to construct the heterogeneous network. Graph representation learning then fuses directly or indirectly connected news and domain information during feature enhancement. Finally, a composite loss is designed for news and domain node classification. HFGD captures potential differences and commonalities in domains and enhances label adaptation through domain interactions. Experiments on our dataset demonstrate that HFGD outperforms state-of-the-art methods by 1.08% and 0.91% in overall accuracy and macro-F1 score, respectively. Specifically, in the education and military domains with limited sample sizes, HFGD achieves 5.74% and 4.1% improvements in macro-F1 score over the second-best method. The results demonstrate our method’s effectiveness in mitigating domain bias and enhancing detection performance, providing valuable insights for practical multi-domain fake news detection systems.
社交媒体上的假新闻对个人和社会都有严重的负面影响。然而,现有的多领域检测方法存在两个主要的局限性:依赖于精确的领域标注和单一领域分类带来的固有偏差。为了解决这些问题,本文引入了基于特定领域的窄覆盖树分类法(DNT2),该分类法通过细化的分类实现更精确的领域分类和领域关系阐明。构建的数据集由大型语言模型(Large Language Models, llm)用多个标签进行标注,减轻了对人工的依赖,在保持标注质量的同时降低了标注成本。在此基础上,提出了一种分层细粒度域(HFGD)假新闻检测方法,该方法明确地采用异构网络对多关系进行建模。该方法可以减轻领域偏见,全面捕捉新闻多样性和领域相互作用。具体来说,基于新闻语义的领域内聚旨在反映新闻在一个领域内的相关性。将新闻条目作为交叉节点集成到多层域的树状结构中,构建异构网络。图表示学习在特征增强过程中融合直接或间接关联的新闻和领域信息。最后,设计了一种用于新闻和域节点分类的复合损失算法。HFGD捕获域的潜在差异和共性,并通过域相互作用增强标签适应。在我们的数据集上的实验表明,HFGD在总体精度和宏观f1得分方面分别比最先进的方法高出1.08%和0.91%。具体而言,在样本量有限的教育和军事领域,HFGD比次优方法的宏观f1得分分别提高了5.74%和4.1%。结果表明,该方法在减轻领域偏差和提高检测性能方面是有效的,为实际的多领域假新闻检测系统提供了有价值的见解。
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.