{"title":"Do not wait: Preemptive rumor detection with cooperative LLMs and accessible social context","authors":"Junyi Chen, Leyuan Liu, Fan Zhou","doi":"10.1016/j.ipm.2024.103995","DOIUrl":null,"url":null,"abstract":"<div><div>Concerning the dissemination of rumors via tweets, current methods for detecting rumors – both content-based and graph-based – do not adequately address the need for preemptive action to suppress rumors before public exposure. Although the advancement of large language models (LLMs) indicates a positive trend, their application in rumor detection remains either overly simplistic or excessively complex. Motivated by these, we put forward EvidenceRD, which employs an efficient yet effective cooperative strategy of three types of LLMs to mine informative evidence that augments content context behind tweets warranting a fact check. This is then integrated with a credibility network, an automatically generated social context based on the social homophily theory, to depict potential credibility relationships between authors of corresponding tweets. Extensive experiments across four public datasets confirm the efficacy of the proposed EvidenceRD. Specifically, EvidenceRD outperforms state-of-the-art baselines across various categories, achieving an improvement in general detection performance ranging from 3% to 16%. This superiority is achieved by exclusively utilizing pre-public information, a constraint not imposed on the comparative baselines. Besides, As EvidenceRD considers both evidence and the social context behind a tweet, it not only offers enhanced explainability through its multi-perspective evidence presented in natural language but also demonstrates greater robustness and transferability across different scenarios. Additional efficiency analysis demonstrates that the above-enhanced characteristics brought by EvidenceRD are cost-effective in terms of both financial and computational expenses. To summarize, compared to existing studies, this work theoretically presents a generally effective and efficient paradigm for utilizing LLMs in the context of rumor detection and demonstrates how integrating accessible social context can effectively detect rumors. Practically, the proposed method can accurately detect rumors before they emerge publicly with several notable metrics improvements and aid human decision-making on them from multiple dimensions.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 3","pages":"Article 103995"},"PeriodicalIF":7.4000,"publicationDate":"2024-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324003546","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Concerning the dissemination of rumors via tweets, current methods for detecting rumors – both content-based and graph-based – do not adequately address the need for preemptive action to suppress rumors before public exposure. Although the advancement of large language models (LLMs) indicates a positive trend, their application in rumor detection remains either overly simplistic or excessively complex. Motivated by these, we put forward EvidenceRD, which employs an efficient yet effective cooperative strategy of three types of LLMs to mine informative evidence that augments content context behind tweets warranting a fact check. This is then integrated with a credibility network, an automatically generated social context based on the social homophily theory, to depict potential credibility relationships between authors of corresponding tweets. Extensive experiments across four public datasets confirm the efficacy of the proposed EvidenceRD. Specifically, EvidenceRD outperforms state-of-the-art baselines across various categories, achieving an improvement in general detection performance ranging from 3% to 16%. This superiority is achieved by exclusively utilizing pre-public information, a constraint not imposed on the comparative baselines. Besides, As EvidenceRD considers both evidence and the social context behind a tweet, it not only offers enhanced explainability through its multi-perspective evidence presented in natural language but also demonstrates greater robustness and transferability across different scenarios. Additional efficiency analysis demonstrates that the above-enhanced characteristics brought by EvidenceRD are cost-effective in terms of both financial and computational expenses. To summarize, compared to existing studies, this work theoretically presents a generally effective and efficient paradigm for utilizing LLMs in the context of rumor detection and demonstrates how integrating accessible social context can effectively detect rumors. Practically, the proposed method can accurately detect rumors before they emerge publicly with several notable metrics improvements and aid human decision-making on them from multiple dimensions.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.