首页 > 最新文献

Information Processing & Management最新文献

英文 中文
Measuring and forecasting global digital economy efficiency: An integrated approach using the super-efficiency sequential SBM model and machine learning algorithms 测量和预测全球数字经济效率:使用超效率顺序SBM模型和机器学习算法的综合方法
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-19 DOI: 10.1016/j.ipm.2026.104638
Zhishuo Zhang , Hu Liu , Tao Shi , Qian Li , Huayong Niu
This study develops a systematic Measurement–Analysis–Prediction framework to evaluate global digital economy efficiency using data from 114 countries over 2006–2023. Efficiency is decomposed into two stages—infrastructure transformation and value creation and international competitiveness—measured via a super-efficiency sequential Slack-Based Measure (SBM) model. Regional disparities are examined with the Dagum Gini coefficient, and machine learning models are employed for prediction, with Random Forest (RF) identified as the optimal predictor. Results show that global digital economy efficiency has shown a fluctuating upward trend, with Stage 1 (infrastructure transformation) consistently outperforms Stage 2 (value creation). Notably, 2021 marked a significant turning point for infrastructure transformation efficiency, with the efficiency value surging to 0.2883 due to pandemic-induced digital demand. Europe achieves the highest efficiency, while Asia and the Americas exhibit strong internal polarization; overall disparities are driven mainly by net inter-regional gaps. Machine learning predictions indicate efficiency will increase from 0.3210 in 2024 to 0.3566 in 2028, though regional imbalances are expected to persist. Overall, this study provides robust empirical evidence and a comprehensive framework for understanding the transmission mechanisms of digital economy efficiency, interpreting global disparity patterns, and guiding policy formulation.
本研究开发了一个系统的测量-分析-预测框架,利用2006-2023年114个国家的数据来评估全球数字经济效率。将效率分解为基础设施改造和价值创造两个阶段,并通过超效率序列基于松弛测度(SBM)模型对国际竞争力进行测度。使用Dagum基尼系数检查区域差异,并使用机器学习模型进行预测,随机森林(RF)被确定为最佳预测器。结果表明,全球数字经济效率呈现波动上升趋势,第一阶段(基础设施改造)的表现始终优于第二阶段(价值创造)。值得注意的是,2021年是基础设施转型效率的重要转折点,受疫情引发的数字化需求影响,效率值飙升至0.2883。欧洲的效率最高,而亚洲和美洲则表现出强烈的内部极化;总体差距主要是由区域间净差距造成的。机器学习预测表明,效率将从2024年的0.3210提高到2028年的0.3566,尽管区域失衡预计将持续存在。总体而言,本研究为理解数字经济效率的传导机制、解释全球差距格局和指导政策制定提供了强有力的实证证据和全面的框架。
{"title":"Measuring and forecasting global digital economy efficiency: An integrated approach using the super-efficiency sequential SBM model and machine learning algorithms","authors":"Zhishuo Zhang ,&nbsp;Hu Liu ,&nbsp;Tao Shi ,&nbsp;Qian Li ,&nbsp;Huayong Niu","doi":"10.1016/j.ipm.2026.104638","DOIUrl":"10.1016/j.ipm.2026.104638","url":null,"abstract":"<div><div>This study develops a systematic <em>Measurement–Analysis–Prediction</em> framework to evaluate global digital economy efficiency using data from 114 countries over 2006–2023. Efficiency is decomposed into two stages—<em>infrastructure transformation</em> and <em>value creation and international competitiveness</em>—measured via a super-efficiency sequential Slack-Based Measure (SBM) model. Regional disparities are examined with the Dagum Gini coefficient, and machine learning models are employed for prediction, with Random Forest (RF) identified as the optimal predictor. Results show that global digital economy efficiency has shown a fluctuating upward trend, with Stage 1 (infrastructure transformation) consistently outperforms Stage 2 (value creation). Notably, 2021 marked a significant turning point for infrastructure transformation efficiency, with the efficiency value surging to 0.2883 due to pandemic-induced digital demand. Europe achieves the highest efficiency, while Asia and the Americas exhibit strong internal polarization; overall disparities are driven mainly by net inter-regional gaps. Machine learning predictions indicate efficiency will increase from 0.3210 in 2024 to 0.3566 in 2028, though regional imbalances are expected to persist. Overall, this study provides robust empirical evidence and a comprehensive framework for understanding the transmission mechanisms of digital economy efficiency, interpreting global disparity patterns, and guiding policy formulation.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104638"},"PeriodicalIF":6.9,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correlation guided multi-teacher distillation for lightweight image retrieval 基于相关性的多教师蒸馏轻量图像检索
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-18 DOI: 10.1016/j.ipm.2025.104583
Xiangkun Ban , Guang-Hai Liu , Zhou Lu , Bo-Jian Zhang
Image retrieval based on knowledge distillation has enormous potential for lightweight image retrieval. However, existing methods often transfer redundant or inconsistent knowledge to a single student model, resulting in it still rely on the teacher model during the inference phase. To address this problem, an efficient knowledge distillation method based on the multi-teacher distillation framework is proposed to form the basis for lightweight image retrieval, namely the correlation guided distillation (CGD), where correlation coefficient fusion and Cosine-based product quantization are two novel components. The former effectively mitigates noise and conflicting supervision in multi-teacher fusion, the latter encodes rich directional semantics to enhance structural alignment. The whitened teacher features are fused via using correlation coefficient fusion method, which is used to guide the student model through the structural similarity supervision provided by Cosine-based product quantization. Extensive experiments on several benchmark datasets demonstrate that our distillation framework achieves superior performance compared with the latest teacher-dependent methods under the same settings. In terms of the mAP scores with the M and H protocol setups, the performances are improved by 4.1% and 4.5% on the ROxford dataset, and by 2.1% and 4.4% on the RParis dataset. On the large-scale variants with 1 M distractor images, the respective improvements reach 7.9% and 9.0% on ROxford+1M, and 3.3% and 6.6% on RParis+1M. Furthermore, when using only the student model, our CGD method is up to 48.83 × faster than the teacher-dependent methods, without reducing retrieval performance.
基于知识蒸馏的图像检索具有极大的轻量级图像检索潜力。然而,现有方法往往将冗余或不一致的知识转移到单个学生模型中,导致其在推理阶段仍然依赖于教师模型。为了解决这一问题,提出了一种基于多教师蒸馏框架的高效知识蒸馏方法,即相关引导蒸馏(CGD),以相关系数融合和基于余弦的积量化为两个新组成部分,为轻量化图像检索奠定了基础。前者有效地缓解了多教师融合中的噪声和冲突监督,后者编码了丰富的方向语义,增强了结构一致性。将白化后的教师特征通过相关系数融合方法进行融合,利用余弦积量化提供的结构相似性监督来指导学生模型。在多个基准数据集上进行的大量实验表明,在相同的设置下,我们的蒸馏框架与最新的依赖于教师的方法相比取得了更好的性能。就M和H协议设置的mAP分数而言,在ROxford数据集上的性能提高了4.1%和4.5%,在RParis数据集上的性能提高了2.1%和4.4%。在1张 M分心图像的大规模变体上,ROxford+1M的改进率分别为7.9%和9.0%,RParis+1M的改进率分别为3.3%和6.6%。此外,当只使用学生模型时,我们的CGD方法比依赖教师的方法快48.83 × ,而不降低检索性能。
{"title":"Correlation guided multi-teacher distillation for lightweight image retrieval","authors":"Xiangkun Ban ,&nbsp;Guang-Hai Liu ,&nbsp;Zhou Lu ,&nbsp;Bo-Jian Zhang","doi":"10.1016/j.ipm.2025.104583","DOIUrl":"10.1016/j.ipm.2025.104583","url":null,"abstract":"<div><div>Image retrieval based on knowledge distillation has enormous potential for lightweight image retrieval. However, existing methods often transfer redundant or inconsistent knowledge to a single student model, resulting in it still rely on the teacher model during the inference phase. To address this problem, an efficient knowledge distillation method based on the multi-teacher distillation framework is proposed to form the basis for lightweight image retrieval, namely the correlation guided distillation (CGD), where correlation coefficient fusion and Cosine-based product quantization are two novel components. The former effectively mitigates noise and conflicting supervision in multi-teacher fusion, the latter encodes rich directional semantics to enhance structural alignment. The whitened teacher features are fused via using correlation coefficient fusion method, which is used to guide the student model through the structural similarity supervision provided by Cosine-based product quantization. Extensive experiments on several benchmark datasets demonstrate that our distillation framework achieves superior performance compared with the latest teacher-dependent methods under the same settings. In terms of the mAP scores with the M and H protocol setups, the performances are improved by 4.1% and 4.5% on the ROxford dataset, and by 2.1% and 4.4% on the RParis dataset. On the large-scale variants with 1 M distractor images, the respective improvements reach 7.9% and 9.0% on ROxford+1M, and 3.3% and 6.6% on RParis+1M. Furthermore, when using only the student model, our CGD method is up to 48.83 × faster than the teacher-dependent methods, without reducing retrieval performance.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104583"},"PeriodicalIF":6.9,"publicationDate":"2026-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An information processing framework for education: Supporting automatic question generation with NLP to minimize human intervention 教育信息处理框架:支持NLP自动问题生成,减少人为干预
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-16 DOI: 10.1016/j.ipm.2026.104613
Memoona Saleem , Zahoor Ur Rehman , Raja Hashim Ali , Ujala Akmal , Ali Zeeshan Ijaz , Raja Manzar Abbas
The recent model and conceptual advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) are changing how knowledge-intensive tasks are giving more reliable and creative decisions. Particularly, the advancements in NLP has completely changed the way human language is processed, is understood, and is then used for generating human language. These updates are useful for the deeper analysis of textual content that support information systems and their management. For example, in the field of education, these technologies offer advanced and more intelligent tools, which enhance education through improved learning experiences, optimized assessments, and other teaching and study mechanisms. In this paper, we have worked on a unified framework for automatic questions generation (AQG) and educational content analysis. For this purpose, we have developed Questify-TheEduBot, which integrates transformer-based models (BERT, GPT), Latent Dirichlet Allocation (LDA), sentiment analysis, and keyword extraction into a single pipeline. Existing tools typically address isolated tasks but our tool generates multiple types of questions, i.e., multiple choice questions (MCQs), cloze, as well as descriptive type of questions. In addition to AQG, Questify-TheEduBot simultaneously validates semantic coherence, topic coverage, and sentiment appropriateness. We have evaluated and compared our model on SQuAD v2.0 and LearningQ datasets, which consists of over 300,000 Question Answer pairs. Questify-TheEduBot demonstrated excellent performance on the test datasets, with cosine similarity above 0.85, keyword overlap of 87%, and topic modeling precision of 89%. Human evaluation further confirms the pedagogical relevance of generated questions, where our study shows significant improvements over template-based and Seq2Seq competing baseline models. The web-based platform of our tool offers instructors and learners with a tested, interpretable, and resource-efficient tool for automated assessment, support in curriculum development, and enables personalized learning. By merging automated question generation and content analytics, Questify-TheEduBot advances the state of NLP in the field of education, where it provides actionable insights for information management in digital learning environments.
人工智能(AI)和自然语言处理(NLP)的最新模型和概念进展正在改变知识密集型任务如何提供更可靠和创造性的决策。特别是,NLP的进步已经完全改变了人类语言的处理方式,理解,然后用于生成人类语言。这些更新有助于对支持信息系统及其管理的文本内容进行更深入的分析。例如,在教育领域,这些技术提供了更先进、更智能的工具,通过改善学习体验、优化评估和其他教学和学习机制来加强教育。在本文中,我们研究了一个用于自动问题生成(AQG)和教育内容分析的统一框架。为此,我们开发了Questify-TheEduBot,它将基于转换器的模型(BERT, GPT),潜在狄利克雷分配(LDA),情感分析和关键字提取集成到单个管道中。现有的工具通常解决孤立的任务,但我们的工具生成多种类型的问题,即选择题(mcq),完形填空,以及描述性问题。除了AQG, Questify-TheEduBot还可以同时验证语义一致性、主题覆盖和情感适当性。我们在SQuAD v2.0和LearningQ数据集上对我们的模型进行了评估和比较,这些数据集由超过30万个问答对组成。Questify-TheEduBot在测试数据集上表现出色,余弦相似度在0.85以上,关键词重叠率为87%,主题建模精度为89%。人类评估进一步证实了生成问题的教学相关性,我们的研究显示了基于模板和Seq2Seq竞争基线模型的显着改进。我们的工具的网络平台为教师和学习者提供了一个经过测试的、可解释的、资源高效的工具,用于自动评估,支持课程开发,并实现个性化学习。通过合并自动问题生成和内容分析,Questify-TheEduBot推动了NLP在教育领域的发展,为数字学习环境中的信息管理提供了可操作的见解。
{"title":"An information processing framework for education: Supporting automatic question generation with NLP to minimize human intervention","authors":"Memoona Saleem ,&nbsp;Zahoor Ur Rehman ,&nbsp;Raja Hashim Ali ,&nbsp;Ujala Akmal ,&nbsp;Ali Zeeshan Ijaz ,&nbsp;Raja Manzar Abbas","doi":"10.1016/j.ipm.2026.104613","DOIUrl":"10.1016/j.ipm.2026.104613","url":null,"abstract":"<div><div>The recent model and conceptual advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) are changing how knowledge-intensive tasks are giving more reliable and creative decisions. Particularly, the advancements in NLP has completely changed the way human language is processed, is understood, and is then used for generating human language. These updates are useful for the deeper analysis of textual content that support information systems and their management. For example, in the field of education, these technologies offer advanced and more intelligent tools, which enhance education through improved learning experiences, optimized assessments, and other teaching and study mechanisms. In this paper, we have worked on a unified framework for automatic questions generation (AQG) and educational content analysis. For this purpose, we have developed Questify-TheEduBot, which integrates transformer-based models (BERT, GPT), Latent Dirichlet Allocation (LDA), sentiment analysis, and keyword extraction into a single pipeline. Existing tools typically address isolated tasks but our tool generates multiple types of questions, i.e., multiple choice questions (MCQs), cloze, as well as descriptive type of questions. In addition to AQG, Questify-TheEduBot simultaneously validates semantic coherence, topic coverage, and sentiment appropriateness. We have evaluated and compared our model on SQuAD v2.0 and LearningQ datasets, which consists of over 300,000 Question Answer pairs. Questify-TheEduBot demonstrated excellent performance on the test datasets, with cosine similarity above 0.85, keyword overlap of 87%, and topic modeling precision of 89%. Human evaluation further confirms the pedagogical relevance of generated questions, where our study shows significant improvements over template-based and Seq2Seq competing baseline models. The web-based platform of our tool offers instructors and learners with a tested, interpretable, and resource-efficient tool for automated assessment, support in curriculum development, and enables personalized learning. By merging automated question generation and content analytics, Questify-TheEduBot advances the state of NLP in the field of education, where it provides actionable insights for information management in digital learning environments.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104613"},"PeriodicalIF":6.9,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Q-value guided Text-to-SQL generation: Structured reasoning meets efficient inference exploration q值引导的文本到sql生成:结构化推理满足高效推理探索
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-15 DOI: 10.1016/j.ipm.2025.104607
Min Tang , Lixin Zou , Shujie Cui , Weiqing Wang , Zhe Jin , Chengliang Li , Shiuan-Ni Liang
Translating natural language queries into executable SQL queries (Text-to-SQL) is essential for human-database interaction but remains a challenging task, especially for complex queries. Existing methods based on large language models (LLMs) often rely on supervised fine-tuning or iterative refinement based on execution success. However, these methods overlook errors that occur during the generation process, leading to error propagation issues. To address this, we propose the Q-value guided Text-to-SQL, which trains a Q-value network to evaluate SQL potential and avoid generation errors. Specifically, QSQL decomposes the complex SQL generation process into simpler and helpful intermediate steps, such as selection, schema linking, sub-query generation, and combination. Then, we adopt Monte Carlo Tree Search (MCTS) to estimate Q-values of each step and incorporate rule-based consensus filtering to eliminate inconsistent/low-quality data, removing the need for extra human annotations. Finally, a Q-value probe network approximates the estimated values for efficient generation. Experimental results on the Spider and Bird benchmarks show that QSQL achieves a superior balance between execution accuracy and inference efficiency. Compared to a baseline MCTS approach, QSQL improves execution accuracy by 9% while reducing inference time to just 11.9%.
将自然语言查询转换为可执行的SQL查询(文本到SQL)对于人-数据库交互至关重要,但仍然是一项具有挑战性的任务,特别是对于复杂的查询。基于大型语言模型(llm)的现有方法通常依赖于基于执行成功的监督微调或迭代改进。然而,这些方法忽略了在生成过程中发生的错误,从而导致错误传播问题。为了解决这个问题,我们提出了q值引导的文本到SQL,它训练一个q值网络来评估SQL的潜力并避免生成错误。具体来说,QSQL将复杂的SQL生成过程分解为更简单和有用的中间步骤,例如选择、模式链接、子查询生成和组合。然后,我们采用蒙特卡罗树搜索(MCTS)来估计每一步的q值,并结合基于规则的共识过滤来消除不一致/低质量的数据,从而消除了额外的人工注释的需要。最后,一个q值探测网络近似于有效生成的估计值。在Spider和Bird基准测试上的实验结果表明,QSQL在执行精度和推理效率之间取得了很好的平衡。与基线MCTS方法相比,QSQL将执行精度提高了9%,同时将推理时间减少到11.9%。
{"title":"Q-value guided Text-to-SQL generation: Structured reasoning meets efficient inference exploration","authors":"Min Tang ,&nbsp;Lixin Zou ,&nbsp;Shujie Cui ,&nbsp;Weiqing Wang ,&nbsp;Zhe Jin ,&nbsp;Chengliang Li ,&nbsp;Shiuan-Ni Liang","doi":"10.1016/j.ipm.2025.104607","DOIUrl":"10.1016/j.ipm.2025.104607","url":null,"abstract":"<div><div>Translating natural language queries into executable SQL queries (Text-to-SQL) is essential for human-database interaction but remains a challenging task, especially for complex queries. Existing methods based on large language models (LLMs) often rely on supervised fine-tuning or iterative refinement based on execution success. However, these methods overlook errors that occur during the generation process, leading to error propagation issues. To address this, we propose the <em>Q</em>-value guided Text-to-SQL, which trains a <em>Q</em>-value network to evaluate SQL potential and avoid generation errors. Specifically, QSQL decomposes the complex SQL generation process into simpler and helpful intermediate steps, such as selection, schema linking, sub-query generation, and combination. Then, we adopt Monte Carlo Tree Search (MCTS) to estimate <em>Q</em>-values of each step and incorporate rule-based consensus filtering to eliminate inconsistent/low-quality data, removing the need for extra human annotations. Finally, a <em>Q</em>-value probe network approximates the estimated values for efficient generation. Experimental results on the <span>Spider</span> and <span>Bird</span> benchmarks show that QSQL achieves a superior balance between execution accuracy and inference efficiency. Compared to a baseline MCTS approach, QSQL improves execution accuracy by <strong>9%</strong> while reducing inference time to just <strong>11.9%</strong>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104607"},"PeriodicalIF":6.9,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-view dynamic perception framework for Chinese harmful meme detection 中文有害模因检测的多视角动态感知框架
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-15 DOI: 10.1016/j.ipm.2025.104602
Jiapei Hu , Yifan Lyu , Yifan Chen , Kuntao Li , Yun Xue , Jinghua Liang
Chinese harmful memes convey toxicity on social media through varied semantic complexity and diverse modality combinations. However, existing detection methods typically adopt static architectures with fixed interaction patterns, which lack the flexibility to accurately identify harmful cues embedded in heterogeneous semantic and modal content across different memes, hindering a comprehensive understanding of toxic intent. To address this limitation, we propose the Multi-View Dynamic Perception (MDP) framework, a dynamic interaction paradigm specifically designed for Chinese harmful meme detection. Specifically, we develop five types of semantic perception nodes to synchronously extract features from diverse views. These nodes are densely stacked to form two perception branches, respectively guided by textual and visual features, to effectively capture modality-specific cues. To enhance adaptability, each node is equipped with an independent soft router that dynamically regulates information flow and enables flexible interaction patterns tailored to different memes. Furthermore, we introduce a Hierarchical Mutual Learning module to promote complementary representation learning between the two branches via mutual information maximization. Extensive experiments on the publicly available dataset TOXICN MM, comprising 12,000 samples, demonstrate the effectiveness of the proposed framework, with F1 score improvements of 1.06% in harmful meme detection and 2.77% in harmful type identification over the previous state-of-the-art method. We further evaluate the generalization of the MDP framework on a Chinese multimodal sarcasm detection dataset, where the proposed method also achieves competitive results.
中文有害模因通过不同的语义复杂性和不同的情态组合在社交媒体上传递毒性。然而,现有的检测方法通常采用具有固定交互模式的静态架构,缺乏灵活性,无法准确识别跨不同模因的异构语义和模态内容中嵌入的有害线索,阻碍了对有毒意图的全面理解。为了解决这一限制,我们提出了多视图动态感知(MDP)框架,这是一个专门为中文有害模因检测设计的动态交互范式。具体来说,我们开发了五种类型的语义感知节点来同步提取不同视图的特征。这些节点密集堆叠形成两个感知分支,分别由文本和视觉特征引导,以有效捕获特定于模态的线索。为了增强适应性,每个节点都配备了独立的软路由器,动态调节信息流,实现针对不同模因的灵活交互模式。此外,我们引入了一个分层互学习模块,通过互信息最大化来促进两个分支之间的互补表示学习。在公开数据集TOXICN MM上进行的大量实验,包括12,000个样本,证明了所提出框架的有效性,与之前最先进的方法相比,有害模因检测的F1分数提高了1.06%,有害类型识别的F1分数提高了2.77%。我们进一步评估了MDP框架在中文多模态讽刺检测数据集上的泛化效果,该方法也取得了具有竞争力的结果。
{"title":"Multi-view dynamic perception framework for Chinese harmful meme detection","authors":"Jiapei Hu ,&nbsp;Yifan Lyu ,&nbsp;Yifan Chen ,&nbsp;Kuntao Li ,&nbsp;Yun Xue ,&nbsp;Jinghua Liang","doi":"10.1016/j.ipm.2025.104602","DOIUrl":"10.1016/j.ipm.2025.104602","url":null,"abstract":"<div><div>Chinese harmful memes convey toxicity on social media through varied semantic complexity and diverse modality combinations. However, existing detection methods typically adopt static architectures with fixed interaction patterns, which lack the flexibility to accurately identify harmful cues embedded in heterogeneous semantic and modal content across different memes, hindering a comprehensive understanding of toxic intent. To address this limitation, we propose the Multi-View Dynamic Perception (MDP) framework, a dynamic interaction paradigm specifically designed for Chinese harmful meme detection. Specifically, we develop five types of semantic perception nodes to synchronously extract features from diverse views. These nodes are densely stacked to form two perception branches, respectively guided by textual and visual features, to effectively capture modality-specific cues. To enhance adaptability, each node is equipped with an independent soft router that dynamically regulates information flow and enables flexible interaction patterns tailored to different memes. Furthermore, we introduce a Hierarchical Mutual Learning module to promote complementary representation learning between the two branches via mutual information maximization. Extensive experiments on the publicly available dataset TOXICN MM, comprising 12,000 samples, demonstrate the effectiveness of the proposed framework, with F1 score improvements of 1.06% in harmful meme detection and 2.77% in harmful type identification over the previous state-of-the-art method. We further evaluate the generalization of the MDP framework on a Chinese multimodal sarcasm detection dataset, where the proposed method also achieves competitive results.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104602"},"PeriodicalIF":6.9,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NeuPath: A hybrid learning-based optimization approach for emergency search path planning NeuPath:一种基于混合学习的紧急搜索路径规划优化方法
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-14 DOI: 10.1016/j.ipm.2026.104615
Yingying Gao , Tianle Pu , Qingqing Yang , Zhiwei Yang , Kewei Yang , Changjun Fan
Search operations play a vital role in humanitarian emergencies, where maximizing the probability of finding survivors requires highly efficient solutions. A key challenge lies in the limitations of existing methods, which often struggle with high-dimensional constraints and sparse decision spaces, particularly in large-scale scenarios. To address this, we propose NeuPath, a hybrid learning-based optimization framework that accelerates the discovery of high-quality solutions. NeuPath first formulates the optimal search problem (OSP) as a bipartite graph representation to enhance feature extraction and scalability. It then predicts an initial solution using a graph neural network (GNN) augmented with a two-stage aggregation mechanism, followed by refinement via a block-wise trust-region scheme. Extensive experiments on OSP static scenarios (500 instances) demonstrate that NeuPath achieves significant speedups over exact solvers, with performance gains of 2.48 ×  (Gurobi) and 2.74 ×  (SCIP) across varying problem sizes. For large-scale random scenarios (500 instances), the solution quality of this method also significantly exceeds that of the exact solver in a finite time (3600s). Moreover, the framework exhibits strong generalization capabilities by learning meaningful problem structure features. Ablation studies further validate the effectiveness of each module.
搜索行动在人道主义紧急情况中发挥着至关重要的作用,在这种情况下,最大限度地找到幸存者的可能性需要高效的解决方案。一个关键的挑战在于现有方法的局限性,这些方法经常与高维约束和稀疏决策空间作斗争,特别是在大规模场景中。为了解决这个问题,我们提出了NeuPath,这是一个基于学习的混合优化框架,可以加速发现高质量的解决方案。NeuPath首先将最优搜索问题(OSP)表述为二部图表示,以增强特征提取和可扩展性。然后,它使用带有两阶段聚合机制的图神经网络(GNN)预测初始解决方案,然后通过块方向的信任区域方案进行细化。在OSP静态场景(500个实例)上进行的大量实验表明,NeuPath比精确求解器实现了显著的加速,在不同的问题规模上,性能提高了2.48 × (Gurobi)和2.74 × (SCIP)。对于大规模随机场景(500个实例),该方法在有限时间(3600s)内的求解质量也明显优于精确求解器。此外,该框架通过学习有意义的问题结构特征,表现出较强的泛化能力。烧蚀研究进一步验证了各模块的有效性。
{"title":"NeuPath: A hybrid learning-based optimization approach for emergency search path planning","authors":"Yingying Gao ,&nbsp;Tianle Pu ,&nbsp;Qingqing Yang ,&nbsp;Zhiwei Yang ,&nbsp;Kewei Yang ,&nbsp;Changjun Fan","doi":"10.1016/j.ipm.2026.104615","DOIUrl":"10.1016/j.ipm.2026.104615","url":null,"abstract":"<div><div>Search operations play a vital role in humanitarian emergencies, where maximizing the probability of finding survivors requires highly efficient solutions. A key challenge lies in the limitations of existing methods, which often struggle with high-dimensional constraints and sparse decision spaces, particularly in large-scale scenarios. To address this, we propose NeuPath, a hybrid learning-based optimization framework that accelerates the discovery of high-quality solutions. NeuPath first formulates the optimal search problem (OSP) as a bipartite graph representation to enhance feature extraction and scalability. It then predicts an initial solution using a graph neural network (GNN) augmented with a two-stage aggregation mechanism, followed by refinement via a block-wise trust-region scheme. Extensive experiments on OSP static scenarios (500 instances) demonstrate that NeuPath achieves significant speedups over exact solvers, with performance gains of 2.48 ×  (Gurobi) and 2.74 ×  (SCIP) across varying problem sizes. For large-scale random scenarios (500 instances), the solution quality of this method also significantly exceeds that of the exact solver in a finite time (3600s). Moreover, the framework exhibits strong generalization capabilities by learning meaningful problem structure features. Ablation studies further validate the effectiveness of each module.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104615"},"PeriodicalIF":6.9,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GRAIN: Gravity-resistance adaptive framework for identifying influential nodes using multi-order structural diversity GRAIN:利用多阶结构多样性识别影响节点的重力-阻力自适应框架
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-13 DOI: 10.1016/j.ipm.2026.104618
Yirun Ruan , Xinghua Qin , Sizheng Liu , Mengmeng Zhang , Jun Tang , Yanming Guo , Tianyuan Yu
Identifying influential spreaders is critical for applications like disease control and information dissemination. Existing methods often face trade-offs among accuracy, resolution, and computational cost. To achieve a more balanced and accurate identification, we propose GRAIN (Gravity-Resistance Adaptive Influential Node identification framework), a novel approach that effectively combines multi-order structural diversity and topological cohesion for influence assessment. Specifically, GRAIN introduces two key innovations: (1) A node-adaptive propagation diversity metric based on k-shell entropy. This metric dynamically balances the contributions of a node's direct (1-hop) and indirect (2-hop) neighbors to its diversity through an adaptive weighting parameter λ(v). (2) Resistance distance Ωij to quantify the topological cohesion between nodes i and j, capturing the "multiple-route distance diminishment" effect observed in networks. We integrate these components within a physics-inspired gravity model: A node's "mass" combines its core position (k-shell) and neighborhood propagation diversity, while "distance" uses Ωij with exponential attenuation for farther neighbors. A node’s total influence is the sum of gravitational forces from its 1-hop and 2-hop neighbors. Evaluations on 20 real-world and 15 synthetic networks using the Susceptible-Infected-Recovered (SIR) model show that GRAIN significantly outperforms a range of benchmarks in the majority of cases, achieving higher kendall’s tau correlation across a wide range of transmission rates and network topologies.
确定有影响力的传播者对于疾病控制和信息传播等应用至关重要。现有的方法经常面临精度、分辨率和计算成本之间的权衡。为了实现更平衡和准确的识别,我们提出了一种将多阶结构多样性和拓扑内聚有效地结合起来进行影响评估的新方法GRAIN(重力-阻力自适应影响节点识别框架)。具体来说,GRAIN引入了两个关键创新:(1)基于k壳熵的节点自适应传播多样性度量。该度量通过自适应加权参数λ(v)动态平衡节点的直接(1跳)和间接(2跳)邻居对其多样性的贡献。(2)阻力距离Ωij量化节点i和j之间的拓扑内聚性,捕捉网络中观察到的“多路距离递减”效应。我们将这些组件集成到一个物理启发的重力模型中:节点的“质量”结合了它的核心位置(k壳)和邻域传播多样性,而“距离”使用Ωij和指数衰减来表示更远的邻居。一个节点的总影响是来自它的1跳和2跳邻居的引力的总和。使用敏感性-感染-恢复(SIR)模型对20个真实网络和15个合成网络进行的评估表明,在大多数情况下,GRAIN显著优于一系列基准,在广泛的传输速率和网络拓扑结构范围内实现更高的肯德尔tau相关性。
{"title":"GRAIN: Gravity-resistance adaptive framework for identifying influential nodes using multi-order structural diversity","authors":"Yirun Ruan ,&nbsp;Xinghua Qin ,&nbsp;Sizheng Liu ,&nbsp;Mengmeng Zhang ,&nbsp;Jun Tang ,&nbsp;Yanming Guo ,&nbsp;Tianyuan Yu","doi":"10.1016/j.ipm.2026.104618","DOIUrl":"10.1016/j.ipm.2026.104618","url":null,"abstract":"<div><div>Identifying influential spreaders is critical for applications like disease control and information dissemination. Existing methods often face trade-offs among accuracy, resolution, and computational cost. To achieve a more balanced and accurate identification, we propose GRAIN (Gravity-Resistance Adaptive Influential Node identification framework), a novel approach that effectively combines multi-order structural diversity and topological cohesion for influence assessment. Specifically, GRAIN introduces two key innovations: (1) A node-adaptive propagation diversity metric based on k-shell entropy. This metric dynamically balances the contributions of a node's direct (1-hop) and indirect (2-hop) neighbors to its diversity through an adaptive weighting parameter <em>λ</em>(<em>v</em>). (2) Resistance distance Ω<em><sub>ij</sub></em> to quantify the topological cohesion between nodes <em>i</em> and <em>j</em>, capturing the \"multiple-route distance diminishment\" effect observed in networks. We integrate these components within a physics-inspired gravity model: A node's \"mass\" combines its core position (k-shell) and neighborhood propagation diversity, while \"distance\" uses Ω<em><sub>ij</sub></em> with exponential attenuation for farther neighbors. A node’s total influence is the sum of gravitational forces from its 1-hop and 2-hop neighbors. Evaluations on 20 real-world and 15 synthetic networks using the Susceptible-Infected-Recovered (SIR) model show that GRAIN significantly outperforms a range of benchmarks in the majority of cases, achieving higher kendall’s tau correlation across a wide range of transmission rates and network topologies.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104618"},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-based visual question classification using quaternion hypergraph consistent network 基于四元数超图一致网络的知识视觉问题分类
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-13 DOI: 10.1016/j.ipm.2025.104591
Jing Wang , Duantengchuan Li , Xu Du , Hao Li , Zhuang Hu
Visual questions are an important means to evaluate students’ knowledge. Knowledge-based visual question classification can effectively excavate the knowledge intention of the question, and realize the effective organization and management of online question resources at the knowledge level. The existing methods simply regard it as a multimodal classification task, ignoring the capture of implicit knowledge information and the fine-grained interactions between multimodal and multi-granularity features. To mitigate this, we propose a quaternion hypergraph consistent network (QHCN). This approach can not only extract explicit semantic features and implicit knowledge features from text and images simultaneously, but also considers three key properties among explicit-implicit features: modality complementation, modality independence, and knowledge consistency. Specifically, a visual question is represented as a quaternion vector consisting of two modalities and four-dimensional features. To achieve multimodal complementation, the consistency of vision and language guides the construction of a quaternion hypergraph, and a quaternion convolution operator deeply fuses explicit-implicit features. To capture inter-dependencies between explicit-implicit features, the independence loss and knowledge consistency loss are designed to optimize hypergraph network parameters and enhance the hypergraph structure. Extensive experiments on visual question sets verify that our QHCN achieved an accuracy of 94.82% and an F1 score of 94.76%, outperforming the optimal baseline by +1.46% and +1.53%, respectively.
可视化问题是评价学生知识水平的重要手段。基于知识的可视化问题分类可以有效挖掘问题的知识意图,实现对在线问题资源在知识层面的有效组织和管理。现有方法简单地将其视为一个多模态分类任务,忽略了隐性知识信息的获取以及多模态和多粒度特征之间的细粒度交互。为了解决这个问题,我们提出了一个四元数超图一致网络(QHCN)。该方法不仅可以同时从文本和图像中提取显式语义特征和隐含知识特征,而且考虑了显式和隐含特征之间的三个关键特性:情态互补、情态独立性和知识一致性。具体来说,视觉问题被表示为由两个模态和四维特征组成的四元数向量。为了实现多模态互补,视觉和语言的一致性指导了四元数超图的构造,四元数卷积算子深度融合了显式-隐式特征。为了捕获显隐特征之间的相互依赖关系,设计了独立性损失和知识一致性损失来优化超图网络参数,增强超图结构。在视觉问题集上的大量实验验证了我们的QHCN达到了94.82%的准确率和94.76%的F1分数,分别比最优基线提高了+1.46%和+1.53%。
{"title":"Knowledge-based visual question classification using quaternion hypergraph consistent network","authors":"Jing Wang ,&nbsp;Duantengchuan Li ,&nbsp;Xu Du ,&nbsp;Hao Li ,&nbsp;Zhuang Hu","doi":"10.1016/j.ipm.2025.104591","DOIUrl":"10.1016/j.ipm.2025.104591","url":null,"abstract":"<div><div>Visual questions are an important means to evaluate students’ knowledge. Knowledge-based visual question classification can effectively excavate the knowledge intention of the question, and realize the effective organization and management of online question resources at the knowledge level. The existing methods simply regard it as a multimodal classification task, ignoring the capture of implicit knowledge information and the fine-grained interactions between multimodal and multi-granularity features. To mitigate this, we propose a quaternion hypergraph consistent network (QHCN). This approach can not only extract explicit semantic features and implicit knowledge features from text and images simultaneously, but also considers three key properties among explicit-implicit features: modality complementation, modality independence, and knowledge consistency. Specifically, a visual question is represented as a quaternion vector consisting of two modalities and four-dimensional features. To achieve multimodal complementation, the consistency of vision and language guides the construction of a quaternion hypergraph, and a quaternion convolution operator deeply fuses explicit-implicit features. To capture inter-dependencies between explicit-implicit features, the independence loss and knowledge consistency loss are designed to optimize hypergraph network parameters and enhance the hypergraph structure. Extensive experiments on visual question sets verify that our QHCN achieved an accuracy of 94.82% and an F1 score of 94.76%, outperforming the optimal baseline by +1.46% and +1.53%, respectively.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104591"},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing large language models for knowledge graph question answering via multi-granularity knowledge injection and structured reasoning path-augmented prompting 通过多粒度知识注入和结构化推理路径增强提示增强知识图问答的大型语言模型
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-13 DOI: 10.1016/j.ipm.2026.104614
Chuanyang Gong , Zhihua Wei , Wenhao Tao , Duoqian Miao
Large language models (LLMs) often exhibit factual errors when handling complex knowledge reasoning. To address this issue, we propose MGPrompt, a novel knowledge graph question answering (KGQA) framework that enhances LLM performance by integrating multi-granularity knowledge with structured reasoning path-augmented prompting. MGPrompt consists of three core modules-knowledge refinement, semantic association, and information fusion-to dynamically filter and integrate entity-level, relation-level, and subgraph-level knowledge retrieved from the knowledge graph. Subsequently, we inject these refined semantic representations as prefix vectors into the LLM and fine-tune the model using Low-Rank Adaptation (LoRA) to guide it in generating accurate reasoning paths. We conducted extensive experiments on two benchmark datasets, WebQSP and CWQ. The results show that MGPrompt achieves highly competitive performance compared to 30 baseline methods. Experimental results show that MGPrompt achieves highly competitive performance on both WebQSP and CWQ; in particular, it improves the Hits@1 score on WebQSP by 1.1% over the strongest baseline (85.7%), thereby clearly demonstrating the effectiveness of the proposed framework for complex KGQA tasks.
大型语言模型(llm)在处理复杂的知识推理时经常出现事实错误。为了解决这个问题,我们提出了MGPrompt,这是一个新的知识图问答(KGQA)框架,通过将多粒度知识与结构化推理路径增强提示集成在一起来提高LLM的性能。MGPrompt由知识精化、语义关联和信息融合三个核心模块组成,用于动态过滤和集成从知识图中检索到的实体级、关系级和子图级知识。随后,我们将这些精炼的语义表示作为前缀向量注入到LLM中,并使用低秩自适应(Low-Rank Adaptation, LoRA)对模型进行微调,引导其生成准确的推理路径。我们在WebQSP和CWQ两个基准数据集上进行了大量的实验。结果表明,与30种基准方法相比,MGPrompt实现了极具竞争力的性能。实验结果表明,MGPrompt在WebQSP和CWQ上都具有很强的竞争力;特别是,它在WebQSP上的Hits@1得分比最强基线(85.7%)提高了1.1%,从而清楚地证明了所提出的框架对于复杂的KGQA任务的有效性。
{"title":"Enhancing large language models for knowledge graph question answering via multi-granularity knowledge injection and structured reasoning path-augmented prompting","authors":"Chuanyang Gong ,&nbsp;Zhihua Wei ,&nbsp;Wenhao Tao ,&nbsp;Duoqian Miao","doi":"10.1016/j.ipm.2026.104614","DOIUrl":"10.1016/j.ipm.2026.104614","url":null,"abstract":"<div><div>Large language models (LLMs) often exhibit factual errors when handling complex knowledge reasoning. To address this issue, we propose MGPrompt, a novel knowledge graph question answering (KGQA) framework that enhances LLM performance by integrating multi-granularity knowledge with structured reasoning path-augmented prompting. MGPrompt consists of three core modules-knowledge refinement, semantic association, and information fusion-to dynamically filter and integrate entity-level, relation-level, and subgraph-level knowledge retrieved from the knowledge graph. Subsequently, we inject these refined semantic representations as prefix vectors into the LLM and fine-tune the model using Low-Rank Adaptation (LoRA) to guide it in generating accurate reasoning paths. We conducted extensive experiments on two benchmark datasets, WebQSP and CWQ. The results show that MGPrompt achieves highly competitive performance compared to 30 baseline methods. Experimental results show that MGPrompt achieves highly competitive performance on both WebQSP and CWQ; in particular, it improves the Hits@1 score on WebQSP by 1.1% over the strongest baseline (85.7%), thereby clearly demonstrating the effectiveness of the proposed framework for complex KGQA tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104614"},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance unfairness of large language models in cross-language fact-checking 大型语言模型在跨语言事实检验中的性能不公平性
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-13 DOI: 10.1016/j.ipm.2026.104616
Dandan Wang , Stephanie Jean Tsang , Yadong Zhou
Large language models (LLMs) are increasingly used for automated fact-checking, yet their performance often varies across languages, raising global fairness concerns. This study evaluated cross-language inequality in LLM-based fact-checking using 4,500 claims spanning nine languages across six language families. Besides building a systematic performance-evaluation pipeline covering instruction following, authenticity classification, evidence generation, and checking-worthiness scoring, we quantified inequality using standard deviation, coefficient of variation, Gini coefficient, and Theil index. Results showed substantial cross-language disparities, with higher performance on claims from rich-resource languages. To mitigate inequality, we tested two interventions, role-restricted prompt engineering and model fine-tuning. Both approaches reduced disparities, with fine-tuning achieving the largest and most consistent improvement across languages, particularly in checking-worthiness scoring. This study provides a reproducible framework for quantifying multilingual performance and fairness in LLM-based fact-checking and offers practical guidance for developing more equitable verification systems across diverse linguistic contexts.
大型语言模型(llm)越来越多地用于自动事实检查,但它们的性能通常因语言而不同,这引起了全球公平性问题。这项研究评估了基于法学硕士的事实核查中的跨语言不平等,使用了跨越6个语系的9种语言的4,500个索赔。构建了包括指令遵循、真实性分类、证据生成和检验价值评分在内的系统绩效评价管道,并使用标准差、变异系数、基尼系数和泰尔指数对不平等进行量化。结果显示了显著的跨语言差异,在资源丰富的语言中表现更好。为了减轻不平等,我们测试了两种干预措施,角色限制提示工程和模型微调。这两种方法都减少了差异,通过微调实现了跨语言的最大和最一致的改进,特别是在检查值评分方面。本研究为量化基于法学硕士的事实核查中的多语言表现和公平性提供了一个可重复的框架,并为在不同语言背景下开发更公平的核查系统提供了实践指导。
{"title":"Performance unfairness of large language models in cross-language fact-checking","authors":"Dandan Wang ,&nbsp;Stephanie Jean Tsang ,&nbsp;Yadong Zhou","doi":"10.1016/j.ipm.2026.104616","DOIUrl":"10.1016/j.ipm.2026.104616","url":null,"abstract":"<div><div>Large language models (LLMs) are increasingly used for automated fact-checking, yet their performance often varies across languages, raising global fairness concerns. This study evaluated cross-language inequality in LLM-based fact-checking using 4,500 claims spanning nine languages across six language families. Besides building a systematic performance-evaluation pipeline covering instruction following, authenticity classification, evidence generation, and checking-worthiness scoring, we quantified inequality using standard deviation, coefficient of variation, Gini coefficient, and Theil index. Results showed substantial cross-language disparities, with higher performance on claims from rich-resource languages. To mitigate inequality, we tested two interventions, role-restricted prompt engineering and model fine-tuning. Both approaches reduced disparities, with fine-tuning achieving the largest and most consistent improvement across languages, particularly in checking-worthiness scoring. This study provides a reproducible framework for quantifying multilingual performance and fairness in LLM-based fact-checking and offers practical guidance for developing more equitable verification systems across diverse linguistic contexts.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104616"},"PeriodicalIF":6.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Processing & Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1