Pub Date : 2026-06-01Epub Date: 2026-01-13DOI: 10.1016/j.ipm.2026.104614
Chuanyang Gong , Zhihua Wei , Wenhao Tao , Duoqian Miao
Large language models (LLMs) often exhibit factual errors when handling complex knowledge reasoning. To address this issue, we propose MGPrompt, a novel knowledge graph question answering (KGQA) framework that enhances LLM performance by integrating multi-granularity knowledge with structured reasoning path-augmented prompting. MGPrompt consists of three core modules-knowledge refinement, semantic association, and information fusion-to dynamically filter and integrate entity-level, relation-level, and subgraph-level knowledge retrieved from the knowledge graph. Subsequently, we inject these refined semantic representations as prefix vectors into the LLM and fine-tune the model using Low-Rank Adaptation (LoRA) to guide it in generating accurate reasoning paths. We conducted extensive experiments on two benchmark datasets, WebQSP and CWQ. The results show that MGPrompt achieves highly competitive performance compared to 30 baseline methods. Experimental results show that MGPrompt achieves highly competitive performance on both WebQSP and CWQ; in particular, it improves the Hits@1 score on WebQSP by 1.1% over the strongest baseline (85.7%), thereby clearly demonstrating the effectiveness of the proposed framework for complex KGQA tasks.
{"title":"Enhancing large language models for knowledge graph question answering via multi-granularity knowledge injection and structured reasoning path-augmented prompting","authors":"Chuanyang Gong , Zhihua Wei , Wenhao Tao , Duoqian Miao","doi":"10.1016/j.ipm.2026.104614","DOIUrl":"10.1016/j.ipm.2026.104614","url":null,"abstract":"<div><div>Large language models (LLMs) often exhibit factual errors when handling complex knowledge reasoning. To address this issue, we propose MGPrompt, a novel knowledge graph question answering (KGQA) framework that enhances LLM performance by integrating multi-granularity knowledge with structured reasoning path-augmented prompting. MGPrompt consists of three core modules-knowledge refinement, semantic association, and information fusion-to dynamically filter and integrate entity-level, relation-level, and subgraph-level knowledge retrieved from the knowledge graph. Subsequently, we inject these refined semantic representations as prefix vectors into the LLM and fine-tune the model using Low-Rank Adaptation (LoRA) to guide it in generating accurate reasoning paths. We conducted extensive experiments on two benchmark datasets, WebQSP and CWQ. The results show that MGPrompt achieves highly competitive performance compared to 30 baseline methods. Experimental results show that MGPrompt achieves highly competitive performance on both WebQSP and CWQ; in particular, it improves the Hits@1 score on WebQSP by 1.1% over the strongest baseline (85.7%), thereby clearly demonstrating the effectiveness of the proposed framework for complex KGQA tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104614"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-14DOI: 10.1016/j.ipm.2026.104615
Yingying Gao , Tianle Pu , Qingqing Yang , Zhiwei Yang , Kewei Yang , Changjun Fan
Search operations play a vital role in humanitarian emergencies, where maximizing the probability of finding survivors requires highly efficient solutions. A key challenge lies in the limitations of existing methods, which often struggle with high-dimensional constraints and sparse decision spaces, particularly in large-scale scenarios. To address this, we propose NeuPath, a hybrid learning-based optimization framework that accelerates the discovery of high-quality solutions. NeuPath first formulates the optimal search problem (OSP) as a bipartite graph representation to enhance feature extraction and scalability. It then predicts an initial solution using a graph neural network (GNN) augmented with a two-stage aggregation mechanism, followed by refinement via a block-wise trust-region scheme. Extensive experiments on OSP static scenarios (500 instances) demonstrate that NeuPath achieves significant speedups over exact solvers, with performance gains of 2.48 × (Gurobi) and 2.74 × (SCIP) across varying problem sizes. For large-scale random scenarios (500 instances), the solution quality of this method also significantly exceeds that of the exact solver in a finite time (3600s). Moreover, the framework exhibits strong generalization capabilities by learning meaningful problem structure features. Ablation studies further validate the effectiveness of each module.
{"title":"NeuPath: A hybrid learning-based optimization approach for emergency search path planning","authors":"Yingying Gao , Tianle Pu , Qingqing Yang , Zhiwei Yang , Kewei Yang , Changjun Fan","doi":"10.1016/j.ipm.2026.104615","DOIUrl":"10.1016/j.ipm.2026.104615","url":null,"abstract":"<div><div>Search operations play a vital role in humanitarian emergencies, where maximizing the probability of finding survivors requires highly efficient solutions. A key challenge lies in the limitations of existing methods, which often struggle with high-dimensional constraints and sparse decision spaces, particularly in large-scale scenarios. To address this, we propose NeuPath, a hybrid learning-based optimization framework that accelerates the discovery of high-quality solutions. NeuPath first formulates the optimal search problem (OSP) as a bipartite graph representation to enhance feature extraction and scalability. It then predicts an initial solution using a graph neural network (GNN) augmented with a two-stage aggregation mechanism, followed by refinement via a block-wise trust-region scheme. Extensive experiments on OSP static scenarios (500 instances) demonstrate that NeuPath achieves significant speedups over exact solvers, with performance gains of 2.48 × (Gurobi) and 2.74 × (SCIP) across varying problem sizes. For large-scale random scenarios (500 instances), the solution quality of this method also significantly exceeds that of the exact solver in a finite time (3600s). Moreover, the framework exhibits strong generalization capabilities by learning meaningful problem structure features. Ablation studies further validate the effectiveness of each module.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104615"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-06DOI: 10.1016/j.ipm.2025.104595
Tongji Chen , Guoliang Zou , Shizhe Hu, Yangdong Ye
Multi-modal clustering aims to exploit relationships between different modalities to enhance clustering performance. However, existing methods face two main challenges. First, feature extraction fails to fully utilize the relationships between samples from different modalities, which is crucial for capturing global multi-modal information. Second, most clustering methods struggle to correct significantly erroneous assignments. To address these challenges, we propose Cross-modal Information Propagation for Contrastive Multi-modal Clustering (CIPCMC), a novel method driven by cross-modal information (CMI) and contrastive learning. We progressively obtain private CMI and integrate it into a unified CMI, which is then propagated to optimize the entire model. First, a cross-attention mechanism introduces CMI for each modality, enabling the model to focus on relationships between different modalities. This allows the model to uncover semantic associations and effectively exploit the complementary nature of multi-modal data. Next, we fuse modality-specific representations to derive a unified CMI representation, which helps each modality correct erroneous assignments, leading to high-confidence clustering. The end-to-end training of CIPCMC ensures module synergy, improving performance and generalization. Experiments on challenging datasets show that CIPCMC outperforms existing methods, achieving accuracy improvements of 10.0% on the Caltech-3M dataset and 16.6% on the PBMC dataset.
{"title":"Cross-modal information propagation for contrastive multi-modal clustering","authors":"Tongji Chen , Guoliang Zou , Shizhe Hu, Yangdong Ye","doi":"10.1016/j.ipm.2025.104595","DOIUrl":"10.1016/j.ipm.2025.104595","url":null,"abstract":"<div><div>Multi-modal clustering aims to exploit relationships between different modalities to enhance clustering performance. However, existing methods face two main challenges. First, feature extraction fails to fully utilize the relationships between samples from different modalities, which is crucial for capturing global multi-modal information. Second, most clustering methods struggle to correct significantly erroneous assignments. To address these challenges, we propose Cross-modal Information Propagation for Contrastive Multi-modal Clustering (CIPCMC), a novel method driven by cross-modal information (CMI) and contrastive learning. We progressively obtain private CMI and integrate it into a unified CMI, which is then propagated to optimize the entire model. First, a cross-attention mechanism introduces CMI for each modality, enabling the model to focus on relationships between different modalities. This allows the model to uncover semantic associations and effectively exploit the complementary nature of multi-modal data. Next, we fuse modality-specific representations to derive a unified CMI representation, which helps each modality correct erroneous assignments, leading to high-confidence clustering. The end-to-end training of CIPCMC ensures module synergy, improving performance and generalization. Experiments on challenging datasets show that CIPCMC outperforms existing methods, achieving accuracy improvements of 10.0% on the Caltech-3M dataset and 16.6% on the PBMC dataset.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104595"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-09DOI: 10.1016/j.ipm.2026.104611
Ke Huang , Chenghao Xiao , Yao Xiao , Ming Cai , Noura Al Moubayed
Large Language Models (LLMs) have advanced zero-shot Information Extraction (IE), particularly in Sentence-level Relation Extraction (SentRE), through in-context learning and instruction tuning. However, the current evaluation of LLMs’ zero-shot ability on IE tasks remains fragile and unreliable. In this work, we provide a systematic examination of the fragility underlying current evaluation practices across three interrelated levels. At the data level, we demonstrate that the commonly adopted random sampling strategy introduces significant biases in class-imbalanced datasets, whereas balanced sampling provides more stable and faithful assessments of LLMs performance. At the task level, we reveal that three domain prompt frameworks on SentRE transfer inconsistently to Document-level Relation Extraction (DocRE) and Named Entity Recognition (NER), showing partial effectiveness on NER but notable limitations on DocRE due to long contexts and complex entity structures. At the method level, through extensive experiments on three IE tasks and seven datasets, we conduct the first comprehensive comparison of five general prompt frameworks, including Chain-of-Thought, Self-Improvement, and Self-Debate, showing that prompt effectiveness is highly task-dependent, with no single strategy dominating across tasks. For each task, the CoT prompt framework achieves the best performance on SentRE, the Vanilla prompt framework performs best on DocRE, and the Self-Consistency prompt framework excels on NER. These insights challenge current landscape of information extraction, providing guidelines for robust evaluation and prompt designs.
{"title":"Reevaluating zero-shot information extraction: Sampling bias, prompting transferability and sensitivity in large language models","authors":"Ke Huang , Chenghao Xiao , Yao Xiao , Ming Cai , Noura Al Moubayed","doi":"10.1016/j.ipm.2026.104611","DOIUrl":"10.1016/j.ipm.2026.104611","url":null,"abstract":"<div><div>Large Language Models (LLMs) have advanced zero-shot Information Extraction (IE), particularly in Sentence-level Relation Extraction (SentRE), through in-context learning and instruction tuning. However, the current evaluation of LLMs’ zero-shot ability on IE tasks remains fragile and unreliable. In this work, we provide a systematic examination of the fragility underlying current evaluation practices across three interrelated levels. At the data level, we demonstrate that the commonly adopted random sampling strategy introduces significant biases in class-imbalanced datasets, whereas balanced sampling provides more stable and faithful assessments of LLMs performance. At the task level, we reveal that three domain prompt frameworks on SentRE transfer inconsistently to Document-level Relation Extraction (DocRE) and Named Entity Recognition (NER), showing partial effectiveness on NER but notable limitations on DocRE due to long contexts and complex entity structures. At the method level, through extensive experiments on three IE tasks and seven datasets, we conduct the first comprehensive comparison of five general prompt frameworks, including Chain-of-Thought, Self-Improvement, and Self-Debate, showing that prompt effectiveness is highly task-dependent, with no single strategy dominating across tasks. For each task, the CoT prompt framework achieves the best performance on SentRE, the Vanilla prompt framework performs best on DocRE, and the Self-Consistency prompt framework excels on NER. These insights challenge current landscape of information extraction, providing guidelines for robust evaluation and prompt designs.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104611"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-02DOI: 10.1016/j.ipm.2025.104586
Chenhui Shi , Yongjie Xin , Haifeng Yang , Jianghui Cai , Jie Wang , Lichan Zhou , Yanting He , Fuxing Cui , Xujun Zhao , Yaling Xun
Graph-based multi-view clustering methods have gained considerable attention in recent years. However, most existing techniques ignore the association of graph and feature distributions between different views. In addition, noise and redundant information in data will leads to an inability to accurately learn consistent distributions among multiple views. To overcome these issues, this study proposes a framework termed “multi-view clustering based on the association of graph structure and feature distribution” (MLGF). Specifically, we provide collaborative training based on a similar distribution comparison mechanism that unifies the graph structures and feature distributions of different views, to construct multiple high-quality similarity matrices. Noisy information is effectively eliminated from the raw data by embedding graph spectral decomposition and automatic weighting methods into the graph encoder to learn clean, low-dimensional embedded representations of the data. Finally, multiple similarity matrices are fused in a locally weighted manner to obtain consistent similarity matrices. Experiments on five benchmark datasets demonstrated the superiority of our method, achieving 100%, 97.28% on COIL-20 and Handwritten datasets. This is attributed to the effective joint optimization of graph structure and feature distribution, which is validated by its outstanding performance across diverse datasets. The code will be available at https://github.com/shichenhui/MLGF.
{"title":"Multi-view clustering based on the association of graph structure and feature distribution","authors":"Chenhui Shi , Yongjie Xin , Haifeng Yang , Jianghui Cai , Jie Wang , Lichan Zhou , Yanting He , Fuxing Cui , Xujun Zhao , Yaling Xun","doi":"10.1016/j.ipm.2025.104586","DOIUrl":"10.1016/j.ipm.2025.104586","url":null,"abstract":"<div><div>Graph-based multi-view clustering methods have gained considerable attention in recent years. However, most existing techniques ignore the association of graph and feature distributions between different views. In addition, noise and redundant information in data will leads to an inability to accurately learn consistent distributions among multiple views. To overcome these issues, this study proposes a framework termed “multi-view clustering based on the association of graph structure and feature distribution” (MLGF). Specifically, we provide collaborative training based on a similar distribution comparison mechanism that unifies the graph structures and feature distributions of different views, to construct multiple high-quality similarity matrices. Noisy information is effectively eliminated from the raw data by embedding graph spectral decomposition and automatic weighting methods into the graph encoder to learn clean, low-dimensional embedded representations of the data. Finally, multiple similarity matrices are fused in a locally weighted manner to obtain consistent similarity matrices. Experiments on five benchmark datasets demonstrated the superiority of our method, achieving 100%, 97.28% on COIL-20 and Handwritten datasets. This is attributed to the effective joint optimization of graph structure and feature distribution, which is validated by its outstanding performance across diverse datasets. The code will be available at <span><span>https://github.com/shichenhui/MLGF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104586"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-02DOI: 10.1016/j.ipm.2025.104587
Xinru Han , Yukun Bao , Jianming Zhan , Yufeng Shen
Existing multi-criteria sorting methods predominantly rely on preset classification thresholds or fixed numbers of alternatives for classification, exhibiting strong subjectivity and overlooking potential consensus correlations between classifications. In group decision-making (GDM), the consensus feedback mechanism drives the consensus reaching process (CRP) and gives rise to the problem of adjustment amount allocation among decision-makers (DMs). However, existing studies over-rely on consensus thresholds and neglect differences in DMs’ adjustment capabilities and sequences, which significantly reduces the applicability and accuracy of the methods. To address the above issues, this study proposes a novel group consensus method (NS-FPR-PM) integrating the Nash-Stackelberg game and preference maps within the framework of fuzzy preference relations (FPRs). Specifically, class probability thresholds are objectively derived through an optimization model; the classification results are then converted into preference maps based on these class probability thresholds to explore the inherent consensus relations, thereby eliminating reliance on consensus thresholds. The Nash-Stackelberg game model can characterize the differences in bargaining power among DMs, and an asynchronous adjustment mechanism is designed accordingly to achieve fair allocation of adjustment amount. Finally, we provide an example to illustrate the proposed method, the experimental results and analysis demonstrate that the method exhibits significant advantages over similar methods in terms of consensus reaching efficiency and unit adjustment conversion rate.
{"title":"A multi-criteria sorting method for preference maps based on Nash-Stackelberg game","authors":"Xinru Han , Yukun Bao , Jianming Zhan , Yufeng Shen","doi":"10.1016/j.ipm.2025.104587","DOIUrl":"10.1016/j.ipm.2025.104587","url":null,"abstract":"<div><div>Existing multi-criteria sorting methods predominantly rely on preset classification thresholds or fixed numbers of alternatives for classification, exhibiting strong subjectivity and overlooking potential consensus correlations between classifications. In group decision-making (GDM), the consensus feedback mechanism drives the consensus reaching process (CRP) and gives rise to the problem of adjustment amount allocation among decision-makers (DMs). However, existing studies over-rely on consensus thresholds and neglect differences in DMs’ adjustment capabilities and sequences, which significantly reduces the applicability and accuracy of the methods. To address the above issues, this study proposes a novel group consensus method (NS-FPR-PM) integrating the Nash-Stackelberg game and preference maps within the framework of fuzzy preference relations (FPRs). Specifically, class probability thresholds are objectively derived through an optimization model; the classification results are then converted into preference maps based on these class probability thresholds to explore the inherent consensus relations, thereby eliminating reliance on consensus thresholds. The Nash-Stackelberg game model can characterize the differences in bargaining power among DMs, and an asynchronous adjustment mechanism is designed accordingly to achieve fair allocation of adjustment amount. Finally, we provide an example to illustrate the proposed method, the experimental results and analysis demonstrate that the method exhibits significant advantages over similar methods in terms of consensus reaching efficiency and unit adjustment conversion rate.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104587"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2025-12-31DOI: 10.1016/j.ipm.2025.104575
Haibo Zhang , Zhenyu Liu , Yang Wu , Jiaqian Yuan , Gang Li , Zhijie Ding , Bin Hu
Text-based automated depression detection is one of the current hot topics. However, current research lacks the exploration of key verbal behaviors in depression detection scenarios, resulting in insufficient generalization performance of the models. To address this issue, we propose a depression detection method based on emotional pattern discrepancies, as the discrepancies are one of the fundamental features of depression as an affective disorder. Specifically, we propose an Emotional Pattern Discrepancy Aware Depression Detection Model (EPDAD). The EPDAD employs specially designed modules and loss functions to train the model. This approach enables the model to dynamically and comprehensively perceive the different emotional patterns reflected by depressed and healthy individuals in response to various emotional stimuli. As a result, it enhances the model’s ability to learn the essential features of depression. We evaluate the generalization performance of our model from a cross-dataset and cross-topic perspective using MODMA (52 samples) and MIDD (520 samples) datasets. In cross-topic generalization experiments, our method improves F1 score by 10.39% and 1.77% on MODMA and MIDD, respectively, in comparison to the state-of-the-art method. In cross-dataset generalization experiments, our method improves the F1 score by a maximum of 6.37%. We also compare our model with large language models, and the results indicate it is more effective for depression detection tasks. Our research contributes to the practical application of depression detection models. Our code is available at: https://github.com/hbZhzzz/EPDAD.
{"title":"A text-based emotional pattern discrepancy aware model for enhanced generalization in depression detection","authors":"Haibo Zhang , Zhenyu Liu , Yang Wu , Jiaqian Yuan , Gang Li , Zhijie Ding , Bin Hu","doi":"10.1016/j.ipm.2025.104575","DOIUrl":"10.1016/j.ipm.2025.104575","url":null,"abstract":"<div><div>Text-based automated depression detection is one of the current hot topics. However, current research lacks the exploration of key verbal behaviors in depression detection scenarios, resulting in insufficient generalization performance of the models. To address this issue, we propose a depression detection method based on emotional pattern discrepancies, as the discrepancies are one of the fundamental features of depression as an affective disorder. Specifically, we propose an Emotional Pattern Discrepancy Aware Depression Detection Model (EPDAD). The EPDAD employs specially designed modules and loss functions to train the model. This approach enables the model to dynamically and comprehensively perceive the different emotional patterns reflected by depressed and healthy individuals in response to various emotional stimuli. As a result, it enhances the model’s ability to learn the essential features of depression. We evaluate the generalization performance of our model from a cross-dataset and cross-topic perspective using MODMA (52 samples) and MIDD (520 samples) datasets. In cross-topic generalization experiments, our method improves F1 score by 10.39% and 1.77% on MODMA and MIDD, respectively, in comparison to the state-of-the-art method. In cross-dataset generalization experiments, our method improves the F1 score by a maximum of 6.37%. We also compare our model with large language models, and the results indicate it is more effective for depression detection tasks. Our research contributes to the practical application of depression detection models. Our code is available at: <span><span>https://github.com/hbZhzzz/EPDAD</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104575"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-23DOI: 10.1016/j.ipm.2026.104636
Gian Marco Orlando , Valerio La Gatta , Diego Russo , Vincenzo Moscato
Generative Agent-Based Modeling (GABM) is an emerging simulation paradigm that integrates the reasoning capabilities of Large Language Models (LLMs) with traditional Agent-Based Modeling to replicate complex social behaviors, including user interactions on social media platforms. While GABM has been employed to study localized phenomena on social media, such as opinion formation and information propagation, its capacity to capture global network-level phenomena remains underexplored. In this paper, we address this gap by investigating whether GABM-based social media simulations exhibit the Friendship Paradox (FP) – a counterintuitive network phenomenon where individuals, on average, have fewer friends than their friends. We design a GABM-based framework for social media simulation, featuring generative agents that emulate real users by incorporating distinct personalities, interests, and behaviors. Leveraging three real-world Twitter datasets centered on the US 2020 Election, UK Brexit, and the QAnon conspiracy, we demonstrate that the FP and its generalized forms emerge in GABM-based simulations. Consistent with real-world social media, we observe a hierarchical structure where generative agents preferentially connect with others exhibiting superior attributes, such as greater activity or influence, without being instructed with any behavioral rules. Furthermore, our analysis reveals that infrequent connections with highly connected agents primarily drive the Friendship Paradox, mirroring established patterns in real-world networks. Overall, our findings validate the ability of GABM to replicate global social media phenomena, highlighting its potential as a robust framework for modeling and analyzing complex social behaviors at scale.
{"title":"Validating generative agent-Based modeling in social media simulations through the lens of the friendship paradox","authors":"Gian Marco Orlando , Valerio La Gatta , Diego Russo , Vincenzo Moscato","doi":"10.1016/j.ipm.2026.104636","DOIUrl":"10.1016/j.ipm.2026.104636","url":null,"abstract":"<div><div>Generative Agent-Based Modeling (GABM) is an emerging simulation paradigm that integrates the reasoning capabilities of Large Language Models (LLMs) with traditional Agent-Based Modeling to replicate complex social behaviors, including user interactions on social media platforms. While GABM has been employed to study localized phenomena on social media, such as opinion formation and information propagation, its capacity to capture global network-level phenomena remains underexplored. In this paper, we address this gap by investigating whether GABM-based social media simulations exhibit the Friendship Paradox (FP) – a counterintuitive network phenomenon where individuals, on average, have fewer friends than their friends. We design a GABM-based framework for social media simulation, featuring <em>generative agents</em> that emulate real users by incorporating distinct personalities, interests, and behaviors. Leveraging three real-world Twitter datasets centered on the US 2020 Election, UK Brexit, and the QAnon conspiracy, we demonstrate that the FP and its generalized forms emerge in GABM-based simulations. Consistent with real-world social media, we observe a hierarchical structure where <em>generative agents</em> preferentially connect with others exhibiting superior attributes, such as greater activity or influence, without being instructed with any behavioral rules. Furthermore, our analysis reveals that infrequent connections with highly connected agents primarily drive the Friendship Paradox, mirroring established patterns in real-world networks. Overall, our findings validate the ability of GABM to replicate global social media phenomena, highlighting its potential as a robust framework for modeling and analyzing complex social behaviors at scale.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104636"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-08DOI: 10.1016/j.ipm.2025.104597
Jiaying Ren , Fengming Han , Yadong Xu
Deep learning has advanced EEG-based human emotion recognition, yet most existing approaches rely on either temporal or spectral features and insufficiently model the fine-grained spatiotemporal structure of neural activity. To address these challenges, this paper develops a dual-stream spatiotemporal graph convolutional network (DSSGCN) for human emotion recognition. In the time domain, a multi-scale modern temporal convolutional network (MS-MTCN) is designed to capture rich temporal information across diverse receptive fields and model long-range temporal dependencies. In the frequency domain, a fully-connected multi-scale graph attention network (FM-GAT) is introduced to learn complex inter-channel relationships and spatial dependencies from the spectral representation of EEG signals. Furthermore, a cross-domain feature fusion module (CFFM) is employed to integrate the complementary information from both temporal and spectral branches, followed by an adaptive ensemble classifier (AEC) to enhance recognition robustness. Finally, an improved online knowledge distillation (IOKD) algorithm is devised to enhance the model’s robustness and generalization. Evaluated on two public dataset and a self-collected music-emotion dataset, DSSGCN achieves 93.98%, 85.00%, and 99.20% accuracy, consistently surpassing eleven state-of-the-art methods and validating its effectiveness for decoding affective states from EEG signals.
{"title":"Dual-stream spatiotemporal graph convolutional networks for EEG-based human emotion recognition","authors":"Jiaying Ren , Fengming Han , Yadong Xu","doi":"10.1016/j.ipm.2025.104597","DOIUrl":"10.1016/j.ipm.2025.104597","url":null,"abstract":"<div><div>Deep learning has advanced EEG-based human emotion recognition, yet most existing approaches rely on either temporal or spectral features and insufficiently model the fine-grained spatiotemporal structure of neural activity. To address these challenges, this paper develops a dual-stream spatiotemporal graph convolutional network (DSSGCN) for human emotion recognition. In the time domain, a multi-scale modern temporal convolutional network (MS-MTCN) is designed to capture rich temporal information across diverse receptive fields and model long-range temporal dependencies. In the frequency domain, a fully-connected multi-scale graph attention network (FM-GAT) is introduced to learn complex inter-channel relationships and spatial dependencies from the spectral representation of EEG signals. Furthermore, a cross-domain feature fusion module (CFFM) is employed to integrate the complementary information from both temporal and spectral branches, followed by an adaptive ensemble classifier (AEC) to enhance recognition robustness. Finally, an improved online knowledge distillation (IOKD) algorithm is devised to enhance the model’s robustness and generalization. Evaluated on two public dataset and a self-collected music-emotion dataset, DSSGCN achieves 93.98%, 85.00%, and 99.20% accuracy, consistently surpassing eleven state-of-the-art methods and validating its effectiveness for decoding affective states from EEG signals.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104597"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-06-01Epub Date: 2026-01-05DOI: 10.1016/j.ipm.2025.104590
Changjiu Li , Wei Han , Yong Zhang , Xinwei Wang , Xichao Su
Efficient scheduling of carrier-based aircraft sorties is essential for enhancing the effectiveness of aircraft carriers. The key research challenges stem from the limitations of traditional algorithms, which struggle with this complex scheduling problem due to their high computational complexity, poor adaptability to dynamic events, and a tendency to converge to local optima, rendering them unsuitable for meeting real-time operational demands. To tackle these challenges, we propose an end-to-end deep reinforcement learning scheduling framework that leverages a multi-head attention mechanism to extract features from a heterogeneous graph of the scheduling environment. Using the proximal policy optimization-clip algorithm, the framework enables iterative interaction with a simulation environment to train the scheduling agent. Our experimental findings quantitatively demonstrate the superiority of the proposed framework: the agent outperforms traditional combined rules by over 5% and metaheuristic algorithms by approximately 1%, while achieving an average decision-making time of just 0.7 seconds. The model also demonstrates strong robustness, maintaining a minimal optimality gap even under a 30% reduction in resources. This research provides commanders with a more efficient decision support tool, thereby improving their battlefield response capabilities.
{"title":"End-to-end scheduling for carrier-based aircraft sortie operations using deep reinforcement learning","authors":"Changjiu Li , Wei Han , Yong Zhang , Xinwei Wang , Xichao Su","doi":"10.1016/j.ipm.2025.104590","DOIUrl":"10.1016/j.ipm.2025.104590","url":null,"abstract":"<div><div>Efficient scheduling of carrier-based aircraft sorties is essential for enhancing the effectiveness of aircraft carriers. The key research challenges stem from the limitations of traditional algorithms, which struggle with this complex scheduling problem due to their high computational complexity, poor adaptability to dynamic events, and a tendency to converge to local optima, rendering them unsuitable for meeting real-time operational demands. To tackle these challenges, we propose an end-to-end deep reinforcement learning scheduling framework that leverages a multi-head attention mechanism to extract features from a heterogeneous graph of the scheduling environment. Using the proximal policy optimization-clip algorithm, the framework enables iterative interaction with a simulation environment to train the scheduling agent. Our experimental findings quantitatively demonstrate the superiority of the proposed framework: the agent outperforms traditional combined rules by over 5% and metaheuristic algorithms by approximately 1%, while achieving an average decision-making time of just 0.7 seconds. The model also demonstrates strong robustness, maintaining a minimal optimality gap even under a 30% reduction in resources. This research provides commanders with a more efficient decision support tool, thereby improving their battlefield response capabilities.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104590"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145927586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}