首页 > 最新文献

Information Processing & Management最新文献

英文 中文
PERStance: Personality-guided enhanced multimodal stance detection 持久性:个性导向的增强多模态姿态检测
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-23 DOI: 10.1016/j.ipm.2025.104593
Guoqi Geng, Qianyi Zhan, Heng-Yang Lu
Stance detection aims to identify users’ attitudes toward specific targets in social media, playing a crucial role in information processing and public opinion management. However, existing research on stance detection often overlooks the potential influence of user personality traits on stance expression. In view of the shortcomings of existing stance detection methods, this paper studies the impact of personality traits on stance judgment. To this end, we propose PERStance, a personality-guided enhanced multimodal zero-shot stance detection method. Specifically, PERStance uses a Large Language Model (LLM) to infer users’ personality traits from a multi-dimensional perspective, thereby more accurately understanding users’ potential stances on specific targets. To mitigate issues such as LLM hallucinations and reasoning confusion, we incorporate the Chain-of-Thought framework in the stance detection stage and optimize its reasoning path. Experimental results on multiple multimodal stance detection datasets show that the PERStance method proposed in this paper achieves the best performance in stance detection, with an average increase of 23.88% in the Macro-F1 score. Ablation experiments verify the effectiveness of each module in this method. The source code of our proposed framework is released at https://github.com/jncsnlp/PERStance.
姿态检测旨在识别用户在社交媒体中对特定目标的态度,在信息处理和舆论管理中起着至关重要的作用。然而,现有的姿态检测研究往往忽略了用户人格特征对姿态表达的潜在影响。针对现有姿态检测方法的不足,本文研究了人格特征对姿态判断的影响。为此,我们提出了一种人格引导的增强多模态零射击姿态检测方法PERStance。具体而言,PERStance使用LLM (Large Language Model)从多维度角度推断用户的人格特征,从而更准确地了解用户对特定目标的潜在立场。为了减轻LLM幻觉和推理混乱等问题,我们将思维链框架纳入姿态检测阶段,并优化其推理路径。在多个多模态姿态检测数据集上的实验结果表明,本文提出的PERStance方法在姿态检测中表现最佳,其Macro-F1得分平均提高23.88%。烧蚀实验验证了该方法中各个模块的有效性。我们提出的框架的源代码发布在https://github.com/jncsnlp/PERStance。
{"title":"PERStance: Personality-guided enhanced multimodal stance detection","authors":"Guoqi Geng,&nbsp;Qianyi Zhan,&nbsp;Heng-Yang Lu","doi":"10.1016/j.ipm.2025.104593","DOIUrl":"10.1016/j.ipm.2025.104593","url":null,"abstract":"<div><div>Stance detection aims to identify users’ attitudes toward specific targets in social media, playing a crucial role in information processing and public opinion management. However, existing research on stance detection often overlooks the potential influence of user personality traits on stance expression. In view of the shortcomings of existing stance detection methods, this paper studies the impact of personality traits on stance judgment. To this end, we propose PERStance, a personality-guided enhanced multimodal zero-shot stance detection method. Specifically, PERStance uses a Large Language Model (LLM) to infer users’ personality traits from a multi-dimensional perspective, thereby more accurately understanding users’ potential stances on specific targets. To mitigate issues such as LLM hallucinations and reasoning confusion, we incorporate the Chain-of-Thought framework in the stance detection stage and optimize its reasoning path. Experimental results on multiple multimodal stance detection datasets show that the PERStance method proposed in this paper achieves the best performance in stance detection, with an average increase of <strong>23.88%</strong> in the Macro-F1 score. Ablation experiments verify the effectiveness of each module in this method. The source code of our proposed framework is released at <span><span>https://github.com/jncsnlp/PERStance</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104593"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring and forecasting global digital economy efficiency: An integrated approach using the super-efficiency sequential SBM model and machine learning algorithms 测量和预测全球数字经济效率:使用超效率顺序SBM模型和机器学习算法的综合方法
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-19 DOI: 10.1016/j.ipm.2026.104638
Zhishuo Zhang , Hu Liu , Tao Shi , Qian Li , Huayong Niu
This study develops a systematic Measurement–Analysis–Prediction framework to evaluate global digital economy efficiency using data from 114 countries over 2006–2023. Efficiency is decomposed into two stages—infrastructure transformation and value creation and international competitiveness—measured via a super-efficiency sequential Slack-Based Measure (SBM) model. Regional disparities are examined with the Dagum Gini coefficient, and machine learning models are employed for prediction, with Random Forest (RF) identified as the optimal predictor. Results show that global digital economy efficiency has shown a fluctuating upward trend, with Stage 1 (infrastructure transformation) consistently outperforms Stage 2 (value creation). Notably, 2021 marked a significant turning point for infrastructure transformation efficiency, with the efficiency value surging to 0.2883 due to pandemic-induced digital demand. Europe achieves the highest efficiency, while Asia and the Americas exhibit strong internal polarization; overall disparities are driven mainly by net inter-regional gaps. Machine learning predictions indicate efficiency will increase from 0.3210 in 2024 to 0.3566 in 2028, though regional imbalances are expected to persist. Overall, this study provides robust empirical evidence and a comprehensive framework for understanding the transmission mechanisms of digital economy efficiency, interpreting global disparity patterns, and guiding policy formulation.
本研究开发了一个系统的测量-分析-预测框架,利用2006-2023年114个国家的数据来评估全球数字经济效率。将效率分解为基础设施改造和价值创造两个阶段,并通过超效率序列基于松弛测度(SBM)模型对国际竞争力进行测度。使用Dagum基尼系数检查区域差异,并使用机器学习模型进行预测,随机森林(RF)被确定为最佳预测器。结果表明,全球数字经济效率呈现波动上升趋势,第一阶段(基础设施改造)的表现始终优于第二阶段(价值创造)。值得注意的是,2021年是基础设施转型效率的重要转折点,受疫情引发的数字化需求影响,效率值飙升至0.2883。欧洲的效率最高,而亚洲和美洲则表现出强烈的内部极化;总体差距主要是由区域间净差距造成的。机器学习预测表明,效率将从2024年的0.3210提高到2028年的0.3566,尽管区域失衡预计将持续存在。总体而言,本研究为理解数字经济效率的传导机制、解释全球差距格局和指导政策制定提供了强有力的实证证据和全面的框架。
{"title":"Measuring and forecasting global digital economy efficiency: An integrated approach using the super-efficiency sequential SBM model and machine learning algorithms","authors":"Zhishuo Zhang ,&nbsp;Hu Liu ,&nbsp;Tao Shi ,&nbsp;Qian Li ,&nbsp;Huayong Niu","doi":"10.1016/j.ipm.2026.104638","DOIUrl":"10.1016/j.ipm.2026.104638","url":null,"abstract":"<div><div>This study develops a systematic <em>Measurement–Analysis–Prediction</em> framework to evaluate global digital economy efficiency using data from 114 countries over 2006–2023. Efficiency is decomposed into two stages—<em>infrastructure transformation</em> and <em>value creation and international competitiveness</em>—measured via a super-efficiency sequential Slack-Based Measure (SBM) model. Regional disparities are examined with the Dagum Gini coefficient, and machine learning models are employed for prediction, with Random Forest (RF) identified as the optimal predictor. Results show that global digital economy efficiency has shown a fluctuating upward trend, with Stage 1 (infrastructure transformation) consistently outperforms Stage 2 (value creation). Notably, 2021 marked a significant turning point for infrastructure transformation efficiency, with the efficiency value surging to 0.2883 due to pandemic-induced digital demand. Europe achieves the highest efficiency, while Asia and the Americas exhibit strong internal polarization; overall disparities are driven mainly by net inter-regional gaps. Machine learning predictions indicate efficiency will increase from 0.3210 in 2024 to 0.3566 in 2028, though regional imbalances are expected to persist. Overall, this study provides robust empirical evidence and a comprehensive framework for understanding the transmission mechanisms of digital economy efficiency, interpreting global disparity patterns, and guiding policy formulation.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104638"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence applications and innovation ambidexterity: The “inverted U-shaped” regulating effect of risk-taking 人工智能应用与创新双重性:风险承担的“倒u型”调节效应
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-20 DOI: 10.1016/j.ipm.2026.104635
Lihua Fu , Luo Jin , Yaokuang Li
Artificial intelligence (AI) is driving technological and industrial transformation, reshaping enterprise structure and innovation. Despite its importance, research lacks insight into AI’s impact on innovation ambidexterity and the underexplored role of risk-taking. Drawing on information processing theory, this study constructs a theoretical model to examine the impact of AI applications on enterprise exploration, exploitation, and innovation ambidexterity, as well as incorporates the level of risk-taking as a moderating variable. Using Stata 17 to perform panel data regression and a series of robustness tests, we analyze 1,908 firm-year observations from the information transmission, software, and information technology service industries, as well as the scientific research and technology service industries listed on the Shanghai and Shenzhen Stock Exchanges from 2012 to 2022. The empirical analyses propose that AI applications remarkably enhance innovation ambidexterity at the 1 % level. This positive effect is the most pronounced when risk-taking is moderate. This study extends information processing theory to the AI-enabled innovation context and further enriches its boundary conditions by introducing risk-taking as a nonlinear moderator. Managerially, the findings suggest that enterprises should calibrate risk-taking levels to complement AI deployment, enabling a balanced approach to exploration and exploitation.
人工智能(AI)正在推动技术和产业变革,重塑企业结构和创新。尽管它很重要,但研究缺乏对人工智能对创新的影响的洞察力,以及对风险承担作用的探索不足。利用信息处理理论,构建了人工智能应用对企业探索、开发和创新双元性影响的理论模型,并将风险承担水平作为调节变量。利用Stata 17进行面板数据回归和一系列稳健性检验,对2012 - 2022年沪深两市上市的信息传输、软件、信息技术服务行业以及科研和技术服务行业的1908个企业年观测数据进行了分析。实证分析表明,人工智能应用在1%的水平上显著提高了创新的双元性。当冒险适度时,这种积极影响最为明显。本研究将信息处理理论扩展到人工智能驱动的创新环境中,并通过引入风险承担作为非线性调节因子进一步丰富了其边界条件。在管理方面,研究结果表明,企业应调整风险承担水平,以补充人工智能的部署,从而实现勘探和开发的平衡方法。
{"title":"Artificial intelligence applications and innovation ambidexterity: The “inverted U-shaped” regulating effect of risk-taking","authors":"Lihua Fu ,&nbsp;Luo Jin ,&nbsp;Yaokuang Li","doi":"10.1016/j.ipm.2026.104635","DOIUrl":"10.1016/j.ipm.2026.104635","url":null,"abstract":"<div><div>Artificial intelligence (AI) is driving technological and industrial transformation, reshaping enterprise structure and innovation. Despite its importance, research lacks insight into AI’s impact on innovation ambidexterity and the underexplored role of risk-taking. Drawing on information processing theory, this study constructs a theoretical model to examine the impact of AI applications on enterprise exploration, exploitation, and innovation ambidexterity, as well as incorporates the level of risk-taking as a moderating variable. Using Stata 17 to perform panel data regression and a series of robustness tests, we analyze 1,908 firm-year observations from the information transmission, software, and information technology service industries, as well as the scientific research and technology service industries listed on the Shanghai and Shenzhen Stock Exchanges from 2012 to 2022. The empirical analyses propose that AI applications remarkably enhance innovation ambidexterity at the 1 % level. This positive effect is the most pronounced when risk-taking is moderate. This study extends information processing theory to the AI-enabled innovation context and further enriches its boundary conditions by introducing risk-taking as a nonlinear moderator. Managerially, the findings suggest that enterprises should calibrate risk-taking levels to complement AI deployment, enabling a balanced approach to exploration and exploitation.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104635"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SEGA: Selective cross-lingual representation via sparse guided attention for low-resource multilingual named entity recognition SEGA:基于稀疏引导注意力的低资源多语言命名实体识别的选择性跨语言表示
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-22 DOI: 10.1016/j.ipm.2026.104627
Paerhati Tulajiang , Jinzhong Ning , Yuanyuan Sun , Liang Yang , Yuanyu Zhang , Kelaiti Xiao , Zhixing Lu , Yijia Zhang , Hongfei Lin
Multilingual named entity recognition (NER) is especially challenging in low-resource and typologically diverse languages, where translation drift, morphological variation, and noisy alignments degrade performance. Existing encoder-based methods often rely on dense attention or uniform alignment, which tends to propagate irrelevant signals across languages. We present SEGA, a lightweight and typology-aware framework that incorporates sparse guided attention to select auxiliary signals, alongside a weighted fusion layer that balances representations between cross-lingual and monolingual contexts. Unlike prior approaches, SEGA requires no parallel corpora and supports fully monolingual inference. We evaluate SEGA on six multilingual NER benchmarks spanning over 60 languages, including CoNLL, WikiANN, MasakhaNER 2.0, XTREME-40, WikiNEuRal, and MultiNERD. SEGA achieves new state-of-the-art results on five datasets, with absolute gains of up to +24.2 F1 over strong encoder baselines, and outperforming prompt-based large language models by up to +18.9 F1 in low-resource scenarios. Efficiency analyses show that SEGA adds only  ∼ 30M parameters beyond a standard dual encoder, making it lightweight and deployable on a single GPU. Comprehensive ablation, visualization, and error analyses confirm that SEGA is robust to alignment noise, morphological complexity, and boundary ambiguity, offering a practical and scalable solution for real-world multilingual NER.
多语言命名实体识别(NER)在低资源和类型多样化的语言中尤其具有挑战性,其中翻译漂移、形态变化和噪声对齐会降低性能。现有的基于编码器的方法通常依赖于密集关注或均匀对齐,这倾向于跨语言传播不相关的信号。我们提出了一种轻量级的类型感知框架SEGA,它结合了稀疏引导注意力来选择辅助信号,以及一个加权融合层,以平衡跨语言和单语言上下文之间的表示。与之前的方法不同,SEGA不需要并行语料库,并且完全支持单语言推理。我们对世嘉进行了六种多语言NER基准测试,涵盖60多种语言,包括CoNLL、WikiANN、MasakhaNER 2.0、XTREME-40、WikiNEuRal和multierd。世嘉在五个数据集上实现了新的最先进的结果,在强大的编码器基线上的绝对增益高达+24.2 F1,在低资源场景下优于基于提示的大型语言模型高达+18.9 F1。效率分析表明,世嘉在标准双编码器之外仅增加了 ~ 30M参数,使其重量轻,可部署在单个GPU上。综合消融、可视化和误差分析证实,SEGA对对齐噪声、形态复杂性和边界模糊具有鲁棒性,为现实世界的多语言NER提供了实用且可扩展的解决方案。
{"title":"SEGA: Selective cross-lingual representation via sparse guided attention for low-resource multilingual named entity recognition","authors":"Paerhati Tulajiang ,&nbsp;Jinzhong Ning ,&nbsp;Yuanyuan Sun ,&nbsp;Liang Yang ,&nbsp;Yuanyu Zhang ,&nbsp;Kelaiti Xiao ,&nbsp;Zhixing Lu ,&nbsp;Yijia Zhang ,&nbsp;Hongfei Lin","doi":"10.1016/j.ipm.2026.104627","DOIUrl":"10.1016/j.ipm.2026.104627","url":null,"abstract":"<div><div>Multilingual named entity recognition (NER) is especially challenging in low-resource and typologically diverse languages, where translation drift, morphological variation, and noisy alignments degrade performance. Existing encoder-based methods often rely on dense attention or uniform alignment, which tends to propagate irrelevant signals across languages. We present SEGA, a lightweight and typology-aware framework that incorporates sparse guided attention to select auxiliary signals, alongside a weighted fusion layer that balances representations between cross-lingual and monolingual contexts. Unlike prior approaches, SEGA requires no parallel corpora and supports fully monolingual inference. We evaluate SEGA on six multilingual NER benchmarks spanning over 60 languages, including CoNLL, WikiANN, MasakhaNER 2.0, XTREME-40, WikiNEuRal, and MultiNERD. SEGA achieves new state-of-the-art results on five datasets, with absolute gains of up to +24.2 F1 over strong encoder baselines, and outperforming prompt-based large language models by up to +18.9 F1 in low-resource scenarios. Efficiency analyses show that SEGA adds only  ∼ 30M parameters beyond a standard dual encoder, making it lightweight and deployable on a single GPU. Comprehensive ablation, visualization, and error analyses confirm that SEGA is robust to alignment noise, morphological complexity, and boundary ambiguity, offering a practical and scalable solution for real-world multilingual NER.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104627"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146023254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Language models for environmental, social, and governance analysis: A review 环境、社会和治理分析的语言模型:综述
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-02 DOI: 10.1016/j.ipm.2025.104596
Kelvin Du , Rui Mao , Frank Xing , Gianmarco Mengaldo , Erik Cambria
Language models have revolutionized information processing, elevating it to new levels and generating opportunities to positively impact our society, e.g., in Environmental, Social, and Governance (ESG) domains. This article surveys the current use of language models for ESG analysis, focusing on their applicable scope, effectiveness, and transformative impact. It highlights how these models facilitate a deeper understanding of ESG practices and impacts by integrating unstructured data while acknowledging existing limitations and challenges. Specifically, based on a review of over ninety ESG studies published since the introduction of Transformers in 2018, we discovered that the potential of language models is particularly notable in four primary themes: (1) ESG Frameworks and Standards, which involve the classification of ESG-related texts into binary categories, coarse-grained ESG factors, or fine-grained ESG topics. This theme also includes identifying ESG topic trends and assessing the alignment of corporate ESG disclosures with sustainable development goals; (2) ESG Reporting and Disclosure, which include ESG narrative processing, ESG reporting assurance and ESG report generation; (3) ESG Measurement and Evaluation, which involves calculating ESG ratings, extracting key performance indicators (KPIs), assessing ESG risks, detecting ESG controversy categories, analyzing ESG impact and duration, and assessing the effects of ESG on sustainable growth and corporate financial performance, among other functions; (4) ESG Integration and Application, aiming to incorporate ESG factors into broader financial applications and thereby innovate financial tasks, including ESG sentiment analysis, ESG chatbots and AI assistants, ESG-based financial risk and credit analysis, and ESG investing strategies. We conclude by emphasizing the significance of language models in advancing ESG studies and discussing future research directions.
语言模型已经彻底改变了信息处理,将其提升到新的水平,并产生了积极影响我们社会的机会,例如在环境,社会和治理(ESG)领域。本文调查了ESG分析中语言模型的当前使用情况,重点关注它们的适用范围、有效性和变革影响。它强调了这些模型如何通过整合非结构化数据促进对ESG实践和影响的更深入理解,同时承认现有的局限性和挑战。具体来说,基于对自2018年《变压器》推出以来发表的90多项ESG研究的回顾,我们发现语言模型的潜力在四个主要主题中尤为显著:(1)ESG框架和标准,其中涉及将ESG相关文本分为二元类别、粗粒度ESG因素或细粒度ESG主题。该主题还包括确定ESG主题趋势,评估企业ESG披露与可持续发展目标的一致性;(2) ESG报告与披露,包括ESG叙事处理、ESG报告保证和ESG报告生成;(3) ESG测量与评估,包括计算ESG评级、提取关键绩效指标、评估ESG风险、发现ESG争议类别、分析ESG影响和持续时间、评估ESG对可持续增长和公司财务绩效的影响等功能;(4) ESG整合与应用,旨在将ESG因素纳入更广泛的金融应用,从而创新金融任务,包括ESG情绪分析、ESG聊天机器人和人工智能助手、基于ESG的金融风险和信用分析以及ESG投资策略。最后,我们强调了语言模型在推进ESG研究中的重要意义,并讨论了未来的研究方向。
{"title":"Language models for environmental, social, and governance analysis: A review","authors":"Kelvin Du ,&nbsp;Rui Mao ,&nbsp;Frank Xing ,&nbsp;Gianmarco Mengaldo ,&nbsp;Erik Cambria","doi":"10.1016/j.ipm.2025.104596","DOIUrl":"10.1016/j.ipm.2025.104596","url":null,"abstract":"<div><div>Language models have revolutionized information processing, elevating it to new levels and generating opportunities to positively impact our society, e.g., in Environmental, Social, and Governance (ESG) domains. This article surveys the current use of language models for ESG analysis, focusing on their applicable scope, effectiveness, and transformative impact. It highlights how these models facilitate a deeper understanding of ESG practices and impacts by integrating unstructured data while acknowledging existing limitations and challenges. Specifically, based on a review of over ninety ESG studies published since the introduction of Transformers in 2018, we discovered that the potential of language models is particularly notable in four primary themes: (1) <strong>ESG Frameworks and Standards</strong>, which involve the classification of ESG-related texts into binary categories, coarse-grained ESG factors, or fine-grained ESG topics. This theme also includes identifying ESG topic trends and assessing the alignment of corporate ESG disclosures with sustainable development goals; (2) <strong>ESG Reporting and Disclosure</strong>, which include ESG narrative processing, ESG reporting assurance and ESG report generation; (3) <strong>ESG Measurement and Evaluation</strong>, which involves calculating ESG ratings, extracting key performance indicators (KPIs), assessing ESG risks, detecting ESG controversy categories, analyzing ESG impact and duration, and assessing the effects of ESG on sustainable growth and corporate financial performance, among other functions; (4) <strong>ESG Integration and Application</strong>, aiming to incorporate ESG factors into broader financial applications and thereby innovate financial tasks, including ESG sentiment analysis, ESG chatbots and AI assistants, ESG-based financial risk and credit analysis, and ESG investing strategies. We conclude by emphasizing the significance of language models in advancing ESG studies and discussing future research directions.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104596"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CEREM: A segment-wise attention network for chinese highly aggregated semantic extraction 中文高度聚合语义提取的分段关注网络
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-12 DOI: 10.1016/j.ipm.2026.104617
Bin Liu , Jiaqi Han , Zhenyu Zhang , Shijun Li , Haixi Zhang , Yijie Chen , Keqin Li
The demand of large models for data has revitalized information extraction research, particularly for Chinese texts, where semantic isolation poses unique challenges. Existing methods often rely on Chinese word segmentation, but their capacity to capture full semantic meaning is constrained by polysemy, flexible word order, and other unique characteristics of the Chinese language. To address this limitation, we propose three-level semantic division and design CEREM, a prompt- and pointer-based IE network, to extract highly aggregated semantics. In our design, prompts unify multiple IE tasks while preserving semantic interactions, a Segment Information Attention mechanism implicitly aggregates the high-level semantics to enhance Chinese understanding, and an Independent Branches strategy decouples parameters to focus separately on the sub-tasks of start and end index prediction. We evaluate CEREM on four datasets–DiaKG, CMedCausal, Title2Event, and the self-constructed CAIT–covering named entity recognition (NER), relation extraction (RE), and event extraction tasks. CEREM achieves state-of-the-art performance: on CAIT, 88.59% F1 for NER and 71.82% for RE; on DiaKG, 81.77% for NER and 65.44% for RE; and for causal relation extraction on CMedCausal, 45.30% F1. These results demonstrate CEREM’s effectiveness across domains and task types, highlighting its potential as a unified framework for Chinese information extraction.
对大型数据模型的需求振兴了信息提取研究,特别是对语义隔离带来独特挑战的中文文本。现有的分词方法往往依赖于汉语的分词,但由于汉语的多义性、词序的灵活性以及其他独特的特点,限制了其捕捉完整语义的能力。为了解决这一限制,我们提出了三层语义划分,并设计了基于提示和指针的IE网络CEREM,以提取高度聚合的语义。在我们的设计中,提示统一多个IE任务,同时保持语义交互;段信息关注机制隐式聚合高级语义以增强中文理解;独立分支策略解耦参数以分别关注开始和结束索引预测的子任务。我们在四个数据集(diakg、CMedCausal、Title2Event)和自构建的cait上评估了CEREM,包括命名实体识别(NER)、关系提取(RE)和事件提取任务。CEREM达到了最先进的性能:在CAIT上,NER的F1为88.59%,RE为71.82%;在DiaKG上,NER为81.77%,RE为65.44%;CMedCausal上的因果关系提取,F1为45.30%。这些结果证明了CEREM在跨领域和任务类型方面的有效性,突出了其作为中文信息提取统一框架的潜力。
{"title":"CEREM: A segment-wise attention network for chinese highly aggregated semantic extraction","authors":"Bin Liu ,&nbsp;Jiaqi Han ,&nbsp;Zhenyu Zhang ,&nbsp;Shijun Li ,&nbsp;Haixi Zhang ,&nbsp;Yijie Chen ,&nbsp;Keqin Li","doi":"10.1016/j.ipm.2026.104617","DOIUrl":"10.1016/j.ipm.2026.104617","url":null,"abstract":"<div><div>The demand of large models for data has revitalized information extraction research, particularly for Chinese texts, where semantic isolation poses unique challenges. Existing methods often rely on Chinese word segmentation, but their capacity to capture full semantic meaning is constrained by polysemy, flexible word order, and other unique characteristics of the Chinese language. To address this limitation, we propose three-level semantic division and design CEREM, a prompt- and pointer-based IE network, to extract highly aggregated semantics. In our design, prompts unify multiple IE tasks while preserving semantic interactions, a Segment Information Attention mechanism implicitly aggregates the high-level semantics to enhance Chinese understanding, and an Independent Branches strategy decouples parameters to focus separately on the sub-tasks of start and end index prediction. We evaluate CEREM on four datasets–DiaKG, CMedCausal, Title2Event, and the self-constructed CAIT–covering named entity recognition (NER), relation extraction (RE), and event extraction tasks. CEREM achieves state-of-the-art performance: on CAIT, 88.59% F1 for NER and 71.82% for RE; on DiaKG, 81.77% for NER and 65.44% for RE; and for causal relation extraction on CMedCausal, 45.30% F1. These results demonstrate CEREM’s effectiveness across domains and task types, highlighting its potential as a unified framework for Chinese information extraction.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104617"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-based visual question classification using quaternion hypergraph consistent network 基于四元数超图一致网络的知识视觉问题分类
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-13 DOI: 10.1016/j.ipm.2025.104591
Jing Wang , Duantengchuan Li , Xu Du , Hao Li , Zhuang Hu
Visual questions are an important means to evaluate students’ knowledge. Knowledge-based visual question classification can effectively excavate the knowledge intention of the question, and realize the effective organization and management of online question resources at the knowledge level. The existing methods simply regard it as a multimodal classification task, ignoring the capture of implicit knowledge information and the fine-grained interactions between multimodal and multi-granularity features. To mitigate this, we propose a quaternion hypergraph consistent network (QHCN). This approach can not only extract explicit semantic features and implicit knowledge features from text and images simultaneously, but also considers three key properties among explicit-implicit features: modality complementation, modality independence, and knowledge consistency. Specifically, a visual question is represented as a quaternion vector consisting of two modalities and four-dimensional features. To achieve multimodal complementation, the consistency of vision and language guides the construction of a quaternion hypergraph, and a quaternion convolution operator deeply fuses explicit-implicit features. To capture inter-dependencies between explicit-implicit features, the independence loss and knowledge consistency loss are designed to optimize hypergraph network parameters and enhance the hypergraph structure. Extensive experiments on visual question sets verify that our QHCN achieved an accuracy of 94.82% and an F1 score of 94.76%, outperforming the optimal baseline by +1.46% and +1.53%, respectively.
可视化问题是评价学生知识水平的重要手段。基于知识的可视化问题分类可以有效挖掘问题的知识意图,实现对在线问题资源在知识层面的有效组织和管理。现有方法简单地将其视为一个多模态分类任务,忽略了隐性知识信息的获取以及多模态和多粒度特征之间的细粒度交互。为了解决这个问题,我们提出了一个四元数超图一致网络(QHCN)。该方法不仅可以同时从文本和图像中提取显式语义特征和隐含知识特征,而且考虑了显式和隐含特征之间的三个关键特性:情态互补、情态独立性和知识一致性。具体来说,视觉问题被表示为由两个模态和四维特征组成的四元数向量。为了实现多模态互补,视觉和语言的一致性指导了四元数超图的构造,四元数卷积算子深度融合了显式-隐式特征。为了捕获显隐特征之间的相互依赖关系,设计了独立性损失和知识一致性损失来优化超图网络参数,增强超图结构。在视觉问题集上的大量实验验证了我们的QHCN达到了94.82%的准确率和94.76%的F1分数,分别比最优基线提高了+1.46%和+1.53%。
{"title":"Knowledge-based visual question classification using quaternion hypergraph consistent network","authors":"Jing Wang ,&nbsp;Duantengchuan Li ,&nbsp;Xu Du ,&nbsp;Hao Li ,&nbsp;Zhuang Hu","doi":"10.1016/j.ipm.2025.104591","DOIUrl":"10.1016/j.ipm.2025.104591","url":null,"abstract":"<div><div>Visual questions are an important means to evaluate students’ knowledge. Knowledge-based visual question classification can effectively excavate the knowledge intention of the question, and realize the effective organization and management of online question resources at the knowledge level. The existing methods simply regard it as a multimodal classification task, ignoring the capture of implicit knowledge information and the fine-grained interactions between multimodal and multi-granularity features. To mitigate this, we propose a quaternion hypergraph consistent network (QHCN). This approach can not only extract explicit semantic features and implicit knowledge features from text and images simultaneously, but also considers three key properties among explicit-implicit features: modality complementation, modality independence, and knowledge consistency. Specifically, a visual question is represented as a quaternion vector consisting of two modalities and four-dimensional features. To achieve multimodal complementation, the consistency of vision and language guides the construction of a quaternion hypergraph, and a quaternion convolution operator deeply fuses explicit-implicit features. To capture inter-dependencies between explicit-implicit features, the independence loss and knowledge consistency loss are designed to optimize hypergraph network parameters and enhance the hypergraph structure. Extensive experiments on visual question sets verify that our QHCN achieved an accuracy of 94.82% and an F1 score of 94.76%, outperforming the optimal baseline by +1.46% and +1.53%, respectively.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104591"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An information processing framework for education: Supporting automatic question generation with NLP to minimize human intervention 教育信息处理框架:支持NLP自动问题生成,减少人为干预
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2026-01-16 DOI: 10.1016/j.ipm.2026.104613
Memoona Saleem , Zahoor Ur Rehman , Raja Hashim Ali , Ujala Akmal , Ali Zeeshan Ijaz , Raja Manzar Abbas
The recent model and conceptual advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) are changing how knowledge-intensive tasks are giving more reliable and creative decisions. Particularly, the advancements in NLP has completely changed the way human language is processed, is understood, and is then used for generating human language. These updates are useful for the deeper analysis of textual content that support information systems and their management. For example, in the field of education, these technologies offer advanced and more intelligent tools, which enhance education through improved learning experiences, optimized assessments, and other teaching and study mechanisms. In this paper, we have worked on a unified framework for automatic questions generation (AQG) and educational content analysis. For this purpose, we have developed Questify-TheEduBot, which integrates transformer-based models (BERT, GPT), Latent Dirichlet Allocation (LDA), sentiment analysis, and keyword extraction into a single pipeline. Existing tools typically address isolated tasks but our tool generates multiple types of questions, i.e., multiple choice questions (MCQs), cloze, as well as descriptive type of questions. In addition to AQG, Questify-TheEduBot simultaneously validates semantic coherence, topic coverage, and sentiment appropriateness. We have evaluated and compared our model on SQuAD v2.0 and LearningQ datasets, which consists of over 300,000 Question Answer pairs. Questify-TheEduBot demonstrated excellent performance on the test datasets, with cosine similarity above 0.85, keyword overlap of 87%, and topic modeling precision of 89%. Human evaluation further confirms the pedagogical relevance of generated questions, where our study shows significant improvements over template-based and Seq2Seq competing baseline models. The web-based platform of our tool offers instructors and learners with a tested, interpretable, and resource-efficient tool for automated assessment, support in curriculum development, and enables personalized learning. By merging automated question generation and content analytics, Questify-TheEduBot advances the state of NLP in the field of education, where it provides actionable insights for information management in digital learning environments.
人工智能(AI)和自然语言处理(NLP)的最新模型和概念进展正在改变知识密集型任务如何提供更可靠和创造性的决策。特别是,NLP的进步已经完全改变了人类语言的处理方式,理解,然后用于生成人类语言。这些更新有助于对支持信息系统及其管理的文本内容进行更深入的分析。例如,在教育领域,这些技术提供了更先进、更智能的工具,通过改善学习体验、优化评估和其他教学和学习机制来加强教育。在本文中,我们研究了一个用于自动问题生成(AQG)和教育内容分析的统一框架。为此,我们开发了Questify-TheEduBot,它将基于转换器的模型(BERT, GPT),潜在狄利克雷分配(LDA),情感分析和关键字提取集成到单个管道中。现有的工具通常解决孤立的任务,但我们的工具生成多种类型的问题,即选择题(mcq),完形填空,以及描述性问题。除了AQG, Questify-TheEduBot还可以同时验证语义一致性、主题覆盖和情感适当性。我们在SQuAD v2.0和LearningQ数据集上对我们的模型进行了评估和比较,这些数据集由超过30万个问答对组成。Questify-TheEduBot在测试数据集上表现出色,余弦相似度在0.85以上,关键词重叠率为87%,主题建模精度为89%。人类评估进一步证实了生成问题的教学相关性,我们的研究显示了基于模板和Seq2Seq竞争基线模型的显着改进。我们的工具的网络平台为教师和学习者提供了一个经过测试的、可解释的、资源高效的工具,用于自动评估,支持课程开发,并实现个性化学习。通过合并自动问题生成和内容分析,Questify-TheEduBot推动了NLP在教育领域的发展,为数字学习环境中的信息管理提供了可操作的见解。
{"title":"An information processing framework for education: Supporting automatic question generation with NLP to minimize human intervention","authors":"Memoona Saleem ,&nbsp;Zahoor Ur Rehman ,&nbsp;Raja Hashim Ali ,&nbsp;Ujala Akmal ,&nbsp;Ali Zeeshan Ijaz ,&nbsp;Raja Manzar Abbas","doi":"10.1016/j.ipm.2026.104613","DOIUrl":"10.1016/j.ipm.2026.104613","url":null,"abstract":"<div><div>The recent model and conceptual advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) are changing how knowledge-intensive tasks are giving more reliable and creative decisions. Particularly, the advancements in NLP has completely changed the way human language is processed, is understood, and is then used for generating human language. These updates are useful for the deeper analysis of textual content that support information systems and their management. For example, in the field of education, these technologies offer advanced and more intelligent tools, which enhance education through improved learning experiences, optimized assessments, and other teaching and study mechanisms. In this paper, we have worked on a unified framework for automatic questions generation (AQG) and educational content analysis. For this purpose, we have developed Questify-TheEduBot, which integrates transformer-based models (BERT, GPT), Latent Dirichlet Allocation (LDA), sentiment analysis, and keyword extraction into a single pipeline. Existing tools typically address isolated tasks but our tool generates multiple types of questions, i.e., multiple choice questions (MCQs), cloze, as well as descriptive type of questions. In addition to AQG, Questify-TheEduBot simultaneously validates semantic coherence, topic coverage, and sentiment appropriateness. We have evaluated and compared our model on SQuAD v2.0 and LearningQ datasets, which consists of over 300,000 Question Answer pairs. Questify-TheEduBot demonstrated excellent performance on the test datasets, with cosine similarity above 0.85, keyword overlap of 87%, and topic modeling precision of 89%. Human evaluation further confirms the pedagogical relevance of generated questions, where our study shows significant improvements over template-based and Seq2Seq competing baseline models. The web-based platform of our tool offers instructors and learners with a tested, interpretable, and resource-efficient tool for automated assessment, support in curriculum development, and enables personalized learning. By merging automated question generation and content analytics, Questify-TheEduBot advances the state of NLP in the field of education, where it provides actionable insights for information management in digital learning environments.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104613"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145978436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SemiBCP-SAM2 : Semi-supervised model via enhanced bidirectional copy-paste based on SAM2 for medical image segmentation SemiBCP-SAM2:基于SAM2的增强双向复制粘贴半监督模型,用于医学图像分割
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2025-12-31 DOI: 10.1016/j.ipm.2025.104576
Guangqi Yang , Xiaoxin Guo , Haoran Zhang , Zhenyuan Zheng , Hongliang Dong , Songbai Xu
Insufficient use of unlabeled data often leads to inaccurate medical image segmentation, and noise in pseudo-labels can further destabilize training. In this paper, we propose a semi-supervised model based on the SAM2 combined with a bidirectional copy-paste mean teacher model (SemiBCP-SAM2). Specifically, we use a student model to generate segmentation results, which are then used as input prompts for SAM2 to generate additional pseudo-labels, providing auxiliary supervision to guide student learning. We also introduce a Masked Prompt (MP) mechanism that reduces prompt confidence to better handle uncertainty and noise, improving its performance in complex or incomplete information scenarios. Another major contribution is the transplantability of this model that can be achieved by replacing the baseline network in the student-teacher model, and can enhance the performance of other semi-supervised segmentation networks at a lower cost. We conduct comparative experiments and performance evaluations of SemiBCP-SAM2 on the ACDC (100 MRI scans) and PROMISE12 (50 MRI scans) datasets. On ACDC, with 5% and 10% labeled data, SemiBCP-SAM2 improves Dice by 0.29% and 1.16%, and Jaccard by 0.39% and 1.84%. On PROMISE12, with 5% and 20% labeled data, it improves Dice by 1.61% and 2.03%, and Jaccard by 1.99% and 2.79%. Source code is released at https://github.com/ydlam/SemiBCP-SAM2.
未标记数据的使用不足往往导致医学图像分割不准确,而伪标签中的噪声会进一步破坏训练的稳定性。在本文中,我们提出了一个基于SAM2和双向复制粘贴平均教师模型(SemiBCP-SAM2)的半监督模型。具体来说,我们使用学生模型来生成分割结果,然后将其用作SAM2的输入提示,以生成额外的伪标签,为指导学生学习提供辅助监督。我们还引入了掩蔽提示(MP)机制,该机制降低了提示置信度,以更好地处理不确定性和噪声,提高了其在复杂或不完整信息场景中的性能。另一个主要贡献是该模型的可移植性,可以通过替换学生-教师模型中的基线网络来实现,并且可以以较低的成本提高其他半监督分割网络的性能。我们在ACDC(100次MRI扫描)和PROMISE12(50次MRI扫描)数据集上对SemiBCP-SAM2进行了比较实验和性能评估。在ACDC上,当标记数据分别为5%和10%时,SemiBCP-SAM2分别提高Dice 0.29%和1.16%,Jaccard提高0.39%和1.84%。在PROMISE12上,标记数据分别为5%和20%时,Dice分别提高了1.61%和2.03%,Jaccard分别提高了1.99%和2.79%。源代码发布在https://github.com/ydlam/SemiBCP-SAM2。
{"title":"SemiBCP-SAM2 : Semi-supervised model via enhanced bidirectional copy-paste based on SAM2 for medical image segmentation","authors":"Guangqi Yang ,&nbsp;Xiaoxin Guo ,&nbsp;Haoran Zhang ,&nbsp;Zhenyuan Zheng ,&nbsp;Hongliang Dong ,&nbsp;Songbai Xu","doi":"10.1016/j.ipm.2025.104576","DOIUrl":"10.1016/j.ipm.2025.104576","url":null,"abstract":"<div><div>Insufficient use of unlabeled data often leads to inaccurate medical image segmentation, and noise in pseudo-labels can further destabilize training. In this paper, we propose a semi-supervised model based on the SAM2 combined with a bidirectional copy-paste mean teacher model (SemiBCP-SAM2). Specifically, we use a student model to generate segmentation results, which are then used as input prompts for SAM2 to generate additional pseudo-labels, providing auxiliary supervision to guide student learning. We also introduce a Masked Prompt (MP) mechanism that reduces prompt confidence to better handle uncertainty and noise, improving its performance in complex or incomplete information scenarios. Another major contribution is the transplantability of this model that can be achieved by replacing the baseline network in the student-teacher model, and can enhance the performance of other semi-supervised segmentation networks at a lower cost. We conduct comparative experiments and performance evaluations of SemiBCP-SAM2 on the ACDC (100 MRI scans) and PROMISE12 (50 MRI scans) datasets. On ACDC, with 5% and 10% labeled data, SemiBCP-SAM2 improves Dice by 0.29% and 1.16%, and Jaccard by 0.39% and 1.84%. On PROMISE12, with 5% and 20% labeled data, it improves Dice by 1.61% and 2.03%, and Jaccard by 1.99% and 2.79%. Source code is released at <span><span>https://github.com/ydlam/SemiBCP-SAM2</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104576"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From tracking to thinking: Facilitating post-exercise reflection by a large language model-mediated journaling system 从跟踪到思考:通过大型语言模型介导的日志系统促进运动后反思
IF 6.9 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-06-01 Epub Date: 2025-12-31 DOI: 10.1016/j.ipm.2025.104574
Xianglin Zhao , Yucheng Jin , Annie Yan Wang , Ming Zhang
Wearable devices provide rich quantitative data for self-reflection on physical activity. However, users often struggle to derive meaningful insights from these data, highlighting the need for enhanced support. To investigate whether Large Language Models (LLMs) can facilitate this process, we propose and evaluate a human-LLM collaborative reflective journaling paradigm. We developed PaceMind, an LLM-mediated journaling system that implements this paradigm based on a three-stage reflection framework. It can generate data-driven drafts and personalized questions to guide users in integrating exercise data with personal insights. A two-week within-subjects study (N=21) compared the LLM-mediated system with a template-based journaling baseline. The LLM-mediated design significantly improved the perceived effectiveness of reflection support and increased users’ intention to use the system. However, perceived ease of use did not improve significantly. Users appreciated the LLM’s scaffolding for easing data sense-making, but also reported added cognitive work in verifying and personalizing the LLM-generated content. Although objective activity levels did not change significantly, the LLM-mediated condition showed a trend toward more adaptive exercise planning and sustained engagement. Our findings provide empirical evidence for a human-LLM collaborative reflection paradigm in a data-intensive exercise context. They highlight both the potential to deepen user reflection and underscore the critical design challenge of balancing automation with meaningful cognitive engagement and user control.
可穿戴设备为身体活动的自我反思提供了丰富的定量数据。然而,用户往往很难从这些数据中获得有意义的见解,这突出了对增强支持的需求。为了研究大型语言模型(llm)是否能促进这一过程,我们提出并评估了一个人类- llm协作反射日志范式。我们开发了PaceMind,这是一个基于llm的日志系统,它基于三阶段反射框架实现了这种范式。它可以生成数据驱动的草稿和个性化问题,引导用户将锻炼数据与个人见解相结合。一项为期两周的受试者研究(N=21)将llm介导的系统与基于模板的日志基线进行了比较。法学硕士介导的设计显著提高了反思支持的感知有效性,增加了用户使用系统的意愿。然而,感知易用性并没有显著提高。用户对LLM简化数据意义构建的框架表示赞赏,但也报告了在验证和个性化LLM生成的内容方面增加的认知工作。虽然客观活动水平没有显著变化,但llm介导的条件显示出更适应性的运动计划和持续参与的趋势。我们的研究结果为数据密集型练习环境中的人类-法学硕士协作反思范式提供了经验证据。他们强调了深化用户反思的潜力,并强调了平衡自动化与有意义的认知参与和用户控制的关键设计挑战。
{"title":"From tracking to thinking: Facilitating post-exercise reflection by a large language model-mediated journaling system","authors":"Xianglin Zhao ,&nbsp;Yucheng Jin ,&nbsp;Annie Yan Wang ,&nbsp;Ming Zhang","doi":"10.1016/j.ipm.2025.104574","DOIUrl":"10.1016/j.ipm.2025.104574","url":null,"abstract":"<div><div>Wearable devices provide rich quantitative data for self-reflection on physical activity. However, users often struggle to derive meaningful insights from these data, highlighting the need for enhanced support. To investigate whether Large Language Models (LLMs) can facilitate this process, we propose and evaluate a human-LLM collaborative reflective journaling paradigm. We developed <em>PaceMind</em>, an LLM-mediated journaling system that implements this paradigm based on a three-stage reflection framework. It can generate data-driven drafts and personalized questions to guide users in integrating exercise data with personal insights. A two-week within-subjects study (<span><math><mrow><mi>N</mi><mo>=</mo><mn>21</mn></mrow></math></span>) compared the LLM-mediated system with a template-based journaling baseline. The LLM-mediated design significantly improved the perceived effectiveness of reflection support and increased users’ intention to use the system. However, perceived ease of use did not improve significantly. Users appreciated the LLM’s scaffolding for easing data sense-making, but also reported added cognitive work in verifying and personalizing the LLM-generated content. Although objective activity levels did not change significantly, the LLM-mediated condition showed a trend toward more adaptive exercise planning and sustained engagement. Our findings provide empirical evidence for a human-LLM collaborative reflection paradigm in a data-intensive exercise context. They highlight both the potential to deepen user reflection and underscore the critical design challenge of balancing automation with meaningful cognitive engagement and user control.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 4","pages":"Article 104574"},"PeriodicalIF":6.9,"publicationDate":"2026-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145886181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Processing & Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1