首页 > 最新文献

Information Processing & Management最新文献

英文 中文
PIE: A Personalized Information Embedded model for text-based depression detection PIE:基于文本的抑郁检测个性化信息嵌入模型
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-16 DOI: 10.1016/j.ipm.2024.103830

Depression detection based on text analysis has emerged as a research hotspot. Existing research indicates that patients’ personalized characteristics are the primary factor contributing to differences in reported experiences, which poses challenges for automated depression detection methods. To address this, we pioneered defining the fundamental components of personalized information within the text-based depression detection field and proposed the Personalized Information Embedding (PIE) model. The model narrows the gap between generic clinical symptoms and personalized patient experiences in detection, introducing a novel method for computing personalized information representations. Then, we constructed a unique depression intervention dataset containing 108 cases of subjects, the first longitudinally gathering experimental dataset in text-based depression detection. Extensive experimental evidence demonstrates that compared to advanced models, PIE demonstrates statistically significant improvements in performance (with the maximum reductions in RMSE of 0.309 and MAE of 0.232) and generalizability (with standard deviation reductions in RMSE by 75.43% and MAE by 69.77%), and the out-of-domain generalizability of personalized information representations has been validated on two larger external datasets. Additionally, we conducted case studies to analyze how personalized information leads to improved model capabilities. This research serves as a pilot and reference for developing personalized models in text-based depression detection.

基于文本分析的抑郁检测已成为研究热点。现有研究表明,患者的个性化特征是导致报告经历差异的主要因素,这给自动抑郁检测方法带来了挑战。为此,我们率先在基于文本的抑郁检测领域定义了个性化信息的基本组成部分,并提出了个性化信息嵌入(PIE)模型。该模型缩小了通用临床症状与个性化患者检测经验之间的差距,引入了一种计算个性化信息表征的新方法。然后,我们构建了一个包含 108 例受试者的独特抑郁干预数据集,这是首个基于文本的抑郁检测的纵向收集实验数据集。大量实验证据表明,与高级模型相比,PIE 在性能(RMSE 最大降低 0.309,MAE 最大降低 0.232)和泛化能力(RMSE 标准差降低 75.43%,MAE 标准差降低 69.77%)方面都有统计学意义上的显著提高,而且个性化信息表征的域外泛化能力已在两个更大的外部数据集上得到验证。此外,我们还进行了案例研究,分析个性化信息如何提高模型能力。这项研究为开发基于文本的抑郁检测个性化模型提供了试点和参考。
{"title":"PIE: A Personalized Information Embedded model for text-based depression detection","authors":"","doi":"10.1016/j.ipm.2024.103830","DOIUrl":"10.1016/j.ipm.2024.103830","url":null,"abstract":"<div><p>Depression detection based on text analysis has emerged as a research hotspot. Existing research indicates that patients’ personalized characteristics are the primary factor contributing to differences in reported experiences, which poses challenges for automated depression detection methods. To address this, we pioneered defining the fundamental components of personalized information within the text-based depression detection field and proposed the <strong>P</strong>ersonalized <strong>I</strong>nformation <strong>E</strong>mbedding (PIE) model. The model narrows the gap between generic clinical symptoms and personalized patient experiences in detection, introducing a novel method for computing personalized information representations. Then, we constructed a unique depression intervention dataset containing 108 cases of subjects, the first longitudinally gathering experimental dataset in text-based depression detection. Extensive experimental evidence demonstrates that compared to advanced models, PIE demonstrates statistically significant improvements in performance (with the maximum reductions in RMSE of 0.309 and MAE of 0.232) and generalizability (with standard deviation reductions in RMSE by 75.43% and MAE by 69.77%), and the out-of-domain generalizability of personalized information representations has been validated on two larger external datasets. Additionally, we conducted case studies to analyze how personalized information leads to improved model capabilities. This research serves as a pilot and reference for developing personalized models in text-based depression detection.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CMACF: Transformer-based cross-modal attention cross-fusion model for systemic lupus erythematosus diagnosis combining Raman spectroscopy, FTIR spectroscopy, and metabolomics CMACF:基于变压器的跨模态注意力交叉融合模型,结合拉曼光谱、傅立叶变换红外光谱和代谢组学诊断系统性红斑狼疮
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-15 DOI: 10.1016/j.ipm.2024.103804

As complex multi-omics data in the medical field tend to be multi-modal. Integrating these multimodal information into novel disease diagnosis models has become challenging. However, previous methods mainly focus on single omics, which cannot effectively capture the contributions between different combinations of multi-omics information. To solve this problem, based on Raman spectroscopy, FTIR spectroscopy, and metabolomics data, this paper proposes a new Cross-modal Cross-fusion network based on the Transformer self-attention mechanism (CMACF). The research focuses on effectively combining the feature patterns of different omics for disease prediction. Specifically, by constructing the Raman-IR, Raman-metabolomic, and IR spectral-metabolomic feature pairs and reasonably focusing on the information of different combination pairs through multiple stages of feature sub-network, attention cross-fusion, bimodal interaction, and sequence interaction feature level fusion, it is interesting to find that the information contribution between different pairs is different. We conducted extensive experiments on the systemic lupus erythematosus multi-omics dataset, and the accuracy and AUC values are as high as 99.44 % and 99.98 %, respectively, with the best classification effect. The results show that CMACF can efficiently fuse multi-omics medical data, provide an efficient baseline for processing medical multimodal data, and analyze the contribution of multi-omics data fusion.

由于医学领域复杂的多组学数据往往是多模态的。将这些多模态信息整合到新型疾病诊断模型中已成为一项挑战。然而,以往的方法主要关注单一的组学信息,无法有效捕捉不同组合的多组学信息之间的贡献。为解决这一问题,本文基于拉曼光谱、傅立叶变换红外光谱和代谢组学数据,提出了一种新的基于变换器自注意机制的跨模态交叉融合网络(CMACF)。研究重点是有效结合不同 omics 的特征模式进行疾病预测。具体来说,通过构建拉曼-红外、拉曼-代谢组、红外光谱-代谢组特征对,并通过特征子网络、注意交叉融合、双模交互、序列交互特征级融合等多个阶段合理关注不同组合对的信息,有趣地发现不同组合对之间的信息贡献是不同的。我们在系统性红斑狼疮多组学数据集上进行了大量实验,准确率和 AUC 值分别高达 99.44 % 和 99.98 %,分类效果最佳。结果表明,CMACF 可以高效地融合多组学医学数据,为医学多模态数据的处理提供了一个高效的基线,并分析了多组学数据融合的贡献。
{"title":"CMACF: Transformer-based cross-modal attention cross-fusion model for systemic lupus erythematosus diagnosis combining Raman spectroscopy, FTIR spectroscopy, and metabolomics","authors":"","doi":"10.1016/j.ipm.2024.103804","DOIUrl":"10.1016/j.ipm.2024.103804","url":null,"abstract":"<div><p>As complex multi-omics data in the medical field tend to be multi-modal. Integrating these multimodal information into novel disease diagnosis models has become challenging. However, previous methods mainly focus on single omics, which cannot effectively capture the contributions between different combinations of multi-omics information. To solve this problem, based on Raman spectroscopy, FTIR spectroscopy, and metabolomics data, this paper proposes a new Cross-modal Cross-fusion network based on the Transformer self-attention mechanism (CMACF). The research focuses on effectively combining the feature patterns of different omics for disease prediction. Specifically, by constructing the Raman-IR, Raman-metabolomic, and IR spectral-metabolomic feature pairs and reasonably focusing on the information of different combination pairs through multiple stages of feature sub-network, attention cross-fusion, bimodal interaction, and sequence interaction feature level fusion, it is interesting to find that the information contribution between different pairs is different. We conducted extensive experiments on the systemic lupus erythematosus multi-omics dataset, and the accuracy and AUC values are as high as 99.44 % and 99.98 %, respectively, with the best classification effect. The results show that CMACF can efficiently fuse multi-omics medical data, provide an efficient baseline for processing medical multimodal data, and analyze the contribution of multi-omics data fusion.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensing the diversity of rumors: Rumor detection with hierarchical prototype contrastive learning 感知谣言的多样性:利用分层原型对比学习检测谣言
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-14 DOI: 10.1016/j.ipm.2024.103832

The proliferation of rumors on social networks poses a serious threat to cybersecurity, justice and public trust, increasing the urgent need for rumor detection. Existing detection methods typically treat all rumors as a single homogeneous category, neglecting the diverse semantic hierarchies within rumors. Rumors pervade various domains, each with its distinct characteristics. These methods tend to lag in expressiveness when confronted with real-world scenarios involving multiple semantic levels. Furthermore, the diversity of rumors also complicates the collection of datasets, and inevitably introduces noisy data, which hinders the correctness of the learned representations. To address these challenges, we propose a rumor detection framework with Hierarchical Prototype Contrastive Learning (HPCL). In this framework, we construct a set of dynamically updated hierarchical prototypes through contrastive learning to encourage capturing the hierarchical semantic structure within rumors. Additionally, we design a difficulty metric function based on the distance between instances and prototypes, and introduce curriculum learning to mitigate the adverse effects of noisy data. Experiments on four public datasets demonstrate that our approach achieves state-of-the-art performance. Our code is publicly released at https://github.com/Coder-HenryZa/HPCL.

社交网络上谣言的泛滥对网络安全、司法公正和公众信任构成了严重威胁,因此对谣言检测的需求日益迫切。现有的检测方法通常将所有谣言视为单一的同质类别,忽略了谣言内部不同的语义层次。谣言遍布各个领域,每个领域都有其独特的特征。这些方法在面对涉及多个语义层次的真实世界场景时,往往表现力不足。此外,谣言的多样性也使数据集的收集变得复杂,并不可避免地引入了噪声数据,从而影响了所学表征的正确性。为了应对这些挑战,我们提出了一个采用分层原型对比学习(Hierarchical Prototype Contrastive Learning,HPCL)的谣言检测框架。在这个框架中,我们通过对比学习构建了一组动态更新的分层原型,以鼓励捕捉谣言中的分层语义结构。此外,我们还根据实例与原型之间的距离设计了一个难度度量函数,并引入课程学习来减轻噪声数据的不利影响。在四个公共数据集上的实验证明,我们的方法达到了最先进的性能。我们的代码已在 https://github.com/Coder-HenryZa/HPCL 上公开发布。
{"title":"Sensing the diversity of rumors: Rumor detection with hierarchical prototype contrastive learning","authors":"","doi":"10.1016/j.ipm.2024.103832","DOIUrl":"10.1016/j.ipm.2024.103832","url":null,"abstract":"<div><p>The proliferation of rumors on social networks poses a serious threat to cybersecurity, justice and public trust, increasing the urgent need for rumor detection. Existing detection methods typically treat all rumors as a single homogeneous category, neglecting the diverse semantic hierarchies within rumors. Rumors pervade various domains, each with its distinct characteristics. These methods tend to lag in expressiveness when confronted with real-world scenarios involving multiple semantic levels. Furthermore, the diversity of rumors also complicates the collection of datasets, and inevitably introduces noisy data, which hinders the correctness of the learned representations. To address these challenges, we propose a rumor detection framework with Hierarchical Prototype Contrastive Learning (HPCL). In this framework, we construct a set of dynamically updated hierarchical prototypes through contrastive learning to encourage capturing the hierarchical semantic structure within rumors. Additionally, we design a difficulty metric function based on the distance between instances and prototypes, and introduce curriculum learning to mitigate the adverse effects of noisy data. Experiments on four public datasets demonstrate that our approach achieves state-of-the-art performance. Our code is publicly released at <span><span>https://github.com/Coder-HenryZa/HPCL</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Situation-aware empathetic response generation 情境感知移情反应生成
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-12 DOI: 10.1016/j.ipm.2024.103824

Empathetic response generation endeavours to perceive the interlocutor’s emotional and cognitive states in the dialogue and express proper responses. Previous studies detect the interlocutor’s states by understanding the immediate context of the dialogue. However, these methods are at an elementary/intermediate level of empathetic understanding due to the neglect of the broader context (i.e., the situation) and its associations with the dialogue, leading to inaccurate comprehension of the interlocutor’s states. In this paper, we utilize the EMPATHETIC-DIALOGUES dataset consisting of 25k dialogues, and on this basis, we propose a Situation-Dialogue Association Model (SDAM). SDAM focuses on the broader context, i.e., the situation, and enhances the understanding of empathy from explicit and implicit associations. Regarding explicit associations, we propose a bidirectional filtering encoder. It selects relevant keywords between the situation and dialogue, learning their direct lexical relevance. For implicit associations, we use a knowledge-based hypergraph network grounded to learn convoluted connections between the situation and the dialogue. Moreover, we also introduce a simple fine-tuning approach that combines SDAM with large language models to further strengthen the empathetic understanding capability. Compared to the baseline, SDAM demonstrates superior empathetic ability. In terms of emotion accuracy, fluency, and response diversity (Distinct-1/Distinct-2), SDAM achieves improvements of 12.25 (a 30.47% increase), 0.3 (a 0.85% increase), and 0.86/1.23 (116.22% and 30.67% increases), respectively. Additionally, our variant model based on large language models exhibits better emotion recognition capability without compromising response quality, specifically achieving an improvement of 0.23 (a 0.37% increase) in emotion accuracy.

移情反应生成致力于感知对话者在对话中的情感和认知状态,并表达适当的反应。以往的研究通过理解对话的直接语境来检测对话者的状态。然而,这些方法由于忽视了更广泛的语境(即情境)及其与对话的关联,导致对对话者状态的理解不准确,处于初级/中级移情理解水平。本文利用由 25k 对话组成的 EMPATHETIC-DIALOGUES 数据集,在此基础上提出了情境-对话关联模型(Situation-Dialogue Association Model,SDAM)。SDAM 侧重于更广泛的语境,即情境,并从显性和隐性关联中加强对移情的理解。关于显性关联,我们提出了一种双向过滤编码器。它可以选择情境和对话之间的相关关键词,学习它们的直接词汇相关性。对于隐性关联,我们使用基于知识的超图网络来学习情境和对话之间的复杂联系。此外,我们还引入了一种简单的微调方法,将 SDAM 与大型语言模型相结合,以进一步加强移情理解能力。与基线相比,SDAM 表现出了卓越的移情能力。在情感准确性、流畅性和反应多样性(Distinct-1/Distinct-2)方面,SDAM 分别提高了 12.25(提高了 30.47%)、0.3(提高了 0.85%)和 0.86/1.23(提高了 116.22% 和 30.67%)。此外,我们基于大型语言模型的变体模型在不影响响应质量的情况下表现出了更好的情感识别能力,特别是在情感准确率方面提高了 0.23(提高了 0.37%)。
{"title":"Situation-aware empathetic response generation","authors":"","doi":"10.1016/j.ipm.2024.103824","DOIUrl":"10.1016/j.ipm.2024.103824","url":null,"abstract":"<div><p>Empathetic response generation endeavours to perceive the interlocutor’s emotional and cognitive states in the dialogue and express proper responses. Previous studies detect the interlocutor’s states by understanding the immediate context of the dialogue. However, these methods are at an elementary/intermediate level of empathetic understanding due to the neglect of the broader context (i.e., the situation) and its associations with the dialogue, leading to inaccurate comprehension of the interlocutor’s states. In this paper, we utilize the EMPATHETIC-DIALOGUES dataset consisting of 25k dialogues, and on this basis, we propose a Situation-Dialogue Association Model (SDAM). SDAM focuses on the broader context, i.e., the situation, and enhances the understanding of empathy from explicit and implicit associations. Regarding explicit associations, we propose a bidirectional filtering encoder. It selects relevant keywords between the situation and dialogue, learning their direct lexical relevance. For implicit associations, we use a knowledge-based hypergraph network grounded to learn convoluted connections between the situation and the dialogue. Moreover, we also introduce a simple fine-tuning approach that combines SDAM with large language models to further strengthen the empathetic understanding capability. Compared to the baseline, SDAM demonstrates superior empathetic ability. In terms of emotion accuracy, fluency, and response diversity (Distinct-1/Distinct-2), SDAM achieves improvements of 12.25 (a 30.47% increase), 0.3 (a 0.85% increase), and 0.86/1.23 (116.22% and 30.67% increases), respectively. Additionally, our variant model based on large language models exhibits better emotion recognition capability without compromising response quality, specifically achieving an improvement of 0.23 (a 0.37% increase) in emotion accuracy.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141630513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FEDS-ICL: Enhancing translation ability and efficiency of large language model by optimizing demonstration selection FEDS-ICL:通过优化示范选择提高大型语言模型的翻译能力和效率
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-03 DOI: 10.1016/j.ipm.2024.103825
Shaolin Zhu, Leiyu Pan, Deyi Xiong

Large language models (LLMs) that exhibit a remarkable ability by in-context learning (ICL) with bilingual demonstrations have been recognized as a potential solution for machine translation. However, the process of selecting these demonstrations from vast datastores is notoriously time-consuming and inefficient. Moreover, the strategies for designing effective in-context demonstrations are not well-established. To address these critical gaps, we introduce a novel Fast and Effective approach for Demonstration Selection in-Context learning (FEDS-ICL) tailored to LLMs. Our method is designed to mainly enhance the efficiency and accuracy of translation of LLMs. Our approach revolutionizes demonstration selection by designing new product quantization technique that rapidly extracts neighboring target tokens from a strategically curated subset of sentences. This method significantly deviates from the conventional exhaustive search across entire datastores, leading to a remarkable increase in speed. Furthermore, FEDS-ICL pioneers an innovative template design for in-context demonstrations, specifically crafted to amplify the translation capabilities of multilingual LLMs. In experiments, we compare our FEDS-ICL with various existing methods on across diverse language pairs on ten different LLMs. The results reveal an up to 2.1-fold increase in selection speed and an impressive enhancement in translation accuracy, outperforming existing baselines by up to 2.0 BLEU points at least on ten different LLMs. The ablation study show the proposed product quantization and multi-view demonstration can effectively enhance the efficiency and accuracy of LLMs in machine translation. The analysis on robustness of FEDS-ICL shows that the incorporation of a greater number of demonstrations can lead a positive correlation between the quantity of contextually rich demonstrations and the translation quality of LLMs. These advancements position FEDS-ICL as a transformative methodology in the domain of machine translation and pattern analysis, marking a significant leap towards more efficient and precise machine translation.

大型语言模型(LLMs)通过双语示范的上下文学习(ICL)展现出非凡的能力,已被视为机器翻译的潜在解决方案。然而,从庞大的数据库中选择这些示例的过程耗时长、效率低是众所周知的。此外,设计有效的上下文示范的策略也不成熟。为了解决这些关键问题,我们引入了一种专为 LLM 量身定制的快速有效的上下文学习演示选择方法(FEDS-ICL)。我们的方法主要是为了提高 LLM 翻译的效率和准确性。我们的方法通过设计新的产品量化技术,从经过战略策划的句子子集中快速提取相邻的目标标记,从而彻底改变了示范选择。这种方法大大偏离了传统的对整个数据存储进行穷举搜索的方法,从而显著提高了速度。此外,FEDS-ICL 还开创了用于上下文演示的创新模板设计,专门用于增强多语言 LLM 的翻译能力。在实验中,我们将 FEDS-ICL 与现有的各种方法进行了比较,这些方法适用于十种不同 LLM 上的不同语言对。结果表明,我们的选择速度提高了 2.1 倍,翻译准确性也得到了显著提升,在十种不同的 LLM 上,我们的翻译准确性至少比现有基线高出 2.0 个 BLEU 点。消融研究表明,建议的乘积量化和多视图演示可以有效提高机器翻译中 LLM 的效率和准确性。对 FEDS-ICL 鲁棒性的分析表明,加入更多的演示可以使上下文丰富的演示数量与 LLM 的翻译质量呈正相关。这些进步将 FEDS-ICL 定位为机器翻译和模式分析领域的变革性方法,标志着向更高效、更精确的机器翻译迈出了重要一步。
{"title":"FEDS-ICL: Enhancing translation ability and efficiency of large language model by optimizing demonstration selection","authors":"Shaolin Zhu,&nbsp;Leiyu Pan,&nbsp;Deyi Xiong","doi":"10.1016/j.ipm.2024.103825","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103825","url":null,"abstract":"<div><p>Large language models (LLMs) that exhibit a remarkable ability by in-context learning (ICL) with bilingual demonstrations have been recognized as a potential solution for machine translation. However, the process of selecting these demonstrations from vast datastores is notoriously time-consuming and inefficient. Moreover, the strategies for designing effective in-context demonstrations are not well-established. To address these critical gaps, we introduce a novel Fast and Effective approach for Demonstration Selection in-Context learning (FEDS-ICL) tailored to LLMs. Our method is designed to mainly enhance the efficiency and accuracy of translation of LLMs. Our approach revolutionizes demonstration selection by designing new product quantization technique that rapidly extracts neighboring target tokens from a strategically curated subset of sentences. This method significantly deviates from the conventional exhaustive search across entire datastores, leading to a remarkable increase in speed. Furthermore, FEDS-ICL pioneers an innovative template design for in-context demonstrations, specifically crafted to amplify the translation capabilities of multilingual LLMs. In experiments, we compare our FEDS-ICL with various existing methods on across diverse language pairs on ten different LLMs. The results reveal an up to 2.1-fold increase in selection speed and an impressive enhancement in translation accuracy, outperforming existing baselines by up to 2.0 BLEU points at least on ten different LLMs. The ablation study show the proposed product quantization and multi-view demonstration can effectively enhance the efficiency and accuracy of LLMs in machine translation. The analysis on robustness of FEDS-ICL shows that the incorporation of a greater number of demonstrations can lead a positive correlation between the quantity of contextually rich demonstrations and the translation quality of LLMs. These advancements position FEDS-ICL as a transformative methodology in the domain of machine translation and pattern analysis, marking a significant leap towards more efficient and precise machine translation.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141539758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating target-aware knowledge into prompt-tuning for few-shot stance detection 将目标感知知识纳入几发姿态检测的提示调整中
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-02 DOI: 10.1016/j.ipm.2024.103815
Shaokang Wang , Fuhui Sun , Xiaoyan Wang , Li Pan

Stance detection, a fundamental task in natural language processing, identifies user stances in texts towards specific targets. The diverse targets and ever-changing expressions make it challenging to attain comprehensive knowledge from limited data. Existing methods focus on incorporating supplementary knowledge, neglecting fusion consistency during training, which is critical for preserving the rationality of the inference. In this paper, we introduce TAP, a novel approach for few-shot stance detection. TAP extends the verbalizer hierarchically, a mapping function in prompt-tuning. Constructed using a log-odds ratio of topics and targets, the verbalizer refines candidates with prior knowledge, forming the foundation for subsequent hierarchical distillation. The hierarchical distillation, a technique based on pilot experiments on the hierarchical verbalizer, ensures the fusion of diverse knowledge during prompt-tuning, maintaining consistency throughout the training process. Notably, TAP constructs verbalizers without external knowledge augmentation. The hierarchical distillation involves a joint loss function, contributing to the model’s robustness and training consistency. Extensive experiments are conducted on SemEval2016t6 and ArgMin datasets with 13 different targets. The proposed method is evaluated on various few-shot and full-data settings with F1-Macro and F1-Micro scores. On average, TAP achieves overall improvements of 4.71% and 3.76% over state-of-the-art baselines on Semeval2016t6 and ArgMin datasets, respectively, in few-shot scenarios.

姿态检测是自然语言处理中的一项基本任务,用于识别文本中用户对特定目标的姿态。目标多种多样,表达方式千变万化,要从有限的数据中获得全面的知识具有挑战性。现有方法侧重于纳入补充知识,忽略了训练过程中的融合一致性,而这对于保持推理的合理性至关重要。在本文中,我们介绍了 TAP,这是一种用于少量语态检测的新方法。TAP 对动词化器进行了分层扩展,这是提示调整中的一个映射函数。口头化器使用主题和目标的对数比率构建,利用先验知识完善候选词,为后续的分层提炼奠定基础。分层提炼是一种基于分层口头表达器试点实验的技术,可确保在提示调整过程中融合各种知识,从而在整个训练过程中保持一致性。值得注意的是,TAP 无需外部知识扩充即可构建言语表达器。分层蒸馏涉及联合损失函数,有助于提高模型的鲁棒性和训练的一致性。在 SemEval2016t6 和 ArgMin 数据集上对 13 个不同目标进行了广泛的实验。所提出的方法在 F1-Macro 和 F1-Micro 分数的各种少量和全数据设置上进行了评估。平均而言,在 Semeval2016t6 和 ArgMin 数据集上,TAP 在少量数据场景下的整体性能分别比最先进的基线提高了 4.71% 和 3.76%。
{"title":"Incorporating target-aware knowledge into prompt-tuning for few-shot stance detection","authors":"Shaokang Wang ,&nbsp;Fuhui Sun ,&nbsp;Xiaoyan Wang ,&nbsp;Li Pan","doi":"10.1016/j.ipm.2024.103815","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103815","url":null,"abstract":"<div><p>Stance detection, a fundamental task in natural language processing, identifies user stances in texts towards specific targets. The diverse targets and ever-changing expressions make it challenging to attain comprehensive knowledge from limited data. Existing methods focus on incorporating supplementary knowledge, neglecting fusion consistency during training, which is critical for preserving the rationality of the inference. In this paper, we introduce TAP, a novel approach for few-shot stance detection. TAP extends the verbalizer hierarchically, a mapping function in prompt-tuning. Constructed using a log-odds ratio of topics and targets, the verbalizer refines candidates with prior knowledge, forming the foundation for subsequent hierarchical distillation. The hierarchical distillation, a technique based on pilot experiments on the hierarchical verbalizer, ensures the fusion of diverse knowledge during prompt-tuning, maintaining consistency throughout the training process. Notably, TAP constructs verbalizers without external knowledge augmentation. The hierarchical distillation involves a joint loss function, contributing to the model’s robustness and training consistency. Extensive experiments are conducted on SemEval2016t6 and ArgMin datasets with 13 different targets. The proposed method is evaluated on various few-shot and full-data settings with F1-Macro and F1-Micro scores. On average, TAP achieves overall improvements of 4.71% and 3.76% over state-of-the-art baselines on Semeval2016t6 and ArgMin datasets, respectively, in few-shot scenarios.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141486479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chinese nested entity recognition method for the finance domain based on heterogeneous graph network 基于异构图网络的中文金融领域嵌套实体识别方法
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-01 DOI: 10.1016/j.ipm.2024.103812
Han Zhang , Yiping Dang , Yazhou Zhang , Siyuan Liang , Junxiu Liu , Lixia Ji

In the finance domain, nested named entities recognition has become a hot topic in named entity recognition tasks. Traditional nested entity recognition methods easily ignore the dependency relationships between entities, and these methods are mostly suitable for English general domain. Therefore, we propose a Chinese nested entity recognition method for the finance domain based on heterogeneous graph network(HGFNER). This method consists of two parts: the boundary division model of candidate entities and the internal relationship graph model of candidate entities. First, the boundary division model of candidate entities that introduces expert knowledge is used to partition the flat entities contained in the text and segment the text to address issues such as long entity boundaries and strong domain features in the Chinese finance domain. Then, by using heterogeneous graphs to represent the internal structure of entities from both spatial and syntactic dependencies to achieve the goal of learning dependency relationships between entities from multiple perspectives. Meanwhile, so as not to affect the operational efficiency of the model, we also propose a fast matching algorithm DAAC_BM for n-gram sequences in domain dictionaries to solve the problems of memory overflow and space waste faced by multi-pattern fast matching algorithms in Chinese matching. In addition, we propose a Chinese nested entity dataset CFNE for the financial field, which, as far as we know, is the first publicly available annotated dataset in the field. HGFNER achieves state-of-the-art macro-F1 value on CFNE, reaching 86.41%.

在金融领域,嵌套命名实体识别已成为命名实体识别任务中的热门话题。传统的嵌套实体识别方法容易忽略实体之间的依赖关系,而且这些方法大多适用于英语通用领域。因此,我们提出了一种基于异构图网络(HGFNER)的金融领域中文嵌套实体识别方法。该方法由两部分组成:候选实体的边界划分模型和候选实体的内部关系图模型。首先,通过引入专家知识的候选实体边界划分模型,对文本中包含的平面实体进行划分,并针对中文金融领域实体边界长、领域特征强等问题对文本进行分割。然后,利用异构图从空间依赖和句法依赖两方面来表示实体的内部结构,实现从多角度学习实体间依赖关系的目标。同时,为了不影响模型的运行效率,我们还提出了针对领域词典中 n-gram 序列的快速匹配算法 DAAC_BM,以解决中文匹配中多模式快速匹配算法面临的内存溢出和空间浪费问题。此外,我们还提出了金融领域的中文嵌套实体数据集 CFNE,据我们所知,这是金融领域第一个公开的注释数据集。HGFNER 在 CFNE 上实现了最先进的宏 F1 值,达到了 86.41%。
{"title":"Chinese nested entity recognition method for the finance domain based on heterogeneous graph network","authors":"Han Zhang ,&nbsp;Yiping Dang ,&nbsp;Yazhou Zhang ,&nbsp;Siyuan Liang ,&nbsp;Junxiu Liu ,&nbsp;Lixia Ji","doi":"10.1016/j.ipm.2024.103812","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103812","url":null,"abstract":"<div><p>In the finance domain, nested named entities recognition has become a hot topic in named entity recognition tasks. Traditional nested entity recognition methods easily ignore the dependency relationships between entities, and these methods are mostly suitable for English general domain. Therefore, we propose a Chinese nested entity recognition method for the finance domain based on heterogeneous graph network(HGFNER). This method consists of two parts: the boundary division model of candidate entities and the internal relationship graph model of candidate entities. First, the boundary division model of candidate entities that introduces expert knowledge is used to partition the flat entities contained in the text and segment the text to address issues such as long entity boundaries and strong domain features in the Chinese finance domain. Then, by using heterogeneous graphs to represent the internal structure of entities from both spatial and syntactic dependencies to achieve the goal of learning dependency relationships between entities from multiple perspectives. Meanwhile, so as not to affect the operational efficiency of the model, we also propose a fast matching algorithm DAAC_BM for n-gram sequences in domain dictionaries to solve the problems of memory overflow and space waste faced by multi-pattern fast matching algorithms in Chinese matching. In addition, we propose a Chinese nested entity dataset CFNE for the financial field, which, as far as we know, is the first publicly available annotated dataset in the field. HGFNER achieves state-of-the-art macro-F1 value on CFNE, reaching 86.41%.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141486478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monetizing entrepreneur response to crowdfunding with text analytics 利用文本分析将企业家对众筹的回应货币化
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-01 DOI: 10.1016/j.ipm.2024.103818
Wei Wang , Yuting Xu , Yenchun Jim Wu , Mark Goh

This paper examines the role of response in crowdfunding to guide fundraisers to monetize their responses better. In all, 6,405 commenters on a large crowdfunding platform in China (Modian.com) are observed. Grounded on the interaction texts, we identify, summarize, and extract the comments and the strategies for fundraisers to respond through several rounds of coding. It is found that potential investors comment to collect information or to express themselves, while fundraisers are either project-oriented or investor-oriented when responding to online reviews. We adopt the Naïve Bayes classifier to classify the crowdfunding demand of commenters, and entrepreneur's response strategy, as being either investor-oriented (60.8 %) or project-oriented (39.2 %). Econometric models are constructed to empirically test and quantify the impact of fundraiser response strategies on commenter investment decisions. The findings inform that commenters who are responded to by fundraisers are about 4 times more likely to invest than those who receive no response or receive responses from peer commenters. Also, a fundraiser project-oriented response strategy achieves 2.25 times better crowdfunding performance over an investor-oriented one. Robustness testing is performed using other criteria for classifying response strategies, as well as testing with a sample from another platform.

本文探讨了回复在众筹中的作用,以指导筹款者更好地将回复货币化。我们对中国某大型众筹平台(摩店网)上的 6405 位评论者进行了观察。我们以互动文本为基础,通过多轮编码,识别、总结并提取了评论内容和筹资者的回应策略。研究发现,潜在投资者发表评论是为了收集信息或表达自己的观点,而筹款人在回应在线评论时,要么以项目为导向,要么以投资者为导向。我们采用 Naïve Bayes 分类器将评论者的众筹需求和创业者的回应策略划分为投资者导向型(60.8%)或项目导向型(39.2%)。通过构建计量经济学模型,实证检验并量化了众筹者回应策略对评论者投资决策的影响。研究结果表明,得到筹款人回复的评论者进行投资的可能性比没有得到回复或得到同行评论者回复的评论者高出约 4 倍。此外,以募捐者项目为导向的回应策略比以投资者为导向的回应策略的众筹绩效高出 2.25 倍。使用其他响应策略分类标准进行了稳健性测试,并使用另一个平台的样本进行了测试。
{"title":"Monetizing entrepreneur response to crowdfunding with text analytics","authors":"Wei Wang ,&nbsp;Yuting Xu ,&nbsp;Yenchun Jim Wu ,&nbsp;Mark Goh","doi":"10.1016/j.ipm.2024.103818","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103818","url":null,"abstract":"<div><p>This paper examines the role of response in crowdfunding to guide fundraisers to monetize their responses better. In all, 6,405 commenters on a large crowdfunding platform in China (<em>Modian.com</em>) are observed. Grounded on the interaction texts, we identify, summarize, and extract the comments and the strategies for fundraisers to respond through several rounds of coding. It is found that potential investors comment to collect information or to express themselves, while fundraisers are either project-oriented or investor-oriented when responding to online reviews. We adopt the Naïve Bayes classifier to classify the crowdfunding demand of commenters, and entrepreneur's response strategy, as being either investor-oriented (60.8 %) or project-oriented (39.2 %). Econometric models are constructed to empirically test and quantify the impact of fundraiser response strategies on commenter investment decisions. The findings inform that commenters who are responded to by fundraisers are about 4 times more likely to invest than those who receive no response or receive responses from peer commenters. Also, a fundraiser project-oriented response strategy achieves 2.25 times better crowdfunding performance over an investor-oriented one. Robustness testing is performed using other criteria for classifying response strategies, as well as testing with a sample from another platform.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141486489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing graph neural networks for self-explainable modeling: A causal perspective with multi-granularity receptive fields 增强图神经网络的自解释建模功能:多粒度感受野的因果视角
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-07-01 DOI: 10.1016/j.ipm.2024.103821
Yuan Li, Li Liu, Penggang Chen, Chenglin Zhang, Guoyin Wang

Self-explainable Graph Neural Networks (GNNs) provide explanations alongside their predictions, making the model transparent and facilitating their wide adoption in high-stakes tasks. Current studies on constructing such GNNs are limited by the single receptive field, resulting in the modeling of spurious correlations in self-explainable GNNs. To address this issue, this paper introduces a GNN model with incorporated multi-granularity receptive fields, capturing causal correlations during the model construction and providing explanations alongside its predictions. Specifically, we employ closeness matrices with multiple structural orders to construct multi-granularity receptive fields for the model. Subsequently, we design a model architecture with sliced channels to integrate representations learned from multiple receptive fields heuristically. Objective functions from a causal perspective are further designed to guide the optimization of the proposed model. Experiments conducted on five real-world datasets and one synthetic dataset demonstrate the superior performance of the proposed model. In terms of classification accuracy, compared to SOTA baseline, the proposed GNN achieves the improvement of 0.17%, 1.99%, 0.70%, 0.83%, and 0.78% on the real-world datasets MUTAG, PTC, PROTEINS, IMDB-M, and IMDB-B, respectively. Compared to three self-explainable baselines, qualitative and quantitative studies are conducted on MUTAG, PTC, PROTEINS, IMDB-M, IMDB-B, and the synthetic dataset Spurious-Motif. Experimental results confirm that the proposed model can accurately identify the essential substructures, such as NO2 in the MUTAG dataset. Additionally, the proposed model assigns significant weights to the motif part and distinguishes it from the base part in the Spurious-Motif dataset, enhancing the accuracy of graph classification and the explanations of the predictions. The classifications along with explanations obtained with this approach align with human cognition and experience.

可自我解释的图神经网络(GNNs)在预测的同时还提供解释,使模型透明化,有利于在高风险任务中广泛采用。目前有关构建此类图神经网络的研究受到单一感受野的限制,导致在可自我解释的图神经网络中出现虚假相关性建模。为了解决这个问题,本文引入了一个包含多粒度感受野的 GNN 模型,在构建模型的过程中捕捉因果相关性,并在预测的同时提供解释。具体来说,我们采用具有多种结构阶数的接近度矩阵来构建模型的多粒度感受野。随后,我们设计了一个具有分片通道的模型架构,以启发式整合从多个感受野中学习到的表征。我们还进一步设计了从因果角度出发的目标函数,以指导优化所提出的模型。在五个真实世界数据集和一个合成数据集上进行的实验证明了所提模型的卓越性能。在分类准确率方面,与 SOTA 基线相比,所提出的 GNN 在真实世界数据集 MUTAG、PTC、PROTEINS、IMDB-M 和 IMDB-B 上分别提高了 0.17%、1.99%、0.70%、0.83% 和 0.78%。与三个可自行解释的基线相比,在 MUTAG、PTC、PROTEINS、IMDB-M、IMDB-B 和合成数据集 Spurious-Motif 上进行了定性和定量研究。实验结果证实,所提出的模型可以准确地识别重要的子结构,如 MUTAG 数据集中的 NO2。此外,在 Spurious-Motif 数据集中,所提出的模型为图案部分分配了重要权重,并将其与基底部分区分开来,从而提高了图分类的准确性和预测解释的准确性。这种方法获得的分类和解释与人类的认知和经验相吻合。
{"title":"Enhancing graph neural networks for self-explainable modeling: A causal perspective with multi-granularity receptive fields","authors":"Yuan Li,&nbsp;Li Liu,&nbsp;Penggang Chen,&nbsp;Chenglin Zhang,&nbsp;Guoyin Wang","doi":"10.1016/j.ipm.2024.103821","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103821","url":null,"abstract":"<div><p>Self-explainable Graph Neural Networks (GNNs) provide explanations alongside their predictions, making the model transparent and facilitating their wide adoption in high-stakes tasks. Current studies on constructing such GNNs are limited by the single receptive field, resulting in the modeling of spurious correlations in self-explainable GNNs. To address this issue, this paper introduces a GNN model with incorporated multi-granularity receptive fields, capturing causal correlations during the model construction and providing explanations alongside its predictions. Specifically, we employ closeness matrices with multiple structural orders to construct multi-granularity receptive fields for the model. Subsequently, we design a model architecture with sliced channels to integrate representations learned from multiple receptive fields heuristically. Objective functions from a causal perspective are further designed to guide the optimization of the proposed model. Experiments conducted on five real-world datasets and one synthetic dataset demonstrate the superior performance of the proposed model. In terms of classification accuracy, compared to SOTA baseline, the proposed GNN achieves the improvement of 0.17%, 1.99%, 0.70%, 0.83%, and 0.78% on the real-world datasets MUTAG, PTC, PROTEINS, IMDB-M, and IMDB-B, respectively. Compared to three self-explainable baselines, qualitative and quantitative studies are conducted on MUTAG, PTC, PROTEINS, IMDB-M, IMDB-B, and the synthetic dataset Spurious-Motif. Experimental results confirm that the proposed model can accurately identify the essential substructures, such as <span><math><mrow><mi>N</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span> in the MUTAG dataset. Additionally, the proposed model assigns significant weights to the motif part and distinguishes it from the base part in the Spurious-Motif dataset, enhancing the accuracy of graph classification and the explanations of the predictions. The classifications along with explanations obtained with this approach align with human cognition and experience.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141486463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Span-level bidirectional retention scheme for aspect sentiment triplet extraction 用于方面情感三连音提取的跨度级双向保留方案
IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-29 DOI: 10.1016/j.ipm.2024.103823
Xuan Yang , Tao Peng , Haijia Bi , Jiayu Han

The objective of the Aspect Sentiment Triplet Extraction (ASTE) task is to identify triplets of (aspect, opinion, sentiment) from user-generated reviews. The current study does not extensively integrate the interaction between word pairs and aspect-opinion pairs during the learning process at the granularity of sentence analysis. Furthermore, the bidirectional inference for the triplet, along with the parallel computing approach for long-span texts, also fail to achieve efficient unification. We introduce a new perspective: Span-level Bidirectional Retention Scheme(SBRS) for Aspect Sentiment Triplet Extraction model. The model comprises two pathways. The first pathway involves extracting effective aspect-opinion pair outcomes via two progressive submodules that operate on words and word pairs at varying scales. Building on the first pathway, the second pathway senses the interaction information of word pairs through bidirectional recursion and combines an efficient parallel computing approach. This combination allows the model to utilize three features – context, semantics, and relationship – to accurately identify the sentimental orientation. Thus, the two pathways facilitate the learning of relation-aware representations of word pairs. We carried out experiments on two public datasets, showing an average enhancement of 3.34% and 1.72% in F1 scores compared to the most recent baselines models, and multiple experiments from diverse angles proved the model’s superiority.

方面-情感三元组提取(ASTE)任务的目标是从用户生成的评论中识别(方面、观点、情感)三元组。目前的研究并未在句子分析的粒度上广泛整合学习过程中词对与方面-观点对之间的交互。此外,三元组的双向推理以及长跨度文本的并行计算方法也无法实现高效统一。我们引入了一个新的视角:跨度级双向保留方案(SBRS)的三重情感提取模型。该模型包括两个途径。第一条途径是通过两个渐进的子模块,以不同的尺度对词和词对进行操作,从而提取有效的方面-观点对结果。在第一条路径的基础上,第二条路径通过双向递归感知词对的交互信息,并结合高效的并行计算方法。这种组合使模型能够利用语境、语义和关系这三种特征来准确识别情感取向。因此,这两种途径有助于学习词对的关系感知表征。我们在两个公开数据集上进行了实验,结果表明,与最新的基线模型相比,F1 分数平均提高了 3.34% 和 1.72%,而且多个角度的实验证明了该模型的优越性。
{"title":"Span-level bidirectional retention scheme for aspect sentiment triplet extraction","authors":"Xuan Yang ,&nbsp;Tao Peng ,&nbsp;Haijia Bi ,&nbsp;Jiayu Han","doi":"10.1016/j.ipm.2024.103823","DOIUrl":"https://doi.org/10.1016/j.ipm.2024.103823","url":null,"abstract":"<div><p>The objective of the Aspect Sentiment Triplet Extraction (ASTE) task is to identify triplets of (aspect, opinion, sentiment) from user-generated reviews. The current study does not extensively integrate the interaction between word pairs and aspect-opinion pairs during the learning process at the granularity of sentence analysis. Furthermore, the bidirectional inference for the triplet, along with the parallel computing approach for long-span texts, also fail to achieve efficient unification. We introduce a new perspective: <em>Span-level Bidirectional Retention Scheme(SBRS) for Aspect Sentiment Triplet Extraction model</em>. The model comprises two pathways. The first pathway involves extracting effective aspect-opinion pair outcomes via two progressive submodules that operate on words and word pairs at varying scales. Building on the first pathway, the second pathway senses the interaction information of word pairs through bidirectional recursion and combines an efficient parallel computing approach. This combination allows the model to utilize three features – context, semantics, and relationship – to accurately identify the sentimental orientation. Thus, the two pathways facilitate the learning of relation-aware representations of word pairs. We carried out experiments on two public datasets, showing an average enhancement of 3.34% and 1.72% in F1 scores compared to the most recent baselines models, and multiple experiments from diverse angles proved the model’s superiority.</p></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141486459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information Processing & Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1