Information Processing & Management最新文献_第10页

Overlap to equilibrium: Oversampling imbalanced datasets using overlapping degree 重叠到平衡：利用重叠度对不平衡数据集进行超采样

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-23 DOI: 10.1016/j.ipm.2024.103975

Sidra Jubair , Jie Yang , Bilal Ali

Imbalanced and overlapping class distributions present several challenges, including poor generalization, misleading accuracy, and inflated importance of the majority class, which further complicate the classification task. To tackle this, we introduce a new novel oversampling method called GOS that generates samples from positive overlapping samples for imbalanced and overlapping data which improves the classification performance. Firstly, In GOS, a novel concept termed overlapping degree is introduced utilizing both local and global information from positive and negative samples. Secondly, it measures how much a positive sample contributes to the overlapping region and helps to identify positively overlapping samples. Lastly, the identified positive overlapping samples are transformed to generate new positive samples with a transformation matrix derived from the distribution information of all positive samples. We compare GOS with 14 commonly used under-sampling, oversampling, and advanced oversampling methods on 15 publicly available real imbalanced datasets with sample sizes varying from 178 to 2000 having an imbalance ratio varying from 2.02 to 41.4. The experimental results show that GOS outperforms these baselines achieving average improvements of 3.2 % in accuracy, 2.5 % in G-mean, 4.5 % in F1-score, and 5.2 % in AUC.

不平衡和重叠的类别分布带来了一些挑战，包括泛化能力差、误导准确性以及多数类别的重要性膨胀，从而使分类任务变得更加复杂。为了解决这个问题，我们引入了一种名为 GOS 的新型超采样方法，它能从正重叠样本中生成样本，用于处理不平衡和重叠数据，从而提高分类性能。首先，在 GOS 中，我们引入了一个新概念，即重叠度（overlapping degree），它利用了正样本和负样本的局部和全局信息。其次，它衡量了正样本对重叠区域的贡献程度，有助于识别正重叠样本。最后，对识别出的正重叠样本进行转换，利用从所有正样本分布信息中得出的转换矩阵生成新的正样本。我们在 15 个公开的真实不平衡数据集上比较了 GOS 与 14 种常用的欠采样、过采样和高级过采样方法，这些数据集的样本量从 178 个到 2000 个不等，不平衡率从 2.02 到 41.4 不等。实验结果表明，GOS 的表现优于这些基线方法，平均准确率提高了 3.2%，G-mean 提高了 2.5%，F1-score 提高了 4.5%，AUC 提高了 5.2%。

{"title":"Overlap to equilibrium: Oversampling imbalanced datasets using overlapping degree","authors":"Sidra Jubair , Jie Yang , Bilal Ali","doi":"10.1016/j.ipm.2024.103975","DOIUrl":"10.1016/j.ipm.2024.103975","url":null,"abstract":"<div><div>Imbalanced and overlapping class distributions present several challenges, including poor generalization, misleading accuracy, and inflated importance of the majority class, which further complicate the classification task. To tackle this, we introduce a new novel oversampling method called GOS that generates samples from positive overlapping samples for imbalanced and overlapping data which improves the classification performance. Firstly, In GOS, a novel concept termed overlapping degree is introduced utilizing both local and global information from positive and negative samples. Secondly, it measures how much a positive sample contributes to the overlapping region and helps to identify positively overlapping samples. Lastly, the identified positive overlapping samples are transformed to generate new positive samples with a transformation matrix derived from the distribution information of all positive samples. We compare GOS with 14 commonly used under-sampling, oversampling, and advanced oversampling methods on 15 publicly available real imbalanced datasets with sample sizes varying from 178 to 2000 having an imbalance ratio varying from 2.02 to 41.4. The experimental results show that GOS outperforms these baselines achieving average improvements of 3.2 % in accuracy, 2.5 % in G-mean, 4.5 % in F1-score, and 5.2 % in AUC.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103975"},"PeriodicalIF":7.4,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fusion of generative adversarial networks and non-negative tensor decomposition for depression fMRI data analysis 融合生成式对抗网络和非负张量分解用于抑郁 fMRI 数据分析

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-23 DOI: 10.1016/j.ipm.2024.103961

Fengqin Wang , Hengjin Ke , Yunbo Tang

Objective:

This study introduces a novel approach, F-GAN-NTD, which integrates Generative Adversarial Networks (GANs) with Non-negative Tensor Decomposition (NTD) theory to enhance the analysis of functional Magnetic Resonance Imaging (fMRI) data related to depression.

Methods:

F-GAN-NTD is applied to extract nonlinear non-negative factors from multidimensional fMRI tensor data, utilizing Deep-NTD technology to generate factor matrices that capture latent structures and dynamic features. A multi-view neural network architecture processes these factor matrices from all modalities simultaneously, enabling comprehensive pattern discrimination between depression patients and healthy controls. The method is tested on the Closed Eyes Depression fMRI (CEDF) and Strategic Research Program for Brain Sciences (SRPBS) datasets.

Results:

The F-GAN-NTD method demonstrates significant improvements in fMRI data classification, outperforming traditional approaches. It also effectively restores incomplete fMRI tensor data and reveals abnormal brain network connections, offering insights into the pathophysiological mechanisms of depression.

Conclusions:

F-GAN-NTD enhances the extraction of meaningful features from fMRI data, improving classification performance and providing a deeper understanding of depression-related brain abnormalities. The integration across modalities contributes to a more comprehensive analysis of depression.

方法：F-GAN-NTD应用于从多维fMRI张量数据中提取非线性非负因子，利用Deep-NTD技术生成能捕捉潜在结构和动态特征的因子矩阵。多视角神经网络架构可同时处理来自所有模式的这些因子矩阵，从而在抑郁症患者和健康对照组之间实现全面的模式识别。该方法在闭眼抑郁 fMRI（CEDF）和脑科学战略研究计划（SRPBS）数据集上进行了测试。结果：F-GAN-NTD 方法在 fMRI 数据分类方面有显著改进，优于传统方法。结论：F-GAN-NTD 增强了从 fMRI 数据中提取有意义特征的能力，提高了分类性能，加深了对抑郁症相关脑部异常的理解。跨模式的整合有助于对抑郁症进行更全面的分析。

{"title":"Fusion of generative adversarial networks and non-negative tensor decomposition for depression fMRI data analysis","authors":"Fengqin Wang , Hengjin Ke , Yunbo Tang","doi":"10.1016/j.ipm.2024.103961","DOIUrl":"10.1016/j.ipm.2024.103961","url":null,"abstract":"<div><h3>Objective:</h3><div>This study introduces a novel approach, F-GAN-NTD, which integrates Generative Adversarial Networks (GANs) with Non-negative Tensor Decomposition (NTD) theory to enhance the analysis of functional Magnetic Resonance Imaging (fMRI) data related to depression.</div></div><div><h3>Methods:</h3><div>F-GAN-NTD is applied to extract nonlinear non-negative factors from multidimensional fMRI tensor data, utilizing Deep-NTD technology to generate factor matrices that capture latent structures and dynamic features. A multi-view neural network architecture processes these factor matrices from all modalities simultaneously, enabling comprehensive pattern discrimination between depression patients and healthy controls. The method is tested on the Closed Eyes Depression fMRI (CEDF) and Strategic Research Program for Brain Sciences (SRPBS) datasets.</div></div><div><h3>Results:</h3><div>The F-GAN-NTD method demonstrates significant improvements in fMRI data classification, outperforming traditional approaches. It also effectively restores incomplete fMRI tensor data and reveals abnormal brain network connections, offering insights into the pathophysiological mechanisms of depression.</div></div><div><h3>Conclusions:</h3><div>F-GAN-NTD enhances the extraction of meaningful features from fMRI data, improving classification performance and providing a deeper understanding of depression-related brain abnormalities. The integration across modalities contributes to a more comprehensive analysis of depression.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103961"},"PeriodicalIF":7.4,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How can consumers without credit history benefit from the use of information processing and machine learning tools by financial institutions? 没有信用记录的消费者如何从金融机构使用信息处理和机器学习工具中获益？

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-22 DOI: 10.1016/j.ipm.2024.103972

Bjorn van Braak, Joerg R. Osterrieder, Marcos R. Machado

This research aims to enhance the predictability of creditworthiness among marginalized consumers affected by the widespread adoption of AI frameworks. We utilize ensemble methods to handle the imbalanced dataset used for evaluating the credit risk of consumers with sparse or non-existent credit histories. To promote fairness in the Machine Learning (ML) model, we employed the disparate impact remover—a recognized bias mitigation tool to minimize group bias. Three strategies were employed to tackle dataset imbalance: oversampling, undersampling, and class weight adjustment. Our findings reveal that adjusting the class weight proved most effective in sustaining commendable performance, demonstrating higher accuracy and F-1 scores surpassing 80% in most experiments. While the application of the disparate impact remover might compromise the ML model’s predictive capabilities, our results underscore the necessity of deliberating over the use of potentially bias-sensitive, unprotected features. Recognizing the critical nature of this trade-off for financial decision-makers, we delve into its implications.

本研究旨在提高受人工智能框架广泛应用影响的边缘化消费者的信用可预测性。我们利用集合方法来处理用于评估信用记录稀少或不存在的消费者信用风险的不平衡数据集。为了促进机器学习（ML）模型的公平性，我们采用了差异影响消除器--一种公认的偏见缓解工具，以最大限度地减少群体偏见。我们采用了三种策略来解决数据集失衡问题：过度采样、不足采样和类别权重调整。我们的研究结果表明，调整类权重被证明是最有效的方法，可以保持值得称赞的性能，在大多数实验中，准确率和 F-1 分数都超过了 80%。虽然应用差异影响消除器可能会损害 ML 模型的预测能力，但我们的结果强调了慎重考虑使用潜在的偏差敏感、未受保护特征的必要性。认识到这种权衡对金融决策者的重要性，我们将深入探讨其影响。

{"title":"How can consumers without credit history benefit from the use of information processing and machine learning tools by financial institutions?","authors":"Bjorn van Braak, Joerg R. Osterrieder, Marcos R. Machado","doi":"10.1016/j.ipm.2024.103972","DOIUrl":"10.1016/j.ipm.2024.103972","url":null,"abstract":"<div><div>This research aims to enhance the predictability of creditworthiness among marginalized consumers affected by the widespread adoption of AI frameworks. We utilize ensemble methods to handle the imbalanced dataset used for evaluating the credit risk of consumers with sparse or non-existent credit histories. To promote fairness in the Machine Learning (ML) model, we employed the disparate impact remover—a recognized bias mitigation tool to minimize group bias. Three strategies were employed to tackle dataset imbalance: oversampling, undersampling, and class weight adjustment. Our findings reveal that adjusting the class weight proved most effective in sustaining commendable performance, demonstrating higher accuracy and F-1 scores surpassing 80% in most experiments. While the application of the disparate impact remover might compromise the ML model’s predictive capabilities, our results underscore the necessity of deliberating over the use of potentially bias-sensitive, unprotected features. Recognizing the critical nature of this trade-off for financial decision-makers, we delve into its implications.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103972"},"PeriodicalIF":7.4,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-level semantics probability embedding for image–text matching 用于图像文本匹配的多层次语义概率嵌入

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-21 DOI: 10.1016/j.ipm.2024.103968

An-An Liu , Long Yang , Wenhui Li , Weizhi Nie , Xianzhu Liu , Haipeng Chen

The requirement of image–text matching is to retrieve matching images or texts based on textual or visual queries. However, image–text matching is inherently a many-to-many problem, as an image can correspond to multiple levels of visual semantic scenes, which can be described by different texts. Similarly, textual descriptions can be visualized through multiple visual scenes. This leads to ambiguity in the matching between images and texts. To better capture these matching relationships, we employ graph convolutional networks to extract multi-level semantic information for image–text pairs, and construct Gaussian distribution representations for image and text instead of conventional point representations. Furthermore, we introduce a inter-modal mixture of Gaussian distribution to constrain the matching relationships between image–text pairs, which ensures more precise distribution representations in a shared space and strengthens the correlation between cross-modal. We conducted experiments on Flickr30K and MS-COCO, which are two widely used datasets, demonstrates the superior performance of our approach.

图像-文本匹配的要求是根据文本或视觉查询检索匹配的图像或文本。然而，图像-文本匹配本质上是一个多对多的问题，因为一幅图像可以对应多层次的视觉语义场景，而这些场景可以由不同的文本来描述。同样，文本描述也可以通过多个视觉场景实现可视化。这就导致了图像和文本之间匹配的模糊性。为了更好地捕捉这些匹配关系，我们采用图卷积网络来提取图像-文本对的多层次语义信息，并为图像和文本构建高斯分布表示法，而不是传统的点表示法。此外，我们还引入了跨模态混合高斯分布来约束图像-文本对之间的匹配关系，从而确保在共享空间中获得更精确的分布表示，并加强跨模态之间的相关性。我们在 Flickr30K 和 MS-COCO 这两个广泛使用的数据集上进行了实验，证明了我们的方法性能优越。

{"title":"Multi-level semantics probability embedding for image–text matching","authors":"An-An Liu , Long Yang , Wenhui Li , Weizhi Nie , Xianzhu Liu , Haipeng Chen","doi":"10.1016/j.ipm.2024.103968","DOIUrl":"10.1016/j.ipm.2024.103968","url":null,"abstract":"<div><div>The requirement of image–text matching is to retrieve matching images or texts based on textual or visual queries. However, image–text matching is inherently a many-to-many problem, as an image can correspond to multiple levels of visual semantic scenes, which can be described by different texts. Similarly, textual descriptions can be visualized through multiple visual scenes. This leads to ambiguity in the matching between images and texts. To better capture these matching relationships, we employ graph convolutional networks to extract multi-level semantic information for image–text pairs, and construct Gaussian distribution representations for image and text instead of conventional point representations. Furthermore, we introduce a inter-modal mixture of Gaussian distribution to constrain the matching relationships between image–text pairs, which ensures more precise distribution representations in a shared space and strengthens the correlation between cross-modal. We conducted experiments on Flickr30K and MS-COCO, which are two widely used datasets, demonstrates the superior performance of our approach.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103968"},"PeriodicalIF":7.4,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Focus on user micro multi-behavioral states: Time-sensitive User Behavior Conversion Prediction and Multi-view Reinforcement Learning Based Recommendation Approach 关注用户微观多行为状态：时敏用户行为转换预测和基于多视角强化学习的推荐方法

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-20 DOI: 10.1016/j.ipm.2024.103967

Shanshan Wan , Shuyue Yang , Zebin Fu

In recommender systems, user behavior conversion implies user interest drifts and behavior patterns. However, current research has paid little attention to the correlation between target behavior conversion rate and user behavior patterns, and the impact of highly time-sensitive multi-behavior analysis on target behavior conversion rate is neglected. Meanwhile, compared to normal behavior conversions, user deviant behavior conversions are seldom studied. The behavior conversion rate that balances normal behavior patterns and deviant behavior patterns can more accurately reflect user interest drifts and real-time needs, thereby improving recommendation performance. Based on the above motivations, we propose a Time-sensitive Behavior Conversion Prediction and Multi-view Reinforcement Learning Based Recommendation Approach (TCMR), aiming to achieve more accurate and adaptive recommendations by analyzing user interest drifts, demand timings and behavior stability. First, we construct a hyper-behavior spatial model of highly collaborative temporal signals, and propose a subnet collaborative method to obtain normal behavior patterns, in which, core subnet, similarity subnet and behavior subnet are extracted from the hyper-behavior spatial model. Subsequently, we design a multi-level user behavior trajectory tree to perceive potential user deviant behaviors by comparing behavior conversions within the single behavior modality and across different behavior modality. By integrating normal behaviors and deviant behaviors, we evaluate user interest drifts, demand timings, and behavior stability, and ultimately obtain a prediction of behavior conversion rate. Finally, a multi-perspective asynchronous reinforcement learning is proposed, enabling TCMR to provide recommendations by considering multiple user perspectives and purposes. Experimental results demonstrate that TCMR exhibits superior recommendation performance and effectiveness.

在推荐系统中，用户行为转换意味着用户兴趣漂移和行为模式。然而，目前的研究很少关注目标行为转化率与用户行为模式之间的相关性，也忽视了高时效性的多行为分析对目标行为转化率的影响。同时，与正常行为转化率相比，用户偏差行为转化率鲜有研究。兼顾正常行为模式和偏差行为模式的行为转化率能更准确地反映用户的兴趣偏移和实时需求，从而提高推荐性能。基于上述动机，我们提出了一种基于时敏行为转换预测和多视图强化学习的推荐方法（TCMR），旨在通过分析用户兴趣偏移、需求时序和行为稳定性，实现更准确的自适应推荐。首先，我们构建了一个高度协同时空信号的超行为空间模型，并提出了一种子网协同方法来获取正常行为模式，即从超行为空间模型中提取核心子网、相似性子网和行为子网。随后，我们设计了一个多层次的用户行为轨迹树，通过比较单一行为模式内和不同行为模式间的行为转换，感知用户潜在的偏差行为。通过整合正常行为和偏差行为，我们评估了用户兴趣偏移、需求时序和行为稳定性，并最终获得了行为转化率预测。最后，我们提出了一种多视角异步强化学习方法，使 TCMR 能够通过考虑多个用户视角和目的来提供推荐。实验结果表明，TCMR 具有卓越的推荐性能和有效性。

{"title":"Focus on user micro multi-behavioral states: Time-sensitive User Behavior Conversion Prediction and Multi-view Reinforcement Learning Based Recommendation Approach","authors":"Shanshan Wan , Shuyue Yang , Zebin Fu","doi":"10.1016/j.ipm.2024.103967","DOIUrl":"10.1016/j.ipm.2024.103967","url":null,"abstract":"<div><div>In recommender systems, user behavior conversion implies user interest drifts and behavior patterns. However, current research has paid little attention to the correlation between target behavior conversion rate and user behavior patterns, and the impact of highly time-sensitive multi-behavior analysis on target behavior conversion rate is neglected. Meanwhile, compared to normal behavior conversions, user deviant behavior conversions are seldom studied. The behavior conversion rate that balances normal behavior patterns and deviant behavior patterns can more accurately reflect user interest drifts and real-time needs, thereby improving recommendation performance. Based on the above motivations, we propose a Time-sensitive Behavior Conversion Prediction and Multi-view Reinforcement Learning Based Recommendation Approach (TCMR), aiming to achieve more accurate and adaptive recommendations by analyzing user interest drifts, demand timings and behavior stability. First, we construct a hyper-behavior spatial model of highly collaborative temporal signals, and propose a subnet collaborative method to obtain normal behavior patterns, in which, core subnet, similarity subnet and behavior subnet are extracted from the hyper-behavior spatial model. Subsequently, we design a multi-level user behavior trajectory tree to perceive potential user deviant behaviors by comparing behavior conversions within the single behavior modality and across different behavior modality. By integrating normal behaviors and deviant behaviors, we evaluate user interest drifts, demand timings, and behavior stability, and ultimately obtain a prediction of behavior conversion rate. Finally, a multi-perspective asynchronous reinforcement learning is proposed, enabling TCMR to provide recommendations by considering multiple user perspectives and purposes. Experimental results demonstrate that TCMR exhibits superior recommendation performance and effectiveness.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103967"},"PeriodicalIF":7.4,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TourPIE: Empowering tourists with multi-criteria event-driven personalized travel sequences TourPIE：通过多标准事件驱动的个性化旅游序列增强游客能力

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-20 DOI: 10.1016/j.ipm.2024.103970

Mariam Orabi, Imad Afyouni, Zaher Al Aghbari

Tourism stands as a robust global industry, yet modern travelers increasingly crave personalized and immersive experiences in new destinations. While existing research has focused on constructing recommender systems for tourist venues from static sources, a crucial gap remains in addressing transient and upcoming attractions. Motivated by this, we present TourPIE, an innovative approach that bridges this divide by integrating both static and dynamic sources of Points of Interest (POI) lists. Leveraging insights from social media posts, TourPIE identifies tourism-related events and unveils upcoming attractions in real time. This groundbreaking system introduces two novel recommender algorithms, TourPIE-RO and TourPIE-RC, designed to dynamically suggest travel sequences based on contextual criteria such as budget, distance, and interests. In a comparative study across a dataset of 489 venues combining events and POI, TourPIE outperforms baseline methods, achieving a balance between relevant attractions and cost-effective routes while minimizing travel distance. Results show improved interest profit while reducing traveling distance by at least 10 km, and at least a

\times 2

improvement in distance overhead compared to balanced baselines. Additionally, TourPIE nearly aligns with routes of single-criteria greedy baselines. These findings underscore TourPIE’s effectiveness in recommending tailored travel plans for modern explorers seeking diverse and unforgettable experiences.

旅游业是一个蓬勃发展的全球性产业，但现代游客越来越渴望在新的旅游目的地获得个性化和身临其境的体验。现有的研究主要集中在从静态资源中构建旅游景点推荐系统，但在处理瞬时和即将到来的景点方面仍存在重大差距。受此启发，我们提出了 TourPIE，一种通过整合静态和动态兴趣点（POI）列表来源来弥合这一鸿沟的创新方法。利用从社交媒体帖子中获得的洞察力，TourPIE 可以识别与旅游相关的活动，并实时公布即将推出的景点。这一开创性系统引入了两种新颖的推荐算法--TourPIE-RO 和 TourPIE-RC，旨在根据预算、距离和兴趣等情境标准动态推荐旅行顺序。在一个包含 489 个活动场所和 POI 的数据集的比较研究中，TourPIE 的表现优于基线方法，在相关景点和具有成本效益的路线之间实现了平衡，同时最大限度地减少了旅行距离。结果表明，与平衡的基线方法相比，在减少至少 10 千米旅行距离的同时提高了兴趣收益，距离开销至少提高了 ×2。此外，TourPIE 几乎与单一标准的贪婪基线路线一致。这些发现证明了TourPIE在为追求多样化和难忘体验的现代探险者推荐量身定制的旅行计划方面的有效性。

{"title":"TourPIE: Empowering tourists with multi-criteria event-driven personalized travel sequences","authors":"Mariam Orabi, Imad Afyouni, Zaher Al Aghbari","doi":"10.1016/j.ipm.2024.103970","DOIUrl":"10.1016/j.ipm.2024.103970","url":null,"abstract":"<div><div>Tourism stands as a robust global industry, yet modern travelers increasingly crave personalized and immersive experiences in new destinations. While existing research has focused on constructing recommender systems for tourist venues from static sources, a crucial gap remains in addressing transient and upcoming attractions. Motivated by this, we present TourPIE, an innovative approach that bridges this divide by integrating both static and dynamic sources of Points of Interest (POI) lists. Leveraging insights from social media posts, TourPIE identifies tourism-related events and unveils upcoming attractions in real time. This groundbreaking system introduces two novel recommender algorithms, TourPIE-RO and TourPIE-RC, designed to dynamically suggest travel sequences based on contextual criteria such as budget, distance, and interests. In a comparative study across a dataset of 489 venues combining events and POI, TourPIE outperforms baseline methods, achieving a balance between relevant attractions and cost-effective routes while minimizing travel distance. Results show improved interest profit while reducing traveling distance by at least 10 km, and at least a <span><math><mrow><mo>×</mo><mn>2</mn></mrow></math></span> improvement in distance overhead compared to balanced baselines. Additionally, TourPIE nearly aligns with routes of single-criteria greedy baselines. These findings underscore TourPIE’s effectiveness in recommending tailored travel plans for modern explorers seeking diverse and unforgettable experiences.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103970"},"PeriodicalIF":7.4,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Concept-aware embedding for logical query reasoning over knowledge graphs 知识图谱逻辑查询推理的概念感知嵌入

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-19 DOI: 10.1016/j.ipm.2024.103971

Pengwei Pan , Jingpei Lei , Jiaan Wang , Dantong Ouyang , Jianfeng Qu , Zhixu Li

Logical query reasoning over knowledge graphs (KGs) is an important task for querying some information upon specified conditions. Despite recent advancements, existing methods typically focus on the inherent structure of logical queries and fail to capture the commonality among entities and relations, resulting in cascading errors during multi-hop inference. To mitigate this issue, we resort to inferring relations’ domain constraints based on the commonality of their connected entities implicitly. Specifically, to capture the domain constraints of relations, we treat the set of relations emitted by an entity as its implicit concept information and derive a relation’s domain constraint by aggregating the implicit concept information of its head entities. Employing a geometric-based embedding strategy, we enrich the representations of entities in the query with their implicit concept information. Additionally, we design a straightforward yet effective curriculum learning strategy to refine its reasoning skills. Notably, our model can be integrated into any existing query embedding-based logical query reasoning methods in a plug-and-play manner, enhancing their understanding of the entities as well as relations in queries. Experiments on three widely used datasets show that our model can achieve comparable outcomes and improve the performance of existing logical query reasoning models. Particularly, as a plug-in, it achieves an absolute improvement of the maximum 8.4% Hits@3 compared to the original model on the FB15k dataset, and it surpasses the former state-of-the-art plug-and-play logical query reasoning model in most scenes, exceeding it by up to 2.1% average Hits@3 results.

知识图谱（KG）上的逻辑查询推理是根据指定条件查询某些信息的一项重要任务。尽管近来取得了一些进展，但现有方法通常只关注逻辑查询的固有结构，而未能捕捉实体和关系之间的共性，从而导致多跳推理过程中出现连锁错误。为了缓解这一问题，我们采用了根据隐式连接实体的共性来推断关系的领域约束。具体来说，为了捕捉关系的领域约束，我们将实体发出的关系集视为其隐式概念信息，并通过聚合其头部实体的隐式概念信息推导出关系的领域约束。利用基于几何的嵌入策略，我们用实体的隐式概念信息丰富了查询中实体的表征。此外，我们还设计了一种简单而有效的课程学习策略来完善其推理技能。值得注意的是，我们的模型可以即插即用的方式集成到任何现有的基于查询嵌入的逻辑查询推理方法中，从而增强它们对查询中实体和关系的理解。在三个广泛使用的数据集上进行的实验表明，我们的模型可以取得与现有逻辑查询推理模型相当的结果并提高其性能。特别是在 FB15k 数据集上，作为一个插件，它与原始模型相比实现了最高 8.4% Hits@3 的绝对改进，并且在大多数场景中超过了以前最先进的即插即用逻辑查询推理模型，平均 Hits@3 结果最多超过其 2.1%。

{"title":"Concept-aware embedding for logical query reasoning over knowledge graphs","authors":"Pengwei Pan , Jingpei Lei , Jiaan Wang , Dantong Ouyang , Jianfeng Qu , Zhixu Li","doi":"10.1016/j.ipm.2024.103971","DOIUrl":"10.1016/j.ipm.2024.103971","url":null,"abstract":"<div><div>Logical query reasoning over knowledge graphs (KGs) is an important task for querying some information upon specified conditions. Despite recent advancements, existing methods typically focus on the inherent structure of logical queries and fail to capture the commonality among entities and relations, resulting in cascading errors during multi-hop inference. To mitigate this issue, we resort to inferring relations’ domain constraints based on the commonality of their connected entities implicitly. Specifically, to capture the domain constraints of relations, we treat the set of relations emitted by an entity as its implicit concept information and derive a relation’s domain constraint by aggregating the implicit concept information of its head entities. Employing a geometric-based embedding strategy, we enrich the representations of entities in the query with their implicit concept information. Additionally, we design a straightforward yet effective curriculum learning strategy to refine its reasoning skills. Notably, our model can be integrated into any existing query embedding-based logical query reasoning methods in a plug-and-play manner, enhancing their understanding of the entities as well as relations in queries. Experiments on three widely used datasets show that our model can achieve comparable outcomes and improve the performance of existing logical query reasoning models. Particularly, as a plug-in, it achieves an absolute improvement of the maximum 8.4% Hits@3 compared to the original model on the FB15k dataset, and it surpasses the former state-of-the-art plug-and-play logical query reasoning model in most scenes, exceeding it by up to 2.1% average Hits@3 results.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103971"},"PeriodicalIF":7.4,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Causal keyword driven reliable text classification with large language model feedback 利用大型语言模型反馈进行因果关键词驱动的可靠文本分类

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-18 DOI: 10.1016/j.ipm.2024.103964

Rui Song , Yingji Li , Mingjie Tian , Hanwen Wang , Fausto Giunchiglia , Hao Xu

Recent studies show Pre-trained Language Models (PLMs) tend to shortcut learning, reducing effectiveness with Out-Of-Distribution (OOD) samples, prompting research on the impact of shortcuts and robust causal features by interpretable methods for text classification. However, current approaches encounter two primary challenges. Firstly, black-box interpretable methods often yield incorrect causal keywords. Secondly, existing methods do not differentiate between shortcuts and causal keywords, often employing a unified approach to deal with them. To address the first challenge, we propose a framework that incorporates Large Language Model’s feedback into the process of identifying shortcuts and causal keywords. Specifically, we transform causal feature extraction into a word-level binary labeling task with the aid of ChatGPT. For the second challenge, we introduce a multi-grained shortcut mitigation framework. This framework includes two auxiliary tasks aimed at addressing shortcuts and causal features separately: shortcut reconstruction and counterfactual contrastive learning. These tasks enhance PLMs at both the token and sample granularity levels, respectively. Experimental results show that the proposed method achieves an average performance improvement of more than 1% under the premise of four different language model as the backbones for sentiment classification and toxicity detection tasks on 8 datasets compared with the most recent baseline methods.

最近的研究表明，预训练语言模型（PLMs）往往会缩短学习时间，降低对分布外样本（OOD）的学习效率，这促使人们研究可解释文本分类方法对缩短学习时间和稳健因果特征的影响。然而，当前的方法遇到了两个主要挑战。首先，黑盒子可解释方法经常产生错误的因果关键词。其次，现有方法没有区分捷径和因果关键词，往往采用统一的方法来处理它们。为了应对第一个挑战，我们提出了一个框架，将大语言模型的反馈融入到识别捷径和因果关键词的过程中。具体来说，我们借助 ChatGPT 将因果特征提取转化为词级二进制标注任务。针对第二个挑战，我们引入了一个多粒度捷径缓解框架。该框架包括两个旨在分别处理捷径和因果特征的辅助任务：捷径重构和反事实对比学习。这些任务分别在标记和样本粒度层面上增强了 PLM。实验结果表明，与最新的基线方法相比，在以四种不同语言模型为骨干进行情感分类和毒性检测任务的前提下，所提出的方法在 8 个数据集上的平均性能提高了 1%以上。

{"title":"Causal keyword driven reliable text classification with large language model feedback","authors":"Rui Song , Yingji Li , Mingjie Tian , Hanwen Wang , Fausto Giunchiglia , Hao Xu","doi":"10.1016/j.ipm.2024.103964","DOIUrl":"10.1016/j.ipm.2024.103964","url":null,"abstract":"<div><div>Recent studies show Pre-trained Language Models (PLMs) tend to shortcut learning, reducing effectiveness with Out-Of-Distribution (OOD) samples, prompting research on the impact of shortcuts and robust causal features by interpretable methods for text classification. However, current approaches encounter two primary challenges. Firstly, black-box interpretable methods often yield incorrect causal keywords. Secondly, existing methods do not differentiate between shortcuts and causal keywords, often employing a unified approach to deal with them. To address the first challenge, we propose a framework that incorporates Large Language Model’s feedback into the process of identifying shortcuts and causal keywords. Specifically, we transform causal feature extraction into a word-level binary labeling task with the aid of ChatGPT. For the second challenge, we introduce a multi-grained shortcut mitigation framework. This framework includes two auxiliary tasks aimed at addressing shortcuts and causal features separately: shortcut reconstruction and counterfactual contrastive learning. These tasks enhance PLMs at both the token and sample granularity levels, respectively. Experimental results show that the proposed method achieves an average performance improvement of more than 1% under the premise of four different language model as the backbones for sentiment classification and toxicity detection tasks on 8 datasets compared with the most recent baseline methods.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103964"},"PeriodicalIF":7.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Self-supervised star graph optimization embedding non-negative matrix factorization 嵌入非负矩阵因式分解的自监督星图优化

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-18 DOI: 10.1016/j.ipm.2024.103969

Songtao Li , Qiancheng Wang , MengJie Luo , Yang Li , Chang Tang

Labeling expensive and graph structure fuzziness are recognized as indispensable prerequisites for solving practical problems in semi-supervised graph learning. This paper proposes a novel approach: a non-negative matrix factorization algorithm based on self-supervised star graph optimal embedding, utilizing the progressive spontaneous strategy of anchor graphs. The model considers the feature assignment rules in unlabeled samples and constructs a corresponding probabilistic extension model to extract pseudo-labeled information from the samples. It also constructs self-supervised hard constraints accordingly to enhance the learning process. In addition, inspired by the graph structure filter, we propose a star graph optimization method. It smooths the association relationships between nodes in the graph structure and improves the accuracy of the graph regularization term in describing the association relationships of the original data. Finally, we give the objective function of the model with the multiplicative update rule and analyze the convergence of the algorithm under this rule. Clustering experiments on several standard image datasets and electroencephalography datasets show that the proposed algorithm improves over the current state-of-the-art benchmark algorithms by 6.9% on average. This indicates that the proposed model has excellent self-supervised label discovery and data representation capabilities.

标签昂贵和图结构模糊被认为是解决半监督图学习实际问题不可或缺的先决条件。本文提出了一种新方法：基于自监督星图最优嵌入的非负矩阵因式分解算法，利用了锚图的渐进自发策略。该模型考虑了未标记样本中的特征分配规则，并构建了相应的概率扩展模型，以从样本中提取伪标记信息。它还构建了相应的自监督硬约束，以加强学习过程。此外，受图结构过滤器的启发，我们提出了一种星形图优化方法。它平滑了图结构中节点间的关联关系，提高了图正则化项描述原始数据关联关系的准确性。最后，我们给出了乘法更新规则下模型的目标函数，并分析了该规则下算法的收敛性。在几个标准图像数据集和脑电数据集上进行的聚类实验表明，所提出的算法比目前最先进的基准算法平均提高了 6.9%。这表明所提出的模型具有出色的自监督标签发现和数据表示能力。

{"title":"Self-supervised star graph optimization embedding non-negative matrix factorization","authors":"Songtao Li , Qiancheng Wang , MengJie Luo , Yang Li , Chang Tang","doi":"10.1016/j.ipm.2024.103969","DOIUrl":"10.1016/j.ipm.2024.103969","url":null,"abstract":"<div><div>Labeling expensive and graph structure fuzziness are recognized as indispensable prerequisites for solving practical problems in semi-supervised graph learning. This paper proposes a novel approach: a non-negative matrix factorization algorithm based on self-supervised star graph optimal embedding, utilizing the progressive spontaneous strategy of anchor graphs. The model considers the feature assignment rules in unlabeled samples and constructs a corresponding probabilistic extension model to extract pseudo-labeled information from the samples. It also constructs self-supervised hard constraints accordingly to enhance the learning process. In addition, inspired by the graph structure filter, we propose a star graph optimization method. It smooths the association relationships between nodes in the graph structure and improves the accuracy of the graph regularization term in describing the association relationships of the original data. Finally, we give the objective function of the model with the multiplicative update rule and analyze the convergence of the algorithm under this rule. Clustering experiments on several standard image datasets and electroencephalography datasets show that the proposed algorithm improves over the current state-of-the-art benchmark algorithms by 6.9% on average. This indicates that the proposed model has excellent self-supervised label discovery and data representation capabilities.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103969"},"PeriodicalIF":7.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploiting multiple influence pattern of event organizer for event recommendation 利用活动组织者的多重影响模式进行活动推荐

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management

Pub Date : 2024-11-18 DOI: 10.1016/j.ipm.2024.103966

Xiaofeng Han, Xiangwu Meng, Yujie Zhang

Existing event recommendation methods pay attention to contextual factors to approach sparse and cold-start problem, in which organizer influence is a vital factor in Event-based Social Networks (EBSNs). However, existing studies ignore multiple influence pattern of organizer at event-level. In this paper, we distinguish organizer role and user (participant) role, exploring the organizer multiple influence pattern at event-level based on two scores: organizer behavior score and organizer popularity score. Besides, the organizer influence at event-level is dynamic, the step length is the time difference between two adjacent events from same organizer. Based on this discovery, we first calculate the organizer behavior score and organizer popularity score, then we propose an Organizer Multiple Influence Pattern-aware model (OMIP) based on topic model to capture user event topic preferences under the multiple influence pattern, which models the correlation and alternative-relation between user behavior topic and influence pattern. OMIP depends on the user’s participation records, user’s profiles and organizer’s profiles. OMIP outperforms state-of-the-art baselines with remarkable improvements in terms of Recall@k, NDCG@k, F1@k, and AUC. Specifically, Recall@5 improvement of 0.22%–16.41%; NDCG@5 improvement of 1.25%–10.81%; F1@5 improvement of 3.49%–16.43%; AUC improvement of 0.70%–1.62% on two real-world EBSNs datasets. Besides, OMIP can learn semantically topics and patterns which are useful to explain recommendations.

现有的事件推荐方法关注上下文因素，以解决稀疏和冷启动问题，其中组织者的影响力是基于事件的社交网络（EBSN）中的一个重要因素。然而，现有研究忽视了组织者在事件层面的多重影响模式。本文区分了组织者角色和用户（参与者）角色，基于组织者行为得分和组织者受欢迎程度得分这两个得分，探讨了组织者在事件层面的多重影响模式。此外，事件级的组织者影响力是动态的，步长是同一组织者的两个相邻事件之间的时间差。基于这一发现，我们首先计算了组织者行为得分和组织者受欢迎程度得分，然后提出了基于主题模型的组织者多重影响模式感知模型（OMIP），以捕捉多重影响模式下的用户事件主题偏好，该模型对用户行为主题和影响模式之间的相关性和替代性进行了建模。OMIP 依赖于用户的参与记录、用户档案和组织者档案。OMIP 在 Recall@k、NDCG@k、F1@k 和 AUC 方面的表现优于最先进的基线。具体来说，在两个真实的 EBSNs 数据集上，Recall@5 提高了 0.22%-16.41%；NDCG@5 提高了 1.25%-10.81%；F1@5 提高了 3.49%-16.43%；AUC 提高了 0.70%-1.62%。此外，OMIP 还能学习语义主题和模式，这对解释推荐非常有用。

{"title":"Exploiting multiple influence pattern of event organizer for event recommendation","authors":"Xiaofeng Han, Xiangwu Meng, Yujie Zhang","doi":"10.1016/j.ipm.2024.103966","DOIUrl":"10.1016/j.ipm.2024.103966","url":null,"abstract":"<div><div>Existing event recommendation methods pay attention to contextual factors to approach sparse and cold-start problem, in which organizer influence is a vital factor in Event-based Social Networks (EBSNs). However, existing studies ignore multiple influence pattern of organizer at event-level. In this paper, we distinguish organizer role and user (participant) role, exploring the organizer multiple influence pattern at event-level based on two scores: organizer behavior score and organizer popularity score. Besides, the organizer influence at event-level is dynamic, the step length is the time difference between two adjacent events from same organizer. Based on this discovery, we first calculate the organizer behavior score and organizer popularity score, then we propose an Organizer Multiple Influence Pattern-aware model (OMIP) based on topic model to capture user event topic preferences under the multiple influence pattern, which models the correlation and alternative-relation between user behavior topic and influence pattern. OMIP depends on the user’s participation records, user’s profiles and organizer’s profiles. OMIP outperforms state-of-the-art baselines with remarkable improvements in terms of Recall@k, NDCG@k, F1@k, and AUC. Specifically, Recall@5 improvement of 0.22%–16.41%; NDCG@5 improvement of 1.25%–10.81%; F1@5 improvement of 3.49%–16.43%; AUC improvement of 0.70%–1.62% on two real-world EBSNs datasets. Besides, OMIP can learn semantically topics and patterns which are useful to explain recommendations.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 2","pages":"Article 103966"},"PeriodicalIF":7.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142701975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0