Information Systems最新文献_第2页

Hands-on analysis of using large language models for the auto evaluation of programming assignments 使用大型语言模型自动评估编程作业的实践分析

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-15 DOI: 10.1016/j.is.2024.102473

Kareem Mohamed , Mina Yousef , Walaa Medhat , Ensaf Hussein Mohamed , Ghada Khoriba , Tamer Arafa

The increasing adoption of programming education necessitates efficient and accurate methods for evaluating students’ coding assignments. Traditional manual grading is time-consuming, often inconsistent, and prone to subjective biases. This paper explores the application of large language models (LLMs) for the automated evaluation of programming assignments. LLMs can use advanced natural language processing capabilities to assess code quality, functionality, and adherence to best practices, providing detailed feedback and grades. We demonstrate the effectiveness of LLMs through experiments comparing their performance with human evaluators across various programming tasks. Our study evaluates the performance of several LLMs for automated grading. Gemini 1.5 Pro achieves an exact match accuracy of 86% and a

\pm 1

accuracy of 98%. GPT-4o also demonstrates strong performance, with exact match and

\pm 1

accuracies of 69% and 97%, respectively. Both models correlate highly with human evaluations, indicating their potential for reliable automated grading. However, models such as Llama 3 70B and Mixtral 8

\times

7B exhibit low accuracy and alignment with human grading, particularly in problem-solving tasks. These findings suggest that advanced LLMs are instrumental in scalable, automated educational assessment. Additionally, LLMs enhance the learning experience by offering personalized, instant feedback, fostering an iterative learning process. The findings suggest that LLMs could play a pivotal role in the future of programming education, ensuring scalability and consistency in evaluation.

随着编程教育的日益普及，我们需要高效、准确的方法来评估学生的编码作业。传统的人工评分费时费力，往往不一致，而且容易产生主观偏见。本文探讨了大语言模型（LLM）在编程作业自动评估中的应用。LLM 可以使用先进的自然语言处理能力来评估代码质量、功能和是否符合最佳实践，并提供详细的反馈和评分。我们通过比较 LLM 与人类评估员在各种编程任务中的表现，证明了 LLM 的有效性。我们的研究评估了几种用于自动分级的 LLM 的性能。Gemini 1.5 Pro 的精确匹配准确率为 86%，±1 准确率为 98%。GPT-4o 也表现出强劲的性能，精确匹配准确率和 ±1 准确率分别为 69% 和 97%。这两个模型都与人类评估结果高度相关，表明它们具有可靠的自动分级潜力。然而，Llama 3 70B 和 Mixtral 8 × 7B 等模型的准确度和与人类分级的一致性较低，尤其是在解决问题的任务中。这些研究结果表明，先进的 LLM 在可扩展的自动教育评估中具有重要作用。此外，LLM 还能提供个性化的即时反馈，促进迭代学习过程，从而增强学习体验。研究结果表明，LLM 可在未来的编程教育中发挥关键作用，确保评估的可扩展性和一致性。

{"title":"Hands-on analysis of using large language models for the auto evaluation of programming assignments","authors":"Kareem Mohamed , Mina Yousef , Walaa Medhat , Ensaf Hussein Mohamed , Ghada Khoriba , Tamer Arafa","doi":"10.1016/j.is.2024.102473","DOIUrl":"10.1016/j.is.2024.102473","url":null,"abstract":"<div><div>The increasing adoption of programming education necessitates efficient and accurate methods for evaluating students’ coding assignments. Traditional manual grading is time-consuming, often inconsistent, and prone to subjective biases. This paper explores the application of large language models (LLMs) for the automated evaluation of programming assignments. LLMs can use advanced natural language processing capabilities to assess code quality, functionality, and adherence to best practices, providing detailed feedback and grades. We demonstrate the effectiveness of LLMs through experiments comparing their performance with human evaluators across various programming tasks. Our study evaluates the performance of several LLMs for automated grading. Gemini 1.5 Pro achieves an exact match accuracy of 86% and a <span><math><mrow><mo>±</mo><mn>1</mn></mrow></math></span> accuracy of 98%. GPT-4o also demonstrates strong performance, with exact match and <span><math><mrow><mo>±</mo><mn>1</mn></mrow></math></span> accuracies of 69% and 97%, respectively. Both models correlate highly with human evaluations, indicating their potential for reliable automated grading. However, models such as Llama 3 70B and Mixtral 8 <span><math><mo>×</mo></math></span> 7B exhibit low accuracy and alignment with human grading, particularly in problem-solving tasks. These findings suggest that advanced LLMs are instrumental in scalable, automated educational assessment. Additionally, LLMs enhance the learning experience by offering personalized, instant feedback, fostering an iterative learning process. The findings suggest that LLMs could play a pivotal role in the future of programming education, ensuring scalability and consistency in evaluation.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"128 ","pages":"Article 102473"},"PeriodicalIF":3.0,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142529772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Influence maximization based on discrete particle swarm optimization on multilayer network 基于离散粒子群优化的多层网络影响最大化

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-09 DOI: 10.1016/j.is.2024.102466

Saiwei Wang , Wei Liu , Ling Chen , Shijie Zong

Influence maximization (IM) aims to strategically select influential users to maximize information propagation in social networks. Most of the existing studies focus on IM in single-layer networks. However, we have observed that individuals often engage in multiple social platforms to fulfill various social needs. To make better use of this observation, we consider an extended problem of how to maximize influence spread in multilayer networks. The Multilayer Influence Maximization (MLIM) problem is different from the IM problem because information propagation behaves differently in multilayer networks compared to single-layer networks: users influenced on one layer may trigger the propagation of information on another layer. Our work successfully models the information propagation process as a Multilayer Independent Cascade model in multilayer networks. Based on the characteristics of this model, we introduce an approximation function called Multilayer Expected Diffusion Value (MLEDV) for it. However, the NP-hardness of the MLIM problem has posed significant challenges to our work. To tackle the issue, we devise a novel algorithm based on Discrete Particle Swarm Optimization. Our algorithm consists of two stages: 1) the candidate node selection, where we devise a novel centrality metric called Random connectivity Centrality to select candidate nodes, which assesses the importance of nodes from a connectivity perspective. 2)the seed selection, where we employ a discrete particle swarm algorithm to find seed nodes from the candidate nodes. We use MLEDV as a fitness function to measure the spreading power of candidate solutions in our algorithm. Additionally, we introduce a Neighborhood Optimization strategy to increase the convergence of the algorithm. We conduct experiments on four real-world networks and two self-built networks and demonstrate that our algorithms are effective for the MLIM problem.

影响力最大化（IM）旨在战略性地选择有影响力的用户，以最大限度地扩大社交网络中的信息传播。现有研究大多关注单层网络中的 IM。然而，我们注意到，个人通常会参与多个社交平台，以满足各种社交需求。为了更好地利用这一观察结果，我们考虑了如何在多层网络中实现影响力传播最大化的扩展问题。多层影响力最大化（MLIM）问题与 IM 问题不同，因为信息传播在多层网络中的表现与单层网络不同：在一层受到影响的用户可能会引发信息在另一层的传播。我们的工作成功地将信息传播过程建模为多层网络中的多层独立级联模型。根据该模型的特点，我们为其引入了一个名为多层期望扩散值（MLEDV）的近似函数。然而，MLIM 问题的 NP 难度给我们的工作带来了巨大挑战。为了解决这个问题，我们设计了一种基于离散粒子群优化的新算法。我们的算法包括两个阶段：1) 候选节点选择，我们设计了一种名为 "随机连接中心性 "的新型中心性度量来选择候选节点，该度量从连接性角度评估节点的重要性。2）种子选择，我们采用离散粒子群算法从候选节点中寻找种子节点。在算法中，我们使用 MLEDV 作为适配函数来衡量候选方案的传播能力。此外，我们还引入了邻域优化策略，以提高算法的收敛性。我们在四个真实世界网络和两个自建网络上进行了实验，证明我们的算法对 MLIM 问题是有效的。

{"title":"Influence maximization based on discrete particle swarm optimization on multilayer network","authors":"Saiwei Wang , Wei Liu , Ling Chen , Shijie Zong","doi":"10.1016/j.is.2024.102466","DOIUrl":"10.1016/j.is.2024.102466","url":null,"abstract":"<div><div>Influence maximization (IM) aims to strategically select influential users to maximize information propagation in social networks. Most of the existing studies focus on IM in single-layer networks. However, we have observed that individuals often engage in multiple social platforms to fulfill various social needs. To make better use of this observation, we consider an extended problem of how to maximize influence spread in multilayer networks. The Multilayer Influence Maximization (MLIM) problem is different from the IM problem because information propagation behaves differently in multilayer networks compared to single-layer networks: users influenced on one layer may trigger the propagation of information on another layer. Our work successfully models the information propagation process as a Multilayer Independent Cascade model in multilayer networks. Based on the characteristics of this model, we introduce an approximation function called Multilayer Expected Diffusion Value (MLEDV) for it. However, the NP-hardness of the MLIM problem has posed significant challenges to our work. To tackle the issue, we devise a novel algorithm based on Discrete Particle Swarm Optimization. Our algorithm consists of two stages: 1) the candidate node selection, where we devise a novel centrality metric called Random connectivity Centrality to select candidate nodes, which assesses the importance of nodes from a connectivity perspective. 2)the seed selection, where we employ a discrete particle swarm algorithm to find seed nodes from the candidate nodes. We use MLEDV as a fitness function to measure the spreading power of candidate solutions in our algorithm. Additionally, we introduce a Neighborhood Optimization strategy to increase the convergence of the algorithm. We conduct experiments on four real-world networks and two self-built networks and demonstrate that our algorithms are effective for the MLIM problem.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"127 ","pages":"Article 102466"},"PeriodicalIF":3.0,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142420321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Runtime integration of machine learning and simulation for business processes: Time and decision mining predictions 业务流程中机器学习与模拟的运行时集成：时间和决策挖掘预测

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-09 DOI: 10.1016/j.is.2024.102472

Francesca Meneghello , Chiara Di Francescomarino , Chiara Ghidini , Massimiliano Ronzani

Recent research in Computer Science has investigated the use of Deep Learning (DL) techniques to complement outcomes or decisions within a Discrete Event Simulation (DES) model. The main idea of this combination is to maintain a white box simulation model complement it with information provided by DL models to overcome the unrealistic or oversimplified assumptions of traditional DESs. State-of-the-art techniques in BPM combine Deep Learning and Discrete Event Simulation in a post-integration fashion: first an entire simulation is performed, and then a DL model is used to add waiting times and processing times to the events produced by the simulation model.

In this paper, we aim at taking a step further by introducing Rims (Runtime Integration of Machine Learning and Simulation). Instead of complementing the outcome of a complete simulation with the results of predictions a posteriori, Rims provides a tight integration of the predictions of the DL model at runtime during the simulation. This runtime-integration enables us to fully exploit the specific predictions while respecting simulation execution, thus enhancing the performance of the overall system both w.r.t. the single techniques (Business Process Simulation and DL) separately and the post-integration approach. In particular, the runtime integration ensures the accuracy of intercase features for time prediction, such as the number of ongoing traces at a given time, by calculating them during directly the simulation, where all traces are executed in parallel. Additionally, it allows for the incorporation of online queue information in the DL model and enables the integration of other predictive models into the simulator to enhance decision point management within the process model. These enhancements improve the performance of Rims in accurately simulating the real process in terms of control flow, as well as in terms of time and congestion dimensions. Especially in process scenarios with significant congestion – when a limited availability of resources leads to significant event queues for their allocation – the ability of Rims to use queue features to predict waiting times allows it to surpass the state-of-the-art. We evaluated our approach with real-world and synthetic event logs, using various metrics to assess the simulation model’s quality in terms of control-flow, time, and congestion dimensions.

计算机科学领域的最新研究调查了深度学习（DL）技术的使用情况，以补充离散事件仿真（DES）模型中的结果或决策。这种组合的主要理念是，利用深度学习模型提供的信息对白盒仿真模型进行补充，以克服传统 DES 不切实际或过于简化的假设。BPM 领域最先进的技术以一种后整合的方式将深度学习和离散事件仿真结合在一起：首先执行整个仿真，然后使用 DL 模型为仿真模型生成的事件添加等待时间和处理时间。Rims 不是用事后预测的结果来补充完整的仿真结果，而是在仿真过程中的运行时对 DL 模型的预测进行紧密集成。这种运行时集成使我们能够在尊重仿真执行的前提下充分利用特定的预测结果，从而提高整个系统的性能，无论是与单独的技术（业务流程仿真和 DL）相比，还是与后集成方法相比，都是如此。特别是，运行时集成确保了用于时间预测的案例间特征的准确性，如在给定时间内正在进行的跟踪数量，方法是在所有跟踪都并行执行的模拟过程中直接计算这些特征。此外，它还允许在 DL 模型中纳入在线队列信息，并允许将其他预测模型集成到模拟器中，以加强流程模型中的决策点管理。这些改进提高了 Rims 在控制流、时间和拥塞维度方面精确模拟实际流程的性能。特别是在拥堵严重的流程场景中，当有限的可用资源导致大量事件排队等待分配时，Rims 利用队列特征预测等待时间的能力使其超越了最先进的技术。我们利用真实世界和合成事件日志对我们的方法进行了评估，并使用各种指标来评估仿真模型在控制流、时间和拥塞方面的质量。

{"title":"Runtime integration of machine learning and simulation for business processes: Time and decision mining predictions","authors":"Francesca Meneghello , Chiara Di Francescomarino , Chiara Ghidini , Massimiliano Ronzani","doi":"10.1016/j.is.2024.102472","DOIUrl":"10.1016/j.is.2024.102472","url":null,"abstract":"<div><div>Recent research in Computer Science has investigated the use of Deep Learning (DL) techniques to complement outcomes or decisions within a Discrete Event Simulation (DES) model. The main idea of this combination is to maintain a white box simulation model complement it with information provided by DL models to overcome the unrealistic or oversimplified assumptions of traditional DESs. State-of-the-art techniques in BPM combine Deep Learning and Discrete Event Simulation in a post-integration fashion: first an entire simulation is performed, and then a DL model is used to add waiting times and processing times to the events produced by the simulation model.</div><div>In this paper, we aim at taking a step further by introducing <span>Rims</span> (Runtime Integration of Machine Learning and Simulation). Instead of complementing the outcome of a complete simulation with the results of predictions a posteriori, <span>Rims</span> provides a tight integration of the predictions of the DL model <em>at runtime</em> during the simulation. This runtime-integration enables us to fully exploit the specific predictions while respecting simulation execution, thus enhancing the performance of the overall system both w.r.t. the single techniques (Business Process Simulation and DL) separately and the post-integration approach. In particular, the runtime integration ensures the accuracy of intercase features for time prediction, such as the number of ongoing traces at a given time, by calculating them during directly the simulation, where all traces are executed in parallel. Additionally, it allows for the incorporation of online queue information in the DL model and enables the integration of other predictive models into the simulator to enhance decision point management within the process model. These enhancements improve the performance of <span>Rims</span> in accurately simulating the real process in terms of control flow, as well as in terms of time and congestion dimensions. Especially in process scenarios with significant congestion – when a limited availability of resources leads to significant event queues for their allocation – the ability of <span>Rims</span> to use queue features to predict waiting times allows it to surpass the state-of-the-art. We evaluated our approach with real-world and synthetic event logs, using various metrics to assess the simulation model’s quality in terms of control-flow, time, and congestion dimensions.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"128 ","pages":"Article 102472"},"PeriodicalIF":3.0,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Business process simulation: Probabilistic modeling of intermittent resource availability and multitasking behavior 业务流程模拟：间歇性资源可用性和多任务行为的概率建模

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-09 DOI: 10.1016/j.is.2024.102471

Orlenys López-Pintado, Marlon Dumas

In business process simulation, resource availability is typically modeled by assigning a calendar to each resource, e.g., Monday–Friday, 9:00–18:00. Resources are assumed to be always available during each time slot in their availability calendar. This assumption often becomes invalid due to interruptions, breaks, or time-sharing across processes. In other words, existing approaches fail to capture intermittent availability. Another limitation of existing approaches is that they either do not consider multitasking behavior, or if they do, they assume that resources always multitask (up to a maximum capacity) whenever available. However, studies have shown that the multitasking patterns vary across days. This paper introduces a probabilistic approach to model resource availability and multitasking behavior for business process simulation. In this approach, each time slot in a resource calendar has an associated availability probability and a multitasking probability per multitasking level. For example, a resource may be available on Fridays between 14:00–15:00 with 90% probability, and given that they are performing one task during this slot, they may take on a second concurrent task with 60% probability. We propose algorithms to discover probabilistic calendars and probabilistic multitasking capacities from event logs. An evaluation shows that, with these enhancements, simulation models discovered from event logs better replicate the distribution of activities and cycle times, relative to approaches with crisp calendars and monotasking assumptions.

在业务流程模拟中，资源可用性通常是通过为每个资源分配一个日历来建模的，例如，周一至周五，9:00-18:00。假设资源在其可用性日历中的每个时间段内始终可用。由于中断、休息或跨流程分时，这一假设往往变得无效。换句话说，现有方法无法捕捉间歇性可用性。现有方法的另一个局限性在于，它们要么不考虑多任务处理行为，要么即使考虑了，也会假设资源在可用时总是多任务处理（达到最大容量）。然而，研究表明，多任务模式在不同的日子会有所不同。本文介绍了一种概率方法，用于为业务流程模拟中的资源可用性和多任务行为建模。在这种方法中，资源日历中的每个时间段都有相关的可用性概率和每个多任务级别的多任务概率。例如，资源在周五 14:00-15:00 之间可用的概率为 90%，考虑到他们在此时间段内正在执行一项任务，他们可能会以 60% 的概率同时执行第二项任务。我们提出了从事件日志中发现概率日历和概率多任务容量的算法。评估结果表明，与采用清晰日历和单任务假设的方法相比，通过这些增强功能从事件日志中发现的仿真模型能更好地复制活动和周期时间的分布。

{"title":"Business process simulation: Probabilistic modeling of intermittent resource availability and multitasking behavior","authors":"Orlenys López-Pintado, Marlon Dumas","doi":"10.1016/j.is.2024.102471","DOIUrl":"10.1016/j.is.2024.102471","url":null,"abstract":"<div><div>In business process simulation, resource availability is typically modeled by assigning a calendar to each resource, e.g., Monday–Friday, 9:00–18:00. Resources are assumed to be always available during each time slot in their availability calendar. This assumption often becomes invalid due to interruptions, breaks, or time-sharing across processes. In other words, existing approaches fail to capture intermittent availability. Another limitation of existing approaches is that they either do not consider multitasking behavior, or if they do, they assume that resources always multitask (up to a maximum capacity) whenever available. However, studies have shown that the multitasking patterns vary across days. This paper introduces a probabilistic approach to model resource availability and multitasking behavior for business process simulation. In this approach, each time slot in a resource calendar has an associated availability probability and a multitasking probability per multitasking level. For example, a resource may be available on Fridays between 14:00–15:00 with 90% probability, and given that they are performing one task during this slot, they may take on a second concurrent task with 60% probability. We propose algorithms to discover probabilistic calendars and probabilistic multitasking capacities from event logs. An evaluation shows that, with these enhancements, simulation models discovered from event logs better replicate the distribution of activities and cycle times, relative to approaches with crisp calendars and monotasking assumptions.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"127 ","pages":"Article 102471"},"PeriodicalIF":3.0,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142432610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Modeling higher-order social influence using multi-head graph attention autoencoder 利用多头图注意力自动编码器建立高阶社会影响力模型

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-05 DOI: 10.1016/j.is.2024.102474

Elnaz Meydani , Christoph Duesing , Matthias Trier

Recommender systems are powerful tools developed to mitigate information overload in e-commerce platforms. Social recommender systems leverage social relations among users to predict their preferences. Recently, graph neural networks have been utilized for social recommendations, modeling user-user social relations and user–item interactions as graph-structured data. Despite their improvement over traditional systems, most existing social recommender systems exploit only first-order social relations and overlook the importance of social influence diffusion from higher-order neighbors in social networks. Additionally, these techniques often treat all neighboring nodes equally, without highlighting the most influential ones. To address these challenges, we introduce GATE-SR, a novel model that leverages a multi-head graph attention autoencoder to capture indirect social influence from higher-order neighbors while emphasizing the most relevant users. Moreover, we incorporate implicit social connections derived from coherent communities within the network. While GATE-SR performs comparably to baseline models in rich data environments, its strength lies in excelling at cold-start scenarios—where other models often fall short. This focus on cold-start performance aligns with our goal of building a robust recommender system for real-world challenges. Through extensive experiments on three real-world datasets, we demonstrate that GATE-SR outperforms several state-of-the-art baselines in cold-start scenarios. These results highlight the crucial role of accentuating the most influential neighbors, both explicit and implicit, when modeling higher-order social connections for more accurate recommendations.

推荐系统是为减轻电子商务平台信息过载而开发的强大工具。社交推荐系统利用用户之间的社交关系来预测他们的偏好。最近，图神经网络被用于社交推荐，将用户与用户之间的社交关系和用户与物品之间的互动建模为图结构数据。尽管与传统系统相比有了改进，但大多数现有的社交推荐系统只利用了一阶社交关系，而忽视了社交网络中来自高阶邻居的社交影响扩散的重要性。此外，这些技术往往对所有邻接节点一视同仁，而没有突出最有影响力的节点。为了应对这些挑战，我们引入了 GATE-SR，这是一种新型模型，它利用多头图注意力自动编码器捕捉来自高阶邻居的间接社会影响，同时强调最相关的用户。此外，我们还纳入了来自网络内连贯社区的隐式社交联系。虽然 GATE-SR 在丰富数据环境中的表现与基线模型不相上下，但它的优势在于在冷启动场景中表现出色--而其他模型往往在这种场景中表现不佳。对冷启动性能的关注与我们的目标一致，即为现实世界的挑战建立一个强大的推荐系统。通过在三个真实世界数据集上的广泛实验，我们证明了 GATE-SR 在冷启动场景中的表现优于几个最先进的基线模型。这些结果凸显了在为更准确的推荐建立高阶社交关系模型时，突出最有影响力的邻居（包括显性和隐性邻居）的关键作用。

{"title":"Modeling higher-order social influence using multi-head graph attention autoencoder","authors":"Elnaz Meydani , Christoph Duesing , Matthias Trier","doi":"10.1016/j.is.2024.102474","DOIUrl":"10.1016/j.is.2024.102474","url":null,"abstract":"<div><div>Recommender systems are powerful tools developed to mitigate information overload in e-commerce platforms. Social recommender systems leverage social relations among users to predict their preferences. Recently, graph neural networks have been utilized for social recommendations, modeling user-user social relations and user–item interactions as graph-structured data. Despite their improvement over traditional systems, most existing social recommender systems exploit only first-order social relations and overlook the importance of social influence diffusion from higher-order neighbors in social networks. Additionally, these techniques often treat all neighboring nodes equally, without highlighting the most influential ones. To address these challenges, we introduce GATE-SR, a novel model that leverages a multi-head graph attention autoencoder to capture indirect social influence from higher-order neighbors while emphasizing the most relevant users. Moreover, we incorporate implicit social connections derived from coherent communities within the network. While GATE-SR performs comparably to baseline models in rich data environments, its strength lies in excelling at cold-start scenarios—where other models often fall short. This focus on cold-start performance aligns with our goal of building a robust recommender system for real-world challenges. Through extensive experiments on three real-world datasets, we demonstrate that GATE-SR outperforms several state-of-the-art baselines in cold-start scenarios. These results highlight the crucial role of accentuating the most influential neighbors, both explicit and implicit, when modeling higher-order social connections for more accurate recommendations.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"128 ","pages":"Article 102474"},"PeriodicalIF":3.0,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142445834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploiting explicit item–item correlations from knowledge graphs for enhanced sequential recommendation 利用知识图谱中明确的项目与项目之间的相关性来增强顺序推荐功能

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-05 DOI: 10.1016/j.is.2024.102470

Yanlin Zhang , Yuchen Shi , Deqing Yang , Xiaodong Gu

In recent years, the research of employing knowledge graphs (KGs) in sequential recommendation (SR) has received a lot of attention, since the side information extracted from KGs, especially the information of the correlations between items, indeed helps the SR models achieve better performance. However, many previous KG-based SR models tend to introduce some noise information when learning item embeddings, or insufficiently fuse item–item correlations into their sequential modeling, thus limiting their performance improvements. In this paper, we propose a Distance-Aware Knowledge-based Sequential Recommendation model (DAKSR), which exploits the explicit item–item correlations from KGs to achieve enhanced SR. Specifically, as one critical component in our DAKSR, the distance score matrix (DSM) is first obtained to indicate the correlations between items, and then leveraged in the following three major modules of DAKSR. First, in the Item-Set Embedding layer (ISE) all item embeddings are learned based on DSM, in which the noise information is eliminated effectively. Meanwhile, the Knowledge-Infused Transformer (KIT) incorporates DSM into its attention mechanism to improve the feature extraction. Furthermore, the Knowledge Contrastive Learning module (KCL) also leverages the item–item correlations presented in DSM to generate two credible sequence views, which are used to refine sample representations through a contrastive learning strategy, and thus improve the model’s robustness. Our extensive experiments on three SR benchmarks obviously demonstrate our DAKSR’s superior performance over the state-of-the-art (SOTA) KG-based recommendation models. The implementation of our DAKSR is available at https://github.com/Easonsi/DAKSR for reproducing our experiment results conveniently.

近年来，知识图谱（KG）在序列推荐（SR）中的应用研究受到了广泛关注，因为从知识图谱中提取的侧信息，尤其是条目间的相关性信息，确实有助于序列推荐模型获得更好的性能。然而，以往许多基于 KG 的 SR 模型在学习条目嵌入时往往会引入一些噪声信息，或者在建立序列模型时没有充分融合条目与条目之间的相关性，从而限制了其性能的提高。在本文中，我们提出了一种距离感知的基于知识的序列推荐模型（DAKSR），该模型利用基于项目嵌入的显式项目-项目相关性来实现增强的序列推荐。具体来说，作为我们的 DAKSR 的一个重要组成部分，距离得分矩阵（DSM）首先用来表示项目之间的相关性，然后在 DAKSR 的以下三个主要模块中加以利用。首先，在项目集嵌入层（ISE）中，所有项目嵌入都是基于 DSM 学习的，其中有效地消除了噪声信息。同时，知识注入转换器（KIT）将 DSM 纳入其注意机制，以改进特征提取。此外，知识对比学习模块（KCL）还利用 DSM 中的项目-项目相关性生成两个可信的序列视图，通过对比学习策略来完善样本表示，从而提高模型的鲁棒性。我们在三个推荐基准上进行的大量实验清楚地证明了我们的 DAKSR 比基于 KG 的最先进（SOTA）推荐模型具有更优越的性能。我们的 DAKSR 的实现方法可在 https://github.com/Easonsi/DAKSR 上获取，以便于重现我们的实验结果。

{"title":"Exploiting explicit item–item correlations from knowledge graphs for enhanced sequential recommendation","authors":"Yanlin Zhang , Yuchen Shi , Deqing Yang , Xiaodong Gu","doi":"10.1016/j.is.2024.102470","DOIUrl":"10.1016/j.is.2024.102470","url":null,"abstract":"<div><div>In recent years, the research of employing knowledge graphs (KGs) in sequential recommendation (SR) has received a lot of attention, since the side information extracted from KGs, especially the information of the correlations between items, indeed helps the SR models achieve better performance. However, many previous KG-based SR models tend to introduce some noise information when learning item embeddings, or insufficiently fuse item–item correlations into their sequential modeling, thus limiting their performance improvements. In this paper, we propose a <strong>D</strong>istance-<strong>A</strong>ware <strong>K</strong>nowledge-based <strong>S</strong>equential <strong>R</strong>ecommendation model (<strong>DAKSR</strong>), which exploits the explicit item–item correlations from KGs to achieve enhanced SR. Specifically, as one critical component in our DAKSR, the <em>distance score matrix</em> (DSM) is first obtained to indicate the correlations between items, and then leveraged in the following three major modules of DAKSR. First, in the Item-Set Embedding layer (ISE) all item embeddings are learned based on DSM, in which the noise information is eliminated effectively. Meanwhile, the Knowledge-Infused Transformer (KIT) incorporates DSM into its attention mechanism to improve the feature extraction. Furthermore, the Knowledge Contrastive Learning module (KCL) also leverages the item–item correlations presented in DSM to generate two credible sequence views, which are used to refine sample representations through a contrastive learning strategy, and thus improve the model’s robustness. Our extensive experiments on three SR benchmarks obviously demonstrate our DAKSR’s superior performance over the state-of-the-art (SOTA) KG-based recommendation models. The implementation of our DAKSR is available at <span><span>https://github.com/Easonsi/DAKSR</span><svg><path></path></svg></span> for reproducing our experiment results conveniently.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"128 ","pages":"Article 102470"},"PeriodicalIF":3.0,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142529775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advances on data management systems 数据管理系统的进步

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-03 DOI: 10.1016/j.is.2024.102467

Ladjel Bellatreche, Marlon Dumas, Panagiotis Karras, Raimundas Matulevičius, Silvia Chiusano, Tania Cerquitelli, Robert Wrembel

引用次数: 0

Special Issue of CAiSE 2023 Best Papers CAiSE 2023 最佳论文特刊

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-10-01 DOI: 10.1016/j.is.2024.102469

Iris Reinhartz-Berger , Marta Indulska

引用次数: 0

Finding meaningful paths in heterogeneous graphs with PathWays 利用 PathWays 在异构图中查找有意义的路径

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-09-30 DOI: 10.1016/j.is.2024.102463

Nelly Barret , Antoine Gauquier , Jia-Jean Law , Ioana Manolescu

Graphs, and notably RDF graphs, are a prominent way of sharing data. As data usage democratizes, users need help figuring out the useful content of a graph dataset. In particular, journalists with whom we collaborate are interested in identifying, in a graph, the connections between entities, e.g., people, organizations, emails, etc. We present a novel method for exploring data graphs through their data paths connecting Named Entities (NEs, in short); each data path leads to a tabular-looking set of results. NEs are extracted from the data through dedicated Information Extraction modules. Our method builds upon the pre-existing ConnectionLens platform and follow-up work in the Abstra project, which builds simple, visual ER-style summaries of semi-structured data. The contribution of the present work, and its novelty, is twofold. First, we propose a novel analysis of entity-to-entity paths contained in datasets of any nature, and propose a new method for ranking paths, leveraging a novel Information Extraction (IE) module we built on top of ChatGPT. Second, we present an efficient approach to enumerate and compute NE paths, based on an algorithm which automatically recommends sub-paths to materialize, and rewrites the path queries using these subpaths. Our experiments demonstrate the interest of NE paths and the efficiency of our method for computing and ranking them.

图形，尤其是 RDF 图形，是一种重要的数据共享方式。随着数据使用的民主化，用户需要有人帮助他们找出图表数据集的有用内容。特别是与我们合作的记者，他们对在图中识别实体（如人、组织、电子邮件等）之间的联系很感兴趣。我们提出了一种通过连接命名实体（Named Entities，简称 NEs）的数据路径来探索数据图的新方法；每条数据路径都会产生一组表格形式的结果。通过专用的信息提取模块从数据中提取 NE。我们的方法建立在已有的 ConnectionLens 平台和 Abstra 项目的后续工作基础之上，后者可为半结构化数据建立简单、可视化的 ER 风格摘要。本工作的贡献及其新颖性体现在两个方面。首先，我们对任何性质的数据集中包含的实体到实体路径提出了一种新的分析方法，并利用我们在 ChatGPT 基础上构建的新颖信息提取（IE）模块，提出了一种新的路径排序方法。其次，我们提出了一种枚举和计算近义词路径的高效方法，该方法基于一种自动推荐子路径并使用这些子路径重写路径查询的算法。我们的实验证明了近邻路径的重要性以及我们计算和排列近邻路径的方法的效率。

{"title":"Finding meaningful paths in heterogeneous graphs with PathWays","authors":"Nelly Barret , Antoine Gauquier , Jia-Jean Law , Ioana Manolescu","doi":"10.1016/j.is.2024.102463","DOIUrl":"10.1016/j.is.2024.102463","url":null,"abstract":"<div><div>Graphs, and notably RDF graphs, are a prominent way of sharing data. As data usage democratizes, users need help figuring out the useful content of a graph dataset. In particular, journalists with whom we collaborate are interested in identifying, in a graph, the <em>connections between entities</em>, e.g., people, organizations, emails, etc. We present a novel method for exploring data graphs through <em>their data paths connecting Named Entities</em> (NEs, in short); each data path leads to a tabular-looking set of results. NEs are extracted from the data through dedicated Information Extraction modules. Our method builds upon the pre-existing ConnectionLens platform and follow-up work in the Abstra project, which builds simple, visual ER-style summaries of semi-structured data. The contribution of the present work, and its novelty, is twofold. First, we propose a novel analysis of entity-to-entity paths contained in datasets of any nature, and propose a new method for ranking paths, leveraging a novel Information Extraction (IE) module we built on top of ChatGPT. Second, we present an efficient approach to enumerate and compute NE paths, based on an algorithm which automatically recommends sub-paths to materialize, and rewrites the path queries using these subpaths. Our experiments demonstrate the interest of NE paths and the efficiency of our method for computing and ranking them.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"127 ","pages":"Article 102463"},"PeriodicalIF":3.0,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142420464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using AI explainable models and handwriting/drawing tasks for psychological well-being 利用人工智能可解释模型和手写/绘画任务促进心理健康

IF 3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Systems

Pub Date : 2024-09-28 DOI: 10.1016/j.is.2024.102465

Francesco Prinzi , Pietro Barbiero , Claudia Greco , Terry Amorese , Gennaro Cordasco , Pietro Liò , Salvatore Vitabile , Anna Esposito

This study addresses the increasing threat to Psychological Well-Being (PWB) posed by Depression, Anxiety, and Stress conditions. Machine learning methods have shown promising results for several psychological conditions. However, the lack of transparency in existing models impedes practical application. The study aims to develop explainable machine learning models for depression, anxiety and stress prediction, focusing on features extracted from tasks involving handwriting and drawing.

Two hundred patients completed the Depression, Anxiety, and Stress Scale (DASS-21) and performed seven tasks related to handwriting and drawing. Extracted features, encompassing pressure, stroke pattern, time, space, and pen inclination, were used to train the explainable-by-design Entropy-based Logic Explained Network (e-LEN) model, employing first-order logic rules for explanation. Performance comparison was performed with XGBoost, enhanced by the SHAP explanation method.

The trained models achieved notable accuracy in predicting depression (0.749 ±0.089), anxiety (0.721 ±0.088), and stress (0.761 ±0.086) through 10-fold cross-validation (repeated 20 times). The e-LEN model’s logic rules facilitated clinical validation, uncovering correlations with existing clinical literature. While performance remained consistent for depression and anxiety on an independent test dataset, a slight degradation was observed for stress prediction in the test task.

本研究探讨了抑郁、焦虑和压力对心理健康（PWB）造成的日益严重的威胁。机器学习方法已在几种心理状况方面取得了可喜的成果。然而，现有模型缺乏透明度，妨碍了实际应用。这项研究旨在开发用于预测抑郁、焦虑和压力的可解释机器学习模型，重点是从涉及手写和绘画的任务中提取的特征。200 名患者完成了抑郁、焦虑和压力量表（DASS-21），并完成了七项与手写和绘画有关的任务。提取的特征包括压力、笔画模式、时间、空间和笔的倾斜度，用于训练基于熵的可解释逻辑解释网络（e-LEN）模型，该模型采用一阶逻辑规则进行解释。通过 10 倍交叉验证（重复 20 次），训练出的模型在预测抑郁（0.749 ±0.089 ）、焦虑（0.721 ±0.088 ）和压力（0.761 ±0.086 ）方面取得了显著的准确性。e-LEN 模型的逻辑规则促进了临床验证，发现了与现有临床文献的相关性。在独立的测试数据集上，抑郁和焦虑的表现保持一致，但在测试任务中，压力预测的表现略有下降。

{"title":"Using AI explainable models and handwriting/drawing tasks for psychological well-being","authors":"Francesco Prinzi , Pietro Barbiero , Claudia Greco , Terry Amorese , Gennaro Cordasco , Pietro Liò , Salvatore Vitabile , Anna Esposito","doi":"10.1016/j.is.2024.102465","DOIUrl":"10.1016/j.is.2024.102465","url":null,"abstract":"<div><div>This study addresses the increasing threat to Psychological Well-Being (PWB) posed by Depression, Anxiety, and Stress conditions. Machine learning methods have shown promising results for several psychological conditions. However, the lack of transparency in existing models impedes practical application. The study aims to develop explainable machine learning models for depression, anxiety and stress prediction, focusing on features extracted from tasks involving handwriting and drawing.</div><div>Two hundred patients completed the Depression, Anxiety, and Stress Scale (DASS-21) and performed seven tasks related to handwriting and drawing. Extracted features, encompassing pressure, stroke pattern, time, space, and pen inclination, were used to train the explainable-by-design Entropy-based Logic Explained Network (e-LEN) model, employing first-order logic rules for explanation. Performance comparison was performed with XGBoost, enhanced by the SHAP explanation method.</div><div>The trained models achieved notable accuracy in predicting depression (0.749 ±0.089), anxiety (0.721 ±0.088), and stress (0.761 ±0.086) through 10-fold cross-validation (repeated 20 times). The e-LEN model’s logic rules facilitated clinical validation, uncovering correlations with existing clinical literature. While performance remained consistent for depression and anxiety on an independent test dataset, a slight degradation was observed for stress prediction in the test task.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"127 ","pages":"Article 102465"},"PeriodicalIF":3.0,"publicationDate":"2024-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142420465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0