Journal of Intelligent Information Systems最新文献_第2页

Machine learning approaches to predict the execution time of the meteorological simulation software COSMO 预测气象模拟软件 COSMO 执行时间的机器学习方法

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-31 DOI: 10.1007/s10844-024-00880-x

Allegra De Filippo, Emanuele Di Giacomo, Andrea Borghesi

Predicting the execution time of weather forecast models is a complex task, since these models are usually performed on High Performance Computing systems that require large computing capabilities. Indeed, a reliable prediction can imply several benefits, by allowing for an improved planning of the model execution, a better allocation of available resources, and the identification of possible anomalies. However, to make such predictions is usually hard, since there is a scarcity of datasets that benchmark the existing meteorological simulation models. In this work, we focus on the runtime predictions of the execution of the COSMO (COnsortium for SMall-scale MOdeling) weather forecasting model used at the Hydro-Meteo-Climate Structure of the Regional Agency for the Environment and Energy Prevention Emilia-Romagna. We show how a plethora of Machine Learning approaches can obtain accurate runtime predictions of this complex model, by designing a new well-defined benchmark for this application task. Indeed, our contribution is twofold: 1) the creation of a large public dataset reporting the runtime of COSMO run under a variety of different configurations; 2) a comparative study of ML models, which greatly outperform the current state-of-practice used by the domain experts. This data collection represents an essential initial benchmark for this application field, and a useful resource for analyzing the model performance: better accuracy in runtime predictions could help facility owners to improve job scheduling and resource allocation of the entire system; while for a final user, a posteriori analysis could help to identify anomalous runs.

预测天气预报模型的执行时间是一项复杂的任务，因为这些模型通常是在需要大型计算能力的高性能计算系统上执行的。事实上，可靠的预测可以带来多种益处，包括改进模型执行计划、更好地分配可用资源以及识别可能的异常情况。然而，要做出这样的预测通常很难，因为现有的气象模拟模型缺乏基准数据集。在这项工作中，我们重点研究了艾米利亚-罗马涅大区环境和能源预防局水文气象气候结构使用的 COSMO（小尺度模拟联盟）天气预报模型的运行预测。我们通过为这一应用任务设计一个定义明确的新基准，展示了大量机器学习方法如何在运行时对这一复杂模型进行准确预测。事实上，我们的贡献是双重的：1）创建了一个大型公共数据集，报告 COSMO 在各种不同配置下的运行时间；2）对 ML 模型进行比较研究，这些模型大大优于领域专家目前使用的实践状态。这些数据收集是这一应用领域的重要初始基准，也是分析模型性能的有用资源：更准确的运行时间预测可以帮助设施所有者改进整个系统的作业调度和资源分配；而对于最终用户来说，后验分析可以帮助识别异常运行。

{"title":"Machine learning approaches to predict the execution time of the meteorological simulation software COSMO","authors":"Allegra De Filippo, Emanuele Di Giacomo, Andrea Borghesi","doi":"10.1007/s10844-024-00880-x","DOIUrl":"https://doi.org/10.1007/s10844-024-00880-x","url":null,"abstract":"Predicting the execution time of weather forecast models is a complex task, since these models are usually performed on High Performance Computing systems that require large computing capabilities. Indeed, a reliable prediction can imply several benefits, by allowing for an improved planning of the model execution, a better allocation of available resources, and the identification of possible anomalies. However, to make such predictions is usually hard, since there is a scarcity of datasets that benchmark the existing meteorological simulation models. In this work, we focus on the runtime predictions of the execution of the COSMO (COnsortium for SMall-scale MOdeling) weather forecasting model used at the Hydro-Meteo-Climate Structure of the Regional Agency for the Environment and Energy Prevention Emilia-Romagna. We show how a plethora of Machine Learning approaches can obtain accurate runtime predictions of this complex model, by designing a new well-defined benchmark for this application task. Indeed, our contribution is twofold: 1) the creation of a large public dataset reporting the runtime of COSMO run under a variety of different configurations; 2) a comparative study of ML models, which greatly outperform the current state-of-practice used by the domain experts. This data collection represents an essential initial benchmark for this application field, and a useful resource for analyzing the model performance: better accuracy in runtime predictions could help facility owners to improve job scheduling and resource allocation of the entire system; while for a final user, a posteriori analysis could help to identify anomalous runs.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"75 1 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Span-based semantic syntactic dual enhancement for aspect sentiment triplet extraction 基于跨度的语义句法二元增强，用于方面情感三元组提取

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-22 DOI: 10.1007/s10844-024-00881-w

Shuxia Ren, Zewei Guo, Xiaohan Li, Ruikun Zhong

Aspect-Based Sentiment Triple Extraction (ASTE), a critical sub-task of Aspect-Based Sentiment Analysis (ABSA), has received extensive attention in recent years. ASTE aims to extract structured sentiment triples from texts, with most existing studies focusing on designing new strategic frameworks. Nonetheless, these methods often overlook the complex characteristics of linguistic expression and the deeper semantic nuances, leading to deficiencies in extracting the semantic representations of triples and effectively utilizing syntactic relationships in texts. To address these challenges, this paper introduces a span-based semantic and syntactic Dual-Enhanced model that deeply integrates rich syntactic information, such as part-of-speech tagging, constituent syntax, and dependency syntax structures. Specifically, we designed a semantic encoder and a syntactic encoder to capture the semantic-syntactic information closely related to the sentence’s underlying intent. Through a Feature Interaction Module, we effectively integrate information across different dimensions and promote a more comprehensive understanding of the relationships between aspects and opinions. We also adopted a span-based tagging scheme that generates more precise aspect sentiment triple extractions by exploring cross-level information and constraints. Experimental results on benchmark datasets derived from the SemEval challenge prove that our model significantly outperforms existing baselines.

基于方面的情感三元提取（ASTE）是基于方面的情感分析（ABSA）的一个重要子任务，近年来受到广泛关注。ASTE 的目的是从文本中提取结构化的情感三元组，现有研究大多侧重于设计新的策略框架。然而，这些方法往往忽略了语言表达的复杂特点和深层语义的细微差别，导致在提取三元组的语义表征和有效利用文本中的句法关系方面存在不足。为了应对这些挑战，本文介绍了一种基于跨度的语义和句法双增强模型，该模型深度整合了丰富的句法信息，如语音部分标记、成分句法和依赖句法结构。具体来说，我们设计了一个语义编码器和一个句法编码器，以捕捉与句子基本意图密切相关的语义句法信息。通过特征交互模块，我们有效地整合了不同维度的信息，促进了对方面和观点之间关系的更全面理解。我们还采用了基于跨度的标记方案，通过探索跨层信息和约束条件，生成更精确的方面情感三重提取。在 SemEval 挑战赛的基准数据集上的实验结果证明，我们的模型明显优于现有的基线模型。

{"title":"Span-based semantic syntactic dual enhancement for aspect sentiment triplet extraction","authors":"Shuxia Ren, Zewei Guo, Xiaohan Li, Ruikun Zhong","doi":"10.1007/s10844-024-00881-w","DOIUrl":"https://doi.org/10.1007/s10844-024-00881-w","url":null,"abstract":"Aspect-Based Sentiment Triple Extraction (ASTE), a critical sub-task of Aspect-Based Sentiment Analysis (ABSA), has received extensive attention in recent years. ASTE aims to extract structured sentiment triples from texts, with most existing studies focusing on designing new strategic frameworks. Nonetheless, these methods often overlook the complex characteristics of linguistic expression and the deeper semantic nuances, leading to deficiencies in extracting the semantic representations of triples and effectively utilizing syntactic relationships in texts. To address these challenges, this paper introduces a span-based semantic and syntactic Dual-Enhanced model that deeply integrates rich syntactic information, such as part-of-speech tagging, constituent syntax, and dependency syntax structures. Specifically, we designed a semantic encoder and a syntactic encoder to capture the semantic-syntactic information closely related to the sentence’s underlying intent. Through a Feature Interaction Module, we effectively integrate information across different dimensions and promote a more comprehensive understanding of the relationships between aspects and opinions. We also adopted a span-based tagging scheme that generates more precise aspect sentiment triple extractions by exploring cross-level information and constraints. Experimental results on benchmark datasets derived from the SemEval challenge prove that our model significantly outperforms existing baselines.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"26 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tiramisù: making sense of multi-faceted process information through time and space Tiramisù：通过时间和空间理解多方面的过程信息

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-14 DOI: 10.1007/s10844-024-00875-8

Anti Alman, Alessio Arleo, Iris Beerepoot, Andrea Burattin, Claudio Di Ciccio, Manuel Resinas

Knowledge-intensive processes represent a particularly challenging scenario for process mining. The flexibility that such processes allow constitutes a hurdle as they are hard to capture in a single model. To tackle this problem, multiple visual representations of the same processes could be beneficial, each addressing different information dimensions according to the specific needs and background knowledge of the concrete process workers and stakeholders. In this paper, we propose, describe, and evaluate a framework, named Tiramisù , that leverages visual analytics for the interactive visualization of multi-faceted process information, aimed at supporting the investigation and insight generation of users in their process analysis tasks. Tiramisù is based on a multi-layer visualization methodology that includes a visual backdrop that provides context and an arbitrary number of superimposed and on-demand dimension layers. This arrangement allows our framework to display process information from different perspectives and to project this information onto a domain-friendly representation of the context in which the process unfolds. We provide an in-depth description of the approach’s founding principles, deeply rooted in visualization research, that justify our design choices for the whole framework. We demonstrate the feasibility of the framework through its application in two use-case scenarios in the context of healthcare and personal information management. Plus, we conducted qualitative evaluations with potential end users of both scenarios, gathering precious insights about the efficacy and applicability of our framework to various application domains.

对于流程挖掘而言，知识密集型流程是一个特别具有挑战性的场景。这种流程的灵活性构成了一个障碍，因为它们很难被一个单一的模型所捕捉。为了解决这个问题，对同一流程进行多种可视化表示可能是有益的，每种可视化表示都能根据具体流程工作者和利益相关者的特定需求和背景知识来处理不同的信息维度。在本文中，我们提出、描述并评估了一个名为 Tiramisù 的框架，该框架利用可视化分析技术对多方面的流程信息进行交互式可视化，旨在支持用户在流程分析任务中进行调查并提出见解。Tiramisù 基于多层可视化方法，包括提供上下文的可视化背景以及任意数量的叠加和按需维度层。这种安排使我们的框架能够从不同角度显示流程信息，并将这些信息投射到流程所处环境的领域友好表示法上。我们深入介绍了该方法的基本原则，这些原则深深植根于可视化研究，并为我们对整个框架的设计选择提供了依据。我们通过在医疗保健和个人信息管理两个使用场景中的应用，证明了该框架的可行性。此外，我们还对这两个场景的潜在最终用户进行了定性评估，收集了有关我们的框架在不同应用领域的有效性和适用性的宝贵见解。

{"title":"Tiramisù: making sense of multi-faceted process information through time and space","authors":"Anti Alman, Alessio Arleo, Iris Beerepoot, Andrea Burattin, Claudio Di Ciccio, Manuel Resinas","doi":"10.1007/s10844-024-00875-8","DOIUrl":"https://doi.org/10.1007/s10844-024-00875-8","url":null,"abstract":"Knowledge-intensive processes represent a particularly challenging scenario for process mining. The flexibility that such processes allow constitutes a hurdle as they are hard to capture in a single model. To tackle this problem, multiple visual representations of the same processes could be beneficial, each addressing different information dimensions according to the specific needs and background knowledge of the concrete process workers and stakeholders. In this paper, we propose, describe, and evaluate a framework, named Tiramisù , that leverages visual analytics for the interactive visualization of multi-faceted process information, aimed at supporting the investigation and insight generation of users in their process analysis tasks. Tiramisù is based on a multi-layer visualization methodology that includes a visual backdrop that provides context and an arbitrary number of superimposed and on-demand dimension layers. This arrangement allows our framework to display process information from different perspectives and to project this information onto a domain-friendly representation of the context in which the process unfolds. We provide an in-depth description of the approach’s founding principles, deeply rooted in visualization research, that justify our design choices for the whole framework. We demonstrate the feasibility of the framework through its application in two use-case scenarios in the context of healthcare and personal information management. Plus, we conducted qualitative evaluations with potential end users of both scenarios, gathering precious insights about the efficacy and applicability of our framework to various application domains.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"72 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning recommendations from educational event data in higher education 从高等教育的教育事件数据中学习建议

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-13 DOI: 10.1007/s10844-024-00873-w

Gyunam Park, Lukas Liss, Wil M. P. van der Aalst

This paper presents a novel approach for generating actionable recommendations from educational event data collected by Campus Management Systems (CMS) to enhance study planning in higher education. The approach unfolds in three phases: feature identification tailored to the educational context, predictive modeling employing the RuleFit algorithm, and extracting actionable recommendations. We utilize diverse features, encompassing academic histories and course sequences, to capture the multi-dimensional nature of student academic behaviors. The effectiveness of our approach is empirically validated using data from the computer science bachelor’s program at RWTH Aachen University, with the goal of predicting overall GPA and formulating recommendations to enhance academic performance. Our contributions lie in the novel adaptation of behavioral features for the educational domain and the strategic use of the RuleFit algorithm for both predictive modeling and the generation of practical recommendations, offering a data-driven foundation for informed study planning and academic decision-making.

本文介绍了一种从校园管理系统（CMS）收集的教育事件数据中生成可行建议的新方法，以加强高等教育中的学习规划。该方法分为三个阶段：针对教育背景的特征识别、采用 RuleFit 算法的预测建模以及提取可行建议。我们利用包括学业历史和课程序列在内的各种特征来捕捉学生学业行为的多维性。我们使用亚琛工业大学计算机科学本科课程的数据对我们方法的有效性进行了实证验证，目的是预测总体 GPA 并提出提高学习成绩的建议。我们的贡献在于针对教育领域对行为特征进行了新颖的调整，并战略性地将 RuleFit 算法用于预测建模和生成实用建议，为明智的学习规划和学术决策提供了数据驱动的基础。

引用次数: 0

Temporal knowledge completion enhanced self-supervised entity alignment 时态知识完成增强型自监督实体配准

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-13 DOI: 10.1007/s10844-024-00878-5

Teng Fu, Gang Zhou

Temporal graph entity alignment aims at finding the equivalent entity pairs across different temporal knowledge graphs (TKGs). Primarily methods mainly utilize a time-aware and relationship-aware approach to embed and align. However, the existence of long-tail entities in TKGs still restricts the accuracy of alignment, as the limited neighborhood information may restrict the available neighborhood information for obtaining high-quality embeddings, and hence would impact the efficiency of entity alignment in representation space. Moreover, most previous researches are supervised, with heavy dependence on seed labels for alignment, restricting their applicability in scenarios with limited resources. To tackle these challenges, we propose a Temporal Knowledge Completion enhanced Self-supervised Entity Alignment (TSEA). We argue that, with high-quality embeddings, the entities would be aligned in a self-supervised manner. To this end, TSEA is constituted of two modules: A graph completion module to predict the missing links for the long-tailed entities. With the improved graph, TSEA further incorporates a self-supervised entity alignment module to achieve unsupervised alignment. Experimental results on widely adopted benchmarks demonstrate improved performance compared to several recent baseline methods. Additional ablation experiments further corroborate the efficacy of the proposed modules.

时态图实体对齐的目的是在不同的时态知识图（TKG）中找到等效的实体对。主要方法主要利用时间感知和关系感知方法进行嵌入和对齐。然而，TKG 中长尾实体的存在仍然限制了对齐的准确性，因为有限的邻域信息可能会限制获得高质量嵌入的可用邻域信息，从而影响实体在表示空间中对齐的效率。此外，以往的研究大多是有监督的，对齐严重依赖种子标签，这限制了它们在资源有限的场景中的适用性。为了应对这些挑战，我们提出了时态知识补全增强型自监督实体对齐（TSEA）。我们认为，有了高质量的嵌入，就能以自我监督的方式对齐实体。为此，TSEA 由两个模块组成：一个图完成模块，用于预测长尾实体的缺失链接。通过改进的图，TSEA 进一步整合了自监督实体对齐模块，以实现无监督对齐。在广泛采用的基准上进行的实验结果表明，与最近几种基线方法相比，TSEA 的性能有所提高。额外的消减实验进一步证实了所提模块的功效。

{"title":"Temporal knowledge completion enhanced self-supervised entity alignment","authors":"Teng Fu, Gang Zhou","doi":"10.1007/s10844-024-00878-5","DOIUrl":"https://doi.org/10.1007/s10844-024-00878-5","url":null,"abstract":"Temporal graph entity alignment aims at finding the equivalent entity pairs across different temporal knowledge graphs (TKGs). Primarily methods mainly utilize a time-aware and relationship-aware approach to embed and align. However, the existence of long-tail entities in TKGs still restricts the accuracy of alignment, as the limited neighborhood information may restrict the available neighborhood information for obtaining high-quality embeddings, and hence would impact the efficiency of entity alignment in representation space. Moreover, most previous researches are supervised, with heavy dependence on seed labels for alignment, restricting their applicability in scenarios with limited resources. To tackle these challenges, we propose a Temporal Knowledge Completion enhanced Self-supervised Entity Alignment (TSEA). We argue that, with high-quality embeddings, the entities would be aligned in a self-supervised manner. To this end, TSEA is constituted of two modules: A graph completion module to predict the missing links for the long-tailed entities. With the improved graph, TSEA further incorporates a self-supervised entity alignment module to achieve unsupervised alignment. Experimental results on widely adopted benchmarks demonstrate improved performance compared to several recent baseline methods. Additional ablation experiments further corroborate the efficacy of the proposed modules.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"58 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improved machine learning technique for feature reduction and its application in spam email detection 用于减少特征的改进型机器学习技术及其在垃圾邮件检测中的应用

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-07 DOI: 10.1007/s10844-024-00870-z

Ahmed A. Ewees, Marwa A. Gaheen, Mohammed M. Alshahrani, Ahmed M. Anter, Fatma H. Ismail

This paper introduces MPAG, a new feature selection method aimed at overcoming the limitations of the conventional Marine Predators Algorithm (MPA). The MPA may experience stagnation and become trapped in local optima during optimization. To address this challenge, we propose a refined version of the MPA, termed MPAG, which incorporates the Local Escape Operator (LEO) from the gradient-based optimizer (GBO). By leveraging the LEO operator, MPAG enhances the exploration ability of the MPA, particularly during the initial one-third of iterations. This enhancement injects more diversity into populations, thereby improving the process of search space discovery and mitigating the risk of premature convergence. The performance of MPAG is evaluated on 14 feature selection benchmark datasets, employing seven performance measures including fitness value, classification accuracy, and selected features. Our findings indicate that MPAG outperforms other algorithms in 86% of the datasets, underscoring its capability to select the most relevant features across various datasets while maintaining stability. Additionally, MPAG is evaluated using two cybersecurity applications, specifically spam detection datasets, where it demonstrates superior performance across most performance measures compared to other methods.

本文介绍了 MPAG，这是一种新的特征选择方法，旨在克服传统海洋捕食者算法（MPA）的局限性。MPA 在优化过程中可能会出现停滞并陷入局部最优状态。为了应对这一挑战，我们提出了一种改进版的 MPA，称为 MPAG，它结合了基于梯度的优化器 (GBO) 中的局部逃逸算子 (LEO)。通过利用 LEO 运算符，MPAG 增强了 MPA 的探索能力，尤其是在最初三分之一的迭代过程中。这种增强为种群注入了更多的多样性，从而改善了搜索空间的发现过程，降低了过早收敛的风险。我们在 14 个特征选择基准数据集上对 MPAG 的性能进行了评估，采用了七种性能指标，包括适配值、分类准确率和所选特征。我们的研究结果表明，在 86% 的数据集上，MPAG 的表现优于其他算法，这突出表明它有能力在各种数据集上选择最相关的特征，同时保持稳定性。此外，我们还利用两个网络安全应用（特别是垃圾邮件检测数据集）对 MPAG 进行了评估，结果表明 MPAG 在大多数性能指标上都优于其他方法。

{"title":"Improved machine learning technique for feature reduction and its application in spam email detection","authors":"Ahmed A. Ewees, Marwa A. Gaheen, Mohammed M. Alshahrani, Ahmed M. Anter, Fatma H. Ismail","doi":"10.1007/s10844-024-00870-z","DOIUrl":"https://doi.org/10.1007/s10844-024-00870-z","url":null,"abstract":"This paper introduces MPAG, a new feature selection method aimed at overcoming the limitations of the conventional Marine Predators Algorithm (MPA). The MPA may experience stagnation and become trapped in local optima during optimization. To address this challenge, we propose a refined version of the MPA, termed MPAG, which incorporates the Local Escape Operator (LEO) from the gradient-based optimizer (GBO). By leveraging the LEO operator, MPAG enhances the exploration ability of the MPA, particularly during the initial one-third of iterations. This enhancement injects more diversity into populations, thereby improving the process of search space discovery and mitigating the risk of premature convergence. The performance of MPAG is evaluated on 14 feature selection benchmark datasets, employing seven performance measures including fitness value, classification accuracy, and selected features. Our findings indicate that MPAG outperforms other algorithms in 86% of the datasets, underscoring its capability to select the most relevant features across various datasets while maintaining stability. Additionally, MPAG is evaluated using two cybersecurity applications, specifically spam detection datasets, where it demonstrates superior performance across most performance measures compared to other methods.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"77 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141932616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Joint entity and relation extraction with fusion of multi-feature semantics 融合多特征语义的联合实体和关系提取

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-08-01 DOI: 10.1007/s10844-024-00871-y

Ting Wang, Wenjie Yang, Tao Wu, Chuan Yang, Jiaying Liang, Hongyang Wang, Jia Li, Dong Xiang, Zheng Zhou

Entity relation extraction is a key technology for extracting structured information from unstructured text and serves as the foundation for building large-scale knowledge graphs. Current joint entity relation extraction methods primarily focus on improving the recognition of overlapping triplets to enhance the overall performance of the model. However, the model still faces numerous challenges in managing intra-triplet and inter-triplet interactions, expanding the breadth of semantic encoding, and reducing information redundancy during the extraction process. These issues make it challenging for the model to achieve satisfactory performance in both normal and overlapping triple extraction. To address these challenges, this study proposes a comprehensive prediction network that includes multi-feature semantic fusion. We have developed a semantic fusion module that integrates entity mask embedding sequences, which enhance connections between entities, and context embedding sequences that provide richer semantic information, to enhance inter-triplet interactions and expand semantic encoding. Subsequently, using a parallel decoder to simultaneously generate a set of triplets, improving the interaction between them. Additionally, we utilize an entity mask sequence to finely prune these triplets, optimizing the final set of triplets. Experimental results on the publicly available datasets NYT and WebNLG demonstrate that, with BERT as the encoder, our model outperforms the baseline model in terms of accuracy and F1 score.

实体关系提取是从非结构化文本中提取结构化信息的关键技术，也是构建大规模知识图谱的基础。目前的联合实体关系提取方法主要侧重于提高重叠三元组的识别率，以增强模型的整体性能。然而，该模型在管理三元组内和三元组间的交互、扩展语义编码的广度以及减少提取过程中的信息冗余方面仍面临诸多挑战。这些问题使得该模型在正常三元组和重叠三元组提取中都难以取得令人满意的性能。为了应对这些挑战，本研究提出了一种包含多特征语义融合的综合预测网络。我们开发了一个语义融合模块，该模块整合了实体掩码嵌入序列和上下文嵌入序列，前者可增强实体间的联系，后者可提供更丰富的语义信息，从而增强三元组间的交互并扩展语义编码。随后，利用并行解码器同时生成一组三元组，改善它们之间的互动。此外，我们还利用实体掩码序列对这些三元组进行精细修剪，从而优化最终的三元组。在公开数据集 NYT 和 WebNLG 上的实验结果表明，使用 BERT 作为编码器，我们的模型在准确率和 F1 分数方面都优于基线模型。

{"title":"Joint entity and relation extraction with fusion of multi-feature semantics","authors":"Ting Wang, Wenjie Yang, Tao Wu, Chuan Yang, Jiaying Liang, Hongyang Wang, Jia Li, Dong Xiang, Zheng Zhou","doi":"10.1007/s10844-024-00871-y","DOIUrl":"https://doi.org/10.1007/s10844-024-00871-y","url":null,"abstract":"Entity relation extraction is a key technology for extracting structured information from unstructured text and serves as the foundation for building large-scale knowledge graphs. Current joint entity relation extraction methods primarily focus on improving the recognition of overlapping triplets to enhance the overall performance of the model. However, the model still faces numerous challenges in managing intra-triplet and inter-triplet interactions, expanding the breadth of semantic encoding, and reducing information redundancy during the extraction process. These issues make it challenging for the model to achieve satisfactory performance in both normal and overlapping triple extraction. To address these challenges, this study proposes a comprehensive prediction network that includes multi-feature semantic fusion. We have developed a semantic fusion module that integrates entity mask embedding sequences, which enhance connections between entities, and context embedding sequences that provide richer semantic information, to enhance inter-triplet interactions and expand semantic encoding. Subsequently, using a parallel decoder to simultaneously generate a set of triplets, improving the interaction between them. Additionally, we utilize an entity mask sequence to finely prune these triplets, optimizing the final set of triplets. Experimental results on the publicly available datasets NYT and WebNLG demonstrate that, with BERT as the encoder, our model outperforms the baseline model in terms of accuracy and F1 score.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"34 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141870542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SESAME - self-supervised framework for extractive question answering over document collections SESAME - 文件集抽取式问题解答自监督框架

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-07-30 DOI: 10.1007/s10844-024-00869-6

Vitor A. Batista, Diogo S. M. Gomes, Alexandre Evsukoff

Question Answering is one of the most relevant areas in the field of Natural Language Processing, rapidly evolving with promising results due to the increasing availability of suitable datasets and the advent of new technologies, such as Generative Models. This article introduces SESAME, a Self-supervised framework for Extractive queStion Answering over docuMent collEctions. SESAME aims to enhance open-domain question answering systems (ODQA) by leveraging domain adaptation with synthetic datasets, enabling efficient question answering over private document collections with low resource usage. The framework incorporates recent advances with large language models, and an efficient hybrid method for context retrieval. We conducted several sets of experiments with the Machine Reading for Question Answering (MRQA) 2019 Shared Task datasets, FAQuAD - a Brazilian Portuguese reading comprehension dataset, Wikipedia, and Retrieval-Augmented Generation Benchmark, to demonstrate SESAME’s effectiveness. The results indicate that SESAME’s domain adaptation using synthetic data significantly improves QA performance, generalizes across different domains and languages, and competes with or surpasses state-of-the-art systems in ODQA. Finally, SESAME is an open-source tool, and all code, datasets and experimental data are available for public use in our repository.

问题解答是自然语言处理领域中最相关的领域之一，由于合适数据集的可用性不断提高以及生成模型等新技术的出现，该领域发展迅速，成果喜人。本文介绍的 SESAME 是一个用于文档拼合提取式问题解答的自监督框架。SESAME 旨在通过利用合成数据集的领域适应性来增强开放领域问题解答系统（ODQA），从而以较低的资源使用率在私有文档集上实现高效的问题解答。该框架结合了最近在大型语言模型方面取得的进展，以及一种高效的上下文检索混合方法。我们使用机器阅读问题解答（MRQA）2019 共享任务数据集、FAQuAD（巴西葡萄牙语阅读理解数据集）、维基百科和检索增强生成基准进行了多组实验，以证明 SESAME 的有效性。结果表明，SESAME 利用合成数据进行的领域适应性调整显著提高了质量保证性能，并可在不同领域和语言间通用，在 ODQA 方面可与最先进的系统竞争，甚至超越它们。最后，SESAME 是一款开源工具，所有代码、数据集和实验数据均可在我们的资源库中公开使用。

{"title":"SESAME - self-supervised framework for extractive question answering over document collections","authors":"Vitor A. Batista, Diogo S. M. Gomes, Alexandre Evsukoff","doi":"10.1007/s10844-024-00869-6","DOIUrl":"https://doi.org/10.1007/s10844-024-00869-6","url":null,"abstract":"Question Answering is one of the most relevant areas in the field of Natural Language Processing, rapidly evolving with promising results due to the increasing availability of suitable datasets and the advent of new technologies, such as Generative Models. This article introduces SESAME, a Self-supervised framework for Extractive queStion Answering over docuMent collEctions. SESAME aims to enhance open-domain question answering systems (ODQA) by leveraging domain adaptation with synthetic datasets, enabling efficient question answering over private document collections with low resource usage. The framework incorporates recent advances with large language models, and an efficient hybrid method for context retrieval. We conducted several sets of experiments with the Machine Reading for Question Answering (MRQA) 2019 Shared Task datasets, FAQuAD - a Brazilian Portuguese reading comprehension dataset, Wikipedia, and Retrieval-Augmented Generation Benchmark, to demonstrate SESAME’s effectiveness. The results indicate that SESAME’s domain adaptation using synthetic data significantly improves QA performance, generalizes across different domains and languages, and competes with or surpasses state-of-the-art systems in ODQA. Finally, SESAME is an open-source tool, and all code, datasets and experimental data are available for public use in our repository.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"15 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141870711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing data preparation: insights from a time series case study 加强数据准备：时间序列案例研究的启示

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-07-25 DOI: 10.1007/s10844-024-00867-8

Camilla Sancricca, Giovanni Siracusa, Cinzia Cappiello

Data play a key role in AI systems that support decision-making processes. Data-centric AI highlights the importance of having high-quality input data to obtain reliable results. However, well-preparing data for machine learning is becoming difficult due to the variety of data quality issues and available data preparation tasks. For this reason, approaches that help users in performing this demanding phase are needed. This work proposes DIANA, a framework for data-centric AI to support data exploration and preparation, suggesting suitable cleaning tasks to obtain valuable analysis results. We design an adaptive self-service environment that can handle the analysis and preparation of different types of sources, i.e., tabular, and streaming data. The central component of our framework is a knowledge base that collects evidence related to the effectiveness of the data preparation actions along with the type of input data and the considered machine learning model. In this paper, we first describe the framework, the knowledge base model, and its enrichment process. Then, we show the experiments conducted to enrich the knowledge base in a particular case study: time series data streams.

数据在支持决策过程的人工智能系统中发挥着关键作用。以数据为中心的人工智能强调了拥有高质量输入数据以获得可靠结果的重要性。然而，由于存在各种数据质量问题和可用的数据准备任务，为机器学习做好数据准备变得越来越困难。因此，我们需要能帮助用户完成这一高难度阶段的方法。这项工作提出了 DIANA，这是一个以数据为中心的人工智能框架，用于支持数据探索和准备，建议合适的清理任务，以获得有价值的分析结果。我们设计了一种自适应自助服务环境，可以处理不同类型的数据源（即表格数据和流数据）的分析和准备工作。我们框架的核心部分是一个知识库，它可以收集与数据准备操作的有效性相关的证据，以及输入数据的类型和考虑的机器学习模型。在本文中，我们首先介绍了该框架、知识库模型及其丰富过程。然后，我们展示了在一个特定案例研究中为丰富知识库而进行的实验：时间序列数据流。

{"title":"Enhancing data preparation: insights from a time series case study","authors":"Camilla Sancricca, Giovanni Siracusa, Cinzia Cappiello","doi":"10.1007/s10844-024-00867-8","DOIUrl":"https://doi.org/10.1007/s10844-024-00867-8","url":null,"abstract":"Data play a key role in AI systems that support decision-making processes. Data-centric AI highlights the importance of having high-quality input data to obtain reliable results. However, well-preparing data for machine learning is becoming difficult due to the variety of data quality issues and available data preparation tasks. For this reason, approaches that help users in performing this demanding phase are needed. This work proposes DIANA, a framework for data-centric AI to support data exploration and preparation, suggesting suitable cleaning tasks to obtain valuable analysis results. We design an adaptive self-service environment that can handle the analysis and preparation of different types of sources, i.e., tabular, and streaming data. The central component of our framework is a knowledge base that collects evidence related to the effectiveness of the data preparation actions along with the type of input data and the considered machine learning model. In this paper, we first describe the framework, the knowledge base model, and its enrichment process. Then, we show the experiments conducted to enrich the knowledge base in a particular case study: time series data streams.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"78 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141777860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-task learning and mutual information maximization with crossmodal transformer for multimodal sentiment analysis 多任务学习和互信息最大化与跨模态变换器用于多模态情感分析

IF 3.4 3区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Journal of Intelligent Information Systems

Pub Date : 2024-07-10 DOI: 10.1007/s10844-024-00858-9

Yang Shi, Jinglang Cai, Lei Liao

The effectiveness of multimodal sentiment analysis hinges on the seamless integration of information from diverse modalities, where the quality of modality fusion directly influences sentiment analysis accuracy. Prior methods often rely on intricate fusion strategies, elevating computational costs and potentially yielding inaccurate multimodal representations due to distribution gaps and information redundancy across heterogeneous modalities. This paper centers on the backpropagation of loss and introduces a Transformer-based model called Multi-Task Learning and Mutual Information Maximization with Crossmodal Transformer (MMMT). Addressing the issue of inaccurate multimodal representation for MSA, MMMT effectively combines mutual information maximization with crossmodal Transformer to convey more modality-invariant information to multimodal representation, fully exploring modal commonalities. Notably, it utilizes multi-modal labels for uni-modal training, presenting a fresh perspective on multi-task learning in MSA. Comparative experiments on the CMU-MOSI and CMU-MOSEI datasets demonstrate that MMMT improves model accuracy while reducing computational burden, making it suitable for resource-constrained and real-time performance-requiring application scenarios. Additionally, ablation experiments validate the efficacy of multi-task learning and probe the specific impact of combining mutual information maximization with Transformer in MSA.

多模态情感分析的有效性取决于对不同模态信息的无缝整合，而模态融合的质量直接影响情感分析的准确性。先前的方法通常依赖于复杂的融合策略，从而提高了计算成本，并且由于异构模态之间的分布差距和信息冗余，可能会产生不准确的多模态表示。本文以损失的反向传播为中心，介绍了一种基于变换器的模型，称为跨模态变换器的多任务学习和互信息最大化（MMMT）。针对 MSA 多模态表征不准确的问题，MMMT 将互信息最大化与跨模态变换器有效结合，为多模态表征传递更多模态不变信息，充分挖掘模态共性。值得注意的是，它利用多模态标签进行单模态训练，为 MSA 的多任务学习提供了一个全新的视角。在 CMU-MOSI 和 CMU-MOSEI 数据集上进行的对比实验表明，MMMT 提高了模型的准确性，同时减轻了计算负担，使其适用于资源受限和要求实时性能的应用场景。此外，消融实验验证了多任务学习的功效，并探究了在 MSA 中将互信息最大化与 Transformer 相结合的具体影响。

{"title":"Multi-task learning and mutual information maximization with crossmodal transformer for multimodal sentiment analysis","authors":"Yang Shi, Jinglang Cai, Lei Liao","doi":"10.1007/s10844-024-00858-9","DOIUrl":"https://doi.org/10.1007/s10844-024-00858-9","url":null,"abstract":"The effectiveness of multimodal sentiment analysis hinges on the seamless integration of information from diverse modalities, where the quality of modality fusion directly influences sentiment analysis accuracy. Prior methods often rely on intricate fusion strategies, elevating computational costs and potentially yielding inaccurate multimodal representations due to distribution gaps and information redundancy across heterogeneous modalities. This paper centers on the backpropagation of loss and introduces a Transformer-based model called Multi-Task Learning and Mutual Information Maximization with Crossmodal Transformer (MMMT). Addressing the issue of inaccurate multimodal representation for MSA, MMMT effectively combines mutual information maximization with crossmodal Transformer to convey more modality-invariant information to multimodal representation, fully exploring modal commonalities. Notably, it utilizes multi-modal labels for uni-modal training, presenting a fresh perspective on multi-task learning in MSA. Comparative experiments on the CMU-MOSI and CMU-MOSEI datasets demonstrate that MMMT improves model accuracy while reducing computational burden, making it suitable for resource-constrained and real-time performance-requiring application scenarios. Additionally, ablation experiments validate the efficacy of multi-task learning and probe the specific impact of combining mutual information maximization with Transformer in MSA.","PeriodicalId":56119,"journal":{"name":"Journal of Intelligent Information Systems","volume":"16 1","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141567460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0