Intelligent Systems with Applications最新文献_第4页

Optimizing the organization of the first mile in agri-food supply chains with a heterogeneous fleet using a mixed-integer linear model 使用混合整数线性模型优化具有异质车队的农业食品供应链中第一英里的组织工作

Intelligent Systems with Applications

Pub Date : 2024-08-26 DOI: 10.1016/j.iswa.2024.200426

Harol Mauricio Gámez-Albán, Ruben Guisson, Annelies De Meyer

Consumers are increasingly demanding high-quality food, which presents significant challenges for agricultural supply chains. While the majority of research in the agri-food sector has concentrated on optimizing logistics costs and meeting demand by focusing on minimizing the last mile, the complexity of the first mile in the agricultural supply chain has been less explored. Farmers must efficiently manage the harvesting process and the transportation of harvested produce to consolidation centers to ensure the delivery of high-quality products. This paper addresses this research gap by introducing a mixed-integer programming model that leverages vehicle routing problem concepts to optimize the logistics processes involved in transporting harvested products from various fields to a central depot. The primary objective is to minimize total logistics costs associated with visiting different fields during a pick-up round using multiple vehicles. The model has been applied to a case study involving an agricultural cooperative in Greece as part of the European BBTWINS project, which aims to enhance agri-food value chain digitalization for improved resource efficiency. The results demonstrate that organizing the first mile of the agri-food supply chain with a cooled vehicle for pick-up rounds can reduce logistics costs by up to 40% compared to the current practices.

消费者对高品质食品的要求越来越高，这给农业供应链带来了巨大挑战。农业食品行业的大部分研究都集中在优化物流成本和满足需求方面，重点是尽量减少最后一英里的物流成本，但对农业供应链中第一英里的复杂性却探讨较少。农民必须有效地管理收获过程，并将收获的农产品运输到集货中心，以确保交付高质量的产品。本文针对这一研究空白，引入了一个混合整数编程模型，利用车辆路由问题的概念来优化将收获产品从不同田地运往中心仓库的物流过程。该模型的主要目标是最大限度地降低在一轮取货过程中使用多辆汽车访问不同田地所产生的总物流成本。作为欧洲 BBTWINS 项目的一部分，该模型被应用于希腊一家农业合作社的案例研究，该项目旨在加强农业食品价值链的数字化，以提高资源利用效率。研究结果表明，与目前的做法相比，在农业食品供应链的 "第一英里 "使用冷藏车进行一轮取货可将物流成本最多降低 40%。

{"title":"Optimizing the organization of the first mile in agri-food supply chains with a heterogeneous fleet using a mixed-integer linear model","authors":"Harol Mauricio Gámez-Albán, Ruben Guisson, Annelies De Meyer","doi":"10.1016/j.iswa.2024.200426","DOIUrl":"10.1016/j.iswa.2024.200426","url":null,"abstract":"<div><p>Consumers are increasingly demanding high-quality food, which presents significant challenges for agricultural supply chains. While the majority of research in the agri-food sector has concentrated on optimizing logistics costs and meeting demand by focusing on minimizing the last mile, the complexity of the first mile in the agricultural supply chain has been less explored. Farmers must efficiently manage the harvesting process and the transportation of harvested produce to consolidation centers to ensure the delivery of high-quality products. This paper addresses this research gap by introducing a mixed-integer programming model that leverages vehicle routing problem concepts to optimize the logistics processes involved in transporting harvested products from various fields to a central depot. The primary objective is to minimize total logistics costs associated with visiting different fields during a pick-up round using multiple vehicles. The model has been applied to a case study involving an agricultural cooperative in Greece as part of the European BBTWINS project, which aims to enhance agri-food value chain digitalization for improved resource efficiency. The results demonstrate that organizing the first mile of the agri-food supply chain with a cooled vehicle for pick-up rounds can reduce logistics costs by up to 40% compared to the current practices.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200426"},"PeriodicalIF":0.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001005/pdfft?md5=e0c6a4a25c889be39eb6364bf1524305&pid=1-s2.0-S2667305324001005-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142084379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integrating explainable machine learning and user-centric model for diagnosing cardiovascular disease: A novel approach 整合可解释的机器学习和以用户为中心的心血管疾病诊断模型：一种新方法

Intelligent Systems with Applications

Pub Date : 2024-08-23 DOI: 10.1016/j.iswa.2024.200428

Gangani Dharmarathne , Madhusha Bogahawaththa , Upaka Rathnayake , D.P.P. Meddage

Conventional machine learning techniques in diagnosing cardiovascular disease have a limitation owing to the lack of interpretability of models. This study utilised an explainable machine learning approach to predict the likelihood of having CVD. Four machine learning models were employed for CVD diagnosis: Decision Tree (DT), K-Nearest Neighbor (KNN), Random Forest (RF), and Extreme Gradient Boost (XGB). Shapley Additive Explanations (SHAP) were used to provide reasoning for the models' predictions. Using these models and explanations, a user interface was developed to assist in diagnosing CVD. All four classification models demonstrated good accuracy in diagnosing CVD, with the KNN model showcasing the best performance (Accuracy: 71 %). SHAP provided the reasoning behind KNN predictions, and the predictive interface was developed by embedding these explanations to provide transparency behind the model's decisions.

由于模型缺乏可解释性，诊断心血管疾病的传统机器学习技术存在局限性。本研究利用可解释的机器学习方法来预测患心血管疾病的可能性。心血管疾病诊断采用了四种机器学习模型：决策树（DT）、K-近邻（KNN）、随机森林（RF）和极端梯度提升（XGB）。Shapley Additive Explanations (SHAP) 用于为模型的预测提供推理。利用这些模型和解释，开发了一个用户界面来协助诊断心血管疾病。所有四个分类模型在诊断心血管疾病方面都表现出良好的准确性，其中 KNN 模型表现最佳（准确率：71%）。SHAP 提供了 KNN 预测背后的推理，并通过嵌入这些解释开发了预测界面，以提供模型决策背后的透明度。

{"title":"Integrating explainable machine learning and user-centric model for diagnosing cardiovascular disease: A novel approach","authors":"Gangani Dharmarathne , Madhusha Bogahawaththa , Upaka Rathnayake , D.P.P. Meddage","doi":"10.1016/j.iswa.2024.200428","DOIUrl":"10.1016/j.iswa.2024.200428","url":null,"abstract":"<div><p>Conventional machine learning techniques in diagnosing cardiovascular disease have a limitation owing to the lack of interpretability of models. This study utilised an explainable machine learning approach to predict the likelihood of having CVD. Four machine learning models were employed for CVD diagnosis: Decision Tree (DT), K-Nearest Neighbor (KNN), Random Forest (RF), and Extreme Gradient Boost (XGB). Shapley Additive Explanations (SHAP) were used to provide reasoning for the models' predictions. Using these models and explanations, a user interface was developed to assist in diagnosing CVD. All four classification models demonstrated good accuracy in diagnosing CVD, with the KNN model showcasing the best performance (Accuracy: 71 %). SHAP provided the reasoning behind KNN predictions, and the predictive interface was developed by embedding these explanations to provide transparency behind the model's decisions.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200428"},"PeriodicalIF":0.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324001029/pdfft?md5=40d5d256c670f94ade9890d52463a6a1&pid=1-s2.0-S2667305324001029-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142077112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Attention mechanism enhanced LSTM networks for latency prediction in deterministic MEC networks 用于确定性 MEC 网络延迟预测的注意力机制增强型 LSTM 网络

Intelligent Systems with Applications

Pub Date : 2024-08-22 DOI: 10.1016/j.iswa.2024.200425

Zhonglu Zou, Xin Yan, Yongshi Yuan, Zilin You, Liming Chen

In deterministic mobile edge computing (MEC) networks, accurately predicting latency is critical for optimizing resource allocation and enhancing quality of service (QoS). This paper introduces a novel approach leveraging attention mechanism enhanced long short-term memory (LSTM) networks to predict latency in MEC networks. The proposed model integrates attention mechanisms into LSTM networks to capture temporal dependency and emphasize relevant features in the input data, thereby improving the prediction accuracy. T extensive experiments are conducted by using practical MEC network data, demonstrating that the proposed approach significantly outperforms traditional LSTM and other baseline models in terms of prediction accuracy and computational efficiency. Additionally, we analyze the impact of various configurations in the attention mechanism and LSTM on the model performance, providing insights into the optimal settings. The findings of this study contribute to the advancement of latency prediction techniques in deterministic MEC networks, facilitating more efficient and reliable network management.

在确定性移动边缘计算（MEC）网络中，准确预测延迟对于优化资源分配和提高服务质量（QoS）至关重要。本文介绍了一种利用注意力机制增强型长短期记忆（LSTM）网络预测 MEC 网络延迟的新方法。所提出的模型将注意力机制集成到 LSTM 网络中，以捕捉时间依赖性并强调输入数据中的相关特征，从而提高预测准确性。我们使用实际的 MEC 网络数据进行了大量实验，结果表明所提出的方法在预测准确性和计算效率方面明显优于传统的 LSTM 和其他基线模型。此外，我们还分析了注意力机制和 LSTM 的各种配置对模型性能的影响，为最佳设置提供了启示。本研究的发现有助于推动确定性 MEC 网络中延迟预测技术的发展，从而促进更高效、更可靠的网络管理。

{"title":"Attention mechanism enhanced LSTM networks for latency prediction in deterministic MEC networks","authors":"Zhonglu Zou, Xin Yan, Yongshi Yuan, Zilin You, Liming Chen","doi":"10.1016/j.iswa.2024.200425","DOIUrl":"10.1016/j.iswa.2024.200425","url":null,"abstract":"<div><p>In deterministic mobile edge computing (MEC) networks, accurately predicting latency is critical for optimizing resource allocation and enhancing quality of service (QoS). This paper introduces a novel approach leveraging attention mechanism enhanced long short-term memory (LSTM) networks to predict latency in MEC networks. The proposed model integrates attention mechanisms into LSTM networks to capture temporal dependency and emphasize relevant features in the input data, thereby improving the prediction accuracy. T extensive experiments are conducted by using practical MEC network data, demonstrating that the proposed approach significantly outperforms traditional LSTM and other baseline models in terms of prediction accuracy and computational efficiency. Additionally, we analyze the impact of various configurations in the attention mechanism and LSTM on the model performance, providing insights into the optimal settings. The findings of this study contribute to the advancement of latency prediction techniques in deterministic MEC networks, facilitating more efficient and reliable network management.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200425"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000991/pdfft?md5=ee47f3714c07656cbf13489f3b8c15dd&pid=1-s2.0-S2667305324000991-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142084378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multi-source heterogeneous data fusion method for intelligent systems in the Internet of Things 物联网智能系统的多源异构数据融合方法

Intelligent Systems with Applications

Pub Date : 2024-08-09 DOI: 10.1016/j.iswa.2024.200424

Rongrong Sun , Yuemei Ren

The advent of the Internet of Things (IoT) has revolutionized the field of intelligent system development by providing an extensive amount of data from IoT devices, essential for the management of these systems and the creation of innovative services. This data covers various aspects, including creation at the physical layer, transmission through the network layer, and processing within the application layer. This study presents a groundbreaking approach to amalgamating multi-source and varied data within intelligent systems leveraging IoT technology. Our approach seeks to optimize the integration of diverse datasets by examining the correlations between different data types using a novel mixed information gain strategy, leading to effective data fusion. It capitalizes on the computational and storage capacities of systems for seamless integration and augments the analysis of information, thereby improving the useability of data in intelligent systems. Simulation tests confirm the superiority of our method, demonstrating a remarkable improvement in performance in the fusion of dynamic, multi-source heterogeneous data compared to conventional techniques.

物联网（IoT）的出现彻底改变了智能系统开发领域，因为它提供了大量来自物联网设备的数据，这些数据对这些系统的管理和创新服务的创建至关重要。这些数据涉及多个方面，包括在物理层创建、通过网络层传输以及在应用层处理。本研究提出了一种开创性的方法，利用物联网技术在智能系统中整合多源、多样的数据。我们的方法旨在通过使用新颖的混合信息增益策略来检查不同数据类型之间的相关性，从而优化不同数据集的整合，实现有效的数据融合。它利用系统的计算和存储能力进行无缝整合，并增强信息分析，从而提高数据在智能系统中的可用性。仿真测试证实了我们方法的优越性，与传统技术相比，我们在动态多源异构数据融合方面的性能有了显著提高。

{"title":"A multi-source heterogeneous data fusion method for intelligent systems in the Internet of Things","authors":"Rongrong Sun , Yuemei Ren","doi":"10.1016/j.iswa.2024.200424","DOIUrl":"10.1016/j.iswa.2024.200424","url":null,"abstract":"<div><p>The advent of the Internet of Things (IoT) has revolutionized the field of intelligent system development by providing an extensive amount of data from IoT devices, essential for the management of these systems and the creation of innovative services. This data covers various aspects, including creation at the physical layer, transmission through the network layer, and processing within the application layer. This study presents a groundbreaking approach to amalgamating multi-source and varied data within intelligent systems leveraging IoT technology. Our approach seeks to optimize the integration of diverse datasets by examining the correlations between different data types using a novel mixed information gain strategy, leading to effective data fusion. It capitalizes on the computational and storage capacities of systems for seamless integration and augments the analysis of information, thereby improving the useability of data in intelligent systems. Simulation tests confirm the superiority of our method, demonstrating a remarkable improvement in performance in the fusion of dynamic, multi-source heterogeneous data compared to conventional techniques.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200424"},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266730532400098X/pdfft?md5=2b7b7b15f5cc697370e951edb65b1983&pid=1-s2.0-S266730532400098X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142002118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing smart grid load forecasting: An attention-based deep learning model integrated with federated learning and XAI for security and interpretability 加强智能电网负荷预测：基于注意力的深度学习模型与联合学习和 XAI 相结合，提高安全性和可解释性

Intelligent Systems with Applications

Pub Date : 2024-08-04 DOI: 10.1016/j.iswa.2024.200422

Md Al Amin Sarker, Bharanidharan Shanmugam, Sami Azam, Suresh Thennadil

Smart grid is a transformative advancement that modernized the traditional power system for effective electricity management, and involves optimized energy distribution by load forecasting. Precise load forecasting provides the best utilization of energy resources and increases sustainability. Dynamic changes of several connected factors, such as temporal and geographical variability, pose challenges to accurate load prediction. Integrating Artificial Intelligence (AI) in the smart grid can enhance the performance of the forecasting process by capturing these changes. This study investigated load forecasting tasks on four different datasets. Several preprocessing and augmentation techniques are applied to increase the data quality. An attention-based 1D-CNN-GRU model is proposed to capture the temporal patterns from the time-series data, and the hyperparameters of the model are optimized using a particle swarm optimization (PSO) algorithm that also accelerates the convergence and results in an efficient training session. Empirical evaluations highlight that the proposed model substantially minimizes the loss, reflecting the ability to make accurate predictions. It obtains MAE values of 0.12, 0.8, 16.48, and 82.64 for the four datasets. Moreover, the explainable AI (XAI) technique is applied using Shapley Additive explanations (SHAP) to interpret the model prediction, providing the feature ranking based on their prediction score. Moreover, this study utilizes federated learning, enables collaborative training, maintains the privacy of the grid data, and secures the process comprehensively. The aggregation mechanism in federated learning is modified using pruning-based methods that reduce the parameters and computational cost, resulting in a more efficient framework. Integrating all these approaches provides valuable insights for developing a load forecasting model and outlines potential contributions in the smart grid domain.

智能电网是一种变革性的进步，它将传统的电力系统现代化，以实现有效的电力管理，并通过负荷预测来优化能源分配。精确的负荷预测可使能源资源得到最佳利用，并提高可持续性。一些相关因素的动态变化，如时间和地理上的可变性，给精确的负荷预测带来了挑战。在智能电网中集成人工智能（AI）可以通过捕捉这些变化来提高预测过程的性能。本研究调查了四个不同数据集上的负荷预测任务。为提高数据质量，采用了多种预处理和增强技术。研究提出了一种基于注意力的 1D-CNN-GRU 模型来捕捉时间序列数据中的时间模式，并使用粒子群优化（PSO）算法优化了该模型的超参数，该算法还能加速收敛并实现高效的训练过程。实证评估结果表明，所提出的模型极大地减少了损失，体现了准确预测的能力。该模型在四个数据集上的 MAE 值分别为 0.12、0.8、16.48 和 82.64。此外，可解释人工智能（XAI）技术使用夏普利相加解释（SHAP）来解释模型预测，根据预测得分提供特征排名。此外，本研究还利用联盟学习，实现了协作训练，维护了网格数据的隐私，并全面保障了整个过程的安全。联合学习中的聚合机制通过基于剪枝的方法进行了修改，从而减少了参数和计算成本，形成了一个更高效的框架。整合所有这些方法为开发负荷预测模型提供了宝贵的见解，并勾勒出在智能电网领域的潜在贡献。

{"title":"Enhancing smart grid load forecasting: An attention-based deep learning model integrated with federated learning and XAI for security and interpretability","authors":"Md Al Amin Sarker, Bharanidharan Shanmugam, Sami Azam, Suresh Thennadil","doi":"10.1016/j.iswa.2024.200422","DOIUrl":"10.1016/j.iswa.2024.200422","url":null,"abstract":"<div><p>Smart grid is a transformative advancement that modernized the traditional power system for effective electricity management, and involves optimized energy distribution by load forecasting. Precise load forecasting provides the best utilization of energy resources and increases sustainability. Dynamic changes of several connected factors, such as temporal and geographical variability, pose challenges to accurate load prediction. Integrating Artificial Intelligence (AI) in the smart grid can enhance the performance of the forecasting process by capturing these changes. This study investigated load forecasting tasks on four different datasets. Several preprocessing and augmentation techniques are applied to increase the data quality. An attention-based 1D-CNN-GRU model is proposed to capture the temporal patterns from the time-series data, and the hyperparameters of the model are optimized using a particle swarm optimization (PSO) algorithm that also accelerates the convergence and results in an efficient training session. Empirical evaluations highlight that the proposed model substantially minimizes the loss, reflecting the ability to make accurate predictions. It obtains MAE values of 0.12, 0.8, 16.48, and 82.64 for the four datasets. Moreover, the explainable AI (XAI) technique is applied using Shapley Additive explanations (SHAP) to interpret the model prediction, providing the feature ranking based on their prediction score. Moreover, this study utilizes federated learning, enables collaborative training, maintains the privacy of the grid data, and secures the process comprehensively. The aggregation mechanism in federated learning is modified using pruning-based methods that reduce the parameters and computational cost, resulting in a more efficient framework. Integrating all these approaches provides valuable insights for developing a load forecasting model and outlines potential contributions in the smart grid domain.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200422"},"PeriodicalIF":0.0,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000966/pdfft?md5=6bef1c0253b216dd874359b6617d6b66&pid=1-s2.0-S2667305324000966-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141962031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Process mining embeddings: Learning vector representations for Petri nets 过程挖掘嵌入：学习 Petri 网的向量表征

Intelligent Systems with Applications

Pub Date : 2024-08-02 DOI: 10.1016/j.iswa.2024.200423

Juan G. Colonna , Ahmed A. Fares , Márcio Duarte , Ricardo Sousa

Process Mining offers a powerful framework for uncovering, analyzing, and optimizing real-world business processes. Petri nets provide a versatile means of modeling process behavior. However, traditional methods often struggle to effectively compare complex Petri nets, hindering their potential for process enhancement. To address this challenge, we introduce PetriNet2Vec, an unsupervised methodology inspired by Doc2Vec. This approach converts Petri nets into embedding vectors, facilitating the comparison, clustering, and classification of process models. We validated our approach using the PDC Dataset, comprising 96 diverse Petri net models. The results demonstrate that PetriNet2Vec effectively captures the structural properties of process models, enabling accurate process classification and efficient process retrieval. Specifically, our findings highlight the utility of the learned embeddings in two key downstream tasks: process classification and process retrieval. In process classification, the embeddings allowed for accurate categorization of process models based on their structural properties. In process retrieval, the embeddings enabled efficient retrieval of similar process models using cosine distance. These results demonstrate the potential of PetriNet2Vec to significantly enhance process mining capabilities.

流程挖掘为发现、分析和优化现实世界的业务流程提供了一个强大的框架。Petri 网为流程行为建模提供了一种多功能手段。然而，传统方法往往难以对复杂的 Petri 网进行有效比较，从而阻碍了 Petri 网在流程改进方面的潜力。为了应对这一挑战，我们引入了 PetriNet2Vec，这是一种受 Doc2Vec 启发的无监督方法。这种方法将 Petri 网转换为嵌入向量，便于流程模型的比较、聚类和分类。我们使用由 96 个不同 Petri 网模型组成的 PDC 数据集验证了我们的方法。结果表明，PetriNet2Vec 能有效捕捉流程模型的结构特性，从而实现准确的流程分类和高效的流程检索。具体来说，我们的研究结果强调了所学嵌入在流程分类和流程检索这两个关键下游任务中的实用性。在流程分类中，嵌入可以根据流程模型的结构特性对其进行准确分类。在流程检索中，嵌入可以利用余弦距离高效检索类似的流程模型。这些结果证明了 PetriNet2Vec 在显著提高流程挖掘能力方面的潜力。

{"title":"Process mining embeddings: Learning vector representations for Petri nets","authors":"Juan G. Colonna , Ahmed A. Fares , Márcio Duarte , Ricardo Sousa","doi":"10.1016/j.iswa.2024.200423","DOIUrl":"10.1016/j.iswa.2024.200423","url":null,"abstract":"<div><p>Process Mining offers a powerful framework for uncovering, analyzing, and optimizing real-world business processes. Petri nets provide a versatile means of modeling process behavior. However, traditional methods often struggle to effectively compare complex Petri nets, hindering their potential for process enhancement. To address this challenge, we introduce PetriNet2Vec, an unsupervised methodology inspired by Doc2Vec. This approach converts Petri nets into embedding vectors, facilitating the comparison, clustering, and classification of process models. We validated our approach using the PDC Dataset, comprising 96 diverse Petri net models. The results demonstrate that PetriNet2Vec effectively captures the structural properties of process models, enabling accurate process classification and efficient process retrieval. Specifically, our findings highlight the utility of the learned embeddings in two key downstream tasks: process classification and process retrieval. In process classification, the embeddings allowed for accurate categorization of process models based on their structural properties. In process retrieval, the embeddings enabled efficient retrieval of similar process models using cosine distance. These results demonstrate the potential of PetriNet2Vec to significantly enhance process mining capabilities.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200423"},"PeriodicalIF":0.0,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000978/pdfft?md5=c5510ffdcb881b0b3214984ffc9b29e1&pid=1-s2.0-S2667305324000978-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141962773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Determinates of investor opinion gap around IPOs: A machine learning approach IPO 周围投资者意见差距的决定因素：机器学习方法

Intelligent Systems with Applications

Pub Date : 2024-07-28 DOI: 10.1016/j.iswa.2024.200420

Ali Albada , Muataz Salam Al-Daweri , Rabie A. Ramadan , Khalid Al. Qatiti , Li Haoyang , Peng Shutong

The current study examines the factors influencing investor opinions on issues related to listed firms during the first day of Initial Public Offerings (IPOs), focusing on a sample of 350 fixed-priced IPOs listed on the Malaysian stock exchange (Bursa Malaysia) from 2004 to 2021. This research contributes to existing literature by employing various machine learning methods, which address the limitations of traditional linear regression models commonly used in previous studies. Specifically, five methods—extra tree regressor (ETR), single feature selection (SFS), reverse single feature (RSF), recursive feature elimination (RFE), and sequential modelling feature adding (SMFA)—are utilized to assess the importance of features in predicting the investor opinion gap within the dataset.

The study's experiments indicate that these methods effectively mitigate noisy data, enhancing their reliability for this type of analysis. The findings provide valuable insights for regulators regarding safeguarding investors' rights to information disclosed in prospectuses.

本研究以 2004 年至 2021 年期间在马来西亚证券交易所（Bursa Malaysia）上市的 350 家固定价格首次公开募股（IPO）为样本，探讨了影响投资者在首次公开募股（IPO）首日对上市公司相关问题意见的因素。本研究采用多种机器学习方法，解决了以往研究中常用的传统线性回归模型的局限性，为现有文献做出了贡献。具体来说，研究采用了五种方法--额外树回归器（ETR）、单一特征选择（SFS）、反向单一特征（RSF）、递归特征消除（RFE）和序列建模特征添加（SMFA）--来评估特征在预测数据集中投资者意见差距方面的重要性。研究结果为监管机构保障投资者对招股说明书所披露信息的权利提供了宝贵的见解。

{"title":"Determinates of investor opinion gap around IPOs: A machine learning approach","authors":"Ali Albada , Muataz Salam Al-Daweri , Rabie A. Ramadan , Khalid Al. Qatiti , Li Haoyang , Peng Shutong","doi":"10.1016/j.iswa.2024.200420","DOIUrl":"10.1016/j.iswa.2024.200420","url":null,"abstract":"<div><p>The current study examines the factors influencing investor opinions on issues related to listed firms during the first day of Initial Public Offerings (IPOs), focusing on a sample of 350 fixed-priced IPOs listed on the Malaysian stock exchange (Bursa Malaysia) from 2004 to 2021. This research contributes to existing literature by employing various machine learning methods, which address the limitations of traditional linear regression models commonly used in previous studies. Specifically, five methods—extra tree regressor (ETR), single feature selection (SFS), reverse single feature (RSF), recursive feature elimination (RFE), and sequential modelling feature adding (SMFA)—are utilized to assess the importance of features in predicting the investor opinion gap within the dataset.</p><p>The study's experiments indicate that these methods effectively mitigate noisy data, enhancing their reliability for this type of analysis. The findings provide valuable insights for regulators regarding safeguarding investors' rights to information disclosed in prospectuses.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200420"},"PeriodicalIF":0.0,"publicationDate":"2024-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000942/pdfft?md5=f261c79633047d7bfb957563e6c75844&pid=1-s2.0-S2667305324000942-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141846813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on time series prediction of hybrid intelligent systems based on deep learning 基于深度学习的混合智能系统时间序列预测研究

Intelligent Systems with Applications

Pub Date : 2024-07-25 DOI: 10.1016/j.iswa.2024.200419

Shang Jin , Wang Weiqing , Shi Bingcun , Xu Xiaobo

Power forecasting plays a crucial role in the operation of smart grid system, which is indispensable for making the operation plan of power system, improving economic efficiency and ensuring the quality of power supply. In order to enhance the accuracy of power load forecasting, a hybrid intelligent power load forecasting system is proposed in this paper. The system first preprocesses the raw data using Savitzky-Golay smoothing technology to eliminate noise and improve data quality. Then, a long and short term memory network with attention mechanism is used to enhance the generalization ability of the model. In addition, in order to further improve the prediction performance, an improved genetic algorithm is integrated to optimize the model parameters. Finally, a data set is used to verify the proposed prediction method. In terms of short-term forecasting ability of experiment of the testing data set, compared with LSTM model, the proposed method shows superior performance in root mean square error and mean absolute error indicators, with root mean square error reduced by 18.7 % and mean absolute error reduced by 26.2 %. In terms of long-term prediction ability of experiment of the testing data set, compared with GBRT model, the proposed method reduces root mean square error and mean absolute error by 24.8 % and 30.7 %, respectively. The experimental results show that compared with the existing benchmark algorithm, the proposed method is significantly improved in two key indexes of prediction accuracy, which proves its effectiveness and superiority in power load prediction.

电力预测在智能电网系统运行中起着至关重要的作用，是制定电力系统运行计划、提高经济效益、保证供电质量不可或缺的重要手段。为了提高电力负荷预测的准确性，本文提出了一种混合智能电力负荷预测系统。该系统首先利用萨维茨基-戈莱平滑技术对原始数据进行预处理，以消除噪声，提高数据质量。然后，利用具有注意力机制的长短期记忆网络来增强模型的泛化能力。此外，为了进一步提高预测性能，还集成了改进的遗传算法来优化模型参数。最后，使用一组数据来验证所提出的预测方法。在测试数据集实验的短期预测能力方面，与 LSTM 模型相比，所提出的方法在均方根误差和均值绝对误差指标上表现优异，均方根误差降低了 18.7%，均值绝对误差降低了 26.2%。在测试数据集实验的长期预测能力方面，与 GBRT 模型相比，所提方法的均方根误差和平均绝对误差分别降低了 24.8 % 和 30.7 %。实验结果表明，与现有基准算法相比，所提方法在预测精度的两个关键指标上都有显著提高，证明了其在电力负荷预测中的有效性和优越性。

{"title":"Research on time series prediction of hybrid intelligent systems based on deep learning","authors":"Shang Jin , Wang Weiqing , Shi Bingcun , Xu Xiaobo","doi":"10.1016/j.iswa.2024.200419","DOIUrl":"10.1016/j.iswa.2024.200419","url":null,"abstract":"<div><p>Power forecasting plays a crucial role in the operation of smart grid system, which is indispensable for making the operation plan of power system, improving economic efficiency and ensuring the quality of power supply. In order to enhance the accuracy of power load forecasting, a hybrid intelligent power load forecasting system is proposed in this paper. The system first preprocesses the raw data using Savitzky-Golay smoothing technology to eliminate noise and improve data quality. Then, a long and short term memory network with attention mechanism is used to enhance the generalization ability of the model. In addition, in order to further improve the prediction performance, an improved genetic algorithm is integrated to optimize the model parameters. Finally, a data set is used to verify the proposed prediction method. In terms of short-term forecasting ability of experiment of the testing data set, compared with LSTM model, the proposed method shows superior performance in root mean square error and mean absolute error indicators, with root mean square error reduced by 18.7 % and mean absolute error reduced by 26.2 %. In terms of long-term prediction ability of experiment of the testing data set, compared with GBRT model, the proposed method reduces root mean square error and mean absolute error by 24.8 % and 30.7 %, respectively. The experimental results show that compared with the existing benchmark algorithm, the proposed method is significantly improved in two key indexes of prediction accuracy, which proves its effectiveness and superiority in power load prediction.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200419"},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000930/pdfft?md5=6ff0d7f6d562dacf9dabf34fe5baf781&pid=1-s2.0-S2667305324000930-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141848209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BRYT: Automated keyword extraction for open datasets BRYT：开放数据集的自动关键词提取

Intelligent Systems with Applications

Pub Date : 2024-07-24 DOI: 10.1016/j.iswa.2024.200421

Umair Ahmed , Charalampos Alexopoulos , Marco Piangerelli , Andrea Polini

In today’s information-driven world, open data is crucial in making valuable structured data freely accessible to the public. However, the absence of quality metadata often hinders the findability and representation of this data. In this study we specifically focus on keywords, proposing a strategy for their automatic generation. In particular, we employed five existing keyword extraction methodologies (BERT, RAKE, YAKE, TEXTRANK, and ChatGPT) and proposed a novel hybrid methodology, named BRYT (read as bright). Our evaluation of these algorithms was conducted using Gestalt String Matching and Jaccard Similarity techniques. We validated our study using a selection of datasets from the EU data portal, specifically choosing those that exhibited potentially high-quality metadata. This included datasets that contained a substantial number of keywords and had comprehensive, relevant metadata. The results showed that 69.1% of the dataset keywords majorly matched (more than 50% or 5 keywords), 24.7% minorly matched (up to 50% or 5 keywords), and 6.2% did not match. The proposed hybrid model, BRYT, outperformed other algorithms in the major matches, while ChatGPT was a close second. YAKE outperformed the others in minor matches, and ChatGPT was again a close second. The evaluations concluded that BRYT consistently extracted more representative keywords in major matches, highlighting its effectiveness in improving findability. This study sets up a favorable field for further representative metadata extraction and population, making the data more findable, discoverable, and accessible.

在当今信息驱动的世界中，开放数据对于向公众免费提供有价值的结构化数据至关重要。然而，缺乏高质量的元数据往往会阻碍这些数据的可查找性和代表性。在这项研究中，我们特别关注关键词，并提出了一种自动生成关键词的策略。特别是，我们采用了五种现有的关键词提取方法（BERT、RAKE、YAKE、TEXTRANK 和 ChatGPT），并提出了一种新颖的混合方法，命名为 BRYT（read as bright）。我们使用格式塔字符串匹配和 Jaccard 相似性技术对这些算法进行了评估。我们从欧盟数据门户网站中选择了一些数据集，特别是那些显示出潜在高质量元数据的数据集，对我们的研究进行了验证。其中包括包含大量关键字和全面相关元数据的数据集。结果显示，69.1% 的数据集关键词基本匹配（50% 以上或 5 个关键词），24.7% 轻微匹配（50% 以下或 5 个关键词），6.2% 不匹配。拟议的混合模型 BRYT 在主要匹配度方面优于其他算法，而 ChatGPT 紧随其后。在次要匹配中，YAKE 的表现优于其他算法，而 ChatGPT 紧随其后。评估得出的结论是，在主要匹配中，BRYT 始终能提取出更具代表性的关键词，这凸显了它在提高可查找性方面的有效性。这项研究为进一步提取有代表性的元数据和数据集奠定了良好的基础，使数据更易于查找、发现和访问。

{"title":"BRYT: Automated keyword extraction for open datasets","authors":"Umair Ahmed , Charalampos Alexopoulos , Marco Piangerelli , Andrea Polini","doi":"10.1016/j.iswa.2024.200421","DOIUrl":"10.1016/j.iswa.2024.200421","url":null,"abstract":"<div><p>In today’s information-driven world, open data is crucial in making valuable structured data freely accessible to the public. However, the absence of quality metadata often hinders the findability and representation of this data. In this study we specifically focus on keywords, proposing a strategy for their automatic generation. In particular, we employed five existing keyword extraction methodologies (BERT, RAKE, YAKE, TEXTRANK, and ChatGPT) and proposed a novel hybrid methodology, named BRYT (read as bright). Our evaluation of these algorithms was conducted using Gestalt String Matching and Jaccard Similarity techniques. We validated our study using a selection of datasets from the EU data portal, specifically choosing those that exhibited potentially high-quality metadata. This included datasets that contained a substantial number of keywords and had comprehensive, relevant metadata. The results showed that 69.1% of the dataset keywords majorly matched (more than 50% or 5 keywords), 24.7% minorly matched (up to 50% or 5 keywords), and 6.2% did not match. The proposed hybrid model, BRYT, outperformed other algorithms in the major matches, while ChatGPT was a close second. YAKE outperformed the others in minor matches, and ChatGPT was again a close second. The evaluations concluded that BRYT consistently extracted more representative keywords in major matches, highlighting its effectiveness in improving findability. This study sets up a favorable field for further representative metadata extraction and population, making the data more findable, discoverable, and accessible.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200421"},"PeriodicalIF":0.0,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000954/pdfft?md5=bffa7bce793407b80d4f01fce6471a60&pid=1-s2.0-S2667305324000954-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141845202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing Word Sense Disambiguation for Amharic homophone words using Bidirectional Long Short-Term Memory network 利用双向长短期记忆网络增强阿姆哈拉语同音词的词义消歧能力

Intelligent Systems with Applications

Pub Date : 2024-07-14 DOI: 10.1016/j.iswa.2024.200417

Mequanent Degu Belete , Lijalem Getanew Shiferaw , Girma Kassa Alitasb , Tariku Sinshaw Tamir

Given the Amharic language has a lot of perplexing terminology since it features duplicate homophone letters, fidel's ሀ, ሐ, and ኀ (three of which are pronounced as HA), ሠ and ሰ (both pronounced as SE), አ and ዐ (both pronounced as AE), and ጸ and ፀ (both pronounced as TSE). The WSD (Word Sense Disambiguation) model, which tackles the issue of lexical ambiguity in the context of the Amharic language, is developed using a deep learning technique. Due to the unavailability of the Amharic wordnet, a total of 1756 examples of paired Amharic ambiguous homophonic words were collected. These words were ድህነት(dhnet) and ድኅነት(dhnet), ምሁር(m'hur) and ምሑር(m'hur), በአል(be'al) and በዢል(be'al), አቢይ (abiy) and ዐቢይ(abiy), with a total of 1756 examples. Following word preprocessing, word2vec, fasttext, Term Frequency-Inverse Document Frequency (TFIDF), and bag of words (BoW) were used to vectorize the text. The vectorized text was divided into train and test data. The train data was then analysed using Naive Bayes (NB), K-nearest neighbour (KNN), logistic regression (LG), decision trees (DT), random forests (RF), and random oversampling technique. Bidirectional Gate Recurrent Unit (BiGRU) and Bidirectional Long Short-Term Memory (BiLSTM) improved to 99.99 % accuracy even with limited datasets.

鉴于阿姆哈拉语有许多令人困惑的术语，因为它具有重复的同音字母，菲德尔的ሀ、ሐ和ኀ（其中三个发音为 HA）、ሠ和ሰ（发音均为 SE）、አ和ዐ（发音均为 AE）以及ጸ和ፀ（发音均为 TSE）。WSD（词义消歧）模型采用深度学习技术开发，用于解决阿姆哈拉语语境中的词汇歧义问题。由于无法获得阿姆哈拉语单词网，因此共收集了 1756 个阿姆哈拉语成对同音歧义词实例。这些词分别是 ድህነት(dhnet) 和 ድኅነት(dhnet)、ምሁ(m'hur) 和 ምሑ(m'hur) 、በአል(be'al) 和 በዢል(be'al) 、አቢይ (abiy) 和 ዐቢይ(abiy)，共计 1756 个例子。经过词预处理后，使用 word2vec、fasttext、词频-反向文档频率（TFIDF）和词包（BoW）对文本进行了向量化。矢量化后的文本分为训练数据和测试数据。然后使用 Naive Bayes (NB)、K-nearest neighbour (KNN)、逻辑回归 (LG)、决策树 (DT)、随机森林 (RF) 和随机超采样技术对训练数据进行分析。即使数据集有限，双向门递归单元（BiGRU）和双向长短期记忆（BiLSTM）的准确率也提高到了 99.99%。

{"title":"Enhancing Word Sense Disambiguation for Amharic homophone words using Bidirectional Long Short-Term Memory network","authors":"Mequanent Degu Belete , Lijalem Getanew Shiferaw , Girma Kassa Alitasb , Tariku Sinshaw Tamir","doi":"10.1016/j.iswa.2024.200417","DOIUrl":"10.1016/j.iswa.2024.200417","url":null,"abstract":"<div><p>Given the Amharic language has a lot of perplexing terminology since it features duplicate homophone letters, fidel's ሀ, ሐ, and ኀ (three of which are pronounced as HA), ሠ and ሰ (both pronounced as SE), አ and ዐ (both pronounced as AE), and ጸ and ፀ (both pronounced as TSE). The WSD (Word Sense Disambiguation) model, which tackles the issue of lexical ambiguity in the context of the Amharic language, is developed using a deep learning technique. Due to the unavailability of the Amharic wordnet, a total of 1756 examples of paired Amharic ambiguous homophonic words were collected. These words were ድህነት(dhnet) and ድኅነት(dhnet), ምሁር(m'hur) and ምሑር(m'hur), በአል(be'al) and በዢል(be'al), አቢይ (abiy) and ዐቢይ(abiy), with a total of 1756 examples. Following word preprocessing, word2vec, fasttext, Term Frequency-Inverse Document Frequency (TFIDF), and bag of words (BoW) were used to vectorize the text. The vectorized text was divided into train and test data. The train data was then analysed using Naive Bayes (NB), K-nearest neighbour (KNN), logistic regression (LG), decision trees (DT), random forests (RF), and random oversampling technique. Bidirectional Gate Recurrent Unit (BiGRU) and Bidirectional Long Short-Term Memory (BiLSTM) improved to 99.99 % accuracy even with limited datasets.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200417"},"PeriodicalIF":0.0,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000917/pdfft?md5=202ee58dc6e5f4972b676973759f3a8c&pid=1-s2.0-S2667305324000917-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141715798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0