Expert Systems最新文献_第2页

Label distribution learning for compound facial expression recognition in‐the‐wild: A comparative study 用于野外复合面部表情识别的标签分布学习：比较研究

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-10 DOI: 10.1111/exsy.13724

Afifa Khelifa, Haythem Ghazouani, Walid Barhoumi

Human emotional states encompass both basic and compound facial expressions. However, current works primarily focus on basic expressions, consequently neglecting the broad spectrum of human emotions encountered in practical scenarios. Compound facial expressions involve the simultaneous manifestation of multiple emotions on an individual's face. This phenomenon reflects the complexity and richness of human states, where facial features dynamically convey a combination of feelings. This study embarks on a pioneering exploration of Compound Facial Expression Recognition (CFER), with a distinctive emphasis on leveraging the Label Distribution Learning (LDL) paradigm. This strategic application of LDL aims to address the ambiguity and complexity inherent in compound expressions, marking a significant departure from the dominant Single Label Learning (SLL) and Multi‐Label Learning (MLL) paradigms. Within this framework, we rigorously investigate the potential of LDL for a critical challenge in Facial Expression Recognition (FER): recognizing compound facial expressions in uncontrolled environments. We utilize the recently introduced RAF‐CE dataset, meticulously designed for compound expression assessment. By conducting a comprehensive comparative analysis pitting LDL against conventional SLL and MLL approaches on RAF‐CE, we aim to definitively establish LDL's superiority in handling this complex task. Furthermore, we assess the generalizability of LDL models trained on RAF‐CE by evaluating their performance on the EmotioNet and RAF‐DB Compound datasets. This demonstrates their effectiveness without domain adaptation. To solidify these findings, we conduct a comprehensive comparative analysis of 12 cutting‐edge LDL algorithms on RAF‐CE, S‐BU3DFE, and S‐JAFFE datasets, providing valuable insights into the most effective LDL techniques for FER in‐the‐wild.

人类的情绪状态包括基本面部表情和复合面部表情。然而，目前的研究主要集中在基本表情上，因此忽略了实际场景中遇到的广泛的人类情绪。复合面部表情涉及个人面部多种情绪的同时表现。这种现象反映了人类状态的复杂性和丰富性，面部特征动态地传达了多种情感的组合。本研究对复合面部表情识别（CFER）进行了开创性的探索，并特别强调利用标签分布学习（LDL）范式。LDL 的这一战略性应用旨在解决复合表情固有的模糊性和复杂性，标志着与主流的单标签学习（SLL）和多标签学习（MLL）范式的重大差异。在这一框架内，我们严格研究了 LDL 在面部表情识别（FER）的关键挑战中的潜力：在不受控制的环境中识别复合面部表情。我们利用最近推出的 RAF-CE 数据集，该数据集是专为复合表情评估而精心设计的。通过在 RAF-CE 数据集上对 LDL 与传统 SLL 和 MLL 方法进行全面的比较分析，我们旨在明确 LDL 在处理这一复杂任务方面的优势。此外，我们还通过评估在 EmotioNet 和 RAF-DB Compound 数据集上的表现，评估了在 RAF-CE 上训练的 LDL 模型的通用性。这证明了它们在没有领域适应的情况下的有效性。为了巩固这些研究结果，我们在 RAF-CE、S-BU3DFE 和 S-JAFFE 数据集上对 12 种前沿 LDL 算法进行了全面的比较分析，从而为 FER 在实际应用中最有效的 LDL 技术提供了宝贵的见解。

{"title":"Label distribution learning for compound facial expression recognition in‐the‐wild: A comparative study","authors":"Afifa Khelifa, Haythem Ghazouani, Walid Barhoumi","doi":"10.1111/exsy.13724","DOIUrl":"https://doi.org/10.1111/exsy.13724","url":null,"abstract":"Human emotional states encompass both basic and compound facial expressions. However, current works primarily focus on basic expressions, consequently neglecting the broad spectrum of human emotions encountered in practical scenarios. Compound facial expressions involve the simultaneous manifestation of multiple emotions on an individual's face. This phenomenon reflects the complexity and richness of human states, where facial features dynamically convey a combination of feelings. This study embarks on a pioneering exploration of Compound Facial Expression Recognition (CFER), with a distinctive emphasis on leveraging the Label Distribution Learning (LDL) paradigm. This strategic application of LDL aims to address the ambiguity and complexity inherent in compound expressions, marking a significant departure from the dominant Single Label Learning (SLL) and Multi‐Label Learning (MLL) paradigms. Within this framework, we rigorously investigate the potential of LDL for a critical challenge in Facial Expression Recognition (FER): recognizing compound facial expressions in uncontrolled environments. We utilize the recently introduced RAF‐CE dataset, meticulously designed for compound expression assessment. By conducting a comprehensive comparative analysis pitting LDL against conventional SLL and MLL approaches on RAF‐CE, we aim to definitively establish LDL's superiority in handling this complex task. Furthermore, we assess the generalizability of LDL models trained on RAF‐CE by evaluating their performance on the EmotioNet and RAF‐DB Compound datasets. This demonstrates their effectiveness without domain adaptation. To solidify these findings, we conduct a comprehensive comparative analysis of 12 cutting‐edge LDL algorithms on RAF‐CE, S‐BU3DFE, and S‐JAFFE datasets, providing valuable insights into the most effective LDL techniques for FER in‐the‐wild.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"65 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated learning‐driven dual blockchain for data sharing and reputation management in Internet of medical things 用于医疗物联网数据共享和声誉管理的联邦学习驱动双区块链

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-06 DOI: 10.1111/exsy.13714

Chenquan Gan, Xinghai Xiao, Qingyi Zhu, Deepak Kumar Jain, Akanksha Saini, Amir Hussain

In the Internet of Medical Things (IoMT), the vulnerability of federated learning (FL) to single points of failure, low‐quality nodes, and poisoning attacks necessitates innovative solutions. This article introduces a FL‐driven dual‐blockchain approach to address these challenges and improve data sharing and reputation management. Our approach comprises two blockchains: the Model Quality Blockchain (MQchain) and the Reputation Incentive Blockchain (RIchain). MQchain utilizes an enhanced Proof of Quality (PoQ) consensus algorithm to exclude low‐quality nodes from participating in aggregation, effectively mitigating single points of failure and poisoning attacks by leveraging node reputation and quality thresholds. In parallel, RIchain incorporates a reputation evaluation, incentive mechanism, and index query mechanism, allowing for rapid and comprehensive node evaluation, thus identifying high‐reputation nodes for MQchain. Security analysis confirms the theoretical soundness of the proposed method. Experimental evaluation using real medical datasets, specifically MedMNIST, demonstrates the remarkable resilience of our approach against attacks compared to three alternative methods.

在医疗物联网（IoMT）中，联合学习（FL）容易受到单点故障、低质量节点和中毒攻击的影响，因此需要创新的解决方案。本文介绍了一种 FL 驱动的双区块链方法，以应对这些挑战并改善数据共享和声誉管理。我们的方法包括两个区块链：模型质量区块链（MQchain）和声誉激励区块链（RIchain）。MQchain 利用增强型质量证明（PoQ）共识算法来排除低质量节点参与聚合，通过利用节点声誉和质量阈值来有效缓解单点故障和中毒攻击。与此同时，RIchain 还结合了声誉评估、激励机制和索引查询机制，可以快速、全面地评估节点，从而为 MQchain 识别出高声誉节点。安全分析证实了所提方法的理论合理性。使用真实医疗数据集（特别是 MedMNIST）进行的实验评估表明，与三种替代方法相比，我们的方法具有显著的抗攻击能力。

{"title":"Federated learning‐driven dual blockchain for data sharing and reputation management in Internet of medical things","authors":"Chenquan Gan, Xinghai Xiao, Qingyi Zhu, Deepak Kumar Jain, Akanksha Saini, Amir Hussain","doi":"10.1111/exsy.13714","DOIUrl":"https://doi.org/10.1111/exsy.13714","url":null,"abstract":"In the Internet of Medical Things (IoMT), the vulnerability of federated learning (FL) to single points of failure, low‐quality nodes, and poisoning attacks necessitates innovative solutions. This article introduces a FL‐driven dual‐blockchain approach to address these challenges and improve data sharing and reputation management. Our approach comprises two blockchains: the Model Quality Blockchain (MQchain) and the Reputation Incentive Blockchain (RIchain). MQchain utilizes an enhanced Proof of Quality (PoQ) consensus algorithm to exclude low‐quality nodes from participating in aggregation, effectively mitigating single points of failure and poisoning attacks by leveraging node reputation and quality thresholds. In parallel, RIchain incorporates a reputation evaluation, incentive mechanism, and index query mechanism, allowing for rapid and comprehensive node evaluation, thus identifying high‐reputation nodes for MQchain. Security analysis confirms the theoretical soundness of the proposed method. Experimental evaluation using real medical datasets, specifically MedMNIST, demonstrates the remarkable resilience of our approach against attacks compared to three alternative methods.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"6 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

One-step multiple kernel k-means clustering based on block diagonal representation 基于块对角线表示的一步多核 K 均值聚类法

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-05 DOI: 10.1111/exsy.13720

Cuiling Chen, Zhi Li

Multiple kernel k-means clustering (MKKC) can efficiently incorporate multiple base kernels to generate an optimal kernel. Many existing MKKC methods all need two-step operation: learning clustering indicator matrix and performing clustering on it. However, the optimal clustering results of two steps are not equivalent to those of original problem. To address this issue, in this paper we propose a novel method named one-step multiple kernel k-means clustering based on block diagonal representation (OS-MKKC-BD). By imposing a block diagonal constraint on the product of indicator matrix and its transpose, this method can encourage the indicator matrix to be block diagonal. Then the indicator matrix can produce explicit clustering indicator, so as to implement one-step clustering, which avoids the disadvantage of two-step operation. Furthermore, a simple kernel weighting strategy is used to obtain an optimal kernel, which boosts the quality of optimal kernel. In addition, a three-step iterative algorithm is designed to solve the corresponding optimization problem, where the Riemann conjugate gradient iterative method is used to solve the optimization problem of the indicator matrix. Finally, by extensive experiments on eleven real data sets and comparison of clustering results with 10 MKC methods, it is concluded that OS-MKKC-BD is effective.

多核 K-means 聚类（MKKC）可以有效地结合多个基本核来生成最优核。现有的许多 MKKC 方法都需要两步操作：学习聚类指标矩阵并对其进行聚类。然而，两步操作的最优聚类结果并不等同于原始问题的最优聚类结果。为了解决这个问题，本文提出了一种新方法，即基于块对角线表示的一步多核均值聚类（OS-MKKC-BD）。通过对指标矩阵及其转置的乘积施加正对角线约束，该方法可以促使指标矩阵成为正对角线。这样，指标矩阵就能产生明确的聚类指标，从而实现一步聚类，避免了两步操作的缺点。此外，利用简单的内核加权策略获得最优内核，提高了最优内核的质量。此外，还设计了一种三步迭代算法来解决相应的优化问题，其中黎曼共轭梯度迭代法用于解决指标矩阵的优化问题。最后，通过在 11 个真实数据集上进行大量实验，并将聚类结果与 10 种 MKC 方法进行比较，得出 OS-MKKC-BD 是有效的。

{"title":"One-step multiple kernel k-means clustering based on block diagonal representation","authors":"Cuiling Chen, Zhi Li","doi":"10.1111/exsy.13720","DOIUrl":"10.1111/exsy.13720","url":null,"abstract":"Multiple kernel k-means clustering (MKKC) can efficiently incorporate multiple base kernels to generate an optimal kernel. Many existing MKKC methods all need two-step operation: learning clustering indicator matrix and performing clustering on it. However, the optimal clustering results of two steps are not equivalent to those of original problem. To address this issue, in this paper we propose a novel method named one-step multiple kernel k-means clustering based on block diagonal representation (OS-MKKC-BD). By imposing a block diagonal constraint on the product of indicator matrix and its transpose, this method can encourage the indicator matrix to be block diagonal. Then the indicator matrix can produce explicit clustering indicator, so as to implement one-step clustering, which avoids the disadvantage of two-step operation. Furthermore, a simple kernel weighting strategy is used to obtain an optimal kernel, which boosts the quality of optimal kernel. In addition, a three-step iterative algorithm is designed to solve the corresponding optimization problem, where the Riemann conjugate gradient iterative method is used to solve the optimization problem of the indicator matrix. Finally, by extensive experiments on eleven real data sets and comparison of clustering results with 10 MKC methods, it is concluded that OS-MKKC-BD is effective.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A new method based on generative adversarial networks for multivariate time series prediction 基于生成式对抗网络的多变量时间序列预测新方法

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-05 DOI: 10.1111/exsy.13700

Xiwen Qin, Hongyu Shi, Xiaogang Dong, Siqi Zhang

Multivariate time series have more complex and high-dimensional characteristics, which makes it difficult to analyze and predict the data accurately. In this paper, a new multivariate time series prediction method is proposed. This method is a generative adversarial networks (GAN) method based on Fourier transform and bi-directional gated recurrent unit (Bi-GRU). First, the Fourier transform is utilized to extend the data features, which helps the GAN to better learn the distributional features of the original data. Second, in order to guide the model to fully learn the distribution of the original time series data, Bi-GRU is introduced as the generator of GAN. To solve the problems of mode collapse and gradient vanishing that exist in GAN, Wasserstein distance is used as the loss function of GAN. Finally, the proposed method is used for the prediction of air quality, stock price and RMB exchange rate. The experimental results show that the model can effectively predict the trend of the time series compared with the other nine baseline models. It significantly improves the accuracy and flexibility of multivariate time series forecasting and provides new ideas and methods for accurate time series forecasting in industrial, financial and environmental fields.

多变量时间序列具有更加复杂和高维的特征，这给准确分析和预测数据带来了困难。本文提出了一种新的多元时间序列预测方法。该方法是一种基于傅立叶变换和双向门控递归单元（Bi-GRU）的生成对抗网络（GAN）方法。首先，利用傅立叶变换扩展数据特征，这有助于 GAN 更好地学习原始数据的分布特征。其次，为了引导模型充分学习原始时间序列数据的分布，引入了 Bi-GRU 作为 GAN 的生成器。为了解决 GAN 中存在的模式崩溃和梯度消失问题，采用 Wasserstein 距离作为 GAN 的损失函数。最后，将所提出的方法用于空气质量、股票价格和人民币汇率的预测。实验结果表明，与其他九种基线模型相比，该模型能有效预测时间序列的趋势。它极大地提高了多元时间序列预测的准确性和灵活性，为工业、金融和环境领域的精确时间序列预测提供了新的思路和方法。

{"title":"A new method based on generative adversarial networks for multivariate time series prediction","authors":"Xiwen Qin, Hongyu Shi, Xiaogang Dong, Siqi Zhang","doi":"10.1111/exsy.13700","DOIUrl":"10.1111/exsy.13700","url":null,"abstract":"Multivariate time series have more complex and high-dimensional characteristics, which makes it difficult to analyze and predict the data accurately. In this paper, a new multivariate time series prediction method is proposed. This method is a generative adversarial networks (GAN) method based on Fourier transform and bi-directional gated recurrent unit (Bi-GRU). First, the Fourier transform is utilized to extend the data features, which helps the GAN to better learn the distributional features of the original data. Second, in order to guide the model to fully learn the distribution of the original time series data, Bi-GRU is introduced as the generator of GAN. To solve the problems of mode collapse and gradient vanishing that exist in GAN, Wasserstein distance is used as the loss function of GAN. Finally, the proposed method is used for the prediction of air quality, stock price and RMB exchange rate. The experimental results show that the model can effectively predict the trend of the time series compared with the other nine baseline models. It significantly improves the accuracy and flexibility of multivariate time series forecasting and provides new ideas and methods for accurate time series forecasting in industrial, financial and environmental fields.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TensorCRO: A TensorFlow-based implementation of a multi-method ensemble for optimization TensorCRO：基于 TensorFlow 的多方法优化组合实施方案

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-05 DOI: 10.1111/exsy.13713

A. Palomo-Alonso, V. G. Costa, L. M. Moreno-Saavedra, E. Lorente-Ramos, J. Pérez-Aracil, C. E. Pedreira, S. Salcedo-Sanz

This paper presents a novel implementation of the Coral Reef Optimization with Substrate Layers (CRO-SL) algorithm. Our approach, which we call TensorCRO, takes advantage of the TensorFlow framework to represent CRO-SL as a series of tensor operations, allowing it to run on GPU and search for solutions in a faster and more efficient way. We evaluate the performance of the proposed implementation across a wide range of benchmark functions commonly used in optimization research (such as the Rastrigin, Rosenbrock, Ackley, and Griewank functions), and we show that GPU execution leads to considerable speedups when compared to its CPU counterpart. Then, when comparing TensorCRO to other state-of-the-art optimization algorithms (such as the Genetic Algorithm, Simulated Annealing, and Particle Swarm Optimization), the results show that TensorCRO can achieve better convergence rates and solutions than other algorithms within a fixed execution time, given that the fitness functions are also implemented on TensorFlow. Furthermore, we also evaluate the proposed approach in a real-world problem of optimizing power production in wind farms by selecting the locations of turbines; in every evaluated scenario, TensorCRO outperformed the other meta-heuristics and achieved solutions close to the best known in the literature. Overall, our implementation of the CRO-SL algorithm in TensorFlow GPU provides a new, fast, and efficient approach to solving optimization problems, and we believe that the proposed implementation has significant potential to be applied in various domains, such as engineering, finance, and machine learning, where optimization is often used to solve complex problems. Furthermore, we propose that this implementation can be used to optimize models that cannot propagate an error gradient, which is an excellent choice for non-gradient-based optimizers.

本文介绍了珊瑚礁底层优化算法（CRO-SL）的新型实现方法。我们将这种方法称为 TensorCRO，它利用 TensorFlow 框架将 CRO-SL 表述为一系列张量运算，使其能够在 GPU 上运行，并以更快、更高效的方式搜索解决方案。我们在优化研究中常用的各种基准函数（如 Rastrigin、Rosenbrock、Ackley 和 Griewank 函数）上评估了所建议的实现的性能，结果表明，与 CPU 相比，GPU 的执行速度大大提高。然后，在将 TensorCRO 与其他最先进的优化算法（如遗传算法、模拟退火和粒子群优化）进行比较时，结果表明 TensorCRO 可以在固定的执行时间内实现比其他算法更好的收敛速度和解决方案，因为适配函数也是在 TensorFlow 上实现的。此外，我们还在通过选择涡轮机位置来优化风力发电场发电量的实际问题中对所提出的方法进行了评估；在每个评估场景中，TensorCRO 的表现都优于其他元启发式算法，并获得了接近文献中已知最佳的解决方案。总之，我们在 TensorFlow GPU 中实现的 CRO-SL 算法为解决优化问题提供了一种全新、快速、高效的方法，我们相信所提出的实现方法在工程、金融和机器学习等经常使用优化方法解决复杂问题的各个领域都有巨大的应用潜力。此外，我们还提出，这种实现方法可用于优化无法传播误差梯度的模型，这对于基于非梯度的优化器来说是一个极佳的选择。

{"title":"TensorCRO: A TensorFlow-based implementation of a multi-method ensemble for optimization","authors":"A. Palomo-Alonso, V. G. Costa, L. M. Moreno-Saavedra, E. Lorente-Ramos, J. Pérez-Aracil, C. E. Pedreira, S. Salcedo-Sanz","doi":"10.1111/exsy.13713","DOIUrl":"10.1111/exsy.13713","url":null,"abstract":"This paper presents a novel implementation of the Coral Reef Optimization with Substrate Layers (CRO-SL) algorithm. Our approach, which we call TensorCRO, takes advantage of the TensorFlow framework to represent CRO-SL as a series of tensor operations, allowing it to run on GPU and search for solutions in a faster and more efficient way. We evaluate the performance of the proposed implementation across a wide range of benchmark functions commonly used in optimization research (such as the Rastrigin, Rosenbrock, Ackley, and Griewank functions), and we show that GPU execution leads to considerable speedups when compared to its CPU counterpart. Then, when comparing TensorCRO to other state-of-the-art optimization algorithms (such as the Genetic Algorithm, Simulated Annealing, and Particle Swarm Optimization), the results show that TensorCRO can achieve better convergence rates and solutions than other algorithms within a fixed execution time, given that the fitness functions are also implemented on TensorFlow. Furthermore, we also evaluate the proposed approach in a real-world problem of optimizing power production in wind farms by selecting the locations of turbines; in every evaluated scenario, TensorCRO outperformed the other meta-heuristics and achieved solutions close to the best known in the literature. Overall, our implementation of the CRO-SL algorithm in TensorFlow GPU provides a new, fast, and efficient approach to solving optimization problems, and we believe that the proposed implementation has significant potential to be applied in various domains, such as engineering, finance, and machine learning, where optimization is often used to solve complex problems. Furthermore, we propose that this implementation can be used to optimize models that cannot propagate an error gradient, which is an excellent choice for non-gradient-based optimizers.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.13713","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comparative evaluation of Large Language Models using key metrics and emerging tools 使用关键指标和新兴工具对大型语言模型进行比较评估

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-04 DOI: 10.1111/exsy.13719

Sarah McAvinue, Kapal Dev

This research involved designing and building an interactive generative AI application to conduct a comparative analysis of two advanced Large Language Models (LLMs), GPT‐4, and Claude 2, using Langsmith evaluation tools. The project was developed to explore the potential of LLMs in facilitating postgraduate course recommendations within a simulated environment at Munster Technological University (MTU). Designed for comparative analysis, the application enables testing of GPT‐4 and Claude 2 and can be hosted flexibly on either Amazon Web Services (AWS) or Azure. It utilizes advanced natural language processing and retrieval‐augmented generation (RAG) techniques to process proprietary data tailored to postgraduate needs. A key component of this research was the rigorous assessment of the LLMs using the Langsmith evaluation tool against both customized and standard benchmarks. The evaluation focused on metrics such as bias, safety, accuracy, cost, robustness, and latency. Additionally, adaptability covering critical features like language translation and internet access, was independently researched since the Langsmith tool does not evaluate this metric. This ensures a holistic assessment of the LLM's capabilities.

这项研究涉及设计和构建一个交互式生成人工智能应用程序，利用兰史密斯评估工具对两种先进的大型语言模型（LLM）--GPT-4 和 Claude 2 进行比较分析。开发该项目的目的是在明斯特理工大学（MTU）的模拟环境中探索 LLM 在促进研究生课程推荐方面的潜力。该应用程序专为比较分析而设计，可对 GPT-4 和 Claude 2 进行测试，并可灵活地托管在亚马逊网络服务（AWS）或 Azure 上。它利用先进的自然语言处理和检索增强生成（RAG）技术来处理专有数据，以满足研究生的需求。本研究的一个关键组成部分是使用兰斯史密斯评估工具，根据定制和标准基准对 LLM 进行严格评估。评估的重点是偏差、安全性、准确性、成本、稳健性和延迟等指标。此外，还对语言翻译和互联网接入等关键功能的适应性进行了独立研究，因为兰斯史密斯工具并不对这一指标进行评估。这确保了对 LLM 能力的全面评估。

{"title":"Comparative evaluation of Large Language Models using key metrics and emerging tools","authors":"Sarah McAvinue, Kapal Dev","doi":"10.1111/exsy.13719","DOIUrl":"https://doi.org/10.1111/exsy.13719","url":null,"abstract":"This research involved designing and building an interactive generative AI application to conduct a comparative analysis of two advanced Large Language Models (LLMs), GPT‐4, and Claude 2, using Langsmith evaluation tools. The project was developed to explore the potential of LLMs in facilitating postgraduate course recommendations within a simulated environment at Munster Technological University (MTU). Designed for comparative analysis, the application enables testing of GPT‐4 and Claude 2 and can be hosted flexibly on either Amazon Web Services (AWS) or Azure. It utilizes advanced natural language processing and retrieval‐augmented generation (RAG) techniques to process proprietary data tailored to postgraduate needs. A key component of this research was the rigorous assessment of the LLMs using the Langsmith evaluation tool against both customized and standard benchmarks. The evaluation focused on metrics such as bias, safety, accuracy, cost, robustness, and latency. Additionally, adaptability covering critical features like language translation and internet access, was independently researched since the Langsmith tool does not evaluate this metric. This ensures a holistic assessment of the LLM's capabilities.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"15 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ACRES: A framework for (semi)automatic generation of rule-based expert systems with uncertainty from datasets ACRES：从数据集（半）自动生成具有不确定性的基于规则的专家系统的框架

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-04 DOI: 10.1111/exsy.13723

Konstantinos Kovas, Ioannis Hatzilygeroudis

Traditionally, the design of an expert system involves acquiring knowledge, in the form of symbolic rules, directly from the expert(s), which is a complex and time-consuming task. Although expert systems approach is quite old, it is still present, especially where explicit knowledge representation and reasoning, which assure interpretability and explainability, are necessary. Therefore, machine learning methods have been devised to extract rules from data, to facilitate that task. However, those methods are quite inflexible in adapting to the application domain and provide no help in designing the expert system. In this work, we present a framework and corresponding tool, namely ACRES, for semi-automatically generating expert systems from datasets. ACRES allows for data preprocessing, which helps in structuring knowledge in the form of a tree, called rule hierarchy, which represents (possible) dependencies among data variables and is used for rule formation. This improves interpretability and explainability of the produced systems. We have also designed and evaluated alternative methods for rule extraction from data and for calculation and use of certainty factors, to represent uncertainty; CFs can be dynamically updated. Experimental results on seven well-known datasets show that the proposed rule extraction methods are comparable to other popular machine learning approaches like decision trees, CART, JRip, PART, Random Forest, and so on, for the classification task. Finally, we give insights on two applications of ACRES.

传统上，专家系统的设计涉及以符号规则的形式直接从专家那里获取知识，这是一项复杂而耗时的任务。虽然专家系统方法历史悠久，但它依然存在，特别是在需要明确的知识表示和推理以确保可解释性和可解释性的情况下。因此，人们设计了从数据中提取规则的机器学习方法，以促进这项任务。然而，这些方法在适应应用领域方面非常不灵活，对专家系统的设计没有任何帮助。在这项工作中，我们提出了一个从数据集半自动生成专家系统的框架和相应工具，即 ACRES。ACRES 允许进行数据预处理，这有助于以树形结构（称为规则层次结构）的形式构建知识，树形结构表示数据变量之间（可能的）依赖关系，并用于规则的形成。这提高了所生成系统的可解释性和可说明性。我们还设计并评估了从数据中提取规则以及计算和使用确定性因子的替代方法，以表示不确定性；确定性因子可以动态更新。在七个著名数据集上的实验结果表明，在分类任务中，所提出的规则提取方法与决策树、CART、JRip、PART、随机森林等其他流行的机器学习方法不相上下。最后，我们对 ACRES 的两个应用进行了深入分析。

{"title":"ACRES: A framework for (semi)automatic generation of rule-based expert systems with uncertainty from datasets","authors":"Konstantinos Kovas, Ioannis Hatzilygeroudis","doi":"10.1111/exsy.13723","DOIUrl":"10.1111/exsy.13723","url":null,"abstract":"Traditionally, the design of an expert system involves acquiring knowledge, in the form of symbolic rules, directly from the expert(s), which is a complex and time-consuming task. Although expert systems approach is quite old, it is still present, especially where explicit knowledge representation and reasoning, which assure interpretability and explainability, are necessary. Therefore, machine learning methods have been devised to extract rules from data, to facilitate that task. However, those methods are quite inflexible in adapting to the application domain and provide no help in designing the expert system. In this work, we present a framework and corresponding tool, namely ACRES, for semi-automatically generating expert systems from datasets. ACRES allows for data preprocessing, which helps in structuring knowledge in the form of a tree, called rule hierarchy, which represents (possible) dependencies among data variables and is used for rule formation. This improves interpretability and explainability of the produced systems. We have also designed and evaluated alternative methods for rule extraction from data and for calculation and use of certainty factors, to represent uncertainty; CFs can be dynamically updated. Experimental results on seven well-known datasets show that the proposed rule extraction methods are comparable to other popular machine learning approaches like decision trees, CART, JRip, PART, Random Forest, and so on, for the classification task. Finally, we give insights on two applications of ACRES.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.13723","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How does energy transition improve energy utilization efficiency? A case study of China's coal‐to‐gas program 能源转型如何提高能源利用效率？中国煤制天然气项目案例研究

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-03 DOI: 10.1111/exsy.13721

Zhixiang Zhou, Yifei Zhu, Yannan Li, Huaqing Wu

Improving energy efficiency by adjusting the structure of energy consumption types is of great significance for reducing carbon emissions in the short term. The present paper constructs new data envelopment analysis models for evaluating energy utilization under different structural conditions and calculating potential emissions reductions. We conducted empirical research on 30 provinces in China from 2003 to 2019—a time frame that coincides with the instituting of China's “coal‐to‐gas” program. Our results show that technological progress is the main way for China to reduce carbon emissions and that it is possible to reduce the total amount of carbon emissions by 35%. Additionally, optimizing the energy consumption structure following the coal‐to‐gas program guidelines could reduce the country's carbon emissions by a further 25%. Finally, this paper provides specific policy recommendations based on the efficiency analysis results to guide each province in reducing carbon emissions under the conditions of energy demand growth.

通过调整能源消费类型结构来提高能源效率，对于在短期内减少碳排放具有重要意义。本文构建了新的数据包络分析模型，用于评估不同结构条件下的能源利用率，并计算潜在的减排量。我们在 2003 年至 2019 年期间对中国 30 个省份进行了实证研究--该时间段与中国 "煤改气 "计划的实施时间相吻合。研究结果表明，技术进步是中国减少碳排放的主要途径，碳排放总量有可能减少 35%。此外，按照 "煤改气 "计划的指导方针优化能源消费结构，可以使中国的碳排放量再减少 25%。最后，本文根据效率分析结果提出了具体的政策建议，以指导各省在能源需求增长的条件下减少碳排放。

引用次数: 0

Deep learning-based gesture recognition for surgical applications: A data augmentation approach 基于深度学习的手术应用手势识别：数据增强方法

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-09-01 DOI: 10.1111/exsy.13706

Sofía Sorbet Santiago, Jenny Alexandra Cifuentes

Hand gesture recognition and classification play a pivotal role in automating Human-Computer Interaction (HCI) and have garnered substantial attention in research. In this study, the focus is placed on the application of gesture recognition in surgical settings to provide valuable feedback during medical training. A tool gesture classification system based on Deep Learning (DL) techniques is proposed, specifically employing a Long Short Term Memory (LSTM)-based model with an attention mechanism. The research is structured in three key stages: data pre-processing to eliminate outliers and smooth trajectories, addressing noise from surgical instrument data acquisition; data augmentation to overcome data scarcity by generating new trajectories through controlled spatial transformations; and the implementation and evaluation of the DL-based classification strategy. The dataset used includes recordings from ten participants with varying surgical experience, covering three types of trajectories and involving both right and left arms. The proposed classifier, combined with the data augmentation strategy, is assessed for its effectiveness in classifying all acquired gestures. The performance of the proposed model is evaluated against other DL-based methodologies commonly employed in surgical gesture classification. The results indicate that the proposed approach outperforms these benchmark methods, achieving higher classification accuracy and robustness in distinguishing diverse surgical gestures.

手势识别和分类在人机交互（HCI）自动化中发挥着举足轻重的作用，并在研究中获得了极大的关注。本研究的重点是手势识别在外科手术中的应用，以便在医疗培训期间提供有价值的反馈。本研究提出了一种基于深度学习（DL）技术的工具手势分类系统，特别是采用了基于长短期记忆（LSTM）模型和注意力机制。研究分为三个关键阶段：数据预处理，以消除异常值和平滑轨迹，解决手术器械数据采集带来的噪声问题；数据增强，通过受控空间变换生成新轨迹，克服数据稀缺问题；以及基于深度学习的分类策略的实施和评估。所使用的数据集包括来自十位具有不同手术经验的参与者的记录，涵盖三种类型的轨迹，并涉及左右手臂。所提出的分类器与数据增强策略相结合，对其在对所有获取的手势进行分类方面的有效性进行了评估。与外科手势分类中常用的其他基于 DL 的方法相比，对所提出模型的性能进行了评估。结果表明，所提出的方法优于这些基准方法，在区分各种手术手势方面具有更高的分类准确性和鲁棒性。

{"title":"Deep learning-based gesture recognition for surgical applications: A data augmentation approach","authors":"Sofía Sorbet Santiago, Jenny Alexandra Cifuentes","doi":"10.1111/exsy.13706","DOIUrl":"10.1111/exsy.13706","url":null,"abstract":"Hand gesture recognition and classification play a pivotal role in automating Human-Computer Interaction (HCI) and have garnered substantial attention in research. In this study, the focus is placed on the application of gesture recognition in surgical settings to provide valuable feedback during medical training. A tool gesture classification system based on Deep Learning (DL) techniques is proposed, specifically employing a Long Short Term Memory (LSTM)-based model with an attention mechanism. The research is structured in three key stages: data pre-processing to eliminate outliers and smooth trajectories, addressing noise from surgical instrument data acquisition; data augmentation to overcome data scarcity by generating new trajectories through controlled spatial transformations; and the implementation and evaluation of the DL-based classification strategy. The dataset used includes recordings from ten participants with varying surgical experience, covering three types of trajectories and involving both right and left arms. The proposed classifier, combined with the data augmentation strategy, is assessed for its effectiveness in classifying all acquired gestures. The performance of the proposed model is evaluated against other DL-based methodologies commonly employed in surgical gesture classification. The results indicate that the proposed approach outperforms these benchmark methods, achieving higher classification accuracy and robustness in distinguishing diverse surgical gestures.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142206598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CADICA: A new dataset for coronary artery disease detection by using invasive coronary angiography CADICA：利用有创冠状动脉造影检测冠状动脉疾病的新数据集

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-08-29 DOI: 10.1111/exsy.13708

Ariadna Jiménez-Partinen, Miguel A. Molina-Cabello, Karl Thurnhofer-Hemsi, Esteban J. Palomo, Jorge Rodríguez-Capitán, Ana I. Molina-Ramos, Manuel Jiménez-Navarro

Coronary artery disease (CAD) remains the leading cause of death globally and invasive coronary angiography (ICA) is considered the gold standard of anatomical imaging evaluation when CAD is suspected. However, risk evaluation based on ICA has several limitations, such as visual assessment of stenosis severity, which has significant interobserver variability. This motivates to development of a lesion classification system that can support specialists in their clinical procedures. Although deep learning classification methods are well-developed in other areas of medical imaging, ICA image classification is still at an early stage. One of the most important reasons is the lack of available and high-quality open-access datasets. In this paper, we reported a new annotated ICA images dataset, CADICA, to provide the research community with a comprehensive and rigorous dataset of coronary angiography consisting of a set of acquired patient videos and associated disease-related metadata. This dataset can be used by clinicians to train their skills in angiographic assessment of CAD severity, by computer scientists to create computer-aided diagnostic systems to help in such assessment, and to validate existing methods for CAD detection. In addition, baseline classification methods are proposed and analysed, validating the functionality of CADICA with deep learning-based methods and giving the scientific community a starting point to improve CAD detection.

冠状动脉疾病（CAD）仍然是全球死亡的主要原因，有创冠状动脉造影术（ICA）被认为是疑似冠状动脉疾病时进行解剖成像评估的黄金标准。然而，基于 ICA 的风险评估存在一些局限性，例如对狭窄严重程度的目测评估就存在明显的观察者间差异。这就促使我们开发一种病变分类系统，为专家的临床程序提供支持。虽然深度学习分类方法在医学影像的其他领域已经发展成熟，但 ICA 图像分类仍处于早期阶段。其中一个最重要的原因是缺乏可用的高质量开放访问数据集。在本文中，我们报告了一个新的注释 ICA 图像数据集 CADICA，为研究界提供了一个全面而严谨的冠状动脉造影数据集，该数据集由一组采集的患者视频和相关疾病元数据组成。临床医生可利用该数据集训练血管造影术评估 CAD 严重程度的技能，计算机科学家可利用该数据集创建计算机辅助诊断系统以帮助进行此类评估，还可利用该数据集验证现有的 CAD 检测方法。此外，还提出并分析了基线分类方法，用基于深度学习的方法验证了 CADICA 的功能，为科学界改进 CAD 检测提供了一个起点。

{"title":"CADICA: A new dataset for coronary artery disease detection by using invasive coronary angiography","authors":"Ariadna Jiménez-Partinen, Miguel A. Molina-Cabello, Karl Thurnhofer-Hemsi, Esteban J. Palomo, Jorge Rodríguez-Capitán, Ana I. Molina-Ramos, Manuel Jiménez-Navarro","doi":"10.1111/exsy.13708","DOIUrl":"10.1111/exsy.13708","url":null,"abstract":"Coronary artery disease (CAD) remains the leading cause of death globally and invasive coronary angiography (ICA) is considered the gold standard of anatomical imaging evaluation when CAD is suspected. However, risk evaluation based on ICA has several limitations, such as visual assessment of stenosis severity, which has significant interobserver variability. This motivates to development of a lesion classification system that can support specialists in their clinical procedures. Although deep learning classification methods are well-developed in other areas of medical imaging, ICA image classification is still at an early stage. One of the most important reasons is the lack of available and high-quality open-access datasets. In this paper, we reported a new annotated ICA images dataset, CADICA, to provide the research community with a comprehensive and rigorous dataset of coronary angiography consisting of a set of acquired patient videos and associated disease-related metadata. This dataset can be used by clinicians to train their skills in angiographic assessment of CAD severity, by computer scientists to create computer-aided diagnostic systems to help in such assessment, and to validate existing methods for CAD detection. In addition, baseline classification methods are proposed and analysed, validating the functionality of CADICA with deep learning-based methods and giving the scientific community a starting point to improve CAD detection.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 12","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.13708","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0