Expert Systems with Applications最新文献_第3页

SF-GAN: Semantic fusion generative adversarial networks for text-to-image synthesis SF-GAN：用于文本到图像合成的语义融合生成对抗网络

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-29 DOI: 10.1016/j.eswa.2024.125583

Text-to-image synthesis aims to generate high-quality realistic images conditioned on text description. The major challenge of this task rests on the deep and seamless integration of text and image features. Therefore, in this paper, we present a novel approach, e.g., semantic fusion generative adversarial networks (SF-GAN), for fine-grained text-to-image generation, which enables efficient semantic interactions. Specifically, our proposed SF-GAN leverages a novel recurrent semantic fusion network to seamlessly manipulate the global allocation of text information across discrete fusion blocks. Moreover, with the usage of the contrastive loss and the dynamic convolution, SF-GAN could fuse the text and image information more accurately and further improve the semantic consistency in the generate stage. During the discrimination stage, we introduce a word-level discriminator designed to offer the generator precise feedback pertaining to each individual word. When compared to current state-of-the-art techniques, our SF-GAN demonstrates remarkable efficiency in generating realistic and text-aligned images, outperforming its contemporaries on challenging benchmark datasets.

文本到图像的合成旨在根据文本描述生成高质量的逼真图像。这项任务的主要挑战在于如何深入、无缝地整合文本和图像特征。因此，在本文中，我们提出了一种新方法，如语义融合生成对抗网络（SF-GAN），用于细粒度文本到图像的生成，从而实现高效的语义交互。具体来说，我们提出的 SF-GAN 利用新颖的递归语义融合网络，在离散融合块之间无缝操作文本信息的全局分配。此外，通过使用对比损失和动态卷积，SF-GAN 可以更准确地融合文本和图像信息，进一步提高生成阶段的语义一致性。在判别阶段，我们引入了词级判别器，旨在为生成器提供与每个单词相关的精确反馈。与目前最先进的技术相比，我们的 SF-GAN 在生成逼真的文本对齐图像方面表现出了卓越的效率，在具有挑战性的基准数据集上，我们的 SF-GAN 优于同类技术。

{"title":"SF-GAN: Semantic fusion generative adversarial networks for text-to-image synthesis","authors":"","doi":"10.1016/j.eswa.2024.125583","DOIUrl":"10.1016/j.eswa.2024.125583","url":null,"abstract":"<div><div>Text-to-image synthesis aims to generate high-quality realistic images conditioned on text description. The major challenge of this task rests on the deep and seamless integration of text and image features. Therefore, in this paper, we present a novel approach, e.g., semantic fusion generative adversarial networks (SF-GAN), for fine-grained text-to-image generation, which enables efficient semantic interactions. Specifically, our proposed SF-GAN leverages a novel recurrent semantic fusion network to seamlessly manipulate the global allocation of text information across discrete fusion blocks. Moreover, with the usage of the contrastive loss and the dynamic convolution, SF-GAN could fuse the text and image information more accurately and further improve the semantic consistency in the generate stage. During the discrimination stage, we introduce a word-level discriminator designed to offer the generator precise feedback pertaining to each individual word. When compared to current state-of-the-art techniques, our SF-GAN demonstrates remarkable efficiency in generating realistic and text-aligned images, outperforming its contemporaries on challenging benchmark datasets.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142539020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial bee colony algorithm based on multiple indicators for many-objective optimization with irregular Pareto fronts 基于多指标的人工蜂群算法，用于具有不规则帕累托前沿的多目标优化

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-29 DOI: 10.1016/j.eswa.2024.125613

Artificial bee colony (ABC) algorithm has shown excellent performance over many single and multi-objective optimization problems (MOPs). However, ABC encounters some difficulties when solving many-objective optimization problems (MaOPs) with irregular Pareto fronts (PFs). The possible reasons include two aspects: (1) there are many non-dominated solutions in the population and the low selection pressure cannot move the population toward the PF; and (2) it is hard to maintain population diversity for PFs having irregular geometric structures. To address these issues, a new many-objective ABC variant based on multiple indicators (called MIMaOABC) is proposed in this paper. Firstly, a convergence indicator

I_{ɛ_{+}}

and a diversity indicator (

D i v

) based on parallel distance are utilized. A single indicator may have preferences and it easily causes the population to converge to a subregion of the PF. Then, a two-stage environmental selection method is designed based on the two indicators. In the first stage, the

I_{ɛ_{+}}

based environmental selection is used to improve the convergence. In the second stage, the

D i v

based environmental selection is employed to maintain diversity and handle irregular PFs. To balance exploration and exploitation during the search, multiple search strategies are used in different search stages, respectively. In the onlooker bee stage, solutions with good convergence are chosen for further search based on a new selection mechanism. In order to verify the performance of MIMaOABC, a set of well-known benchmark problems with degenerate, discontinuous, inverted, and regular PFs are tested. Performance of MIMaOABC is compared with eight state-of-the-art algorithms. Computational results shows that the proposed MIMaOABC is competitive in solving MaOPs with both irregular and regular PFs.

人工蜂群（ABC）算法在许多单目标和多目标优化问题（MOPs）中表现出了卓越的性能。然而，在解决具有不规则帕累托前沿（PFs）的多目标优化问题（MaOPs）时，人工蜂群算法遇到了一些困难。可能的原因包括两个方面：（1）种群中有许多非主导解，低选择压力无法使种群向帕累托前沿移动；（2）对于具有不规则几何结构的帕累托前沿，很难保持种群的多样性。针对这些问题，本文提出了一种新的基于多指标的多目标 ABC 变体（称为 MIMaOABC）。首先，利用收敛指标 Iɛ+ 和基于平行距离的多样性指标 (Div)。单一指标可能具有偏好性，容易使种群收敛到 PF 的一个子区域。然后，根据这两个指标设计了一种两阶段环境选择方法。在第一阶段，使用基于 Iɛ+ 的环境选择来提高收敛性。在第二阶段，采用基于 Div 的环境选择来保持多样性和处理不规则的 PF。为了平衡搜索过程中的探索和利用，在不同的搜索阶段分别采用了多种搜索策略。在观望蜂阶段，基于新的选择机制，选择收敛性好的解决方案进行进一步搜索。为了验证 MIMaOABC 的性能，测试了一组具有退化、不连续、倒置和规则 PF 的著名基准问题。MIMaOABC 的性能与八种最先进的算法进行了比较。计算结果表明，所提出的 MIMaOABC 在解决具有不规则和规则 PF 的 MaOPs 时都具有竞争力。

{"title":"Artificial bee colony algorithm based on multiple indicators for many-objective optimization with irregular Pareto fronts","authors":"","doi":"10.1016/j.eswa.2024.125613","DOIUrl":"10.1016/j.eswa.2024.125613","url":null,"abstract":"<div><div>Artificial bee colony (ABC) algorithm has shown excellent performance over many single and multi-objective optimization problems (MOPs). However, ABC encounters some difficulties when solving many-objective optimization problems (MaOPs) with irregular Pareto fronts (PFs). The possible reasons include two aspects: (1) there are many non-dominated solutions in the population and the low selection pressure cannot move the population toward the PF; and (2) it is hard to maintain population diversity for PFs having irregular geometric structures. To address these issues, a new many-objective ABC variant based on multiple indicators (called MIMaOABC) is proposed in this paper. Firstly, a convergence indicator <span><math><msub><mrow><mi>I</mi></mrow><mrow><msub><mrow><mi>ɛ</mi></mrow><mrow><mo>+</mo></mrow></msub></mrow></msub></math></span> and a diversity indicator (<span><math><mrow><mi>D</mi><mi>i</mi><mi>v</mi></mrow></math></span>) based on parallel distance are utilized. A single indicator may have preferences and it easily causes the population to converge to a subregion of the PF. Then, a two-stage environmental selection method is designed based on the two indicators. In the first stage, the <span><math><msub><mrow><mi>I</mi></mrow><mrow><msub><mrow><mi>ɛ</mi></mrow><mrow><mo>+</mo></mrow></msub></mrow></msub></math></span> based environmental selection is used to improve the convergence. In the second stage, the <span><math><mrow><mi>D</mi><mi>i</mi><mi>v</mi></mrow></math></span> based environmental selection is employed to maintain diversity and handle irregular PFs. To balance exploration and exploitation during the search, multiple search strategies are used in different search stages, respectively. In the onlooker bee stage, solutions with good convergence are chosen for further search based on a new selection mechanism. In order to verify the performance of MIMaOABC, a set of well-known benchmark problems with degenerate, discontinuous, inverted, and regular PFs are tested. Performance of MIMaOABC is compared with eight state-of-the-art algorithms. Computational results shows that the proposed MIMaOABC is competitive in solving MaOPs with both irregular and regular PFs.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142571808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A two-stage hybrid flow-shop formulation for sterilization processes in hospitals 医院消毒流程的两阶段混合流程-车间配方

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-29 DOI: 10.1016/j.eswa.2024.125624

Sterile processing is a critical secondary process and a major cost factor in the processing, acquisition, and storage of costly medical devices. This article aims to improve the performance of sterile processing by developing, implementing, and evaluating a dispatching rule-based algorithm to reduce the time medical devices spend in the central sterile supply department using a two-stage hybrid flow-shop formulation. The algorithm combines dispatching rules with stage decomposition and compatibility conditions. A genetic algorithm is designed to benchmark the performance in addition to an analytic bound. Real-world data from a large German hospital were used to test the effectiveness of the heuristics. The case study demonstrated the practical implications of the approach, leading to a reduction in the time medical devices spend in the system and improved utilization of washer-disinfector machines and sterilizers. It also highlighted the importance of aligning machine capacity with demand and the potential trade-offs associated with batch processing decisions. Our approach can contribute to substantial operational cost savings and efficiency gains, offering significant benefits to decision makers at both the operational and tactical levels.

无菌处理是一个关键的辅助流程，也是昂贵医疗器械的加工、购置和储存过程中的主要成本因素。本文旨在通过开发、实施和评估一种基于调度规则的算法来提高无菌处理的性能，该算法采用两阶段混合流-shop公式来减少医疗器械在中央无菌供应部门的停留时间。该算法将调度规则与阶段分解和兼容性条件相结合。除分析约束外，还设计了一种遗传算法来确定性能基准。来自德国一家大型医院的真实数据被用来测试启发式算法的有效性。案例研究证明了该方法的实际意义，它缩短了医疗设备在系统中的停留时间，提高了清洗消毒机和灭菌器的利用率。它还强调了根据需求调整机器容量的重要性，以及与批量处理决策相关的潜在权衡。我们的方法有助于节省大量运营成本并提高效率，为运营和战术层面的决策者带来巨大收益。

{"title":"A two-stage hybrid flow-shop formulation for sterilization processes in hospitals","authors":"","doi":"10.1016/j.eswa.2024.125624","DOIUrl":"10.1016/j.eswa.2024.125624","url":null,"abstract":"<div><div>Sterile processing is a critical secondary process and a major cost factor in the processing, acquisition, and storage of costly medical devices. This article aims to improve the performance of sterile processing by developing, implementing, and evaluating a dispatching rule-based algorithm to reduce the time medical devices spend in the central sterile supply department using a two-stage hybrid flow-shop formulation. The algorithm combines dispatching rules with stage decomposition and compatibility conditions. A genetic algorithm is designed to benchmark the performance in addition to an analytic bound. Real-world data from a large German hospital were used to test the effectiveness of the heuristics. The case study demonstrated the practical implications of the approach, leading to a reduction in the time medical devices spend in the system and improved utilization of washer-disinfector machines and sterilizers. It also highlighted the importance of aligning machine capacity with demand and the potential trade-offs associated with batch processing decisions. Our approach can contribute to substantial operational cost savings and efficiency gains, offering significant benefits to decision makers at both the operational and tactical levels.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142577872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Industrial robot energy consumption model identification: A coupling model-driven and data-driven paradigm 工业机器人能耗模型识别：模型驱动和数据驱动的耦合范式

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125604

Due to wide distribution and low energy efficiency, the energy-saving in industrial robots (IRs) is attracting extensive attention. Accurate energy consumption (EC) models of IRs lay the foundation for energy-saving. However, most dynamic and electrical parameters of IRs are not disclosed by manufacturers, which leads to the invalidity of most model-based EC prediction methods. To bridge this gap, a mechanism-data hybrid-driven method is proposed to predict the EC of IRs in this paper. First, a joint torque prediction model integrating a hybrid-driven parameter identification is developed based on deep reinforcement learning (DRL). The framework for DRL-based parameter identification is constructed through tailored design of interfaces and training mechanisms, wherein the DRL agent can learn to identify the dynamic parameters from the trajectory database. And a deep neural network based on long short-term memory (LSTM) is proposed to predict the EC of IRs according to the joint torques and velocities. The nonlinear item, which is not modeled in the robot dynamic equation, are also encapsulated in the deep neural network with one-dimensional convolutional neural network (1D-CNN) layers to improve the prediction accuracy. To validate the accuracy and efficacy of the proposed method, experiments are conducted on a KUKA KR60-3 industrial robot with different loads. The results demonstrate that the proposed method can predict EC with a mean absolute percentage error of less than 2% under a fixed load and less than 3% under loads not used for agent training.

由于工业机器人（IR）分布广、能效低，其节能问题受到广泛关注。精确的工业机器人能耗（EC）模型为节能奠定了基础。然而，由于制造商并未公布工业机器人的大部分动态参数和电气参数，导致大多数基于模型的能耗预测方法无效。为了弥补这一缺陷，本文提出了一种机制-数据混合驱动的方法来预测 IR 的 EC。首先，在深度强化学习（DRL）的基础上，开发了一种集成了混合驱动参数识别的联合扭矩预测模型。通过量身定制的界面设计和训练机制，构建了基于 DRL 的参数识别框架，其中 DRL 代理可以从轨迹数据库中学习识别动态参数。此外，还提出了一种基于长短期记忆（LSTM）的深度神经网络，用于根据关节扭矩和速度预测 IR 的 EC。机器人动态方程中没有建模的非线性项目也被封装在深度神经网络中的一维卷积神经网络（1D-CNN）层中，以提高预测精度。为了验证所提方法的准确性和有效性，我们在不同负载的 KUKA KR60-3 工业机器人上进行了实验。结果表明，在固定负载下，所提出的方法能够以低于 2% 的平均绝对百分比误差预测 EC，而在未用于代理培训的负载下，误差则低于 3%。

{"title":"Industrial robot energy consumption model identification: A coupling model-driven and data-driven paradigm","authors":"","doi":"10.1016/j.eswa.2024.125604","DOIUrl":"10.1016/j.eswa.2024.125604","url":null,"abstract":"<div><div>Due to wide distribution and low energy efficiency, the energy-saving in industrial robots (IRs) is attracting extensive attention. Accurate energy consumption (EC) models of IRs lay the foundation for energy-saving. However, most dynamic and electrical parameters of IRs are not disclosed by manufacturers, which leads to the invalidity of most model-based EC prediction methods. To bridge this gap, a mechanism-data hybrid-driven method is proposed to predict the EC of IRs in this paper. First, a joint torque prediction model integrating a hybrid-driven parameter identification is developed based on deep reinforcement learning (DRL). The framework for DRL-based parameter identification is constructed through tailored design of interfaces and training mechanisms, wherein the DRL agent can learn to identify the dynamic parameters from the trajectory database. And a deep neural network based on long short-term memory (LSTM) is proposed to predict the EC of IRs according to the joint torques and velocities. The nonlinear item, which is not modeled in the robot dynamic equation, are also encapsulated in the deep neural network with one-dimensional convolutional neural network (1D-CNN) layers to improve the prediction accuracy. To validate the accuracy and efficacy of the proposed method, experiments are conducted on a KUKA KR60-3 industrial robot with different loads. The results demonstrate that the proposed method can predict EC with a mean absolute percentage error of less than 2% under a fixed load and less than 3% under loads not used for agent training.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142571807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiplex graph fusion network with reinforcement structure learning for fraud detection in online e-commerce platforms 利用强化结构学习的多重图融合网络检测在线电子商务平台中的欺诈行为

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125598

Fraudulent activities on e-commerce platforms, such as spamming product reviews or fake payment behaviors, seriously mislead users’ purchasing decisions and harm platform integrity. To effectively identify fraudsters, recent research mainly attempts to employ graph neural networks (GNNs) with aggregating neighborhood features for detecting the fraud suspiciousness. However, GNNs are vulnerable to carefully-crafted perturbations in the graph structure, and the camouflage strategies of collusive fraudsters limit the effectiveness of GNNs-based fraud detectors. To address these issues, a novel multiplex graph fusion network with reinforcement structure learning (RestMGFN) is proposed in this paper to reveal the collaborative camouflage review fraud. Specifically, an adaptive graph structure learning module is designed to generate high-quality graph representation by utilizing paradigm constraints on the intrinsic properties of graph. Multiple relation-specific graphs are then constructed using meta-path search for capturing the deep semantic features of fraudulent activities. Finally, we incorporate the multiplex graph representations module into a unified framework, jointly optimizing the graph structure and corresponding embedding representations. Comprehensive experiments on real-world datasets verify the effectiveness and robustness of the proposed model compared with state-of-the-art approaches.

电子商务平台上的欺诈活动，如垃圾商品评论或虚假支付行为，严重误导了用户的购买决策，损害了平台的诚信。为了有效识别欺诈者，近年来的研究主要尝试采用具有聚合邻域特征的图神经网络（GNN）来检测欺诈可疑性。然而，图神经网络容易受到精心设计的图结构扰动的影响，而且合谋欺诈者的伪装策略也限制了基于图神经网络的欺诈检测器的有效性。为解决这些问题，本文提出了一种具有强化结构学习功能的新型多重图融合网络（RestMGFN），以揭示协同伪装审查欺诈行为。具体来说，本文设计了一个自适应图结构学习模块，利用对图内在属性的范式约束生成高质量的图表示。然后，利用元路径搜索构建多个特定关系图，以捕捉欺诈活动的深层语义特征。最后，我们将多重图表示模块纳入统一框架，共同优化图结构和相应的嵌入表示。在真实世界数据集上进行的综合实验验证了与最先进的方法相比，所提出模型的有效性和鲁棒性。

{"title":"Multiplex graph fusion network with reinforcement structure learning for fraud detection in online e-commerce platforms","authors":"","doi":"10.1016/j.eswa.2024.125598","DOIUrl":"10.1016/j.eswa.2024.125598","url":null,"abstract":"<div><div>Fraudulent activities on e-commerce platforms, such as spamming product reviews or fake payment behaviors, seriously mislead users’ purchasing decisions and harm platform integrity. To effectively identify fraudsters, recent research mainly attempts to employ graph neural networks (GNNs) with aggregating neighborhood features for detecting the fraud suspiciousness. However, GNNs are vulnerable to carefully-crafted perturbations in the graph structure, and the camouflage strategies of collusive fraudsters limit the effectiveness of GNNs-based fraud detectors. To address these issues, a novel multiplex graph fusion network with reinforcement structure learning (RestMGFN) is proposed in this paper to reveal the collaborative camouflage review fraud. Specifically, an adaptive graph structure learning module is designed to generate high-quality graph representation by utilizing paradigm constraints on the intrinsic properties of graph. Multiple relation-specific graphs are then constructed using meta-path search for capturing the deep semantic features of fraudulent activities. Finally, we incorporate the multiplex graph representations module into a unified framework, jointly optimizing the graph structure and corresponding embedding representations. Comprehensive experiments on real-world datasets verify the effectiveness and robustness of the proposed model compared with state-of-the-art approaches.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142593546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Phase spectrogram of EEG from S-transform Enhances epileptic seizure detection 根据 S 变换绘制脑电图相位频谱图提高癫痫发作检测能力

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125621

Automatic epilepsy seizure detection has high clinical value since it can alleviate the burden of manual monitoring. Nevertheless, it remains a technically challenging task to achieve a reliable system. In this study, we investigated the significance of the phase information in EEG signals in seizure detection using machine learning. We used the Stockwell transform (S-transform) to extract both phase and power spectra of the EEG signal in epilepsy patients. A dual-stream convolution neural network (CNN) model was adopted as the classifier, which takes both spectra as inputs. We demonstrated that the phase input allows the CNN model to capture the heightened phase synchronization among EEG channels in seizure and add network attention to both the low- and high-frequency features of the inputs in the CHB-MIT and Bonn databases. We improved the detection AUC-ROC by 6.68% on the CHB-MIT database when adding phase inputs to the power inputs. By incorporating a channel fusion post-processing to the outputs of this CNN model, it achieves a sensitivity and specificity of 79.59% and 92.23%, respectively, surpassing some of the state-of-the-art methods. Our results show that the phase inputs are useful features in seizure detection. This discovery has significant implications for improving the effectiveness of automatic seizure detection systems.

癫痫发作自动检测具有很高的临床价值，因为它可以减轻人工监测的负担。然而，要实现一个可靠的系统，在技术上仍是一项具有挑战性的任务。在本研究中，我们利用机器学习研究了脑电信号中的相位信息在癫痫发作检测中的重要性。我们使用斯托克韦尔变换（S-transform）来提取癫痫患者脑电信号的相位和功率谱。分类器采用双流卷积神经网络（CNN）模型，将两个频谱作为输入。我们证明，相位输入可使 CNN 模型捕捉到癫痫发作时脑电图通道间更高的相位同步性，并增加网络对 CHB-MIT 和波恩数据库中输入的低频和高频特征的关注。在 CHB-MIT 数据库中，将相位输入添加到功率输入后，我们将检测 AUC-ROC 提高了 6.68%。通过对该 CNN 模型的输出进行信道融合后处理，其灵敏度和特异度分别达到了 79.59% 和 92.23%，超过了一些最先进的方法。我们的结果表明，相位输入是癫痫发作检测的有用特征。这一发现对提高癫痫发作自动检测系统的有效性具有重要意义。

{"title":"Phase spectrogram of EEG from S-transform Enhances epileptic seizure detection","authors":"","doi":"10.1016/j.eswa.2024.125621","DOIUrl":"10.1016/j.eswa.2024.125621","url":null,"abstract":"<div><div>Automatic epilepsy seizure detection has high clinical value since it can alleviate the burden of manual monitoring. Nevertheless, it remains a technically challenging task to achieve a reliable system. In this study, we investigated the significance of the phase information in EEG signals in seizure detection using machine learning. We used the Stockwell transform (S-transform) to extract both phase and power spectra of the EEG signal in epilepsy patients. A dual-stream convolution neural network (CNN) model was adopted as the classifier, which takes both spectra as inputs. We demonstrated that the phase input allows the CNN model to capture the heightened phase synchronization among EEG channels in seizure and add network attention to both the low- and high-frequency features of the inputs in the CHB-MIT and Bonn databases. We improved the detection AUC-ROC by 6.68% on the CHB-MIT database when adding phase inputs to the power inputs. By incorporating a channel fusion post-processing to the outputs of this CNN model, it achieves a sensitivity and specificity of 79.59% and 92.23%, respectively, surpassing some of the state-of-the-art methods. Our results show that the phase inputs are useful features in seizure detection. This discovery has significant implications for improving the effectiveness of automatic seizure detection systems.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142577871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MEFDPN: Mixture exponential family distribution posterior networks for evaluating data uncertainty MEFDPN：用于评估数据不确定性的混合指数族分布后验网络

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125593

The computation of uncertainty are crucial for developing a reliable machine learning model. The natural posterior network (NatPN) provides uncertainty estimation for any single exponential family distribution, but real-world data is often complex. Therefore, we introduce a mixture exponential family posterior network (MEFDPN), which extends the prior distribution to a mixture of exponential family distributions, aiming to fit complex distributions that better represent real data. During network training, MEFDPN independently updates the posterior Bayesian estimates for each prior distribution, and the weights of these distributions are updated based on the forward propagation results. Furthermore, MEFDPN calculates two types of uncertainty (aleatoric and epistemic) and combines them using entropy weighting to obtain a comprehensive confidence measure for each data point. Theoretically, MEFDPN achieves higher prediction accuracy, and experimental results demonstrate its capability to compute high-quality data comprehensive confidence. Moreover, it shows encouraging accuracy in Out-of-Distribution(OOD) detection and validation experiments. Finally, we apply MEFDPN to a materials dataset, efficiently filtering out OOD data. This results in a significant enhancement of prediction accuracy for machine learning models. Specifically, removing only 5% of outlier data leads to a 2%–5% improvement in accuracy.

不确定性的计算对于开发可靠的机器学习模型至关重要。自然后验网络（NatPN）可对任何单一指数族分布进行不确定性估计，但现实世界的数据往往很复杂。因此，我们引入了混合指数族后验网络（MEFDPN），它将先验分布扩展为指数族分布的混合分布，旨在拟合更能代表真实数据的复杂分布。在网络训练过程中，MEFDPN 会独立更新每个先验分布的后验贝叶斯估计值，并根据前向传播结果更新这些分布的权重。此外，MEFDPN 还计算两种类型的不确定性（先验不确定性和认识不确定性），并使用熵加权法将它们结合起来，从而为每个数据点获得综合置信度。从理论上讲，MEFDPN 可以获得更高的预测精度，实验结果也证明了它计算高质量数据综合置信度的能力。此外，它在失配（OOD）检测和验证实验中也表现出了令人鼓舞的准确性。最后，我们将 MEFDPN 应用于材料数据集，有效地过滤掉了 OOD 数据。这大大提高了机器学习模型的预测准确性。具体来说，只需去除 5%的离群数据，就能提高 2%-5% 的准确率。

{"title":"MEFDPN: Mixture exponential family distribution posterior networks for evaluating data uncertainty","authors":"","doi":"10.1016/j.eswa.2024.125593","DOIUrl":"10.1016/j.eswa.2024.125593","url":null,"abstract":"<div><div>The computation of uncertainty are crucial for developing a reliable machine learning model. The natural posterior network (NatPN) provides uncertainty estimation for any single exponential family distribution, but real-world data is often complex. Therefore, we introduce a mixture exponential family posterior network (MEFDPN), which extends the prior distribution to a mixture of exponential family distributions, aiming to fit complex distributions that better represent real data. During network training, MEFDPN independently updates the posterior Bayesian estimates for each prior distribution, and the weights of these distributions are updated based on the forward propagation results. Furthermore, MEFDPN calculates two types of uncertainty (aleatoric and epistemic) and combines them using entropy weighting to obtain a comprehensive confidence measure for each data point. Theoretically, MEFDPN achieves higher prediction accuracy, and experimental results demonstrate its capability to compute high-quality data comprehensive confidence. Moreover, it shows encouraging accuracy in Out-of-Distribution(OOD) detection and validation experiments. Finally, we apply MEFDPN to a materials dataset, efficiently filtering out OOD data. This results in a significant enhancement of prediction accuracy for machine learning models. Specifically, removing only 5% of outlier data leads to a 2%–5% improvement in accuracy.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142571830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Recommendation feedback-based dynamic adaptive training for efficient social item recommendation 基于推荐反馈的动态自适应训练，实现高效的社交项目推荐

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125605

For the application of social item recommendation, how to effectively dig out the implicit relationships between different items plays a crucial role in its performance. However, existing social item recommendation systems constructed their item graphs using a static method based on item features. Considering the fact that most items, such as live streams, can hardly be characterized with limited number of feature tags in reality, the static construction methods make it hard to accurately grasp the underlying item–item relationships. To address the problem, we propose an item graph generation method based on Recommendation Feedback and Dynamic Adaptive Training (RFDAT) to achieve an efficient social item recommendation. Specifically, a multi-task learning technique is leveraged to concurrently predict the item graph and user–item interaction graph, allowing the recommendation task itself to directly participate in the dynamic construction process of the item graph, which is adaptively constructed based on feedback from recommendation results iteratively during the training procedure. Compared with the static construction methods, this allows us to fully explore item–item relationships and item feature representations, therefore improving recommendation accuracy. Furthermore, a lightweight graph convolutional denoising and fusion method based on Laplacian smoothing filter is employed to achieve deep interaction and fusion among multi-graph features, and effectively mitigate the influence of noise in the process of feature learning. Finally, extensive experimental results on four public datasets show that compared with eight state-of-the-art methods, our proposed method achieves improvements of 4.97%, 2.90%, 2.03%, and 4.82% in the important evaluation metric NDCG@10 on Yelp, Ciao, LastFM, and Douban datasets, respectively. It also illustrates very competitive performance against these baselines in the recommendation accuracy for cold users and the recommendation rate for cold items.

对于社交物品推荐的应用而言，如何有效挖掘不同物品之间的隐含关系对其性能起着至关重要的作用。然而，现有的社交项目推荐系统都是基于项目特征的静态方法来构建项目图。考虑到大多数项目（如直播流）在现实中很难用有限的特征标签来表征，静态构建方法很难准确把握项目与项目之间的隐含关系。针对这一问题，我们提出了一种基于推荐反馈和动态自适应训练（RFDAT）的项目图生成方法，以实现高效的社交项目推荐。具体来说，该方法利用多任务学习技术同时预测物品图和用户-物品交互图，让推荐任务本身直接参与物品图的动态构建过程，并在训练过程中根据推荐结果的反馈迭代自适应地构建物品图。与静态构建方法相比，这使我们能够充分探索项目与项目之间的关系和项目特征表征，从而提高推荐的准确性。此外，我们还采用了基于拉普拉斯平滑滤波器的轻量级图卷积去噪与融合方法，实现了多图特征之间的深度交互与融合，并有效降低了特征学习过程中的噪声影响。最后，在四个公共数据集上的大量实验结果表明，与八种最先进的方法相比，我们提出的方法在 Yelp、Ciao、LastFM 和豆瓣数据集上的重要评价指标 NDCG@10 分别提高了 4.97%、2.90%、2.03% 和 4.82%。此外，在冷用户推荐准确率和冷项目推荐率方面，与这些基线相比也具有很强的竞争力。

{"title":"Recommendation feedback-based dynamic adaptive training for efficient social item recommendation","authors":"","doi":"10.1016/j.eswa.2024.125605","DOIUrl":"10.1016/j.eswa.2024.125605","url":null,"abstract":"<div><div>For the application of social item recommendation, how to effectively dig out the implicit relationships between different items plays a crucial role in its performance. However, existing social item recommendation systems constructed their item graphs using a static method based on item features. Considering the fact that most items, such as live streams, can hardly be characterized with limited number of feature tags in reality, the static construction methods make it hard to accurately grasp the underlying item–item relationships. To address the problem, we propose an item graph generation method based on Recommendation Feedback and Dynamic Adaptive Training (RFDAT) to achieve an efficient social item recommendation. Specifically, a multi-task learning technique is leveraged to concurrently predict the item graph and user–item interaction graph, allowing the recommendation task itself to directly participate in the dynamic construction process of the item graph, which is adaptively constructed based on feedback from recommendation results iteratively during the training procedure. Compared with the static construction methods, this allows us to fully explore item–item relationships and item feature representations, therefore improving recommendation accuracy. Furthermore, a lightweight graph convolutional denoising and fusion method based on Laplacian smoothing filter is employed to achieve deep interaction and fusion among multi-graph features, and effectively mitigate the influence of noise in the process of feature learning. Finally, extensive experimental results on four public datasets show that compared with eight state-of-the-art methods, our proposed method achieves improvements of 4.97%, 2.90%, 2.03%, and 4.82% in the important evaluation metric NDCG@10 on Yelp, Ciao, LastFM, and Douban datasets, respectively. It also illustrates very competitive performance against these baselines in the recommendation accuracy for cold users and the recommendation rate for cold items.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142552577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Weighted ensemble based on differentiated sampling rates for imbalanced classification and application to credit risk assessment 基于不同采样率的加权集合，用于不平衡分类并应用于信用风险评估

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125595

Imbalanced data classification is an important research topic in machine learning. The class imbalance problem has a great impact on the classification performance of the algorithm. In this research direction, proposing an effective sampling strategy for imbalanced data is a challenging task. Although a lot of methods have been proposed to classify imbalanced data, the problem remains open. If a method reflects the data distribution and removes noisy samples, then good classification results will be obtained. Therefore, this paper proposes a weighted ensemble algorithm based on differentiated sampling rates (KSDE) and apply it to the field of credit risk assessment. KSDE removes noisy samples using the outlier detection technique. Then, multiple balanced training subsets are generated to train submodels using differentiated sampling rates. These training subsets sufficiently represent the distribution of data. Finally, the well-performing submodels are weighted and integrated to obtain the prediction result. We conducted comprehensive experiments to validate the performance of the proposed method. Comparing 12 state-of-the-art methods on 23 datasets. KSDE outperforms the recently proposed SPE (Self-paced Ensemble) by 12.46% in terms of TPR (True Positive Rate). In addition, KSDE achieves good results on 7 credit risk datasets. The experimental results show that the proposed method is competitive in solving the imbalanced data classification problem.

不平衡数据分类是机器学习中的一个重要研究课题。类不平衡问题对算法的分类性能有很大影响。在这一研究方向中，针对不平衡数据提出有效的采样策略是一项具有挑战性的任务。虽然人们已经提出了很多方法来对不平衡数据进行分类，但这个问题仍然没有解决。如果一种方法能反映数据分布并去除噪声样本，那么就能获得良好的分类结果。因此，本文提出了一种基于差异采样率的加权集合算法（KSDE），并将其应用于信用风险评估领域。KSDE 利用离群点检测技术去除噪声样本。然后，生成多个平衡的训练子集，利用差异化采样率训练子模型。这些训练子集充分代表了数据的分布。最后，对表现良好的子模型进行加权和整合，得到预测结果。我们进行了全面的实验来验证所提方法的性能。在 23 个数据集上比较了 12 种最先进的方法。KSDE的TPR（真阳性率）比最近提出的SPE（自步调集合）高出12.46%。此外，KSDE 在 7 个信用风险数据集上也取得了良好的结果。实验结果表明，所提出的方法在解决不平衡数据分类问题上具有竞争力。

{"title":"Weighted ensemble based on differentiated sampling rates for imbalanced classification and application to credit risk assessment","authors":"","doi":"10.1016/j.eswa.2024.125595","DOIUrl":"10.1016/j.eswa.2024.125595","url":null,"abstract":"<div><div>Imbalanced data classification is an important research topic in machine learning. The class imbalance problem has a great impact on the classification performance of the algorithm. In this research direction, proposing an effective sampling strategy for imbalanced data is a challenging task. Although a lot of methods have been proposed to classify imbalanced data, the problem remains open. If a method reflects the data distribution and removes noisy samples, then good classification results will be obtained. Therefore, this paper proposes a weighted ensemble algorithm based on differentiated sampling rates (KSDE) and apply it to the field of credit risk assessment. KSDE removes noisy samples using the outlier detection technique. Then, multiple balanced training subsets are generated to train submodels using differentiated sampling rates. These training subsets sufficiently represent the distribution of data. Finally, the well-performing submodels are weighted and integrated to obtain the prediction result. We conducted comprehensive experiments to validate the performance of the proposed method. Comparing 12 state-of-the-art methods on 23 datasets. KSDE outperforms the recently proposed SPE (Self-paced Ensemble) by 12.46% in terms of TPR (True Positive Rate). In addition, KSDE achieves good results on 7 credit risk datasets. The experimental results show that the proposed method is competitive in solving the imbalanced data classification problem.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142552580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Class activation map-based slicing-concatenation and contrastive learning: A novel strategy for record-level atrial fibrillation detection 基于类激活图的切分-合并和对比学习：记录级心房颤动检测的新策略

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications

Pub Date : 2024-10-28 DOI: 10.1016/j.eswa.2024.125619

Background

Deep learning-based models for atrial fibrillation (AF) detection require extensive training data, which often necessitates labor-intensive professional annotation. While data augmentation techniques have been employed to mitigate the scarcity of annotated electrocardiogram (ECG) data, specific augmentation methods tailored for recording-level ECG annotations are lacking. This gap hampers the development of robust deep learning models for AF detection.

Methods

We propose a novel strategy, a combination of Class Activation Map-based Slicing-Concatenation (CAM-SC) data augmentation and contrastive learning, to address the current challenges. Initially, a baseline model incorporating a global average pooling layer is trained for classification and to generate class activation maps (CAMs), which highlight indicative ECG segments. After that, in each recording, indicative and non-indicative segments are sliced. These segments are subsequently concatenated randomly based on starting and ending Q points of QRS complexes, with indicative segments preserved to maintain label correctness. Finally, the augmented dataset undergoes contrastive learning to learn general representations, thereby enhancing AF detection performance.

Results

Using ResNet-101 as the baseline model, training with the augmented data yielded the highest F1-score of 0.861 on the Computing in Cardiology (CinC) Challenge 2017 dataset, a typical AF dataset with recording-level annotations. The metrics outperform most previous studies.

Conclusions

This study introduces an innovative data augmentation method specifically designed for recording-level ECG annotations, significantly enhancing AF detection using deep learning models. This approach has substantial implications for future AF detection research.

背景基于深度学习的心房颤动（AF）检测模型需要大量的训练数据，而这往往需要劳动密集型的专业注释。虽然数据扩增技术已被用于缓解心电图（ECG）注释数据稀缺的问题，但目前还缺乏针对记录级心电图注释的特定扩增方法。我们提出了一种新颖的策略，将基于类激活图的切片-合并（CAM-SC）数据增强和对比学习相结合，以应对当前的挑战。首先，对包含全局平均汇集层的基线模型进行分类训练，并生成类激活图（CAM），突出指示性心电图片段。然后，在每个记录中，对指示性和非指示性片段进行切片。随后，根据 QRS 波群的起始和终止 Q 点随机连接这些片段，并保留指示性片段以保持标签的正确性。结果使用 ResNet-101 作为基线模型，在 2017 年心脏病学计算（CinC）挑战赛数据集（一个具有记录级注释的典型房颤数据集）上，使用增强数据进行训练获得了最高的 F1 分数 0.861。结论本研究介绍了一种专门针对记录级心电图注释的创新数据增强方法，显著增强了使用深度学习模型的房颤检测能力。这种方法对未来的房颤检测研究具有重大意义。

{"title":"Class activation map-based slicing-concatenation and contrastive learning: A novel strategy for record-level atrial fibrillation detection","authors":"","doi":"10.1016/j.eswa.2024.125619","DOIUrl":"10.1016/j.eswa.2024.125619","url":null,"abstract":"<div><h3>Background</h3><div>Deep learning-based models for atrial fibrillation (AF) detection require extensive training data, which often necessitates labor-intensive professional annotation. While data augmentation techniques have been employed to mitigate the scarcity of annotated electrocardiogram (ECG) data, specific augmentation methods tailored for recording-level ECG annotations are lacking. This gap hampers the development of robust deep learning models for AF detection.</div></div><div><h3>Methods</h3><div>We propose a novel strategy, a combination of Class Activation Map-based Slicing-Concatenation (CAM-SC) data augmentation and contrastive learning, to address the current challenges. Initially, a baseline model incorporating a global average pooling layer is trained for classification and to generate class activation maps (CAMs), which highlight indicative ECG segments. After that, in each recording, indicative and non-indicative segments are sliced. These segments are subsequently concatenated randomly based on starting and ending Q points of QRS complexes, with indicative segments preserved to maintain label correctness. Finally, the augmented dataset undergoes contrastive learning to learn general representations, thereby enhancing AF detection performance.</div></div><div><h3>Results</h3><div>Using ResNet-101 as the baseline model, training with the augmented data yielded the highest F1-score of 0.861 on the Computing in Cardiology (CinC) Challenge 2017 dataset, a typical AF dataset with recording-level annotations. The metrics outperform most previous studies.</div></div><div><h3>Conclusions</h3><div>This study introduces an innovative data augmentation method specifically designed for recording-level ECG annotations, significantly enhancing AF detection using deep learning models. This approach has substantial implications for future AF detection research.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142552578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0