Neural Computing and Applications最新文献_第10页

Privacy-preserving hierarchical federated learning with biosignals to detect drowsiness while driving 利用生物信号进行隐私保护分层联合学习，以检测驾驶时的嗜睡状态

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10282-3

Sergio López Bernal, José Manuel Hidalgo Rogel, Enrique Tomás Martínez Beltrán, Mario Quiles Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán

In response to the global safety concern of drowsiness during driving, the European Union enforces that new vehicles must integrate detection systems compliant with the general data protection regulation. To identify drowsiness patterns while preserving drivers’ data privacy, recent literature has combined Federated Learning (FL) with different biosignals, such as facial expressions, heart rate, electroencephalography (EEG), or electrooculography (EOG). However, existing solutions are unsuitable for drowsiness detection where heterogeneous stakeholders want to collaborate at different levels while guaranteeing data privacy. There is a lack of works evaluating the benefits of using Hierarchical FL (HFL) with EEG and EOG biosignals, and comparing HFL over traditional FL and Machine Learning (ML) approaches to detect drowsiness at the wheel while ensuring data confidentiality. Thus, this work proposes a flexible framework for drowsiness identification by using HFL, FL, and ML over EEG and EOG data. To validate the framework, this work defines a scenario of three transportation companies aiming to share data from their drivers without compromising their confidentiality, defining a two-level hierarchical structure. This study presents three incremental Use Cases (UCs) to assess detection performance: UC1) intra-company FL, yielding a 77.3% accuracy while ensuring the privacy of individual drivers’ data; UC2) inter-company FL, achieving 71.7% accuracy for known drivers and 67.1% for new subjects, ensuring data confidentiality between companies but not intra-organization; and UC3) HFL inter-company, which ensured comprehensive data privacy both within and between companies, with an accuracy of 71.9% for training subjects and 65.5% for new subjects.

为应对全球对驾驶过程中嗜睡问题的安全关切，欧盟规定新车必须集成符合一般数据保护法规的检测系统。为了在识别嗜睡模式的同时保护驾驶员的数据隐私，最近有文献将联合学习（FL）与不同的生物信号（如面部表情、心率、脑电图（EEG）或脑电图（EOG））相结合。然而，现有的解决方案并不适合嗜睡检测，因为在嗜睡检测中，不同的利益相关者希望在不同层面进行协作，同时保证数据的隐私性。目前，还没有作品对使用分层动态脑电图（HFL）与脑电图和眼电图生物信号的好处进行评估，也没有作品对 HFL 与传统动态脑电图和机器学习（ML）方法进行比较，以在确保数据保密性的同时检测车轮上的瞌睡情况。因此，本研究提出了一个灵活的框架，通过使用 HFL、FL 和 ML 对 EEG 和 EOG 数据进行嗜睡识别。为了验证该框架，本研究定义了三个运输公司的场景，目的是在不影响数据保密性的情况下共享司机的数据，并定义了一个两级分层结构。本研究提出了三个增量用例（UC）来评估检测性能：UC1) 公司内部 FL，准确率为 77.3%，同时确保了司机个人数据的隐私；UC2) 公司间 FL，已知司机准确率为 71.7%，新对象准确率为 67.1%，确保了公司之间的数据保密性，但未确保组织内部的数据保密性；UC3) 公司间 HFL，确保了公司内部和公司之间的全面数据隐私，培训对象准确率为 71.9%，新对象准确率为 65.5%。

{"title":"Privacy-preserving hierarchical federated learning with biosignals to detect drowsiness while driving","authors":"Sergio López Bernal, José Manuel Hidalgo Rogel, Enrique Tomás Martínez Beltrán, Mario Quiles Pérez, Gregorio Martínez Pérez, Alberto Huertas Celdrán","doi":"10.1007/s00521-024-10282-3","DOIUrl":"https://doi.org/10.1007/s00521-024-10282-3","url":null,"abstract":"In response to the global safety concern of drowsiness during driving, the European Union enforces that new vehicles must integrate detection systems compliant with the general data protection regulation. To identify drowsiness patterns while preserving drivers’ data privacy, recent literature has combined Federated Learning (FL) with different biosignals, such as facial expressions, heart rate, electroencephalography (EEG), or electrooculography (EOG). However, existing solutions are unsuitable for drowsiness detection where heterogeneous stakeholders want to collaborate at different levels while guaranteeing data privacy. There is a lack of works evaluating the benefits of using Hierarchical FL (HFL) with EEG and EOG biosignals, and comparing HFL over traditional FL and Machine Learning (ML) approaches to detect drowsiness at the wheel while ensuring data confidentiality. Thus, this work proposes a flexible framework for drowsiness identification by using HFL, FL, and ML over EEG and EOG data. To validate the framework, this work defines a scenario of three transportation companies aiming to share data from their drivers without compromising their confidentiality, defining a two-level hierarchical structure. This study presents three incremental Use Cases (UCs) to assess detection performance: UC1) intra-company FL, yielding a 77.3% accuracy while ensuring the privacy of individual drivers’ data; UC2) inter-company FL, achieving 71.7% accuracy for known drivers and 67.1% for new subjects, ensuring data confidentiality between companies but not intra-organization; and UC3) HFL inter-company, which ensured comprehensive data privacy both within and between companies, with an accuracy of 71.9% for training subjects and 65.5% for new subjects.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Incremental federated learning for traffic flow classification in heterogeneous data scenarios 异构数据场景中交通流分类的增量联合学习

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10281-4

Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok

This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.

本文探讨了联合学习（FL）和集中学习（CL）模型在网络应用的多类流量分类中的比较分析，在隐私保护日益受到关注的背景下，这是一项适时的研究。现有文献往往忽略了详细的分类性能评估以及一致的数据处理和特征选择方法，与之不同的是，我们的研究通过实施前馈神经网络和评估独立且同分布（IID）和非独立且同分布（非 IID）条件下的 FL 性能来纠正这些缺陷，并特别关注增量训练。在我们的跨ilo 实验设置中，每轮涉及五个客户端，FL 模型表现出显著的适应性。在 IID 条件下，FL 模型的准确率达到了 96.65% 的峰值，证明了它的鲁棒性。此外，尽管非 IID 环境带来了挑战，但我们的 FL 模型仍表现出了很强的适应能力，可在各轮中逐步调整以优化性能；在大多数情况下，我们的 FL 模型在多个成熟指标方面的表现与理想化的 CL 模型相当。通过一个全面的交通流分类使用案例，这项工作（i）有助于更好地理解 FL 的能力和局限性，为 FL 在现实世界中的部署提供了宝贵的见解，（ii）为研究界提供了一个新颖、大型、精心策划的交通流数据集。

{"title":"Incremental federated learning for traffic flow classification in heterogeneous data scenarios","authors":"Adrian Pekar, Laszlo Arpad Makara, Gergely Biczok","doi":"10.1007/s00521-024-10281-4","DOIUrl":"https://doi.org/10.1007/s00521-024-10281-4","url":null,"abstract":"This paper explores the comparative analysis of federated learning (FL) and centralized learning (CL) models in the context of multi-class traffic flow classification for network applications, a timely study in the context of increasing privacy preservation concerns. Unlike existing literature that often omits detailed class-wise performance evaluation, and consistent data handling and feature selection approaches, our study rectifies these gaps by implementing a feed-forward neural network and assessing FL performance under both independent and identically distributed (IID) and non-independent and identically distributed (non-IID) conditions, with a particular focus on incremental training. In our cross-silo experimental setup involving five clients per round, FL models exhibit notable adaptability. Under IID conditions, the accuracy of the FL model peaked at 96.65%, demonstrating its robustness. Moreover, despite the challenges presented by non-IID environments, our FL models demonstrated significant resilience, adapting incrementally over rounds to optimize performance; in most scenarios, our FL models performed comparably to the idealistic CL model regarding multiple well-established metrics. Through a comprehensive traffic flow classification use case, this work (i) contributes to a better understanding of the capabilities and limitations of FL, offering valuable insights for the real-world deployment of FL, and (ii) provides a novel, large, carefully curated traffic flow dataset for the research community.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancing precision agriculture: domain-specific augmentations and robustness testing for convolutional neural networks in precision spraying evaluation 推进精准农业：精准喷洒评估中卷积神经网络的特定领域增强和鲁棒性测试

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10142-0

Harry Rogers, Beatriz De La Iglesia, Tahmina Zebin, Grzegorz Cielniak, Ben Magri

Modern agriculture relies heavily on the precise application of chemicals such as fertilisers, herbicides, and pesticides, which directly affect both crop yield and environmental footprint. Therefore, it is crucial to assess the accuracy of precision sprayers regarding the spatial location of spray deposits. However, there is currently no fully automated evaluation method for this. In this study, we collected a novel dataset from a precision spot spraying system to enable us to classify and detect spray deposits on target weeds and non-target crops. We employed multiple deep convolutional backbones for this task; subsequently, we have proposed a robustness testing methodology for evaluation purposes. We experimented with two novel data augmentation techniques: subtraction and thresholding which enhanced the classification accuracy and robustness of the developed models. On average, across nine different tests and four distinct convolutional neural networks, subtraction improves robustness by 50.83%, and thresholding increases by 42.26% from a baseline. Additionally, we have presented the results from a novel weakly supervised object detection task using our dataset, establishing a baseline Intersection over Union score of 42.78%. Our proposed pipeline includes an explainable artificial intelligence stage and provides insights not only into the spatial location of the spray deposits but also into the specific filtering methods within that spatial location utilised for classification.

现代农业在很大程度上依赖于化肥、除草剂和杀虫剂等化学品的精确施用，这直接影响到作物产量和环境足迹。因此，评估精确喷雾器在喷雾沉积空间位置方面的准确性至关重要。然而，目前还没有完全自动化的评估方法。在本研究中，我们从精确定点喷雾系统中收集了一个新数据集，以便对目标杂草和非目标作物上的喷雾沉积物进行分类和检测。我们采用了多个深度卷积骨干来完成这项任务；随后，我们提出了一种鲁棒性测试方法来进行评估。我们尝试了两种新颖的数据增强技术：减法和阈值法，它们提高了所开发模型的分类准确性和鲁棒性。平均而言，在九个不同的测试和四个不同的卷积神经网络中，减法将稳健性提高了 50.83%，而阈值法比基线提高了 42.26%。此外，我们还展示了使用我们的数据集进行的新型弱监督对象检测任务的结果，确定了 42.78% 的基线 "交集大于联合 "得分。我们提出的管道包括一个可解释的人工智能阶段，不仅能深入了解喷雾沉积物的空间位置，还能了解该空间位置内用于分类的特定过滤方法。

{"title":"Advancing precision agriculture: domain-specific augmentations and robustness testing for convolutional neural networks in precision spraying evaluation","authors":"Harry Rogers, Beatriz De La Iglesia, Tahmina Zebin, Grzegorz Cielniak, Ben Magri","doi":"10.1007/s00521-024-10142-0","DOIUrl":"https://doi.org/10.1007/s00521-024-10142-0","url":null,"abstract":"Modern agriculture relies heavily on the precise application of chemicals such as fertilisers, herbicides, and pesticides, which directly affect both crop yield and environmental footprint. Therefore, it is crucial to assess the accuracy of precision sprayers regarding the spatial location of spray deposits. However, there is currently no fully automated evaluation method for this. In this study, we collected a novel dataset from a precision spot spraying system to enable us to classify and detect spray deposits on target weeds and non-target crops. We employed multiple deep convolutional backbones for this task; subsequently, we have proposed a robustness testing methodology for evaluation purposes. We experimented with two novel data augmentation techniques: subtraction and thresholding which enhanced the classification accuracy and robustness of the developed models. On average, across nine different tests and four distinct convolutional neural networks, subtraction improves robustness by 50.83%, and thresholding increases by 42.26% from a baseline. Additionally, we have presented the results from a novel weakly supervised object detection task using our dataset, establishing a baseline Intersection over Union score of 42.78%. Our proposed pipeline includes an explainable artificial intelligence stage and provides insights not only into the spatial location of the spray deposits but also into the specific filtering methods within that spatial location utilised for classification.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"101 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Contrastive dissimilarity: optimizing performance on imbalanced and limited data sets 对比异质性：优化不平衡和有限数据集的性能

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10286-z

Lucas O. Teixeira, Diego Bertolini, Luiz S. Oliveira, George D. C. Cavalcanti, Yandre M. G. Costa

A primary challenge in pattern recognition is imbalanced datasets, resulting in skewed and biased predictions. This problem is exacerbated by limited data availability, increasing the reliance on expensive expert data labeling. The study introduces a novel method called contrastive dissimilarity, which combines dissimilarity-based representation with contrastive learning to improve classification performance in imbalance and data scarcity scenarios. Based on pairwise sample differences, dissimilarity representation excels in situations with numerous overlapping classes and limited samples per class. Unlike traditional methods that use fixed distance functions like Euclidean or cosine, our proposal employs metric learning with contrastive loss to estimate a custom dissimilarity function. We conducted extensive evaluations in 13 databases across multiple training–test splits. The results showed that this approach outperforms traditional models like SVM, random forest, and Naive Bayes, particularly in settings with limited training data.

模式识别面临的一个主要挑战是数据集不平衡，导致预测结果有偏差和偏见。有限的数据可用性加剧了这一问题，增加了对昂贵的专家数据标注的依赖。本研究引入了一种名为对比异质性的新方法，它将基于异质性的表示与对比学习相结合，以提高不平衡和数据稀缺情况下的分类性能。基于成对样本差异，异质性表示法在有大量重叠类和每类样本有限的情况下表现出色。与使用欧几里得或余弦等固定距离函数的传统方法不同，我们的建议采用具有对比损失的度量学习来估计自定义的异质性函数。我们在 13 个数据库中进行了广泛的评估，涉及多个训练-测试分区。结果表明，这种方法优于 SVM、随机森林和奈维贝叶斯等传统模型，尤其是在训练数据有限的情况下。

引用次数: 0

Enhancing dynamic ensemble selection: combining self-generating prototypes and meta-classifier for data classification 加强动态集合选择：结合自生成原型和元分类器进行数据分类

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10237-8

Alberto Manastarla, Leandro A. Silva

In dynamic ensemble selection (DES) techniques, the competence level of each classifier is estimated from a pool of classifiers, and only the most competent ones are selected to classify a specific test sample and predict its class labels. A significant challenge in DES is efficiently estimating classifier competence for accurate prediction, especially when these techniques employ the K-Nearest Neighbors (KNN) algorithm to define the competence region of a test sample based on a validation set (known as the dynamic selection dataset or DSEL). This challenge is exacerbated when the DSEL does not accurately reflect the original data distribution or contains noisy data. Such conditions can reduce the precision of the system, induce unexpected behaviors, and compromise stability. To address these issues, this paper introduces the self-generating prototype ensemble selection (SGP.DES) framework, which combines meta-learning with prototype selection. The proposed meta-classifier of SGP.DES supports multiple classification algorithms and utilizes meta-features from prototypes derived from the original training set, enhancing the selection of the best classifiers for a test sample. The method improves the efficiency of KNN in defining competence regions by generating a reduced and noise-free DSEL set that preserves the original data distribution. Furthermore, the SGP.DES framework facilitates tailored optimization for specific classification challenges through the use of hyperparameters that control prototype selection and the meta-classifier operation mode to select the most appropriate classification algorithm for dynamic selection. Empirical evaluations of twenty-four classification problems have demonstrated that SGP.DES outperforms state-of-the-art DES methods as well as traditional single-model and ensemble methods in terms of accuracy, confirming its effectiveness across a wide range of classification contexts.

在动态集合选择（DES）技术中，每个分类器的能力水平都是从分类器池中估算出来的，只有能力最强的分类器才能被选中对特定测试样本进行分类并预测其类别标签。DES 技术面临的一个重大挑战是如何有效地估计分类器的能力以进行准确预测，尤其是当这些技术采用 K-Nearest Neighbors (KNN) 算法，根据验证集（称为动态选择数据集或 DSEL）来定义测试样本的能力区域时。如果 DSEL 不能准确反映原始数据的分布或包含噪声数据，这一挑战就会更加严峻。这种情况会降低系统的精度，诱发意想不到的行为，并影响稳定性。为了解决这些问题，本文介绍了自生成原型集合选择（SGP.DES）框架，它将元学习与原型选择相结合。SGP.DES 提出的元分类器支持多种分类算法，并利用从原始训练集中生成的原型的元特征，增强了为测试样本选择最佳分类器的能力。该方法通过生成一个保留原始数据分布的精简无噪声 DSEL 集，提高了 KNN 在定义能力区域方面的效率。此外，SGP.DES 框架通过使用超参数控制原型选择和元分类器运行模式，为动态选择选择最合适的分类算法，从而促进了针对特定分类挑战的定制优化。对 24 个分类问题的实证评估表明，SGP.DES 在准确性方面优于最先进的 DES 方法以及传统的单一模型和集合方法，从而证实了它在各种分类环境中的有效性。

{"title":"Enhancing dynamic ensemble selection: combining self-generating prototypes and meta-classifier for data classification","authors":"Alberto Manastarla, Leandro A. Silva","doi":"10.1007/s00521-024-10237-8","DOIUrl":"https://doi.org/10.1007/s00521-024-10237-8","url":null,"abstract":"In dynamic ensemble selection (DES) techniques, the competence level of each classifier is estimated from a pool of classifiers, and only the most competent ones are selected to classify a specific test sample and predict its class labels. A significant challenge in DES is efficiently estimating classifier competence for accurate prediction, especially when these techniques employ the K-Nearest Neighbors (KNN) algorithm to define the competence region of a test sample based on a validation set (known as the dynamic selection dataset or DSEL). This challenge is exacerbated when the DSEL does not accurately reflect the original data distribution or contains noisy data. Such conditions can reduce the precision of the system, induce unexpected behaviors, and compromise stability. To address these issues, this paper introduces the self-generating prototype ensemble selection (SGP.DES) framework, which combines meta-learning with prototype selection. The proposed meta-classifier of SGP.DES supports multiple classification algorithms and utilizes meta-features from prototypes derived from the original training set, enhancing the selection of the best classifiers for a test sample. The method improves the efficiency of KNN in defining competence regions by generating a reduced and noise-free DSEL set that preserves the original data distribution. Furthermore, the SGP.DES framework facilitates tailored optimization for specific classification challenges through the use of hyperparameters that control prototype selection and the meta-classifier operation mode to select the most appropriate classification algorithm for dynamic selection. Empirical evaluations of twenty-four classification problems have demonstrated that SGP.DES outperforms state-of-the-art DES methods as well as traditional single-model and ensemble methods in terms of accuracy, confirming its effectiveness across a wide range of classification contexts.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"57 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimization of energy consumption of oil refinery reboiler and condenser using neural network 利用神经网络优化炼油厂再沸器和冷凝器的能耗

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10049-w

Farshad Farahbod

The distillation tower is a crucial component of the refining process. Its energy efficiency has been a major area of research, especially following the oil crisis. This study focuses on optimizing energy consumption in the Shiraz refinery’s distillation unit. The unit is simulated using ASPEN-HYSYS software. Simulation results are validated against real data to ensure model accuracy. The operational data aligns well with model predictions. Following the creation of a data bank using HYSYS software, the tower’s operating conditions are optimized using neural networks and MATLAB software. In this study, a neural network model is developed for the distillation tower. This modeling approach is cost-effective, does not require complex theories, and does not rely on prior system knowledge. Additionally, real-time modeling is achievable through parallel distributed processing. The findings indicate that the optimal feed tray is 9 and the optimal feed temperature is 283.5°C. Furthermore, the optimized number of trays in the distillation tower is 47. Results show that in optimal conditions, cold and hot energy consumption are reduced by approximately 9.7% and 10.8%, respectively. Moreover, implementing optimal conditions results in a reduction of hot energy consumption in the reboiler by 60,000 MW and a reduction of cold energy consumption in the condenser by 30,000 MW.

蒸馏塔是炼油工艺的重要组成部分。其能效一直是研究的主要领域，尤其是在石油危机之后。本研究的重点是优化设拉子炼油厂蒸馏装置的能耗。该装置使用 ASPEN-HYSYS 软件进行模拟。模拟结果与实际数据进行了验证，以确保模型的准确性。运行数据与模型预测结果十分吻合。在使用 HYSYS 软件创建数据库后，使用神经网络和 MATLAB 软件对塔的运行条件进行了优化。本研究为蒸馏塔开发了一个神经网络模型。这种建模方法成本效益高，不需要复杂的理论，也不依赖于先前的系统知识。此外，还可通过并行分布式处理实现实时建模。研究结果表明，最佳进料盘为 9 个，最佳进料温度为 283.5°C。此外，蒸馏塔中的最佳塔盘数量为 47 个。结果表明，在最佳条件下，冷能耗和热能耗分别降低了约 9.7% 和 10.8%。此外，在最佳条件下，再沸器的热能消耗减少了 60,000 兆瓦，冷凝器的冷能消耗减少了 30,000 兆瓦。

{"title":"Optimization of energy consumption of oil refinery reboiler and condenser using neural network","authors":"Farshad Farahbod","doi":"10.1007/s00521-024-10049-w","DOIUrl":"https://doi.org/10.1007/s00521-024-10049-w","url":null,"abstract":"The distillation tower is a crucial component of the refining process. Its energy efficiency has been a major area of research, especially following the oil crisis. This study focuses on optimizing energy consumption in the Shiraz refinery’s distillation unit. The unit is simulated using ASPEN-HYSYS software. Simulation results are validated against real data to ensure model accuracy. The operational data aligns well with model predictions. Following the creation of a data bank using HYSYS software, the tower’s operating conditions are optimized using neural networks and MATLAB software. In this study, a neural network model is developed for the distillation tower. This modeling approach is cost-effective, does not require complex theories, and does not rely on prior system knowledge. Additionally, real-time modeling is achievable through parallel distributed processing. The findings indicate that the optimal feed tray is 9 and the optimal feed temperature is 283.5°C. Furthermore, the optimized number of trays in the distillation tower is 47. Results show that in optimal conditions, cold and hot energy consumption are reduced by approximately 9.7% and 10.8%, respectively. Moreover, implementing optimal conditions results in a reduction of hot energy consumption in the reboiler by 60,000 MW and a reduction of cold energy consumption in the condenser by 30,000 MW.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

One-shot knowledge graph completion based on disentangled representation learning 基于分解表示学习的一次性知识图谱补全

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10236-9

Youmin Zhang, Lei Sun, Ye Wang, Qun Liu, Li Liu

One-shot knowledge graph completion (KGC) aims to infer unseen facts when only one support entity pair is available for a particular relationship. Prior studies learn reference representations from one support pair for matching query pairs. This strategy can be challenging, particularly when dealing with multiple relationships between identical support pairs, resulting in indistinguishable reference representations. To this end, we propose a disentangled representation learning framework for one-shot KGC. Specifically, to learn sufficient representations, we construct an entity encoder with a fine-grained attention mechanism to explicitly model the input and output neighbors. We adopt an orthogonal regularizer to promote the independence of learned factors in entity representation, enabling the matching processor with max pooling to adaptively identify the semantic roles associated with a particular relation. Subsequently, the one-shot KGC is accomplished by seamlessly integrating the aforementioned modules in an end-to-end learning manner. Extensive experiments on real-world datasets demonstrate the outperformance of the proposed framework.

一次性知识图谱补全（KGC）的目的是在特定关系只有一个支持实体对的情况下，推断出不可见的事实。之前的研究是从匹配查询对的一个支持对中学习参考表征。这种策略具有挑战性，尤其是在处理相同支持对之间的多种关系时，会导致参考表征无法区分。为此，我们提出了一种适用于单击 KGC 的分离表征学习框架。具体来说，为了学习足够的表征，我们构建了一个具有细粒度关注机制的实体编码器，以明确地对输入和输出邻域建模。我们采用正交规整器来促进实体表征中已学因素的独立性，从而使具有最大池化功能的匹配处理器能够自适应地识别与特定关系相关的语义角色。随后，通过以端到端的学习方式无缝集成上述模块，实现了一次性 KGC。在真实世界数据集上进行的大量实验证明，所提出的框架性能更优。

{"title":"One-shot knowledge graph completion based on disentangled representation learning","authors":"Youmin Zhang, Lei Sun, Ye Wang, Qun Liu, Li Liu","doi":"10.1007/s00521-024-10236-9","DOIUrl":"https://doi.org/10.1007/s00521-024-10236-9","url":null,"abstract":"One-shot knowledge graph completion (KGC) aims to infer unseen facts when only one support entity pair is available for a particular relationship. Prior studies learn reference representations from one support pair for matching query pairs. This strategy can be challenging, particularly when dealing with multiple relationships between identical support pairs, resulting in indistinguishable reference representations. To this end, we propose a disentangled representation learning framework for one-shot KGC. Specifically, to learn sufficient representations, we construct an entity encoder with a fine-grained attention mechanism to explicitly model the input and output neighbors. We adopt an orthogonal regularizer to promote the independence of learned factors in entity representation, enabling the matching processor with max pooling to adaptively identify the semantic roles associated with a particular relation. Subsequently, the one-shot KGC is accomplished by seamlessly integrating the aforementioned modules in an end-to-end learning manner. Extensive experiments on real-world datasets demonstrate the outperformance of the proposed framework.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Novel gray wolf optimization-based key frame extraction method for video classification using ConvLSTM 一种基于灰狼优化的新颖关键帧提取方法，用于使用 ConvLSTM 进行视频分类

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10266-3

Ujwalla Gawande, Kamal Hajari, Yogesh Golhar, Punit Fulzele

In this paper, we propose a novel keyframe extraction extraction method based on the gray wolf optimization (GWO) algorithm, addressing the challenge of information loss in traditional methods due to redundant and similar frames. The proposed method GWOKConvLSTM prioritizes speed, accuracy, and compression efficiency while preserving semantic information. Inspired by wolf behavior, we construct a fitness function that minimizes reconstruction error and achieves optimal compression ratios below 8%. Compared to traditional methods, our GWO method achieves the lowest reconstruction error for a given compression rate, providing a concise and visually coherent summary of keyframes while maintaining consistency across similar motions. Additionally, we propose a template-based method for video classification tasks, achieving the highest accuracy when combined with pre-trained CNNs and ConvLSTM. Our method effectively prevents dynamic background noise from affecting keyframe selection, leading to significantly improve video classification performance using deep neural networks.

本文提出了一种基于灰狼优化（GWO）算法的新型关键帧提取方法，解决了传统方法中由于冗余帧和相似帧造成信息丢失的难题。所提出的 GWOKConvLSTM 方法在保留语义信息的同时，优先考虑了速度、准确性和压缩效率。受狼行为的启发，我们构建了一个拟合函数，它能使重建误差最小化，并实现低于 8% 的最佳压缩率。与传统方法相比，我们的 GWO 方法在给定的压缩率下实现了最低的重构误差，提供了简洁且视觉上连贯的关键帧摘要，同时保持了类似动作的一致性。此外，我们还针对视频分类任务提出了一种基于模板的方法，该方法与预训练的 CNN 和 ConvLSTM 结合使用时可达到最高准确率。我们的方法能有效防止动态背景噪音影响关键帧的选择，从而显著提高深度神经网络的视频分类性能。

引用次数: 0

Vpit: real-time embedded single object 3D tracking using voxel pseudo images Vpit：使用体素伪图像进行实时嵌入式单个物体 3D 跟踪

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10259-2

Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros Iosifidis

In this paper, we propose a novel voxel-based 3D single object tracking (3D SOT) method called Voxel Pseudo Image Tracking (VPIT). VPIT is the first method that uses voxel pseudo images for 3D SOT. The input point cloud is structured by pillar-based voxelization, and the resulting pseudo image is used as an input to a 2D-like Siamese SOT method. The pseudo image is created in the Bird’s-eye View (BEV) coordinates; and therefore, the objects in it have constant size. Thus, only the object rotation can change in the new coordinate system and not the object scale. For this reason, we replace multi-scale search with a multi-rotation search, where differently rotated search regions are compared against a single target representation to predict both position and rotation of the object. Experiments on KITTI [1] Tracking dataset show that VPIT is the fastest 3D SOT method and maintains competitive Success and Precision values. Application of a SOT method in a real-world scenario meets with limitations such as lower computational capabilities of embedded devices and a latency-unforgiving environment, where the method is forced to skip certain data frames if the inference speed is not high enough. We implement a real-time evaluation protocol and show that other methods lose most of their performance on embedded devices; while, VPIT maintains its ability to track the object.

本文提出了一种新颖的基于体素的三维单个物体跟踪（3D SOT）方法，称为体素伪图像跟踪（VPIT）。VPIT 是第一种使用体素伪图像进行 3D SOT 的方法。输入点云通过基于柱的体素化进行结构化，生成的伪图像用作类似二维连体 SOT 方法的输入。伪图像以鸟瞰图（BEV）坐标创建，因此其中的物体大小不变。因此，在新的坐标系中，只有物体的旋转会发生变化，而物体的比例不会发生变化。因此，我们用多旋转搜索取代多尺度搜索，将不同旋转搜索区域与单一目标表示进行比较，以预测物体的位置和旋转。在 KITTI [1] 跟踪数据集上的实验表明，VPIT 是最快的 3D SOT 方法，并保持了具有竞争力的成功率和精确度值。在现实世界中应用 SOT 方法会遇到一些限制，例如嵌入式设备的计算能力较低，以及延迟环境不宽松，如果推理速度不够快，该方法就会被迫跳过某些数据帧。我们实施了一个实时评估协议，结果表明其他方法在嵌入式设备上的性能大打折扣，而 VPIT 却能保持跟踪物体的能力。

{"title":"Vpit: real-time embedded single object 3D tracking using voxel pseudo images","authors":"Illia Oleksiienko, Paraskevi Nousi, Nikolaos Passalis, Anastasios Tefas, Alexandros Iosifidis","doi":"10.1007/s00521-024-10259-2","DOIUrl":"https://doi.org/10.1007/s00521-024-10259-2","url":null,"abstract":"In this paper, we propose a novel voxel-based 3D single object tracking (3D SOT) method called Voxel Pseudo Image Tracking (VPIT). VPIT is the first method that uses voxel pseudo images for 3D SOT. The input point cloud is structured by pillar-based voxelization, and the resulting pseudo image is used as an input to a 2D-like Siamese SOT method. The pseudo image is created in the Bird’s-eye View (BEV) coordinates; and therefore, the objects in it have constant size. Thus, only the object rotation can change in the new coordinate system and not the object scale. For this reason, we replace multi-scale search with a multi-rotation search, where differently rotated search regions are compared against a single target representation to predict both position and rotation of the object. Experiments on KITTI [1] Tracking dataset show that VPIT is the fastest 3D SOT method and maintains competitive Success and Precision values. Application of a SOT method in a real-world scenario meets with limitations such as lower computational capabilities of embedded devices and a latency-unforgiving environment, where the method is forced to skip certain data frames if the inference speed is not high enough. We implement a real-time evaluation protocol and show that other methods lose most of their performance on embedded devices; while, VPIT maintains its ability to track the object.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conditional image-to-image translation generative adversarial network (cGAN) for fabric defect data augmentation 用于织物缺陷数据增强的条件图像到图像转换生成对抗网络 (cGAN)

Neural Computing and Applications

Pub Date : 2024-08-12 DOI: 10.1007/s00521-024-10179-1

Swash Sami Mohammed, Hülya Gökalp Clarke

The availability of comprehensive datasets is a crucial challenge for developing artificial intelligence (AI) models in various applications and fields. The lack of large and diverse public fabric defect datasets forms a major obstacle to properly and accurately developing and training AI models for detecting and classifying fabric defects in real-life applications. Models trained on limited datasets struggle to identify underrepresented defects, reducing their practicality. To address these issues, this study suggests using a conditional generative adversarial network (cGAN) for fabric defect data augmentation. The proposed image-to-image translator GAN features a conditional U-Net generator and a 6-layered PatchGAN discriminator. The conditional U-Network (U-Net) generator can produce highly realistic synthetic defective samples and offers the ability to control various characteristics of the generated samples by taking two input images: a segmented defect mask and a clean fabric image. The segmented defect mask provides information about various aspects of the defects to be added to the clean fabric sample, including their type, shape, size, and location. By augmenting the training dataset with diverse and realistic synthetic samples, the AI models can learn to identify a broader range of defects more accurately. This technique helps overcome the limitations of small or unvaried datasets, leading to improved defect detection accuracy and generalizability. Moreover, this proposed augmentation method can find applications in other challenging fields, such as generating synthetic samples for medical imaging datasets related to brain and lung tumors.

在各种应用和领域开发人工智能（AI）模型时，能否获得全面的数据集是一个至关重要的挑战。缺乏大型、多样化的公共织物缺陷数据集，是在实际应用中正确、准确地开发和训练用于检测和分类织物缺陷的人工智能模型的主要障碍。在有限数据集上训练的模型难以识别代表性不足的缺陷，降低了其实用性。为了解决这些问题，本研究建议使用条件生成对抗网络（cGAN）来增强织物缺陷数据。所提出的图像到图像转换 GAN 具有条件 U-Net 生成器和 6 层 PatchGAN 识别器。条件 U-Net 生成器可生成高度逼真的合成疵点样本，并能通过获取两幅输入图像来控制生成样本的各种特征：一幅是分割后的疵点掩膜，另一幅是干净的织物图像。分段缺陷掩膜提供了要添加到干净织物样本中的缺陷的各方面信息，包括缺陷的类型、形状、大小和位置。通过使用多样化的真实合成样本来增强训练数据集，人工智能模型可以学会更准确地识别更广泛的缺陷。这种技术有助于克服小数据集或无差异数据集的局限性，从而提高缺陷检测的准确性和通用性。此外，这种拟议的增强方法还可应用于其他具有挑战性的领域，例如为与脑肿瘤和肺肿瘤相关的医学成像数据集生成合成样本。

{"title":"Conditional image-to-image translation generative adversarial network (cGAN) for fabric defect data augmentation","authors":"Swash Sami Mohammed, Hülya Gökalp Clarke","doi":"10.1007/s00521-024-10179-1","DOIUrl":"https://doi.org/10.1007/s00521-024-10179-1","url":null,"abstract":"The availability of comprehensive datasets is a crucial challenge for developing artificial intelligence (AI) models in various applications and fields. The lack of large and diverse public fabric defect datasets forms a major obstacle to properly and accurately developing and training AI models for detecting and classifying fabric defects in real-life applications. Models trained on limited datasets struggle to identify underrepresented defects, reducing their practicality. To address these issues, this study suggests using a conditional generative adversarial network (cGAN) for fabric defect data augmentation. The proposed image-to-image translator GAN features a conditional U-Net generator and a 6-layered PatchGAN discriminator. The conditional U-Network (U-Net) generator can produce highly realistic synthetic defective samples and offers the ability to control various characteristics of the generated samples by taking two input images: a segmented defect mask and a clean fabric image. The segmented defect mask provides information about various aspects of the defects to be added to the clean fabric sample, including their type, shape, size, and location. By augmenting the training dataset with diverse and realistic synthetic samples, the AI models can learn to identify a broader range of defects more accurately. This technique helps overcome the limitations of small or unvaried datasets, leading to improved defect detection accuracy and generalizability. Moreover, this proposed augmentation method can find applications in other challenging fields, such as generating synthetic samples for medical imaging datasets related to brain and lung tumors.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"110 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141933008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0