首页 > 最新文献

Cognitive Computation最新文献

英文 中文
Intelligent Fisheries: Cognitive Solutions for Improving Aquaculture Commercial Efficiency Through Enhanced Biomass Estimation and Early Disease Detection 智能渔业:通过增强生物量估算和早期疾病检测提高水产养殖商业效率的认知解决方案
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-06-01 DOI: 10.1007/s12559-024-10292-2
Kanwal Aftab, Linda Tschirren, Boris Pasini, Peter Zeller, Bostan Khan, Muhammad Moazam Fraz

With the burgeoning global demand for seafood, potential solutions like aquaculture are increasingly significant, provided they address issues like pollution and food security challenges in a sustainable manner. However, significant obstacles such as disease outbreaks and inaccurate biomass estimation underscore the need for optimized solutions. This paper proposes “Fish-Sense”, a deep learning-based pipeline inspired by the human visual system’s ability to recognize and classify objects, developed in conjunction with fish farms, aiming to enhance disease detection and biomass estimation in the aquaculture industry. Our automated framework is two-pronged: one module for biomass estimation using deep learning algorithms to segment fish, classify species, and estimate biomass; and another for disease symptom detection symptoms, employing deep learning algorithms to classify fish into healthy and unhealthy categories, and subsequently identifying symptoms and locations of bacterial infections if a fish is classified as unhealthy. To overcome data scarcity in this field, we have created four novel real-world datasets for fish segmentation, health classification, species classification, and fish part segmentation. Our biomass estimation algorithms demonstrated substantial accuracy across five species, and the health classification. These algorithms provide a foundation for the development of industrial software solutions to improve fish health monitoring in aquaculture farms. Our integrated pipeline facilitates the transition from research to real-world applications, potentially encouraging responsible aquaculture practices. Nevertheless, these advancements must be seen as part of a comprehensive strategy aimed at improving the aquaculture industry’s sustainability and efficiency, in line with the United Nations’ Sustainable Development Goals’ evolving interpretations. The code, trained models, and the data for this project can be obtained from the following GitHub repository: https://github.com/Vision-At-SEECS/Fish-Sense.

随着全球对海产品的需求急剧增长,水产养殖等潜在解决方案的重要性日益凸显,前提是它们能以可持续的方式解决污染和粮食安全挑战等问题。然而,疾病爆发和生物量估算不准确等重大障碍凸显了优化解决方案的必要性。本文提出的 "鱼感 "是一种基于深度学习的管道,其灵感来源于人类视觉系统识别和分类物体的能力,与养鱼场共同开发,旨在提高水产养殖业的疾病检测和生物量估算能力。我们的自动化框架是双管齐下的:一个模块用于生物量估算,利用深度学习算法对鱼类进行分割、物种分类和生物量估算;另一个模块用于疾病症状检测,利用深度学习算法将鱼类分为健康和不健康两类,如果鱼类被归类为不健康,则随后识别细菌感染的症状和位置。为了克服该领域数据稀缺的问题,我们创建了四个新的真实世界数据集,用于鱼类分割、健康分类、物种分类和鱼类部位分割。我们的生物量估算算法在五个物种和健康分类中都表现出了相当高的准确性。这些算法为开发工业软件解决方案,改善水产养殖场的鱼类健康监测奠定了基础。我们的集成管道促进了从研究到实际应用的过渡,有可能鼓励负责任的水产养殖实践。不过,这些进展必须被视为旨在提高水产养殖业可持续性和效率的综合战略的一部分,以符合联合国可持续发展目标不断发展的解释。本项目的代码、训练有素的模型和数据可从以下 GitHub 存储库获取:https://github.com/Vision-At-SEECS/Fish-Sense。
{"title":"Intelligent Fisheries: Cognitive Solutions for Improving Aquaculture Commercial Efficiency Through Enhanced Biomass Estimation and Early Disease Detection","authors":"Kanwal Aftab, Linda Tschirren, Boris Pasini, Peter Zeller, Bostan Khan, Muhammad Moazam Fraz","doi":"10.1007/s12559-024-10292-2","DOIUrl":"https://doi.org/10.1007/s12559-024-10292-2","url":null,"abstract":"<p>With the burgeoning global demand for seafood, potential solutions like aquaculture are increasingly significant, provided they address issues like pollution and food security challenges in a sustainable manner. However, significant obstacles such as disease outbreaks and inaccurate biomass estimation underscore the need for optimized solutions. This paper proposes “Fish-Sense”, a deep learning-based pipeline inspired by the human visual system’s ability to recognize and classify objects, developed in conjunction with fish farms, aiming to enhance disease detection and biomass estimation in the aquaculture industry. Our automated framework is two-pronged: one module for biomass estimation using deep learning algorithms to segment fish, classify species, and estimate biomass; and another for disease symptom detection symptoms, employing deep learning algorithms to classify fish into healthy and unhealthy categories, and subsequently identifying symptoms and locations of bacterial infections if a fish is classified as unhealthy. To overcome data scarcity in this field, we have created four novel real-world datasets for fish segmentation, health classification, species classification, and fish part segmentation. Our biomass estimation algorithms demonstrated substantial accuracy across five species, and the health classification. These algorithms provide a foundation for the development of industrial software solutions to improve fish health monitoring in aquaculture farms. Our integrated pipeline facilitates the transition from research to real-world applications, potentially encouraging responsible aquaculture practices. Nevertheless, these advancements must be seen as part of a comprehensive strategy aimed at improving the aquaculture industry’s sustainability and efficiency, in line with the United Nations’ Sustainable Development Goals’ evolving interpretations. The code, trained models, and the data for this project can be obtained from the following GitHub repository: https://github.com/Vision-At-SEECS/Fish-Sense.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141192117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review of Key Technologies for Emotion Analysis Using Multimodal Information 利用多模态信息进行情感分析的关键技术综述
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-06-01 DOI: 10.1007/s12559-024-10287-z
Xianxun Zhu, Chaopeng Guo, Heyang Feng, Yao Huang, Yichen Feng, Xiangyang Wang, Rui Wang

Emotion analysis, an integral aspect of human–machine interactions, has witnessed significant advancements in recent years. With the rise of multimodal data sources such as speech, text, and images, there is a profound need for a comprehensive review of pivotal elements within this domain. Our paper delves deep into the realm of emotion analysis, examining multimodal data sources encompassing speech, text, images, and physiological signals. We provide a curated overview of relevant literature, academic forums, and competitions. Emphasis is laid on dissecting unimodal processing methods, including preprocessing, feature extraction, and tools across speech, text, images, and physiological signals. We further discuss the nuances of multimodal data fusion techniques, spotlighting early, late, model, and hybrid fusion strategies. Key findings indicate the essentiality of analyzing emotions across multiple modalities. Detailed discussions on emotion elicitation, expression, and representation models are presented. Moreover, we uncover challenges such as dataset creation, modality synchronization, model efficiency, limited data scenarios, cross-domain applicability, and the handling of missing modalities. Practical solutions and suggestions are provided to address these challenges. The realm of multimodal emotion analysis is vast, with numerous applications ranging from driver sentiment detection to medical evaluations. Our comprehensive review serves as a valuable resource for both scholars and industry professionals. It not only sheds light on the current state of research but also highlights potential directions for future innovations. The insights garnered from this paper are expected to pave the way for subsequent advancements in deep multimodal emotion analysis tailored for real-world deployments.

情感分析是人机交互不可或缺的一个方面,近年来取得了长足的进步。随着语音、文本和图像等多模态数据源的兴起,我们亟需对这一领域的关键要素进行全面回顾。我们的论文深入情感分析领域,研究了包括语音、文本、图像和生理信号在内的多模态数据源。我们提供了相关文献、学术论坛和竞赛的策划概述。重点是剖析单模态处理方法,包括预处理、特征提取和跨语音、文本、图像和生理信号的工具。我们进一步讨论了多模态数据融合技术的细微差别,重点介绍了早期、后期、模型和混合融合策略。主要研究结果表明了通过多种模式分析情绪的重要性。我们详细讨论了情绪激发、表达和表现模型。此外,我们还揭示了诸如数据集创建、模态同步、模型效率、有限数据场景、跨领域适用性以及处理缺失模态等方面的挑战。针对这些挑战,我们提供了实用的解决方案和建议。多模态情感分析的领域十分广阔,从驾驶员情感检测到医疗评估等应用不胜枚举。我们的全面综述对学者和行业专业人士来说都是宝贵的资源。它不仅揭示了研究现状,还强调了未来创新的潜在方向。从本文中获得的真知灼见有望为后续针对现实世界部署的深度多模态情感分析的进步铺平道路。
{"title":"A Review of Key Technologies for Emotion Analysis Using Multimodal Information","authors":"Xianxun Zhu, Chaopeng Guo, Heyang Feng, Yao Huang, Yichen Feng, Xiangyang Wang, Rui Wang","doi":"10.1007/s12559-024-10287-z","DOIUrl":"https://doi.org/10.1007/s12559-024-10287-z","url":null,"abstract":"<p>Emotion analysis, an integral aspect of human–machine interactions, has witnessed significant advancements in recent years. With the rise of multimodal data sources such as speech, text, and images, there is a profound need for a comprehensive review of pivotal elements within this domain. Our paper delves deep into the realm of emotion analysis, examining multimodal data sources encompassing speech, text, images, and physiological signals. We provide a curated overview of relevant literature, academic forums, and competitions. Emphasis is laid on dissecting unimodal processing methods, including preprocessing, feature extraction, and tools across speech, text, images, and physiological signals. We further discuss the nuances of multimodal data fusion techniques, spotlighting early, late, model, and hybrid fusion strategies. Key findings indicate the essentiality of analyzing emotions across multiple modalities. Detailed discussions on emotion elicitation, expression, and representation models are presented. Moreover, we uncover challenges such as dataset creation, modality synchronization, model efficiency, limited data scenarios, cross-domain applicability, and the handling of missing modalities. Practical solutions and suggestions are provided to address these challenges. The realm of multimodal emotion analysis is vast, with numerous applications ranging from driver sentiment detection to medical evaluations. Our comprehensive review serves as a valuable resource for both scholars and industry professionals. It not only sheds light on the current state of research but also highlights potential directions for future innovations. The insights garnered from this paper are expected to pave the way for subsequent advancements in deep multimodal emotion analysis tailored for real-world deployments.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Android Ransomware Detection Through Hybrid Simultaneous Swarm-Based Optimization 通过基于蜂群的混合同步优化增强安卓勒索软件检测能力
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-06-01 DOI: 10.1007/s12559-024-10301-4
Moutaz Alazab, Ruba Abu Khurma, David Camacho, Alejandro Martín

Ransomware is a significant security threat that poses a serious risk to the security of smartphones, and its impact on portable devices has been extensively discussed in a number of research papers. In recent times, this threat has witnessed a significant increase, causing substantial losses for both individuals and organizations. The emergence and widespread occurrence of diverse forms of ransomware present a significant impediment to the pursuit of reliable security measures that can effectively combat them. This constitutes a formidable challenge due to the dynamic nature of ransomware, which renders traditional security protocols inadequate, as they might have a high false alarm rate and exert significant processing demands on mobile devices that are restricted by limited battery life, CPU, and memory. This paper proposes a novel intelligent method for detecting ransomware that is based on a hybrid multi-solution binary JAYA algorithm with a single-solution simulated annealing (SA). The primary objective is to leverage the exploitation power of SA in supporting the exploration power of the binary JAYA algorithm. This approach results in a better balance between global and local search milestones. The empirical results of our research demonstrate the superiority of the proposed SMO-BJAYA-SA-SVM method over other algorithms based on the evaluation measures used. The proposed method achieved an accuracy rate of 98.7%, a precision of 98.6%, a recall of 98.7%, and an F1 score of 98.6%. Therefore, we believe that our approach is an effective method for detecting ransomware on portable devices. It has the potential to provide a more reliable and efficient solution to this growing security threat.

勒索软件是一种对智能手机安全构成严重威胁的重大安全威胁,其对便携式设备的影响已在许多研究论文中进行了广泛讨论。近来,这种威胁显著增加,给个人和组织都造成了巨大损失。各种形式的勒索软件不断涌现并广泛传播,严重阻碍了可靠安全措施的有效实施。由于勒索软件的动态特性,传统的安全协议可能会有较高的误报率,并对受限于电池寿命、CPU 和内存的移动设备提出了大量的处理要求,这就构成了一个巨大的挑战。本文提出了一种新型智能方法来检测勒索软件,该方法基于混合多解二进制 JAYA 算法和单解模拟退火(SA)。其主要目的是利用 SA 的开发能力来支持二进制 JAYA 算法的探索能力。这种方法能更好地平衡全局和局部搜索里程碑。我们的研究实证结果表明,根据所使用的评估指标,所提出的 SMO-BJAYA-SA-SVM 方法优于其他算法。提出的方法达到了 98.7% 的准确率、98.6% 的精确率、98.7% 的召回率和 98.6% 的 F1 分数。因此,我们认为我们的方法是检测便携式设备上勒索软件的有效方法。它有望为这一日益严重的安全威胁提供更可靠、更高效的解决方案。
{"title":"Enhanced Android Ransomware Detection Through Hybrid Simultaneous Swarm-Based Optimization","authors":"Moutaz Alazab, Ruba Abu Khurma, David Camacho, Alejandro Martín","doi":"10.1007/s12559-024-10301-4","DOIUrl":"https://doi.org/10.1007/s12559-024-10301-4","url":null,"abstract":"<p>Ransomware is a significant security threat that poses a serious risk to the security of smartphones, and its impact on portable devices has been extensively discussed in a number of research papers. In recent times, this threat has witnessed a significant increase, causing substantial losses for both individuals and organizations. The emergence and widespread occurrence of diverse forms of ransomware present a significant impediment to the pursuit of reliable security measures that can effectively combat them. This constitutes a formidable challenge due to the dynamic nature of ransomware, which renders traditional security protocols inadequate, as they might have a high false alarm rate and exert significant processing demands on mobile devices that are restricted by limited battery life, CPU, and memory. This paper proposes a novel intelligent method for detecting ransomware that is based on a hybrid multi-solution binary JAYA algorithm with a single-solution simulated annealing (SA). The primary objective is to leverage the exploitation power of SA in supporting the exploration power of the binary JAYA algorithm. This approach results in a better balance between global and local search milestones. The empirical results of our research demonstrate the superiority of the proposed SMO-BJAYA-SA-SVM method over other algorithms based on the evaluation measures used. The proposed method achieved an accuracy rate of 98.7%, a precision of 98.6%, a recall of 98.7%, and an F1 score of 98.6%. Therefore, we believe that our approach is an effective method for detecting ransomware on portable devices. It has the potential to provide a more reliable and efficient solution to this growing security threat.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141198428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RA-Net: Region-Aware Attention Network for Skin Lesion Segmentation RA-Net:用于皮损分割的区域感知注意力网络
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-06-01 DOI: 10.1007/s12559-024-10304-1
Asim Naveed, Syed S. Naqvi, Shahzaib Iqbal, Imran Razzak, Haroon Ahmed Khan, Tariq M. Khan

The precise segmentation of skin lesion in dermoscopic images is essential for the early detection of skin cancer. However, the irregular shapes of the lesions, the absence of sharp edges, the existence of artifacts like hair follicles, and marker color make this task difficult. Currently, fully connected networks (FCNs) and U-Nets are the most commonly used techniques for melanoma segmentation. However, as the depth of these neural network models increases, they become prone to various challenges. The most pertinent of these challenges are the vanishing gradient problem and the parameter redundancy problem. These can result in a decline in Jaccard index of the segmentation model. This study introduces a novel end-to-end trainable network designed for skin lesion segmentation. The proposed methodology consists of an encoder-decoder, a region-aware attention approach, and guided loss function. The trainable parameters are reduced using depth-wise separable convolution, and the attention features are refined using a guided loss, resulting in a high Jaccard index. We assessed the effectiveness of our proposed RA-Net on four frequently utilized benchmark datasets for skin lesion segmentation: ISIC 2016, ISIC 2017, ISIC 2018, and PH2. The empirical results validate that our method achieves state-of-the-art performance, as indicated by a notably high Jaccard index.

精确分割皮肤镜图像中的皮损对于早期检测皮肤癌至关重要。然而,由于皮损形状不规则、没有锐利边缘、存在毛囊等伪影以及标记颜色等原因,这项任务很难完成。目前,全连接网络(FCN)和 U-Nets 是最常用的黑色素瘤分割技术。然而,随着这些神经网络模型深度的增加,它们容易面临各种挑战。其中最相关的挑战是梯度消失问题和参数冗余问题。这些问题会导致分割模型的 Jaccard 指数下降。本研究介绍了一种新颖的端到端可训练网络,设计用于皮损分割。所提出的方法包括编码器-解码器、区域感知注意力方法和引导损失函数。使用深度可分离卷积减少了可训练参数,并使用引导损失对注意力特征进行了改进,从而获得了较高的 Jaccard 指数。我们在四个常用的皮损分割基准数据集上评估了所提出的 RA-Net 的有效性:ISIC 2016、ISIC 2017、ISIC 2018 和 PH2。实证结果验证了我们的方法达到了最先进的性能,这一点从明显较高的 Jaccard 指数可以看出。
{"title":"RA-Net: Region-Aware Attention Network for Skin Lesion Segmentation","authors":"Asim Naveed, Syed S. Naqvi, Shahzaib Iqbal, Imran Razzak, Haroon Ahmed Khan, Tariq M. Khan","doi":"10.1007/s12559-024-10304-1","DOIUrl":"https://doi.org/10.1007/s12559-024-10304-1","url":null,"abstract":"<p>The precise segmentation of skin lesion in dermoscopic images is essential for the early detection of skin cancer. However, the irregular shapes of the lesions, the absence of sharp edges, the existence of artifacts like hair follicles, and marker color make this task difficult. Currently, fully connected networks (FCNs) and U-Nets are the most commonly used techniques for melanoma segmentation. However, as the depth of these neural network models increases, they become prone to various challenges. The most pertinent of these challenges are the vanishing gradient problem and the parameter redundancy problem. These can result in a decline in Jaccard index of the segmentation model. This study introduces a novel end-to-end trainable network designed for skin lesion segmentation. The proposed methodology consists of an encoder-decoder, a region-aware attention approach, and guided loss function. The trainable parameters are reduced using depth-wise separable convolution, and the attention features are refined using a guided loss, resulting in a high Jaccard index. We assessed the effectiveness of our proposed RA-Net on four frequently utilized benchmark datasets for skin lesion segmentation: ISIC 2016, ISIC 2017, ISIC 2018, and PH2. The empirical results validate that our method achieves state-of-the-art performance, as indicated by a notably high Jaccard index.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141192042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart Data Driven Decision Trees Ensemble Methodology for Imbalanced Big Data 针对不平衡大数据的智能数据驱动决策树集合方法学
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-31 DOI: 10.1007/s12559-024-10295-z
Diego García-Gil, Salvador García, Ning Xiong, Francisco Herrera

Differences in data size per class, also known as imbalanced data distribution, have become a common problem affecting data quality. Big Data scenarios pose a new challenge to traditional imbalanced classification algorithms, since they are not prepared to work with such amount of data. Split data strategies and lack of data in the minority class due to the use of MapReduce paradigm have posed new challenges for tackling the imbalance between classes in Big Data scenarios. Ensembles have been shown to be able to successfully address imbalanced data problems. Smart Data refers to data of enough quality to achieve high-performance models. The combination of ensembles and Smart Data, achieved through Big Data preprocessing, should be a great synergy. In this paper, we propose a novel Smart Data driven Decision Trees Ensemble methodology for addressing the imbalanced classification problem in Big Data domains, namely SD_DeTE methodology. This methodology is based on the learning of different decision trees using distributed quality data for the ensemble process. This quality data is achieved by fusing random discretization, principal components analysis, and clustering-based random oversampling for obtaining different Smart Data versions of the original data. Experiments carried out in 21 binary adapted datasets have shown that our methodology outperforms random forest.

每类数据大小的差异(也称为不平衡数据分布)已成为影响数据质量的常见问题。大数据场景对传统的不平衡分类算法提出了新的挑战,因为它们还没有准备好处理如此大的数据量。分割数据策略和 MapReduce 范式的使用导致少数类别数据的缺乏,为解决大数据场景中类别间的不平衡问题提出了新的挑战。事实证明,集合能够成功解决不平衡数据问题。智能数据指的是数据质量足以实现高性能模型。通过大数据预处理实现的数据集与智能数据的结合应能产生巨大的协同效应。本文提出了一种新颖的智能数据驱动决策树集合方法,即 SD_DeTE 方法,用于解决大数据领域的不平衡分类问题。该方法基于在集合过程中使用分布式高质量数据来学习不同的决策树。这种高质量数据是通过融合随机离散化、主成分分析和基于聚类的随机超采样来获得原始数据的不同智能数据版本。在 21 个二元适配数据集上进行的实验表明,我们的方法优于随机森林。
{"title":"Smart Data Driven Decision Trees Ensemble Methodology for Imbalanced Big Data","authors":"Diego García-Gil, Salvador García, Ning Xiong, Francisco Herrera","doi":"10.1007/s12559-024-10295-z","DOIUrl":"https://doi.org/10.1007/s12559-024-10295-z","url":null,"abstract":"<p>Differences in data size per class, also known as imbalanced data distribution, have become a common problem affecting data quality. Big Data scenarios pose a new challenge to traditional imbalanced classification algorithms, since they are not prepared to work with such amount of data. Split data strategies and lack of data in the minority class due to the use of MapReduce paradigm have posed new challenges for tackling the imbalance between classes in Big Data scenarios. Ensembles have been shown to be able to successfully address imbalanced data problems. Smart Data refers to data of enough quality to achieve high-performance models. The combination of ensembles and Smart Data, achieved through Big Data preprocessing, should be a great synergy. In this paper, we propose a novel Smart Data driven Decision Trees Ensemble methodology for addressing the imbalanced classification problem in Big Data domains, namely SD_DeTE methodology. This methodology is based on the learning of different decision trees using distributed quality data for the ensemble process. This quality data is achieved by fusing random discretization, principal components analysis, and clustering-based random oversampling for obtaining different Smart Data versions of the original data. Experiments carried out in 21 binary adapted datasets have shown that our methodology outperforms random forest.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141192114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Deep Learning Approach for Diagnosis of Attention-Deficit/Hyperactivity Disorder in Children Based on EEG Signals 基于脑电信号诊断儿童注意力缺陷/多动症的高效深度学习方法
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-31 DOI: 10.1007/s12559-024-10302-3
Hamid Jahani, Ali Asghar Safaei

Attention-deficit/hyperactivity disorder (ADHD) is a behavioral disorder in children that can persist into adulthood if not treated. Early diagnosis of this condition is crucial for effective treatment. The database includes 61 children with attention-deficit/hyperactivity disorder and 60 healthy children as a control group. To diagnose children with ADHD, features were first extracted from EEG signals. Next, a convolutional neural network model was trained, and a new residual network was introduced. The two proposed models were evaluated using tenfold cross-validation on the test data. The average accuracy and F1 score were 92.52% and 93.6%, respectively, for the convolutional model and 96.8% and 97.1% for the ResNet model on the epoch data, respectively. On the other hand, accuracy for subject-based prediction was 96.5% for the convolution model and 98.6% for the modified ResNet model. Accuracy, precision, recall, and F1 score for the proposed ResNet model are better than the convolution model proposed in previous studies and better than the proposed model in the literature. This work presents a paradigm shift in the cognitive-inspired domain by introducing a novel ResNet model for ADHD diagnosis. The model’s exceptional accuracy, exceeding conventional methods, showcases its potential as a biologically inspired tool. This opens avenues for exploring the neurological underpinnings of ADHD because the model can be used for the manifold learning of EEG signals. Analyzing the proposed network can lead to a deeper understanding of EEG, bridging the gap between artificial intelligence and cognitive neuroscience. The paper’s innovative approach has far-reaching implications, offering a concrete application of cognitive principles to improve mental health diagnostics in children. It is important to note that the data were augmented and the classification model is based on a single experiment containing a very small number of children but the results, and accuracy of classification, are based on classifying augmented data samples that compose the EEG signals of this small number of individuals. It is prudent to undertake a comprehensive investigation into the efficacy of these models across a broad cohort of subjects.

注意力缺陷/多动症(ADHD)是一种儿童行为障碍,如果不加以治疗,可能会持续到成年。早期诊断这种疾病对有效治疗至关重要。该数据库包括 61 名患有注意力缺陷/多动症的儿童和 60 名健康儿童作为对照组。为了诊断多动症儿童,首先从脑电图信号中提取特征。接着,训练了一个卷积神经网络模型,并引入了一个新的残差网络。通过对测试数据进行十倍交叉验证,对提出的两个模型进行了评估。在历时数据上,卷积模型的平均准确率和 F1 分数分别为 92.52% 和 93.6%,ResNet 模型的平均准确率和 F1 分数分别为 96.8% 和 97.1%。另一方面,基于主题的预测准确率,卷积模型为 96.5%,修改后的 ResNet 模型为 98.6%。所提出的 ResNet 模型的准确度、精确度、召回率和 F1 分数均优于之前研究中提出的卷积模型,也优于文献中提出的模型。这项研究通过引入用于多动症诊断的新型 ResNet 模型,实现了认知启发领域的范式转变。该模型的准确性超过了传统方法,展示了其作为生物启发工具的潜力。由于该模型可用于脑电信号的流形学习,这为探索多动症的神经基础开辟了道路。分析所提出的网络可以加深对脑电图的理解,弥补人工智能和认知神经科学之间的差距。本文的创新方法具有深远的意义,它提供了认知原理的具体应用,以改善儿童的心理健康诊断。值得注意的是,数据是经过扩增的,分类模型也是基于包含极少数儿童的单一实验,但结果和分类的准确性是基于对组成这一小部分人的脑电信号的扩增数据样本进行分类得出的。为了谨慎起见,应该对这些模型在大量受试者中的有效性进行全面调查。
{"title":"Efficient Deep Learning Approach for Diagnosis of Attention-Deficit/Hyperactivity Disorder in Children Based on EEG Signals","authors":"Hamid Jahani, Ali Asghar Safaei","doi":"10.1007/s12559-024-10302-3","DOIUrl":"https://doi.org/10.1007/s12559-024-10302-3","url":null,"abstract":"<p>Attention-deficit/hyperactivity disorder (ADHD) is a behavioral disorder in children that can persist into adulthood if not treated. Early diagnosis of this condition is crucial for effective treatment. The database includes 61 children with attention-deficit/hyperactivity disorder and 60 healthy children as a control group. To diagnose children with ADHD, features were first extracted from EEG signals. Next, a convolutional neural network model was trained, and a new residual network was introduced. The two proposed models were evaluated using tenfold cross-validation on the test data. The average accuracy and F1 score were 92.52% and 93.6%, respectively, for the convolutional model and 96.8% and 97.1% for the ResNet model on the epoch data, respectively. On the other hand, accuracy for subject-based prediction was 96.5% for the convolution model and 98.6% for the modified ResNet model. Accuracy, precision, recall, and F1 score for the proposed ResNet model are better than the convolution model proposed in previous studies and better than the proposed model in the literature. This work presents a paradigm shift in the cognitive-inspired domain by introducing a novel ResNet model for ADHD diagnosis. The model’s exceptional accuracy, exceeding conventional methods, showcases its potential as a biologically inspired tool. This opens avenues for exploring the neurological underpinnings of ADHD because the model can be used for the manifold learning of EEG signals. Analyzing the proposed network can lead to a deeper understanding of EEG, bridging the gap between artificial intelligence and cognitive neuroscience. The paper’s innovative approach has far-reaching implications, offering a concrete application of cognitive principles to improve mental health diagnostics in children. It is important to note that the data were augmented and the classification model is based on a single experiment containing a very small number of children but the results, and accuracy of classification, are based on classifying augmented data samples that compose the EEG signals of this small number of individuals. It is prudent to undertake a comprehensive investigation into the efficacy of these models across a broad cohort of subjects.\u0000</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Explainable Machine Learning Models for Clinicians 为临床医生评估可解释的机器学习模型
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-31 DOI: 10.1007/s12559-024-10297-x
Noemi Scarpato, Aria Nourbakhsh, Patrizia Ferroni, Silvia Riondino, Mario Roselli, Francesca Fallucchi, Piero Barbanti, Fiorella Guadagni, Fabio Massimo Zanzotto

Gaining clinicians’ trust will unleash the full potential of artificial intelligence (AI) in medicine, and explaining AI decisions is seen as the way to build trustworthy systems. However, explainable artificial intelligence (XAI) methods in medicine often lack a proper evaluation. In this paper, we present our evaluation methodology for XAI methods using forward simulatability. We define the Forward Simulatability Score (FSS) and analyze its limitations in the context of clinical predictors. Then, we applied FSS to our XAI approach defined over an ML-RO, a machine learning clinical predictor based on random optimization over a multiple kernel support vector machine (SVM) algorithm. To Compare FSS values before and after the explanation phase, we test our evaluation methodology for XAI methods on three clinical datasets, namely breast cancer, VTE, and migraine. The ML-RO system is a good model on which to test our XAI evaluation strategy based on the FSS. Indeed, ML-RO outperforms two other base models—a decision tree (DT) and a plain SVM—in the three datasets and gives the possibility of defining different XAI models: TOPK, MIGF, and F4G. The FSS evaluation score suggests that the explanation method F4G for the ML-RO is the most effective in two datasets out of the three tested, and it shows the limits of the learned model for one dataset. Our study aims to introduce a standard practice for evaluating XAI methods in medicine. By establishing a rigorous evaluation framework, we seek to provide healthcare professionals with reliable tools for assessing the performance of XAI methods to enhance the adoption of AI systems in clinical practice.

赢得临床医生的信任将充分释放人工智能(AI)在医疗领域的潜力,而解释人工智能的决策被视为建立可信系统的途径。然而,医学中的可解释人工智能(XAI)方法往往缺乏适当的评估。在本文中,我们介绍了利用前向可模拟性对 XAI 方法进行评估的方法。我们定义了前向可模拟性评分(FSS),并分析了其在临床预测方面的局限性。然后,我们将 FSS 应用于在 ML-RO 上定义的 XAI 方法,ML-RO 是一种基于多核支持向量机 (SVM) 算法随机优化的机器学习临床预测器。为了比较解释阶段前后的 FSS 值,我们在三个临床数据集(即乳腺癌、VTE 和偏头痛)上测试了 XAI 方法的评估方法。ML-RO 系统是测试我们基于 FSS 的 XAI 评估策略的良好模型。事实上,ML-RO 在三个数据集上的表现优于其他两个基础模型--决策树(DT)和普通 SVM,并为定义不同的 XAI 模型提供了可能性:TOPK、MIGF 和 F4G。FSS 评估得分表明,ML-RO 的解释方法 F4G 在三个测试数据集中的两个数据集中最为有效,同时也显示了所学模型在一个数据集中的局限性。我们的研究旨在为医学领域的 XAI 方法评估引入标准实践。通过建立一个严格的评估框架,我们试图为医疗保健专业人员提供可靠的工具来评估 XAI 方法的性能,从而促进人工智能系统在临床实践中的应用。
{"title":"Evaluating Explainable Machine Learning Models for Clinicians","authors":"Noemi Scarpato, Aria Nourbakhsh, Patrizia Ferroni, Silvia Riondino, Mario Roselli, Francesca Fallucchi, Piero Barbanti, Fiorella Guadagni, Fabio Massimo Zanzotto","doi":"10.1007/s12559-024-10297-x","DOIUrl":"https://doi.org/10.1007/s12559-024-10297-x","url":null,"abstract":"<p>Gaining clinicians’ trust will unleash the full potential of artificial intelligence (AI) in medicine, and explaining AI decisions is seen as the way to build trustworthy systems. However, explainable artificial intelligence (XAI) methods in medicine often lack a proper evaluation. In this paper, we present our evaluation methodology for XAI methods using forward simulatability. We define the Forward Simulatability Score (FSS) and analyze its limitations in the context of clinical predictors. Then, we applied FSS to our XAI approach defined over an ML-RO, a machine learning clinical predictor based on random optimization over a multiple kernel support vector machine (SVM) algorithm. To Compare FSS values before and after the explanation phase, we test our evaluation methodology for XAI methods on three clinical datasets, namely breast cancer, VTE, and migraine. The ML-RO system is a good model on which to test our XAI evaluation strategy based on the FSS. Indeed, ML-RO outperforms two other base models—a decision tree (DT) and a plain SVM—in the three datasets and gives the possibility of defining different XAI models: TOPK, MIGF, and F4G. The FSS evaluation score suggests that the explanation method F4G for the ML-RO is the most effective in two datasets out of the three tested, and it shows the limits of the learned model for one dataset. Our study aims to introduce a standard practice for evaluating XAI methods in medicine. By establishing a rigorous evaluation framework, we seek to provide healthcare professionals with reliable tools for assessing the performance of XAI methods to enhance the adoption of AI systems in clinical practice.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Counterfactual Explanations in the Big Picture: An Approach for Process Prediction-Driven Job-Shop Scheduling Optimization 大局中的反事实解释:流程预测驱动的作业车间调度优化方法
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-30 DOI: 10.1007/s12559-024-10294-0
Nijat Mehdiyev, Maxim Majlatow, Peter Fettke

In this study, we propose a pioneering framework for generating multi-objective counterfactual explanations in job-shop scheduling contexts, combining predictive process monitoring with advanced mathematical optimization techniques. Using the Non-dominated Sorting Genetic Algorithm II (NSGA-II) for multi-objective optimization, our approach enhances the generation of counterfactual explanations that illuminate potential enhancements at both the operational and systemic levels. Validated with real-world data, our methodology underscores the superiority of NSGA-II in crafting pertinent and actionable counterfactual explanations, surpassing traditional methods in both efficiency and practical relevance. This work advances the domains of explainable artificial intelligence (XAI), predictive process monitoring, and combinatorial optimization, providing an effective tool for improving automated scheduling systems’ clarity, and decision-making capabilities.

在本研究中,我们提出了一个开创性的框架,将预测性流程监控与先进的数学优化技术相结合,用于生成作业车间调度背景下的多目标反事实解释。利用非支配排序遗传算法 II(NSGA-II)进行多目标优化,我们的方法增强了反事实解释的生成,从而揭示了操作和系统层面的潜在改进。经过真实世界数据的验证,我们的方法强调了 NSGA-II 在制作中肯、可操作的反事实解释方面的优越性,在效率和实用性方面都超越了传统方法。这项工作推动了可解释人工智能(XAI)、预测过程监控和组合优化领域的发展,为提高自动调度系统的清晰度和决策能力提供了有效工具。
{"title":"Counterfactual Explanations in the Big Picture: An Approach for Process Prediction-Driven Job-Shop Scheduling Optimization","authors":"Nijat Mehdiyev, Maxim Majlatow, Peter Fettke","doi":"10.1007/s12559-024-10294-0","DOIUrl":"https://doi.org/10.1007/s12559-024-10294-0","url":null,"abstract":"<p>In this study, we propose a pioneering framework for generating multi-objective counterfactual explanations in job-shop scheduling contexts, combining predictive process monitoring with advanced mathematical optimization techniques. Using the Non-dominated Sorting Genetic Algorithm II (NSGA-II) for multi-objective optimization, our approach enhances the generation of counterfactual explanations that illuminate potential enhancements at both the operational and systemic levels. Validated with real-world data, our methodology underscores the superiority of NSGA-II in crafting pertinent and actionable counterfactual explanations, surpassing traditional methods in both efficiency and practical relevance. This work advances the domains of explainable artificial intelligence (XAI), predictive process monitoring, and combinatorial optimization, providing an effective tool for improving automated scheduling systems’ clarity, and decision-making capabilities.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141192110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Cardiovascular Diseases Using Data Mining Approaches: Application of an Ensemble-Based Model 利用数据挖掘方法检测心血管疾病:基于集合的模型的应用
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-30 DOI: 10.1007/s12559-024-10306-z
Mojdeh Nazari, Hassan Emami, Reza Rabiei, Azamossadat Hosseini, Shahabedin Rahmatizadeh

Cardiovascular diseases are the leading contributor of mortality worldwide. Accurate cardiovascular disease prediction is crucial, and the application of machine learning and data mining techniques could facilitate decision-making and improve predictive capabilities. This study aimed to present a model for accurate prediction of cardiovascular diseases and identifying key contributing factors with the greatest impact. The Cleveland dataset besides the locally collected dataset, called the Noor dataset, was used in this study. Accordingly, various data mining techniques besides four ensemble learning-based models were implemented on both datasets. Moreover, a novel model for combining individual classifiers in ensemble learning, wherein weights were assigned to each classifier (using a genetic algorithm), was developed. The predictive strength of each feature was also investigated to ensure the generalizability of the outcomes. The ultimate ensemble-based model achieved a precision rate of 88.05% and 90.12% on the Cleveland and Noor datasets, respectively, demonstrating its reliability and suitability for future research in predicting the likelihood of cardiovascular diseases. Not only the proposed model introduces an innovative approach for specifying cardiovascular diseases by unraveling the intricate relationships between various biological variables but also facilitates early detection of cardiovascular diseases.

心血管疾病是导致全球死亡的主要因素。准确预测心血管疾病至关重要,而应用机器学习和数据挖掘技术可以促进决策并提高预测能力。本研究旨在提出一个模型,用于准确预测心血管疾病,并确定影响最大的关键诱因。除本地收集的数据集(称为 Noor 数据集)外,本研究还使用了克利夫兰数据集。因此,除了四个基于集合学习的模型外,还在这两个数据集上实施了各种数据挖掘技术。此外,还开发了一种在集合学习中组合单个分类器的新模型,其中为每个分类器分配了权重(使用遗传算法)。此外,还对每个特征的预测强度进行了研究,以确保结果的通用性。最终基于集合的模型在克利夫兰和努尔数据集上的精确率分别达到了 88.05% 和 90.12%,证明了其可靠性以及在未来预测心血管疾病可能性研究中的适用性。所提出的模型不仅通过揭示各种生物变量之间错综复杂的关系,为心血管疾病的诊断引入了一种创新方法,而且有助于心血管疾病的早期检测。
{"title":"Detection of Cardiovascular Diseases Using Data Mining Approaches: Application of an Ensemble-Based Model","authors":"Mojdeh Nazari, Hassan Emami, Reza Rabiei, Azamossadat Hosseini, Shahabedin Rahmatizadeh","doi":"10.1007/s12559-024-10306-z","DOIUrl":"https://doi.org/10.1007/s12559-024-10306-z","url":null,"abstract":"<p>Cardiovascular diseases are the leading contributor of mortality worldwide. Accurate cardiovascular disease prediction is crucial, and the application of machine learning and data mining techniques could facilitate decision-making and improve predictive capabilities. This study aimed to present a model for accurate prediction of cardiovascular diseases and identifying key contributing factors with the greatest impact. The Cleveland dataset besides the locally collected dataset, called the Noor dataset, was used in this study. Accordingly, various data mining techniques besides four ensemble learning-based models were implemented on both datasets. Moreover, a novel model for combining individual classifiers in ensemble learning, wherein weights were assigned to each classifier (using a genetic algorithm), was developed. The predictive strength of each feature was also investigated to ensure the generalizability of the outcomes. The ultimate ensemble-based model achieved a precision rate of 88.05% and 90.12% on the Cleveland and Noor datasets, respectively, demonstrating its reliability and suitability for future research in predicting the likelihood of cardiovascular diseases. Not only the proposed model introduces an innovative approach for specifying cardiovascular diseases by unraveling the intricate relationships between various biological variables but also facilitates early detection of cardiovascular diseases.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Adversarial Network-Assisted Framework for Power Management 生成式对抗网络辅助电源管理框架
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-05-27 DOI: 10.1007/s12559-024-10284-2
Noman Khan, Samee Ullah Khan, Ahmed Farouk, Sung Wook Baik

The rise in power consumption (PC) is caused by several factors such as the growing global population, urbanization, technological advances, economic development, and growth of businesses and commercial sectors. In these days, intermittent renewable energy sources (RESs) are widely utilized in electric grids to meet the need for power. Data-driven techniques are essential to assuring the steady operation of the electric grid and accurate power consumption and generation forecasting. Conversely, the available datasets for time series electric power forecasting in the energy industry are not as large as those for other domains such as in computer vision. Thus, a deep learning (DL) framework for predicting PC in residential and commercial buildings as well as the power generation (PG) from RESs is introduced. The raw power data obtained from buildings and RESs-based power plants are conceded by the purging process where absent values are filled in and noise and outliers are eliminated. Next, the proposed generative adversarial network (GAN) uses a portion of the cleaned data to generate synthetic parallel data, which is combined with the actual data to make a hybrid dataset. Subsequently, the stacked gated recurrent unit (GRU) model, which is optimized for power forecasting, is trained using the hybrid dataset. Six existent power data are used to train and test sixteen linear and nonlinear models for energy forecasting. The best-performing network is selected as the proposed method for forecasting tasks. For Korea Yeongam solar power (KYSP), individual household electric power consumption (IHEPC), and advanced institute of convergence technology (AICT) datasets, the proposed model obtains mean absolute error (MAE) values of 0.0716, 0.0819, and 0.0877, respectively. Similarly, its MAE values are 0.1215, 0.5093, and 0.5751, for Australia Alice Springs solar power (AASSP), Korea south east wind power (KSEWP), and, Korea south east solar power (KSESP) datasets, respectively.

全球人口增长、城市化、技术进步、经济发展以及企业和商业部门的增长等因素导致了电力消耗(PC)的增加。如今,间歇性可再生能源(RES)被广泛应用于电网,以满足电力需求。数据驱动技术对于确保电网稳定运行、准确预测用电量和发电量至关重要。相反,能源行业中用于时间序列电力预测的可用数据集不如计算机视觉等其他领域的数据集大。因此,我们引入了一个深度学习(DL)框架,用于预测住宅和商业建筑的 PC 以及可再生能源的发电量(PG)。从建筑物和基于可再生能源的发电厂获得的原始电力数据会经过净化过程,其中缺失的值会被填补,噪声和异常值会被消除。接下来,建议的生成式对抗网络(GAN)使用部分净化数据生成合成并行数据,并将其与实际数据相结合,形成混合数据集。随后,使用混合数据集训练针对电力预测进行了优化的叠加门控递归单元(GRU)模型。六个现有电力数据用于训练和测试十六个线性和非线性模型,以进行电能预测。选择表现最好的网络作为预测任务的建议方法。对于韩国永岩太阳能发电(KYSP)、个人家庭电力消耗(IHEPC)和高级融合技术研究所(AICT)数据集,建议模型获得的平均绝对误差(MAE)值分别为 0.0716、0.0819 和 0.0877。同样,澳大利亚爱丽斯泉太阳能发电数据集(AASSP)、韩国东南部风力发电数据集(KSEWP)和韩国东南部太阳能发电数据集(KSESP)的平均绝对误差(MAE)值分别为 0.1215、0.5093 和 0.5751。
{"title":"Generative Adversarial Network-Assisted Framework for Power Management","authors":"Noman Khan, Samee Ullah Khan, Ahmed Farouk, Sung Wook Baik","doi":"10.1007/s12559-024-10284-2","DOIUrl":"https://doi.org/10.1007/s12559-024-10284-2","url":null,"abstract":"<p>The rise in power consumption (PC) is caused by several factors such as the growing global population, urbanization, technological advances, economic development, and growth of businesses and commercial sectors. In these days, intermittent renewable energy sources (RESs) are widely utilized in electric grids to meet the need for power. Data-driven techniques are essential to assuring the steady operation of the electric grid and accurate power consumption and generation forecasting. Conversely, the available datasets for time series electric power forecasting in the energy industry are not as large as those for other domains such as in computer vision. Thus, a deep learning (DL) framework for predicting PC in residential and commercial buildings as well as the power generation (PG) from RESs is introduced. The raw power data obtained from buildings and RESs-based power plants are conceded by the purging process where absent values are filled in and noise and outliers are eliminated. Next, the proposed generative adversarial network (GAN) uses a portion of the cleaned data to generate synthetic parallel data, which is combined with the actual data to make a hybrid dataset. Subsequently, the stacked gated recurrent unit (GRU) model, which is optimized for power forecasting, is trained using the hybrid dataset. Six existent power data are used to train and test sixteen linear and nonlinear models for energy forecasting. The best-performing network is selected as the proposed method for forecasting tasks. For Korea Yeongam solar power (KYSP), individual household electric power consumption (IHEPC), and advanced institute of convergence technology (AICT) datasets, the proposed model obtains mean absolute error (MAE) values of 0.0716, 0.0819, and 0.0877, respectively. Similarly, its MAE values are 0.1215, 0.5093, and 0.5751, for Australia Alice Springs solar power (AASSP), Korea south east wind power (KSEWP), and, Korea south east solar power (KSESP) datasets, respectively.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141173345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cognitive Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1