Cognitive Computation最新文献_第4页

Prescribed-Time Sampled-Data Control for the Bipartite Consensus of Linear Multi-Agent Systems in Singed Networks 针对单一网络中线性多代理系统的两方共识的规定时间采样数据控制

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-07-06 DOI: 10.1007/s12559-024-10319-8

Mengke Liu, Wenbing Zhang, Guanglei Wu

This article examines the prescribed-time sampled-data control problem for multi-agent systems in signed networks. A time-varying high gain-based protocol is devised to solve the prescribed-time bipartite consensus problem of the linear multi-agent systems with the control gain matrix being resolved through the utilization of the parametric Lyapunov equation. By using the method of scalarization, sufficient conditions are attained to ensure the prescribed-time bipartite consensus of linear multi-agent systems, where the maximum allowable sampling interval (MASI) ensuring the prescribed-time consensus is determined by the initial values of the system state, the linear dynamics of the system, and the maximum eigenvalue of the Laplacian matrix. Specifically, the MASI is inversely proportional to the maximum eigenvalue of the Laplacian matrix. Finally, the validity of the conclusion is ensured through numerical simulation.

本文研究了签名网络中多代理系统的规定时间采样数据控制问题。本文设计了一种基于时变高增益的协议，以解决线性多代理系统的规定时间两端共识问题，并利用参数 Lyapunov 方程解决了控制增益矩阵问题。通过使用标量化方法，获得了确保线性多代理系统的规定时间双向共识的充分条件，其中确保规定时间共识的最大允许采样间隔（MASI）由系统状态的初始值、系统的线性动力学和拉普拉斯矩阵的最大特征值决定。具体来说，MASI 与拉普拉斯矩阵的最大特征值成反比。最后，通过数值模拟确保了结论的正确性。

引用次数: 0

Twin Bounded Support Vector Machine with Capped Pinball Loss 带有弹球损失上限的孪生有界支持向量机

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-07-06 DOI: 10.1007/s12559-024-10307-y

Huiru Wang, Xiaoqing Hong, Siyuan Zhang

In order to obtain a more robust and sparse classifier, in this paper, we propose a novel classifier termed as twin bounded support vector machine with capped pinball loss (CPin-TBSVM), which has the excellent properties of being insensitive to feature and label noise. Given that the proposed model is non-convex, we use the convex-concave procedure algorithm (CCCP) to solve a series of two smaller-sized quadratic programming problems to find the optimal solution. In the process of solving the iterative subproblem, the dual coordinate descent method (DCDM) is used for speeding up solving optimization problems. Moreover, we analyze its theoretical properties, including that the capped pinball loss satisfies Bayes’ rule and CPin-TBSVM has certain noise insensitivity and sparsity. The properties are verified on an artificial dataset as well. The numerical experiment is conducted on 24 UCI datasets and the results are compared with four other models which include SVM, TSVM, Pin-GTSVM and TPin-TSVM. The results show that the proposed CPin-TBSVM has a better classification effect and noise insensitivity.

为了获得更鲁棒且稀疏的分类器，我们在本文中提出了一种新型分类器，即带有弹球损失上限的孪生有界支持向量机（CPin-TBSVM），它具有对特征和标签噪声不敏感的优异特性。鉴于所提出的模型是非凸的，我们使用凸-凹过程算法（CCCP）来求解一系列两个较小的二次编程问题，以找到最优解。在求解迭代子问题的过程中，我们使用了双坐标下降法（DCDM）来加快优化问题的求解速度。此外，我们还分析了其理论特性，包括封顶弹球损失满足贝叶斯规则，CPin-TBSVM 具有一定的噪声不敏感性和稀疏性。这些特性也在一个人工数据集上得到了验证。在 24 个 UCI 数据集上进行了数值实验，并将实验结果与其他四种模型（包括 SVM、TSVM、Pin-GTSVM 和 TPin-TSVM）进行了比较。结果表明，所提出的 CPin-TBSVM 具有更好的分类效果和噪声不敏感性。

引用次数: 0

Pruning Deep Neural Networks for Green Energy-Efficient Models: A Survey 修剪深度神经网络，建立绿色节能模型：调查

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-07-05 DOI: 10.1007/s12559-024-10313-0

Jihene Tmamna, Emna Ben Ayed, Rahma Fourati, Mandar Gogate, Tughrul Arslan, Amir Hussain, Mounir Ben Ayed

Over the past few years, larger and deeper neural network models, particularly convolutional neural networks (CNNs), have consistently advanced state-of-the-art performance across various disciplines. Yet, the computational demands of these models have escalated exponentially. Intensive computations hinder not only research inclusiveness and deployment on resource-constrained devices, such as Edge Internet of Things (IoT) devices, but also result in a substantial carbon footprint. Green deep learning has emerged as a research field that emphasizes energy consumption and carbon emissions during model training and inference, aiming to innovate with light and energy-efficient neural networks. Various techniques are available to achieve this goal. Studies show that conventional deep models often contain redundant parameters that do not alter outcomes significantly, underpinning the theoretical basis for model pruning. Consequently, this timely review paper seeks to systematically summarize recent breakthroughs in CNN pruning methods, offering necessary background knowledge for researchers in this interdisciplinary domain. Secondly, we spotlight the challenges of current model pruning methods to inform future avenues of research. Additionally, the survey highlights the pressing need for the development of innovative metrics to effectively balance diverse pruning objectives. Lastly, it investigates pruning techniques oriented towards sophisticated deep learning models, including hybrid feedforward CNNs and long short-term memory (LSTM) recurrent neural networks, a field ripe for exploration within green deep learning research.

在过去几年中，更大、更深的神经网络模型，尤其是卷积神经网络（CNN），不断提升着各学科的先进性能。然而，这些模型的计算需求却呈指数级增长。密集的计算不仅阻碍了研究的包容性和在边缘物联网（IoT）设备等资源受限设备上的部署，还造成了大量的碳足迹。绿色深度学习已成为一个研究领域，它强调模型训练和推理过程中的能耗和碳排放，旨在利用轻型节能神经网络进行创新。为实现这一目标，有多种技术可供选择。研究表明，传统的深度模型往往包含冗余参数，而这些参数并不会显著改变结果，这也是模型剪枝的理论基础。因此，这篇及时的综述论文旨在系统总结 CNN 修剪方法的最新突破，为这一跨学科领域的研究人员提供必要的背景知识。其次，我们强调了当前模型剪枝方法所面临的挑战，为未来的研究提供了参考。此外，调查还强调了开发创新指标以有效平衡不同剪枝目标的迫切需要。最后，它还研究了面向复杂深度学习模型的剪枝技术，包括混合前馈 CNN 和长短期记忆 (LSTM) 循环神经网络，这是绿色深度学习研究中一个成熟的探索领域。

{"title":"Pruning Deep Neural Networks for Green Energy-Efficient Models: A Survey","authors":"Jihene Tmamna, Emna Ben Ayed, Rahma Fourati, Mandar Gogate, Tughrul Arslan, Amir Hussain, Mounir Ben Ayed","doi":"10.1007/s12559-024-10313-0","DOIUrl":"https://doi.org/10.1007/s12559-024-10313-0","url":null,"abstract":"<p>Over the past few years, larger and deeper neural network models, particularly convolutional neural networks (CNNs), have consistently advanced state-of-the-art performance across various disciplines. Yet, the computational demands of these models have escalated exponentially. Intensive computations hinder not only research inclusiveness and deployment on resource-constrained devices, such as Edge Internet of Things (IoT) devices, but also result in a substantial carbon footprint. Green deep learning has emerged as a research field that emphasizes energy consumption and carbon emissions during model training and inference, aiming to innovate with light and energy-efficient neural networks. Various techniques are available to achieve this goal. Studies show that conventional deep models often contain redundant parameters that do not alter outcomes significantly, underpinning the theoretical basis for model pruning. Consequently, this timely review paper seeks to systematically summarize recent breakthroughs in CNN pruning methods, offering necessary background knowledge for researchers in this interdisciplinary domain. Secondly, we spotlight the challenges of current model pruning methods to inform future avenues of research. Additionally, the survey highlights the pressing need for the development of innovative metrics to effectively balance diverse pruning objectives. Lastly, it investigates pruning techniques oriented towards sophisticated deep learning models, including hybrid feedforward CNNs and long short-term memory (LSTM) recurrent neural networks, a field ripe for exploration within green deep learning research.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"67 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141547232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unmasking GAN-Generated Faces with Optimal Deep Learning and Cognitive Computing-Based Cutting-Edge Detection System 利用基于优化深度学习和认知计算的尖端检测系统揭开 GAN 生成的人脸的面纱

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-07-02 DOI: 10.1007/s12559-024-10318-9

Rana Alabdan, Jamal Alsamri, Siwar Ben Haj Hassine, Faiz Abdullah Alotaibi, Saud S. Alotaibi, Ayman Yafoz, Mrim M. Alnfiai, Mesfer Al Duhayyim

The emergence of deep learning (DL) has improved the excellence of generated media. However, with the enlarged level of photorealism, synthetic media is becoming very similar to tangible media, increasing severe problems regarding transmitting fake or deployed data over the Internet. In this situation, it is significant to improve automatic tools to constantly and early identify synthetic media. Generative Adversarial Network (GAN)-based models can create realistic faces that cause deep social and security issues. Existing techniques for identifying GAN-generated faces can execute well on restricted public datasets. Nevertheless, images from existing datasets must signify real situations sufficient for view variants and data distributions, where real faces mainly outnumber artificial ones. Therefore, this study develops an optimal DL-based GAN-generated face detection and classification (ODL-GANFDC) technique. The ODL-GANFDC technique aims to examine the input images properly and recognize whether GAN generates them. To accomplish this, the ODL-GANFDC technique follows the initial stage of the CLAHE-based contrast enhancement process. In addition, the deep residual network (DRN) model must be employed to learn the complex and intrinsic patterns from the preprocessed images. Besides, the hyperparameters of the DRN model can be optimally chosen using an improved sand cat swarm optimization (ISCSO) algorithm. Finally, the GAN-generated faces can be detected using a variational autoencoder (VAE). An extensive set of experimentations can be carried out to highlight the performance of the ODL-GANFDC technique. The experimental outcomes stated the promising results of the ODL-GANFDC technique over compared approaches on the GAN-generated face detection process.

深度学习（DL）的出现提高了生成媒体的质量。然而，随着逼真度的提高，合成媒体正变得与有形媒体非常相似，从而增加了在互联网上传输伪造或部署数据的严重问题。在这种情况下，改进自动工具以不断及早识别合成媒体就显得尤为重要。基于生成对抗网络（GAN）的模型可以创建逼真的人脸，从而引发深刻的社会和安全问题。识别 GAN 生成的人脸的现有技术可以在受限的公共数据集上很好地执行。然而，现有数据集中的图像必须足以代表视图变体和数据分布的真实情况，在这种情况下，真实人脸主要多于人造人脸。因此，本研究开发了一种基于 DL 的 GAN 生成的最佳人脸检测和分类（ODL-GANFDC）技术。ODL-GANFDC 技术旨在正确检查输入图像并识别 GAN 是否生成了这些图像。为此，ODL-GANFDC 技术遵循基于 CLAHE 的对比度增强过程的初始阶段。此外，还必须使用深度残差网络（DRN）模型来学习预处理图像中复杂的内在模式。此外，DRN 模型的超参数可通过改进的沙猫群优化（ISCSO）算法进行优化选择。最后，可以使用变异自动编码器（VAE）检测 GAN 生成的人脸。为了突出 ODL-GANFDC 技术的性能，我们进行了大量实验。实验结果表明，在 GAN 生成的人脸检测过程中，ODL-GANFDC 技术与其他方法相比具有良好的效果。

{"title":"Unmasking GAN-Generated Faces with Optimal Deep Learning and Cognitive Computing-Based Cutting-Edge Detection System","authors":"Rana Alabdan, Jamal Alsamri, Siwar Ben Haj Hassine, Faiz Abdullah Alotaibi, Saud S. Alotaibi, Ayman Yafoz, Mrim M. Alnfiai, Mesfer Al Duhayyim","doi":"10.1007/s12559-024-10318-9","DOIUrl":"https://doi.org/10.1007/s12559-024-10318-9","url":null,"abstract":"<p>The emergence of deep learning (DL) has improved the excellence of generated media. However, with the enlarged level of photorealism, synthetic media is becoming very similar to tangible media, increasing severe problems regarding transmitting fake or deployed data over the Internet. In this situation, it is significant to improve automatic tools to constantly and early identify synthetic media. Generative Adversarial Network (GAN)-based models can create realistic faces that cause deep social and security issues. Existing techniques for identifying GAN-generated faces can execute well on restricted public datasets. Nevertheless, images from existing datasets must signify real situations sufficient for view variants and data distributions, where real faces mainly outnumber artificial ones. Therefore, this study develops an optimal DL-based GAN-generated face detection and classification (ODL-GANFDC) technique. The ODL-GANFDC technique aims to examine the input images properly and recognize whether GAN generates them. To accomplish this, the ODL-GANFDC technique follows the initial stage of the CLAHE-based contrast enhancement process. In addition, the deep residual network (DRN) model must be employed to learn the complex and intrinsic patterns from the preprocessed images. Besides, the hyperparameters of the DRN model can be optimally chosen using an improved sand cat swarm optimization (ISCSO) algorithm. Finally, the GAN-generated faces can be detected using a variational autoencoder (VAE). An extensive set of experimentations can be carried out to highlight the performance of the ODL-GANFDC technique. The experimental outcomes stated the promising results of the ODL-GANFDC technique over compared approaches on the GAN-generated face detection process.\u0000</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"8 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cognitive Intelligent Decisions for Big Data and Cloud Computing in Industrial Applications using Trifold Algorithms 利用三折算法为工业应用中的大数据和云计算做出认知智能决策

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-28 DOI: 10.1007/s12559-024-10317-w

Shitharth Selvarajan, Hariprasath Manoharan, Rakan A. Alsowail, Achyut Shankar, Saravanan Pandiaraj, Carsten Maple, Wattana Viriyasitavat

In contemporary real-time applications, diminutive devices are increasingly employing a greater portion of the spectrum to transmit data despite the relatively small size of said data. The demand for big data in cloud storage networks is on the rise, as cognitive networks can enable intelligent decision-making with minimal spectrum utilization. The introduction of cognitive networks has facilitated the provision of a novel channel that enables the allocation of low power resources while minimizing path loss. The proposed method involves the integration of three algorithms to examine the process of big data. Whenever big data applications are examined then distance measurement, decisions mechanism and learning techniques from past data is much importance thus algorithms are chosen according to the requirements of big data and cloud storage networks. Further the effect of integration process is examined with three case studies that considers low resource, path loss and weight functions where optimized outcome is achieved in all defined case studies as compared to existing approach.

在当代实时应用中，尽管数据量相对较小，但微型设备却越来越多地使用更多的频谱来传输数据。云存储网络中对大数据的需求在不断增加，因为认知网络能够以最小的频谱利用率实现智能决策。认知网络的引入促进了新型信道的提供，这种信道能够分配低功率资源，同时最大限度地减少路径损耗。所提出的方法涉及整合三种算法来研究大数据的过程。每当研究大数据应用时，距离测量、决策机制和从过去数据中学习的技术都非常重要，因此要根据大数据和云存储网络的要求选择算法。此外，还通过三个案例研究考察了整合过程的效果，其中考虑了低资源、路径损耗和权重函数，与现有方法相比，所有定义的案例研究都取得了优化结果。

{"title":"Cognitive Intelligent Decisions for Big Data and Cloud Computing in Industrial Applications using Trifold Algorithms","authors":"Shitharth Selvarajan, Hariprasath Manoharan, Rakan A. Alsowail, Achyut Shankar, Saravanan Pandiaraj, Carsten Maple, Wattana Viriyasitavat","doi":"10.1007/s12559-024-10317-w","DOIUrl":"https://doi.org/10.1007/s12559-024-10317-w","url":null,"abstract":"<p>In contemporary real-time applications, diminutive devices are increasingly employing a greater portion of the spectrum to transmit data despite the relatively small size of said data. The demand for big data in cloud storage networks is on the rise, as cognitive networks can enable intelligent decision-making with minimal spectrum utilization. The introduction of cognitive networks has facilitated the provision of a novel channel that enables the allocation of low power resources while minimizing path loss. The proposed method involves the integration of three algorithms to examine the process of big data. Whenever big data applications are examined then distance measurement, decisions mechanism and learning techniques from past data is much importance thus algorithms are chosen according to the requirements of big data and cloud storage networks. Further the effect of integration process is examined with three case studies that considers low resource, path loss and weight functions where optimized outcome is achieved in all defined case studies as compared to existing approach.\u0000</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"46 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning from Failure: Towards Developing a Disease Diagnosis Assistant That Also Learns from Unsuccessful Diagnoses 从失败中学习：开发能从失败诊断中学习的疾病诊断助手

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-27 DOI: 10.1007/s12559-024-10274-4

Abhisek Tiwari, Swarna S, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar, Sarbajeet Tiwari

In recent years, automatic disease diagnosis has gained immense popularity in research and industry communities. Humans learn a task through both successful and unsuccessful attempts in real life, and physicians are not different. When doctors fail to diagnose disease correctly, they re-assess the extracted symptoms and re-diagnose the patient by inspecting a few more symptoms guided by their previous experience and current context. Motivated by the experience gained from failure assessment, we propose a novel end-to-end automatic disease diagnosis dialogue system called Failure Assessment incorporated Symptom Investigation and Disease Diagnosis (FA-SIDD) Assistant. The proposed FA-SIDD model includes a knowledge-guided, incorrect disease projection-aware failure assessment module that analyzes unsuccessful diagnosis attempts and reinforces the assessment for further investigation and re-diagnosis. We formulate a novel Markov decision process for the proposed failure assessment, incorporating symptom investigation and disease diagnosis frameworks, and optimize the policy using deep reinforcement learning. The proposed model has outperformed several baselines and the existing symptom investigation and diagnosis methods by a significant margin (1–3%) in all evaluation metrics (including human evaluation). The improvements over the multiple datasets and across multiple algorithms firmly establish the efficacy of learning gained from unsuccessful diagnoses. The work is the first attempt that investigate the importance of learning gained from unsuccessful diagnoses. The developed assistant learns diagnosis task more efficiently than traditional assistants and shows robust behavior. Furthermore, the code is available at https://github.com/AbhisekTiwari/FA-SIDA.

近年来，自动疾病诊断在研究和工业界大受欢迎。人类在现实生活中通过成功和失败的尝试来学习一项任务，医生也不例外。当医生未能正确诊断疾病时，他们会重新评估提取的症状，并根据以往的经验和当前的环境再检查一些症状，从而重新诊断病人。基于从故障评估中获得的经验，我们提出了一种新颖的端到端自动疾病诊断对话系统，称为故障评估合并症状调查和疾病诊断（FA-SIDD）助手。所提出的 FA-SIDD 模型包括一个知识指导、错误疾病预测感知的故障评估模块，该模块可分析不成功的诊断尝试，并加强评估以进行进一步调查和重新诊断。我们为拟议的故障评估制定了一个新颖的马尔可夫决策过程，其中纳入了症状调查和疾病诊断框架，并利用深度强化学习优化了策略。在所有评估指标（包括人工评估）中，所提出的模型都以显著的优势（1%-3%）优于多个基线和现有的症状调查与诊断方法。在多个数据集和多种算法上的改进，牢固确立了从不成功诊断中获得的学习效果。这项工作是研究从不成功诊断中学习的重要性的首次尝试。与传统助手相比，所开发的助手能更有效地学习诊断任务，并表现出稳健的行为。此外，代码可在 https://github.com/AbhisekTiwari/FA-SIDA 网站上获取。

{"title":"Learning from Failure: Towards Developing a Disease Diagnosis Assistant That Also Learns from Unsuccessful Diagnoses","authors":"Abhisek Tiwari, Swarna S, Sriparna Saha, Pushpak Bhattacharyya, Minakshi Dhar, Sarbajeet Tiwari","doi":"10.1007/s12559-024-10274-4","DOIUrl":"https://doi.org/10.1007/s12559-024-10274-4","url":null,"abstract":"<p>In recent years, automatic disease diagnosis has gained immense popularity in research and industry communities. Humans learn a task through both successful and unsuccessful attempts in real life, and physicians are not different. When doctors fail to diagnose disease correctly, they re-assess the extracted symptoms and re-diagnose the patient by inspecting a few more symptoms guided by their previous experience and current context. Motivated by the experience gained from failure assessment, we propose a novel end-to-end automatic disease diagnosis dialogue system called Failure Assessment incorporated Symptom Investigation and Disease Diagnosis (FA-SIDD) Assistant. The proposed FA-SIDD model includes a knowledge-guided, incorrect disease projection-aware failure assessment module that analyzes unsuccessful diagnosis attempts and reinforces the assessment for further investigation and re-diagnosis. We formulate a novel Markov decision process for the proposed failure assessment, incorporating symptom investigation and disease diagnosis frameworks, and optimize the policy using deep reinforcement learning. The proposed model has outperformed several baselines and the existing symptom investigation and diagnosis methods by a significant margin (1–3%) in all evaluation metrics (including human evaluation). The improvements over the multiple datasets and across multiple algorithms firmly establish the efficacy of learning gained from unsuccessful diagnoses. The work is the first attempt that investigate the importance of learning gained from unsuccessful diagnoses. The developed assistant learns diagnosis task more efficiently than traditional assistants and shows robust behavior. Furthermore, the code is available at https://github.com/AbhisekTiwari/FA-SIDA.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"49 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Modal Generative DeepFake Detection via Visual-Language Pretraining with Gate Fusion for Cognitive Computation 通过视觉语言预训练与认知计算门融合进行多模态生成式 DeepFake 检测

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-25 DOI: 10.1007/s12559-024-10316-x

Guisheng Zhang, Mingliang Gao, Qilei Li, Wenzhe Zhai, Gwanggil Jeon

With the widespread adoption of deep learning, there has been a notable increase in the prevalence of multimodal deepfake content. These deepfakes pose a substantial risk to both individual privacy and the security of their assets. In response to this pressing issue, researchers have undertaken substantial endeavors in utilizing generative AI and cognitive computation to leverage multimodal data to detect deepfakes. However, the efforts thus far have fallen short of fully exploiting the extensive reservoir of multimodal feature information, which leads to a deficiency in leveraging spatial information across multiple dimensions. In this study, we introduce a framework called Visual-Language Pretraining with Gate Fusion (VLP-GF), designed to identify multimodal deceptive content and enhance the accurate localization of manipulated regions within both images and textual annotations. Specifically, we introduce an adaptive fusion module tailored to integrate local and global information simultaneously. This module captures global context and local details concurrently, thereby improving the performance of image bounding-box grounding within the system. Additionally, to maximize the utilization of semantic information from diverse modalities, we incorporate a gating mechanism to strengthen the interaction of multimodal information further. Through a series of ablation experiments and comprehensive comparisons with state-of-the-art approaches on extensive benchmark datasets, we empirically demonstrate the superior efficacy of VLP-GF.

随着深度学习的广泛应用，多模态深度伪造内容的普遍性明显增加。这些深度伪造内容对个人隐私和资产安全都构成了巨大风险。为了应对这一紧迫问题，研究人员在利用生成式人工智能和认知计算来利用多模态数据检测深度伪造内容方面做出了大量努力。然而，迄今为止所做的努力还不足以充分利用广泛的多模态特征信息库，这导致在利用多维空间信息方面存在不足。在本研究中，我们引入了一个名为 "视觉语言预训练与门融合（VLP-GF）"的框架，旨在识别多模态欺骗性内容，并提高图像和文本注释中被操纵区域的精确定位。具体来说，我们引入了一个自适应融合模块，旨在同时整合局部和全局信息。该模块可同时捕捉全局上下文和局部细节，从而提高系统内图像边界框接地的性能。此外，为了最大限度地利用来自不同模态的语义信息，我们还采用了一种门控机制，以进一步加强多模态信息的交互。通过一系列消融实验以及在大量基准数据集上与最先进方法的综合比较，我们从经验上证明了 VLP-GF 的卓越功效。

{"title":"Multi-Modal Generative DeepFake Detection via Visual-Language Pretraining with Gate Fusion for Cognitive Computation","authors":"Guisheng Zhang, Mingliang Gao, Qilei Li, Wenzhe Zhai, Gwanggil Jeon","doi":"10.1007/s12559-024-10316-x","DOIUrl":"https://doi.org/10.1007/s12559-024-10316-x","url":null,"abstract":"<p>With the widespread adoption of deep learning, there has been a notable increase in the prevalence of multimodal deepfake content. These deepfakes pose a substantial risk to both individual privacy and the security of their assets. In response to this pressing issue, researchers have undertaken substantial endeavors in utilizing generative AI and cognitive computation to leverage multimodal data to detect deepfakes. However, the efforts thus far have fallen short of fully exploiting the extensive reservoir of multimodal feature information, which leads to a deficiency in leveraging spatial information across multiple dimensions. In this study, we introduce a framework called Visual-Language Pretraining with Gate Fusion (VLP-GF), designed to identify multimodal deceptive content and enhance the accurate localization of manipulated regions within both images and textual annotations. Specifically, we introduce an adaptive fusion module tailored to integrate local and global information simultaneously. This module captures global context and local details concurrently, thereby improving the performance of image bounding-box grounding within the system. Additionally, to maximize the utilization of semantic information from diverse modalities, we incorporate a gating mechanism to strengthen the interaction of multimodal information further. Through a series of ablation experiments and comprehensive comparisons with state-of-the-art approaches on extensive benchmark datasets, we empirically demonstrate the superior efficacy of VLP-GF.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"145 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards Long-Term Remembering in Federated Continual Learning 在联合持续学习中实现长期记忆

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-21 DOI: 10.1007/s12559-024-10314-z

Ziqin Zhao, Fan Lyu, Linyan Li, Fuyuan Hu, Minming Gu, Li Sun

Background

Federated Continual Learning (FCL) involves learning from distributed data on edge devices with incremental knowledge. However, current FCL methods struggle to retain long-term memories on the server.

Method

In this paper, we introduce a method called Fisher INformation Accumulation Learning (FINAL) to address catastrophic forgetting in FCL. First, we accumulate a global Fisher with a federated Fisher information matrix formed from clients task by task to remember long-term knowledge. Second, we present a novel multi-node collaborative integration strategy to assemble the federated Fisher, which reveals the task-specific co-importance of parameters among clients. Finally, we raise a Fisher balancing method to combine the global Fisher and federated Fisher, avoiding neglecting new learning or causing catastrophic forgetting.

Results

We conducted evaluations on four FCL datasets, and the findings demonstrate that the proposed FINAL effectively maintains long-term knowledge on the server.

Conclusions

The exceptional performance of this method indicates its significant value for future FCL research.

背景联合持续学习（FCL）涉及从边缘设备上的分布式数据中学习增量知识。方法在本文中，我们介绍了一种名为费舍尔信息积累学习（FINAL）的方法，以解决 FCL 中的灾难性遗忘问题。首先，我们用一个由客户逐个任务形成的联合 Fisher 信息矩阵来积累全局 Fisher，从而记住长期知识。其次，我们提出了一种新颖的多节点协作集成策略来组装联合费雪，从而揭示了客户间特定任务参数的共同重要性。最后，我们提出了一种费舍尔平衡方法，将全局费舍尔和联合费舍尔结合起来，避免忽略新的学习或造成灾难性遗忘。结果我们在四个 FCL 数据集上进行了评估，结果表明所提出的 FINAL 有效地维护了服务器上的长期知识。

{"title":"Towards Long-Term Remembering in Federated Continual Learning","authors":"Ziqin Zhao, Fan Lyu, Linyan Li, Fuyuan Hu, Minming Gu, Li Sun","doi":"10.1007/s12559-024-10314-z","DOIUrl":"https://doi.org/10.1007/s12559-024-10314-z","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Background</h3><p>Federated Continual Learning (FCL) involves learning from distributed data on edge devices with incremental knowledge. However, current FCL methods struggle to retain long-term memories on the server.</p><h3 data-test=\"abstract-sub-heading\">Method</h3><p>In this paper, we introduce a method called Fisher INformation Accumulation Learning (FINAL) to address catastrophic forgetting in FCL. First, we accumulate a global Fisher with a federated Fisher information matrix formed from clients task by task to remember long-term knowledge. Second, we present a novel multi-node collaborative integration strategy to assemble the federated Fisher, which reveals the task-specific co-importance of parameters among clients. Finally, we raise a Fisher balancing method to combine the global Fisher and federated Fisher, avoiding neglecting new learning or causing catastrophic forgetting.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>We conducted evaluations on four FCL datasets, and the findings demonstrate that the proposed FINAL effectively maintains long-term knowledge on the server.</p><h3 data-test=\"abstract-sub-heading\">Conclusions</h3><p>The exceptional performance of this method indicates its significant value for future FCL research.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"86 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SPEI-FL: Serverless Privacy Edge Intelligence-Enabled Federated Learning in Smart Healthcare Systems SPEI-FL：智能医疗系统中的无服务器隐私边缘智能联合学习

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-17 DOI: 10.1007/s12559-024-10310-3

Mahmuda Akter, Nour Moustafa, Benjamin Turnbull

Smart healthcare systems promise significant benefits for fast and accurate medical decisions. However, working with personal health data presents new privacy issues and constraints that must be solved from a cybersecurity perspective. Edge intelligence-enabled federated learning is a new scheme that utilises decentralised computing that allows data analytics to be carried out at the edge of a network, enhancing data privacy. However, this scheme suffers from privacy attacks, including inference, free-riding, and man-in-the-middle attacks, especially with serverless computing for allocating resources to user needs. Edge intelligence-enabled federated learning requires client data insertion and deletion to authenticate genuine clients and a serverless computing capability to ensure the security of collaborative machine learning models. This work introduces a serverless privacy edge intelligence-based federated learning (SPEI-FL) framework to address these issues. SPEI-FL includes a federated edge aggregator and authentication method to improve the data privacy of federated learning and allow client adaptation and removal without impacting the overall learning processes. It also can classify intruders through serverless computing processes. The proposed framework was evaluated with the unstructured COVID-19 medical chest x-rays and MNIST digit datasets, and the structured BoT-IoT dataset. The performance of the framework is comparable with existing authentication methods and reported a higher accuracy than comparable methods (approximately 90% as compared with the 81% reported by peer methods). The proposed authentication method prevents the exposure of sensitive patient information during medical device authentication and would become the cornerstone of the next generation of medical security with serverless computing.

智能医疗系统有望为快速、准确的医疗决策带来巨大好处。然而，个人健康数据的处理带来了新的隐私问题和限制，必须从网络安全的角度加以解决。边缘智能联合学习是一种利用分散计算的新方案，它允许在网络边缘进行数据分析，从而提高数据的隐私性。然而，这种方案受到隐私攻击，包括推理、搭便车和中间人攻击，尤其是在无服务器计算根据用户需求分配资源的情况下。支持边缘智能的联合学习需要插入和删除客户端数据以验证真正的客户端，还需要无服务器计算能力来确保协作机器学习模型的安全性。这项工作介绍了一种无服务器隐私边缘智能联合学习（SPEI-FL）框架，以解决这些问题。SPEI-FL 包括一个联合边缘聚合器和认证方法，以提高联合学习的数据隐私性，并允许在不影响整体学习过程的情况下调整和移除客户端。它还能通过无服务器计算流程对入侵者进行分类。我们用非结构化的 COVID-19 医学胸部 X 光片和 MNIST 数字数据集以及结构化的 BoT-IoT 数据集对所提出的框架进行了评估。该框架的性能与现有的身份验证方法不相上下，并且报告的准确率高于同类方法（约为 90%，而同类方法报告的准确率为 81%）。所提出的身份验证方法可防止医疗设备身份验证过程中患者敏感信息的泄露，并将成为下一代无服务器计算医疗安全的基石。

{"title":"SPEI-FL: Serverless Privacy Edge Intelligence-Enabled Federated Learning in Smart Healthcare Systems","authors":"Mahmuda Akter, Nour Moustafa, Benjamin Turnbull","doi":"10.1007/s12559-024-10310-3","DOIUrl":"https://doi.org/10.1007/s12559-024-10310-3","url":null,"abstract":"<p>Smart healthcare systems promise significant benefits for fast and accurate medical decisions. However, working with personal health data presents new privacy issues and constraints that must be solved from a cybersecurity perspective. Edge intelligence-enabled federated learning is a new scheme that utilises decentralised computing that allows data analytics to be carried out at the edge of a network, enhancing data privacy. However, this scheme suffers from privacy attacks, including inference, free-riding, and man-in-the-middle attacks, especially with serverless computing for allocating resources to user needs. Edge intelligence-enabled federated learning requires client data insertion and deletion to authenticate genuine clients and a serverless computing capability to ensure the security of collaborative machine learning models. This work introduces a serverless privacy edge intelligence-based federated learning (SPEI-FL) framework to address these issues. SPEI-FL includes a federated edge aggregator and authentication method to improve the data privacy of federated learning and allow client adaptation and removal without impacting the overall learning processes. It also can classify intruders through serverless computing processes. The proposed framework was evaluated with the unstructured COVID-19 medical chest x-rays and MNIST digit datasets, and the structured BoT-IoT dataset. The performance of the framework is comparable with existing authentication methods and reported a higher accuracy than comparable methods (approximately 90% as compared with the 81% reported by peer methods). The proposed authentication method prevents the exposure of sensitive patient information during medical device authentication and would become the cornerstone of the next generation of medical security with serverless computing.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"86 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141502720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cognitive Tracing Data Trails: Auditing Data Provenance in Discriminative Language Models Using Accumulated Discrepancy Score 认知追踪数据轨迹：使用累积差异得分审核判别语言模型中的数据出处

IF 5.4 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Cognitive Computation

Pub Date : 2024-06-14 DOI: 10.1007/s12559-024-10315-y

Zhirui Zeng, Jialing He, Tao Xiang, Ning Wang, Biwen Chen, Shangwei Guo

The burgeoning practice of unauthorized acquisition and utilization of personal textual data (e.g., social media comments and search histories) by certain entities has become a discernible trend. To uphold data protection regulations such as the Asia–Pacific Privacy Initiative (APPI) and to identify instances of unpermitted exploitation of personal data, we propose a novel and efficient audit framework that helps users conduct cognitive analysis to determine if their textual data was used for data augmentation. and training a discriminative model. In particular, we focus on auditing models that use BERT as the backbone for discriminating text and are at the core of popular online services. We first propose an accumulated discrepancy score, which involves not only the response of the target model to the auditing sample but also the responses between pre-trained and finetuned models, to identify membership. We implement two types of audit methods (i.e., sample-level and user-level) according to our framework and conduct comprehensive experiments on two downstream applications to evaluate the performance. The experimental results demonstrate that our sample-level auditing achieves an AUC of 89.7% and an accuracy of 83%, whereas the user-level method can audit membership with an AUC of 89.7% and an accuracy of 88%. Additionally, we undertake an analysis of how augmentation methods impact auditing performance and expound upon the underlying reasons for these observations.

某些实体在未经授权的情况下获取和使用个人文本数据（如社交媒体评论和搜索历史）的做法日益增多，这已成为一种明显的趋势。为了维护数据保护法规（如亚太隐私倡议（APPI））并识别未经许可利用个人数据的情况，我们提出了一个新颖高效的审计框架，帮助用户进行认知分析，以确定其文本数据是否被用于数据增强。特别是，我们将重点放在使用 BERT 作为文本判别骨干的审计模型上，这些模型是流行在线服务的核心。我们首先提出了累积差异分数，它不仅涉及目标模型对审核样本的响应，还涉及预训练模型和微调模型之间的响应，以识别成员身份。根据我们的框架，我们实现了两种类型的审核方法（即样本级和用户级），并在两个下游应用上进行了综合实验以评估其性能。实验结果表明，样本级审核的 AUC 为 89.7%，准确率为 83%，而用户级方法的 AUC 为 89.7%，准确率为 88%。此外，我们还分析了增强方法如何影响审核性能，并阐述了这些观察结果的根本原因。

{"title":"Cognitive Tracing Data Trails: Auditing Data Provenance in Discriminative Language Models Using Accumulated Discrepancy Score","authors":"Zhirui Zeng, Jialing He, Tao Xiang, Ning Wang, Biwen Chen, Shangwei Guo","doi":"10.1007/s12559-024-10315-y","DOIUrl":"https://doi.org/10.1007/s12559-024-10315-y","url":null,"abstract":"<p>The burgeoning practice of unauthorized acquisition and utilization of personal textual data (e.g., social media comments and search histories) by certain entities has become a discernible trend. To uphold data protection regulations such as the Asia–Pacific Privacy Initiative (APPI) and to identify instances of unpermitted exploitation of personal data, we propose a novel and efficient audit framework that helps users conduct cognitive analysis to determine if their textual data was used for data augmentation. and training a discriminative model. In particular, we focus on auditing models that use BERT as the backbone for discriminating text and are at the core of popular online services. We first propose an accumulated discrepancy score, which involves not only the response of the target model to the auditing sample but also the responses between pre-trained and finetuned models, to identify membership. We implement two types of audit methods (i.e., sample-level and user-level) according to our framework and conduct comprehensive experiments on two downstream applications to evaluate the performance. The experimental results demonstrate that our sample-level auditing achieves an AUC of 89.7% and an accuracy of 83%, whereas the user-level method can audit membership with an AUC of 89.7% and an accuracy of 88%. Additionally, we undertake an analysis of how augmentation methods impact auditing performance and expound upon the underlying reasons for these observations.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":"1 1","pages":""},"PeriodicalIF":5.4,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141512680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0