基于新型解释混合模型的Deepfake图像分类

Q3 Computer Science CommIT Journal Pub Date : 2023-09-06 DOI:10.21512/commit.v17i2.8761

Sudarshana Kerenalli, Vamsidhar Yendapalli, Mylarareddy Chinnaiah

{"title":"基于新型解释混合模型的Deepfake图像分类","authors":"Sudarshana Kerenalli, Vamsidhar Yendapalli, Mylarareddy Chinnaiah","doi":"10.21512/commit.v17i2.8761","DOIUrl":null,"url":null,"abstract":"In court, criminal investigations and identity management tools, like check-in and payment logins, face videos, and photos, are used as evidence more frequently. Although deeply falsified information may be found using deep learning classifiers, block-box decisionmaking makes forensic investigation in criminal trials more challenging. Therefore, the research suggests a three-step classification technique to classify the deceptive deepfake image content. The research examines the visual assessments of an EfficientNet and Shifted Window Transformer (SWinT) hybrid model based on Convolutional Neural Network (CNN) and Transformer architectures. The classifier generality is improved in the first stage using a different augmentation. Then, the hybrid model is developed in the second step by combining the EfficientNet and Shifted Window Transformer architectures. Next, the GradCAM approach for assessing human understanding demonstrates deepfake visual interpretation. In 14,204 images for the validation set, there are 7,096 fake photos and 7,108 real images. In contrast to focusing only on a few discrete face parts, the research shows that the entire deepfake image should be investigated. On a custom dataset of real, Generative Adversarial Networks (GAN)-generated, and human-altered web photos, the proposed method achieves an accuracy of 98.45%, a recall of 99.12%, and a loss of 0.11125. The proposed method successfully distinguishes between real and manipulated images. Moreover, the presented approach can assist investigators in clarifying the composition of the artificially produced material.","PeriodicalId":31276,"journal":{"name":"CommIT Journal","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classification of Deepfake Images Using a Novel Explanatory Hybrid Model\",\"authors\":\"Sudarshana Kerenalli, Vamsidhar Yendapalli, Mylarareddy Chinnaiah\",\"doi\":\"10.21512/commit.v17i2.8761\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In court, criminal investigations and identity management tools, like check-in and payment logins, face videos, and photos, are used as evidence more frequently. Although deeply falsified information may be found using deep learning classifiers, block-box decisionmaking makes forensic investigation in criminal trials more challenging. Therefore, the research suggests a three-step classification technique to classify the deceptive deepfake image content. The research examines the visual assessments of an EfficientNet and Shifted Window Transformer (SWinT) hybrid model based on Convolutional Neural Network (CNN) and Transformer architectures. The classifier generality is improved in the first stage using a different augmentation. Then, the hybrid model is developed in the second step by combining the EfficientNet and Shifted Window Transformer architectures. Next, the GradCAM approach for assessing human understanding demonstrates deepfake visual interpretation. In 14,204 images for the validation set, there are 7,096 fake photos and 7,108 real images. In contrast to focusing only on a few discrete face parts, the research shows that the entire deepfake image should be investigated. On a custom dataset of real, Generative Adversarial Networks (GAN)-generated, and human-altered web photos, the proposed method achieves an accuracy of 98.45%, a recall of 99.12%, and a loss of 0.11125. The proposed method successfully distinguishes between real and manipulated images. Moreover, the presented approach can assist investigators in clarifying the composition of the artificially produced material.\",\"PeriodicalId\":31276,\"journal\":{\"name\":\"CommIT Journal\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CommIT Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21512/commit.v17i2.8761\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CommIT Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21512/commit.v17i2.8761","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

在法庭上，刑事调查和身份管理工具，如签到和支付登录、面部视频和照片，被更频繁地用作证据。尽管使用深度学习分类器可以发现深度伪造的信息，但块盒决策使刑事审判中的法医调查更具挑战性。因此，研究提出了一种三步分类技术来对具有欺骗性的深度假图像内容进行分类。该研究考察了基于卷积神经网络(CNN)和Transformer架构的effentnet和移位窗口变压器(SWinT)混合模型的视觉评估。在第一阶段使用不同的增强来提高分类器的通用性。然后，在第二步中，通过结合EfficientNet和shift Window Transformer体系结构来开发混合模型。接下来，用于评估人类理解的GradCAM方法演示了深度视觉解释。在验证集的14,204张图像中，有7,096张假照片和7,108张真实图像。与只关注几个离散的人脸部分不同，研究表明应该研究整个深度假图像。在真实的、生成对抗网络(GAN)生成的和人为修改的网页照片的自定义数据集上，所提出的方法达到了98.45%的准确率、99.12%的召回率和0.11125的损失。该方法成功地区分了真实图像和经过处理的图像。此外，提出的方法可以帮助研究人员澄清人工生产材料的组成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Classification of Deepfake Images Using a Novel Explanatory Hybrid Model

In court, criminal investigations and identity management tools, like check-in and payment logins, face videos, and photos, are used as evidence more frequently. Although deeply falsified information may be found using deep learning classifiers, block-box decisionmaking makes forensic investigation in criminal trials more challenging. Therefore, the research suggests a three-step classification technique to classify the deceptive deepfake image content. The research examines the visual assessments of an EfficientNet and Shifted Window Transformer (SWinT) hybrid model based on Convolutional Neural Network (CNN) and Transformer architectures. The classifier generality is improved in the first stage using a different augmentation. Then, the hybrid model is developed in the second step by combining the EfficientNet and Shifted Window Transformer architectures. Next, the GradCAM approach for assessing human understanding demonstrates deepfake visual interpretation. In 14,204 images for the validation set, there are 7,096 fake photos and 7,108 real images. In contrast to focusing only on a few discrete face parts, the research shows that the entire deepfake image should be investigated. On a custom dataset of real, Generative Adversarial Networks (GAN)-generated, and human-altered web photos, the proposed method achieves an accuracy of 98.45%, a recall of 99.12%, and a loss of 0.11125. The proposed method successfully distinguishes between real and manipulated images. Moreover, the presented approach can assist investigators in clarifying the composition of the artificially produced material.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊