{"title":"Polling Mechanism for Video Deepfake","authors":"Jian-Jiun Ding, Hsuan-Wei Hsu, Chien-Wei Huang","doi":"10.1109/ECICE52819.2021.9645661","DOIUrl":null,"url":null,"abstract":"A polling-based mechanism is developed to identify whether a video is forged. It is important for forensic image processing. However, most of the existing video deepfake algorithms are frame-based. In other words, a learning-based method is applied to identify whether a frame is forged then the mean of the fake score for all frames is applied to determine whether the whole video is forged. In this work, we propose a polling mechanism to well integrate the deepfake score of each frame. We found that the misidentification of a deepfake algorithm usually occurs in the frame with drastic motion, a tilted head, and blinking eyes. Therefore, we determine the weights of each frame according to the frame difference, the head orientation, whether the eyes are blinking, and the accuracy rate of the validation data. With the proposed polling mechanism, the accuracy of video deepfake can be improved and whether a video is forged can be well determined using a much smaller number of frames.","PeriodicalId":176225,"journal":{"name":"2021 IEEE 3rd Eurasia Conference on IOT, Communication and Engineering (ECICE)","volume":"190 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 3rd Eurasia Conference on IOT, Communication and Engineering (ECICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECICE52819.2021.9645661","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A polling-based mechanism is developed to identify whether a video is forged. It is important for forensic image processing. However, most of the existing video deepfake algorithms are frame-based. In other words, a learning-based method is applied to identify whether a frame is forged then the mean of the fake score for all frames is applied to determine whether the whole video is forged. In this work, we propose a polling mechanism to well integrate the deepfake score of each frame. We found that the misidentification of a deepfake algorithm usually occurs in the frame with drastic motion, a tilted head, and blinking eyes. Therefore, we determine the weights of each frame according to the frame difference, the head orientation, whether the eyes are blinking, and the accuracy rate of the validation data. With the proposed polling mechanism, the accuracy of video deepfake can be improved and whether a video is forged can be well determined using a much smaller number of frames.