Generative Steganography via Live Comments on Streaming Video Frames

IF 4.5 2区计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS IEEE Transactions on Computational Social Systems Pub Date : 2024-03-12 DOI:10.1109/TCSS.2024.3352979

Yuling Liu;Cuilin Wang;Jie Wang;Bo Ou;Xin Liao

{"title":"Generative Steganography via Live Comments on Streaming Video Frames","authors":"Yuling Liu;Cuilin Wang;Jie Wang;Bo Ou;Xin Liao","doi":"10.1109/TCSS.2024.3352979","DOIUrl":null,"url":null,"abstract":"Generative text steganography has received considerable attention in the covert communication community for the benefit of sending secret messages without the need to modify carriers. Existing methods typically choose the next word when generating a stego-text based on conditional probability encoding of candidates, which may lead to generating inadequate words for the underlying secret message. How to generate a semantically controllable stego-text with a high capacity on secure embedding of a secret message is a main challenge. We address this challenge by proposing a new paradigm to generative text steganography that takes advantage of certain social media through apparently normal behaviors from the sender. In particular, we make use of the live commenting feature provided by public video sharing platforms (PVSPs), which allow viewers to make comments on video scenes that will fly on screens when the scenes are shown. We show that this feature can be used to construct a generative steganographic system. The sender generates at random a number of distracting words and a certain invertible matrix called W-\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n matrix based on the total number of message words and distracting words. The sender then transforms a sequence of indexes of these words to a sequence, selects one or more videos with a sufficiently large number of total frames, and generates a comment on each frame in the sequence. The receiver extracts commented frame indexes, uses the shared stego-key to generate the same W-\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n matrix as the sender, and obtains the secret message using the inverse of the matrix. The stego-key consists of a vocabulary generator and a W-\n<inline-formula><tex-math>$d$</tex-math></inline-formula>\n matrix generator (WMG) based on pseudorandomly generated numbers. To generate comments on frames that conform to comments made by viewers, we devise a neural ResNet-LSTM model to generate a comment for an input image based on its content. Theoretical analysis shows that commented video frames (CVF) is covert, secure, efficient, and feasible to conceal any message of arbitrary length. We implement CVF and present evaluation results from multiple aspects that our work outperforms the existing stego-methods.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":null,"pages":null},"PeriodicalIF":4.5000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10462497/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}

引用次数: 0

Abstract

Generative text steganography has received considerable attention in the covert communication community for the benefit of sending secret messages without the need to modify carriers. Existing methods typically choose the next word when generating a stego-text based on conditional probability encoding of candidates, which may lead to generating inadequate words for the underlying secret message. How to generate a semantically controllable stego-text with a high capacity on secure embedding of a secret message is a main challenge. We address this challenge by proposing a new paradigm to generative text steganography that takes advantage of certain social media through apparently normal behaviors from the sender. In particular, we make use of the live commenting feature provided by public video sharing platforms (PVSPs), which allow viewers to make comments on video scenes that will fly on screens when the scenes are shown. We show that this feature can be used to construct a generative steganographic system. The sender generates at random a number of distracting words and a certain invertible matrix called W-

$d$

matrix based on the total number of message words and distracting words. The sender then transforms a sequence of indexes of these words to a sequence, selects one or more videos with a sufficiently large number of total frames, and generates a comment on each frame in the sequence. The receiver extracts commented frame indexes, uses the shared stego-key to generate the same W-

$d$

matrix as the sender, and obtains the secret message using the inverse of the matrix. The stego-key consists of a vocabulary generator and a W-

$d$

matrix generator (WMG) based on pseudorandomly generated numbers. To generate comments on frames that conform to comments made by viewers, we devise a neural ResNet-LSTM model to generate a comment for an input image based on its content. Theoretical analysis shows that commented video frames (CVF) is covert, secure, efficient, and feasible to conceal any message of arbitrary length. We implement CVF and present evaluation results from multiple aspects that our work outperforms the existing stego-methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过对流媒体视频帧的实时评论生成隐写术

生成式文本隐写术无需修改载体即可发送秘密信息，因此在隐蔽通信领域受到广泛关注。现有方法在生成隐写文本时，通常根据候选词的条件概率编码来选择下一个词，这可能会导致生成的词不适合底层密文。如何在安全嵌入密文的基础上生成高容量、语义可控的隐去文本是一个主要挑战。为了应对这一挑战，我们提出了一种新的生成文本隐写术范式，即通过发送者表面上的正常行为来利用某些社交媒体。特别是，我们利用了公共视频共享平台（PVSP）提供的实时评论功能，该功能允许观众对视频场景发表评论，这些评论会在场景播放时出现在屏幕上。我们证明，这一功能可用于构建生成式隐写系统。发送者根据信息字词和干扰字词的总数随机生成一定数量的干扰字词和称为 W-$d$ 矩阵的可逆矩阵。然后，发送方将这些词的索引序列转换为序列，选择一个或多个总帧数足够多的视频，并为序列中的每个帧生成注释。接收方提取注释帧索引，使用共享的隐密密钥生成与发送方相同的 W-$d$ 矩阵，并利用矩阵的逆变换获取密文。隐密密钥由词汇生成器和基于伪随机生成数字的 W-$d$ 矩阵生成器 (WMG) 组成。为了生成符合观众评论的帧评论，我们设计了一个 ResNet-LSTM 神经模型，根据输入图像的内容生成评论。理论分析表明，评论视频帧（CVF）具有隐蔽性、安全性、高效性和可行性，可以隐藏任意长度的信息。我们实现了 CVF，并从多个方面给出了评估结果，证明我们的工作优于现有的偷窃方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Computational Social Systems Social Sciences-Social Sciences (miscellaneous)

CiteScore

10.00

自引率

20.00%

发文量

316

期刊介绍： IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.

期刊最新文献

Table of Contents Guest Editorial: Special Issue on Dark Side of the Socio-Cyber World: Media Manipulation, Fake News, and Misinformation IEEE Transactions on Computational Social Systems Publication Information IEEE Transactions on Computational Social Systems Information for Authors IEEE Systems, Man, and Cybernetics Society Information