Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating the Same

IF 4.8 1区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Broadcasting Pub Date : 2024-04-22 DOI:10.1109/TBC.2024.3380474

Sungjun Ahn;Hyun-Jeong Yim;Youngwan Lee;Sung-Ik Park

{"title":"Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating the Same","authors":"Sungjun Ahn;Hyun-Jeong Yim;Youngwan Lee;Sung-Ik Park","doi":"10.1109/TBC.2024.3380474","DOIUrl":null,"url":null,"abstract":"This paper introduces a media service model that exploits artificial intelligence (AI) video generators at the receive end. This proposal deviates from the traditional multimedia ecosystem, completely relying on in-house production, by shifting part of the content creation onto the receiver. We bring a semantic process into the framework, allowing the distribution network to provide service elements that prompt the content generator rather than distributing encoded data of fully finished programs. The service elements include fine-tailored text descriptions, lightweight image data of some objects, or application programming interfaces, comprehensively referred to as semantic sources, and the user terminal translates the received semantic data into video frames. Empowered by the random nature of generative AI, users can experience super-personalized services accordingly. The proposed idea incorporates situations in which the user receives different service providers’ element packages, either in a sequence over time or multiple packages at the same time. Given promised in-context coherence and content integrity, the combinatory dynamics will amplify the service diversity, allowing the users to always chance upon new experiences. This work particularly aims at short-form videos and advertisements, which the users would easily feel fatigued by seeing the same frame sequence every time. In those use cases, the content provider’s role will be recast as scripting semantic sources, transformed from a thorough producer. Overall, this work explores a new form of media ecosystem facilitated by receiver-embedded generative models, featuring both random content dynamics and enhanced delivery efficiency simultaneously.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"70 3","pages":"980-994"},"PeriodicalIF":4.8000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Broadcasting","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10506327/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

This paper introduces a media service model that exploits artificial intelligence (AI) video generators at the receive end. This proposal deviates from the traditional multimedia ecosystem, completely relying on in-house production, by shifting part of the content creation onto the receiver. We bring a semantic process into the framework, allowing the distribution network to provide service elements that prompt the content generator rather than distributing encoded data of fully finished programs. The service elements include fine-tailored text descriptions, lightweight image data of some objects, or application programming interfaces, comprehensively referred to as semantic sources, and the user terminal translates the received semantic data into video frames. Empowered by the random nature of generative AI, users can experience super-personalized services accordingly. The proposed idea incorporates situations in which the user receives different service providers’ element packages, either in a sequence over time or multiple packages at the same time. Given promised in-context coherence and content integrity, the combinatory dynamics will amplify the service diversity, allowing the users to always chance upon new experiences. This work particularly aims at short-form videos and advertisements, which the users would easily feel fatigued by seeing the same frame sequence every time. In those use cases, the content provider’s role will be recast as scripting semantic sources, transformed from a thorough producer. Overall, this work explores a new form of media ecosystem facilitated by receiver-embedded generative models, featuring both random content dynamics and enhanced delivery efficiency simultaneously.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

由生成式人工智能驱动的动态和超级个性化媒体生态系统：不可预测的剧目永不重演

本文介绍了一种在接收端利用人工智能（AI）视频生成器的媒体服务模式。这一建议偏离了完全依赖内部制作的传统多媒体生态系统，将部分内容创作转移到了接收端。我们将语义流程引入框架，允许分发网络提供服务元素，提示内容生成器，而不是分发已完全完成节目的编码数据。服务元素包括量身定制的文本描述、某些对象的轻量级图像数据或应用程序编程接口，统称为语义源，用户终端将接收到的语义数据转化为视频帧。借助生成式人工智能的随机性，用户可以体验到超级个性化的服务。所提出的想法包含了用户接收不同服务提供商元素包的情况，这些元素包可以是一段时间内的序列包，也可以是同时接收的多个包。在保证上下文一致性和内容完整性的前提下，组合动态将扩大服务的多样性，让用户始终有机会获得新体验。这项工作尤其针对短视频和广告，因为用户很容易因为每次都看到相同的帧序列而感到疲劳。在这些使用案例中，内容提供商的角色将从彻底的制作者转变为脚本语义源。总之，这项工作探索了一种由嵌入式接收器生成模型促进的新型媒体生态系统，其特点是同时具有随机内容动态和更高的传输效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Broadcasting 工程技术-电信学

CiteScore

9.40

自引率

31.10%

发文量

审稿时长

6-12 weeks

期刊介绍： The Society’s Field of Interest is “Devices, equipment, techniques and systems related to broadcast technology, including the production, distribution, transmission, and propagation aspects.” In addition to this formal FOI statement, which is used to provide guidance to the Publications Committee in the selection of content, the AdCom has further resolved that “broadcast systems includes all aspects of transmission, propagation, and reception.”

期刊最新文献

Table of Contents 2025 Scott Helt Memorial Award for the Best Paper Published in IEEE Transactions on Broadcasting IEEE Transactions on Broadcasting Information for Readers and Authors IEEE Transactions on Broadcasting Publication Information Blind Light Field Image Quality Assessment Using Multiplane Texture and Multilevel Wavelet Information