一个低成本的虚拟2D代言人角色广告框架

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2022-07-18 DOI:10.1109/ICMEW56448.2022.9859278

Jiarun Zhang, Zhao Li, Jialun Zhang, Zhiqiang Zhang

{"title":"一个低成本的虚拟2D代言人角色广告框架","authors":"Jiarun Zhang, Zhao Li, Jialun Zhang, Zhiqiang Zhang","doi":"10.1109/ICMEW56448.2022.9859278","DOIUrl":null,"url":null,"abstract":"Live-streaming advertising has achieved huge success in modern retail platforms. However, small-scaled merchants are neither economically nor technically capable of having their own spokes-person. Addressing the need for the massive online interactive advertising, this paper proposes an economic-efficient approach, Virtual spokes-Character Advertising (VSCA). VSCA generates 2-D Virtual spokes-Character advertising video and provides it to the merchants as a supplementary marketing method. VSCA first generates the simplified natural language description of the merchandise from its original long title using text generation methods and then passes it to the Text-to-Speech model for the audio description. Secondly, VSCA remits the audio to our remodeled two-phases lip-syncing network to generate virtual advertising videos about the given merchandise. With our novelly designed two-phases lip-syncing network, it is the first in the industry able to generate lip-syncing video of given audio with human face image input instead of video input. As the industry’s first application on 2D spokes-character advertising, VSCA has its large potential in real world applications.","PeriodicalId":106759,"journal":{"name":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Low-Cost Virtual 2D Spokes-Character Advertising Framework\",\"authors\":\"Jiarun Zhang, Zhao Li, Jialun Zhang, Zhiqiang Zhang\",\"doi\":\"10.1109/ICMEW56448.2022.9859278\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Live-streaming advertising has achieved huge success in modern retail platforms. However, small-scaled merchants are neither economically nor technically capable of having their own spokes-person. Addressing the need for the massive online interactive advertising, this paper proposes an economic-efficient approach, Virtual spokes-Character Advertising (VSCA). VSCA generates 2-D Virtual spokes-Character advertising video and provides it to the merchants as a supplementary marketing method. VSCA first generates the simplified natural language description of the merchandise from its original long title using text generation methods and then passes it to the Text-to-Speech model for the audio description. Secondly, VSCA remits the audio to our remodeled two-phases lip-syncing network to generate virtual advertising videos about the given merchandise. With our novelly designed two-phases lip-syncing network, it is the first in the industry able to generate lip-syncing video of given audio with human face image input instead of video input. As the industry’s first application on 2D spokes-character advertising, VSCA has its large potential in real world applications.\",\"PeriodicalId\":106759,\"journal\":{\"name\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMEW56448.2022.9859278\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW56448.2022.9859278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

直播广告在现代零售平台上取得了巨大的成功。然而，小商家既没有经济能力，也没有技术能力拥有自己的代言人。针对大规模网络互动广告的需求，本文提出了一种经济高效的方式——虚拟代言人广告(VSCA)。VSCA生成二维虚拟代言人广告视频，作为一种补充营销手段提供给商家。VSCA首先使用文本生成方法从商品的原始长标题生成简化的自然语言描述，然后将其传递给文本到语音模型进行音频描述。其次，VSCA将音频发送到我们改造的两阶段对口型网络，以生成关于给定商品的虚拟广告视频。凭借我们新颖设计的两阶段对口型网络，它是业界第一个能够以人脸图像输入代替视频输入的给定音频生成对口型视频的网络。作为业界首个2D代言人广告应用，VSCA在现实世界的应用潜力巨大。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Low-Cost Virtual 2D Spokes-Character Advertising Framework

Live-streaming advertising has achieved huge success in modern retail platforms. However, small-scaled merchants are neither economically nor technically capable of having their own spokes-person. Addressing the need for the massive online interactive advertising, this paper proposes an economic-efficient approach, Virtual spokes-Character Advertising (VSCA). VSCA generates 2-D Virtual spokes-Character advertising video and provides it to the merchants as a supplementary marketing method. VSCA first generates the simplified natural language description of the merchandise from its original long title using text generation methods and then passes it to the Text-to-Speech model for the audio description. Secondly, VSCA remits the audio to our remodeled two-phases lip-syncing network to generate virtual advertising videos about the given merchandise. With our novelly designed two-phases lip-syncing network, it is the first in the industry able to generate lip-syncing video of given audio with human face image input instead of video input. As the industry’s first application on 2D spokes-character advertising, VSCA has its large potential in real world applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

自引率

0.00%

发文量