{"title":"Research on Speech-based Digital Human Driving System Based on Unreal Engine","authors":"Lai Wei, Yutong Wang, Dongdong Li","doi":"10.1109/BMSB58369.2023.10211201","DOIUrl":null,"url":null,"abstract":"This paper proposed and realized a speech-based digital human driving system in Unreal engine. Based on blendshapes, this paper proposed a micro-expression driving algorithm to realize the change of expressions dynamically. In order to drive lip shapes efficiently, shape libraries are used to cache basic facial data instead of storing animations, which can reduce additional overhead of hardware resources, and can also achieve reuse and expansion of data. Moreover, this paper introduces the proxy middleware into the architecture, which can reduce bandwidth consumption. This system can be applied in intelligent services such as tour guiding, educational counseling, exhibition interpreting and other scenes, which does not affect the experiences of activities, but can greatly reduce the cost in terms of transportation, time and financial resources, and has broad application prospects.","PeriodicalId":13080,"journal":{"name":"IEEE international Symposium on Broadband Multimedia Systems and Broadcasting","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE international Symposium on Broadband Multimedia Systems and Broadcasting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMSB58369.2023.10211201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposed and realized a speech-based digital human driving system in Unreal engine. Based on blendshapes, this paper proposed a micro-expression driving algorithm to realize the change of expressions dynamically. In order to drive lip shapes efficiently, shape libraries are used to cache basic facial data instead of storing animations, which can reduce additional overhead of hardware resources, and can also achieve reuse and expansion of data. Moreover, this paper introduces the proxy middleware into the architecture, which can reduce bandwidth consumption. This system can be applied in intelligent services such as tour guiding, educational counseling, exhibition interpreting and other scenes, which does not affect the experiences of activities, but can greatly reduce the cost in terms of transportation, time and financial resources, and has broad application prospects.