Simultaneous 3-D motion estimation and wire-frame model adaptation including photometric effects for knowledge-based video coding

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 1994-04-19 DOI:10.1109/ICASSP.1994.389400

G. Akar, A. Tekalp, L. Onural

{"title":"Simultaneous 3-D motion estimation and wire-frame model adaptation including photometric effects for knowledge-based video coding","authors":"G. Akar, A. Tekalp, L. Onural","doi":"10.1109/ICASSP.1994.389400","DOIUrl":null,"url":null,"abstract":"We address the problem of 3-D motion estimation in the context of knowledge-based coding of facial image sequences. The proposed method handles the global and local motion estimation and the adaptation of a generic wire-frame to a particular speaker simultaneously within an optical flow based framework including the photometric effects of motion. We use a flexible wire-frame model whose local structure is characterized by the normal vectors of the patches which are related to the coordinates of the nodes. Geometrical constraints that describe the propagation of the movement of the nodes are introduced, which are then efficiently utilized to reduce the number of independent structure parameters. A stochastic relaxation algorithm has been used to determine optimum global motion estimates and the parameters describing the structure of the wire-frame model. For the initialization of the motion and structure parameters, a modified feature based algorithm is used. Experimental results with simulated facial image sequences are given.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389400","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

We address the problem of 3-D motion estimation in the context of knowledge-based coding of facial image sequences. The proposed method handles the global and local motion estimation and the adaptation of a generic wire-frame to a particular speaker simultaneously within an optical flow based framework including the photometric effects of motion. We use a flexible wire-frame model whose local structure is characterized by the normal vectors of the patches which are related to the coordinates of the nodes. Geometrical constraints that describe the propagation of the movement of the nodes are introduced, which are then efficiently utilized to reduce the number of independent structure parameters. A stochastic relaxation algorithm has been used to determine optimum global motion estimates and the parameters describing the structure of the wire-frame model. For the initialization of the motion and structure parameters, a modified feature based algorithm is used. Experimental results with simulated facial image sequences are given.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

同时三维运动估计和线框模型适应，包括基于知识的视频编码的光度效应

在基于知识的人脸图像序列编码背景下，研究了三维运动估计问题。该方法在包含运动光度效应的基于光流的框架内同时处理全局和局部运动估计以及通用线框对特定说话者的自适应。我们使用一种柔性线框模型，其局部结构由与节点坐标相关的补丁的法向量来表征。引入描述节点运动传播的几何约束，然后有效地利用这些约束来减少独立结构参数的数量。采用随机松弛算法确定最优全局运动估计和描述线框模型结构的参数。对于运动和结构参数的初始化，采用了一种改进的基于特征的算法。给出了模拟人脸图像序列的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

自引率

0.00%

发文量