Perceptually-Aligned Dynamic Facial Projection Mapping by High-Speed Face-Tracking Method and Lens-Shift Co-Axial Setup

Hao-Lun Peng;Kengo Sato;Soran Nakagawa;Yoshihiro Watanabe
{"title":"Perceptually-Aligned Dynamic Facial Projection Mapping by High-Speed Face-Tracking Method and Lens-Shift Co-Axial Setup","authors":"Hao-Lun Peng;Kengo Sato;Soran Nakagawa;Yoshihiro Watanabe","doi":"10.1109/TVCG.2025.3527203","DOIUrl":null,"url":null,"abstract":"Dynamic Facial Projection Mapping (DFPM) overlays computer-generated images onto human faces to create immersive experiences that have been used in the makeup and entertainment industries. In this study, we propose two concepts to reduce the misalignment artifacts between projected images and target faces, which is a persistent challenge for DFPM. Our first concept is a high-speed face-tracking method that exploits temporal information. We first introduce a cropped-area-limited inter/extrapolation-based face detection framework, which allows parallel execution with facial landmark detection. We then propose a novel hybrid facial landmark detection method that combines fast Ensemble of Regression Trees (ERT)-based detections and an auxiliary detection. ERT-based detections rapidly produce results in 0.107 ms using temporal information with the support of auxiliary detection to recover from detection errors. To train the facial landmark detection method, we propose an innovative method for simulating high-frame-rate video annotations to address the lack of publicly available high-frame-rate annotated datasets. Our second concept is a lens-shift co-axial projector-camera setup that maintains a high optical alignment with only a 1.274-pixel error between 1 m and 2 m depth. This setup reduces misalignment by applying the same optical designs to the projector and camera without causing large misalignment as in conventional methods. Based on these concepts, we developed a novel high-speed DFPM system that achieves nearly perfect alignment with human visual perception.","PeriodicalId":94035,"journal":{"name":"IEEE transactions on visualization and computer graphics","volume":"31 10","pages":"6824-6838"},"PeriodicalIF":6.5000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10845069","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on visualization and computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10845069/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Dynamic Facial Projection Mapping (DFPM) overlays computer-generated images onto human faces to create immersive experiences that have been used in the makeup and entertainment industries. In this study, we propose two concepts to reduce the misalignment artifacts between projected images and target faces, which is a persistent challenge for DFPM. Our first concept is a high-speed face-tracking method that exploits temporal information. We first introduce a cropped-area-limited inter/extrapolation-based face detection framework, which allows parallel execution with facial landmark detection. We then propose a novel hybrid facial landmark detection method that combines fast Ensemble of Regression Trees (ERT)-based detections and an auxiliary detection. ERT-based detections rapidly produce results in 0.107 ms using temporal information with the support of auxiliary detection to recover from detection errors. To train the facial landmark detection method, we propose an innovative method for simulating high-frame-rate video annotations to address the lack of publicly available high-frame-rate annotated datasets. Our second concept is a lens-shift co-axial projector-camera setup that maintains a high optical alignment with only a 1.274-pixel error between 1 m and 2 m depth. This setup reduces misalignment by applying the same optical designs to the projector and camera without causing large misalignment as in conventional methods. Based on these concepts, we developed a novel high-speed DFPM system that achieves nearly perfect alignment with human visual perception.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于高速人脸跟踪和镜头移位同轴设置的感知对齐动态面部投影映射。
动态面部投影映射(DFPM)将计算机生成的图像覆盖到人脸上,创造出身临其境的体验,这种体验已被用于化妆品和娱乐行业。在本研究中,我们提出了两个概念来减少投影图像与目标表面之间的不对准伪影,这是DFPM一直面临的挑战。我们的第一个概念是利用时间信息的高速面部跟踪方法。我们首先引入了一个基于裁剪区域限制的内部/外推的人脸检测框架,该框架允许与面部地标检测并行执行。然后,我们提出了一种新的混合人脸特征检测方法,该方法将基于快速回归树集成(ERT)的检测与辅助检测相结合。基于ert的检测在0.107 ms内快速产生结果,利用时间信息在辅助检测的支持下从检测错误中恢复。为了训练人脸标记检测方法,我们提出了一种模拟高帧率视频注释的创新方法,以解决公开可用的高帧率注释数据集的不足。我们的第二个概念是镜头移位同轴投影仪-相机设置,它保持高度的光学对准,在1米和2米深度之间只有1.274像素的误差。这种设置通过将相同的光学设计应用于投影仪和相机,而不会像传统方法那样造成大的不校准,从而减少了不校准。基于这些概念,我们开发了一种新的高速DFPM系统,该系统与人类的视觉感知几乎完全一致。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DynAvatar: Dynamic 3D Head Avatar Deformation With Expression Guided Gaussian Splatting. Understanding the Research-Practice Gap in Visualization Design Guidelines. QuRAFT: Enhancing Quantum Algorithm Design by Visual Linking Between Mathematical Concepts and Quantum Circuits. DanceAgent: Dance Movement Refinement With LLM Agent. Do You "Trust" This Visualization? An Inventory to Measure Trust in Visualizations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1