Quality-Based rPPG Compensation With Temporal Difference Transformer for Camera-Based Driver Monitoring

IF 8.4 1区 工程技术 Q1 ENGINEERING, CIVIL IEEE Transactions on Intelligent Transportation Systems Pub Date : 2024-12-31 DOI:10.1109/TITS.2024.3504605
Kunyoung Lee;Hyunsoo Seo;Seunghyun Kim;Byeong Seon An;Shinwi Park;Yonggwon Jeon;Eui Chul Lee
{"title":"Quality-Based rPPG Compensation With Temporal Difference Transformer for Camera-Based Driver Monitoring","authors":"Kunyoung Lee;Hyunsoo Seo;Seunghyun Kim;Byeong Seon An;Shinwi Park;Yonggwon Jeon;Eui Chul Lee","doi":"10.1109/TITS.2024.3504605","DOIUrl":null,"url":null,"abstract":"Remote photoplethysmography (rPPG) is a method for monitoring pulse signal by utilizing a camera sensor to capture a facial video including variations in blood flow beneath the skin. Recently, rPPG advancements have enabled the measurement of an individual’s heart rate with a Root Mean Square Error (RMSE) of approximately 1.0 in controlled indoor environments. However, when applied in car dataset including driving environments, the RMSE of rPPG measurements significantly increases to over 9.07. This limitation, caused by motion-related artifacts and fluctuations in ambient illumination, becomes particularly noticeable while driving, resulting in a Percentage of Time that Error is less than 6 beats per minute (PTE6) of up to 65.1%. To address these limitations, we focus on the assessment of rPPG noise, with an emphasis on evaluating noise components within facial video and quantifying quality of the rPPG measurement. In this paper, we propose a deep learning framework that infers rPPG signal and quality based on video vision transformer. the proposed method demonstrates that the top 10% quality measurements yield PTE6 of 91.98% and 99.59% in driving and garage environments, respectively. Additionally, we introduce a quality-based rPPG compensation method that improves accuracy in driving environments by predicting rPPG quality based on noise assessment. This compensation method demonstrates superior accuracy compared to the current state-of-the-art, achieving a PTE6 of 68.24% in driving scenarios.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 2","pages":"1951-1963"},"PeriodicalIF":8.4000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10818975/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

Remote photoplethysmography (rPPG) is a method for monitoring pulse signal by utilizing a camera sensor to capture a facial video including variations in blood flow beneath the skin. Recently, rPPG advancements have enabled the measurement of an individual’s heart rate with a Root Mean Square Error (RMSE) of approximately 1.0 in controlled indoor environments. However, when applied in car dataset including driving environments, the RMSE of rPPG measurements significantly increases to over 9.07. This limitation, caused by motion-related artifacts and fluctuations in ambient illumination, becomes particularly noticeable while driving, resulting in a Percentage of Time that Error is less than 6 beats per minute (PTE6) of up to 65.1%. To address these limitations, we focus on the assessment of rPPG noise, with an emphasis on evaluating noise components within facial video and quantifying quality of the rPPG measurement. In this paper, we propose a deep learning framework that infers rPPG signal and quality based on video vision transformer. the proposed method demonstrates that the top 10% quality measurements yield PTE6 of 91.98% and 99.59% in driving and garage environments, respectively. Additionally, we introduce a quality-based rPPG compensation method that improves accuracy in driving environments by predicting rPPG quality based on noise assessment. This compensation method demonstrates superior accuracy compared to the current state-of-the-art, achieving a PTE6 of 68.24% in driving scenarios.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于时序差分变压器的基于质量的rPPG补偿用于摄像机驱动监控
远程光电容积脉搏波描记(rPPG)是一种监测脉搏信号的方法,它利用相机传感器捕捉面部视频,包括皮肤下血液流动的变化。最近,rPPG的进步使得在受控的室内环境中测量个人心率的均方根误差(RMSE)约为1.0。然而,当应用于包含驾驶环境的汽车数据集时,rPPG测量值的RMSE显著增加到9.07以上。这种限制是由运动相关的伪影和环境照明的波动引起的,在驾驶时变得特别明显,导致误差百分比每分钟少于6次(PTE6)高达65.1%。为了解决这些限制,我们将重点放在rPPG噪声的评估上,重点是评估面部视频中的噪声成分和量化rPPG测量的质量。在本文中,我们提出了一个基于视频视觉变压器的rPPG信号和质量推断的深度学习框架。该方法表明,在驾驶和车库环境中,前10%的质量测量的PTE6率分别为91.98%和99.59%。此外,我们还介绍了一种基于质量的rPPG补偿方法,该方法通过基于噪声评估预测rPPG质量来提高驾驶环境中的准确性。与目前最先进的技术相比,这种补偿方法具有更高的精度,在驾驶场景中实现了68.24%的PTE6。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Intelligent Transportation Systems
IEEE Transactions on Intelligent Transportation Systems 工程技术-工程:电子与电气
CiteScore
14.80
自引率
12.90%
发文量
1872
审稿时长
7.5 months
期刊介绍: The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.
期刊最新文献
PCD-DB: Enhancing Popular Content Dissemination by Incentivizing V2X Cooperation Among Electric Vehicles Using DAG-Based Blockchain A Predictive UAV Framework for Tracking Fast-Moving Vehicles in Dynamic Environments Occlusion-Aware Diffusion Model for Pedestrian Intention Prediction Transparent and Trustworthy Blockchain-Based Scheme for the Protection of Vehicular Soft Integrity in Shared Mobility Social-WITRAN: Multi-Modal Trajectory Prediction With Social-Aware Information Transmission
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1