Heart Rate Estimation from RGB Facial Videos Using Robust Face Demarcation and VMD

Arya Deo Mehta, H. Sharma
{"title":"Heart Rate Estimation from RGB Facial Videos Using Robust Face Demarcation and VMD","authors":"Arya Deo Mehta, H. Sharma","doi":"10.1109/NCC52529.2021.9530067","DOIUrl":null,"url":null,"abstract":"The recent studies suggest the feasibility of accessing crucial health parameters through contactless means with an RGB camera placed at a distance. As high-quality RGB cameras are getting more cost-effective due to the drastic evolution in imaging technology, the camera-based health monitoring is evoking a considerable interest among researchers. This development may provide a better alternative to the conventional contact-based methods, as it promises a convenient and contactless long term vital sign monitoring solution that doesn't restrict personal mobility. This paper introduces an effective approach towards monitoring heart rate (HR) from facial videos using an RGB camera in wild practical scenarios. The proposed approach introduces the face symmetry-based quality scoring, which is an essential step to ensure quality face detection and avoid false face detections in videos captured in a practical scenario. Further, steps such as feature points generation for optimum masking and variational mode decomposition (VMD) based filtering assist in obtaining a signal dominated mainly by the HR component. Two publicly available datasets comprising the video signals at different frame rates collected from the subjects with diverse ethnicities and skin tones are used to access the performance of the technique. The proposed approach achieved a mean absolute error of 6.58 beats per minute (BPM) on the COHFACE (Good illumination) dataset class, 9.11 BPM on the COHFACE (Bad illumination) dataset class and 6.37 BPM on the DEAP dataset class outperforming some of the state-of-art methods affirming its effectiveness in the estimation of HR in more realistic scenarios.","PeriodicalId":414087,"journal":{"name":"2021 National Conference on Communications (NCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC52529.2021.9530067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The recent studies suggest the feasibility of accessing crucial health parameters through contactless means with an RGB camera placed at a distance. As high-quality RGB cameras are getting more cost-effective due to the drastic evolution in imaging technology, the camera-based health monitoring is evoking a considerable interest among researchers. This development may provide a better alternative to the conventional contact-based methods, as it promises a convenient and contactless long term vital sign monitoring solution that doesn't restrict personal mobility. This paper introduces an effective approach towards monitoring heart rate (HR) from facial videos using an RGB camera in wild practical scenarios. The proposed approach introduces the face symmetry-based quality scoring, which is an essential step to ensure quality face detection and avoid false face detections in videos captured in a practical scenario. Further, steps such as feature points generation for optimum masking and variational mode decomposition (VMD) based filtering assist in obtaining a signal dominated mainly by the HR component. Two publicly available datasets comprising the video signals at different frame rates collected from the subjects with diverse ethnicities and skin tones are used to access the performance of the technique. The proposed approach achieved a mean absolute error of 6.58 beats per minute (BPM) on the COHFACE (Good illumination) dataset class, 9.11 BPM on the COHFACE (Bad illumination) dataset class and 6.37 BPM on the DEAP dataset class outperforming some of the state-of-art methods affirming its effectiveness in the estimation of HR in more realistic scenarios.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于鲁棒人脸分割和VMD的RGB人脸视频心率估计
最近的研究表明,通过放置在远处的RGB相机通过非接触式方式获取关键健康参数的可行性。随着成像技术的飞速发展,高质量的RGB摄像机越来越具有成本效益,基于摄像机的健康监测引起了研究人员的极大兴趣。这一发展可能为传统的基于接触的方法提供更好的替代方案,因为它承诺提供一种方便和非接触的长期生命体征监测解决方案,不会限制个人的行动能力。本文介绍了一种在野外实际场景中使用RGB相机从面部视频中监测心率的有效方法。该方法引入了基于人脸对称性的质量评分,这是确保在实际场景中捕获的视频中进行高质量人脸检测和避免虚假人脸检测的关键步骤。此外,用于最佳掩蔽的特征点生成和基于变分模态分解(VMD)的滤波等步骤有助于获得主要由HR分量主导的信号。使用两个公开可用的数据集,包括从不同种族和肤色的受试者收集的不同帧速率的视频信号,以访问该技术的性能。所提出的方法在COHFACE(光照良好)数据集类上的平均绝对误差为6.58次/分钟(BPM),在COHFACE(光照不足)数据集类上的平均绝对误差为9.11次/分钟,在DEAP数据集类上的平均绝对误差为6.37次/分钟,优于一些最先进的方法,证实了其在更现实的场景中估计HR的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Biomedical Image Retrieval using Muti-Scale Local Bit-plane Arbitrary Shaped Patterns Forensics of Decompressed JPEG Color Images Based on Chroma Subsampling Optimized Bio-inspired Spiking Neural Models based Anatomical and Functional Neurological Image Fusion in NSST Domain Improved Hankel Norm Criterion for Interfered Nonlinear Digital Filters Subjected to Hardware Constraints The Capacity of Photonic Erasure Channels with Detector Dead Times
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1