Non-contact rPPG-based human status assessment via a spatial–temporal attention feature fusion network with anti-aliasing

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers in Industry Pub Date : 2025-02-01 DOI:10.1016/j.compind.2024.104227
Qiwei Xue , Xi Zhang , Yuchong Zhang , Amin Hekmatmanesh , Huapeng Wu , Yuntao Song , Yong Cheng
{"title":"Non-contact rPPG-based human status assessment via a spatial–temporal attention feature fusion network with anti-aliasing","authors":"Qiwei Xue ,&nbsp;Xi Zhang ,&nbsp;Yuchong Zhang ,&nbsp;Amin Hekmatmanesh ,&nbsp;Huapeng Wu ,&nbsp;Yuntao Song ,&nbsp;Yong Cheng","doi":"10.1016/j.compind.2024.104227","DOIUrl":null,"url":null,"abstract":"<div><div>Remote Photoplethysmography (rPPG) is a cost-effective and non-contact technology that enables real-time monitoring of physiological status by extracting vital information such as heart rate (HR). This capability enables the assessment of fatigue and stress, helping to prevent accidents by identifying risky conditions early. Continuous monitoring with rPPG reduces operational risks, contributing to safer industrial and medical environments. However, the performance of rPPG is challenged by complex backgrounds and facial motions in industrial environments, which complicates feature extraction. To address these challenges, this paper proposes a spatial–temporal attention feature fusion network with anti-aliasing (ST-ASENet) for human status assessment. The ST-ASENet encodes spatial–temporal facial signals from multiple regions of interest (ROI) and enhances feature extraction through the attention mechanism. The network integrates anti-aliasing by low-pass filtering during the downsampling process to improve the accuracy of rPPG signals in complex environments. It calculates HR, respiratory rate (RR), and heart rate variability (HRV) for status evaluation. Additionally, the Robotics Operator Factors Assessment (ROFA) dataset is introduced, featuring diverse individuals and environments to improve the robustness of ST-ASENet. Experimental results demonstrate that ST-ASENet outperforms state-of-the-art methods in HR estimation and shows effectiveness across various industrial scenarios. The proposed method fosters operational efficiency and a data-driven approach to human-centric safety, making rPPG invaluable in modern, health-focused workplaces.</div></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"165 ","pages":"Article 104227"},"PeriodicalIF":8.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524001556","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Remote Photoplethysmography (rPPG) is a cost-effective and non-contact technology that enables real-time monitoring of physiological status by extracting vital information such as heart rate (HR). This capability enables the assessment of fatigue and stress, helping to prevent accidents by identifying risky conditions early. Continuous monitoring with rPPG reduces operational risks, contributing to safer industrial and medical environments. However, the performance of rPPG is challenged by complex backgrounds and facial motions in industrial environments, which complicates feature extraction. To address these challenges, this paper proposes a spatial–temporal attention feature fusion network with anti-aliasing (ST-ASENet) for human status assessment. The ST-ASENet encodes spatial–temporal facial signals from multiple regions of interest (ROI) and enhances feature extraction through the attention mechanism. The network integrates anti-aliasing by low-pass filtering during the downsampling process to improve the accuracy of rPPG signals in complex environments. It calculates HR, respiratory rate (RR), and heart rate variability (HRV) for status evaluation. Additionally, the Robotics Operator Factors Assessment (ROFA) dataset is introduced, featuring diverse individuals and environments to improve the robustness of ST-ASENet. Experimental results demonstrate that ST-ASENet outperforms state-of-the-art methods in HR estimation and shows effectiveness across various industrial scenarios. The proposed method fosters operational efficiency and a data-driven approach to human-centric safety, making rPPG invaluable in modern, health-focused workplaces.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于特征融合嵌入抗混叠的非接触rppg工业人体状态评估
远程光电容积脉搏波描记(rPPG)是一种具有成本效益的非接触式技术,可以通过提取心率(HR)等重要信息来实时监测生理状态。这种能力可以评估疲劳和压力,帮助通过早期识别危险情况来防止事故的发生。使用rPPG进行持续监测可降低操作风险,有助于提高工业和医疗环境的安全性。然而,rPPG的性能受到工业环境中复杂背景和面部运动的挑战,这使得特征提取变得复杂。为了解决这些问题,本文提出了一种用于人类状态评估的具有抗混叠的时空注意力特征融合网络(ST-ASENet)。ST-ASENet对来自多个感兴趣区域(ROI)的时空面部信号进行编码,并通过注意机制增强特征提取。该网络在下采样过程中通过低通滤波实现抗混叠,提高了rPPG信号在复杂环境下的精度。它计算HR、呼吸率(RR)和心率变异性(HRV)来评估状态。此外,引入了机器人操作员因素评估(ROFA)数据集,具有不同的个体和环境,以提高ST-ASENet的鲁棒性。实验结果表明,ST-ASENet在人力资源估计方面优于最先进的方法,并在各种工业场景中显示出有效性。拟议的方法提高了操作效率和数据驱动的方法,以实现以人为本的安全,使rPPG在以健康为重点的现代工作场所中变得非常宝贵。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
期刊最新文献
Intelligent chatter detection in high-speed milling using successive variational mode decomposition and a multi-channel feature fusion network A decentralised approach to cyber-physical systems as a service: Managing shared access worldwide through blockchain standards On design of cognitive situation-adaptive autonomous mobile robotic applications Editorial Board Adaptive fault diagnosis of machining processes enabled by hybrid deep learning and incremental transfer learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1