TranSenseFusers: A temporal CNN-Transformer neural network family for explainable PPG-based stress detection

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2025-04-01 Epub Date: 2024-12-03 DOI:10.1016/j.bspc.2024.107248
Panagiotis Kasnesis , Christos Chatzigeorgiou , Michalis Feidakis , Álvaro Gutiérrez , Charalampos Z. Patrikakis
{"title":"TranSenseFusers: A temporal CNN-Transformer neural network family for explainable PPG-based stress detection","authors":"Panagiotis Kasnesis ,&nbsp;Christos Chatzigeorgiou ,&nbsp;Michalis Feidakis ,&nbsp;Álvaro Gutiérrez ,&nbsp;Charalampos Z. Patrikakis","doi":"10.1016/j.bspc.2024.107248","DOIUrl":null,"url":null,"abstract":"<div><div>Stress is a common everyday emotional state in modern society contributing to both physical and mental illnesses. Thus, detecting and managing the degree of stress is crucial to improve well-being. Wearable devices equipped with biosensors, such as PhotoPlethysmoGraphy (PPG), can measure reliably a person’s affective state. However, PPG-based approaches suffer from the presence of Motion Artifacts (MA) affecting their overall performance. Classical machine learning and deep learning approaches have been proposed over the years for PPG-based stress detection, exploiting signal processing techniques to remove the recorded noise, but lack explainability or their performance fails to generalize across subjects. In the current work, we present a novel architecture, <em>TranSenseFuser</em> comprised of temporal convolutions followed by feature-level or sequence-level multi-head attention to improve sensor fusion’s effectiveness and exploit the provided attention maps as a form of explainability. The developed models are evaluated on highly benchmarked public dataset, namely WESAD, achieving state-of-the-art results (up to 98.46% accuracy and 97.03% F1-score) using different window sizes and cross-validation set-ups. Moreover, we demonstrate the explainability of the model towards filtering out the motion artifacts by visualizing the obtained attention maps and quantify the performance of this artifact segmentation feature in a zeros-shot manner.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"102 ","pages":"Article 107248"},"PeriodicalIF":4.9000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424013065","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Stress is a common everyday emotional state in modern society contributing to both physical and mental illnesses. Thus, detecting and managing the degree of stress is crucial to improve well-being. Wearable devices equipped with biosensors, such as PhotoPlethysmoGraphy (PPG), can measure reliably a person’s affective state. However, PPG-based approaches suffer from the presence of Motion Artifacts (MA) affecting their overall performance. Classical machine learning and deep learning approaches have been proposed over the years for PPG-based stress detection, exploiting signal processing techniques to remove the recorded noise, but lack explainability or their performance fails to generalize across subjects. In the current work, we present a novel architecture, TranSenseFuser comprised of temporal convolutions followed by feature-level or sequence-level multi-head attention to improve sensor fusion’s effectiveness and exploit the provided attention maps as a form of explainability. The developed models are evaluated on highly benchmarked public dataset, namely WESAD, achieving state-of-the-art results (up to 98.46% accuracy and 97.03% F1-score) using different window sizes and cross-validation set-ups. Moreover, we demonstrate the explainability of the model towards filtering out the motion artifacts by visualizing the obtained attention maps and quantify the performance of this artifact segmentation feature in a zeros-shot manner.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
TranSenseFusers:一个时序CNN-Transformer神经网络家族,用于可解释的基于ppg的应力检测
压力是现代社会中常见的日常情绪状态,会导致身体和精神疾病。因此,检测和管理压力的程度是提高幸福感的关键。配备生物传感器的可穿戴设备,如光电容积脉搏描记仪(PPG),可以可靠地测量一个人的情感状态。然而,基于ppg的方法受到运动伪影(MA)影响其整体性能的影响。多年来,经典的机器学习和深度学习方法已经被提出用于基于ppg的应力检测,利用信号处理技术去除记录的噪声,但缺乏可解释性,或者它们的性能无法在受试者中推广。在目前的工作中,我们提出了一种新的架构,TranSenseFuser由时间卷积和特征级或序列级多头注意组成,以提高传感器融合的有效性,并利用所提供的注意图作为可解释性的一种形式。开发的模型在高度基准化的公共数据集(即WESAD)上进行评估,使用不同的窗口大小和交叉验证设置获得了最先进的结果(高达98.46%的准确率和97.03%的f1分数)。此外,我们通过可视化获得的注意图来证明该模型对过滤掉运动伪影的可解释性,并以零镜头方式量化该伪影分割特征的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
Calibration-based continuous blood pressure estimation using pulse-derived signals and feature normalization PPG-CAGAN: A PPG signal with coronary artery disease synthesis method based on generative adversarial learning ATSF-SFL: Adaptive spatiotemporal feature fusion and split federated learning framework based on rs-fMRI for multi-center brain disease diagnosis Assessment and characterization of the impact of pulmonary pathology on flow-induced acoustics using computational fluid dynamics A watermarking-based multimodal healthcare data protection using blockchain and deep learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1