首页 > 最新文献

Biomedical Engineering Letters最新文献

英文 中文
Artificial intelligence in ECG-based diagnosis of low left ventricular ejection fraction: a systematic review and meta-analysis. 人工智能在低左心室射血分数诊断中的应用:系统回顾和荟萃分析。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-05-14 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00479-3
Gayathiri R R, Arya Bhardwaj, R Pradeep Kumar, Bala Chakravarthy Neelapu, Kunal Pal, J Sivaraman

Detecting Left Ventricular Systolic Dysfunction (LVSD) is crucial for counteracting heart failure progression. While Electrocardiograms (ECG) are widely used, their standalone diagnostic accuracy is insufficient. Integrating Artificial Intelligence (AI) with ECG analysis offers a promising approach to enhance precision. A systematic review was conducted to assess AI-enabled ECG for LVSD detection. Of 394 initial studies, 19 qualified for the systematic review, with 17 incorporated into meta-analysis. Study quality was gauged using QUADAS-2. Univariate meta-analysis, Spearman correlation, and bivariate meta-analyses were performed, along with publication bias assessment. The pooled sensitivity and specificity for AI-enabled ECG models were 86.9% and 84.4%, respectively. Studies with an ejection fraction (EF) threshold of 35% had the highest sensitivity, while those with 50% showed lower sensitivity and specificity. A weak positive Spearman correlation was found across all studies (ρ = 0.374, p = 0.066). A strong positive correlation for externally validated studies (ρ = 0.696, p = 0.008), and a weak negative correlation for test-only studies, indicated a threshold effect. Hierarchical summary receiver operating characteristic curve showed diagnostic robustness for studies with a 40% EF threshold; however, it showed a lack of real-world generalizability for test-only studies. AI-enabled ECG models show strong diagnostic potential for severe LVSD but remain limited for mild cases. External validation is essential for robustness and generalizability. Future research should enhance diagnostic accuracy for mild LVSD and address publication bias to optimize AI-based tools.

检测左心室收缩功能障碍(LVSD)是对抗心力衰竭进展的关键。虽然心电图(ECG)被广泛使用,但其单独诊断的准确性不足。将人工智能(AI)与ECG分析相结合提供了一种很有前途的提高精度的方法。我们进行了一项系统评价,以评估人工智能心电图对LVSD的检测效果。在394项初始研究中,19项符合系统评价,17项纳入荟萃分析。使用QUADAS-2评估研究质量。进行单因素荟萃分析、Spearman相关分析和双因素荟萃分析,并进行发表偏倚评估。人工智能心电图模型的敏感性和特异性分别为86.9%和84.4%。射血分数(EF)阈值为35%的研究灵敏度最高,而阈值为50%的研究灵敏度和特异性较低。在所有研究中均发现微弱的正Spearman相关(ρ = 0.374, p = 0.066)。外部验证的研究有很强的正相关(ρ = 0.696, p = 0.008),仅测试的研究有弱的负相关,表明存在阈值效应。分级汇总受者工作特征曲线对具有40% EF阈值的研究具有诊断稳健性;然而,它显示出对仅限测试的研究缺乏现实世界的普遍性。人工智能支持的心电图模型显示出对严重LVSD的强大诊断潜力,但对轻度病例的诊断仍然有限。外部验证对于鲁棒性和泛化性至关重要。未来的研究应提高轻度LVSD的诊断准确性,解决发表偏倚问题,优化基于人工智能的工具。
{"title":"Artificial intelligence in ECG-based diagnosis of low left ventricular ejection fraction: a systematic review and meta-analysis.","authors":"Gayathiri R R, Arya Bhardwaj, R Pradeep Kumar, Bala Chakravarthy Neelapu, Kunal Pal, J Sivaraman","doi":"10.1007/s13534-025-00479-3","DOIUrl":"https://doi.org/10.1007/s13534-025-00479-3","url":null,"abstract":"<p><p>Detecting Left Ventricular Systolic Dysfunction (LVSD) is crucial for counteracting heart failure progression. While Electrocardiograms (ECG) are widely used, their standalone diagnostic accuracy is insufficient. Integrating Artificial Intelligence (AI) with ECG analysis offers a promising approach to enhance precision. A systematic review was conducted to assess AI-enabled ECG for LVSD detection. Of 394 initial studies, 19 qualified for the systematic review, with 17 incorporated into meta-analysis. Study quality was gauged using QUADAS-2. Univariate meta-analysis, Spearman correlation, and bivariate meta-analyses were performed, along with publication bias assessment. The pooled sensitivity and specificity for AI-enabled ECG models were 86.9% and 84.4%, respectively. Studies with an ejection fraction (EF) threshold of 35% had the highest sensitivity, while those with 50% showed lower sensitivity and specificity. A weak positive Spearman correlation was found across all studies (ρ = 0.374, <i>p</i> = 0.066). A strong positive correlation for externally validated studies (ρ = 0.696, <i>p</i> = 0.008), and a weak negative correlation for test-only studies, indicated a threshold effect. Hierarchical summary receiver operating characteristic curve showed diagnostic robustness for studies with a 40% EF threshold; however, it showed a lack of real-world generalizability for test-only studies. AI-enabled ECG models show strong diagnostic potential for severe LVSD but remain limited for mild cases. External validation is essential for robustness and generalizability. Future research should enhance diagnostic accuracy for mild LVSD and address publication bias to optimize AI-based tools.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"661-676"},"PeriodicalIF":3.2,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229370/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sensor arrangement strategy for effective bowel sound source localization. 有效定位肠道声源的传感器布置策略。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-05-06 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00476-6
Kenji Takawaki, Takeyuki Haraguchi, Takahiro Emoto

The main purpose of this study is to develop a method that can objectively evaluate the intestinal motility of patients with functional gastrointestinal disorders through non-invasive means. The research question focuses on whether the asymmetry in electronic stethoscope (ES) arrangements can enhance the accuracy of bowel sound (BS) source localization, which is crucial for detailed assessments of intestinal motility. This study introduced a new index called the angle-based asymmetry degree ([Formula: see text]), derived from the interior angles of the quadrilateral formed by the ESs, to quantitatively evaluate the asymmetry of four-ES-based arrangement patterns. We conducted simulations in an abdominal acoustic environment to compare the effects of symmetric and asymmetric ES arrangements on BS source localization accuracy. The influence of different [Formula: see text] values on localization performance was also analyzed under various signal-to-noise ratio ([Formula: see text]) conditions. The simulations revealed that BS source localization accuracy improved as the [Formula: see text] increased. Asymmetric ES arrangements significantly enhanced the localization accuracy compared to conventional symmetric arrangements, even in environments with high levels of noise. Additionally, various ES arrangement patterns corresponding to different [Formula: see text] values demonstrated improvements in localization accuracy. The study concludes that using asymmetric ES arrangements based on the newly proposed [Formula: see text] index substantially improves BS source localization accuracy. These findings suggest that asymmetry in ES placements can be a critical factor in enhancing non-invasive evaluations of intestinal motility, thereby contributing to the development of more effective BS source localization technologies. The results hold promise for practical applications in diagnosing and managing functional gastrointestinal disorders.

本研究的主要目的是开发一种能够通过非侵入性手段客观评价功能性胃肠疾病患者肠道运动的方法。研究问题集中在电子听诊器(ES)排列的不对称是否可以提高肠声(BS)源定位的准确性,这对于详细评估肠道运动至关重要。本研究引入了一种新的指标——基于角度的不对称度([公式:见文]),该指标由四种es形成的四边形的内角推导而来,用于定量评价四种es排列模式的不对称性。我们在腹部声环境中进行了模拟,比较对称和不对称ES排列对BS源定位精度的影响。分析了在不同信噪比(公式:见文)条件下,不同[公式:见文]值对定位性能的影响。仿真结果表明,BS源定位精度随着[公式:见文本]的增加而提高。与传统的对称配置相比,非对称ES配置显著提高了定位精度,即使在高噪音环境中也是如此。此外,不同的ES排列模式对应不同的[公式:见文本]值,显示了定位精度的提高。研究表明,采用基于新提出的[公式:见文本]索引的非对称ES排列,可以显著提高BS源定位精度。这些发现表明,ES放置的不对称可能是增强肠道运动的非侵入性评估的关键因素,从而有助于开发更有效的BS源定位技术。该结果有望在功能性胃肠疾病的诊断和管理中得到实际应用。
{"title":"Sensor arrangement strategy for effective bowel sound source localization.","authors":"Kenji Takawaki, Takeyuki Haraguchi, Takahiro Emoto","doi":"10.1007/s13534-025-00476-6","DOIUrl":"https://doi.org/10.1007/s13534-025-00476-6","url":null,"abstract":"<p><p>The main purpose of this study is to develop a method that can objectively evaluate the intestinal motility of patients with functional gastrointestinal disorders through non-invasive means. The research question focuses on whether the asymmetry in electronic stethoscope (ES) arrangements can enhance the accuracy of bowel sound (BS) source localization, which is crucial for detailed assessments of intestinal motility. This study introduced a new index called the angle-based asymmetry degree ([Formula: see text]), derived from the interior angles of the quadrilateral formed by the ESs, to quantitatively evaluate the asymmetry of four-ES-based arrangement patterns. We conducted simulations in an abdominal acoustic environment to compare the effects of symmetric and asymmetric ES arrangements on BS source localization accuracy. The influence of different [Formula: see text] values on localization performance was also analyzed under various signal-to-noise ratio ([Formula: see text]) conditions. The simulations revealed that BS source localization accuracy improved as the [Formula: see text] increased. Asymmetric ES arrangements significantly enhanced the localization accuracy compared to conventional symmetric arrangements, even in environments with high levels of noise. Additionally, various ES arrangement patterns corresponding to different [Formula: see text] values demonstrated improvements in localization accuracy. The study concludes that using asymmetric ES arrangements based on the newly proposed [Formula: see text] index substantially improves BS source localization accuracy. These findings suggest that asymmetry in ES placements can be a critical factor in enhancing non-invasive evaluations of intestinal motility, thereby contributing to the development of more effective BS source localization technologies. The results hold promise for practical applications in diagnosing and managing functional gastrointestinal disorders.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"763-772"},"PeriodicalIF":3.2,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229448/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Facial electromyogram-based emotion recognition for virtual reality applications using machine learning classifiers trained on posed expressions. 基于面部肌电图的虚拟现实应用情感识别,使用机器学习分类器训练姿势表情。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-05-03 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00477-5
Jung-Hwan Kim, Ho-Seung Cha, Chang-Hwan Im

Recognition of human emotions holds great potential for various daily-life applications. With the increasing interest in virtual reality (VR) technologies, numerous studies have proposed new approaches to integrating emotion recognition into VR environments. However, despite recent advancements, camera-based emotion-recognition technology faces critical limitations due to the physical obstruction caused by head-mounted displays (HMDs). Facial electromyography (fEMG) offers a promising alternative for human emotion-recognition in VR environments, as electrodes can be readily embedded in the padding of commercial HMD devices. However, conventional fEMG-based emotion recognition approaches, although not yet developed for VR applications, require lengthy and tedious calibration sessions. These sessions typically involve collecting fEMG data during the presentation of audio-visual stimuli for eliciting specific emotions. We trained a machine learning classifier using fEMG data acquired while users intentionally made posed facial expressions. This approach simplifies the traditionally time-consuming calibration process, making it less burdensome for users. The proposed method was validated using 20 participants who made posed facial expressions for calibration and then watched emotion-evoking video clips for validation. The results demonstrated the effectiveness of our method in classifying high- and low-valence states, achieving a macro F1-score of 88.20%. This underscores the practicality and efficiency of the proposed method. To the best of our knowledge, this is the first study to successfully build an fEMG-based emotion-recognition model using posed facial expressions. This approach paves the way for developing user-friendly interface technologies in VR-immersive environments.

人类情感识别在日常生活中有着巨大的应用潜力。随着人们对虚拟现实技术的兴趣日益浓厚,许多研究提出了将情感识别集成到虚拟现实环境中的新方法。然而,尽管最近取得了进展,但由于头戴式显示器(hmd)造成的物理障碍,基于摄像头的情感识别技术面临着严重的限制。面部肌电图(fEMG)为VR环境中的人类情绪识别提供了一个有前途的替代方案,因为电极可以很容易地嵌入商业头戴设备的填充物中。然而,传统的基于femg的情感识别方法虽然尚未开发用于VR应用,但需要冗长而繁琐的校准过程。这些课程通常包括在呈现视听刺激以引发特定情绪的过程中收集fEMG数据。我们训练了一个机器学习分类器,使用用户故意做出面部表情时获得的fEMG数据。这种方法简化了传统上耗时的校准过程,减少了用户的负担。该方法通过20名参与者的面部表情来验证,然后观看情感唤起的视频片段来验证。结果表明,我们的方法对高价态和低价态的分类是有效的,宏观f1得分为88.20%。这突出了所提出方法的实用性和有效性。据我们所知,这是第一次成功地利用摆姿势的面部表情建立基于femg的情绪识别模型的研究。这种方法为在vr沉浸式环境中开发用户友好的界面技术铺平了道路。
{"title":"Facial electromyogram-based emotion recognition for virtual reality applications using machine learning classifiers trained on posed expressions.","authors":"Jung-Hwan Kim, Ho-Seung Cha, Chang-Hwan Im","doi":"10.1007/s13534-025-00477-5","DOIUrl":"https://doi.org/10.1007/s13534-025-00477-5","url":null,"abstract":"<p><p>Recognition of human emotions holds great potential for various daily-life applications. With the increasing interest in virtual reality (VR) technologies, numerous studies have proposed new approaches to integrating emotion recognition into VR environments. However, despite recent advancements, camera-based emotion-recognition technology faces critical limitations due to the physical obstruction caused by head-mounted displays (HMDs). Facial electromyography (fEMG) offers a promising alternative for human emotion-recognition in VR environments, as electrodes can be readily embedded in the padding of commercial HMD devices. However, conventional fEMG-based emotion recognition approaches, although not yet developed for VR applications, require lengthy and tedious calibration sessions. These sessions typically involve collecting fEMG data during the presentation of audio-visual stimuli for eliciting specific emotions. We trained a machine learning classifier using fEMG data acquired while users intentionally made posed facial expressions. This approach simplifies the traditionally time-consuming calibration process, making it less burdensome for users. The proposed method was validated using 20 participants who made posed facial expressions for calibration and then watched emotion-evoking video clips for validation. The results demonstrated the effectiveness of our method in classifying high- and low-valence states, achieving a macro F1-score of 88.20%. This underscores the practicality and efficiency of the proposed method. To the best of our knowledge, this is the first study to successfully build an fEMG-based emotion-recognition model using posed facial expressions. This approach paves the way for developing user-friendly interface technologies in VR-immersive environments.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"773-783"},"PeriodicalIF":3.2,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229427/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A study for expert-informed active pulmonary nodule segmentation. 专家指导下活动性肺结节分割的研究。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-04-25 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00474-8
Shuangping Tan, Tong Zhang, Youfeng Deng, Zhimin Nie, Xiali Wu, Xinyue Yan, Xiaojuan Zhang, Huike Yi, Xianci Song, Jun Li

Accurate segmentation of pulmonary nodule based on computed tomography (CT) images is of great significance for the diagnosis and treatment of lung cancer. However, the current popular segmentation algorithms usually do not involve expert knowledge from radiologists, thereby carrying the risk of failing to produce generalizable and trustworthy models. In this study, we develop an expert-informed active pulmonary nodule segmentation method that iteratively optimize a deep segmentation model using an active learning scheme. The uncertainties from both intermediate segmentation results and correction inputs from radiologists are combined effectively. Interactive graph interfaces are developed to enable online corrections, significantly facilitating the integration of expert knowledge from radiologists. Evaluation results on the Luna16 dataset demonstrate that the proposed approach significantly promotes the segmentation performance of pulmonary nodules. The proposed method can effectively incorporate expert knowledge of multiple radiologists into deep segmentation algorithms, which not only promote the segmentation performance, but also enhance the validity, reliability, and generalizability of computer-aided diagnosis methods.

基于计算机断层扫描(CT)图像的肺结节准确分割对肺癌的诊断和治疗具有重要意义。然而,目前流行的分割算法通常不涉及放射科医生的专业知识,因此存在无法产生可推广和值得信赖的模型的风险。在这项研究中,我们开发了一种专家知情的活动性肺结节分割方法,该方法使用主动学习方案迭代优化深度分割模型。中间分割结果和放射科医生的校正输入的不确定性被有效地结合起来。交互式图形界面的开发,使在线更正,大大促进专家知识从放射科医师的整合。在Luna16数据集上的评估结果表明,该方法显著提高了肺结节的分割性能。该方法能有效地将多名放射科医生的专家知识整合到深度分割算法中,不仅提高了分割性能,而且提高了计算机辅助诊断方法的有效性、可靠性和泛化性。
{"title":"A study for expert-informed active pulmonary nodule segmentation.","authors":"Shuangping Tan, Tong Zhang, Youfeng Deng, Zhimin Nie, Xiali Wu, Xinyue Yan, Xiaojuan Zhang, Huike Yi, Xianci Song, Jun Li","doi":"10.1007/s13534-025-00474-8","DOIUrl":"https://doi.org/10.1007/s13534-025-00474-8","url":null,"abstract":"<p><p>Accurate segmentation of pulmonary nodule based on computed tomography (CT) images is of great significance for the diagnosis and treatment of lung cancer. However, the current popular segmentation algorithms usually do not involve expert knowledge from radiologists, thereby carrying the risk of failing to produce generalizable and trustworthy models. In this study, we develop an expert-informed active pulmonary nodule segmentation method that iteratively optimize a deep segmentation model using an active learning scheme. The uncertainties from both intermediate segmentation results and correction inputs from radiologists are combined effectively. Interactive graph interfaces are developed to enable online corrections, significantly facilitating the integration of expert knowledge from radiologists. Evaluation results on the Luna16 dataset demonstrate that the proposed approach significantly promotes the segmentation performance of pulmonary nodules. The proposed method can effectively incorporate expert knowledge of multiple radiologists into deep segmentation algorithms, which not only promote the segmentation performance, but also enhance the validity, reliability, and generalizability of computer-aided diagnosis methods.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"735-748"},"PeriodicalIF":3.2,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229428/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in cardiovascular signal analysis with future directions: a review of machine learning and deep learning models for cardiovascular disease classification based on ECG, PCG, and PPG signals. 心血管信号分析的进展与未来方向:基于ECG、PCG和PPG信号的心血管疾病分类的机器学习和深度学习模型综述
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-04-24 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00473-9
Yunendah Nur Fuadah, Ki Moo Lim

This systematic review examines the transformative impact of artificial intelligence (AI), including machine learning (ML) and deep learning (DL), on cardiovascular signal analysis, focusing on electrocardiograms (ECG), phonocardiograms (PCG), and photoplethysmograms (PPG). It evaluates state-of-the-art methodologies that enhance diagnostic accuracy and predictive analytics by leveraging AI-driven systems. A wide range of public and private datasets are assessed, with attention to their strengths and limitations in supporting cardiovascular diagnostics. Key preprocessing techniques, such as noise reduction, signal normalization, and artifact mitigation, are explored for their role in improving signal quality. The review also highlights feature extraction methods, from time-domain and frequency-domain analyses to advanced morphological and spectral approaches, which contribute to robust classifier performance. Traditional ML models, such as K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Random Forests (RF), are compared with advanced DL architectures, including Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), and transfer learning models for detecting cardiovascular diseases. Despite these advancements, challenges such as dataset heterogeneity, preprocessing variability, and computational complexity persist, hindering clinical adoption. The review underscores the importance of large-scale, diverse datasets, multi-modal signal integration, and explainable AI to foster clinical trust and facilitate ethical deployment. By addressing these challenges, this review highlights the potential of AI to revolutionize cardiovascular healthcare through early diagnosis, wearable technology, and real-time decision support, paving the way for precision medicine and improved patient outcomes.

本系统综述研究了人工智能(AI)的变革性影响,包括机器学习(ML)和深度学习(DL),对心血管信号分析的影响,重点是心电图(ECG),心音图(PCG)和光容积描记图(PPG)。它评估最先进的方法,通过利用人工智能驱动的系统提高诊断准确性和预测分析。评估了广泛的公共和私人数据集,并注意了它们在支持心血管诊断方面的优势和局限性。关键的预处理技术,如降噪、信号归一化和伪影缓解,探讨了它们在提高信号质量方面的作用。该综述还强调了特征提取方法,从时域和频域分析到先进的形态学和频谱方法,这些方法有助于增强分类器的性能。传统的机器学习模型,如k -最近邻(KNN)、支持向量机(SVM)和随机森林(RF),与先进的深度学习架构,包括卷积神经网络(cnn)、长短期记忆网络(LSTMs)和用于检测心血管疾病的迁移学习模型进行了比较。尽管取得了这些进步,但数据集异质性、预处理可变性和计算复杂性等挑战仍然存在,阻碍了临床应用。该综述强调了大规模、多样化数据集、多模态信号集成和可解释的人工智能对于促进临床信任和促进伦理部署的重要性。通过应对这些挑战,本综述强调了人工智能通过早期诊断、可穿戴技术和实时决策支持彻底改变心血管医疗保健的潜力,为精准医疗和改善患者预后铺平了道路。
{"title":"Advances in cardiovascular signal analysis with future directions: a review of machine learning and deep learning models for cardiovascular disease classification based on ECG, PCG, and PPG signals.","authors":"Yunendah Nur Fuadah, Ki Moo Lim","doi":"10.1007/s13534-025-00473-9","DOIUrl":"https://doi.org/10.1007/s13534-025-00473-9","url":null,"abstract":"<p><p>This systematic review examines the transformative impact of artificial intelligence (AI), including machine learning (ML) and deep learning (DL), on cardiovascular signal analysis, focusing on electrocardiograms (ECG), phonocardiograms (PCG), and photoplethysmograms (PPG). It evaluates state-of-the-art methodologies that enhance diagnostic accuracy and predictive analytics by leveraging AI-driven systems. A wide range of public and private datasets are assessed, with attention to their strengths and limitations in supporting cardiovascular diagnostics. Key preprocessing techniques, such as noise reduction, signal normalization, and artifact mitigation, are explored for their role in improving signal quality. The review also highlights feature extraction methods, from time-domain and frequency-domain analyses to advanced morphological and spectral approaches, which contribute to robust classifier performance. Traditional ML models, such as K-Nearest Neighbor (KNN), Support Vector Machines (SVM) and Random Forests (RF), are compared with advanced DL architectures, including Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), and transfer learning models for detecting cardiovascular diseases. Despite these advancements, challenges such as dataset heterogeneity, preprocessing variability, and computational complexity persist, hindering clinical adoption. The review underscores the importance of large-scale, diverse datasets, multi-modal signal integration, and explainable AI to foster clinical trust and facilitate ethical deployment. By addressing these challenges, this review highlights the potential of AI to revolutionize cardiovascular healthcare through early diagnosis, wearable technology, and real-time decision support, paving the way for precision medicine and improved patient outcomes.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"619-660"},"PeriodicalIF":3.2,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229450/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convolutional channel modulator for transformer and LSTM networks in EEG-based emotion recognition. 基于脑电图的情感识别中变压器和LSTM网络的卷积信道调制器。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-04-21 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00475-7
Hyunwook Kang, Jin Woo Choi, Byung Hyung Kim

Electroencephalogram (EEG) signal is receiving much attention from recent studies since it is highly associated with intrinsic emotion. However, EEG signals contain underlying factors of variations across different sessions of the same subject, which make it difficult to learn temporal relationships between successive time steps. To disentangle invariant features, we propose a feature re-weighting mechanism on the extracted EEG features for temporal sequence modeling. Based on this method, our proposed model, called Convolutional Channel Modulator for Transformer and LSTM networks (CCMTL), extracts emotion-related inter-channel correlations using convolution operations and emphasizes important features by generating a channel attention map. This attention map is then used to perform matrix multiplication on the extracted features, which helps the subsequent Transformer to focus on important affective features. Furthermore, the sequential temporal modeling enhances the overall model's capability to understand temporal relationships both in global and sequential contexts. Experimental settings on public EEG emotion datasets demonstrate the superiority of the proposed CCMTL, surpassing six state-of-the-art models. Our code is publicly available at https://github.com/affctivai/CCMTL.

脑电图(EEG)信号与内在情绪密切相关,近年来备受关注。然而,脑电图信号包含了同一受试者不同时段的潜在变化因素,这使得学习连续时间步长之间的时间关系变得困难。为了去除不变性特征,我们提出了一种特征重加权机制,对提取的脑电特征进行时序建模。基于这种方法,我们提出的模型,称为变压器和LSTM网络的卷积信道调制器(CCMTL),使用卷积运算提取与情感相关的信道间相关性,并通过生成信道注意图来强调重要特征。然后使用此注意图对提取的特征执行矩阵乘法,这有助于后续Transformer将重点放在重要的情感特征上。此外,时序时序建模增强了整体模型在全局和时序上下文中理解时序关系的能力。在公开的EEG情绪数据集上的实验设置证明了所提出的CCMTL的优越性,超过了目前最先进的六种模型。我们的代码可以在https://github.com/affctivai/CCMTL上公开获得。
{"title":"Convolutional channel modulator for transformer and LSTM networks in EEG-based emotion recognition.","authors":"Hyunwook Kang, Jin Woo Choi, Byung Hyung Kim","doi":"10.1007/s13534-025-00475-7","DOIUrl":"https://doi.org/10.1007/s13534-025-00475-7","url":null,"abstract":"<p><p>Electroencephalogram (EEG) signal is receiving much attention from recent studies since it is highly associated with intrinsic emotion. However, EEG signals contain underlying factors of variations across different sessions of the same subject, which make it difficult to learn temporal relationships between successive time steps. To disentangle invariant features, we propose a feature re-weighting mechanism on the extracted EEG features for temporal sequence modeling. Based on this method, our proposed model, called Convolutional Channel Modulator for Transformer and LSTM networks (CCMTL), extracts emotion-related inter-channel correlations using convolution operations and emphasizes important features by generating a channel attention map. This attention map is then used to perform matrix multiplication on the extracted features, which helps the subsequent Transformer to focus on important affective features. Furthermore, the sequential temporal modeling enhances the overall model's capability to understand temporal relationships both in global and sequential contexts. Experimental settings on public EEG emotion datasets demonstrate the superiority of the proposed CCMTL, surpassing six state-of-the-art models. Our code is publicly available at https://github.com/affctivai/CCMTL.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"749-761"},"PeriodicalIF":3.2,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion-Phys: noise-robust heart rate estimation from facial videos via diffusion models. 扩散物理:基于扩散模型的面部视频噪声鲁棒心率估计。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-04-09 eCollection Date: 2025-05-01 DOI: 10.1007/s13534-025-00472-w
Yong-Hoon Jeong, Young-Seok Choi

Remote photoplethysmography (rPPG) offers significant potential for health monitoring and emotional analysis through non-contact physiological measurement from facial videos. However, noise remains a crucial challenge, limiting the generalizability of current rPPG methods. This paper introduces Diffusion-Phys, a novel framework using diffusion models for robust heart rate (HR) estimation from facial videos. Diffusion-Phys employs Multi-scale Spatial-Temporal Maps (MSTmaps) to preprocess input data and introduces Gaussian noise to simulate real-world conditions. The model is trained using a denoising network for accurate HR estimation. Experimental evaluations on the VIPL-HR, UBFC-rPPG and PURE datasets demonstrate that Diffusion-Phys achieves comparable or superior performance to state-of-the-art methods, with lower computational complexity. These results highlight the effectiveness of explicitly addressing noise through diffusion modeling, improving the reliability and generalization of non-contact physiological measurement systems.

远程光电容积脉搏波(rPPG)通过非接触的面部视频生理测量,为健康监测和情绪分析提供了巨大的潜力。然而,噪声仍然是一个关键的挑战,限制了当前rPPG方法的推广。本文介绍了一种利用扩散模型对面部视频进行鲁棒心率估计的新框架diffusion - phys。扩散物理使用多尺度时空图(MSTmaps)预处理输入数据,并引入高斯噪声来模拟现实世界的条件。该模型使用去噪网络进行训练,以获得准确的HR估计。在VIPL-HR、UBFC-rPPG和PURE数据集上的实验评估表明,Diffusion-Phys具有与最先进的方法相当或更好的性能,且计算复杂度更低。这些结果强调了通过扩散建模明确处理噪声的有效性,提高了非接触式生理测量系统的可靠性和通用性。
{"title":"Diffusion-Phys: noise-robust heart rate estimation from facial videos via diffusion models.","authors":"Yong-Hoon Jeong, Young-Seok Choi","doi":"10.1007/s13534-025-00472-w","DOIUrl":"10.1007/s13534-025-00472-w","url":null,"abstract":"<p><p>Remote photoplethysmography (rPPG) offers significant potential for health monitoring and emotional analysis through non-contact physiological measurement from facial videos. However, noise remains a crucial challenge, limiting the generalizability of current rPPG methods. This paper introduces Diffusion-Phys, a novel framework using diffusion models for robust heart rate (HR) estimation from facial videos. Diffusion-Phys employs Multi-scale Spatial-Temporal Maps (MSTmaps) to preprocess input data and introduces Gaussian noise to simulate real-world conditions. The model is trained using a denoising network for accurate HR estimation. Experimental evaluations on the VIPL-HR, UBFC-rPPG and PURE datasets demonstrate that Diffusion-Phys achieves comparable or superior performance to state-of-the-art methods, with lower computational complexity. These results highlight the effectiveness of explicitly addressing noise through diffusion modeling, improving the reliability and generalization of non-contact physiological measurement systems.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 3","pages":"575-585"},"PeriodicalIF":3.2,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12011667/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143990201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of a silicone-based flexible dry electrode for measuring human bioelectrical signals. 用于人体生物电信号测量的硅基柔性干电极的评价。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-04-05 eCollection Date: 2025-05-01 DOI: 10.1007/s13534-025-00471-x
Chang-Hee Han, Seong-Uk Kim, Kyung-Soo Lim, Young-Jin Jung, Sangho Lee, Sung Hoon Kim, Han-Jeong Hwang

The development of conductive polymer-based dry electrodes with high conductivity is promising for practical applications in daily life due to their biocompatibility, flexibility, lightweight, and comfort. The objective of this study is to demonstrate the feasibility of using a novel silicone-based dry electrode for measuring various bioelectrical signals.The silicone-based electrode, manufactured using an optimized polymer matrix, combines high conductivity with flexibility, ensuring superior wearability and reliable bioelectrical signal monitoring. To evaluate its performance, we compared its impedance and flexibility with those of a commercial electrode. Additionally, its compatibility for measuring biological signals was assessed through performance comparisons across various bioelectrical signals. Fourteen healthy participants performed three experimental paradigms: (1) eyes closed and open to measure alpha electroencephalography (EEG) as well as resting-state electrocardiography (ECG), (2) eye blinking to measure electrooculography (EOG), and (3) wrist movement to measure electromyography (EMG). All bioelectrical signals were measured simultaneously using both the silicone-based dry electrode and a commercial dry electrode. The performance of the silicone-based dry electrode was evaluated by comparing the signal quality of both electrodes. The silicone-based dry electrode exhibited lower electrical impedance (39.43 kΩ on average, p = 0.0058) and greater flexibility (Young's modulus: silicone 1.51 ± 0.10 MPa vs. commercial 2.46 ± 0.38 MPa) compared to the commercial dry electrode. Overall, there were minimal differences in signal quality between the two electrodes: i) EEG (α power SNR: silicone 1.39 ± 0.34 vs. commercial 1.36 ± 0.29), ii) ECG (R-peak recall: 99.20 ± 2.50%, correlation coefficient: 0.96 ± 0.08), iii) EOG (eye blink recall: 100.00%, correlation coefficient: 0.98 ± 0.03), and iv) EMG (no significant difference in SNR values). These findings indicate that the developed electrode not only ensures superior flexibility but also maintains compatible electrical properties for measuring various bioelectrical signals.

高导电性聚合物基干电极具有生物相容性、柔韧性、轻量化和舒适性等优点,在日常生活中具有广阔的应用前景。本研究的目的是证明使用新型硅基干电极测量各种生物电信号的可行性。硅基电极使用优化的聚合物基质制造,结合了高导电性和灵活性,确保了卓越的可穿戴性和可靠的生物电信号监测。为了评估其性能,我们将其阻抗和柔韧性与商业电极进行了比较。此外,通过各种生物电信号的性能比较,评估了其测量生物信号的兼容性。14名健康参与者进行了三种实验范式:(1)闭眼和睁眼测量α脑电图(EEG)和静息状态心电图(ECG),(2)眨眼测量眼电图(EOG),(3)手腕运动测量肌电图(EMG)。使用硅基干电极和商用干电极同时测量所有生物电信号。通过比较两种电极的信号质量来评价硅基干电极的性能。与商业干电极相比,硅基干电极表现出更低的电阻抗(平均39.43 kΩ, p = 0.0058)和更大的柔韧性(杨氏模量:硅基为1.51±0.10 MPa,商用为2.46±0.38 MPa)。总的来说,两种电极之间的信号质量差异很小:1)EEG (α功率信噪比:硅胶1.39±0.34 vs.商用1.36±0.29),2)ECG (r -峰召回率:99.20±2.50%,相关系数:0.96±0.08),3)EOG(眨眼召回率:100.00%,相关系数:0.98±0.03),4)EMG(信噪比值无显著差异)。这些发现表明,所开发的电极不仅保证了优越的灵活性,而且保持了测量各种生物电信号的兼容电性能。
{"title":"Evaluation of a silicone-based flexible dry electrode for measuring human bioelectrical signals.","authors":"Chang-Hee Han, Seong-Uk Kim, Kyung-Soo Lim, Young-Jin Jung, Sangho Lee, Sung Hoon Kim, Han-Jeong Hwang","doi":"10.1007/s13534-025-00471-x","DOIUrl":"10.1007/s13534-025-00471-x","url":null,"abstract":"<p><p>The development of conductive polymer-based dry electrodes with high conductivity is promising for practical applications in daily life due to their biocompatibility, flexibility, lightweight, and comfort. The objective of this study is to demonstrate the feasibility of using a novel silicone-based dry electrode for measuring various bioelectrical signals.The silicone-based electrode, manufactured using an optimized polymer matrix, combines high conductivity with flexibility, ensuring superior wearability and reliable bioelectrical signal monitoring. To evaluate its performance, we compared its impedance and flexibility with those of a commercial electrode. Additionally, its compatibility for measuring biological signals was assessed through performance comparisons across various bioelectrical signals. Fourteen healthy participants performed three experimental paradigms: (1) eyes closed and open to measure alpha electroencephalography (EEG) as well as resting-state electrocardiography (ECG), (2) eye blinking to measure electrooculography (EOG), and (3) wrist movement to measure electromyography (EMG). All bioelectrical signals were measured simultaneously using both the silicone-based dry electrode and a commercial dry electrode. The performance of the silicone-based dry electrode was evaluated by comparing the signal quality of both electrodes. The silicone-based dry electrode exhibited lower electrical impedance (39.43 kΩ on average, <i>p</i> = 0.0058) and greater flexibility (Young's modulus: silicone 1.51 ± 0.10 MPa vs. commercial 2.46 ± 0.38 MPa) compared to the commercial dry electrode. Overall, there were minimal differences in signal quality between the two electrodes: i) EEG (α power SNR: silicone 1.39 ± 0.34 vs. commercial 1.36 ± 0.29), ii) ECG (R-peak recall: 99.20 ± 2.50%, correlation coefficient: 0.96 ± 0.08), iii) EOG (eye blink recall: 100.00%, correlation coefficient: 0.98 ± 0.03), and iv) EMG (no significant difference in SNR values). These findings indicate that the developed electrode not only ensures superior flexibility but also maintains compatible electrical properties for measuring various bioelectrical signals.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 3","pages":"563-574"},"PeriodicalIF":3.2,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12011699/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144020137","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time monitoring and quantitative analysis of 3D tumor spheroids using portable cellular imaging system. 利用便携式细胞成像系统实时监测和定量分析三维肿瘤球体。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-03-28 eCollection Date: 2025-05-01 DOI: 10.1007/s13534-025-00470-y
Ji Heon Lim, Ji Wook Choi, Na Yeon Kim, Taewook Kang, Bong Geun Chung

Three-dimensional (3D) tumor spheroid models closely mimic in vivo tumor environment and play a vital role in studying oncological research. Despite their significance, the existing methods for analyzing 3D tumor spheroids often suffer from limitations, including low throughput, high cost, and insufficient resolution. To address these challenges, we developed a portable imaging system for the real-time sensing and quantitative analysis of the 3D tumor spheroids. The system integrated the seamless workflow of spheroid generation, cell morphology tracking, and drug screening. The spheroid generation was successfully characterized using MCF-7 breast cancer cells by optimizing cell concentration (5-20 × 106 cells/mL), incubation time (24-96 h) and microwell diameter (400-600 μm). A custom-written algorithm was developed for automated analysis of spheroids, exhibiting high sensitivity (98.99%) and specificity (98.21%). Confusion matrices and receiver operating characteristic curve analysis further confirmed the robustness of the algorithm with an area under the curve value of 93.75% and an equal error rate of 0.79%. Following the characterization, the real-time sensing of spheroid generation and the response of spheroids to drug treatment were successfully demonstrated. Furthermore, the live/dead assays with chemotherapy provided a detailed insight into the efficacy and cytotoxic effects of the drug, demonstrating a significant dose-dependent decrease in a spheroid viability. Therefore, our system offers considerable potential for enhancing drug development processes and personalized treatment strategies, thereby contributing to more effective cancer therapies.

Supplementary information: The online version contains supplementary material available at 10.1007/s13534-025-00470-y.

三维肿瘤球体模型紧密地模拟了体内肿瘤环境,在肿瘤研究中起着至关重要的作用。尽管具有重要意义,但现有的分析三维肿瘤球体的方法往往存在局限性,包括低通量、高成本和分辨率不足。为了解决这些挑战,我们开发了一种便携式成像系统,用于实时传感和3D肿瘤球体的定量分析。该系统集成了球体生成、细胞形态跟踪和药物筛选的无缝工作流程。通过优化细胞浓度(5-20 × 106细胞/mL)、孵育时间(24-96 h)和微孔直径(400-600 μm),成功表征了MCF-7乳腺癌细胞的球形生成。该算法具有较高的灵敏度(98.99%)和特异性(98.21%)。混淆矩阵和接收者工作特征曲线分析进一步证实了算法的鲁棒性,曲线下面积为93.75%,错误率为0.79%。根据表征,球体生成的实时传感和球体对药物治疗的响应被成功演示。此外,化疗的活/死试验提供了药物疗效和细胞毒性作用的详细见解,证明了球体活力的显着剂量依赖性降低。因此,我们的系统为加强药物开发过程和个性化治疗策略提供了相当大的潜力,从而有助于更有效的癌症治疗。补充信息:在线版本包含补充资料,提供地址为10.1007/s13534-025-00470-y。
{"title":"Real-time monitoring and quantitative analysis of 3D tumor spheroids using portable cellular imaging system.","authors":"Ji Heon Lim, Ji Wook Choi, Na Yeon Kim, Taewook Kang, Bong Geun Chung","doi":"10.1007/s13534-025-00470-y","DOIUrl":"10.1007/s13534-025-00470-y","url":null,"abstract":"<p><p>Three-dimensional (3D) tumor spheroid models closely mimic in vivo tumor environment and play a vital role in studying oncological research. Despite their significance, the existing methods for analyzing 3D tumor spheroids often suffer from limitations, including low throughput, high cost, and insufficient resolution. To address these challenges, we developed a portable imaging system for the real-time sensing and quantitative analysis of the 3D tumor spheroids. The system integrated the seamless workflow of spheroid generation, cell morphology tracking, and drug screening. The spheroid generation was successfully characterized using MCF-7 breast cancer cells by optimizing cell concentration (5-20 × 10<sup>6</sup> cells/mL), incubation time (24-96 h) and microwell diameter (400-600 μm). A custom-written algorithm was developed for automated analysis of spheroids, exhibiting high sensitivity (98.99%) and specificity (98.21%). Confusion matrices and receiver operating characteristic curve analysis further confirmed the robustness of the algorithm with an area under the curve value of 93.75% and an equal error rate of 0.79%. Following the characterization, the real-time sensing of spheroid generation and the response of spheroids to drug treatment were successfully demonstrated. Furthermore, the live/dead assays with chemotherapy provided a detailed insight into the efficacy and cytotoxic effects of the drug, demonstrating a significant dose-dependent decrease in a spheroid viability. Therefore, our system offers considerable potential for enhancing drug development processes and personalized treatment strategies, thereby contributing to more effective cancer therapies.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s13534-025-00470-y.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 3","pages":"549-561"},"PeriodicalIF":3.2,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12011688/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144041732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of hybrid EEG-based multimodal human-computer interfaces using deep learning: applications, advances, and challenges. 基于脑电图的混合多模态人机界面的深度学习综述:应用、进展和挑战。
IF 3.2 4区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-03-22 eCollection Date: 2025-07-01 DOI: 10.1007/s13534-025-00469-5
Hyung-Tak Lee, Miseon Shim, Xianghong Liu, Hye-Ran Cheon, Sang-Gyu Kim, Chang-Hee Han, Han-Jeong Hwang

Human-computer interaction (HCI) focuses on designing efficient and intuitive interactions between humans and computer systems. Recent advancements have utilized multimodal approaches, such as electroencephalography (EEG)-based systems combined with other biosignals, along with deep learning to enhance performance and reliability. However, no systematic review has consolidated findings on EEG-based multimodal HCI systems. This review examined 124 studies published from 2016 to 2024, retrieved from the Web of Science database, focusing on hybrid EEG-based multimodal HCI systems employing deep learning. The keywords used for evaluation were as follows: 'Deep Learning' AND 'EEG' AND ('fNIRS' OR 'NIRS' OR 'MEG' OR 'fMRI' OR 'EOG' OR 'EMG' OR 'ECG' OR 'PPG' OR 'GSR'). Main topics explored are: (1) types of biosignals used with EEG, (2) neural network architectures, (3) fusion strategies, (4) system performance, and (5) target applications. Frequently paired signals, such as EOG, EMG, and fNIRS, effectively complement EEG by addressing its limitations. Convolutional neural networks are extensively employed for spatio-temporal-spectral feature extraction, with early and intermediate fusion strategies being the most commonly used. Applications, such as sleep stage classification, emotion recognition, and mental state decoding, have shown notable performance improvements. Despite these advancements, challenges remain, including the lack of real-time online systems, difficulties in signal synchronization, limited data availability, and insufficient explainable AI (XAI) methods to interpret signal interactions. Emerging solutions, such as portable systems, lightweight deep learning models, and data augmentation techniques, offer promising pathways to address these issues. This review highlights the potential of EEG-based multimodal HCI systems and emphasizes the need for advancements in real-time interaction, fusion algorithms, and XAI to enhance their adaptability, interpretability, and reliability.

人机交互(HCI)侧重于设计人与计算机系统之间有效和直观的交互。最近的进展是利用多模式方法,例如基于脑电图(EEG)的系统与其他生物信号相结合,以及深度学习来提高性能和可靠性。然而,目前还没有系统性的综述对基于脑电图的多模态HCI系统的研究结果进行整合。本综述从Web of Science数据库中检索了2016年至2024年发表的124项研究,重点关注采用深度学习的基于脑电图的混合多模态HCI系统。用于评估的关键词如下:“深度学习”和“脑电图”和(“fNIRS”或“NIRS”或“MEG”或“fMRI”或“EOG”或“EMG”或“ECG”或“PPG”或“GSR”)。主要探讨的主题有:(1)脑电图使用的生物信号类型,(2)神经网络架构,(3)融合策略,(4)系统性能,以及(5)目标应用。频繁配对的信号,如EOG、EMG和fNIRS,通过解决EEG的局限性,有效地补充了EEG。卷积神经网络广泛用于时空光谱特征提取,其中早期和中期融合策略是最常用的。睡眠阶段分类、情绪识别和精神状态解码等应用已经显示出显著的性能改善。尽管取得了这些进步,但挑战依然存在,包括缺乏实时在线系统、信号同步困难、数据可用性有限以及解释信号相互作用的可解释人工智能(XAI)方法不足。新兴的解决方案,如便携式系统、轻量级深度学习模型和数据增强技术,为解决这些问题提供了有希望的途径。这篇综述强调了基于脑电图的多模态HCI系统的潜力,并强调需要在实时交互、融合算法和XAI方面取得进展,以增强其适应性、可解释性和可靠性。
{"title":"A review of hybrid EEG-based multimodal human-computer interfaces using deep learning: applications, advances, and challenges.","authors":"Hyung-Tak Lee, Miseon Shim, Xianghong Liu, Hye-Ran Cheon, Sang-Gyu Kim, Chang-Hee Han, Han-Jeong Hwang","doi":"10.1007/s13534-025-00469-5","DOIUrl":"https://doi.org/10.1007/s13534-025-00469-5","url":null,"abstract":"<p><p>Human-computer interaction (HCI) focuses on designing efficient and intuitive interactions between humans and computer systems. Recent advancements have utilized multimodal approaches, such as electroencephalography (EEG)-based systems combined with other biosignals, along with deep learning to enhance performance and reliability. However, no systematic review has consolidated findings on EEG-based multimodal HCI systems. This review examined 124 studies published from 2016 to 2024, retrieved from the Web of Science database, focusing on hybrid EEG-based multimodal HCI systems employing deep learning. The keywords used for evaluation were as follows: 'Deep Learning' AND 'EEG' AND ('fNIRS' OR 'NIRS' OR 'MEG' OR 'fMRI' OR 'EOG' OR 'EMG' OR 'ECG' OR 'PPG' OR 'GSR'). Main topics explored are: (1) types of biosignals used with EEG, (2) neural network architectures, (3) fusion strategies, (4) system performance, and (5) target applications. Frequently paired signals, such as EOG, EMG, and fNIRS, effectively complement EEG by addressing its limitations. Convolutional neural networks are extensively employed for spatio-temporal-spectral feature extraction, with early and intermediate fusion strategies being the most commonly used. Applications, such as sleep stage classification, emotion recognition, and mental state decoding, have shown notable performance improvements. Despite these advancements, challenges remain, including the lack of real-time online systems, difficulties in signal synchronization, limited data availability, and insufficient explainable AI (XAI) methods to interpret signal interactions. Emerging solutions, such as portable systems, lightweight deep learning models, and data augmentation techniques, offer promising pathways to address these issues. This review highlights the potential of EEG-based multimodal HCI systems and emphasizes the need for advancements in real-time interaction, fusion algorithms, and XAI to enhance their adaptability, interpretability, and reliability.</p>","PeriodicalId":46898,"journal":{"name":"Biomedical Engineering Letters","volume":"15 4","pages":"587-618"},"PeriodicalIF":3.2,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12229323/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biomedical Engineering Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1