首页 > 最新文献

IEEE Transactions on Biomedical Engineering最新文献

英文 中文
Machine Learning Methods to Track Dynamic Facial Function in Facial Palsy. 跟踪面瘫患者动态面部功能的机器学习方法。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-11-01 DOI: 10.1109/TBME.2025.3567984
Akshita A Rao, Jacqueline J Greene, Todd P Coleman

Objective: For patients with facial palsy, the wait for return of facial function and resulting vision risk from poor eye closure, difficulty speaking and eating from flaccid oral sphincter muscles, and psychological morbidity from the inability to smile or express emotions can be devastating. There are limited methods to assess ongoing facial nerve regeneration: clinicians rely on subjective descriptions, imprecise scales, and static photographs to evaluate facial functional recovery. We propose a more precise evaluation of dynamic facial function through video-based machine learning analysis to facilitate a better understanding of the sometimes subtle onset of facial nerve recovery and improve guidance for facial reanimation surgery.

Methods: We present machine learning methods employing likelihood ratio tests, optimal transport theory, and Mahalanobis distances to: 1) assess the use of defined facial landmarks for binary classification of different facial palsy types; 2) identify regions of asymmetry and potential palsy during specific facial cues; and 3) quantify palsy severity and map it directly to widely used clinical scores, offering clinicians an objective way to assess facial nerve function.

Results: Our results demonstrate that video analysis provides a significantly more accurate and detailed assessment of facial movements than previously reported.

Conclusions: Our work allows for precise classification of facial palsy types, identification of asymmetric regions, and assessment of palsy severity.

Significance: This project enables clinicians to have more accurate and timely information to make decisions for facial reanimation surgery, which will have drastic consequences on the quality of life for affected patients.

目的:对于面瘫患者来说,闭眼不良、口腔括约肌松弛导致说话和进食困难、无法微笑或表达情绪导致的心理疾病等面瘫功能恢复和视力风险的等待是毁灭性的。评估正在进行的面神经再生的方法有限:临床医生依靠主观描述,不精确的尺度和静态照片来评估面部功能恢复。我们建议通过基于视频的机器学习分析对动态面部功能进行更精确的评估,以促进更好地理解面神经恢复有时微妙的开始,并改善面部再生手术的指导。方法:我们提出了采用似然比检验、最优传输理论和马氏距离的机器学习方法:1)评估定义面部标志对不同面瘫类型二值分类的使用;2)在特定的面部线索中识别不对称区域和潜在的麻痹;3)量化麻痹严重程度,并将其直接映射到广泛使用的临床评分,为临床医生提供客观评估面神经功能的方法。结果:我们的研究结果表明,视频分析提供了比以前报道的更准确和详细的面部运动评估。结论:我们的工作允许面瘫类型的精确分类,识别不对称区域,和评估麻痹的严重程度。意义:本项目使临床医生有更准确、及时的信息来决定是否进行面部再生手术,这将对影响患者的生活质量产生重大影响。
{"title":"Machine Learning Methods to Track Dynamic Facial Function in Facial Palsy.","authors":"Akshita A Rao, Jacqueline J Greene, Todd P Coleman","doi":"10.1109/TBME.2025.3567984","DOIUrl":"10.1109/TBME.2025.3567984","url":null,"abstract":"<p><strong>Objective: </strong>For patients with facial palsy, the wait for return of facial function and resulting vision risk from poor eye closure, difficulty speaking and eating from flaccid oral sphincter muscles, and psychological morbidity from the inability to smile or express emotions can be devastating. There are limited methods to assess ongoing facial nerve regeneration: clinicians rely on subjective descriptions, imprecise scales, and static photographs to evaluate facial functional recovery. We propose a more precise evaluation of dynamic facial function through video-based machine learning analysis to facilitate a better understanding of the sometimes subtle onset of facial nerve recovery and improve guidance for facial reanimation surgery.</p><p><strong>Methods: </strong>We present machine learning methods employing likelihood ratio tests, optimal transport theory, and Mahalanobis distances to: 1) assess the use of defined facial landmarks for binary classification of different facial palsy types; 2) identify regions of asymmetry and potential palsy during specific facial cues; and 3) quantify palsy severity and map it directly to widely used clinical scores, offering clinicians an objective way to assess facial nerve function.</p><p><strong>Results: </strong>Our results demonstrate that video analysis provides a significantly more accurate and detailed assessment of facial movements than previously reported.</p><p><strong>Conclusions: </strong>Our work allows for precise classification of facial palsy types, identification of asymmetric regions, and assessment of palsy severity.</p><p><strong>Significance: </strong>This project enables clinicians to have more accurate and timely information to make decisions for facial reanimation surgery, which will have drastic consequences on the quality of life for affected patients.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":"3359-3373"},"PeriodicalIF":4.5,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12584918/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144002848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conditional Autonomy in Robot-Assisted Transbronchial Interventions. 机器人辅助经支气管介入治疗中的条件自主。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-11-01 DOI: 10.1109/TBME.2025.3565915
Artur Banach, Fumitaro Masaki, Lambros Athanasiou, Franklin King, Hussein Kharroubi, Bassel Tfayli, Hisashi Tsukada, Yolonda Colson, Nobuhiko Hata

Lung cancer is one of the leading causes of cancer-related deaths, and accurate staging is critical for determining the appropriate treatment. Robotic Navigation Bronchoscopy has shown advantages over traditional manual procedures, offering benefits in safety, efficiency, and accessibility. Although there is ongoing discussion regarding autonomous RNB, there is limited focus on the autonomy in advancing the bronchoscope. In this study, we introduce a novel method for conditional autonomy in advancing and aligning a robotic bronchoscope, which was validated in vitro, ex vivo, and in vivo. This conditional autonomy utilizes a monoscopic bronchoscopic view as input, with operators guiding the system by specifying the next airway to enter at branching points. The reachability of target lesions using this conditional autonomy was 73.3% in the phantom study and 77.5% in the ex vivo study. Statistical significance was found in success rates between bifurcations and trifurcations (p = 0.03) and across lobe segments (p = 0.005). The presence of breathing motion did not affect lesion reachability or the success of turns at branching points in the ex vivo studies. In the in vivo study, when comparing conditional automation to human-operated navigation, the conditional automation took less time to reach the target lesions than human operators. The median time for passing each bifurcation was 2.5 seconds for human operators and 1.3 seconds for conditional automation. By improving precision and consistency in tissue sampling, this technology could redefine the standard of care for lung cancer patients, leading to more accurate diagnoses and therapies.

肺癌是癌症相关死亡的主要原因之一,准确的分期对于确定适当的治疗至关重要。机器人导航支气管镜检查显示出优于传统人工操作的优势,在安全性、效率和可及性方面具有优势。尽管关于自主支气管镜的讨论正在进行中,但对自主推进支气管镜的关注有限。在这项研究中,我们介绍了一种新的方法来实现机器人支气管镜的条件自主推进和对准,并在体外、离体和体内进行了验证。这种条件自主利用单镜支气管镜视图作为输入,操作员通过指定在分支点进入的下一个气道来指导系统。使用这种条件自主的目标病变可达性在假体研究中为73.3%,在离体研究中为77.5%。分岔和分岔之间的成功率(p = 0.03)和跨叶段的成功率(p = 0.005)具有统计学意义。在离体研究中,呼吸运动的存在并不影响病变的可达性或分支点转弯的成功。在体内研究中,当将条件自动化与人工操作导航进行比较时,条件自动化比人工操作所需的时间更短。人工操作人员通过每个分叉的平均时间为2.5秒,条件自动化操作人员为1.3秒。通过提高组织采样的精确度和一致性,这项技术可以重新定义肺癌患者的护理标准,从而实现更准确的诊断和治疗。
{"title":"Conditional Autonomy in Robot-Assisted Transbronchial Interventions.","authors":"Artur Banach, Fumitaro Masaki, Lambros Athanasiou, Franklin King, Hussein Kharroubi, Bassel Tfayli, Hisashi Tsukada, Yolonda Colson, Nobuhiko Hata","doi":"10.1109/TBME.2025.3565915","DOIUrl":"10.1109/TBME.2025.3565915","url":null,"abstract":"<p><p>Lung cancer is one of the leading causes of cancer-related deaths, and accurate staging is critical for determining the appropriate treatment. Robotic Navigation Bronchoscopy has shown advantages over traditional manual procedures, offering benefits in safety, efficiency, and accessibility. Although there is ongoing discussion regarding autonomous RNB, there is limited focus on the autonomy in advancing the bronchoscope. In this study, we introduce a novel method for conditional autonomy in advancing and aligning a robotic bronchoscope, which was validated in vitro, ex vivo, and in vivo. This conditional autonomy utilizes a monoscopic bronchoscopic view as input, with operators guiding the system by specifying the next airway to enter at branching points. The reachability of target lesions using this conditional autonomy was 73.3% in the phantom study and 77.5% in the ex vivo study. Statistical significance was found in success rates between bifurcations and trifurcations (p = 0.03) and across lobe segments (p = 0.005). The presence of breathing motion did not affect lesion reachability or the success of turns at branching points in the ex vivo studies. In the in vivo study, when comparing conditional automation to human-operated navigation, the conditional automation took less time to reach the target lesions than human operators. The median time for passing each bifurcation was 2.5 seconds for human operators and 1.3 seconds for conditional automation. By improving precision and consistency in tissue sampling, this technology could redefine the standard of care for lung cancer patients, leading to more accurate diagnoses and therapies.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":"3256-3267"},"PeriodicalIF":4.5,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144003383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noise Propagation and MP-PCA Image Denoising for High-Resolution Quantitative $R_2^{rm{*}}$, $T_2^{rm{*}}$, and Magnetic Susceptibility Mapping (QSM). 高分辨率定量R2*, T2*和磁化率图(QSM)的噪声传播和MP-PCA图像去噪。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-11-01 DOI: 10.1109/TBME.2025.3566561
Liad Doniza, Mitchel Lee, Tamar Blumenfeld-Katzir, Moran Artzi, Dafna Ben-Bashat, Orna Aizenstein, Dvir Radunsky, Fenella Kirkham, George Thomas, Rimona S Weil, Karin Shmueli, Noam Ben-Eliezer

Objective: Quantitative Susceptibility Mapping (QSM) measures magnetic susceptibility of tissues, aiding in the detection of pathologies like traumatic brain injury, cerebral microbleeds, Parkinson's disease, and multiple sclerosis, through analysis of variations in substances such as iron and calcium. Despite its clinical value, using high-resolution QSM (voxel sizes < 1 mm3) reduces signal-to-noise ratio (SNR), which compromises diagnostic quality.

Methods: Denoising of T2*-weighted (T2*w) data was implemented using Marchenko-Pastur Principal Component Analysis (MP-PCA), allowing to enhance the quality of R2*, T2*, and QSM maps. Proof of concept of the denoising technique was demonstrated on a numerical phantom, healthy subjects, and patients with brain metastases and sickle cell anemia.

Results: Effective and robust denoising was observed across different scan settings, offering higher SNR and improved accuracy. Noise propagation was analyzed between T2*w, R2*, and T2* values, revealing augmentation of noise in T2*w compared to R2* values.

Conclusions: The use of MP-PCA denoising allows the collection of high resolution (∼0.5 mm3) QSM data at clinical scan times, without compromising SNR.

Significance: The presented pipeline could enhance the diagnosis of various neurological diseases by providing higher-definition mapping of small vessels and of variations in iron or calcium.

定量易感性绘图(QSM)测量组织的磁化率,通过分析铁和钙等物质的变化,帮助检测创伤性脑损伤、脑微出血、帕金森病和多发性硬化症等疾病。尽管具有临床价值,但使用高分辨率QSM(体素尺寸< 1 mm3)会降低信噪比(SNR),从而影响诊断质量。方法:采用Marchenko-Pastur主成分分析(MP-PCA)对T2*加权(T2*)数据进行去噪,提高R2*、T2*和QSM图谱的质量。降噪技术的概念已在数字幻影、健康受试者和脑转移和镰状细胞性贫血患者身上得到验证。结果:在不同的扫描设置中观察到有效和鲁棒的去噪,提供更高的信噪比和提高的准确性。分析T2*w、R2*和T2*值之间的噪声传播,发现T2*w中的噪声比R2*值增强。结论:使用MP-PCA去噪可以在临床扫描时间收集高分辨率(~ 0.5 mm3)的QSM数据,而不会影响信噪比。意义:所提出的管道可以通过提供小血管和铁或钙变化的高清晰度制图来增强各种神经系统疾病的诊断。
{"title":"Noise Propagation and MP-PCA Image Denoising for High-Resolution Quantitative $R_2^{rm{*}}$, $T_2^{rm{*}}$, and Magnetic Susceptibility Mapping (QSM).","authors":"Liad Doniza, Mitchel Lee, Tamar Blumenfeld-Katzir, Moran Artzi, Dafna Ben-Bashat, Orna Aizenstein, Dvir Radunsky, Fenella Kirkham, George Thomas, Rimona S Weil, Karin Shmueli, Noam Ben-Eliezer","doi":"10.1109/TBME.2025.3566561","DOIUrl":"10.1109/TBME.2025.3566561","url":null,"abstract":"<p><strong>Objective: </strong>Quantitative Susceptibility Mapping (QSM) measures magnetic susceptibility of tissues, aiding in the detection of pathologies like traumatic brain injury, cerebral microbleeds, Parkinson's disease, and multiple sclerosis, through analysis of variations in substances such as iron and calcium. Despite its clinical value, using high-resolution QSM (voxel sizes < 1 mm<sup>3</sup>) reduces signal-to-noise ratio (SNR), which compromises diagnostic quality.</p><p><strong>Methods: </strong>Denoising of T<sub>2</sub><sup>*</sup>-weighted (T<sub>2</sub><sup>*</sup>w) data was implemented using Marchenko-Pastur Principal Component Analysis (MP-PCA), allowing to enhance the quality of R<sub>2</sub><sup>*</sup>, T<sub>2</sub><sup>*</sup>, and QSM maps. Proof of concept of the denoising technique was demonstrated on a numerical phantom, healthy subjects, and patients with brain metastases and sickle cell anemia.</p><p><strong>Results: </strong>Effective and robust denoising was observed across different scan settings, offering higher SNR and improved accuracy. Noise propagation was analyzed between T<sub>2</sub><sup>*</sup>w, R<sub>2</sub><sup>*</sup>, and T<sub>2</sub><sup>*</sup> values, revealing augmentation of noise in T<sub>2</sub><sup>*</sup>w compared to R<sub>2</sub><sup>*</sup> values.</p><p><strong>Conclusions: </strong>The use of MP-PCA denoising allows the collection of high resolution (∼0.5 mm<sup>3</sup>) QSM data at clinical scan times, without compromising SNR.</p><p><strong>Significance: </strong>The presented pipeline could enhance the diagnosis of various neurological diseases by providing higher-definition mapping of small vessels and of variations in iron or calcium.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":"3277-3287"},"PeriodicalIF":4.5,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143989392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building and Sustaining Open-Source Medical Device Projects. 建立和维护开源医疗设备项目。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-11-01 DOI: 10.1109/TBME.2025.3563102
Alex Baldwin, Sahar Elyahoodayan, Pallavi Gunalan, Victor Pikov, Ellis Meng

The open-source development model has been successfully applied to consumer and enterprise software, and recently to consumer hardware. Medical devices may become a beneficiary of this trend, as open-source medical device development has the potential to reduce costs, democratize patient access, and provide continued support to abandoned devices from failed companies. Unlike the consumer device market, the medical device market is highly regulated and involves considerable manufacturer liability that may limit the use of open-source technology. This review of open-source medical device development explores the current state of development in research and clinical products and suggests best practices for creating sustainable and effective open-source medical devices.

开源开发模型已经成功地应用于消费者和企业软件,最近也应用于消费者硬件。医疗设备可能成为这一趋势的受益者,因为开源医疗设备开发有可能降低成本,使患者更容易使用,并为失败公司的废弃设备提供持续支持。与消费设备市场不同,医疗设备市场受到高度监管,涉及相当大的制造商责任,这可能限制开源技术的使用。这篇对开源医疗设备开发的综述探讨了研究和临床产品开发的现状,并提出了创建可持续和有效的开源医疗设备的最佳实践建议。
{"title":"Building and Sustaining Open-Source Medical Device Projects.","authors":"Alex Baldwin, Sahar Elyahoodayan, Pallavi Gunalan, Victor Pikov, Ellis Meng","doi":"10.1109/TBME.2025.3563102","DOIUrl":"10.1109/TBME.2025.3563102","url":null,"abstract":"<p><p>The open-source development model has been successfully applied to consumer and enterprise software, and recently to consumer hardware. Medical devices may become a beneficiary of this trend, as open-source medical device development has the potential to reduce costs, democratize patient access, and provide continued support to abandoned devices from failed companies. Unlike the consumer device market, the medical device market is highly regulated and involves considerable manufacturer liability that may limit the use of open-source technology. This review of open-source medical device development explores the current state of development in research and clinical products and suggests best practices for creating sustainable and effective open-source medical devices.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":"3159-3173"},"PeriodicalIF":4.5,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12684831/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143997812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Migration of Deep Learning Models Across Ultrasound Scanners. 跨超声扫描仪的深度学习模型迁移。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-11-01 DOI: 10.1109/TBME.2025.3564567
Ufuk Soylu, Varun Chandrasekeran, Gregory J Czarnota, Michael L Oelze

A transfer function approach has recently proven effective for calibrating deep learning (DL) algorithms in quantitative ultrasound (QUS), addressing data shifts at both the acquisition and machine levels. Expanding on this approach, we develop a strategy to acquire the functionality of a DL model from one ultrasound machine and implement it on another in a black-box setting, in the context of QUS. This demonstrates the ease with which the functionality of a DL model can be transferred between machines. While the proposed approach can also assist regulatory bodies in comparing and approving DL models, it also highlights the security risks associated with deploying such models in a commercial scanner for clinical use. The method is a black-box unsupervised domain adaptation technique that integrates the transfer function approach with an iterative schema. It does not utilize any information related to model internals but it solely relies on the availability of an input-output interface. Additionally, we assume the availability of unlabeled data from a testing machine. This scenario could become relevant as companies begin deploying their DL functionalities for clinical use. In the experiments, we used a SonixOne and a Verasonics machine. The model was trained on SonixOne data, and its functionality was then transferred to the Verasonics machine. The proposed method successfully transferred the functionality to the Verasonics machine, achieving a remarkable 98 percent classification accuracy in a binary decision task. This study underscores the need to establish security measures prior to deploying DL models in clinical settings.

传递函数方法最近被证明可以有效地校准定量超声(QUS)中的深度学习(DL)算法,解决采集和机器层面的数据转移问题。在此方法的基础上,我们开发了一种策略,从一台超声机器获取DL模型的功能,并在QUS背景下的黑盒设置中在另一台超声机器上实现它。这证明了深度学习模型的功能可以在机器之间轻松地转移。虽然所提出的方法也可以帮助监管机构比较和批准DL模型,但它也强调了在商用扫描仪中部署此类模型用于临床使用的安全风险。该方法是一种将传递函数方法与迭代模式相结合的黑盒无监督域自适应技术。它不利用任何与模型内部相关的信息,而仅仅依赖于输入-输出接口的可用性。此外,我们假设从测试机器获得未标记数据的可用性。随着公司开始部署临床使用的深度学习功能,这种情况可能会变得相关。在实验中,我们使用了SonixOne和Verasonics机器。该模型在SonixOne数据上进行训练,然后将其功能转移到Verasonics机器上。该方法成功地将该功能转移到Verasonics机器上,在二元决策任务中实现了98%的分类准确率。这项研究强调了在临床环境中部署DL模型之前建立安全措施的必要性。
{"title":"Migration of Deep Learning Models Across Ultrasound Scanners.","authors":"Ufuk Soylu, Varun Chandrasekeran, Gregory J Czarnota, Michael L Oelze","doi":"10.1109/TBME.2025.3564567","DOIUrl":"10.1109/TBME.2025.3564567","url":null,"abstract":"<p><p>A transfer function approach has recently proven effective for calibrating deep learning (DL) algorithms in quantitative ultrasound (QUS), addressing data shifts at both the acquisition and machine levels. Expanding on this approach, we develop a strategy to acquire the functionality of a DL model from one ultrasound machine and implement it on another in a black-box setting, in the context of QUS. This demonstrates the ease with which the functionality of a DL model can be transferred between machines. While the proposed approach can also assist regulatory bodies in comparing and approving DL models, it also highlights the security risks associated with deploying such models in a commercial scanner for clinical use. The method is a black-box unsupervised domain adaptation technique that integrates the transfer function approach with an iterative schema. It does not utilize any information related to model internals but it solely relies on the availability of an input-output interface. Additionally, we assume the availability of unlabeled data from a testing machine. This scenario could become relevant as companies begin deploying their DL functionalities for clinical use. In the experiments, we used a SonixOne and a Verasonics machine. The model was trained on SonixOne data, and its functionality was then transferred to the Verasonics machine. The proposed method successfully transferred the functionality to the Verasonics machine, achieving a remarkable 98 percent classification accuracy in a binary decision task. This study underscores the need to establish security measures prior to deploying DL models in clinical settings.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":"3210-3220"},"PeriodicalIF":4.5,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12758015/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143970113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High Sensitivity Sensor based on Bound State in the Continuum in Detection for Cancer Cells. 基于连续体束缚态的高灵敏度传感器在癌细胞检测中的应用。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-10-31 DOI: 10.1109/TBME.2025.3627465
Yuqi Cao, Liran Shen, Heng Liu, Weiting Ge, Wei Huang, Yi Zhang, Jiani Chen, Pingjie Huang, Dibo Hou, Guangxin Zhang

Although terahertz (THz) metasurfaces based on bound states in the continuum (BIC) have garnered significant attention in biomedical applications, their technical implementation in high-sensitivity cancer cells detection remains a critical challenge. In this work, we present a THz biosensor employing dual split-ring resonator (DSRR) arrays quasi-bound states in the continuum (Q-BIC). Numerical simulations reveal a high-Q resonance dip at 2.35 THz with a detection sensitivity of 522 GHz/RIU. Experimentally, the performance was validated by detecting normal cells (murine splenocytes) and three cancer cell lines (LLC, LoVo, and MC38). In addition, analysis of cell type discrimination was achieved by integrating machine learning algorithms to project high dimensional spectral data into a low-dimensional space. This study establishes a label-free approach for long-term cellular monitoring, advancing THz technology as an innovative platform for practical biomedical applications.

尽管基于连续体结合态(BIC)的太赫兹(THz)超表面在生物医学应用中引起了极大的关注,但其在高灵敏度癌细胞检测中的技术实现仍然是一个关键挑战。在这项工作中,我们提出了一种采用双分裂环谐振器(DSRR)阵列连续介质准束缚态(Q-BIC)的太赫兹生物传感器。数值模拟结果表明,在2.35 THz处存在高q共振倾角,探测灵敏度为522 GHz/RIU。实验中,通过检测正常细胞(小鼠脾细胞)和三种癌细胞系(LLC, LoVo和MC38),验证了该性能。此外,通过集成机器学习算法将高维光谱数据投影到低维空间,实现了细胞类型判别分析。本研究建立了一种无标签的长期细胞监测方法,推动太赫兹技术作为实际生物医学应用的创新平台。
{"title":"High Sensitivity Sensor based on Bound State in the Continuum in Detection for Cancer Cells.","authors":"Yuqi Cao, Liran Shen, Heng Liu, Weiting Ge, Wei Huang, Yi Zhang, Jiani Chen, Pingjie Huang, Dibo Hou, Guangxin Zhang","doi":"10.1109/TBME.2025.3627465","DOIUrl":"https://doi.org/10.1109/TBME.2025.3627465","url":null,"abstract":"<p><p>Although terahertz (THz) metasurfaces based on bound states in the continuum (BIC) have garnered significant attention in biomedical applications, their technical implementation in high-sensitivity cancer cells detection remains a critical challenge. In this work, we present a THz biosensor employing dual split-ring resonator (DSRR) arrays quasi-bound states in the continuum (Q-BIC). Numerical simulations reveal a high-Q resonance dip at 2.35 THz with a detection sensitivity of 522 GHz/RIU. Experimentally, the performance was validated by detecting normal cells (murine splenocytes) and three cancer cell lines (LLC, LoVo, and MC38). In addition, analysis of cell type discrimination was achieved by integrating machine learning algorithms to project high dimensional spectral data into a low-dimensional space. This study establishes a label-free approach for long-term cellular monitoring, advancing THz technology as an innovative platform for practical biomedical applications.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145421565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hierarchical Multimodal Framework for Sedation Monitoring in ICU Patients. ICU患者镇静监测的分层多模式框架。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-10-28 DOI: 10.1109/TBME.2025.3626584
Ke Zhang, Zhelong Wang, Shiguo Zang, Zhenglin Li, Hongyu Zhao, Jiaxi Li, Fang Lin, Hongkai Zhao

In the intensive care unit (ICU), monitoring sedation levels is crucial. Clinicians often rely on intermittent behavioral scales like the Richmond Agitation-Sedation Scale (RASS), which can be subjective and delay timely interventions. While electroencephalography (EEG) offers a continuous and non-invasive alternative, but the complexity of consciousness renders a unimodal signal insufficient for comprehensive representation. To address these challenges, we propose a novel multimodal deep learning framework, Hierarchical Multimodal Fusion with Dynamic Correction (HMDC), that synergistically integrates EEG with peripheral physiological signals including blood pressure, heart rate, and oxygen saturation. The architecture features a dual-stream pathway to process both raw temporal EEG data and its spectral features from spectrograms. These neural representations are then intelligently fused and refined by a Dynamic Correction Module using a confidence-weighting mechanism. The model was developed and validated on a dataset comprising 2,880 labeled RASS assessments from 105 ICU patients, with scores ranging from -5 (comatose) to +1 (restless). The HMDC framework achieved a classification accuracy of 83.8%, significantly outperforming unimodal and simpler fusion baselines. By providing a temporally precise and physiologically grounded sedation assessment, this integrative approach establishes a robust correlation between multimodal signal patterns and clinical states, offering clinicians a unified tool for optimizing sedative titration and potentially minimizing delirium risks.

在重症监护病房(ICU),监测镇静水平至关重要。临床医生经常依赖间歇性行为量表,如里士满激动镇静量表(RASS),这可能是主观的,并延迟及时干预。虽然脑电图(EEG)提供了一种连续和非侵入性的替代方法,但意识的复杂性使得单峰信号不足以全面表征。为了解决这些挑战,我们提出了一种新的多模态深度学习框架,即分层多模态融合动态校正(HMDC),该框架将EEG与周围生理信号(包括血压、心率和血氧饱和度)协同集成。该体系结构具有双流路径来处理原始时间脑电图数据和来自频谱图的频谱特征。然后,使用置信度加权机制的动态校正模块对这些神经表征进行智能融合和细化。该模型是在一个数据集上开发和验证的,该数据集包括来自105名ICU患者的2,880个标记RASS评估,评分范围从-5(昏迷)到+1(不安)。HMDC框架实现了83.8%的分类准确率,显著优于单峰和更简单的融合基线。通过提供时间上精确和生理上的镇静评估,这种综合方法在多模态信号模式和临床状态之间建立了强大的相关性,为临床医生提供了一个统一的工具来优化镇静滴定,并可能最大限度地降低谵妄风险。
{"title":"A Hierarchical Multimodal Framework for Sedation Monitoring in ICU Patients.","authors":"Ke Zhang, Zhelong Wang, Shiguo Zang, Zhenglin Li, Hongyu Zhao, Jiaxi Li, Fang Lin, Hongkai Zhao","doi":"10.1109/TBME.2025.3626584","DOIUrl":"https://doi.org/10.1109/TBME.2025.3626584","url":null,"abstract":"<p><p>In the intensive care unit (ICU), monitoring sedation levels is crucial. Clinicians often rely on intermittent behavioral scales like the Richmond Agitation-Sedation Scale (RASS), which can be subjective and delay timely interventions. While electroencephalography (EEG) offers a continuous and non-invasive alternative, but the complexity of consciousness renders a unimodal signal insufficient for comprehensive representation. To address these challenges, we propose a novel multimodal deep learning framework, Hierarchical Multimodal Fusion with Dynamic Correction (HMDC), that synergistically integrates EEG with peripheral physiological signals including blood pressure, heart rate, and oxygen saturation. The architecture features a dual-stream pathway to process both raw temporal EEG data and its spectral features from spectrograms. These neural representations are then intelligently fused and refined by a Dynamic Correction Module using a confidence-weighting mechanism. The model was developed and validated on a dataset comprising 2,880 labeled RASS assessments from 105 ICU patients, with scores ranging from -5 (comatose) to +1 (restless). The HMDC framework achieved a classification accuracy of 83.8%, significantly outperforming unimodal and simpler fusion baselines. By providing a temporally precise and physiologically grounded sedation assessment, this integrative approach establishes a robust correlation between multimodal signal patterns and clinical states, offering clinicians a unified tool for optimizing sedative titration and potentially minimizing delirium risks.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145389114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Manifold Learning Approaches for Characterizing Photoplethysmographic Signals. 表征光容积脉搏波信号的流形学习方法。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-10-27 DOI: 10.1109/TBME.2025.3625858
Enrique Feito-Casares, Francisco M Melgarejo-Meseguer, Alejandro Cobo, Luis Baumela, Jose-Luis Rojo-Alvarez

Objective: Photoplethysmography (PPG) is widely used for cardiovascular monitoring, but its analysis is challenged by signal variability, inconsistent acquisition settings, and limited interpretability. This study investigates the use of low-dimensional embeddings to support down-stream tasks, including anomaly detection, activity classification, and signal authenticity verification across diverse PPG modalities.

Methods: We developed a pipeline lever aging dimensionality reduction techniques, Autoencoder (AE), Fully Connected Neural Network (FCNN), and Uniform Manifold Approximation and Projection (UMAP) to extract compact signal representations. These methods were evaluated across four datasets representing clinical (BIDMC, MIMIC-PERFORM), wearable (Wrist PPG), and remote PPG (UBFC) recordings. Performance was assessed through clustering indices, classification metrics, and anomaly detection rates under varying noise levels.

Results: Quantitative evaluation demonstrated that AE-based embeddings enabled accurate discrimination between neonatal and adult signals in the MIMIC-PERFORM dataset (F1 = 0.92, AUC = 0.90), while UMAP outperformed AE and FCNN in clustering physical activities from Wrist PPG data (Davies Bouldin Index = 5.40). In the BIDMC dataset, the framework detected synthetic anomalies with an AUC of 0.77 at 2 dB SNR, with detection rates declining consistently with reduced noise. On the UBFC dataset, UMAP embeddings supported the detection of manipulated rPPG signals with an F1 score of 0.75 and an AUC of 0.73.

Conclusion: Low dimensional representations provide a compact and task relevant encoding of PPG signals that enhances classification and detection performance in multiple scenarios. While interpretability gains remain task-dependent, these findings support the utility of embedding-based approaches in biomedical signal analysis and their robustness across modalities and noise conditions.

目的:光容积脉搏波(PPG)广泛用于心血管监测,但其分析受到信号变异性、采集设置不一致和可解释性有限的挑战。本研究探讨了使用低维嵌入来支持下游任务,包括不同PPG模式下的异常检测、活动分类和信号真实性验证。方法:我们开发了管道杆老化降维技术、自动编码器(AE)、全连接神经网络(FCNN)和均匀流形逼近与投影(UMAP)来提取紧凑的信号表示。这些方法通过四个数据集进行评估,这些数据集代表临床(BIDMC, MIMIC-PERFORM),可穿戴(手腕PPG)和远程PPG (UBFC)记录。通过聚类指数、分类指标和不同噪声水平下的异常检测率来评估性能。结果:定量评估表明,基于AE的嵌入能够准确区分MIMIC-PERFORM数据集中的新生儿和成人信号(F1 = 0.92, AUC = 0.90),而UMAP在从手腕PPG数据中聚类身体活动方面优于AE和FCNN (Davies Bouldin指数= 5.40)。在BIDMC数据集中,该框架在2 dB信噪比下检测到的合成异常AUC为0.77,检测率随着噪声的降低而持续下降。在UBFC数据集上,UMAP嵌入支持检测被操纵的rPPG信号,F1得分为0.75,AUC为0.73。结论:低维表示提供了一种紧凑且与任务相关的PPG信号编码,提高了多种场景下的分类和检测性能。虽然可解释性增益仍然依赖于任务,但这些发现支持基于嵌入的方法在生物医学信号分析中的效用,以及它们在模式和噪声条件下的鲁棒性。
{"title":"Manifold Learning Approaches for Characterizing Photoplethysmographic Signals.","authors":"Enrique Feito-Casares, Francisco M Melgarejo-Meseguer, Alejandro Cobo, Luis Baumela, Jose-Luis Rojo-Alvarez","doi":"10.1109/TBME.2025.3625858","DOIUrl":"https://doi.org/10.1109/TBME.2025.3625858","url":null,"abstract":"<p><strong>Objective: </strong>Photoplethysmography (PPG) is widely used for cardiovascular monitoring, but its analysis is challenged by signal variability, inconsistent acquisition settings, and limited interpretability. This study investigates the use of low-dimensional embeddings to support down-stream tasks, including anomaly detection, activity classification, and signal authenticity verification across diverse PPG modalities.</p><p><strong>Methods: </strong>We developed a pipeline lever aging dimensionality reduction techniques, Autoencoder (AE), Fully Connected Neural Network (FCNN), and Uniform Manifold Approximation and Projection (UMAP) to extract compact signal representations. These methods were evaluated across four datasets representing clinical (BIDMC, MIMIC-PERFORM), wearable (Wrist PPG), and remote PPG (UBFC) recordings. Performance was assessed through clustering indices, classification metrics, and anomaly detection rates under varying noise levels.</p><p><strong>Results: </strong>Quantitative evaluation demonstrated that AE-based embeddings enabled accurate discrimination between neonatal and adult signals in the MIMIC-PERFORM dataset (F1 = 0.92, AUC = 0.90), while UMAP outperformed AE and FCNN in clustering physical activities from Wrist PPG data (Davies Bouldin Index = 5.40). In the BIDMC dataset, the framework detected synthetic anomalies with an AUC of 0.77 at 2 dB SNR, with detection rates declining consistently with reduced noise. On the UBFC dataset, UMAP embeddings supported the detection of manipulated rPPG signals with an F1 score of 0.75 and an AUC of 0.73.</p><p><strong>Conclusion: </strong>Low dimensional representations provide a compact and task relevant encoding of PPG signals that enhances classification and detection performance in multiple scenarios. While interpretability gains remain task-dependent, these findings support the utility of embedding-based approaches in biomedical signal analysis and their robustness across modalities and noise conditions.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145377198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tumor Treating Fields: A Review of Computational Strategies for Thermal Safety and Personalization Treatment. 肿瘤治疗领域:热安全和个性化治疗的计算策略综述。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-10-24 DOI: 10.1109/TBME.2025.3625565
Yueyue Xiao, Chunxiao Chen, Jing Xia, Yubin Zheng, Liang Wang, Ming Lu, Jagath C Rajapakse

Objective: Tumor Treating Fields (TTFields) therapy, a clinically established modality that disrupts cancer cell mitosis through biophysical mechanisms, presents a unique paradigm in oncology. Despite its proven efficacy, its broad application is hindered by significant challenges in optimizing treatment delivery for individual patients.

Methods: This review synthesizes the landscape of advanced computational strategies designed to overcome these barriers. We argue that personalizing TTFields therapy requires tackling three interdependent obstacles: achieving accurate electric field dosimetry, ensuring thermal safety, and enabling adaptive treatment planning.

Results: this review systematically analyzes the state-of-the-art computational solutions corresponding to each challenge. We first examine patient-specific electric field modeling, emphasizing the critical roles of high-fidelity segmentation and quantitative dosimetric criteria. We then delve into thermal safety analysis, focusing on coupled electro-thermal simulations for predicting and mitigating thermal risks. Finally, we explore the multifaceted approaches to personalization, reviewing the convergence of algorithmic array layout optimization, real-time monitoring systems, and synergistic surgical interventions.

Significance: By structuring the current body of research within this "problem-solution" framework, this review provides a clear and cohesive synthesis of how computational engineering is paving the way for a new era of precise, safe, and adaptive TTFields therapy.

目的:肿瘤治疗场(TTFields)治疗是一种临床建立的通过生物物理机制破坏癌细胞有丝分裂的治疗方式,在肿瘤学中呈现出独特的范式。尽管其已被证明有效,但其广泛应用受到优化个体患者治疗递送的重大挑战的阻碍。方法:本综述综合了旨在克服这些障碍的先进计算策略的景观。我们认为个性化的TTFields疗法需要解决三个相互依存的障碍:实现准确的电场剂量测定,确保热安全性,并实现适应性治疗计划。结果:本综述系统地分析了每个挑战对应的最先进的计算解决方案。我们首先检查患者特异性电场建模,强调高保真分割和定量剂量标准的关键作用。然后,我们深入研究热安全分析,重点关注耦合电热模拟,以预测和减轻热风险。最后,我们探讨了个性化的多方面方法,回顾了算法阵列布局优化、实时监测系统和协同手术干预的融合。意义:通过在这个“问题-解决方案”框架内构建当前的研究主体,本综述提供了一个清晰而有凝聚力的综合,即计算工程如何为精确、安全和适应性TTFields治疗的新时代铺平道路。
{"title":"Tumor Treating Fields: A Review of Computational Strategies for Thermal Safety and Personalization Treatment.","authors":"Yueyue Xiao, Chunxiao Chen, Jing Xia, Yubin Zheng, Liang Wang, Ming Lu, Jagath C Rajapakse","doi":"10.1109/TBME.2025.3625565","DOIUrl":"https://doi.org/10.1109/TBME.2025.3625565","url":null,"abstract":"<p><strong>Objective: </strong>Tumor Treating Fields (TTFields) therapy, a clinically established modality that disrupts cancer cell mitosis through biophysical mechanisms, presents a unique paradigm in oncology. Despite its proven efficacy, its broad application is hindered by significant challenges in optimizing treatment delivery for individual patients.</p><p><strong>Methods: </strong>This review synthesizes the landscape of advanced computational strategies designed to overcome these barriers. We argue that personalizing TTFields therapy requires tackling three interdependent obstacles: achieving accurate electric field dosimetry, ensuring thermal safety, and enabling adaptive treatment planning.</p><p><strong>Results: </strong>this review systematically analyzes the state-of-the-art computational solutions corresponding to each challenge. We first examine patient-specific electric field modeling, emphasizing the critical roles of high-fidelity segmentation and quantitative dosimetric criteria. We then delve into thermal safety analysis, focusing on coupled electro-thermal simulations for predicting and mitigating thermal risks. Finally, we explore the multifaceted approaches to personalization, reviewing the convergence of algorithmic array layout optimization, real-time monitoring systems, and synergistic surgical interventions.</p><p><strong>Significance: </strong>By structuring the current body of research within this \"problem-solution\" framework, this review provides a clear and cohesive synthesis of how computational engineering is paving the way for a new era of precise, safe, and adaptive TTFields therapy.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145367672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Hemispheric Spatial-Temporal Attention Network for Decoding Silent Speech From EEG. 脑电沉默语音解码的跨半球时空注意网络。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2025-10-23 DOI: 10.1109/TBME.2025.3624878
Yanru Bai, Shuming Zhang, Ran Zhao, Xu Han, Guangjian Ni, Dong Ming

Objective: Speech, as the core of advanced human cognition, is fundamental to social interaction and daily life. Electroencephalogram (EEG)-based speech brain-computer interface (BCI) offers a novel communication pathway for patients with speech disorders, where deep learning has demonstrated significant advantages. Given the established dominance of the left hemisphere in speech processing, exploring methods to extract speech related neural features fully is crucial for enhancing decoding per formance.

Approach: In this study, EEG signals were recorded during a silent speech task involving the articulation of 10 distinct Chinese characters. Leveraging the principle of language function lateralization, we proposed a novel deep learning model, the cross hemispheric spatial-temporal attention network (CHSTAN), for EEG-based silent speech recognition. A multiscale temporal con volution block was employed to extract the temporal dynamics of EEG signals. A hemispheric spatial convolutional block was designed to independently process spatial information from the left and right hemispheres. Furthermore, the cross-attention mechanism was introduced to enhance inter-hemispheric feature inter action and specifically reinforce left-hemispheric feature representation for the final classification.

Results: We compared CHSTAN with other existing methods using 5-fold cross-validation on the collected dataset. CHSTAN achieved an average classification accuracy of 49.88% and an average F1-score of 48.75% in decoding the 10 Chinese characters, significantly outperforming other methods.

Conclusion: The results indicate that the CHSTAN performs effectively in silent speech EEG classification tasks. Notably, the feature patterns learned through its innovative architecture correspond to neural speech processing mechanism.

Significance: CHSTAN provides valuable insights and practical solutions for improving the performance of EEG-based speech decoding.

目的:语言是人类高级认知的核心,是社会交往和日常生活的基础。基于脑电图(EEG)的言语脑机接口(BCI)为言语障碍患者提供了一种新的交流途径,深度学习在这方面具有显著优势。鉴于左半球在语音处理中的主导地位,探索充分提取语音相关神经特征的方法对于提高解码性能至关重要。方法:在本研究中,在一个涉及10个不同汉字发音的无声言语任务中记录脑电图信号。利用语言功能侧化原理,提出了一种新的深度学习模型——跨半球时空注意网络(CHSTAN),用于基于脑电图的无声语音识别。采用多尺度时间卷积块提取脑电信号的时间动态特征。设计了一个半球空间卷积块来独立处理来自左右半球的空间信息。在此基础上,引入交叉注意机制,增强左半球特征交互作用,强化左半球特征表征,为最终分类提供依据。结果:我们对收集的数据集进行了5倍交叉验证,将CHSTAN与其他现有方法进行了比较。CHSTAN解码10个汉字的平均分类准确率为49.88%,平均f1得分为48.75%,明显优于其他方法。结论:CHSTAN在无声语音脑电分类任务中表现良好。值得注意的是,通过其创新架构学习到的特征模式与神经语音处理机制相对应。意义:CHSTAN为提高基于脑电图的语音解码性能提供了有价值的见解和实用的解决方案。
{"title":"Cross-Hemispheric Spatial-Temporal Attention Network for Decoding Silent Speech From EEG.","authors":"Yanru Bai, Shuming Zhang, Ran Zhao, Xu Han, Guangjian Ni, Dong Ming","doi":"10.1109/TBME.2025.3624878","DOIUrl":"https://doi.org/10.1109/TBME.2025.3624878","url":null,"abstract":"<p><strong>Objective: </strong>Speech, as the core of advanced human cognition, is fundamental to social interaction and daily life. Electroencephalogram (EEG)-based speech brain-computer interface (BCI) offers a novel communication pathway for patients with speech disorders, where deep learning has demonstrated significant advantages. Given the established dominance of the left hemisphere in speech processing, exploring methods to extract speech related neural features fully is crucial for enhancing decoding per formance.</p><p><strong>Approach: </strong>In this study, EEG signals were recorded during a silent speech task involving the articulation of 10 distinct Chinese characters. Leveraging the principle of language function lateralization, we proposed a novel deep learning model, the cross hemispheric spatial-temporal attention network (CHSTAN), for EEG-based silent speech recognition. A multiscale temporal con volution block was employed to extract the temporal dynamics of EEG signals. A hemispheric spatial convolutional block was designed to independently process spatial information from the left and right hemispheres. Furthermore, the cross-attention mechanism was introduced to enhance inter-hemispheric feature inter action and specifically reinforce left-hemispheric feature representation for the final classification.</p><p><strong>Results: </strong>We compared CHSTAN with other existing methods using 5-fold cross-validation on the collected dataset. CHSTAN achieved an average classification accuracy of 49.88% and an average F1-score of 48.75% in decoding the 10 Chinese characters, significantly outperforming other methods.</p><p><strong>Conclusion: </strong>The results indicate that the CHSTAN performs effectively in silent speech EEG classification tasks. Notably, the feature patterns learned through its innovative architecture correspond to neural speech processing mechanism.</p><p><strong>Significance: </strong>CHSTAN provides valuable insights and practical solutions for improving the performance of EEG-based speech decoding.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145354690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Biomedical Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1