首页 > 最新文献

IEEE Transactions on Biomedical Engineering最新文献

英文 中文
Muscle Synergy-Guided Reinforcement Learning for Embodied Musculoskeletal Motion Skill Learning. 肌肉协同引导的强化学习对具身肌肉骨骼运动技能的学习。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3651379
Lijun Han, Long Cheng, Muyuan Ma, Feiyang Yang, Houcheng Li

Background: Acquiring human- like motor skills in embodied musculoskeletal models is challenging due to the high dimensionality and redundancy of muscle actuators.

Methods: Inspired by human motor control coordination patterns, we introduce a synergy-guided reinforcement learning framework integrating physiological priors derived from muscle synergies into the control policy of an embodied musculoskeletal model. It leverages coordinated muscle activation patterns to guide learning, generating muscle excitation signals via a synergy-guided control component and a residual control component. To evaluate the proposed method, four badminton stroke skills are selected as benchmark tasks (forehand/backhand, inward/outward net slices).

Results: The experimental results demonstrate that our method achieves an average root mean square error of under 0.015 radians across all stroke types, demonstrating its ability to accurately learn expert motion. Furthermore, it outperforms the baseline proximal policy optimization (PPO) model in terms of trajectory accuracy, energy efficiency, and training convergence speed, particularly excelling in energy efficiency with up to a 14.9% reduction in energy consumption. The forehand high serve is also tested to validate the method's effectiveness in learning longer and larger-ranges movements, showing the same advantages. Moreover, the muscle synergies learned by the model exhibit moderate resemblance to human synergies, indicating potential interpretability and biological plausibility.

Conclusion: This work highlights that integrating neurophysiological priors into reinforcement learning provides a promising pathway for efficient, interpretable, and human- like motor control.

Significance: The approach holds promise for advancing motor skill assessment, human-machine interfaces, and rehabilitation technologies by enabling more efficient and human- like motion skill learning.

背景:由于肌肉致动器的高维性和冗余性,在具身肌肉骨骼模型中获得类似人类的运动技能是具有挑战性的。方法:受人类运动控制协调模式的启发,我们引入了一个协同引导的强化学习框架,将来自肌肉协同的生理先验整合到一个具体肌肉骨骼模型的控制策略中。它利用协调的肌肉激活模式来指导学习,通过协同引导的控制成分和剩余控制成分产生肌肉兴奋信号。为了评估所提出的方法,选择了四种羽毛球击球技术作为基准任务(正手/反手,向内/向外网切)。结果:实验结果表明,我们的方法在所有笔划类型下的平均均方根误差小于0.015弧度,证明了它能够准确地学习专家运动。此外,它在轨迹精度、能源效率和训练收敛速度方面优于基线近端策略优化(PPO)模型,特别是在能源效率方面表现出色,能耗降低高达14.9%。正手高发球也进行了测试,以验证该方法在学习更长和更大范围动作方面的有效性,显示出同样的优势。此外,该模型学习的肌肉协同作用与人类协同作用表现出适度的相似性,表明潜在的可解释性和生物学合理性。结论:这项工作强调了将神经生理学先验整合到强化学习中,为高效、可解释和类似人类的运动控制提供了一条有希望的途径。意义:该方法通过实现更高效和人性化的运动技能学习,有望推进运动技能评估、人机界面和康复技术。
{"title":"Muscle Synergy-Guided Reinforcement Learning for Embodied Musculoskeletal Motion Skill Learning.","authors":"Lijun Han, Long Cheng, Muyuan Ma, Feiyang Yang, Houcheng Li","doi":"10.1109/TBME.2026.3651379","DOIUrl":"https://doi.org/10.1109/TBME.2026.3651379","url":null,"abstract":"<p><strong>Background: </strong>Acquiring human- like motor skills in embodied musculoskeletal models is challenging due to the high dimensionality and redundancy of muscle actuators.</p><p><strong>Methods: </strong>Inspired by human motor control coordination patterns, we introduce a synergy-guided reinforcement learning framework integrating physiological priors derived from muscle synergies into the control policy of an embodied musculoskeletal model. It leverages coordinated muscle activation patterns to guide learning, generating muscle excitation signals via a synergy-guided control component and a residual control component. To evaluate the proposed method, four badminton stroke skills are selected as benchmark tasks (forehand/backhand, inward/outward net slices).</p><p><strong>Results: </strong>The experimental results demonstrate that our method achieves an average root mean square error of under 0.015 radians across all stroke types, demonstrating its ability to accurately learn expert motion. Furthermore, it outperforms the baseline proximal policy optimization (PPO) model in terms of trajectory accuracy, energy efficiency, and training convergence speed, particularly excelling in energy efficiency with up to a 14.9% reduction in energy consumption. The forehand high serve is also tested to validate the method's effectiveness in learning longer and larger-ranges movements, showing the same advantages. Moreover, the muscle synergies learned by the model exhibit moderate resemblance to human synergies, indicating potential interpretability and biological plausibility.</p><p><strong>Conclusion: </strong>This work highlights that integrating neurophysiological priors into reinforcement learning provides a promising pathway for efficient, interpretable, and human- like motor control.</p><p><strong>Significance: </strong>The approach holds promise for advancing motor skill assessment, human-machine interfaces, and rehabilitation technologies by enabling more efficient and human- like motion skill learning.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Discrete Hemodynamic Control Framework: Proof-of-Concept Study for Autonomous Drug Therapy in Acute Heart Failure. 离散血流动力学控制框架:急性心力衰竭自主药物治疗的概念验证研究。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3653399
Yasuyuki Kataoka, Kazunori Uemura, Mitsuji Sampei, Yukiko Fukuda, Jon Peterson, Riku Funada, Keita Saku, Joe Alexander, Kenji Sunagawa

Objectives: Managing acute heart failure (AHF) involves the dual challenges of stabilizing hemodynamics while minimizing myocardial oxygen consumption. This is a clinically significant and technically challenging issue due to complex cardiovascular interactions and trade-offs between therapeutic goals. We aim to develop a discrete hemodynamic control framework for autonomous drug therapy that enhances both control versatility and clinical potential.

Methods: A comprehensive multi-input, multi-output model was derived to capture causal relationships linking drug infusion, cardiovascular parameters, and hemodynamic responses. Based on this model, we designed an optimal control framework capable of regulating arbitrary hemodynamic targets (e.g., arterial pressure, cardiac output, left atrial pressure, right atrial pressure), minimizing myocardial oxygen consumption, accommodating multiple guideline-concordant drug classes, and enforcing clinical constraints such as dosage limits and contraindications. The discrete design enables compatibility with standard hemodynamic measurements in ICU/CCU settings. Control performance was validated using a cardiovascular simulator.

Results: The system achieved multi-dimensional regulation of hemodynamics and reduced myocardial oxygen consumption across a range of AHF scenarios. It also identified pharmacologically untreatable cases by evaluating the reachable hemodynamic range. Performance remained stable despite inter-patient variability in drug sensitivities, circulatory properties, and baroreflex activity.

Conclusions: The proposed system may function as a physician-assistive tool and a foundation for autonomous drug therapy in AHF.

Significances: By integrating physiologically grounded modeling with practical clinical constraints, the system offers a novel and scalable approach to optimize drug therapy in AHF. It has the potential to improve treatment precision, reduce clinician burden, and advance next-generation critical care.

目的:管理急性心力衰竭(AHF)涉及稳定血流动力学和最小化心肌耗氧量的双重挑战。由于复杂的心血管相互作用和治疗目标之间的权衡,这是一个具有临床意义和技术挑战性的问题。我们的目标是为自主药物治疗开发一个离散的血流动力学控制框架,以提高控制的通用性和临床潜力。方法:建立一个综合的多输入、多输出模型,以捕捉药物输注、心血管参数和血流动力学反应之间的因果关系。基于该模型,我们设计了一个最优控制框架,能够调节任意血流动力学目标(如动脉压、心输出量、左房压、右房压),最大限度地减少心肌耗氧量,适应多种指南一致的药物类别,并执行临床限制,如剂量限制和禁忌症。离散设计使其与ICU/CCU设置中的标准血流动力学测量兼容。使用心血管模拟器验证控制性能。结果:该系统实现了血流动力学的多维调节,降低了心肌耗氧量。它还通过评估可达到的血流动力学范围来确定药理学上无法治疗的病例。尽管患者之间在药物敏感性、循环特性和压力反射活性方面存在差异,但其表现仍保持稳定。结论:该系统可作为AHF的医师辅助工具和自主药物治疗的基础。意义:通过将生理基础模型与实际临床约束相结合,该系统为优化AHF的药物治疗提供了一种新颖且可扩展的方法。它有可能提高治疗精度,减轻临床医生的负担,并推进下一代重症监护。
{"title":"A Discrete Hemodynamic Control Framework: Proof-of-Concept Study for Autonomous Drug Therapy in Acute Heart Failure.","authors":"Yasuyuki Kataoka, Kazunori Uemura, Mitsuji Sampei, Yukiko Fukuda, Jon Peterson, Riku Funada, Keita Saku, Joe Alexander, Kenji Sunagawa","doi":"10.1109/TBME.2026.3653399","DOIUrl":"https://doi.org/10.1109/TBME.2026.3653399","url":null,"abstract":"<p><strong>Objectives: </strong>Managing acute heart failure (AHF) involves the dual challenges of stabilizing hemodynamics while minimizing myocardial oxygen consumption. This is a clinically significant and technically challenging issue due to complex cardiovascular interactions and trade-offs between therapeutic goals. We aim to develop a discrete hemodynamic control framework for autonomous drug therapy that enhances both control versatility and clinical potential.</p><p><strong>Methods: </strong>A comprehensive multi-input, multi-output model was derived to capture causal relationships linking drug infusion, cardiovascular parameters, and hemodynamic responses. Based on this model, we designed an optimal control framework capable of regulating arbitrary hemodynamic targets (e.g., arterial pressure, cardiac output, left atrial pressure, right atrial pressure), minimizing myocardial oxygen consumption, accommodating multiple guideline-concordant drug classes, and enforcing clinical constraints such as dosage limits and contraindications. The discrete design enables compatibility with standard hemodynamic measurements in ICU/CCU settings. Control performance was validated using a cardiovascular simulator.</p><p><strong>Results: </strong>The system achieved multi-dimensional regulation of hemodynamics and reduced myocardial oxygen consumption across a range of AHF scenarios. It also identified pharmacologically untreatable cases by evaluating the reachable hemodynamic range. Performance remained stable despite inter-patient variability in drug sensitivities, circulatory properties, and baroreflex activity.</p><p><strong>Conclusions: </strong>The proposed system may function as a physician-assistive tool and a foundation for autonomous drug therapy in AHF.</p><p><strong>Significances: </strong>By integrating physiologically grounded modeling with practical clinical constraints, the system offers a novel and scalable approach to optimize drug therapy in AHF. It has the potential to improve treatment precision, reduce clinician burden, and advance next-generation critical care.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ECG-Adapt: A Novel Framework for Robust Electrocardiogram Classification Across Diverse Populations and Recording Conditions. ECG-Adapt:跨不同人群和记录条件的稳健心电图分类的新框架。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3653051
Ahmadreza Argha, Hamid Alinejad-Rokny, Farshid Hajati, Joseph Magdy, Joan Li, Zi Zai Lim, Jennifer Yu, Min Yang, Ken Butcher, Sze-Yuan Ooi, Nigel H Lovell

The electrocardiogram (ECG) is a vital diagnostic tool used to monitor and diagnose a wide range of cardiac conditions. However, ECG signals can exhibit significant variability across different patient populations, recording devices, and environmental conditions, creating challenges in developing universally robust and accurate classification models. This research addresses these challenges by exploring and advancing domain adaptation techniques to enhance the robustness and generalizability of ECG classification models. By leveraging unsupervised domain adaptation (UDA), we aim to mitigate the performance degradation that typically occurs when models trained on one dataset are applied to another, thereby improving diagnostic accuracy and reliability across diverse clinical settings. We introduce ECG-Adapt, an integrated approach that aligns features both within classes and across domains. Unlike existing methods that rely on clustering as a preprocessing step, ECG-Adapt does not require clustering, simplifying the workflow. It further incorporates weakly supervised learning to prevent overfitting of the discriminator to pseudo-labels generated by the classifier, enhancing robustness and generalizability. Applying our novel unsupervised domain adaptation framework led to substantial performance gains. For instance, ECG-Adapt improved the average $F_{1}$-score by 8% on single-lead problems and 7% on 12-lead problems. By leveraging ECG-Adapt, performance degradation when applying models across datasets can be mitigated, enhancing diagnostic accuracy and reliability in diverse clinical settings and demonstrating strong potential for real-world deployment.

心电图(ECG)是一种重要的诊断工具,用于监测和诊断各种心脏疾病。然而,ECG信号在不同的患者群体、记录设备和环境条件下会表现出显著的可变性,这给开发普遍鲁棒和准确的分类模型带来了挑战。本研究通过探索和推进领域自适应技术来提高心电分类模型的鲁棒性和泛化性,从而解决了这些挑战。通过利用无监督域自适应(UDA),我们的目标是减轻在一个数据集上训练的模型应用于另一个数据集时通常发生的性能下降,从而提高不同临床环境下诊断的准确性和可靠性。我们介绍了ECG-Adapt,这是一种集成的方法,可以在类和跨域内对齐功能。与现有的依赖聚类作为预处理步骤的方法不同,ECG-Adapt不需要聚类,简化了工作流程。它进一步结合了弱监督学习,以防止判别器对分类器生成的伪标签过拟合,增强鲁棒性和泛化性。应用我们的新无监督域自适应框架导致了实质性的性能提升。例如,ECG-Adapt将单导联问题的平均得分提高了8%,将12导联问题的平均得分提高了7%。通过利用ECG-Adapt,可以减轻跨数据集应用模型时的性能下降,提高不同临床环境下诊断的准确性和可靠性,并显示出在实际应用中的强大潜力。
{"title":"ECG-Adapt: A Novel Framework for Robust Electrocardiogram Classification Across Diverse Populations and Recording Conditions.","authors":"Ahmadreza Argha, Hamid Alinejad-Rokny, Farshid Hajati, Joseph Magdy, Joan Li, Zi Zai Lim, Jennifer Yu, Min Yang, Ken Butcher, Sze-Yuan Ooi, Nigel H Lovell","doi":"10.1109/TBME.2026.3653051","DOIUrl":"https://doi.org/10.1109/TBME.2026.3653051","url":null,"abstract":"<p><p>The electrocardiogram (ECG) is a vital diagnostic tool used to monitor and diagnose a wide range of cardiac conditions. However, ECG signals can exhibit significant variability across different patient populations, recording devices, and environmental conditions, creating challenges in developing universally robust and accurate classification models. This research addresses these challenges by exploring and advancing domain adaptation techniques to enhance the robustness and generalizability of ECG classification models. By leveraging unsupervised domain adaptation (UDA), we aim to mitigate the performance degradation that typically occurs when models trained on one dataset are applied to another, thereby improving diagnostic accuracy and reliability across diverse clinical settings. We introduce ECG-Adapt, an integrated approach that aligns features both within classes and across domains. Unlike existing methods that rely on clustering as a preprocessing step, ECG-Adapt does not require clustering, simplifying the workflow. It further incorporates weakly supervised learning to prevent overfitting of the discriminator to pseudo-labels generated by the classifier, enhancing robustness and generalizability. Applying our novel unsupervised domain adaptation framework led to substantial performance gains. For instance, ECG-Adapt improved the average $F_{1}$-score by 8% on single-lead problems and 7% on 12-lead problems. By leveraging ECG-Adapt, performance degradation when applying models across datasets can be mitigated, enhancing diagnostic accuracy and reliability in diverse clinical settings and demonstrating strong potential for real-world deployment.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Cardiac Event Detection from Single-Arm Wearable ECG via a Contrastive Multitask Framework. 基于对比多任务框架的单臂可穿戴心电图动态心脏事件检测。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3651455
Xianbin Zhang, Haoke Zhang, Gui-Bin Bian, Richard Millham, Victor Hugo C de Albuquerque, Wanqing Wu

Objective: Upper-limb electrocardiogram (ECG) acquired from a single-arm wearable device offers a practical approach for dynamic cardiac monitoring. This study addresses whether single-channel Arm-ECG, characterized by higher noise and distinct morphology compared to standard 12-lead ECG, can reliably detect arrhythmias.

Methods: We propose CLMF-Net, a contrastive multitask framework featuring multiscale convolutional layers to capture temporal-morphological patterns. A fine-grained reconstruction branch preserves subtle clinical features, including P waves and ST segments, while a contrastive module ensures robustness against signal quality variations. The complete framework is trained end-to-end using a unified loss function that balances classification, reconstruction, and consistency objectives.

Results: Validation with 132 elderly participants undergoing six-minute walk tests confirmed the feasibility of dynamic Arm-ECG acquisition. CLMF-Net, trained for arrhythmia identification from single-channel ECG, achieved 95.19% accuracy on the Arm-ECG dataset. It effectively detects arrhythmias, despite challenges with noise and morphology. Generalizability was evidenced by 95.55% and 84.23% accuracy on the Chapman and China Physiological Signal Challenge 2018 (CPSC2018) datasets, respectively, demonstrating the model's ability to classify multiple arrhythmia types across diverse datasets.

Conclusion: Beyond precise identification of arrhythmias, deep feature and morphology analyses showed that the learned representations do not always align with clinically emphasized intervals. This discrepancy highlights both the promise of leveraging Arm-ECG signals and the caution required when translating model-derived interpretations into clinical workflows.

Significance: This study validates the clinical utility of single-arm ECG monitoring, demonstrating feature reliability for real-world cardiac health assessment and supporting the translational potential through consistency with standard recordings.

目的:单臂可穿戴设备获取上肢心电图为动态心脏监测提供了一种实用的方法。与标准12导联心电图相比,单通道臂式心电图具有更高的噪声和不同的形态学特征,该研究探讨了单通道臂式心电图是否能够可靠地检测心律失常。方法:我们提出了CLMF-Net,这是一个具有多尺度卷积层的对比多任务框架,用于捕获时间形态模式。细粒度重建分支保留了细微的临床特征,包括P波和ST段,而对比模块确保了对信号质量变化的鲁棒性。完整的框架是端到端的训练,使用统一的损失函数来平衡分类、重建和一致性目标。结果:对132名老年参与者进行6分钟步行测试的验证证实了动态臂电采集的可行性。CLMF-Net在Arm-ECG数据集上的准确率达到95.19%。它有效地检测心律失常,尽管噪声和形态学的挑战。在查普曼和中国生理信号挑战2018 (CPSC2018)数据集上,该模型的准确率分别为95.55%和84.23%,证明了该模型能够对不同数据集的多种心律失常类型进行分类。结论:除了心律失常的精确识别,深度特征和形态学分析表明,学习表征并不总是与临床强调的间隔一致。这种差异既强调了利用臂电信号的前景,也强调了将模型衍生的解释转化为临床工作流程时所需的谨慎。意义:本研究验证了单臂心电图监测的临床实用性,证明了真实世界心脏健康评估的特征可靠性,并通过与标准记录的一致性支持了转化潜力。
{"title":"Dynamic Cardiac Event Detection from Single-Arm Wearable ECG via a Contrastive Multitask Framework.","authors":"Xianbin Zhang, Haoke Zhang, Gui-Bin Bian, Richard Millham, Victor Hugo C de Albuquerque, Wanqing Wu","doi":"10.1109/TBME.2026.3651455","DOIUrl":"https://doi.org/10.1109/TBME.2026.3651455","url":null,"abstract":"<p><strong>Objective: </strong>Upper-limb electrocardiogram (ECG) acquired from a single-arm wearable device offers a practical approach for dynamic cardiac monitoring. This study addresses whether single-channel Arm-ECG, characterized by higher noise and distinct morphology compared to standard 12-lead ECG, can reliably detect arrhythmias.</p><p><strong>Methods: </strong>We propose CLMF-Net, a contrastive multitask framework featuring multiscale convolutional layers to capture temporal-morphological patterns. A fine-grained reconstruction branch preserves subtle clinical features, including P waves and ST segments, while a contrastive module ensures robustness against signal quality variations. The complete framework is trained end-to-end using a unified loss function that balances classification, reconstruction, and consistency objectives.</p><p><strong>Results: </strong>Validation with 132 elderly participants undergoing six-minute walk tests confirmed the feasibility of dynamic Arm-ECG acquisition. CLMF-Net, trained for arrhythmia identification from single-channel ECG, achieved 95.19% accuracy on the Arm-ECG dataset. It effectively detects arrhythmias, despite challenges with noise and morphology. Generalizability was evidenced by 95.55% and 84.23% accuracy on the Chapman and China Physiological Signal Challenge 2018 (CPSC2018) datasets, respectively, demonstrating the model's ability to classify multiple arrhythmia types across diverse datasets.</p><p><strong>Conclusion: </strong>Beyond precise identification of arrhythmias, deep feature and morphology analyses showed that the learned representations do not always align with clinically emphasized intervals. This discrepancy highlights both the promise of leveraging Arm-ECG signals and the caution required when translating model-derived interpretations into clinical workflows.</p><p><strong>Significance: </strong>This study validates the clinical utility of single-arm ECG monitoring, demonstrating feature reliability for real-world cardiac health assessment and supporting the translational potential through consistency with standard recordings.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RUNet: A Zero-Calibration Framework for Cross-Domain EEG Decoding via Riemannian and Unsupervised Representation Learning. RUNet:一种基于黎曼和无监督表示学习的跨域EEG解码零校准框架。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3653024
Jing Jin, Chongfeng Wang, Ren Xu, Xijie He, Xiao Wu, Junxian Li, Weijie Chen, Xingyu Wang, Andrzej Cichocki

Objective: Inter-session and inter-subject variability in electroencephalography (EEG) signals, resulting from individual differences and environmental factors, poses a major challenge for neural decoding in brain-computer interface (BCI) applications.

Methods: To address this issue, we propose RUNet, a zero-calibration motor imagery EEG decoding framework based on Riemannian manifold learning and unsupervised representation learning. RUNet incorporates a multi-scale spatiotemporal convolutional module that jointly captures local global spatial and multi-resolution temporal dynamics features. To enhance the robustness of EEG features against non stationarity, a polysynergistic covariance optimization module is employed, which strengthens the covariance matrix representation through multiple regularizations and adaptive fusion. In addition, RUNet integrates the Riemannian Affine Log Mapping layer, based on Affine-Invariant Transformation and Log-Euclidean Mapping, in an end-to-end manner to mitigate cross-domain covariance drift and enhance domain-invariant feature learning. A transfer learning framework is further integrated into RUNet: during pre-training, an unsupervised contrastive loss is applied to resting-state EEG data to learn domain-invariant spatiotemporal features; during retraining, task-specific data are used to enhance discriminability and feature disentanglement.

Conclusion: Experimental results on the BCI Competition IV 2a, 2b datasets and a self-collected laboratory dataset show that RUNet achieves average cross-session accuracies of 87.19%, 88.03% and 85.45%, and cross-subject accuracies of 68.09%, 78.29% and 87.25%, respectively. On the PhysioNet dataset, a cross-subject accuracy of 78.14% is achieved. These results demonstrate the effectiveness of RUNet's unified pipeline and its robust cross-domain generalization.

目的:脑机接口(BCI)的神经解码技术在脑机接口(BCI)应用中,由于个体差异和环境因素导致的脑电信号在会话间和主体间的差异是一个重大挑战。为了解决这一问题,我们提出了基于黎曼流形学习和无监督表示学习的零校准运动图像脑电解码框架RUNet。RUNet包含一个多尺度时空卷积模块,可以联合捕获局部全局空间和多分辨率时间动态特征。为了增强脑电特征对非平稳性的鲁棒性,采用多协同协方差优化模块,通过多次正则化和自适应融合增强协方差矩阵的表示。此外,RUNet以端到端方式集成了基于仿射不变变换和对数欧氏映射的黎曼仿射日志映射层,以减轻跨域协方差漂移,增强域不变特征学习。将迁移学习框架进一步集成到RUNet中:在预训练过程中,对静息状态脑电数据应用无监督对比损失来学习域不变的时空特征;在再训练过程中,使用特定于任务的数据来增强可辨别性和特征解纠缠。结论:在BCI Competition IV 2a、2b数据集和自选实验室数据集上的实验结果表明,RUNet的平均跨会话准确率分别为87.19%、88.03%和85.45%,跨主题准确率分别为68.09%、78.29%和87.25%。在PhysioNet数据集上,实现了78.14%的跨主题准确率。这些结果证明了RUNet统一管道的有效性和跨域泛化的鲁棒性。
{"title":"RUNet: A Zero-Calibration Framework for Cross-Domain EEG Decoding via Riemannian and Unsupervised Representation Learning.","authors":"Jing Jin, Chongfeng Wang, Ren Xu, Xijie He, Xiao Wu, Junxian Li, Weijie Chen, Xingyu Wang, Andrzej Cichocki","doi":"10.1109/TBME.2026.3653024","DOIUrl":"https://doi.org/10.1109/TBME.2026.3653024","url":null,"abstract":"<p><strong>Objective: </strong>Inter-session and inter-subject variability in electroencephalography (EEG) signals, resulting from individual differences and environmental factors, poses a major challenge for neural decoding in brain-computer interface (BCI) applications.</p><p><strong>Methods: </strong>To address this issue, we propose RUNet, a zero-calibration motor imagery EEG decoding framework based on Riemannian manifold learning and unsupervised representation learning. RUNet incorporates a multi-scale spatiotemporal convolutional module that jointly captures local global spatial and multi-resolution temporal dynamics features. To enhance the robustness of EEG features against non stationarity, a polysynergistic covariance optimization module is employed, which strengthens the covariance matrix representation through multiple regularizations and adaptive fusion. In addition, RUNet integrates the Riemannian Affine Log Mapping layer, based on Affine-Invariant Transformation and Log-Euclidean Mapping, in an end-to-end manner to mitigate cross-domain covariance drift and enhance domain-invariant feature learning. A transfer learning framework is further integrated into RUNet: during pre-training, an unsupervised contrastive loss is applied to resting-state EEG data to learn domain-invariant spatiotemporal features; during retraining, task-specific data are used to enhance discriminability and feature disentanglement.</p><p><strong>Conclusion: </strong>Experimental results on the BCI Competition IV 2a, 2b datasets and a self-collected laboratory dataset show that RUNet achieves average cross-session accuracies of 87.19%, 88.03% and 85.45%, and cross-subject accuracies of 68.09%, 78.29% and 87.25%, respectively. On the PhysioNet dataset, a cross-subject accuracy of 78.14% is achieved. These results demonstrate the effectiveness of RUNet's unified pipeline and its robust cross-domain generalization.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segment Anything Model 2: An Application to 2D and 3D Medical Images. 细分任何模型2:应用于2D和3D医学图像。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3653267
Haoyu Dong, Hanxue Gu, Yaqian Chen, Jichen Yang, Yuwen Chen, Maciej A Mazurowski

Segment Anything Model (SAM) has gained significant attention because of its ability to segment a variety of objects in images upon providing a prompt. Recently developed SAM 2 has extended this ability to video segmentation, and by substituting the third spatial dimension in 3D images for the time dimension in videos, it opens an opportunity to apply SAM 2 to 3D medical images. In this paper, we extensively evaluate SAM 2's ability to segment both 2D and 3D medical images using 80 prompt strategies across 21 medical imaging datasets, including 2D modalities (X-ray and ultrasound), 3D modalities (magnetic resonance imaging, computed tomography, and positron emission tomography), and surgical videos. We find that in the 2D setting, SAM 2 performs similarly to SAM, while in the 3D setting we observe that: (1) selecting the first mask is more effective than choosing the one with the highest confidence, (2) prompting the slice with the largest object appears is the most cost-effective strategy when only one slice is prompted, (3) box prompts result in higher performance than point prompts at a slightly higher annotation cost, (4) bidirectional propagation outperforms front-to-end propagation, (5) interactive annotation is rarely effective, (6) SAM 2, without fine-tuning, achieves 3D IoU from 0.32 with a single point prompt to 0.51 with a ground truth mask on one slice, and exceeds 0.8 on certain datasets when using box or ground-truth prompts, a level that begins to approach clinical usefulness. These findings demonstrate that SAM 2's ability to segment 3D medical images can be improved with our proposed strategies over the default ones, providing practical guidance for using SAM 2 for prompt-based 3D medical image segmentation.

任何物体分割模型(SAM)由于能够在提供提示的情况下分割图像中的各种物体而受到广泛关注。最近开发的SAM 2已经将这种能力扩展到视频分割,并且通过用3D图像中的第三个空间维度代替视频中的时间维度,它打开了将SAM 2应用于3D医学图像的机会。在本文中,我们广泛评估了SAM 2在21个医学成像数据集中使用80种提示策略分割2D和3D医学图像的能力,这些数据集包括2D模式(x射线和超声波)、3D模式(磁共振成像、计算机断层扫描和正电子发射断层扫描)和手术视频。我们发现,在2D环境中,SAM 2的表现与SAM相似,而在3D环境中,我们观察到:(1)选择第一个掩码比选择置信度最高的掩码更有效;(2)当只提示一个切片时,提示出现最大对象的切片是最经济有效的策略;(3)框提示比点提示性能更高,注释成本略高;(4)双向传播优于前端传播;(5)交互式注释很少有效;(6)没有微调的SAM 2。使用单点提示实现3D IoU从0.32到0.51,在一个切片上使用真实值掩膜,并且在使用盒或真实值提示时在某些数据集上超过0.8,这一水平开始接近临床有用性。这些结果表明,与默认策略相比,我们提出的策略可以提高SAM 2对3D医学图像的分割能力,为使用SAM 2进行基于提示的3D医学图像分割提供了实践指导。
{"title":"Segment Anything Model 2: An Application to 2D and 3D Medical Images.","authors":"Haoyu Dong, Hanxue Gu, Yaqian Chen, Jichen Yang, Yuwen Chen, Maciej A Mazurowski","doi":"10.1109/TBME.2026.3653267","DOIUrl":"https://doi.org/10.1109/TBME.2026.3653267","url":null,"abstract":"<p><p>Segment Anything Model (SAM) has gained significant attention because of its ability to segment a variety of objects in images upon providing a prompt. Recently developed SAM 2 has extended this ability to video segmentation, and by substituting the third spatial dimension in 3D images for the time dimension in videos, it opens an opportunity to apply SAM 2 to 3D medical images. In this paper, we extensively evaluate SAM 2's ability to segment both 2D and 3D medical images using 80 prompt strategies across 21 medical imaging datasets, including 2D modalities (X-ray and ultrasound), 3D modalities (magnetic resonance imaging, computed tomography, and positron emission tomography), and surgical videos. We find that in the 2D setting, SAM 2 performs similarly to SAM, while in the 3D setting we observe that: (1) selecting the first mask is more effective than choosing the one with the highest confidence, (2) prompting the slice with the largest object appears is the most cost-effective strategy when only one slice is prompted, (3) box prompts result in higher performance than point prompts at a slightly higher annotation cost, (4) bidirectional propagation outperforms front-to-end propagation, (5) interactive annotation is rarely effective, (6) SAM 2, without fine-tuning, achieves 3D IoU from 0.32 with a single point prompt to 0.51 with a ground truth mask on one slice, and exceeds 0.8 on certain datasets when using box or ground-truth prompts, a level that begins to approach clinical usefulness. These findings demonstrate that SAM 2's ability to segment 3D medical images can be improved with our proposed strategies over the default ones, providing practical guidance for using SAM 2 for prompt-based 3D medical image segmentation.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SW-VEI-Net: A Physics-Informed Deep Neural Network for Shear Wave Viscoelasticity Imaging. SW-VEI-Net:用于横波粘弹性成像的物理信息深度神经网络。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3652121
Haoming Lin, Zhongjun Ma, Yunxiang Wang, Muqing Lin, Shuming Xu, Mian Chen, Minhua Lu, Siping Chen, Xin Chen

Quantitative viscoelasticity imaging via shear wave elastography (SWE) remains challenging due to complex wave physics and limitations of conventional reconstruction methods. To address this, we present SW-VEI-Net, a physics-informed neural network (PINN) that simultaneously reconstructs the shear elastic modulus and viscous modulus by integrating viscoelastic wave equations into a dual-network architecture. The framework employs a dual-loss function to balance data fidelity and physics-based regularization, significantly reducing reliance on empirical data while improving interpretability. Extensive validation on tissue-mimicking phantoms, rat liver fibrosis model, and clinical cases demonstrates that SW-VEI-Net outperforms state-of-the-art SWE methods. Compared to SWENet (a PINN-based method using a linear elastic model), SW-VEI-Net not only enables simultaneous assessment of shear elastic and viscous moduli, but also achieves higher accuracy in shear elastic modulus reconstruction. Furthermore, when benchmarked against the dispersion fitting (DF) method (based on a viscoelastic model), SW-VEI-Net produces comparable viscoelastic parameter maps while exhibiting enhanced robustness and consistency. For liver fibrosis staging, SW-VEI-Net achieves AUC values of 0.85 ($geq$F2) and 0.91 ($=$F4) based on elastic modulus classification, surpassing both SWENet (0.84, 0.85) and DF (0.78, 0.88). Additional validation in healthy volunteers shows strong agreement with a commercial ultrasound system. By synergizing deep learning with fundamental wave physics, this study represents a significant advancement in SWE, offering substantial clinical potential for early detection of hepatic fibrosis and malignant lesions through precise viscoelastic biomarker mapping.

由于复杂的波物理特性和传统重建方法的局限性,通过剪切波弹性成像(SWE)进行定量粘弹性成像仍然具有挑战性。为了解决这个问题,我们提出了SW-VEI-Net,这是一个物理信息神经网络(PINN),通过将粘弹性波动方程集成到双网络架构中,同时重建剪切弹性模量和粘性模量。该框架采用双损失函数来平衡数据保真度和基于物理的正则化,显著减少了对经验数据的依赖,同时提高了可解释性。对组织模拟模型、大鼠肝纤维化模型和临床病例的广泛验证表明,SW-VEI-Net优于最先进的SWE方法。与SWENet(基于pup的线性弹性模型方法)相比,SW-VEI-Net不仅可以同时评估剪切弹性模量和粘性模量,而且在剪切弹性模量重建方面具有更高的精度。此外,当与离散拟合(DF)方法(基于粘弹性模型)进行基准测试时,SW-VEI-Net可以生成可比的粘弹性参数图,同时表现出增强的鲁棒性和一致性。对于肝纤维化分期,基于弹性模量分类,SW-VEI-Net的AUC值分别为0.85 ($geq$ F2)和0.91 ($=$ F4),超过了SWENet(0.84, 0.85)和DF(0.78, 0.88)。在健康志愿者中进行的额外验证显示与商业超声系统有很强的一致性。通过将深度学习与基本波物理相结合,该研究代表了SWE的重大进步,通过精确的粘弹性生物标志物定位,为早期检测肝纤维化和恶性病变提供了巨大的临床潜力。
{"title":"SW-VEI-Net: A Physics-Informed Deep Neural Network for Shear Wave Viscoelasticity Imaging.","authors":"Haoming Lin, Zhongjun Ma, Yunxiang Wang, Muqing Lin, Shuming Xu, Mian Chen, Minhua Lu, Siping Chen, Xin Chen","doi":"10.1109/TBME.2026.3652121","DOIUrl":"https://doi.org/10.1109/TBME.2026.3652121","url":null,"abstract":"<p><p>Quantitative viscoelasticity imaging via shear wave elastography (SWE) remains challenging due to complex wave physics and limitations of conventional reconstruction methods. To address this, we present SW-VEI-Net, a physics-informed neural network (PINN) that simultaneously reconstructs the shear elastic modulus and viscous modulus by integrating viscoelastic wave equations into a dual-network architecture. The framework employs a dual-loss function to balance data fidelity and physics-based regularization, significantly reducing reliance on empirical data while improving interpretability. Extensive validation on tissue-mimicking phantoms, rat liver fibrosis model, and clinical cases demonstrates that SW-VEI-Net outperforms state-of-the-art SWE methods. Compared to SWENet (a PINN-based method using a linear elastic model), SW-VEI-Net not only enables simultaneous assessment of shear elastic and viscous moduli, but also achieves higher accuracy in shear elastic modulus reconstruction. Furthermore, when benchmarked against the dispersion fitting (DF) method (based on a viscoelastic model), SW-VEI-Net produces comparable viscoelastic parameter maps while exhibiting enhanced robustness and consistency. For liver fibrosis staging, SW-VEI-Net achieves AUC values of 0.85 ($geq$F2) and 0.91 ($=$F4) based on elastic modulus classification, surpassing both SWENet (0.84, 0.85) and DF (0.78, 0.88). Additional validation in healthy volunteers shows strong agreement with a commercial ultrasound system. By synergizing deep learning with fundamental wave physics, this study represents a significant advancement in SWE, offering substantial clinical potential for early detection of hepatic fibrosis and malignant lesions through precise viscoelastic biomarker mapping.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feasibility of dual probe pulse wave imaging of the abdominal aorta. 腹主动脉双探头脉冲波成像的可行性。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3651584
Larissa C Jansen, Richard G P Lopata, Hans-Martin Schwab

Pulse wave velocity (PWV) is an indirect measure of vessel stiffness, that has the potential to serve as a meaningful parameter for risk stratification of vascular diseases, such as abdominal aortic aneurysms (AAAs). However, assessing the PWV and pulse wave patterns in the complete abdominal aorta using ultrasound-based pulse wave imaging (PWI) is challenging due to the limited field of view (FOV) and contrast of a single ultrasound (US) probe. Hence, an approach is required that can capture distension of aortas with different levels of stiffness accurately in a large FOV. Therefore, we propose PWI based on dual probe, bistatic US. Single and dual probe ultrasound simulations were performed using finite element models of pressure waves propagating in aortas with different stiffness levels. Next, the approach was tested on an aorta and AAA mimicking phantom in a mock circulation setup. The simulation results show that the FOV, image quality, and PWV-estimation accuracy improve when using the dual probe approach (accuracy range: 94.9 - 99.8 $%$; R$^{2}$ range: 0.92 - 0.98) compared to conventional US (accuracy range: 12.6 - 93.9 $%$; R$^{2}$ range: 0.52 - 0.91). The approach was successfully expanded to the phantom study, which demonstrated expected wave patterns within a larger FOV. With dual probe PWI of the non-dilated phantom, the R$^{2}$-value improves (monostatic: 0.95; bistatic: 0.96) compared to use of single probe PWI (0.85). The proposed method shows to be promising for PWV-estimations in less compliant vessels with high wave speeds.

脉搏波速度(PWV)是血管刚度的间接测量,有可能作为血管疾病(如腹主动脉瘤(AAAs))风险分层的有意义参数。然而,由于单个超声(US)探头的视野(FOV)和对比度有限,使用基于超声的脉冲波成像(PWI)评估完整腹主动脉的PWV和脉冲波模式具有挑战性。因此,需要一种能够在大视场中准确捕捉不同硬度的主动脉扩张的方法。因此,我们提出了基于双探头、双稳态US的PWI。利用有限元模型对不同刚度水平主动脉内压力波的传播进行了单探头和双探头超声模拟。接下来,该方法在模拟循环装置中对主动脉和AAA模拟幻影进行了测试。仿真结果表明,与传统的US方法(精度范围:12.6 ~ 93.9 $%$;R$^{2}$范围:0.52 ~ 0.91)相比,采用双探头方法的FOV、图像质量和pwv估计精度(精度范围:94.9 ~ 99.8 $%$;R$^{2}$范围:0.92 ~ 0.98)得到了提高。该方法成功地扩展到幻影研究中,在更大的视场内展示了预期的波模式。与使用单探头PWI(0.85)相比,使用非扩张幻体的双探头PWI, R$^{2}$-值(单静:0.95;双静:0.96)得到改善。所提出的方法对于在不太适应的高波速船舶中进行pwv估计是有希望的。
{"title":"Feasibility of dual probe pulse wave imaging of the abdominal aorta.","authors":"Larissa C Jansen, Richard G P Lopata, Hans-Martin Schwab","doi":"10.1109/TBME.2026.3651584","DOIUrl":"https://doi.org/10.1109/TBME.2026.3651584","url":null,"abstract":"<p><p>Pulse wave velocity (PWV) is an indirect measure of vessel stiffness, that has the potential to serve as a meaningful parameter for risk stratification of vascular diseases, such as abdominal aortic aneurysms (AAAs). However, assessing the PWV and pulse wave patterns in the complete abdominal aorta using ultrasound-based pulse wave imaging (PWI) is challenging due to the limited field of view (FOV) and contrast of a single ultrasound (US) probe. Hence, an approach is required that can capture distension of aortas with different levels of stiffness accurately in a large FOV. Therefore, we propose PWI based on dual probe, bistatic US. Single and dual probe ultrasound simulations were performed using finite element models of pressure waves propagating in aortas with different stiffness levels. Next, the approach was tested on an aorta and AAA mimicking phantom in a mock circulation setup. The simulation results show that the FOV, image quality, and PWV-estimation accuracy improve when using the dual probe approach (accuracy range: 94.9 - 99.8 $%$; R$^{2}$ range: 0.92 - 0.98) compared to conventional US (accuracy range: 12.6 - 93.9 $%$; R$^{2}$ range: 0.52 - 0.91). The approach was successfully expanded to the phantom study, which demonstrated expected wave patterns within a larger FOV. With dual probe PWI of the non-dilated phantom, the R$^{2}$-value improves (monostatic: 0.95; bistatic: 0.96) compared to use of single probe PWI (0.85). The proposed method shows to be promising for PWV-estimations in less compliant vessels with high wave speeds.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Spiking Neural Network With Generalized Distributive Law for Biosignal and Sensory Fusion. 具有广义分配律的多模态脉冲神经网络用于生物信号和感觉融合。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3653109
Zenan Huang, Bingrui Guo, Hailing Xu, Haojie Ruan, Donghui Guo

Multimodal signal fusion is a cornerstone of biomedical engineering and intelligent sensing, enabling holistic analysis of heterogeneous sources such as electroencephalography (EEG), peripheral signals, speech, and imaging data. However, integrating diverse modalities in a computationally efficient and biologically plausible manner remains a significant challenge. Transformer-based fusion architectures rely on global cross-attention to integrate multimodal information but incur high computational costs. In contrast, STDP-driven fully connected layers adopt local learning rules, which restrict their ability to autonomously form efficient sparse topologies for complex multimodal tasks. To address these issues, we propose a novel end-to-end framework-the Multimodal Spiking Neural Network (MSNN)-featuring a fusion module grounded in the Generalized Distributive Law (GDL). This principled mechanism provides an efficient and interpretable means of integrating heterogeneous biomedical and sensory signals. The MSNN further incorporates structure-adaptive leaky integrate-and-fire (SALIF) neurons, enabling dynamic optimization of sparse connectivity to enhance fusion efficiency. The proposed MSNN is validated on a range of datasets, demonstrating strong versatility: it achieves binary classification accuracies of 92.29% (valence) and 91.08% (arousal) on the DEAP dataset for affective state decoding and 99.77% on the WESAD dataset for stress detection, while delivering state-of-the-art performance on standard pattern recognition tasks (MNIST & TIDIGITS: 99.01%) and event-driven neuromorphic datasets (MNIST-DVS & N-TIDIGITS: 99.98%). These results demonstrate that MSNN offers an effective and energy-efficient solution for multimodal sensor fusion in biomedical and intelligent sensing applications.

多模态信号融合是生物医学工程和智能传感的基石,能够对脑电图(EEG)、外围信号、语音和成像数据等异构源进行整体分析。然而,以计算效率和生物学上合理的方式整合多种模式仍然是一个重大挑战。基于变压器的融合体系结构依赖全局交叉关注来集成多模态信息,但计算成本较高。相比之下,stdp驱动的全连接层采用局部学习规则,这限制了它们在复杂多模态任务中自主形成高效稀疏拓扑的能力。为了解决这些问题,我们提出了一种新的端到端框架-多模态峰值神经网络(MSNN)-其特征是基于广义分配律(GDL)的融合模块。这一原则机制提供了一种有效且可解释的整合异质生物医学和感官信号的方法。MSNN进一步引入了结构自适应的泄漏集成与火灾(SALIF)神经元,实现了稀疏连接的动态优化,以提高融合效率。所提出的MSNN在一系列数据集上进行了验证,显示出强大的通用性:它在情感状态解码的DEAP数据集上实现了92.29%(价态)和91.08%(唤醒)的二元分类准确率,在压力检测的WESAD数据集上实现了99.77%的二元分类准确率,同时在标准模式识别任务(MNIST和TIDIGITS: 99.01%)和事件驱动的神经形态数据集(MNIST- dvs和N-TIDIGITS: 99.98%)上提供了最先进的性能。这些结果表明,MSNN为生物医学和智能传感应用中的多模态传感器融合提供了一种有效且节能的解决方案。
{"title":"Multimodal Spiking Neural Network With Generalized Distributive Law for Biosignal and Sensory Fusion.","authors":"Zenan Huang, Bingrui Guo, Hailing Xu, Haojie Ruan, Donghui Guo","doi":"10.1109/TBME.2026.3653109","DOIUrl":"https://doi.org/10.1109/TBME.2026.3653109","url":null,"abstract":"<p><p>Multimodal signal fusion is a cornerstone of biomedical engineering and intelligent sensing, enabling holistic analysis of heterogeneous sources such as electroencephalography (EEG), peripheral signals, speech, and imaging data. However, integrating diverse modalities in a computationally efficient and biologically plausible manner remains a significant challenge. Transformer-based fusion architectures rely on global cross-attention to integrate multimodal information but incur high computational costs. In contrast, STDP-driven fully connected layers adopt local learning rules, which restrict their ability to autonomously form efficient sparse topologies for complex multimodal tasks. To address these issues, we propose a novel end-to-end framework-the Multimodal Spiking Neural Network (MSNN)-featuring a fusion module grounded in the Generalized Distributive Law (GDL). This principled mechanism provides an efficient and interpretable means of integrating heterogeneous biomedical and sensory signals. The MSNN further incorporates structure-adaptive leaky integrate-and-fire (SALIF) neurons, enabling dynamic optimization of sparse connectivity to enhance fusion efficiency. The proposed MSNN is validated on a range of datasets, demonstrating strong versatility: it achieves binary classification accuracies of 92.29% (valence) and 91.08% (arousal) on the DEAP dataset for affective state decoding and 99.77% on the WESAD dataset for stress detection, while delivering state-of-the-art performance on standard pattern recognition tasks (MNIST & TIDIGITS: 99.01%) and event-driven neuromorphic datasets (MNIST-DVS & N-TIDIGITS: 99.98%). These results demonstrate that MSNN offers an effective and energy-efficient solution for multimodal sensor fusion in biomedical and intelligent sensing applications.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FocFormer-UNet: UNet With Focal Modulation and Transformers for Ultrasound Needle Tracking Using Photoacoustic Ground Truth. FocFormer-UNet: UNet与聚焦调制和变压器的超声针跟踪利用光声地面真相。
IF 4.5 2区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2026-01-12 DOI: 10.1109/TBME.2026.3652428
S M Mahim, Md Emamul Hossen, Manojit Pramanik

Ultrasound (US)-guided needle tracking is a critical procedure for various clinical diagnoses and treatment planning, highlighting the need for improved visualization methods to enhance accuracy. While deep learning (DL) techniques have been employed to boost needle visibility in US images, they often rely heavily on manual annotations or simulated datasets, which can introduce biases and limit real-world applicability. Photoacoustic (PA) imaging, known for its high contrast capabilities, offers a promising solution by providing superior needle visualization compared to conventional US images. In this work, we present FocFormer-UNet, a DL network that leverages PA images of the needle as ground truth for training, eliminating the need for manual annotations. This approach significantly improves needle localization accuracy in US images, reducing the reliance on time-consuming manual labeling. FocFormer-UNet achieves excellent needle localization accuracy, demonstrated by a modified Hausdorff distance of 1.43 1.23 and a targeting error of 1.22 1.14 on human clinical dataset, indicating minimal deviation from actual needle positions. Our method offers robust needle tracking across diverse US systems, improving the precision and reliability of US-guided needle insertion procedures. It holds great promise for advancing AI-driven clinical support tools in medical imaging. The following is the source code: https://github.com/DeeplearningBILAB/FocFormer-UNet. Open Science Framework (OSF) provides datasets and checkpoints at: https://osf.io/yxt9v/.

超声(US)引导的针头跟踪是各种临床诊断和治疗计划的关键程序,强调需要改进可视化方法以提高准确性。虽然深度学习(DL)技术已被用于提高美国图像的针尖可见性,但它们通常严重依赖于手动注释或模拟数据集,这可能会引入偏见并限制现实世界的适用性。光声成像(PA)以其高对比度能力而闻名,与传统的美国图像相比,它提供了一种很有前途的解决方案,提供了优越的针头可视化。在这项工作中,我们提出了FocFormer-UNet,这是一种深度学习网络,它利用针的PA图像作为训练的基础真理,消除了手动注释的需要。这种方法显著提高了美国图像中针头定位的准确性,减少了对耗时的人工标记的依赖。FocFormer-UNet实现了优异的针头定位精度,在人类临床数据集上,改进的Hausdorff距离为1.43 1.23,靶向误差为1.22 1.14,与实际针头位置的偏差最小。我们的方法在不同的美国系统中提供强大的针头跟踪,提高了美国引导的针头插入程序的精度和可靠性。它为推进医学成像中人工智能驱动的临床支持工具提供了巨大的希望。源代码如下:https://github.com/DeeplearningBILAB/FocFormer-UNet。开放科学框架(OSF)提供数据集和检查点:https://osf.io/yxt9v/。
{"title":"FocFormer-UNet: UNet With Focal Modulation and Transformers for Ultrasound Needle Tracking Using Photoacoustic Ground Truth.","authors":"S M Mahim, Md Emamul Hossen, Manojit Pramanik","doi":"10.1109/TBME.2026.3652428","DOIUrl":"https://doi.org/10.1109/TBME.2026.3652428","url":null,"abstract":"<p><p>Ultrasound (US)-guided needle tracking is a critical procedure for various clinical diagnoses and treatment planning, highlighting the need for improved visualization methods to enhance accuracy. While deep learning (DL) techniques have been employed to boost needle visibility in US images, they often rely heavily on manual annotations or simulated datasets, which can introduce biases and limit real-world applicability. Photoacoustic (PA) imaging, known for its high contrast capabilities, offers a promising solution by providing superior needle visualization compared to conventional US images. In this work, we present FocFormer-UNet, a DL network that leverages PA images of the needle as ground truth for training, eliminating the need for manual annotations. This approach significantly improves needle localization accuracy in US images, reducing the reliance on time-consuming manual labeling. FocFormer-UNet achieves excellent needle localization accuracy, demonstrated by a modified Hausdorff distance of 1.43 1.23 and a targeting error of 1.22 1.14 on human clinical dataset, indicating minimal deviation from actual needle positions. Our method offers robust needle tracking across diverse US systems, improving the precision and reliability of US-guided needle insertion procedures. It holds great promise for advancing AI-driven clinical support tools in medical imaging. The following is the source code: https://github.com/DeeplearningBILAB/FocFormer-UNet. Open Science Framework (OSF) provides datasets and checkpoints at: https://osf.io/yxt9v/.</p>","PeriodicalId":13245,"journal":{"name":"IEEE Transactions on Biomedical Engineering","volume":"PP ","pages":""},"PeriodicalIF":4.5,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145959234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Biomedical Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1