Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3604620
Honggang Liu, Han Yang, Dongjun Liu, Xuanyu Jin, Yong Peng, Wanzeng Kong
Biometric recognition using visually evoked potentials (VEPs), a type of neural response to visual stimuli recorded via electroencephalography (EEG), has shown great promise. However, the non-stationary nature of EEG signals poses a major challenge in cross-session scenarios, where data collected on different days often leads to performance degradation. To address this, we propose the Discriminative Robust Feature Network (DRFNet) to enhance the robustness and inter-subject discriminability of identity representations across sessions. DRFNet incorporates two key components: (1) A log-power transformation that amplifies inter-individual differences by capturing non-linear energy patterns from VEP features via signal squaring and logarithmic scaling; and (2) A hierarchical normalization strategy with adaptive attention to balance discriminative identity cues with inter-session invariance by stabilizing feature distributions across multiple levels (feature map, batch, and sample). On two public multi-session SSVEP datasets (Dataset A: 30 subjects, 6 s trials; Dataset B: 54 subjects, 4 s trials), our model outperformed state-of-the-art methods, achieving identification accuracies of 92.92% and 86.30%, and equal error rates of 3.92% and 4.09%, respectively. Further analysis demonstrates that filter bank processing and a reduced set of parietal-occipital electrodes can provide more discriminative features while offering a practical path toward system lightweighting.
{"title":"DRFNet: Enhancing Identity Discriminability and Feature Robustness for Cross-Session VEP-Based EEG Biometrics.","authors":"Honggang Liu, Han Yang, Dongjun Liu, Xuanyu Jin, Yong Peng, Wanzeng Kong","doi":"10.1109/JBHI.2025.3604620","DOIUrl":"10.1109/JBHI.2025.3604620","url":null,"abstract":"<p><p>Biometric recognition using visually evoked potentials (VEPs), a type of neural response to visual stimuli recorded via electroencephalography (EEG), has shown great promise. However, the non-stationary nature of EEG signals poses a major challenge in cross-session scenarios, where data collected on different days often leads to performance degradation. To address this, we propose the Discriminative Robust Feature Network (DRFNet) to enhance the robustness and inter-subject discriminability of identity representations across sessions. DRFNet incorporates two key components: (1) A log-power transformation that amplifies inter-individual differences by capturing non-linear energy patterns from VEP features via signal squaring and logarithmic scaling; and (2) A hierarchical normalization strategy with adaptive attention to balance discriminative identity cues with inter-session invariance by stabilizing feature distributions across multiple levels (feature map, batch, and sample). On two public multi-session SSVEP datasets (Dataset A: 30 subjects, 6 s trials; Dataset B: 54 subjects, 4 s trials), our model outperformed state-of-the-art methods, achieving identification accuracies of 92.92% and 86.30%, and equal error rates of 3.92% and 4.09%, respectively. Further analysis demonstrates that filter bank processing and a reduced set of parietal-occipital electrodes can provide more discriminative features while offering a practical path toward system lightweighting.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2181-2193"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144952105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3605298
Serhii Reznichenko, John Whitaker, Zixuan Ni, Amir AbdelWahab, Usha Tedrow, John L Sapp, Shijie Zhou
Identifying the onset of the QRS complex is an important step for localizing the site of origin (SOO) of premature ventricular complexes (PVCs) and the exit site of Ventricular Tachycardia (VT). However, identifying the QRS onset is challenging due to signal noise, baseline wander, motion artifact, and muscle artifact. Furthermore, in VT, QRS onset detection is especially difficult due to the overlap with repolarization from the prior beat. In this study, 7706 captured bipolar pacing beats (Stim-QRS < 40 ms) pooled from 384 anatomically widely dispersed pacing sites of 15 patients were used for an attention-based Swin-Unet neural network. We also utilized a self-supervised pretraining technique using 88253 unannotated ECG records. The algorithm correctly identified most of the onsets for ECG signals with bipolar pacing-site ECG dataset, achieving a sensitivity of 0.958 and a 1.924 ± 4.275 milliseconds prediction error. Our algorithm also achieved a prediction error of 1.518 ± 8.702 milliseconds for the QT Database (QTDB), and a prediction error of 1.333 ± 7.575 milliseconds for the Lobachevsky University Electrocardiography Database (LUDB) public datasets. We also achieved high inter-dataset performance, which supports the practical performance of the method, with a sensitivity of 0.927 for QTDB and a sensitivity of 0.981 for LUDB. The AI model achieves accurate onset detection in paced ECGs with spike-removed inputs, providing a controlled, high-fidelity training setting for future efforts in generalizing to VT ECGs. The use of self-supervised pretraining further improves the detector's accuracy, showcasing the applicability of the approach and using unannotated ECG signals for downstream tasks.
确定QRS复合体的起始点是确定室性早搏复合体(室性早搏复合体)起始点和室性心动过速(室性心动过速)结束点的重要步骤。然而,由于信号噪声、基线漂移、运动伪影和肌肉伪影,识别QRS的发病具有挑战性。此外,在VT中,由于与前拍的复极化重叠,QRS的开始检测特别困难。在这项研究中,从解剖学上广泛分布的384个起搏部位收集了15例患者的7706次双极起搏心跳(Stim-QRS < 40ms),用于基于注意的swun - unet神经网络。我们还利用88253个无注释心电图记录进行了自监督预训练技术。该算法正确识别了双极起搏点心电数据集的大部分起搏信号,预测灵敏度为0.958,预测误差为1.924±4.275毫秒。该算法对QT数据库(QTDB)的预测误差为1.518±8.702毫秒,对Lobachevsky University Electrocardiography Database (LUDB)公共数据集的预测误差为1.333±7.575毫秒。我们还实现了高数据集间性能,这支持了该方法的实际性能,QTDB的灵敏度为0.927,LUDB的灵敏度为0.981。人工智能模型在去除尖刺输入的有节奏心电图中实现了准确的发作检测,为将来推广到VT心电图提供了一个可控的、高保真的训练设置。使用自监督预训练进一步提高了检测器的准确性,展示了该方法的适用性,并将未注释的心电信号用于下游任务。
{"title":"AI-Based QRS Onset Detection in the Early Ventricular Activation Site ECGs.","authors":"Serhii Reznichenko, John Whitaker, Zixuan Ni, Amir AbdelWahab, Usha Tedrow, John L Sapp, Shijie Zhou","doi":"10.1109/JBHI.2025.3605298","DOIUrl":"10.1109/JBHI.2025.3605298","url":null,"abstract":"<p><p>Identifying the onset of the QRS complex is an important step for localizing the site of origin (SOO) of premature ventricular complexes (PVCs) and the exit site of Ventricular Tachycardia (VT). However, identifying the QRS onset is challenging due to signal noise, baseline wander, motion artifact, and muscle artifact. Furthermore, in VT, QRS onset detection is especially difficult due to the overlap with repolarization from the prior beat. In this study, 7706 captured bipolar pacing beats (Stim-QRS < 40 ms) pooled from 384 anatomically widely dispersed pacing sites of 15 patients were used for an attention-based Swin-Unet neural network. We also utilized a self-supervised pretraining technique using 88253 unannotated ECG records. The algorithm correctly identified most of the onsets for ECG signals with bipolar pacing-site ECG dataset, achieving a sensitivity of 0.958 and a 1.924 ± 4.275 milliseconds prediction error. Our algorithm also achieved a prediction error of 1.518 ± 8.702 milliseconds for the QT Database (QTDB), and a prediction error of 1.333 ± 7.575 milliseconds for the Lobachevsky University Electrocardiography Database (LUDB) public datasets. We also achieved high inter-dataset performance, which supports the practical performance of the method, with a sensitivity of 0.927 for QTDB and a sensitivity of 0.981 for LUDB. The AI model achieves accurate onset detection in paced ECGs with spike-removed inputs, providing a controlled, high-fidelity training setting for future efforts in generalizing to VT ECGs. The use of self-supervised pretraining further improves the detector's accuracy, showcasing the applicability of the approach and using unannotated ECG signals for downstream tasks.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2073-2086"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145006089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3614546
Haridimos Kondylakis, Richard Osuala, Xenia Puig-Bosch, Noussair Lazrak, Oliver Diaz, Kaisar Kushibar, Ioanna Chouvarda, Stefanie Charalambous, Martijn Pa Starmans, Sara Colantonio, Nikos Tachos, Smriti Joshi, Henry C Woodruff, Zohaib Salahuddin, Gianna Tsakou, Susanna Ausso, Leonor Cerda Alberich, Nickolas Papanikolaou, Philippe Lambin, Kostas Marias, Manolis Tsiknakis, Dimitrios I Fotiadis, Luis Marti-Bonmati, Karim Lekadir
Recent advancements in artificial intelligence (AI) and the vast data generated by modern clinical systems have driven the development of AI solutions in medical imaging, encompassing image reconstruction, segmentation, diagnosis, and treatment planning. Despite these successes and potential, many stakeholders worry about the risks and ethical implications of imaging AI, viewing it as complex, opaque, and challenging to understand, use, and trust in critical clinical applications. The FUTURE-AI guideline for trustworthy AI in healthcare was established based on six guiding principles: Fairness, Universality, Traceability, Usability, Robustness, and Explainability. Through international consensus, a set of recommendations was defined, covering the entire lifecycle of medical AI tools, from design, development, and validation to regulation, deployment, and monitoring. In this paper, we describe how these specific recommendations can be instantiated in the domain of medical imaging, providing an overview of current best practices along with guidelines and concrete metrics on how those recommendations could be met, offering a valuable resource to the international medical imaging community.
{"title":"A Review of Methods for Trustworthy AI in Medical Imaging: The FUTURE-AI Guidelines.","authors":"Haridimos Kondylakis, Richard Osuala, Xenia Puig-Bosch, Noussair Lazrak, Oliver Diaz, Kaisar Kushibar, Ioanna Chouvarda, Stefanie Charalambous, Martijn Pa Starmans, Sara Colantonio, Nikos Tachos, Smriti Joshi, Henry C Woodruff, Zohaib Salahuddin, Gianna Tsakou, Susanna Ausso, Leonor Cerda Alberich, Nickolas Papanikolaou, Philippe Lambin, Kostas Marias, Manolis Tsiknakis, Dimitrios I Fotiadis, Luis Marti-Bonmati, Karim Lekadir","doi":"10.1109/JBHI.2025.3614546","DOIUrl":"10.1109/JBHI.2025.3614546","url":null,"abstract":"<p><p>Recent advancements in artificial intelligence (AI) and the vast data generated by modern clinical systems have driven the development of AI solutions in medical imaging, encompassing image reconstruction, segmentation, diagnosis, and treatment planning. Despite these successes and potential, many stakeholders worry about the risks and ethical implications of imaging AI, viewing it as complex, opaque, and challenging to understand, use, and trust in critical clinical applications. The FUTURE-AI guideline for trustworthy AI in healthcare was established based on six guiding principles: Fairness, Universality, Traceability, Usability, Robustness, and Explainability. Through international consensus, a set of recommendations was defined, covering the entire lifecycle of medical AI tools, from design, development, and validation to regulation, deployment, and monitoring. In this paper, we describe how these specific recommendations can be instantiated in the domain of medical imaging, providing an overview of current best practices along with guidelines and concrete metrics on how those recommendations could be met, offering a valuable resource to the international medical imaging community.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2299-2315"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145191511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3542561
Sibo Qiao, Qiang Guo, Fengdong Shi, Min Wang, Haohao Zhu, Fazlullah Khan, Joel J P C Rodrigues, Zhihan Lyu
The exponential growth of sensitive patient information and diagnostic records in digital healthcare systems has increased the complexity of data protection, while frequent medical data breaches severely compromise system security and reliability. Existing privacy protection techniques often lack robustness and real-time capabilities in high-noise, high-packet-loss, and dynamic network environments, limiting their effectiveness in detecting healthcare data leaks. To address these challenges, we propose a Swarm Intelligence-Based Network Watermarking (SIBW) method for real-time privacy data leakage detection in digital healthcare systems. SIBW integrates fountain codes with outer error correction codes and employs a Multi-Phase Synergistic Swarm Optimization Algorithm (MPSSOA) to dynamically optimize encoding parameters, significantly enhancing the robustness and interference resistance of watermark detection. Additionally, a reliable synchronization sequence and lightweight embedding mechanism are designed to ensure adaptability to complex, dynamic networks. Experimental results demonstrate that SIBW achieves over 90% detection accuracy under high latency jitter and packet loss conditions, surpassing existing methods in both robustness and efficiency. With a compact design of only 3.7 MB, SIBW is particularly suited for rapid deployment in resource-constrained digital healthcare systems.
{"title":"SIBW: A Swarm Intelligence-Based Network Flow Watermarking Approach for Privacy Leakage Detection in Digital Healthcare Systems.","authors":"Sibo Qiao, Qiang Guo, Fengdong Shi, Min Wang, Haohao Zhu, Fazlullah Khan, Joel J P C Rodrigues, Zhihan Lyu","doi":"10.1109/JBHI.2025.3542561","DOIUrl":"10.1109/JBHI.2025.3542561","url":null,"abstract":"<p><p>The exponential growth of sensitive patient information and diagnostic records in digital healthcare systems has increased the complexity of data protection, while frequent medical data breaches severely compromise system security and reliability. Existing privacy protection techniques often lack robustness and real-time capabilities in high-noise, high-packet-loss, and dynamic network environments, limiting their effectiveness in detecting healthcare data leaks. To address these challenges, we propose a Swarm Intelligence-Based Network Watermarking (SIBW) method for real-time privacy data leakage detection in digital healthcare systems. SIBW integrates fountain codes with outer error correction codes and employs a Multi-Phase Synergistic Swarm Optimization Algorithm (MPSSOA) to dynamically optimize encoding parameters, significantly enhancing the robustness and interference resistance of watermark detection. Additionally, a reliable synchronization sequence and lightweight embedding mechanism are designed to ensure adaptability to complex, dynamic networks. Experimental results demonstrate that SIBW achieves over 90% detection accuracy under high latency jitter and packet loss conditions, surpassing existing methods in both robustness and efficiency. With a compact design of only 3.7 MB, SIBW is particularly suited for rapid deployment in resource-constrained digital healthcare systems.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"1912-1924"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143556727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3599210
Huijie Li, Yide Yu, Si Shi, Anmin Hu, Jian Huo, Wei Lin, Chaoran Wu, Wuman Luo
Automated control of personalized multiple anesthetics in clinical Total Intravenous Anesthesia (TIVA) is crucial yet challenging. Current systems, including target-controlled infusion (TCI) and closed-loop systems, either rely on relatively static pharmacokinetic/pharmacodynamic (PK/PD) models or focus on single anesthetic control. So they limit both personalization and collaborative control. To address these issues, we propose a novel Value Decomposition Multi-Agent Deep Reinforcement Learning (VD-MADRL) framework based on Markov Game (MG) for Personalized Multiple Anesthetics Control in a Closed-Loop system (PMAC-CL). VD-MADRL optimizes the collaboration between two anesthetics propofol (Agent I) and remifentanil (Agent II) by leveraging a MG to identify optimal actions among heterogeneous agents. We employ various value function decomposition methods to resolve the credit allocation problem and enhance collaborative control. We also introduce a multivariate environment model based on random forest (RF) for anesthesia state simulation. To ensure data validity, we design a data resampling and alignment technique to synchronize trajectory data from different devices, avoiding gradient explosion and maintaining conformity to Markov property. Extensive experiments on general and thoracic surgery datasets demonstrate that VD-MADRL provides more refined dose adjustments and maintains multiple anesthesia state indicators more stably at target levels compared to human experience. Especially, the best-performing algorithm, VDN in general surgery with online training, achieved a 16.4% increase in cumulative reward (CR) and a 58.0% reduction in mean MDPE compared to human experience. This demonstrates its great clinical value.
{"title":"Value Decomposition-Based Multi-Agent Learning for Anesthetics Collaborative Control.","authors":"Huijie Li, Yide Yu, Si Shi, Anmin Hu, Jian Huo, Wei Lin, Chaoran Wu, Wuman Luo","doi":"10.1109/JBHI.2025.3599210","DOIUrl":"10.1109/JBHI.2025.3599210","url":null,"abstract":"<p><p>Automated control of personalized multiple anesthetics in clinical Total Intravenous Anesthesia (TIVA) is crucial yet challenging. Current systems, including target-controlled infusion (TCI) and closed-loop systems, either rely on relatively static pharmacokinetic/pharmacodynamic (PK/PD) models or focus on single anesthetic control. So they limit both personalization and collaborative control. To address these issues, we propose a novel Value Decomposition Multi-Agent Deep Reinforcement Learning (VD-MADRL) framework based on Markov Game (MG) for Personalized Multiple Anesthetics Control in a Closed-Loop system (PMAC-CL). VD-MADRL optimizes the collaboration between two anesthetics propofol (Agent I) and remifentanil (Agent II) by leveraging a MG to identify optimal actions among heterogeneous agents. We employ various value function decomposition methods to resolve the credit allocation problem and enhance collaborative control. We also introduce a multivariate environment model based on random forest (RF) for anesthesia state simulation. To ensure data validity, we design a data resampling and alignment technique to synchronize trajectory data from different devices, avoiding gradient explosion and maintaining conformity to Markov property. Extensive experiments on general and thoracic surgery datasets demonstrate that VD-MADRL provides more refined dose adjustments and maintains multiple anesthesia state indicators more stably at target levels compared to human experience. Especially, the best-performing algorithm, VDN in general surgery with online training, achieved a 16.4% increase in cumulative reward (CR) and a 58.0% reduction in mean MDPE compared to human experience. This demonstrates its great clinical value.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2167-2180"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144882809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3592643
Jian Zhong, Haochen Zhao, Xiao Liang, Qichang Zhao, Jianxin Wang
Accurately predicting drug-drug interaction events (DDIEs) is critical for improving medication safety and guiding clinical decision-making. However, existing graph neural network (GNN)-based methods often struggle to effectively integrate multi-view features and generalize to novel or understudied drugs. To address these limitations, we propose MRLF-DDI, a multi-view representation learning framework that jointly models information from individual drug features, local interaction contexts, and global interaction patterns. MRLF-DDI introduces the use of atom-level structural features enriched with bond angle information-marking the first incorporation of this geometry-aware feature in DDIE prediction. It further employs a multi-granularity GNN and a gated knowledge transfer strategy to enhance feature learning and cold-start generalization. Extensive experiments on benchmark datasets demonstrate that MRLF-DDI achieves superior performance in both warm-start and cold-start scenarios. Case studies and visualization analyses further highlight its practical utility in identifying clinically relevant DDIEs.
{"title":"MRLF-DDI: A Multi-View Representation Learning Framework for Drug-Drug Interaction Event Prediction.","authors":"Jian Zhong, Haochen Zhao, Xiao Liang, Qichang Zhao, Jianxin Wang","doi":"10.1109/JBHI.2025.3592643","DOIUrl":"10.1109/JBHI.2025.3592643","url":null,"abstract":"<p><p>Accurately predicting drug-drug interaction events (DDIEs) is critical for improving medication safety and guiding clinical decision-making. However, existing graph neural network (GNN)-based methods often struggle to effectively integrate multi-view features and generalize to novel or understudied drugs. To address these limitations, we propose MRLF-DDI, a multi-view representation learning framework that jointly models information from individual drug features, local interaction contexts, and global interaction patterns. MRLF-DDI introduces the use of atom-level structural features enriched with bond angle information-marking the first incorporation of this geometry-aware feature in DDIE prediction. It further employs a multi-granularity GNN and a gated knowledge transfer strategy to enhance feature learning and cold-start generalization. Extensive experiments on benchmark datasets demonstrate that MRLF-DDI achieves superior performance in both warm-start and cold-start scenarios. Case studies and visualization analyses further highlight its practical utility in identifying clinically relevant DDIEs.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2217-2227"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144707316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3593617
Mengni Zhou, Rongkun Mi, Ang Zhao, Xin Wen, Yan Niu, Xubin Wu, Yanqing Dong, Yaru Xu, Yanan Li, Jie Xiang
Major Depressive Disorder (MDD) is a common mental disorder, and making an early and accurate diagnosis is crucial for effective treatment. Functional Connectivity Network (FCN) constructed based on functional Magnetic Resonance Imaging (fMRI) have demonstrated the potential to reveal the mechanisms underlying brain abnormalities. Deep learning has been widely employed to extract features from FCN, but existing methods typically operate directly on the network, failing to fully exploit their deep information. Although graph coarsening techniques offer certain advantages in extracting the brain's complex structure, they may also result in the loss of critical information. To address this issue, we propose the Multi-Granularity Brain Networks Fusion (MGBNF) framework. MGBNF models brain networks through multi-granularity analysis and constructs combinatorial modules to enhance feature extraction. Finally, the Constrained Attention Pooling (CAP) mechanism is employed to achieve the effective integration of multi-channel features. In the feature extraction stage, the parameter sharing mechanism is introduced and applied to multiple channels to capture similar connectivity patterns between different channels while reducing the number of parameters. We validate the effectiveness of the MGBNF model on multiple classification tasks and various brain atlases. The results demonstrate that MGBNF outperforms baseline models in terms of classification performance. Ablation experiments further validate its effectiveness. In addition, we conducted a thorough analysis of the variability of different subtypes of MDD by multiple classification tasks, and the results support further clinical applications.
{"title":"Diagnosis of Major Depressive Disorder Based on Multi-Granularity Brain Networks Fusion.","authors":"Mengni Zhou, Rongkun Mi, Ang Zhao, Xin Wen, Yan Niu, Xubin Wu, Yanqing Dong, Yaru Xu, Yanan Li, Jie Xiang","doi":"10.1109/JBHI.2025.3593617","DOIUrl":"10.1109/JBHI.2025.3593617","url":null,"abstract":"<p><p>Major Depressive Disorder (MDD) is a common mental disorder, and making an early and accurate diagnosis is crucial for effective treatment. Functional Connectivity Network (FCN) constructed based on functional Magnetic Resonance Imaging (fMRI) have demonstrated the potential to reveal the mechanisms underlying brain abnormalities. Deep learning has been widely employed to extract features from FCN, but existing methods typically operate directly on the network, failing to fully exploit their deep information. Although graph coarsening techniques offer certain advantages in extracting the brain's complex structure, they may also result in the loss of critical information. To address this issue, we propose the Multi-Granularity Brain Networks Fusion (MGBNF) framework. MGBNF models brain networks through multi-granularity analysis and constructs combinatorial modules to enhance feature extraction. Finally, the Constrained Attention Pooling (CAP) mechanism is employed to achieve the effective integration of multi-channel features. In the feature extraction stage, the parameter sharing mechanism is introduced and applied to multiple channels to capture similar connectivity patterns between different channels while reducing the number of parameters. We validate the effectiveness of the MGBNF model on multiple classification tasks and various brain atlases. The results demonstrate that MGBNF outperforms baseline models in terms of classification performance. Ablation experiments further validate its effectiveness. In addition, we conducted a thorough analysis of the variability of different subtypes of MDD by multiple classification tasks, and the results support further clinical applications.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2328-2339"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144742034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3598354
Hika Barki, Ngoc-Dau Mai, Wan-Young Chung
Automated emotion identification via physiological data from wearable devices is a growing field, yet traditional electroencephalography (EEG) and photoplethysmography (PPG) collection methods can be uncomfortable. This research introduces a novel structure of the in-ear wearable device that captures both PPG and EEG signals to enhance user comfort for emotion recognition. Data were collected from 21 individuals experiencing four emotional states (fear, happy, calm, sad) induced by video stimuli. Following signal preprocessing, temporal and frequency domain features were extracted and selected using the ReliefF approach. Classification accuracy was assessed for PPG, EEG, and combined features, with combined features yielding superior results. An XGBoost classifier, optimized with Bayesian hyperparameter tuning, achieved 97.58% accuracy, 97.57% precision, 97.57% recall, and a 97.58% F1 score, outperforming support vector machine, decision tree, random forest, and K-Nearest Neighbor classifiers. These findings highlight the benefits of multimodal physiological sensing and optimized machine learning for reliable emotion characterization, with implications for mental health monitoring and human-computer interaction.
{"title":"Optimized XGBoost for Multimodal Affective State Classification Using In-Ear PPG and Behind-the-Ear EEG Signals.","authors":"Hika Barki, Ngoc-Dau Mai, Wan-Young Chung","doi":"10.1109/JBHI.2025.3598354","DOIUrl":"10.1109/JBHI.2025.3598354","url":null,"abstract":"<p><p>Automated emotion identification via physiological data from wearable devices is a growing field, yet traditional electroencephalography (EEG) and photoplethysmography (PPG) collection methods can be uncomfortable. This research introduces a novel structure of the in-ear wearable device that captures both PPG and EEG signals to enhance user comfort for emotion recognition. Data were collected from 21 individuals experiencing four emotional states (fear, happy, calm, sad) induced by video stimuli. Following signal preprocessing, temporal and frequency domain features were extracted and selected using the ReliefF approach. Classification accuracy was assessed for PPG, EEG, and combined features, with combined features yielding superior results. An XGBoost classifier, optimized with Bayesian hyperparameter tuning, achieved 97.58% accuracy, 97.57% precision, 97.57% recall, and a 97.58% F1 score, outperforming support vector machine, decision tree, random forest, and K-Nearest Neighbor classifiers. These findings highlight the benefits of multimodal physiological sensing and optimized machine learning for reliable emotion characterization, with implications for mental health monitoring and human-computer interaction.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"2139-2152"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144845820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01DOI: 10.1109/JBHI.2025.3601807
Zhijun Xiao, Maarten De Vos, Christos Chatzichristos, Yunyi Jiang, Minghui Zhao, Fei Ding, Chenxi Yang, Jianqing Li, Chengyu Liu
In recent years, the demand for smart healthcare solutions have heightened the need for accuracy, reliability, and comfort in bedside ECG recording and analysis. This study presents a bedside non-direct contact ECG recording system based on capacitive coupling electrocardiography (cECG) and verifies its performance in accurately capturing Heart Rate Variability (HRV) during the night. Firstly, cECG collects ECG data through clothing, avoiding skin irritation from conventional wet electrodes. Secondly, leveraging the unique characteristics of cECG signals, a deep learning framework assesses the quality of cECG, filtering noise and identifying off-bed information, enhancing HRV analysis precision. Subsequently, the system was employed to recording sleep data from 6 subjects overnight, with our proposed algorithm utilized for signal quality assessment (SQA) and HRV analysis. Finally, HRV features were compared with synchronously collected wet electrode ECG signals, encompassing time domain features, frequency domain features, and nonlinear features, totaling 13 HRV features. Experimental findings demonstrate that for the SQA task, the model achieved a classification accuracy of 94.7%, with a Recall of 0.941, Precision of 0.940, F1 score of 0.941, and Cohen's Kappa of 0.927. The accuracy of on/off-bed monitoring reached 99.79%. Additionally, HRV features showed a strong correlation with the reference ECG. In the time-domain metrics, the largest mean absolute percentage error (MAPE) is for PNN50, with a value of 8.148%. In the frequency-domain features, the largest MAPE is for HF, with a value of 13.253%. For nonlinear features, the largest MAPE is for SD1, with a value of 5.182%. Generally, the system exhibited a reliable solution for cECG recording, on/off-bed status detection, and bedside HRV analysis.
{"title":"Non-Direct Contact ECG Signal Classification Using a Hybrid Deep Learning Framework With Validation in Bedside Heart Rate Variability Analysis.","authors":"Zhijun Xiao, Maarten De Vos, Christos Chatzichristos, Yunyi Jiang, Minghui Zhao, Fei Ding, Chenxi Yang, Jianqing Li, Chengyu Liu","doi":"10.1109/JBHI.2025.3601807","DOIUrl":"10.1109/JBHI.2025.3601807","url":null,"abstract":"<p><p>In recent years, the demand for smart healthcare solutions have heightened the need for accuracy, reliability, and comfort in bedside ECG recording and analysis. This study presents a bedside non-direct contact ECG recording system based on capacitive coupling electrocardiography (cECG) and verifies its performance in accurately capturing Heart Rate Variability (HRV) during the night. Firstly, cECG collects ECG data through clothing, avoiding skin irritation from conventional wet electrodes. Secondly, leveraging the unique characteristics of cECG signals, a deep learning framework assesses the quality of cECG, filtering noise and identifying off-bed information, enhancing HRV analysis precision. Subsequently, the system was employed to recording sleep data from 6 subjects overnight, with our proposed algorithm utilized for signal quality assessment (SQA) and HRV analysis. Finally, HRV features were compared with synchronously collected wet electrode ECG signals, encompassing time domain features, frequency domain features, and nonlinear features, totaling 13 HRV features. Experimental findings demonstrate that for the SQA task, the model achieved a classification accuracy of 94.7%, with a Recall of 0.941, Precision of 0.940, F1 score of 0.941, and Cohen's Kappa of 0.927. The accuracy of on/off-bed monitoring reached 99.79%. Additionally, HRV features showed a strong correlation with the reference ECG. In the time-domain metrics, the largest mean absolute percentage error (MAPE) is for PNN50, with a value of 8.148%. In the frequency-domain features, the largest MAPE is for HF, with a value of 13.253%. For nonlinear features, the largest MAPE is for SD1, with a value of 5.182%. Generally, the system exhibited a reliable solution for cECG recording, on/off-bed status detection, and bedside HRV analysis.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"1959-1971"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144952225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
For digital health platforms, the challenge is balancing patient privacy with the ability to match patients to the right providers quickly and accurately. Existing systems often suffer from privacy leakage, insufficient matching precision, and degraded performance when dealing with large-scale data. In this paper, we propose 4PM, a novel privacy-preserving patient-provider matching scheme that leverages secure computation to deliver strong privacy guarantees while ensuring efficient and accurate matching. Our method partitions patient data between two non-colluding servers via secret sharing, employing the optimized Millionaires' Protocol for secure ranking and leveraging oblivious retrieval techniques for privacy-preserving matching. 4PM significantly reduces the computational complexity of high-dimensional data, achieving end-to-end latency within 0.5 seconds in scenarios with 200 doctors and 200-dimensional symptom vectors. Our work contributes to fostering secure and trustworthy healthcare in the digital era.
{"title":"4PM: Privacy-Preserving Patient-Provider Matching Service in Digital Healthcare System.","authors":"Jing Lei, Haobo Zhang, Fake Lyu, Jinghui Qin, Qingqi Pei","doi":"10.1109/JBHI.2025.3644174","DOIUrl":"10.1109/JBHI.2025.3644174","url":null,"abstract":"<p><p>For digital health platforms, the challenge is balancing patient privacy with the ability to match patients to the right providers quickly and accurately. Existing systems often suffer from privacy leakage, insufficient matching precision, and degraded performance when dealing with large-scale data. In this paper, we propose 4PM, a novel privacy-preserving patient-provider matching scheme that leverages secure computation to deliver strong privacy guarantees while ensuring efficient and accurate matching. Our method partitions patient data between two non-colluding servers via secret sharing, employing the optimized Millionaires' Protocol for secure ranking and leveraging oblivious retrieval techniques for privacy-preserving matching. 4PM significantly reduces the computational complexity of high-dimensional data, achieving end-to-end latency within 0.5 seconds in scenarios with 200 doctors and 200-dimensional symptom vectors. Our work contributes to fostering secure and trustworthy healthcare in the digital era.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":"1947-1958"},"PeriodicalIF":6.8,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145762784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}