首页 > 最新文献

IEEE Journal of Biomedical and Health Informatics最新文献

英文 中文
Score Prior Guided Iterative Solver for Speckles Removal in Optical Coherent Tomography Images. 光学相干断层扫描图像中斑点去除的分数先验引导迭代求解器
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-21 DOI: 10.1109/JBHI.2024.3480928
Sanqian Li, Risa Higashita, Huazhu Fu, Bing Yang, Jiang Liu

Optical coherence tomography (OCT) is a widely used non-invasive imaging modality for ophthalmic diagnosis. However, the inherent speckle noise becomes the leading cause of OCT image quality, and efficient speckle removal algorithms can improve image readability and benefit automated clinical analysis. As an ill-posed inverse problem, it is of utmost importance for speckle removal to learn suitable priors. In this work, we develop a score prior guided iterative solver (SPIS) with logarithmic space to remove speckles in OCT images. Specifically, we model the posterior distribution of raw OCT images as a data consistency term and transform the speckle removal from a nonlinear into a linear inverse problem in the logarithmic domain. Subsequently, the learned prior distribution through the score function from the diffusion model is utilized as a constraint for the data consistency term into the linear inverse optimization, resulting in an iterative speckle removal procedure that alternates between the score prior predictor and the subsequent non-expansive data consistency corrector. Experimental results on the private and public OCT datasets demonstrate that the proposed SPIS has an excellent performance in speckle removal and out-of-distribution (OOD) generalization. Further downstream automatic analysis on the OCT images verifies that the proposed SPIS can benefit clinical applications. The data and code are available at https://github.com/ lisanqian1212/SPIS.

光学相干断层扫描(OCT)是一种广泛应用于眼科诊断的无创成像模式。然而,固有的斑点噪声成为影响 OCT 图像质量的主要原因,高效的斑点去除算法可以提高图像的可读性,有利于自动临床分析。作为一个难以解决的逆问题,学习合适的前验对于斑点去除至关重要。在这项工作中,我们开发了一种具有对数空间的分数先验引导迭代求解器(SPIS),用于去除 OCT 图像中的斑点。具体来说,我们将原始 OCT 图像的后验分布建模为数据一致性项,并将斑点去除从非线性问题转化为对数域的线性逆问题。随后,通过扩散模型中的分数函数学习到的先验分布被用作线性逆优化中数据一致性项的约束条件,从而形成一个在分数先验预测器和随后的非扩展数据一致性校正器之间交替进行的迭代斑点去除程序。在私有和公共 OCT 数据集上的实验结果表明,所提出的 SPIS 在斑点去除和分布外(OOD)泛化方面表现出色。对 OCT 图像的进一步下游自动分析验证了所提出的 SPIS 能为临床应用带来益处。数据和代码见 https://github.com/ lisanqian1212/SPIS。
{"title":"Score Prior Guided Iterative Solver for Speckles Removal in Optical Coherent Tomography Images.","authors":"Sanqian Li, Risa Higashita, Huazhu Fu, Bing Yang, Jiang Liu","doi":"10.1109/JBHI.2024.3480928","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3480928","url":null,"abstract":"<p><p>Optical coherence tomography (OCT) is a widely used non-invasive imaging modality for ophthalmic diagnosis. However, the inherent speckle noise becomes the leading cause of OCT image quality, and efficient speckle removal algorithms can improve image readability and benefit automated clinical analysis. As an ill-posed inverse problem, it is of utmost importance for speckle removal to learn suitable priors. In this work, we develop a score prior guided iterative solver (SPIS) with logarithmic space to remove speckles in OCT images. Specifically, we model the posterior distribution of raw OCT images as a data consistency term and transform the speckle removal from a nonlinear into a linear inverse problem in the logarithmic domain. Subsequently, the learned prior distribution through the score function from the diffusion model is utilized as a constraint for the data consistency term into the linear inverse optimization, resulting in an iterative speckle removal procedure that alternates between the score prior predictor and the subsequent non-expansive data consistency corrector. Experimental results on the private and public OCT datasets demonstrate that the proposed SPIS has an excellent performance in speckle removal and out-of-distribution (OOD) generalization. Further downstream automatic analysis on the OCT images verifies that the proposed SPIS can benefit clinical applications. The data and code are available at https://github.com/ lisanqian1212/SPIS.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142499419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking Large Language Models in Evidence-Based Medicine. 为循证医学中的大型语言模型设定基准。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-21 DOI: 10.1109/JBHI.2024.3483816
Jin Li, Yiyan Deng, Qi Sun, Junjie Zhu, Yu Tian, Jingsong Li, Tingting Zhu

Evidence-based medicine (EBM) represents a paradigm of providing patient care grounded in the most current and rigorously evaluated research. Recent advances in large language models (LLMs) offer a potential solution to transform EBM by automating labor-intensive tasks and thereby improving the efficiency of clinical decision-making. This study explores integrating LLMs into the key stages in EBM, evaluating their ability across evidence retrieval (PICO extraction, biomedical question answering), synthesis (summarizing randomized controlled trials), and dissemination (medical text simplification). We conducted a comparative analysis of seven LLMs, including both proprietary and open-source models, as well as those fine-tuned on medical corpora. Specifically, we benchmarked the performance of various LLMs on each EBM task under zero-shot settings as baselines, and employed prompting techniques, including in-context learning, chain-of-thought reasoning, and knowledge-guided prompting to enhance their capabilities. Our extensive experiments revealed the strengths of LLMs, such as remarkable understanding capabilities even in zero-shot settings, strong summarization skills, and effective knowledge transfer via prompting. Promoting strategies such as knowledge-guided prompting proved highly effective (e.g., improving the performance of GPT-4 by 13.10% over zero-shot in PICO extraction). However, the experiments also showed limitations, with LLM performance falling well below state-of-the-art baselines like PubMedBERT in handling named entity recognition tasks. Moreover, human evaluation revealed persisting challenges with factual inconsistencies and domain inaccuracies, underscoring the need for rigorous quality control before clinical application. This study provides insights into enhancing EBM using LLMs while highlighting critical areas for further research. The code is publicly available on Github.

循证医学(EBM)是一种以最新的、经过严格评估的研究成果为基础为患者提供医疗服务的模式。大型语言模型(LLMs)的最新进展提供了一种潜在的解决方案,通过将劳动密集型任务自动化来改变循证医学,从而提高临床决策的效率。本研究探讨了将 LLMs 整合到 EBM 关键阶段的问题,评估了它们在证据检索(PICO 提取、生物医学问题解答)、综合(随机对照试验总结)和传播(医学文本简化)方面的能力。我们对七种 LLM 进行了比较分析,其中包括专有模型和开源模型,以及在医学语料库中经过微调的模型。具体来说,我们以零拍设置为基准,对各种 LLM 在每个 EBM 任务上的性能进行了基准测试,并采用了提示技术,包括上下文学习、思维链推理和知识引导提示,以增强它们的能力。我们的大量实验揭示了 LLMs 的优势,例如即使在零镜头设置下也有出色的理解能力、很强的总结技能以及通过提示进行有效的知识转移。事实证明,知识引导提示等促进策略非常有效(例如,在 PICO 提取方面,GPT-4 的性能比零镜头提高了 13.10%)。不过,实验也显示出了局限性,在处理命名实体识别任务时,LLM 的性能远远低于 PubMedBERT 等最先进的基线。此外,人工评估显示,事实不一致和领域不准确的问题依然存在,这突出表明在临床应用之前需要进行严格的质量控制。这项研究为利用 LLMs 增强 EBM 提供了见解,同时也突出了有待进一步研究的关键领域。代码可在 Github 上公开获取。
{"title":"Benchmarking Large Language Models in Evidence-Based Medicine.","authors":"Jin Li, Yiyan Deng, Qi Sun, Junjie Zhu, Yu Tian, Jingsong Li, Tingting Zhu","doi":"10.1109/JBHI.2024.3483816","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3483816","url":null,"abstract":"<p><p>Evidence-based medicine (EBM) represents a paradigm of providing patient care grounded in the most current and rigorously evaluated research. Recent advances in large language models (LLMs) offer a potential solution to transform EBM by automating labor-intensive tasks and thereby improving the efficiency of clinical decision-making. This study explores integrating LLMs into the key stages in EBM, evaluating their ability across evidence retrieval (PICO extraction, biomedical question answering), synthesis (summarizing randomized controlled trials), and dissemination (medical text simplification). We conducted a comparative analysis of seven LLMs, including both proprietary and open-source models, as well as those fine-tuned on medical corpora. Specifically, we benchmarked the performance of various LLMs on each EBM task under zero-shot settings as baselines, and employed prompting techniques, including in-context learning, chain-of-thought reasoning, and knowledge-guided prompting to enhance their capabilities. Our extensive experiments revealed the strengths of LLMs, such as remarkable understanding capabilities even in zero-shot settings, strong summarization skills, and effective knowledge transfer via prompting. Promoting strategies such as knowledge-guided prompting proved highly effective (e.g., improving the performance of GPT-4 by 13.10% over zero-shot in PICO extraction). However, the experiments also showed limitations, with LLM performance falling well below state-of-the-art baselines like PubMedBERT in handling named entity recognition tasks. Moreover, human evaluation revealed persisting challenges with factual inconsistencies and domain inaccuracies, underscoring the need for rigorous quality control before clinical application. This study provides insights into enhancing EBM using LLMs while highlighting critical areas for further research. The code is publicly available on Github.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142499410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MRGCDDI: Multi-Relation Graph Contrastive Learning without Data Augmentation for Drug-Drug Interaction Events Prediction. MRGCDDI:用于药物相互作用事件预测的无数据增强多关系图对比学习。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-21 DOI: 10.1109/JBHI.2024.3483812
Yu Li, Lin-Xuan Hou, Zhu-Hong You, Yang Yuan, Cheng-Gang Mi, Yu-An Huang, Hai-Cheng Yi

Predicting drug-drug interactions (DDIs) is a significant concern in the field of deep learning. It can effectively reduce potential adverse consequences and improve therapeutic safety. Graph neural network (GNN)-based models have made satisfactory progress in DDI event prediction. However, most existing models overlook crucial drug structure and interaction information, which is necessary for accurate DDI event prediction. To tackle this issue, we introduce a new method called MRGCDDI. This approach employs contrastive learning, but unlike conventional methods, it does not require data augmentation, thereby avoiding additional noise. MRGCDDI maintains the semantics of the graphical data during encoder perturbation through a simple yet effective contrastive learning approach, without the need for manual trial and error, tedious searching, or expensive domain knowledge to select enhancements. The approach presented in this study effectively integrates drug features extracted from drug molecular graphs and information from multi-relational drug-drug interaction (DDI) networks. Extensive experimental results demonstrate that MRGCDDI outperforms state-of-the-art methods on both datasets. Specifically, on Deng's dataset, MRGCDDI achieves an average increase of 4.33% in accuracy, 11.57% in Macro-F1, 10.97% in Macro-Recall, and 10.64% in Macro-Precision. Similarly, on Ryu's dataset, the model shows improvements with an average increase of 2.42% in accuracy, 3.86% in Macro-F1, 3.49% in Macro-Recall, and 2.75% in Macro-Precision. All the data and codes of this work are available at https://github.com/Nokeli/MRGCDDI.

预测药物间相互作用(DDIs)是深度学习领域的一个重要问题。它可以有效减少潜在的不良后果,提高治疗安全性。基于图神经网络(GNN)的模型在 DDI 事件预测方面取得了令人满意的进展。然而,大多数现有模型都忽略了关键的药物结构和相互作用信息,而这正是准确预测 DDI 事件所必需的。为了解决这个问题,我们引入了一种名为 MRGCDDI 的新方法。这种方法采用对比学习,但与传统方法不同的是,它不需要数据扩增,从而避免了额外的噪音。MRGCDDI 通过一种简单而有效的对比学习方法,在编码器扰动期间保持图形数据的语义,而无需人工试错、繁琐的搜索或昂贵的领域知识来选择增强。本研究提出的方法有效地整合了从药物分子图中提取的药物特征和从多关系药物相互作用(DDI)网络中提取的信息。广泛的实验结果表明,MRGCDDI 在这两个数据集上的表现都优于最先进的方法。具体来说,在 Deng 的数据集上,MRGCDDI 的准确率平均提高了 4.33%,宏 F1 提高了 11.57%,宏调用提高了 10.97%,宏精度提高了 10.64%。同样,在 Ryu 的数据集上,该模型的准确率平均提高了 2.42%,Macro-F1 提高了 3.86%,Macro-Recall 提高了 3.49%,Macro-Precision 提高了 2.75%。这项工作的所有数据和代码可在 https://github.com/Nokeli/MRGCDDI 网站上查阅。
{"title":"MRGCDDI: Multi-Relation Graph Contrastive Learning without Data Augmentation for Drug-Drug Interaction Events Prediction.","authors":"Yu Li, Lin-Xuan Hou, Zhu-Hong You, Yang Yuan, Cheng-Gang Mi, Yu-An Huang, Hai-Cheng Yi","doi":"10.1109/JBHI.2024.3483812","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3483812","url":null,"abstract":"<p><p>Predicting drug-drug interactions (DDIs) is a significant concern in the field of deep learning. It can effectively reduce potential adverse consequences and improve therapeutic safety. Graph neural network (GNN)-based models have made satisfactory progress in DDI event prediction. However, most existing models overlook crucial drug structure and interaction information, which is necessary for accurate DDI event prediction. To tackle this issue, we introduce a new method called MRGCDDI. This approach employs contrastive learning, but unlike conventional methods, it does not require data augmentation, thereby avoiding additional noise. MRGCDDI maintains the semantics of the graphical data during encoder perturbation through a simple yet effective contrastive learning approach, without the need for manual trial and error, tedious searching, or expensive domain knowledge to select enhancements. The approach presented in this study effectively integrates drug features extracted from drug molecular graphs and information from multi-relational drug-drug interaction (DDI) networks. Extensive experimental results demonstrate that MRGCDDI outperforms state-of-the-art methods on both datasets. Specifically, on Deng's dataset, MRGCDDI achieves an average increase of 4.33% in accuracy, 11.57% in Macro-F1, 10.97% in Macro-Recall, and 10.64% in Macro-Precision. Similarly, on Ryu's dataset, the model shows improvements with an average increase of 2.42% in accuracy, 3.86% in Macro-F1, 3.49% in Macro-Recall, and 2.75% in Macro-Precision. All the data and codes of this work are available at https://github.com/Nokeli/MRGCDDI.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142499415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physical Activity Integration in Blood Glucose Level Prediction: Different Levels of Data Fusion. 血糖水平预测中的体育活动整合:不同层次的数据融合。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-21 DOI: 10.1109/JBHI.2024.3483999
Hoda Nemat, Heydar Khadem, Jackie Elliott, Mohammed Benaissa

Blood glucose level (BGL) prediction contributes to more effective management of type 1 diabetes. Physical activity (PA) is a crucial factor in diabetes management. It affects BGL, and it is imperative to effectively deploy PA in BGL prediction to support diabetes management systems by incorporating this crucial factor. Due to the erratic nature of PA's impact on BGL inter- and intra-patients and insufficient knowledge, deploying PA in BGL prediction is challenging. Hence, optimal approaches for PA fusion with BGL are demanded to improve the performance of BGL prediction. To address this gap, we propose novel methodologies for extracting and integrating information from PA data into BGL prediction. This paper proposes several novel PA-informed prediction models by developing different approaches for extracting information from PA data and fusing this information with BGL data in signal, feature, and decision levels to find the optimal approach for deploying PA in BGL prediction models. For signal-level fusion, different automatically-recorded PA data are fused with BGL data. Also, three feature engineering approaches are developed for feature-level fusion: subjective assessments of PA, objective assessments of PA, and statistics of PA. Furthermore, in decision-level fusion, ensemble learning is used to combine predictions from models trained with different inputs. Then, a comparative investigation is performed between the developed PA-informed approaches and the no-fusion approach, as well as between themselves. The analyses are performed on the publicly available Ohio dataset with rigorous evaluation. The results show that deploying PA can statistically significantly improve BGL prediction performance. The results show that deploying PA can statistically significantly improve BGL prediction performance. Also, among the developed approaches to leveraging PA in BGL prediction, fusing heart rate data at the signal-level and PA intensity categories at the feature-level with BGL data are the most effective ways. Our developed methodologies contribute to determining optimal approaches, including the kind of PA information and fusion method, to improve the performance of BGL prediction effectively.

预测血糖水平(BGL)有助于更有效地管理 1 型糖尿病。体力活动(PA)是糖尿病管理的一个关键因素。它影响血糖水平,因此必须在血糖水平预测中有效利用体力活动,通过纳入这一关键因素来支持糖尿病管理系统。由于患者之间和患者内部的 PA 对血糖胆固醇的影响不稳定,加之知识不足,在血糖胆固醇预测中应用 PA 具有挑战性。因此,需要将 PA 与 BGL 融合的最佳方法来提高 BGL 预测的性能。针对这一差距,我们提出了从 PA 数据中提取信息并将其整合到 BGL 预测中的新方法。本文通过开发从 PA 数据中提取信息并将这些信息与 BGL 数据在信号、特征和决策层面进行融合的不同方法,提出了几种新型 PA 信息预测模型,以找到在 BGL 预测模型中部署 PA 的最佳方法。在信号级融合方面,将不同的自动记录 PA 数据与 BGL 数据进行融合。此外,还为特征级融合开发了三种特征工程方法:PA 的主观评估、PA 的客观评估和 PA 的统计。此外,在决策级融合中,采用了集合学习的方法,将不同输入训练的模型的预测结果结合起来。然后,对已开发的 PA 信息方法和无融合方法以及它们之间进行了比较研究。分析在公开的俄亥俄州数据集上进行,并进行了严格的评估。结果表明,采用 PA 可以在统计上显著提高 BGL 预测性能。结果表明,从统计学角度看,部署 PA 可以显著提高 BGL 预测性能。此外,在已开发的利用 PA 进行 BGL 预测的方法中,信号级心率数据和特征级 PA 强度类别与 BGL 数据的融合是最有效的方法。我们开发的方法有助于确定最佳方法,包括 PA 信息的种类和融合方法,从而有效提高 BGL 预测的性能。
{"title":"Physical Activity Integration in Blood Glucose Level Prediction: Different Levels of Data Fusion.","authors":"Hoda Nemat, Heydar Khadem, Jackie Elliott, Mohammed Benaissa","doi":"10.1109/JBHI.2024.3483999","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3483999","url":null,"abstract":"<p><p>Blood glucose level (BGL) prediction contributes to more effective management of type 1 diabetes. Physical activity (PA) is a crucial factor in diabetes management. It affects BGL, and it is imperative to effectively deploy PA in BGL prediction to support diabetes management systems by incorporating this crucial factor. Due to the erratic nature of PA's impact on BGL inter- and intra-patients and insufficient knowledge, deploying PA in BGL prediction is challenging. Hence, optimal approaches for PA fusion with BGL are demanded to improve the performance of BGL prediction. To address this gap, we propose novel methodologies for extracting and integrating information from PA data into BGL prediction. This paper proposes several novel PA-informed prediction models by developing different approaches for extracting information from PA data and fusing this information with BGL data in signal, feature, and decision levels to find the optimal approach for deploying PA in BGL prediction models. For signal-level fusion, different automatically-recorded PA data are fused with BGL data. Also, three feature engineering approaches are developed for feature-level fusion: subjective assessments of PA, objective assessments of PA, and statistics of PA. Furthermore, in decision-level fusion, ensemble learning is used to combine predictions from models trained with different inputs. Then, a comparative investigation is performed between the developed PA-informed approaches and the no-fusion approach, as well as between themselves. The analyses are performed on the publicly available Ohio dataset with rigorous evaluation. The results show that deploying PA can statistically significantly improve BGL prediction performance. The results show that deploying PA can statistically significantly improve BGL prediction performance. Also, among the developed approaches to leveraging PA in BGL prediction, fusing heart rate data at the signal-level and PA intensity categories at the feature-level with BGL data are the most effective ways. Our developed methodologies contribute to determining optimal approaches, including the kind of PA information and fusion method, to improve the performance of BGL prediction effectively.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142499417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interpretable Dynamic Directed Graph Convolutional Network for Multi-Relational Prediction of Missense Mutation and Drug Response. 用于错义突变和药物反应多关系预测的可解释动态定向图卷积网络
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-18 DOI: 10.1109/JBHI.2024.3483316
Qian Gao, Tao Xu, Xiaodi Li, Wanling Gao, Haoyuan Shi, Youhua Zhang, Jie Chen, Zhenyu Yue

Tumor heterogeneity presents a significant challenge in predicting drug responses, especially as missense mutations within the same gene can lead to varied outcomes such as drug resistance, enhanced sensitivity, or therapeutic ineffectiveness. These complex relationships highlight the need for advanced analytical approaches in oncology. Due to their powerful ability to handle heterogeneous data, graph convolutional networks (GCNs) represent a promising approach for predicting drug responses. However, simple bipartite graphs cannot accurately capture the complex relationships involved in missense mutation and drug response. Furthermore, Deep learning models for drug response are often considered "black boxes", and their interpretability remains a widely discussed issue. To address these challenges, we propose an Interpretable Dynamic Directed Graph Convolutional Network (IDDGCN) framework, which incorporates four key features: (1) the use of directed graphs to differentiate between sensitivity and resistance relationships, (2) the dynamic updating of node weights based on node-specific interactions, (3) the exploration of associations between different mutations within the same gene and drug response, and (4) the enhancement of interpretability models through the integration of a weighted mechanism that accounts for the biological significance, alongside a ground truth construction method to evaluate prediction transparency. The experimental results demonstrate that IDDGCN outperforms existing state-of-the-art models, exhibiting excellent predictive power. Both qualitative and quantitative evaluations of its interpretability further highlight its ability to explain predictions, offering a fresh perspective for precision oncology and targeted drug development.

肿瘤的异质性给预测药物反应带来了巨大挑战,尤其是同一基因的错义突变会导致不同的结果,如耐药性、敏感性增强或治疗无效。这些复杂的关系凸显了肿瘤学对先进分析方法的需求。图卷积网络(GCN)具有处理异构数据的强大能力,是预测药物反应的一种有前途的方法。然而,简单的双向图无法准确捕捉错义突变与药物反应之间的复杂关系。此外,针对药物反应的深度学习模型通常被认为是 "黑盒子",其可解释性仍然是一个被广泛讨论的问题。为了应对这些挑战,我们提出了可解释动态有向图卷积网络(IDDGCN)框架,该框架包含四个关键特征:(1) 使用有向图区分敏感性和耐药性关系;(2) 根据节点特异性相互作用动态更新节点权重;(3) 探索同一基因内不同突变与药物反应之间的关联;(4) 通过整合考虑生物学意义的加权机制来增强可解释性模型,同时采用地面实况构建方法来评估预测的透明度。实验结果表明,IDDGCN 优于现有的先进模型,表现出卓越的预测能力。对其可解释性的定性和定量评估进一步突出了其解释预测的能力,为精准肿瘤学和靶向药物开发提供了一个全新的视角。
{"title":"Interpretable Dynamic Directed Graph Convolutional Network for Multi-Relational Prediction of Missense Mutation and Drug Response.","authors":"Qian Gao, Tao Xu, Xiaodi Li, Wanling Gao, Haoyuan Shi, Youhua Zhang, Jie Chen, Zhenyu Yue","doi":"10.1109/JBHI.2024.3483316","DOIUrl":"10.1109/JBHI.2024.3483316","url":null,"abstract":"<p><p>Tumor heterogeneity presents a significant challenge in predicting drug responses, especially as missense mutations within the same gene can lead to varied outcomes such as drug resistance, enhanced sensitivity, or therapeutic ineffectiveness. These complex relationships highlight the need for advanced analytical approaches in oncology. Due to their powerful ability to handle heterogeneous data, graph convolutional networks (GCNs) represent a promising approach for predicting drug responses. However, simple bipartite graphs cannot accurately capture the complex relationships involved in missense mutation and drug response. Furthermore, Deep learning models for drug response are often considered \"black boxes\", and their interpretability remains a widely discussed issue. To address these challenges, we propose an Interpretable Dynamic Directed Graph Convolutional Network (IDDGCN) framework, which incorporates four key features: (1) the use of directed graphs to differentiate between sensitivity and resistance relationships, (2) the dynamic updating of node weights based on node-specific interactions, (3) the exploration of associations between different mutations within the same gene and drug response, and (4) the enhancement of interpretability models through the integration of a weighted mechanism that accounts for the biological significance, alongside a ground truth construction method to evaluate prediction transparency. The experimental results demonstrate that IDDGCN outperforms existing state-of-the-art models, exhibiting excellent predictive power. Both qualitative and quantitative evaluations of its interpretability further highlight its ability to explain predictions, offering a fresh perspective for precision oncology and targeted drug development.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
rU-Net, Multi-Scale Feature Fusion and Transfer Learning: Unlocking the Potential of Cuffless Blood Pressure Monitoring with PPG and ECG. rU-Net、多尺度特征融合和迁移学习:释放无袖带血压监测与 PPG 和 ECG 的潜力。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-18 DOI: 10.1109/JBHI.2024.3483301
Jiaming Chen, Xueling Zhou, Lei Feng, Bingo Wing-Kuen Ling, Lianyi Han, Hongtao Zhang

This study introduces an innovative deep-learning model for cuffless blood pressure estimation using PPG and ECG signals, demonstrating state-of-the-art performance on the largest clean dataset, PulseDB. The rU-Net architecture, a fusion of U-Net and ResNet, enhances both generalization and feature extraction accuracy. Accurate multi-scale feature capture is facilitated by short-time Fourier transform (STFT) time-frequency distributions and multi-head attention mechanisms, allowing data-driven feature selection. The inclusion of demographic parameters as supervisory information further elevates performance. On the calibration-based dataset, our model excels, achieving outstanding accuracy (SBP MAE ± std: 4.49 ± 4.86 mmHg, DBP MAE ± std: 2.69 ± 3.10 mmHg), surpassing AAMI standards and earning a BHS Grade A rating. Addressing the challenge of calibration-free data, we propose a fine-tuning-based transfer learning approach. Remarkably, with only 10% data transfer, our model attains exceptional accuracy (SBP MAE ± std: 4.14 ± 5.01 mmHg, DBP MAE ± std: 2.48 ± 2.93 mmHg). This study sets the stage for the development of highly accurate and reliable wearable cuffless blood pressure monitoring devices.

本研究介绍了一种利用 PPG 和 ECG 信号进行无袖带血压估算的创新型深度学习模型,在最大的清洁数据集 PulseDB 上展示了最先进的性能。融合了 U-Net 和 ResNet 的 rU-Net 架构提高了泛化和特征提取的准确性。短时傅立叶变换 (STFT) 时频分布和多头关注机制有助于准确捕捉多尺度特征,从而实现数据驱动的特征选择。将人口统计参数作为监督信息,可进一步提高性能。在基于校准的数据集上,我们的模型表现出色,实现了出色的准确性(SBP MAE ± std:4.49 ± 4.86 mmHg,DBP MAE ± std:2.69 ± 3.10 mmHg),超过了 AAMI 标准,并获得了 BHS A 级评级。为了应对无校准数据的挑战,我们提出了一种基于微调的迁移学习方法。值得注意的是,只需传输 10% 的数据,我们的模型就能达到极高的准确度(SBP MAE ± std:4.14 ± 5.01 mmHg,DBP MAE ± std:2.48 ± 2.93 mmHg)。这项研究为开发高度准确可靠的可穿戴式无袖带血压监测设备奠定了基础。
{"title":"rU-Net, Multi-Scale Feature Fusion and Transfer Learning: Unlocking the Potential of Cuffless Blood Pressure Monitoring with PPG and ECG.","authors":"Jiaming Chen, Xueling Zhou, Lei Feng, Bingo Wing-Kuen Ling, Lianyi Han, Hongtao Zhang","doi":"10.1109/JBHI.2024.3483301","DOIUrl":"10.1109/JBHI.2024.3483301","url":null,"abstract":"<p><p>This study introduces an innovative deep-learning model for cuffless blood pressure estimation using PPG and ECG signals, demonstrating state-of-the-art performance on the largest clean dataset, PulseDB. The rU-Net architecture, a fusion of U-Net and ResNet, enhances both generalization and feature extraction accuracy. Accurate multi-scale feature capture is facilitated by short-time Fourier transform (STFT) time-frequency distributions and multi-head attention mechanisms, allowing data-driven feature selection. The inclusion of demographic parameters as supervisory information further elevates performance. On the calibration-based dataset, our model excels, achieving outstanding accuracy (SBP MAE ± std: 4.49 ± 4.86 mmHg, DBP MAE ± std: 2.69 ± 3.10 mmHg), surpassing AAMI standards and earning a BHS Grade A rating. Addressing the challenge of calibration-free data, we propose a fine-tuning-based transfer learning approach. Remarkably, with only 10% data transfer, our model attains exceptional accuracy (SBP MAE ± std: 4.14 ± 5.01 mmHg, DBP MAE ± std: 2.48 ± 2.93 mmHg). This study sets the stage for the development of highly accurate and reliable wearable cuffless blood pressure monitoring devices.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Camera-Based Respiratory Imaging System for Monitoring Infant Thoracoabdominal Patterns of Respiration. 基于摄像头的呼吸成像系统,用于监测婴儿胸腹呼吸模式。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-17 DOI: 10.1109/JBHI.2024.3482569
Dongmin Huang, Yongshen Zeng, Yingen Zhu, Xiaoyan Song, Liping Pan, Jie Yang, Yanrong Wang, Hongzhou Lu, Wenjin Wang

Existing respiratory monitoring techniques primarily focus on respiratory rate measurement, neglecting the potential of using thoracoabdominal patterns of respiration for infant lung health assessment. To bridge this gap, we exploit the unique advantage of spatial redundancy of a camera sensor to analyze the infant thoracoabdominal respiratory motion. Specifically, we propose a camera-based respiratory imaging (CRI) system that utilizes optical flow to construct a spatio-temporal respiratory imager for comparing the infant chest and abdominal respiratory motion, and employs deep learning algorithms to identify infant abdominal, thoracoabdominal synchronous, and thoracoabdominal asynchronous patterns of respiration. To alleviate the challenges posed by limited clinical training data and subject variability, we introduce a novel multiple-expert contrastive learning (MECL) strategy to CRI. It enriches training samples by reversing and pairing different-class data, and promotes the representation consistency of same-class data through multi-expert collaborative optimization. Clinical validation involving 44 infants shows that MECL achieves 70% in sensitivity and 80.21% in specificity, which validates the feasibility of CRI for respiratory pattern recognition. This work investigates a novel video-based approach for assessing the infant thoracoabdominal patterns of respiration, revealing a new value stream of video health monitoring in neonatal care.

现有的呼吸监测技术主要侧重于呼吸频率的测量,忽视了利用胸腹式呼吸模式进行婴儿肺部健康评估的潜力。为了弥补这一不足,我们利用摄像头传感器空间冗余的独特优势来分析婴儿胸腹呼吸运动。具体来说,我们提出了一种基于相机的呼吸成像(CRI)系统,该系统利用光流构建时空呼吸成像仪,用于比较婴儿胸腹呼吸运动,并采用深度学习算法识别婴儿腹部、胸腹同步和胸腹异步呼吸模式。为了缓解有限的临床训练数据和受试者差异性带来的挑战,我们在 CRI 中引入了一种新颖的多专家对比学习(MECL)策略。它通过反转和配对不同类数据来丰富训练样本,并通过多专家协作优化来提高同类数据的表征一致性。44 名婴儿的临床验证表明,MECL 的灵敏度和特异度分别达到了 70% 和 80.21%,验证了呼吸模式识别 CRI 的可行性。这项工作研究了一种基于视频评估婴儿胸腹呼吸模式的新方法,揭示了新生儿护理中视频健康监测的新价值流。
{"title":"Camera-Based Respiratory Imaging System for Monitoring Infant Thoracoabdominal Patterns of Respiration.","authors":"Dongmin Huang, Yongshen Zeng, Yingen Zhu, Xiaoyan Song, Liping Pan, Jie Yang, Yanrong Wang, Hongzhou Lu, Wenjin Wang","doi":"10.1109/JBHI.2024.3482569","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3482569","url":null,"abstract":"<p><p>Existing respiratory monitoring techniques primarily focus on respiratory rate measurement, neglecting the potential of using thoracoabdominal patterns of respiration for infant lung health assessment. To bridge this gap, we exploit the unique advantage of spatial redundancy of a camera sensor to analyze the infant thoracoabdominal respiratory motion. Specifically, we propose a camera-based respiratory imaging (CRI) system that utilizes optical flow to construct a spatio-temporal respiratory imager for comparing the infant chest and abdominal respiratory motion, and employs deep learning algorithms to identify infant abdominal, thoracoabdominal synchronous, and thoracoabdominal asynchronous patterns of respiration. To alleviate the challenges posed by limited clinical training data and subject variability, we introduce a novel multiple-expert contrastive learning (MECL) strategy to CRI. It enriches training samples by reversing and pairing different-class data, and promotes the representation consistency of same-class data through multi-expert collaborative optimization. Clinical validation involving 44 infants shows that MECL achieves 70% in sensitivity and 80.21% in specificity, which validates the feasibility of CRI for respiratory pattern recognition. This work investigates a novel video-based approach for assessing the infant thoracoabdominal patterns of respiration, revealing a new value stream of video health monitoring in neonatal care.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Cross Attention Approach to Diagnostic Explainability Using Clinical Practice Guidelines for Depression. 使用《抑郁症临床实践指南》对诊断可解释性进行交叉关注。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-17 DOI: 10.1109/JBHI.2024.3483577
Sumit Dalal, Deepa Tilwani, Manas Gaur, Sarika Jain, Valerie L Shalin, Amit P Sheth

The lack of explainability in using relevant clinical knowledge hinders the adoption of artificial intelligence-powered analysis of unstructured clinical dialogue. A wealth of relevant, untapped Mental Health (MH) data is available in online communities, providing the opportunity to address the explainability problem with substantial potential impact as a screening tool for both online and offline applications. Inspired by how clinicians rely on their expertise when interacting with patients, we leverage relevant clinical knowledge to classify and explain depression-related data, reducing manual review time and engendering trust. We developed a method to enhance attention in contemporary transformer models and generate explanations for classifications that are understandable by mental health practitioners (MHPs) by incorporating external clinical knowledge. We propose a domain-general architecture called ProcesS knowledgeinfused cross ATtention (PSAT) that incorporates clinical practice guidelines (CPG) when computing attention. We transform a CPG resource focused on depression, such as the Patient Health Questionnaire (e.g. PHQ-9) and related questions, into a machine-readable ontology using SNOMED-CT. With this resource, PSAT enhances the ability of models like GPT-3.5 to generate application-relevant explanations. Evaluation of four expert-curated datasets related to depression demonstrates PSAT's applicationrelevant explanations. PSAT surpasses the performance of twelve baseline models and can provide explanations where other baselines fall short.

在使用相关临床知识时缺乏可解释性,这阻碍了对非结构化临床对话进行人工智能分析。在线社区中存在大量相关的、尚未开发的心理健康(MH)数据,这为解决可解释性问题提供了机会,可作为在线和离线应用的筛选工具产生巨大的潜在影响。受临床医生在与患者互动时如何依赖专业知识的启发,我们利用相关临床知识对抑郁症相关数据进行分类和解释,从而减少人工审核时间并赢得信任。我们开发了一种方法来提高当代转换器模型的注意力,并通过结合外部临床知识生成心理健康从业人员(MHPs)可以理解的分类解释。我们提出了一种名为 "ProcesS knowledgeinfused cross ATtention (PSAT) "的领域通用架构,该架构在计算注意力时结合了临床实践指南(CPG)。我们利用 SNOMED-CT 将以抑郁症为重点的 CPG 资源(如患者健康问卷(PHQ-9)和相关问题)转化为机器可读的本体。有了这一资源,PSAT 就能增强 GPT-3.5 等模型生成应用相关解释的能力。对四个由专家编辑的抑郁症相关数据集的评估证明了 PSAT 的应用相关解释能力。PSAT 的性能超过了 12 个基线模型,可以提供其他基线模型无法提供的解释。
{"title":"A Cross Attention Approach to Diagnostic Explainability Using Clinical Practice Guidelines for Depression.","authors":"Sumit Dalal, Deepa Tilwani, Manas Gaur, Sarika Jain, Valerie L Shalin, Amit P Sheth","doi":"10.1109/JBHI.2024.3483577","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3483577","url":null,"abstract":"<p><p>The lack of explainability in using relevant clinical knowledge hinders the adoption of artificial intelligence-powered analysis of unstructured clinical dialogue. A wealth of relevant, untapped Mental Health (MH) data is available in online communities, providing the opportunity to address the explainability problem with substantial potential impact as a screening tool for both online and offline applications. Inspired by how clinicians rely on their expertise when interacting with patients, we leverage relevant clinical knowledge to classify and explain depression-related data, reducing manual review time and engendering trust. We developed a method to enhance attention in contemporary transformer models and generate explanations for classifications that are understandable by mental health practitioners (MHPs) by incorporating external clinical knowledge. We propose a domain-general architecture called ProcesS knowledgeinfused cross ATtention (PSAT) that incorporates clinical practice guidelines (CPG) when computing attention. We transform a CPG resource focused on depression, such as the Patient Health Questionnaire (e.g. PHQ-9) and related questions, into a machine-readable ontology using SNOMED-CT. With this resource, PSAT enhances the ability of models like GPT-3.5 to generate application-relevant explanations. Evaluation of four expert-curated datasets related to depression demonstrates PSAT's applicationrelevant explanations. PSAT surpasses the performance of twelve baseline models and can provide explanations where other baselines fall short.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CATransformer: A Cycle-Aware Transformer for High-Fidelity ECG Generation From PPG. CATransformer:从 PPG 生成高保真心电图的周期感知变压器。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-17 DOI: 10.1109/JBHI.2024.3482853
Xiaoyan Yuan, Wei Wang, Xiaohe Li, Yuanting Zhang, Xiping Hu, M Jamal Deen

Electrocardiography (ECG) is the gold standard for monitoring heart function and is crucial for preventing the worsening of cardiovascular diseases (CVDs). However, the inconvenience of ECG acquisition poses challenges for long-term continuous monitoring. Consequently, researchers have explored non-invasive and easily accessible photoplethysmography (PPG) as an alternative, converting it into ECG. Previous studies have focused on peaks or simple mapping to generate ECG, ignoring the inherent periodicity of cardiovascular signals. This results in an inability to accurately extract physiological information during the cycle, thus compromising the generated ECG signals' clinical utility. To this end, we introduce a novel PPG-to-ECG translation model called CATransformer, capable of adaptive modeling based on the cardiac cycle. Specifically, CATransformer automatically extracts the cycle using a cycle-aware module and creates multiple semantic views of the cardiac cycle. It leverages a transformer to capture detailed features within each cycle and the dynamics across cycles. Our method outperforms existing approaches, exhibiting the lowest RMSE across five paired PPG-ECG databases. Additionally, extensive experiments are conducted on four cardiovascular-related tasks to assess the clinical utility of the generated ECG, achieving consistent state-of-the-art performance. Experimental results confirm that CATransformer generates highly faithful ECG signals while preserving their physiological characteristics.

心电图(ECG)是监测心脏功能的黄金标准,对预防心血管疾病(CVD)恶化至关重要。然而,心电图采集的不便给长期连续监测带来了挑战。因此,研究人员探索了一种非侵入性且易于获取的光电血压计(PPG)作为替代方法,将其转换为心电图。以往的研究侧重于峰值或简单映射来生成心电图,忽略了心血管信号固有的周期性。这导致无法准确提取周期内的生理信息,从而影响了生成的心电信号的临床实用性。为此,我们引入了一种名为 CATransformer 的新型 PPG 到 ECG 转换模型,它能够根据心动周期自适应建模。具体来说,CATransformer 使用周期感知模块自动提取周期,并创建多个心动周期语义视图。它利用转换器捕捉每个周期内的详细特征和跨周期的动态变化。我们的方法优于现有方法,在五个配对的 PPG-ECG 数据库中显示出最低的 RMSE。此外,我们还在四项心血管相关任务中进行了广泛的实验,以评估生成的心电图的临床实用性,并取得了一致的先进性能。实验结果证实,CATransformer 可生成高度忠实的心电图信号,同时保留其生理特征。
{"title":"CATransformer: A Cycle-Aware Transformer for High-Fidelity ECG Generation From PPG.","authors":"Xiaoyan Yuan, Wei Wang, Xiaohe Li, Yuanting Zhang, Xiping Hu, M Jamal Deen","doi":"10.1109/JBHI.2024.3482853","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3482853","url":null,"abstract":"<p><p>Electrocardiography (ECG) is the gold standard for monitoring heart function and is crucial for preventing the worsening of cardiovascular diseases (CVDs). However, the inconvenience of ECG acquisition poses challenges for long-term continuous monitoring. Consequently, researchers have explored non-invasive and easily accessible photoplethysmography (PPG) as an alternative, converting it into ECG. Previous studies have focused on peaks or simple mapping to generate ECG, ignoring the inherent periodicity of cardiovascular signals. This results in an inability to accurately extract physiological information during the cycle, thus compromising the generated ECG signals' clinical utility. To this end, we introduce a novel PPG-to-ECG translation model called CATransformer, capable of adaptive modeling based on the cardiac cycle. Specifically, CATransformer automatically extracts the cycle using a cycle-aware module and creates multiple semantic views of the cardiac cycle. It leverages a transformer to capture detailed features within each cycle and the dynamics across cycles. Our method outperforms existing approaches, exhibiting the lowest RMSE across five paired PPG-ECG databases. Additionally, extensive experiments are conducted on four cardiovascular-related tasks to assess the clinical utility of the generated ECG, achieving consistent state-of-the-art performance. Experimental results confirm that CATransformer generates highly faithful ECG signals while preserving their physiological characteristics.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Facial Expression Recognition for Healthcare Monitoring Systems Using Neural Random Forest. 使用神经随机森林为医疗监控系统识别面部表情。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-16 DOI: 10.1109/JBHI.2024.3482450
Muhammad Hameed Siddiqi, Irshad Ahmad, Yousef Alhwaiti, Faheem Khan

Facial expressions vary with different health conditions, making a facial expression recognition (FER) system valuable within a healthcare framework. Achieving accurate recognition of facial expressions is a considerable challenge due to the difficulty in capturing subtle features. This research introduced an ensemble neural random forest method that utilizes convolutional neural network (CNN) architecture for feature extraction and optimized random forest for classification. For feature extraction, four convolutional layers with different numbers of filters and kernel sizes are used. Further, the maxpooling, batch normalization, and dropout layers are used in the model to expedite the process of feature extraction and avoid the overfitting of the model. The extracted features are provided to the optimized random forest for classification, which is based on the number of trees, criterion, maximum tree depth, maximum terminal nodes, minimum sample split, and maximum features per tree, and applied to the classification process. To demonstrate the significance of the proposed model, we conducted a thorough assessment of the proposed neural random forest through an extensive experiment encompassing six publicly available datasets. The remarkable weighted average recognition rate of 97.3% achieved across these diverse datasets highlights the effectiveness of our approach in the context of FER systems.

面部表情会随着不同的健康状况而变化,因此面部表情识别(FER)系统在医疗保健框架内非常有价值。由于难以捕捉细微特征,因此实现面部表情的准确识别是一项相当大的挑战。这项研究引入了一种集合神经随机森林方法,利用卷积神经网络(CNN)架构进行特征提取,并利用优化的随机森林进行分类。在特征提取方面,使用了四个具有不同数量过滤器和内核大小的卷积层。此外,模型中还使用了 maxpooling、batch normalization 和 dropout 层,以加快特征提取过程,避免模型的过度拟合。提取的特征将提供给优化的随机森林进行分类,该分类基于树的数量、准则、最大树深、最大终端节点、最小样本分割和每棵树的最大特征,并应用于分类过程。为了证明所提模型的重要意义,我们通过一项包含六个公开数据集的广泛实验,对所提神经随机森林进行了全面评估。这些不同数据集的加权平均识别率高达 97.3%,这充分证明了我们的方法在 FER 系统中的有效性。
{"title":"Facial Expression Recognition for Healthcare Monitoring Systems Using Neural Random Forest.","authors":"Muhammad Hameed Siddiqi, Irshad Ahmad, Yousef Alhwaiti, Faheem Khan","doi":"10.1109/JBHI.2024.3482450","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3482450","url":null,"abstract":"<p><p>Facial expressions vary with different health conditions, making a facial expression recognition (FER) system valuable within a healthcare framework. Achieving accurate recognition of facial expressions is a considerable challenge due to the difficulty in capturing subtle features. This research introduced an ensemble neural random forest method that utilizes convolutional neural network (CNN) architecture for feature extraction and optimized random forest for classification. For feature extraction, four convolutional layers with different numbers of filters and kernel sizes are used. Further, the maxpooling, batch normalization, and dropout layers are used in the model to expedite the process of feature extraction and avoid the overfitting of the model. The extracted features are provided to the optimized random forest for classification, which is based on the number of trees, criterion, maximum tree depth, maximum terminal nodes, minimum sample split, and maximum features per tree, and applied to the classification process. To demonstrate the significance of the proposed model, we conducted a thorough assessment of the proposed neural random forest through an extensive experiment encompassing six publicly available datasets. The remarkable weighted average recognition rate of 97.3% achieved across these diverse datasets highlights the effectiveness of our approach in the context of FER systems.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142464063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Biomedical and Health Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1