首页 > 最新文献

IEEE Journal of Biomedical and Health Informatics最新文献

英文 中文
Alzheimer's Disease Detection in EEG Sleep Signals. 从脑电图睡眠信号中检测阿尔茨海默病
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-11 DOI: 10.1109/JBHI.2024.3478380
Lorena Gallego-Vinaras, Juan Miguel Mira-Tomas, Anna Michela Gaeta, Gerard Pinol-Ripoll, Ferran Barbe, Pablo M Olmos, Arrate Munoz-Barrutia

Alzheimer's disease (AD) and sleep disorders exhibit a close association, where disruptions in sleep patterns often precede the onset of Mild Cognitive Impairment (MCI) and early-stage AD. This study delves into the potential of utilizing sleep-related electroencephalography (EEG) signals acquired through polysomnography (PSG) for the early detection of AD. Our primary focus is on exploring semi-supervised Deep Learning techniques for the classification of EEG signals due to the clinical scenario characterized by the limited data availability. The methodology entails testing and comparing the performance of semi-supervised models, benchmarked against an unsupervised and a supervised model. The study highlights the significance of spatial and temporal analysis capabilities, conducting independent analyses of each sleep stage. Results demonstrate the effectiveness of one semi-supervised model in leveraging limited labeled data, achieving stable metrics across all sleep stages, and reaching 90% accuracy in its supervised form. Comparative analyses reveal this superior performance over the unsupervised model, while the supervised model ranges between 92-94% . These findings underscore the potential of semi-supervised models in early AD detection, particularly in overcoming the challenges associated with the scarcity of labeled data. Ablation tests affirm the critical role of spatio-temporal feature extraction in semi-supervised predictive performance, and t-SNE visualizations validate the model's proficiency in distinguishing AD patterns. Overall, this research contributes to the advancement of AD detection through innovative Deep Learning approaches, highlighting the crucial role of semi-supervised learning in addressing data limitations.

阿尔茨海默病(AD)与睡眠障碍有着密切的联系,睡眠模式紊乱往往先于轻度认知功能障碍(MCI)和早期阿尔茨海默病发病。本研究探讨了利用通过多导睡眠图(PSG)获得的与睡眠相关的脑电图(EEG)信号来早期检测老年痴呆症的潜力。由于临床场景的特点是数据可用性有限,我们的主要重点是探索用于脑电信号分类的半监督深度学习技术。该方法需要测试和比较半监督模型的性能,并以无监督模型和有监督模型为基准。研究强调了空间和时间分析能力的重要性,并对每个睡眠阶段进行了独立分析。结果表明,一种半监督模型能有效利用有限的标记数据,在所有睡眠阶段都能获得稳定的指标,其监督形式的准确率达到 90%。对比分析表明,该模型的准确率高于无监督模型,而有监督模型的准确率在 92-94% 之间。这些发现凸显了半监督模型在早期注意力缺失症检测中的潜力,尤其是在克服标记数据稀缺带来的挑战方面。消融测试肯定了时空特征提取在半监督预测性能中的关键作用,t-SNE 可视化验证了模型在区分注意力缺失症模式方面的能力。总之,这项研究通过创新的深度学习方法推动了注意力缺失检测的发展,突出了半监督学习在解决数据限制方面的关键作用。
{"title":"Alzheimer's Disease Detection in EEG Sleep Signals.","authors":"Lorena Gallego-Vinaras, Juan Miguel Mira-Tomas, Anna Michela Gaeta, Gerard Pinol-Ripoll, Ferran Barbe, Pablo M Olmos, Arrate Munoz-Barrutia","doi":"10.1109/JBHI.2024.3478380","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3478380","url":null,"abstract":"<p><p>Alzheimer's disease (AD) and sleep disorders exhibit a close association, where disruptions in sleep patterns often precede the onset of Mild Cognitive Impairment (MCI) and early-stage AD. This study delves into the potential of utilizing sleep-related electroencephalography (EEG) signals acquired through polysomnography (PSG) for the early detection of AD. Our primary focus is on exploring semi-supervised Deep Learning techniques for the classification of EEG signals due to the clinical scenario characterized by the limited data availability. The methodology entails testing and comparing the performance of semi-supervised models, benchmarked against an unsupervised and a supervised model. The study highlights the significance of spatial and temporal analysis capabilities, conducting independent analyses of each sleep stage. Results demonstrate the effectiveness of one semi-supervised model in leveraging limited labeled data, achieving stable metrics across all sleep stages, and reaching 90% accuracy in its supervised form. Comparative analyses reveal this superior performance over the unsupervised model, while the supervised model ranges between 92-94% . These findings underscore the potential of semi-supervised models in early AD detection, particularly in overcoming the challenges associated with the scarcity of labeled data. Ablation tests affirm the critical role of spatio-temporal feature extraction in semi-supervised predictive performance, and t-SNE visualizations validate the model's proficiency in distinguishing AD patterns. Overall, this research contributes to the advancement of AD detection through innovative Deep Learning approaches, highlighting the crucial role of semi-supervised learning in addressing data limitations.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142406368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LKAN: LLM-Based Knowledge-Aware Attention Network for Clinical Staging of Liver Cancer. LKAN:基于 LLM 的肝癌临床分期知识感知注意力网络。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-11 DOI: 10.1109/JBHI.2024.3478809
Ya Li, Xuecong Zheng, Jiaping Li, Qingyun Dai, Chang-Dong Wang, Min Chen

Clinical staging of liver cancer (CSoLC), an important indicator for evaluating the degree of deterioration of primary liver cancer cells (PLCCs), is key in the diagnosis, treatment, and rehabilitation of liver cancer. In China, the current CSoLC adopts the China liver cancer (CNLC) staging, which is usually evaluated by clinicians based on the patient's radiology reports. Therefore, inferring clinical information from unstructured radiology reports can provide auxiliary decision support for clinicians. The key to solving the challenging task is to guide the model to pay attention to the staging-related words or sentences, and the following issues may occur: 1) Imbalanced categories: The symptoms of liver cancer in the early- or mid-stage are not obvious, resulting in more data in the end-stage. 2) Domain sensitivity of liver cancer data: The liver cancer dataset contains a large amount of domain knowledge, and the conventional methods can exacerbate out-of-vocabulary, which greatly affects the accuracy of classification. 3) Free-text and lengthy report: The radiology report of liver cancer sparsely describes various lesions with domain-specific terms, which poses difficulties in mining key information related to staging. To tackle these challenges, this article proposes a large language model (LLM)-based Knowledge-aware Attention Network (LKAN) for CSoLC. First, for maintaining semantic consistency, LLM and a rule-based algorithm are integrated to generate more diverse and reasonable data. Second, unlabeled radiology corpus of liver cancer are pre-trained to introduce domain knowledge for subsequent representation learning. Third, attention is improved by incorporating both global and local features, which can provide professional guidance for the classifier to focus on the important information. Compared with the baseline models, the classification accuracy of LKAN has achieved the best results with 90.3% Accuracy, 90.0% Macro_F1 score, and 90.0% Macro_Recall. The code is available at https://github.com/xczhh/Supplemental-Material.

肝癌临床分期(CSoLC)是评价原发性肝癌细胞(PLCC)恶化程度的重要指标,是肝癌诊断、治疗和康复的关键。在中国,目前的 CSoLC 采用的是中国肝癌(CNLC)分期,通常由临床医生根据患者的放射学报告进行评估。因此,从非结构化的放射学报告中推断临床信息可为临床医生提供辅助决策支持。解决这一挑战性任务的关键在于引导模型关注分期相关的单词或句子,可能会出现以下问题:1)分类失衡:肝癌早期或中期症状不明显,导致末期数据较多。2) 肝癌数据的领域敏感性:肝癌数据集包含大量领域知识,传统方法会加剧词汇缺失,大大影响分类的准确性。3) 自由文本和冗长报告:肝癌的放射报告用特定领域的术语对各种病变进行了稀疏描述,这给挖掘与分期相关的关键信息带来了困难。针对这些难题,本文提出了一种基于大语言模型(LLM)的知识感知注意力网络(LKAN),用于 CSoLC。首先,为了保持语义的一致性,LLM 与基于规则的算法相结合,以生成更多样、更合理的数据。其次,对未标记的肝癌放射学语料进行预训练,为后续的表征学习引入领域知识。第三,通过结合全局和局部特征来提高注意力,为分类器关注重要信息提供专业指导。与基线模型相比,LKAN 的分类准确率达到了最佳效果,准确率为 90.3%,Macro_F1 分数为 90.0%,Macro_Recall 分数为 90.0%。代码见 https://github.com/xczhh/Supplemental-Material。
{"title":"LKAN: LLM-Based Knowledge-Aware Attention Network for Clinical Staging of Liver Cancer.","authors":"Ya Li, Xuecong Zheng, Jiaping Li, Qingyun Dai, Chang-Dong Wang, Min Chen","doi":"10.1109/JBHI.2024.3478809","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3478809","url":null,"abstract":"<p><p>Clinical staging of liver cancer (CSoLC), an important indicator for evaluating the degree of deterioration of primary liver cancer cells (PLCCs), is key in the diagnosis, treatment, and rehabilitation of liver cancer. In China, the current CSoLC adopts the China liver cancer (CNLC) staging, which is usually evaluated by clinicians based on the patient's radiology reports. Therefore, inferring clinical information from unstructured radiology reports can provide auxiliary decision support for clinicians. The key to solving the challenging task is to guide the model to pay attention to the staging-related words or sentences, and the following issues may occur: 1) Imbalanced categories: The symptoms of liver cancer in the early- or mid-stage are not obvious, resulting in more data in the end-stage. 2) Domain sensitivity of liver cancer data: The liver cancer dataset contains a large amount of domain knowledge, and the conventional methods can exacerbate out-of-vocabulary, which greatly affects the accuracy of classification. 3) Free-text and lengthy report: The radiology report of liver cancer sparsely describes various lesions with domain-specific terms, which poses difficulties in mining key information related to staging. To tackle these challenges, this article proposes a large language model (LLM)-based Knowledge-aware Attention Network (LKAN) for CSoLC. First, for maintaining semantic consistency, LLM and a rule-based algorithm are integrated to generate more diverse and reasonable data. Second, unlabeled radiology corpus of liver cancer are pre-trained to introduce domain knowledge for subsequent representation learning. Third, attention is improved by incorporating both global and local features, which can provide professional guidance for the classifier to focus on the important information. Compared with the baseline models, the classification accuracy of LKAN has achieved the best results with 90.3% Accuracy, 90.0% Macro_F1 score, and 90.0% Macro_Recall. The code is available at https://github.com/xczhh/Supplemental-Material.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142406369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement. TrustEMG-Net:使用表征-屏蔽转换器和 U-Net 增强表面肌电图。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-10 DOI: 10.1109/JBHI.2024.3475817
Kuan-Chen Wang, Kai-Chun Liu, Ping-Cheng Yeh, Sheng-Yu Peng, Yu Tsao

Surface electromyography (sEMG) is a widely employed bio-signal that captures human muscle activity via electrodes placed on the skin. Several studies have proposed methods to remove sEMG contaminants, as non-invasive measurements render sEMG susceptible to various contaminants. However, these approaches often rely on heuristic-based optimization and are sensitive to the contaminant type. A more potent, robust, and generalized sEMG denoising approach should be developed for various healthcare and human-computer interaction applications. This paper proposes a novel neural network (NN)-based sEMG denoising method called TrustEMG-Net. It leverages the potent nonlinear mapping capability and data-driven nature of NNs. TrustEMG-Net adopts a denoising autoencoder structure by combining U-Net with a Transformer encoder using a representation-masking approach. The proposed approach is evaluated using the Ninapro sEMG database with five common contamination types and signal-to-noise ratio (SNR) conditions. Compared with existing sEMG denoising methods, TrustEMG-Net achieves exceptional performance across the five evaluation metrics, exhibiting a minimum improvement of 20%. Its superiority is consistent under various conditions, including SNRs ranging from -14 to 2 dB and five contaminant types. An ablation study further proves that the design of TrustEMG-Net contributes to its optimality, providing high-quality sEMG and serving as an effective, robust, and generalized denoising solution for sEMG applications.

表面肌电图(sEMG)是一种广泛使用的生物信号,它通过放置在皮肤上的电极捕捉人体的肌肉活动。一些研究提出了去除 sEMG 杂质的方法,因为非侵入式测量使 sEMG 容易受到各种杂质的影响。不过,这些方法通常依赖于启发式优化,对污染物类型很敏感。应针对各种医疗保健和人机交互应用开发一种更有效、稳健和通用的 sEMG 去噪方法。本文提出了一种基于神经网络(NN)的新型 sEMG 去噪方法,称为 TrustEMG-Net。它充分利用了神经网络强大的非线性映射能力和数据驱动特性。TrustEMG-Net 采用去噪自动编码器结构,通过表示掩码方法将 U-Net 与 Transformer 编码器相结合。我们使用 Ninapro sEMG 数据库对所提出的方法进行了评估,该数据库包含五种常见的污染类型和信噪比(SNR)条件。与现有的 sEMG 去噪方法相比,TrustEMG-Net 在五个评估指标上都取得了优异的性能,至少提高了 20%。在 SNR 为 -14 到 2 dB 以及五种污染物类型等各种条件下,其优越性始终如一。一项消融研究进一步证明,TrustEMG-Net 的设计有助于实现其最优性,从而提供高质量的 sEMG,并为 sEMG 应用提供有效、稳健和通用的去噪解决方案。
{"title":"TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement.","authors":"Kuan-Chen Wang, Kai-Chun Liu, Ping-Cheng Yeh, Sheng-Yu Peng, Yu Tsao","doi":"10.1109/JBHI.2024.3475817","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3475817","url":null,"abstract":"<p><p>Surface electromyography (sEMG) is a widely employed bio-signal that captures human muscle activity via electrodes placed on the skin. Several studies have proposed methods to remove sEMG contaminants, as non-invasive measurements render sEMG susceptible to various contaminants. However, these approaches often rely on heuristic-based optimization and are sensitive to the contaminant type. A more potent, robust, and generalized sEMG denoising approach should be developed for various healthcare and human-computer interaction applications. This paper proposes a novel neural network (NN)-based sEMG denoising method called TrustEMG-Net. It leverages the potent nonlinear mapping capability and data-driven nature of NNs. TrustEMG-Net adopts a denoising autoencoder structure by combining U-Net with a Transformer encoder using a representation-masking approach. The proposed approach is evaluated using the Ninapro sEMG database with five common contamination types and signal-to-noise ratio (SNR) conditions. Compared with existing sEMG denoising methods, TrustEMG-Net achieves exceptional performance across the five evaluation metrics, exhibiting a minimum improvement of 20%. Its superiority is consistent under various conditions, including SNRs ranging from -14 to 2 dB and five contaminant types. An ablation study further proves that the design of TrustEMG-Net contributes to its optimality, providing high-quality sEMG and serving as an effective, robust, and generalized denoising solution for sEMG applications.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142400195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric Dynamic Granger Causality based on Multi-Space Spectrum Fusion for Time-varying Directed Brain Network Construction. 基于多空间频谱融合的非参数动态格兰杰因果关系用于时变定向脑网络构建
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-10 DOI: 10.1109/JBHI.2024.3477944
Chanlin Yi, Jiamin Zhang, Zihan Weng, Wanjun Chen, Dezhong Yao, Fali Li, Zehong Cao, Peiyang Li, Peng Xu

Nonparametric estimation of time-varying directed networks can unveil the intricate transient organization of directed brain communication while circumventing constraints imposed by prescribed model-driven methods. A robust time-frequency representation - the foundation of its causality inference - is critical for enhancing its reliability. This study proposed a novel method, i.e., nonparametric dynamic Granger causality based on Multi-space Spectrum Fusion (ndGCMSF), which integrates complementary spectrum information from different spaces to generate reliable spectral representations to estimate dynamic causalities across brain regions. Systematic simulations and validations demonstrate that ndGCMSF exhibits superior noise resistance and a powerful ability to capture subtle dynamic changes in directed brain networks. Particularly, ndGCMSF revealed that during instruction response movements, the laterality in the hemisphere ipsilateral to the hemiplegic limb emerges upon instruction onset and diminishes upon task accomplishment. These intrinsic variations further provide reliable features for distinguishing two types of hemiplegia (left vs. right) and assessing motor functions. The ndGCMSF offers powerful functional patterns to derive effective brain networks in dynamically changing operational settings and contributes to extensive areas involving dynamical and directed communications.

对时变有向网络的非参数估计可以揭示有向大脑通信的复杂瞬态组织,同时规避规定的模型驱动方法所带来的限制。稳健的时频表示是其因果推断的基础,对于提高其可靠性至关重要。本研究提出了一种新方法,即基于多空间频谱融合的非参数动态格兰杰因果关系(ndGCMSF),它整合了来自不同空间的互补频谱信息,生成可靠的频谱表示,以估计跨脑区的动态因果关系。系统模拟和验证表明,ndGCMSF 具有卓越的抗噪能力和捕捉定向脑网络中微妙动态变化的强大能力。特别是,ndGCMSF 发现,在指令响应运动过程中,偏瘫肢体同侧半球的侧向性在指令开始时出现,并在任务完成时减弱。这些内在变化进一步为区分两种偏瘫类型(左侧偏瘫和右侧偏瘫)和评估运动功能提供了可靠的特征。ndGCMSF提供了强大的功能模式,可在动态变化的操作环境中推导出有效的大脑网络,并有助于涉及动态和定向通信的广泛领域。
{"title":"Nonparametric Dynamic Granger Causality based on Multi-Space Spectrum Fusion for Time-varying Directed Brain Network Construction.","authors":"Chanlin Yi, Jiamin Zhang, Zihan Weng, Wanjun Chen, Dezhong Yao, Fali Li, Zehong Cao, Peiyang Li, Peng Xu","doi":"10.1109/JBHI.2024.3477944","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3477944","url":null,"abstract":"<p><p>Nonparametric estimation of time-varying directed networks can unveil the intricate transient organization of directed brain communication while circumventing constraints imposed by prescribed model-driven methods. A robust time-frequency representation - the foundation of its causality inference - is critical for enhancing its reliability. This study proposed a novel method, i.e., nonparametric dynamic Granger causality based on Multi-space Spectrum Fusion (ndGCMSF), which integrates complementary spectrum information from different spaces to generate reliable spectral representations to estimate dynamic causalities across brain regions. Systematic simulations and validations demonstrate that ndGCMSF exhibits superior noise resistance and a powerful ability to capture subtle dynamic changes in directed brain networks. Particularly, ndGCMSF revealed that during instruction response movements, the laterality in the hemisphere ipsilateral to the hemiplegic limb emerges upon instruction onset and diminishes upon task accomplishment. These intrinsic variations further provide reliable features for distinguishing two types of hemiplegia (left vs. right) and assessing motor functions. The ndGCMSF offers powerful functional patterns to derive effective brain networks in dynamically changing operational settings and contributes to extensive areas involving dynamical and directed communications.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142400183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms SCKansformer:通过 Kansformer 骨干和分层注意机制对骨髓细胞进行精细分类。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-10 DOI: 10.1109/JBHI.2024.3471928
Yifei Chen;Zhu Zhu;Shenghao Zhu;Linwei Qiu;Binfeng Zou;Fan Jia;Yunpeng Zhu;Chenyan Zhang;Zhaojie Fang;Feiwei Qin;Jin Fan;Changmiao Wang;Gang Yu;Yu Gao
The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redundant feature extraction when processing high-dimensional microimage data. We propose a novel fine-grained classification model, SCKansformer, for bone marrow blood cells, which addresses these challenges and enhances classification accuracy and efficiency. The model integrates the Kansformer Encoder, SCConv Encoder, and Global-Local Attention Encoder. The Kansformer Encoder replaces the traditional MLP layer with the KAN, improving nonlinear feature representation and interpretability. The SCConv Encoder, with its Spatial and Channel Reconstruction Units, enhances feature representation and reduces redundancy. The Global-Local Attention Encoder combines Multi-head Self-Attention with a Local Part module to capture both global and local features. We validated our model using the Bone Marrow Blood Cell Fine-Grained Classification Dataset (BMCD-FGCD), comprising over 10,000 samples and nearly 40 classifications, developed with a partner hospital. Comparative experiments on our private dataset, as well as the publicly available PBC and ALL-IDB datasets, demonstrate that SCKansformer outperforms both typical and advanced microcell classification methods across all datasets.
急性白血病等恶性肿瘤的发病率和死亡率大幅上升。在临床上,医院依靠外周血和骨髓涂片的细胞学检查来诊断恶性肿瘤,其中准确的血细胞计数至关重要。现有的自动化方法在处理高维显微图像数据时面临着特征表达能力低、可解释性差和冗余特征提取等挑战。我们针对骨髓血细胞提出了一种新的细粒度分类模型 SCKansformer,它能解决这些难题,并提高分类的准确性和效率。该模型集成了 Kansformer 编码器、SCConv 编码器和全局-局部注意力编码器。Kansformer 编码器用 KAN 取代了传统的 MLP 层,改进了非线性特征表示和可解释性。SCConv 编码器具有空间和通道重构单元,可增强特征表示并减少冗余。全局-局部注意力编码器将多头自我注意力与局部模块相结合,以捕捉全局和局部特征。我们使用与合作医院共同开发的骨髓血细胞精细分类数据集(BMCD-FGCD)验证了我们的模型,该数据集包含 10,000 多个样本和近 40 个分类。在我们的私有数据集以及公开的 PBC 和 ALL-IDB 数据集上进行的对比实验表明,SCKansformer 在所有数据集上的表现都优于典型的和先进的微细胞分类方法。
{"title":"SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms","authors":"Yifei Chen;Zhu Zhu;Shenghao Zhu;Linwei Qiu;Binfeng Zou;Fan Jia;Yunpeng Zhu;Chenyan Zhang;Zhaojie Fang;Feiwei Qin;Jin Fan;Changmiao Wang;Gang Yu;Yu Gao","doi":"10.1109/JBHI.2024.3471928","DOIUrl":"10.1109/JBHI.2024.3471928","url":null,"abstract":"The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redundant feature extraction when processing high-dimensional microimage data. We propose a novel fine-grained classification model, SCKansformer, for bone marrow blood cells, which addresses these challenges and enhances classification accuracy and efficiency. The model integrates the Kansformer Encoder, SCConv Encoder, and Global-Local Attention Encoder. The Kansformer Encoder replaces the traditional MLP layer with the KAN, improving nonlinear feature representation and interpretability. The SCConv Encoder, with its Spatial and Channel Reconstruction Units, enhances feature representation and reduces redundancy. The Global-Local Attention Encoder combines Multi-head Self-Attention with a Local Part module to capture both global and local features. We validated our model using the Bone Marrow Blood Cell Fine-Grained Classification Dataset (BMCD-FGCD), comprising over 10,000 samples and nearly 40 classifications, developed with a partner hospital. Comparative experiments on our private dataset, as well as the publicly available PBC and ALL-IDB datasets, demonstrate that SCKansformer outperforms both typical and advanced microcell classification methods across all datasets.","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"29 1","pages":"558-571"},"PeriodicalIF":6.7,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142400194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MiRS-HF: A Novel Deep Learning Predictor for Cancer Classification and miRNA Expression Patterns MiRS-HF:用于癌症分类和 miRNA 表达模式的新型深度学习预测器。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1109/JBHI.2024.3476672
Jie Ni;Donghui Yan;Shan Lu;Zhuoying Xie;Yun Liu;Xin Zhang
Cancer classification and biomarker identification are crucial for guiding personalized treatment. To make effective use of miRNA associations and expression data, we have developed a deep learning model for cancer classification and biomarker identification. We propose an approach for cancer classification called MiRNA Selection and Hybrid Fusion (MiRS-HF), which consists of early fusion and intermediate fusion. The early fusion involves applying a Layer Attention Graph Convolutional Network (LAGCN) to a miRNA-disease heterogeneous network, resulting in a miRNA-disease association degree score matrix. The intermediate fusion employs a Graph Convolutional Network (GCN) in the classification tasks, weighting the expression data based on the miRNA-disease association degree score. Furthermore, MiRS-HF can identify the important miRNA biomarkers and their expression patterns. The proposed method demonstrates superior performance in the classification tasks of six cancers compared to other methods. Simultaneously, we incorporated the feature weighting strategy into the comparison algorithm, leading to a significant improvement in the algorithm's results, highlighting the extreme importance of this strategy.
癌症分类和生物标记物鉴定对于指导个性化治疗至关重要。为了有效利用 miRNA 关联和表达数据,我们开发了一种用于癌症分类和生物标记物鉴定的深度学习模型。为了有效利用 miRNA 关联和表达数据,我们开发了一种用于癌症分类和生物标记物鉴定的深度学习模型。我们提出了一种名为 MiRNA 选择和混合融合(MiRS-HF)的癌症分类方法,它包括早期融合和中期融合。早期融合是将层注意图卷积网络(LAGCN)应用于 miRNA-疾病异构网络,从而得到 miRNA-疾病关联度得分矩阵。中期融合在分类任务中采用图卷积网络(GCN),根据 miRNA-疾病关联度得分对表达数据进行加权。此外,MiRS-HF 还能识别重要的 miRNA 生物标记物及其表达模式。与其他方法相比,所提出的方法在六种癌症的分类任务中表现出更优越的性能。同时,我们在比较算法中加入了特征加权策略,从而显著改善了算法结果,凸显了这一策略的极端重要性。
{"title":"MiRS-HF: A Novel Deep Learning Predictor for Cancer Classification and miRNA Expression Patterns","authors":"Jie Ni;Donghui Yan;Shan Lu;Zhuoying Xie;Yun Liu;Xin Zhang","doi":"10.1109/JBHI.2024.3476672","DOIUrl":"10.1109/JBHI.2024.3476672","url":null,"abstract":"Cancer classification and biomarker identification are crucial for guiding personalized treatment. To make effective use of miRNA associations and expression data, we have developed a deep learning model for cancer classification and biomarker identification. We propose an approach for cancer classification called MiRNA Selection and Hybrid Fusion (MiRS-HF), which consists of early fusion and intermediate fusion. The early fusion involves applying a Layer Attention Graph Convolutional Network (LAGCN) to a miRNA-disease heterogeneous network, resulting in a miRNA-disease association degree score matrix. The intermediate fusion employs a Graph Convolutional Network (GCN) in the classification tasks, weighting the expression data based on the miRNA-disease association degree score. Furthermore, MiRS-HF can identify the important miRNA biomarkers and their expression patterns. The proposed method demonstrates superior performance in the classification tasks of six cancers compared to other methods. Simultaneously, we incorporated the feature weighting strategy into the comparison algorithm, leading to a significant improvement in the algorithm's results, highlighting the extreme importance of this strategy.","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"29 1","pages":"679-689"},"PeriodicalIF":6.7,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142390172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inhibitory Components in Muscle Synergies Factorized by The Rectified Latent Variable Model from Electromyographic Data. 通过整流潜变量模型从肌电图数据推断肌肉协同作用中的抑制成分
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1109/JBHI.2024.3453603
Xiaoyu Guo, Subing Huang, Borong He, Chuanlin Lan, Jodie J Xie, Kelvin Y S Lau, Tomohiko Takei, Arthur D P Mak, Roy T H Cheung, Kazuhiko Seki, Vincent C K Cheung, Rosa H M Chan

Non-negative matrix factorization (NMF), widely used in motor neuroscience for identifying muscle synergies from electromyographical signals (EMGs), extracts non-negative synergies and is yet unable to identify potential negative components (NegCps) in synergies underpinned by inhibitory spinal interneurons. To overcome this constraint, we propose to utilize rectified latent variable model (RLVM) to extract muscle synergies. RLVM uses an autoencoder neural network, and the weight matrix of its neural network could be negative, while latent variables must remain non-negative. If inputs to the model are EMGs, the weight matrix and latent variables represent muscle synergies and their temporal activation coefficients, respectively. We compared performances of NMF and RLVM in identifying muscle synergies in simulated and experimental datasets. Our simulated results showed that RLVM performed better in identifying muscle-synergy subspace and NMF had a good correlation with ground truth. Finally, we applied RLVM to a previously published experimental dataset comprising EMGs from upper-limb muscles and spike recordings of spinal premotor interneurons (PreM-INs) collected from two macaque monkeys during grasping tasks. RLVM and NMF synergies were highly similar, but a few small negative muscle components were observed in RLVM synergies. The muscles with NegCps identified by RLVM exhibited near-zero values in their corresponding synergies identified by NMF. Importantly, NegCps of RLVM synergies showed correspondence with the muscle connectivity of PreM-INs with inhibitory muscle fields, as identified by spike-triggered averaging of EMGs. Our results demonstrate the feasibility of RLVM in extracting potential inhibitory muscle-synergy components from EMGs.

非负矩阵因式分解(NMF)在运动神经科学中被广泛用于从肌电信号(EMG)中识别肌肉协同作用,但它提取的是非负协同作用,无法识别由抑制性脊髓中间神经元支撑的协同作用中的潜在负成分(NegCps)。为了克服这一限制,我们建议利用整流潜变量模型(RLVM)来提取肌肉协同作用。RLVM 使用自编码器神经网络,其神经网络的权重矩阵可以为负,而潜变量必须保持非负。如果模型的输入是肌电图,则权重矩阵和潜变量分别代表肌肉协同作用及其时间激活系数。我们比较了 NMF 和 RLVM 在模拟和实验数据集中识别肌肉协同作用的性能。模拟结果表明,RLVM 在识别肌肉协同子空间方面表现更好,而 NMF 与地面实况具有良好的相关性。最后,我们将 RLVM 应用于之前发表的实验数据集,该数据集包括两只猕猴在抓握任务中采集的上肢肌肉肌电图和脊髓前运动中间神经元(PreM-INs)的尖峰记录。RLVM 和 NMF 协同作用高度相似,但在 RLVM 协同作用中观察到了一些小的负肌肉成分。RLVM 识别出的具有负肌肉成分的肌肉在 NMF 识别出的相应协同作用中表现出接近零的值。重要的是,RLVM 协同作用的 NegCps 与 EMG 的尖峰触发平均化所识别出的具有抑制性肌场的 PreM-IN 的肌肉连通性相对应。我们的研究结果证明了 RLVM 从肌电图中提取潜在抑制性肌肉协同成分的可行性。
{"title":"Inhibitory Components in Muscle Synergies Factorized by The Rectified Latent Variable Model from Electromyographic Data.","authors":"Xiaoyu Guo, Subing Huang, Borong He, Chuanlin Lan, Jodie J Xie, Kelvin Y S Lau, Tomohiko Takei, Arthur D P Mak, Roy T H Cheung, Kazuhiko Seki, Vincent C K Cheung, Rosa H M Chan","doi":"10.1109/JBHI.2024.3453603","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3453603","url":null,"abstract":"<p><p>Non-negative matrix factorization (NMF), widely used in motor neuroscience for identifying muscle synergies from electromyographical signals (EMGs), extracts non-negative synergies and is yet unable to identify potential negative components (NegCps) in synergies underpinned by inhibitory spinal interneurons. To overcome this constraint, we propose to utilize rectified latent variable model (RLVM) to extract muscle synergies. RLVM uses an autoencoder neural network, and the weight matrix of its neural network could be negative, while latent variables must remain non-negative. If inputs to the model are EMGs, the weight matrix and latent variables represent muscle synergies and their temporal activation coefficients, respectively. We compared performances of NMF and RLVM in identifying muscle synergies in simulated and experimental datasets. Our simulated results showed that RLVM performed better in identifying muscle-synergy subspace and NMF had a good correlation with ground truth. Finally, we applied RLVM to a previously published experimental dataset comprising EMGs from upper-limb muscles and spike recordings of spinal premotor interneurons (PreM-INs) collected from two macaque monkeys during grasping tasks. RLVM and NMF synergies were highly similar, but a few small negative muscle components were observed in RLVM synergies. The muscles with NegCps identified by RLVM exhibited near-zero values in their corresponding synergies identified by NMF. Importantly, NegCps of RLVM synergies showed correspondence with the muscle connectivity of PreM-INs with inhibitory muscle fields, as identified by spike-triggered averaging of EMGs. Our results demonstrate the feasibility of RLVM in extracting potential inhibitory muscle-synergy components from EMGs.</p>","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"PP ","pages":""},"PeriodicalIF":6.7,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142390171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Quantification of HER2 Amplification Levels Using Deep Learning 利用深度学习自动量化 HER2 扩增水平。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-09 DOI: 10.1109/JBHI.2024.3476554
Ching-Wei Wang;Kai-Lin Chu;Ting-Sheng Su;Keng-Wei Liu;Yi-Jia Lin;Tai-Kuang Chao
HER2 assessment is necessary for patient selection in anti-HER2 targeted treatment. However, manual assessment of HER2 amplification is time-costly, labor-intensive, highly subjective and error-prone. Challenges in HER2 analysis in fluorescence in situ hybridization (FISH) and dual in situ hybridization (DISH) images include unclear and blurry cell boundaries, large variations in cell shapes and signals, overlapping and clustered cells and sparse label issues with manual annotations only on cells with high confidences, producing subjective assessment scores according to the individual choices on cell selection. To address the above-mentioned issues, we have developed a soft-sampling cascade deep learning model and a signal detection model in quantifying CEN17 and HER2 of cells to assist assessment of HER2 amplification status for patient selection of HER2 targeting therapy to breast cancer. In evaluation with two different kinds of clinical datasets, including a FISH data set and a DISH data set, the proposed method achieves high accuracy, recall and F1-score for both datasets in instance segmentation of HER2 related cells that must contain both CEN17 and HER2 signals. Moreover, the proposed method is demonstrated to significantly outperform seven state of the art recently published deep learning methods, including contour proposal network (CPN), soft label-based FCN (SL-FCN), modified fully convolutional network (M-FCN), bilayer convolutional network (BCNet), SOLOv2, Cascade R-CNN and DeepLabv3+ with three different backbones (p $leq$ 0.01). Clinically, anti-HER2 therapy can also be applied to gastric cancer patients. We applied the developed model to assist in HER2 DISH amplification assessment for gastric cancer patients, and it also showed promising predictive results (accuracy 97.67 $pm$ 1.46%, precision 96.15 $pm$ 5.82%, respectively).
在抗 HER2 靶向治疗中,HER2 评估是选择患者的必要条件。然而,人工评估 HER2 扩增耗时、耗力、主观性强且容易出错。在荧光原位杂交(FISH)和双原位杂交(DISH)图像中进行 HER2 分析所面临的挑战包括细胞边界不清晰和模糊、细胞形状和信号差异大、细胞重叠和聚集以及标签稀疏等问题,而人工标注仅针对置信度高的细胞,因此会根据个人对细胞选择的不同而产生主观评估分数。针对上述问题,我们开发了一种软采样级联深度学习模型和信号检测模型,用于量化细胞的CEN17和HER2,以辅助评估HER2扩增状态,帮助患者选择HER2靶向治疗乳腺癌。在对两种不同类型的临床数据集(包括 FISH 数据集和 DISH 数据集)进行评估时,在对必须同时包含 CEN17 和 HER2 信号的 HER2 相关细胞进行实例分割时,所提出的方法在这两种数据集上都取得了较高的准确率、召回率和 F1 分数。此外,该方法还明显优于七种最新发表的深度学习方法,包括轮廓提议网络(CPN)、基于软标签的FCN(SL-FCN)、修正的全卷积网络(M-FCN)、双层卷积网络(BCNet)、SOLOv2、级联R-CNN和具有三种不同骨架的DeepLabv3+(P≤0.01)。在临床上,抗 HER2 治疗也可用于胃癌患者。我们将所开发的模型用于辅助胃癌患者的 HER2 DISH 扩增评估,也显示出了良好的预测结果(准确率分别为 97.67 ±1.46%,精确率分别为 96.15 ±5.82%)。
{"title":"Automated Quantification of HER2 Amplification Levels Using Deep Learning","authors":"Ching-Wei Wang;Kai-Lin Chu;Ting-Sheng Su;Keng-Wei Liu;Yi-Jia Lin;Tai-Kuang Chao","doi":"10.1109/JBHI.2024.3476554","DOIUrl":"10.1109/JBHI.2024.3476554","url":null,"abstract":"HER2 assessment is necessary for patient selection in anti-HER2 targeted treatment. However, manual assessment of HER2 amplification is time-costly, labor-intensive, highly subjective and error-prone. Challenges in HER2 analysis in fluorescence in situ hybridization (FISH) and dual in situ hybridization (DISH) images include unclear and blurry cell boundaries, large variations in cell shapes and signals, overlapping and clustered cells and sparse label issues with manual annotations only on cells with high confidences, producing subjective assessment scores according to the individual choices on cell selection. To address the above-mentioned issues, we have developed a soft-sampling cascade deep learning model and a signal detection model in quantifying CEN17 and HER2 of cells to assist assessment of HER2 amplification status for patient selection of HER2 targeting therapy to breast cancer. In evaluation with two different kinds of clinical datasets, including a FISH data set and a DISH data set, the proposed method achieves high accuracy, recall and F1-score for both datasets in instance segmentation of HER2 related cells that must contain both CEN17 and HER2 signals. Moreover, the proposed method is demonstrated to significantly outperform seven state of the art recently published deep learning methods, including contour proposal network (CPN), soft label-based FCN (SL-FCN), modified fully convolutional network (M-FCN), bilayer convolutional network (BCNet), SOLOv2, Cascade R-CNN and DeepLabv3+ with three different backbones (\u0000<italic>p</i>\u0000 \u0000<inline-formula><tex-math>$leq$</tex-math></inline-formula>\u0000 0.01). Clinically, anti-HER2 therapy can also be applied to gastric cancer patients. We applied the developed model to assist in HER2 DISH amplification assessment for gastric cancer patients, and it also showed promising predictive results (accuracy 97.67 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 1.46%, precision 96.15 \u0000<inline-formula><tex-math>$pm$</tex-math></inline-formula>\u0000 5.82%, respectively).","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"29 1","pages":"333-344"},"PeriodicalIF":6.7,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142390168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DTI-MvSCA: An Anti-Over-Smoothing Multi-View Framework With Negative Sample Selection for Predicting Drug-Target Interactions DTI-MvSCA:具有负样本选择的抗过度平滑多视图框架预测药物-靶标相互作用
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JBHI.2024.3476120
Lihong Peng;Zongzheng Bai;Longlong Liu;Long Yang;Xin Liu;Min Chen;Xing Chen
Predicting potential drug-target interactions (DTIs) facilitates to accelerate drug discovery and reduce development cost. Current deep learning-based methods exhibit high-performance predictions, but three challenges remain: first, the absence of negative DTIs severely limits the model performance. Moreover, existing graph neural networks are beset with the scalability due to the model complexity and graph size. More importantly, most methods focus on learning the topological features while ignoring node features during DTI representation learning. To solve the limitations, here, we develop a multi-view neural network framework called DTI-MvSCA for DTI identification. This framework begins with constructing a drug-protein pair (DPP) network with matrix operation-based negative DTI selection, and then learns the DPP representations through a Multi-view neural network, finally classifies each DPP based on multilayer perceptron. Particularly, the multi-view neural network integrates graph topological feature learning based on the self-attention mechanism and SHADOW graph attention network, node feature learning based on 1D Convolutional neural network, and the Attention mechanism. An in-depth experiment on DrugBank V3.0 and V5.0 showed that DTI-MvSCA obtained precise and robust predictions against five state-of-the-art baseline methods. Furthermore, visualizing the feature distributions of the selected negative DTIs exhibits a more distinguishable and clearer boundary. In summary, DTI-MvSCA provides a useful deep learning tool to investigate potential DTIs.
预测潜在的药物-靶标相互作用(DTIs)有助于加速药物发现和降低开发成本。目前基于深度学习的方法表现出高性能的预测,但仍然存在三个挑战:首先,缺乏负dti严重限制了模型的性能。此外,现有的图神经网络由于模型的复杂性和图的大小而受到可扩展性的困扰。更重要的是,在DTI表示学习过程中,大多数方法都侧重于学习拓扑特征,而忽略了节点特征。为了解决这些限制,我们开发了一个称为DTI- mvsca的多视图神经网络框架,用于DTI识别。该框架首先构建基于矩阵运算的负DTI选择的药物蛋白对(drug-protein pair, DPP)网络,然后通过多视图神经网络学习DPP表示,最后基于多层感知器对每个DPP进行分类。其中,多视图神经网络集成了基于自注意机制和SHADOW图注意网络的图拓扑特征学习、基于一维卷积神经网络的节点特征学习和注意机制。在DrugBank V3.0和V5.0上进行的深入实验表明,DTI-MvSCA针对五种最先进的基线方法获得了精确而稳健的预测。此外,可视化所选择的负dti的特征分布显示出更可区分和更清晰的边界。总之,DTI-MvSCA提供了一个有用的深度学习工具来调查潜在的dti。
{"title":"DTI-MvSCA: An Anti-Over-Smoothing Multi-View Framework With Negative Sample Selection for Predicting Drug-Target Interactions","authors":"Lihong Peng;Zongzheng Bai;Longlong Liu;Long Yang;Xin Liu;Min Chen;Xing Chen","doi":"10.1109/JBHI.2024.3476120","DOIUrl":"https://doi.org/10.1109/JBHI.2024.3476120","url":null,"abstract":"Predicting potential drug-target interactions (DTIs) facilitates to accelerate drug discovery and reduce development cost. Current deep learning-based methods exhibit high-performance predictions, but three challenges remain: first, the absence of negative DTIs severely limits the model performance. Moreover, existing graph neural networks are beset with the scalability due to the model complexity and graph size. More importantly, most methods focus on learning the topological features while ignoring node features during DTI representation learning. To solve the limitations, here, we develop a multi-view neural network framework called DTI-MvSCA for DTI identification. This framework begins with constructing a drug-protein pair (DPP) network with matrix operation-based negative DTI selection, and then learns the DPP representations through a \u0000<underline><b>M</b></u>\u0000ulti-\u0000<bold><u>v</u></b>\u0000iew neural network, finally classifies each DPP based on multilayer perceptron. Particularly, the multi-view neural network integrates graph topological feature learning based on the self-attention mechanism and \u0000<bold><u>S</u></b>\u0000HADOW graph attention network, node feature learning based on 1D \u0000<underline><b>C</b></u>\u0000onvolutional neural network, and the \u0000<bold><u>A</u></b>\u0000ttention mechanism. An in-depth experiment on DrugBank V3.0 and V5.0 showed that DTI-MvSCA obtained precise and robust predictions against five state-of-the-art baseline methods. Furthermore, visualizing the feature distributions of the selected negative DTIs exhibits a more distinguishable and clearer boundary. In summary, DTI-MvSCA provides a useful deep learning tool to investigate potential DTIs.","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"29 1","pages":"711-723"},"PeriodicalIF":6.7,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142938181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Trustworthy Curriculum Learning Guided Multi-Target Domain Adaptation Network for Autism Spectrum Disorder Classification 用于自闭症谱系障碍分类的可信课程学习引导的多目标领域适应网络。
IF 6.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-08 DOI: 10.1109/JBHI.2024.3476076
Jiale Dun;Jun Wang;Juncheng Li;Qianhui Yang;Wenlong Hang;Xiaofeng Lu;Shihui Ying;Jun Shi
Domain adaptation has demonstrated success in classification of multi-center autism spectrum disorder (ASD). However, current domain adaptation methods primarily focus on classifying data in a single target domain with the assistance of one or multiple source domains, lacking the capability to address the clinical scenario of identifying ASD in multiple target domains. In response to this limitation, we propose a Trustworthy Curriculum Learning Guided Multi-Target Domain Adaptation (TCL-MTDA) network for identifying ASD in multiple target domains. To effectively handle varying degrees of data shift in multiple target domains, we propose a trustworthy curriculum learning procedure based on the Dempster-Shafer (D-S) Theory of Evidence. Additionally, a domain-contrastive adaptation method is integrated into the TCL-MTDA process to align data distributions between source and target domains, facilitating the learning of domain-invariant features. The proposed TCL-MTDA method is evaluated on 437 subjects (including 220 ASD patients and 217 NCs) from the Autism Brain Imaging Data Exchange (ABIDE). Experimental results validate the effectiveness of our proposed method in multi-target ASD classification, achieving an average accuracy of 71.46% (95% CI: 68.85% - 74.06%) across four target domains, significantly outperforming most baseline methods (p<0.05).
在多中心自闭症谱系障碍(ASD)分类方面,领域适应已取得了成功。然而,目前的领域适应方法主要侧重于在一个或多个源领域的辅助下对单个目标领域的数据进行分类,缺乏在多个目标领域识别自闭症谱系障碍的临床场景的能力。针对这一局限,我们提出了一种可信课程学习引导的多目标域自适应(TCL-MTDA)网络,用于识别多个目标域中的 ASD。为了有效处理多个目标领域中不同程度的数据偏移,我们提出了基于 Dempster-Shafer (D-S) 证据理论的可信课程学习程序。此外,我们还在 TCL-MTDA 过程中集成了领域对比适应方法,以调整源领域和目标领域之间的数据分布,从而促进领域不变特征的学习。我们在自闭症脑成像数据交换中心(ABIDE)的 437 名受试者(包括 220 名 ASD 患者和 217 名 NCs)上对所提出的 TCL-MTDA 方法进行了评估。实验结果验证了我们提出的方法在多目标 ASD 分类中的有效性,在四个目标领域中取得了 71.46% (95% CI: 68.85% - 74.06%) 的平均准确率,明显优于大多数基线方法(p
{"title":"A Trustworthy Curriculum Learning Guided Multi-Target Domain Adaptation Network for Autism Spectrum Disorder Classification","authors":"Jiale Dun;Jun Wang;Juncheng Li;Qianhui Yang;Wenlong Hang;Xiaofeng Lu;Shihui Ying;Jun Shi","doi":"10.1109/JBHI.2024.3476076","DOIUrl":"10.1109/JBHI.2024.3476076","url":null,"abstract":"Domain adaptation has demonstrated success in classification of multi-center autism spectrum disorder (ASD). However, current domain adaptation methods primarily focus on classifying data in a single target domain with the assistance of one or multiple source domains, lacking the capability to address the clinical scenario of identifying ASD in multiple target domains. In response to this limitation, we propose a Trustworthy Curriculum Learning Guided Multi-Target Domain Adaptation (TCL-MTDA) network for identifying ASD in multiple target domains. To effectively handle varying degrees of data shift in multiple target domains, we propose a trustworthy curriculum learning procedure based on the Dempster-Shafer (D-S) Theory of Evidence. Additionally, a domain-contrastive adaptation method is integrated into the TCL-MTDA process to align data distributions between source and target domains, facilitating the learning of domain-invariant features. The proposed TCL-MTDA method is evaluated on 437 subjects (including 220 ASD patients and 217 NCs) from the Autism Brain Imaging Data Exchange (ABIDE). Experimental results validate the effectiveness of our proposed method in multi-target ASD classification, achieving an average accuracy of 71.46% (95% CI: 68.85% - 74.06%) across four target domains, significantly outperforming most baseline methods (p<0.05).","PeriodicalId":13073,"journal":{"name":"IEEE Journal of Biomedical and Health Informatics","volume":"29 1","pages":"310-323"},"PeriodicalIF":6.7,"publicationDate":"2024-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142390167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Biomedical and Health Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1