首页 > 最新文献

Journal of Biomedical Informatics最新文献

英文 中文
Quantifying the effect of Behaviour Self-Regulation on well-being through causal analysis: A methodological framework for longitudinal health data 通过因果分析量化行为自我调节对健康的影响:纵向健康数据的方法学框架。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-02-01 Epub Date: 2026-01-10 DOI: 10.1016/j.jbi.2026.104984
Jialou Wang , Pingfan Wang , Wai Lok Woo , Kandianos Emmanouil Sakalidis , Florentina Johanna Hettinga , Angela Rodrigues , Helen Dawes , Gavin Daniel Tempest
Understanding the drivers of well-being from longitudinal behavioural data is a fundamental challenge in biomedical informatics, where traditional analyses often conflate correlation with causation. This paper presents a rigorous application of causal inference to disentangle the drivers of well-being from complex longitudinal self-report data (N=141 enrolled; N=94 analysed after a priori completeness threshold of 20 of 28 daily entries). We introduce a novel computational metric, the Behaviour Self-Regulation Score (BSRS), to quantify both trait-like (long-term) and state-like (short-term) behavioural consistency from daily reports of physical activity and sleep. Employing causal graphical models and propensity score methods, we estimate the causal effects of these behavioural patterns, controlling for motivational and perceptual confounders. Our analysis uncovers distinct causal pathways: while long-term self-regulation (BSRS-L) has a stable positive causal effect, short-term behavioural consistency (BSRS-S) demonstrates a significantly stronger causal impact on daily well-being, despite a near-zero correlation. Furthermore, we demonstrate that features selected via our causal framework significantly improve the predictive accuracy of well-being in machine learning models compared to conventional feature selection methods. This work contributes a robust methodological framework for causal analysis of longitudinal self-report data and provides evidence that causally-informed modelling can identify more potent targets for digital health interventions.
从纵向行为数据中理解幸福的驱动因素是生物医学信息学的一个基本挑战,传统的分析通常将相关性与因果关系混为一谈。本文采用严格的因果推理方法,从复杂的纵向自我报告数据中找出幸福感的驱动因素(纳入的N=141;在28个日常条目的先验完备性阈值≥20后,分析N=94)。我们引入了一种新的计算度量,即行为自我调节评分(BSRS),从日常的身体活动和睡眠报告中量化特征(长期)和状态(短期)行为一致性。采用因果图模型和倾向评分方法,我们估计这些行为模式的因果效应,控制动机和知觉混杂因素。我们的分析揭示了不同的因果途径:虽然长期自我调节(BSRS-L)具有稳定的正因果效应,但短期行为一致性(BSRS-S)对日常幸福感的因果影响明显更强,尽管相关性接近于零。此外,我们证明,与传统的特征选择方法相比,通过我们的因果框架选择的特征显著提高了机器学习模型中幸福感的预测准确性。这项工作为纵向自我报告数据的因果分析提供了一个强有力的方法框架,并提供证据表明,因果关系知情的建模可以确定更有效的数字卫生干预目标。
{"title":"Quantifying the effect of Behaviour Self-Regulation on well-being through causal analysis: A methodological framework for longitudinal health data","authors":"Jialou Wang ,&nbsp;Pingfan Wang ,&nbsp;Wai Lok Woo ,&nbsp;Kandianos Emmanouil Sakalidis ,&nbsp;Florentina Johanna Hettinga ,&nbsp;Angela Rodrigues ,&nbsp;Helen Dawes ,&nbsp;Gavin Daniel Tempest","doi":"10.1016/j.jbi.2026.104984","DOIUrl":"10.1016/j.jbi.2026.104984","url":null,"abstract":"<div><div>Understanding the drivers of well-being from longitudinal behavioural data is a fundamental challenge in biomedical informatics, where traditional analyses often conflate correlation with causation. This paper presents a rigorous application of causal inference to disentangle the drivers of well-being from complex longitudinal self-report data (<span><math><mrow><mi>N</mi><mo>=</mo><mn>141</mn></mrow></math></span> enrolled; <span><math><mrow><mi>N</mi><mo>=</mo><mn>94</mn></mrow></math></span> analysed after a priori completeness threshold of <span><math><mrow><mo>≥</mo><mn>20</mn></mrow></math></span> of 28 daily entries). We introduce a novel computational metric, the Behaviour Self-Regulation Score (BSRS), to quantify both trait-like (long-term) and state-like (short-term) behavioural consistency from daily reports of physical activity and sleep. Employing causal graphical models and propensity score methods, we estimate the causal effects of these behavioural patterns, controlling for motivational and perceptual confounders. Our analysis uncovers distinct causal pathways: while long-term self-regulation (BSRS-L) has a stable positive causal effect, short-term behavioural consistency (BSRS-S) demonstrates a significantly stronger causal impact on daily well-being, despite a near-zero correlation. Furthermore, we demonstrate that features selected via our causal framework significantly improve the predictive accuracy of well-being in machine learning models compared to conventional feature selection methods. This work contributes a robust methodological framework for causal analysis of longitudinal self-report data and provides evidence that causally-informed modelling can identify more potent targets for digital health interventions.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"174 ","pages":"Article 104984"},"PeriodicalIF":4.5,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145951928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDD-MARF: a multimodal depression detection model based on multi-level attention mechanism and residual fusion MDD-MARF:基于多层次注意机制和残差融合的多模态抑郁检测模型
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-11-29 DOI: 10.1016/j.jbi.2025.104965
Jianghai Zhou , Jike Ge , Zuqin Chen , Jie Tan , You Li

Objective

Depression is a serious mental disorder that significantly affects patients’ work ability and social functioning. With the rapid development of artificial intelligence, researchers have begun to explore automatic depression detection methods based on multimodal data. However, multimodal data are often accompanied by a large amount of noise. Existing methods usually lack sufficient feature screening after extraction and are directly applied to downstream tasks, which may limit the model’s generalization ability. In addition, current multimodal fusion strategies still face several challenges.

Methods

To address these challenges, we propose a novel multimodal depression detection model that integrates three modalities: audio, vision, and text. The model extracts depression-related key features through a multi-level attention mechanism and achieves efficient multimodal feature fusion using skip connections with a residual structure.

Results

Experiments conducted on the DAIC-WOZ dataset showed that the proposed method achieved a mean absolute error (MAE) of 3.13 and a root mean square error (RMSE) of 3.59, outperforming existing state-of-the-art models. The generalization ability of the model was further validated on the E-DAIC dataset, demonstrating its effectiveness and robustness.

Conclusion

The proposed method provides an efficient and reliable solution for depression detection using multimodal data and multi-level attention mechanisms. The findings highlight the significant value of multimodal learning in the medical field and offer strong support for the development of AI-assisted clinical decision-making systems.
目的抑郁症是一种严重的精神障碍,严重影响患者的工作能力和社会功能。随着人工智能的快速发展,研究人员开始探索基于多模态数据的抑郁症自动检测方法。然而,多模态数据往往伴随着大量的噪声。现有方法通常在提取后缺乏足够的特征筛选,直接应用于下游任务,这可能会限制模型的泛化能力。此外,当前的多模态融合策略还面临着一些挑战。为了解决这些挑战,我们提出了一种新的多模态抑郁症检测模型,该模型集成了三种模态:音频、视觉和文本。该模型通过多层次注意机制提取抑郁相关关键特征,并利用带有残余结构的跳跃连接实现高效的多模态特征融合。结果在DAIC-WOZ数据集上进行的实验表明,该方法的平均绝对误差(MAE)为3.13,均方根误差(RMSE)为3.59,优于现有的最先进模型。在e - aic数据集上进一步验证了模型的泛化能力,证明了模型的有效性和鲁棒性。结论该方法利用多模态数据和多层次注意机制,为抑郁症检测提供了高效可靠的解决方案。研究结果突出了多模式学习在医学领域的重要价值,并为人工智能辅助临床决策系统的发展提供了强有力的支持。
{"title":"MDD-MARF: a multimodal depression detection model based on multi-level attention mechanism and residual fusion","authors":"Jianghai Zhou ,&nbsp;Jike Ge ,&nbsp;Zuqin Chen ,&nbsp;Jie Tan ,&nbsp;You Li","doi":"10.1016/j.jbi.2025.104965","DOIUrl":"10.1016/j.jbi.2025.104965","url":null,"abstract":"<div><h3>Objective</h3><div>Depression is a serious mental disorder that significantly affects patients’ work ability and social functioning. With the rapid development of artificial intelligence, researchers have begun to explore automatic depression detection methods based on multimodal data. However, multimodal data are often accompanied by a large amount of noise. Existing methods usually lack sufficient feature screening after extraction and are directly applied to downstream tasks, which may limit the model’s generalization ability. In addition, current multimodal fusion strategies still face several challenges.</div></div><div><h3>Methods</h3><div>To address these challenges, we propose a novel multimodal depression detection model that integrates three modalities: audio, vision, and text. The model extracts depression-related key features through a multi-level attention mechanism and achieves efficient multimodal feature fusion using skip connections with a residual structure.</div></div><div><h3>Results</h3><div>Experiments conducted on the DAIC-WOZ dataset showed that the proposed method achieved a mean absolute error (MAE) of 3.13 and a root mean square error (RMSE) of 3.59, outperforming existing state-of-the-art models. The generalization ability of the model was further validated on the E-DAIC dataset, demonstrating its effectiveness and robustness.</div></div><div><h3>Conclusion</h3><div>The proposed method provides an efficient and reliable solution for depression detection using multimodal data and multi-level attention mechanisms. The findings highlight the significant value of multimodal learning in the medical field and offer strong support for the development of AI-assisted clinical decision-making systems.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104965"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145645673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fusion framework: Conditional-aware one-stage nested event extraction model 融合框架:条件感知的单阶段嵌套事件提取模型。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-24 DOI: 10.1016/j.jbi.2025.104972
Sen Niu , Xiaohong Han , Liu Cao , Ye Tian , Ding Yuan , Longlong Cheng
We present CA-NEE, a Conditional-Aware one-stage model for overlapping and nested biomedical event extraction. CA-NEE integrates an event-type-aware conditioning mechanism with token-pair relation modeling to jointly identify triggers, argument spans, and roles. A Conditional Layer Normalization (CLN) dynamically adapts token representations to candidate event types, and a parallel word-pair scorer predicts span boundaries and roles in a single pass. Evaluations on GENIA11 and GENIA13 show consistent gains in Trigger Classification (TC) and Argument Classification (AC) over strong baselines, particularly on complex overlapping and nested structures. These results demonstrate that CA-NEE offers an effective and efficient solution for biomedical event extraction.
我们提出了一种用于重叠和嵌套生物医学事件提取的条件感知单阶段模型CA-NEE。CA-NEE将事件类型感知的条件调节机制与令牌对关系建模集成在一起,以联合识别触发器、参数范围和角色。条件层规范化(Conditional Layer Normalization, CLN)动态地将令牌表示适应候选事件类型,并行词对评分器在一次传递中预测跨度边界和角色。对GENIA11和GENIA13的评估显示,在强基线上,触发分类(TC)和参数分类(AC)取得了一致的进展,特别是在复杂的重叠和嵌套结构上。这些结果表明CA-NEE为生物医学事件提取提供了有效的解决方案。
{"title":"Fusion framework: Conditional-aware one-stage nested event extraction model","authors":"Sen Niu ,&nbsp;Xiaohong Han ,&nbsp;Liu Cao ,&nbsp;Ye Tian ,&nbsp;Ding Yuan ,&nbsp;Longlong Cheng","doi":"10.1016/j.jbi.2025.104972","DOIUrl":"10.1016/j.jbi.2025.104972","url":null,"abstract":"<div><div>We present CA-NEE, a Conditional-Aware one-stage model for overlapping and nested biomedical event extraction. CA-NEE integrates an event-type-aware conditioning mechanism with token-pair relation modeling to jointly identify triggers, argument spans, and roles. A Conditional Layer Normalization (CLN) dynamically adapts token representations to candidate event types, and a parallel word-pair scorer predicts span boundaries and roles in a single pass. Evaluations on <strong>GENIA11</strong> and <strong>GENIA13</strong> show consistent gains in Trigger Classification (TC) and Argument Classification (AC) over strong baselines, particularly on complex overlapping and nested structures. These results demonstrate that CA-NEE offers an effective and efficient solution for biomedical event extraction.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104972"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reviewer Acknowledgement 2025 审稿人致谢2025。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-25 DOI: 10.1016/j.jbi.2025.104974
{"title":"Reviewer Acknowledgement 2025","authors":"","doi":"10.1016/j.jbi.2025.104974","DOIUrl":"10.1016/j.jbi.2025.104974","url":null,"abstract":"","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104974"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145900440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling temporal self and interactive evolution for biomedical hypothesis generation 生物医学假设生成的时间自我和交互进化建模。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-11 DOI: 10.1016/j.jbi.2025.104970
Hongyun Zeng , Huiwei Zhou , Weihong Yao , Hao Zhou , Yan Zhao , Zhecheng Wang

Objectives

Hypothesis generation (HG) aims to reveal meaningful hidden relationships between scientific terms from literature for accelerating innovation in drug discovery, disease prognosis and treatment. Recent studies have successfully employed the dynamic nature of term-pair relations for HG. However, the existing methods focus on capturing the evolution of term pairs by modeling the temporal meaning of terms themselves, which is hard to accurately model intricate spatio-temporal relations between term pairs.

Methods

In this paper, a Temporal Self and Interactive Evolution (TSIE) modeling method is proposed to accurately characterize complex dynamics of term-pair relations in HG. Specifically, for each term pair, we first employ Gated Recurrent Unit (GRU) to model its Temporal Self-evolution (TSE) and Temporal Interactive Evolution (TIE) for learning its TSE Embedding (TSE_emb) and TIE Embedding (TIE_emb), respectively. Then, we adopt a dual-tower Transformer to further model the temporal dependencies of both TSE_emb and TIE_emb, which are finally integrated by a gated fusion layer for inferring the future connectivity of the term pair.

Results

Experiments on three real-world datasets Immunotherapy, Virology, and Neurology demonstrate that TSIE can effectively capture complex evolutional patterns for biomedical hypothesis generation and achieve the state-of-the-art performance.

Conclusion

This paper proposes a novel TSIE method to learn temporal interactive difference features and enhance the model’s understanding of temporal relation inference. Our TSIE learns both TSE and TIE to effectively model the dynamic relationship between terms. By adapting a dual-tower Transformer encoder, TSIE can further model the temporal dependencies of TSE and TIE.
目的:假设生成(Hypothesis generation, HG)旨在揭示文献中科学术语之间有意义的隐含关系,以加速药物发现、疾病预后和治疗的创新。最近的研究成功地利用了术语对关系的动态特性,但现有的方法主要是通过对术语本身的时间意义建模来捕捉术语对的演变,难以准确地模拟术语对之间复杂的时空关系。方法:本文提出了一种时间自交互演化(TSIE)建模方法,以准确表征HG中术语对关系的复杂动态。具体而言,对于每个术语对,我们首先使用门控循环单元(GRU)对其时间自演化(TSE)和时间交互演化(TIE)建模,分别学习其TSE嵌入(TSE_emb)和TIE嵌入(TIE_emb)。然后,我们采用双塔变压器进一步建模TSE_emb和TIE_emb的时间依赖性,最后通过门控融合层进行集成,以推断术语对的未来连通性。结果:在免疫疗法、病毒学和神经学三个真实数据集上的实验表明,TSIE可以有效地捕获生物医学假设生成的复杂进化模式,并达到最先进的性能。结论:本文提出了一种新的TSIE方法来学习时间交互差异特征,增强模型对时间关系推理的理解。我们的TSIE同时学习TSE和TIE,以有效地建模术语之间的动态关系。通过采用双塔变压器编码器,TSIE可以进一步对TSE和TIE的时间依赖性进行建模。
{"title":"Modeling temporal self and interactive evolution for biomedical hypothesis generation","authors":"Hongyun Zeng ,&nbsp;Huiwei Zhou ,&nbsp;Weihong Yao ,&nbsp;Hao Zhou ,&nbsp;Yan Zhao ,&nbsp;Zhecheng Wang","doi":"10.1016/j.jbi.2025.104970","DOIUrl":"10.1016/j.jbi.2025.104970","url":null,"abstract":"<div><h3>Objectives</h3><div>Hypothesis generation (HG) aims to reveal meaningful hidden relationships between scientific terms from literature for accelerating innovation in drug discovery, disease prognosis and treatment. Recent studies have successfully employed the dynamic nature of term-pair relations for HG. However, the existing methods focus on capturing the evolution of term pairs by modeling the temporal meaning of terms themselves, which is hard to accurately model intricate spatio-temporal relations between term pairs.</div></div><div><h3>Methods</h3><div>In this paper, a Temporal Self and Interactive Evolution (TSIE) modeling method is proposed to accurately characterize complex dynamics of term-pair relations in HG. Specifically, for each term pair, we first employ Gated Recurrent Unit (GRU) to model its Temporal Self-evolution (TSE) and Temporal Interactive Evolution (TIE) for learning its TSE Embedding (TSE_emb) and TIE Embedding (TIE_emb), respectively. Then, we adopt a dual-tower Transformer to further model the temporal dependencies of both TSE_emb and TIE_emb, which are finally integrated by a gated fusion layer for inferring the future connectivity of the term pair.</div></div><div><h3>Results</h3><div>Experiments on three real-world datasets Immunotherapy, Virology, and Neurology demonstrate that TSIE can effectively capture complex evolutional patterns for biomedical hypothesis generation and achieve the state-of-the-art performance.</div></div><div><h3>Conclusion</h3><div>This paper proposes a novel TSIE method to learn temporal interactive difference features and enhance the model’s understanding of temporal relation inference. Our TSIE learns both TSE and TIE to effectively model the dynamic relationship between terms. By adapting a dual-tower Transformer encoder, TSIE can further model the temporal dependencies of TSE and TIE.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104970"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145742797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond manual transcripts: Exploring the potential of automatic speech recognition errors in improving Alzheimer’s disease detection 超越手工转录:探索自动语音识别错误在提高阿尔茨海默病检测中的潜力
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-18 DOI: 10.1016/j.jbi.2025.104968
Yin-Long Liu , Yuanchao Li , Rui Feng , Jiaxin Chen , Yiming Wang , Yu-Ang Chen , Nan Ding , Jiahong Yuan , Zhen-Hua Ling

Objective:

This study aims to extend the counterintuitive observation that Automatic Speech Recognition (ASR) errors can be beneficial for Alzheimer’s Disease (AD) detection. Our objective is to conduct a large-scale investigation to validate this phenomenon and, more importantly, to elucidate the specific mechanisms by which ASR errors can serve as valuable diagnostic clues for distinguishing individuals with AD from Healthy Controls (HC).

Methods:

We employed 18 ASR models, in both their original and fine-tuned versions, to generate 36 sets of transcripts from the ADReSS dataset. We also synthesized speech from both manual and ASR transcripts using a text-to-speech (TTS) model. Knowledge-based features and pre-trained embeddings were extracted and fed into two proposed AD detection models : a self-attention model and a cross-attention-based interpretability model. To uncover the underlying mechanisms, we conducted a multi-faceted set of analyses, including examinations of ASR error types, words affected by ASR errors, linguistic comparisons, attention weight analysis, and case studies.

Results:

We demonstrate that transcripts generated by certain ASR models achieve higher AD detection accuracy than gold-standard manual transcripts. This performance gain stems not from errors in general or a high Word Error Rate (WER), but from specific and asymmetric error patterns. Our analyses reveal that these patterns amplify some pre-existing linguistic deficits in AD speech (e.g., disfluencies), thereby increasing the feature-level divergence between the AD and HC groups. Furthermore, we show that these diagnostic clues are effectively preserved when speech is synthesized from ASR transcripts, holding significant implications for data augmentation strategies in AD research.

Conclusion:

The specific, asymmetric error patterns introduced by certain ASR models enhance the distinction between AD and HC groups by amplifying pathological linguistic deficits associated with AD. This work suggests a paradigm shift for clinical ASR development: optimizing models not merely for transcription accuracy, but for their downstream diagnostic utility.
目的:本研究旨在扩展反直觉的观察,即自动语音识别(ASR)错误可能有利于阿尔茨海默病(AD)的检测。我们的目标是进行大规模的调查来验证这一现象,更重要的是,阐明ASR错误作为区分AD个体和健康对照(HC)的有价值的诊断线索的具体机制。方法:我们采用了18个原始版本和微调版本的ASR模型,从address数据集中生成36组转录本。我们还使用文本到语音(TTS)模型从手动和ASR转录本合成语音。提取基于知识的特征和预训练的嵌入,并将其输入到两种AD检测模型中:自注意模型和基于交叉注意的可解释性模型。为了揭示潜在的机制,我们进行了多方面的分析,包括检查ASR错误类型、受ASR错误影响的单词、语言比较、注意权重分析和案例研究。结果:我们证明由某些ASR模型生成的转录本比金标准人工转录本具有更高的AD检测精度。这种性能的提高不是来自一般的错误或较高的单词错误率,而是来自特定的和不对称的错误模式。我们的分析表明,这些模式放大了AD言语中一些先前存在的语言缺陷(例如,不流利),从而增加了AD和HC群体之间的特征水平差异。此外,我们发现当从ASR转录本合成语音时,这些诊断线索有效地保留了下来,这对AD研究中的数据增强策略具有重要意义。结论:某些ASR模型引入的特定的、不对称的错误模式通过放大与AD相关的病理性语言缺陷来增强AD和HC组之间的区别。这项工作提示了临床ASR发展的范式转变:优化模型不仅是为了转录准确性,而且为了它们的下游诊断效用。
{"title":"Beyond manual transcripts: Exploring the potential of automatic speech recognition errors in improving Alzheimer’s disease detection","authors":"Yin-Long Liu ,&nbsp;Yuanchao Li ,&nbsp;Rui Feng ,&nbsp;Jiaxin Chen ,&nbsp;Yiming Wang ,&nbsp;Yu-Ang Chen ,&nbsp;Nan Ding ,&nbsp;Jiahong Yuan ,&nbsp;Zhen-Hua Ling","doi":"10.1016/j.jbi.2025.104968","DOIUrl":"10.1016/j.jbi.2025.104968","url":null,"abstract":"<div><h3>Objective:</h3><div>This study aims to extend the counterintuitive observation that Automatic Speech Recognition (ASR) errors can be beneficial for Alzheimer’s Disease (AD) detection. Our objective is to conduct a large-scale investigation to validate this phenomenon and, more importantly, to elucidate the specific mechanisms by which ASR errors can serve as valuable diagnostic clues for distinguishing individuals with AD from Healthy Controls (HC).</div></div><div><h3>Methods:</h3><div>We employed 18 ASR models, in both their original and fine-tuned versions, to generate 36 sets of transcripts from the ADReSS dataset. We also synthesized speech from both manual and ASR transcripts using a text-to-speech (TTS) model. Knowledge-based features and pre-trained embeddings were extracted and fed into two proposed AD detection models : a self-attention model and a cross-attention-based interpretability model. To uncover the underlying mechanisms, we conducted a multi-faceted set of analyses, including examinations of ASR error types, words affected by ASR errors, linguistic comparisons, attention weight analysis, and case studies.</div></div><div><h3>Results:</h3><div>We demonstrate that transcripts generated by certain ASR models achieve higher AD detection accuracy than gold-standard manual transcripts. This performance gain stems not from errors in general or a high Word Error Rate (WER), but from specific and asymmetric error patterns. Our analyses reveal that these patterns amplify some pre-existing linguistic deficits in AD speech (e.g., disfluencies), thereby increasing the feature-level divergence between the AD and HC groups. Furthermore, we show that these diagnostic clues are effectively preserved when speech is synthesized from ASR transcripts, holding significant implications for data augmentation strategies in AD research.</div></div><div><h3>Conclusion:</h3><div>The specific, asymmetric error patterns introduced by certain ASR models enhance the distinction between AD and HC groups by amplifying pathological linguistic deficits associated with AD. This work suggests a paradigm shift for clinical ASR development: optimizing models not merely for transcription accuracy, but for their downstream diagnostic utility.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104968"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145798054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Domain adaptation of stable diffusion for ultrasound inpainting: a synthetic data approach for enhanced thyroid nodule segmentation 超声成像稳定扩散的域自适应:一种增强甲状腺结节分割的综合数据方法。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-11-30 DOI: 10.1016/j.jbi.2025.104963
Antonin Prochazka, Jan Zeman

Objective

To enhance the cross-domain generalization of thyroid-nodule segmentation models by augmenting limited ultrasound training data with synthetic images generated by a fine-tuned Stable Diffusion model.

Methods

Three public thyroid ultrasound datasets with heterogeneous acquisition characteristics were used: TN3K (training + testing), TDID, and TUCC. The denoising UNet inside Stable Diffusion v1.4 was fine-tuned on 2303 TN3K nodules and then used to synthesize realistic thyroid nodules. Using the model’s inpainting capability, same number of synthetic nodules were inserted into original ultrasound images. The combined data were then used to train ResUNet, DeepLabV3+ and MITUnet segmentation networks with identical hyper-parameters. Performance between the models trained on native data only and native + synthetic data was quantified with the Dice similarity coefficient (Dice score) and Intersection-over-Union (IoU).

Results

Across the in-domain TN3K test set (n = 614), performance gains were modest, with the best improvements reaching + 2.2 % in Dice score for DeepLabV3+. In contrast, substantial gains were observed on the external datasets. On the TDID dataset (n = 462), DeepLabV3+ improved from 38.2 % to 59.1 % Dice (+20.9 %), while MITUNet and ResUNet also gained up by 7.1 % and 6.9 % respectively. On the TUCC dataset (n = 192), DeepLabV3+ improved by 11.4 % in Dice, MITUNet by 6.9 %, and ResUNet by 3.1 %. All improvements—except for in-domain TN3K—were statistically significant (p < 0.01, paired t-test or Wilcoxon signed-rank test), confirming that synthetic images generated by Stable Diffusion enhance cross-domain segmentation robustness.

Conclusion

Augmenting ultrasound dataset with synthetic images generated by a task-specific Stable Diffusion model substantially improves the robustness of thyroid nodule segmentation across datasets acquired with different devices, at different institutions, and by different operators.
目的:利用经微调的稳定扩散模型生成的合成图像增强有限超声训练数据,增强甲状腺结节分割模型的跨域泛化。方法:使用三个具有异构采集特征的公共甲状腺超声数据集:TN3K(训练 + 测试)、TDID和TUCC。利用Stable Diffusion v1.4中的去噪UNet对2303个TN3K结节进行微调,合成真实的甲状腺结节。利用模型的绘制能力,将相同数量的合成结节插入到原始超声图像中。然后使用组合数据训练具有相同超参数的ResUNet、DeepLabV3+和MITUnet分割网络。使用Dice相似系数(Dice score)和Intersection-over-Union (IoU)来量化仅在本地数据和本地 + 合成数据上训练的模型之间的性能。结果:在域内TN3K测试集(n = 614)中,性能的提高是适度的,DeepLabV3+的Dice得分的最佳改进达到 + 2.2 %。相比之下,在外部数据集上观察到实质性的收益。在TDID数据集(n = 462)上,DeepLabV3+从38.2 %提高到59.1 % Dice(+20.9 %),而MITUNet和ResUNet也分别提高了7.1 %和6.9 %。在TUCC数据集(n = 192)上,DeepLabV3+在Dice上提高了11.4 %,在MITUNet上提高了6.9 %,在ResUNet上提高了3.1 %。结论:用特定任务的稳定扩散模型生成的合成图像增强超声数据集,大大提高了不同设备、不同机构和不同操作人员获得的数据集的甲状腺结节分割的鲁棒性。
{"title":"Domain adaptation of stable diffusion for ultrasound inpainting: a synthetic data approach for enhanced thyroid nodule segmentation","authors":"Antonin Prochazka,&nbsp;Jan Zeman","doi":"10.1016/j.jbi.2025.104963","DOIUrl":"10.1016/j.jbi.2025.104963","url":null,"abstract":"<div><h3>Objective</h3><div>To enhance the cross-domain generalization of thyroid-nodule segmentation models by augmenting limited ultrasound training data with synthetic images generated by a fine-tuned Stable Diffusion model.</div></div><div><h3>Methods</h3><div>Three public thyroid ultrasound datasets with heterogeneous acquisition characteristics were used: TN3K (training + testing), TDID, and TUCC. The denoising UNet inside Stable Diffusion v1.4 was fine-tuned on 2303 TN3K nodules and then used to synthesize realistic thyroid nodules. Using the model’s inpainting capability, same number of synthetic nodules were inserted into original ultrasound images. The combined data were then used to train ResUNet, DeepLabV3+ and MITUnet segmentation networks with identical hyper-parameters. Performance between the models trained on native data only and native + synthetic data was quantified with the Dice similarity coefficient (Dice score) and Intersection-over-Union (IoU).</div></div><div><h3>Results</h3><div>Across the in-domain TN3K test set (n = 614), performance gains were modest, with the best improvements reaching + 2.2 % in Dice score for DeepLabV3+. In contrast, substantial gains were observed on the external datasets. On the TDID dataset (n = 462), DeepLabV3+ improved from 38.2 % to 59.1 % Dice (+20.9 %), while MITUNet and ResUNet also gained up by 7.1 % and 6.9 % respectively. On the TUCC dataset (n = 192), DeepLabV3+ improved by 11.4 % in Dice, MITUNet by 6.9 %, and ResUNet by 3.1 %. All improvements—except for in-domain TN3K—were statistically significant (p &lt; 0.01, paired <em>t</em>-test or Wilcoxon signed-rank test), confirming that synthetic images generated by Stable Diffusion enhance cross-domain segmentation robustness.</div></div><div><h3>Conclusion</h3><div>Augmenting ultrasound dataset with synthetic images generated by a task-specific Stable Diffusion model substantially improves the robustness of thyroid nodule segmentation across datasets acquired with different devices, at different institutions, and by different operators.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104963"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145661395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Journal's cover 杂志的封面
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2026-01-07 DOI: 10.1016/S1532-0464(26)00002-X
{"title":"Journal's cover","authors":"","doi":"10.1016/S1532-0464(26)00002-X","DOIUrl":"10.1016/S1532-0464(26)00002-X","url":null,"abstract":"","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104978"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145938768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating retrospective quality assessment with real-time guideline application to support the episodic application of clinical guidelines over significant time periods 将回顾性质量评估与实时指南应用相结合,支持临床指南在重要时间段内的阶段性应用。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-29 DOI: 10.1016/j.jbi.2025.104975
Bruria Ben Shahar , Yuval Shahar , Shai Jaffe , Odeya Cohen , Erez Shalom , Maya Selivanova , Ephraim Rimon , Irit Hochberg , Ayelet Goldstein

Background

Evidence-based clinical guidelines (GLs) are essential for standardizing care, yet often difficult to apply. Most clinical decision support systems (CDSSs) assume continuous application, which misaligns with the episodic nature of real-world workflows.

Objectives

To design, implement, and evaluate e-Picard, a CDSS that provides GL-based recommendations through episodic, intermittent, on-demand consultations. The system supports retrospective assessment of past care and prospective identification of required actions. The evaluation focused on system validity and on its potential, in a retrospective simulation on real-world data, to enhance staff adherence to the GLs and to assess the potential effect of varying the frequency of the consultations.

Methods

The system development involved three preprocessing steps: (1) acquisition of free-text GLs with domain experts; (2) modeling procedural logic as workflows; and (3) flattening these into declarative temporal patterns for retrospective quality assessment and prospective recommendations. At runtime, e-Picard analyzes offline patient data to identify missed actions, computes compliance using fuzzy logic, and generates context-specific recommendations. e-Picard was applied to pressure-ulcer (PU) and diabetes management (DM) GLs, adapted for episodic use. Technical validation was performed on records from 43 PU and 82 DM patients. A retrospective simulation using 1,000 patients per domain estimated potential increases in adherence under varying consultation frequencies.

Results

Technical manual validation showed high correctness (≥99 %) and completeness (up to 98 %), based on 3,110 PU and 12,538 DM data instances (i.e., clinical measurements or actions), across various clinical scenarios over two-week observation periods.
Retrospective simulation covered 57,860 PU and 100,940 DM data instances with estimated adherence potentially increasing from 68 %–69 % to 89 %–97 % for PU and from 14 %–15 % to 60 %–87 % for DM, in the real-world data retrospective simulation, assuming full adherence of the staff to the system’s recommendations, depending on the scenario. Higher consultation frequency yielded greater gains, and adherence variability across hospital units and patient subgroups was reduced.

Conclusions

Episodic CDSSs can deliver accurate, context-aware recommendations in environments with intermittent use and incomplete data, with the potential, assuming that the real-world data retrospective simulation results hold, to enhance adherence and consistency in care.
背景:循证临床指南(GLs)是标准化护理必不可少的,但往往难以应用。大多数临床决策支持系统(cdss)假设持续应用,这与现实世界工作流程的偶然性不一致。目的:设计、实施和评估e-Picard,这是一个CDSS,通过偶发的、间歇的、按需咨询提供基于gl的建议。该系统支持对过去护理的回顾性评估和对所需行动的前瞻性识别。评价的重点是系统有效性及其潜力,通过对真实世界数据的回顾性模拟,加强工作人员对全球目标的遵守,并评估改变咨询频率的潜在影响。方法:系统开发包括三个预处理步骤:(1)与领域专家一起获取自由文本GLs;(2)将过程逻辑建模为工作流;(3)将这些扁平化为陈述性时间模式,用于回顾性质量评估和前瞻性建议。在运行时,e-Picard分析离线患者数据以识别遗漏的操作,使用模糊逻辑计算遵从性,并生成特定于上下文的建议。e-Picard应用于压疮(PU)和糖尿病管理(DM) GLs,适用于间歇性使用。对43例PU和82例DM患者的记录进行了技术验证。每个领域使用1000名患者的回顾性模拟估计了不同咨询频率下依从性的潜在增加。结果:技术手册验证显示高正确性(≥99 %)和完整性(高达98 %),基于3,110 PU和12,538 DM数据实例(即临床测量或行动),跨越两周观察期的各种临床场景。在现实世界的数据回顾性模拟中,假设员工完全遵守系统的建议,根据具体情况,回顾性模拟涵盖了57,860个PU和100,940个DM数据实例,估计依从性可能从PU的68 %-69 %增加到89 %-97 %,DM的14 %-15 %增加到60 %-87 %。更高的咨询频率产生更大的收益,并且降低了医院单位和患者亚组之间的依从性差异。结论:偶发性cdss可以在间歇性使用和数据不完整的环境中提供准确的、情境感知的建议,假设真实世界数据回顾性模拟结果有效,具有增强护理依从性和一致性的潜力。
{"title":"Integrating retrospective quality assessment with real-time guideline application to support the episodic application of clinical guidelines over significant time periods","authors":"Bruria Ben Shahar ,&nbsp;Yuval Shahar ,&nbsp;Shai Jaffe ,&nbsp;Odeya Cohen ,&nbsp;Erez Shalom ,&nbsp;Maya Selivanova ,&nbsp;Ephraim Rimon ,&nbsp;Irit Hochberg ,&nbsp;Ayelet Goldstein","doi":"10.1016/j.jbi.2025.104975","DOIUrl":"10.1016/j.jbi.2025.104975","url":null,"abstract":"<div><h3>Background</h3><div>Evidence-based clinical guidelines (GLs) are essential for standardizing care, yet often difficult to apply. Most clinical decision support systems (CDSSs) assume continuous application, which misaligns with the episodic nature of real-world workflows.</div></div><div><h3>Objectives</h3><div>To design, implement, and evaluate e-Picard, a CDSS that provides GL-based recommendations through episodic, intermittent, on-demand consultations. The system supports retrospective assessment of past care and prospective identification of required actions. The evaluation focused on system validity and on its potential, in a retrospective simulation on real-world data, to enhance staff adherence to the GLs and to assess the potential effect of varying the frequency of the consultations.</div></div><div><h3>Methods</h3><div>The system development involved three preprocessing steps: (1) acquisition of free-text GLs with domain experts; (2) modeling procedural logic as workflows; and (3) flattening these into declarative temporal patterns for retrospective quality assessment and prospective recommendations. At runtime, e-Picard analyzes offline patient data to identify missed actions, computes compliance using fuzzy logic, and generates context-specific recommendations. e-Picard was applied to pressure-ulcer (PU) and diabetes management (DM) GLs, adapted for episodic use. Technical validation was performed on records from 43 PU and 82 DM patients. A retrospective simulation using 1,000 patients per domain estimated potential increases in adherence under varying consultation frequencies.</div></div><div><h3>Results</h3><div>Technical manual validation showed high correctness (≥99 %) and completeness (up to 98 %), based on 3,110 PU and 12,538 DM data instances (i.e., clinical measurements or actions), across various clinical scenarios over two-week observation periods.</div><div>Retrospective simulation covered 57,860 PU and 100,940 DM data instances with estimated adherence potentially increasing from 68 %–69 % to 89 %–97 % for PU and from 14 %–15 % to 60 %–87 % for DM, in the real-world data retrospective simulation, assuming full adherence of the staff to the system’s recommendations, depending on the scenario. Higher consultation frequency yielded greater gains, and adherence variability across hospital units and patient subgroups was reduced.</div></div><div><h3>Conclusions</h3><div>Episodic CDSSs can deliver accurate, context-aware recommendations in environments with intermittent use and incomplete data, with the potential, assuming that the real-world data retrospective simulation results hold, to enhance adherence and consistency in care.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104975"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145878473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mamba-enhanced disease semantic knowledge graph for interpretable automatic ICD coding 用于可解释的自动ICD编码的mamba增强疾病语义知识图。
IF 4.5 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-22 DOI: 10.1016/j.jbi.2025.104973
Pengli Lu , Chao Dong , Jingjin Xue , Fentang Gao
Automatic ICD coding refers to the process of using artificial intelligence methods to automatically extract information related to diseases, symptoms, diagnoses, treatments, and other relevant details from electronic health records, and convert it into codes that comply with the International Classification of Diseases (ICD) standard. Automatic ICD coding technology has been gradually improved with the advancement of deep learning, but in practical deployment, it still faces challenges such as inconsistent semantics, ambiguous labels, and limited interpretability. To address these issues, we propose a novel automatic ICD coding framework MKHCNet (Mamba-Knowledge-HPLA-ContraNorm Network) which integrates unstructured clinical knowledge representation, long-range dependency modeling, and contrastive normalization techniques to enhance coding performance. Specifically, we construct a disease semantic knowledge graph to enrich ICD label representations, employ the Mamba network to capture cross-domain dependencies, apply the ContraNorm module to enhance label separability, and propose the Hierarchical Position Label Attention (HPLA) mechanism to achieve fine-grained, attention-based interpretability. Finally, with the purpose of capturing complex nonlinear relationships more effectively and better adapting to complex patterns in EHR data, FastKAN acts as a classifier and utilizes radial basis function (RBF) for feature transformation. We conducted systematic experiments on the benchmark datasets MIMIC-FULL and MIMIC-50. The experimental results show that MKHCNet improves MaAUC and P@8 by 2.1% and 0.3% on MIMIC-FULL respectively compared with the best existing mainstream model. Furthermore, case studies demonstrate that the model is able to effectively identify complex semantic cues and provide strong clinical interpretability.
ICD自动编码是指利用人工智能方法,从电子健康记录中自动提取与疾病、症状、诊断、治疗等相关细节信息,并将其转换为符合国际疾病分类(ICD)标准的代码的过程。随着深度学习的推进,自动ICD编码技术逐渐得到完善,但在实际部署中,仍然面临语义不一致、标签模糊、可解释性有限等挑战。为了解决这些问题,我们提出了一种新的ICD自动编码框架MKHCNet (mamba - knowledge - hpla - contransform Network),该框架集成了非结构化临床知识表示、远程依赖建模和对比归一化技术来提高编码性能。具体而言,我们构建了疾病语义知识图来丰富ICD标签表示,使用Mamba网络捕获跨领域依赖关系,应用contransform模块增强标签可分离性,并提出了分层位置标签注意(HPLA)机制来实现细粒度的、基于注意的可解释性。最后,为了更有效地捕捉复杂的非线性关系,更好地适应电子病历数据中的复杂模式,FastKAN作为分类器,利用径向基函数(RBF)进行特征转换。我们在基准数据集MIMIC- full和MIMIC 50上进行了系统的实验。实验结果表明,与现有最佳主流模型相比,MKHCNet在MIMIC-FULL上的MaAUC和MaF分别提高了2.2%和0.9%。此外,案例研究表明,该模型能够有效识别复杂的语义线索,并提供强大的临床可解释性。
{"title":"Mamba-enhanced disease semantic knowledge graph for interpretable automatic ICD coding","authors":"Pengli Lu ,&nbsp;Chao Dong ,&nbsp;Jingjin Xue ,&nbsp;Fentang Gao","doi":"10.1016/j.jbi.2025.104973","DOIUrl":"10.1016/j.jbi.2025.104973","url":null,"abstract":"<div><div>Automatic ICD coding refers to the process of using artificial intelligence methods to automatically extract information related to diseases, symptoms, diagnoses, treatments, and other relevant details from electronic health records, and convert it into codes that comply with the International Classification of Diseases (ICD) standard. Automatic ICD coding technology has been gradually improved with the advancement of deep learning, but in practical deployment, it still faces challenges such as inconsistent semantics, ambiguous labels, and limited interpretability. To address these issues, we propose a novel automatic ICD coding framework <strong>MKHCNet</strong> (<strong>M</strong>amba-<strong>K</strong>nowledge-<strong>H</strong>PLA-<strong>C</strong>ontraNorm <strong>Net</strong>work) which integrates unstructured clinical knowledge representation, long-range dependency modeling, and contrastive normalization techniques to enhance coding performance. Specifically, we construct a disease semantic knowledge graph to enrich ICD label representations, employ the Mamba network to capture cross-domain dependencies, apply the ContraNorm module to enhance label separability, and propose the Hierarchical Position Label Attention (HPLA) mechanism to achieve fine-grained, attention-based interpretability. Finally, with the purpose of capturing complex nonlinear relationships more effectively and better adapting to complex patterns in EHR data, FastKAN acts as a classifier and utilizes radial basis function (RBF) for feature transformation. We conducted systematic experiments on the benchmark datasets MIMIC-FULL and MIMIC-50. The experimental results show that MKHCNet improves MaAUC and P<span><math><mi>@</mi></math></span>8 by 2.1% and 0.3% on MIMIC-FULL respectively compared with the best existing mainstream model. Furthermore, case studies demonstrate that the model is able to effectively identify complex semantic cues and provide strong clinical interpretability.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"173 ","pages":"Article 104973"},"PeriodicalIF":4.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145827758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Biomedical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1