首页 > 最新文献

Journal of Biomedical Informatics最新文献

英文 中文
Promoting smartphone-based keratitis screening using meta-learning: A multicenter study 利用元学习推广基于智能手机的角膜炎筛查:一项多中心研究。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104722
Zhongwen Li , Yangyang Wang , Kuan Chen , Wei Qiang , Xihang Zong , Ke Ding , Shihong Wang , Shiqi Yin , Jiewei Jiang , Wei Chen

Objective

Keratitis is the primary cause of corneal blindness worldwide. Prompt identification and referral of patients with keratitis are fundamental measures to improve patient prognosis. Although deep learning can assist ophthalmologists in automatically detecting keratitis through a slit lamp camera, remote and underserved areas often lack this professional equipment. Smartphones, a widely available device, have recently been found to have potential in keratitis screening. However, given the limited data available from smartphones, employing traditional deep learning algorithms to construct a robust intelligent system presents a significant challenge. This study aimed to propose a meta-learning framework, cosine nearest centroid-based metric learning (CNCML), for developing a smartphone-based keratitis screening model in the case of insufficient smartphone data by leveraging the prior knowledge acquired from slit-lamp photographs.

Methods

We developed and assessed CNCML based on 13,009 slit-lamp photographs and 4,075 smartphone photographs that were obtained from 3 independent clinical centers. To mimic real-world scenarios with various degrees of sample scarcity, we used training sets of different sizes (0 to 20 photographs per class) from the HUAWEI smartphone to train CNCML. We evaluated the performance of CNCML not only on an internal test dataset but also on two external datasets that were collected by two different brands of smartphones (VIVO and XIAOMI) in another clinical center. Furthermore, we compared the performance of CNCML with that of traditional deep learning models on these smartphone datasets. The accuracy and macro-average area under the curve (macro-AUC) were utilized to evaluate the performance of models.

Results

With merely 15 smartphone photographs per class used for training, CNCML reached accuracies of 84.59%, 83.15%, and 89.99% on three smartphone datasets, with corresponding macro-AUCs of 0.96, 0.95, and 0.98, respectively. The accuracies of CNCML on these datasets were 0.56% to 9.65% higher than those of the most competitive traditional deep learning models.

Conclusions

CNCML exhibited fast learning capabilities, attaining remarkable performance with a small number of training samples. This approach presents a potential solution for transitioning intelligent keratitis detection from professional devices (e.g., slit-lamp cameras) to more ubiquitous devices (e.g., smartphones), making keratitis screening more convenient and effective.

目的:角膜炎是全球角膜失明的主要原因。及时发现和转诊角膜炎患者是改善患者预后的基本措施。虽然深度学习可以帮助眼科医生通过裂隙灯相机自动检测角膜炎,但偏远地区和服务不足的地区往往缺乏这种专业设备。智能手机作为一种广泛使用的设备,最近被发现在角膜炎筛查方面具有潜力。然而,由于智能手机提供的数据有限,采用传统的深度学习算法来构建一个强大的智能系统是一个巨大的挑战。本研究旨在提出一种元学习框架--基于余弦最近中心点的度量学习(CNCML),通过利用从裂隙灯照片中获取的先验知识,在智能手机数据不足的情况下开发基于智能手机的角膜炎筛查模型:我们根据从 3 个独立临床中心获得的 13,009 张裂隙灯照片和 4,075 张智能手机照片开发并评估了 CNCML。为了模拟真实世界中不同程度的样本稀缺情况,我们使用 HUAWEI 智能手机中不同大小的训练集(每类 0 到 20 张照片)来训练 CNCML。我们不仅在内部测试数据集上评估了 CNCML 的性能,还在两个外部数据集上评估了 CNCML 的性能,这两个外部数据集是由另一个临床中心的两个不同品牌的智能手机(VIVO 和 XIAOMI)收集的。此外,我们还比较了 CNCML 与传统深度学习模型在这些智能手机数据集上的表现。我们利用准确率和宏观平均曲线下面积(macro-AUC)来评估模型的性能:在每个类别仅使用 15 张智能手机照片进行训练的情况下,CNCML 在三个智能手机数据集上的准确率分别达到 84.59%、83.15% 和 89.99%,相应的宏观平均曲线下面积(macro-AUC)分别为 0.96、0.95 和 0.98。CNCML 在这些数据集上的准确率比最具竞争力的传统深度学习模型高出 0.56% 至 9.65%:CNCML 展示了快速学习能力,只需少量训练样本就能获得出色的性能。这种方法为将智能角膜炎检测从专业设备(如裂隙灯照相机)过渡到更普及的设备(如智能手机)提供了一种潜在的解决方案,使角膜炎筛查更加方便有效。
{"title":"Promoting smartphone-based keratitis screening using meta-learning: A multicenter study","authors":"Zhongwen Li ,&nbsp;Yangyang Wang ,&nbsp;Kuan Chen ,&nbsp;Wei Qiang ,&nbsp;Xihang Zong ,&nbsp;Ke Ding ,&nbsp;Shihong Wang ,&nbsp;Shiqi Yin ,&nbsp;Jiewei Jiang ,&nbsp;Wei Chen","doi":"10.1016/j.jbi.2024.104722","DOIUrl":"10.1016/j.jbi.2024.104722","url":null,"abstract":"<div><h3>Objective</h3><p>Keratitis is the primary cause of corneal blindness worldwide. Prompt identification and referral of patients with keratitis are fundamental measures to improve patient prognosis. Although deep learning can assist ophthalmologists in automatically detecting keratitis through a slit lamp camera, remote and underserved areas often lack this professional equipment. Smartphones, a widely available device, have recently been found to have potential in keratitis screening. However, given the limited data available from smartphones, employing traditional deep learning algorithms to construct a robust intelligent system presents a significant challenge. This study aimed to propose a meta-learning framework, cosine nearest centroid-based metric learning (CNCML), for developing a smartphone-based keratitis screening model in the case of insufficient smartphone data by leveraging the prior knowledge acquired from slit-lamp photographs.</p></div><div><h3>Methods</h3><p>We developed and assessed CNCML based on 13,009 slit-lamp photographs and 4,075 smartphone photographs that were obtained from 3 independent clinical centers. To mimic real-world scenarios with various degrees of sample scarcity, we used training sets of different sizes (0 to 20 photographs per class) from the HUAWEI smartphone to train CNCML. We evaluated the performance of CNCML not only on an internal test dataset but also on two external datasets that were collected by two different brands of smartphones (VIVO and XIAOMI) in another clinical center. Furthermore, we compared the performance of CNCML with that of traditional deep learning models on these smartphone datasets. The accuracy and macro-average area under the curve (macro-AUC) were utilized to evaluate the performance of models.</p></div><div><h3>Results</h3><p>With merely 15 smartphone photographs per class used for training, CNCML reached accuracies of 84.59%, 83.15%, and 89.99% on three smartphone datasets, with corresponding macro-AUCs of 0.96, 0.95, and 0.98, respectively. The accuracies of CNCML on these datasets were 0.56% to 9.65% higher than those of the most competitive traditional deep learning models.</p></div><div><h3>Conclusions</h3><p>CNCML exhibited fast learning capabilities, attaining remarkable performance with a small number of training samples. This approach presents a potential solution for transitioning intelligent keratitis detection from professional devices (e.g., slit-lamp cameras) to more ubiquitous devices (e.g., smartphones), making keratitis screening more convenient and effective.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104722"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142145736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAFT-SO: A novel multi-atlas fusion template based on spatial overlap for ASD diagnosis MAFT-SO:基于空间重叠的新型多图集融合模板,用于 ASD 诊断。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104714
Yuefeng Ma , Xiaochen Mu , Tengfei Zhang , Yu Zhao

Autism spectrum disorder (ASD) is a common neurological condition. Early diagnosis and treatment are essential for enhancing the life quality of individuals with ASD. However, most existing studies either focus solely on the brain networks of subjects within a single atlas or merely employ simple matrix concatenation to represent the fusion of multi-atlas. These approaches neglected the natural spatial overlap that exists between brain regions across multi-atlas and did not fully capture the comprehensive information of brain regions under different atlases. To tackle this weakness, in this paper, we propose a novel multi-atlas fusion template based on spatial overlap degree of brain regions, which aims to obtain a comprehensive representation of brain networks. Specifically, we formally define a measurement of the spatial overlap among brain regions across different atlases, named spatial overlap degree. Then, we fuse the multi-atlas to obtain brain networks of each subject based on spatial overlap. Finally, the GCN is used to perform the final classification. The experimental results on Autism Brain Imaging Data Exchange (ABIDE) demonstrate that our proposed method achieved an accuracy of 0.757. Overall, our method outperforms SOTA methods in ASD/TC classification.

自闭症谱系障碍(ASD)是一种常见的神经系统疾病。早期诊断和治疗对提高自闭症患者的生活质量至关重要。然而,现有的大多数研究要么只关注单一图集中受试者的大脑网络,要么只采用简单的矩阵连接来表示多图集的融合。这些方法忽视了多图谱中脑区之间存在的天然空间重叠,无法全面捕捉不同图谱下脑区的综合信息。针对这一缺陷,本文提出了一种基于脑区空间重叠度的新型多图集融合模板,旨在获得脑网络的综合表征。具体来说,我们正式定义了不同图集中脑区空间重叠度的测量方法,命名为空间重叠度。然后,我们融合多图集,根据空间重叠度获得每个受试者的脑网络。最后,利用 GCN 进行最终分类。自闭症脑成像数据交换(ABIDE)的实验结果表明,我们提出的方法达到了 0.757 的准确率。总体而言,我们的方法在 ASD/TC 分类方面优于 SOTA 方法。
{"title":"MAFT-SO: A novel multi-atlas fusion template based on spatial overlap for ASD diagnosis","authors":"Yuefeng Ma ,&nbsp;Xiaochen Mu ,&nbsp;Tengfei Zhang ,&nbsp;Yu Zhao","doi":"10.1016/j.jbi.2024.104714","DOIUrl":"10.1016/j.jbi.2024.104714","url":null,"abstract":"<div><p>Autism spectrum disorder (ASD) is a common neurological condition. Early diagnosis and treatment are essential for enhancing the life quality of individuals with ASD. However, most existing studies either focus solely on the brain networks of subjects within a single atlas or merely employ simple matrix concatenation to represent the fusion of multi-atlas. These approaches neglected the natural spatial overlap that exists between brain regions across multi-atlas and did not fully capture the comprehensive information of brain regions under different atlases. To tackle this weakness, in this paper, we propose a novel multi-atlas fusion template based on spatial overlap degree of brain regions, which aims to obtain a comprehensive representation of brain networks. Specifically, we formally define a measurement of the spatial overlap among brain regions across different atlases, named spatial overlap degree. Then, we fuse the multi-atlas to obtain brain networks of each subject based on spatial overlap. Finally, the GCN is used to perform the final classification. The experimental results on Autism Brain Imaging Data Exchange (ABIDE) demonstrate that our proposed method achieved an accuracy of 0.757. Overall, our method outperforms SOTA methods in ASD/TC classification.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104714"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142072869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating graph convolutional networks to enhance prompt learning for biomedical relation extraction 整合图卷积网络,加强生物医学关系提取的及时学习
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104717
Bocheng Guo , Jiana Meng , Di Zhao , Xiangxing Jia , Yonghe Chu , Hongfei Lin

Background and Objective:

Biomedical relation extraction aims to reveal the relation between entities in medical texts. Currently, the relation extraction models that have attracted much attention are mainly to fine-tune the pre-trained language models (PLMs) or add template prompt learning, which also limits the ability of the model to deal with grammatical dependencies. Graph convolutional networks (GCNs) can play an important role in processing syntactic dependencies in biomedical texts.

Methods:

In this work, we propose a biomedical relation extraction model that fuses GCNs enhanced prompt learning to handle limitations in syntactic dependencies and achieve good performance. Specifically, we propose a model that combines prompt learning with GCNs for relation extraction, by integrating the syntactic dependency information analyzed by GCNs into the prompt learning model, by predicting the correspondence with [MASK] tokens labels for relation extraction.

Results:

Our model achieved F1 scores of 85.57%, 80.15%, 95.10%, and 84.11% in the biomedical relation extraction datasets GAD, ChemProt, PGR, and DDI, respectively, all of which outperform some existing baseline models.

Conclusions:

In this paper, we propose enhancing prompt learning through GCNs, integrating syntactic information into biomedical relation extraction tasks. Experimental results show that our proposed method achieves excellent performance in the biomedical relation extraction task.

背景与目的:生物医学关系提取旨在揭示医学文本中实体之间的关系。目前,备受关注的关系提取模型主要是对预训练语言模型(PLM)进行微调或添加模板提示学习,这也限制了模型处理语法依赖关系的能力。方法:在这项工作中,我们提出了一种生物医学关系提取模型,该模型融合了增强型提示学习(GCNs enhanced prompt learning),以处理语法依赖关系的局限性并获得良好的性能。具体来说,我们提出了一种将提示学习与 GCNs 结合起来进行关系提取的模型,将 GCNs 分析的句法依赖信息整合到提示学习模型中,通过预测与 [MASK] 标记的对应关系进行关系提取。结果:在生物医学关系提取数据集 GAD、ChemProt、PGR 和 DDI 中,我们的模型分别取得了 85.57%、80.15%、95.10% 和 84.11% 的 F1 分数,均优于现有的一些基线模型。实验结果表明,我们提出的方法在生物医学关系提取任务中取得了优异的性能。
{"title":"Integrating graph convolutional networks to enhance prompt learning for biomedical relation extraction","authors":"Bocheng Guo ,&nbsp;Jiana Meng ,&nbsp;Di Zhao ,&nbsp;Xiangxing Jia ,&nbsp;Yonghe Chu ,&nbsp;Hongfei Lin","doi":"10.1016/j.jbi.2024.104717","DOIUrl":"10.1016/j.jbi.2024.104717","url":null,"abstract":"<div><h3>Background and Objective:</h3><p>Biomedical relation extraction aims to reveal the relation between entities in medical texts. Currently, the relation extraction models that have attracted much attention are mainly to fine-tune the pre-trained language models (PLMs) or add template prompt learning, which also limits the ability of the model to deal with grammatical dependencies. Graph convolutional networks (GCNs) can play an important role in processing syntactic dependencies in biomedical texts.</p></div><div><h3>Methods:</h3><p>In this work, we propose a biomedical relation extraction model that fuses GCNs enhanced prompt learning to handle limitations in syntactic dependencies and achieve good performance. Specifically, we propose a model that combines prompt learning with GCNs for relation extraction, by integrating the syntactic dependency information analyzed by GCNs into the prompt learning model, by predicting the correspondence with [MASK] tokens labels for relation extraction.</p></div><div><h3>Results:</h3><p>Our model achieved F1 scores of 85.57%, 80.15%, 95.10%, and 84.11% in the biomedical relation extraction datasets GAD, ChemProt, PGR, and DDI, respectively, all of which outperform some existing baseline models.</p></div><div><h3>Conclusions:</h3><p>In this paper, we propose enhancing prompt learning through GCNs, integrating syntactic information into biomedical relation extraction tasks. Experimental results show that our proposed method achieves excellent performance in the biomedical relation extraction task.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104717"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142094932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis 改善罕见结果预测的条件多标签模型:自闭症诊断预测示例。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104711
Wei A. Huang , Matthew Engelhard , Marika Coffman , Elliot D. Hill , Qin Weng , Abby Scheer , Gary Maslow , Ricardo Henao , Geraldine Dawson , Benjamin A. Goldstein

Objective

This study aimed to develop a novel approach using routinely collected electronic health records (EHRs) data to improve the prediction of a rare event. We illustrated this using an example of improving early prediction of an autism diagnosis, given its low prevalence, by leveraging correlations between autism and other neurodevelopmental conditions (NDCs).

Methods

To achieve this, we introduced a conditional multi-label model by merging conditional learning and multi-label methodologies. The conditional learning approach breaks a hard task into more manageable pieces in each stage, and the multi-label approach utilizes information from related neurodevelopmental conditions to learn predictive latent features. The study involved forecasting autism diagnosis by age 5.5 years, utilizing data from the first 18 months of life, and the analysis of feature importance correlations to explore the alignment within the feature space across different conditions.

Results

Upon analysis of health records from 18,156 children, we are able to generate a model that predicts a future autism diagnosis with moderate performance (AUROC=0.76). The proposed conditional multi-label method significantly improves predictive performance with an AUROC of 0.80 (p < 0.001). Further examination shows that both the conditional and multi-label approach alone provided marginal lift to the model performance compared to a one-stage one-label approach. We also demonstrated the generalizability and applicability of this method using simulated data with high correlation between feature vectors for different labels.

Conclusion

Our findings underscore the effectiveness of the developed conditional multi-label model for early prediction of an autism diagnosis. The study introduces a versatile strategy applicable to prediction tasks involving limited target populations but sharing underlying features or etiology among related groups.

研究目的本研究旨在利用常规收集的电子健康记录 (EHR) 数据开发一种新方法,以改善对罕见事件的预测。我们以自闭症为例进行了说明,鉴于自闭症发病率较低,我们利用自闭症与其他神经发育疾病(NDCs)之间的相关性,改善了对自闭症诊断的早期预测:为此,我们将条件学习和多标签方法相结合,引入了条件多标签模型。条件学习法将一项艰巨的任务分解为更易于管理的各个阶段,而多标签法则利用相关神经发育状况的信息来学习预测性潜在特征。研究涉及利用出生后头 18 个月的数据预测 5.5 岁前的自闭症诊断,并分析特征重要性相关性,以探索不同条件下特征空间内的一致性:通过对 18156 名儿童的健康记录进行分析,我们能够生成一个预测未来自闭症诊断的模型,该模型的性能处于中等水平(AUROC=0.76)。所提出的条件多标签方法显著提高了预测性能,AUROC 为 0.80(p 结论:我们的研究结果强调了多标签方法的有效性:我们的研究结果凸显了所开发的条件多标签模型在早期预测自闭症诊断方面的有效性。该研究介绍了一种多功能策略,适用于涉及有限目标人群但相关群体具有相同基本特征或病因的预测任务。
{"title":"A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis","authors":"Wei A. Huang ,&nbsp;Matthew Engelhard ,&nbsp;Marika Coffman ,&nbsp;Elliot D. Hill ,&nbsp;Qin Weng ,&nbsp;Abby Scheer ,&nbsp;Gary Maslow ,&nbsp;Ricardo Henao ,&nbsp;Geraldine Dawson ,&nbsp;Benjamin A. Goldstein","doi":"10.1016/j.jbi.2024.104711","DOIUrl":"10.1016/j.jbi.2024.104711","url":null,"abstract":"<div><h3>Objective</h3><p>This study aimed to develop a novel approach using routinely collected electronic health records (EHRs) data to improve the prediction of a rare event. We illustrated this using an example of improving early prediction of an autism diagnosis, given its low prevalence, by leveraging correlations between autism and other neurodevelopmental conditions (NDCs).</p></div><div><h3>Methods</h3><p>To achieve this, we introduced a conditional multi-label model by merging conditional learning and multi-label methodologies. The conditional learning approach breaks a hard task into more manageable pieces in each stage, and the multi-label approach utilizes information from related neurodevelopmental conditions to learn predictive latent features. The study involved forecasting autism diagnosis by age 5.5 years, utilizing data from the first 18 months of life, and the analysis of feature importance correlations to explore the alignment within the feature space across different conditions.</p></div><div><h3>Results</h3><p>Upon analysis of health records from 18,156 children, we are able to generate a model that predicts a future autism diagnosis with moderate performance (AUROC=0.76). The proposed conditional multi-label method significantly improves predictive performance with an AUROC of 0.80 (<em>p</em> &lt; 0.001). Further examination shows that both the conditional and multi-label approach alone provided marginal lift to the model performance compared to a one-stage one-label approach. We also demonstrated the generalizability and applicability of this method using simulated data with high correlation between feature vectors for different labels.</p></div><div><h3>Conclusion</h3><p>Our findings underscore the effectiveness of the developed conditional multi-label model for early prediction of an autism diagnosis. The study introduces a versatile strategy applicable to prediction tasks involving limited target populations but sharing underlying features or etiology among related groups.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104711"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142055705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing gait dysfunction severity in Parkinson’s Disease using 2-Stream Spatial–Temporal Neural Network 利用双流时空神经网络评估帕金森病步态功能障碍的严重程度
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104679

Parkinson’s Disease (PD), a neurodegenerative disorder, significantly impacts the quality of life for millions of people worldwide. PD primarily impacts dopaminergic neurons in the brain’s substantia nigra, resulting in dopamine deficiency and gait impairments such as bradykinesia and rigidity. Currently, several well-established tools, such as the Movement Disorder Society-Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) and Hoehn and Yahr (H&Y) Scale, are used for evaluating gait dysfunction in PD. While insightful, these methods are subjective, time-consuming, and often ineffective in early-stage diagnosis. Other methods using specialized sensors and equipment to measure movement disorders are cumbersome and expensive, limiting their accessibility. This study introduces a hierarchical approach to evaluating gait dysfunction in PD through videos. The novel 2-Stream Spatial–Temporal Neural Network (2S-STNN) leverages the spatial–temporal features from the skeleton and silhouette streams for PD classification. This approach achieves an accuracy rate of 89% and outperforms other state-of-the-art models. The study also employs saliency values to highlight critical body regions that significantly influence model decisions and are severely affected by the disease. For a more detailed analysis, the study investigates 21 specific gait attributes for a nuanced quantification of gait disorders. Parameters such as walking pace, step length, and neck forward angle are found to be strongly correlated with PD gait severity categories. This approach offers a comprehensive and convenient solution for PD management in clinical settings, enabling patients to receive a more precise evaluation and monitoring of their gait impairments.

帕金森病(PD)是一种神经退行性疾病,严重影响着全球数百万人的生活质量。帕金森病主要影响大脑黑质中的多巴胺能神经元,导致多巴胺缺乏和步态障碍,如运动迟缓和僵硬。目前,运动障碍协会-统一帕金森病评定量表(MDS-UPDRS)和Hoehn and Yahr(H&Y)量表等几种成熟的工具被用于评估帕金森病的步态功能障碍。这些方法虽然很有见地,但都很主观、耗时,而且在早期诊断中往往效果不佳。其他使用专门传感器和设备测量运动障碍的方法既繁琐又昂贵,限制了其普及性。本研究介绍了一种通过视频评估帕金森病步态功能障碍的分层方法。新颖的双流空间-时间神经网络(2S-STNN)利用骨架流和轮廓流的空间-时间特征进行帕金森病分类。这种方法的准确率达到 89%,优于其他最先进的模型。该研究还采用了显著性值来突出关键的身体区域,这些区域对模型的决策有重大影响,并受到疾病的严重影响。为了进行更详细的分析,该研究调查了 21 个特定的步态属性,以对步态障碍进行更细致的量化。研究发现,步行速度、步长和颈部前倾角度等参数与帕金森病步态严重程度类别密切相关。这种方法为临床上的帕金森病管理提供了一种全面、便捷的解决方案,使患者能够得到更精确的步态障碍评估和监测。
{"title":"Assessing gait dysfunction severity in Parkinson’s Disease using 2-Stream Spatial–Temporal Neural Network","authors":"","doi":"10.1016/j.jbi.2024.104679","DOIUrl":"10.1016/j.jbi.2024.104679","url":null,"abstract":"<div><p>Parkinson’s Disease (PD), a neurodegenerative disorder, significantly impacts the quality of life for millions of people worldwide. PD primarily impacts dopaminergic neurons in the brain’s substantia nigra, resulting in dopamine deficiency and gait impairments such as bradykinesia and rigidity. Currently, several well-established tools, such as the Movement Disorder Society-Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) and Hoehn and Yahr (H&amp;Y) Scale, are used for evaluating gait dysfunction in PD. While insightful, these methods are subjective, time-consuming, and often ineffective in early-stage diagnosis. Other methods using specialized sensors and equipment to measure movement disorders are cumbersome and expensive, limiting their accessibility. This study introduces a hierarchical approach to evaluating gait dysfunction in PD through videos. The novel 2-Stream Spatial–Temporal Neural Network (2S-STNN) leverages the spatial–temporal features from the skeleton and silhouette streams for PD classification. This approach achieves an accuracy rate of 89% and outperforms other state-of-the-art models. The study also employs saliency values to highlight critical body regions that significantly influence model decisions and are severely affected by the disease. For a more detailed analysis, the study investigates 21 specific gait attributes for a nuanced quantification of gait disorders. Parameters such as walking pace, step length, and neck forward angle are found to be strongly correlated with PD gait severity categories. This approach offers a comprehensive and convenient solution for PD management in clinical settings, enabling patients to receive a more precise evaluation and monitoring of their gait impairments.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104679"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141457162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interactive dual-stream contrastive learning for radiology report generation 用于生成放射学报告的交互式双流对比学习
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104718
Ziqi Zhang, Ailian Jiang

Radiology report generation automates diagnostic narrative synthesis from medical imaging data. Current report generation methods primarily employ knowledge graphs for image enhancement, neglecting the interpretability and guiding function of the knowledge graphs themselves. Additionally, few approaches leverage the stable modal alignment information from multimodal pre-trained models to facilitate the generation of radiology reports. We propose the Terms-Guided Radiology Report Generation (TGR), a simple and practical model for generating reports guided primarily by anatomical terms. Specifically, we utilize a dual-stream visual feature extraction module comprised of detail extraction module and a frozen multimodal pre-trained model to separately extract visual detail features and semantic features. Furthermore, a Visual Enhancement Module (VEM) is proposed to further enrich the visual features, thereby facilitating the generation of a list of anatomical terms. We integrate anatomical terms with image features and proceed to engage contrastive learning with frozen text embeddings, utilizing the stable feature space from these embeddings to boost modal alignment capabilities further. Our model incorporates the capability for manual input, enabling it to generate a list of organs for specifically focused abnormal areas or to produce more accurate single-sentence descriptions based on selected anatomical terms. Comprehensive experiments demonstrate the effectiveness of our method in report generation tasks, our TGR-S model reduces training parameters by 38.9% while performing comparably to current state-of-the-art models, and our TGR-B model exceeds the best baseline models across multiple metrics.

放射学报告生成可自动根据医学影像数据进行诊断叙述综合。目前的报告生成方法主要采用知识图谱来增强图像,而忽略了知识图谱本身的可解释性和指导功能。此外,很少有方法利用多模态预训练模型的稳定模态配准信息来促进放射报告的生成。我们提出的术语指导放射报告生成(TGR)是一种简单实用的模型,主要以解剖术语为指导生成报告。具体来说,我们利用由细节提取模块和冷冻多模态预训练模型组成的双流视觉特征提取模块,分别提取视觉细节特征和语义特征。此外,我们还提出了视觉增强模块(VEM)来进一步丰富视觉特征,从而促进解剖术语列表的生成。我们将解剖术语与图像特征整合在一起,然后使用冻结文本嵌入进行对比学习,利用这些嵌入的稳定特征空间进一步提高模态配准能力。我们的模型具有手动输入功能,可针对特定的异常区域生成器官列表,或根据选定的解剖术语生成更准确的单句描述。综合实验证明了我们的方法在报告生成任务中的有效性,我们的 TGR-S 模型减少了 38.9% 的训练参数,同时与当前最先进的模型性能相当,而我们的 TGR-B 模型在多个指标上都超过了最佳基线模型。
{"title":"Interactive dual-stream contrastive learning for radiology report generation","authors":"Ziqi Zhang,&nbsp;Ailian Jiang","doi":"10.1016/j.jbi.2024.104718","DOIUrl":"10.1016/j.jbi.2024.104718","url":null,"abstract":"<div><p>Radiology report generation automates diagnostic narrative synthesis from medical imaging data. Current report generation methods primarily employ knowledge graphs for image enhancement, neglecting the interpretability and guiding function of the knowledge graphs themselves. Additionally, few approaches leverage the stable modal alignment information from multimodal pre-trained models to facilitate the generation of radiology reports. We propose the Terms-Guided Radiology Report Generation (TGR), a simple and practical model for generating reports guided primarily by anatomical terms. Specifically, we utilize a dual-stream visual feature extraction module comprised of detail extraction module and a frozen multimodal pre-trained model to separately extract visual detail features and semantic features. Furthermore, a Visual Enhancement Module (VEM) is proposed to further enrich the visual features, thereby facilitating the generation of a list of anatomical terms. We integrate anatomical terms with image features and proceed to engage contrastive learning with frozen text embeddings, utilizing the stable feature space from these embeddings to boost modal alignment capabilities further. Our model incorporates the capability for manual input, enabling it to generate a list of organs for specifically focused abnormal areas or to produce more accurate single-sentence descriptions based on selected anatomical terms. Comprehensive experiments demonstrate the effectiveness of our method in report generation tasks, our TGR-S model reduces training parameters by 38.9% while performing comparably to current state-of-the-art models, and our TGR-B model exceeds the best baseline models across multiple metrics.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104718"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142094701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SSGU-CD: A combined semantic and structural information graph U-shaped network for document-level Chemical-Disease interaction extraction SSGU-CD:用于文档级化学-疾病交互提取的语义和结构信息图 U 型组合网络。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104719
Pengyuan Nie , Jinzhong Ning , Mengxuan Lin , Zhihao Yang , Lei Wang

Document-level interaction extraction for Chemical-Disease is aimed at inferring the interaction relations between chemical entities and disease entities across multiple sentences. Compared with sentence-level relation extraction, document-level relation extraction can capture the associations between different entities throughout the entire document, which is found to be more practical for biomedical text information. However, current biomedical extraction methods mainly concentrate on sentence-level relation extraction, making it difficult to access the rich structural information contained in documents in practical application scenarios. We put forward SSGU-CD, a combined Semantic and Structural information Graph U-shaped network for document-level Chemical-Disease interaction extraction. This framework effectively stores document semantic and structure information as graphs and can fuse the original context information of documents. Using the framework, we propose a balanced combination of cross-entropy loss function to facilitate collaborative optimization among models with the aim of enhancing the ability to extract Chemical-Disease interaction relations. We evaluated SSGU-CD on the document-level relation extraction dataset CDR and BioRED, and the results demonstrate that the framework can significantly improve the extraction performance.

化学-疾病的文档级交互关系抽取旨在推断多个句子中化学实体与疾病实体之间的交互关系。与句子级关系提取相比,文档级关系提取可以捕捉整个文档中不同实体之间的关联,这对于生物医学文本信息来说更为实用。然而,目前的生物医学提取方法主要集中于句子级关系提取,在实际应用场景中很难获取文档中包含的丰富结构信息。我们提出了一种用于文档级化学-疾病交互提取的语义与结构信息图U形网络(Semantic and Structural information Graph U-shaped network)。该框架能有效地将文档语义和结构信息存储为图,并能融合文档的原始上下文信息。利用该框架,我们提出了交叉熵损失函数的平衡组合,以促进模型间的协同优化,从而提高提取化学-疾病交互关系的能力。我们在文档级关系提取数据集 CDR 和 BioRED 上对 SSGU-CD 进行了评估,结果表明该框架能显著提高提取性能。
{"title":"SSGU-CD: A combined semantic and structural information graph U-shaped network for document-level Chemical-Disease interaction extraction","authors":"Pengyuan Nie ,&nbsp;Jinzhong Ning ,&nbsp;Mengxuan Lin ,&nbsp;Zhihao Yang ,&nbsp;Lei Wang","doi":"10.1016/j.jbi.2024.104719","DOIUrl":"10.1016/j.jbi.2024.104719","url":null,"abstract":"<div><p>Document-level interaction extraction for Chemical-Disease is aimed at inferring the interaction relations between chemical entities and disease entities across multiple sentences. Compared with sentence-level relation extraction, document-level relation extraction can capture the associations between different entities throughout the entire document, which is found to be more practical for biomedical text information. However, current biomedical extraction methods mainly concentrate on sentence-level relation extraction, making it difficult to access the rich structural information contained in documents in practical application scenarios. We put forward SSGU-CD, a combined <strong><u>S</u></strong>emantic and <strong><u>S</u></strong>tructural information <strong><u>G</u></strong>raph <strong><u>U</u></strong>-shaped network for document-level <strong><u>C</u></strong>hemical-<strong><u>D</u></strong>isease interaction extraction. This framework effectively stores document semantic and structure information as graphs and can fuse the original context information of documents. Using the framework, we propose a balanced combination of cross-entropy loss function to facilitate collaborative optimization among models with the aim of enhancing the ability to extract Chemical-Disease interaction relations. We evaluated SSGU-CD on the document-level relation extraction dataset CDR and BioRED, and the results demonstrate that the framework can significantly improve the extraction performance.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104719"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424001370/pdfft?md5=ccbd03895ffdd2c9164f4a506fad5a18&pid=1-s2.0-S1532046424001370-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142107838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MolCFL: A personalized and privacy-preserving drug discovery framework based on generative clustered federated learning MolCFL:基于生成聚类联合学习的个性化和保护隐私的药物发现框架。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104712
Yan Guo, Yongqiang Gao, Jiawei Song

In today’s era of rapid development of large models, the traditional drug development process is undergoing a profound transformation. The vast demand for data and consumption of computational resources are making independent drug discovery increasingly difficult. By integrating federated learning technology into the drug discovery field, we have found a solution that both protects privacy and shares computational power. However, the differences in data held by various pharmaceutical institutions and the diversity in drug design objectives have exacerbated the issue of data heterogeneity, making traditional federated learning consensus models unable to meet the personalized needs of all parties. In this study, we introduce and evaluate an innovative drug discovery framework, MolCFL, which utilizes a multi-layer perceptron (MLP) as the generator and a graph convolutional network (GCN) as the discriminator in a generative adversarial network (GAN). By learning the graph structure of molecules, it generates new molecules in a highly personalized manner and then optimizes the learning process by clustering federated learning, grouping compound data with high similarity. MolCFL not only enhances the model’s ability to protect privacy but also significantly improves the efficiency and personalization of molecular design. MolCFL exhibits superior performance when handling non-independently and identically distributed data compared to traditional models. Experimental results show that the framework demonstrates outstanding performance on two benchmark datasets, with the generated new molecules achieving over 90% in Uniqueness and close to 100% in Novelty. MolCFL not only improves the quality and efficiency of drug molecule design but also, through its highly customized clustered federated learning environment, promotes collaboration and specialization in the drug discovery process while ensuring data privacy. These features make MolCFL a powerful tool suitable for addressing the various challenges faced in the modern drug research and development field.

在当今大型模型快速发展的时代,传统的药物研发过程正在经历一场深刻的变革。对数据的巨大需求和对计算资源的消耗使得独立药物发现变得越来越困难。通过将联合学习技术融入药物研发领域,我们找到了一种既能保护隐私又能共享计算能力的解决方案。然而,不同制药机构所掌握数据的差异和药物设计目标的多样性加剧了数据异构问题,使得传统的联合学习共识模型无法满足各方的个性化需求。在本研究中,我们介绍并评估了一种创新的药物发现框架--MolCFL,它在生成对抗网络(GAN)中使用多层感知器(MLP)作为生成器,图卷积网络(GCN)作为判别器。通过学习分子的图结构,它能以高度个性化的方式生成新分子,然后通过聚类联合学习优化学习过程,将具有高度相似性的复合数据分组。MolCFL 不仅增强了模型保护隐私的能力,还显著提高了分子设计的效率和个性化程度。与传统模型相比,MolCFL 在处理非独立和同分布数据时表现出更优越的性能。实验结果表明,该框架在两个基准数据集上表现出色,生成的新分子唯一性超过90%,新颖性接近100%。MolCFL不仅提高了药物分子设计的质量和效率,而且通过其高度定制的集群联合学习环境,促进了药物发现过程中的协作和专业化,同时确保了数据隐私。这些特点使 MolCFL 成为一个强大的工具,适用于应对现代药物研发领域面临的各种挑战。
{"title":"MolCFL: A personalized and privacy-preserving drug discovery framework based on generative clustered federated learning","authors":"Yan Guo,&nbsp;Yongqiang Gao,&nbsp;Jiawei Song","doi":"10.1016/j.jbi.2024.104712","DOIUrl":"10.1016/j.jbi.2024.104712","url":null,"abstract":"<div><p>In today’s era of rapid development of large models, the traditional drug development process is undergoing a profound transformation. The vast demand for data and consumption of computational resources are making independent drug discovery increasingly difficult. By integrating federated learning technology into the drug discovery field, we have found a solution that both protects privacy and shares computational power. However, the differences in data held by various pharmaceutical institutions and the diversity in drug design objectives have exacerbated the issue of data heterogeneity, making traditional federated learning consensus models unable to meet the personalized needs of all parties. In this study, we introduce and evaluate an innovative drug discovery framework, MolCFL, which utilizes a multi-layer perceptron (MLP) as the generator and a graph convolutional network (GCN) as the discriminator in a generative adversarial network (GAN). By learning the graph structure of molecules, it generates new molecules in a highly personalized manner and then optimizes the learning process by clustering federated learning, grouping compound data with high similarity. MolCFL not only enhances the model’s ability to protect privacy but also significantly improves the efficiency and personalization of molecular design. MolCFL exhibits superior performance when handling non-independently and identically distributed data compared to traditional models. Experimental results show that the framework demonstrates outstanding performance on two benchmark datasets, with the generated new molecules achieving over 90% in Uniqueness and close to 100% in Novelty. MolCFL not only improves the quality and efficiency of drug molecule design but also, through its highly customized clustered federated learning environment, promotes collaboration and specialization in the drug discovery process while ensuring data privacy. These features make MolCFL a powerful tool suitable for addressing the various challenges faced in the modern drug research and development field.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104712"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142055706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing Chinese biomedical text mining with community challenges 以社区挑战推进中文生物医学文本挖掘。
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104716
Hui Zong , Rongrong Wu , Jiaxue Cha , Weizhe Feng , Erman Wu , Jiakun Li , Aibin Shao , Liang Tao , Zuofeng Li , Buzhou Tang , Bairong Shen

Objective

This study aims to review the recent advances in community challenges for biomedical text mining in China.

Methods

We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation.

Results

We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models.

Conclusion

Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.

研究目的本研究旨在回顾中国生物医学文本挖掘社区挑战赛的最新进展:我们收集了生物医学文本挖掘社区挑战赛发布的评估任务信息,包括任务描述、数据集描述、数据来源、任务类型和相关链接。对命名实体识别、实体规范化、属性提取、关系提取、事件提取、文本分类、文本相似性、知识图谱构建、问题解答、文本生成、大型语言模型评估等各类生物医学自然语言处理任务进行了系统总结和对比分析:我们从 2017 年至 2023 年的 6 个社区挑战中确定了 39 项评估任务。我们的分析揭示了生物医学文本挖掘中评估任务类型和数据来源的多样性。我们从转化生物医学信息学的角度探讨了这些社区挑战任务的潜在临床应用。我们将这些社区挑战赛与英文版挑战赛进行了比较,并讨论了这些社区挑战赛的贡献、局限性、经验教训和指导原则,同时强调了大语言模型时代的未来发展方向:社区挑战评估竞赛在促进生物医学文本挖掘领域的技术创新和跨学科合作方面发挥了重要作用。这些挑战赛为研究人员开发最先进的解决方案提供了宝贵的平台。
{"title":"Advancing Chinese biomedical text mining with community challenges","authors":"Hui Zong ,&nbsp;Rongrong Wu ,&nbsp;Jiaxue Cha ,&nbsp;Weizhe Feng ,&nbsp;Erman Wu ,&nbsp;Jiakun Li ,&nbsp;Aibin Shao ,&nbsp;Liang Tao ,&nbsp;Zuofeng Li ,&nbsp;Buzhou Tang ,&nbsp;Bairong Shen","doi":"10.1016/j.jbi.2024.104716","DOIUrl":"10.1016/j.jbi.2024.104716","url":null,"abstract":"<div><h3>Objective</h3><p>This study aims to review the recent advances in community challenges for biomedical text mining in China.</p></div><div><h3>Methods</h3><p>We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation.</p></div><div><h3>Results</h3><p>We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models.</p></div><div><h3>Conclusion</h3><p>Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104716"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424001345/pdfft?md5=90f19a6b5c337cb24358bf3c1497f985&pid=1-s2.0-S1532046424001345-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142093211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BGformer: An improved Informer model to enhance blood glucose prediction BGformer:改进的 Informer 模型可提高血糖预测能力
IF 4 2区 医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-26 DOI: 10.1016/j.jbi.2024.104715
Yuewei Xue, Shaopeng Guan, Wanhai Jia

Accurately predicting blood glucose levels is crucial in diabetes management to mitigate patients’ risk of complications. However, blood glucose values exhibit instability, and existing prediction methods often struggle to capture their volatile nature, leading to inaccurate trend forecasts. To address these challenges, we propose a novel blood glucose level prediction model based on the Informer architecture: BGformer. Our model introduces a feature enhancement module and a microscale overlapping concerns mechanism. The feature enhancement module integrates periodic and trend feature extractors, enhancing the model’s ability to capture relevant information from the data. By extending the feature extraction capacity of time series data, it provides richer feature representations for analysis. Meanwhile, the microscale overlapping concerns mechanism adopts a window-based strategy, computing attention scores only within specific windows. This approach reduces computational complexity while enhancing the model’s capacity to capture local temporal dependencies. Furthermore, we introduce a dual attention enhancement module to augment the model’s expressive capability. Through prediction experiments on blood glucose values from sixteen diabetic patients, our model outperformed eight benchmark models in terms of both MAE and RMSE metrics for future 60-minute and 90-minute predictions. Our proposed scheme significantly improves the model’s dependency-capturing ability, resulting in more accurate blood glucose level predictions.

准确预测血糖水平对糖尿病管理至关重要,可降低患者出现并发症的风险。然而,血糖值具有不稳定性,现有的预测方法往往难以捕捉其波动性,导致趋势预测不准确。为了应对这些挑战,我们提出了一种基于 Informer 架构的新型血糖水平预测模型:BGformer。我们的模型引入了特征增强模块和微尺度重叠关注机制。特征增强模块集成了周期和趋势特征提取器,增强了模型从数据中捕捉相关信息的能力。通过扩展时间序列数据的特征提取能力,它为分析提供了更丰富的特征表示。同时,微尺度重叠关注机制采用基于窗口的策略,只计算特定窗口内的关注分数。这种方法既降低了计算复杂度,又增强了模型捕捉局部时间依赖性的能力。此外,我们还引入了双重注意力增强模块,以提高模型的表达能力。通过对 16 名糖尿病患者的血糖值进行预测实验,我们的模型在未来 60 分钟和 90 分钟预测的 MAE 和 RMSE 指标方面均优于 8 个基准模型。我们提出的方案大大提高了模型的依赖捕捉能力,使血糖水平预测更加准确。
{"title":"BGformer: An improved Informer model to enhance blood glucose prediction","authors":"Yuewei Xue,&nbsp;Shaopeng Guan,&nbsp;Wanhai Jia","doi":"10.1016/j.jbi.2024.104715","DOIUrl":"10.1016/j.jbi.2024.104715","url":null,"abstract":"<div><p>Accurately predicting blood glucose levels is crucial in diabetes management to mitigate patients’ risk of complications. However, blood glucose values exhibit instability, and existing prediction methods often struggle to capture their volatile nature, leading to inaccurate trend forecasts. To address these challenges, we propose a novel blood glucose level prediction model based on the Informer architecture: BGformer. Our model introduces a feature enhancement module and a microscale overlapping concerns mechanism. The feature enhancement module integrates periodic and trend feature extractors, enhancing the model’s ability to capture relevant information from the data. By extending the feature extraction capacity of time series data, it provides richer feature representations for analysis. Meanwhile, the microscale overlapping concerns mechanism adopts a window-based strategy, computing attention scores only within specific windows. This approach reduces computational complexity while enhancing the model’s capacity to capture local temporal dependencies. Furthermore, we introduce a dual attention enhancement module to augment the model’s expressive capability. Through prediction experiments on blood glucose values from sixteen diabetic patients, our model outperformed eight benchmark models in terms of both MAE and RMSE metrics for future 60-minute and 90-minute predictions. Our proposed scheme significantly improves the model’s dependency-capturing ability, resulting in more accurate blood glucose level predictions.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104715"},"PeriodicalIF":4.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142089056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Biomedical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1