Journal of Biomedical Informatics最新文献_第5页

Taming vision transformers for clinical laryngoscopy assessment 用于临床喉镜检查评估的驯服视力变换器。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-02-01 DOI: 10.1016/j.jbi.2024.104766

Xinzhu Zhang , Jing Zhao , Daoming Zong , Henglei Ren , Chunli Gao

Objective:

Laryngoscopy, essential for diagnosing laryngeal cancer (LCA), faces challenges due to high inter-observer variability and the reliance on endoscopist expertise. Distinguishing precancerous from early-stage cancerous lesions is particularly challenging, even for experienced practitioners, given their similar appearances. This study aims to enhance laryngoscopic image analysis to improve early screening/detection of cancer or precancerous conditions.

Methods:

We propose MedFormer, a laryngeal cancer classification method based on the Vision Transformer (ViT). To address data scarcity, MedFormer employs a customized transfer learning approach that leverages the representational power of pre-trained transformers. This method enables robust out-of-domain generalization by fine-tuning a minimal set of additional parameters.

Results:

MedFormer exhibits sensitivity-specificity values of 98%–89% for identifying precancerous lesions (leukoplakia) and 89%–97% for detecting cancer, surpassing CNN counterparts significantly. Additionally, when compared to the two selected ViT-based models, MedFormer also demonstrates superior performance. It also outperforms physician visual evaluations (PVE) in certain scenarios and matches PVE performance in all cases. Visualizations using class activation maps (CAM) and deformable patches demonstrate MedFormer’s interpretability, aiding clinicians in understanding the model’s predictions.

Conclusion:

We highlight the potential of visual transformers in clinical laryngoscopic assessments, presenting MedFormer as an effective method for the early detection of laryngeal cancer.

目的：喉镜检查是诊断喉癌（LCA）的关键，由于观察者之间的高度变异性和对内镜专家专业知识的依赖，喉镜检查面临着挑战。区分癌前病变和早期癌性病变尤其具有挑战性，即使对经验丰富的医生来说也是如此，因为它们的外观相似。本研究旨在加强喉镜图像分析，以提高早期筛查/发现癌症或癌前病变。方法：提出一种基于视觉变换器（Vision Transformer, ViT）的喉癌分类方法MedFormer。为了解决数据短缺问题，MedFormer采用了一种定制的迁移学习方法，利用了预训练变压器的表征能力。该方法通过微调最小附加参数集实现鲁棒的域外泛化。结果：MedFormer识别癌前病变（白斑）的敏感性-特异性值为98%-89%，检测癌症的敏感性-特异性值为89%-97%，明显超过CNN。此外，与两种选定的基于vit的模型相比，MedFormer也表现出卓越的性能。在某些情况下，它也优于医生的视觉评估（PVE），并在所有情况下匹配PVE的表现。使用类激活图（CAM）和可变形贴片的可视化展示了MedFormer的可解释性，帮助临床医生理解模型的预测。结论：我们强调了视觉变形在临床喉镜评估中的潜力，表明MedFormer是一种早期发现喉癌的有效方法。

{"title":"Taming vision transformers for clinical laryngoscopy assessment","authors":"Xinzhu Zhang , Jing Zhao , Daoming Zong , Henglei Ren , Chunli Gao","doi":"10.1016/j.jbi.2024.104766","DOIUrl":"10.1016/j.jbi.2024.104766","url":null,"abstract":"<div><h3>Objective:</h3><div>Laryngoscopy, essential for diagnosing laryngeal cancer (LCA), faces challenges due to high inter-observer variability and the reliance on endoscopist expertise. Distinguishing precancerous from early-stage cancerous lesions is particularly challenging, even for experienced practitioners, given their similar appearances. This study aims to enhance laryngoscopic image analysis to improve early screening/detection of cancer or precancerous conditions.</div></div><div><h3>Methods:</h3><div>We propose MedFormer, a laryngeal cancer classification method based on the Vision Transformer (ViT). To address data scarcity, MedFormer employs a customized transfer learning approach that leverages the representational power of pre-trained transformers. This method enables robust out-of-domain generalization by fine-tuning a minimal set of additional parameters.</div></div><div><h3>Results:</h3><div>MedFormer exhibits sensitivity-specificity values of 98%–89% for identifying precancerous lesions (leukoplakia) and 89%–97% for detecting cancer, surpassing CNN counterparts significantly. Additionally, when compared to the two selected ViT-based models, MedFormer also demonstrates superior performance. It also outperforms physician visual evaluations (PVE) in certain scenarios and matches PVE performance in all cases. Visualizations using class activation maps (CAM) and deformable patches demonstrate MedFormer’s interpretability, aiding clinicians in understanding the model’s predictions.</div></div><div><h3>Conclusion:</h3><div>We highlight the potential of visual transformers in clinical laryngoscopic assessments, presenting MedFormer as an effective method for the early detection of laryngeal cancer.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"162 ","pages":"Article 104766"},"PeriodicalIF":4.0,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143006140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Examining implementation outcomes in health information exchange systems: A scoping review

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-01-20 DOI: 10.1016/j.jbi.2025.104782

Bonnie Lum , Navisha Weerasinghe , Charlene H. Chu , Dan Perri , Lisa Cranley

Background

Health information exchange (HIE) facilitates the secure exchange of digital health data across disparate health systems and settings. The implementation of information technology projects in healthcare is complex, further complicated by the fact that implementation success, through the measure of implementation outcomes, has been inconsistently defined and evaluated. There is no known scoping review examining implementation success through implementation outcomes in the field of HIE technologies. The aim of this scoping review was to provide a synthesis of studies related to reported implementation outcomes of HIE solutions (and related interoperability technologies) with a goal to inform the implementation of large-scale HIE projects in the future.

Methods

A scoping review, guided by the Arksey and O’Malley Framework, was conducted in four databases (Medline, Embase, CINAHL, and Web of Science), gathering studies from January 2010 to June 2023. Studies that described the implementation of a technology supporting interoperability or HIE across different organizations and/or across different healthcare settings and described the evaluation of one or more implementation outcomes from the Implementation Outcome Framework (IOF) were included.

Results

37 studies were included in this review. The implementation outcome adoption was most frequently reported (n = 24). Fidelity and penetration were not reported. Few studies provided definitions for the outcomes being evaluated. Few studies provided details surrounding the stage of implementation as it relates to the outcome examined. No studies used the IOF or other similar implementation science evaluation frameworks.

Conclusion

This review highlights the existing gaps in the field of HIE/interoperability solutions implementation studies. Future studies should employ theoretical frameworks to guide their research, standardize language used to describe implementation outcomes, and expand knowledge of salient outcomes at varying stages of implementation.

{"title":"Examining implementation outcomes in health information exchange systems: A scoping review","authors":"Bonnie Lum , Navisha Weerasinghe , Charlene H. Chu , Dan Perri , Lisa Cranley","doi":"10.1016/j.jbi.2025.104782","DOIUrl":"10.1016/j.jbi.2025.104782","url":null,"abstract":"<div><h3>Background</h3><div>Health information exchange (HIE) facilitates the secure exchange of digital health data across disparate health systems and settings. The implementation of information technology projects in healthcare is complex, further complicated by the fact that implementation success, through the measure of implementation outcomes, has been inconsistently defined and evaluated. There is no known scoping review examining implementation success through implementation outcomes in the field of HIE technologies. The aim of this scoping review was to provide a synthesis of studies related to reported implementation outcomes of HIE solutions (and related interoperability technologies) with a goal to inform the implementation of large-scale HIE projects in the future.</div></div><div><h3>Methods</h3><div>A scoping review, guided by the Arksey and O’Malley Framework, was conducted in four databases (Medline, Embase, CINAHL, and Web of Science), gathering studies from January 2010 to June 2023. Studies that described the implementation of a technology supporting interoperability or HIE across different organizations and/or across different healthcare settings and described the evaluation of one or more implementation outcomes from the Implementation Outcome Framework (IOF) were included.</div></div><div><h3>Results</h3><div>37 studies were included in this review. The implementation outcome adoption was most frequently reported (n = 24). Fidelity and penetration were not reported. Few studies provided definitions for the outcomes being evaluated. Few studies provided details surrounding the stage of implementation as it relates to the outcome examined. No studies used the IOF or other similar implementation science evaluation frameworks.</div></div><div><h3>Conclusion</h3><div>This review highlights the existing gaps in the field of HIE/interoperability solutions implementation studies. Future studies should employ theoretical frameworks to guide their research, standardize language used to describe implementation outcomes, and expand knowledge of salient outcomes at varying stages of implementation.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"163 ","pages":"Article 104782"},"PeriodicalIF":4.0,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143023464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Coherence and comprehensibility: Large language models predict lay understanding of health-related content 连贯性和可理解性：大型语言模型预测外行对健康相关内容的理解。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-01-01 DOI: 10.1016/j.jbi.2024.104758

Trevor Cohen , Weizhe Xu , Yue Guo , Serguei Pakhomov , Gondy Leroy

Health literacy is a prerequisite to informed health-related decision making. To facilitate understanding of information, text should be presented at an appropriate reading level for the reader. Cognitive studies suggest that the coherence of a text – the interconnectedness between the ideas it expresses – is especially important for low-knowledge readers, who lack the background knowledge to draw inferences from text that is implicitly connected only. Prior work in cognitive science has yielded automated methods to estimate coherence. These methods estimate the proximity between text representations in a semantic vector space, with the underlying idea that units of text that are poorly connected will be further apart in this space. In addition, recent work with large language models (LLMs) has produced probabilistic methodological analogues that have yet to be evaluated for this purpose. This work concerns the relationship between these automated measures and layperson comprehension of biomedical text. To characterize this relationship, we applied a range of automated measures of text coherence to a set of text snippets, some of which were deliberately modified to improve their accessibility in a series of reading comprehension experiments. Results indicate significant associations between reader comprehension – as estimated using multiple-choice questions – and LLM-derived coherence metrics. Interventions designed to improve the comprehensibility of passages also improved their coherence, as measured with the best-performing LLM-derived models and shown by improved reader understanding of the text. These findings support the utility of LLM-derived measures of text coherence as a means to identify gaps in connectedness that make biomedical text difficult for laypeople to understand, with the potential to inform both manual and automated methods to improve the accessibility of the biomedical literature.

卫生知识普及是作出与卫生有关的知情决策的先决条件。为了便于理解信息，文本应该以适合读者的阅读水平呈现。认知研究表明，文本的连贯性——即文本所表达的观点之间的相互联系——对低知识的读者来说尤为重要，因为他们缺乏背景知识，无法从只有隐含联系的文本中得出推论。认知科学先前的工作已经产生了估计连贯性的自动化方法。这些方法估计语义向量空间中文本表示之间的接近度，其基本思想是连接不良的文本单元在该空间中会进一步分开。此外，最近使用大型语言模型（llm）的工作已经产生了概率方法类似物，但尚未为此目的进行评估。这项工作涉及这些自动化措施和外行人对生物医学文本的理解之间的关系。为了描述这种关系，我们对一组文本片段应用了一系列文本连贯性的自动测量，其中一些片段在一系列阅读理解实验中被故意修改以提高其可访问性。结果表明，读者理解-估计使用多项选择题-和法学硕士衍生的连贯性指标之间的显著关联。旨在提高段落可理解性的干预措施也提高了它们的连贯性，正如用表现最好的法学硕士衍生模型所衡量的那样，并通过提高读者对文本的理解来显示。这些发现支持了法学硕士衍生的文本连贯性测量的效用，作为识别使外行难以理解的生物医学文本的连通性差距的一种手段，具有通知手动和自动化方法以提高生物医学文献的可及性的潜力。

{"title":"Coherence and comprehensibility: Large language models predict lay understanding of health-related content","authors":"Trevor Cohen , Weizhe Xu , Yue Guo , Serguei Pakhomov , Gondy Leroy","doi":"10.1016/j.jbi.2024.104758","DOIUrl":"10.1016/j.jbi.2024.104758","url":null,"abstract":"<div><div>Health literacy is a prerequisite to informed health-related decision making. To facilitate understanding of information, text should be presented at an appropriate reading level for the reader. Cognitive studies suggest that the coherence of a text – the interconnectedness between the ideas it expresses – is especially important for low-knowledge readers, who lack the background knowledge to draw inferences from text that is implicitly connected only. Prior work in cognitive science has yielded automated methods to estimate coherence. These methods estimate the <em>proximity</em> between text representations in a semantic vector space, with the underlying idea that units of text that are poorly connected will be further apart in this space. In addition, recent work with large language models (LLMs) has produced <em>probabilistic</em> methodological analogues that have yet to be evaluated for this purpose. This work concerns the relationship between these automated measures and layperson comprehension of biomedical text. To characterize this relationship, we applied a range of automated measures of text coherence to a set of text snippets, some of which were deliberately modified to improve their accessibility in a series of reading comprehension experiments. Results indicate significant associations between reader comprehension – as estimated using multiple-choice questions – and LLM-derived coherence metrics. Interventions designed to improve the comprehensibility of passages also improved their coherence, as measured with the best-performing LLM-derived models and shown by improved reader understanding of the text. These findings support the utility of LLM-derived measures of text coherence as a means to identify gaps in connectedness that make biomedical text difficult for laypeople to understand, with the potential to inform both manual and automated methods to improve the accessibility of the biomedical literature.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"161 ","pages":"Article 104758"},"PeriodicalIF":4.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142813110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient strabismus diagnosis from small samples: Harnessing spatial features for improved accuracy 从小样本有效的斜视诊断：利用空间特征提高准确性。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-01-01 DOI: 10.1016/j.jbi.2024.104759

Renzhong Wu , Shenghui Liao , Yongrong Ji , Xiaoyan Kui , Fuchang Han , Ziyang Hu , Xuefei Song

Strabismus is a common ophthalmological condition, and early diagnosis is crucial to preventing visual impairment and loss of stereopsis. However, traditional methods for diagnosing strabismus often rely on specialized ophthalmic equipment and trained personnel, limiting the widespread accessibility of strabismus diagnosis. Computer-aided strabismus diagnosis is an effective and widely used technology that assists clinicians in making clinical diagnoses and improving efficiency. To address this, we designed an efficient strabismus diagnosis model, RIS-MLP, based on a small number of samples derived from frontal facial images captured under natural lighting conditions via the Hirschberg test. The RIS-MLP combines light reflex point detection and iris detection modules to accurately extract key spatial features even under noisy and occluded conditions. The optimized spatial feature strategies further enhances the performance of the classification module. To validate the superiority of RIS-MLP, we conducted both direct and indirect comparative experiments. Indirect comparisons demonstrate that the RIS-MLP has advantages in terms of sample efficiency. While direct comparisons show that the RIS-MLP can mitigate overfitting to a certain extent, and the RIS-MLP along with its variants (e.g., RIS-SVM) have outperformed state-of-the-art models on our noisy and imbalanced dataset.

斜视是一种常见的眼科疾病，早期诊断对预防视力损害和立体视觉丧失至关重要。然而，传统的斜视诊断方法往往依赖于专业的眼科设备和训练有素的人员，限制了斜视诊断的广泛可及性。计算机辅助斜视诊断是一种有效的、广泛应用的技术，可以帮助临床医生进行临床诊断，提高诊断效率。为了解决这个问题，我们设计了一个有效的斜视诊断模型RIS-MLP，该模型基于Hirschberg测试在自然光条件下捕获的少量正面面部图像样本。RIS-MLP结合光反射点检测和虹膜检测模块，即使在噪声和遮挡条件下也能准确提取关键空间特征。优化后的空间特征策略进一步提高了分类模块的性能。为了验证RIS-MLP的优越性，我们进行了直接和间接的对比实验。间接比较表明，RIS-MLP在样本效率方面具有优势。虽然直接比较表明RIS-MLP可以在一定程度上缓解过拟合，而且RIS-MLP及其变体（例如RIS-SVM）在我们的嘈杂和不平衡数据集上的表现优于最先进的模型。

{"title":"Efficient strabismus diagnosis from small samples: Harnessing spatial features for improved accuracy","authors":"Renzhong Wu , Shenghui Liao , Yongrong Ji , Xiaoyan Kui , Fuchang Han , Ziyang Hu , Xuefei Song","doi":"10.1016/j.jbi.2024.104759","DOIUrl":"10.1016/j.jbi.2024.104759","url":null,"abstract":"<div><div>Strabismus is a common ophthalmological condition, and early diagnosis is crucial to preventing visual impairment and loss of stereopsis. However, traditional methods for diagnosing strabismus often rely on specialized ophthalmic equipment and trained personnel, limiting the widespread accessibility of strabismus diagnosis. Computer-aided strabismus diagnosis is an effective and widely used technology that assists clinicians in making clinical diagnoses and improving efficiency. To address this, we designed an efficient strabismus diagnosis model, RIS-MLP, based on a small number of samples derived from frontal facial images captured under natural lighting conditions via the Hirschberg test. The RIS-MLP combines light reflex point detection and iris detection modules to accurately extract key spatial features even under noisy and occluded conditions. The optimized spatial feature strategies further enhances the performance of the classification module. To validate the superiority of RIS-MLP, we conducted both direct and indirect comparative experiments. Indirect comparisons demonstrate that the RIS-MLP has advantages in terms of sample efficiency. While direct comparisons show that the RIS-MLP can mitigate overfitting to a certain extent, and the RIS-MLP along with its variants (e.g., RIS-SVM) have outperformed state-of-the-art models on our noisy and imbalanced dataset.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"161 ","pages":"Article 104759"},"PeriodicalIF":4.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142818178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing suicidal behavior detection in EHRs: A multi-label NLP framework with transformer models and semantic retrieval-based annotation 在电子病历中增强自杀行为检测：一个具有转换模型和基于语义检索的注释的多标签NLP框架。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-01-01 DOI: 10.1016/j.jbi.2024.104755

Kimia Zandbiglari , Shobhan Kumar , Muhammad Bilal , Amie Goodin , Masoud Rouhizadeh

Background:

Suicide is a leading cause of death worldwide, making early identification of suicidal behaviors crucial for clinicians. Current Natural Language Processing (NLP) approaches for identifying suicidal behaviors in Electronic Health Records (EHRs) rely on keyword searches, rule-based methods, and binary classification, which may not fully capture the complexity and spectrum of suicidal behaviors. This study aims to create a multi-class labeled dataset with annotation guidelines and develop a novel NLP approach for fine-grained, multi-label classification of suicidal behaviors, improving the efficiency of the annotation process and accuracy of the NLP methods.

Methods:

We develop a multi-class labeling system based on guidelines from FDA, CDC, and WHO, distinguishing between six categories of suicidal behaviors and allowing for multiple labels per data sample. To efficiently create an annotated dataset, we use an MPNet-based semantic retrieval framework to extract relevant sentences from a large EHR dataset, reducing annotation space while capturing diverse expressions. Experts annotate the extracted sentences using the multi-class system. We then formulate the task as a multi-label classification problem and fine-tune transformer-based models on the curated dataset to accurately classify suicidal behaviors in EHRs.

Results:

Lexical analysis revealed key themes in assessing suicide risk, considering an individual’s history, mental health, substance use, and family background. Fine-tuned transformer-based models effectively identified suicidal behaviors from EHRs, with Bio_ClinicalBERT, BioBERT, and XLNet achieving the F1 scores (0.81), outperforming BERT and RoBERTa. The proposed approach, based on a multi-label classification system, captures the complexity of suicidal behaviors effectively particularly “Suicide Attempt” and “Family History” instances. The proposed approach, using task-specific NLP models and a multi-label classification system, captures the complexity of suicidal behaviors more effectively than traditional binary classification. However, direct comparisons with existing studies are difficult due to varying metrics and label definitions.