Journal of Biomedical Informatics最新文献_第6页

Unveiling pathology-related predictive uncertainty of glomerular lesion recognition using prototype learning 利用原型学习揭示肾小球病变识别的病理相关预测不确定性。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2025-01-01 DOI: 10.1016/j.jbi.2024.104745

Qiming He , Yingming Xu , Qiang Huang , Yanxia Wang , Jing Ye , Yonghong He , Jing Li , Lianghui Zhu , Zhe Wang , Tian Guan

Objective

Recognizing glomerular lesions is essential in diagnosing chronic kidney disease. However, deep learning faces challenges due to the lesion heterogeneity, superposition, progression, and tissue incompleteness, leading to uncertainty in model predictions. Therefore, it is crucial to analyze pathology-related predictive uncertainty in glomerular lesion recognition and unveil its relationship with pathological properties and its impact on model performance.

Methods

This paper presents a novel framework for pathology-related predictive uncertainty analysis towards glomerular lesion recognition, including prototype learning based predictive uncertainty estimation, pathology-characterized correlation analysis and weight-redistributed prediction rectification. The prototype learning based predictive uncertainty estimation includes deep prototyping, affinity embedding, and multi-dimensional uncertainty fusion. The pathology-characterized correlation analysis is the first to use expert-based and learning- based approach to construct the pathology-related characterization of lesions and tissues. The weight-redistributed prediction rectification module performs reweighting- based lesion recognition.

Results

To validate the performance, extensive experiments were conducted. Based on the Spearman and Pearson correlation analysis, the proposed framework enables more efficient correlation analysis, and strong correlation with pathology-related characterization can be achieved (c index > 0.6 and p < 0.01). Furthermore, the prediction rectification module demonstrated improved lesion recognition performance across most metrics, with enhancements of up to 6.36 %.

Conclusion

The proposed predictive uncertainty analysis in glomerular lesion recognition offers a valuable approach for assessing computational pathology’s predictive uncertainty from a pathology-related perspective.

Significance

The paper provides a solution for pathology-related predictive uncertainty estimation in algorithm development and clinical practice.

目的：鉴别肾小球病变对慢性肾脏病的诊断至关重要。然而，由于病变的异质性、叠加性、进展性和组织的不完全性，深度学习面临着挑战，导致模型预测的不确定性。因此，分析肾小球病变识别中病理相关的预测不确定性，揭示其与病理性质的关系及其对模型性能的影响至关重要。方法：提出基于原型学习的肾小球病变预测不确定性估计、病理特征相关性分析和权重重分布预测校正的病理相关预测不确定性分析框架。基于原型学习的预测不确定性估计包括深度原型、亲和嵌入和多维不确定性融合。病理特征相关分析是首次使用基于专家和基于学习的方法来构建病变和组织的病理相关特征。权重重分配预测校正模块执行基于权重重的病灶识别。结果：为了验证其性能，进行了大量的实验。基于Spearman和Pearson相关性分析，本文提出的框架能够实现更高效的相关性分析，并与病理相关表征具有较强的相关性（c index > 0.6和p ）。结论：本文提出的肾小球病变识别预测不确定性分析，为从病理相关角度评估计算病理学的预测不确定性提供了一种有价值的方法。意义：为算法开发和临床实践中与病理相关的预测不确定性估计提供了解决方案。

{"title":"Unveiling pathology-related predictive uncertainty of glomerular lesion recognition using prototype learning","authors":"Qiming He , Yingming Xu , Qiang Huang , Yanxia Wang , Jing Ye , Yonghong He , Jing Li , Lianghui Zhu , Zhe Wang , Tian Guan","doi":"10.1016/j.jbi.2024.104745","DOIUrl":"10.1016/j.jbi.2024.104745","url":null,"abstract":"<div><h3>Objective</h3><div>Recognizing glomerular lesions is essential in diagnosing chronic kidney disease. However, deep learning faces challenges due to the lesion heterogeneity, superposition, progression, and tissue incompleteness, leading to uncertainty in model predictions. Therefore, it is crucial to analyze pathology-related predictive uncertainty in glomerular lesion recognition and unveil its relationship with pathological properties and its impact on model performance.</div></div><div><h3>Methods</h3><div>This paper presents a novel framework for pathology-related predictive uncertainty analysis towards glomerular lesion recognition, including prototype learning based predictive uncertainty estimation, pathology-characterized correlation analysis and weight-redistributed prediction rectification. The prototype learning based predictive uncertainty estimation includes deep prototyping, affinity embedding, and multi-dimensional uncertainty fusion. The pathology-characterized correlation analysis is the first to use expert-based and learning- based approach to construct the pathology-related characterization of lesions and tissues. The weight-redistributed prediction rectification module performs reweighting- based lesion recognition.</div></div><div><h3>Results</h3><div>To validate the performance, extensive experiments were conducted. Based on the Spearman and Pearson correlation analysis, the proposed framework enables more efficient correlation analysis, and strong correlation with pathology-related characterization can be achieved (c index > 0.6 and p < 0.01). Furthermore, the prediction rectification module demonstrated improved lesion recognition performance across most metrics, with enhancements of up to 6.36 %.</div></div><div><h3>Conclusion</h3><div>The proposed predictive uncertainty analysis in glomerular lesion recognition offers a valuable approach for assessing computational pathology’s predictive uncertainty from a pathology-related perspective.</div></div><div><h3>Significance</h3><div>The paper provides a solution for pathology-related predictive uncertainty estimation in algorithm development and clinical practice.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"161 ","pages":"Article 104745"},"PeriodicalIF":4.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142921240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Early multi-cancer detection through deep learning: An anomaly detection approach using Variational Autoencoder 通过深度学习进行早期多癌检测：使用变异自动编码器的异常检测方法。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104751

Innocent Tatchum Sado , Louis Fippo Fitime , Geraud Fokou Pelap , Claude Tinku , Gaelle Mireille Meudje , Thomas Bouetou Bouetou

Cancer is a disease that causes many deaths worldwide. The treatment of cancer is first and foremost a matter of detection, a treatment that is most effective when the disease is detected at an early stage. With the evolution of technology, several computer-aided diagnosis tools have been developed around cancer; several image-based cancer detection methods have been developed too. However, cancer detection faces many difficulties related to early detection which is crucial for patient survival rate. To detect cancer early, scientists have been using transcriptomic data. However, this presents some challenges such as unlabeled data, a large amount of data, and image-based techniques that only focus on one type of cancer. The purpose of this work is to develop a deep learning model that can effectively detect as soon as possible, specifically in the early stages, any type of cancer as an anomaly in transcriptomic data. This model must have the ability to act independently and not be restricted to any specific type of cancer. To achieve this goal, we modeled a deep neural network (a Variational Autoencoder) and then defined an algorithm for detecting anomalies in the output of the Variational Autoencoder. The Variational Autoencoder consists of an encoder and a decoder with a hidden layer. With the TCGA and GTEx data, we were able to train the model for six types of cancer using the Adam optimizer with decay learning for training, and a two-component loss function. As a result, we obtained the lowest value of accuracy 0.950, and the lowest value of recall 0.830. This research leads us to the design of a deep learning model for the detection of cancer as an anomaly in transcriptomic data.

癌症是一种导致全球许多人死亡的疾病。癌症的治疗首先要靠检测，只有在早期发现癌症，治疗效果才会最好。随着技术的发展，围绕癌症开发出了多种计算机辅助诊断工具，还开发出了多种基于图像的癌症检测方法。然而，癌症检测面临着许多与早期检测有关的困难，而早期检测对患者的存活率至关重要。为了早期检测癌症，科学家们一直在使用转录组数据。然而，这也带来了一些挑战，如无标记数据、数据量大以及基于图像的技术只关注一种类型的癌症。这项工作的目的是开发一种深度学习模型，它能尽快（特别是在早期阶段）有效检测转录组数据中任何类型癌症的异常。该模型必须具备独立行动的能力，并且不局限于任何特定类型的癌症。为了实现这一目标，我们建立了一个深度神经网络模型（变异自动编码器），然后定义了一种算法，用于检测变异自动编码器输出中的异常。变异自动编码器由一个编码器和一个带隐藏层的解码器组成。利用 TCGA 和 GTEx 数据，我们使用亚当优化器（Adam optimizer）、衰减学习训练和双分量损失函数对六种类型的癌症进行了模型训练。结果，我们获得了准确率最低值 0.950 和召回率最低值 0.830。这项研究为我们设计了一种深度学习模型，用于检测转录组数据中的癌症异常。

{"title":"Early multi-cancer detection through deep learning: An anomaly detection approach using Variational Autoencoder","authors":"Innocent Tatchum Sado , Louis Fippo Fitime , Geraud Fokou Pelap , Claude Tinku , Gaelle Mireille Meudje , Thomas Bouetou Bouetou","doi":"10.1016/j.jbi.2024.104751","DOIUrl":"10.1016/j.jbi.2024.104751","url":null,"abstract":"<div><div>Cancer is a disease that causes many deaths worldwide. The treatment of cancer is first and foremost a matter of detection, a treatment that is most effective when the disease is detected at an early stage. With the evolution of technology, several computer-aided diagnosis tools have been developed around cancer; several image-based cancer detection methods have been developed too. However, cancer detection faces many difficulties related to early detection which is crucial for patient survival rate. To detect cancer early, scientists have been using transcriptomic data. However, this presents some challenges such as unlabeled data, a large amount of data, and image-based techniques that only focus on one type of cancer. The purpose of this work is to develop a deep learning model that can effectively detect as soon as possible, specifically in the early stages, any type of cancer as an anomaly in transcriptomic data. This model must have the ability to act independently and not be restricted to any specific type of cancer. To achieve this goal, we modeled a deep neural network (a Variational Autoencoder) and then defined an algorithm for detecting anomalies in the output of the Variational Autoencoder. The Variational Autoencoder consists of an encoder and a decoder with a hidden layer. With the TCGA and GTEx data, we were able to train the model for six types of cancer using the Adam optimizer with decay learning for training, and a two-component loss function. As a result, we obtained the lowest value of accuracy 0.950, and the lowest value of recall 0.830. This research leads us to the design of a deep learning model for the detection of cancer as an anomaly in transcriptomic data.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104751"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142687219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How to identify patient perception of AI voice robots in the follow-up scenario? A multimodal identity perception method based on deep learning 在后续场景中如何识别患者对AI语音机器人的感知？一种基于深度学习的多模态身份感知方法。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104757

Mingjie Liu , Kuiyou Chen , Qing Ye , Hong Wu

Objectives

Post-discharge follow-up stands as a critical component of post-diagnosis management, and the constraints of healthcare resources impede comprehensive manual follow-up. However, patients are less cooperative with AI follow-up calls or may even hang up once AI voice robots are perceived. To improve the effectiveness of follow-up, alternative measures should be taken when patients perceive AI voice robots. Therefore, identifying how patients perceive AI voice robots is crucial. This study aims to construct a multimodal identity perception model based on deep learning to identify how patients perceive AI voice robots.

Methods

Our dataset includes 2030 response audio recordings and corresponding texts from patients. We conduct comparative experiments and perform an ablation study. The proposed model employs a transfer learning approach, utilizing BERT and TextCNN for text feature extraction, AST and LSTM for audio feature extraction, and self-attention for feature fusion.

Results

Our model demonstrates superior performance against existing baselines, with a precision of 86.67%, an AUC of 84%, and an accuracy of 94.38%. Additionally, a generalization experiment was conducted using 144 patients’ response audio recordings and corresponding text data from other departments in the hospital, confirming the model’s robustness and effectiveness.

Conclusion

Our multimodal identity perception model can identify how patients perceive AI voice robots effectively. Identifying how patients perceive AI not only helps to optimize the follow-up process and improve patient cooperation, but also provides support for the evaluation and optimization of AI voice robots.

目的：出院后随访是诊断后管理的重要组成部分，医疗资源的限制阻碍了全面的人工随访。然而，一旦感知到人工智能语音机器人，患者对人工智能的随访电话不太配合，甚至可能会挂断电话。为提高随访效果，患者感知AI语音机器人时应采取替代措施。因此，确定患者如何看待人工智能语音机器人至关重要。本研究旨在构建基于深度学习的多模态身份感知模型，识别患者对AI语音机器人的感知。方法：我们的数据集包括2030个患者的回应录音和相应的文本。我们进行了对比实验并进行了消融研究。该模型采用迁移学习方法，利用BERT和TextCNN进行文本特征提取，利用AST和LSTM进行音频特征提取，利用自关注进行特征融合。结果：我们的模型在现有基线上表现出优异的性能，精度为86.67%，AUC为84%，准确度为94.38%。此外，利用144例患者的应答录音和医院其他科室的相应文本数据进行了泛化实验，验证了模型的稳健性和有效性。结论：我们的多模态身份感知模型可以有效识别患者对AI语音机器人的感知。识别患者对AI的感知，不仅有助于优化随访流程，提高患者配合度，也为AI语音机器人的评估和优化提供支持。

{"title":"How to identify patient perception of AI voice robots in the follow-up scenario? A multimodal identity perception method based on deep learning","authors":"Mingjie Liu , Kuiyou Chen , Qing Ye , Hong Wu","doi":"10.1016/j.jbi.2024.104757","DOIUrl":"10.1016/j.jbi.2024.104757","url":null,"abstract":"<div><h3>Objectives</h3><div>Post-discharge follow-up stands as a critical component of post-diagnosis management, and the constraints of healthcare resources impede comprehensive manual follow-up. However, patients are less cooperative with AI follow-up calls or may even hang up once AI voice robots are perceived. To improve the effectiveness of follow-up, alternative measures should be taken when patients perceive AI voice robots. Therefore, identifying how patients perceive AI voice robots is crucial. This study aims to construct a multimodal identity perception model based on deep learning to identify how patients perceive AI voice robots.</div></div><div><h3>Methods</h3><div>Our dataset includes 2030 response audio recordings and corresponding texts from patients. We conduct comparative experiments and perform an ablation study. The proposed model employs a transfer learning approach, utilizing BERT and TextCNN for text feature extraction, AST and LSTM for audio feature extraction, and self-attention for feature fusion.</div></div><div><h3>Results</h3><div>Our model demonstrates superior performance against existing baselines, with a precision of 86.67%, an AUC of 84%, and an accuracy of 94.38%. Additionally, a generalization experiment was conducted using 144 patients’ response audio recordings and corresponding text data from other departments in the hospital, confirming the model’s robustness and effectiveness.</div></div><div><h3>Conclusion</h3><div>Our multimodal identity perception model can identify how patients perceive AI voice robots effectively. Identifying how patients perceive AI not only helps to optimize the follow-up process and improve patient cooperation, but also provides support for the evaluation and optimization of AI voice robots.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104757"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142780183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Biomedical document-level relation extraction with thematic capture and localized entity pooling 基于主题捕获和局部实体池的生物医学文档级关系提取。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104756

Yuqing Li, Xinhui Shao

In contrast to sentence-level relational extraction, document-level relation extraction poses greater challenges as a document typically contains multiple entities, and one entity may be associated with multiple other entities. Existing methods often rely on graph structures to capture path representations between entity pairs. However, this paper introduces a novel approach called local entity pooling that solely relies on the pre-training model to identify the bridge entity related to the current entity pair and generate the reasoning path representation. This technique effectively mitigates the multi-entity problem. Additionally, the model leverages the multi-entity and multi-label characteristics of the document to acquire the document’s thematic representation, thereby enhancing the document-level relation extraction task. Experimental evaluations conducted on two biomedical datasets, CDR and GDA. Our TCLEP (Thematic Capture and Localized Entity Pooling) model achieved the Macro-F1 scores of 71.7% and 85.3%, respectively. Simultaneously, we incorporated local entity pooling and thematic capture modules into the state-of-the-art model, resulting in performance improvements of 1.5% and 0.2% on the respective datasets. These results highlight the advanced performance of our proposed approach.

与句子级关系提取相比，文档级关系提取面临更大的挑战，因为文档通常包含多个实体，并且一个实体可能与多个其他实体相关联。现有的方法通常依赖于图结构来捕获实体对之间的路径表示。然而，本文引入了一种称为局部实体池的新方法，该方法仅依赖于预训练模型来识别与当前实体对相关的桥实体并生成推理路径表示。该技术有效地缓解了多实体问题。此外，该模型利用文档的多实体和多标签特征来获取文档的主题表示，从而增强了文档级关系提取任务。对CDR和GDA两个生物医学数据集进行了实验评估。我们的TCLEP （Thematic Capture and localization Entity Pooling）模型的Macro-F1得分分别为71.7%和85.3%。同时，我们将本地实体池和主题捕获模块合并到最先进的模型中，从而使各自数据集的性能提高1.5%和0.2%。这些结果突出了我们提出的方法的先进性能。

{"title":"Biomedical document-level relation extraction with thematic capture and localized entity pooling","authors":"Yuqing Li, Xinhui Shao","doi":"10.1016/j.jbi.2024.104756","DOIUrl":"10.1016/j.jbi.2024.104756","url":null,"abstract":"<div><div>In contrast to sentence-level relational extraction, document-level relation extraction poses greater challenges as a document typically contains multiple entities, and one entity may be associated with multiple other entities. Existing methods often rely on graph structures to capture path representations between entity pairs. However, this paper introduces a novel approach called local entity pooling that solely relies on the pre-training model to identify the bridge entity related to the current entity pair and generate the reasoning path representation. This technique effectively mitigates the multi-entity problem. Additionally, the model leverages the multi-entity and multi-label characteristics of the document to acquire the document’s thematic representation, thereby enhancing the document-level relation extraction task. Experimental evaluations conducted on two biomedical datasets, CDR and GDA. Our TCLEP (<strong>T</strong>hematic <strong>C</strong>apture and <strong>L</strong>ocalized <strong>E</strong>ntity <strong>P</strong>ooling) model achieved the Macro-F1 scores of 71.7% and 85.3%, respectively. Simultaneously, we incorporated local entity pooling and thematic capture modules into the state-of-the-art model, resulting in performance improvements of 1.5% and 0.2% on the respective datasets. These results highlight the advanced performance of our proposed approach.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104756"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142769374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Taxonomy-based prompt engineering to generate synthetic drug-related patient portal messages 基于分类学的提示工程，生成合成的药物相关患者门户信息。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104752

Natalie Wang , Sukrit Treewaree , Ayah Zirikly , Yuzhi L. Lu , Michelle H. Nguyen , Bhavik Agarwal , Jash Shah , James Michael Stevenson , Casey Overby Taylor

Objective:

The objectives of this study were to: (1) create a corpus of synthetic drug-related patient portal messages to address the current lack of publicly available datasets for model development, (2) assess differences in language used and linguistics among the synthetic patient portal messages, and (3) assess the accuracy of patient-reported drug side effects for different racial groups.

Methods:

We leveraged a taxonomy for patient- and clinician-generated content to guide prompt engineering for synthetic drug-related patient portal messages. We generated two groups of messages: the first group (200 messages) used a subset of the taxonomy relevant to a broad range of drug-related messages and the second group (250 messages) used a subset of the taxonomy relevant to a narrow range of messages focused on side effects. Prompts also include one of five racial groups. Next, we assessed linguistic characteristics among message parts (subject, beginning, body, ending) across different prompt specifications (urgency, patient portal taxa, race). We also assessed the performance and frequency of patient-reported side effects across different racial groups and compared to data present in a real world data source (SIDER).

Results:

The study generated 450 synthetic patient portal messages, and we assessed linguistic patterns, accuracy of drug-side effect pairs, frequency of pairs compared to real world data. Linguistic analysis revealed variations in language usage and politeness and analysis of positive predictive values identified differences in symptoms reported based on urgency levels and racial groups in the prompt. We also found that low incident SIDER drug-side effect pairs were observed less frequently in our dataset.

Conclusion:

This study demonstrates the potential of synthetic patient portal messages as a valuable resource for healthcare research. After creating a corpus of synthetic drug-related patient portal messages, we identified significant language differences and provided evidence that drug-side effect pairs observed in messages are comparable to what is expected in real world settings.

研究目的本研究的目的是(1）创建一个合成药物相关患者门户网站信息的语料库，以解决目前缺乏公开可用数据集来开发模型的问题；（2）评估合成患者门户网站信息在语言使用和语言学方面的差异；以及（3）评估不同种族群体患者报告的药物副作用的准确性：我们利用患者和临床医生生成的内容分类法来指导合成药物相关患者门户网站信息的提示工程。我们生成了两组信息：第一组（200 条信息）使用了与广泛的药物相关信息相关的分类标准子集，第二组（250 条信息）使用了与范围较窄的副作用相关的分类标准子集。提示还包括五个种族群体中的一个。接下来，我们评估了不同提示规格（紧急程度、患者门户分类群、种族）下信息各部分（主题、开头、主体、结尾）的语言特点。我们还评估了不同种族群体患者报告副作用的准确性和频率，并与真实世界的数据进行了比较：研究生成了 450 条合成的患者门户信息，我们评估了语言模式、药物副作用配对的准确性以及与真实世界数据相比的配对频率。使用LIWC进行的语言分析揭示了语言使用和礼貌方面的差异，对阳性预测值的分析确定了根据紧急程度和提示中的种族群体报告症状的差异。我们还发现了与 SIDER 数据库相似的药物副作用配对发生率：本研究证明了合成患者门户网站信息作为医疗保健研究宝贵资源的潜力。在创建了与药物相关的合成患者门户网站信息语料库后，我们发现了显著的语言差异，并评估了各种提示中药物副作用配对的准确性和频率。

{"title":"Taxonomy-based prompt engineering to generate synthetic drug-related patient portal messages","authors":"Natalie Wang , Sukrit Treewaree , Ayah Zirikly , Yuzhi L. Lu , Michelle H. Nguyen , Bhavik Agarwal , Jash Shah , James Michael Stevenson , Casey Overby Taylor","doi":"10.1016/j.jbi.2024.104752","DOIUrl":"10.1016/j.jbi.2024.104752","url":null,"abstract":"<div><h3>Objective:</h3><div>The objectives of this study were to: (1) create a corpus of synthetic drug-related patient portal messages to address the current lack of publicly available datasets for model development, (2) assess differences in language used and linguistics among the synthetic patient portal messages, and (3) assess the accuracy of patient-reported drug side effects for different racial groups.</div></div><div><h3>Methods:</h3><div>We leveraged a taxonomy for patient- and clinician-generated content to guide prompt engineering for synthetic drug-related patient portal messages. We generated two groups of messages: the first group (200 messages) used a subset of the taxonomy relevant to a broad range of drug-related messages and the second group (250 messages) used a subset of the taxonomy relevant to a narrow range of messages focused on side effects. Prompts also include one of five racial groups. Next, we assessed linguistic characteristics among message parts (subject, beginning, body, ending) across different prompt specifications (urgency, patient portal taxa, race). We also assessed the performance and frequency of patient-reported side effects across different racial groups and compared to data present in a real world data source (SIDER).</div></div><div><h3>Results:</h3><div>The study generated 450 synthetic patient portal messages, and we assessed linguistic patterns, accuracy of drug-side effect pairs, frequency of pairs compared to real world data. Linguistic analysis revealed variations in language usage and politeness and analysis of positive predictive values identified differences in symptoms reported based on urgency levels and racial groups in the prompt. We also found that low incident SIDER drug-side effect pairs were observed less frequently in our dataset.</div></div><div><h3>Conclusion:</h3><div>This study demonstrates the potential of synthetic patient portal messages as a valuable resource for healthcare research. After creating a corpus of synthetic drug-related patient portal messages, we identified significant language differences and provided evidence that drug-side effect pairs observed in messages are comparable to what is expected in real world settings.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104752"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142739561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sleep apnea test prediction based on Electronic Health Records 基于电子健康记录的睡眠呼吸暂停测试预测。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104737

Lama Abu Tahoun , Amit Shay Green , Tal Patalon , Yaron Dagan , Robert Moskovitch

The identification of Obstructive Sleep Apnea (OSA) is done by a Polysomnography test which is often done in later ages. Being able to notify potential insured members at earlier ages is desirable. For that, we develop predictive models that rely on Electronic Health Records (EHR) and predict whether a person will go through a sleep apnea test after the age of 50. A major challenge is the variability in EHR records in various insured members over the years, which this study investigates as well in the context of controls matching, and prediction. Since there are many temporal variables, the RankLi method was introduced for temporal variable selection. This approach employs the t-test to calculate a divergence score for each temporal variable between the target classes. We also investigate here the need to consider the number of EHR records, as part of control matching, and whether modeling separately for subgroups according to the number of EHR records is more effective. For each prediction task, we trained 4 different classifiers including 1-CNN, LSTM, Random Forest, and Logistic Regression, on data until the age of 40 or 50, and on several numbers of temporal variables. Using the number of EHR records for control matching was found crucial, and using learning models for subsets of the population according to the number of EHR records they have was found more effective. The deep learning models, particularly the 1-CNN, achieved the highest balanced accuracy and AUC scores in both male and female groups. In the male group, the highest results were also observed at age 50 with 100 temporal variables, resulting in a balanced accuracy of 90% and an AUC of 93%.

阻塞性睡眠呼吸暂停（OSA）是通过多导睡眠图检查来确定的，通常在晚年进行。我们希望能够在潜在投保人较早的年龄就通知他们。为此，我们开发了依赖电子健康记录（EHR）的预测模型，预测一个人是否会在 50 岁以后接受睡眠呼吸暂停测试。一个主要的挑战是不同参保人员多年来的电子健康记录存在差异，本研究在对照匹配和预测方面也对此进行了调查。由于存在许多时间变量，因此引入了 RankLi 方法来选择时间变量。这种方法采用 t 检验来计算目标类别之间每个时间变量的分歧分值。在此，我们还研究了作为控制匹配的一部分，是否需要考虑电子病历记录的数量，以及根据电子病历记录的数量为亚组单独建模是否更有效。针对每项预测任务，我们在 40 岁或 50 岁之前的数据和多个时间变量上训练了 4 种不同的分类器，包括 1-CNN、LSTM、随机森林和逻辑回归。我们发现，使用电子病历记录数量进行对照匹配至关重要，而根据电子病历记录数量对人群子集使用学习模型则更为有效。在男性组和女性组中，深度学习模型，尤其是 1-CNN 获得了最高的平衡准确率和 AUC 分数。在男性组中，50 岁时的结果也是最高的，有 100 个时间变量，平衡准确率为 90%，AUC 为 93%。

{"title":"Sleep apnea test prediction based on Electronic Health Records","authors":"Lama Abu Tahoun , Amit Shay Green , Tal Patalon , Yaron Dagan , Robert Moskovitch","doi":"10.1016/j.jbi.2024.104737","DOIUrl":"10.1016/j.jbi.2024.104737","url":null,"abstract":"<div><div>The identification of Obstructive Sleep Apnea (OSA) is done by a Polysomnography test which is often done in later ages. Being able to notify potential insured members at earlier ages is desirable. For that, we develop predictive models that rely on Electronic Health Records (EHR) and predict whether a person will go through a sleep apnea test after the age of 50. A major challenge is the variability in EHR records in various insured members over the years, which this study investigates as well in the context of controls matching, and prediction. Since there are many temporal variables, the RankLi method was introduced for temporal variable selection. This approach employs the t-test to calculate a divergence score for each temporal variable between the target classes. We also investigate here the need to consider the number of EHR records, as part of control matching, and whether modeling separately for subgroups according to the number of EHR records is more effective. For each prediction task, we trained 4 different classifiers including 1-CNN, LSTM, Random Forest, and Logistic Regression, on data until the age of 40 or 50, and on several numbers of temporal variables. Using the number of EHR records for control matching was found crucial, and using learning models for subsets of the population according to the number of EHR records they have was found more effective. The deep learning models, particularly the 1-CNN, achieved the highest balanced accuracy and AUC scores in both male and female groups. In the male group, the highest results were also observed at age 50 with 100 temporal variables, resulting in a balanced accuracy of 90% and an AUC of 93%.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104737"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142568735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Structural analysis and intelligent classification of clinical trial eligibility criteria based on deep learning and medical text mining 基于深度学习和医学文本挖掘的临床试验资格标准的结构分析和智能分类。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-12-01 DOI: 10.1016/j.jbi.2024.104753

Yongzhong Han , Qianmin Su , Liang Liu , Ying Li , Jihan Huang

Objective:

To enhance the efficiency, quality, and innovation capability of clinical trials, this paper introduces a novel model called CTEC-AC (Clinical Trial Eligibility Criteria Automatic Classification), aimed at structuring clinical trial eligibility criteria into computationally explainable classifications.

Methods:

We obtained detailed information on the latest 2,500 clinical trials from ClinicalTrials.gov, generating over 20,000 eligibility criteria data entries. To enhance the expressiveness of these criteria, we integrated two powerful methods: ClinicalBERT and MetaMap. The resulting enhanced features were used as input for a hierarchical clustering algorithm. Post-processing included expert validation of the algorithm’s output to ensure the accuracy of the constructed annotated eligibility text corpus. Ultimately, our model was employed to automate the classification of eligibility criteria.

Results:

We identified 31 distinct categories to summarize the eligibility criteria written by clinical researchers and uncovered common themes in how these criteria are expressed. Using our automated classification model on a labeled dataset, we achieved a macro-average F1 score of 0.94.

Conclusion:

This work can automatically extract structured representations from unstructured eligibility criteria text, significantly advancing the informatization of clinical trials. This, in turn, can significantly enhance the intelligence of automated participant recruitment for clinical researchers.

目的为了提高临床试验的效率、质量和创新能力，本文介绍了一种名为 CTEC-AC（临床试验资格标准自动分类）的新型模型，旨在将临床试验资格标准结构化为可计算解释的分类：方法：我们从 ClinicalTrials.gov 获取了最新 2,500 项临床试验的详细信息，生成了 20,000 多个资格标准数据条目。为了增强这些标准的表达能力，我们整合了两种强大的方法：ClinicalBERT 和 MetaMap。由此产生的增强特征被用作分层聚类算法的输入。后处理包括对算法输出的专家验证，以确保所构建的注释资格文本语料库的准确性。最终，我们的模型被用于对资格标准进行自动分类：结果：我们确定了 31 个不同的类别来概括临床研究人员撰写的资格标准，并发现了这些标准表达方式的共同主题。在标注数据集上使用我们的自动分类模型，我们取得了 0.94 的宏观平均 F1 分数：这项工作可以从非结构化的资格标准文本中自动提取结构化表述，极大地推动了临床试验的信息化进程。这反过来又能大大提高临床研究人员自动招募参与者的智能化程度。

{"title":"Structural analysis and intelligent classification of clinical trial eligibility criteria based on deep learning and medical text mining","authors":"Yongzhong Han , Qianmin Su , Liang Liu , Ying Li , Jihan Huang","doi":"10.1016/j.jbi.2024.104753","DOIUrl":"10.1016/j.jbi.2024.104753","url":null,"abstract":"<div><h3>Objective:</h3><div>To enhance the efficiency, quality, and innovation capability of clinical trials, this paper introduces a novel model called CTEC-AC (Clinical Trial Eligibility Criteria Automatic Classification), aimed at structuring clinical trial eligibility criteria into computationally explainable classifications.</div></div><div><h3>Methods:</h3><div>We obtained detailed information on the latest 2,500 clinical trials from ClinicalTrials.gov, generating over 20,000 eligibility criteria data entries. To enhance the expressiveness of these criteria, we integrated two powerful methods: ClinicalBERT and MetaMap. The resulting enhanced features were used as input for a hierarchical clustering algorithm. Post-processing included expert validation of the algorithm’s output to ensure the accuracy of the constructed annotated eligibility text corpus. Ultimately, our model was employed to automate the classification of eligibility criteria.</div></div><div><h3>Results:</h3><div>We identified 31 distinct categories to summarize the eligibility criteria written by clinical researchers and uncovered common themes in how these criteria are expressed. Using our automated classification model on a labeled dataset, we achieved a macro-average F1 score of 0.94.</div></div><div><h3>Conclusion:</h3><div>This work can automatically extract structured representations from unstructured eligibility criteria text, significantly advancing the informatization of clinical trials. This, in turn, can significantly enhance the intelligence of automated participant recruitment for clinical researchers.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104753"},"PeriodicalIF":4.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142739557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Importance of variables from different time frames for predicting self-harm using health system data 利用医疗系统数据预测自残时不同时间段变量的重要性。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-11-16 DOI: 10.1016/j.jbi.2024.104750

Charles J. Wolock , Brian D. Williamson , Susan M. Shortreed , Gregory E. Simon , Karen J. Coleman , Rodney Yeargans , Brian K. Ahmedani , Yihe Daida , Frances L. Lynch , Rebecca C. Rossom , Rebecca A. Ziebell , Maricela Cruz , Robert D. Wellman , R. Yates Coley

Objective:

Self-harm risk prediction models developed using health system data (electronic health records and insurance claims information) often use patient information from up to several years prior to the index visit when the prediction is made. Measurements from some time periods may not be available for all patients. Using the framework of algorithm-agnostic variable importance, we study the predictive potential of variables corresponding to different time horizons prior to the index visit and demonstrate the application of variable importance techniques in the biomedical informatics setting.

Materials and Methods:

We use variable importance to quantify the potential of recent (up to three months before the index visit) and distant (more than one year before the index visit) patient mental health information for predicting self-harm risk using data from seven health systems. We quantify importance as the decrease in predictiveness when the variable set of interest is excluded from the prediction task. We define predictiveness using discriminative metrics: area under the receiver operating characteristic curve (AUC), sensitivity, and positive predictive value.

Results:

Mental health predictors corresponding to the three months prior to the index visit show strong signal of importance; in one setting, excluding these variables decreased AUC from 0.85 to 0.77. Predictors corresponding to more distant information were less important.

Discussion:

Predictors from the months immediately preceding the index visit are highly important. Implementation of self-harm prediction models may be challenging in settings where recent data are not completely available (e.g., due to lags in insurance claims processing) at the time a prediction is made.

Conclusion:

Clinically derived variables from different time frames exhibit varying levels of importance for predicting self-harm. Variable importance analyses can inform whether and how to implement risk prediction models into clinical practice given real-world data limitations. These analyses be applied more broadly in biomedical informatics research to provide insight into general clinical risk prediction tasks.

目的：利用医疗系统数据（电子健康记录和保险理赔信息）开发的自残风险预测模型通常会使用患者在进行预测时的就诊指数之前长达数年的信息。可能无法获得所有患者在某些时间段的测量数据。利用算法诊断变量重要性框架，我们研究了指数就诊前不同时间段相应变量的预测潜力，并展示了变量重要性技术在生物医学信息学中的应用：我们使用七个医疗系统的数据，利用变量重要性来量化近期（指数就诊前三个月内）和远期（指数就诊前一年以上）患者心理健康信息预测自残风险的潜力。我们将重要性量化为预测任务中排除相关变量集后预测性的下降幅度。我们使用判别指标来定义预测性：接收者操作特征曲线下面积（AUC）、灵敏度和阳性预测值：结果：与指标就诊前三个月相对应的心理健康预测因子显示出强烈的重要性信号；在一种情况下，排除这些变量后，AUC 从 0.85 降至 0.77。与更远的信息相对应的预测因子则不那么重要：讨论：指标就诊前几个月的预测因素非常重要。在预测时近期数据不完全可用（例如，由于保险理赔处理的滞后性）的情况下，自残预测模型的实施可能具有挑战性：结论：不同时间段的临床变量在预测自残时表现出不同程度的重要性。鉴于现实世界数据的局限性，变量重要性分析可以为是否以及如何在临床实践中实施风险预测模型提供信息。这些分析可更广泛地应用于生物医学信息学研究，为一般临床风险预测任务提供见解。

{"title":"Importance of variables from different time frames for predicting self-harm using health system data","authors":"Charles J. Wolock , Brian D. Williamson , Susan M. Shortreed , Gregory E. Simon , Karen J. Coleman , Rodney Yeargans , Brian K. Ahmedani , Yihe Daida , Frances L. Lynch , Rebecca C. Rossom , Rebecca A. Ziebell , Maricela Cruz , Robert D. Wellman , R. Yates Coley","doi":"10.1016/j.jbi.2024.104750","DOIUrl":"10.1016/j.jbi.2024.104750","url":null,"abstract":"<div><h3>Objective:</h3><div>Self-harm risk prediction models developed using health system data (electronic health records and insurance claims information) often use patient information from up to several years prior to the index visit when the prediction is made. Measurements from some time periods may not be available for all patients. Using the framework of algorithm-agnostic variable importance, we study the predictive potential of variables corresponding to different time horizons prior to the index visit and demonstrate the application of variable importance techniques in the biomedical informatics setting.</div></div><div><h3>Materials and Methods:</h3><div>We use variable importance to quantify the potential of recent (up to three months before the index visit) and distant (more than one year before the index visit) patient mental health information for predicting self-harm risk using data from seven health systems. We quantify importance as the decrease in predictiveness when the variable set of interest is excluded from the prediction task. We define predictiveness using discriminative metrics: area under the receiver operating characteristic curve (AUC), sensitivity, and positive predictive value.</div></div><div><h3>Results:</h3><div>Mental health predictors corresponding to the three months prior to the index visit show strong signal of importance; in one setting, excluding these variables decreased AUC from 0.85 to 0.77. Predictors corresponding to more distant information were less important.</div></div><div><h3>Discussion:</h3><div>Predictors from the months immediately preceding the index visit are highly important. Implementation of self-harm prediction models may be challenging in settings where recent data are not completely available (e.g., due to lags in insurance claims processing) at the time a prediction is made.</div></div><div><h3>Conclusion:</h3><div>Clinically derived variables from different time frames exhibit varying levels of importance for predicting self-harm. Variable importance analyses can inform whether and how to implement risk prediction models into clinical practice given real-world data limitations. These analyses be applied more broadly in biomedical informatics research to provide insight into general clinical risk prediction tasks.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104750"},"PeriodicalIF":4.0,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142668134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning approaches for the discovery of clinical pathways from patient data: A systematic review 从患者数据中发现临床路径的机器学习方法：系统综述。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-11-12 DOI: 10.1016/j.jbi.2024.104746

Lillian Muyama , Antoine Neuraz , Adrien Coulet

Background:

Clinical pathways are sequences of events followed during the clinical care of a group of patients who meet pre-defined criteria. They have many applications ranging from healthcare evaluation and optimization to clinical decision support. These pathways can be discovered from existing healthcare data, in particular with machine learning which is a family of methods used to learn patterns from data. This review provides a comprehensive overview of the literature concerning the use of machine learning methods for clinical pathway discovery from patient data.

Methods:

Guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) method , we conducted a systematic review of the existing literature. We searched 6 databases, i.e., ACM Digital Library, ScienceDirect, Web of Science, PubMed, IEEE Xplore, and Scopus spanning from January 2004 to December 2023 using search terms pertinent to clinical pathways and their development. Subsequently, the retrieved papers were analyzed to assess their relevance to the scope of this study.

Results:

In total, 131 papers that met the specified inclusion criteria were identified. These papers expressed diverse motivations behind data-driven clinical pathway discovery ranging from knowledge discovery to conformance checking with established clinical guidelines (derived from existing literature and clinical experts). Notably, the predominant methods employed (67.2%,

n

=88) involved unsupervised machine learning techniques, such as clustering and process mining.

Conclusions:

Relevant clinical pathways can be discovered from patient data using machine learning methods, with the desirable potential to aid clinical decision-making in healthcare. However, to reach this objective, the methods used to discover pathways should be reproducible, and rigorous performance evaluation by clinical experts needs to be conducted for validation.

背景：临床路径是对一组符合预定标准的患者进行临床治疗时所遵循的事件序列。它们有很多应用，从医疗评估和优化到临床决策支持。这些路径可以从现有的医疗数据中发现，特别是通过机器学习，机器学习是从数据中学习模式的一系列方法。本综述全面概述了有关使用机器学习方法从患者数据中发现临床路径的文献：在系统综述和荟萃分析首选报告项目（PRISMA）方法的指导下，我们对现有文献进行了系统综述。我们检索了 6 个数据库，即 ACM Digital Library、ScienceDirect、Web of Science、PubMed、IEEE Xplore 和 Scopus，检索时间跨度为 2004 年 1 月至 2023 年 12 月，检索词与临床路径及其开发相关。随后，对检索到的论文进行了分析，以评估它们与本研究范围的相关性：共有 131 篇论文符合特定的纳入标准。这些论文表达了数据驱动临床路径发现背后的各种动机，从知识发现到与既定临床指南（来自现有文献和临床专家）的一致性检查。值得注意的是，采用的主要方法（67.2%，n=88）涉及无监督机器学习方法，如聚类和流程挖掘：结论：使用机器学习方法可以从患者数据中发现相关的临床路径，具有帮助医疗保健临床决策的理想潜力。然而，要实现这一目标，用于发现路径的方法应具有可重复性，并由临床专家进行严格的性能评估以进行验证。

{"title":"Machine learning approaches for the discovery of clinical pathways from patient data: A systematic review","authors":"Lillian Muyama , Antoine Neuraz , Adrien Coulet","doi":"10.1016/j.jbi.2024.104746","DOIUrl":"10.1016/j.jbi.2024.104746","url":null,"abstract":"<div><h3>Background:</h3><div>Clinical pathways are sequences of events followed during the clinical care of a group of patients who meet pre-defined criteria. They have many applications ranging from healthcare evaluation and optimization to clinical decision support. These pathways can be discovered from existing healthcare data, in particular with machine learning which is a family of methods used to learn patterns from data. This review provides a comprehensive overview of the literature concerning the use of machine learning methods for clinical pathway discovery from patient data.</div></div><div><h3>Methods:</h3><div>Guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) method , we conducted a systematic review of the existing literature. We searched 6 databases, <em>i.e.</em>, ACM Digital Library, ScienceDirect, Web of Science, PubMed, IEEE Xplore, and Scopus spanning from January 2004 to December 2023 using search terms pertinent to clinical pathways and their development. Subsequently, the retrieved papers were analyzed to assess their relevance to the scope of this study.</div></div><div><h3>Results:</h3><div>In total, 131 papers that met the specified inclusion criteria were identified. These papers expressed diverse motivations behind data-driven clinical pathway discovery ranging from knowledge discovery to conformance checking with established clinical guidelines (derived from existing literature and clinical experts). Notably, the predominant methods employed (67.2%, <span><math><mi>n</mi></math></span>=88) involved unsupervised machine learning techniques, such as clustering and process mining.</div></div><div><h3>Conclusions:</h3><div>Relevant clinical pathways can be discovered from patient data using machine learning methods, with the desirable potential to aid clinical decision-making in healthcare. However, to reach this objective, the methods used to discover pathways should be reproducible, and rigorous performance evaluation by clinical experts needs to be conducted for validation.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104746"},"PeriodicalIF":4.0,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142621220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering 针对医学视觉问题解答的多目标跨模态自监督视觉语言预训练。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-11-12 DOI: 10.1016/j.jbi.2024.104748

Gang Liu , Jinlong He , Pengfei Li , Zixu Zhao , Shenjun Zhong

Medical Visual Question Answering (VQA) is a task that aims to provide answers to questions about medical images, which utilizes both visual and textual information in the reasoning process. The absence of large-scale annotated medical VQA datasets presents a formidable obstacle to training a medical VQA model from scratch in an end-to-end manner. Existing works have been using image captioning dataset in the pre-training stage and fine-tuning to downstream VQA tasks. Following the same paradigm, we use a collection of public medical image captioning datasets to pre-train multimodality models in a self-supervised setup, and fine-tune to downstream medical VQA tasks. In the work, we propose a method that featured with Cross-Modal pre-training with Multiple Objectives (CMMO), which includes masked image modeling, masked language modeling, image-text matching, and image-text contrastive learning. The proposed method is designed to associate the visual features of medical images with corresponding medical concepts in captions, for learning aligned vision and language feature representations, and multi-modal interactions. The experimental results reveal that our proposed CMMO method outperforms state-of-the-art methods on three public medical VQA datasets, showing absolute improvements of 2.6%, 0.9%, and 4.0% on the VQA-RAD, PathVQA, and SLAKE dataset, respectively. We also conduct comprehensive ablation studies to validate our method, and visualize the attention maps which show a strong interpretability. The code and pre-trained weights will be released at https://github.com/pengfeiliHEU/CMMO.

医学视觉问题解答（VQA）是一项旨在为医学图像问题提供答案的任务，它在推理过程中同时利用了视觉和文本信息。由于缺乏大规模的注释医学 VQA 数据集，要以端到端的方式从头开始训练医学 VQA 模型面临着巨大的障碍。现有的工作都是在预训练阶段使用图像标题数据集，然后根据下游的 VQA 任务进行微调。按照同样的模式，我们使用公共医疗图像标题数据集在自监督设置中预训练多模态模型，并根据下游医疗 VQA 任务进行微调。在这项工作中，我们提出了一种以多目标交叉模态预训练（CMMO）为特色的方法，其中包括屏蔽图像建模、屏蔽语言建模、图像-文本匹配和图像-文本对比学习。所提出的方法旨在将医学图像的视觉特征与标题中相应的医学概念联系起来，以学习一致的视觉和语言特征表征以及多模态交互。实验结果表明，我们提出的 CMMO 方法在三个公共医疗 VQA 数据集上的表现优于最先进的方法，在 VQA-RAD、PathVQA 和 SLAKE 数据集上的绝对改进率分别为 2.6%、0.9% 和 4.0%。我们还进行了全面的消融研究，以验证我们的方法，并对注意力图进行了可视化，结果显示了很强的可解释性。代码和预训练权重将在 https://github.com/pengfeiliHEU/CMMO 上发布。

{"title":"Cross-Modal self-supervised vision language pre-training with multiple objectives for medical visual question answering","authors":"Gang Liu , Jinlong He , Pengfei Li , Zixu Zhao , Shenjun Zhong","doi":"10.1016/j.jbi.2024.104748","DOIUrl":"10.1016/j.jbi.2024.104748","url":null,"abstract":"<div><div>Medical Visual Question Answering (VQA) is a task that aims to provide answers to questions about medical images, which utilizes both visual and textual information in the reasoning process. The absence of large-scale annotated medical VQA datasets presents a formidable obstacle to training a medical VQA model from scratch in an end-to-end manner. Existing works have been using image captioning dataset in the pre-training stage and fine-tuning to downstream VQA tasks. Following the same paradigm, we use a collection of public medical image captioning datasets to pre-train multimodality models in a self-supervised setup, and fine-tune to downstream medical VQA tasks. In the work, we propose a method that featured with Cross-Modal pre-training with Multiple Objectives (CMMO), which includes masked image modeling, masked language modeling, image-text matching, and image-text contrastive learning. The proposed method is designed to associate the visual features of medical images with corresponding medical concepts in captions, for learning aligned vision and language feature representations, and multi-modal interactions. The experimental results reveal that our proposed CMMO method outperforms state-of-the-art methods on three public medical VQA datasets, showing absolute improvements of 2.6%, 0.9%, and 4.0% on the VQA-RAD, PathVQA, and SLAKE dataset, respectively. We also conduct comprehensive ablation studies to validate our method, and visualize the attention maps which show a strong interpretability. The code and pre-trained weights will be released at <span><span>https://github.com/pengfeiliHEU/CMMO</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"160 ","pages":"Article 104748"},"PeriodicalIF":4.0,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142621216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0