Journal of Biomedical Informatics最新文献_第10页

Interactive dual-stream contrastive learning for radiology report generation 用于生成放射学报告的交互式双流对比学习

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104718

Ziqi Zhang, Ailian Jiang

Radiology report generation automates diagnostic narrative synthesis from medical imaging data. Current report generation methods primarily employ knowledge graphs for image enhancement, neglecting the interpretability and guiding function of the knowledge graphs themselves. Additionally, few approaches leverage the stable modal alignment information from multimodal pre-trained models to facilitate the generation of radiology reports. We propose the Terms-Guided Radiology Report Generation (TGR), a simple and practical model for generating reports guided primarily by anatomical terms. Specifically, we utilize a dual-stream visual feature extraction module comprised of detail extraction module and a frozen multimodal pre-trained model to separately extract visual detail features and semantic features. Furthermore, a Visual Enhancement Module (VEM) is proposed to further enrich the visual features, thereby facilitating the generation of a list of anatomical terms. We integrate anatomical terms with image features and proceed to engage contrastive learning with frozen text embeddings, utilizing the stable feature space from these embeddings to boost modal alignment capabilities further. Our model incorporates the capability for manual input, enabling it to generate a list of organs for specifically focused abnormal areas or to produce more accurate single-sentence descriptions based on selected anatomical terms. Comprehensive experiments demonstrate the effectiveness of our method in report generation tasks, our TGR-S model reduces training parameters by 38.9% while performing comparably to current state-of-the-art models, and our TGR-B model exceeds the best baseline models across multiple metrics.

放射学报告生成可自动根据医学影像数据进行诊断叙述综合。目前的报告生成方法主要采用知识图谱来增强图像，而忽略了知识图谱本身的可解释性和指导功能。此外，很少有方法利用多模态预训练模型的稳定模态配准信息来促进放射报告的生成。我们提出的术语指导放射报告生成（TGR）是一种简单实用的模型，主要以解剖术语为指导生成报告。具体来说，我们利用由细节提取模块和冷冻多模态预训练模型组成的双流视觉特征提取模块，分别提取视觉细节特征和语义特征。此外，我们还提出了视觉增强模块（VEM）来进一步丰富视觉特征，从而促进解剖术语列表的生成。我们将解剖术语与图像特征整合在一起，然后使用冻结文本嵌入进行对比学习，利用这些嵌入的稳定特征空间进一步提高模态配准能力。我们的模型具有手动输入功能，可针对特定的异常区域生成器官列表，或根据选定的解剖术语生成更准确的单句描述。综合实验证明了我们的方法在报告生成任务中的有效性，我们的 TGR-S 模型减少了 38.9% 的训练参数，同时与当前最先进的模型性能相当，而我们的 TGR-B 模型在多个指标上都超过了最佳基线模型。

{"title":"Interactive dual-stream contrastive learning for radiology report generation","authors":"Ziqi Zhang, Ailian Jiang","doi":"10.1016/j.jbi.2024.104718","DOIUrl":"10.1016/j.jbi.2024.104718","url":null,"abstract":"<div><p>Radiology report generation automates diagnostic narrative synthesis from medical imaging data. Current report generation methods primarily employ knowledge graphs for image enhancement, neglecting the interpretability and guiding function of the knowledge graphs themselves. Additionally, few approaches leverage the stable modal alignment information from multimodal pre-trained models to facilitate the generation of radiology reports. We propose the Terms-Guided Radiology Report Generation (TGR), a simple and practical model for generating reports guided primarily by anatomical terms. Specifically, we utilize a dual-stream visual feature extraction module comprised of detail extraction module and a frozen multimodal pre-trained model to separately extract visual detail features and semantic features. Furthermore, a Visual Enhancement Module (VEM) is proposed to further enrich the visual features, thereby facilitating the generation of a list of anatomical terms. We integrate anatomical terms with image features and proceed to engage contrastive learning with frozen text embeddings, utilizing the stable feature space from these embeddings to boost modal alignment capabilities further. Our model incorporates the capability for manual input, enabling it to generate a list of organs for specifically focused abnormal areas or to produce more accurate single-sentence descriptions based on selected anatomical terms. Comprehensive experiments demonstrate the effectiveness of our method in report generation tasks, our TGR-S model reduces training parameters by 38.9% while performing comparably to current state-of-the-art models, and our TGR-B model exceeds the best baseline models across multiple metrics.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104718"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142094701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SSGU-CD: A combined semantic and structural information graph U-shaped network for document-level Chemical-Disease interaction extraction SSGU-CD：用于文档级化学-疾病交互提取的语义和结构信息图 U 型组合网络。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104719

Pengyuan Nie , Jinzhong Ning , Mengxuan Lin , Zhihao Yang , Lei Wang

Document-level interaction extraction for Chemical-Disease is aimed at inferring the interaction relations between chemical entities and disease entities across multiple sentences. Compared with sentence-level relation extraction, document-level relation extraction can capture the associations between different entities throughout the entire document, which is found to be more practical for biomedical text information. However, current biomedical extraction methods mainly concentrate on sentence-level relation extraction, making it difficult to access the rich structural information contained in documents in practical application scenarios. We put forward SSGU-CD, a combined Semantic and Structural information Graph U-shaped network for document-level Chemical-Disease interaction extraction. This framework effectively stores document semantic and structure information as graphs and can fuse the original context information of documents. Using the framework, we propose a balanced combination of cross-entropy loss function to facilitate collaborative optimization among models with the aim of enhancing the ability to extract Chemical-Disease interaction relations. We evaluated SSGU-CD on the document-level relation extraction dataset CDR and BioRED, and the results demonstrate that the framework can significantly improve the extraction performance.

化学-疾病的文档级交互关系抽取旨在推断多个句子中化学实体与疾病实体之间的交互关系。与句子级关系提取相比，文档级关系提取可以捕捉整个文档中不同实体之间的关联，这对于生物医学文本信息来说更为实用。然而，目前的生物医学提取方法主要集中于句子级关系提取，在实际应用场景中很难获取文档中包含的丰富结构信息。我们提出了一种用于文档级化学-疾病交互提取的语义与结构信息图U形网络（Semantic and Structural information Graph U-shaped network）。该框架能有效地将文档语义和结构信息存储为图，并能融合文档的原始上下文信息。利用该框架，我们提出了交叉熵损失函数的平衡组合，以促进模型间的协同优化，从而提高提取化学-疾病交互关系的能力。我们在文档级关系提取数据集 CDR 和 BioRED 上对 SSGU-CD 进行了评估，结果表明该框架能显著提高提取性能。

{"title":"SSGU-CD: A combined semantic and structural information graph U-shaped network for document-level Chemical-Disease interaction extraction","authors":"Pengyuan Nie , Jinzhong Ning , Mengxuan Lin , Zhihao Yang , Lei Wang","doi":"10.1016/j.jbi.2024.104719","DOIUrl":"10.1016/j.jbi.2024.104719","url":null,"abstract":"<div><p>Document-level interaction extraction for Chemical-Disease is aimed at inferring the interaction relations between chemical entities and disease entities across multiple sentences. Compared with sentence-level relation extraction, document-level relation extraction can capture the associations between different entities throughout the entire document, which is found to be more practical for biomedical text information. However, current biomedical extraction methods mainly concentrate on sentence-level relation extraction, making it difficult to access the rich structural information contained in documents in practical application scenarios. We put forward SSGU-CD, a combined <strong><u>S</u></strong>emantic and <strong><u>S</u></strong>tructural information <strong><u>G</u></strong>raph <strong><u>U</u></strong>-shaped network for document-level <strong><u>C</u></strong>hemical-<strong><u>D</u></strong>isease interaction extraction. This framework effectively stores document semantic and structure information as graphs and can fuse the original context information of documents. Using the framework, we propose a balanced combination of cross-entropy loss function to facilitate collaborative optimization among models with the aim of enhancing the ability to extract Chemical-Disease interaction relations. We evaluated SSGU-CD on the document-level relation extraction dataset CDR and BioRED, and the results demonstrate that the framework can significantly improve the extraction performance.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104719"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424001370/pdfft?md5=ccbd03895ffdd2c9164f4a506fad5a18&pid=1-s2.0-S1532046424001370-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142107838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MolCFL: A personalized and privacy-preserving drug discovery framework based on generative clustered federated learning MolCFL：基于生成聚类联合学习的个性化和保护隐私的药物发现框架。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104712

Yan Guo, Yongqiang Gao, Jiawei Song

In today’s era of rapid development of large models, the traditional drug development process is undergoing a profound transformation. The vast demand for data and consumption of computational resources are making independent drug discovery increasingly difficult. By integrating federated learning technology into the drug discovery field, we have found a solution that both protects privacy and shares computational power. However, the differences in data held by various pharmaceutical institutions and the diversity in drug design objectives have exacerbated the issue of data heterogeneity, making traditional federated learning consensus models unable to meet the personalized needs of all parties. In this study, we introduce and evaluate an innovative drug discovery framework, MolCFL, which utilizes a multi-layer perceptron (MLP) as the generator and a graph convolutional network (GCN) as the discriminator in a generative adversarial network (GAN). By learning the graph structure of molecules, it generates new molecules in a highly personalized manner and then optimizes the learning process by clustering federated learning, grouping compound data with high similarity. MolCFL not only enhances the model’s ability to protect privacy but also significantly improves the efficiency and personalization of molecular design. MolCFL exhibits superior performance when handling non-independently and identically distributed data compared to traditional models. Experimental results show that the framework demonstrates outstanding performance on two benchmark datasets, with the generated new molecules achieving over 90% in Uniqueness and close to 100% in Novelty. MolCFL not only improves the quality and efficiency of drug molecule design but also, through its highly customized clustered federated learning environment, promotes collaboration and specialization in the drug discovery process while ensuring data privacy. These features make MolCFL a powerful tool suitable for addressing the various challenges faced in the modern drug research and development field.

在当今大型模型快速发展的时代，传统的药物研发过程正在经历一场深刻的变革。对数据的巨大需求和对计算资源的消耗使得独立药物发现变得越来越困难。通过将联合学习技术融入药物研发领域，我们找到了一种既能保护隐私又能共享计算能力的解决方案。然而，不同制药机构所掌握数据的差异和药物设计目标的多样性加剧了数据异构问题，使得传统的联合学习共识模型无法满足各方的个性化需求。在本研究中，我们介绍并评估了一种创新的药物发现框架--MolCFL，它在生成对抗网络（GAN）中使用多层感知器（MLP）作为生成器，图卷积网络（GCN）作为判别器。通过学习分子的图结构，它能以高度个性化的方式生成新分子，然后通过聚类联合学习优化学习过程，将具有高度相似性的复合数据分组。MolCFL 不仅增强了模型保护隐私的能力，还显著提高了分子设计的效率和个性化程度。与传统模型相比，MolCFL 在处理非独立和同分布数据时表现出更优越的性能。实验结果表明，该框架在两个基准数据集上表现出色，生成的新分子唯一性超过90%，新颖性接近100%。MolCFL不仅提高了药物分子设计的质量和效率，而且通过其高度定制的集群联合学习环境，促进了药物发现过程中的协作和专业化，同时确保了数据隐私。这些特点使 MolCFL 成为一个强大的工具，适用于应对现代药物研发领域面临的各种挑战。

{"title":"MolCFL: A personalized and privacy-preserving drug discovery framework based on generative clustered federated learning","authors":"Yan Guo, Yongqiang Gao, Jiawei Song","doi":"10.1016/j.jbi.2024.104712","DOIUrl":"10.1016/j.jbi.2024.104712","url":null,"abstract":"<div><p>In today’s era of rapid development of large models, the traditional drug development process is undergoing a profound transformation. The vast demand for data and consumption of computational resources are making independent drug discovery increasingly difficult. By integrating federated learning technology into the drug discovery field, we have found a solution that both protects privacy and shares computational power. However, the differences in data held by various pharmaceutical institutions and the diversity in drug design objectives have exacerbated the issue of data heterogeneity, making traditional federated learning consensus models unable to meet the personalized needs of all parties. In this study, we introduce and evaluate an innovative drug discovery framework, MolCFL, which utilizes a multi-layer perceptron (MLP) as the generator and a graph convolutional network (GCN) as the discriminator in a generative adversarial network (GAN). By learning the graph structure of molecules, it generates new molecules in a highly personalized manner and then optimizes the learning process by clustering federated learning, grouping compound data with high similarity. MolCFL not only enhances the model’s ability to protect privacy but also significantly improves the efficiency and personalization of molecular design. MolCFL exhibits superior performance when handling non-independently and identically distributed data compared to traditional models. Experimental results show that the framework demonstrates outstanding performance on two benchmark datasets, with the generated new molecules achieving over 90% in Uniqueness and close to 100% in Novelty. MolCFL not only improves the quality and efficiency of drug molecule design but also, through its highly customized clustered federated learning environment, promotes collaboration and specialization in the drug discovery process while ensuring data privacy. These features make MolCFL a powerful tool suitable for addressing the various challenges faced in the modern drug research and development field.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104712"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142055706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancing Chinese biomedical text mining with community challenges 以社区挑战推进中文生物医学文本挖掘。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-09-01 DOI: 10.1016/j.jbi.2024.104716

Hui Zong , Rongrong Wu , Jiaxue Cha , Weizhe Feng , Erman Wu , Jiakun Li , Aibin Shao , Liang Tao , Zuofeng Li , Buzhou Tang , Bairong Shen

Objective

This study aims to review the recent advances in community challenges for biomedical text mining in China.

Methods

We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation.

Results

We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models.

Conclusion

Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.

研究目的本研究旨在回顾中国生物医学文本挖掘社区挑战赛的最新进展：我们收集了生物医学文本挖掘社区挑战赛发布的评估任务信息，包括任务描述、数据集描述、数据来源、任务类型和相关链接。对命名实体识别、实体规范化、属性提取、关系提取、事件提取、文本分类、文本相似性、知识图谱构建、问题解答、文本生成、大型语言模型评估等各类生物医学自然语言处理任务进行了系统总结和对比分析：我们从 2017 年至 2023 年的 6 个社区挑战中确定了 39 项评估任务。我们的分析揭示了生物医学文本挖掘中评估任务类型和数据来源的多样性。我们从转化生物医学信息学的角度探讨了这些社区挑战任务的潜在临床应用。我们将这些社区挑战赛与英文版挑战赛进行了比较，并讨论了这些社区挑战赛的贡献、局限性、经验教训和指导原则，同时强调了大语言模型时代的未来发展方向：社区挑战评估竞赛在促进生物医学文本挖掘领域的技术创新和跨学科合作方面发挥了重要作用。这些挑战赛为研究人员开发最先进的解决方案提供了宝贵的平台。

{"title":"Advancing Chinese biomedical text mining with community challenges","authors":"Hui Zong , Rongrong Wu , Jiaxue Cha , Weizhe Feng , Erman Wu , Jiakun Li , Aibin Shao , Liang Tao , Zuofeng Li , Buzhou Tang , Bairong Shen","doi":"10.1016/j.jbi.2024.104716","DOIUrl":"10.1016/j.jbi.2024.104716","url":null,"abstract":"<div><h3>Objective</h3><p>This study aims to review the recent advances in community challenges for biomedical text mining in China.</p></div><div><h3>Methods</h3><p>We collected information of evaluation tasks released in community challenges of biomedical text mining, including task description, dataset description, data source, task type and related links. A systematic summary and comparative analysis were conducted on various biomedical natural language processing tasks, such as named entity recognition, entity normalization, attribute extraction, relation extraction, event extraction, text classification, text similarity, knowledge graph construction, question answering, text generation, and large language model evaluation.</p></div><div><h3>Results</h3><p>We identified 39 evaluation tasks from 6 community challenges that spanned from 2017 to 2023. Our analysis revealed the diverse range of evaluation task types and data sources in biomedical text mining. We explored the potential clinical applications of these community challenge tasks from a translational biomedical informatics perspective. We compared with their English counterparts, and discussed the contributions, limitations, lessons and guidelines of these community challenges, while highlighting future directions in the era of large language models.</p></div><div><h3>Conclusion</h3><p>Community challenge evaluation competitions have played a crucial role in promoting technology innovation and fostering interdisciplinary collaboration in the field of biomedical text mining. These challenges provide valuable platforms for researchers to develop state-of-the-art solutions.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104716"},"PeriodicalIF":4.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1532046424001345/pdfft?md5=90f19a6b5c337cb24358bf3c1497f985&pid=1-s2.0-S1532046424001345-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142093211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BGformer: An improved Informer model to enhance blood glucose prediction BGformer：改进的 Informer 模型可提高血糖预测能力

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-26 DOI: 10.1016/j.jbi.2024.104715

Yuewei Xue, Shaopeng Guan, Wanhai Jia

Accurately predicting blood glucose levels is crucial in diabetes management to mitigate patients’ risk of complications. However, blood glucose values exhibit instability, and existing prediction methods often struggle to capture their volatile nature, leading to inaccurate trend forecasts. To address these challenges, we propose a novel blood glucose level prediction model based on the Informer architecture: BGformer. Our model introduces a feature enhancement module and a microscale overlapping concerns mechanism. The feature enhancement module integrates periodic and trend feature extractors, enhancing the model’s ability to capture relevant information from the data. By extending the feature extraction capacity of time series data, it provides richer feature representations for analysis. Meanwhile, the microscale overlapping concerns mechanism adopts a window-based strategy, computing attention scores only within specific windows. This approach reduces computational complexity while enhancing the model’s capacity to capture local temporal dependencies. Furthermore, we introduce a dual attention enhancement module to augment the model’s expressive capability. Through prediction experiments on blood glucose values from sixteen diabetic patients, our model outperformed eight benchmark models in terms of both MAE and RMSE metrics for future 60-minute and 90-minute predictions. Our proposed scheme significantly improves the model’s dependency-capturing ability, resulting in more accurate blood glucose level predictions.

准确预测血糖水平对糖尿病管理至关重要，可降低患者出现并发症的风险。然而，血糖值具有不稳定性，现有的预测方法往往难以捕捉其波动性，导致趋势预测不准确。为了应对这些挑战，我们提出了一种基于 Informer 架构的新型血糖水平预测模型：BGformer。我们的模型引入了特征增强模块和微尺度重叠关注机制。特征增强模块集成了周期和趋势特征提取器，增强了模型从数据中捕捉相关信息的能力。通过扩展时间序列数据的特征提取能力，它为分析提供了更丰富的特征表示。同时，微尺度重叠关注机制采用基于窗口的策略，只计算特定窗口内的关注分数。这种方法既降低了计算复杂度，又增强了模型捕捉局部时间依赖性的能力。此外，我们还引入了双重注意力增强模块，以提高模型的表达能力。通过对 16 名糖尿病患者的血糖值进行预测实验，我们的模型在未来 60 分钟和 90 分钟预测的 MAE 和 RMSE 指标方面均优于 8 个基准模型。我们提出的方案大大提高了模型的依赖捕捉能力，使血糖水平预测更加准确。

{"title":"BGformer: An improved Informer model to enhance blood glucose prediction","authors":"Yuewei Xue, Shaopeng Guan, Wanhai Jia","doi":"10.1016/j.jbi.2024.104715","DOIUrl":"10.1016/j.jbi.2024.104715","url":null,"abstract":"<div><p>Accurately predicting blood glucose levels is crucial in diabetes management to mitigate patients’ risk of complications. However, blood glucose values exhibit instability, and existing prediction methods often struggle to capture their volatile nature, leading to inaccurate trend forecasts. To address these challenges, we propose a novel blood glucose level prediction model based on the Informer architecture: BGformer. Our model introduces a feature enhancement module and a microscale overlapping concerns mechanism. The feature enhancement module integrates periodic and trend feature extractors, enhancing the model’s ability to capture relevant information from the data. By extending the feature extraction capacity of time series data, it provides richer feature representations for analysis. Meanwhile, the microscale overlapping concerns mechanism adopts a window-based strategy, computing attention scores only within specific windows. This approach reduces computational complexity while enhancing the model’s capacity to capture local temporal dependencies. Furthermore, we introduce a dual attention enhancement module to augment the model’s expressive capability. Through prediction experiments on blood glucose values from sixteen diabetic patients, our model outperformed eight benchmark models in terms of both MAE and RMSE metrics for future 60-minute and 90-minute predictions. Our proposed scheme significantly improves the model’s dependency-capturing ability, resulting in more accurate blood glucose level predictions.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104715"},"PeriodicalIF":4.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142089056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying cooperating cancer driver genes in individual patients through hypergraph random walk 通过超图随机漫步识别单个患者中相互合作的癌症驱动基因

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-17 DOI: 10.1016/j.jbi.2024.104710

Tong Zhang , Shao-Wu Zhang , Ming-Yu Xie , Yan Li

Objective

Identifying cancer driver genes, especially rare or patient-specific cancer driver genes, is a primary goal in cancer therapy. Although researchers have proposed some methods to tackle this problem, these methods mostly identify cancer driver genes at single gene level, overlooking the cooperative relationship among cancer driver genes. Identifying cooperating cancer driver genes in individual patients is pivotal for understanding cancer etiology and advancing the development of personalized therapies.

Methods

Here, we propose a novel Personalized Cooperating cancer Driver Genes (PCoDG) method by using hypergraph random walk to identify the cancer driver genes that cooperatively drive individual patient cancer progression. By leveraging the powerful ability of hypergraph in representing multi-way relationships, PCoDG first employs the personalized hypergraph to depict the complex interactions among mutated genes and differentially expressed genes of an individual patient. Then, a hypergraph random walk algorithm based on hyperedge similarity is utilized to calculate the importance scores of mutated genes, integrating these scores with signaling pathway data to identify the cooperating cancer driver genes in individual patients.

Results

The experimental results on three TCGA cancer datasets (i.e., BRCA, LUAD, and COADREAD) demonstrate the effectiveness of PCoDG in identifying personalized cooperating cancer driver genes. These genes identified by PCoDG not only offer valuable insights into patient stratification correlating with clinical outcomes, but also provide an useful reference resource for tailoring personalized treatments.

Conclusion

We propose a novel method that can effectively identify cooperating cancer driver genes for individual patients, thereby deepening our understanding of the cooperative relationship among personalized cancer driver genes and advancing the development of precision oncology.

目的：确定癌症驱动基因，尤其是罕见或患者特异性癌症驱动基因，是癌症治疗的首要目标。虽然研究人员提出了一些方法来解决这一问题，但这些方法大多是从单个基因水平识别癌症驱动基因，忽略了癌症驱动基因之间的合作关系。方法：在此，我们提出了一种新型的个性化合作癌症驱动基因（PCoDG）方法，利用超图随机游走来识别合作驱动个体患者癌症进展的癌症驱动基因。PCoDG 利用超图在表示多向关系方面的强大能力，首先使用个性化超图来描述个体患者的突变基因和差异表达基因之间的复杂相互作用。然后，利用基于超边缘相似性的超图随机漫步算法计算突变基因的重要性得分，并将这些得分与信号通路数据整合，以确定个体患者中的合作癌症驱动基因：在三个 TCGA 癌症数据集（即 BRCA、LUAD 和 COADREAD）上的实验结果表明，PCoDG 在识别个性化合作癌症驱动基因方面非常有效。PCoDG 发现的这些基因不仅为患者分层提供了与临床结果相关的宝贵见解，还为定制个性化治疗提供了有用的参考资源：我们提出了一种新方法，它能有效识别个体患者的合作癌症驱动基因，从而加深我们对个性化癌症驱动基因之间合作关系的理解，推动精准肿瘤学的发展。

{"title":"Identifying cooperating cancer driver genes in individual patients through hypergraph random walk","authors":"Tong Zhang , Shao-Wu Zhang , Ming-Yu Xie , Yan Li","doi":"10.1016/j.jbi.2024.104710","DOIUrl":"10.1016/j.jbi.2024.104710","url":null,"abstract":"<div><h3>Objective</h3><p>Identifying cancer driver genes, especially rare or patient-specific cancer driver genes, is a primary goal in cancer therapy. Although researchers have proposed some methods to tackle this problem, these methods mostly identify cancer driver genes at single gene level, overlooking the cooperative relationship among cancer driver genes. Identifying cooperating cancer driver genes in individual patients is pivotal for understanding cancer etiology and advancing the development of personalized therapies.</p></div><div><h3>Methods</h3><p>Here, we propose a novel Personalized Cooperating cancer Driver Genes (PCoDG) method by using hypergraph random walk to identify the cancer driver genes that cooperatively drive individual patient cancer progression. By leveraging the powerful ability of hypergraph in representing multi-way relationships, PCoDG first employs the personalized hypergraph to depict the complex interactions among mutated genes and differentially expressed genes of an individual patient. Then, a hypergraph random walk algorithm based on hyperedge similarity is utilized to calculate the importance scores of mutated genes, integrating these scores with signaling pathway data to identify the cooperating cancer driver genes in individual patients.</p></div><div><h3>Results</h3><p>The experimental results on three TCGA cancer datasets (i.e., BRCA, LUAD, and COADREAD) demonstrate the effectiveness of PCoDG in identifying personalized cooperating cancer driver genes. These genes identified by PCoDG not only offer valuable insights into patient stratification correlating with clinical outcomes, but also provide an useful reference resource for tailoring personalized treatments.</p></div><div><h3>Conclusion</h3><p>We propose a novel method that can effectively identify cooperating cancer driver genes for individual patients, thereby deepening our understanding of the cooperative relationship among personalized cancer driver genes and advancing the development of precision oncology.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104710"},"PeriodicalIF":4.0,"publicationDate":"2024-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142004329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigation of bias in the automated assessment of school violence 调查校园暴力自动评估中的偏差。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-15 DOI: 10.1016/j.jbi.2024.104709

Lara J. Kanbar , Anagh Mishra , Alexander Osborn , Andrew Cifuentes , Jennifer Combs , Michael Sorter , Drew Barzman , Judith W. Dexheimer

Objectives

Natural language processing and machine learning have the potential to lead to biased predictions. We designed a novel Automated RIsk Assessment (ARIA) machine learning algorithm that assesses risk of violence and aggression in adolescents using natural language processing of transcribed student interviews. This work evaluated the possible sources of bias in the study design and the algorithm, tested how much of a prediction was explained by demographic covariates, and investigated the misclassifications based on demographic variables.

Methods

We recruited students 10–18 years of age and enrolled in middle or high schools in Ohio, Kentucky, Indiana, and Tennessee. The reference standard outcome was determined by a forensic psychiatrist as either a “high” or “low” risk level. ARIA used L2-regularized logistic regression to predict a risk level for each student using contextual and semantic features. We conducted three analyses: a PROBAST analysis of risk in study design; analysis of demographic variables as covariates; and a prediction analysis. Covariates were included in the linear regression analyses and comprised of race, sex, ethnicity, household education, annual household income, age at the time of visit, and utilization of public assistance.

Results

We recruited 412 students from 204 schools. ARIA performed with an AUC of 0.92, sensitivity of 71%, NPV of 77%, and specificity of 95%. Of these, 387 students with complete demographic information were included in the analysis. Individual linear regressions resulted in a coefficient of determination less than 0.08 across all demographic variables. When using all demographic variables to predict ARIA’s risk assessment score, the multiple linear regression model resulted in a coefficient of determination of 0.189. ARIA performed with a lower False Negative Rate (FNR) of 15.2% (CI [0 – 40]) for the Black subgroup and 12.7%, CI [0 – 41.4] for Other races, compared to an FNR of 26.1% (CI [14.1 – 41.8]) in the White subgroup.

Conclusions

Bias assessment is needed to address shortcomings within machine learning. In our work, student race, ethnicity, sex, use of public assistance, and annual household income did not explain ARIA’s risk assessment score of students. ARIA will continue to be evaluated regularly with increased subject recruitment.

目的：自然语言处理和机器学习有可能导致有偏差的预测。我们设计了一种新颖的自动风险评估（ARIA）机器学习算法，该算法通过对学生访谈记录进行自然语言处理来评估青少年的暴力和攻击风险。这项工作评估了研究设计和算法中可能存在的偏差来源，测试了人口统计学协变量对预测结果的解释程度，并调查了基于人口统计学变量的错误分类：我们招募了俄亥俄州、肯塔基州、印第安纳州和田纳西州 10-18 岁的初中或高中学生。参考标准结果由法医精神病学家确定为 "高 "或 "低 "风险水平。ARIA 采用 L2 规则化逻辑回归，利用上下文和语义特征预测每个学生的风险等级。我们进行了三项分析：研究设计中的风险 PROBAST 分析；作为协变量的人口统计学变量分析；以及预测分析。协变量包括种族、性别、民族、家庭教育程度、家庭年收入、就诊时的年龄以及公共援助的使用情况：我们从 204 所学校招募了 412 名学生。ARIA的AUC为0.92，灵敏度为71%，NPV为77%，特异度为95%。其中，387 名具有完整人口统计学信息的学生被纳入分析。在所有人口统计学变量中，单个线性回归的决定系数均小于 0.08。当使用所有人口统计学变量预测 ARIA 风险评估得分时，多元线性回归模型的决定系数为 0.189。黑人亚组的假阴性率（FNR）为 15.2%（CI [0 - 40]），其他种族的假阴性率（FNR）为 12.7%（CI [0 - 41.4]），而白人亚组的假阴性率（FNR）为 26.1%（CI [14.1 - 41.8]）：需要进行偏差评估，以解决机器学习中的不足。在我们的工作中，学生的种族、民族、性别、使用公共援助和家庭年收入并不能解释 ARIA 对学生的风险评估得分。我们将继续定期对 ARIA 进行评估，并增加实验对象的招募。

{"title":"Investigation of bias in the automated assessment of school violence","authors":"Lara J. Kanbar , Anagh Mishra , Alexander Osborn , Andrew Cifuentes , Jennifer Combs , Michael Sorter , Drew Barzman , Judith W. Dexheimer","doi":"10.1016/j.jbi.2024.104709","DOIUrl":"10.1016/j.jbi.2024.104709","url":null,"abstract":"<div><h3>Objectives</h3><p>Natural language processing and machine learning have the potential to lead to biased predictions. We designed a novel Automated RIsk Assessment (ARIA) machine learning algorithm that assesses risk of violence and aggression in adolescents using natural language processing of transcribed student interviews. This work evaluated the possible sources of bias in the study design and the algorithm, tested how much of a prediction was explained by demographic covariates, and investigated the misclassifications based on demographic variables.</p></div><div><h3>Methods</h3><p>We recruited students 10–18 years of age and enrolled in middle or high schools in Ohio, Kentucky, Indiana, and Tennessee. The reference standard outcome was determined by a forensic psychiatrist as either a “high” or “low” risk level. ARIA used L2-regularized logistic regression to predict a risk level for each student using contextual and semantic features. We conducted three analyses: a PROBAST analysis of risk in study design; analysis of demographic variables as covariates; and a prediction analysis. Covariates were included in the linear regression analyses and comprised of race, sex, ethnicity, household education, annual household income, age at the time of visit, and utilization of public assistance.</p></div><div><h3>Results</h3><p>We recruited 412 students from 204 schools. ARIA performed with an AUC of 0.92, sensitivity of 71%, NPV of 77%, and specificity of 95%. Of these, 387 students with complete demographic information were included in the analysis. Individual linear regressions resulted in a coefficient of determination less than 0.08 across all demographic variables. When using all demographic variables to predict ARIA’s risk assessment score, the multiple linear regression model resulted in a coefficient of determination of 0.189. ARIA performed with a lower False Negative Rate (FNR) of 15.2% (CI [0 – 40]) for the Black subgroup and 12.7%, CI [0 – 41.4] for Other races, compared to an FNR of 26.1% (CI [14.1 – 41.8]) in the White subgroup.</p></div><div><h3>Conclusions</h3><p>Bias assessment is needed to address shortcomings within machine learning. In our work, student race, ethnicity, sex, use of public assistance, and annual household income did not explain ARIA’s risk assessment score of students. ARIA will continue to be evaluated regularly with increased subject recruitment.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104709"},"PeriodicalIF":4.0,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141995770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the role of the UMLS in supporting diagnosis generation proposed by Large Language Models 关于 UMLS 在支持大语言模型提出的诊断生成鉴别诊断中的作用。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-13 DOI: 10.1016/j.jbi.2024.104707

Majid Afshar , Yanjun Gao , Deepak Gupta , Emma Croxford , Dina Demner-Fushman

Objective:

Traditional knowledge-based and machine learning diagnostic decision support systems have benefited from integrating the medical domain knowledge encoded in the Unified Medical Language System (UMLS). The emergence of Large Language Models (LLMs) to supplant traditional systems poses questions of the quality and extent of the medical knowledge in the models’ internal knowledge representations and the need for external knowledge sources. The objective of this study is three-fold: to probe the diagnosis-related medical knowledge of popular LLMs, to examine the benefit of providing the UMLS knowledge to LLMs (grounding the diagnosis predictions), and to evaluate the correlations between human judgments and the UMLS-based metrics for generations by LLMs.

Methods:

We evaluated diagnoses generated by LLMs from consumer health questions and daily care notes in the electronic health records using the ConsumerQA and Problem Summarization datasets. Probing LLMs for the UMLS knowledge was performed by prompting the LLM to complete the diagnosis-related UMLS knowledge paths. Grounding the predictions was examined in an approach that integrated the UMLS graph paths and clinical notes in prompting the LLMs. The results were compared to prompting without the UMLS paths. The final experiments examined the alignment of different evaluation metrics, UMLS-based and non-UMLS, with human expert evaluation.

Results:

In probing the UMLS knowledge, GPT-3.5 significantly outperformed Llama2 and a simple baseline yielding an F1 score of 10.9% in completing one-hop UMLS paths for a given concept. Grounding diagnosis predictions with the UMLS paths improved the results for both models on both tasks, with the highest improvement (4%) in SapBERT score. There was a weak correlation between the widely used evaluation metrics (ROUGE and SapBERT) and human judgments.

Conclusion:

We found that while popular LLMs contain some medical knowledge in their internal representations, augmentation with the UMLS knowledge provides performance gains around diagnosis generation. The UMLS needs to be tailored for the task to improve the LLMs predictions. Finding evaluation metrics that are aligned with human judgments better than the traditional ROUGE and BERT-based scores remains an open research question.

目的：传统的基于知识和机器学习的诊断决策支持系统得益于整合了统一医学语言系统（UMLS）中编码的医学领域知识。大型语言模型（LLM）的出现取代了传统系统，这就提出了模型内部知识表征中医学知识的质量和范围以及对外部知识源的需求等问题。本研究的目的有三：探究流行 LLM 的诊断相关医学知识；研究向 LLM 提供 UMLS 知识的益处（为诊断预测提供基础）；评估人类判断与基于 UMLS 的 LLM 生成指标之间的相关性：我们使用消费者质量保证（ConsumerQA）和问题汇总（Problem Summarization）数据集，评估了 LLMs 根据消费者健康问题和电子健康记录中的日常护理记录生成的诊断。通过提示 LLM 完成与诊断相关的 UMLS 知识路径，对 LLM 的 UMLS 知识进行探测。在对 LLMs 进行提示时，采用了一种将 UMLS 图路径和临床笔记整合在一起的方法，对预测的基础进行了研究。实验结果与没有 UMLS 路径的提示进行了比较。最后的实验检验了不同评价指标（基于 UMLS 和非 UMLS）与人类专家评价的一致性：在探究 UMLS 知识方面，GPT-3.5 的表现明显优于 Llama2 和简单基线，在完成给定概念的一跳 UMLS 路径方面，GPT-3.5 的 F1 得分为 10.9%。以 UMLS 路径为基础的诊断预测提高了两个模型在两个任务中的结果，其中 SapBERT 分数的提高幅度最大（4%）。广泛使用的评价指标（ROUGE 和 SapBERT）与人类判断之间的相关性较弱：我们发现，虽然流行的 LLM 在其内部表示法中包含一些医学知识，但使用 UMLS 知识进行增强可提高诊断生成的性能。为了提高 LLMs 的预测能力，UMLS 需要针对任务进行定制。寻找比传统的基于 ROUGE 和 BERT 的分数更符合人类判断的评价指标，仍然是一个有待研究的问题。

{"title":"On the role of the UMLS in supporting diagnosis generation proposed by Large Language Models","authors":"Majid Afshar , Yanjun Gao , Deepak Gupta , Emma Croxford , Dina Demner-Fushman","doi":"10.1016/j.jbi.2024.104707","DOIUrl":"10.1016/j.jbi.2024.104707","url":null,"abstract":"<div><h3>Objective:</h3><p>Traditional knowledge-based and machine learning diagnostic decision support systems have benefited from integrating the medical domain knowledge encoded in the Unified Medical Language System (UMLS). The emergence of Large Language Models (LLMs) to supplant traditional systems poses questions of the quality and extent of the medical knowledge in the models’ internal knowledge representations and the need for external knowledge sources. The objective of this study is three-fold: to probe the diagnosis-related medical knowledge of popular LLMs, to examine the benefit of providing the UMLS knowledge to LLMs (grounding the diagnosis predictions), and to evaluate the correlations between human judgments and the UMLS-based metrics for generations by LLMs.</p></div><div><h3>Methods:</h3><p>We evaluated diagnoses generated by LLMs from consumer health questions and daily care notes in the electronic health records using the ConsumerQA and Problem Summarization datasets. Probing LLMs for the UMLS knowledge was performed by prompting the LLM to complete the diagnosis-related UMLS knowledge paths. Grounding the predictions was examined in an approach that integrated the UMLS graph paths and clinical notes in prompting the LLMs. The results were compared to prompting without the UMLS paths. The final experiments examined the alignment of different evaluation metrics, UMLS-based and non-UMLS, with human expert evaluation.</p></div><div><h3>Results:</h3><p>In probing the UMLS knowledge, GPT-3.5 significantly outperformed Llama2 and a simple baseline yielding an F1 score of 10.9% in completing one-hop UMLS paths for a given concept. Grounding diagnosis predictions with the UMLS paths improved the results for both models on both tasks, with the highest improvement (4%) in SapBERT score. There was a weak correlation between the widely used evaluation metrics (ROUGE and SapBERT) and human judgments.</p></div><div><h3>Conclusion:</h3><p>We found that while popular LLMs contain some medical knowledge in their internal representations, augmentation with the UMLS knowledge provides performance gains around diagnosis generation. The UMLS needs to be tailored for the task to improve the LLMs predictions. Finding evaluation metrics that are aligned with human judgments better than the traditional ROUGE and BERT-based scores remains an open research question.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104707"},"PeriodicalIF":4.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141982357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Balancing the efforts of chart review and gains in PRS prediction accuracy: An empirical study 平衡病历审核的工作量与 PRS 预测准确性的提高：实证研究。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-10 DOI: 10.1016/j.jbi.2024.104705

Yuqing Lei , Adam Christian Naj , Hua Xu , Ruowang Li , Yong Chen

Objective

Phenotypic misclassification in genetic association analyses can impact the accuracy of PRS-based prediction models. The bias reduction method proposed by Tong et al. (2019) has demonstrated its efficacy in reducing the effects of bias on the estimation of association parameters between genotype and phenotype while minimizing variance by employing chart reviews on a subset of the data for validating phenotypes, however its improvement of subsequent PRS prediction accuracy remains unclear. Our study aims to fill this gap by assessing the performance of simulated PRS models and estimating the optimal number of chart reviews needed for validation.

Methods

To comprehensively assess the efficacy of the bias reduction method proposed by Tong et al. in enhancing the accuracy of PRS-based prediction models, we simulated each phenotype under different correlation structures (an independent model, a weakly correlated model, a strongly correlated model) and introduced error-prone phenotypes using two distinct error mechanisms (differential and non-differential phenotyping errors). To facilitate this, we used genotype and phenotype data from 12 case-control datasets in the Alzheimer’s Disease Genetics Consortium (ADGC) to produce simulated phenotypes. The evaluation included analyses across various misclassification rates of original phenotypes as well as quantities of validation set. Additionally, we determined the median threshold, identifying the minimal validation size required for a meaningful improvement in the accuracy of PRS-based predictions across a broad spectrum.

Results

This simulation study demonstrated that incorporating chart review does not universally guarantee enhanced performance of PRS-based prediction models. Specifically, in scenarios with minimal misclassification rates and limited validation sizes, PRS models utilizing debiased regression coefficients demonstrated inferior predictive capabilities compared to models using error-prone phenotypes. Put differently, the effectiveness of the bias reduction method is contingent upon the misclassification rates of phenotypes and the size of the validation set employed during chart reviews. Notably, when dealing with datasets featuring higher misclassification rates, the advantages of utilizing this bias reduction method become more evident, requiring a smaller validation set to achieve better performance.

Conclusion

This study highlights the importance of choosing an appropriate validation set size to balance between the efforts of chart review and the gain in PRS prediction accuracy. Consequently, our study establishes a valuable guidance for validation planning, across a diverse array of sensitivity and specificity combinations.

目的：遗传关联分析中的表型分类错误会影响基于 PRS 预测模型的准确性。Tong 等人（2019 年）提出的减少偏倚方法证明了其在减少偏倚对基因型和表型之间关联参数估计的影响方面的有效性，同时通过对验证表型的数据子集进行图表审查来最小化方差，但其对后续 PRS 预测准确性的改善效果仍不明确。我们的研究旨在通过评估模拟 PRS 模型的性能和估算验证所需的最佳图表审查数量来填补这一空白：为了全面评估 Tong 等人提出的减少偏倚方法在提高基于 PRS 预测模型准确性方面的功效，我们模拟了不同相关结构（独立模型、弱相关模型、强相关模型）下的每种表型，并使用两种不同的错误机制（差异表型错误和非差异表型错误）引入了易出错的表型。为此，我们使用了阿尔茨海默病遗传学联合会（ADGC）12 个病例对照数据集的基因型和表型数据来制作模拟表型。评估包括分析原始表型的各种误分类率以及验证集的数量。此外，我们还确定了中值阈值，确定了在广泛范围内有效提高基于 PRS 预测的准确性所需的最小验证规模：这项模拟研究表明，纳入病历审查并不能普遍保证基于 PRS 预测模型的性能得到提高。具体来说，在误分类率极低且验证规模有限的情况下，与使用易出错表型的模型相比，使用去偏回归系数的 PRS 模型的预测能力较差。换句话说，减少偏差方法的有效性取决于表型的误分类率和病历审核中使用的验证集的大小。值得注意的是，在处理误分类率较高的数据集时，使用这种减少偏差方法的优势会更加明显，需要较小的验证集来获得更好的性能：本研究强调了选择适当验证集大小的重要性，以在病历审核工作量和 PRS 预测准确率之间取得平衡。因此，我们的研究为各种灵敏度和特异性组合的验证规划提供了宝贵的指导。

{"title":"Balancing the efforts of chart review and gains in PRS prediction accuracy: An empirical study","authors":"Yuqing Lei , Adam Christian Naj , Hua Xu , Ruowang Li , Yong Chen","doi":"10.1016/j.jbi.2024.104705","DOIUrl":"10.1016/j.jbi.2024.104705","url":null,"abstract":"<div><h3>Objective</h3><p>Phenotypic misclassification in genetic association analyses can impact the accuracy of PRS-based prediction models. The bias reduction method proposed by Tong et al. (2019) has demonstrated its efficacy in reducing the effects of bias on the estimation of association parameters between genotype and phenotype while minimizing variance by employing chart reviews on a subset of the data for validating phenotypes, however its improvement of subsequent PRS prediction accuracy remains unclear. Our study aims to fill this gap by assessing the performance of simulated PRS models and estimating the optimal number of chart reviews needed for validation.</p></div><div><h3>Methods</h3><p>To comprehensively assess the efficacy of the bias reduction method proposed by Tong et al. in enhancing the accuracy of PRS-based prediction models, we simulated each phenotype under different correlation structures (an independent model, a weakly correlated model, a strongly correlated model) and introduced error-prone phenotypes using two distinct error mechanisms (differential and non-differential phenotyping errors). To facilitate this, we used genotype and phenotype data from 12 case-control datasets in the Alzheimer’s Disease Genetics Consortium (ADGC) to produce simulated phenotypes. The evaluation included analyses across various misclassification rates of original phenotypes as well as quantities of validation set. Additionally, we determined the median threshold, identifying the minimal validation size required for a meaningful improvement in the accuracy of PRS-based predictions across a broad spectrum.</p></div><div><h3>Results</h3><p>This simulation study demonstrated that incorporating chart review does not universally guarantee enhanced performance of PRS-based prediction models. Specifically, in scenarios with minimal misclassification rates and limited validation sizes, PRS models utilizing debiased regression coefficients demonstrated inferior predictive capabilities compared to models using error-prone phenotypes. Put differently, the effectiveness of the bias reduction method is contingent upon the misclassification rates of phenotypes and the size of the validation set employed during chart reviews. Notably, when dealing with datasets featuring higher misclassification rates, the advantages of utilizing this bias reduction method become more evident, requiring a smaller validation set to achieve better performance.</p></div><div><h3>Conclusion</h3><p>This study highlights the importance of choosing an appropriate validation set size to balance between the efforts of chart review and the gain in PRS prediction accuracy. Consequently, our study establishes a valuable guidance for validation planning, across a diverse array of sensitivity and specificity combinations.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104705"},"PeriodicalIF":4.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141971201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A GPT-based EHR modeling system for unsupervised novel disease detection 基于 GPT 的电子病历建模系统，用于无监督新型疾病检测。

IF 4 2区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Biomedical Informatics

Pub Date : 2024-08-08 DOI: 10.1016/j.jbi.2024.104706

Boran Hao , Yang Hu , William G. Adams , Sabrina A. Assoumou , Heather E. Hsu , Nahid Bhadelia , Ioannis Ch. Paschalidis

Objective

To develop an Artificial Intelligence (AI)-based anomaly detection model as a complement of an “astute physician” in detecting novel disease cases in a hospital and preventing emerging outbreaks.

Methods

Data included hospitalized patients (n = 120,714) at a safety-net hospital in Massachusetts. A novel Generative Pre-trained Transformer (GPT)-based clinical anomaly detection system was designed and further trained using Empirical Risk Minimization (ERM), which can model a hospitalized patient’s Electronic Health Records (EHR) and detect atypical patients. Methods and performance metrics, similar to the ones behind the recent Large Language Models (LLMs), were leveraged to capture the dynamic evolution of the patient’s clinical variables and compute an Out-Of-Distribution (OOD) anomaly score.

Results

In a completely unsupervised setting, hospitalizations for Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infection could have been predicted by our GPT model at the beginning of the COVID-19 pandemic, with an Area Under the Receiver Operating Characteristic Curve (AUC) of 92.2 %, using 31 extracted clinical variables and a 3-day detection window. Our GPT achieves individual patient-level anomaly detection and mortality prediction AUC of 78.3 % and 94.7 %, outperforming traditional linear models by 6.6 % and 9 %, respectively. Different types of clinical trajectories of a SARS-CoV-2 infection are captured by our model to make interpretable detections, while a trend of over-pessimistic outcome prediction yields a more effective detection pathway. Furthermore, our comprehensive GPT model can potentially assist clinicians with forecasting patient clinical variables and developing personalized treatment plans.

Conclusion

This study demonstrates that an emerging outbreak can be accurately detected within a hospital, by using a GPT to model patient EHR time sequences and labeling them as anomalous when actual outcomes are not supported by the model. Such a GPT is also a comprehensive model with the functionality of generating future patient clinical variables, which can potentially assist clinicians in developing personalized treatment plans.

目的开发一种基于人工智能（AI）的异常检测模型，作为 "精明医生 "的辅助工具，检测医院中的新型疾病病例，预防新出现的疾病爆发：数据包括马萨诸塞州一家安全网医院的住院病人（n = 120,714）。设计了一种基于生成预训练变换器（GPT）的新型临床异常检测系统，并使用经验风险最小化（ERM）对其进行了进一步训练，该系统可对住院患者的电子健康记录（EHR）进行建模并检测出非典型患者。该系统采用了与最近的大型语言模型（LLM）类似的方法和性能指标，以捕捉患者临床变量的动态演变，并计算出异常分布（OOD）得分：在完全无监督的情况下，我们的GPT模型可以在COVID-19大流行初期预测严重急性呼吸系统综合症冠状病毒2（SARS-CoV-2）感染的住院情况，使用31个提取的临床变量和3天的检测窗口，接收者工作特征曲线下面积（AUC）为92.2%。我们的 GPT 在单个患者层面的异常检测和死亡率预测 AUC 分别达到 78.3% 和 94.7%，分别比传统线性模型高出 6.6% 和 9%。我们的模型捕捉到了 SARS-CoV-2 感染的不同类型临床轨迹，从而进行了可解释的检测，而过度悲观的结果预测趋势则产生了更有效的检测途径。此外，我们的综合 GPT 模型有可能帮助临床医生预测病人的临床变量，并制定个性化的治疗方案：本研究表明，通过使用 GPT 对患者电子病历时间序列建模，并在实际结果与模型不符时将其标记为异常，可以在医院内准确检测到新出现的疫情。这种 GPT 还是一种综合模型，具有生成未来病人临床变量的功能，可帮助临床医生制定个性化治疗方案。

{"title":"A GPT-based EHR modeling system for unsupervised novel disease detection","authors":"Boran Hao , Yang Hu , William G. Adams , Sabrina A. Assoumou , Heather E. Hsu , Nahid Bhadelia , Ioannis Ch. Paschalidis","doi":"10.1016/j.jbi.2024.104706","DOIUrl":"10.1016/j.jbi.2024.104706","url":null,"abstract":"<div><h3>Objective</h3><p>To develop an <em>Artificial Intelligence (AI)</em>-based anomaly detection model as a complement of an “astute physician” in detecting novel disease cases in a hospital and preventing emerging outbreaks<em>.</em></p></div><div><h3>Methods</h3><p>Data included hospitalized patients (n = 120,714) at a safety-net hospital in Massachusetts. A novel <em>Generative Pre-trained Transformer (GPT)</em>-based clinical anomaly detection system was designed and further trained using <em>Empirical Risk Minimization (ERM)</em>, which can model a hospitalized patient’s <em>Electronic Health Records (EHR)</em> and detect atypical patients. Methods and performance metrics, similar to the ones behind the recent <em>Large Language Models (LLMs)</em>, were leveraged to capture the dynamic evolution of the patient’s clinical variables and compute an <em>Out-Of-Distribution (OOD)</em> anomaly score.</p></div><div><h3>Results</h3><p>In a completely unsupervised setting, hospitalizations for <em>Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2)</em> infection could have been predicted by our GPT model at the beginning of the COVID-19 pandemic, with an Area Under the Receiver Operating Characteristic Curve (AUC) of 92.2 %, using 31 extracted clinical variables and a 3-day detection window. Our GPT achieves individual patient-level anomaly detection and mortality prediction AUC of 78.3 % and 94.7 %, outperforming traditional linear models by 6.6 % and 9 %, respectively. Different types of clinical trajectories of a SARS-CoV-2 infection are captured by our model to make interpretable detections, while a trend of over-pessimistic outcome prediction yields a more effective detection pathway. Furthermore, our comprehensive GPT model can potentially assist clinicians with forecasting patient clinical variables and developing personalized treatment plans.</p></div><div><h3>Conclusion</h3><p>This study demonstrates that an emerging outbreak can be accurately detected within a hospital, by using a GPT to model patient EHR time sequences and labeling them as anomalous when actual outcomes are not supported by the model. Such a GPT is also a comprehensive model with the functionality of generating future patient clinical variables, which can potentially assist clinicians in developing personalized treatment plans.</p></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"157 ","pages":"Article 104706"},"PeriodicalIF":4.0,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141912806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0