AMIA ... Annual Symposium proceedings. AMIA Symposium最新文献

英文中文

Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes. 多机构临床笔记稳健文本分类的原产地干扰后门调整。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Xiruo Ding, Zhecheng Sheng, Meliha Yetişgen, Serguei Pakhomov, Trevor Cohen

Natural Language Processing (NLP) methods have been broadly applied to clinical tasks. Machine learning and deep learning approaches have been used to improve the performance of clinical NLP. However, these approaches require sufficiently large datasets for training, and trained models have been shown to transfer poorly across sites. These issues have led to the promotion of data collection and integration across different institutions for accurate and portable models. However, this can introduce a form of bias called confounding by provenance. When source-specific data distributions differ at deployment, this may harm model performance. To address this issue, we evaluate the utility of backdoor adjustment for text classification in a multi-site dataset of clinical notes annotated for mentions of substance abuse. Using an evaluation framework devised to measure robustness to distributional shifts, we assess the utility of backdoor adjustment. Our results indicate that backdoor adjustment can effectively mitigate for confounding shift.

自然语言处理（NLP）方法已广泛应用于临床任务。机器学习和深度学习方法已被用于提高临床 NLP 的性能。然而，这些方法需要足够大的数据集进行训练，而且训练后的模型在不同机构间的转移效果不佳。这些问题促使人们提倡在不同机构间收集和整合数据，以建立准确、可移植的模型。然而，这可能会引入一种称为 "来源混杂"（confounding by provenance）的偏差。当特定来源的数据分布在部署时有所不同时，这可能会损害模型的性能。为了解决这个问题，我们评估了在一个多站点数据集中对文本分类进行后门调整的效用，该数据集包含了对药物滥用进行注释的临床笔记。我们使用一个评估框架来衡量对分布变化的稳健性，评估了后门调整的效用。结果表明，"后门调整 "可以有效地减少混杂转移。

{"title":"Backdoor Adjustment of Confounding by Provenance for Robust Text Classification of Multi-institutional Clinical Notes.","authors":"Xiruo Ding, Zhecheng Sheng, Meliha Yetişgen, Serguei Pakhomov, Trevor Cohen","doi":"","DOIUrl":"","url":null,"abstract":"Natural Language Processing (NLP) methods have been broadly applied to clinical tasks. Machine learning and deep learning approaches have been used to improve the performance of clinical NLP. However, these approaches require sufficiently large datasets for training, and trained models have been shown to transfer poorly across sites. These issues have led to the promotion of data collection and integration across different institutions for accurate and portable models. However, this can introduce a form of bias called confounding by provenance. When source-specific data distributions differ at deployment, this may harm model performance. To address this issue, we evaluate the utility of backdoor adjustment for text classification in a multi-site dataset of clinical notes annotated for mentions of substance abuse. Using an evaluation framework devised to measure robustness to distributional shifts, we assess the utility of backdoor adjustment. Our results indicate that backdoor adjustment can effectively mitigate for confounding shift.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"923-932"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785933/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Contextual Variation of Clinical Notes induced by EHR Migration. 电子病历迁移引起的临床笔记上下文差异。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Kurt Miller, Sungrim Moon, Sunyang Fu, Hongfang Liu

The structure and semantics of clinical notes vary considerably across different Electronic Health Record (EHR) systems, sites, and institutions. Such heterogeneity hampers the portability of natural language processing (NLP) models in extracting information from the text for clinical research or practice. In this study, we evaluate the contextual variation of clinical notes by measuring the semantic and syntactic similarity of the notes of two sets of physicians comprising four medical specialties across EHR migrations at two Mayo Clinic sites. We find significant semantic and syntactic variation imposed by the context of the EHR system and between medical specialties whereas only minor variation is caused by variation of spatial context across sites. Our findings suggest that clinical language models need to account for process differences at the specialty sublanguage level to be generalizable.

在不同的电子健康记录（EHR）系统、网站和机构中，临床笔记的结构和语义差异很大。这种异质性阻碍了自然语言处理（NLP）模型从文本中提取信息用于临床研究或实践的可移植性。在本研究中，我们评估了临床笔记的上下文差异，方法是测量两组医生笔记的语义和句法相似性，这两组医生由四个医学专业组成，在梅奥诊所的两个站点进行了 EHR 迁移。我们发现，电子病历系统的上下文和医学专科之间的语义和句法差异很大，而不同地点的空间上下文差异造成的差异很小。我们的研究结果表明，临床语言模型需要考虑专科子语言层面的过程差异，这样才能具有普遍性。

引用次数: 0

DiffusionCT: Latent Diffusion Model for CT Image Standardization. DiffusionCT：用于 CT 图像标准化的潜在扩散模型。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Md Selim, Jie Zhang, Michael A Brooks, Ge Wang, Jin Chen

Computed tomography (CT) is one of the modalities for effective lung cancer screening, diagnosis, treatment, and prognosis. The features extracted from CT images are now used to quantify spatial and temporal variations in tumors. However, CT images obtained from various scanners with customized acquisition protocols may introduce considerable variations in texture features, even for the same patient. This presents a fundamental challenge to downstream studies that require consistent and reliable feature analysis. Existing CT image harmonization models rely on GAN-based supervised or semi-supervised learning, with limited performance. This work addresses the issue of CT image harmonization using a new diffusion-based model, named DiffusionCT, to standardize CT images acquired from different vendors and protocols. DiffusionCT operates in the latent space by mapping a latent non-standard distribution into a standard one. DiffusionCT incorporates a U-Net-based encoder-decoder, augmented by a diffusion model integrated into the bottleneck part. The model is designed in two training phases. The encoder-decoder is first trained, without embedding the diffusion model, to learn the latent representation of the input data. The latent diffusion model is then trained in the next training phase while fixing the encoder-decoder. Finally, the decoder synthesizes a standardized image with the transformed latent representation. The experimental results demonstrate a significant improvement in the performance of the standardization task using DiffusionCT.

计算机断层扫描（CT）是有效筛查、诊断、治疗和预后肺癌的方法之一。目前，从 CT 图像中提取的特征被用于量化肿瘤的空间和时间变化。然而，从不同的扫描仪和定制的采集协议中获得的 CT 图像可能会在纹理特征上产生相当大的差异，即使是同一个病人也不例外。这对需要一致、可靠的特征分析的下游研究提出了根本性的挑战。现有的 CT 图像协调模型依赖于基于 GAN 的监督或半监督学习，但性能有限。这项研究利用一种新的基于扩散的模型（名为 DiffusionCT）来解决 CT 图像协调问题，以标准化从不同供应商和协议获取的 CT 图像。DiffusionCT 通过将非标准的潜在分布映射到标准分布，在潜在空间中进行操作。DiffusionCT 包含一个基于 U-Net 的编码器-解码器，并在瓶颈部分集成了一个扩散模型。该模型的设计分为两个训练阶段。首先在不嵌入扩散模型的情况下训练编码器-解码器，以学习输入数据的潜在表示。然后在下一个训练阶段训练潜在扩散模型，同时固定编码器-解码器。最后，解码器用转换后的潜在表示合成标准化图像。实验结果表明，使用 DiffusionCT 可以显著提高标准化任务的性能。

{"title":"DiffusionCT: Latent Diffusion Model for CT Image Standardization.","authors":"Md Selim, Jie Zhang, Michael A Brooks, Ge Wang, Jin Chen","doi":"","DOIUrl":"","url":null,"abstract":"Computed tomography (CT) is one of the modalities for effective lung cancer screening, diagnosis, treatment, and prognosis. The features extracted from CT images are now used to quantify spatial and temporal variations in tumors. However, CT images obtained from various scanners with customized acquisition protocols may introduce considerable variations in texture features, even for the same patient. This presents a fundamental challenge to downstream studies that require consistent and reliable feature analysis. Existing CT image harmonization models rely on GAN-based supervised or semi-supervised learning, with limited performance. This work addresses the issue of CT image harmonization using a new diffusion-based model, named DiffusionCT, to standardize CT images acquired from different vendors and protocols. DiffusionCT operates in the latent space by mapping a latent non-standard distribution into a standard one. DiffusionCT incorporates a U-Net-based encoder-decoder, augmented by a diffusion model integrated into the bottleneck part. The model is designed in two training phases. The encoder-decoder is first trained, without embedding the diffusion model, to learn the latent representation of the input data. The latent diffusion model is then trained in the next training phase while fixing the encoder-decoder. Finally, the decoder synthesizes a standardized image with the transformed latent representation. The experimental results demonstrate a significant improvement in the performance of the standardization task using DiffusionCT.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"624-633"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785850/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

From Free-text Drug Labels to Structured Medication Terminology with BERT and GPT. 使用 BERT 和 GPT，从自由文本药物标签到结构化药物术语。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Duy-Hoa Ngo, Bevan Koopman

We present a method to enrich controlled medication terminology from free-text drug labels. This is important because, while controlled medication terminology capture well-structured medication information, much of the information pertaining to medications is still found in free-text. First, we compared different Named Entity Recognition (NER) models including rule-based, feature-based, deep learning-based models with Transformers as well as ChatGPT, few-shot and fine-tuned GPT-3 to find the most suitable model that accurately extracts medication entities (ingredients, brand, dose, etc.) from free-text. Then, a rule-based Relation Extraction algorithm transforms NER results into a well-structured medication knowledge graph. Finally, a Medication Searching method takes the knowledge graph and matches it to relevant medications in the terminology server. An empirical evaluation on real-world drug labels shows that BERT-CRF was the most effective NER model with F-measure 95%. After performing terms normalization, the Medication Searching achieved an accuracy of 77% for when matching a label to relevant medication in the terminology server. The NER and Medication Searching models could be deployed as a web service capable of accepting free-text queries and returning structured medication information; thus providing a useful means of better managing medications information found in different health systems.

我们提出了一种从自由文本药物标签中丰富受控药物术语的方法。这一点非常重要，因为虽然受控药物术语能捕捉到结构良好的药物信息，但许多与药物相关的信息仍然存在于自由文本中。首先，我们比较了不同的命名实体识别（NER）模型，包括基于规则的模型、基于特征的模型、基于深度学习的模型、Transformers 模型以及 ChatGPT 模型、少拍模型和微调 GPT-3 模型，以找到最适合的模型，从自由文本中准确提取药物实体（成分、品牌、剂量等）。然后，基于规则的关系提取算法将 NER 结果转化为结构良好的药物知识图谱。最后，药物搜索方法将知识图谱与术语服务器中的相关药物进行匹配。对真实世界药物标签的经验评估表明，BERT-CRF 是最有效的 NER 模型，F-measure 为 95%。在对术语进行归一化处理后，当将标签与术语服务器中的相关药物进行匹配时，药物搜索的准确率达到了 77%。NER 和用药搜索模型可作为网络服务部署，能够接受自由文本查询并返回结构化的用药信息，从而为更好地管理不同医疗系统中的用药信息提供了有用的手段。

{"title":"From Free-text Drug Labels to Structured Medication Terminology with BERT and GPT.","authors":"Duy-Hoa Ngo, Bevan Koopman","doi":"","DOIUrl":"","url":null,"abstract":"We present a method to enrich controlled medication terminology from free-text drug labels. This is important because, while controlled medication terminology capture well-structured medication information, much of the information pertaining to medications is still found in free-text. First, we compared different Named Entity Recognition (NER) models including rule-based, feature-based, deep learning-based models with Transformers as well as ChatGPT, few-shot and fine-tuned GPT-3 to find the most suitable model that accurately extracts medication entities (ingredients, brand, dose, etc.) from free-text. Then, a rule-based Relation Extraction algorithm transforms NER results into a well-structured medication knowledge graph. Finally, a Medication Searching method takes the knowledge graph and matches it to relevant medications in the terminology server. An empirical evaluation on real-world drug labels shows that BERT-CRF was the most effective NER model with F-measure 95%. After performing terms normalization, the Medication Searching achieved an accuracy of 77% for when matching a label to relevant medication in the terminology server. The NER and Medication Searching models could be deployed as a web service capable of accepting free-text queries and returning structured medication information; thus providing a useful means of better managing medications information found in different health systems.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"540-549"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785872/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identification of Outcome-Oriented Progression Subtypes from Mild Cognitive Impairment to Alzheimer's Disease Using Electronic Health Records. 利用电子健康记录识别从轻度认知障碍到阿尔茨海默病的以结果为导向的进展亚型。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Jie Xu, Rui Yin, Yu Huang, Hannah Gao, Yonghui Wu, Jingchuan Guo, Glenn E Smith, Steven T DeKosky, Fei Wang, Yi Guo, Jiang Bian

Alzheimer's disease (AD) is a complex heterogeneous neurodegenerative disease that requires an in-depth understanding of its progression pathways and contributing factors to develop effective risk stratification and prevention strategies. In this study, we proposed an outcome-oriented model to identify progression pathways from mild cognitive impairment (MCI) to AD using electronic health records (EHRs) from the OneFlorida+ Clinical Research Consortium. To achieve this, we employed the long short-term memory (LSTM) network to extract relevant information from the sequential records of each patient. The hierarchical agglomerative clustering was then applied to the learned representation to group patients based on their progression subtypes. Our approach identified multiple progression pathways, each of which represented distinct patterns of disease progression from MCI to AD. These pathways can serve as a valuable resource for researchers to understand the factors influencing AD progression and to develop personalized interventions to delay or prevent the onset of the disease.

阿尔茨海默病（AD）是一种复杂的异质性神经退行性疾病，需要深入了解其进展途径和诱因，以制定有效的风险分层和预防策略。在本研究中，我们提出了一个以结果为导向的模型，利用 OneFlorida+ 临床研究联盟的电子健康记录（EHR）来识别从轻度认知障碍（MCI）到老年痴呆症（AD）的进展路径。为此，我们利用长短期记忆（LSTM）网络从每位患者的连续记录中提取相关信息。然后将分层聚类应用于所学的表示，根据患者的进展亚型对其进行分组。我们的方法确定了多条疾病进展路径，每条路径都代表了从 MCI 到 AD 的不同疾病进展模式。这些路径可作为研究人员的宝贵资源，帮助他们了解影响注意力缺失症进展的因素，并开发个性化干预措施来延缓或预防疾病的发生。

{"title":"Identification of Outcome-Oriented Progression Subtypes from Mild Cognitive Impairment to Alzheimer's Disease Using Electronic Health Records.","authors":"Jie Xu, Rui Yin, Yu Huang, Hannah Gao, Yonghui Wu, Jingchuan Guo, Glenn E Smith, Steven T DeKosky, Fei Wang, Yi Guo, Jiang Bian","doi":"","DOIUrl":"","url":null,"abstract":"Alzheimer's disease (AD) is a complex heterogeneous neurodegenerative disease that requires an in-depth understanding of its progression pathways and contributing factors to develop effective risk stratification and prevention strategies. In this study, we proposed an outcome-oriented model to identify progression pathways from mild cognitive impairment (MCI) to AD using electronic health records (EHRs) from the OneFlorida+ Clinical Research Consortium. To achieve this, we employed the long short-term memory (LSTM) network to extract relevant information from the sequential records of each patient. The hierarchical agglomerative clustering was then applied to the learned representation to group patients based on their progression subtypes. Our approach identified multiple progression pathways, each of which represented distinct patterns of disease progression from MCI to AD. These pathways can serve as a valuable resource for researchers to understand the factors influencing AD progression and to develop personalized interventions to delay or prevent the onset of the disease.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"764-773"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785946/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying Reuse and Redundancies in Respiratory Flowsheet Documentation: Implications for Clinician Documentation Burden. 识别呼吸流程表文档中的重复使用和冗余：对临床医生文档负担的影响。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Jennifer Withall, Mai Tran, Bobby Schroeder, Rachel Lee, Amanda Moy, Syed Mohtashim Abbas Bokhari, Kenrick Cato, Sarah Rossetti

Documentation burden is experienced by clinical end-users of the electronic health record. Flowsheet measure reuse and clinical concept redundancy are two contributors to documentation burden. In this paper, we described nursing flowsheet documentation hierarchy and frequency of use for one month from two hospitals in our health system. We examined respiratory care management documentation in greater detail. We found 59 instances of reuse of respiratory care flowsheet measure fields over two or more templates and groups, and 5 instances of clinical concept redundancy. Flowsheet measure fields for physical assessment observations and measurements were the most frequently documented and most reused, whereas respiratory intervention documentation was less frequently reused. Further research should investigate the relationship between flowsheet measure reuse and redundancy and EHR information overload and documentation burden.

电子病历的临床最终用户都会遇到文档负担问题。流程表措施重复使用和临床概念冗余是造成文档负担的两个因素。在本文中，我们描述了我们医疗系统中两家医院一个月内护理流程表文档的层次结构和使用频率。我们更详细地检查了呼吸护理管理文件。我们发现在两个或两个以上的模板和组别中重复使用呼吸护理流程表测量字段的情况有 59 例，临床概念重复的情况有 5 例。物理评估观察和测量的流程表测量字段是最常记录和重复使用的，而呼吸干预记录的重复使用频率较低。进一步的研究应探讨流程表措施重复使用和冗余与电子病历信息过载和文档负担之间的关系。

引用次数: 0

Improving physical activity among prostate cancer survivors through a peer-based digital walking program. 通过基于同伴的数字化步行计划，提高前列腺癌幸存者的体育锻炼水平。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Savitha Sangameswaran, Reggie Casanova-Perez, Harsh Patel, David J Cronkite, Ayah Idris, Dori E Rosenberg, Jonathan L Wright, John L Gore, Andrea L Hartzler

Physical activity is important for prostate cancer survivors. Yet survivors face significant barriers to traditional structured exercise programs, limiting engagement and impact. Digital programs that incorporate fitness trackers and peer support via social media have potential to improve the reach and impact of traditional support. Using a digital walking program with prostate cancer survivors, we employed mixed methods to assess program outcomes, engagement, perceived utility, and social influence. After 6 weeks of program use, survivors and loved ones (n=18) significantly increased their average daily step count. Although engagement and perceived utility of using a fitness tracker and interacting with walking buddies was high, social media engagement and utility were limited. Group strategies associated with social influence were driven more by group attraction to the collective task of walking than by interpersonal bonds. Findings demonstrate the feasibility of a digital walking program to improve physical activity and extend the reach of traditional support.

体育锻炼对前列腺癌幸存者非常重要。然而，幸存者在参加传统的结构化锻炼计划时面临着巨大障碍，从而限制了参与度和影响力。通过社交媒体整合健身追踪器和同伴支持的数字项目有可能提高传统支持的覆盖面和影响力。我们采用混合方法评估了前列腺癌幸存者的数字步行计划成果、参与度、感知效用和社会影响力。在使用该计划 6 周后，幸存者和亲人（18 人）的日均步数显著增加。虽然使用健身追踪器和与步行伙伴互动的参与度和感知效用很高，但社交媒体的参与度和效用却很有限。与社会影响相关的团体策略更多是由团体对步行这一集体任务的吸引力而非人际纽带驱动的。研究结果表明，通过数字步行计划来提高体育锻炼和扩大传统支持范围是可行的。

{"title":"Improving physical activity among prostate cancer survivors through a peer-based digital walking program.","authors":"Savitha Sangameswaran, Reggie Casanova-Perez, Harsh Patel, David J Cronkite, Ayah Idris, Dori E Rosenberg, Jonathan L Wright, John L Gore, Andrea L Hartzler","doi":"","DOIUrl":"","url":null,"abstract":"Physical activity is important for prostate cancer survivors. Yet survivors face significant barriers to traditional structured exercise programs, limiting engagement and impact. Digital programs that incorporate fitness trackers and peer support via social media have potential to improve the reach and impact of traditional support. Using a digital walking program with prostate cancer survivors, we employed mixed methods to assess program outcomes, engagement, perceived utility, and social influence. After 6 weeks of program use, survivors and loved ones (n=18) significantly increased their average daily step count. Although engagement and perceived utility of using a fitness tracker and interacting with walking buddies was high, social media engagement and utility were limited. Group strategies associated with social influence were driven more by group attraction to the collective task of walking than by interpersonal bonds. Findings demonstrate the feasibility of a digital walking program to improve physical activity and extend the reach of traditional support.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"608-617"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785891/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Standardizing Multi-site Clinical Note Titles to LOINC Document Ontology: A Transformer-based Approach. 将多站点临床笔记标题标准化为 LOINC 文档本体：基于转换器的方法

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Xu Zuo, Yujia Zhou, Jon Duke, George Hripcsak, Nigam Shah, Juan M Banda, Ruth Reeves, Timothy Miller, Lemuel R Waitman, Karthik Natarajan, Hua Xu

The types of clinical notes in electronic health records (EHRs) are diverse and it would be great to standardize them to ensure unified data retrieval, exchange, and integration. The LOINC Document Ontology (DO) is a subset of LOINC that is created specifically for naming and describing clinical documents. Despite the efforts of promoting and improving this ontology, how to efficiently deploy it in real-world clinical settings has yet to be explored. In this study we evaluated the utility of LOINC DO by mapping clinical note titles collected from five institutions to the LOINC DO and classifying the mapping into three classes based on semantic similarity between note titles and LOINC DO codes. Additionally, we developed a standardization pipeline that automatically maps clinical note titles from multiple sites to suitable LOINC DO codes, without accessing the content of clinical notes. The pipeline can be initialized with different large language models, and we compared the performances between them. The results showed that our automated pipeline achieved an accuracy of 0.90. By comparing the manual and automated mapping results, we analyzed the coverage of LOINC DO in describing multi-site clinical note titles and summarized the potential scope for extension.

电子健康记录（EHR）中的临床笔记类型多种多样，如果能将它们标准化，以确保统一的数据检索、交换和整合，那将是一件非常好的事情。LOINC 文档本体（DO）是 LOINC 的一个子集，专门用于命名和描述临床文档。尽管人们一直在努力推广和改进这一本体，但如何在现实世界的临床环境中有效地部署这一本体仍有待探索。在这项研究中，我们将从五家机构收集到的临床病历标题与 LOINC DO 进行了映射，并根据病历标题与 LOINC DO 代码之间的语义相似性将映射分为三类，从而评估了 LOINC DO 的实用性。此外，我们还开发了一个标准化流水线，可将多个机构的临床病历标题自动映射为合适的 LOINC DO 代码，而无需访问临床病历的内容。该管道可使用不同的大型语言模型进行初始化，我们还比较了它们之间的性能。结果显示，我们的自动管道准确率达到了 0.90。通过比较手动和自动映射结果，我们分析了 LOINC DO 在描述多站点临床笔记标题方面的覆盖范围，并总结了潜在的扩展范围。

{"title":"Standardizing Multi-site Clinical Note Titles to LOINC Document Ontology: A Transformer-based Approach.","authors":"Xu Zuo, Yujia Zhou, Jon Duke, George Hripcsak, Nigam Shah, Juan M Banda, Ruth Reeves, Timothy Miller, Lemuel R Waitman, Karthik Natarajan, Hua Xu","doi":"","DOIUrl":"","url":null,"abstract":"The types of clinical notes in electronic health records (EHRs) are diverse and it would be great to standardize them to ensure unified data retrieval, exchange, and integration. The LOINC Document Ontology (DO) is a subset of LOINC that is created specifically for naming and describing clinical documents. Despite the efforts of promoting and improving this ontology, how to efficiently deploy it in real-world clinical settings has yet to be explored. In this study we evaluated the utility of LOINC DO by mapping clinical note titles collected from five institutions to the LOINC DO and classifying the mapping into three classes based on semantic similarity between note titles and LOINC DO codes. Additionally, we developed a standardization pipeline that automatically maps clinical note titles from multiple sites to suitable LOINC DO codes, without accessing the content of clinical notes. The pipeline can be initialized with different large language models, and we compared the performances between them. The results showed that our automated pipeline achieved an accuracy of 0.90. By comparing the manual and automated mapping results, we analyzed the coverage of LOINC DO in describing multi-site clinical note titles and summarized the potential scope for extension.","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2023 ","pages":"834-843"},"PeriodicalIF":0.0,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10785935/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139467618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Text Classification of Cancer Clinical Trial Eligibility Criteria. 癌症临床试验资格标准文本分类》。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Yumeng Yang, Soumya Jayaraj, Ethan Ludmir, Kirk Roberts

Automatic identification of clinical trials for which a patient is eligible is complicated by the fact that trial eligibility are stated in natural language. A potential solution to this problem is to employ text classification methods for common types of eligibility criteria. In this study, we focus on seven common exclusion criteria in cancer trials: prior malignancy, human immunodeficiency virus, hepatitis B, hepatitis C, psychiatric illness, drug/substance abuse, and autoimmune illness. Our dataset consists of 764 phase III cancer trials with these exclusions annotated at the trial level. We experiment with common transformer models as well as a new pre-trained clinical trial BERT model. Our results demonstrate the feasibility of automatically classifying common exclusion criteria. Additionally, we demonstrate the value of a pre-trained language model specifically for clinical trials, which yield the highest average performance across all criteria.

自动识别患者符合条件的临床试验非常复杂，因为试验资格是用自然语言表述的。解决这一问题的潜在方法是针对常见类型的资格标准采用文本分类方法。在本研究中，我们重点关注癌症试验中常见的七种排除标准：既往恶性肿瘤、人类免疫缺陷病毒、乙型肝炎、丙型肝炎、精神疾病、药物/物质滥用和自身免疫性疾病。我们的数据集由 764 项 III 期癌症试验组成，这些试验在试验层面上标注了这些排除项。我们使用常见的转换器模型以及新的预训练临床试验 BERT 模型进行了实验。我们的结果证明了自动分类常见排除标准的可行性。此外，我们还证明了专门针对临床试验的预训练语言模型的价值，该模型在所有标准中的平均性能最高。

引用次数: 0

QuizTime: Innovative Learning Platform to Support Just-In-Time Asynchronous Quizzes to Improve Health Outcomes. QuizTime：支持即时异步测验的创新学习平台，以改善健康结果。

AMIA ... Annual Symposium proceedings. AMIA Symposium

Pub Date : 2024-01-11 eCollection Date: 2023-01-01

Toufeeq Ahmed, Katie Stinson, Jay Johnson, Zainab Latif

QuizTime is an innovative, asynchronous, spaced learning platform that provides just-in-time learning to increase knowledge and retention. QuizTime was created in 2015, and since then, its effectiveness has been tested and studied across multiple healthcare learning interventions. This paper describes the importance of spaced learning in knowledge acquisition and retention, and the motivation behind the creation of the innovative QuizTime platform. We demonstrate the usefulness of this platform, as shown by multiple case studies using QuizTime, to increase and engage medical students, residents, physicians and health care providers with new quizzes and interventions.

QuizTime 是一个创新的异步间隔学习平台，提供及时学习，以增加知识和保持率。QuizTime创建于2015年，从那时起，它的有效性已在多种医疗保健学习干预措施中得到测试和研究。本文介绍了间隔学习在知识获取和保持方面的重要性，以及创建创新型 QuizTime 平台的动机。我们通过多个使用 QuizTime 的案例研究，展示了该平台的实用性，通过新的测验和干预措施，提高医学生、住院医师、医生和医疗保健提供者的参与度。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

AMIA ... Annual Symposium proceedings. AMIA Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀