首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Application of Information Link Control in Surgical Specimen Near-Miss Events in a South China Hospital: Nonrandomized Controlled Study. 信息链接控制在华南某医院手术标本近失事件中的应用:非随机对照研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-14 DOI: 10.2196/52722
Tingting Chen, Xiaofen Tang, Min Xu, Yue Jiang, Fengyan Zheng

Background: Information control is a promising approach for managing surgical specimens. However, there is limited research evidence on surgical near misses. This is particularly true in the closed loop of information control for each link.

Objective: A new model of surgical specimen process management is further constructed, and a safe operating room nursing practice environment is created by intercepting specimen near-miss events through information safety barriers.

Methods: In a large hospital in China, 84,289 surgical specimens collected in the conventional information specimen management mode from January to December 2021 were selected as the control group, and 99,998 surgical specimens collected in the information safety barrier control surgical specimen management mode from January to December 2022 were selected as the improvement group. The incidence of near misses, the qualified rate of pathological specimen fixation, and the average time required for specimen fixation were compared under the 2 management modes. The causes of 2 groups of near misses were analyzed and the near misses of information safety barrier control surgical specimens were studied.

Results: Under the information-based safety barrier control surgical specimen management model, the incidence of adverse events in surgical specimens was reduced, the reporting of near-miss events in surgical specimens was improved by 100%, the quality control quality management of surgical specimens was effectively improved, the pass rate of surgical pathology specimen fixation was improved, and the meantime for surgical specimen fixation was shortened, with differences considered statistically significant at P<.05.

Conclusions: Our research has developed a new mode of managing the surgical specimen process. This mode can prevent errors in approaching specimens by implementing information security barriers, thereby enhancing the quality of specimen management, ensuring the safety of medical procedures, and improving the quality of hospital services.

背景:信息控制是一种很有前景的手术标本管理方法。然而,有关手术险情的研究证据却很有限。在每个环节的信息控制闭环中尤其如此:进一步构建手术标本流程管理的新模式,通过信息安全屏障拦截标本近失事件,创造安全的手术室护理实践环境:在国内某大型医院选取2021年1月至12月常规信息标本管理模式下采集的84289例手术标本作为对照组,2022年1月至12月信息安全屏障控制手术标本管理模式下采集的99998例手术标本作为改进组。比较两种管理模式下的险情发生率、病理标本固定合格率和标本固定平均所需时间。分析了两组险情发生的原因,并对信息化安全屏障控制手术标本的险情进行了研究:结果:在信息化安全屏障控制手术标本管理模式下,手术标本不良事件发生率降低,手术标本近失事件报告率提高了100%,手术标本质控质量管理水平得到有效提高,手术病理标本固定合格率提高,手术标本固定时间缩短,PConclusions认为差异有统计学意义:我们的研究开发了一种新的手术标本流程管理模式。这种模式可以通过实施信息安全屏障来防止标本接近中的错误,从而提高标本管理的质量,确保医疗过程的安全,改善医院的服务质量。
{"title":"Application of Information Link Control in Surgical Specimen Near-Miss Events in a South China Hospital: Nonrandomized Controlled Study.","authors":"Tingting Chen, Xiaofen Tang, Min Xu, Yue Jiang, Fengyan Zheng","doi":"10.2196/52722","DOIUrl":"10.2196/52722","url":null,"abstract":"<p><strong>Background: </strong>Information control is a promising approach for managing surgical specimens. However, there is limited research evidence on surgical near misses. This is particularly true in the closed loop of information control for each link.</p><p><strong>Objective: </strong>A new model of surgical specimen process management is further constructed, and a safe operating room nursing practice environment is created by intercepting specimen near-miss events through information safety barriers.</p><p><strong>Methods: </strong>In a large hospital in China, 84,289 surgical specimens collected in the conventional information specimen management mode from January to December 2021 were selected as the control group, and 99,998 surgical specimens collected in the information safety barrier control surgical specimen management mode from January to December 2022 were selected as the improvement group. The incidence of near misses, the qualified rate of pathological specimen fixation, and the average time required for specimen fixation were compared under the 2 management modes. The causes of 2 groups of near misses were analyzed and the near misses of information safety barrier control surgical specimens were studied.</p><p><strong>Results: </strong>Under the information-based safety barrier control surgical specimen management model, the incidence of adverse events in surgical specimens was reduced, the reporting of near-miss events in surgical specimens was improved by 100%, the quality control quality management of surgical specimens was effectively improved, the pass rate of surgical pathology specimen fixation was improved, and the meantime for surgical specimen fixation was shortened, with differences considered statistically significant at P<.05.</p><p><strong>Conclusions: </strong>Our research has developed a new mode of managing the surgical specimen process. This mode can prevent errors in approaching specimens by implementing information security barriers, thereby enhancing the quality of specimen management, ensuring the safety of medical procedures, and improving the quality of hospital services.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e52722"},"PeriodicalIF":3.1,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11492967/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health Care Worker Usage of Large-Scale Health Information Exchanges in Japan: User-Level Audit Log Analysis Study. 日本医护人员使用大规模医疗信息交换的情况:用户级审计日志分析研究》。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-09 DOI: 10.2196/56263
Jun Suzumoto, Yukiko Mori, Tomohiro Kuroda

Background: Over 200 health information exchanges (HIEs) are currently operational in Japan. The most common feature of HIEs is remote on-demand viewing or searching of aggregated patient health data from multiple institutions. However, the usage of this feature by individual users and institutions remains unknown.

Objective: This study aims to understand usage of the on-demand patient data viewing feature of large-scale HIEs by individual health care workers and institutions in Japan.

Methods: We conducted audit log analyses of large-scale HIEs. The research subjects were HIEs connected to over 100 institutions and with over 10,000 patients. Each health care worker's profile and audit log data for HIEs were collected. We conducted four types of analyses on the extracted audit log. First, we calculated the ratio of the number of days of active HIE use for each hospital-affiliated doctor account. Second, we calculated cumulative monthly usage days of HIEs by each institution in financial year (FY) 2021/22. Third, we calculated each facility type's monthly active institution ratio in FY2021/22. Fourth, we compared the monthly active institution ratio by medical institution for each HIE and the proportion of cumulative usage days by user type for each HIE.

Results: We identified 24 HIEs as candidates for data collection and we analyzed data from 7 HIEs. Among hospital doctors, 93.5% (7326/7833) had never used HIEs during the available period in FY2021/22, while 19 doctors used them at least 30% of days. The median (IQR) monthly active institution ratios were 0.482 (0.470-0.487) for hospitals, 0.243 (0.230-0.247) for medical clinics, and 0.030 (0.024-0.048) for dental clinics. In 51.9% (1781/3434) of hospitals, the cumulative monthly usage days of HIEs was 0, while in 26.8% (921/3434) of hospitals, it was between 1 and 10, and in 3% (103/3434) of hospitals, it was 100 or more. The median (IQR) monthly active institution ratio in medical institutions was 0.511 (0.487-0.529) for the most used HIE and 0.109 (0.0927-0.117) for the least used. The proportion of cumulative usage days of HIE by user type was complex for each HIE, and no consistent trends could be discerned.

Conclusions: In the large-scale HIEs surveyed in this study, the overall usage of the on-demand patient data viewing feature was low, consistent with past official reports. User-level analyses of audit logs revealed large disparities in the number of days of HIE use among health care workers and institutions. There were also large disparities in HIE use by facility type or HIE; the percentage of cumulative HIE usage days by user type also differed by HIE. This study indicates the need for further research into why there are large disparities in demand for HIEs in Japan as well as the need to design comprehensive audit logs that can be matched with other official datasets.

背景:日本目前有 200 多个健康信息交换系统(HIE)在运行。HIE 最常见的功能是远程按需查看或搜索来自多个机构的汇总病人健康数据。然而,个人用户和医疗机构对这一功能的使用情况仍不得而知:本研究旨在了解日本医护人员个人和机构对大型 HIE 的按需查看患者数据功能的使用情况:我们对大型 HIE 进行了审计日志分析。研究对象是与 100 多家机构连接、拥有 10,000 多名患者的 HIE。我们收集了每位医护人员的个人资料和 HIE 的审计日志数据。我们对提取的审计日志进行了四种分析。首先,我们计算了每个医院附属医生账户的 HIE 有效使用天数比率。其次,我们计算了 2021/22 财政年度(FY)各机构每月使用 HIE 的累计天数。第三,我们计算了 2021/22 财政年度各设施类型的每月活跃机构比率。第四,我们比较了各医疗机构在每个医疗信息基础设施中的每月活跃机构比率和各医疗信息基础设施中按用户类型划分的累计使用天数比例:我们确定了 24 个 HIE 作为数据收集的候选机构,并对 7 个 HIE 的数据进行了分析。在医院医生中,93.5%(7326/7833)的医生在 2021/22 财政年度的可用期间从未使用过 HIE,而 19 名医生至少有 30% 的天数使用过 HIE。医院每月活跃机构比率的中位数(IQR)为 0.482(0.470-0.487),医疗诊所为 0.243(0.230-0.247),牙科诊所为 0.030(0.024-0.048)。51.9%(1781/3434)的医院每月使用HIE的累计天数为0,26.8%(921/3434)的医院为1至10天,3%(103/3434)的医院为100天或以上。在医疗机构中,使用最多的 HIE 每月活跃机构比例的中位数(IQR)为 0.511(0.487-0.529),使用最少的为 0.109(0.0927-0.117)。按用户类型划分的 HIE 累计使用天数比例在每个 HIE 中都很复杂,无法发现一致的趋势:在本研究调查的大型 HIE 中,按需查看病人数据功能的总体使用率较低,这与过去的官方报告一致。对审计日志进行的用户层面分析表明,医护人员和医疗机构在使用 HIE 的天数上存在巨大差异。不同机构类型或 HIE 在使用 HIE 方面也存在巨大差异;不同 HIE 的用户类型在累计 HIE 使用天数中所占的百分比也不尽相同。这项研究表明,有必要进一步研究日本对 HIE 的需求存在巨大差异的原因,以及设计可与其他官方数据集进行匹配的全面审计日志的必要性。
{"title":"Health Care Worker Usage of Large-Scale Health Information Exchanges in Japan: User-Level Audit Log Analysis Study.","authors":"Jun Suzumoto, Yukiko Mori, Tomohiro Kuroda","doi":"10.2196/56263","DOIUrl":"10.2196/56263","url":null,"abstract":"<p><strong>Background: </strong>Over 200 health information exchanges (HIEs) are currently operational in Japan. The most common feature of HIEs is remote on-demand viewing or searching of aggregated patient health data from multiple institutions. However, the usage of this feature by individual users and institutions remains unknown.</p><p><strong>Objective: </strong>This study aims to understand usage of the on-demand patient data viewing feature of large-scale HIEs by individual health care workers and institutions in Japan.</p><p><strong>Methods: </strong>We conducted audit log analyses of large-scale HIEs. The research subjects were HIEs connected to over 100 institutions and with over 10,000 patients. Each health care worker's profile and audit log data for HIEs were collected. We conducted four types of analyses on the extracted audit log. First, we calculated the ratio of the number of days of active HIE use for each hospital-affiliated doctor account. Second, we calculated cumulative monthly usage days of HIEs by each institution in financial year (FY) 2021/22. Third, we calculated each facility type's monthly active institution ratio in FY2021/22. Fourth, we compared the monthly active institution ratio by medical institution for each HIE and the proportion of cumulative usage days by user type for each HIE.</p><p><strong>Results: </strong>We identified 24 HIEs as candidates for data collection and we analyzed data from 7 HIEs. Among hospital doctors, 93.5% (7326/7833) had never used HIEs during the available period in FY2021/22, while 19 doctors used them at least 30% of days. The median (IQR) monthly active institution ratios were 0.482 (0.470-0.487) for hospitals, 0.243 (0.230-0.247) for medical clinics, and 0.030 (0.024-0.048) for dental clinics. In 51.9% (1781/3434) of hospitals, the cumulative monthly usage days of HIEs was 0, while in 26.8% (921/3434) of hospitals, it was between 1 and 10, and in 3% (103/3434) of hospitals, it was 100 or more. The median (IQR) monthly active institution ratio in medical institutions was 0.511 (0.487-0.529) for the most used HIE and 0.109 (0.0927-0.117) for the least used. The proportion of cumulative usage days of HIE by user type was complex for each HIE, and no consistent trends could be discerned.</p><p><strong>Conclusions: </strong>In the large-scale HIEs surveyed in this study, the overall usage of the on-demand patient data viewing feature was low, consistent with past official reports. User-level analyses of audit logs revealed large disparities in the number of days of HIE use among health care workers and institutions. There were also large disparities in HIE use by facility type or HIE; the percentage of cumulative HIE usage days by user type also differed by HIE. This study indicates the need for further research into why there are large disparities in demand for HIEs in Japan as well as the need to design comprehensive audit logs that can be matched with other official datasets.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e56263"},"PeriodicalIF":3.1,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11481819/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing the Functionalities of Personal Health Record Systems: Empirical Study Based on the HL7 Personal Health Record System Functional Model Release 1. 增强个人健康记录系统的功能:基于 HL7 个人健康记录系统功能模型第 1 版的实证研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-09 DOI: 10.2196/56735
Teng Cao, Zhi Chen, Masaharu Nakayama

Background: The increasing demand for personal health record (PHR) systems is driven by individuals' desire to actively manage their health care. However, the limited functionality of current PHR systems has affected users' willingness to adopt them, leading to lower-than-expected usage rates. The HL7 (Health Level Seven) PHR System Functional Model (PHR-S FM) was proposed to address this issue, outlining all possible functionalities in PHR systems. Although the PHR-S FM provides a comprehensive theoretical framework, its practical effectiveness and applicability have not been fully explored.

Objective: This study aimed to design and develop a tethered PHR prototype in accordance with the guidelines of the PHR-S FM. It sought to explore the feasibility of applying the PHR-S FM in PHR systems by comparing the prototype with the results of previous research.

Methods: The PHR-S FM profile was defined to meet broad clinical data management requirements based on previous research. We designed and developed a PHR prototype as a web application using the Fast Healthcare Interoperability Resources R4 (FHIR) and Logical Observation Identifiers Names and Codes (LOINC) coding system for interoperability and data consistency. We validated the prototype using the Synthea dataset, which provided realistic synthetic medical records. In addition, we compared the results produced by the prototype with those of previous studies to evaluate the feasibility and implementation of the PHR-S FM framework.

Results: The PHR prototype was developed based on the PHR-S FM profile. We verified its functionality by demonstrating its ability to synchronize data with the FHIR server, effectively managing and displaying various health data types. Validation using the Synthea dataset confirmed the prototype's accuracy, achieving 100% coverage across 1157 data items. A comparison with the findings of previous studies indicated the feasibility of implementing the PHR-S FM and highlighted areas for future research and improvements.

Conclusions: The results of this study offer valuable insights into the potential for practical application and broad adoption of the PHR-S FM in real-world health care settings.

背景:个人健康记录(PHR)系统的需求日益增长,其原因是个人希望主动管理自己的医疗保健。然而,目前的个人健康记录系统功能有限,影响了用户的使用意愿,导致使用率低于预期。为了解决这一问题,人们提出了 HL7(健康七级)个人健康记录系统功能模型(PHR-S FM),概述了个人健康记录系统中所有可能的功能。虽然 PHR-S FM 提供了一个全面的理论框架,但其实际有效性和适用性尚未得到充分探讨:本研究旨在根据 PHR-S FM 的指导方针设计和开发一个系留式 PHR 原型。本研究旨在根据 PHR-S FM 的指导方针设计和开发一个系留式个人健康记录仪原型,并通过将该原型与之前的研究成果进行比较,探索在个人健康记录仪系统中应用 PHR-S FM 的可行性:方法:PHR-S FM 配置文件是在先前研究的基础上为满足广泛的临床数据管理要求而定义的。我们使用快速医疗互操作性资源 R4(FHIR)和逻辑观察标识符名称和代码(LOINC)编码系统设计并开发了一个 PHR 原型,作为一个网络应用程序,以实现互操作性和数据一致性。我们使用 Synthea 数据集对原型进行了验证,该数据集提供了真实的合成医疗记录。此外,我们还将原型产生的结果与之前的研究结果进行了比较,以评估 PHR-S FM 框架的可行性和实施情况:PHR 原型是基于 PHR-S FM 配置文件开发的。我们通过演示其与 FHIR 服务器同步数据、有效管理和显示各种健康数据类型的能力来验证其功能。使用 Synthea 数据集进行的验证证实了原型的准确性,在 1157 个数据项中实现了 100% 的覆盖率。与以往研究结果的比较表明了实施 PHR-S FM 的可行性,并强调了未来研究和改进的领域:本研究的结果为 PHR-S FM 在现实医疗环境中的实际应用和广泛采用提供了宝贵的见解。
{"title":"Enhancing the Functionalities of Personal Health Record Systems: Empirical Study Based on the HL7 Personal Health Record System Functional Model Release 1.","authors":"Teng Cao, Zhi Chen, Masaharu Nakayama","doi":"10.2196/56735","DOIUrl":"10.2196/56735","url":null,"abstract":"<p><strong>Background: </strong>The increasing demand for personal health record (PHR) systems is driven by individuals' desire to actively manage their health care. However, the limited functionality of current PHR systems has affected users' willingness to adopt them, leading to lower-than-expected usage rates. The HL7 (Health Level Seven) PHR System Functional Model (PHR-S FM) was proposed to address this issue, outlining all possible functionalities in PHR systems. Although the PHR-S FM provides a comprehensive theoretical framework, its practical effectiveness and applicability have not been fully explored.</p><p><strong>Objective: </strong>This study aimed to design and develop a tethered PHR prototype in accordance with the guidelines of the PHR-S FM. It sought to explore the feasibility of applying the PHR-S FM in PHR systems by comparing the prototype with the results of previous research.</p><p><strong>Methods: </strong>The PHR-S FM profile was defined to meet broad clinical data management requirements based on previous research. We designed and developed a PHR prototype as a web application using the Fast Healthcare Interoperability Resources R4 (FHIR) and Logical Observation Identifiers Names and Codes (LOINC) coding system for interoperability and data consistency. We validated the prototype using the Synthea dataset, which provided realistic synthetic medical records. In addition, we compared the results produced by the prototype with those of previous studies to evaluate the feasibility and implementation of the PHR-S FM framework.</p><p><strong>Results: </strong>The PHR prototype was developed based on the PHR-S FM profile. We verified its functionality by demonstrating its ability to synchronize data with the FHIR server, effectively managing and displaying various health data types. Validation using the Synthea dataset confirmed the prototype's accuracy, achieving 100% coverage across 1157 data items. A comparison with the findings of previous studies indicated the feasibility of implementing the PHR-S FM and highlighted areas for future research and improvements.</p><p><strong>Conclusions: </strong>The results of this study offer valuable insights into the potential for practical application and broad adoption of the PHR-S FM in real-world health care settings.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e56735"},"PeriodicalIF":3.1,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11481820/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Use of SNOMED CT in Large Language Models: Scoping Review. 在大型语言模型中使用 SNOMED CT:范围审查。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-07 DOI: 10.2196/62924
Eunsuk Chang, Sumi Sung
<p><strong>Background: </strong>Large language models (LLMs) have substantially advanced natural language processing (NLP) capabilities but often struggle with knowledge-driven tasks in specialized domains such as biomedicine. Integrating biomedical knowledge sources such as SNOMED CT into LLMs may enhance their performance on biomedical tasks. However, the methodologies and effectiveness of incorporating SNOMED CT into LLMs have not been systematically reviewed.</p><p><strong>Objective: </strong>This scoping review aims to examine how SNOMED CT is integrated into LLMs, focusing on (1) the types and components of LLMs being integrated with SNOMED CT, (2) which contents of SNOMED CT are being integrated, and (3) whether this integration improves LLM performance on NLP tasks.</p><p><strong>Methods: </strong>Following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, we searched ACM Digital Library, ACL Anthology, IEEE Xplore, PubMed, and Embase for relevant studies published from 2018 to 2023. Studies were included if they incorporated SNOMED CT into LLM pipelines for natural language understanding or generation tasks. Data on LLM types, SNOMED CT integration methods, end tasks, and performance metrics were extracted and synthesized.</p><p><strong>Results: </strong>The review included 37 studies. Bidirectional Encoder Representations from Transformers and its biomedical variants were the most commonly used LLMs. Three main approaches for integrating SNOMED CT were identified: (1) incorporating SNOMED CT into LLM inputs (28/37, 76%), primarily using concept descriptions to expand training corpora; (2) integrating SNOMED CT into additional fusion modules (5/37, 14%); and (3) using SNOMED CT as an external knowledge retriever during inference (5/37, 14%). The most frequent end task was medical concept normalization (15/37, 41%), followed by entity extraction or typing and classification. While most studies (17/19, 89%) reported performance improvements after SNOMED CT integration, only a small fraction (19/37, 51%) provided direct comparisons. The reported gains varied widely across different metrics and tasks, ranging from 0.87% to 131.66%. However, some studies showed either no improvement or a decline in certain performance metrics.</p><p><strong>Conclusions: </strong>This review demonstrates diverse approaches for integrating SNOMED CT into LLMs, with a focus on using concept descriptions to enhance biomedical language understanding and generation. While the results suggest potential benefits of SNOMED CT integration, the lack of standardized evaluation methods and comprehensive performance reporting hinders definitive conclusions about its effectiveness. Future research should prioritize consistent reporting of performance comparisons and explore more sophisticated methods for incorporating SNOMED CT's relational structure into LLMs. In addition, the biomedical NLP commun
背景:大型语言模型(LLMs)具有非常先进的自然语言处理(NLP)能力,但在处理生物医学等专业领域的知识驱动型任务时却往往力不从心。将 SNOMED CT 等生物医学知识源整合到 LLM 中可以提高它们在生物医学任务中的性能。然而,将 SNOMED CT 纳入 LLMs 的方法和效果尚未得到系统回顾:本范围综述旨在研究如何将 SNOMED CT 整合到 LLM 中,重点关注:(1)与 SNOMED CT 整合的 LLM 的类型和组成部分;(2)SNOMED CT 的哪些内容被整合;以及(3)这种整合是否提高了 LLM 在 NLP 任务中的表现:按照 PRISMA-ScR(系统综述和元分析的首选报告项目,范围综述的扩展)指南,我们检索了 ACM 数字图书馆、ACL 文集、IEEE Xplore、PubMed 和 Embase,以查找 2018 年至 2023 年发表的相关研究。如果研究将 SNOMED CT 纳入了用于自然语言理解或生成任务的 LLM 管道,则纳入这些研究。对 LLM 类型、SNOMED CT 整合方法、终端任务和性能指标的数据进行了提取和综合:综述包括 37 项研究。来自变换器的双向编码器表示法及其生物医学变体是最常用的 LLM。确定了整合 SNOMED CT 的三种主要方法:(1) 将 SNOMED CT 纳入 LLM 输入(28/37,76%),主要使用概念描述来扩展训练语料库;(2) 将 SNOMED CT 纳入额外的融合模块(5/37,14%);(3) 在推理过程中将 SNOMED CT 用作外部知识检索器(5/37,14%)。最常见的最终任务是医学概念规范化(15/37,41%),其次是实体提取或类型化和分类。虽然大多数研究(17/19,89%)都报告了集成 SNOMED CT 后性能的提高,但只有一小部分研究(19/37,51%)提供了直接比较。在不同的指标和任务中,所报告的提高幅度差别很大,从 0.87% 到 131.66% 不等。然而,一些研究表明,某些性能指标要么没有改善,要么有所下降:本综述展示了将 SNOMED CT 整合到 LLM 中的各种方法,重点是使用概念描述来增强生物医学语言的理解和生成。虽然研究结果表明 SNOMED CT 整合具有潜在的优势,但由于缺乏标准化的评估方法和全面的性能报告,因此无法对其有效性得出明确的结论。未来的研究应优先考虑性能比较的一致性报告,并探索将 SNOMED CT 的关系结构纳入 LLM 的更复杂方法。此外,生物医学 NLP 界应开发标准化的评估框架,以更好地评估本体集成对 LLM 性能的影响。
{"title":"Use of SNOMED CT in Large Language Models: Scoping Review.","authors":"Eunsuk Chang, Sumi Sung","doi":"10.2196/62924","DOIUrl":"10.2196/62924","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Large language models (LLMs) have substantially advanced natural language processing (NLP) capabilities but often struggle with knowledge-driven tasks in specialized domains such as biomedicine. Integrating biomedical knowledge sources such as SNOMED CT into LLMs may enhance their performance on biomedical tasks. However, the methodologies and effectiveness of incorporating SNOMED CT into LLMs have not been systematically reviewed.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This scoping review aims to examine how SNOMED CT is integrated into LLMs, focusing on (1) the types and components of LLMs being integrated with SNOMED CT, (2) which contents of SNOMED CT are being integrated, and (3) whether this integration improves LLM performance on NLP tasks.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;Following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines, we searched ACM Digital Library, ACL Anthology, IEEE Xplore, PubMed, and Embase for relevant studies published from 2018 to 2023. Studies were included if they incorporated SNOMED CT into LLM pipelines for natural language understanding or generation tasks. Data on LLM types, SNOMED CT integration methods, end tasks, and performance metrics were extracted and synthesized.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The review included 37 studies. Bidirectional Encoder Representations from Transformers and its biomedical variants were the most commonly used LLMs. Three main approaches for integrating SNOMED CT were identified: (1) incorporating SNOMED CT into LLM inputs (28/37, 76%), primarily using concept descriptions to expand training corpora; (2) integrating SNOMED CT into additional fusion modules (5/37, 14%); and (3) using SNOMED CT as an external knowledge retriever during inference (5/37, 14%). The most frequent end task was medical concept normalization (15/37, 41%), followed by entity extraction or typing and classification. While most studies (17/19, 89%) reported performance improvements after SNOMED CT integration, only a small fraction (19/37, 51%) provided direct comparisons. The reported gains varied widely across different metrics and tasks, ranging from 0.87% to 131.66%. However, some studies showed either no improvement or a decline in certain performance metrics.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This review demonstrates diverse approaches for integrating SNOMED CT into LLMs, with a focus on using concept descriptions to enhance biomedical language understanding and generation. While the results suggest potential benefits of SNOMED CT integration, the lack of standardized evaluation methods and comprehensive performance reporting hinders definitive conclusions about its effectiveness. Future research should prioritize consistent reporting of performance comparisons and explore more sophisticated methods for incorporating SNOMED CT's relational structure into LLMs. In addition, the biomedical NLP commun","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e62924"},"PeriodicalIF":3.1,"publicationDate":"2024-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11494256/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142382572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative Study to Evaluate the Accuracy of Differential Diagnosis Lists Generated by Gemini Advanced, Gemini, and Bard for a Case Report Series Analysis: Cross-Sectional Study. 评估 Gemini Advanced、Gemini 和 Bard 为病例报告系列分析生成的鉴别诊断列表准确性的比较研究:横断面研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-02 DOI: 10.2196/63010
Takanobu Hirosawa, Yukinori Harada, Kazuki Tokumasu, Takahiro Ito, Tomoharu Suzuki, Taro Shimizu

Background: Generative artificial intelligence (GAI) systems by Google have recently been updated from Bard to Gemini and Gemini Advanced as of December 2023. Gemini is a basic, free-to-use model after a user's login, while Gemini Advanced operates on a more advanced model requiring a fee-based subscription. These systems have the potential to enhance medical diagnostics. However, the impact of these updates on comprehensive diagnostic accuracy remains unknown.

Objective: This study aimed to compare the accuracy of the differential diagnosis lists generated by Gemini Advanced, Gemini, and Bard across comprehensive medical fields using case report series.

Methods: We identified a case report series with relevant final diagnoses published in the American Journal Case Reports from January 2022 to March 2023. After excluding nondiagnostic cases and patients aged 10 years and younger, we included the remaining case reports. After refining the case parts as case descriptions, we input the same case descriptions into Gemini Advanced, Gemini, and Bard to generate the top 10 differential diagnosis lists. In total, 2 expert physicians independently evaluated whether the final diagnosis was included in the lists and its ranking. Any discrepancies were resolved by another expert physician. Bonferroni correction was applied to adjust the P values for the number of comparisons among 3 GAI systems, setting the corrected significance level at P value <.02.

Results: In total, 392 case reports were included. The inclusion rates of the final diagnosis within the top 10 differential diagnosis lists were 73% (286/392) for Gemini Advanced, 76.5% (300/392) for Gemini, and 68.6% (269/392) for Bard. The top diagnoses matched the final diagnoses in 31.6% (124/392) for Gemini Advanced, 42.6% (167/392) for Gemini, and 31.4% (123/392) for Bard. Gemini demonstrated higher diagnostic accuracy than Bard both within the top 10 differential diagnosis lists (P=.02) and as the top diagnosis (P=.001). In addition, Gemini Advanced achieved significantly lower accuracy than Gemini in identifying the most probable diagnosis (P=.002).

Conclusions: The results of this study suggest that Gemini outperformed Bard in diagnostic accuracy following the model update. However, Gemini Advanced requires further refinement to optimize its performance for future artificial intelligence-enhanced diagnostics. These findings should be interpreted cautiously and considered primarily for research purposes, as these GAI systems have not been adjusted for medical diagnostics nor approved for clinical use.

背景介绍最近,谷歌的生成式人工智能(GAI)系统从 Bard 升级为 Gemini 和 Gemini Advanced,截止日期为 2023 年 12 月。Gemini 是用户登录后免费使用的基本模式,而 Gemini Advanced 则是需要付费订阅的高级模式。这些系统具有提高医疗诊断水平的潜力。然而,这些更新对综合诊断准确性的影响仍是未知数:本研究旨在利用病例报告系列比较 Gemini Advanced、Gemini 和 Bard 生成的鉴别诊断列表在综合医疗领域的准确性:我们确定了 2022 年 1 月至 2023 年 3 月期间发表在《美国病例报告杂志》(American Journal Case Reports)上的具有相关最终诊断的病例报告系列。在排除非诊断性病例和年龄在 10 岁及以下的患者后,我们纳入了剩余的病例报告。将病例部分细化为病例描述后,我们将相同的病例描述输入到 Gemini Advanced、Gemini 和 Bard 中,生成前 10 位鉴别诊断列表。共有两名专家医师独立评估最终诊断是否包含在列表中及其排名。任何差异均由另一位专家医师解决。根据 3 个 GAI 系统之间的比较次数,对 P 值进行了 Bonferroni 校正,将校正后的显著性水平设定为 P 值 结果:共纳入 392 份病例报告。最终诊断在前 10 个鉴别诊断列表中的纳入率分别为:Gemini Advanced 73%(286/392)、Gemini 76.5%(300/392)和 Bard 68.6%(269/392)。Gemini Advanced 有 31.6%(124/392)、Gemini 有 42.6%(167/392)和 Bard 有 31.4%(123/392)的最高诊断与最终诊断相吻合。在前 10 个鉴别诊断列表中(P=.02),Gemini 的诊断准确率高于 Bard(P=.001)。此外,在确定最可能的诊断方面,Gemini Advanced 的准确性明显低于 Gemini(P=.002):本研究结果表明,模型更新后,双子座在诊断准确性方面优于巴德。然而,Gemini Advanced 还需要进一步改进,以优化其在未来人工智能增强诊断中的表现。由于这些 GAI 系统尚未针对医疗诊断进行调整,也未被批准用于临床,因此应谨慎解读这些研究结果,并将其主要用于研究目的。
{"title":"Comparative Study to Evaluate the Accuracy of Differential Diagnosis Lists Generated by Gemini Advanced, Gemini, and Bard for a Case Report Series Analysis: Cross-Sectional Study.","authors":"Takanobu Hirosawa, Yukinori Harada, Kazuki Tokumasu, Takahiro Ito, Tomoharu Suzuki, Taro Shimizu","doi":"10.2196/63010","DOIUrl":"10.2196/63010","url":null,"abstract":"<p><strong>Background: </strong>Generative artificial intelligence (GAI) systems by Google have recently been updated from Bard to Gemini and Gemini Advanced as of December 2023. Gemini is a basic, free-to-use model after a user's login, while Gemini Advanced operates on a more advanced model requiring a fee-based subscription. These systems have the potential to enhance medical diagnostics. However, the impact of these updates on comprehensive diagnostic accuracy remains unknown.</p><p><strong>Objective: </strong>This study aimed to compare the accuracy of the differential diagnosis lists generated by Gemini Advanced, Gemini, and Bard across comprehensive medical fields using case report series.</p><p><strong>Methods: </strong>We identified a case report series with relevant final diagnoses published in the American Journal Case Reports from January 2022 to March 2023. After excluding nondiagnostic cases and patients aged 10 years and younger, we included the remaining case reports. After refining the case parts as case descriptions, we input the same case descriptions into Gemini Advanced, Gemini, and Bard to generate the top 10 differential diagnosis lists. In total, 2 expert physicians independently evaluated whether the final diagnosis was included in the lists and its ranking. Any discrepancies were resolved by another expert physician. Bonferroni correction was applied to adjust the P values for the number of comparisons among 3 GAI systems, setting the corrected significance level at P value <.02.</p><p><strong>Results: </strong>In total, 392 case reports were included. The inclusion rates of the final diagnosis within the top 10 differential diagnosis lists were 73% (286/392) for Gemini Advanced, 76.5% (300/392) for Gemini, and 68.6% (269/392) for Bard. The top diagnoses matched the final diagnoses in 31.6% (124/392) for Gemini Advanced, 42.6% (167/392) for Gemini, and 31.4% (123/392) for Bard. Gemini demonstrated higher diagnostic accuracy than Bard both within the top 10 differential diagnosis lists (P=.02) and as the top diagnosis (P=.001). In addition, Gemini Advanced achieved significantly lower accuracy than Gemini in identifying the most probable diagnosis (P=.002).</p><p><strong>Conclusions: </strong>The results of this study suggest that Gemini outperformed Bard in diagnostic accuracy following the model update. However, Gemini Advanced requires further refinement to optimize its performance for future artificial intelligence-enhanced diagnostics. These findings should be interpreted cautiously and considered primarily for research purposes, as these GAI systems have not been adjusted for medical diagnostics nor approved for clinical use.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e63010"},"PeriodicalIF":3.1,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11483254/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing Information Biases Within Electronic Health Record Data to Improve the Examination of Epidemiologic Associations With Diabetes Prevalence Among Young Adults: Cross-Sectional Study. 解决电子健康记录数据中的信息偏差,改进对年轻人糖尿病患病率流行病学关联的研究:横断面研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-01 DOI: 10.2196/58085
Sarah Conderino, Rebecca Anthopolos, Sandra S Albrecht, Shannon M Farley, Jasmin Divers, Andrea R Titus, Lorna E Thorpe

Background: Electronic health records (EHRs) are increasingly used for epidemiologic research to advance public health practice. However, key variables are susceptible to missing data or misclassification within EHRs, including demographic information or disease status, which could affect the estimation of disease prevalence or risk factor associations.

Objective: In this paper, we applied methods from the literature on missing data and causal inference to assess whether we could mitigate information biases when estimating measures of association between potential risk factors and diabetes among a patient population of New York City young adults.

Methods: We estimated the odds ratio (OR) for diabetes by race or ethnicity and asthma status using EHR data from NYU Langone Health. Methods from the missing data and causal inference literature were then applied to assess the ability to control for misclassification of health outcomes in the EHR data. We compared EHR-based associations with associations observed from 2 national health surveys, the Behavioral Risk Factor Surveillance System (BRFSS) and the National Health and Nutrition Examination Survey, representing traditional public health surveillance systems.

Results: Observed EHR-based associations between race or ethnicity and diabetes were comparable to health survey-based estimates, but the association between asthma and diabetes was significantly overestimated (OREHR 3.01, 95% CI 2.86-3.18 vs ORBRFSS 1.23, 95% CI 1.09-1.40). Missing data and causal inference methods reduced information biases in these estimates, yielding relative differences from traditional estimates below 50% (ORMissingData 1.79, 95% CI 1.67-1.92 and ORCausal 1.42, 95% CI 1.34-1.51).

Conclusions: Findings suggest that without bias adjustment, EHR analyses may yield biased measures of association, driven in part by subgroup differences in health care use. However, applying missing data or causal inference frameworks can help control for and, importantly, characterize residual information biases in these estimates.

背景:电子健康记录(EHR)越来越多地用于流行病学研究,以促进公共卫生实践。然而,电子健康记录中的关键变量容易出现数据缺失或分类错误,包括人口统计信息或疾病状态,这可能会影响对疾病流行率或风险因素关联的估计:在本文中,我们应用了文献中有关缺失数据和因果推断的方法,以评估在估算纽约市年轻成人患者群体中潜在风险因素与糖尿病之间的关联时,我们是否能减轻信息偏差:我们利用纽约大学朗贡医疗中心的电子病历数据,按种族或民族以及哮喘状况估算了糖尿病的几率比(OR)。然后应用缺失数据和因果推断文献中的方法来评估控制 EHR 数据中健康结果分类错误的能力。我们将基于电子病历的关联与代表传统公共卫生监测系统的行为风险因素监测系统(BRFSS)和国家健康与营养调查(National Health and Nutrition Examination Survey)这两项国家健康调查中观察到的关联进行了比较:基于电子病历观察到的种族或民族与糖尿病之间的关系与基于健康调查的估计值相当,但哮喘与糖尿病之间的关系被明显高估(OREHR 3.01,95% CI 2.86-3.18 vs ORBRFSS 1.23,95% CI 1.09-1.40)。缺失数据和因果推断方法减少了这些估计值的信息偏差,与传统估计值的相对差异低于 50%(ORMissingData 1.79,95% CI 1.67-1.92 和 ORCausal 1.42,95% CI 1.34-1.51):研究结果表明,如果不进行偏倚调整,电子病历分析可能会产生偏倚的关联测量,部分原因是亚组在医疗保健使用方面存在差异。然而,应用缺失数据或因果推断框架有助于控制这些估计值中的残余信息偏差,重要的是,还有助于描述这些偏差的特征。
{"title":"Addressing Information Biases Within Electronic Health Record Data to Improve the Examination of Epidemiologic Associations With Diabetes Prevalence Among Young Adults: Cross-Sectional Study.","authors":"Sarah Conderino, Rebecca Anthopolos, Sandra S Albrecht, Shannon M Farley, Jasmin Divers, Andrea R Titus, Lorna E Thorpe","doi":"10.2196/58085","DOIUrl":"10.2196/58085","url":null,"abstract":"<p><strong>Background: </strong>Electronic health records (EHRs) are increasingly used for epidemiologic research to advance public health practice. However, key variables are susceptible to missing data or misclassification within EHRs, including demographic information or disease status, which could affect the estimation of disease prevalence or risk factor associations.</p><p><strong>Objective: </strong>In this paper, we applied methods from the literature on missing data and causal inference to assess whether we could mitigate information biases when estimating measures of association between potential risk factors and diabetes among a patient population of New York City young adults.</p><p><strong>Methods: </strong>We estimated the odds ratio (OR) for diabetes by race or ethnicity and asthma status using EHR data from NYU Langone Health. Methods from the missing data and causal inference literature were then applied to assess the ability to control for misclassification of health outcomes in the EHR data. We compared EHR-based associations with associations observed from 2 national health surveys, the Behavioral Risk Factor Surveillance System (BRFSS) and the National Health and Nutrition Examination Survey, representing traditional public health surveillance systems.</p><p><strong>Results: </strong>Observed EHR-based associations between race or ethnicity and diabetes were comparable to health survey-based estimates, but the association between asthma and diabetes was significantly overestimated (OREHR 3.01, 95% CI 2.86-3.18 vs ORBRFSS 1.23, 95% CI 1.09-1.40). Missing data and causal inference methods reduced information biases in these estimates, yielding relative differences from traditional estimates below 50% (ORMissingData 1.79, 95% CI 1.67-1.92 and ORCausal 1.42, 95% CI 1.34-1.51).</p><p><strong>Conclusions: </strong>Findings suggest that without bias adjustment, EHR analyses may yield biased measures of association, driven in part by subgroup differences in health care use. However, applying missing data or causal inference frameworks can help control for and, importantly, characterize residual information biases in these estimates.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58085"},"PeriodicalIF":3.1,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11460830/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study. 通过 "一对一 "分类消除临床缩略语的歧义:算法开发与验证研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-01 DOI: 10.2196/56955
Sheng-Feng Sung, Ya-Han Hu, Chong-Yan Chen

Background: Electronic medical records store extensive patient data and serve as a comprehensive repository, including textual medical records like surgical and imaging reports. Their utility in clinical decision support systems is substantial, but the widespread use of ambiguous and unstandardized abbreviations in clinical documents poses challenges for natural language processing in clinical decision support systems. Efficient abbreviation disambiguation methods are needed for effective information extraction.

Objective: This study aims to enhance the one-to-all (OTA) framework for clinical abbreviation expansion, which uses a single model to predict multiple abbreviation meanings. The objective is to improve OTA by developing context-candidate pairs and optimizing word embeddings in Bidirectional Encoder Representations From Transformers (BERT), evaluating the model's efficacy in expanding clinical abbreviations using real data.

Methods: Three datasets were used: Medical Subject Headings Word Sense Disambiguation, University of Minnesota, and Chia-Yi Christian Hospital from Ditmanson Medical Foundation Chia-Yi Christian Hospital. Texts containing polysemous abbreviations were preprocessed and formatted for BERT. The study involved fine-tuning pretrained models, ClinicalBERT and BlueBERT, generating dataset pairs for training and testing based on Huang et al's method.

Results: BlueBERT achieved macro- and microaccuracies of 95.41% and 95.16%, respectively, on the Medical Subject Headings Word Sense Disambiguation dataset. It improved macroaccuracy by 0.54%-1.53% compared to two baselines, long short-term memory and deepBioWSD with random embedding. On the University of Minnesota dataset, BlueBERT recorded macro- and microaccuracies of 98.40% and 98.22%, respectively. Against the baselines of Word2Vec + support vector machine and BioWordVec + support vector machine, BlueBERT demonstrated a macroaccuracy improvement of 2.61%-4.13%.

Conclusions: This research preliminarily validated the effectiveness of the OTA method for abbreviation disambiguation in medical texts, demonstrating the potential to enhance both clinical staff efficiency and research effectiveness.

背景:电子病历存储了大量患者数据,是一个综合性的资料库,其中包括手术和成像报告等文本医疗记录。它们在临床决策支持系统中的作用非常大,但临床文档中广泛使用含糊不清和未标准化的缩写,这给临床决策支持系统中的自然语言处理带来了挑战。为了有效提取信息,需要高效的缩写消歧方法:本研究旨在改进用于临床缩写扩展的一对全(OTA)框架,该框架使用单一模型预测多个缩写的含义。其目的是通过开发上下文候选对和优化双向编码器变换器表征(BERT)中的词嵌入来改进 OTA,并使用真实数据评估该模型在扩展临床缩写方面的功效:方法:使用了三个数据集:方法:使用了三个数据集:医学主题词表词义消歧、明尼苏达大学和 Ditmanson 医学基金会嘉义基督教医院。包含多义缩写的文本经过预处理和格式化后用于 BERT。研究包括微调预训练模型、ClinicalBERT 和 BlueBERT,根据 Huang 等人的方法生成用于训练和测试的数据集对:在医学主题词词义消歧数据集上,BlueBERT 的宏观准确率和微观准确率分别达到 95.41% 和 95.16%。与两个基线(长短期记忆和随机嵌入的 deepBioWSD)相比,它的宏观准确率提高了 0.54%-1.53%。在明尼苏达大学的数据集上,BlueBERT 的宏观准确率和微观准确率分别达到了 98.40% 和 98.22%。与 Word2Vec + 支持向量机和 BioWordVec + 支持向量机的基线相比,BlueBERT 的宏观准确率提高了 2.61%-4.13% :这项研究初步验证了 OTA 方法在医学文本中进行缩写消歧的有效性,显示了其提高临床工作人员效率和研究效果的潜力。
{"title":"Disambiguating Clinical Abbreviations by One-to-All Classification: Algorithm Development and Validation Study.","authors":"Sheng-Feng Sung, Ya-Han Hu, Chong-Yan Chen","doi":"10.2196/56955","DOIUrl":"10.2196/56955","url":null,"abstract":"<p><strong>Background: </strong>Electronic medical records store extensive patient data and serve as a comprehensive repository, including textual medical records like surgical and imaging reports. Their utility in clinical decision support systems is substantial, but the widespread use of ambiguous and unstandardized abbreviations in clinical documents poses challenges for natural language processing in clinical decision support systems. Efficient abbreviation disambiguation methods are needed for effective information extraction.</p><p><strong>Objective: </strong>This study aims to enhance the one-to-all (OTA) framework for clinical abbreviation expansion, which uses a single model to predict multiple abbreviation meanings. The objective is to improve OTA by developing context-candidate pairs and optimizing word embeddings in Bidirectional Encoder Representations From Transformers (BERT), evaluating the model's efficacy in expanding clinical abbreviations using real data.</p><p><strong>Methods: </strong>Three datasets were used: Medical Subject Headings Word Sense Disambiguation, University of Minnesota, and Chia-Yi Christian Hospital from Ditmanson Medical Foundation Chia-Yi Christian Hospital. Texts containing polysemous abbreviations were preprocessed and formatted for BERT. The study involved fine-tuning pretrained models, ClinicalBERT and BlueBERT, generating dataset pairs for training and testing based on Huang et al's method.</p><p><strong>Results: </strong>BlueBERT achieved macro- and microaccuracies of 95.41% and 95.16%, respectively, on the Medical Subject Headings Word Sense Disambiguation dataset. It improved macroaccuracy by 0.54%-1.53% compared to two baselines, long short-term memory and deepBioWSD with random embedding. On the University of Minnesota dataset, BlueBERT recorded macro- and microaccuracies of 98.40% and 98.22%, respectively. Against the baselines of Word2Vec + support vector machine and BioWordVec + support vector machine, BlueBERT demonstrated a macroaccuracy improvement of 2.61%-4.13%.</p><p><strong>Conclusions: </strong>This research preliminarily validated the effectiveness of the OTA method for abbreviation disambiguation in medical texts, demonstrating the potential to enhance both clinical staff efficiency and research effectiveness.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e56955"},"PeriodicalIF":3.1,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11460304/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142333446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Practical Aspects of Using Large Language Models to Screen Abstracts for Cardiovascular Drug Development: Cross-Sectional Study. 使用大型语言模型筛选心血管药物开发摘要的实用方面:横断面研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-30 DOI: 10.2196/64143
Jay G Ronquillo, Jamie Ye, Donal Gorman, Adina R Lemeshow, Stephen J Watt

Unlabelled: Cardiovascular drug development requires synthesizing relevant literature about indications, mechanisms, biomarkers, and outcomes. This short study investigates the performance, cost, and prompt engineering trade-offs of 3 large language models accelerating the literature screening process for cardiovascular drug development applications.

无标签:心血管药物开发需要综合有关适应症、机制、生物标记物和结果的相关文献。这项简短的研究调查了 3 个大型语言模型的性能、成本和及时工程权衡,这些模型可加速心血管药物开发应用的文献筛选过程。
{"title":"Practical Aspects of Using Large Language Models to Screen Abstracts for Cardiovascular Drug Development: Cross-Sectional Study.","authors":"Jay G Ronquillo, Jamie Ye, Donal Gorman, Adina R Lemeshow, Stephen J Watt","doi":"10.2196/64143","DOIUrl":"10.2196/64143","url":null,"abstract":"<p><strong>Unlabelled: </strong>Cardiovascular drug development requires synthesizing relevant literature about indications, mechanisms, biomarkers, and outcomes. This short study investigates the performance, cost, and prompt engineering trade-offs of 3 large language models accelerating the literature screening process for cardiovascular drug development applications.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e64143"},"PeriodicalIF":3.1,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11469161/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142376312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward Better Semantic Interoperability of Data Element Repositories in Medicine: Analysis Study. 实现医学数据元素库更好的语义互操作性:分析研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-30 DOI: 10.2196/60293
Zhengyong Hu, Anran Wang, Yifan Duan, Jiayin Zhou, Wanfei Hu, Sizhu Wu

Background: Data element repositories facilitate high-quality medical data sharing by standardizing data and enhancing semantic interoperability. However, the application of repositories is confined to specific projects and institutions.

Objective: This study aims to explore potential issues and promote broader application of data element repositories within the medical field by evaluating and analyzing typical repositories.

Methods: Following the inclusion of 5 data element repositories through a literature review, a novel analysis framework consisting of 7 dimensions and 36 secondary indicators was constructed and used for evaluation and analysis.

Results: The study's results delineate the unique characteristics of different repositories and uncover specific issues in their construction. These issues include the absence of data reuse protocols and insufficient information regarding the application scenarios and efficacy of data elements. The repositories fully comply with only 45% (9/20) of the subprinciples for Findable and Reusable in the FAIR principle, while achieving a 90% (19/20 subprinciples) compliance rate for Accessible and 67% (10/15 subprinciples) for Interoperable.

Conclusions: The recommendations proposed in this study address the issues to improve the construction and application of repositories, offering valuable insights to data managers, computer experts, and other pertinent stakeholders.

背景:数据元素资源库通过数据标准化和增强语义互操作性,促进了高质量的医疗数据共享。然而,存储库的应用仅限于特定的项目和机构:本研究旨在通过评估和分析典型的数据元素库,探讨潜在的问题并促进数据元素库在医疗领域的更广泛应用:方法:通过文献综述纳入 5 个数据元素库后,构建了由 7 个维度和 36 个二级指标组成的新型分析框架,并用于评估和分析:研究结果:研究结果描述了不同资源库的独特性,并揭示了其建设过程中的具体问题。这些问题包括缺乏数据再利用协议,以及有关数据元素的应用场景和功效的信息不足。这些资料库只完全符合 FAIR 原则中可查找和可重用子原则的 45%(9/20),而符合可访问原则的比例为 90%(19/20 子原则),符合可互操作原则的比例为 67%(10/15 子原则):本研究提出的建议解决了改进资料库建设和应用的问题,为数据管理人员、计算机专家和其他相关利益方提供了有价值的见解。
{"title":"Toward Better Semantic Interoperability of Data Element Repositories in Medicine: Analysis Study.","authors":"Zhengyong Hu, Anran Wang, Yifan Duan, Jiayin Zhou, Wanfei Hu, Sizhu Wu","doi":"10.2196/60293","DOIUrl":"10.2196/60293","url":null,"abstract":"<p><strong>Background: </strong>Data element repositories facilitate high-quality medical data sharing by standardizing data and enhancing semantic interoperability. However, the application of repositories is confined to specific projects and institutions.</p><p><strong>Objective: </strong>This study aims to explore potential issues and promote broader application of data element repositories within the medical field by evaluating and analyzing typical repositories.</p><p><strong>Methods: </strong>Following the inclusion of 5 data element repositories through a literature review, a novel analysis framework consisting of 7 dimensions and 36 secondary indicators was constructed and used for evaluation and analysis.</p><p><strong>Results: </strong>The study's results delineate the unique characteristics of different repositories and uncover specific issues in their construction. These issues include the absence of data reuse protocols and insufficient information regarding the application scenarios and efficacy of data elements. The repositories fully comply with only 45% (9/20) of the subprinciples for Findable and Reusable in the FAIR principle, while achieving a 90% (19/20 subprinciples) compliance rate for Accessible and 67% (10/15 subprinciples) for Interoperable.</p><p><strong>Conclusions: </strong>The recommendations proposed in this study address the issues to improve the construction and application of repositories, offering valuable insights to data managers, computer experts, and other pertinent stakeholders.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e60293"},"PeriodicalIF":3.1,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11474123/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142333466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementation of the Observational Medical Outcomes Partnership Model in Electronic Medical Record Systems: Evaluation Study Using Factor Analysis and Decision-Making Trial and Evaluation Laboratory-Best-Worst Methods. 在电子病历系统中实施观察性医疗成果合作模式:使用因素分析和决策试验及评估实验室--最佳--最差方法的评估研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-27 DOI: 10.2196/58498
Ming Luo, Yu Gu, Feilong Zhou, Shaohong Chen

Background: Electronic medical record (EMR) systems are essential in health care for collecting and storing patient medical data. They provide critical information to doctors and caregivers, facilitating improved decision-making and patient care. Despite their significance, optimizing EMR systems is crucial for enhancing health care quality. Implementing the Observational Medical Outcomes Partnership (OMOP) shared data model represents a promising approach to improve EMR performance and overall health care outcomes.

Objective: This study aims to evaluate the effects of implementing the OMOP shared data model in EMR systems and to assess its impact on enhancing health care quality.

Methods: In this study, 3 distinct methodologies are used to explore various aspects of health care information systems. First, factor analysis is utilized to investigate the correlations between EMR systems and attitudes toward OMOP. Second, the best-worst method (BWM) is applied to determine the weights of criteria and subcriteria. Lastly, the decision-making trial and evaluation laboratory technique is used to illustrate the interactions and interdependencies among the identified criteria.

Results: In this research, we evaluated the AliHealth EMR system by surveying 98 users and practitioners to assess its effectiveness and user satisfaction. The study reveals that among all components, "EMR resolution" holds the highest importance with a weight of 0.31007783, highlighting its significant role in the evaluation. Conversely, "EMR ease of use" has the lowest weight of 0.1860467, indicating that stakeholders prioritize the resolution aspect over ease of use in their assessment of EMR systems.

Conclusions: The findings highlight that stakeholders prioritize certain aspects of EMR systems, with "EMR resolution" being the most valued component.

背景:电子病历系统(EMR)是医疗保健领域收集和存储病人医疗数据的重要工具。它们为医生和护理人员提供重要信息,有助于改进决策和病人护理。尽管其意义重大,但优化电子病历系统对提高医疗质量至关重要。实施观察性医疗结果伙伴关系(OMOP)共享数据模型是提高 EMR 性能和整体医疗结果的一种可行方法:本研究旨在评估在 EMR 系统中实施 OMOP 共享数据模型的效果,并评估其对提高医疗质量的影响:本研究采用三种不同的方法探讨医疗信息系统的各个方面。首先,利用因子分析来研究 EMR 系统与对 OMOP 的态度之间的相关性。其次,采用最佳-最差法(BWM)确定标准和次级标准的权重。最后,使用决策试验和评估实验室技术来说明已确定标准之间的相互作用和相互依存关系:在这项研究中,我们通过对 98 名用户和从业人员进行调查,评估了阿里健康 EMR 系统的有效性和用户满意度。研究显示,在所有组成部分中,"电子病历解析度 "的重要性最高,权重为 0.31007783,突出了其在评价中的重要作用。相反,"电子病历易用性 "的权重最低,为 0.1860467,这表明利益相关者在评估电子病历系统时,优先考虑的是分辨率而不是易用性:研究结果突出表明,利益相关者优先考虑电子病历系统的某些方面,其中 "电子病历分辨率 "是最受重视的组成部分。
{"title":"Implementation of the Observational Medical Outcomes Partnership Model in Electronic Medical Record Systems: Evaluation Study Using Factor Analysis and Decision-Making Trial and Evaluation Laboratory-Best-Worst Methods.","authors":"Ming Luo, Yu Gu, Feilong Zhou, Shaohong Chen","doi":"10.2196/58498","DOIUrl":"10.2196/58498","url":null,"abstract":"<p><strong>Background: </strong>Electronic medical record (EMR) systems are essential in health care for collecting and storing patient medical data. They provide critical information to doctors and caregivers, facilitating improved decision-making and patient care. Despite their significance, optimizing EMR systems is crucial for enhancing health care quality. Implementing the Observational Medical Outcomes Partnership (OMOP) shared data model represents a promising approach to improve EMR performance and overall health care outcomes.</p><p><strong>Objective: </strong>This study aims to evaluate the effects of implementing the OMOP shared data model in EMR systems and to assess its impact on enhancing health care quality.</p><p><strong>Methods: </strong>In this study, 3 distinct methodologies are used to explore various aspects of health care information systems. First, factor analysis is utilized to investigate the correlations between EMR systems and attitudes toward OMOP. Second, the best-worst method (BWM) is applied to determine the weights of criteria and subcriteria. Lastly, the decision-making trial and evaluation laboratory technique is used to illustrate the interactions and interdependencies among the identified criteria.</p><p><strong>Results: </strong>In this research, we evaluated the AliHealth EMR system by surveying 98 users and practitioners to assess its effectiveness and user satisfaction. The study reveals that among all components, \"EMR resolution\" holds the highest importance with a weight of 0.31007783, highlighting its significant role in the evaluation. Conversely, \"EMR ease of use\" has the lowest weight of 0.1860467, indicating that stakeholders prioritize the resolution aspect over ease of use in their assessment of EMR systems.</p><p><strong>Conclusions: </strong>The findings highlight that stakeholders prioritize certain aspects of EMR systems, with \"EMR resolution\" being the most valued component.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58498"},"PeriodicalIF":3.1,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11470222/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142333447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1