首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Opportunities for the informatics community to advance learning health systems. 信息学界推进学习型医疗系统的机遇。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae281
Melissa A Gunderson, Peter Embí, Charles P Friedman, Genevieve B Melton

Objectives: There is rapidly growing interest in learning health systems (LHSs) nationally and globally. While the critical role of informatics is recognized, the informatics community has been relatively slow to formalize LHS as a priority area.

Materials and methods: We compiled results from a short survey of LHS leaders and American Medical Informatics Association (AMIA) members, discussion from an LHS reception at the AMIA annual meeting, and a follow-up survey to inform priorities at the intersection of LHS and informatics.

Results: We present opportunities between informatics and LHS which fell into themes of: Understanding and Context, Shared Resources, Collaboration, Education, Data, Evaluation, and Patient Centeredness. Immediate LHS informatics priorities identified include establishing informatics LHS forum(s), case reports of LHS informatics successes and failures, LHS informatics education resources, and improved understanding of LHS principles in informatics.

Conclusion: Increased informatics and LHS alignment is critical for advancing this transformative national priority.

目的:在国内和全球范围内,人们对学习型医疗系统(LHS)的兴趣与日俱增。虽然信息学的关键作用已得到认可,但信息学界将学习型医疗系统正式列为优先领域的进程却相对缓慢:我们汇编了一项针对地方卫生系统领导者和美国医学信息学协会(AMIA)成员的简短调查、美国医学信息学协会年会地方卫生系统招待会的讨论结果,以及一项后续调查,以了解地方卫生系统和信息学交叉领域的优先事项:结果:我们介绍了信息学和地方卫生系统之间的机遇,这些机遇的主题包括结果:我们介绍了信息学与地方保健服务之间的机遇,这些机遇的主题包括:理解和背景、共享资源、合作、教育、数据、评估和以患者为中心。已确定的长者健康服务信息学当务之急包括建立长者健康服务信息学论坛、长者健康服务信息学成功与失败案例报告、长者健康服务信息学教育资源,以及提高对信息学中长者健康服务原则的理解:结论:加强信息学与长者健康服务的协调对于推进这一变革性的国家优先事项至关重要。
{"title":"Opportunities for the informatics community to advance learning health systems.","authors":"Melissa A Gunderson, Peter Embí, Charles P Friedman, Genevieve B Melton","doi":"10.1093/jamia/ocae281","DOIUrl":"10.1093/jamia/ocae281","url":null,"abstract":"<p><strong>Objectives: </strong>There is rapidly growing interest in learning health systems (LHSs) nationally and globally. While the critical role of informatics is recognized, the informatics community has been relatively slow to formalize LHS as a priority area.</p><p><strong>Materials and methods: </strong>We compiled results from a short survey of LHS leaders and American Medical Informatics Association (AMIA) members, discussion from an LHS reception at the AMIA annual meeting, and a follow-up survey to inform priorities at the intersection of LHS and informatics.</p><p><strong>Results: </strong>We present opportunities between informatics and LHS which fell into themes of: Understanding and Context, Shared Resources, Collaboration, Education, Data, Evaluation, and Patient Centeredness. Immediate LHS informatics priorities identified include establishing informatics LHS forum(s), case reports of LHS informatics successes and failures, LHS informatics education resources, and improved understanding of LHS principles in informatics.</p><p><strong>Conclusion: </strong>Increased informatics and LHS alignment is critical for advancing this transformative national priority.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"253-257"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648723/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142696018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Just another tool in their repertoire: uncovering insights into public and patient perspectives on clinicians' use of machine learning in perioperative care. 这只是他们的另一个工具:揭示公众和患者对临床医生在围手术期护理中使用机器学习的看法。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae257
Xiomara T Gonzalez, Karen Steger-May, Joanna Abraham

Objectives: Successful implementation of machine learning-augmented clinical decision support systems (ML-CDSS) in perioperative care requires the prioritization of patient-centric approaches to ensure alignment with societal expectations. We assessed general public and surgical patient attitudes and perspectives on ML-CDSS use in perioperative care.

Materials and methods: A sequential explanatory study was conducted. Stage 1 collected public opinions through a survey. Stage 2 ascertained surgical patients' experiences and attitudes via focus groups and interviews.

Results: For Stage 1, a total of 281 respondents' (140 males [49.8%]) data were considered. Among participants without ML awareness, males were almost three times more likely than females to report more acceptance (OR = 2.97; 95% CI, 1.36-6.49) and embrace (OR = 2.74; 95% CI, 1.23-6.09) of ML-CDSS use by perioperative teams. Males were almost twice as likely as females to report more acceptance across all perioperative phases with ORs ranging from 1.71 to 2.07. In Stage 2, insights from 10 surgical patients revealed unanimous agreement that ML-CDSS should primarily serve a supportive function. The pre- and post-operative phases were identified explicitly as forums where ML-CDSS can enhance care delivery. Patients requested for education on ML-CDSS's role in their care to be disseminated by surgeons across multiple platforms.

Discussion and conclusion: The general public and surgical patients are receptive to ML-CDSS use throughout their perioperative care provided its role is auxiliary to perioperative teams. However, the integration of ML-CDSS into perioperative workflows presents unique challenges for healthcare settings. Insights from this study can inform strategies to support large-scale implementation and adoption of ML-CDSS by patients in all perioperative phases. Key strategies to promote the feasibility and acceptability of ML-CDSS include clinician-led discussions about ML-CDSS's role in perioperative care, established metrics to evaluate the clinical utility of ML-CDSS, and patient education.

目的:在围手术期护理中成功实施机器学习增强型临床决策支持系统(ML-CDSS)需要优先考虑以患者为中心的方法,以确保符合社会期望。我们评估了公众和手术患者对在围手术期护理中使用机器学习辅助临床决策支持系统的态度和观点:我们开展了一项顺序解释性研究。第一阶段通过调查收集公众意见。第二阶段通过焦点小组和访谈了解手术患者的经历和态度:第一阶段共考虑了 281 名受访者(140 名男性 [49.8%])的数据。在没有 ML 意识的参与者中,男性报告围手术期团队更接受(OR = 2.97;95% CI,1.36-6.49)和拥护(OR = 2.74;95% CI,1.23-6.09)ML-CDSS 的可能性几乎是女性的三倍。在所有围手术期阶段,男性对 ML-CDSS 的接受度几乎是女性的两倍,OR 值从 1.71 到 2.07 不等。在第二阶段,从 10 名手术患者那里了解到的情况表明,他们一致认为 ML-CDSS 应主要发挥辅助功能。术前和术后阶段被明确认定为 ML-CDSS 可以加强护理服务的场所。患者要求外科医生通过多个平台传播有关 ML-CDSS 在其护理中的作用的教育:只要 ML-CDSS 的作用是辅助围手术期团队,公众和手术患者都能接受在围手术期护理中使用 ML-CDSS。然而,将 ML-CDSS 整合到围术期工作流程中给医疗机构带来了独特的挑战。本研究的启示可为支持大规模实施 ML-CDSS 并让患者在围手术期各阶段采用 ML-CDSS 的策略提供参考。促进 ML-CDSS 可行性和可接受性的关键策略包括:由临床医生主导讨论 ML-CDSS 在围手术期护理中的作用、建立评估 ML-CDSS 临床效用的标准以及开展患者教育。
{"title":"Just another tool in their repertoire: uncovering insights into public and patient perspectives on clinicians' use of machine learning in perioperative care.","authors":"Xiomara T Gonzalez, Karen Steger-May, Joanna Abraham","doi":"10.1093/jamia/ocae257","DOIUrl":"10.1093/jamia/ocae257","url":null,"abstract":"<p><strong>Objectives: </strong>Successful implementation of machine learning-augmented clinical decision support systems (ML-CDSS) in perioperative care requires the prioritization of patient-centric approaches to ensure alignment with societal expectations. We assessed general public and surgical patient attitudes and perspectives on ML-CDSS use in perioperative care.</p><p><strong>Materials and methods: </strong>A sequential explanatory study was conducted. Stage 1 collected public opinions through a survey. Stage 2 ascertained surgical patients' experiences and attitudes via focus groups and interviews.</p><p><strong>Results: </strong>For Stage 1, a total of 281 respondents' (140 males [49.8%]) data were considered. Among participants without ML awareness, males were almost three times more likely than females to report more acceptance (OR = 2.97; 95% CI, 1.36-6.49) and embrace (OR = 2.74; 95% CI, 1.23-6.09) of ML-CDSS use by perioperative teams. Males were almost twice as likely as females to report more acceptance across all perioperative phases with ORs ranging from 1.71 to 2.07. In Stage 2, insights from 10 surgical patients revealed unanimous agreement that ML-CDSS should primarily serve a supportive function. The pre- and post-operative phases were identified explicitly as forums where ML-CDSS can enhance care delivery. Patients requested for education on ML-CDSS's role in their care to be disseminated by surgeons across multiple platforms.</p><p><strong>Discussion and conclusion: </strong>The general public and surgical patients are receptive to ML-CDSS use throughout their perioperative care provided its role is auxiliary to perioperative teams. However, the integration of ML-CDSS into perioperative workflows presents unique challenges for healthcare settings. Insights from this study can inform strategies to support large-scale implementation and adoption of ML-CDSS by patients in all perioperative phases. Key strategies to promote the feasibility and acceptability of ML-CDSS include clinician-led discussions about ML-CDSS's role in perioperative care, established metrics to evaluate the clinical utility of ML-CDSS, and patient education.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"150-162"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648718/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning. 利用健康记录数据和机器学习减少急性肝性卟啉症的诊断延误。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae141
Balu Bhasuran, Katharina Schmolly, Yuvraaj Kapoor, Nanditha Lakshmi Jayakumar, Raymond Doan, Jigar Amin, Stephen Meninger, Nathan Cheng, Robert Deering, Karl Anderson, Simon W Beaven, Bruce Wang, Vivek A Rudrapatna

Background: Acute hepatic porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of 15 years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. We sought to train and characterize models for identifying patients with AHP.

Methods: This diagnostic study used structured and notes-based EHR data from 2 centers at the University of California, UCSF (2012-2022) and UCLA (2019-2022). The data were split into 2 cohorts (referral and diagnosis) and used to develop models that predict (1) who will be referred for testing of acute porphyria, among those who presented with abdominal pain (a cardinal symptom of AHP), and (2) who will test positive, among those referred. The referral cohort consisted of 747 patients referred for testing and 99 849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. The case cohort was 81% female and 6-75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. Our primary outcome was the F-score on an outcome-stratified test set.

Results: The best center-specific referral models achieved an F-score of 86%-91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥10% probability of referral, ≥50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years.

Conclusions: ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed.

背景:急性肝卟啉症(AHP)是一组罕见但可治疗的疾病,平均诊断延迟时间长达 15 年。电子健康记录(EHR)数据和机器学习(ML)的出现可能会改善对 AHP 等罕见疾病的及时识别。然而,由于病例数量有限、电子病历数据不结构化以及医疗服务固有的选择偏差,预测模型可能很难训练。我们试图训练和描述识别 AHP 患者的模型:这项诊断研究使用了加州大学旧金山分校(2012-2022 年)和加州大学洛杉矶分校(2019-2022 年)两个中心的结构化和基于笔记的电子病历数据。这些数据被分为两个队列(转诊和诊断),并用于建立模型,预测:(1) 在出现腹痛(AHP 的主要症状)的患者中,哪些人会被转诊接受急性卟啉症检测;(2) 在转诊患者中,哪些人会检测呈阳性。转诊队列由 747 名转诊患者和 99 849 名未转诊的同期患者组成。诊断队列包括 72 例确诊的 AHP 病例和 347 例检测呈阴性的患者。病例群中 81% 为女性,诊断时年龄为 6-75 岁。候选模型采用了一系列架构。特征选择是半自动化的,并结合了知识图谱中的公开数据。我们的主要结果是结果分层测试集上的 F 分数:结果:最佳中心特定转诊模型的 F 分数达到了 86%-91%。最佳诊断模型的 F 分数为 92%。为了进一步测试我们的模型,我们联系了 372 名目前没有 AHP 诊断但被我们的模型预测为可能有 AHP 诊断的患者(转诊概率≥10%,测试阳性概率≥50%)。然而,我们只能招募其中的 10 名患者进行生化检测,结果全部为阴性。尽管如此,事后评估表明,这些模型可以在诊断日期之前发现 71% 的病例,节省了 1.2 年的时间:结论:ML 可以减少 AHP 和其他罕见病的诊断延误。在部署这些模型之前,还需要强有力的招募策略和多中心协调来验证它们。
{"title":"Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning.","authors":"Balu Bhasuran, Katharina Schmolly, Yuvraaj Kapoor, Nanditha Lakshmi Jayakumar, Raymond Doan, Jigar Amin, Stephen Meninger, Nathan Cheng, Robert Deering, Karl Anderson, Simon W Beaven, Bruce Wang, Vivek A Rudrapatna","doi":"10.1093/jamia/ocae141","DOIUrl":"10.1093/jamia/ocae141","url":null,"abstract":"<p><strong>Background: </strong>Acute hepatic porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of 15 years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. We sought to train and characterize models for identifying patients with AHP.</p><p><strong>Methods: </strong>This diagnostic study used structured and notes-based EHR data from 2 centers at the University of California, UCSF (2012-2022) and UCLA (2019-2022). The data were split into 2 cohorts (referral and diagnosis) and used to develop models that predict (1) who will be referred for testing of acute porphyria, among those who presented with abdominal pain (a cardinal symptom of AHP), and (2) who will test positive, among those referred. The referral cohort consisted of 747 patients referred for testing and 99 849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. The case cohort was 81% female and 6-75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. Our primary outcome was the F-score on an outcome-stratified test set.</p><p><strong>Results: </strong>The best center-specific referral models achieved an F-score of 86%-91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥10% probability of referral, ≥50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years.</p><p><strong>Conclusions: </strong>ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"63-70"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141472084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Barriers to obtaining and using interoperable information among non-federal acute care hospitals. 非联邦急症护理医院获取和使用互操作信息的障碍。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae263
Jordan Everson, Chelsea Richwine

Objective: To understand barriers to obtaining and using interoperable information at US hospitals.

Materials and methods: Using 2023 nationally representative survey data on US hospitals (N = 2420), we examined major and minor barriers to exchanging information with other organizations, and how barriers vary by hospital characteristics and methods used to obtain information. Using a series of regression models, we examined how hospital experiences with barriers relate to routine use of information at responding hospitals.

Results: In 2023, most hospitals experienced at least one minor (81%) or major (62%) barrier to exchange, with the most common major barriers relating to different vendors and exchange partners' capabilities. Higher-resourced hospitals and those often using network-based exchange tended to experience more minor barriers whereas lower-resourced hospitals and those often using mail/fax or direct access to outside electronic health records experienced more major barriers. In multivariate regression, hospitals indicating "Patient matching" and "Costs to exchange" were a major or minor barrier had the strongest independent negative association with the likelihood of reporting providers at their hospital frequently use information from outside organizations.

Discussion: Despite progress in interoperable exchange, various barriers remain. The prevalence of barriers varied by hospital type and methods used, with barriers more often preventing exchange for lower-resourced hospitals and those using outdated exchange methods.

Conclusion: While several technical and policy efforts are underway to address prevalent barriers, it will be important to monitor whether efforts are successful in ensuring information from outside organizations can be seamlessly exchanged and used to inform patient care.

目的:了解美国医院获取和使用可互操作信息的障碍:了解美国医院获取和使用可互操作信息的障碍:利用 2023 年美国医院的全国代表性调查数据(N = 2420),我们研究了与其他组织交换信息的主要障碍和次要障碍,以及不同医院的特点和获取信息的方法所造成的障碍差异。通过一系列回归模型,我们研究了医院遇到的障碍与受访医院日常信息使用的关系:2023 年,大多数医院在交换信息时至少遇到过一次轻微(81%)或严重(62%)的障碍,其中最常见的严重障碍与不同供应商和交换合作伙伴的能力有关。资源较多的医院和经常使用网络交换的医院往往遇到更多的小障碍,而资源较少的医院和经常使用邮件/传真或直接访问外部电子病历的医院则遇到更多的大障碍。在多变量回归中,表示 "患者匹配 "和 "交换成本 "是主要或次要障碍的医院与报告其医院的医疗服务提供者经常使用外部机构信息的可能性呈最强的独立负相关:尽管在互操作性交换方面取得了进展,但仍存在各种障碍。障碍的普遍程度因医院类型和使用的方法而异,资源较少的医院和使用过时的交换方法的医院更常因障碍而无法交换信息:尽管目前正在开展一些技术和政策方面的工作来解决普遍存在的障碍,但重要的是要监测这些工作是否能成功确保来自外部机构的信息能够无缝交换并用于为患者护理提供信息。
{"title":"Barriers to obtaining and using interoperable information among non-federal acute care hospitals.","authors":"Jordan Everson, Chelsea Richwine","doi":"10.1093/jamia/ocae263","DOIUrl":"10.1093/jamia/ocae263","url":null,"abstract":"<p><strong>Objective: </strong>To understand barriers to obtaining and using interoperable information at US hospitals.</p><p><strong>Materials and methods: </strong>Using 2023 nationally representative survey data on US hospitals (N = 2420), we examined major and minor barriers to exchanging information with other organizations, and how barriers vary by hospital characteristics and methods used to obtain information. Using a series of regression models, we examined how hospital experiences with barriers relate to routine use of information at responding hospitals.</p><p><strong>Results: </strong>In 2023, most hospitals experienced at least one minor (81%) or major (62%) barrier to exchange, with the most common major barriers relating to different vendors and exchange partners' capabilities. Higher-resourced hospitals and those often using network-based exchange tended to experience more minor barriers whereas lower-resourced hospitals and those often using mail/fax or direct access to outside electronic health records experienced more major barriers. In multivariate regression, hospitals indicating \"Patient matching\" and \"Costs to exchange\" were a major or minor barrier had the strongest independent negative association with the likelihood of reporting providers at their hospital frequently use information from outside organizations.</p><p><strong>Discussion: </strong>Despite progress in interoperable exchange, various barriers remain. The prevalence of barriers varied by hospital type and methods used, with barriers more often preventing exchange for lower-resourced hospitals and those using outdated exchange methods.</p><p><strong>Conclusion: </strong>While several technical and policy efforts are underway to address prevalent barriers, it will be important to monitor whether efforts are successful in ensuring information from outside organizations can be seamlessly exchanged and used to inform patient care.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"20-27"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648725/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The number of patient scheduled hours resulting in a 40-hour work week by physician specialty and setting: a cross-sectional study using electronic health record event log data. 按医生专业和工作环境划分的每周 40 小时工作时间所安排的病人小时数:利用电子健康记录事件日志数据进行的横断面研究。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae266
Christine A Sinsky, Lisa Rotenstein, A Jay Holmgren, Nate C Apathy

Objective: To quantify how many patient scheduled hours would result in a 40-h work week (PSH40) for ambulatory physicians and to determine how PSH40 varies by specialty and practice type.

Methods: We calculated PSH40 for 186 188 ambulatory physicians across 395 organizations from November 2021 through April 2022 stratified by specialty.

Results: Median PSH40 for the sample was 33.2 h (IQR: 28.7-36.5). PSH40 was lowest in infectious disease (26.2, IQR: 21.6-31.1), geriatrics (27.2, IQR: 21.5-32.0) and hematology (28.6, IQR: 23.6-32.6) and highest in plastic surgery (35.7, IQR: 32.8-37.7), pain medicine (35.8, IQR: 32.6-37.9) and sports medicine (36.0, IQR: 33.3-38.1).

Discussion: Health system leaders and physicians will benefit from data driven and transparent discussions about work hour expectations. The PSH40 measure can also be used to quantify the impact of variations in the clinical care environment on the in-person ambulatory patient care time available to physicians.

Conclusions: PSH40 is a novel measure that can be generated from vendor-derived metrics and used by operational leaders to inform work expectations. It can also support research into the impact of changes in the care environment on physicians' workload and capacity.

目的量化非住院医师每周 40 小时工作时间(PSH40)所需的病人预定小时数,并确定不同专业和执业类型的 PSH40 有何差异:我们计算了 2021 年 11 月至 2022 年 4 月期间 395 家机构中 186 188 名非住院医师的 PSH40,并按专业进行了分层:样本的 PSH40 中位数为 33.2 小时(IQR:28.7-36.5)。传染病科(26.2,IQR:21.6-31.1)、老年病科(27.2,IQR:21.5-32.0)和血液科(28.6,IQR:23.6-32.6)的 PSH40 最低,整形外科(35.7,IQR:32.8-37.7)、疼痛科(35.8,IQR:32.6-37.9)和运动医学科(36.0,IQR:33.3-38.1)的 PSH40 最高:讨论:医疗系统领导和医生将受益于数据驱动和透明的工时预期讨论。PSH40测量方法还可用于量化临床护理环境的变化对医生门诊病人护理时间的影响:PSH40 是一种新颖的衡量标准,可从供应商提供的指标中生成,并由业务领导者用于告知工作预期。它还能支持研究医疗环境的变化对医生工作量和能力的影响。
{"title":"The number of patient scheduled hours resulting in a 40-hour work week by physician specialty and setting: a cross-sectional study using electronic health record event log data.","authors":"Christine A Sinsky, Lisa Rotenstein, A Jay Holmgren, Nate C Apathy","doi":"10.1093/jamia/ocae266","DOIUrl":"10.1093/jamia/ocae266","url":null,"abstract":"<p><strong>Objective: </strong>To quantify how many patient scheduled hours would result in a 40-h work week (PSH40) for ambulatory physicians and to determine how PSH40 varies by specialty and practice type.</p><p><strong>Methods: </strong>We calculated PSH40 for 186 188 ambulatory physicians across 395 organizations from November 2021 through April 2022 stratified by specialty.</p><p><strong>Results: </strong>Median PSH40 for the sample was 33.2 h (IQR: 28.7-36.5). PSH40 was lowest in infectious disease (26.2, IQR: 21.6-31.1), geriatrics (27.2, IQR: 21.5-32.0) and hematology (28.6, IQR: 23.6-32.6) and highest in plastic surgery (35.7, IQR: 32.8-37.7), pain medicine (35.8, IQR: 32.6-37.9) and sports medicine (36.0, IQR: 33.3-38.1).</p><p><strong>Discussion: </strong>Health system leaders and physicians will benefit from data driven and transparent discussions about work hour expectations. The PSH40 measure can also be used to quantify the impact of variations in the clinical care environment on the in-person ambulatory patient care time available to physicians.</p><p><strong>Conclusions: </strong>PSH40 is a novel measure that can be generated from vendor-derived metrics and used by operational leaders to inform work expectations. It can also support research into the impact of changes in the care environment on physicians' workload and capacity.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"235-240"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648726/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secure messaging telehealth billing in the digital age: moving beyond time-based metrics. 数字时代的安全信息远程医疗计费:超越基于时间的指标。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae250
Dong-Gil Ko, Umberto Tachinardi, Eric J Warm

Objective: We proposed adopting billing models for secure messaging (SM) telehealth services that move beyond time-based metrics, focusing on the complexity and clinical expertise involved in patient care.

Materials and methods: We trained 8 classification machine learning (ML) models using providers' electronic health record (EHR) audit log data for patient-initiated non-urgent messages. Mixed effect modeling (MEM) analyzed significance.

Results: Accuracy and area under the receiver operating characteristics curve scores generally exceeded 0.85, demonstrating robust performance. MEM showed that knowledge domains significantly influenced SM billing, explaining nearly 40% of the variance.

Discussion: This study demonstrates that ML models using EHR audit log data can improve and predict billing in SM telehealth services, supporting billing models that reflect clinical complexity and expertise rather than time-based metrics.

Conclusion: Our research highlights the need for SM billing models beyond time-based metrics, using EHR audit log data to capture the true value of clinical work.

目的:我们建议对安全信息传送(SM)远程医疗服务采用计费模式:我们建议对安全信息(SM)远程医疗服务采用计费模式,这种模式超越了基于时间的衡量标准,侧重于患者护理所涉及的复杂性和临床专业知识:我们使用医疗服务提供者的电子健康记录(EHR)审计日志数据,针对患者发起的非紧急信息训练了 8 个分类机器学习(ML)模型。混合效应建模(MEM)分析了显著性:结果:准确率和接收者工作特征曲线下面积得分普遍超过 0.85,显示出强大的性能。混合效应模型显示,知识域对 SM 计费有显著影响,解释了近 40% 的方差:本研究表明,使用 EHR 审计日志数据的 ML 模型可以改进和预测 SM 远程医疗服务的计费,支持反映临床复杂性和专业知识而非基于时间指标的计费模型:我们的研究强调,除了基于时间的指标外,还需要使用电子病历审计日志数据来捕捉临床工作的真正价值,从而建立 SM 计费模型。
{"title":"Secure messaging telehealth billing in the digital age: moving beyond time-based metrics.","authors":"Dong-Gil Ko, Umberto Tachinardi, Eric J Warm","doi":"10.1093/jamia/ocae250","DOIUrl":"10.1093/jamia/ocae250","url":null,"abstract":"<p><strong>Objective: </strong>We proposed adopting billing models for secure messaging (SM) telehealth services that move beyond time-based metrics, focusing on the complexity and clinical expertise involved in patient care.</p><p><strong>Materials and methods: </strong>We trained 8 classification machine learning (ML) models using providers' electronic health record (EHR) audit log data for patient-initiated non-urgent messages. Mixed effect modeling (MEM) analyzed significance.</p><p><strong>Results: </strong>Accuracy and area under the receiver operating characteristics curve scores generally exceeded 0.85, demonstrating robust performance. MEM showed that knowledge domains significantly influenced SM billing, explaining nearly 40% of the variance.</p><p><strong>Discussion: </strong>This study demonstrates that ML models using EHR audit log data can improve and predict billing in SM telehealth services, supporting billing models that reflect clinical complexity and expertise rather than time-based metrics.</p><p><strong>Conclusion: </strong>Our research highlights the need for SM billing models beyond time-based metrics, using EHR audit log data to capture the true value of clinical work.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"230-234"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A machine learning framework to adjust for learning effects in medical device safety evaluation. 在医疗器械安全评估中调整学习效果的机器学习框架。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae273
Jejo D Koola, Karthik Ramesh, Jialin Mao, Minyoung Ahn, Sharon E Davis, Usha Govindarajulu, Amy M Perkins, Dax Westerman, Henry Ssemaganda, Theodore Speroff, Lucila Ohno-Machado, Craig R Ramsay, Art Sedrakyan, Frederic S Resnic, Michael E Matheny

Objectives: Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sought to develop a machine learning (ML) framework to detect and adjust for operator learning effects.

Materials and methods: A gradient-boosted decision tree ML method was used to analyze synthetic datasets that replicate the complexity of clinical scenarios involving high-risk medical devices. We designed this process to detect learning effects using a risk-adjusted cumulative sum method, quantify the excess adverse event rate attributable to operator inexperience, and adjust for these alongside patient factors in evaluating device safety signals. To maintain integrity, we employed blinding between data generation and analysis teams. Synthetic data used underlying distributions and patient feature correlations based on clinical data from the Department of Veterans Affairs between 2005 and 2012. We generated 2494 synthetic datasets with widely varying characteristics including number of patient features, operators and institutions, and the operator learning form. Each dataset contained a hypothetical study device, Device B, and a reference device, Device A. We evaluated accuracy in identifying learning effects and identifying and estimating the strength of the device safety signal. Our approach also evaluated different clinically relevant thresholds for safety signal detection.

Results: Our framework accurately identified the presence or absence of learning effects in 93.6% of datasets and correctly determined device safety signals in 93.4% of cases. The estimated device odds ratios' 95% confidence intervals were accurately aligned with the specified ratios in 94.7% of datasets. In contrast, a comparative model excluding operator learning effects significantly underperformed in detecting device signals and in accuracy. Notably, our framework achieved 100% specificity for clinically relevant safety signal thresholds, although sensitivity varied with the threshold applied.

Discussion: A machine learning framework, tailored for the complexities of post-market device evaluation, may provide superior performance compared to standard parametric techniques when operator learning is present.

Conclusion: Demonstrating the capacity of ML to overcome complex evaluative challenges, our framework addresses the limitations of traditional statistical methods in current post-market surveillance processes. By offering a reliable means to detect and adjust for learning effects, it may significantly improve medical device safety evaluation.

目标:医疗器械上市后监督的传统方法往往无法准确考虑操作者的学习效应,导致对器械安全性的评估出现偏差。这些方法难以应对非线性、复杂的学习曲线和时变协变量(如医生经验)。为了解决这些局限性,我们试图开发一种机器学习(ML)框架来检测和调整操作者的学习效果:我们使用梯度提升决策树 ML 方法来分析合成数据集,这些数据集复制了涉及高风险医疗设备的临床场景的复杂性。我们设计了这一流程,以使用风险调整累积和法检测学习效应,量化因操作者经验不足而导致的超额不良事件率,并在评估器械安全信号时将这些因素与患者因素一并考虑。为了保持完整性,我们在数据生成和分析团队之间采用了盲法。合成数据使用了基于退伍军人事务部 2005 年至 2012 年临床数据的基础分布和患者特征相关性。我们生成了 2494 个合成数据集,这些数据集的特征千差万别,包括患者特征数量、操作者和机构以及操作者学习形式。我们评估了识别学习效应以及识别和估计设备安全信号强度的准确性。我们的方法还评估了安全信号检测的不同临床相关阈值:我们的框架在 93.6% 的数据集中准确识别了学习效应的存在与否,并在 93.4% 的案例中正确确定了设备安全信号。在 94.7% 的数据集中,估计设备几率的 95% 置信区间与指定几率准确一致。相比之下,排除了操作员学习效应的比较模型在检测设备信号和准确性方面明显表现不佳。值得注意的是,我们的框架对临床相关安全信号阈值的特异性达到了 100%,但灵敏度随应用的阈值而变化:讨论:针对上市后设备评估的复杂性而定制的机器学习框架,在操作者学习的情况下,可能比标准参数技术提供更优越的性能:我们的框架展示了机器学习克服复杂评估挑战的能力,解决了当前上市后监督流程中传统统计方法的局限性。通过提供检测和调整学习效应的可靠方法,它可以显著改善医疗设备的安全性评估。
{"title":"A machine learning framework to adjust for learning effects in medical device safety evaluation.","authors":"Jejo D Koola, Karthik Ramesh, Jialin Mao, Minyoung Ahn, Sharon E Davis, Usha Govindarajulu, Amy M Perkins, Dax Westerman, Henry Ssemaganda, Theodore Speroff, Lucila Ohno-Machado, Craig R Ramsay, Art Sedrakyan, Frederic S Resnic, Michael E Matheny","doi":"10.1093/jamia/ocae273","DOIUrl":"10.1093/jamia/ocae273","url":null,"abstract":"<p><strong>Objectives: </strong>Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sought to develop a machine learning (ML) framework to detect and adjust for operator learning effects.</p><p><strong>Materials and methods: </strong>A gradient-boosted decision tree ML method was used to analyze synthetic datasets that replicate the complexity of clinical scenarios involving high-risk medical devices. We designed this process to detect learning effects using a risk-adjusted cumulative sum method, quantify the excess adverse event rate attributable to operator inexperience, and adjust for these alongside patient factors in evaluating device safety signals. To maintain integrity, we employed blinding between data generation and analysis teams. Synthetic data used underlying distributions and patient feature correlations based on clinical data from the Department of Veterans Affairs between 2005 and 2012. We generated 2494 synthetic datasets with widely varying characteristics including number of patient features, operators and institutions, and the operator learning form. Each dataset contained a hypothetical study device, Device B, and a reference device, Device A. We evaluated accuracy in identifying learning effects and identifying and estimating the strength of the device safety signal. Our approach also evaluated different clinically relevant thresholds for safety signal detection.</p><p><strong>Results: </strong>Our framework accurately identified the presence or absence of learning effects in 93.6% of datasets and correctly determined device safety signals in 93.4% of cases. The estimated device odds ratios' 95% confidence intervals were accurately aligned with the specified ratios in 94.7% of datasets. In contrast, a comparative model excluding operator learning effects significantly underperformed in detecting device signals and in accuracy. Notably, our framework achieved 100% specificity for clinically relevant safety signal thresholds, although sensitivity varied with the threshold applied.</p><p><strong>Discussion: </strong>A machine learning framework, tailored for the complexities of post-market device evaluation, may provide superior performance compared to standard parametric techniques when operator learning is present.</p><p><strong>Conclusion: </strong>Demonstrating the capacity of ML to overcome complex evaluative challenges, our framework addresses the limitations of traditional statistical methods in current post-market surveillance processes. By offering a reliable means to detect and adjust for learning effects, it may significantly improve medical device safety evaluation.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"206-217"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648715/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142548633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating gradient-based explanation methods for neural network ECG analysis using heatmaps. 利用热图评估基于梯度的神经网络心电图分析解释方法。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae280
Andrea Marheim Storås, Steffen Mæland, Jonas L Isaksen, Steven Alexander Hicks, Vajira Thambawita, Claus Graff, Hugo Lewi Hammer, Pål Halvorsen, Michael Alexander Riegler, Jørgen K Kanters

Objective: Evaluate popular explanation methods using heatmap visualizations to explain the predictions of deep neural networks for electrocardiogram (ECG) analysis and provide recommendations for selection of explanations methods.

Materials and methods: A residual deep neural network was trained on ECGs to predict intervals and amplitudes. Nine commonly used explanation methods (Saliency, Deconvolution, Guided backpropagation, Gradient SHAP, SmoothGrad, Input × gradient, DeepLIFT, Integrated gradients, GradCAM) were qualitatively evaluated by medical experts and objectively evaluated using a perturbation-based method.

Results: No single explanation method consistently outperformed the other methods, but some methods were clearly inferior. We found considerable disagreement between the human expert evaluation and the objective evaluation by perturbation.

Discussion: The best explanation method depended on the ECG measure. To ensure that future explanations of deep neural networks for medical data analyses are useful to medical experts, data scientists developing new explanation methods should collaborate tightly with domain experts. Because there is no explanation method that performs best in all use cases, several methods should be applied.

Conclusion: Several explanation methods should be used to determine the most suitable approach.

目的:使用热图可视化评估流行的解释方法,以解释用于心电图(ECG)分析的深度神经网络的预测,并为解释方法的选择提供建议:在心电图上训练残差深度神经网络,以预测间隔和振幅。医学专家对九种常用解释方法(Saliency、Deconvolution、Guided backpropagation、Gradient SHAP、SmoothGrad、Input × gradient、DeepLIFT、Integrated gradients、GradCAM)进行了定性评估,并使用基于扰动的方法进行了客观评估:结果:没有一种解释方法的性能始终优于其他方法,但有些方法的性能明显不如其他方法。我们发现专家评价和扰动客观评价之间存在很大分歧:讨论:最佳解释方法取决于心电图测量。为确保未来用于医学数据分析的深度神经网络解释对医学专家有用,开发新解释方法的数据科学家应与领域专家密切合作。由于没有一种解释方法能在所有使用情况下都表现最佳,因此应采用多种方法:结论:应采用多种解释方法来确定最合适的方法。
{"title":"Evaluating gradient-based explanation methods for neural network ECG analysis using heatmaps.","authors":"Andrea Marheim Storås, Steffen Mæland, Jonas L Isaksen, Steven Alexander Hicks, Vajira Thambawita, Claus Graff, Hugo Lewi Hammer, Pål Halvorsen, Michael Alexander Riegler, Jørgen K Kanters","doi":"10.1093/jamia/ocae280","DOIUrl":"10.1093/jamia/ocae280","url":null,"abstract":"<p><strong>Objective: </strong>Evaluate popular explanation methods using heatmap visualizations to explain the predictions of deep neural networks for electrocardiogram (ECG) analysis and provide recommendations for selection of explanations methods.</p><p><strong>Materials and methods: </strong>A residual deep neural network was trained on ECGs to predict intervals and amplitudes. Nine commonly used explanation methods (Saliency, Deconvolution, Guided backpropagation, Gradient SHAP, SmoothGrad, Input × gradient, DeepLIFT, Integrated gradients, GradCAM) were qualitatively evaluated by medical experts and objectively evaluated using a perturbation-based method.</p><p><strong>Results: </strong>No single explanation method consistently outperformed the other methods, but some methods were clearly inferior. We found considerable disagreement between the human expert evaluation and the objective evaluation by perturbation.</p><p><strong>Discussion: </strong>The best explanation method depended on the ECG measure. To ensure that future explanations of deep neural networks for medical data analyses are useful to medical experts, data scientists developing new explanation methods should collaborate tightly with domain experts. Because there is no explanation method that performs best in all use cases, several methods should be applied.</p><p><strong>Conclusion: </strong>Several explanation methods should be used to determine the most suitable approach.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"79-88"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648713/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142591550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting social support and social isolation information from clinical psychiatry notes: comparing a rule-based natural language processing system and a large language model. 从临床精神病学笔记中提取社会支持和社会隔离信息:比较基于规则的自然语言处理系统和大型语言模型。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae260
Braja Gopal Patra, Lauren A Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A Sanchez-Ruiz, Euijung Ryu, Joanna M Biernacka, Girish N Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J John Mann, Yiye Zhang, Alexander W Charney, Jyotishman Pathak

Objectives: Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented in narrative clinical notes rather than as structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of extraction of such information.

Materials and methods: Psychiatric encounter notes from Mount Sinai Health System (MSHS, n = 300) and Weill Cornell Medicine (WCM, n = 225) were annotated to create a gold-standard corpus. A rule-based system (RBS) involving lexicons and a large language model (LLM) using FLAN-T5-XL were developed to identify mentions of SS and SI and their subcategories (eg, social network, instrumental support, and loneliness).

Results: For extracting SS/SI, the RBS obtained higher macroaveraged F1-scores than the LLM at both MSHS (0.89 versus 0.65) and WCM (0.85 versus 0.82). For extracting the subcategories, the RBS also outperformed the LLM at both MSHS (0.90 versus 0.62) and WCM (0.82 versus 0.81).

Discussion and conclusion: Unexpectedly, the RBS outperformed the LLMs across all metrics. An intensive review demonstrates that this finding is due to the divergent approach taken by the RBS and LLM. The RBS was designed and refined to follow the same specific rules as the gold-standard annotations. Conversely, the LLM was more inclusive with categorization and conformed to common English-language understanding. Both approaches offer advantages, although additional replication studies are warranted.

目的:社会支持(SS)和社会隔离(SI社会支持(SS)和社会隔离(SI)是与精神疾病结果相关的健康社会决定因素(SDOH)。在电子健康记录(EHR)中,个人层面的社会支持/社会隔离通常记录在叙述性临床笔记中,而非结构化编码数据。自然语言处理(NLP)算法可以自动完成提取此类信息的劳动密集型过程:对西奈山医疗系统(MSHS,n = 300)和威尔康奈尔医学中心(WCM,n = 225)的精神病就诊记录进行注释,以创建黄金标准语料库。使用 FLAN-T5-XL 开发了一个基于规则的系统 (RBS),其中包括词典和大语言模型 (LLM),用于识别 SS 和 SI 及其子类别(如社交网络、工具支持和孤独感):在提取 SS/SI 时,RBS 在 MSHS(0.89 对 0.65)和 WCM(0.85 对 0.82)的宏观平均 F1 分数均高于 LLM。在提取子类别方面,RBS 在 MSHS(0.90 对 0.62)和 WCM(0.82 对 0.81)上的表现也优于 LLM:出乎意料的是,RBS 在所有指标上都优于 LLM。深入研究表明,这一发现是由于 RBS 和 LLM 采用了不同的方法。RBS 的设计和改进遵循了与黄金标准注释相同的特定规则。相反,LLM 在分类方面更具包容性,符合英语的一般理解。这两种方法都具有优势,不过还需要进行更多的重复研究。
{"title":"Extracting social support and social isolation information from clinical psychiatry notes: comparing a rule-based natural language processing system and a large language model.","authors":"Braja Gopal Patra, Lauren A Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A Sanchez-Ruiz, Euijung Ryu, Joanna M Biernacka, Girish N Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J John Mann, Yiye Zhang, Alexander W Charney, Jyotishman Pathak","doi":"10.1093/jamia/ocae260","DOIUrl":"10.1093/jamia/ocae260","url":null,"abstract":"<p><strong>Objectives: </strong>Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented in narrative clinical notes rather than as structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of extraction of such information.</p><p><strong>Materials and methods: </strong>Psychiatric encounter notes from Mount Sinai Health System (MSHS, n = 300) and Weill Cornell Medicine (WCM, n = 225) were annotated to create a gold-standard corpus. A rule-based system (RBS) involving lexicons and a large language model (LLM) using FLAN-T5-XL were developed to identify mentions of SS and SI and their subcategories (eg, social network, instrumental support, and loneliness).</p><p><strong>Results: </strong>For extracting SS/SI, the RBS obtained higher macroaveraged F1-scores than the LLM at both MSHS (0.89 versus 0.65) and WCM (0.85 versus 0.82). For extracting the subcategories, the RBS also outperformed the LLM at both MSHS (0.90 versus 0.62) and WCM (0.82 versus 0.81).</p><p><strong>Discussion and conclusion: </strong>Unexpectedly, the RBS outperformed the LLMs across all metrics. An intensive review demonstrates that this finding is due to the divergent approach taken by the RBS and LLM. The RBS was designed and refined to follow the same specific rules as the gold-standard annotations. Conversely, the LLM was more inclusive with categorization and conformed to common English-language understanding. Both approaches offer advantages, although additional replication studies are warranted.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"218-226"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Measuring interpersonal firearm violence: natural language processing methods to address limitations in criminal charge data. 更正:衡量人际火器暴力:解决刑事指控数据局限性的自然语言处理方法。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-01-01 DOI: 10.1093/jamia/ocae268
{"title":"Correction to: Measuring interpersonal firearm violence: natural language processing methods to address limitations in criminal charge data.","authors":"","doi":"10.1093/jamia/ocae268","DOIUrl":"10.1093/jamia/ocae268","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"264"},"PeriodicalIF":4.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11648706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1