首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Evaluating gradient-based explanation methods for neural network ECG analysis using heatmaps. 利用热图评估基于梯度的神经网络心电图分析解释方法。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-06 DOI: 10.1093/jamia/ocae280
Andrea Marheim Storås, Steffen Mæland, Jonas L Isaksen, Steven Alexander Hicks, Vajira Thambawita, Claus Graff, Hugo Lewi Hammer, Pål Halvorsen, Michael Alexander Riegler, Jørgen K Kanters

Objective: Evaluate popular explanation methods using heatmap visualizations to explain the predictions of deep neural networks for electrocardiogram (ECG) analysis and provide recommendations for selection of explanations methods.

Materials and methods: A residual deep neural network was trained on ECGs to predict intervals and amplitudes. Nine commonly used explanation methods (Saliency, Deconvolution, Guided backpropagation, Gradient SHAP, SmoothGrad, Input × gradient, DeepLIFT, Integrated gradients, GradCAM) were qualitatively evaluated by medical experts and objectively evaluated using a perturbation-based method.

Results: No single explanation method consistently outperformed the other methods, but some methods were clearly inferior. We found considerable disagreement between the human expert evaluation and the objective evaluation by perturbation.

Discussion: The best explanation method depended on the ECG measure. To ensure that future explanations of deep neural networks for medical data analyses are useful to medical experts, data scientists developing new explanation methods should collaborate tightly with domain experts. Because there is no explanation method that performs best in all use cases, several methods should be applied.

Conclusion: Several explanation methods should be used to determine the most suitable approach.

目的:使用热图可视化评估流行的解释方法,以解释用于心电图(ECG)分析的深度神经网络的预测,并为解释方法的选择提供建议:在心电图上训练残差深度神经网络,以预测间隔和振幅。医学专家对九种常用解释方法(Saliency、Deconvolution、Guided backpropagation、Gradient SHAP、SmoothGrad、Input × gradient、DeepLIFT、Integrated gradients、GradCAM)进行了定性评估,并使用基于扰动的方法进行了客观评估:结果:没有一种解释方法的性能始终优于其他方法,但有些方法的性能明显不如其他方法。我们发现专家评价和扰动客观评价之间存在很大分歧:讨论:最佳解释方法取决于心电图测量。为确保未来用于医学数据分析的深度神经网络解释对医学专家有用,开发新解释方法的数据科学家应与领域专家密切合作。由于没有一种解释方法能在所有使用情况下都表现最佳,因此应采用多种方法:结论:应采用多种解释方法来确定最合适的方法。
{"title":"Evaluating gradient-based explanation methods for neural network ECG analysis using heatmaps.","authors":"Andrea Marheim Storås, Steffen Mæland, Jonas L Isaksen, Steven Alexander Hicks, Vajira Thambawita, Claus Graff, Hugo Lewi Hammer, Pål Halvorsen, Michael Alexander Riegler, Jørgen K Kanters","doi":"10.1093/jamia/ocae280","DOIUrl":"https://doi.org/10.1093/jamia/ocae280","url":null,"abstract":"<p><strong>Objective: </strong>Evaluate popular explanation methods using heatmap visualizations to explain the predictions of deep neural networks for electrocardiogram (ECG) analysis and provide recommendations for selection of explanations methods.</p><p><strong>Materials and methods: </strong>A residual deep neural network was trained on ECGs to predict intervals and amplitudes. Nine commonly used explanation methods (Saliency, Deconvolution, Guided backpropagation, Gradient SHAP, SmoothGrad, Input × gradient, DeepLIFT, Integrated gradients, GradCAM) were qualitatively evaluated by medical experts and objectively evaluated using a perturbation-based method.</p><p><strong>Results: </strong>No single explanation method consistently outperformed the other methods, but some methods were clearly inferior. We found considerable disagreement between the human expert evaluation and the objective evaluation by perturbation.</p><p><strong>Discussion: </strong>The best explanation method depended on the ECG measure. To ensure that future explanations of deep neural networks for medical data analyses are useful to medical experts, data scientists developing new explanation methods should collaborate tightly with domain experts. Because there is no explanation method that performs best in all use cases, several methods should be applied.</p><p><strong>Conclusion: </strong>Several explanation methods should be used to determine the most suitable approach.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142591550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trending in the right direction: critical access hospitals increased adoption of advanced electronic health record functions from 2018 to 2023. 趋势方向正确:2018 年至 2023 年,关键通道医院增加了先进电子病历功能的采用。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-06 DOI: 10.1093/jamia/ocae267
Nate C Apathy, A Jay Holmgren, Paige Nong, Julia Adler-Milstein, Jordan Everson

Objectives: We analyzed trends in adoption of advanced patient engagement and clinical data analytics functionalities among critical access hospitals (CAHs) and non-CAHs to assess how historical gaps have changed.

Materials and methods: We used 2014, 2018, and 2023 data from the American Hospital Association Annual Survey IT Supplement to measure differences in adoption rates (ie, the "adoption gap") of patient engagement and clinical data analytics functionalities across CAHs and non-CAHs. We measured changes over time in CAH and non-CAH adoption of 6 "core" clinical data analytics functionalities, 5 "core" patient engagement functionalities, 5 new patient engagement functionalities, and 3 bulk data export use cases. We constructed 2 composite measures for core functionalities and analyzed adoption for other functionalities individually.

Results: Core functionality adoption increased from 21% of CAHs in 2014 to 56% in 2023 for clinical data analytics and 18% to 49% for patient engagement. The CAH adoption gap in both domains narrowed from 2018 to 2023 (both P < .01). More than 90% of all hospitals had adopted viewing and downloading electronic data and clinical notes by 2023. The largest CAH adoption gaps in 2023 were for Fast Healthcare Interoperability Resources (FHIR) bulk export use cases (eg, analytics and reporting: 63% of CAHs, 81% of non-CAHs, P < .001).

Discussion: Adoption of advanced electronic health record functionalities has increased for CAHs and non-CAHs, and some adoption gaps have been closed since 2018. However, CAHs may continue to struggle with clinical data analytics and FHIR-based functionalities.

Conclusion: Some crucial patient engagement functionalities have reached near-universal adoption; however, policymakers should consider programs to support CAHs in closing remaining adoption gaps.

目的:我们分析了重症监护医院(CAH)和非重症监护医院采用先进的患者参与和临床数据分析功能的趋势:我们分析了关键通道医院(CAH)和非CAH采用先进的患者参与和临床数据分析功能的趋势,以评估历史差距的变化情况:我们使用《美国医院协会年度调查 IT 补充报告》中的 2014 年、2018 年和 2023 年数据来衡量重症监护医院和非重症监护医院在患者参与和临床数据分析功能采用率方面的差异(即 "采用差距")。我们测量了 CAH 和非 CAH 对 6 个 "核心 "临床数据分析功能、5 个 "核心 "患者参与功能、5 个新的患者参与功能和 3 个批量数据导出用例的采用率随时间的变化情况。我们为核心功能构建了两个综合衡量标准,并对其他功能的采用情况进行了单独分析:采用核心功能的 CAH 在临床数据分析方面的比例从 2014 年的 21% 提高到 2023 年的 56%,在患者参与方面的比例从 18% 提高到 49%。从 2018 年到 2023 年,CAH 在这两个领域的采用率差距均有所缩小(均为 P 讨论):CAH 和非 CAH 对高级电子病历功能的采用率有所提高,自 2018 年以来,一些采用率差距已经缩小。然而,CAH 可能会继续在临床数据分析和基于 FHIR 的功能方面苦苦挣扎:一些重要的患者参与功能已接近普遍采用;然而,政策制定者应考虑制定计划,支持 CAH 缩小剩余的采用差距。
{"title":"Trending in the right direction: critical access hospitals increased adoption of advanced electronic health record functions from 2018 to 2023.","authors":"Nate C Apathy, A Jay Holmgren, Paige Nong, Julia Adler-Milstein, Jordan Everson","doi":"10.1093/jamia/ocae267","DOIUrl":"https://doi.org/10.1093/jamia/ocae267","url":null,"abstract":"<p><strong>Objectives: </strong>We analyzed trends in adoption of advanced patient engagement and clinical data analytics functionalities among critical access hospitals (CAHs) and non-CAHs to assess how historical gaps have changed.</p><p><strong>Materials and methods: </strong>We used 2014, 2018, and 2023 data from the American Hospital Association Annual Survey IT Supplement to measure differences in adoption rates (ie, the \"adoption gap\") of patient engagement and clinical data analytics functionalities across CAHs and non-CAHs. We measured changes over time in CAH and non-CAH adoption of 6 \"core\" clinical data analytics functionalities, 5 \"core\" patient engagement functionalities, 5 new patient engagement functionalities, and 3 bulk data export use cases. We constructed 2 composite measures for core functionalities and analyzed adoption for other functionalities individually.</p><p><strong>Results: </strong>Core functionality adoption increased from 21% of CAHs in 2014 to 56% in 2023 for clinical data analytics and 18% to 49% for patient engagement. The CAH adoption gap in both domains narrowed from 2018 to 2023 (both P < .01). More than 90% of all hospitals had adopted viewing and downloading electronic data and clinical notes by 2023. The largest CAH adoption gaps in 2023 were for Fast Healthcare Interoperability Resources (FHIR) bulk export use cases (eg, analytics and reporting: 63% of CAHs, 81% of non-CAHs, P < .001).</p><p><strong>Discussion: </strong>Adoption of advanced electronic health record functionalities has increased for CAHs and non-CAHs, and some adoption gaps have been closed since 2018. However, CAHs may continue to struggle with clinical data analytics and FHIR-based functionalities.</p><p><strong>Conclusion: </strong>Some crucial patient engagement functionalities have reached near-universal adoption; however, policymakers should consider programs to support CAHs in closing remaining adoption gaps.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142591482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of routine and structured social needs data collection in improving care in US hospitals. 常规和结构化社会需求数据收集在改善美国医院护理方面的作用。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-06 DOI: 10.1093/jamia/ocae279
Chelsea Richwine, Vaishali Patel, Jordan Everson, Bradley Iott

Objectives: To understand how health-related social needs (HRSN) data are collected at US hospitals and implications for use.

Materials and methods: Using 2023 nationally representative survey data on US hospitals (N = 2775), we described hospitals' routine and structured collection and use of HRSN data and examined the relationship between methods of data collection and specific uses. Multivariate logistic regression was used to identify characteristics associated with data collection and use and understand how methods of data collection relate to use.

Results: In 2023, 88% of hospitals collected HRSN data (64% routinely, 72% structured). While hospitals commonly used data for internal purposes (eg, discharge planning, 79%), those that collected data routinely and in a structured format (58%) used data for purposes involving coordination or exchange with other organizations (eg, making referrals, 74%) at higher rates than hospitals that collected data but not routinely or in a non-structured format (eg, 93% vs 67% for referrals, P< .05). In multivariate regression, routine and structured data collection was positively associated with all uses of data examined. Hospital location, ownership, system-affiliation, value-based care participation, and critical access designation were associated with HRSN data collection, but only system-affiliation was consistently (positively) associated with use.

Discussion: While most hospitals screen for social needs, fewer collect data routinely and in a structured format that would facilitate downstream use. Routine and structured data collection was associated with greater use, particularly for secondary purposes.

Conclusion: Routine and structured screening may result in more actionable data that facilitates use for various purposes that support patient care and improve community and population health, indicating the importance of continuing efforts to increase routine screening and standardize HRSN data collection.

目的:了解美国医院如何收集与健康相关的社会需求(HRSN)数据及其使用意义:了解美国医院如何收集与健康相关的社会需求(HRSN)数据及其对使用的影响:利用 2023 年美国医院的全国代表性调查数据(N = 2775),我们描述了医院对 HRSN 数据的常规和结构化收集与使用情况,并研究了数据收集方法与具体使用之间的关系。我们使用多变量逻辑回归来确定与数据收集和使用相关的特征,并了解数据收集方法与使用之间的关系:2023 年,88% 的医院收集了 HRSN 数据(64% 为常规数据,72% 为结构化数据)。虽然医院通常将数据用于内部目的(如出院计划,79%),但那些常规收集数据并采用结构化格式的医院(58%)将数据用于与其他组织协调或交流的目的(如转诊,74%),其使用率高于那些未常规收集数据或采用非结构化格式的医院(如转诊,93% vs 67%,P< .05)。在多变量回归中,常规和结构化的数据收集与数据的所有用途均呈正相关。医院位置、所有权、系统隶属关系、基于价值的护理参与度和关键准入指定与 HRSN 数据收集有关,但只有系统隶属关系与数据使用持续(正)相关:讨论:虽然大多数医院都会对社会需求进行筛查,但以常规和结构化格式收集数据以方便下游使用的医院较少。常规和结构化的数据收集与更大程度的使用有关,尤其是用于次要目的:常规和结构化筛查可能会产生更多可操作的数据,便于用于支持患者护理、改善社区和人口健康的各种目的,这表明继续努力增加常规筛查和规范 HRSN 数据收集的重要性。
{"title":"The role of routine and structured social needs data collection in improving care in US hospitals.","authors":"Chelsea Richwine, Vaishali Patel, Jordan Everson, Bradley Iott","doi":"10.1093/jamia/ocae279","DOIUrl":"https://doi.org/10.1093/jamia/ocae279","url":null,"abstract":"<p><strong>Objectives: </strong>To understand how health-related social needs (HRSN) data are collected at US hospitals and implications for use.</p><p><strong>Materials and methods: </strong>Using 2023 nationally representative survey data on US hospitals (N = 2775), we described hospitals' routine and structured collection and use of HRSN data and examined the relationship between methods of data collection and specific uses. Multivariate logistic regression was used to identify characteristics associated with data collection and use and understand how methods of data collection relate to use.</p><p><strong>Results: </strong>In 2023, 88% of hospitals collected HRSN data (64% routinely, 72% structured). While hospitals commonly used data for internal purposes (eg, discharge planning, 79%), those that collected data routinely and in a structured format (58%) used data for purposes involving coordination or exchange with other organizations (eg, making referrals, 74%) at higher rates than hospitals that collected data but not routinely or in a non-structured format (eg, 93% vs 67% for referrals, P< .05). In multivariate regression, routine and structured data collection was positively associated with all uses of data examined. Hospital location, ownership, system-affiliation, value-based care participation, and critical access designation were associated with HRSN data collection, but only system-affiliation was consistently (positively) associated with use.</p><p><strong>Discussion: </strong>While most hospitals screen for social needs, fewer collect data routinely and in a structured format that would facilitate downstream use. Routine and structured data collection was associated with greater use, particularly for secondary purposes.</p><p><strong>Conclusion: </strong>Routine and structured screening may result in more actionable data that facilitates use for various purposes that support patient care and improve community and population health, indicating the importance of continuing efforts to increase routine screening and standardize HRSN data collection.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142591563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is ChatGPT worthy enough for provisioning clinical decision support? ChatGPT 是否足以提供临床决策支持?
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-05 DOI: 10.1093/jamia/ocae282
Partha Pratim Ray
{"title":"Is ChatGPT worthy enough for provisioning clinical decision support?","authors":"Partha Pratim Ray","doi":"10.1093/jamia/ocae282","DOIUrl":"https://doi.org/10.1093/jamia/ocae282","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142583648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Artificial intelligence for optimizing recruitment and retention in clinical trials: a scoping review. 更正:人工智能优化临床试验的招募和保留:范围综述。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-05 DOI: 10.1093/jamia/ocae283
{"title":"Correction to: Artificial intelligence for optimizing recruitment and retention in clinical trials: a scoping review.","authors":"","doi":"10.1093/jamia/ocae283","DOIUrl":"https://doi.org/10.1093/jamia/ocae283","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142583537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A GIS software-based method to identify public health data belonging to address-defined communities. 一种基于地理信息系统软件的方法,用于识别属于地址定义社区的公共卫生数据。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-01 DOI: 10.1093/jamia/ocae235
Amanda M Lam, Mariana C Singletary, Theresa Cullen

Objective: This communication presents the results of defining a tribal health jurisdiction by a combination of tribal affiliation (TA) and case address.

Materials and methods: Through a county-tribal partnership, Geographic Information System (GIS) software and custom code were used to extract tribal data from county data by identifying reservation addresses in county extracts of COVID-19 case records from December 30, 2019, to December 31, 2022 (n = 374 653) and COVID-19 vaccination records from December 1, 2020, to April 18, 2023 (n = 2 355 058).

Results: The tool identified 1.91 times as many case records and 3.76 times as many vaccination records as filtering by TA alone.

Discussion and conclusion: This method of identifying communities by patient address, in combination with TA and enrollment, can help tribal health jurisdictions attain equitable access to public health data, when done in partnership with a data sharing agreement. This methodology has potential applications for other populations underrepresented in public health and clinical research.

目的本通报介绍了通过部落隶属关系和病例地址组合定义部落卫生管辖区的结果:通过县与部落合作,使用 GIS 软件和自定义代码从县数据中提取部落数据,方法是在县提取的 2019 年 12 月 30 日至 2022 年 12 月 31 日的 COVID-19 病例记录(n = 374,653 个)和 2020 年 12 月 1 日至 2023 年 4 月 18 日的 COVID-19 疫苗接种记录(n = 2,355,058 个)中识别保留地地址:结果:该工具识别出的病例记录和疫苗接种记录分别是通过部落隶属关系筛选出的病例记录和疫苗接种记录的 1.91 倍和 3.76 倍:这种通过患者地址识别社区的方法与部落隶属关系和注册情况相结合,如果与数据共享协议合作,可以帮助部落卫生辖区公平地获取公共卫生数据。这种方法还有可能应用于其他在公共卫生和临床研究中代表性不足的人群。
{"title":"A GIS software-based method to identify public health data belonging to address-defined communities.","authors":"Amanda M Lam, Mariana C Singletary, Theresa Cullen","doi":"10.1093/jamia/ocae235","DOIUrl":"10.1093/jamia/ocae235","url":null,"abstract":"<p><strong>Objective: </strong>This communication presents the results of defining a tribal health jurisdiction by a combination of tribal affiliation (TA) and case address.</p><p><strong>Materials and methods: </strong>Through a county-tribal partnership, Geographic Information System (GIS) software and custom code were used to extract tribal data from county data by identifying reservation addresses in county extracts of COVID-19 case records from December 30, 2019, to December 31, 2022 (n = 374 653) and COVID-19 vaccination records from December 1, 2020, to April 18, 2023 (n = 2 355 058).</p><p><strong>Results: </strong>The tool identified 1.91 times as many case records and 3.76 times as many vaccination records as filtering by TA alone.</p><p><strong>Discussion and conclusion: </strong>This method of identifying communities by patient address, in combination with TA and enrollment, can help tribal health jurisdictions attain equitable access to public health data, when done in partnership with a data sharing agreement. This methodology has potential applications for other populations underrepresented in public health and clinical research.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"2716-2721"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11491637/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142057033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A machine learning framework to adjust for learning effects in medical device safety evaluation. 在医疗器械安全评估中调整学习效果的机器学习框架。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-29 DOI: 10.1093/jamia/ocae273
Jejo D Koola, Karthik Ramesh, Jialin Mao, Minyoung Ahn, Sharon E Davis, Usha Govindarajulu, Amy M Perkins, Dax Westerman, Henry Ssemaganda, Theodore Speroff, Lucila Ohno-Machado, Craig R Ramsay, Art Sedrakyan, Frederic S Resnic, Michael E Matheny

Objectives: Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sought to develop a machine learning (ML) framework to detect and adjust for operator learning effects.

Materials and methods: A gradient-boosted decision tree ML method was used to analyze synthetic datasets that replicate the complexity of clinical scenarios involving high-risk medical devices. We designed this process to detect learning effects using a risk-adjusted cumulative sum method, quantify the excess adverse event rate attributable to operator inexperience, and adjust for these alongside patient factors in evaluating device safety signals. To maintain integrity, we employed blinding between data generation and analysis teams. Synthetic data used underlying distributions and patient feature correlations based on clinical data from the Department of Veterans Affairs between 2005 and 2012. We generated 2494 synthetic datasets with widely varying characteristics including number of patient features, operators and institutions, and the operator learning form. Each dataset contained a hypothetical study device, Device B, and a reference device, Device A. We evaluated accuracy in identifying learning effects and identifying and estimating the strength of the device safety signal. Our approach also evaluated different clinically relevant thresholds for safety signal detection.

Results: Our framework accurately identified the presence or absence of learning effects in 93.6% of datasets and correctly determined device safety signals in 93.4% of cases. The estimated device odds ratios' 95% confidence intervals were accurately aligned with the specified ratios in 94.7% of datasets. In contrast, a comparative model excluding operator learning effects significantly underperformed in detecting device signals and in accuracy. Notably, our framework achieved 100% specificity for clinically relevant safety signal thresholds, although sensitivity varied with the threshold applied.

Discussion: A machine learning framework, tailored for the complexities of post-market device evaluation, may provide superior performance compared to standard parametric techniques when operator learning is present.

Conclusion: Demonstrating the capacity of ML to overcome complex evaluative challenges, our framework addresses the limitations of traditional statistical methods in current post-market surveillance processes. By offering a reliable means to detect and adjust for learning effects, it may significantly improve medical device safety evaluation.

目标:医疗器械上市后监督的传统方法往往无法准确考虑操作者的学习效应,导致对器械安全性的评估出现偏差。这些方法难以应对非线性、复杂的学习曲线和时变协变量(如医生经验)。为了解决这些局限性,我们试图开发一种机器学习(ML)框架来检测和调整操作者的学习效果:我们使用梯度提升决策树 ML 方法来分析合成数据集,这些数据集复制了涉及高风险医疗设备的临床场景的复杂性。我们设计了这一流程,以使用风险调整累积和法检测学习效应,量化因操作者经验不足而导致的超额不良事件率,并在评估器械安全信号时将这些因素与患者因素一并考虑。为了保持完整性,我们在数据生成和分析团队之间采用了盲法。合成数据使用了基于退伍军人事务部 2005 年至 2012 年临床数据的基础分布和患者特征相关性。我们生成了 2494 个合成数据集,这些数据集的特征千差万别,包括患者特征数量、操作者和机构以及操作者学习形式。我们评估了识别学习效应以及识别和估计设备安全信号强度的准确性。我们的方法还评估了安全信号检测的不同临床相关阈值:我们的框架在 93.6% 的数据集中准确识别了学习效应的存在与否,并在 93.4% 的案例中正确确定了设备安全信号。在 94.7% 的数据集中,估计设备几率的 95% 置信区间与指定几率准确一致。相比之下,排除了操作员学习效应的比较模型在检测设备信号和准确性方面明显表现不佳。值得注意的是,我们的框架对临床相关安全信号阈值的特异性达到了 100%,但灵敏度随应用的阈值而变化:讨论:针对上市后设备评估的复杂性而定制的机器学习框架,在操作者学习的情况下,可能比标准参数技术提供更优越的性能:我们的框架展示了机器学习克服复杂评估挑战的能力,解决了当前上市后监督流程中传统统计方法的局限性。通过提供检测和调整学习效应的可靠方法,它可以显著改善医疗设备的安全性评估。
{"title":"A machine learning framework to adjust for learning effects in medical device safety evaluation.","authors":"Jejo D Koola, Karthik Ramesh, Jialin Mao, Minyoung Ahn, Sharon E Davis, Usha Govindarajulu, Amy M Perkins, Dax Westerman, Henry Ssemaganda, Theodore Speroff, Lucila Ohno-Machado, Craig R Ramsay, Art Sedrakyan, Frederic S Resnic, Michael E Matheny","doi":"10.1093/jamia/ocae273","DOIUrl":"https://doi.org/10.1093/jamia/ocae273","url":null,"abstract":"<p><strong>Objectives: </strong>Traditional methods for medical device post-market surveillance often fail to accurately account for operator learning effects, leading to biased assessments of device safety. These methods struggle with non-linearity, complex learning curves, and time-varying covariates, such as physician experience. To address these limitations, we sought to develop a machine learning (ML) framework to detect and adjust for operator learning effects.</p><p><strong>Materials and methods: </strong>A gradient-boosted decision tree ML method was used to analyze synthetic datasets that replicate the complexity of clinical scenarios involving high-risk medical devices. We designed this process to detect learning effects using a risk-adjusted cumulative sum method, quantify the excess adverse event rate attributable to operator inexperience, and adjust for these alongside patient factors in evaluating device safety signals. To maintain integrity, we employed blinding between data generation and analysis teams. Synthetic data used underlying distributions and patient feature correlations based on clinical data from the Department of Veterans Affairs between 2005 and 2012. We generated 2494 synthetic datasets with widely varying characteristics including number of patient features, operators and institutions, and the operator learning form. Each dataset contained a hypothetical study device, Device B, and a reference device, Device A. We evaluated accuracy in identifying learning effects and identifying and estimating the strength of the device safety signal. Our approach also evaluated different clinically relevant thresholds for safety signal detection.</p><p><strong>Results: </strong>Our framework accurately identified the presence or absence of learning effects in 93.6% of datasets and correctly determined device safety signals in 93.4% of cases. The estimated device odds ratios' 95% confidence intervals were accurately aligned with the specified ratios in 94.7% of datasets. In contrast, a comparative model excluding operator learning effects significantly underperformed in detecting device signals and in accuracy. Notably, our framework achieved 100% specificity for clinically relevant safety signal thresholds, although sensitivity varied with the threshold applied.</p><p><strong>Discussion: </strong>A machine learning framework, tailored for the complexities of post-market device evaluation, may provide superior performance compared to standard parametric techniques when operator learning is present.</p><p><strong>Conclusion: </strong>Demonstrating the capacity of ML to overcome complex evaluative challenges, our framework addresses the limitations of traditional statistical methods in current post-market surveillance processes. By offering a reliable means to detect and adjust for learning effects, it may significantly improve medical device safety evaluation.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142548633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-world federated learning in radiology: hurdles to overcome and benefits to gain. 放射学中的真实世界联合学习:需要克服的障碍和获得的益处。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-25 DOI: 10.1093/jamia/ocae259
Markus Ralf Bujotzek, Ünal Akünal, Stefan Denner, Peter Neher, Maximilian Zenk, Eric Frodl, Astha Jaiswal, Moon Kim, Nicolai R Krekiehn, Manuel Nickel, Richard Ruppel, Marcus Both, Felix Döllinger, Marcel Opitz, Thorsten Persigehl, Jens Kleesiek, Tobias Penzkofer, Klaus Maier-Hein, Andreas Bucher, Rickmer Braren

Objective: Federated Learning (FL) enables collaborative model training while keeping data locally. Currently, most FL studies in radiology are conducted in simulated environments due to numerous hurdles impeding its translation into practice. The few existing real-world FL initiatives rarely communicate specific measures taken to overcome these hurdles. To bridge this significant knowledge gap, we propose a comprehensive guide for real-world FL in radiology. Minding efforts to implement real-world FL, there is a lack of comprehensive assessments comparing FL to less complex alternatives in challenging real-world settings, which we address through extensive benchmarking.

Materials and methods: We developed our own FL infrastructure within the German Radiological Cooperative Network (RACOON) and demonstrated its functionality by training FL models on lung pathology segmentation tasks across six university hospitals. Insights gained while establishing our FL initiative and running the extensive benchmark experiments were compiled and categorized into the guide.

Results: The proposed guide outlines essential steps, identified hurdles, and implemented solutions for establishing successful FL initiatives conducting real-world experiments. Our experimental results prove the practical relevance of our guide and show that FL outperforms less complex alternatives in all evaluation scenarios.

Discussion and conclusion: Our findings justify the efforts required to translate FL into real-world applications by demonstrating advantageous performance over alternative approaches. Additionally, they emphasize the importance of strategic organization, robust management of distributed data and infrastructure in real-world settings. With the proposed guide, we are aiming to aid future FL researchers in circumventing pitfalls and accelerating translation of FL into radiological applications.

目标:联合学习(FL)可以在本地保存数据的同时进行协作模型训练。目前,放射学领域的大多数联机学习研究都是在模拟环境中进行的,这是因为有许多障碍阻碍了联机学习在实践中的应用。现有的少数几个真实世界 FL 计划很少介绍为克服这些障碍而采取的具体措施。为了弥补这一巨大的知识差距,我们提出了放射学真实世界 FL 综合指南。在我们努力实施真实世界 FL 的过程中,缺乏将 FL 与具有挑战性的真实世界环境中复杂性较低的替代方案进行比较的全面评估,而我们通过广泛的基准测试解决了这一问题:我们在德国放射学合作网络(RACOON)内开发了自己的 FL 基础设施,并在六所大学医院的肺部病理分割任务中对 FL 模型进行了训练,从而展示了其功能。在建立 FL 计划和进行大量基准实验的过程中,我们获得了一些启发,并将其汇编和归类到指南中:结果:所提出的指南概述了在真实世界实验中建立成功的 FL 计划的基本步骤、确定的障碍和实施的解决方案。我们的实验结果证明了指南的实用性,并表明在所有评估场景中,FL 都优于复杂度较低的替代方案:我们的研究结果证明,将 FL 转化为实际应用所需的努力是正确的,因为它比其他方法更具优势。此外,这些研究结果还强调了在现实世界中对分布式数据和基础设施进行战略性组织和稳健管理的重要性。我们提出的指南旨在帮助未来的 FL 研究人员规避陷阱,加快 FL 在放射学应用中的转化。
{"title":"Real-world federated learning in radiology: hurdles to overcome and benefits to gain.","authors":"Markus Ralf Bujotzek, Ünal Akünal, Stefan Denner, Peter Neher, Maximilian Zenk, Eric Frodl, Astha Jaiswal, Moon Kim, Nicolai R Krekiehn, Manuel Nickel, Richard Ruppel, Marcus Both, Felix Döllinger, Marcel Opitz, Thorsten Persigehl, Jens Kleesiek, Tobias Penzkofer, Klaus Maier-Hein, Andreas Bucher, Rickmer Braren","doi":"10.1093/jamia/ocae259","DOIUrl":"https://doi.org/10.1093/jamia/ocae259","url":null,"abstract":"<p><strong>Objective: </strong>Federated Learning (FL) enables collaborative model training while keeping data locally. Currently, most FL studies in radiology are conducted in simulated environments due to numerous hurdles impeding its translation into practice. The few existing real-world FL initiatives rarely communicate specific measures taken to overcome these hurdles. To bridge this significant knowledge gap, we propose a comprehensive guide for real-world FL in radiology. Minding efforts to implement real-world FL, there is a lack of comprehensive assessments comparing FL to less complex alternatives in challenging real-world settings, which we address through extensive benchmarking.</p><p><strong>Materials and methods: </strong>We developed our own FL infrastructure within the German Radiological Cooperative Network (RACOON) and demonstrated its functionality by training FL models on lung pathology segmentation tasks across six university hospitals. Insights gained while establishing our FL initiative and running the extensive benchmark experiments were compiled and categorized into the guide.</p><p><strong>Results: </strong>The proposed guide outlines essential steps, identified hurdles, and implemented solutions for establishing successful FL initiatives conducting real-world experiments. Our experimental results prove the practical relevance of our guide and show that FL outperforms less complex alternatives in all evaluation scenarios.</p><p><strong>Discussion and conclusion: </strong>Our findings justify the efforts required to translate FL into real-world applications by demonstrating advantageous performance over alternative approaches. Additionally, they emphasize the importance of strategic organization, robust management of distributed data and infrastructure in real-world settings. With the proposed guide, we are aiming to aid future FL researchers in circumventing pitfalls and accelerating translation of FL into radiological applications.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142512054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Equitable community-based participatory research engagement with communities of color drives All of Us Wisconsin genomic research priorities. 与有色人种社区开展以社区为基础的公平参与式研究,推动了 "我们威斯康星人 "基因组研究的优先事项。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-23 DOI: 10.1093/jamia/ocae265
Suma K Thareja, Xin Yang, Paramita Basak Upama, Aziz Abdullah, Shary Pérez Torres, Linda Jackson Cocroft, Michael Bubolz, Kari McGaughey, Xuelin Lou, Sailaja Kamaraju, Sheikh Iqbal Ahamed, Praveen Madiraju, Anne E Kwitek, Jeffrey Whittle, Zeno Franco

Objective: The NIH All of Us Research Program aims to advance personalized medicine by not only linking patient records, surveys, and genomic data but also engaging with participants, particularly from groups traditionally underrepresented in biomedical research (UBR). This study details how the dialogue between scientists and community members, including many from communities of color, shaped local research priorities.

Materials and methods: We recruited area quantitative, basic, and clinical scientists as well as community members from our Community and Participant Advisory Boards with a predetermined interest in All of Us research as members of a Special Interest Group (SIG). An expert community engagement scientist facilitated 6 SIG meetings over the year, explicitly fostering openness and flexibility during conversations. We qualitatively analyzed discussions using a social movement framework tailored for community-based participatory research (CBPR) mobilization.

Results: The SIG evolved through CBPR stages of emergence, coalescence, momentum, and maintenance/integration. Researchers prioritized community needs above personal academic interests while community members kept discussions focused on tangible return of value to communities. One key outcome includes SIG-driven shifts in programmatic and research priorities of the All of Us Research Program in Southeastern Wisconsin. One major challenge was building equitable conversations that balanced scientific rigor and community understanding.

Discussion: Our approach allowed for a rich dialogue to emerge. Points of connection and disconnection between community members and scientists offered important guidance for emerging areas of genomic inquiry.

Conclusion: Our study presents a robust foundation for future efforts to engage diverse communities in CBPR, particularly on healthcare concerns affecting UBR communities.

目标:美国国立卫生研究院(NIH)的 "我们所有人研究计划"(All of Us Research Program)旨在推动个性化医疗的发展,该计划不仅要将患者记录、调查和基因组数据联系起来,还要让参与者参与进来,尤其是那些传统上在生物医学研究领域代表性不足的群体(UBR)。本研究详细介绍了科学家与社区成员(包括许多来自有色人种社区的成员)之间的对话是如何影响当地研究重点的:我们从社区和参与者咨询委员会中招募了地区定量、基础和临床科学家以及对 "我们所有人 "研究有兴趣的社区成员,作为特别兴趣小组(SIG)的成员。在这一年中,一位社区参与科学家专家主持了 6 次 SIG 会议,明确提出要在对话中培养开放性和灵活性。我们使用为社区参与式研究(CBPR)动员量身定制的社会运动框架对讨论进行了定性分析:结果:SIG 经历了 CBPR 的兴起、凝聚、动力和维持/整合阶段。研究人员将社区需求置于个人学术利益之上,而社区成员则将讨论重点放在对社区的实际价值回报上。其中一项重要成果包括,在 SIG 的推动下,威斯康星州东南部的 "我们大家 "研究计划的计划和研究重点发生了变化。一个主要挑战是建立公平的对话,平衡科学的严谨性和社区的理解:我们的方法使丰富的对话得以出现。社区成员与科学家之间的联系点和脱节点为基因组研究的新兴领域提供了重要指导:我们的研究为今后让不同社区参与 CBPR,特别是影响 UBR 社区的医疗保健问题奠定了坚实的基础。
{"title":"Equitable community-based participatory research engagement with communities of color drives All of Us Wisconsin genomic research priorities.","authors":"Suma K Thareja, Xin Yang, Paramita Basak Upama, Aziz Abdullah, Shary Pérez Torres, Linda Jackson Cocroft, Michael Bubolz, Kari McGaughey, Xuelin Lou, Sailaja Kamaraju, Sheikh Iqbal Ahamed, Praveen Madiraju, Anne E Kwitek, Jeffrey Whittle, Zeno Franco","doi":"10.1093/jamia/ocae265","DOIUrl":"https://doi.org/10.1093/jamia/ocae265","url":null,"abstract":"<p><strong>Objective: </strong>The NIH All of Us Research Program aims to advance personalized medicine by not only linking patient records, surveys, and genomic data but also engaging with participants, particularly from groups traditionally underrepresented in biomedical research (UBR). This study details how the dialogue between scientists and community members, including many from communities of color, shaped local research priorities.</p><p><strong>Materials and methods: </strong>We recruited area quantitative, basic, and clinical scientists as well as community members from our Community and Participant Advisory Boards with a predetermined interest in All of Us research as members of a Special Interest Group (SIG). An expert community engagement scientist facilitated 6 SIG meetings over the year, explicitly fostering openness and flexibility during conversations. We qualitatively analyzed discussions using a social movement framework tailored for community-based participatory research (CBPR) mobilization.</p><p><strong>Results: </strong>The SIG evolved through CBPR stages of emergence, coalescence, momentum, and maintenance/integration. Researchers prioritized community needs above personal academic interests while community members kept discussions focused on tangible return of value to communities. One key outcome includes SIG-driven shifts in programmatic and research priorities of the All of Us Research Program in Southeastern Wisconsin. One major challenge was building equitable conversations that balanced scientific rigor and community understanding.</p><p><strong>Discussion: </strong>Our approach allowed for a rich dialogue to emerge. Points of connection and disconnection between community members and scientists offered important guidance for emerging areas of genomic inquiry.</p><p><strong>Conclusion: </strong>Our study presents a robust foundation for future efforts to engage diverse communities in CBPR, particularly on healthcare concerns affecting UBR communities.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142512053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting social support and social isolation information from clinical psychiatry notes: comparing a rule-based natural language processing system and a large language model. 从临床精神病学笔记中提取社会支持和社会隔离信息:比较基于规则的自然语言处理系统和大型语言模型。
IF 4.7 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-10-18 DOI: 10.1093/jamia/ocae260
Braja Gopal Patra, Lauren A Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A Sanchez-Ruiz, Euijung Ryu, Joanna M Biernacka, Girish N Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J John Mann, Yiye Zhang, Alexander W Charney, Jyotishman Pathak

Objectives: Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented in narrative clinical notes rather than as structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of extraction of such information.

Materials and methods: Psychiatric encounter notes from Mount Sinai Health System (MSHS, n = 300) and Weill Cornell Medicine (WCM, n = 225) were annotated to create a gold-standard corpus. A rule-based system (RBS) involving lexicons and a large language model (LLM) using FLAN-T5-XL were developed to identify mentions of SS and SI and their subcategories (eg, social network, instrumental support, and loneliness).

Results: For extracting SS/SI, the RBS obtained higher macroaveraged F1-scores than the LLM at both MSHS (0.89 versus 0.65) and WCM (0.85 versus 0.82). For extracting the subcategories, the RBS also outperformed the LLM at both MSHS (0.90 versus 0.62) and WCM (0.82 versus 0.81).

Discussion and conclusion: Unexpectedly, the RBS outperformed the LLMs across all metrics. An intensive review demonstrates that this finding is due to the divergent approach taken by the RBS and LLM. The RBS was designed and refined to follow the same specific rules as the gold-standard annotations. Conversely, the LLM was more inclusive with categorization and conformed to common English-language understanding. Both approaches offer advantages, although additional replication studies are warranted.

目的:社会支持(SS)和社会隔离(SI社会支持(SS)和社会隔离(SI)是与精神疾病结果相关的健康社会决定因素(SDOH)。在电子健康记录(EHR)中,个人层面的社会支持/社会隔离通常记录在叙述性临床笔记中,而非结构化编码数据。自然语言处理(NLP)算法可以自动完成提取此类信息的劳动密集型过程:对西奈山医疗系统(MSHS,n = 300)和威尔康奈尔医学中心(WCM,n = 225)的精神病就诊记录进行注释,以创建黄金标准语料库。使用 FLAN-T5-XL 开发了一个基于规则的系统 (RBS),其中包括词典和大语言模型 (LLM),用于识别 SS 和 SI 及其子类别(如社交网络、工具支持和孤独感):在提取 SS/SI 时,RBS 在 MSHS(0.89 对 0.65)和 WCM(0.85 对 0.82)的宏观平均 F1 分数均高于 LLM。在提取子类别方面,RBS 在 MSHS(0.90 对 0.62)和 WCM(0.82 对 0.81)上的表现也优于 LLM:出乎意料的是,RBS 在所有指标上都优于 LLM。深入研究表明,这一发现是由于 RBS 和 LLM 采用了不同的方法。RBS 的设计和改进遵循了与黄金标准注释相同的特定规则。相反,LLM 在分类方面更具包容性,符合英语的一般理解。这两种方法都具有优势,不过还需要进行更多的重复研究。
{"title":"Extracting social support and social isolation information from clinical psychiatry notes: comparing a rule-based natural language processing system and a large language model.","authors":"Braja Gopal Patra, Lauren A Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A Sanchez-Ruiz, Euijung Ryu, Joanna M Biernacka, Girish N Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J John Mann, Yiye Zhang, Alexander W Charney, Jyotishman Pathak","doi":"10.1093/jamia/ocae260","DOIUrl":"https://doi.org/10.1093/jamia/ocae260","url":null,"abstract":"<p><strong>Objectives: </strong>Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented in narrative clinical notes rather than as structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of extraction of such information.</p><p><strong>Materials and methods: </strong>Psychiatric encounter notes from Mount Sinai Health System (MSHS, n = 300) and Weill Cornell Medicine (WCM, n = 225) were annotated to create a gold-standard corpus. A rule-based system (RBS) involving lexicons and a large language model (LLM) using FLAN-T5-XL were developed to identify mentions of SS and SI and their subcategories (eg, social network, instrumental support, and loneliness).</p><p><strong>Results: </strong>For extracting SS/SI, the RBS obtained higher macroaveraged F1-scores than the LLM at both MSHS (0.89 versus 0.65) and WCM (0.85 versus 0.82). For extracting the subcategories, the RBS also outperformed the LLM at both MSHS (0.90 versus 0.62) and WCM (0.82 versus 0.81).</p><p><strong>Discussion and conclusion: </strong>Unexpectedly, the RBS outperformed the LLMs across all metrics. An intensive review demonstrates that this finding is due to the divergent approach taken by the RBS and LLM. The RBS was designed and refined to follow the same specific rules as the gold-standard annotations. Conversely, the LLM was more inclusive with categorization and conformed to common English-language understanding. Both approaches offer advantages, although additional replication studies are warranted.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.7,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1