首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Does it save me money? The economic impact of mobile health interventions on medical expenditure of diabetic patients. 它能帮我省钱吗?流动医疗干预对糖尿病患者医疗支出的经济影响。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-18 DOI: 10.1093/jamia/ocaf199
Xinying Liu, Upkar Varshney, Peiwei Li

Objective: This study evaluates the impact of self-management and support of m-health applications on medication adherence (MA) and the corresponding long-term medical expenditures among patients with Type 2 Diabetes (T2D), using an analytic framework generalizable to other chronic conditions.

Materials and methods: A systematic review and meta-analysis of randomized controlled trials were conducted to estimate the synthesized effect of m-health interventions on MA. These results were integrated into a Markov state-transition model to simulate patient transitions among three adherence levels over a 10-year horizon. Medical expenditure data by adherence level were derived from the Medical Expenditure Panel Survey (MEPS). Monte Carlo simulation was applied to assess uncertainty and estimate individual- and population-level cost outcomes under baseline and intervention scenarios.

Results: The meta-analysis showed a significant positive effect of m-health on MA (standardized mean difference = 0.21, 95% CI: 0.14-0.28). Patients in the intervention scenario experienced an average cost reduction of $4400 over 10 years. At the population level, a cohort of 10 000 patients using m-health tools would yield projected direct medical cost savings of $44 million.

Discussion: This study demonstrates the potential of m-health interventions to improve patient behavior and generate substantial long-term cost savings. By linking behavioral health data to downstream cost outcomes, the study adds to the growing evidence base for informatics-driven population health strategies.

Conclusion: Our study underscores the importance of integrating digital support tools into chronic disease care and informs policy decisions aimed at integrating health informatics innovations.

目的:本研究使用可推广到其他慢性疾病的分析框架,评估移动健康应用程序的自我管理和支持对2型糖尿病(T2D)患者服药依从性(MA)和相应的长期医疗支出的影响。材料和方法:对随机对照试验进行了系统回顾和荟萃分析,以估计移动健康干预措施对MA的综合影响。这些结果被整合到一个马尔可夫状态转换模型中,以模拟患者在10年内三个依从性水平之间的转换。按依从性水平划分的医疗支出数据来自医疗支出小组调查(MEPS)。应用蒙特卡罗模拟来评估不确定性,并估计基线和干预情景下个人和人群水平的成本结果。结果:meta分析显示移动健康对MA有显著的积极影响(标准化平均差异= 0.21,95% CI: 0.14-0.28)。在干预方案中,患者在10年内平均减少了4400美元的费用。在人口层面,使用移动医疗工具的1万名患者预计将节省4 400万美元的直接医疗费用。讨论:本研究证明了移动医疗干预在改善患者行为和产生大量长期成本节约方面的潜力。通过将行为健康数据与下游成本结果联系起来,该研究为信息学驱动的人口健康战略增加了越来越多的证据基础。结论:我们的研究强调了将数字支持工具整合到慢性病护理中的重要性,并为旨在整合健康信息学创新的政策决策提供信息。
{"title":"Does it save me money? The economic impact of mobile health interventions on medical expenditure of diabetic patients.","authors":"Xinying Liu, Upkar Varshney, Peiwei Li","doi":"10.1093/jamia/ocaf199","DOIUrl":"https://doi.org/10.1093/jamia/ocaf199","url":null,"abstract":"<p><strong>Objective: </strong>This study evaluates the impact of self-management and support of m-health applications on medication adherence (MA) and the corresponding long-term medical expenditures among patients with Type 2 Diabetes (T2D), using an analytic framework generalizable to other chronic conditions.</p><p><strong>Materials and methods: </strong>A systematic review and meta-analysis of randomized controlled trials were conducted to estimate the synthesized effect of m-health interventions on MA. These results were integrated into a Markov state-transition model to simulate patient transitions among three adherence levels over a 10-year horizon. Medical expenditure data by adherence level were derived from the Medical Expenditure Panel Survey (MEPS). Monte Carlo simulation was applied to assess uncertainty and estimate individual- and population-level cost outcomes under baseline and intervention scenarios.</p><p><strong>Results: </strong>The meta-analysis showed a significant positive effect of m-health on MA (standardized mean difference = 0.21, 95% CI: 0.14-0.28). Patients in the intervention scenario experienced an average cost reduction of $4400 over 10 years. At the population level, a cohort of 10 000 patients using m-health tools would yield projected direct medical cost savings of $44 million.</p><p><strong>Discussion: </strong>This study demonstrates the potential of m-health interventions to improve patient behavior and generate substantial long-term cost savings. By linking behavioral health data to downstream cost outcomes, the study adds to the growing evidence base for informatics-driven population health strategies.</p><p><strong>Conclusion: </strong>Our study underscores the importance of integrating digital support tools into chronic disease care and informs policy decisions aimed at integrating health informatics innovations.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145543509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a robust corpus for automated evaluation of online health information in Chinese using the DISCERN scale. 基于DISCERN量表的中文在线健康信息自动评估健壮语料库的开发。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-12 DOI: 10.1093/jamia/ocaf175
Ting E, Xingxi Li, Jun Liang, Junhao Ma, Qichuan Fang, Shanli Chen, Jianbo Lei, Christopher G Chute

Objective: To develop the first comprehensive, standardized annotated corpus of Chinese online health information (OHI) using the full 16-item DISCERN instrument and to establish a reliable annotation process that supports automated quality assessment.

Materials and methods: We assembled 510 web-sourced articles on breast cancer, arthritis, and depression. All the articles were independently annotated by three trained raters using the DISCERN scale. Annotation followed a four-step workflow: data collection and preprocessing, rater training, iterative annotation, and quality control. Raters calibrated through consensus sessions and calibration articles. The Dawid-Skene model aggregated individual annotations into final consensus scores. Original five-point ratings were retained and binarized (scores 1-3 as low quality, 4-5 as high quality) to enable both fine-grained and coarse evaluation for machine learning.

Results: Initial annotation of a 60-article pilot produced low agreement (mean Krippendorff's α ≈ 0.022) due to subjective variability. Successive calibration exercises improved agreement markedly, culminating in a corpus-wide Krippendorff's α of 0.834. Consensus scores correlated strongly with individual rater scores, confirming annotation robustness. The dual-scale design yielded a relatively balanced distribution of labels across topics, with roughly equal representation of low- and high-quality articles, and preserved granularity for detailed DISCERN analysis.

Discussion: Our iterative calibration approach and consensus modeling effectively addressed the subjective ambiguity inherent in quality assessment. The binary and five-class labeling strategies facilitate flexible downstream applications, allowing automated systems to perform both broad filtering and nuanced quality differentiation. The high inter-rater reliability demonstrates that rigorous training and consensus methods can overcome domain-specific annotation challenges.

Conclusion: The resulting Chinese OHI corpus, annotated via a standardized DISCERN framework and refined through iterative calibration, provides a robust benchmark for training and evaluating machine learning models. This resource lays the foundation for scalable, reliable automated quality assessment of OHI in Chinese public health settings.

目的:利用完整的16项DISCERN仪器开发首个全面、标准化的中国在线健康信息(OHI)注释语料库,并建立一个可靠的注释过程,支持自动质量评估。材料和方法:我们收集了510篇来自网络的关于乳腺癌、关节炎和抑郁症的文章。所有文章均由三位训练有素的评分员使用DISCERN量表独立注释。注释遵循四步工作流程:数据收集和预处理、分级训练、迭代注释和质量控制。评分员通过共识会议和校准文章进行校准。david - skene模型将单个注释汇总为最终的共识分数。保留原始的五点评分并进行二值化(得分1-3为低质量,得分4-5为高质量),以实现对机器学习的细粒度和粗粒度评估。结果:由于主观可变性,60篇试点文章的初始注释产生了低一致性(平均Krippendorff α≈0.022)。连续的校准练习显著提高了一致性,最终在整个语料库的Krippendorff α为0.834。共识分数与个体评分者得分密切相关,证实了注释的稳健性。双尺度设计产生了跨主题的相对平衡的标签分布,低质量和高质量文章的表示大致相等,并保留了详细的DISCERN分析的粒度。讨论:我们的迭代校准方法和共识建模有效地解决了质量评估中固有的主观模糊性。二进制和五类标签策略促进灵活的下游应用,允许自动化系统执行广泛的过滤和细致入微的质量差异化。高可靠性表明严格的训练和共识方法可以克服特定领域注释的挑战。结论:由此产生的中文OHI语料库,通过标准化的DISCERN框架进行标注,并通过迭代校准进行改进,为训练和评估机器学习模型提供了一个强大的基准。该资源为中国公共卫生机构中可扩展的、可靠的OHI自动化质量评估奠定了基础。
{"title":"Development of a robust corpus for automated evaluation of online health information in Chinese using the DISCERN scale.","authors":"Ting E, Xingxi Li, Jun Liang, Junhao Ma, Qichuan Fang, Shanli Chen, Jianbo Lei, Christopher G Chute","doi":"10.1093/jamia/ocaf175","DOIUrl":"https://doi.org/10.1093/jamia/ocaf175","url":null,"abstract":"<p><strong>Objective: </strong>To develop the first comprehensive, standardized annotated corpus of Chinese online health information (OHI) using the full 16-item DISCERN instrument and to establish a reliable annotation process that supports automated quality assessment.</p><p><strong>Materials and methods: </strong>We assembled 510 web-sourced articles on breast cancer, arthritis, and depression. All the articles were independently annotated by three trained raters using the DISCERN scale. Annotation followed a four-step workflow: data collection and preprocessing, rater training, iterative annotation, and quality control. Raters calibrated through consensus sessions and calibration articles. The Dawid-Skene model aggregated individual annotations into final consensus scores. Original five-point ratings were retained and binarized (scores 1-3 as low quality, 4-5 as high quality) to enable both fine-grained and coarse evaluation for machine learning.</p><p><strong>Results: </strong>Initial annotation of a 60-article pilot produced low agreement (mean Krippendorff's α ≈ 0.022) due to subjective variability. Successive calibration exercises improved agreement markedly, culminating in a corpus-wide Krippendorff's α of 0.834. Consensus scores correlated strongly with individual rater scores, confirming annotation robustness. The dual-scale design yielded a relatively balanced distribution of labels across topics, with roughly equal representation of low- and high-quality articles, and preserved granularity for detailed DISCERN analysis.</p><p><strong>Discussion: </strong>Our iterative calibration approach and consensus modeling effectively addressed the subjective ambiguity inherent in quality assessment. The binary and five-class labeling strategies facilitate flexible downstream applications, allowing automated systems to perform both broad filtering and nuanced quality differentiation. The high inter-rater reliability demonstrates that rigorous training and consensus methods can overcome domain-specific annotation challenges.</p><p><strong>Conclusion: </strong>The resulting Chinese OHI corpus, annotated via a standardized DISCERN framework and refined through iterative calibration, provides a robust benchmark for training and evaluating machine learning models. This resource lays the foundation for scalable, reliable automated quality assessment of OHI in Chinese public health settings.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145507842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving patient understanding of radiology reports using generative artificial intelligence: a vignette study of 2000 US adults. 使用生成人工智能提高患者对放射学报告的理解:一项对2000名美国成年人的小研究。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-12 DOI: 10.1093/jamia/ocaf187
Aurelia Huiwen Chen, Robert S Rudin, David Michael Levine, Ateev Mehrotra

Objectives: Patients value access to their medical reports on patient portals, but the terminology in those reports can cause confusion and anxiety. Can the artificial intelligence (AI) simplification of radiology reports into plain language improve patient comprehension?

Materials and methods: Twenty original radiology reports (breast imaging, chest X-ray) were simplified into plain language using ChatGPT-4 using a customized prompt for each report type. For each report, clinicians created a gold standard of key findings and appropriate follow-up. In August 2024, a national sample of 2000 US adults reviewed 2 randomly assigned reports, 1 original and 1 AI-generated plain language. Participants answered questions focused on comprehension of key findings, follow-up, confidence, anxiety, and preferences. Comprehension and follow-up were compared to the gold standard. We compared patient accuracy for original vs AI-generated plain language reports.

Results: Participants (mean age 48 years) were 62.3% female. Compared to original reports, participants shown AI-generated plain language reports had higher accuracy in comprehension (68.0% vs 58.0%; marginal difference 10.8% [95% CI, 7.8%-13.8%]) and follow-up (64.5% vs 58.4%; marginal difference 6.8% [95% CI, 4.1%-9.4%]). Improvements were larger among participants aged >44 years and with less than college education. With plain language reports, participants reported higher confidence in their answers and lower anxiety. Despite these improvements, 60.0% of participants preferred the original report over the plain language version.

Discussion: Integrating AI simplification into patient portals may be helpful, but trust concerns remain.

Conclusion: AI simplification improved patient comprehension and confidence. Further research is needed to address patient resistance to AI simplification.

目的:患者重视在患者门户网站上访问他们的医疗报告,但这些报告中的术语可能会引起混淆和焦虑。人工智能(AI)将放射学报告简化为简单的语言,能否提高患者的理解力?材料和方法:使用ChatGPT-4将20份原始放射学报告(乳房成像、胸部x线片)简化为简单的语言,并为每种报告类型定制提示。对于每一份报告,临床医生都制定了关键发现和适当随访的黄金标准。2024年8月,一个由2000名美国成年人组成的全国样本审查了两份随机分配的报告,一份是原创的,一份是人工智能生成的通俗语言。参与者回答的问题集中在对关键发现的理解、后续行动、信心、焦虑和偏好。理解和随访与金标准进行比较。我们比较了原始报告和人工智能生成的简单语言报告的患者准确性。结果:参与者(平均年龄48岁)中女性占62.3%。与原始报告相比,参与者显示人工智能生成的简单语言报告在理解和随访方面具有更高的准确性(68.0%对58.0%;边际差10.8% [95% CI, 7.8%-13.8%])。年龄在40到44岁之间、大学学历以下的参与者的改善幅度更大。通过简单的语言报告,参与者对自己的答案更有信心,焦虑感更低。尽管有这些改进,60.0%的参与者更喜欢原始报告而不是简单的语言版本。讨论:将人工智能简化到患者门户中可能会有所帮助,但信任问题仍然存在。结论:人工智能简化提高了患者的理解力和信心。需要进一步的研究来解决患者对人工智能简化的抵制。
{"title":"Improving patient understanding of radiology reports using generative artificial intelligence: a vignette study of 2000 US adults.","authors":"Aurelia Huiwen Chen, Robert S Rudin, David Michael Levine, Ateev Mehrotra","doi":"10.1093/jamia/ocaf187","DOIUrl":"https://doi.org/10.1093/jamia/ocaf187","url":null,"abstract":"<p><strong>Objectives: </strong>Patients value access to their medical reports on patient portals, but the terminology in those reports can cause confusion and anxiety. Can the artificial intelligence (AI) simplification of radiology reports into plain language improve patient comprehension?</p><p><strong>Materials and methods: </strong>Twenty original radiology reports (breast imaging, chest X-ray) were simplified into plain language using ChatGPT-4 using a customized prompt for each report type. For each report, clinicians created a gold standard of key findings and appropriate follow-up. In August 2024, a national sample of 2000 US adults reviewed 2 randomly assigned reports, 1 original and 1 AI-generated plain language. Participants answered questions focused on comprehension of key findings, follow-up, confidence, anxiety, and preferences. Comprehension and follow-up were compared to the gold standard. We compared patient accuracy for original vs AI-generated plain language reports.</p><p><strong>Results: </strong>Participants (mean age 48 years) were 62.3% female. Compared to original reports, participants shown AI-generated plain language reports had higher accuracy in comprehension (68.0% vs 58.0%; marginal difference 10.8% [95% CI, 7.8%-13.8%]) and follow-up (64.5% vs 58.4%; marginal difference 6.8% [95% CI, 4.1%-9.4%]). Improvements were larger among participants aged >44 years and with less than college education. With plain language reports, participants reported higher confidence in their answers and lower anxiety. Despite these improvements, 60.0% of participants preferred the original report over the plain language version.</p><p><strong>Discussion: </strong>Integrating AI simplification into patient portals may be helpful, but trust concerns remain.</p><p><strong>Conclusion: </strong>AI simplification improved patient comprehension and confidence. Further research is needed to address patient resistance to AI simplification.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145534861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhenoFit: a framework for determining computable phenotyping algorithm fitness for purpose and reuse. PhenoFit:一个框架,用于确定可计算的表型算法适合的目的和重用。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-12 DOI: 10.1093/jamia/ocaf195
Laura K Wiley, Luke V Rasmussen, Rebecca T Levinson, Jennnifer Malinowski, Sheila M Manemann, Melissa P Wilson, Martin Chapman, Jennifer A Pacheco, Theresa L Walunas, Justin B Starren, Suzette J Bielinski, Rachel L Richesson

Background: Computational phenotyping from electronic health records (EHRs) is essential for clinical research, decision support, and quality/population health assessment, but the proliferation of algorithms for the same conditions makes it difficult to identify which algorithm is most appropriate for reuse.

Objective: To develop a framework for assessing phenotyping algorithm fitness for purpose and reuse.

Fitness for purpose: Phenotyping algorithms are fit for purpose when they identify the intended population with performance characteristics appropriate for the intended application.

Fitness for reuse: Phenotyping algorithms are fit for reuse when the algorithm is implementable and generalizable-that is, it identifies the same intended population with similar performance characteristics when applied to a new setting.

Conclusions: The PhenoFit framework provides a structured approach to evaluate and adapt phenotyping algorithms for new contexts increasing efficiency and consistency of identifying patient populations from EHRs.

背景:来自电子健康记录(EHRs)的计算表型对于临床研究、决策支持和质量/人群健康评估至关重要,但针对相同条件的算法的激增使得难以确定哪种算法最适合重用。目的:开发一个框架,评估表型算法适合的目的和重用。适合目的:当表型算法确定具有适合预期应用程序的性能特征的预期种群时,它们是适合目的的。适合重用:当表现型算法具有可实现性和通用性时,表现型算法就适合重用——也就是说,当应用于新设置时,它可以识别具有相似性能特征的相同预期种群。结论:PhenoFit框架提供了一种结构化的方法来评估和调整表型算法,以适应新的环境,从而提高了从电子病历中识别患者群体的效率和一致性。
{"title":"PhenoFit: a framework for determining computable phenotyping algorithm fitness for purpose and reuse.","authors":"Laura K Wiley, Luke V Rasmussen, Rebecca T Levinson, Jennnifer Malinowski, Sheila M Manemann, Melissa P Wilson, Martin Chapman, Jennifer A Pacheco, Theresa L Walunas, Justin B Starren, Suzette J Bielinski, Rachel L Richesson","doi":"10.1093/jamia/ocaf195","DOIUrl":"https://doi.org/10.1093/jamia/ocaf195","url":null,"abstract":"<p><strong>Background: </strong>Computational phenotyping from electronic health records (EHRs) is essential for clinical research, decision support, and quality/population health assessment, but the proliferation of algorithms for the same conditions makes it difficult to identify which algorithm is most appropriate for reuse.</p><p><strong>Objective: </strong>To develop a framework for assessing phenotyping algorithm fitness for purpose and reuse.</p><p><strong>Fitness for purpose: </strong>Phenotyping algorithms are fit for purpose when they identify the intended population with performance characteristics appropriate for the intended application.</p><p><strong>Fitness for reuse: </strong>Phenotyping algorithms are fit for reuse when the algorithm is implementable and generalizable-that is, it identifies the same intended population with similar performance characteristics when applied to a new setting.</p><p><strong>Conclusions: </strong>The PhenoFit framework provides a structured approach to evaluate and adapt phenotyping algorithms for new contexts increasing efficiency and consistency of identifying patient populations from EHRs.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145507875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel, standardised approach to balancing effectiveness, efficiency and utility of surveillance AI prediction models for hospitalised patients using sepsis prediction as an exemplar. 以败血症预测为例,平衡住院患者监测人工智能预测模型的有效性、效率和实用性的一种新颖的标准化方法。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-11 DOI: 10.1093/jamia/ocaf192
Anton H van der Vegt, Victoria K Campbell, Robert Webb, Balasubramanian Venkatesh, Paul J Lane, Kathryn Wilks, Steven McPhail, Michael Rice, Tara Isaacs, Ahmad Abdel-Hafez, Stephen Whebell, Adam Irwin, Rudolf J Schnetler, Amith Shetty, Ian A Scott

Objective: To introduce a novel, standardised approach to evaluating AI prediction models in balancing effectiveness, efficiency and utility, using a sepsis prediction model case study.

Materials and methods: Retrospective patient data from electronic medical records of 7 public hospitals was used to retrain and evaluate a machine learning sepsis prediction model. Four conventional metrics-area under the receiver operating curve (AUROC), sensitivity, positive predictive value, and specificity-were compared with a novel graphical display integrating metrics of predictive accuracy (effectiveness), alert burden (efficiency) and lead time of alerts relative to clinical events (utility) for different alert thresholds.

Results: The dataset comprised 977,506 inpatient admissions. The novel methodology produced a plot of four vertically aligned graphs that enables decision-makers to identify an alert threshold that optimally balances effectiveness, efficiency and utility (EEU) at the level of an entire admission, and which differs from that derived using conventional metrics.

Discussion: Conventional evaluation metrics do not consider alert timing relative to clinical events and are often applied to different evaluation datasets (sample and admission level), introducing bias and confusion. In contrast, the EEU methodology (i) generates admission level evaluations at different alert thresholds; (ii) measures alert timing relative to clinical events; and (iii) provides a visual display that enables identification of the alert threshold that optimally balances EEU factors.

Conclusion: Evaluations of prediction models for adverse events in hospitalised patients should incorporate the EEU approach in assessing model suitability and selecting alert thresholds.

目的:通过脓毒症预测模型的案例研究,介绍一种新的、标准化的方法来评估人工智能预测模型在平衡有效性、效率和实用性方面的作用。材料和方法:利用7家公立医院电子病历中的回顾性患者数据,对机器学习脓毒症预测模型进行再训练和评估。将四个常规指标——受试者工作曲线下面积(AUROC)、敏感性、阳性预测值和特异性——与一种新的图形显示进行比较,该图形显示集成了不同警报阈值的预测准确性(有效性)、警报负担(效率)和相对于临床事件的警报提前时间(效用)指标。结果:数据集包括977,506例住院患者。该新方法生成了一个由四个垂直排列的图形组成的图,使决策者能够确定一个警报阈值,该阈值与使用传统指标得出的阈值不同,可以在整个入井水平上最佳地平衡有效性、效率和效用(EEU)。讨论:传统的评估指标不考虑与临床事件相关的预警时间,并且经常应用于不同的评估数据集(样本和入院水平),导致偏差和混淆。相比之下,EEU方法(i)在不同的警报阈值下生成录取水平评估;(ii)测量与临床事件相关的警报时间;(iii)提供视觉显示,能够识别最佳平衡EEU因素的警报阈值。结论:住院患者不良事件预测模型的评估应结合EEU方法来评估模型的适用性和选择预警阈值。
{"title":"A novel, standardised approach to balancing effectiveness, efficiency and utility of surveillance AI prediction models for hospitalised patients using sepsis prediction as an exemplar.","authors":"Anton H van der Vegt, Victoria K Campbell, Robert Webb, Balasubramanian Venkatesh, Paul J Lane, Kathryn Wilks, Steven McPhail, Michael Rice, Tara Isaacs, Ahmad Abdel-Hafez, Stephen Whebell, Adam Irwin, Rudolf J Schnetler, Amith Shetty, Ian A Scott","doi":"10.1093/jamia/ocaf192","DOIUrl":"https://doi.org/10.1093/jamia/ocaf192","url":null,"abstract":"<p><strong>Objective: </strong>To introduce a novel, standardised approach to evaluating AI prediction models in balancing effectiveness, efficiency and utility, using a sepsis prediction model case study.</p><p><strong>Materials and methods: </strong>Retrospective patient data from electronic medical records of 7 public hospitals was used to retrain and evaluate a machine learning sepsis prediction model. Four conventional metrics-area under the receiver operating curve (AUROC), sensitivity, positive predictive value, and specificity-were compared with a novel graphical display integrating metrics of predictive accuracy (effectiveness), alert burden (efficiency) and lead time of alerts relative to clinical events (utility) for different alert thresholds.</p><p><strong>Results: </strong>The dataset comprised 977,506 inpatient admissions. The novel methodology produced a plot of four vertically aligned graphs that enables decision-makers to identify an alert threshold that optimally balances effectiveness, efficiency and utility (EEU) at the level of an entire admission, and which differs from that derived using conventional metrics.</p><p><strong>Discussion: </strong>Conventional evaluation metrics do not consider alert timing relative to clinical events and are often applied to different evaluation datasets (sample and admission level), introducing bias and confusion. In contrast, the EEU methodology (i) generates admission level evaluations at different alert thresholds; (ii) measures alert timing relative to clinical events; and (iii) provides a visual display that enables identification of the alert threshold that optimally balances EEU factors.</p><p><strong>Conclusion: </strong>Evaluations of prediction models for adverse events in hospitalised patients should incorporate the EEU approach in assessing model suitability and selecting alert thresholds.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145490781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supporting electronic health record data usage in research for teams with varying data science and clinical knowledge: a food service analogy approach. 支持具有不同数据科学和临床知识的团队在研究中使用电子健康记录数据:一种食品服务类比方法。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-10 DOI: 10.1093/jamia/ocaf188
Tanja Magoc, Leigh Anne Tang, Khoa A Nguyen, Christopher A Harle

Objective: To guide research data services (RDS) teams in managing researcher variability (eg, differing deadlines, funding, expertise) when honest-brokering data, we present a framework based on operations management principles and a food service analogy.

Materials and methods: Our framework describes 4 data service offerings with different levels of efficiency and service customization: vending machine, fast food, custom meal, and personal chef. We describe examples from 2 institutions.

Results: Vending machine and fast food are efficient but less customizable, making them better-suited for researchers with limited funding or time. Custom meal and personal chef are less efficient but more customized, making them well suited for better-resourced researchers.

Discussion: Efficiency and service tradeoffs should be balanced to align with demand and institutional goals. RDS teams can overcome such tradeoffs through uncompromised reduction or low-cost accommodation approaches.

Conclusion: Our framework can be applied by RDS teams in their design and implementation of data services.

目的:为了指导研究数据服务(RDS)团队在诚实代理数据时管理研究人员的差异性(例如,不同的截止日期、资金、专业知识),我们提出了一个基于运营管理原则和食品服务类比的框架。材料和方法:我们的框架描述了4种具有不同效率水平和服务定制的数据服务产品:自动售货机、快餐、定制餐和私人厨师。我们描述了两个机构的例子。结果:自动贩卖机和快餐效率很高,但可定制性较差,因此更适合资金或时间有限的研究人员。定制餐和私人厨师效率较低,但更具定制性,使它们非常适合资源更丰富的研究人员。讨论:应平衡效率和服务之间的权衡,使之与需求和机构目标保持一致。RDS团队可以通过不妥协的减少或低成本住宿方法来克服这种权衡。结论:RDS团队可以将我们的框架应用于数据服务的设计和实现。
{"title":"Supporting electronic health record data usage in research for teams with varying data science and clinical knowledge: a food service analogy approach.","authors":"Tanja Magoc, Leigh Anne Tang, Khoa A Nguyen, Christopher A Harle","doi":"10.1093/jamia/ocaf188","DOIUrl":"https://doi.org/10.1093/jamia/ocaf188","url":null,"abstract":"<p><strong>Objective: </strong>To guide research data services (RDS) teams in managing researcher variability (eg, differing deadlines, funding, expertise) when honest-brokering data, we present a framework based on operations management principles and a food service analogy.</p><p><strong>Materials and methods: </strong>Our framework describes 4 data service offerings with different levels of efficiency and service customization: vending machine, fast food, custom meal, and personal chef. We describe examples from 2 institutions.</p><p><strong>Results: </strong>Vending machine and fast food are efficient but less customizable, making them better-suited for researchers with limited funding or time. Custom meal and personal chef are less efficient but more customized, making them well suited for better-resourced researchers.</p><p><strong>Discussion: </strong>Efficiency and service tradeoffs should be balanced to align with demand and institutional goals. RDS teams can overcome such tradeoffs through uncompromised reduction or low-cost accommodation approaches.</p><p><strong>Conclusion: </strong>Our framework can be applied by RDS teams in their design and implementation of data services.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing genetic counseling efficiency with natural language processing. 用自然语言处理评估遗传咨询的效率。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-10 DOI: 10.1093/jamia/ocaf190
Michelle H Nguyen, Carolyn D Applegate, Brittney Murray, Ayah Zirikly, Crystal Tichnell, Catherine Gordon, Lisa R Yanek, Cynthia A James, Casey Overby Taylor

Objective: To build natural language processing (NLP) strategies to characterize measures of genetic counseling (GC) efficiency and classify measures according to phase of GC (pre- or post-genetic testing).

Materials and methods: This study selected and annotated 800 GC notes from 7 clinical specialties in a large academic medical center for NLP model development and validation. The NLP approaches extracted GC efficiency measures, including direct and indirect time and GC phase. The models were then applied to 24 102 GC notes collected from January 2016 through December 2023.

Results: NLP approaches performed well (F1 scores of 0.95 and 0.90 for direct time in GC and GC phase classification, respectively). Our findings showed median direct time in GC of 50 minutes, with significant differences in direct time distributions observed across clinical specialties, time periods (2016-2019 or 2020-2023), delivery modes (in person or telehealth), and GC phase.

Discussion: As referrals to GC increase, there is increasing pressure to improve efficiency. Our NLP strategy was used to generate and summarize real-world evidence of GC time for 7 clinical specialties. These approaches enable future research on the impact of interventions intended to improve GC efficiency.

Conclusion: This work demonstrated the practical value of NLP to provide a useful and scalable strategy to generate real world evidence of GC efficiency. Principles presented in this work may also be valuable for health services research in other practice areas.

目的:建立自然语言处理(NLP)策略来表征遗传咨询(GC)效率的测度,并根据GC的阶段(基因检测前或基因检测后)对测度进行分类。材料和方法:本研究选取某大型学术医学中心7个临床专科的800份GC笔记进行注释,用于NLP模型的开发和验证。NLP方法提取气相色谱效率指标,包括直接和间接时间和气相色谱阶段。然后将这些模型应用于2016年1月至2023年12月收集的24102张纸币。结果:NLP方法在GC和GC相分类中表现良好(直接时间F1得分分别为0.95和0.90)。我们的研究结果显示,GC的中位直接时间为50分钟,在临床专科、时间段(2016-2019年或2020-2023年)、交付模式(亲自或远程医疗)和GC阶段的直接时间分布存在显著差异。讨论:随着对GC的引用增加,提高效率的压力也越来越大。我们的NLP策略用于生成和总结7个临床专科GC时间的真实证据。这些方法使未来研究旨在提高GC效率的干预措施的影响成为可能。结论:这项工作证明了NLP的实用价值,它提供了一种有用的、可扩展的策略来生成GC效率的真实世界证据。这项工作中提出的原则也可能对其他实践领域的卫生服务研究有价值。
{"title":"Assessing genetic counseling efficiency with natural language processing.","authors":"Michelle H Nguyen, Carolyn D Applegate, Brittney Murray, Ayah Zirikly, Crystal Tichnell, Catherine Gordon, Lisa R Yanek, Cynthia A James, Casey Overby Taylor","doi":"10.1093/jamia/ocaf190","DOIUrl":"10.1093/jamia/ocaf190","url":null,"abstract":"<p><strong>Objective: </strong>To build natural language processing (NLP) strategies to characterize measures of genetic counseling (GC) efficiency and classify measures according to phase of GC (pre- or post-genetic testing).</p><p><strong>Materials and methods: </strong>This study selected and annotated 800 GC notes from 7 clinical specialties in a large academic medical center for NLP model development and validation. The NLP approaches extracted GC efficiency measures, including direct and indirect time and GC phase. The models were then applied to 24 102 GC notes collected from January 2016 through December 2023.</p><p><strong>Results: </strong>NLP approaches performed well (F1 scores of 0.95 and 0.90 for direct time in GC and GC phase classification, respectively). Our findings showed median direct time in GC of 50 minutes, with significant differences in direct time distributions observed across clinical specialties, time periods (2016-2019 or 2020-2023), delivery modes (in person or telehealth), and GC phase.</p><p><strong>Discussion: </strong>As referrals to GC increase, there is increasing pressure to improve efficiency. Our NLP strategy was used to generate and summarize real-world evidence of GC time for 7 clinical specialties. These approaches enable future research on the impact of interventions intended to improve GC efficiency.</p><p><strong>Conclusion: </strong>This work demonstrated the practical value of NLP to provide a useful and scalable strategy to generate real world evidence of GC efficiency. Principles presented in this work may also be valuable for health services research in other practice areas.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12743353/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated detection of stigmatizing language in Electronic Health Records (EHRs) using a multi-stage transfer learning approach. 使用多阶段迁移学习方法自动检测电子健康记录(EHRs)中的污名语言。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-09 DOI: 10.1093/jamia/ocaf193
Liyang Xue, A M Muntasir Rahman, Charles R Senteio, Vivek K Singh

Objective: Stigmatizing language (SL) in Electronic Health Records (EHRs) can perpetuate biases and negatively impact patient care. This study introduces a novel method for automatically detecting such language to improve healthcare documentation practices.

Materials and methods: We developed a multi-stage transfer learning framework integrating semantic, syntactic, and task adaptation using three datasets: hate speech, clinical phenotypes, and stigmatizing language. Experiments were conducted on stigmatizing language dataset which consists of 4,129 de-identified EHR notes (72.7% stigmatizing, 27.3% non-stigmatizing), split 80/20 for training and testing. Longformer, BERT, and ClinicalBERT models were evaluated, and model performance was assessed on 35 randomized subsets of the test set (each comprising 70% of test data). The Wilcoxon-Mann-Whitney test was used to evaluate statistical significance, with Bonferroni correction applied to control for multiple hypothesis testing. Baseline models included zero-shot and few-shot GPT-4o, Support Vector Machine, Random Forest, Logistic Regression, and Multinomial Naive Bayes.

Results: The proposed framework achieved the highest accuracy, with fully adapted Longformer reaching 89.83%. Performance improvements remained statistically significant after Bonferroni correction compared to all baselines (p < .05). The framework demonstrated robust gains across different stigmatizing language types.

Discussion: This study underscores the value of domain-adaptive NLP for detecting stigmatizing language in EHRs. The multi-stage transfer learning framework effectively captures subtle biases often missed by conventional models, enabling more objective and respectful clinical documentation.

Conclusion: This framework offers a statistically validated, high-performing framework for detecting stigmatizing language in EHRs, supporting responsible AI and promoting equity in clinical care.

目的:电子健康记录(EHRs)中的污名化语言(SL)会使偏见持续存在,并对患者护理产生负面影响。本研究介绍了一种自动检测此类语言的新方法,以改善医疗保健文档实践。材料和方法:我们开发了一个整合语义、句法和任务适应的多阶段迁移学习框架,使用三个数据集:仇恨言论、临床表型和污名化语言。在4129个去识别的EHR笔记(72.7%污名化,27.3%非污名化)组成的污名化语言数据集上进行实验,分成80/20进行训练和测试。对Longformer、BERT和ClinicalBERT模型进行评估,并在测试集的35个随机子集(每个子集包含70%的测试数据)上评估模型的性能。采用Wilcoxon-Mann-Whitney检验评价统计学显著性,采用Bonferroni校正进行多假设检验。基线模型包括零射击和少射击gpt - 40,支持向量机,随机森林,逻辑回归和多项朴素贝叶斯。结果:所提出的框架准确率最高,完全适应的Longformer准确率达到89.83%。与所有基线相比,Bonferroni校正后的性能改善仍具有统计学意义(p < 0.05)。该框架在不同的污名化语言类型中显示出强劲的收益。讨论:本研究强调了领域自适应自然语言处理在电子病历中检测污名化语言的价值。多阶段迁移学习框架有效地捕获了传统模型经常遗漏的微妙偏见,从而实现更客观和尊重的临床记录。结论:该框架提供了一个经过统计验证的高性能框架,用于检测电子病历中的污名化语言,支持负责任的人工智能和促进临床护理的公平性。
{"title":"Automated detection of stigmatizing language in Electronic Health Records (EHRs) using a multi-stage transfer learning approach.","authors":"Liyang Xue, A M Muntasir Rahman, Charles R Senteio, Vivek K Singh","doi":"10.1093/jamia/ocaf193","DOIUrl":"https://doi.org/10.1093/jamia/ocaf193","url":null,"abstract":"<p><strong>Objective: </strong>Stigmatizing language (SL) in Electronic Health Records (EHRs) can perpetuate biases and negatively impact patient care. This study introduces a novel method for automatically detecting such language to improve healthcare documentation practices.</p><p><strong>Materials and methods: </strong>We developed a multi-stage transfer learning framework integrating semantic, syntactic, and task adaptation using three datasets: hate speech, clinical phenotypes, and stigmatizing language. Experiments were conducted on stigmatizing language dataset which consists of 4,129 de-identified EHR notes (72.7% stigmatizing, 27.3% non-stigmatizing), split 80/20 for training and testing. Longformer, BERT, and ClinicalBERT models were evaluated, and model performance was assessed on 35 randomized subsets of the test set (each comprising 70% of test data). The Wilcoxon-Mann-Whitney test was used to evaluate statistical significance, with Bonferroni correction applied to control for multiple hypothesis testing. Baseline models included zero-shot and few-shot GPT-4o, Support Vector Machine, Random Forest, Logistic Regression, and Multinomial Naive Bayes.</p><p><strong>Results: </strong>The proposed framework achieved the highest accuracy, with fully adapted Longformer reaching 89.83%. Performance improvements remained statistically significant after Bonferroni correction compared to all baselines (p < .05). The framework demonstrated robust gains across different stigmatizing language types.</p><p><strong>Discussion: </strong>This study underscores the value of domain-adaptive NLP for detecting stigmatizing language in EHRs. The multi-stage transfer learning framework effectively captures subtle biases often missed by conventional models, enabling more objective and respectful clinical documentation.</p><p><strong>Conclusion: </strong>This framework offers a statistically validated, high-performing framework for detecting stigmatizing language in EHRs, supporting responsible AI and promoting equity in clinical care.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scoping review of models to identify transgender patients in electronic health records. 电子健康记录中识别跨性别患者模型的范围审查。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-04 DOI: 10.1093/jamia/ocaf185
Robert A Becker, Jhansi U L Kolli, Colin G Walsh

Objective: Electronic health records (EHRs) lack a widely adopted standard for recording transgender and gender diverse (TGD) status, complicating research on TGD health. Computational models have been developed to identify TGD individuals in EHRs; however, gaps remain in understanding which components contribute to stronger phenotyping approaches. This scoping review evaluates EHR-based models for identifying TGD individuals, focusing on identifier types, performance, external validation, and ethical reporting to guide best practices.

Materials and methods: We searched PubMed, CINAHL, Web of Science, and Embase for peer-reviewed articles published before January 2024, following PRISMA-ScR guidelines. Included studies used EHR data to identify TGD individuals, verified TGD status, reported or allowed calculation of positive predictive value (PPV), and listed identifiers. Two authors screened and extracted data. We categorized models by data type and logic (structured, unstructured, and multimodal), summarized PPV distributions, and synthesized author-reported ethical considerations.

Results: Fourteen studies describing 50 models met inclusion criteria. Models using TGD-related diagnostic codes alone (n = 11) or requiring both structured and unstructured data (n = 6) showed the highest mean PPVs (85.3% and 97.1%). Models validated on larger confirmed TGD cohorts reported more stable performance, but external validation was rare. Most studies minimally addressed ethics; only 3 described protective measures or stakeholder engagement.

Discussion: Phenotyping of TGD individuals in EHR data remains heterogeneous in design and ethical transparency. Reported PPVs should be interpreted cautiously, as performance is influenced by study design, sample size, and verification methods.

Conclusions: Our recommendations emphasize the components that strengthen phenotyping approaches-identifier choice, multimodal intersection logic, validation practices, and ethical safeguards-rather than endorsing any single model.

目的:电子健康档案(Electronic health records, EHRs)缺乏广泛采用的跨性别和性别多样性(TGD)状态记录标准,使TGD健康研究复杂化。已经开发了计算模型来识别电子病历中的TGD个体;然而,在了解哪些成分有助于更强的表型方法方面仍然存在差距。此范围审查评估了用于识别TGD个体的基于ehr的模型,重点关注标识符类型、性能、外部验证和道德报告,以指导最佳实践。材料和方法:我们按照PRISMA-ScR指南,检索PubMed、CINAHL、Web of Science和Embase,检索2024年1月之前发表的同行评议文章。纳入的研究使用EHR数据来识别TGD个体,验证TGD状态,报告或允许计算阳性预测值(PPV),并列出标识符。两位作者筛选和提取数据。我们根据数据类型和逻辑(结构化、非结构化和多模态)对模型进行了分类,总结了PPV分布,并综合了作者报告的伦理考虑。结果:14项研究描述50个模型符合纳入标准。单独使用tgd相关诊断代码(n = 11)或同时需要结构化和非结构化数据(n = 6)的模型显示最高的平均ppv(85.3%和97.1%)。在更大的TGD队列中验证的模型报告了更稳定的性能,但外部验证很少。大多数研究很少涉及伦理问题;只有3个描述了保护措施或利益相关者参与。讨论:电子病历数据中TGD个体的表型在设计和伦理透明度方面仍然存在异质性。报告的ppv应谨慎解释,因为性能受研究设计、样本量和验证方法的影响。结论:我们的建议强调加强表型方法的组成部分——标识符选择、多模态交叉逻辑、验证实践和伦理保障——而不是支持任何单一模型。
{"title":"A scoping review of models to identify transgender patients in electronic health records.","authors":"Robert A Becker, Jhansi U L Kolli, Colin G Walsh","doi":"10.1093/jamia/ocaf185","DOIUrl":"https://doi.org/10.1093/jamia/ocaf185","url":null,"abstract":"<p><strong>Objective: </strong>Electronic health records (EHRs) lack a widely adopted standard for recording transgender and gender diverse (TGD) status, complicating research on TGD health. Computational models have been developed to identify TGD individuals in EHRs; however, gaps remain in understanding which components contribute to stronger phenotyping approaches. This scoping review evaluates EHR-based models for identifying TGD individuals, focusing on identifier types, performance, external validation, and ethical reporting to guide best practices.</p><p><strong>Materials and methods: </strong>We searched PubMed, CINAHL, Web of Science, and Embase for peer-reviewed articles published before January 2024, following PRISMA-ScR guidelines. Included studies used EHR data to identify TGD individuals, verified TGD status, reported or allowed calculation of positive predictive value (PPV), and listed identifiers. Two authors screened and extracted data. We categorized models by data type and logic (structured, unstructured, and multimodal), summarized PPV distributions, and synthesized author-reported ethical considerations.</p><p><strong>Results: </strong>Fourteen studies describing 50 models met inclusion criteria. Models using TGD-related diagnostic codes alone (n = 11) or requiring both structured and unstructured data (n = 6) showed the highest mean PPVs (85.3% and 97.1%). Models validated on larger confirmed TGD cohorts reported more stable performance, but external validation was rare. Most studies minimally addressed ethics; only 3 described protective measures or stakeholder engagement.</p><p><strong>Discussion: </strong>Phenotyping of TGD individuals in EHR data remains heterogeneous in design and ethical transparency. Reported PPVs should be interpreted cautiously, as performance is influenced by study design, sample size, and verification methods.</p><p><strong>Conclusions: </strong>Our recommendations emphasize the components that strengthen phenotyping approaches-identifier choice, multimodal intersection logic, validation practices, and ethical safeguards-rather than endorsing any single model.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retraction and replacement of: Electronic connectivity between hospital pairs: impact on emergency department-related utilization. 撤销和更换:医院对之间的电子连接:对急诊科相关利用的影响。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-01 DOI: 10.1093/jamia/ocaf158
{"title":"Retraction and replacement of: Electronic connectivity between hospital pairs: impact on emergency department-related utilization.","authors":"","doi":"10.1093/jamia/ocaf158","DOIUrl":"10.1093/jamia/ocaf158","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":"32 11","pages":"1789"},"PeriodicalIF":4.6,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12626204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145551556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1