首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Patient Attitudes Toward Ambient Voice Technology: Preimplementation Patient Survey in an Academic Medical Center. 患者对环境语音技术的态度:在学术医疗中心实施前的患者调查。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-27 DOI: 10.2196/77901
Gary Leiserowitz, Jeff Mansfield, Scott MacDonald, Melissa Jost
<p><strong>Background: </strong>Many institutions are in various stages of deploying an artificial intelligence (AI) scribe system for clinic electronic health record (EHR) documentation. In anticipation of the University of California, Davis Health's deployment of an AI scribe program, we surveyed current patients about their perceptions of this technology to inform a patient-centered implementation.</p><p><strong>Objective: </strong>We assessed patient perceptions about current clinician EHR documentation practices before implementation of the AI scribe program, and preconceptions regarding the AI scribe's introduction.</p><p><strong>Methods: </strong>We conducted a descriptive preimplementation survey as a quality improvement study. A convenience sample of 9171 patients (aged ≥18 years) who had a clinic visit within the previous year, was recruited via an email postvisit survey. Patient-identified demographics (age, gender, and race and ethnicity) were collected. The survey included rating scales on questions related to the patient perception of the AI scribe program, plus open-ended comments. Data were collated to analyze patient perceptions of including AI Scribe technology in a clinician visit.</p><p><strong>Results: </strong>In total, 1893 patients completed the survey (20% response rate), with partial responses from another 549. Sixty-three percent (n=1205) of the respondents were female, and most were 51 years and older (87%, n=1649). Most patients identified themselves as White (69%, n=1312), multirace (8%, n=154), Latinx (7%, n=130), and Black (2%, n=42). The respondents were not representative of the overall clinic populations and skewed more toward being female, ages 50 years and older, and White in comparison. Patients reacted to the current EHR documentation system, with 71% (n=1349) feeling heard or sometimes heard, but 23% (n=416) expressed frustrations that their physician focused too much on typing into the computer. When asked about their anticipated response to the use of an AI scribe, 48% (n=904) were favorable, 33% (n=630) were neutral, and 19% (n=359) were unfavorable. Younger patients (ages 18-30 years) expressed more skepticism than those aged 51 years and older. Further, 42% (655/1567) of positive comments received indicated this technology could improve human interaction during their visits. Comments supported that the use of an AI scribe would enhance patient experience by allowing the clinician to focus on the patient. However, when asked about concerns regarding the AI scribe, 39% (515/1330) and 15% (203/1330) of comments expressed concerns about documentation accuracy and privacy, respectively. Providing previsit patient education and obtaining permission were viewed as very important.</p><p><strong>Conclusions: </strong>This patient survey showed that respondents are generally open to the use of an AI scribe program for EHR documentation to allow the clinician to focus on the patient during the actual encounter ra
背景:许多机构正处于为诊所电子健康记录(EHR)文档部署人工智能(AI)抄写系统的不同阶段。考虑到加州大学戴维斯健康中心(University of California, Davis Health)部署的人工智能记录程序,我们调查了当前患者对这项技术的看法,以告知以患者为中心的实施。目的:在实施人工智能抄写员计划之前,我们评估了患者对当前临床医生电子病历记录实践的看法,以及对人工智能抄写员引入的先入之见。方法:我们进行了一项描述性的实施前调查作为质量改进研究。通过电子邮件访后调查,选取9171例在前一年就诊的患者(年龄≥18岁)作为方便样本。收集患者确定的人口统计数据(年龄、性别、种族和民族)。该调查包括对患者对人工智能抄写程序的看法相关问题的评分量表,以及开放式评论。整理数据以分析患者对在临床医生访问中使用AI Scribe技术的看法。结果:共有1893名患者完成了调查(20%的有效率),另有549名患者部分应答。受访者中女性占63% (n=1205), 51岁及以上的占87% (n= 1649)。大多数患者认为自己是白人(69%,n=1312)、多种族(8%,n=154)、拉丁裔(7%,n=130)和黑人(2%,n=42)。受访者并不能代表整个诊所的人群,相比之下,他们更倾向于50岁及以上的女性和白人。患者对当前的EHR文件系统的反应是,71% (n=1349)的患者感觉被听到或有时被听到,但23% (n=416)的患者对他们的医生过于专注于在电脑上打字表示失望。当被问及他们对使用AI抄写器的预期反应时,48% (n=904)表示赞成,33% (n=630)表示中立,19% (n=359)表示不赞成。年轻患者(18-30岁)比51岁及以上的患者表达更多的怀疑。此外,收到的42%(655/1567)的积极评论表明,这项技术可以改善他们访问期间的人际互动。评论认为,使用人工智能抄写员可以让临床医生专注于患者,从而提高患者体验。然而,当被问及对AI抄写员的担忧时,39%(515/1330)和15%(203/1330)的评论分别表达了对文档准确性和隐私性的担忧。在就诊前对患者进行教育并获得许可是非常重要的。结论:该患者调查显示,受访者通常对使用人工智能抄写程序进行电子病历记录持开放态度,以便临床医生在实际遇到患者时专注于患者,而不是计算机。在使用人工智能之前提供患者教育和征得患者同意是获得患者信任的重要组成部分。考虑到低回复率和非代表性,对结果保持谨慎是适当的。
{"title":"Patient Attitudes Toward Ambient Voice Technology: Preimplementation Patient Survey in an Academic Medical Center.","authors":"Gary Leiserowitz, Jeff Mansfield, Scott MacDonald, Melissa Jost","doi":"10.2196/77901","DOIUrl":"10.2196/77901","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Many institutions are in various stages of deploying an artificial intelligence (AI) scribe system for clinic electronic health record (EHR) documentation. In anticipation of the University of California, Davis Health's deployment of an AI scribe program, we surveyed current patients about their perceptions of this technology to inform a patient-centered implementation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We assessed patient perceptions about current clinician EHR documentation practices before implementation of the AI scribe program, and preconceptions regarding the AI scribe's introduction.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We conducted a descriptive preimplementation survey as a quality improvement study. A convenience sample of 9171 patients (aged ≥18 years) who had a clinic visit within the previous year, was recruited via an email postvisit survey. Patient-identified demographics (age, gender, and race and ethnicity) were collected. The survey included rating scales on questions related to the patient perception of the AI scribe program, plus open-ended comments. Data were collated to analyze patient perceptions of including AI Scribe technology in a clinician visit.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;In total, 1893 patients completed the survey (20% response rate), with partial responses from another 549. Sixty-three percent (n=1205) of the respondents were female, and most were 51 years and older (87%, n=1649). Most patients identified themselves as White (69%, n=1312), multirace (8%, n=154), Latinx (7%, n=130), and Black (2%, n=42). The respondents were not representative of the overall clinic populations and skewed more toward being female, ages 50 years and older, and White in comparison. Patients reacted to the current EHR documentation system, with 71% (n=1349) feeling heard or sometimes heard, but 23% (n=416) expressed frustrations that their physician focused too much on typing into the computer. When asked about their anticipated response to the use of an AI scribe, 48% (n=904) were favorable, 33% (n=630) were neutral, and 19% (n=359) were unfavorable. Younger patients (ages 18-30 years) expressed more skepticism than those aged 51 years and older. Further, 42% (655/1567) of positive comments received indicated this technology could improve human interaction during their visits. Comments supported that the use of an AI scribe would enhance patient experience by allowing the clinician to focus on the patient. However, when asked about concerns regarding the AI scribe, 39% (515/1330) and 15% (203/1330) of comments expressed concerns about documentation accuracy and privacy, respectively. Providing previsit patient education and obtaining permission were viewed as very important.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This patient survey showed that respondents are generally open to the use of an AI scribe program for EHR documentation to allow the clinician to focus on the patient during the actual encounter ra","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77901"},"PeriodicalIF":3.8,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699246/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk Prediction of Major Adverse Cardiovascular Events Within One Year After Percutaneous Coronary Intervention in Patients With Acute Coronary Syndrome: Machine Learning-Based Time-to-Event Analysis. 急性冠脉综合征患者经皮冠状动脉介入治疗后一年内主要不良心血管事件的风险预测:基于机器学习的事件时间分析。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-27 DOI: 10.2196/81778
Hong-Jae Choi, Changhee Lee, Hack-Lyoung Kim, Youn-Jung Son

Background: Patients with acute coronary syndrome (ACS) who undergo percutaneous coronary intervention (PCI) remain at high risk for major adverse cardiovascular events (MACE). Conventional risk scores may not capture dynamic or nonlinear changes in postdischarge MACE risk, whereas machine learning (ML) approaches can improve predictive performance. However, few ML models have incorporated time-to-event analysis to reflect changes in MACE risk over time.

Objective: This study aimed to develop a time-to-event ML model for predicting MACE after PCI in patients with ACS and to identify the risk factors with time-varying contributions.

Methods: We analyzed electronic health records of 3159 patients with ACS who underwent PCI at a tertiary hospital in South Korea between 2008 and 2020. Six time-to-event ML models were developed using 54 variables. Model performance was evaluated using the time-dependent concordance index and Brier score. Variable importance was assessed using permutation importance and visualized with partial dependence plots to identify variables contributing to MACE risk over time.

Results: During a median follow-up of 3.8 years, 626 (19.8%) patients experienced MACE. The best-performing model achieved a time-dependent concordance index of 0.743 at day 30 and 0.616 at 1 year. Time-dependent Brier scores increased and remained stable across all ML models. Key predictors included contrast volume, age, medication adherence, coronary artery disease severity, and glomerular filtration rate. Contrast volume ≥300 mL, age ≥60 years, and medication adherence score ≥30 were associated with early postdischarge risk, whereas coronary artery disease severity and glomerular filtration rate became more influential beyond 60 days.

Conclusions: The proposed time-to-event ML model effectively captured dynamic risk patterns after PCI and identified key predictors with time-varying effects. These findings may support individualized postdischarge management and early intervention strategies to prevent MACE in high-risk patients.

背景:急性冠脉综合征(ACS)患者接受经皮冠状动脉介入治疗(PCI)后仍有发生重大心血管不良事件(MACE)的高风险。传统的风险评分可能无法捕捉出院后MACE风险的动态或非线性变化,而机器学习(ML)方法可以提高预测性能。然而,很少有机器学习模型结合了事件时间分析来反映MACE风险随时间的变化。目的:本研究旨在建立一个预测ACS患者PCI术后MACE的时间-事件ML模型,并确定具有时变贡献的危险因素。方法:我们分析了2008年至2020年在韩国一家三级医院接受PCI治疗的3159例ACS患者的电子健康记录。使用54个变量开发了6个时间到事件的ML模型。使用时间相关的一致性指数和Brier评分来评估模型的性能。变量重要性评估使用排列重要性和可视化的部分依赖图,以确定影响MACE风险的变量随着时间的推移。结果:在中位随访3.8年期间,626例(19.8%)患者经历了MACE。表现最好的模型在第30天和第1年的时间相关一致性指数分别为0.743和0.616。时间相关的Brier评分在所有ML模型中增加并保持稳定。主要预测因素包括造影剂体积、年龄、药物依从性、冠状动脉疾病严重程度和肾小球滤过率。造影剂体积≥300 mL、年龄≥60岁、药物依从性评分≥30与早期出院后风险相关,而冠状动脉疾病严重程度和肾小球滤过率在60天后的影响更大。结论:提出的时间-事件ML模型有效地捕获了PCI术后的动态风险模式,并确定了具有时变效应的关键预测因子。这些发现可能支持个体化的出院后管理和早期干预策略,以预防高危患者的MACE。
{"title":"Risk Prediction of Major Adverse Cardiovascular Events Within One Year After Percutaneous Coronary Intervention in Patients With Acute Coronary Syndrome: Machine Learning-Based Time-to-Event Analysis.","authors":"Hong-Jae Choi, Changhee Lee, Hack-Lyoung Kim, Youn-Jung Son","doi":"10.2196/81778","DOIUrl":"10.2196/81778","url":null,"abstract":"<p><strong>Background: </strong>Patients with acute coronary syndrome (ACS) who undergo percutaneous coronary intervention (PCI) remain at high risk for major adverse cardiovascular events (MACE). Conventional risk scores may not capture dynamic or nonlinear changes in postdischarge MACE risk, whereas machine learning (ML) approaches can improve predictive performance. However, few ML models have incorporated time-to-event analysis to reflect changes in MACE risk over time.</p><p><strong>Objective: </strong>This study aimed to develop a time-to-event ML model for predicting MACE after PCI in patients with ACS and to identify the risk factors with time-varying contributions.</p><p><strong>Methods: </strong>We analyzed electronic health records of 3159 patients with ACS who underwent PCI at a tertiary hospital in South Korea between 2008 and 2020. Six time-to-event ML models were developed using 54 variables. Model performance was evaluated using the time-dependent concordance index and Brier score. Variable importance was assessed using permutation importance and visualized with partial dependence plots to identify variables contributing to MACE risk over time.</p><p><strong>Results: </strong>During a median follow-up of 3.8 years, 626 (19.8%) patients experienced MACE. The best-performing model achieved a time-dependent concordance index of 0.743 at day 30 and 0.616 at 1 year. Time-dependent Brier scores increased and remained stable across all ML models. Key predictors included contrast volume, age, medication adherence, coronary artery disease severity, and glomerular filtration rate. Contrast volume ≥300 mL, age ≥60 years, and medication adherence score ≥30 were associated with early postdischarge risk, whereas coronary artery disease severity and glomerular filtration rate became more influential beyond 60 days.</p><p><strong>Conclusions: </strong>The proposed time-to-event ML model effectively captured dynamic risk patterns after PCI and identified key predictors with time-varying effects. These findings may support individualized postdischarge management and early intervention strategies to prevent MACE in high-risk patients.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e81778"},"PeriodicalIF":3.8,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12699253/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Coverage Sampling to Enhance Clinical Chart Review Coverage for Computable Phenotype Development: Simulation and Empirical Study. 无监督覆盖抽样提高临床图表审查覆盖可计算表型发展:模拟和实证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-27 DOI: 10.2196/72068
Zigui Wang, Jillian H Hurst, Chuan Hong, Benjamin Alan Goldstein

Background: Developing computable phenotypes (CP) based on electronic health records (EHR) data requires "gold-standard" labels for the outcome of interest. To generate these labels, clinicians typically chart-review a subset of patient charts. Charts to be reviewed are most often randomly sampled from the larger set of patients of interest. However, random sampling may fail to capture the diversity of the patient population, particularly if smaller subpopulations exist among those with the condition of interest. This can lead to poorly performing and biased CPs.

Objective: This study aimed to propose an unsupervised sampling approach designed to better capture a diverse patient cohort and improve the information coverage of chart review samples.

Methods: Our coverage sampling method starts by clustering by the patient population of interest. We then perform a stratified sampling from each cluster to ensure even representation for the chart review sample. We introduce a novel metric, nearest neighbor distance, to evaluate the coverage of the generated sample. To evaluate our method, we first conducted a simulation study to model and compare the performance of random versus our proposed coverage sampling. We varied the size and number of subpopulations within the larger cohort. Finally, we apply our approach to a real-world data set to develop a CP for hospitalization due to COVID-19. We evaluate the different sampling strategies based on the information coverage as well as the performance of the learned CP, using the area under the receiver operator characteristic curve.

Results: Our simulation studies show that the unsupervised coverage sampling approach provides broader coverage of patient populations compared to random sampling. When there are no underlying subpopulations, both random and coverage perform equally well for CP development. When there are subgroups, coverage sampling achieves area under the receiver operating characteristic curve gains of approximately 0.03-0.05 over random sampling. In the real-world application, the approach also outperformed random sampling, generating both a more representative sample and an area under the receiver operating characteristic curve improvement of 0.02 (95% CI -0.08 to 0.04).

Conclusions: The proposed coverage sampling method is an easy-to-implement approach that produces a chart review sample that is more representative of the source population. This allows one to learn a CP that has better performance both for subpopulations and the overall cohort. Studies that aim to develop CPs should consider alternative strategies other than randomly sampling patient charts.

背景:基于电子健康记录(EHR)数据开发可计算表型(CP)需要对感兴趣的结果进行“金标准”标签。为了生成这些标签,临床医生通常会对患者图表的一个子集进行图表审查。要审查的图表通常是从感兴趣的较大患者组中随机抽取的。然而,随机抽样可能无法捕获患者群体的多样性,特别是如果在那些有兴趣的条件中存在较小的亚群。这可能导致cp表现不佳和有偏见。目的:本研究旨在提出一种无监督抽样方法,旨在更好地捕获多样化的患者队列,并提高图表回顾样本的信息覆盖率。方法:我们的覆盖抽样方法从感兴趣的患者群体聚类开始。然后,我们从每个集群中执行分层抽样,以确保图表审查样本的均匀表示。我们引入了一种新的度量,最近邻距离,来评估生成样本的覆盖率。为了评估我们的方法,我们首先进行了模拟研究,对随机抽样和我们建议的覆盖抽样的性能进行了建模和比较。我们在更大的队列中改变了亚种群的大小和数量。最后,我们将我们的方法应用于现实世界的数据集,以制定因COVID-19住院的CP。我们利用接收算子特征曲线下的面积,根据信息覆盖率和学习到的CP的性能来评估不同的采样策略。结果:我们的模拟研究表明,与随机抽样相比,无监督覆盖抽样方法提供了更广泛的患者群体覆盖。当没有潜在的亚群时,对于CP的发展,随机和覆盖都表现得同样好。当存在子组时,覆盖抽样比随机抽样实现了接受者工作特征曲线下面积增益约0.03-0.05。在实际应用中,该方法也优于随机抽样,产生了更具代表性的样本,并且接收者工作特征曲线下的面积提高了0.02 (95% CI -0.08至0.04)。结论:建议的覆盖抽样方法是一种易于实施的方法,它产生的图表审查样本更能代表源人群。这允许人们学习一种对亚群体和整体群体都有更好表现的CP。旨在发展CPs的研究应考虑其他策略,而不是随机抽样患者图表。
{"title":"Unsupervised Coverage Sampling to Enhance Clinical Chart Review Coverage for Computable Phenotype Development: Simulation and Empirical Study.","authors":"Zigui Wang, Jillian H Hurst, Chuan Hong, Benjamin Alan Goldstein","doi":"10.2196/72068","DOIUrl":"10.2196/72068","url":null,"abstract":"<p><strong>Background: </strong>Developing computable phenotypes (CP) based on electronic health records (EHR) data requires \"gold-standard\" labels for the outcome of interest. To generate these labels, clinicians typically chart-review a subset of patient charts. Charts to be reviewed are most often randomly sampled from the larger set of patients of interest. However, random sampling may fail to capture the diversity of the patient population, particularly if smaller subpopulations exist among those with the condition of interest. This can lead to poorly performing and biased CPs.</p><p><strong>Objective: </strong>This study aimed to propose an unsupervised sampling approach designed to better capture a diverse patient cohort and improve the information coverage of chart review samples.</p><p><strong>Methods: </strong>Our coverage sampling method starts by clustering by the patient population of interest. We then perform a stratified sampling from each cluster to ensure even representation for the chart review sample. We introduce a novel metric, nearest neighbor distance, to evaluate the coverage of the generated sample. To evaluate our method, we first conducted a simulation study to model and compare the performance of random versus our proposed coverage sampling. We varied the size and number of subpopulations within the larger cohort. Finally, we apply our approach to a real-world data set to develop a CP for hospitalization due to COVID-19. We evaluate the different sampling strategies based on the information coverage as well as the performance of the learned CP, using the area under the receiver operator characteristic curve.</p><p><strong>Results: </strong>Our simulation studies show that the unsupervised coverage sampling approach provides broader coverage of patient populations compared to random sampling. When there are no underlying subpopulations, both random and coverage perform equally well for CP development. When there are subgroups, coverage sampling achieves area under the receiver operating characteristic curve gains of approximately 0.03-0.05 over random sampling. In the real-world application, the approach also outperformed random sampling, generating both a more representative sample and an area under the receiver operating characteristic curve improvement of 0.02 (95% CI -0.08 to 0.04).</p><p><strong>Conclusions: </strong>The proposed coverage sampling method is an easy-to-implement approach that produces a chart review sample that is more representative of the source population. This allows one to learn a CP that has better performance both for subpopulations and the overall cohort. Studies that aim to develop CPs should consider alternative strategies other than randomly sampling patient charts.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e72068"},"PeriodicalIF":3.8,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12661603/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145643472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence-Based Computerized Digit Vigilance Test in Community-Dwelling Older Adults: Development and Validation Study. 基于人工智能的社区老年人手指警觉性计算机化测试:开发与验证研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-26 DOI: 10.2196/73038
Gong-Hong Lin, Dorothy Bai, Yi-Jing Huang, Shih-Chieh Lee, Mai Thi Thuy Vu, Tsu-Hsien Chiu

Background: The Computerized Digit Vigilance Test (CDVT) is a well-established measure of sustained attention. However, the CDVT only measures the total reaction time and response accuracy and fails to capture other crucial attentional features such as the eye blink rate, yawns, head movements, and eye movements. Omitting such features might provide an incomplete representative picture of sustained attention.

Objective: This study aimed to develop an artificial intelligence (AI)-based Computerized Digit Vigilance Test (AI-CDVT) for older adults.

Methods: Participants were assessed by the CDVT with video recordings capturing their head and face. The Montreal Cognitive Assessment (MoCA), Stroop Color Word Test (SCW), and Color Trails Test (CTT) were also administered. The AI-CDVT was developed in three steps: (1) retrieving attentional features using OpenFace AI software (CMU MultiComp Lab), (2) establishing an AI-based scoring model with the Extreme Gradient Boosting regressor, and (3) assessing the AI-CDVT's validity by Pearson r values and test-retest reliability by intraclass correlation coefficients (ICCs).

Results: In total, 153 participants were included. Pearson r values of the AI-CDVT with the MoCA were -0.42, -0.31 with the SCW, and 0.46-0.61 with the CTT. The ICC of the AI-CDVT was 0.78.

Conclusions: We developed an AI-CDVT, which leveraged AI to extract attentional features from video recordings and integrated them to generate a comprehensive attention score. Our findings demonstrated good validity and test-retest reliability for the AI-CDVT, suggesting its potential as a reliable and valid tool for assessing sustained attention in older adults.

背景:计算机数字警觉性测试(CDVT)是一种公认的持续注意的测量方法。然而,CDVT只测量总反应时间和反应准确性,而不能捕捉到其他关键的注意力特征,如眨眼频率、打哈欠、头部运动和眼球运动。忽略这些特征可能会提供一个不完整的持续关注的代表性图像。目的:本研究旨在开发一种基于人工智能(AI)的老年人计算机手指警觉性测试(AI- cdvt)。方法:对参与者进行CDVT评估,录像记录他们的头部和面部。同时进行蒙特利尔认知评估(MoCA)、Stroop颜色单词测试(SCW)和颜色轨迹测试(CTT)。AI- cdvt的开发分为三个步骤:(1)使用OpenFace人工智能软件(CMU MultiComp Lab)检索注意特征;(2)使用极端梯度增强回归器建立基于人工智能的评分模型;(3)使用Pearson r值评估AI- cdvt的效度,使用类内相关系数(ICCs)评估AI- cdvt的重测信度。结果:共纳入153名受试者。AI-CDVT与MoCA的Pearson r值为-0.42,与SCW的Pearson r值为-0.31,与CTT的Pearson r值为0.46-0.61。AI-CDVT的ICC为0.78。结论:我们开发了一个AI- cdvt,它利用AI从视频记录中提取注意力特征,并将它们整合起来生成一个综合的注意力评分。我们的研究结果表明,AI-CDVT具有良好的效度和重测信度,这表明它有可能成为评估老年人持续注意力的可靠和有效的工具。
{"title":"Artificial Intelligence-Based Computerized Digit Vigilance Test in Community-Dwelling Older Adults: Development and Validation Study.","authors":"Gong-Hong Lin, Dorothy Bai, Yi-Jing Huang, Shih-Chieh Lee, Mai Thi Thuy Vu, Tsu-Hsien Chiu","doi":"10.2196/73038","DOIUrl":"10.2196/73038","url":null,"abstract":"<p><strong>Background: </strong>The Computerized Digit Vigilance Test (CDVT) is a well-established measure of sustained attention. However, the CDVT only measures the total reaction time and response accuracy and fails to capture other crucial attentional features such as the eye blink rate, yawns, head movements, and eye movements. Omitting such features might provide an incomplete representative picture of sustained attention.</p><p><strong>Objective: </strong>This study aimed to develop an artificial intelligence (AI)-based Computerized Digit Vigilance Test (AI-CDVT) for older adults.</p><p><strong>Methods: </strong>Participants were assessed by the CDVT with video recordings capturing their head and face. The Montreal Cognitive Assessment (MoCA), Stroop Color Word Test (SCW), and Color Trails Test (CTT) were also administered. The AI-CDVT was developed in three steps: (1) retrieving attentional features using OpenFace AI software (CMU MultiComp Lab), (2) establishing an AI-based scoring model with the Extreme Gradient Boosting regressor, and (3) assessing the AI-CDVT's validity by Pearson r values and test-retest reliability by intraclass correlation coefficients (ICCs).</p><p><strong>Results: </strong>In total, 153 participants were included. Pearson r values of the AI-CDVT with the MoCA were -0.42, -0.31 with the SCW, and 0.46-0.61 with the CTT. The ICC of the AI-CDVT was 0.78.</p><p><strong>Conclusions: </strong>We developed an AI-CDVT, which leveraged AI to extract attentional features from video recordings and integrated them to generate a comprehensive attention score. Our findings demonstrated good validity and test-retest reliability for the AI-CDVT, suggesting its potential as a reliable and valid tool for assessing sustained attention in older adults.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73038"},"PeriodicalIF":3.8,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12670460/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145656612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting 30-Days Hospital Readmission for Patients with Heart Failure Using Electronic Health Record Embeddings: Comparative Evaluation. 使用电子健康记录嵌入预测心力衰竭患者30天再入院:比较评估
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-25 DOI: 10.2196/73020
Prabin Shakya, Ayush Khaneja, Kavishwar B Wagholikar

Background: Heart failure (HF) is a public health concern with a wider impact on quality of life and cost of care. One of the major challenges in HF is the higher rate of unplanned readmissions and suboptimal performance of models to predict the readmissions. Hence, in this study, we implemented embeddings-based approaches to generate features for improving model performance.

Objective: The objective of this study was to evaluate and compare the effectiveness of different feature embedding approaches for improving the prediction of unplanned readmissions in patients with heart failure.

Methods: We compared three embedding approaches including word2vec on terminology codes and concept unique identifier (CUIs) and BERT on descriptive text of concept with baseline (one hot-encoding). We compared area under the receiver operating characteristic (AUROC) and F1-scores for the logistic regression, eXtream gradient-boosting (XGBoost) and artificial neural network (ANN) models using these embedding approaches. The model was tested on the heart failure cohort (N=21,031) identified using least restrictive phenotyping methods from MIMIC-IV dataset.

Results: We found that the embedding approaches significantly improved the performance of the prediction models. The XGBoost performed better for all approaches. The word2vec embeddings (0.65) trained on the dataset outperformed embeddings from pre-trained BERT model (0.59) using descriptive text.

Conclusions: Embedding methods, particularly word2vec trained on electronic health record data, can better discriminate HF readmission cases compared to both one-hot encoding and pre-trained BERT embeddings on concept descriptions making it a viable approach of automation feature selection. The observed AUROC improvement (0.65 vs 0.54) may support more effective risk stratification and targeted clinical interventions.

背景:心力衰竭(HF)是一个公共卫生问题,对生活质量和护理成本有更广泛的影响。心衰的主要挑战之一是较高的非计划再入院率和预测再入院模型的次优性能。因此,在本研究中,我们实现了基于嵌入的方法来生成特征以提高模型性能。目的:本研究的目的是评估和比较不同特征嵌入方法在改善心衰患者意外再入院预测方面的有效性。方法:采用word2vec方法对术语编码和概念唯一标识符(gui)进行嵌入,BERT方法对具有基线的概念描述文本进行嵌入(一种热编码)。我们比较了使用这些嵌入方法的逻辑回归、极端梯度增强(XGBoost)和人工神经网络(ANN)模型的接收者操作特征(AUROC)下的面积和f1得分。该模型在使用MIMIC-IV数据集中限制性最小的表型方法确定的心力衰竭队列(N= 21031)中进行了测试。结果:我们发现嵌入方法显著提高了预测模型的性能。XGBoost在所有方法中都表现得更好。在数据集上训练的word2vec嵌入(0.65)优于使用描述性文本的预训练BERT模型的嵌入(0.59)。结论:与单热编码和概念描述的预训练BERT嵌入相比,嵌入方法,特别是在电子健康记录数据上训练的word2vec,可以更好地区分HF再入院病例,使其成为一种可行的自动化特征选择方法。观察到的AUROC改善(0.65 vs 0.54)可能支持更有效的风险分层和有针对性的临床干预。
{"title":"Predicting 30-Days Hospital Readmission for Patients with Heart Failure Using Electronic Health Record Embeddings: Comparative Evaluation.","authors":"Prabin Shakya, Ayush Khaneja, Kavishwar B Wagholikar","doi":"10.2196/73020","DOIUrl":"10.2196/73020","url":null,"abstract":"<p><strong>Background: </strong>Heart failure (HF) is a public health concern with a wider impact on quality of life and cost of care. One of the major challenges in HF is the higher rate of unplanned readmissions and suboptimal performance of models to predict the readmissions. Hence, in this study, we implemented embeddings-based approaches to generate features for improving model performance.</p><p><strong>Objective: </strong>The objective of this study was to evaluate and compare the effectiveness of different feature embedding approaches for improving the prediction of unplanned readmissions in patients with heart failure.</p><p><strong>Methods: </strong>We compared three embedding approaches including word2vec on terminology codes and concept unique identifier (CUIs) and BERT on descriptive text of concept with baseline (one hot-encoding). We compared area under the receiver operating characteristic (AUROC) and F1-scores for the logistic regression, eXtream gradient-boosting (XGBoost) and artificial neural network (ANN) models using these embedding approaches. The model was tested on the heart failure cohort (N=21,031) identified using least restrictive phenotyping methods from MIMIC-IV dataset.</p><p><strong>Results: </strong>We found that the embedding approaches significantly improved the performance of the prediction models. The XGBoost performed better for all approaches. The word2vec embeddings (0.65) trained on the dataset outperformed embeddings from pre-trained BERT model (0.59) using descriptive text.</p><p><strong>Conclusions: </strong>Embedding methods, particularly word2vec trained on electronic health record data, can better discriminate HF readmission cases compared to both one-hot encoding and pre-trained BERT embeddings on concept descriptions making it a viable approach of automation feature selection. The observed AUROC improvement (0.65 vs 0.54) may support more effective risk stratification and targeted clinical interventions.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e73020"},"PeriodicalIF":3.8,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646029/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145607477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypertension Medication Recommendation via Synergistic and Selective Modeling of Heterogeneous Medical Entities: Development and Evaluation Study of a New Model. 基于异质医疗实体协同选择模型的高血压药物推荐:新模型的开发与评价研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-25 DOI: 10.2196/74170
Ke Zhang, Zhichang Zhang, Yali Liang, Wei Wang, Xia Wang

Background: Electronic health records (EHRs) contain comprehensive information regarding diagnoses, clinical procedures, and prescribed medications. This makes them a valuable resource for developing automated hypertension medication recommendation systems. Within this field, existing research has used machine learning approaches, leveraging demographic characteristics and basic clinical indicators, or deep learning techniques, which extract patterns from EHR data, to predict optimal medications or improve the accuracy of recommendations for common antihypertensive medication categories. However, these methodologies have significant limitations. They rarely adequately characterize the synergistic relationships among heterogeneous medical entities, such as the interplay between comorbid conditions, laboratory results, and specific antihypertensive agents. Furthermore, given the chronic and fluctuating nature of hypertension, effective medication recommendations require dynamic adaptation to disease progression over time. However, current approaches either lack rigorous temporal modeling of EHR data or fail to effectively integrate temporal dynamics with interentity relationships, resulting in the generation of recommendations that are not clinically appropriate due to the neglect of these critical factors.

Objective: This study aims to overcome the challenges in existing methods and introduce a novel model for hypertension medication recommendation that leverages the synergy and selectivity of heterogeneous medical entities.

Methods: First, we used patient EHR data to construct both heterogeneous and homogeneous graphs. The interentity synergies were captured using a multihead graph attention mechanism to enhance entity-level representations. Next, a bidirectional temporal selection mechanism calculated selective coefficients between current and historical visit records and aggregated them to form refined visit-level representations. Finally, medication recommendation probabilities were determined based on these comprehensive patient representations.

Results: Experimental evaluations on the real-world datasets Medical Information Mart for Intensive Care (MIMIC)-III v1.4 and MIMIC-IV v2.2 demonstrated that the proposed model achieved Jaccard similarity coefficients of 58.01% and 55.82%, respectively; areas under the curve of precision-recall of 83.56% and 80.69%, respectively; and F1-scores of 68.95% and 64.83%, respectively, outperforming the baseline models.

Conclusions: The findings indicate the superior efficacy of the introduced model in medication recommendation, highlighting its potential to enhance clinical decision-making in the management of hypertension. The code for the model has been released on GitHub.

背景:电子健康记录(EHRs)包含有关诊断、临床程序和处方药物的全面信息。这使得它们成为开发自动化高血压药物推荐系统的宝贵资源。在这一领域,现有的研究已经使用机器学习方法,利用人口统计学特征和基本临床指标,或深度学习技术,从电子病历数据中提取模式,预测最佳药物或提高常见抗高血压药物类别推荐的准确性。然而,这些方法有很大的局限性。它们很少充分描述异质医学实体之间的协同关系,例如合并症、实验室结果和特定抗高血压药物之间的相互作用。此外,鉴于高血压的慢性和波动性,有效的药物建议需要随着时间的推移动态适应疾病的进展。然而,目前的方法要么缺乏对电子病历数据的严格时间建模,要么未能有效地将时间动态与实体间关系结合起来,导致由于忽视这些关键因素而产生的建议在临床上不合适。目的:本研究旨在克服现有方法的挑战,引入一种利用异质医学实体协同作用和选择性的高血压药物推荐新模型。方法:首先,我们使用患者电子病历数据构建异质图和同质图。使用多头图注意机制捕获实体间的协同作用,以增强实体级表示。其次,双向时间选择机制计算当前和历史访问记录之间的选择系数,并将其聚合以形成精细化的访问级别表示。最后,根据这些综合的患者陈述确定药物推荐概率。结果:在icu Medical Information Mart (MIMIC)-III v1.4和MIMIC- iv v2.2真实数据集上的实验评估表明,所提模型的Jaccard相似系数分别达到58.01%和55.82%;精密度-召回率曲线下面积分别为83.56%和80.69%;和f1得分分别为68.95%和64.83%,优于基线模型。结论:所引入的模型在药物推荐方面具有较好的效果,可为高血压治疗的临床决策提供参考。该模型的代码已经在GitHub上发布。
{"title":"Hypertension Medication Recommendation via Synergistic and Selective Modeling of Heterogeneous Medical Entities: Development and Evaluation Study of a New Model.","authors":"Ke Zhang, Zhichang Zhang, Yali Liang, Wei Wang, Xia Wang","doi":"10.2196/74170","DOIUrl":"10.2196/74170","url":null,"abstract":"<p><strong>Background: </strong>Electronic health records (EHRs) contain comprehensive information regarding diagnoses, clinical procedures, and prescribed medications. This makes them a valuable resource for developing automated hypertension medication recommendation systems. Within this field, existing research has used machine learning approaches, leveraging demographic characteristics and basic clinical indicators, or deep learning techniques, which extract patterns from EHR data, to predict optimal medications or improve the accuracy of recommendations for common antihypertensive medication categories. However, these methodologies have significant limitations. They rarely adequately characterize the synergistic relationships among heterogeneous medical entities, such as the interplay between comorbid conditions, laboratory results, and specific antihypertensive agents. Furthermore, given the chronic and fluctuating nature of hypertension, effective medication recommendations require dynamic adaptation to disease progression over time. However, current approaches either lack rigorous temporal modeling of EHR data or fail to effectively integrate temporal dynamics with interentity relationships, resulting in the generation of recommendations that are not clinically appropriate due to the neglect of these critical factors.</p><p><strong>Objective: </strong>This study aims to overcome the challenges in existing methods and introduce a novel model for hypertension medication recommendation that leverages the synergy and selectivity of heterogeneous medical entities.</p><p><strong>Methods: </strong>First, we used patient EHR data to construct both heterogeneous and homogeneous graphs. The interentity synergies were captured using a multihead graph attention mechanism to enhance entity-level representations. Next, a bidirectional temporal selection mechanism calculated selective coefficients between current and historical visit records and aggregated them to form refined visit-level representations. Finally, medication recommendation probabilities were determined based on these comprehensive patient representations.</p><p><strong>Results: </strong>Experimental evaluations on the real-world datasets Medical Information Mart for Intensive Care (MIMIC)-III v1.4 and MIMIC-IV v2.2 demonstrated that the proposed model achieved Jaccard similarity coefficients of 58.01% and 55.82%, respectively; areas under the curve of precision-recall of 83.56% and 80.69%, respectively; and F1-scores of 68.95% and 64.83%, respectively, outperforming the baseline models.</p><p><strong>Conclusions: </strong>The findings indicate the superior efficacy of the introduced model in medication recommendation, highlighting its potential to enhance clinical decision-making in the management of hypertension. The code for the model has been released on GitHub.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e74170"},"PeriodicalIF":3.8,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646553/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145607501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bilingual On-Premises AI Agent for Clinical Drafting: Implementation Report of Seamless Electronic Health Records Integration in the Y-KNOT Project. 用于临床起草的双语本地AI代理:Y-KNOT项目中电子病历无缝集成的实施报告。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-24 DOI: 10.2196/76848
Hanjae Kim, So-Yeon Lee, Seng Chan You, Sookyung Huh, Jai-Eun Kim, Sung-Tae Kim, Dong-Ryul Ko, Ji Hoon Kim, Jae Hoon Lee, Joon Seok Lim, Moo Suk Park, Kang Young Lee

Background: Large language models (LLMs) have shown promise in reducing clinical documentation burden, yet their real-world implementation remains rare. Especially in South Korea, hospitals face several unique challenges, such as strict data sovereignty requirements and operating in environments where English is not the primary language for documentation. Therefore, we initiated the Your-Knowledgeable Navigator of Treatment (Y-KNOT) project, aimed at developing an on-premises bilingual LLM-based artificial intelligence (AI) agent system integrated with electronic health records (EHRs) for automated clinical drafting.

Objective: We present the Y-KNOT project and provide insights into implementing AI-assisted clinical drafting tools within constraints of health care system.

Methods: This project involved multiple stakeholders and encompassed three simultaneous processes: LLM development, clinical co-development, and EHR integration. We developed a foundation LLM by pretraining Llama3-8B with Korean and English medical corpora. During the clinical co-development phase, the LLM was instruction-tuned for specific documentation tasks through iterative cycles that aligned physicians' clinical requirements, hospital data availability, documentation standards, and technical feasibility. The EHR integration phase focused on seamless AI agent incorporation into clinical workflows, involving document standardization, trigger points definition, and user interaction optimization.

Unlabelled: The resulting system processes emergency department discharge summaries and preanesthetic assessments while maintaining existing clinical workflows. The drafting process is automatically triggered by specific events, such as scheduled batch jobs, with medical records automatically fed into the LLM as input. The agent is built on premises, locating all the architecture inside the hospital.

Conclusions: The Y-KNOT project demonstrates the first seamless integration of an AI agent into an EHR system for clinical drafting. In collaboration with various clinical and administrative teams, we could promptly implement an LLM while addressing key challenges of data security, bilingual requirements, and workflow integration. Our experience highlights a practical and scalable approach to utilizing LLM-based AI agents for other health care institutions, paving the way for broader adoption of LLM-based solutions.

背景:大型语言模型(llm)在减少临床文档负担方面表现出了希望,但它们在现实世界中的实现仍然很少。特别是在韩国,医院面临着一些独特的挑战,例如严格的数据主权要求,以及在英语不是主要文档语言的环境中运营。因此,我们启动了your - knowledge Navigator of Treatment (Y-KNOT)项目,旨在开发一种基于本地双语法学硕士的人工智能(AI)代理系统,该系统与电子健康记录(EHRs)集成,用于自动临床起草。目的:我们提出了Y-KNOT项目,并提供了在卫生保健系统的限制下实施人工智能辅助临床起草工具的见解。方法:该项目涉及多个利益相关者,包括三个同步过程:LLM开发、临床联合开发和EHR整合。我们通过韩语和英语医学语料库对Llama3-8B进行预训练,开发了基础LLM。在临床共同开发阶段,LLM通过迭代周期调整特定的文档任务,使医生的临床需求、医院数据可用性、文档标准和技术可行性保持一致。EHR集成阶段的重点是将AI代理无缝整合到临床工作流程中,包括文档标准化、触发点定义和用户交互优化。无标签:由此产生的系统处理急诊科出院摘要和麻醉前评估,同时保持现有的临床工作流程。起草过程由特定事件(如计划的批处理作业)自动触发,并自动将医疗记录作为输入输入到LLM中。代理是建立在现场,定位所有的建筑在医院内。结论:Y-KNOT项目首次展示了人工智能代理与EHR系统的无缝集成,用于临床起草。通过与各个临床和管理团队的合作,我们可以在解决数据安全、双语要求和工作流程集成等关键挑战的同时,迅速实施法学硕士。我们的经验强调了一种实用且可扩展的方法,可以为其他医疗机构利用基于法学硕士的人工智能代理,为更广泛地采用基于法学硕士的解决方案铺平道路。
{"title":"A Bilingual On-Premises AI Agent for Clinical Drafting: Implementation Report of Seamless Electronic Health Records Integration in the Y-KNOT Project.","authors":"Hanjae Kim, So-Yeon Lee, Seng Chan You, Sookyung Huh, Jai-Eun Kim, Sung-Tae Kim, Dong-Ryul Ko, Ji Hoon Kim, Jae Hoon Lee, Joon Seok Lim, Moo Suk Park, Kang Young Lee","doi":"10.2196/76848","DOIUrl":"10.2196/76848","url":null,"abstract":"<p><strong>Background: </strong>Large language models (LLMs) have shown promise in reducing clinical documentation burden, yet their real-world implementation remains rare. Especially in South Korea, hospitals face several unique challenges, such as strict data sovereignty requirements and operating in environments where English is not the primary language for documentation. Therefore, we initiated the Your-Knowledgeable Navigator of Treatment (Y-KNOT) project, aimed at developing an on-premises bilingual LLM-based artificial intelligence (AI) agent system integrated with electronic health records (EHRs) for automated clinical drafting.</p><p><strong>Objective: </strong>We present the Y-KNOT project and provide insights into implementing AI-assisted clinical drafting tools within constraints of health care system.</p><p><strong>Methods: </strong>This project involved multiple stakeholders and encompassed three simultaneous processes: LLM development, clinical co-development, and EHR integration. We developed a foundation LLM by pretraining Llama3-8B with Korean and English medical corpora. During the clinical co-development phase, the LLM was instruction-tuned for specific documentation tasks through iterative cycles that aligned physicians' clinical requirements, hospital data availability, documentation standards, and technical feasibility. The EHR integration phase focused on seamless AI agent incorporation into clinical workflows, involving document standardization, trigger points definition, and user interaction optimization.</p><p><strong>Unlabelled: </strong>The resulting system processes emergency department discharge summaries and preanesthetic assessments while maintaining existing clinical workflows. The drafting process is automatically triggered by specific events, such as scheduled batch jobs, with medical records automatically fed into the LLM as input. The agent is built on premises, locating all the architecture inside the hospital.</p><p><strong>Conclusions: </strong>The Y-KNOT project demonstrates the first seamless integration of an AI agent into an EHR system for clinical drafting. In collaboration with various clinical and administrative teams, we could promptly implement an LLM while addressing key challenges of data security, bilingual requirements, and workflow integration. Our experience highlights a practical and scalable approach to utilizing LLM-based AI agents for other health care institutions, paving the way for broader adoption of LLM-based solutions.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76848"},"PeriodicalIF":3.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12643392/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145598072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models in Critical Care Medicine: Scoping Review. 重症医学中的大型语言模型:范围综述。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-24 DOI: 10.2196/76326
Tongyue Shi, Jun Ma, Zihan Yu, Haowei Xu, Rongxin Yang, Minqi Xiong, Meirong Xiao, Yilin Li, Huiying Zhao, Guilan Kong

Background: With the rapid development of artificial intelligence, large language models (LLMs) have shown strong capabilities in natural language understanding, reasoning, and generation, attracting much research interest in applying LLMs to health and medicine. Critical care medicine (CCM) provides diagnosis and treatment for patients with critical illness who often require intensive monitoring and interventions in intensive care units (ICUs). Whether LLMs can be applied to CCM, and whether they can operate as ICU experts in assisting clinical decision-making rather than "stochastic parrots," remains uncertain.

Objective: This scoping review aims to provide a panoramic portrait of the application of LLMs in CCM, identifying the advantages, challenges, and future potential of LLMs in this field.

Methods: This study was conducted in accordance with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. Literature was searched across 7 databases, including PubMed, Embase, Scopus, Web of Science, CINAHL, IEEE Xplore, and ACM Digital Library, from the first available paper to August 22, 2025.

Results: From an initial 2342 retrieved papers, 41 were selected for final review. LLMs played an important role in CCM through the following 3 main channels: clinical decision support, medical documentation and reporting, and medical education and doctor-patient communication. Compared to traditional artificial intelligence models, LLMs have advantages in handling unstructured data and do not require manual feature engineering. Meanwhile, applying LLMs to CCM has faced challenges, including hallucinations and poor interpretability, sensitivity to prompts, bias and alignment challenges, and privacy and ethical issues.

Conclusions: Although LLMs are not yet ICU experts, they have the potential to become valuable tools in CCM, helping to improve patient outcomes and optimize health care delivery. Future research should enhance model reliability and interpretability, improve model training and deployment scalability, integrate up-to-date medical knowledge, and strengthen privacy and ethical guidelines, paving the way for LLMs to fully realize their impact in critical care.

Trial registration: OSF Registries yn328; https://osf.io/yn328/.

背景:随着人工智能的快速发展,大型语言模型(large language models, llm)在自然语言理解、推理和生成方面表现出了强大的能力,引起了人们对将llm应用于健康和医学领域的研究兴趣。重症监护医学(CCM)为重症患者提供诊断和治疗,这些患者通常需要在重症监护病房(icu)进行强化监测和干预。法学硕士是否可以应用于CCM,以及他们是否可以作为ICU专家协助临床决策,而不是“随机鹦鹉”,仍然不确定。目的:本综述旨在提供法学硕士在CCM中的应用全景,确定法学硕士在该领域的优势、挑战和未来潜力。方法:本研究按照PRISMA-ScR(系统评价和荟萃分析扩展范围评价的首选报告项目)指南进行。从第一篇论文到2025年8月22日,在PubMed、Embase、Scopus、Web of Science、CINAHL、IEEE explore和ACM数字图书馆等7个数据库中检索文献。结果:从最初的2342篇检索论文中,有41篇入选最终审稿。法学硕士在CCM中主要通过以下3个渠道发挥重要作用:临床决策支持、医学文献报告、医学教育和医患沟通。与传统的人工智能模型相比,llm在处理非结构化数据方面具有优势,并且不需要人工特征工程。与此同时,法学硕士在CCM中的应用也面临着挑战,包括幻觉和较差的可解释性、对提示的敏感性、偏见和一致性挑战,以及隐私和道德问题。结论:虽然法学硕士还不是ICU专家,但他们有潜力成为CCM中有价值的工具,帮助改善患者的预后并优化医疗服务。未来的研究应提高模型的可靠性和可解释性,提高模型训练和部署的可扩展性,整合最新的医学知识,加强隐私和道德准则,为法学硕士充分发挥其在重症监护中的作用铺平道路。试验注册:OSF registryyn328;https://osf.io/yn328/。
{"title":"Large Language Models in Critical Care Medicine: Scoping Review.","authors":"Tongyue Shi, Jun Ma, Zihan Yu, Haowei Xu, Rongxin Yang, Minqi Xiong, Meirong Xiao, Yilin Li, Huiying Zhao, Guilan Kong","doi":"10.2196/76326","DOIUrl":"10.2196/76326","url":null,"abstract":"<p><strong>Background: </strong>With the rapid development of artificial intelligence, large language models (LLMs) have shown strong capabilities in natural language understanding, reasoning, and generation, attracting much research interest in applying LLMs to health and medicine. Critical care medicine (CCM) provides diagnosis and treatment for patients with critical illness who often require intensive monitoring and interventions in intensive care units (ICUs). Whether LLMs can be applied to CCM, and whether they can operate as ICU experts in assisting clinical decision-making rather than \"stochastic parrots,\" remains uncertain.</p><p><strong>Objective: </strong>This scoping review aims to provide a panoramic portrait of the application of LLMs in CCM, identifying the advantages, challenges, and future potential of LLMs in this field.</p><p><strong>Methods: </strong>This study was conducted in accordance with the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. Literature was searched across 7 databases, including PubMed, Embase, Scopus, Web of Science, CINAHL, IEEE Xplore, and ACM Digital Library, from the first available paper to August 22, 2025.</p><p><strong>Results: </strong>From an initial 2342 retrieved papers, 41 were selected for final review. LLMs played an important role in CCM through the following 3 main channels: clinical decision support, medical documentation and reporting, and medical education and doctor-patient communication. Compared to traditional artificial intelligence models, LLMs have advantages in handling unstructured data and do not require manual feature engineering. Meanwhile, applying LLMs to CCM has faced challenges, including hallucinations and poor interpretability, sensitivity to prompts, bias and alignment challenges, and privacy and ethical issues.</p><p><strong>Conclusions: </strong>Although LLMs are not yet ICU experts, they have the potential to become valuable tools in CCM, helping to improve patient outcomes and optimize health care delivery. Future research should enhance model reliability and interpretability, improve model training and deployment scalability, integrate up-to-date medical knowledge, and strengthen privacy and ethical guidelines, paving the way for LLMs to fully realize their impact in critical care.</p><p><strong>Trial registration: </strong>OSF Registries yn328; https://osf.io/yn328/.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e76326"},"PeriodicalIF":3.8,"publicationDate":"2025-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12778902/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145598044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The State of Practice About Security in Telemedicine Systems in Chile: Exploratory Study. 智利远程医疗系统安全实践现状:探索性研究。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-21 DOI: 10.2196/77395
Gaston Marquez, Michelle Pacheco, Priscilla Vergara, Felix Liberona, May Chomalí, Eric Rojas

Background: Information security within telemedicine systems is essential to advancing the digital transformation of health care. Telemedicine encompasses diverse modalities, including teleconsultation, telehealth, and remote patient monitoring, all of which depend on digital platforms, secured communication networks, and internet-connected devices. Although these systems have progressed in aligning with information security standards and regulations, there remains a shortage of comprehensive, practice-oriented studies evaluating which aspects of security are effectively addressed and which remain insufficiently managed, particularly within the Chilean context.

Objective: This study aims to examine how effectively telemedicine systems in Chile address the core security attributes of confidentiality, availability, and integrity.

Methods: Data were analyzed from an evaluation tool designed to assess the quality of telemedicine systems in Chile. Over a 6-year period, 25 telemedicine systems from different providers were assessed, and an in-depth examination of how companies manage key information security subcharacteristics within their systems was undertaken.

Results: The findings indicate that 52% (n=13) of telemedicine systems optimally implement cryptographic techniques to protect confidentiality. In contrast, 44% (n=11) lack robust strategies for adapting to, recovering from, and mitigating security-related incidents. Fault tolerance mechanisms are frequently integrated to minimize service disruption caused by system failures. However, the prioritization of data integrity varies: while some companies treat it as a critical requirement, others assign it limited importance.

Conclusions: This study offers an understanding of the security priorities and practices adopted by telemedicine providers. It highlights a prevailing tendency to prioritize security measures over usability, underscoring the need for a balanced approach that safeguards patient information while supporting efficient clinical workflows.

背景:远程医疗系统中的信息安全对于推进医疗保健的数字化转型至关重要。远程医疗包括多种模式,包括远程会诊、远程保健和远程患者监护,所有这些都依赖于数字平台、安全的通信网络和连接互联网的设备。尽管这些系统在与信息安全标准和法规保持一致方面取得了进展,但仍然缺乏全面的、以实践为导向的研究,评估安全的哪些方面得到了有效解决,哪些方面仍然管理不足,特别是在智利的情况下。目的:本研究旨在研究智利远程医疗系统如何有效地解决机密性、可用性和完整性的核心安全属性。方法:利用智利远程医疗系统质量评估工具对数据进行分析。在6年的时间里,对来自不同供应商的25个远程医疗系统进行了评估,并对公司如何管理其系统内的关键信息安全子特征进行了深入研究。结果:研究结果表明,52% (n=13)的远程医疗系统采用了最佳的加密技术来保护机密性。相比之下,44% (n=11)的企业缺乏适应、恢复和减轻安全相关事件的稳健策略。经常集成容错机制,以尽量减少系统故障造成的服务中断。然而,数据完整性的优先级各不相同:一些公司将其视为关键需求,而另一些公司则将其赋予有限的重要性。结论:本研究提供了对远程医疗提供商采用的安全优先级和实践的理解。它强调了优先考虑安全措施而不是可用性的普遍趋势,强调需要一种平衡的方法来保护患者信息,同时支持有效的临床工作流程。
{"title":"The State of Practice About Security in Telemedicine Systems in Chile: Exploratory Study.","authors":"Gaston Marquez, Michelle Pacheco, Priscilla Vergara, Felix Liberona, May Chomalí, Eric Rojas","doi":"10.2196/77395","DOIUrl":"10.2196/77395","url":null,"abstract":"<p><strong>Background: </strong>Information security within telemedicine systems is essential to advancing the digital transformation of health care. Telemedicine encompasses diverse modalities, including teleconsultation, telehealth, and remote patient monitoring, all of which depend on digital platforms, secured communication networks, and internet-connected devices. Although these systems have progressed in aligning with information security standards and regulations, there remains a shortage of comprehensive, practice-oriented studies evaluating which aspects of security are effectively addressed and which remain insufficiently managed, particularly within the Chilean context.</p><p><strong>Objective: </strong>This study aims to examine how effectively telemedicine systems in Chile address the core security attributes of confidentiality, availability, and integrity.</p><p><strong>Methods: </strong>Data were analyzed from an evaluation tool designed to assess the quality of telemedicine systems in Chile. Over a 6-year period, 25 telemedicine systems from different providers were assessed, and an in-depth examination of how companies manage key information security subcharacteristics within their systems was undertaken.</p><p><strong>Results: </strong>The findings indicate that 52% (n=13) of telemedicine systems optimally implement cryptographic techniques to protect confidentiality. In contrast, 44% (n=11) lack robust strategies for adapting to, recovering from, and mitigating security-related incidents. Fault tolerance mechanisms are frequently integrated to minimize service disruption caused by system failures. However, the prioritization of data integrity varies: while some companies treat it as a critical requirement, others assign it limited importance.</p><p><strong>Conclusions: </strong>This study offers an understanding of the security priorities and practices adopted by telemedicine providers. It highlights a prevailing tendency to prioritize security measures over usability, underscoring the need for a balanced approach that safeguards patient information while supporting efficient clinical workflows.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e77395"},"PeriodicalIF":3.8,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12638033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creation and Implementation of an Electronic Sexual Assault Record at the Geneva University Hospital. 在日内瓦大学医院创建和实施电子性侵犯记录。
IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2025-11-20 DOI: 10.2196/66764
Sara Cottler-Casanova, Laurène Rimondi, Monique Lamuela Naulin, Tony Fracasso, Jasmine Abdulcadir

Background: In Switzerland, sexual assault reports have historically been documented on paper, which limited standardization, completeness, and challenges to produce reliable statistics.

Objective: This study describes the development and implementation of an Electronic Sexual Assault Record (eSAR) within Geneva University Hospitals' Electronic Medical Record (EMR) system, with the aim of improving data quality, documentation, and multidisciplinary coordination.

Methods: The eSAR was developed by a multidisciplinary team including forensic doctors, gynecologists, nurses (clinical and informatics), epidemiologists, and IT specialists. Its structure was based on existing hospital protocols and international recommendations. Variables were defined as "essential" or "highly recommended," with structured fields to ensure completeness and comparability. Confidentiality was safeguarded through restricted access and regular audits.

Results: The eSAR was launched in June 2022 and revised in 2023 after user feedback and training. Since implementation, 382 reports have been completed. Data quality improved substantially, with major reductions in missing information. The system also streamlined workflows and strengthened collaboration across specialties.

Conclusions: The eSAR improved documentation and data reliability, providing a replicable model for standardized sexual assault reporting in Switzerland.

背景:在瑞士,性侵犯报告历来以书面形式记录,这限制了标准化、完整性和产生可靠统计数据的挑战。目的:本研究描述了日内瓦大学医院电子病历(EMR)系统中电子性侵犯记录(eSAR)的开发和实施,旨在提高数据质量、文件记录和多学科协调。方法:eSAR是由包括法医、妇科医生、护士(临床和信息学)、流行病学家和IT专家在内的多学科团队开发的。其结构以现有的医院规程和国际建议为基础。变量被定义为“必要的”或“强烈推荐的”,具有结构化字段以确保完整性和可比性。通过限制接触和定期审计来保障机密性。结果:eSAR于2022年6月上线,经过用户反馈和培训,于2023年进行修订。自执行以来,已完成382份报告。数据质量大大提高,缺失信息大大减少。该系统还简化了工作流程,加强了各专业之间的协作。结论:eSAR提高了文件和数据的可靠性,为瑞士的标准化性侵犯报告提供了可复制的模式。
{"title":"Creation and Implementation of an Electronic Sexual Assault Record at the Geneva University Hospital.","authors":"Sara Cottler-Casanova, Laurène Rimondi, Monique Lamuela Naulin, Tony Fracasso, Jasmine Abdulcadir","doi":"10.2196/66764","DOIUrl":"10.2196/66764","url":null,"abstract":"<p><strong>Background: </strong>In Switzerland, sexual assault reports have historically been documented on paper, which limited standardization, completeness, and challenges to produce reliable statistics.</p><p><strong>Objective: </strong>This study describes the development and implementation of an Electronic Sexual Assault Record (eSAR) within Geneva University Hospitals' Electronic Medical Record (EMR) system, with the aim of improving data quality, documentation, and multidisciplinary coordination.</p><p><strong>Methods: </strong>The eSAR was developed by a multidisciplinary team including forensic doctors, gynecologists, nurses (clinical and informatics), epidemiologists, and IT specialists. Its structure was based on existing hospital protocols and international recommendations. Variables were defined as \"essential\" or \"highly recommended,\" with structured fields to ensure completeness and comparability. Confidentiality was safeguarded through restricted access and regular audits.</p><p><strong>Results: </strong>The eSAR was launched in June 2022 and revised in 2023 after user feedback and training. Since implementation, 382 reports have been completed. Data quality improved substantially, with major reductions in missing information. The system also streamlined workflows and strengthened collaboration across specialties.</p><p><strong>Conclusions: </strong>The eSAR improved documentation and data reliability, providing a replicable model for standardized sexual assault reporting in Switzerland.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e66764"},"PeriodicalIF":3.8,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12634005/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1