首页 > 最新文献

JAMIA Open最新文献

英文 中文
Augmenting large language models to predict social determinants of mental health in opioid use disorder using patient clinical notes. 利用患者临床记录增强大型语言模型来预测阿片类药物使用障碍中心理健康的社会决定因素。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-27 eCollection Date: 2025-12-01 DOI: 10.1093/jamiaopen/ooaf142
Madhavi Pagare, Deva Sai Kumar Bheesetti, Inyene Essien-Aleksi, Mohammad Arif Ul Alam

Objective: Identifying social determinants of mental health (SDOMH) in patients with opioid use disorder (OUD) is crucial for estimating risk and enabling early intervention. Extracting such data from unstructured clinical notes is challenging due to annotation complexity and requires advanced natural language processing (NLP) techniques. We propose the Human-in-the-Loop Large Language Model Interaction for Annotation (HLLIA) framework, combined with a Multilevel Hierarchical Clinical-Longformer Embedding (MHCLE) algorithm, to annotate and predict SDOMH variables.

Materials and methods: We utilized 2636 annotated discharge summaries from the Medical Information Mart for Intensive Care (MIMIC-IV) dataset. High-quality annotations were ensured via a human-in-the-loop approach, refined using large language models (LLMs). The MHCLE algorithm performed multi-label classification of 13 SDOMH variables and was evaluated against baseline models, including RoBERTa, Bio_ClinicalBERT, ClinicalBERT, and ClinicalBigBird.

Results: The MHCLE model achieved superior performance with 96.29% accuracy and a 95.41% F1score, surpassing baseline models. Training-testing policies P1, P2, and P3 yielded accuracies of 98.49%, 90.10%, and 89.04%, respectively, highlighting the importance of human intervention in refining LLM annotations.

Discussion and conclusion: Integrating the MHCLE model with the HLLIA framework offers an effective approach for predicting SDOMH factors from clinical notes, advancing NLP in OUD care. It highlights the importance of human oversight and sets a benchmark for future research.

目的:确定阿片类药物使用障碍(OUD)患者心理健康(SDOMH)的社会决定因素对于估计风险和实现早期干预至关重要。由于注释的复杂性,从非结构化的临床记录中提取此类数据具有挑战性,并且需要先进的自然语言处理(NLP)技术。我们提出了人在循环大语言模型交互注释(hlia)框架,结合多层分层临床-长前嵌入(MHCLE)算法,来注释和预测SDOMH变量。材料和方法:我们利用重症医疗信息集市(MIMIC-IV)数据集中的2636份带注释的出院摘要。通过使用大型语言模型(llm)进行改进的human-in-the-loop方法确保了高质量的注释。MHCLE算法对13个SDOMH变量进行多标签分类,并根据基线模型进行评估,包括RoBERTa、Bio_ClinicalBERT、ClinicalBERT和ClinicalBigBird。结果:MHCLE模型的准确率为96.29%,f1评分为95.41%,优于基线模型。训练测试策略P1、P2和P3的准确率分别为98.49%、90.10%和89.04%,突出了人类干预在精炼LLM注释中的重要性。讨论与结论:将MHCLE模型与hlia框架相结合,为从临床记录中预测SDOMH因素提供了有效的方法,促进了OUD护理中的NLP。它强调了人类监督的重要性,并为未来的研究设定了基准。
{"title":"Augmenting large language models to predict social determinants of mental health in opioid use disorder using patient clinical notes.","authors":"Madhavi Pagare, Deva Sai Kumar Bheesetti, Inyene Essien-Aleksi, Mohammad Arif Ul Alam","doi":"10.1093/jamiaopen/ooaf142","DOIUrl":"10.1093/jamiaopen/ooaf142","url":null,"abstract":"<p><strong>Objective: </strong>Identifying social determinants of mental health (SDOMH) in patients with opioid use disorder (OUD) is crucial for estimating risk and enabling early intervention. Extracting such data from unstructured clinical notes is challenging due to annotation complexity and requires advanced natural language processing (NLP) techniques. We propose the Human-in-the-Loop Large Language Model Interaction for Annotation (HLLIA) framework, combined with a Multilevel Hierarchical Clinical-Longformer Embedding (MHCLE) algorithm, to annotate and predict SDOMH variables.</p><p><strong>Materials and methods: </strong>We utilized 2636 annotated discharge summaries from the Medical Information Mart for Intensive Care (MIMIC-IV) dataset. High-quality annotations were ensured via a human-in-the-loop approach, refined using large language models (LLMs). The MHCLE algorithm performed multi-label classification of 13 SDOMH variables and was evaluated against baseline models, including RoBERTa, Bio_ClinicalBERT, ClinicalBERT, and ClinicalBigBird.</p><p><strong>Results: </strong>The MHCLE model achieved superior performance with 96.29% accuracy and a 95.41% F1score, surpassing baseline models. Training-testing policies P1, P2, and P3 yielded accuracies of 98.49%, 90.10%, and 89.04%, respectively, highlighting the importance of human intervention in refining LLM annotations.</p><p><strong>Discussion and conclusion: </strong>Integrating the MHCLE model with the HLLIA framework offers an effective approach for predicting SDOMH factors from clinical notes, advancing NLP in OUD care. It highlights the importance of human oversight and sets a benchmark for future research.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 6","pages":"ooaf142"},"PeriodicalIF":3.4,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12664681/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Span-based annotation framework for LLM-based clinical named entity recognition: development and validation using Korean emergency department notes. 基于法学硕士的临床命名实体识别的基于span的注释框架:使用韩国急诊科笔记的开发和验证。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-26 eCollection Date: 2025-12-01 DOI: 10.1093/jamiaopen/ooaf157
Eun Hye Jang, Javier Aguirre, Sangji Lee, Hyeyoon Moon, Won Chul Cha

Objective: This study aims to develop and validate of a span-based annotation framework for clinical named entity recognition (NER) using large language models (LLMs) based on Korean emergency department clinical notes.

Materials and methods: Two datasets with the same entity types but different annotation spans (word- vs phrase-level) were constructed, with the phrase-level dataset further was expanded into a doubled version. A Korean language-specific LLM was fine-tuned on each dataset, producing three variants that were compared with two baseline models, few-shot LLM and fine-tuned small language model (SLM). The final variant fine-tuned on the doubled phrase-level dataset was further evaluated against a human annotator.

Results: In all experimental settings, three variants outperformed the baselines by achieving the highest F1 scores across all metrics. The final variant achieved F1 scores exceeding 0.80 across all averaging strategies and evaluation metrics, including token-based, span-based exact, and span-based partial evaluations demonstrating its robustness applicable in a practical setting.

Discussion: While prompt engineering with few-shot is widely adopted for LLM-based clinical NER, our results proved that supervised fine-tuning (SFT) is consistently superior. The final variant outperformed the human annotator, emphasizing its potential as an automatic labeling tool.

Conclusion: This study introduced a novel span-based annotation framework for LLM-based clinical NER verified by three independent experiments. In multilingual and real-world clinical settings, LLMs have proven in handling complex entity spans that include word-level and phrase-level annotations, particularly for long and attribute-rich entities.

目的:本研究旨在利用基于韩国急诊科临床记录的大型语言模型(llm),开发并验证基于跨度的临床命名实体识别(NER)注释框架。材料和方法:构建了两个实体类型相同但标注跨度不同(词级和短语级)的数据集,并将短语级数据集进一步扩展为双版本。针对每个数据集对特定于韩语的LLM进行了微调,产生了三个变体,这些变体与两个基线模型(少量LLM和微调的小语言模型(SLM))进行了比较。在双短语级数据集上微调的最终变体针对人类注释器进行了进一步评估。结果:在所有实验设置中,三个变体通过在所有指标中获得最高的F1分数而优于基线。最终的变体在所有平均策略和评估指标(包括基于令牌的、基于跨度的精确评估和基于跨度的部分评估)中获得了超过0.80的F1分数,证明了它在实际环境中的稳健性。讨论:虽然基于法学硕士的临床NER广泛采用少量镜头的快速工程,但我们的研究结果证明,监督微调(SFT)始终是优越的。最终的变体优于人类注释器,强调了其作为自动标记工具的潜力。结论:本研究为基于llm的临床NER引入了一种新的基于span的注释框架,并通过三个独立实验验证。在多语言和现实世界的临床环境中,法学硕士已经证明可以处理复杂的实体跨度,包括单词级和短语级注释,特别是对于长且属性丰富的实体。
{"title":"Span-based annotation framework for LLM-based clinical named entity recognition: development and validation using Korean emergency department notes.","authors":"Eun Hye Jang, Javier Aguirre, Sangji Lee, Hyeyoon Moon, Won Chul Cha","doi":"10.1093/jamiaopen/ooaf157","DOIUrl":"10.1093/jamiaopen/ooaf157","url":null,"abstract":"<p><strong>Objective: </strong>This study aims to develop and validate of a span-based annotation framework for clinical named entity recognition (NER) using large language models (LLMs) based on Korean emergency department clinical notes.</p><p><strong>Materials and methods: </strong>Two datasets with the same entity types but different annotation spans (word- vs phrase-level) were constructed, with the phrase-level dataset further was expanded into a doubled version. A Korean language-specific LLM was fine-tuned on each dataset, producing three variants that were compared with two baseline models, few-shot LLM and fine-tuned small language model (SLM). The final variant fine-tuned on the doubled phrase-level dataset was further evaluated against a human annotator.</p><p><strong>Results: </strong>In all experimental settings, three variants outperformed the baselines by achieving the highest F1 scores across all metrics. The final variant achieved F1 scores exceeding 0.80 across all averaging strategies and evaluation metrics, including token-based, span-based exact, and span-based partial evaluations demonstrating its robustness applicable in a practical setting.</p><p><strong>Discussion: </strong>While prompt engineering with few-shot is widely adopted for LLM-based clinical NER, our results proved that supervised fine-tuning (SFT) is consistently superior. The final variant outperformed the human annotator, emphasizing its potential as an automatic labeling tool.</p><p><strong>Conclusion: </strong>This study introduced a novel span-based annotation framework for LLM-based clinical NER verified by three independent experiments. In multilingual and real-world clinical settings, LLMs have proven in handling complex entity spans that include word-level and phrase-level annotations, particularly for long and attribute-rich entities.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 6","pages":"ooaf157"},"PeriodicalIF":3.4,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12657458/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145649357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence-generated draft replies to patient messages in pediatrics. 人工智能生成的儿科病人信息回复草稿。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-11-22 eCollection Date: 2025-12-01 DOI: 10.1093/jamiaopen/ooaf159
April S Liang, Shivam Vedak, Alex Dussaq, Dong-Han Yao, Joshua A Villarreal, Sijo Thomas, Nicholas Chen, Tanya Townsend, Natalie M Pageler, Keith Morse

Objectives: This study describes the utilization and experiences of artificial intelligence (AI)-generated draft responses to patient messages in pediatric ambulatory clinicians and contextualizes their experiences in relation to those of adult specialty clinicians.

Materials and methods: A prospective pilot was conducted from September 2023 to August 2024 in 2 pediatric clinics (General Pediatric and Adolescent Medicine) and 2 obstetric clinics (Reproductive Endocrinology and Infertility and General Obstetrics) within an academic health system in Northern California. Participants included physician, nurse, and medical assistant volunteers. The intervention involved a feature utilizing large language models embedded in the electronic health record to generate draft responses. Proportion of AI-generated draft used was collected, as were prepilot and follow-up surveys.

Results: A total of 61 clinicians (26 pediatric, 35 obstetric) enrolled, with 46 (75%) completing both surveys. Pediatric clinicians utilized 13.3% (95% CI, 12.3%-14.4%) of AI-generated drafts, and usage rates when responding to patients vs their proxies was similar (15% vs 12.9%, P = .24). Despite using AI-generated drafts significantly less than obstetric clinicians (18.3% [17.2%-19.5%], P < .0001), pediatric clinicians reported a significant reduction in perceived task load (NASA Task Load Index: 59.9-50.9, P = .04) and were more likely to recommend the tool (LTR: 7.0 vs 5.2, P = .04).

Discussion and conclusion: Pediatric clinicians used AI-generated drafts at a rate within previously reported ranges in adult specialties and experienced utility. These findings suggest this tool has potential for enhancing efficiency and reducing task load in pediatric care.

目的:本研究描述了人工智能(AI)生成的患者信息回复草案在儿科门诊临床医生中的应用和经验,并将他们的经验与成人专科临床医生的经验联系起来。材料和方法:前瞻性试点于2023年9月至2024年8月在北加州学术卫生系统内的2家儿科诊所(普通儿科和青少年医学)和2家产科诊所(生殖内分泌和不孕症和普通产科)进行。参与者包括医生、护士和医疗助理志愿者。干预措施包括利用嵌入在电子健康记录中的大型语言模型来生成回复草稿的功能。收集使用人工智能生成的草稿的比例,以及预试点和后续调查。结果:共有61名临床医生(26名儿科医生,35名产科医生)入组,其中46名(75%)完成了两项调查。儿科临床医生使用了13.3% (95% CI, 12.3%-14.4%)的人工智能生成的草稿,在回应患者和他们的代理时,使用率相似(15%对12.9%,P = .24)。尽管使用人工智能生成的草稿明显少于产科医生(18.3% [17.2%-19.5%]),P P =。04)并且更有可能推荐该工具(LTR: 7.0 vs 5.2, P = .04)。讨论和结论:儿科临床医生使用人工智能生成的草稿的比率在先前报道的成人专科和经验丰富的实用范围内。这些发现表明,该工具具有提高儿科护理效率和减少任务负荷的潜力。
{"title":"Artificial intelligence-generated draft replies to patient messages in pediatrics.","authors":"April S Liang, Shivam Vedak, Alex Dussaq, Dong-Han Yao, Joshua A Villarreal, Sijo Thomas, Nicholas Chen, Tanya Townsend, Natalie M Pageler, Keith Morse","doi":"10.1093/jamiaopen/ooaf159","DOIUrl":"10.1093/jamiaopen/ooaf159","url":null,"abstract":"<p><strong>Objectives: </strong>This study describes the utilization and experiences of artificial intelligence (AI)-generated draft responses to patient messages in pediatric ambulatory clinicians and contextualizes their experiences in relation to those of adult specialty clinicians.</p><p><strong>Materials and methods: </strong>A prospective pilot was conducted from September 2023 to August 2024 in 2 pediatric clinics (General Pediatric and Adolescent Medicine) and 2 obstetric clinics (Reproductive Endocrinology and Infertility and General Obstetrics) within an academic health system in Northern California. Participants included physician, nurse, and medical assistant volunteers. The intervention involved a feature utilizing large language models embedded in the electronic health record to generate draft responses. Proportion of AI-generated draft used was collected, as were prepilot and follow-up surveys.</p><p><strong>Results: </strong>A total of 61 clinicians (26 pediatric, 35 obstetric) enrolled, with 46 (75%) completing both surveys. Pediatric clinicians utilized 13.3% (95% CI, 12.3%-14.4%) of AI-generated drafts, and usage rates when responding to patients vs their proxies was similar (15% vs 12.9%, <i>P</i> = .24). Despite using AI-generated drafts significantly less than obstetric clinicians (18.3% [17.2%-19.5%], <i>P</i> < .0001), pediatric clinicians reported a significant reduction in perceived task load (NASA Task Load Index: 59.9-50.9, <i>P</i> = .04) and were more likely to recommend the tool (LTR: 7.0 vs 5.2, <i>P</i> = .04).</p><p><strong>Discussion and conclusion: </strong>Pediatric clinicians used AI-generated drafts at a rate within previously reported ranges in adult specialties and experienced utility. These findings suggest this tool has potential for enhancing efficiency and reducing task load in pediatric care.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 6","pages":"ooaf159"},"PeriodicalIF":3.4,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12643547/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145606516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping the overdose crisis: 6 locations using open medical examiner data. 绘制过量危机:使用公开法医数据的6个地点。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf140
Daniel R Harris, Nicholas Anthony, Kelly A Keyes, Chris Delcher

Objective: Medical examiners and coroners (ME/C) oversee medicolegal death investigations which determine causes of death and other contextual factors that may have influenced a death. We utilize open data releases from ME/C offices covering 6 different geographic areas to demonstrate the strengths and limitations of ME/C data for forensic epidemiology research.

Materials and methods: We use our novel geoPIPE tool to establish a pipeline that (a) automates ingesting open data releases, (b) geocodes records where possible to yield a spatial component, (c) enhances data with variables useful for overdose research, such as flagging substances contributing to each death, and (d) publishes the enriched data to our open repository. We use results from this pipeline to highlight similarities and differences of overdose data across different sources.

Results: Text processing to extract drugs contributing to each death yielded compatible data across all locations. Conversely, geospatial analyses are sometimes incompatible due to differences in available geographic resolution, which range from fine-grain latitude and longitude coordinates to larger regions identified by zip codes. Our pipeline pushes weekly results to an open repository.

Discussion: Open ME/C data are highly useful for research on substance use disorders; our visualizations demonstrate the ability to contextualize overdose data within and across specific geographic regions. Furthermore, the spatial component of our results enables clustering of overdose events and accessibility studies for resources related to preventing overdose deaths.

Conclusions: Given the utility to public health researchers, we advocate that other ME/C offices explore releasing open data and for policy makers to support and fund transparency efforts.

目的:法医和验尸官(ME/C)监督法医死亡调查,确定死亡原因和其他可能影响死亡的背景因素。我们利用来自6个不同地理区域的ME/C办公室的公开数据来展示ME/C数据在法医流行病学研究中的优势和局限性。材料和方法:我们使用我们的新型geoPIPE工具来建立一个管道,该管道(a)自动获取开放数据发布,(b)在可能的情况下对记录进行地理编码,以产生空间成分,(c)使用对过量研究有用的变量增强数据,例如标记导致每次死亡的物质,以及(d)将丰富的数据发布到我们的开放存储库。我们使用该管道的结果来突出不同来源的过量数据的相似性和差异性。结果:通过文本处理提取导致每次死亡的药物产生了所有地点的兼容数据。相反,地理空间分析有时不兼容,因为可用的地理分辨率不同,从细粒度的纬度和经度坐标到由邮政编码标识的较大区域。我们的管道将每周的结果推送到一个开放的存储库。讨论:开放ME/C数据对物质使用障碍的研究非常有用;我们的可视化展示了在特定地理区域内和跨区域进行过量数据背景化的能力。此外,我们的研究结果的空间组成部分使过量事件聚类和预防过量死亡相关资源的可及性研究成为可能。结论:考虑到对公共卫生研究人员的效用,我们主张其他ME/C办公室探索发布开放数据,并为政策制定者支持和资助透明度工作。
{"title":"Mapping the overdose crisis: 6 locations using open medical examiner data.","authors":"Daniel R Harris, Nicholas Anthony, Kelly A Keyes, Chris Delcher","doi":"10.1093/jamiaopen/ooaf140","DOIUrl":"10.1093/jamiaopen/ooaf140","url":null,"abstract":"<p><strong>Objective: </strong>Medical examiners and coroners (ME/C) oversee medicolegal death investigations which determine causes of death and other contextual factors that may have influenced a death. We utilize open data releases from ME/C offices covering 6 different geographic areas to demonstrate the strengths and limitations of ME/C data for forensic epidemiology research.</p><p><strong>Materials and methods: </strong>We use our novel geoPIPE tool to establish a pipeline that (a) automates ingesting open data releases, (b) geocodes records where possible to yield a spatial component, (c) enhances data with variables useful for overdose research, such as flagging substances contributing to each death, and (d) publishes the enriched data to our open repository. We use results from this pipeline to highlight similarities and differences of overdose data across different sources.</p><p><strong>Results: </strong>Text processing to extract drugs contributing to each death yielded compatible data across all locations. Conversely, geospatial analyses are sometimes incompatible due to differences in available geographic resolution, which range from fine-grain latitude and longitude coordinates to larger regions identified by zip codes. Our pipeline pushes weekly results to an open repository.</p><p><strong>Discussion: </strong>Open ME/C data are highly useful for research on substance use disorders; our visualizations demonstrate the ability to contextualize overdose data within and across specific geographic regions. Furthermore, the spatial component of our results enables clustering of overdose events and accessibility studies for resources related to preventing overdose deaths.</p><p><strong>Conclusions: </strong>Given the utility to public health researchers, we advocate that other ME/C offices explore releasing open data and for policy makers to support and fund transparency efforts.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf140"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574786/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145431958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the acceptability and usability of MedSafer, a patient-centered electronic deprescribing tool. 评估以患者为中心的电子处方工具MedSafer的可接受性和可用性。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf141
Jimin J Lee, Eva Filosa, Tiphaine Pierson, Ninh Khuong, Camille Gagnon, Jennie Herbin, Soham Rej, Claire Godard-Sebillotte, Robyn Tamblyn, Todd C Lee, Emily G McDonald

Background: Deprescribing is the clinically supervised process of stopping or reducing medications that are no longer beneficial. MedSafer is an electronic decision support tool that guides healthcare providers (HCPs) through the deprescribing process. We recently developed a novel patient-facing version of the software, allowing patients and caregivers to generate a personalized deprescribing report to bring to their prescriber.

Objective: The study aimed to evaluate the usability and acceptability of MedSafer among older adults, caregivers, and community HCPs (physicians, nurse practitioners and pharmacists).

Method: A mixed-methods feasibility study was conducted with a convenience sample of 100 older adults/caregivers, and 25 healthcare practitioners. Participants were invited to test MedSafer and answer telephone or electronic surveys via RedCap. The Extended Technology Acceptance Model (TAM2) and System Usability Scale (SUS) were used for evaluation. A semi-structured interview was also conducted with a subset of participants (5 per group) who were selected on a volunteer basis, and thematic analysis was used following Braun & Clarke's approach.

Results: Healthcare providers scored more favorably on TAM2 constructs such as perceived usefulness (PU) (median: 4.25 for HCPs; 3.75 for caregivers; 3.00 for patients), and SUS compared to patients and caregivers (mean: 79.50 for HCPs; 52.95 for caregivers; 55.75 for patients). Thematic analysis revealed that participants recognized MedSafer as an empowering tool but noted the need for some usability improvements.

Conclusion: MedSafer is a promising tool to support deprescribing conversations. Enhancing usability, accessibility, and patient education may improve adoption.

背景:开处方是在临床监督下停止或减少不再有益的药物的过程。MedSafer是一个电子决策支持工具,指导医疗保健提供者(HCPs)通过处方过程。我们最近开发了一种新的面向患者的软件版本,允许患者和护理人员生成个性化的处方报告,并提交给他们的处方医生。目的:本研究旨在评估MedSafer在老年人、护理人员和社区HCPs(医生、执业护士和药剂师)中的可用性和可接受性。方法:采用混合方法对100名老年人/护理人员和25名医疗从业人员进行可行性研究。参与者被邀请测试MedSafer,并通过RedCap回答电话或电子调查。采用扩展技术接受模型(TAM2)和系统可用性量表(SUS)进行评价。在志愿者的基础上,对一部分参与者(每组5人)进行了半结构化访谈,并采用了Braun & Clarke的方法进行了主题分析。结果:与患者和护理人员相比,医疗保健提供者在TAM2结构如感知有用性(PU) (HCPs的中位数:4.25;护理人员的中位数:3.75;患者的中位数:3.00)和SUS上得分更高(HCPs的平均值:79.50;护理人员的平均值:52.95;患者的平均值:55.75)。专题分析显示,与会者认识到MedSafer是一种赋权工具,但指出需要改进一些可用性。结论:MedSafer是一个很有前途的工具来支持处方对话。增强可用性、可访问性和患者教育可能会提高采用率。
{"title":"Assessing the acceptability and usability of MedSafer, a patient-centered electronic deprescribing tool.","authors":"Jimin J Lee, Eva Filosa, Tiphaine Pierson, Ninh Khuong, Camille Gagnon, Jennie Herbin, Soham Rej, Claire Godard-Sebillotte, Robyn Tamblyn, Todd C Lee, Emily G McDonald","doi":"10.1093/jamiaopen/ooaf141","DOIUrl":"10.1093/jamiaopen/ooaf141","url":null,"abstract":"<p><strong>Background: </strong>Deprescribing is the clinically supervised process of stopping or reducing medications that are no longer beneficial. MedSafer is an electronic decision support tool that guides healthcare providers (HCPs) through the deprescribing process. We recently developed a novel patient-facing version of the software, allowing patients and caregivers to generate a personalized deprescribing report to bring to their prescriber.</p><p><strong>Objective: </strong>The study aimed to evaluate the usability and acceptability of MedSafer among older adults, caregivers, and community HCPs (physicians, nurse practitioners and pharmacists).</p><p><strong>Method: </strong>A mixed-methods feasibility study was conducted with a convenience sample of 100 older adults/caregivers, and 25 healthcare practitioners. Participants were invited to test MedSafer and answer telephone or electronic surveys via RedCap. The Extended Technology Acceptance Model (TAM2) and System Usability Scale (SUS) were used for evaluation. A semi-structured interview was also conducted with a subset of participants (5 per group) who were selected on a volunteer basis, and thematic analysis was used following Braun & Clarke's approach.</p><p><strong>Results: </strong>Healthcare providers scored more favorably on TAM2 constructs such as perceived usefulness (PU) (median: 4.25 for HCPs; 3.75 for caregivers; 3.00 for patients), and SUS compared to patients and caregivers (mean: 79.50 for HCPs; 52.95 for caregivers; 55.75 for patients). Thematic analysis revealed that participants recognized MedSafer as an empowering tool but noted the need for some usability improvements.</p><p><strong>Conclusion: </strong>MedSafer is a promising tool to support deprescribing conversations. Enhancing usability, accessibility, and patient education may improve adoption.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf141"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574792/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145432489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Impact of Electronic Health Record to Electronic Data Capture Technology on Workflow Efficiency: a Site Perspective. 评估电子健康记录对电子数据捕获技术对工作流程效率的影响:现场视角。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf139
Anna Patruno, Michael-Owen Panzarella, Michael Buckley, Milena Silverman, Evelyn Salazar, Renata Panchal, Joseph Lengfellner, Alexia Iasonos, Maryam Garza, Byeong Yeob Choi, Meredith Zozus, Stephanie Terzulli, Paul Sabbatini

Introduction: Clinical trial data is still predominantly manually entered by site staff into Electronic Data Capture (EDC) systems. This process of abstracting and manually transcribing patient data is time-consuming, inefficient and error prone. Use of Electronic Health Record to Electronic Data Capture (EHR-To-EDC) technologies that digitize this process would improve these inefficiencies.

Objectives: This study measured the impact of EHR-To-EDC technology on the data entry workflow of clinical trial data managers. The primary objective was to compare the speed and accuracy of the EHR-To-EDC enabled data entry method to the traditional, manual method. The secondary objective was to measure end user satisfaction.

Materials and methods: Five data managers ranging in experience from 9 months to over 2 years, were assigned an investigator-initiated, Memorial Sloan Kettering-sponsored oncology study within their disease area of expertise. Each data manager performed one-hour of manual data entry, and a week later, one-hour of data entry using IgniteData's EHR-To-EDC solution, Archer, on a predetermined set of patients, timepoints and data domains (labs, vitals). The data entered into the EDC were compared side-by-side and used to evaluate the speed and accuracy of the EHR-To-EDC enabled method versus traditional, manual data entry. A user satisfaction survey using a 5-point Likert scale was used to collect feedback regarding the selected platform's learnability, ease of use, perceived time savings, perceived efficiency, and preference over the manual method.

Results: The EHR-To-EDC method resulted in 58% more data entered versus the manual method (difference, 1745 data points; manual, 3023 data points; EHR-To-EDC, 4768 data points). The number of data entry errors was reduced by 99% (manual, 100 data points; EHR-To-EDC, 1 data point). Regarding user satisfaction, data managers either agreed or strongly agreed that the EHR-To-EDC workflow was easy to learn (5/5), easy to use (4.6/5), saved time (5/5), was more efficient (4.8/5), and preferred it over the manual entry workflow (4/5).

Conclusion: EHR-To-EDC enabled data entry increases data manager productivity, reduces errors and is preferred by data managers over manual data entry.

临床试验数据仍然主要由现场工作人员手动输入电子数据采集(EDC)系统。这种提取和手动转录患者数据的过程耗时、低效且容易出错。使用电子健康记录到电子数据捕获(EHR-To-EDC)技术将这一过程数字化,将改善这些低效率。目的:本研究测量了EHR-To-EDC技术对临床试验数据管理人员数据输入工作流程的影响。主要目的是比较EHR-To-EDC支持的数据输入方法与传统的手动方法的速度和准确性。第二个目标是衡量最终用户满意度。材料和方法:5名经验从9个月到2年以上的数据管理人员被分配到一项由研究者发起的、由纪念斯隆凯特林资助的肿瘤研究中,该研究是在他们的疾病专业领域内进行的。每个数据管理人员手动输入一小时的数据,一周后,使用IgniteData的EHR-To-EDC解决方案Archer,对一组预定的患者、时间点和数据域(实验室、生命体征)进行一小时的数据输入。将输入EDC的数据进行并排比较,并用于评估EHR-To-EDC启用方法与传统的手动数据输入方法的速度和准确性。使用5点李克特量表进行用户满意度调查,以收集有关所选平台的易学性、易用性、感知时间节省、感知效率以及相对于手动方法的偏好的反馈。结果:EHR-To-EDC方法比手工方法多输入58%的数据(差异,1745个数据点;手工,3023个数据点;EHR-To-EDC, 4768个数据点)。数据输入错误的数量减少了99%(手动,100个数据点;EHR-To-EDC, 1个数据点)。在用户满意度方面,数据管理人员同意或强烈同意EHR-To-EDC工作流程易学(5/5),易于使用(4.6/5),节省时间(5/5),效率更高(4.8/5),并且更喜欢它而不是手动输入工作流程(4/5)。结论:EHR-To-EDC支持的数据输入提高了数据管理人员的工作效率,减少了错误,是数据管理人员的首选,而不是手动数据输入。
{"title":"Evaluating the Impact of Electronic Health Record to Electronic Data Capture Technology on Workflow Efficiency: a Site Perspective.","authors":"Anna Patruno, Michael-Owen Panzarella, Michael Buckley, Milena Silverman, Evelyn Salazar, Renata Panchal, Joseph Lengfellner, Alexia Iasonos, Maryam Garza, Byeong Yeob Choi, Meredith Zozus, Stephanie Terzulli, Paul Sabbatini","doi":"10.1093/jamiaopen/ooaf139","DOIUrl":"10.1093/jamiaopen/ooaf139","url":null,"abstract":"<p><strong>Introduction: </strong>Clinical trial data is still predominantly manually entered by site staff into Electronic Data Capture (EDC) systems. This process of abstracting and manually transcribing patient data is time-consuming, inefficient and error prone. Use of Electronic Health Record to Electronic Data Capture (EHR-To-EDC) technologies that digitize this process would improve these inefficiencies.</p><p><strong>Objectives: </strong>This study measured the impact of EHR-To-EDC technology on the data entry workflow of clinical trial data managers. The primary objective was to compare the speed and accuracy of the EHR-To-EDC enabled data entry method to the traditional, manual method. The secondary objective was to measure end user satisfaction.</p><p><strong>Materials and methods: </strong>Five data managers ranging in experience from 9 months to over 2 years, were assigned an investigator-initiated, Memorial Sloan Kettering-sponsored oncology study within their disease area of expertise. Each data manager performed one-hour of manual data entry, and a week later, one-hour of data entry using IgniteData's EHR-To-EDC solution, Archer, on a predetermined set of patients, timepoints and data domains (labs, vitals). The data entered into the EDC were compared side-by-side and used to evaluate the speed and accuracy of the EHR-To-EDC enabled method versus traditional, manual data entry. A user satisfaction survey using a 5-point Likert scale was used to collect feedback regarding the selected platform's learnability, ease of use, perceived time savings, perceived efficiency, and preference over the manual method.</p><p><strong>Results: </strong>The EHR-To-EDC method resulted in 58% more data entered versus the manual method (difference, 1745 data points; manual, 3023 data points; EHR-To-EDC, 4768 data points). The number of data entry errors was reduced by 99% (manual, 100 data points; EHR-To-EDC, 1 data point). Regarding user satisfaction, data managers either agreed or strongly agreed that the EHR-To-EDC workflow was easy to learn (5/5), easy to use (4.6/5), saved time (5/5), was more efficient (4.8/5), and preferred it over the manual entry workflow (4/5).</p><p><strong>Conclusion: </strong>EHR-To-EDC enabled data entry increases data manager productivity, reduces errors and is preferred by data managers over manual data entry.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf139"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574785/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145431546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Engaging end-users to develop a novel algorithm to process electronic medication adherence monitoring device data. 吸引终端用户开发一种新的算法来处理电子药物依从性监测设备数据。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf135
Meghan E McGrady, Kevin A Hommel, Constance A Mara, Gabriella Breen, Michal Kouril

Objective: To engage end-users to develop and evaluate an algorithm to convert electronic adherence monitoring device (EAMD) output into the adherence data required for analyses.

Materials and methods: This study included 4 phases. First, process mapping interviews and focus groups were conducted to identify rules for EAMD data processing and user needs. Second, algorithm parameters required to compute daily adherence values were defined and coded in an R package (OncMAP). Third, algorithm-produced data were compared to manually recoded data to evaluate the algorithm's sensitivity, specificity, and accuracy. Finally, pilot testing was conducted to obtain feedback on the perceived value/benefit of the algorithm and features that should be considered during software development.

Results: EAMD data processing rules were identified and coded in an R application. The algorithm correctly classified all complete observations with 100% sensitivity and specificity. The receiver operating characteristic curve analysis yielded an area under the curve of 1.00. All pilot testing participants expressed interest in using the algorithm (Net Promoter Score = 71%) but identified several features essential for inclusion in the software package to ensure widespread adoption.

Discussion: The decision rules implemented to process EAMD actuation data can be parameterized to develop an algorithm to automate this process. The algorithm demonstrated high sensitivity, specificity, and accuracy. End-users were enthusiastic about the product and provided insights to inform the development of a software package including the algorithm.

Conclusion: A rule-based algorithm can accurately process EAMD actuation data and has the potential to improve the rigor and pace of adherence science.

目的:吸引终端用户开发和评估一种算法,将电子依从性监测设备(EAMD)的输出转换为分析所需的依从性数据。材料与方法:本研究分为4期。首先,通过流程映射访谈和焦点小组来确定EAMD数据处理规则和用户需求。其次,在R包(OncMAP)中定义和编码计算每日附着值所需的算法参数。第三,将算法生成的数据与人工编码的数据进行比较,以评估算法的敏感性、特异性和准确性。最后,进行了试点测试,以获得关于算法的感知价值/收益的反馈,以及在软件开发过程中应该考虑的功能。结果:在R应用程序中识别并编码了EAMD数据处理规则。该算法以100%的灵敏度和特异性对所有完整的观测结果进行正确分类。受试者工作特征曲线分析得出曲线下面积为1.00。所有试点测试参与者都表示有兴趣使用该算法(净推荐值= 71%),但确定了软件包中包含的几个必要功能,以确保广泛采用。讨论:可以对用于处理EAMD驱动数据的决策规则进行参数化,以开发一种算法来实现该过程的自动化。该算法具有较高的灵敏度、特异性和准确性。最终用户对产品充满热情,并提供了见解,以告知包括算法在内的软件包的开发。结论:基于规则的算法能够准确地处理EAMD驱动数据,有可能提高依从性科学的严谨性和速度。
{"title":"Engaging end-users to develop a novel algorithm to process electronic medication adherence monitoring device data.","authors":"Meghan E McGrady, Kevin A Hommel, Constance A Mara, Gabriella Breen, Michal Kouril","doi":"10.1093/jamiaopen/ooaf135","DOIUrl":"10.1093/jamiaopen/ooaf135","url":null,"abstract":"<p><strong>Objective: </strong>To engage end-users to develop and evaluate an algorithm to convert electronic adherence monitoring device (EAMD) output into the adherence data required for analyses.</p><p><strong>Materials and methods: </strong>This study included 4 phases. First, process mapping interviews and focus groups were conducted to identify rules for EAMD data processing and user needs. Second, algorithm parameters required to compute daily adherence values were defined and coded in an R package (OncMAP). Third, algorithm-produced data were compared to manually recoded data to evaluate the algorithm's sensitivity, specificity, and accuracy. Finally, pilot testing was conducted to obtain feedback on the perceived value/benefit of the algorithm and features that should be considered during software development.</p><p><strong>Results: </strong>EAMD data processing rules were identified and coded in an R application. The algorithm correctly classified all complete observations with 100% sensitivity and specificity. The receiver operating characteristic curve analysis yielded an area under the curve of 1.00. All pilot testing participants expressed interest in using the algorithm (Net Promoter Score = 71%) but identified several features essential for inclusion in the software package to ensure widespread adoption.</p><p><strong>Discussion: </strong>The decision rules implemented to process EAMD actuation data can be parameterized to develop an algorithm to automate this process. The algorithm demonstrated high sensitivity, specificity, and accuracy. End-users were enthusiastic about the product and provided insights to inform the development of a software package including the algorithm.</p><p><strong>Conclusion: </strong>A rule-based algorithm can accurately process EAMD actuation data and has the potential to improve the rigor and pace of adherence science.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf135"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574791/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145432476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meeting clinical recruitment milestones in an academic center: a data-driven, visual approach. 在学术中心满足临床招聘里程碑:数据驱动的可视化方法。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf125
Anna E Burns, John Tumberger, Mariah Brewe, Michael Bartkoski, Stephani L Stancil

Objectives: Describing the development of a visual dashboard leveraging available tools for efficient recruitment for patient centered clinical trials in resource constrained settings.

Materials and methods: A real-time, visual dashboard was developed, facilitating interactive visualizations, detailed analyses, and data quality control. Daily automated REDCap data retrieval occurred via an R program using REDCap API and output was integrated into Power BI. An interrupted time series analysis was conducted evaluating effects of dashboard on clinical trial recruitment metrics.

Results: The visual dashboard displayed key recruitment metrics, including individual participant progression and recruitment trends over time. Interrupted time series analysis showed improvements in screening rates upon implementation. The mean time to study completion decreased by 19 days following implementation.

Discussion: Customizable metrics offer comprehensive view of recruitment data and granularity, identifying actionable issues, enhancing study timeliness and completion.

Conclusion: Clinical trials of all budgets can integrate dashboards for real-time monitoring and data driven improvements to promote more timely completion.

目的:描述在资源受限的情况下,利用可用工具有效招募以患者为中心的临床试验的可视化仪表板的开发。材料和方法:开发了实时可视化仪表板,促进交互式可视化、详细分析和数据质量控制。通过使用REDCap API的R程序进行每日自动REDCap数据检索,并将输出集成到Power BI中。进行了中断时间序列分析,评估仪表板对临床试验招募指标的影响。结果:可视化仪表板显示了关键的招聘指标,包括个人参与者的进步和招聘趋势。中断时间序列分析显示,实施后筛查率有所提高。研究完成的平均时间在实施后减少了19天。讨论:可定制的指标提供招聘数据和粒度的综合视图,确定可操作的问题,提高学习的及时性和完成度。结论:所有预算的临床试验都可以整合仪表板进行实时监测和数据驱动改进,以促进更及时的完成。
{"title":"Meeting clinical recruitment milestones in an academic center: a data-driven, visual approach.","authors":"Anna E Burns, John Tumberger, Mariah Brewe, Michael Bartkoski, Stephani L Stancil","doi":"10.1093/jamiaopen/ooaf125","DOIUrl":"10.1093/jamiaopen/ooaf125","url":null,"abstract":"<p><strong>Objectives: </strong>Describing the development of a visual dashboard leveraging available tools for efficient recruitment for patient centered clinical trials in resource constrained settings.</p><p><strong>Materials and methods: </strong>A real-time, visual dashboard was developed, facilitating interactive visualizations, detailed analyses, and data quality control. Daily automated REDCap data retrieval occurred via an R program using REDCap API and output was integrated into Power BI. An interrupted time series analysis was conducted evaluating effects of dashboard on clinical trial recruitment metrics.</p><p><strong>Results: </strong>The visual dashboard displayed key recruitment metrics, including individual participant progression and recruitment trends over time. Interrupted time series analysis showed improvements in screening rates upon implementation. The mean time to study completion decreased by 19 days following implementation.</p><p><strong>Discussion: </strong>Customizable metrics offer comprehensive view of recruitment data and granularity, identifying actionable issues, enhancing study timeliness and completion.</p><p><strong>Conclusion: </strong>Clinical trials of all budgets can integrate dashboards for real-time monitoring and data driven improvements to promote more timely completion.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf125"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574790/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145431951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human-centered design of an artificial intelligence monitoring system: the Vanderbilt Algorithmovigilance Monitoring and Operations System. 以人为本的人工智能监控系统设计:范德比尔特算法监控和操作系统。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf136
Megan E Salwei, Sharon E Davis, Carrie Reale, Laurie L Novak, Colin G Walsh, Russ Beebe, Scott Nelson, Sameer Sundrani, Susannah Rose, Adam Wright, Michael Ripperger, Peter Shave, Peter Embí

Objectives: As the use of artificial intelligence (AI) in healthcare is rapidly expanding, there is also growing recognition of the need for ongoing monitoring of AI after implementation, called algorithmovigilance. Yet, there remain few systems that support systematic monitoring and governance of AI used across a health system. In this study, we identify end-user needs for a novel AI monitoring system-the Vanderbilt Algorithmovigilance Monitoring and Operations System (VAMOS)-using human-centered design (HCD).

Materials and methods: We assembled a multidisciplinary team to plan AI monitoring and governance at Vanderbilt University Medical Center. We then conducted 9 participatory design sessions with diverse stakeholders to develop prototypes of VAMOS. Once we had a working prototype, we conducted 8 formative design interviews with key stakeholders to gather feedback on the system. We analyzed the interviews using a rapid qualitative analysis approach and revised the mock-ups. We then conducted a multidisciplinary heuristic evaluation to identify further improvements to the tool.

Results: Through an iterative, HCD process that engaged diverse end-users, we identified key components needed in AI monitoring systems. We identified specific data views and functionality required by end users across several user interfaces including a performance monitoring dashboard, accordion snapshots, and model-specific pages.

Discussion: We distilled general design requirements for systems to support AI monitoring throughout its lifecycle. One important consideration is how to support teams of health system leaders, clinical experts, and technical personnel that are distributed across the organization as they monitor and respond to algorithm deterioration.

Conclusion: VAMOS aims to support systematic and proactive monitoring of AI tools in healthcare organizations. Our findings and recommendations can support the design of AI monitoring systems to support health systems, improve quality of care, and ensure patient safety.

目标:随着人工智能(AI)在医疗保健领域的应用迅速扩大,人们也越来越认识到需要在实施后对AI进行持续监测,称为算法警戒。然而,支持对整个卫生系统使用的人工智能进行系统监测和治理的系统仍然很少。在本研究中,我们确定了终端用户对一种新型人工智能监测系统的需求——Vanderbilt算法警戒监测和操作系统(VAMOS)——采用以人为本的设计(HCD)。材料和方法:我们组建了一个多学科团队,在范德比尔特大学医学中心规划人工智能监测和治理。然后,我们与不同的利益相关者进行了9次参与式设计会议,以开发VAMOS的原型。一旦我们有了一个可工作的原型,我们与关键的利益相关者进行了8次形成性的设计访谈,以收集关于系统的反馈。我们使用快速定性分析方法分析访谈并修改模型。然后,我们进行了多学科启发式评估,以确定该工具的进一步改进。结果:通过一个迭代的HCD过程,让不同的终端用户参与进来,我们确定了人工智能监控系统所需的关键组件。我们确定了最终用户跨多个用户界面所需的特定数据视图和功能,包括性能监视仪表板、手风琴快照和特定于模型的页面。讨论:我们提炼了系统的一般设计需求,以便在整个生命周期中支持AI监控。一个重要的考虑是如何支持分布在整个组织的卫生系统领导、临床专家和技术人员团队监测和应对算法恶化。结论:VAMOS旨在支持医疗机构中人工智能工具的系统和主动监测。我们的研究结果和建议可以支持人工智能监测系统的设计,以支持卫生系统,提高护理质量并确保患者安全。
{"title":"Human-centered design of an artificial intelligence monitoring system: the Vanderbilt Algorithmovigilance Monitoring and Operations System.","authors":"Megan E Salwei, Sharon E Davis, Carrie Reale, Laurie L Novak, Colin G Walsh, Russ Beebe, Scott Nelson, Sameer Sundrani, Susannah Rose, Adam Wright, Michael Ripperger, Peter Shave, Peter Embí","doi":"10.1093/jamiaopen/ooaf136","DOIUrl":"10.1093/jamiaopen/ooaf136","url":null,"abstract":"<p><strong>Objectives: </strong>As the use of artificial intelligence (AI) in healthcare is rapidly expanding, there is also growing recognition of the need for ongoing monitoring of AI after implementation, called <i>algorithmovigilance</i>. Yet, there remain few systems that support systematic monitoring and governance of AI used across a health system. In this study, we identify end-user needs for a novel AI monitoring system-the Vanderbilt Algorithmovigilance Monitoring and Operations System (VAMOS)-using human-centered design (HCD).</p><p><strong>Materials and methods: </strong>We assembled a multidisciplinary team to plan AI monitoring and governance at Vanderbilt University Medical Center. We then conducted 9 participatory design sessions with diverse stakeholders to develop prototypes of VAMOS. Once we had a working prototype, we conducted 8 formative design interviews with key stakeholders to gather feedback on the system. We analyzed the interviews using a rapid qualitative analysis approach and revised the mock-ups. We then conducted a multidisciplinary heuristic evaluation to identify further improvements to the tool.</p><p><strong>Results: </strong>Through an iterative, HCD process that engaged diverse end-users, we identified key components needed in AI monitoring systems. We identified specific data views and functionality required by end users across several user interfaces including a performance monitoring dashboard, accordion snapshots, and model-specific pages.</p><p><strong>Discussion: </strong>We distilled general design requirements for systems to support AI monitoring throughout its lifecycle. One important consideration is how to support teams of health system leaders, clinical experts, and technical personnel that are distributed across the organization as they monitor and respond to algorithm deterioration.</p><p><strong>Conclusion: </strong>VAMOS aims to support systematic and proactive monitoring of AI tools in healthcare organizations. Our findings and recommendations can support the design of AI monitoring systems to support health systems, improve quality of care, and ensure patient safety.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf136"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574793/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145431970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated survey collection with LLM-based conversational agents. 使用基于llm的会话代理自动收集调查问卷。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-30 eCollection Date: 2025-10-01 DOI: 10.1093/jamiaopen/ooaf103
Kurmanbek Kaiyrbekov, Nicholas J Dobbins, Sean D Mooney

Objectives: Phone surveys are crucial for collecting health data but are expensive, time-consuming, and difficult to scale. To overcome these limitations, we propose a survey collection approach powered by conversational Large Language Models (LLMs).

Materials and methods: Our framework leverages an LLM-powered conversational agent to conduct surveys and transcribe conversations, along with an LLM (GPT-4o) to extract responses from the transcripts. We evaluated the framework's performance by analyzing transcription errors, the accuracy of inferred survey responses, and participant experiences across 40 survey responses collected from a convenience sample of 8 individuals, each adopting the role of five LLM-generated personas.

Results: GPT-4o extracted responses to survey questions with an average accuracy of 98%, despite an average transcription word error rate of 7.7%. Participants reported occasional errors by the conversational agent but praised its ability to demonstrate comprehension and maintain engaging conversations.

Discussion and conclusion: Our study showcases the potential of LLM agents to enable scalable, AI-powered phone surveys, reducing human effort and advancing healthcare data collection.

目的:电话调查对收集健康数据至关重要,但昂贵、耗时且难以规模化。为了克服这些限制,我们提出了一种由会话式大型语言模型(llm)提供支持的调查收集方法。材料和方法:我们的框架利用LLM支持的会话代理来进行调查和转录会话,以及LLM (gpt - 40)从转录中提取响应。我们通过分析转录错误、推断调查回答的准确性以及从8个人的便利样本中收集的40份调查回答的参与者体验来评估该框架的性能,每个人都采用法学硕士生成的5个角色。结果:gpt - 40提取调查问题答案的平均准确率为98%,尽管平均转录词错误率为7.7%。参与者报告了对话代理偶尔出现的错误,但赞扬了它展示理解能力和保持对话吸引力的能力。讨论和结论:我们的研究展示了LLM代理在实现可扩展的、人工智能驱动的电话调查、减少人力和推进医疗保健数据收集方面的潜力。
{"title":"Automated survey collection with LLM-based conversational agents.","authors":"Kurmanbek Kaiyrbekov, Nicholas J Dobbins, Sean D Mooney","doi":"10.1093/jamiaopen/ooaf103","DOIUrl":"10.1093/jamiaopen/ooaf103","url":null,"abstract":"<p><strong>Objectives: </strong>Phone surveys are crucial for collecting health data but are expensive, time-consuming, and difficult to scale. To overcome these limitations, we propose a survey collection approach powered by conversational Large Language Models (LLMs).</p><p><strong>Materials and methods: </strong>Our framework leverages an LLM-powered conversational agent to conduct surveys and transcribe conversations, along with an LLM (GPT-4o) to extract responses from the transcripts. We evaluated the framework's performance by analyzing transcription errors, the accuracy of inferred survey responses, and participant experiences across 40 survey responses collected from a convenience sample of 8 individuals, each adopting the role of five LLM-generated personas.</p><p><strong>Results: </strong>GPT-4o extracted responses to survey questions with an average accuracy of 98%, despite an average transcription word error rate of 7.7%. Participants reported occasional errors by the conversational agent but praised its ability to demonstrate comprehension and maintain engaging conversations.</p><p><strong>Discussion and conclusion: </strong>Our study showcases the potential of LLM agents to enable scalable, AI-powered phone surveys, reducing human effort and advancing healthcare data collection.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 5","pages":"ooaf103"},"PeriodicalIF":3.4,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12574787/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145432479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JAMIA Open
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1