首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis. 研究不同糖尿病患者电子健康记录中的语言差异:自然语言处理分析。
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-23 DOI: 10.2196/50428
Isabel Bilotta, Scott Tonidandel, Winston R Liaw, Eden King, Diana N Carvajal, Ayana Taylor, Julie Thamby, Yang Xiang, Cui Tao, Michael Hansen

Background: Individuals from minoritized racial and ethnic backgrounds experience pernicious and pervasive health disparities that have emerged, in part, from clinician bias.

Objective: We used a natural language processing approach to examine whether linguistic markers in electronic health record (EHR) notes differ based on the race and ethnicity of the patient. To validate this methodological approach, we also assessed the extent to which clinicians perceive linguistic markers to be indicative of bias.

Methods: In this cross-sectional study, we extracted EHR notes for patients who were aged 18 years or older; had more than 5 years of diabetes diagnosis codes; and received care between 2006 and 2014 from family physicians, general internists, or endocrinologists practicing in an urban, academic network of clinics. The race and ethnicity of patients were defined as White non-Hispanic, Black non-Hispanic, or Hispanic or Latino. We hypothesized that Sentiment Analysis and Social Cognition Engine (SEANCE) components (ie, negative adjectives, positive adjectives, joy words, fear and disgust words, politics words, respect words, trust verbs, and well-being words) and mean word count would be indicators of bias if racial differences emerged. We performed linear mixed effects analyses to examine the relationship between the outcomes of interest (the SEANCE components and word count) and patient race and ethnicity, controlling for patient age. To validate this approach, we asked clinicians to indicate the extent to which they thought variation in the use of SEANCE language domains for different racial and ethnic groups was reflective of bias in EHR notes.

Results: We examined EHR notes (n=12,905) of Black non-Hispanic, White non-Hispanic, and Hispanic or Latino patients (n=1562), who were seen by 281 physicians. A total of 27 clinicians participated in the validation study. In terms of bias, participants rated negative adjectives as 8.63 (SD 2.06), fear and disgust words as 8.11 (SD 2.15), and positive adjectives as 7.93 (SD 2.46) on a scale of 1 to 10, with 10 being extremely indicative of bias. Notes for Black non-Hispanic patients contained significantly more negative adjectives (coefficient 0.07, SE 0.02) and significantly more fear and disgust words (coefficient 0.007, SE 0.002) than those for White non-Hispanic patients. The notes for Hispanic or Latino patients included significantly fewer positive adjectives (coefficient -0.02, SE 0.007), trust verbs (coefficient -0.009, SE 0.004), and joy words (coefficient -0.03, SE 0.01) than those for White non-Hispanic patients.

Conclusions: This approach may enable physicians and researchers to identify and mitigate bias in medical interactions, with the goal of reducing health disparities stemming from bias.

背景:来自少数种族和民族背景的人在健康方面存在着严重而普遍的差异,这些差异部分源于临床医生的偏见:我们使用自然语言处理方法来研究电子健康记录(EHR)笔记中的语言标记是否因患者的种族和民族而有所不同。为了验证这种方法,我们还评估了临床医生认为语言标记在多大程度上表明了偏见:在这项横断面研究中,我们提取了年龄在 18 岁或 18 岁以上、有 5 年以上糖尿病诊断代码、在 2006 年至 2014 年期间接受过家庭医生、普通内科医生或内分泌科医生治疗的患者的电子病历记录。患者的种族和民族被定义为非西班牙裔白人、非西班牙裔黑人、西班牙裔或拉丁裔。我们假设,如果出现种族差异,情感分析和社会认知引擎(SEANCE)的成分(即负面形容词、正面形容词、欢乐词、恐惧和厌恶词、政治词、尊重词、信任动词和幸福词)和平均字数将成为偏见的指标。我们进行了线性混合效应分析,以检验相关结果(SEANCE 成分和字数)与患者种族和民族之间的关系,同时控制患者年龄。为了验证这种方法,我们请临床医生说明他们认为不同种族和族裔群体在使用 SEANCE 语言域方面的差异在多大程度上反映了电子病历笔记中的偏差:我们检查了由 281 名医生诊治的非西班牙裔黑人、非西班牙裔白人、西班牙裔或拉丁裔患者(n=1562)的电子病历记录(n=12905)。共有 27 名临床医生参与了验证研究。在偏差方面,参与者对负面形容词的评分为 8.63(标度 2.06),对恐惧和厌恶词的评分为 8.11(标度 2.15),对正面形容词的评分为 7.93(标度 2.46),评分范围从 1 到 10,10 为极度偏差。黑人非西班牙裔患者的笔记中包含的负面形容词(系数 0.07,SE 0.02)以及恐惧和厌恶词(系数 0.007,SE 0.002)明显多于白人非西班牙裔患者的笔记。西班牙裔或拉丁裔患者的笔记中包含的积极形容词(系数-0.02,SE 0.007)、信任动词(系数-0.009,SE 0.004)和欢乐词(系数-0.03,SE 0.01)明显少于非西班牙裔白人患者:这种方法可以帮助医生和研究人员识别并减轻医疗互动中的偏差,从而减少因偏差造成的健康差异。
{"title":"Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis.","authors":"Isabel Bilotta, Scott Tonidandel, Winston R Liaw, Eden King, Diana N Carvajal, Ayana Taylor, Julie Thamby, Yang Xiang, Cui Tao, Michael Hansen","doi":"10.2196/50428","DOIUrl":"10.2196/50428","url":null,"abstract":"<p><strong>Background: </strong>Individuals from minoritized racial and ethnic backgrounds experience pernicious and pervasive health disparities that have emerged, in part, from clinician bias.</p><p><strong>Objective: </strong>We used a natural language processing approach to examine whether linguistic markers in electronic health record (EHR) notes differ based on the race and ethnicity of the patient. To validate this methodological approach, we also assessed the extent to which clinicians perceive linguistic markers to be indicative of bias.</p><p><strong>Methods: </strong>In this cross-sectional study, we extracted EHR notes for patients who were aged 18 years or older; had more than 5 years of diabetes diagnosis codes; and received care between 2006 and 2014 from family physicians, general internists, or endocrinologists practicing in an urban, academic network of clinics. The race and ethnicity of patients were defined as White non-Hispanic, Black non-Hispanic, or Hispanic or Latino. We hypothesized that Sentiment Analysis and Social Cognition Engine (SEANCE) components (ie, negative adjectives, positive adjectives, joy words, fear and disgust words, politics words, respect words, trust verbs, and well-being words) and mean word count would be indicators of bias if racial differences emerged. We performed linear mixed effects analyses to examine the relationship between the outcomes of interest (the SEANCE components and word count) and patient race and ethnicity, controlling for patient age. To validate this approach, we asked clinicians to indicate the extent to which they thought variation in the use of SEANCE language domains for different racial and ethnic groups was reflective of bias in EHR notes.</p><p><strong>Results: </strong>We examined EHR notes (n=12,905) of Black non-Hispanic, White non-Hispanic, and Hispanic or Latino patients (n=1562), who were seen by 281 physicians. A total of 27 clinicians participated in the validation study. In terms of bias, participants rated negative adjectives as 8.63 (SD 2.06), fear and disgust words as 8.11 (SD 2.15), and positive adjectives as 7.93 (SD 2.46) on a scale of 1 to 10, with 10 being extremely indicative of bias. Notes for Black non-Hispanic patients contained significantly more negative adjectives (coefficient 0.07, SE 0.02) and significantly more fear and disgust words (coefficient 0.007, SE 0.002) than those for White non-Hispanic patients. The notes for Hispanic or Latino patients included significantly fewer positive adjectives (coefficient -0.02, SE 0.007), trust verbs (coefficient -0.009, SE 0.004), and joy words (coefficient -0.03, SE 0.01) than those for White non-Hispanic patients.</p><p><strong>Conclusions: </strong>This approach may enable physicians and researchers to identify and mitigate bias in medical interactions, with the goal of reducing health disparities stemming from bias.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11137426/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141089296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning Models for Parkinson Disease: Systematic Review 帕金森病的机器学习模型:系统回顾
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-17 DOI: 10.2196/50117
Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert
Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.
背景:随着数据、计算资源和易于使用的软件库的可用性不断提高,机器学习(ML)越来越多地被用于疾病检测和预测,包括帕金森病(PD)。尽管每年都有大量研究成果发表,但很少有 ML 系统被实际应用。目标:特别是,缺乏外部有效性可能会导致这些系统在临床实践中表现不佳。此外,ML 设计和报告中的其他方法问题也会阻碍临床应用,即使是那些能从此类数据驱动型系统中获益的应用。为了对目前帕金森病应用中的 ML 实践进行抽样调查,我们对 2020 年和 2021 年使用 ML 模型诊断帕金森病或跟踪帕金森病进展的研究进行了系统回顾。方法:2020 年 1 月至 2021 年 4 月期间,我们按照 PRISMA 在 PubMed 上进行了系统性文献综述,使用精确字符串 "帕金森病 "和("ML "或 "预测 "或 "分类 "或 "检测 "或 "人工智能 "或 "AI"),从搜索结果中获得了 1085 篇出版物。经过搜索查询和审查,我们找到了 113 篇使用 ML 对帕金森病或帕金森病相关症状进行分类或回归预测的文献。结果发现只有 25.7% 的研究使用了暂缓测试集以避免可能出现的误差,而在没有使用暂缓测试集的研究中,约有一半的研究没有将此作为潜在的关注点。令人惊讶的是,有 38.9% 的研究没有报告如何或是否对模型进行了调整,另有 27.4% 的研究使用了特别的模型调整,而这在 ML 模型优化中通常是不受欢迎的。只有 15% 的研究将结果与其他模型进行了直接比较,这严重限制了对结果的解释。结论:本综述强调了当前 ML 系统和技术的显著局限性,这些局限性可能导致研究报告的性能与旨在检测和预测 PD 等疾病的 ML 模型的实际应用性之间存在差距。
{"title":"Machine Learning Models for Parkinson Disease: Systematic Review","authors":"Thasina Tabashum, Robert Cooper Snyder, Megan K O'Brien, Mark V Albert","doi":"10.2196/50117","DOIUrl":"https://doi.org/10.2196/50117","url":null,"abstract":"Background: With the increasing availability of data, computing resources, and easier-to-use software libraries, machine learning (ML) is increasingly utilized in disease detection and prediction, including Parkinson’s disease (PD). Despite the large number of studies published every year, very few ML systems have been adopted for real-world subject use. Objective: In particular, a lack of external validity may result in poor performance of these systems in clinical practice. Additional methodological issues in ML design and reporting can also hinder clinical adoption, even for applications that would benefit from such data-driven systems. To sample the current ML practices in PD applications, we conducted a systematic review of studies in 2020 and 2021 that use ML models to diagnose PD or to track PD progression. Methods: We conducted a systematic literature review in accordance with PRISMA in PubMed between January 2020 - April 2021, using the exact string “Parkinson’s” AND (“ML” OR “prediction” OR “classification” OR “detection” or “artificial intelligence” OR “AI”), resulting in 1085 publications from the search results. After a search query and review, we found 113 publications that used ML for the classification or regression-based prediction of PD or PD-related symptoms. Results: Only 25.7% of studies used a hold-out test set to avoid potentially inflated accuracies, and approximately half of the studies without a hold-out test set did not state this as a potential concern. Surprisingly, 38.9% of studies did not report on how or if models were tuned, and an additional 27.4% used ad hoc model tuning, which is generally frowned upon in ML model optimization. Only 15% of studies performed direct comparisons of results with other models, severely limiting the interpretation of results. Conclusions: This review highlights the notable limitations of current ML systems and techniques that may contribute to a gap between reported performance in research and the real-life applicability of ML models aiming to detect and predict diseases such as PD.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141064119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Use of Generative AI for Scientific Literature Searches for Systematic Reviews: ChatGPT and Microsoft Bing AI Performance Evaluation 在系统综述的科学文献检索中使用生成式人工智能:ChatGPT 和微软必应人工智能性能评估
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-14 DOI: 10.2196/51187
Yong Nam Gwon, Jae Heon Kim, Hyun Soo Chung, Eun Jee Jung, Joey Chun, Serin Lee, Sung Ryul Shim
Background: A large language model (LLM) is a type of artificial intelligence (AI) model that opens up great possibilities for healthcare practice, research, and education, although scholars have highlighted that there is a need to proactively address current issues regarding its use. One of the best-known LLMs is ChatGPT. Objective: This study aims to explore the potential of ChatGPT as a real-time literature search tool for systematic reviews and clinical decision support system (CDSS). Methods: The search results of a systematic review study on the treatment of Peyronie's Disease published by human experts were selected as a benchmark, and the literature search formula of the study was applied to ChatGPT and Microsoft Bing to compare with human researchers. To determine the accuracy of the retrieved literature, we graded it as A, B, C, and F for only those cases where actual literature exists. Results: The benchmark human researcher's randomized controlled trial search results were 24. ChatGPT collected 1287 literature search results through 639 questions, and 7 of them were exactly matched, and Microsoft Bing collected 48 literature search results through 223 questions, and 19 of them were exactly matched with human search results. Conclusions: This is the first study to compare artificial intelligence (AI) and conventional human systematic review methods as a real-time literature collection tool for evidence-based medicine. The results suggest that the use of ChatGPT as a tool for real-time evidence generation is not yet accurate and feasible. Therefore, researchers should be cautious about using such AI.
背景:大型语言模型(LLM)是人工智能(AI)模型的一种,它为医疗实践、研究和教育提供了巨大的可能性,不过学者们也强调,需要积极解决当前在使用 LLM 方面存在的问题。最著名的人工智能模型之一是 ChatGPT。研究目的本研究旨在探索 ChatGPT 作为系统综述和临床决策支持系统 (CDSS) 的实时文献检索工具的潜力。研究方法选取人类专家发表的关于佩罗尼氏病治疗的系统综述研究的检索结果作为基准,将该研究的文献检索公式应用于 ChatGPT 和 Microsoft Bing,与人类研究人员进行比较。为了确定检索到的文献的准确性,我们仅对存在实际文献的情况将其分为 A、B、C 和 F 四个等级。结果基准人类研究人员的随机对照试验搜索结果为 24。ChatGPT 通过 639 个问题收集了 1287 条文献检索结果,其中 7 条完全匹配;微软必应通过 223 个问题收集了 48 条文献检索结果,其中 19 条与人类检索结果完全匹配。研究结论这是第一项将人工智能(AI)和传统的人类系统综述方法作为循证医学实时文献收集工具进行比较的研究。结果表明,使用 ChatGPT 作为实时证据生成工具尚不准确和可行。因此,研究人员应谨慎使用此类人工智能。
{"title":"The Use of Generative AI for Scientific Literature Searches for Systematic Reviews: ChatGPT and Microsoft Bing AI Performance Evaluation","authors":"Yong Nam Gwon, Jae Heon Kim, Hyun Soo Chung, Eun Jee Jung, Joey Chun, Serin Lee, Sung Ryul Shim","doi":"10.2196/51187","DOIUrl":"https://doi.org/10.2196/51187","url":null,"abstract":"Background: A large language model (LLM) is a type of artificial intelligence (AI) model that opens up great possibilities for healthcare practice, research, and education, although scholars have highlighted that there is a need to proactively address current issues regarding its use. One of the best-known LLMs is ChatGPT. Objective: This study aims to explore the potential of ChatGPT as a real-time literature search tool for systematic reviews and clinical decision support system (CDSS). Methods: The search results of a systematic review study on the treatment of Peyronie's Disease published by human experts were selected as a benchmark, and the literature search formula of the study was applied to ChatGPT and Microsoft Bing to compare with human researchers. To determine the accuracy of the retrieved literature, we graded it as A, B, C, and F for only those cases where actual literature exists. Results: The benchmark human researcher's randomized controlled trial search results were 24. ChatGPT collected 1287 literature search results through 639 questions, and 7 of them were exactly matched, and Microsoft Bing collected 48 literature search results through 223 questions, and 19 of them were exactly matched with human search results. Conclusions: This is the first study to compare artificial intelligence (AI) and conventional human systematic review methods as a real-time literature collection tool for evidence-based medicine. The results suggest that the use of ChatGPT as a tool for real-time evidence generation is not yet accurate and feasible. Therefore, researchers should be cautious about using such AI.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140927429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ventilator-Associated Pneumonia Prediction Models Based on AI: Scoping Review 基于人工智能的呼吸机相关肺炎预测模型:范围界定综述
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-14 DOI: 10.2196/57026
Jinbo Zhang, Pingping Yang, Lu Zeng, Shan Li, Jiamei Zhou
Background: Ventilator-associated pneumonia (VAP) is a serious complication of mechanical ventilation therapy that affects patient treatment and prognosis. Owing to its excellent data mining capabilities, artificial intelligence (AI) has been increasingly used to predict VAP. Objective: This article reviews the prediction models for VAP based on AI, providing a reference for the early identification of high-risk groups in future clinical practice. Methods: A scoping review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) extension guidelines. The Wanfang, Chinese BioMedical Literature Database, Cochrane Library, Web of Science, PubMed, MEDLINE, and Embase databases were searched to identify relevant articles. Study selection and data extraction were independently conducted by two reviewers. The data extracted from the included studies were synthesized narratively. Results: From 137 publications, 11 were included in the scoping review. The included studies reported the use of AI for predicting VAP. All of the 11 studies predicted VAP occurrence, and studies on VAP prognosis were excluded. Further, these studies used text data, and none of them involved imaging data. Public databases were used as the primary data choice for model building (6/11, 55 %), whereas the remaining studies had sample sizes smaller than 1000. Machine learning is the primary algorithm for studying the VAP prediction models. However, deep learning and large language models are not used to construct VAP prediction models. Random forest is the most commonly used algorithm (5/11, 45 %). All studies are internal validations, and none of them address how the model is used. Conclusions: This review presents an overview of studies based on AI used to predict and diagnose VAP. AI models have better predictive performance than traditional methods and are expected to provide an indispensable tool for the risk prediction of VAP in the future. However, the current research is in the stage of model construction and validation, and the implementation and guidance for the clinical prediction of VAP require further research.
背景:呼吸机相关肺炎(VAP)是机械通气治疗的一种严重并发症,会影响患者的治疗和预后。人工智能(AI)具有出色的数据挖掘能力,因此越来越多地被用于预测 VAP。目的:本文综述了基于人工智能的 VAP 预测模型,为今后临床实践中早期识别高危人群提供参考。方法根据《系统综述和荟萃分析首选报告项目》(Preferred Reporting Items for Systematic Reviews and Meta-Analyses,PRISMA)扩展指南进行范围综述。检索了万方数据库、中国生物医学文献数据库、Cochrane图书馆、Web of Science、PubMed、MEDLINE和Embase数据库,以确定相关文章。研究选择和数据提取由两名审稿人独立完成。从纳入的研究中提取的数据进行了叙述性综合。结果:在 137 篇出版物中,有 11 篇被纳入范围界定审查。纳入的研究报告了使用人工智能预测 VAP 的情况。这 11 项研究均预测了 VAP 的发生,而关于 VAP 预后的研究则被排除在外。此外,这些研究使用的都是文本数据,没有一项涉及影像学数据。公共数据库是建立模型的主要数据选择(6/11,55%),而其余研究的样本量均小于 1000。机器学习是研究 VAP 预测模型的主要算法。然而,深度学习和大型语言模型并未用于构建 VAP 预测模型。随机森林是最常用的算法(5/11,45%)。所有研究都是内部验证,没有一项研究涉及如何使用模型。结论:本综述概述了基于人工智能预测和诊断 VAP 的研究。与传统方法相比,人工智能模型具有更好的预测性能,有望在未来为 VAP 的风险预测提供不可或缺的工具。然而,目前的研究还处于模型构建和验证阶段,如何实施并指导 VAP 的临床预测还需要进一步研究。
{"title":"Ventilator-Associated Pneumonia Prediction Models Based on AI: Scoping Review","authors":"Jinbo Zhang, Pingping Yang, Lu Zeng, Shan Li, Jiamei Zhou","doi":"10.2196/57026","DOIUrl":"https://doi.org/10.2196/57026","url":null,"abstract":"Background: Ventilator-associated pneumonia (VAP) is a serious complication of mechanical ventilation therapy that affects patient treatment and prognosis. Owing to its excellent data mining capabilities, artificial intelligence (AI) has been increasingly used to predict VAP. Objective: This article reviews the prediction models for VAP based on AI, providing a reference for the early identification of high-risk groups in future clinical practice. Methods: A scoping review was conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) extension guidelines. The Wanfang, Chinese BioMedical Literature Database, Cochrane Library, Web of Science, PubMed, MEDLINE, and Embase databases were searched to identify relevant articles. Study selection and data extraction were independently conducted by two reviewers. The data extracted from the included studies were synthesized narratively. Results: From 137 publications, 11 were included in the scoping review. The included studies reported the use of AI for predicting VAP. All of the 11 studies predicted VAP occurrence, and studies on VAP prognosis were excluded. Further, these studies used text data, and none of them involved imaging data. Public databases were used as the primary data choice for model building (6/11, 55 %), whereas the remaining studies had sample sizes smaller than 1000. Machine learning is the primary algorithm for studying the VAP prediction models. However, deep learning and large language models are not used to construct VAP prediction models. Random forest is the most commonly used algorithm (5/11, 45 %). All studies are internal validations, and none of them address how the model is used. Conclusions: This review presents an overview of studies based on AI used to predict and diagnose VAP. AI models have better predictive performance than traditional methods and are expected to provide an indispensable tool for the risk prediction of VAP in the future. However, the current research is in the stage of model construction and validation, and the implementation and guidance for the clinical prediction of VAP require further research.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140927373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review. 大型语言模型在急诊医学变革中的作用:范围审查。
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-10 DOI: 10.2196/53787
Carl Preiksaitis, Nicholas Ashenburg, Gabrielle Bunney, Andrew Chu, Rana Kabeer, Fran Riley, Ryan Ribeira, Christian Rose

Background: Artificial intelligence (AI), more specifically large language models (LLMs), holds significant potential in revolutionizing emergency care delivery by optimizing clinical workflows and enhancing the quality of decision-making. Although enthusiasm for integrating LLMs into emergency medicine (EM) is growing, the existing literature is characterized by a disparate collection of individual studies, conceptual analyses, and preliminary implementations. Given these complexities and gaps in understanding, a cohesive framework is needed to comprehend the existing body of knowledge on the application of LLMs in EM.

Objective: Given the absence of a comprehensive framework for exploring the roles of LLMs in EM, this scoping review aims to systematically map the existing literature on LLMs' potential applications within EM and identify directions for future research. Addressing this gap will allow for informed advancements in the field.

Methods: Using PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) criteria, we searched Ovid MEDLINE, Embase, Web of Science, and Google Scholar for papers published between January 2018 and August 2023 that discussed LLMs' use in EM. We excluded other forms of AI. A total of 1994 unique titles and abstracts were screened, and each full-text paper was independently reviewed by 2 authors. Data were abstracted independently, and 5 authors performed a collaborative quantitative and qualitative synthesis of the data.

Results: A total of 43 papers were included. Studies were predominantly from 2022 to 2023 and conducted in the United States and China. We uncovered four major themes: (1) clinical decision-making and support was highlighted as a pivotal area, with LLMs playing a substantial role in enhancing patient care, notably through their application in real-time triage, allowing early recognition of patient urgency; (2) efficiency, workflow, and information management demonstrated the capacity of LLMs to significantly boost operational efficiency, particularly through the automation of patient record synthesis, which could reduce administrative burden and enhance patient-centric care; (3) risks, ethics, and transparency were identified as areas of concern, especially regarding the reliability of LLMs' outputs, and specific studies highlighted the challenges of ensuring unbiased decision-making amidst potentially flawed training data sets, stressing the importance of thorough validation and ethical oversight; and (4) education and communication possibilities included LLMs' capacity to enrich medical training, such as through using simulated patient interactions that enhance communication skills.

Conclusions: LLMs have the potential to fundamentally transform EM, enhancing clinical decision-making, optimizing workflows, and improving patient outc

背景:人工智能(AI),更具体地说是大型语言模型(LLMs),通过优化临床工作流程和提高决策质量,在彻底改变急诊护理服务方面具有巨大潜力。虽然将 LLMs 融入急诊医学(EM)的热情日益高涨,但现有文献的特点是收集了大量不同的单项研究、概念分析和初步实施。鉴于这些复杂性和认识上的差距,我们需要一个具有凝聚力的框架来理解现有的关于在急诊医学中应用 LLM 的知识体系:鉴于目前还没有一个全面的框架来探讨有限法律知识在电磁学中的作用,本范围综述旨在系统地梳理有关有限法律知识在电磁学中潜在应用的现有文献,并确定未来研究的方向。填补这一空白将有助于在知情的情况下推进该领域的研究:使用 PRISMA-ScR(系统综述和 Meta 分析的首选报告项目,范围综述的扩展)标准,我们检索了 Ovid MEDLINE、Embase、Web of Science 和 Google Scholar 在 2018 年 1 月至 2023 年 8 月期间发表的讨论 LLMs 在 EM 中应用的论文。我们排除了其他形式的人工智能。共筛选出 1994 篇独特的标题和摘要,每篇全文均由两名作者独立审阅。5位作者对数据进行了定量和定性的综合分析:结果:共纳入 43 篇论文。研究时间主要集中在 2022 年至 2023 年,研究地点主要在美国和中国。我们发现了四大主题:(1) 临床决策和支持是一个关键领域,LLM 在加强患者护理方面发挥着重要作用,特别是通过应用于实时分诊,可以及早识别患者的紧急状况;(2) 效率、工作流程和信息管理表明,LLM 能够显著提高运营效率,特别是通过病历合成自动化,可以减轻行政负担,加强以患者为中心的护理;(3) 风险、伦理和透明度被认为是值得关注的领域,特别是在 LLM 输出的可靠性方面,具体研究强调了在可能存在缺陷的培训数据集中确保无偏见决策所面临的挑战,强调了彻底验证和伦理监督的重要性;以及 (4) 教育和交流的可能性包括 LLM 丰富医学培训的能力,例如通过使用模拟病人互动来提高交流技能。结论:LLMs 有可能从根本上改变医学教育,加强临床决策,优化工作流程,改善患者预后。本综述通过确定关键研究领域为未来的进步奠定了基础:LLM 应用的前瞻性验证、建立负责任使用的标准、了解提供者和患者的看法以及提高医生的人工智能素养。将 LLM 有效地整合到电磁学中需要各方的共同努力和全面的评估,以确保这些技术能够安全、有效地应用。
{"title":"The Role of Large Language Models in Transforming Emergency Medicine: Scoping Review.","authors":"Carl Preiksaitis, Nicholas Ashenburg, Gabrielle Bunney, Andrew Chu, Rana Kabeer, Fran Riley, Ryan Ribeira, Christian Rose","doi":"10.2196/53787","DOIUrl":"10.2196/53787","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI), more specifically large language models (LLMs), holds significant potential in revolutionizing emergency care delivery by optimizing clinical workflows and enhancing the quality of decision-making. Although enthusiasm for integrating LLMs into emergency medicine (EM) is growing, the existing literature is characterized by a disparate collection of individual studies, conceptual analyses, and preliminary implementations. Given these complexities and gaps in understanding, a cohesive framework is needed to comprehend the existing body of knowledge on the application of LLMs in EM.</p><p><strong>Objective: </strong>Given the absence of a comprehensive framework for exploring the roles of LLMs in EM, this scoping review aims to systematically map the existing literature on LLMs' potential applications within EM and identify directions for future research. Addressing this gap will allow for informed advancements in the field.</p><p><strong>Methods: </strong>Using PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) criteria, we searched Ovid MEDLINE, Embase, Web of Science, and Google Scholar for papers published between January 2018 and August 2023 that discussed LLMs' use in EM. We excluded other forms of AI. A total of 1994 unique titles and abstracts were screened, and each full-text paper was independently reviewed by 2 authors. Data were abstracted independently, and 5 authors performed a collaborative quantitative and qualitative synthesis of the data.</p><p><strong>Results: </strong>A total of 43 papers were included. Studies were predominantly from 2022 to 2023 and conducted in the United States and China. We uncovered four major themes: (1) clinical decision-making and support was highlighted as a pivotal area, with LLMs playing a substantial role in enhancing patient care, notably through their application in real-time triage, allowing early recognition of patient urgency; (2) efficiency, workflow, and information management demonstrated the capacity of LLMs to significantly boost operational efficiency, particularly through the automation of patient record synthesis, which could reduce administrative burden and enhance patient-centric care; (3) risks, ethics, and transparency were identified as areas of concern, especially regarding the reliability of LLMs' outputs, and specific studies highlighted the challenges of ensuring unbiased decision-making amidst potentially flawed training data sets, stressing the importance of thorough validation and ethical oversight; and (4) education and communication possibilities included LLMs' capacity to enrich medical training, such as through using simulated patient interactions that enhance communication skills.</p><p><strong>Conclusions: </strong>LLMs have the potential to fundamentally transform EM, enhancing clinical decision-making, optimizing workflows, and improving patient outc","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11127144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140905207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of SNOMED CT Grouper Accuracy and Coverage in Organizing the Electronic Health Record Problem List by Clinical System: Observational Study 评估 SNOMED CT 分类器在按临床系统整理电子健康记录问题列表时的准确性和覆盖范围:观察研究
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-09 DOI: 10.2196/51274
Rashaud Senior, Timothy Tsai, William Ratliff, Lisa Nadler, Suresh Balu, Elizabeth Malcolm, Eugenia McPeek Hinz
Background: The Problem List (PL) is often poorly organized which makes the use for clinical care more challenging over time. Objective: To measure the accuracy of diagnoses sorting for PL system/conditions groupers based on SNOMED-CT concepts mapped to ICD-10 codes. Methods: We developed 21 system/condition-based groupers using SNOMED-CT hierarchal concepts refined with Boolean logic to re-organize the ICD-10-based PL in our electronic health record (EHR). We extracted the PL from a convenience sample of 50 patients divided across age and sex in a deidentified format for evaluation. Two clinicians independently determined whether a PL diagnosis was correctly attributed to a system/condition grouper. Discrepancies were discussed and, if no consensus was reached, were adjudicated by a third clinician. Descriptive statistics and Cohen’s kappa statistic for interrater reliability were calculated. Results: Our 50-patient sample had a total of 869 diagnoses (range 4–59; median 12, IQR 9-23.75). The reviewers initially agreed on 821 placements. Of the remaining 48 items, 16 required adjudication, leading to a final count of 787 True Positives and 37 True Negatives. We determined PL diagnoses were grouped with Sensitivity 97.6%, Specificity 58.7%, Positive Predictive Value 96.8%, and F1 Score 0.972. After discussion, the calculated kappa statistic was 0.9, confirming “near perfect” agreement. Conclusions: We successfully developed a structured methodology to organize diagnoses on the problem list that supports clinical review.
背景:问题清单(PL)通常组织不佳,因此随着时间的推移,在临床护理中使用问题清单变得更具挑战性。目的根据与 ICD-10 编码相对应的 SNOMED-CT 概念,测量 PL 系统/病症分组器诊断分类的准确性。方法:我们开发了 21 个基于系统/病症的分组器,使用 SNOMED-CT 层次化概念和布尔逻辑来重新组织电子病历 (EHR) 中基于 ICD-10 的 PL。我们从 50 名不同年龄和性别的患者中抽取了方便抽样的 PL 进行评估。两名临床医生独立确定 PL 诊断是否正确归属于系统/病症分组。对差异进行讨论,如果无法达成共识,则由第三位临床医生裁定。计算描述性统计数字和科恩卡帕(Cohen's kappa)统计数字,以衡量检查者之间的可靠性。结果:我们的 50 例患者样本共有 869 项诊断(范围 4-59;中位数 12,IQR 9-23.75)。评审员最初就 821 项诊断达成一致。在剩余的 48 个项目中,有 16 个需要裁定,最终得出 787 个 "真阳性 "和 37 个 "真阴性"。我们确定 PL 诊断分组的灵敏度为 97.6%,特异度为 58.7%,阳性预测值为 96.8%,F1 得分为 0.972。经过讨论,计算得出的卡帕统计量为 0.9,证实了 "近乎完美 "的一致性。结论:我们成功地开发出了一种结构化方法来组织问题清单上的诊断,从而为临床审查提供支持。
{"title":"Evaluation of SNOMED CT Grouper Accuracy and Coverage in Organizing the Electronic Health Record Problem List by Clinical System: Observational Study","authors":"Rashaud Senior, Timothy Tsai, William Ratliff, Lisa Nadler, Suresh Balu, Elizabeth Malcolm, Eugenia McPeek Hinz","doi":"10.2196/51274","DOIUrl":"https://doi.org/10.2196/51274","url":null,"abstract":"Background: The Problem List (PL) is often poorly organized which makes the use for clinical care more challenging over time. Objective: To measure the accuracy of diagnoses sorting for PL system/conditions groupers based on SNOMED-CT concepts mapped to ICD-10 codes. Methods: We developed 21 system/condition-based groupers using SNOMED-CT hierarchal concepts refined with Boolean logic to re-organize the ICD-10-based PL in our electronic health record (EHR). We extracted the PL from a convenience sample of 50 patients divided across age and sex in a deidentified format for evaluation. Two clinicians independently determined whether a PL diagnosis was correctly attributed to a system/condition grouper. Discrepancies were discussed and, if no consensus was reached, were adjudicated by a third clinician. Descriptive statistics and Cohen’s kappa statistic for interrater reliability were calculated. Results: Our 50-patient sample had a total of 869 diagnoses (range 4–59; median 12, IQR 9-23.75). The reviewers initially agreed on 821 placements. Of the remaining 48 items, 16 required adjudication, leading to a final count of 787 True Positives and 37 True Negatives. We determined PL diagnoses were grouped with Sensitivity 97.6%, Specificity 58.7%, Positive Predictive Value 96.8%, and F1 Score 0.972. After discussion, the calculated kappa statistic was 0.9, confirming “near perfect” agreement. Conclusions: We successfully developed a structured methodology to organize diagnoses on the problem list that supports clinical review.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140927505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implementable Prediction of Pressure Injuries in Hospitalized Adults: Model Development and Validation. 住院成人压伤的可实施预测:模型开发与验证
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-08 DOI: 10.2196/51842
Thomas J Reese, Henry J Domenico, Antonio Hernandez, Daniel W Byrne, Ryan P Moore, Jessica B Williams, Brian J Douthit, Elise Russo, Allison B McCoy, Catherine H Ivory, Bryan D Steitz, Adam Wright

Background: Numerous pressure injury prediction models have been developed using electronic health record data, yet hospital-acquired pressure injuries (HAPIs) are increasing, which demonstrates the critical challenge of implementing these models in routine care.

Objective: To help bridge the gap between development and implementation, we sought to create a model that was feasible, broadly applicable, dynamic, actionable, and rigorously validated and then compare its performance to usual care (ie, the Braden scale).

Methods: We extracted electronic health record data from 197,991 adult hospital admissions with 51 candidate features. For risk prediction and feature selection, we used logistic regression with a least absolute shrinkage and selection operator (LASSO) approach. To compare the model with usual care, we used the area under the receiver operating curve (AUC), Brier score, slope, intercept, and integrated calibration index. The model was validated using a temporally staggered cohort.

Results: A total of 5458 HAPIs were identified between January 2018 and July 2022. We determined 22 features were necessary to achieve a parsimonious and highly accurate model. The top 5 features included tracheostomy, edema, central line, first albumin measure, and age. Our model achieved higher discrimination than the Braden scale (AUC 0.897, 95% CI 0.893-0.901 vs AUC 0.798, 95% CI 0.791-0.803).

Conclusions: We developed and validated an accurate prediction model for HAPIs that surpassed the standard-of-care risk assessment and fulfilled necessary elements for implementation. Future work includes a pragmatic randomized trial to assess whether our model improves patient outcomes.

背景:利用电子健康记录数据开发了许多压力损伤预测模型,但医院获得性压力损伤(HAPIs)却在不断增加,这表明在常规护理中实施这些模型是一项严峻的挑战:为了帮助弥合开发与实施之间的差距,我们试图创建一个可行、广泛适用、动态、可操作并经过严格验证的模型,然后将其性能与常规护理(即布莱登量表)进行比较:我们提取了 197,991 例成人住院患者的电子健康记录数据,其中包含 51 个候选特征。在进行风险预测和特征选择时,我们使用了逻辑回归和最小绝对收缩与选择算子(LASSO)方法。为了将该模型与常规护理进行比较,我们使用了接收者操作曲线下面积(AUC)、布赖尔评分、斜率、截距和综合校准指数。该模型使用时间错开的队列进行了验证:2018 年 1 月至 2022 年 7 月期间,共识别出 5458 例 HAPI。我们确定有 22 个特征是建立一个简约且高度准确的模型所必需的。前 5 个特征包括气管切开、水肿、中心管、首次白蛋白测量和年龄。我们的模型比布莱登量表具有更高的区分度(AUC 0.897,95% CI 0.893-0.901 vs AUC 0.798,95% CI 0.791-0.803):我们开发并验证了一个准确的 HAPIs 预测模型,该模型超越了标准护理风险评估,满足了实施的必要条件。未来的工作包括开展一项实用随机试验,以评估我们的模型是否能改善患者的预后。
{"title":"Implementable Prediction of Pressure Injuries in Hospitalized Adults: Model Development and Validation.","authors":"Thomas J Reese, Henry J Domenico, Antonio Hernandez, Daniel W Byrne, Ryan P Moore, Jessica B Williams, Brian J Douthit, Elise Russo, Allison B McCoy, Catherine H Ivory, Bryan D Steitz, Adam Wright","doi":"10.2196/51842","DOIUrl":"10.2196/51842","url":null,"abstract":"<p><strong>Background: </strong>Numerous pressure injury prediction models have been developed using electronic health record data, yet hospital-acquired pressure injuries (HAPIs) are increasing, which demonstrates the critical challenge of implementing these models in routine care.</p><p><strong>Objective: </strong>To help bridge the gap between development and implementation, we sought to create a model that was feasible, broadly applicable, dynamic, actionable, and rigorously validated and then compare its performance to usual care (ie, the Braden scale).</p><p><strong>Methods: </strong>We extracted electronic health record data from 197,991 adult hospital admissions with 51 candidate features. For risk prediction and feature selection, we used logistic regression with a least absolute shrinkage and selection operator (LASSO) approach. To compare the model with usual care, we used the area under the receiver operating curve (AUC), Brier score, slope, intercept, and integrated calibration index. The model was validated using a temporally staggered cohort.</p><p><strong>Results: </strong>A total of 5458 HAPIs were identified between January 2018 and July 2022. We determined 22 features were necessary to achieve a parsimonious and highly accurate model. The top 5 features included tracheostomy, edema, central line, first albumin measure, and age. Our model achieved higher discrimination than the Braden scale (AUC 0.897, 95% CI 0.893-0.901 vs AUC 0.798, 95% CI 0.791-0.803).</p><p><strong>Conclusions: </strong>We developed and validated an accurate prediction model for HAPIs that surpassed the standard-of-care risk assessment and fulfilled necessary elements for implementation. Future work includes a pragmatic randomized trial to assess whether our model improves patient outcomes.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11094428/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140900549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparison of Synthetic Data Generation Techniques for Control Group Survival Data in Oncology Clinical Trials: Simulation Study. 肿瘤临床试验中对照组生存数据的合成数据生成技术比较:模拟研究
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-08 DOI: 10.2196/55118
Ippei Akiya, Takuma Ishihara, Keiichi Yamamoto

Background: Synthetic patient data (SPD) generation for survival analysis in oncology trials holds significant potential for accelerating clinical development. Various machine learning methods, including classification and regression trees (CART), random forest (RF), Bayesian network (BN), and CTGAN, have been employed for this purpose, but their performance in reflecting actual patient survival data remains under investigation.

Objective: The aim of this study was to determine the most suitable SPD generation method for oncology trials, specifically focusing on both progression free survival (PFS) and overall survival (OS), which are the primary evaluation endpoints in oncology trials. To achieve this goal, we conducted a comparative simulation of 4 generation methods: CART, RF, BN, and the CTGAN, and the performance of each method was evaluated.

Methods: Using multiple clinical trial datasets, 1000 datasets were generated by using each method for each clinical trial dataset and evaluated as follows: 1) median survival time (MST) of PFS and OS, 2) hazard ratio distance (HRD), which indicates the similarity between the actual survival function and a synthetic survival function, and 3) visual analysis of Kaplan‒Meier (KM) plots. Each method's ability to mimic the statistical properties of real patient data was evaluated from these multiple angles.

Results: In most simulation cases, CART demonstrated the high percentages of MSTs of synthetic data falling within the range of 95% confidence interval (CI) of the MST of actual data. These percentages ranged from 88.8% to 98.0% for PFS and from 60.8% to 96.1% for OS. In the evaluation of HRD, CART demonstrated that HRD values were concentrated at approximately 0.9. Conversely, for the other methods, no consistent trend was observed for either PFS or OS. The reason why CART demonstrated better similarity than RF was that CART caused overfitting and RF, which is a kind of ensemble learning, prevented it. In SPD generation, the statistical properties close to the actual data should be the focus, not a well-generalized prediction model. Both the BN and CTGAN methods cannot accurately reflect the statistical properties of the actual data because small datasets are not suitable.

Conclusions: As a method for generating SPD for survival data from small datasets, such as clinical trial data, CART demonstrated to be the most effective method compared to RF, BN, and CTGAN. Additionally, it is possible to improve CART-based generation methods by incorporating feature engineering and other methods in future work.

背景:在肿瘤试验中生成用于生存分析的合成患者数据(SPD)对于加速临床开发具有巨大潜力。为此,人们采用了多种机器学习方法,包括分类和回归树(CART)、随机森林(RF)、贝叶斯网络(BN)和 CTGAN,但这些方法在反映实际患者生存数据方面的性能仍有待研究:本研究的目的是确定最适合肿瘤试验的 SPD 生成方法,尤其关注肿瘤试验的主要评价终点--无进展生存期(PFS)和总生存期(OS)。为了实现这一目标,我们对 4 种生成方法进行了比较模拟:方法:使用多个临床试验数据集,针对每个临床试验数据集使用每种方法生成 1000 个数据集,并进行如下评估:1)PFS 和 OS 的平均生存时间(MST);2)危险比距离(HRD),表示实际生存函数与合成生存函数之间的相似性;3)Kaplan-Meier(KM)图的视觉分析。从这些角度对每种方法模拟真实患者数据统计特性的能力进行了评估:在大多数模拟病例中,CART 显示的 MSTS 在 MSTA 95% 置信区间 (CI) 范围内的百分比很高。PFS和OS的百分比分别为88.8%和98.0%,OS的百分比分别为60.8%和96.1%。在 HRD 评估中,CART 显示 HRD 值集中在 0.9 左右。相反,对于其他方法,PFS 和 OS 均未观察到一致的趋势。CART 比 RF 显示出更好的相似性,原因在于 CART 会导致过度拟合,而 RF 作为一种集合学习,可以防止过度拟合。在生成 SPD 时,重点应放在与实际数据相近的统计特性上,而不是一个通用的预测模型。BN和CTGAN方法都无法准确反映实际数据的统计特性,因为小数据集并不适合:作为从临床试验数据等小数据集生成生存数据 SPD 的方法,与 RF、BN 和 CTGAN 相比,CART 被证明是最有效的方法。此外,在未来的工作中,还可以通过结合特征工程和其他方法来改进基于 CART 的生成方法:
{"title":"A Comparison of Synthetic Data Generation Techniques for Control Group Survival Data in Oncology Clinical Trials: Simulation Study.","authors":"Ippei Akiya, Takuma Ishihara, Keiichi Yamamoto","doi":"10.2196/55118","DOIUrl":"10.2196/55118","url":null,"abstract":"<p><strong>Background: </strong>Synthetic patient data (SPD) generation for survival analysis in oncology trials holds significant potential for accelerating clinical development. Various machine learning methods, including classification and regression trees (CART), random forest (RF), Bayesian network (BN), and CTGAN, have been employed for this purpose, but their performance in reflecting actual patient survival data remains under investigation.</p><p><strong>Objective: </strong>The aim of this study was to determine the most suitable SPD generation method for oncology trials, specifically focusing on both progression free survival (PFS) and overall survival (OS), which are the primary evaluation endpoints in oncology trials. To achieve this goal, we conducted a comparative simulation of 4 generation methods: CART, RF, BN, and the CTGAN, and the performance of each method was evaluated.</p><p><strong>Methods: </strong>Using multiple clinical trial datasets, 1000 datasets were generated by using each method for each clinical trial dataset and evaluated as follows: 1) median survival time (MST) of PFS and OS, 2) hazard ratio distance (HRD), which indicates the similarity between the actual survival function and a synthetic survival function, and 3) visual analysis of Kaplan‒Meier (KM) plots. Each method's ability to mimic the statistical properties of real patient data was evaluated from these multiple angles.</p><p><strong>Results: </strong>In most simulation cases, CART demonstrated the high percentages of MSTs of synthetic data falling within the range of 95% confidence interval (CI) of the MST of actual data. These percentages ranged from 88.8% to 98.0% for PFS and from 60.8% to 96.1% for OS. In the evaluation of HRD, CART demonstrated that HRD values were concentrated at approximately 0.9. Conversely, for the other methods, no consistent trend was observed for either PFS or OS. The reason why CART demonstrated better similarity than RF was that CART caused overfitting and RF, which is a kind of ensemble learning, prevented it. In SPD generation, the statistical properties close to the actual data should be the focus, not a well-generalized prediction model. Both the BN and CTGAN methods cannot accurately reflect the statistical properties of the actual data because small datasets are not suitable.</p><p><strong>Conclusions: </strong>As a method for generating SPD for survival data from small datasets, such as clinical trial data, CART demonstrated to be the most effective method compared to RF, BN, and CTGAN. Additionally, it is possible to improve CART-based generation methods by incorporating feature engineering and other methods in future work.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140900554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Electronic Health Record–Integrated Application for Standardizing Care and Monitoring Patients With Autosomal Dominant Polycystic Kidney Disease Enrolled in a Tolvaptan Clinic: Design and Implementation Study 电子健康记录集成应用程序,用于规范护理和监测加入托伐普坦诊所的常染色体显性多囊肾患者:设计与实施研究
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-05-01 DOI: 10.2196/50164
Maroun Chedid, Fouad T Chebib, Erin Dahlen, Theodore Mueller, Theresa Schnell, Melissa Gay, Musab Hommos, Sundararaman Swaminathan, Arvind Garg, Michael Mao, Brigid Amberg, Kirk Balderes, Karen F Johnson, Alyssa Bishop, Jackqueline Kay Vaughn, Marie Hogan, Vicente Torres, Rajeev Chaudhry, Ziad Zoghby
Background: Tolvaptan is the only FDA-approved drug to slow the progression of autosomal dominant polycystic kidney disease (ADPKD) but requires strict clinical monitoring due to potential serious adverse events. Objective: We share our experience in developing and implementing an electronic health record (EHR) based application to monitor patients with ADPKD initiated on Tolvaptan. Methods: The application was developed in collaboration with clinical informatics based on our clinical protocol with frequent laboratory test monitoring to detect early drug-related toxicity. The application streamlines clinical workflow and enables our nursing team to take appropriate actions in real-time to prevent drug-related serious adverse events. We retrospectively analyzed the characteristics of enrolled patients. Results: As of September 2022, 214 patients were enrolled in the Tolvaptan program across all Mayo Clinic sites. Of these, 126 were enrolled in the “Tolvaptan monitoring application” and 88 in the “Past Tolvaptan patients’ application”. The mean age at enrollment was 43.1±9.9 years. A total of 20 (9.3%) developed liver toxicity but only 5 (2.3%) had to discontinue the drug. The two EHR-based applications allow consolidation of all necessary patient information and real-time data management at the individual or population level. This approach facilitates efficient staff workflow, monitoring of drug-related adverse events, and timely prescription renewal. Conclusions: Our study highlights the feasibility of integrating digital applications into the EHR workflow, to facilitate efficient and safe care delivery for patients enrolled in a Tolvaptan program. This workflow needs further validation but could be extended to other healthcare systems managing chronic disease requiring drug monitoring.
背景介绍托伐普坦(Tolvaptan)是美国食品及药物管理局(FDA)批准的唯一一种用于延缓常染色体显性多囊肾病(ADPKD)进展的药物,但由于其潜在的严重不良反应,需要严格的临床监测。目标:我们分享了在开发和实施基于电子健康记录(EHR)的应用程序以监测开始服用托伐普坦的 ADPKD 患者方面的经验。方法:该应用是我们与临床信息学合作开发的,以临床方案为基础,通过频繁的实验室检测监测来检测早期药物相关毒性。该应用程序简化了临床工作流程,使我们的护理团队能够实时采取适当措施,预防与药物相关的严重不良事件。我们回顾性分析了入组患者的特征。结果截至 2022 年 9 月,梅奥诊所所有医疗点共有 214 名患者加入了托伐普坦项目。其中,126 人加入了 "托伐普坦监测申请",88 人加入了 "既往托伐普坦患者申请"。注册时的平均年龄为 43.1±9.9 岁。共有 20 人(9.3%)出现肝毒性,但只有 5 人(2.3%)不得不停药。这两款基于电子病历的应用软件可以整合所有必要的患者信息,并在个人或群体层面进行实时数据管理。这种方法有助于提高员工工作流程的效率,监测与药物相关的不良事件,并及时更新处方。结论:我们的研究强调了将数字应用程序整合到电子病历工作流程中的可行性,以促进为加入托伐普坦计划的患者提供高效、安全的护理服务。这一工作流程需要进一步验证,但可以推广到其他需要药物监测的慢性病管理医疗系统。
{"title":"An Electronic Health Record–Integrated Application for Standardizing Care and Monitoring Patients With Autosomal Dominant Polycystic Kidney Disease Enrolled in a Tolvaptan Clinic: Design and Implementation Study","authors":"Maroun Chedid, Fouad T Chebib, Erin Dahlen, Theodore Mueller, Theresa Schnell, Melissa Gay, Musab Hommos, Sundararaman Swaminathan, Arvind Garg, Michael Mao, Brigid Amberg, Kirk Balderes, Karen F Johnson, Alyssa Bishop, Jackqueline Kay Vaughn, Marie Hogan, Vicente Torres, Rajeev Chaudhry, Ziad Zoghby","doi":"10.2196/50164","DOIUrl":"https://doi.org/10.2196/50164","url":null,"abstract":"Background: Tolvaptan is the only FDA-approved drug to slow the progression of autosomal dominant polycystic kidney disease (ADPKD) but requires strict clinical monitoring due to potential serious adverse events. Objective: We share our experience in developing and implementing an electronic health record (EHR) based application to monitor patients with ADPKD initiated on Tolvaptan. Methods: The application was developed in collaboration with clinical informatics based on our clinical protocol with frequent laboratory test monitoring to detect early drug-related toxicity. The application streamlines clinical workflow and enables our nursing team to take appropriate actions in real-time to prevent drug-related serious adverse events. We retrospectively analyzed the characteristics of enrolled patients. Results: As of September 2022, 214 patients were enrolled in the Tolvaptan program across all Mayo Clinic sites. Of these, 126 were enrolled in the “Tolvaptan monitoring application” and 88 in the “Past Tolvaptan patients’ application”. The mean age at enrollment was 43.1±9.9 years. A total of 20 (9.3%) developed liver toxicity but only 5 (2.3%) had to discontinue the drug. The two EHR-based applications allow consolidation of all necessary patient information and real-time data management at the individual or population level. This approach facilitates efficient staff workflow, monitoring of drug-related adverse events, and timely prescription renewal. Conclusions: Our study highlights the feasibility of integrating digital applications into the EHR workflow, to facilitate efficient and safe care delivery for patients enrolled in a Tolvaptan program. This workflow needs further validation but could be extended to other healthcare systems managing chronic disease requiring drug monitoring.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140833238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic Interoperability of Electronic Health Records: Systematic Review of Alternative Approaches for Enhancing Patient Information Availability 电子健康记录的语义互操作性:提高患者信息可用性的替代方法系统回顾
IF 3.2 3区 医学 Q1 Health Professions Pub Date : 2024-04-25 DOI: 10.2196/53535
Sari Palojoki, Lasse Lehtonen, Riikka Vuokko
Background: Semantic interoperability facilitates the exchange of and access to health data that are being documented in Electronic Health Records (EHRs) with various semantic features. The main goals of semantic interoperability development entails patient data availability and use in diverse EHRs without loss of meaning. Internationally, there are current initiatives that aim to enhance semantic development of EHR data, and consequently, availability of patient data. Interoperability between health information systems is among the core goals of proposal for a regulation on the European Health Data Space and the WHO Global strategy on digital health. Objective: To achieve integrated health data ecosystems, stakeholders need to overcome challenges of implementing semantic interoperability elements. To research the available scientific evidence on the development of semantic interoperability, we defined the following research questions: What are the key elements of and approaches for building semantic interoperability integrated in EHRs? What kinds of goals are driving the development? What kinds of clinical benefits are perceived following this development? Methods: Our research questions focused on key aspects and approaches for semantic interoperability and on possible clinical and semantic benefits in EHR context of these choices. For that purpose, we performed a systematic literature review in PubMed by defining our study framework based on previous research. Results: Our analysis consisted of 14 studies where data models, ontologies, terminologies, classifications, and standards were applied for building interoperability. All articles reported clinical benefits of the selected approach to enhancing semantic interoperability. We identified three main categories for this purpose: increasing availability of data for clinicians (n = 6), increasing quality of care (n = 4) and enhancing clinical data use and re-use for varied purposes (n = 4). Regarding semantic development goals, data harmonization and developing semantic interoperability between different EHRs was the largest category (n = 8). Enhancing health data quality through standardization (n = 5) and developing EHR integrated tools based on interoperable data (n = 1) were the other identified categories. The results were closely coupled with the need to build usable and computable data out of heterogeneous medical information that is accessible through various EHRs and databases, e.g., registers. Conclusions: When heading towards semantic harmonization of clinical data, more experiences and analyses are needed to assess how applicable the chosen solutions are for semantic interoperability of health care data. Instead of promoting a single approach, semantic interoperability should be assessed through several levels of semantic requirements A dual- or multi-model approach is possibly usable to address different semantic interoperability issues during development. The objectives of semanti
背景:语义互操作性有助于交换和访问电子健康记录(EHR)中记录的具有各种语义特征的健康数据。语义互操作性开发的主要目标是在不失去意义的情况下,在不同的电子病历中提供和使用病人数据。目前,国际上有一些旨在加强电子健康记录数据语义开发的倡议,从而提高患者数据的可用性。医疗信息系统之间的互操作性是欧洲健康数据空间法规提案和世界卫生组织全球数字健康战略的核心目标之一。目标:为了实现集成的健康数据生态系统,利益相关者需要克服实施语义互操作性要素的挑战。为了研究有关语义互操作性发展的现有科学证据,我们确定了以下研究问题:在电子病历中集成语义互操作性的关键要素和方法是什么?推动发展的目标是什么?这种发展带来了哪些临床益处?研究方法:我们的研究问题主要集中在语义互操作性的关键方面和方法,以及这些选择在电子病历中可能带来的临床和语义益处。为此,我们在先前研究的基础上确定了研究框架,在 PubMed 上进行了系统的文献综述。结果我们的分析包括 14 项研究,这些研究应用了数据模型、本体、术语、分类和标准来建立互操作性。所有文章都报告了所选方法对提高语义互操作性的临床益处。我们为此确定了三个主要类别:提高临床医生的数据可用性(6 篇)、提高医疗质量(4 篇)以及提高临床数据的使用和重复使用(4 篇)。关于语义开发目标,数据协调和开发不同电子病历之间的语义互操作性是最大的类别(n = 8)。通过标准化提高健康数据质量(n = 5)和开发基于互操作数据的电子健康记录集成工具(n = 1)是其他已确定的类别。这些结果与从可通过各种电子健康记录和数据库(如登记册)获取的异构医疗信息中建立可用和可计算数据的需求密切相关。结论在实现临床数据的语义统一时,需要更多的经验和分析来评估所选择的解决方案对医疗数据语义互操作性的适用性。在开发过程中,可以采用双模型或多模型方法来解决不同的语义互操作性问题,而不是推广单一的方法。语义互操作性的目标是在分散和互不关联的临床护理环境中实现的。因此,提高临床数据可用性的方法应经过充分准备、深思熟虑和论证,以实现经济上可持续的长期成果。临床试验:无
{"title":"Semantic Interoperability of Electronic Health Records: Systematic Review of Alternative Approaches for Enhancing Patient Information Availability","authors":"Sari Palojoki, Lasse Lehtonen, Riikka Vuokko","doi":"10.2196/53535","DOIUrl":"https://doi.org/10.2196/53535","url":null,"abstract":"Background: Semantic interoperability facilitates the exchange of and access to health data that are being documented in Electronic Health Records (EHRs) with various semantic features. The main goals of semantic interoperability development entails patient data availability and use in diverse EHRs without loss of meaning. Internationally, there are current initiatives that aim to enhance semantic development of EHR data, and consequently, availability of patient data. Interoperability between health information systems is among the core goals of proposal for a regulation on the European Health Data Space and the WHO Global strategy on digital health. Objective: To achieve integrated health data ecosystems, stakeholders need to overcome challenges of implementing semantic interoperability elements. To research the available scientific evidence on the development of semantic interoperability, we defined the following research questions: What are the key elements of and approaches for building semantic interoperability integrated in EHRs? What kinds of goals are driving the development? What kinds of clinical benefits are perceived following this development? Methods: Our research questions focused on key aspects and approaches for semantic interoperability and on possible clinical and semantic benefits in EHR context of these choices. For that purpose, we performed a systematic literature review in PubMed by defining our study framework based on previous research. Results: Our analysis consisted of 14 studies where data models, ontologies, terminologies, classifications, and standards were applied for building interoperability. All articles reported clinical benefits of the selected approach to enhancing semantic interoperability. We identified three main categories for this purpose: increasing availability of data for clinicians (n = 6), increasing quality of care (n = 4) and enhancing clinical data use and re-use for varied purposes (n = 4). Regarding semantic development goals, data harmonization and developing semantic interoperability between different EHRs was the largest category (n = 8). Enhancing health data quality through standardization (n = 5) and developing EHR integrated tools based on interoperable data (n = 1) were the other identified categories. The results were closely coupled with the need to build usable and computable data out of heterogeneous medical information that is accessible through various EHRs and databases, e.g., registers. Conclusions: When heading towards semantic harmonization of clinical data, more experiences and analyses are needed to assess how applicable the chosen solutions are for semantic interoperability of health care data. Instead of promoting a single approach, semantic interoperability should be assessed through several levels of semantic requirements A dual- or multi-model approach is possibly usable to address different semantic interoperability issues during development. The objectives of semanti","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140804363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1