首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Predicting Pain Response to a Remote Musculoskeletal Care Program for Low Back Pain Management: Development of a Prediction Tool. 预测腰背痛患者对远程肌肉骨骼护理计划的疼痛反应:开发预测工具
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.2196/64806
Anabela C Areias, Robert G Moulder, Maria Molinos, Dora Janela, Virgílio Bento, Carolina Moreira, Vijay Yanamadala, Steven P Cohen, Fernando Dias Correia, Fabíola Costa

Background: Low back pain (LBP) presents with diverse manifestations, necessitating personalized treatment approaches that recognize various phenotypes within the same diagnosis, which could be achieved through precision medicine. Although prediction strategies have been explored, including those employing artificial intelligence (AI), they still lack scalability and real-time capabilities. Digital care programs (DCPs) facilitate seamless data collection through the Internet of Things and cloud storage, creating an ideal environment for developing and implementing an AI predictive tool to assist clinicians in dynamically optimizing treatment.

Objective: This study aims to develop an AI tool that continuously assists physical therapists in predicting an individual's potential for achieving clinically significant pain relief by the end of the program. A secondary aim was to identify predictors of pain nonresponse to guide treatment adjustments.

Methods: Data collected actively (eg, demographic and clinical information) and passively in real-time (eg, range of motion, exercise performance, and socioeconomic data from public data sources) from 6125 patients enrolled in a remote digital musculoskeletal intervention program were stored in the cloud. Two machine learning techniques, recurrent neural networks (RNNs) and light gradient boosting machine (LightGBM), continuously analyzed session updates up to session 7 to predict the likelihood of achieving significant pain relief at the program end. Model performance was assessed using the area under the receiver operating characteristic curve (ROC-AUC), precision-recall curves, specificity, and sensitivity. Model explainability was assessed using SHapley Additive exPlanations values.

Results: At each session, the model provided a prediction about the potential of being a pain responder, with performance improving over time (P<.001). By session 7, the RNN achieved an ROC-AUC of 0.70 (95% CI 0.65-0.71), and the LightGBM achieved an ROC-AUC of 0.71 (95% CI 0.67-0.72). Both models demonstrated high specificity in scenarios prioritizing high precision. The key predictive features were pain-associated domains, exercise performance, motivation, and compliance, informing continuous treatment adjustments to maximize response rates.

Conclusions: This study underscores the potential of an AI predictive tool within a DCP to enhance the management of LBP, supporting physical therapists in redirecting care pathways early and throughout the treatment course. This approach is particularly important for addressing the heterogeneous phenotypes observed in LBP.

Trial registration: ClinicalTrials.gov NCT04092946; https://clinicaltrials.gov/ct2/show/NCT04092946 and NCT05417685; https://clinicaltrials.gov/ct2/show/NCT05417685.

背景:腰背痛(LBP)的表现多种多样,因此需要个性化的治疗方法来识别同一诊断中的各种表型,这可以通过精准医疗来实现。虽然人们已经探索出了一些预测策略,包括采用人工智能(AI)的策略,但这些策略仍然缺乏可扩展性和实时性。数字护理方案(DCP)通过物联网和云存储促进了无缝数据收集,为开发和实施人工智能预测工具创造了理想的环境,以协助临床医生动态优化治疗:本研究旨在开发一种人工智能工具,持续协助物理治疗师预测个人在项目结束时实现临床显著疼痛缓解的潜力。次要目的是确定疼痛无反应的预测因素,以指导治疗调整:从 6125 名参加远程数字肌肉骨骼干预计划的患者中主动收集的数据(如人口统计学和临床信息)和实时被动收集的数据(如运动范围、运动表现和来自公共数据来源的社会经济数据)都存储在云中。两种机器学习技术--递归神经网络(RNNs)和轻梯度提升机(LightGBM)--持续分析了直至第 7 次的会话更新,以预测在项目结束时疼痛得到明显缓解的可能性。使用接收者操作特征曲线下面积(ROC-AUC)、精确度-召回曲线、特异性和灵敏度评估模型性能。使用 SHapley Additive exPlanations 值评估模型的可解释性:结果:在每次治疗过程中,模型都能对疼痛反应者的潜力做出预测,而且随着时间的推移,模型的性能也在不断提高(PC 结论:这项研究强调了疼痛反应模型的潜力:这项研究强调了在 DCP 中使用人工智能预测工具的潜力,该工具可加强对 LBP 的管理,支持物理治疗师在早期和整个治疗过程中调整护理路径。这种方法对于解决枸杞多糖症中观察到的异质性表型尤为重要:ClinicalTrials.gov NCT04092946; https://clinicaltrials.gov/ct2/show/NCT04092946 和 NCT05417685; https://clinicaltrials.gov/ct2/show/NCT05417685。
{"title":"Predicting Pain Response to a Remote Musculoskeletal Care Program for Low Back Pain Management: Development of a Prediction Tool.","authors":"Anabela C Areias, Robert G Moulder, Maria Molinos, Dora Janela, Virgílio Bento, Carolina Moreira, Vijay Yanamadala, Steven P Cohen, Fernando Dias Correia, Fabíola Costa","doi":"10.2196/64806","DOIUrl":"https://doi.org/10.2196/64806","url":null,"abstract":"<p><strong>Background: </strong>Low back pain (LBP) presents with diverse manifestations, necessitating personalized treatment approaches that recognize various phenotypes within the same diagnosis, which could be achieved through precision medicine. Although prediction strategies have been explored, including those employing artificial intelligence (AI), they still lack scalability and real-time capabilities. Digital care programs (DCPs) facilitate seamless data collection through the Internet of Things and cloud storage, creating an ideal environment for developing and implementing an AI predictive tool to assist clinicians in dynamically optimizing treatment.</p><p><strong>Objective: </strong>This study aims to develop an AI tool that continuously assists physical therapists in predicting an individual's potential for achieving clinically significant pain relief by the end of the program. A secondary aim was to identify predictors of pain nonresponse to guide treatment adjustments.</p><p><strong>Methods: </strong>Data collected actively (eg, demographic and clinical information) and passively in real-time (eg, range of motion, exercise performance, and socioeconomic data from public data sources) from 6125 patients enrolled in a remote digital musculoskeletal intervention program were stored in the cloud. Two machine learning techniques, recurrent neural networks (RNNs) and light gradient boosting machine (LightGBM), continuously analyzed session updates up to session 7 to predict the likelihood of achieving significant pain relief at the program end. Model performance was assessed using the area under the receiver operating characteristic curve (ROC-AUC), precision-recall curves, specificity, and sensitivity. Model explainability was assessed using SHapley Additive exPlanations values.</p><p><strong>Results: </strong>At each session, the model provided a prediction about the potential of being a pain responder, with performance improving over time (P<.001). By session 7, the RNN achieved an ROC-AUC of 0.70 (95% CI 0.65-0.71), and the LightGBM achieved an ROC-AUC of 0.71 (95% CI 0.67-0.72). Both models demonstrated high specificity in scenarios prioritizing high precision. The key predictive features were pain-associated domains, exercise performance, motivation, and compliance, informing continuous treatment adjustments to maximize response rates.</p><p><strong>Conclusions: </strong>This study underscores the potential of an AI predictive tool within a DCP to enhance the management of LBP, supporting physical therapists in redirecting care pathways early and throughout the treatment course. This approach is particularly important for addressing the heterogeneous phenotypes observed in LBP.</p><p><strong>Trial registration: </strong>ClinicalTrials.gov NCT04092946; https://clinicaltrials.gov/ct2/show/NCT04092946 and NCT05417685; https://clinicaltrials.gov/ct2/show/NCT05417685.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e64806"},"PeriodicalIF":3.1,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medication Prescription Policy for US Veterans With Metastatic Castration-Resistant Prostate Cancer: Causal Machine Learning Approach. 美国退伍军人转移性阉割抗性前列腺癌患者的药物处方政策:因果机器学习方法。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.2196/59480
Deepika Gopukumar, Nirup Menon, Martin W Schoen
<p><strong>Background: </strong>Prostate cancer is the second leading cause of death among American men. If detected and treated at an early stage, prostate cancer is often curable. However, an advanced stage such as metastatic castration-resistant prostate cancer (mCRPC) has a high risk of mortality. Multiple treatment options exist, the most common included docetaxel, abiraterone, and enzalutamide. Docetaxel is a cytotoxic chemotherapy, whereas abiraterone and enzalutamide are androgen receptor pathway inhibitors (ARPI). ARPIs are preferred over docetaxel due to lower toxicity. No study has used machine learning with patients' demographics, test results, and comorbidities to identify heterogeneous treatment rules that might improve the survival duration of patients with mCRPC.</p><p><strong>Objective: </strong>This study aimed to measure patient-level heterogeneity in the association of medication prescribed with overall survival duration (in the form of follow-up days) and arrive at a set of medication prescription rules using patient demographics, test results, and comorbidities.</p><p><strong>Methods: </strong>We excluded patients with mCRPC who were on docetaxel, cabaxitaxel, mitoxantrone, and sipuleucel-T either before or after the prescription of an ARPI. We included only the African American and white populations. In total, 2886 identified veterans treated for mCRPC who were prescribed either abiraterone or enzalutamide as the first line of treatment from 2014 to 2017, with follow-up until 2020, were analyzed. We used causal survival forests for analysis. The unit level of analysis was the patient. The primary outcome of this study was follow-up days indicating survival duration while on the first-line medication. After estimating the treatment effect, a prescription policy tree was constructed.</p><p><strong>Results: </strong>For 2886 veterans, enzalutamide is associated with an average of 59.94 (95% CI 35.60-84.28) more days of survival than abiraterone. The increase in overall survival duration for the 2 drugs varied across patient demographics, test results, and comorbidities. Two data-driven subgroups of patients were identified by ranking them on their augmented inverse-propensity weighted (AIPW) scores. The average AIPW scores for the 2 subgroups were 19.36 (95% CI -16.93 to 55.65) and 100.68 (95% CI 62.46-138.89). Based on visualization and t test, the AIPW score for low and high subgroups was significant (P=.003), thereby supporting heterogeneity. The analysis resulted in a set of prescription rules for the 2 ARPIs based on a few covariates available to the physicians at the time of prescription.</p><p><strong>Conclusions: </strong>This study of 2886 veterans showed evidence of heterogeneity and that survival days may be improved for certain patients with mCRPC based on the medication prescribed. Findings suggest that prescription rules based on the patient characteristics, laboratory test results, and comorbidities available to
背景:前列腺癌是导致美国男性死亡的第二大原因。如果在早期发现并治疗,前列腺癌通常是可以治愈的。但是,晚期前列腺癌(如转移性抗性前列腺癌)的死亡风险很高。目前有多种治疗方案,最常见的包括多西他赛、阿比特龙和恩杂鲁胺。多西他赛是一种细胞毒性化疗,而阿比特龙和恩扎鲁胺则是雄激素受体通路抑制剂(ARPI)。由于毒性较低,与多西他赛相比,ARPIs更受青睐。目前还没有研究利用机器学习患者的人口统计学特征、检查结果和合并症来识别异质性治疗规则,从而改善mCRPC患者的生存期:本研究旨在测量处方药物与总生存期(以随访天数的形式表示)相关性的患者层面异质性,并利用患者人口统计学、检验结果和合并症得出一套处方药物规则:我们排除了在开具 ARPI 处方之前或之后使用多西他赛、卡巴西他赛、米托蒽醌和西普利昔单抗的 mCRPC 患者。我们仅纳入了非裔美国人和白人。我们总共分析了2886名在2014年至2017年期间接受过阿比特龙或恩杂鲁胺一线治疗的mCRPC退伍军人,他们的随访将持续到2020年。我们采用因果生存森林进行分析。分析单位为患者。本研究的主要结果是随访天数,表示在一线药物治疗期间的存活时间。估计治疗效果后,构建了处方政策树:在2886名退伍军人中,恩杂鲁胺比阿比特龙的平均生存天数多59.94天(95% CI 35.60-84.28)。这两种药物增加的总生存期因患者人口统计学、检测结果和合并症而异。通过对患者的增强反倾向加权(AIPW)得分进行排序,确定了两个数据驱动的患者亚组。两个亚组的平均 AIPW 得分为 19.36(95% CI -16.93-55.65)和 100.68(95% CI 62.46-138.89)。根据可视化和 t 检验,低分组和高分组的 AIPW 评分具有显著性(P=.003),因此支持异质性。分析结果显示,根据医生在开处方时掌握的几个协变量,为 2 种 ARPI 制定了一套处方规则:这项对2886名退伍军人进行的研究显示了异质性的证据,某些mCRPC患者的生存天数可能会根据处方药物的不同而有所改善。研究结果表明,根据医生在开处方时掌握的患者特征、实验室检查结果和合并症制定处方规则,可以提供个性化的治疗决策,从而提高患者的生存率。
{"title":"Medication Prescription Policy for US Veterans With Metastatic Castration-Resistant Prostate Cancer: Causal Machine Learning Approach.","authors":"Deepika Gopukumar, Nirup Menon, Martin W Schoen","doi":"10.2196/59480","DOIUrl":"https://doi.org/10.2196/59480","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Prostate cancer is the second leading cause of death among American men. If detected and treated at an early stage, prostate cancer is often curable. However, an advanced stage such as metastatic castration-resistant prostate cancer (mCRPC) has a high risk of mortality. Multiple treatment options exist, the most common included docetaxel, abiraterone, and enzalutamide. Docetaxel is a cytotoxic chemotherapy, whereas abiraterone and enzalutamide are androgen receptor pathway inhibitors (ARPI). ARPIs are preferred over docetaxel due to lower toxicity. No study has used machine learning with patients' demographics, test results, and comorbidities to identify heterogeneous treatment rules that might improve the survival duration of patients with mCRPC.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to measure patient-level heterogeneity in the association of medication prescribed with overall survival duration (in the form of follow-up days) and arrive at a set of medication prescription rules using patient demographics, test results, and comorbidities.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We excluded patients with mCRPC who were on docetaxel, cabaxitaxel, mitoxantrone, and sipuleucel-T either before or after the prescription of an ARPI. We included only the African American and white populations. In total, 2886 identified veterans treated for mCRPC who were prescribed either abiraterone or enzalutamide as the first line of treatment from 2014 to 2017, with follow-up until 2020, were analyzed. We used causal survival forests for analysis. The unit level of analysis was the patient. The primary outcome of this study was follow-up days indicating survival duration while on the first-line medication. After estimating the treatment effect, a prescription policy tree was constructed.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;For 2886 veterans, enzalutamide is associated with an average of 59.94 (95% CI 35.60-84.28) more days of survival than abiraterone. The increase in overall survival duration for the 2 drugs varied across patient demographics, test results, and comorbidities. Two data-driven subgroups of patients were identified by ranking them on their augmented inverse-propensity weighted (AIPW) scores. The average AIPW scores for the 2 subgroups were 19.36 (95% CI -16.93 to 55.65) and 100.68 (95% CI 62.46-138.89). Based on visualization and t test, the AIPW score for low and high subgroups was significant (P=.003), thereby supporting heterogeneity. The analysis resulted in a set of prescription rules for the 2 ARPIs based on a few covariates available to the physicians at the time of prescription.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study of 2886 veterans showed evidence of heterogeneity and that survival days may be improved for certain patients with mCRPC based on the medication prescribed. Findings suggest that prescription rules based on the patient characteristics, laboratory test results, and comorbidities available to","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e59480"},"PeriodicalIF":3.1,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Ownership in the AI-Powered Integrative Health Care Landscape. 人工智能驱动的综合医疗保健领域的数据所有权。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.2196/57754
Shuimei Liu, L Raymond Guo

In the rapidly advancing landscape of artificial intelligence (AI) within integrative health care (IHC), the issue of data ownership has become pivotal. This study explores the intricate dynamics of data ownership in the context of IHC and the AI era, presenting the novel Collaborative Healthcare Data Ownership (CHDO) framework. The analysis delves into the multifaceted nature of data ownership, involving patients, providers, researchers, and AI developers, and addresses challenges such as ambiguous consent, attribution of insights, and international inconsistencies. Examining various ownership models, including privatization and communization postulates, as well as distributed access control, data trusts, and blockchain technology, the study assesses their potential and limitations. The proposed CHDO framework emphasizes shared ownership, defined access and control, and transparent governance, providing a promising avenue for responsible and collaborative AI integration in IHC. This comprehensive analysis offers valuable insights into the complex landscape of data ownership in IHC and the AI era, potentially paving the way for ethical and sustainable advancements in data-driven health care.

在综合医疗保健(IHC)领域人工智能(AI)快速发展的背景下,数据所有权问题变得至关重要。本研究探讨了 IHC 和人工智能时代背景下数据所有权的复杂动态,提出了新颖的协作式医疗保健数据所有权(CHDO)框架。该分析深入探讨了数据所有权的多面性,涉及患者、医疗服务提供者、研究人员和人工智能开发人员,并解决了诸如模糊同意、见解归属和国际不一致性等挑战。研究探讨了各种所有权模式,包括私有化和共用化假设,以及分布式访问控制、数据信托和区块链技术,并评估了它们的潜力和局限性。拟议的 CHDO 框架强调共享所有权、明确的访问和控制以及透明的治理,为负责任、协作性地将人工智能整合到 IHC 中提供了一条大有可为的途径。这项全面的分析为了解 IHC 和人工智能时代数据所有权的复杂情况提供了宝贵的见解,有可能为数据驱动的医疗保健的道德和可持续发展铺平道路。
{"title":"Data Ownership in the AI-Powered Integrative Health Care Landscape.","authors":"Shuimei Liu, L Raymond Guo","doi":"10.2196/57754","DOIUrl":"10.2196/57754","url":null,"abstract":"<p><p>In the rapidly advancing landscape of artificial intelligence (AI) within integrative health care (IHC), the issue of data ownership has become pivotal. This study explores the intricate dynamics of data ownership in the context of IHC and the AI era, presenting the novel Collaborative Healthcare Data Ownership (CHDO) framework. The analysis delves into the multifaceted nature of data ownership, involving patients, providers, researchers, and AI developers, and addresses challenges such as ambiguous consent, attribution of insights, and international inconsistencies. Examining various ownership models, including privatization and communization postulates, as well as distributed access control, data trusts, and blockchain technology, the study assesses their potential and limitations. The proposed CHDO framework emphasizes shared ownership, defined access and control, and transparent governance, providing a promising avenue for responsible and collaborative AI integration in IHC. This comprehensive analysis offers valuable insights into the complex landscape of data ownership in IHC and the AI era, potentially paving the way for ethical and sustainable advancements in data-driven health care.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57754"},"PeriodicalIF":3.1,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Task-Specific Transformer-Based Language Models in Health Care: Scoping Review. 医疗保健中基于特定任务转换器的语言模型:范围审查。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-18 DOI: 10.2196/49724
Ha Na Cho, Tae Joon Jun, Young-Hak Kim, Heejun Kang, Imjin Ahn, Hansle Gwon, Yunha Kim, Jiahn Seo, Heejung Choi, Minkyoung Kim, Jiye Han, Gaeun Kee, Seohyun Park, Soyoung Ko

Background: Transformer-based language models have shown great potential to revolutionize health care by advancing clinical decision support, patient interaction, and disease prediction. However, despite their rapid development, the implementation of transformer-based language models in health care settings remains limited. This is partly due to the lack of a comprehensive review, which hinders a systematic understanding of their applications and limitations. Without clear guidelines and consolidated information, both researchers and physicians face difficulties in using these models effectively, resulting in inefficient research efforts and slow integration into clinical workflows.

Objective: This scoping review addresses this gap by examining studies on medical transformer-based language models and categorizing them into 6 tasks: dialogue generation, question answering, summarization, text classification, sentiment analysis, and named entity recognition.

Methods: We conducted a scoping review following the Cochrane scoping review protocol. A comprehensive literature search was performed across databases, including Google Scholar and PubMed, covering publications from January 2017 to September 2024. Studies involving transformer-derived models in medical tasks were included. Data were categorized into 6 key tasks.

Results: Our key findings revealed both advancements and critical challenges in applying transformer-based models to health care tasks. For example, models like MedPIR involving dialogue generation show promise but face privacy and ethical concerns, while question-answering models like BioBERT improve accuracy but struggle with the complexity of medical terminology. The BioBERTSum summarization model aids clinicians by condensing medical texts but needs better handling of long sequences.

Conclusions: This review attempted to provide a consolidated understanding of the role of transformer-based language models in health care and to guide future research directions. By addressing current challenges and exploring the potential for real-world applications, we envision significant improvements in health care informatics. Addressing the identified challenges and implementing proposed solutions can enable transformer-based language models to significantly improve health care delivery and patient outcomes. Our review provides valuable insights for future research and practical applications, setting the stage for transformative advancements in medical informatics.

背景:基于转换器的语言模型通过推进临床决策支持、患者互动和疾病预测,已显示出彻底改变医疗保健的巨大潜力。然而,尽管基于转换器的语言模型发展迅速,但在医疗环境中的应用仍然有限。部分原因在于缺乏全面的综述,这阻碍了对其应用和局限性的系统了解。由于没有明确的指导原则和综合信息,研究人员和医生在有效使用这些模型方面都面临着困难,导致研究工作效率低下,与临床工作流程的整合缓慢:本范围综述通过研究基于医学转换器的语言模型,并将其分为对话生成、问题解答、总结、文本分类、情感分析和命名实体识别等 6 项任务,填补了这一空白:我们按照 Cochrane 范围综述协议进行了范围综述。我们在谷歌学术和PubMed等数据库中进行了全面的文献检索,涵盖了2017年1月至2024年9月期间的出版物。纳入了涉及医疗任务中变压器衍生模型的研究。数据分为 6 个关键任务:我们的主要发现揭示了将基于变压器的模型应用于医疗任务的进步和关键挑战。例如,MedPIR 等涉及对话生成的模型显示了前景,但面临隐私和伦理方面的问题,而 BioBERT 等问题解答模型提高了准确性,但在复杂的医学术语方面却举步维艰。BioBERTSum 摘要模型通过压缩医学文本来帮助临床医生,但需要更好地处理长序列:本综述试图提供对基于转换器的语言模型在医疗保健中的作用的综合理解,并为未来的研究方向提供指导。通过应对当前的挑战和探索现实世界的应用潜力,我们预计医疗信息学将得到显著改善。应对已发现的挑战并实施建议的解决方案,可使基于转换器的语言模型显著改善医疗服务的提供和患者的治疗效果。我们的综述为未来的研究和实际应用提供了宝贵的见解,为医疗信息学的变革性进步奠定了基础。
{"title":"Task-Specific Transformer-Based Language Models in Health Care: Scoping Review.","authors":"Ha Na Cho, Tae Joon Jun, Young-Hak Kim, Heejun Kang, Imjin Ahn, Hansle Gwon, Yunha Kim, Jiahn Seo, Heejung Choi, Minkyoung Kim, Jiye Han, Gaeun Kee, Seohyun Park, Soyoung Ko","doi":"10.2196/49724","DOIUrl":"10.2196/49724","url":null,"abstract":"<p><strong>Background: </strong>Transformer-based language models have shown great potential to revolutionize health care by advancing clinical decision support, patient interaction, and disease prediction. However, despite their rapid development, the implementation of transformer-based language models in health care settings remains limited. This is partly due to the lack of a comprehensive review, which hinders a systematic understanding of their applications and limitations. Without clear guidelines and consolidated information, both researchers and physicians face difficulties in using these models effectively, resulting in inefficient research efforts and slow integration into clinical workflows.</p><p><strong>Objective: </strong>This scoping review addresses this gap by examining studies on medical transformer-based language models and categorizing them into 6 tasks: dialogue generation, question answering, summarization, text classification, sentiment analysis, and named entity recognition.</p><p><strong>Methods: </strong>We conducted a scoping review following the Cochrane scoping review protocol. A comprehensive literature search was performed across databases, including Google Scholar and PubMed, covering publications from January 2017 to September 2024. Studies involving transformer-derived models in medical tasks were included. Data were categorized into 6 key tasks.</p><p><strong>Results: </strong>Our key findings revealed both advancements and critical challenges in applying transformer-based models to health care tasks. For example, models like MedPIR involving dialogue generation show promise but face privacy and ethical concerns, while question-answering models like BioBERT improve accuracy but struggle with the complexity of medical terminology. The BioBERTSum summarization model aids clinicians by condensing medical texts but needs better handling of long sequences.</p><p><strong>Conclusions: </strong>This review attempted to provide a consolidated understanding of the role of transformer-based language models in health care and to guide future research directions. By addressing current challenges and exploring the potential for real-world applications, we envision significant improvements in health care informatics. Addressing the identified challenges and implementing proposed solutions can enable transformer-based language models to significantly improve health care delivery and patient outcomes. Our review provides valuable insights for future research and practical applications, setting the stage for transformative advancements in medical informatics.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e49724"},"PeriodicalIF":3.1,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Potential of Claude 3 Opus in Renal Pathological Diagnosis: Performance Evaluation. 探索 Claude 3 Opus 在肾脏病理诊断中的潜力:性能评估。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-15 DOI: 10.2196/65033
Xingyuan Li, Ke Liu, Yanlin Lang, Zhonglin Chai, Fang Liu

Background: Artificial intelligence (AI) has shown great promise in assisting medical diagnosis, but its application in renal pathology remains limited.

Objective: We evaluated the performance of an advanced AI language model, Claude 3 Opus (Anthropic), in generating diagnostic descriptions for renal pathological images.

Methods: We carefully curated a dataset of 100 representative renal pathological images from the Diagnostic Atlas of Renal Pathology (3rd edition). The image selection aimed to cover a wide spectrum of common renal diseases, ensuring a balanced and comprehensive dataset. Claude 3 Opus generated diagnostic descriptions for each image, which were scored by 2 pathologists on clinical relevance, accuracy, fluency, completeness, and overall value.

Results: Claude 3 Opus achieved a high mean score in language fluency (3.86) but lower scores in clinical relevance (1.75), accuracy (1.55), completeness (2.01), and overall value (1.75). Performance varied across disease types. Interrater agreement was substantial for relevance (κ=0.627) and overall value (κ=0.589) and moderate for accuracy (κ=0.485) and completeness (κ=0.458).

Conclusions: Claude 3 Opus shows potential in generating fluent renal pathology descriptions but needs improvement in accuracy and clinical value. The AI's performance varied across disease types. Addressing the limitations of single-source data and incorporating comparative analyses with other AI approaches are essential steps for future research. Further optimization and validation are needed for clinical applications.

背景:人工智能(AI)在辅助医疗诊断方面前景广阔,但在肾脏病理学方面的应用仍然有限:我们评估了高级人工智能语言模型 Claude 3 Opus(Anthropic)在生成肾脏病理图像诊断描述方面的性能:我们从《肾脏病理诊断图谱》(第 3 版)中精心挑选了 100 幅具有代表性的肾脏病理图像数据集。图像的选择旨在涵盖广泛的常见肾脏疾病,确保数据集的均衡性和全面性。Claude 3 Opus 为每张图像生成诊断描述,由两名病理学家根据临床相关性、准确性、流畅性、完整性和整体价值进行评分:Claude 3 Opus 在语言流畅性方面获得了较高的平均分(3.86),但在临床相关性(1.75)、准确性(1.55)、完整性(2.01)和总体价值(1.75)方面得分较低。不同疾病类型的评分结果各不相同。在相关性(κ=0.627)和总体价值(κ=0.589)方面,相互之间的一致性很高,在准确性(κ=0.485)和完整性(κ=0.458)方面,相互之间的一致性处于中等水平:Claude 3 Opus 在生成流畅的肾脏病理描述方面显示出潜力,但在准确性和临床价值方面有待提高。人工智能在不同疾病类型中的表现各不相同。解决单一来源数据的局限性并结合与其他人工智能方法的比较分析是未来研究的关键步骤。临床应用还需要进一步优化和验证。
{"title":"Exploring the Potential of Claude 3 Opus in Renal Pathological Diagnosis: Performance Evaluation.","authors":"Xingyuan Li, Ke Liu, Yanlin Lang, Zhonglin Chai, Fang Liu","doi":"10.2196/65033","DOIUrl":"10.2196/65033","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) has shown great promise in assisting medical diagnosis, but its application in renal pathology remains limited.</p><p><strong>Objective: </strong>We evaluated the performance of an advanced AI language model, Claude 3 Opus (Anthropic), in generating diagnostic descriptions for renal pathological images.</p><p><strong>Methods: </strong>We carefully curated a dataset of 100 representative renal pathological images from the Diagnostic Atlas of Renal Pathology (3rd edition). The image selection aimed to cover a wide spectrum of common renal diseases, ensuring a balanced and comprehensive dataset. Claude 3 Opus generated diagnostic descriptions for each image, which were scored by 2 pathologists on clinical relevance, accuracy, fluency, completeness, and overall value.</p><p><strong>Results: </strong>Claude 3 Opus achieved a high mean score in language fluency (3.86) but lower scores in clinical relevance (1.75), accuracy (1.55), completeness (2.01), and overall value (1.75). Performance varied across disease types. Interrater agreement was substantial for relevance (κ=0.627) and overall value (κ=0.589) and moderate for accuracy (κ=0.485) and completeness (κ=0.458).</p><p><strong>Conclusions: </strong>Claude 3 Opus shows potential in generating fluent renal pathology descriptions but needs improvement in accuracy and clinical value. The AI's performance varied across disease types. Addressing the limitations of single-source data and incorporating comparative analyses with other AI approaches are essential steps for future research. Further optimization and validation are needed for clinical applications.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e65033"},"PeriodicalIF":3.1,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unintended Consequences of Data Sharing Under the Meaningful Use Program. 在 "有意义使用计划 "下数据共享的意外后果。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-14 DOI: 10.2196/52675
Irmgard Ursula Willcockson, Ignacio Herman Valdes

Unlabelled: Interoperability has been designed to improve the quality and efficiency of health care. It allows the Centers for Medicare and Medicaid Services to collect data on quality measures as a part of the Meaningful Use program. Covered providers who fail to provide data have lower rates of reimbursement. Unintended consequences also arise at each step of the data collection process: (1) providers are not reimbursed for the extra time required to generate data; (2) patients do not have control over when and how their data are provided to or used by the government; and (3) large datasets increase the chances of an accidental data breach or intentional hacker attack. After detailing the issues, we describe several solutions, including an appropriate data use review board, which is designed to oversee certain aspects of the process and ensure accountability and transparency.

无标签:互操作性旨在提高医疗质量和效率。作为 "有意义使用 "计划的一部分,它允许医疗保险和医疗补助服务中心收集质量测量数据。未能提供数据的医疗服务提供者将获得较低的报销比例。数据收集过程的每一步都会产生意想不到的后果:(1) 医疗服务提供者生成数据所需的额外时间得不到补偿;(2) 患者无法控制政府何时以及如何提供或使用他们的数据;(3) 大型数据集增加了意外数据泄露或蓄意黑客攻击的几率。在详细阐述了这些问题后,我们介绍了几种解决方案,其中包括适当的数据使用审查委员会,该委员会旨在监督该过程的某些方面,并确保问责制和透明度。
{"title":"Unintended Consequences of Data Sharing Under the Meaningful Use Program.","authors":"Irmgard Ursula Willcockson, Ignacio Herman Valdes","doi":"10.2196/52675","DOIUrl":"https://doi.org/10.2196/52675","url":null,"abstract":"<p><strong>Unlabelled: </strong>Interoperability has been designed to improve the quality and efficiency of health care. It allows the Centers for Medicare and Medicaid Services to collect data on quality measures as a part of the Meaningful Use program. Covered providers who fail to provide data have lower rates of reimbursement. Unintended consequences also arise at each step of the data collection process: (1) providers are not reimbursed for the extra time required to generate data; (2) patients do not have control over when and how their data are provided to or used by the government; and (3) large datasets increase the chances of an accidental data breach or intentional hacker attack. After detailing the issues, we describe several solutions, including an appropriate data use review board, which is designed to oversee certain aspects of the process and ensure accountability and transparency.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e52675"},"PeriodicalIF":3.1,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods. 加强语言嵌入模型中复杂词组的偏差评估:方法的定量比较。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-12 DOI: 10.2196/60272
Magnus Gray, Mariofanna Milanova, Leihong Wu

Background: Artificial intelligence (AI) is rapidly being adopted to build products and aid in the decision-making process across industries. However, AI systems have been shown to exhibit and even amplify biases, causing a growing concern among people worldwide. Thus, investigating methods of measuring and mitigating bias within these AI-powered tools is necessary.

Objective: In natural language processing applications, the word embedding association test (WEAT) is a popular method of measuring bias in input embeddings, a common area of measure bias in AI. However, certain limitations of the WEAT have been identified (ie, their nonrobust measure of bias and their reliance on predefined and limited groups of words or sentences), which may lead to inadequate measurements and evaluations of bias. Thus, this study takes a new approach at modifying this popular measure of bias, with a focus on making it more robust and applicable in other domains.

Methods: In this study, we introduce the SD-WEAT, which is a modified version of the WEAT that uses the SD of multiple permutations of the WEATs to calculate bias in input embeddings. With the SD-WEAT, we evaluated the biases and stability of several language embedding models, including Global Vectors for Word Representation (GloVe), Word2Vec, and bidirectional encoder representations from transformers (BERT).

Results: This method produces results comparable to those of the WEAT, with strong correlations between the methods' bias scores or effect sizes (r=0.786) and P values (r=0.776), while addressing some of its largest limitations. More specifically, the SD-WEAT is more accessible, as it removes the need to predefine attribute groups, and because the SD-WEAT measures bias over multiple runs rather than one, it reduces the impact of outliers and sample size. Furthermore, the SD-WEAT was found to be more consistent and reliable than its predecessor.

Conclusions: Thus, the SD-WEAT shows promise for robustly measuring bias in the input embeddings fed to AI language models.

背景:人工智能(AI)正被各行各业迅速用于制造产品和辅助决策过程。然而,人工智能系统已被证明会表现出甚至放大偏见,这引起了全世界人们越来越多的关注。因此,有必要研究在这些人工智能驱动的工具中测量和减轻偏见的方法:在自然语言处理应用中,词嵌入关联测试(WEAT)是测量输入嵌入偏差的常用方法,也是人工智能测量偏差的常见领域。然而,WEAT 的某些局限性已被发现(即其对偏差的非稳健测量及其对预定义和有限的单词或句子组的依赖),这可能会导致对偏差的测量和评估不充分。因此,本研究采用了一种新方法来修改这种流行的偏差测量方法,重点是使其更加稳健并适用于其他领域:在本研究中,我们引入了SD-WEAT,它是WEAT的一个改进版本,使用WEAT多重排列的SD来计算输入嵌入中的偏差。利用SD-WEAT,我们评估了几种语言嵌入模型的偏差和稳定性,包括词表示的全局向量(GloVe)、Word2Vec和来自变换器的双向编码器表示(BERT):该方法得出的结果与 WEAT 的结果相当,方法的偏差分数或效应大小(r=0.786)和 P 值(r=0.776)之间具有很强的相关性,同时解决了其最大的一些局限性。更具体地说,SD-WEAT 更易于使用,因为它无需预先定义属性组,而且由于 SD-WEAT 是通过多次运行而不是一次运行来测量偏差的,因此它减少了异常值和样本大小的影响。此外,SD-WEAT 比其前身更一致、更可靠:因此,SD-WEAT有望稳健地测量人工智能语言模型输入嵌入的偏差。
{"title":"Enhancing Bias Assessment for Complex Term Groups in Language Embedding Models: Quantitative Comparison of Methods.","authors":"Magnus Gray, Mariofanna Milanova, Leihong Wu","doi":"10.2196/60272","DOIUrl":"https://doi.org/10.2196/60272","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) is rapidly being adopted to build products and aid in the decision-making process across industries. However, AI systems have been shown to exhibit and even amplify biases, causing a growing concern among people worldwide. Thus, investigating methods of measuring and mitigating bias within these AI-powered tools is necessary.</p><p><strong>Objective: </strong>In natural language processing applications, the word embedding association test (WEAT) is a popular method of measuring bias in input embeddings, a common area of measure bias in AI. However, certain limitations of the WEAT have been identified (ie, their nonrobust measure of bias and their reliance on predefined and limited groups of words or sentences), which may lead to inadequate measurements and evaluations of bias. Thus, this study takes a new approach at modifying this popular measure of bias, with a focus on making it more robust and applicable in other domains.</p><p><strong>Methods: </strong>In this study, we introduce the SD-WEAT, which is a modified version of the WEAT that uses the SD of multiple permutations of the WEATs to calculate bias in input embeddings. With the SD-WEAT, we evaluated the biases and stability of several language embedding models, including Global Vectors for Word Representation (GloVe), Word2Vec, and bidirectional encoder representations from transformers (BERT).</p><p><strong>Results: </strong>This method produces results comparable to those of the WEAT, with strong correlations between the methods' bias scores or effect sizes (r=0.786) and P values (r=0.776), while addressing some of its largest limitations. More specifically, the SD-WEAT is more accessible, as it removes the need to predefine attribute groups, and because the SD-WEAT measures bias over multiple runs rather than one, it reduces the impact of outliers and sample size. Furthermore, the SD-WEAT was found to be more consistent and reliable than its predecessor.</p><p><strong>Conclusions: </strong>Thus, the SD-WEAT shows promise for robustly measuring bias in the input embeddings fed to AI language models.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e60272"},"PeriodicalIF":3.1,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical Decision Support to Increase Emergency Department Naloxone Coprescribing: Implementation Report. 临床决策支持增加急诊科纳洛酮处方:实施报告。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-06 DOI: 10.2196/58276
Stuart W Sommers, Heather J Tolle, Katy E Trinkley, Christine G Johnston, Caitlin L Dietsche, Stephanie V Eldred, Abraham T Wick, Jason A Hoppe
<p><strong>Background: </strong>Coprescribing naloxone with opioid analgesics is a Centers for Disease Control and Prevention (CDC) best practice to mitigate the risk of fatal opioid overdose, yet coprescription by emergency medicine clinicians is rare, occurring less than 5% of the time it is indicated. Clinical decision support (CDS) has been associated with increased naloxone prescribing; however, key CDS design characteristics and pragmatic outcome measures necessary to understand replicability and effectiveness have not been reported.</p><p><strong>Objective: </strong>This study aimed to rigorously evaluate and quantify the impact of CDS designed to improve emergency department (ED) naloxone coprescribing. We hypothesized CDS would increase naloxone coprescribing and the number of naloxone prescriptions filled by patients discharged from EDs in a large health care system.</p><p><strong>Methods: </strong>Following user-centered design principles, we designed and implemented a fully automated, interruptive, electronic health record-based CDS to nudge clinicians to coprescribe naloxone with high-risk opioid prescriptions. "High-risk" opioid prescriptions were defined as any opioid analgesic prescription ≥90 total morphine milligram equivalents per day or for patients with a prior diagnosis of opioid use disorder or opioid overdose. The Reach, Effectiveness, Adoption, Implementation, and Maintenance (RE-AIM) framework was used to evaluate pragmatic CDS outcomes of reach, effectiveness, adoption, implementation, and maintenance. Effectiveness was the primary outcome of interest and was assessed by (1) constructing a Bayesian structural time-series model of the number of ED visits with naloxone coprescriptions before and after CDS implementation and (2) calculating the percentage of naloxone prescriptions associated with CDS that were filled at an outpatient pharmacy. Mann-Kendall tests were used to evaluate longitudinal trends in CDS adoption. All outcomes were analyzed in R (version 4.2.2; R Core Team).</p><p><strong>Unlabelled: </strong>Between November 2019 and July 2023, there were 1,994,994 ED visits. CDS reached clinicians in 0.83% (16,566/1,994,994) of all visits and 15.99% (16,566/103,606) of ED visits where an opioid was prescribed at discharge. Clinicians adopted CDS, coprescribing naloxone in 34.36% (6613/19,246) of alerts. CDS was effective, increasing naloxone coprescribing from baseline by 18.1 (95% CI 17.9-18.3) coprescriptions per week or 2,327% (95% CI 3390-3490). Patients filled 43.80% (1989/4541) of naloxone coprescriptions. The CDS was implemented simultaneously at every ED and no adaptations were made to CDS postimplementation. CDS was maintained beyond the study period and maintained its effect, with adoption increasing over time (τ=0.454; P<.001).</p><p><strong>Conclusions: </strong>Our findings advance the evidence that electronic health record-based CDS increases the number of naloxone coprescriptions and improves the dis
背景:纳洛酮与阿片类镇痛药同时处方是美国疾病控制与预防中心(CDC)降低阿片类药物过量致死风险的最佳做法,但急诊科临床医生很少同时处方纳洛酮,只有不到5%的情况下需要同时处方纳洛酮。临床决策支持(CDS)与纳洛酮处方的增加有关;然而,了解可复制性和有效性所必需的临床决策支持的关键设计特征和实用结果测量方法尚未见报道:本研究旨在严格评估和量化旨在改善急诊科(ED)纳洛酮共同处方的 CDS 的影响。我们假设,在一个大型医疗保健系统中,CDS 将增加纳洛酮的共同处方量以及急诊科出院病人的纳洛酮处方数量:按照以用户为中心的设计原则,我们设计并实施了一种基于电子健康记录的全自动、中断式 CDS,以促使临床医生在开具高风险阿片类药物处方时同时开具纳洛酮处方。"高风险 "阿片类药物处方的定义是:任何阿片类镇痛药处方的总吗啡毫克当量每天≥90 毫克,或处方中的患者曾被诊断为阿片类药物使用障碍或阿片类药物过量。我们采用了 "普及、有效性、采用、实施和维持"(RE-AIM)框架来评估 CDS 在普及、有效性、采用、实施和维持方面的实际效果。有效性是主要的评估结果,其评估方法是:(1)构建一个贝叶斯结构时间序列模型,计算实施 CDS 前后使用纳洛酮处方的急诊就诊次数;(2)计算与 CDS 相关的纳洛酮处方中在门诊药房配药的百分比。Mann-Kendall 检验用于评估采用 CDS 的纵向趋势。所有结果均使用 R(4.2.2 版;R 核心团队)进行分析:2019年11月至2023年7月期间,共有1,994,994次急诊就诊。在所有就诊者中,有 0.83% (16,566/1,994,994)的临床医生采用了 CDS;在出院时开具阿片类药物处方的 ED 就诊者中,有 15.99% (16,566/103,606)的临床医生采用了 CDS。临床医生采用了 CDS,在 34.36% (6613/19246)的警报中共同处方了纳洛酮。CDS 效果显著,纳洛酮处方量比基线每周增加了 18.1(95% CI 17.9-18.3)份,即增加了 2327%(95% CI 3390-3490)。患者开出的纳洛酮处方占 43.80%(1989/4541)。CDS 在每个急诊室同时实施,实施后未对 CDS 进行调整。电子病历系统在研究期结束后继续使用并保持其效果,采用率随时间推移而增加(τ=0.454;PC结论:我们的研究结果进一步证明,基于电子健康记录的 CDS 增加了纳洛酮处方的数量,并改善了纳洛酮的分布。我们的时间序列分析对世俗趋势进行了控制,有力地证明了最小中断性 CDS 显著改善了过程结果。
{"title":"Clinical Decision Support to Increase Emergency Department Naloxone Coprescribing: Implementation Report.","authors":"Stuart W Sommers, Heather J Tolle, Katy E Trinkley, Christine G Johnston, Caitlin L Dietsche, Stephanie V Eldred, Abraham T Wick, Jason A Hoppe","doi":"10.2196/58276","DOIUrl":"10.2196/58276","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Coprescribing naloxone with opioid analgesics is a Centers for Disease Control and Prevention (CDC) best practice to mitigate the risk of fatal opioid overdose, yet coprescription by emergency medicine clinicians is rare, occurring less than 5% of the time it is indicated. Clinical decision support (CDS) has been associated with increased naloxone prescribing; however, key CDS design characteristics and pragmatic outcome measures necessary to understand replicability and effectiveness have not been reported.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aimed to rigorously evaluate and quantify the impact of CDS designed to improve emergency department (ED) naloxone coprescribing. We hypothesized CDS would increase naloxone coprescribing and the number of naloxone prescriptions filled by patients discharged from EDs in a large health care system.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;Following user-centered design principles, we designed and implemented a fully automated, interruptive, electronic health record-based CDS to nudge clinicians to coprescribe naloxone with high-risk opioid prescriptions. \"High-risk\" opioid prescriptions were defined as any opioid analgesic prescription ≥90 total morphine milligram equivalents per day or for patients with a prior diagnosis of opioid use disorder or opioid overdose. The Reach, Effectiveness, Adoption, Implementation, and Maintenance (RE-AIM) framework was used to evaluate pragmatic CDS outcomes of reach, effectiveness, adoption, implementation, and maintenance. Effectiveness was the primary outcome of interest and was assessed by (1) constructing a Bayesian structural time-series model of the number of ED visits with naloxone coprescriptions before and after CDS implementation and (2) calculating the percentage of naloxone prescriptions associated with CDS that were filled at an outpatient pharmacy. Mann-Kendall tests were used to evaluate longitudinal trends in CDS adoption. All outcomes were analyzed in R (version 4.2.2; R Core Team).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Unlabelled: &lt;/strong&gt;Between November 2019 and July 2023, there were 1,994,994 ED visits. CDS reached clinicians in 0.83% (16,566/1,994,994) of all visits and 15.99% (16,566/103,606) of ED visits where an opioid was prescribed at discharge. Clinicians adopted CDS, coprescribing naloxone in 34.36% (6613/19,246) of alerts. CDS was effective, increasing naloxone coprescribing from baseline by 18.1 (95% CI 17.9-18.3) coprescriptions per week or 2,327% (95% CI 3390-3490). Patients filled 43.80% (1989/4541) of naloxone coprescriptions. The CDS was implemented simultaneously at every ED and no adaptations were made to CDS postimplementation. CDS was maintained beyond the study period and maintained its effect, with adoption increasing over time (τ=0.454; P&lt;.001).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;Our findings advance the evidence that electronic health record-based CDS increases the number of naloxone coprescriptions and improves the dis","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58276"},"PeriodicalIF":3.1,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560079/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142592335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Electronic Health Record Data Quality and Performance Assessments: Scoping Review. 电子健康记录数据质量和性能评估:范围审查。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-06 DOI: 10.2196/58130
Yordan P Penev, Timothy R Buchanan, Matthew M Ruppert, Michelle Liu, Ramin Shekouhi, Ziyuan Guan, Jeremy Balch, Tezcan Ozrazgat-Baslanti, Benjamin Shickel, Tyler J Loftus, Azra Bihorac

Background: Electronic health records (EHRs) have an enormous potential to advance medical research and practice through easily accessible and interpretable EHR-derived databases. Attainability of this potential is limited by issues with data quality (DQ) and performance assessment.

Objective: This review aims to streamline the current best practices on EHR DQ and performance assessments as a replicable standard for researchers in the field.

Methods: PubMed was systematically searched for original research articles assessing EHR DQ and performance from inception until May 7, 2023.

Results: Our search yielded 26 original research articles. Most articles had 1 or more significant limitations, including incomplete or inconsistent reporting (n=6, 30%), poor replicability (n=5, 25%), and limited generalizability of results (n=5, 25%). Completeness (n=21, 81%), conformance (n=18, 69%), and plausibility (n=16, 62%) were the most cited indicators of DQ, while correctness or accuracy (n=14, 54%) was most cited for data performance, with context-specific supplementation by recency (n=7, 27%), fairness (n=6, 23%), stability (n=4, 15%), and shareability (n=2, 8%) assessments. Artificial intelligence-based techniques, including natural language data extraction, data imputation, and fairness algorithms, were demonstrated to play a rising role in improving both dataset quality and performance.

Conclusions: This review highlights the need for incentivizing DQ and performance assessments and their standardization. The results suggest the usefulness of artificial intelligence-based techniques for enhancing DQ and performance to unlock the full potential of EHRs to improve medical research and practice.

背景:电子健康记录(EHR)具有巨大潜力,可通过易于访问和解释的 EHR 衍生数据库推动医学研究和实践。但由于数据质量(DQ)和性能评估方面的问题,这一潜力的实现受到了限制:本综述旨在简化当前电子病历数据质量和性能评估的最佳实践,为该领域的研究人员提供一个可复制的标准:方法:系统检索了 PubMed 上从开始到 2023 年 5 月 7 日评估电子病历质量和性能的原创研究文章:结果:我们搜索到 26 篇原创研究文章。大多数文章存在一个或多个重大局限性,包括报告不完整或不一致(6 篇,占 30%)、可复制性差(5 篇,占 25%)以及结果的推广性有限(5 篇,占 25%)。完整性(n=21,81%)、一致性(n=18,69%)和可信度(n=16,62%)是被引用最多的 DQ 指标,而正确性或准确性(n=14,54%)则是被引用最多的数据性能指标,并根据具体情况辅以重复性(n=7,27%)、公平性(n=6,23%)、稳定性(n=4,15%)和可共享性(n=2,8%)评估。基于人工智能的技术,包括自然语言数据提取、数据估算和公平性算法,在提高数据集质量和性能方面发挥了越来越重要的作用:本综述强调了激励数据质量和性能评估及其标准化的必要性。结果表明,基于人工智能的技术在提高数据质量和性能方面非常有用,可以充分释放电子病历在改善医学研究和实践方面的潜力。
{"title":"Electronic Health Record Data Quality and Performance Assessments: Scoping Review.","authors":"Yordan P Penev, Timothy R Buchanan, Matthew M Ruppert, Michelle Liu, Ramin Shekouhi, Ziyuan Guan, Jeremy Balch, Tezcan Ozrazgat-Baslanti, Benjamin Shickel, Tyler J Loftus, Azra Bihorac","doi":"10.2196/58130","DOIUrl":"10.2196/58130","url":null,"abstract":"<p><strong>Background: </strong>Electronic health records (EHRs) have an enormous potential to advance medical research and practice through easily accessible and interpretable EHR-derived databases. Attainability of this potential is limited by issues with data quality (DQ) and performance assessment.</p><p><strong>Objective: </strong>This review aims to streamline the current best practices on EHR DQ and performance assessments as a replicable standard for researchers in the field.</p><p><strong>Methods: </strong>PubMed was systematically searched for original research articles assessing EHR DQ and performance from inception until May 7, 2023.</p><p><strong>Results: </strong>Our search yielded 26 original research articles. Most articles had 1 or more significant limitations, including incomplete or inconsistent reporting (n=6, 30%), poor replicability (n=5, 25%), and limited generalizability of results (n=5, 25%). Completeness (n=21, 81%), conformance (n=18, 69%), and plausibility (n=16, 62%) were the most cited indicators of DQ, while correctness or accuracy (n=14, 54%) was most cited for data performance, with context-specific supplementation by recency (n=7, 27%), fairness (n=6, 23%), stability (n=4, 15%), and shareability (n=2, 8%) assessments. Artificial intelligence-based techniques, including natural language data extraction, data imputation, and fairness algorithms, were demonstrated to play a rising role in improving both dataset quality and performance.</p><p><strong>Conclusions: </strong>This review highlights the need for incentivizing DQ and performance assessments and their standardization. The results suggest the usefulness of artificial intelligence-based techniques for enhancing DQ and performance to unlock the full potential of EHRs to improve medical research and practice.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58130"},"PeriodicalIF":3.1,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142592338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Artificial Intelligence and Data Science for Integration of Social Determinants of Health in Emergency Medicine: Scoping Review. 利用人工智能和数据科学将健康的社会决定因素纳入急诊医学:范围审查。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-10-30 DOI: 10.2196/57124
Ethan E Abbott, Donald Apakama, Lynne D Richardson, Lili Chan, Girish N Nadkarni

Background: Social determinants of health (SDOH) are critical drivers of health disparities and patient outcomes. However, accessing and collecting patient-level SDOH data can be operationally challenging in the emergency department (ED) clinical setting, requiring innovative approaches.

Objective: This scoping review examines the potential of AI and data science for modeling, extraction, and incorporation of SDOH data specifically within EDs, further identifying areas for advancement and investigation.

Methods: We conducted a standardized search for studies published between 2015 and 2022, across Medline (Ovid), Embase (Ovid), CINAHL, Web of Science, and ERIC databases. We focused on identifying studies using AI or data science related to SDOH within emergency care contexts or conditions. Two specialized reviewers in emergency medicine (EM) and clinical informatics independently assessed each article, resolving discrepancies through iterative reviews and discussion. We then extracted data covering study details, methodologies, patient demographics, care settings, and principal outcomes.

Results: Of the 1047 studies screened, 26 met the inclusion criteria. Notably, 9 out of 26 (35%) studies were solely concentrated on ED patients. Conditions studied spanned broad EM complaints and included sepsis, acute myocardial infarction, and asthma. The majority of studies (n=16) explored multiple SDOH domains, with homelessness/housing insecurity and neighborhood/built environment predominating. Machine learning (ML) techniques were used in 23 of 26 studies, with natural language processing (NLP) being the most commonly used approach (n=11). Rule-based NLP (n=5), deep learning (n=2), and pattern matching (n=4) were the most commonly used NLP techniques. NLP models in the reviewed studies displayed significant predictive performance with outcomes, with F1-scores ranging between 0.40 and 0.75 and specificities nearing 95.9%.

Conclusions: Although in its infancy, the convergence of AI and data science techniques, especially ML and NLP, with SDOH in EM offers transformative possibilities for better usage and integration of social data into clinical care and research. With a significant focus on the ED and notable NLP model performance, there is an imperative to standardize SDOH data collection, refine algorithms for diverse patient groups, and champion interdisciplinary synergies. These efforts aim to harness SDOH data optimally, enhancing patient care and mitigating health disparities. Our research underscores the vital need for continued investigation in this domain.

背景:健康的社会决定因素(SDOH)是造成健康差异和患者预后的关键因素。然而,在急诊科(ED)的临床环境中,获取和收集患者层面的 SDOH 数据在操作上具有挑战性,需要创新的方法:本范围综述探讨了人工智能和数据科学在急诊科建模、提取和整合 SDOH 数据方面的潜力,进一步确定了需要推进和调查的领域:我们在 Medline (Ovid)、Embase (Ovid)、CINAHL、Web of Science 和 ERIC 数据库中对 2015 年至 2022 年间发表的研究进行了标准化检索。我们重点识别了在急诊环境或条件下使用人工智能或数据科学进行的与 SDOH 相关的研究。急诊医学(EM)和临床信息学的两位专业审稿人对每篇文章进行独立评估,通过反复审阅和讨论解决差异。然后,我们提取了涵盖研究细节、方法、患者人口统计学、护理环境和主要结果的数据:在筛选出的 1047 篇研究中,有 26 篇符合纳入标准。值得注意的是,26 项研究中有 9 项(35%)仅针对急诊室患者。研究的病症涉及广泛的急诊主诉,包括败血症、急性心肌梗死和哮喘。大多数研究(16 项)探讨了多个 SDOH 领域,其中以无家可归/住房无保障和邻里/建筑环境为主。26 项研究中有 23 项使用了机器学习(ML)技术,其中自然语言处理(NLP)是最常用的方法(11 项)。基于规则的 NLP(5 项)、深度学习(2 项)和模式匹配(4 项)是最常用的 NLP 技术。综述研究中的 NLP 模型对结果具有显著的预测性能,F1 分数介于 0.40 和 0.75 之间,特异性接近 95.9%:人工智能和数据科学技术(尤其是 ML 和 NLP)与 EM 中的 SDOH 的融合虽然还处于起步阶段,但它为更好地利用社会数据并将其整合到临床护理和研究中提供了变革性的可能性。随着人们对急诊室的极大关注和 NLP 模型的显著表现,规范 SDOH 数据收集、完善针对不同患者群体的算法以及倡导跨学科协同合作已势在必行。这些努力旨在优化 SDOH 数据的利用,加强对患者的护理,减少健康差异。我们的研究强调了在这一领域继续开展调查的迫切需要。
{"title":"Leveraging Artificial Intelligence and Data Science for Integration of Social Determinants of Health in Emergency Medicine: Scoping Review.","authors":"Ethan E Abbott, Donald Apakama, Lynne D Richardson, Lili Chan, Girish N Nadkarni","doi":"10.2196/57124","DOIUrl":"10.2196/57124","url":null,"abstract":"<p><strong>Background: </strong>Social determinants of health (SDOH) are critical drivers of health disparities and patient outcomes. However, accessing and collecting patient-level SDOH data can be operationally challenging in the emergency department (ED) clinical setting, requiring innovative approaches.</p><p><strong>Objective: </strong>This scoping review examines the potential of AI and data science for modeling, extraction, and incorporation of SDOH data specifically within EDs, further identifying areas for advancement and investigation.</p><p><strong>Methods: </strong>We conducted a standardized search for studies published between 2015 and 2022, across Medline (Ovid), Embase (Ovid), CINAHL, Web of Science, and ERIC databases. We focused on identifying studies using AI or data science related to SDOH within emergency care contexts or conditions. Two specialized reviewers in emergency medicine (EM) and clinical informatics independently assessed each article, resolving discrepancies through iterative reviews and discussion. We then extracted data covering study details, methodologies, patient demographics, care settings, and principal outcomes.</p><p><strong>Results: </strong>Of the 1047 studies screened, 26 met the inclusion criteria. Notably, 9 out of 26 (35%) studies were solely concentrated on ED patients. Conditions studied spanned broad EM complaints and included sepsis, acute myocardial infarction, and asthma. The majority of studies (n=16) explored multiple SDOH domains, with homelessness/housing insecurity and neighborhood/built environment predominating. Machine learning (ML) techniques were used in 23 of 26 studies, with natural language processing (NLP) being the most commonly used approach (n=11). Rule-based NLP (n=5), deep learning (n=2), and pattern matching (n=4) were the most commonly used NLP techniques. NLP models in the reviewed studies displayed significant predictive performance with outcomes, with F1-scores ranging between 0.40 and 0.75 and specificities nearing 95.9%.</p><p><strong>Conclusions: </strong>Although in its infancy, the convergence of AI and data science techniques, especially ML and NLP, with SDOH in EM offers transformative possibilities for better usage and integration of social data into clinical care and research. With a significant focus on the ED and notable NLP model performance, there is an imperative to standardize SDOH data collection, refine algorithms for diverse patient groups, and champion interdisciplinary synergies. These efforts aim to harness SDOH data optimally, enhancing patient care and mitigating health disparities. Our research underscores the vital need for continued investigation in this domain.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57124"},"PeriodicalIF":3.1,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11539921/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1