首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study. 医疗人工智能聊天机器人的参考幻觉评分:开发与可用性研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-31 DOI: 10.2196/54345
Fadi Aljamaan, Mohamad-Hani Temsah, Ibraheem Altamimi, Ayman Al-Eyadhy, Amr Jamal, Khalid Alhasan, Tamer A Mesallam, Mohamed Farahat, Khalid H Malki

Background: Artificial intelligence (AI) chatbots have recently gained use in medical practice by health care practitioners. Interestingly, the output of these AI chatbots was found to have varying degrees of hallucination in content and references. Such hallucinations generate doubts about their output and their implementation.

Objective: The aim of our study was to propose a reference hallucination score (RHS) to evaluate the authenticity of AI chatbots' citations.

Methods: Six AI chatbots were challenged with the same 10 medical prompts, requesting 10 references per prompt. The RHS is composed of 6 bibliographic items and the reference's relevance to prompts' keywords. RHS was calculated for each reference, prompt, and type of prompt (basic vs complex). The average RHS was calculated for each AI chatbot and compared across the different types of prompts and AI chatbots.

Results: Bard failed to generate any references. ChatGPT 3.5 and Bing generated the highest RHS (score=11), while Elicit and SciSpace generated the lowest RHS (score=1), and Perplexity generated a middle RHS (score=7). The highest degree of hallucination was observed for reference relevancy to the prompt keywords (308/500, 61.6%), while the lowest was for reference titles (169/500, 33.8%). ChatGPT and Bing had comparable RHS (β coefficient=-0.069; P=.32), while Perplexity had significantly lower RHS than ChatGPT (β coefficient=-0.345; P<.001). AI chatbots generally had significantly higher RHS when prompted with scenarios or complex format prompts (β coefficient=0.486; P<.001).

Conclusions: The variation in RHS underscores the necessity for a robust reference evaluation tool to improve the authenticity of AI chatbots. Further, the variations highlight the importance of verifying their output and citations. Elicit and SciSpace had negligible hallucination, while ChatGPT and Bing had critical hallucination levels. The proposed AI chatbots' RHS could contribute to ongoing efforts to enhance AI's general reliability in medical research.

背景介绍人工智能(AI)聊天机器人最近开始被医疗从业人员用于医疗实践。有趣的是,人们发现这些人工智能聊天机器人的输出内容和参考资料存在不同程度的幻觉。这种幻觉让人对它们的输出和实施产生怀疑:我们的研究旨在提出参考幻觉评分(RHS),以评估人工智能聊天机器人引文的真实性:方法:六个人工智能聊天机器人接受了同样的 10 条医学提示,每条提示要求 10 篇参考文献。RHS由6个书目项目和参考文献与提示关键词的相关性组成。针对每个参考文献、提示和提示类型(基本与复杂)计算 RHS。计算每个人工智能聊天机器人的平均 RHS,并对不同类型的提示和人工智能聊天机器人进行比较:Bard 未能生成任何引用。ChatGPT 3.5 和 Bing 生成了最高的 RHS(得分=11),而 Elicit 和 SciSpace 生成了最低的 RHS(得分=1),Perplexity 生成了中等的 RHS(得分=7)。参考文献与提示关键词相关性的幻觉程度最高(308/500,61.6%),而参考文献标题的幻觉程度最低(169/500,33.8%)。ChatGPT 和 Bing 的 RHS 相当(β 系数=-0.069;P=.32),而 Perplexity 的 RHS 明显低于 ChatGPT(β 系数=-0.345;PC 结论:RHS的差异突出表明,有必要使用可靠的参考评估工具来提高人工智能聊天机器人的真实性。此外,这些差异还凸显了验证其产出和引用的重要性。Elicit 和 SciSpace 的幻觉几乎可以忽略不计,而 ChatGPT 和 Bing 的幻觉则达到了临界水平。所建议的人工智能聊天机器人的 RHS 可以为当前提高人工智能在医学研究中的总体可靠性做出贡献。
{"title":"Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study.","authors":"Fadi Aljamaan, Mohamad-Hani Temsah, Ibraheem Altamimi, Ayman Al-Eyadhy, Amr Jamal, Khalid Alhasan, Tamer A Mesallam, Mohamed Farahat, Khalid H Malki","doi":"10.2196/54345","DOIUrl":"10.2196/54345","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) chatbots have recently gained use in medical practice by health care practitioners. Interestingly, the output of these AI chatbots was found to have varying degrees of hallucination in content and references. Such hallucinations generate doubts about their output and their implementation.</p><p><strong>Objective: </strong>The aim of our study was to propose a reference hallucination score (RHS) to evaluate the authenticity of AI chatbots' citations.</p><p><strong>Methods: </strong>Six AI chatbots were challenged with the same 10 medical prompts, requesting 10 references per prompt. The RHS is composed of 6 bibliographic items and the reference's relevance to prompts' keywords. RHS was calculated for each reference, prompt, and type of prompt (basic vs complex). The average RHS was calculated for each AI chatbot and compared across the different types of prompts and AI chatbots.</p><p><strong>Results: </strong>Bard failed to generate any references. ChatGPT 3.5 and Bing generated the highest RHS (score=11), while Elicit and SciSpace generated the lowest RHS (score=1), and Perplexity generated a middle RHS (score=7). The highest degree of hallucination was observed for reference relevancy to the prompt keywords (308/500, 61.6%), while the lowest was for reference titles (169/500, 33.8%). ChatGPT and Bing had comparable RHS (β coefficient=-0.069; P=.32), while Perplexity had significantly lower RHS than ChatGPT (β coefficient=-0.345; P<.001). AI chatbots generally had significantly higher RHS when prompted with scenarios or complex format prompts (β coefficient=0.486; P<.001).</p><p><strong>Conclusions: </strong>The variation in RHS underscores the necessity for a robust reference evaluation tool to improve the authenticity of AI chatbots. Further, the variations highlight the importance of verifying their output and citations. Elicit and SciSpace had negligible hallucination, while ChatGPT and Bing had critical hallucination levels. The proposed AI chatbots' RHS could contribute to ongoing efforts to enhance AI's general reliability in medical research.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325115/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141861823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study 通过无监督特征选择来识别重要的 ICD-10 和 ATC 编码,以便对冠心病患者队列进行机器学习:回顾性研究
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-26 DOI: 10.2196/52896
Peyman Ghasemi, Joon Lee
Background: The application of machine learning in healthcare often necessitates the use of hierarchical codes such as the International Classification of Diseases (ICD) and Anatomical Therapeutic Chemical (ATC) systems. These codes classify diseases and medications respectively, thereby forming extensive data dimensions. Unsupervised feature selection tackles the "curse of dimensionality" and helps to improve the accuracy and performance of supervised learning models by reducing the number of irrelevant or redundant features and avoiding overfitting. Techniques for unsupervised feature selection, such as filter, wrapper, and embedded methods, are implemented to select the most important features with the most intrinsic information. However, they face challenges due to the sheer volume of ICD/ATC codes and the hierarchical structures of these systems. Objective: The objective of this study was to compare several unsupervised feature selection methods for ICD and ATC code databases of coronary artery disease patients in different aspects of performance and complexity and select the best set of features representing these patients. Methods: We compared several unsupervised feature selection methods for two ICD and one ATC code databases of 51,506 coronary artery disease patients in Alberta, Canada. Specifically, we employed Laplacian Score, Unsupervised Feature Selection for Multi-Cluster Data, Autoencoder Inspired Unsupervised Feature Selection, Principal Feature Analysis, and Concrete Autoencoders with and without ICD/ATC tree weight adjustment to select the 100 best features from over 9,000 ICD and 2,000 ATC codes. We assessed the selected features based on their ability to reconstruct the initial feature space and predict 90-day mortality following discharge. We also compared the complexity of selected features by mean code level in ICD/ATC tree and the interpretability of the features in the mortality prediction task using Shapley analysis. Results: In feature space reconstruction and mortality prediction, the Concrete Autoencoder-based methods outperformed other techniques. A weight-adjusted Concrete Autoencoder variant, particularly, demonstrated improved reconstruction accuracy and significant predictive performance enhancement, confirmed by DeLong's and McNemar's tests (P<.05). Concrete Autoencoders preferred more general codes and they consistently reconstructed all features accurately. Additionally, features selected by weight-adjusted Concrete Autoencoders yielded higher Shapley values in mortality prediction compared to most alternatives. Conclusions: This study scrutinized five feature selection methods in ICD/ATC code datasets in an unsupervised context. Our findings underscore the superiority of the Concrete Autoencoder method in selecting salient features that represent the entire dataset, offering a potential asset for subsequent machine learning research. We also present a novel weight adjustment approach for the Concrete Autoenc
背景:机器学习在医疗保健领域的应用通常需要使用分级代码,如国际疾病分类(ICD)和解剖治疗化学(ATC)系统。这些代码分别对疾病和药物进行分类,从而形成了广泛的数据维度。无监督特征选择可以解决 "维度诅咒 "问题,通过减少无关或冗余特征的数量,避免过度拟合,从而帮助提高有监督学习模型的准确性和性能。无监督特征选择技术,如过滤法、包装法和嵌入法等,都是为了选择具有最多内在信息的最重要特征。然而,由于 ICD/ATC 代码的庞大数量和这些系统的分层结构,这些技术面临着挑战。研究目的本研究的目的是比较几种针对冠心病患者 ICD 和 ATC 代码数据库的无监督特征选择方法在不同方面的性能和复杂性,并选出代表这些患者的最佳特征集。方法:我们针对加拿大艾伯塔省 51,506 名冠心病患者的两个 ICD 和一个 ATC 代码数据库,比较了几种无监督特征选择方法。具体来说,我们采用了拉普拉卡方评分法、多集群数据无监督特征选择法、自动编码器启发无监督特征选择法、主特征分析法以及带有或不带有 ICD/ATC 树权重调整的混凝土自动编码器,从 9000 多个 ICD 和 2000 多个 ATC 代码中选出了 100 个最佳特征。我们根据所选特征重建初始特征空间和预测出院后 90 天死亡率的能力对其进行了评估。我们还通过 ICD/ATC 树中的平均代码级别比较了所选特征的复杂性,并使用 Shapley 分析比较了死亡率预测任务中特征的可解释性。结果:在特征空间重建和死亡率预测方面,基于具体自动编码器的方法优于其他技术。特别是经过权重调整的混凝土自动编码器变体,其重建准确性得到了提高,预测性能也有显著增强,这一点已通过 DeLong 检验和 McNemar 检验得到证实(P<.05)。具体自动编码器更倾向于使用更通用的代码,而且它们始终能准确地重建所有特征。此外,与大多数替代方法相比,权重调整后的具体自动编码器选择的特征在死亡率预测中产生了更高的 Shapley 值。结论本研究在无监督的情况下仔细研究了 ICD/ATC 代码数据集中的五种特征选择方法。我们的研究结果强调了混凝土自动编码器方法在选择代表整个数据集的突出特征方面的优越性,为后续的机器学习研究提供了潜在的资产。我们还针对 ICD/ATC 代码数据集提出了一种新颖的具体自动编码器权重调整方法,以增强所选特征的通用性和可解释性。
{"title":"Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study","authors":"Peyman Ghasemi, Joon Lee","doi":"10.2196/52896","DOIUrl":"https://doi.org/10.2196/52896","url":null,"abstract":"Background: The application of machine learning in healthcare often necessitates the use of hierarchical codes such as the International Classification of Diseases (ICD) and Anatomical Therapeutic Chemical (ATC) systems. These codes classify diseases and medications respectively, thereby forming extensive data dimensions. Unsupervised feature selection tackles the \"curse of dimensionality\" and helps to improve the accuracy and performance of supervised learning models by reducing the number of irrelevant or redundant features and avoiding overfitting. Techniques for unsupervised feature selection, such as filter, wrapper, and embedded methods, are implemented to select the most important features with the most intrinsic information. However, they face challenges due to the sheer volume of ICD/ATC codes and the hierarchical structures of these systems. Objective: The objective of this study was to compare several unsupervised feature selection methods for ICD and ATC code databases of coronary artery disease patients in different aspects of performance and complexity and select the best set of features representing these patients. Methods: We compared several unsupervised feature selection methods for two ICD and one ATC code databases of 51,506 coronary artery disease patients in Alberta, Canada. Specifically, we employed Laplacian Score, Unsupervised Feature Selection for Multi-Cluster Data, Autoencoder Inspired Unsupervised Feature Selection, Principal Feature Analysis, and Concrete Autoencoders with and without ICD/ATC tree weight adjustment to select the 100 best features from over 9,000 ICD and 2,000 ATC codes. We assessed the selected features based on their ability to reconstruct the initial feature space and predict 90-day mortality following discharge. We also compared the complexity of selected features by mean code level in ICD/ATC tree and the interpretability of the features in the mortality prediction task using Shapley analysis. Results: In feature space reconstruction and mortality prediction, the Concrete Autoencoder-based methods outperformed other techniques. A weight-adjusted Concrete Autoencoder variant, particularly, demonstrated improved reconstruction accuracy and significant predictive performance enhancement, confirmed by DeLong's and McNemar's tests (P<.05). Concrete Autoencoders preferred more general codes and they consistently reconstructed all features accurately. Additionally, features selected by weight-adjusted Concrete Autoencoders yielded higher Shapley values in mortality prediction compared to most alternatives. Conclusions: This study scrutinized five feature selection methods in ICD/ATC code datasets in an unsupervised context. Our findings underscore the superiority of the Concrete Autoencoder method in selecting salient features that represent the entire dataset, offering a potential asset for subsequent machine learning research. We also present a novel weight adjustment approach for the Concrete Autoenc","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maturity Assessment of District Health Information System Version 2 Implementation in Ethiopia: Current Status and Improvement Pathways. 埃塞俄比亚地区卫生信息系统第 2 版实施成熟度评估:现状与改进途径》。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-26 DOI: 10.2196/50375
Tesfahun Melese Yilma, Asefa Taddese, Adane Mamuye, Berhanu Fikadie Endehabtu, Yibeltal Alemayehu, Asaye Senay, Dawit Daka, Loko Abraham, Rabeal Tadesse, Gemechis Melkamu, Naod Wendrad, Oli Kaba, Mesoud Mohammed, Wubshet Denboba, Dawit Birhan, Amanuel Biru, Binyam Tilahun

Background: Although Ethiopia has made remarkable progress in the uptake of the District Health Information System version 2 (DHIS2) for national aggregate data reporting, there has been no comprehensive assessment of the maturity level of the system.

Objective: This study aims to assess the maturity level of DHIS2 implementation in Ethiopia and propose a road map that could guide the progress toward a higher level of maturity. We also aim to assess the current maturity status, implementation gaps, and future directions of DHIS2 implementation in Ethiopia. The assessment focused on digital health system governance, skilled human resources, information and communication technology (ICT) infrastructure, interoperability, and data quality and use.

Methods: A collaborative assessment was conducted with the engagement of key stakeholders through consultative workshops using the Stages of Continuous Improvement tool to measure maturity levels in 5 core domains, 13 components, and 39 subcomponents. A 5-point scale (1=emerging, 2=repeatable, 3=defined, 4=managed, and 5=optimized) was used to measure the DHIS2 implementation maturity level.

Results: The national DHIS2 implementation's maturity level is currently at the defined stage (score=2.81) and planned to move to the manageable stage (score=4.09) by 2025. The domain-wise maturity score indicated that except for ICT infrastructure, which is at the repeatable stage (score=2.14), the remaining 4 domains are at the defined stage (score=3). The development of a standardized and basic DHIS2 process at the national level, the development of a 10-year strategic plan to guide the implementation of digital health systems including DHIS2, and the presence of the required competencies at the facility level to accomplish specific DHIS2-related tasks are the major strength of the Ministry of Health of Ethiopia so far. The lack of workforce competency guidelines to support the implementation of DHIS2; the unavailability of core competencies (knowledge, skills, and abilities) required to accomplish DHIS2 tasks at all levels of the health system; and ICT infrastructures such as communication network and internet connectivity at the district, zonal, and regional levels are the major hindrances to effective DHIS2 implementation in the country.

Conclusions: On the basis of the Stages of Continuous Improvement maturity model toolkit, the implementation status of DHIS2 in Ethiopia is at the defined stage, with the ICT infrastructure domain being at the lowest stage as compared to the other 4 domains. By 2025, the maturity status is planned to move from the defined stage to the managed stage by improving the identified gaps. Various action points are suggested to address the identified gaps and reach the stated maturity level. The responsible body, necessary resources, and methods of verification required to reac

背景:尽管埃塞俄比亚在采用地区卫生信息系统第 2 版(DHIS2)进行全国综合数据报告方面取得了显著进展,但对该系统的成熟度却没有进行过全面评估:本研究旨在评估埃塞俄比亚实施 DHIS2 的成熟度,并提出一个路线图,以指导向更高成熟度迈进。我们还旨在评估埃塞俄比亚实施 DHIS2 系统的当前成熟度、实施差距和未来方向。评估的重点是数字医疗系统治理、熟练人力资源、信息和通信技术(ICT)基础设施、互操作性以及数据质量和使用:方法:在主要利益相关者的参与下,通过咨询研讨会使用 "持续改进阶段 "工具进行了合作评估,以衡量 5 个核心领域、13 个组成部分和 39 个子组成部分的成熟度。采用五点量表(1=萌芽、2=可重复、3=已定义、4=已管理、5=已优化)来衡量 DHIS2 的实施成熟度:结果:全国 DHIS2 实施的成熟度目前处于确定阶段(得分=2.81),计划到 2025 年进入可管理阶段(得分=4.09)。按领域划分的成熟度得分表明,除信息和通信技术基础设施处于可重复阶段(得分=2.14)外,其余 4 个领域均处于确定阶段(得分=3)。迄今为止,埃塞俄比亚卫生部的主要优势包括:在国家层面制定了标准化的 DHIS2 基本流程;制定了十年战略计划,以指导包括 DHIS2 在内的数字卫生系统的实施;在设施层面具备完成 DHIS2 相关具体任务所需的能力。缺乏支持 DHIS2 实施的劳动力能力指南;缺乏各级卫生系统完成 DHIS2 任务所需的核心能力(知识、技能和能力);以及信息和通信技术基础设施,如地区、分区和区域一级的通信网络和互联网连接,是该国有效实施 DHIS2 的主要障碍:根据 "持续改进阶段 "成熟度模型工具包,埃塞俄比亚 DHIS2 的实施状况处于既定阶段,与其他 4 个领域相比,信息和通信技术基础设施领域处于最低阶段。计划到 2025 年,通过改进已发现的差距,将成熟度从确定阶段提升到管理阶段。为解决已发现的差距并达到既定的成熟度水平,提出了各种行动要点。还列出了达到指定成熟度所需的负责机构、必要资源和验证方法。
{"title":"Maturity Assessment of District Health Information System Version 2 Implementation in Ethiopia: Current Status and Improvement Pathways.","authors":"Tesfahun Melese Yilma, Asefa Taddese, Adane Mamuye, Berhanu Fikadie Endehabtu, Yibeltal Alemayehu, Asaye Senay, Dawit Daka, Loko Abraham, Rabeal Tadesse, Gemechis Melkamu, Naod Wendrad, Oli Kaba, Mesoud Mohammed, Wubshet Denboba, Dawit Birhan, Amanuel Biru, Binyam Tilahun","doi":"10.2196/50375","DOIUrl":"10.2196/50375","url":null,"abstract":"<p><strong>Background: </strong>Although Ethiopia has made remarkable progress in the uptake of the District Health Information System version 2 (DHIS2) for national aggregate data reporting, there has been no comprehensive assessment of the maturity level of the system.</p><p><strong>Objective: </strong>This study aims to assess the maturity level of DHIS2 implementation in Ethiopia and propose a road map that could guide the progress toward a higher level of maturity. We also aim to assess the current maturity status, implementation gaps, and future directions of DHIS2 implementation in Ethiopia. The assessment focused on digital health system governance, skilled human resources, information and communication technology (ICT) infrastructure, interoperability, and data quality and use.</p><p><strong>Methods: </strong>A collaborative assessment was conducted with the engagement of key stakeholders through consultative workshops using the Stages of Continuous Improvement tool to measure maturity levels in 5 core domains, 13 components, and 39 subcomponents. A 5-point scale (1=emerging, 2=repeatable, 3=defined, 4=managed, and 5=optimized) was used to measure the DHIS2 implementation maturity level.</p><p><strong>Results: </strong>The national DHIS2 implementation's maturity level is currently at the defined stage (score=2.81) and planned to move to the manageable stage (score=4.09) by 2025. The domain-wise maturity score indicated that except for ICT infrastructure, which is at the repeatable stage (score=2.14), the remaining 4 domains are at the defined stage (score=3). The development of a standardized and basic DHIS2 process at the national level, the development of a 10-year strategic plan to guide the implementation of digital health systems including DHIS2, and the presence of the required competencies at the facility level to accomplish specific DHIS2-related tasks are the major strength of the Ministry of Health of Ethiopia so far. The lack of workforce competency guidelines to support the implementation of DHIS2; the unavailability of core competencies (knowledge, skills, and abilities) required to accomplish DHIS2 tasks at all levels of the health system; and ICT infrastructures such as communication network and internet connectivity at the district, zonal, and regional levels are the major hindrances to effective DHIS2 implementation in the country.</p><p><strong>Conclusions: </strong>On the basis of the Stages of Continuous Improvement maturity model toolkit, the implementation status of DHIS2 in Ethiopia is at the defined stage, with the ICT infrastructure domain being at the lowest stage as compared to the other 4 domains. By 2025, the maturity status is planned to move from the defined stage to the managed stage by improving the identified gaps. Various action points are suggested to address the identified gaps and reach the stated maturity level. The responsible body, necessary resources, and methods of verification required to reac","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11316158/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Large Language Models on Medical Education and Teaching Adaptations 大语言模型对医学教育和教学调整的影响
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-25 DOI: 10.2196/55933
Li Zhui, Nina Yhap, Liu Liping, Wang Zhengjie, Xiong Zhonghao, Yuan Xiaoshu, Cui Hong, Liu Xuexiu, Ren Wei
This viewpoint article explores the transformative impact of Chat Generative Pre-trained Transformer (ChatGPT) on medical education, highlighting its opportunities and challenges. ChatGPT, a product of OpenAI, leverages advanced deep learning models to offer diverse applications, including enhancing teaching efficiency, facilitating personalized learning, reinforcing clinical skills training, improving medical teaching assessment, enhancing efficiency in medical research, and supporting continuing medical education. While presenting promising opportunities, the integration of ChatGPT in medical education raises concerns about response accuracy, overreliance, lack of emotional intelligence, and privacy and data security risks. The article underscores the imperative need to carefully address these challenges, outlining future pathways to bolster medical information accuracy, fortify privacy and data security, and promote synergy between ChatGPT and other artificial intelligence technologies in medical education. It highlights the adaptability and transformative significance of educators amid the widespread integration of ChatGPT in medical education. Educators must consistently uphold a leadership role, guiding students in the ethical and effective use of ChatGPT, nurturing independent thinking, and honing critical reasoning skills. Safeguarding the quality and integrity of medical education in this dynamic technological era remains paramount.
这篇观点文章探讨了 Chat Generative Pre-trained Transformer(ChatGPT)对医学教育的变革性影响,强调了其机遇和挑战。ChatGPT 是 OpenAI 的产品,利用先进的深度学习模型提供多样化的应用,包括提高教学效率、促进个性化学习、强化临床技能培训、改进医学教学评估、提高医学研究效率以及支持继续医学教育。虽然 ChatGPT 在医学教育中的应用前景广阔,但也引发了人们对响应准确性、过度依赖、缺乏情商以及隐私和数据安全风险等问题的担忧。文章强调了认真应对这些挑战的迫切需要,概述了提高医疗信息准确性、加强隐私和数据安全以及促进 ChatGPT 和其他人工智能技术在医学教育中的协同作用的未来途径。报告强调了在医学教育中广泛整合 ChatGPT 的过程中教育工作者的适应性和变革意义。教育者必须始终坚持发挥领导作用,指导学生以合乎道德的方式有效使用 ChatGPT,培养学生的独立思考能力和批判性推理能力。在这个充满活力的技术时代,保障医学教育的质量和完整性仍然至关重要。
{"title":"Impact of Large Language Models on Medical Education and Teaching Adaptations","authors":"Li Zhui, Nina Yhap, Liu Liping, Wang Zhengjie, Xiong Zhonghao, Yuan Xiaoshu, Cui Hong, Liu Xuexiu, Ren Wei","doi":"10.2196/55933","DOIUrl":"https://doi.org/10.2196/55933","url":null,"abstract":"This viewpoint article explores the transformative impact of Chat Generative Pre-trained Transformer (ChatGPT) on medical education, highlighting its opportunities and challenges. ChatGPT, a product of OpenAI, leverages advanced deep learning models to offer diverse applications, including enhancing teaching efficiency, facilitating personalized learning, reinforcing clinical skills training, improving medical teaching assessment, enhancing efficiency in medical research, and supporting continuing medical education. While presenting promising opportunities, the integration of ChatGPT in medical education raises concerns about response accuracy, overreliance, lack of emotional intelligence, and privacy and data security risks. The article underscores the imperative need to carefully address these challenges, outlining future pathways to bolster medical information accuracy, fortify privacy and data security, and promote synergy between ChatGPT and other artificial intelligence technologies in medical education. It highlights the adaptability and transformative significance of educators amid the widespread integration of ChatGPT in medical education. Educators must consistently uphold a leadership role, guiding students in the ethical and effective use of ChatGPT, nurturing independent thinking, and honing critical reasoning skills. Safeguarding the quality and integrity of medical education in this dynamic technological era remains paramount.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hjernetegn.dk-The Danish Central Nervous System Tumor Awareness Initiative Digital Decision Support Tool: Design and Implementation Report. Hjernetegn.dk-丹麦中枢神经系统肿瘤认知倡议数字决策支持工具:设计和实施报告。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-25 DOI: 10.2196/58886
Kathrine Synne Weile, René Mathiasen, Jeanette Falck Winther, Henrik Hasle, Louise Tram Henriksen

Background: Childhood tumors in the central nervous system (CNS) have longer diagnostic delays than other pediatric tumors. Vague presenting symptoms pose a challenge in the diagnostic process; it has been indicated that patients and parents may be hesitant to seek help, and health care professionals (HCPs) may lack awareness and knowledge about clinical presentation. To raise awareness among HCPs, the Danish CNS tumor awareness initiative hjernetegn.dk was launched.

Objective: This study aims to present the learnings from designing and implementing a decision support tool for HCPs to reduce diagnostic delay in childhood CNS tumors. The aims also include decisions regarding strategies for dissemination and use of social media, and an evaluation of the digital impact 6 months after launch.

Methods: The phases of developing and implementing the tool include participatory co-creation workshops, designing the website and digital platforms, and implementing a press and media strategy. The digital impact of hjernetegn.dk was evaluated through website analytics and social media engagement.

Implementation (results): hjernetegn.dk was launched in August 2023. The results after 6 months exceeded key performance indicators. The analysis showed a high number of website visitors and engagement, with a plateau reached 3 months after the initial launch. The LinkedIn campaign and Google Search strategy also generated a high number of impressions and clicks.

Conclusions: The findings suggest that the initiative has been successfully integrated, raising awareness and providing a valuable tool for HCPs in diagnosing childhood CNS tumors. The study highlights the importance of interdisciplinary collaboration, co-creation, and ongoing community management, as well as broad dissemination strategies when introducing a digital support tool.

背景:与其他儿科肿瘤相比,儿童中枢神经系统(CNS)肿瘤的诊断延迟时间更长。模糊的症状给诊断过程带来了挑战;有研究表明,患者和家长可能会对寻求帮助犹豫不决,而医疗保健专业人员(HCPs)可能缺乏对临床表现的认识和知识。为了提高医护人员对中枢神经系统肿瘤的认识,丹麦发起了中枢神经系统肿瘤认识行动 hjernetegn.dk:本研究旨在介绍在设计和实施决策支持工具以减少儿童中枢神经系统肿瘤诊断延误方面的经验。目的还包括有关传播和使用社交媒体的战略决策,以及对推出 6 个月后的数字影响进行评估:该工具的开发和实施阶段包括参与式共同创造研讨会、设计网站和数字平台,以及实施新闻和媒体战略。通过网站分析和社交媒体参与来评估 hjernetegn.dk 的数字影响。实施(结果):hjernetegn.dk 于 2023 年 8 月推出。6 个月后的结果超过了关键绩效指标。分析表明,网站访问量和参与度都很高,在首次推出 3 个月后达到了一个高峰。LinkedIn 活动和谷歌搜索战略也产生了大量的印象和点击:研究结果表明,该倡议已成功整合,提高了人们的认识,并为保健医生诊断儿童中枢神经系统肿瘤提供了宝贵的工具。该研究强调了跨学科合作、共同创造和持续社区管理的重要性,以及在引入数字支持工具时采取广泛传播策略的重要性。
{"title":"Hjernetegn.dk-The Danish Central Nervous System Tumor Awareness Initiative Digital Decision Support Tool: Design and Implementation Report.","authors":"Kathrine Synne Weile, René Mathiasen, Jeanette Falck Winther, Henrik Hasle, Louise Tram Henriksen","doi":"10.2196/58886","DOIUrl":"10.2196/58886","url":null,"abstract":"<p><strong>Background: </strong>Childhood tumors in the central nervous system (CNS) have longer diagnostic delays than other pediatric tumors. Vague presenting symptoms pose a challenge in the diagnostic process; it has been indicated that patients and parents may be hesitant to seek help, and health care professionals (HCPs) may lack awareness and knowledge about clinical presentation. To raise awareness among HCPs, the Danish CNS tumor awareness initiative hjernetegn.dk was launched.</p><p><strong>Objective: </strong>This study aims to present the learnings from designing and implementing a decision support tool for HCPs to reduce diagnostic delay in childhood CNS tumors. The aims also include decisions regarding strategies for dissemination and use of social media, and an evaluation of the digital impact 6 months after launch.</p><p><strong>Methods: </strong>The phases of developing and implementing the tool include participatory co-creation workshops, designing the website and digital platforms, and implementing a press and media strategy. The digital impact of hjernetegn.dk was evaluated through website analytics and social media engagement.</p><p><strong>Implementation (results): </strong>hjernetegn.dk was launched in August 2023. The results after 6 months exceeded key performance indicators. The analysis showed a high number of website visitors and engagement, with a plateau reached 3 months after the initial launch. The LinkedIn campaign and Google Search strategy also generated a high number of impressions and clicks.</p><p><strong>Conclusions: </strong>The findings suggest that the initiative has been successfully integrated, raising awareness and providing a valuable tool for HCPs in diagnosing childhood CNS tumors. The study highlights the importance of interdisciplinary collaboration, co-creation, and ongoing community management, as well as broad dissemination strategies when introducing a digital support tool.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11310640/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introducing Attribute Association Graphs to Facilitate Medical Data Exploration: Development and Evaluation Using Epidemiological Study Data. 引入属性关联图以促进医学数据探索:利用流行病学研究数据进行开发和评估。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-24 DOI: 10.2196/49865
Louis Bellmann, Alexander Johannes Wiederhold, Leona Trübe, Raphael Twerenbold, Frank Ückert, Karl Gottfried

Background: Interpretability and intuitive visualization facilitate medical knowledge generation through big data. In addition, robustness to high-dimensional and missing data is a requirement for statistical approaches in the medical domain. A method tailored to the needs of physicians must meet all the abovementioned criteria.

Objective: This study aims to develop an accessible tool for visual data exploration without the need for programming knowledge, adjusting complex parameterizations, or handling missing data. We sought to use statistical analysis using the setting of disease and control cohorts familiar to clinical researchers. We aimed to guide the user by identifying and highlighting data patterns associated with disease and reveal relations between attributes within the data set.

Methods: We introduce the attribute association graph, a novel graph structure designed for visual data exploration using robust statistical metrics. The nodes capture frequencies of participant attributes in disease and control cohorts as well as deviations between groups. The edges represent conditional relations between attributes. The graph is visualized using the Neo4j (Neo4j, Inc) data platform and can be interactively explored without the need for technical knowledge. Nodes with high deviations between cohorts and edges of noticeable conditional relationship are highlighted to guide the user during the exploration. The graph is accompanied by a dashboard visualizing variable distributions. For evaluation, we applied the graph and dashboard to the Hamburg City Health Study data set, a large cohort study conducted in the city of Hamburg, Germany. All data structures can be accessed freely by researchers, physicians, and patients. In addition, we developed a user test conducted with physicians incorporating the System Usability Scale, individual questions, and user tasks.

Results: We evaluated the attribute association graph and dashboard through an exemplary data analysis of participants with a general cardiovascular disease in the Hamburg City Health Study data set. All results extracted from the graph structure and dashboard are in accordance with findings from the literature, except for unusually low cholesterol levels in participants with cardiovascular disease, which could be induced by medication. In addition, 95% CIs of Pearson correlation coefficients were calculated for all associations identified during the data analysis, confirming the results. In addition, a user test with 10 physicians assessing the usability of the proposed methods was conducted. A System Usability Scale score of 70.5% and average successful task completion of 81.4% were reported.

Conclusions: The proposed attribute association graph and dashboard enable intuitive visual data exploration. They are robust to high-dimensional as well as missing data and require no pa

背景:可解释性和直观可视化有助于通过大数据生成医学知识。此外,对高维数据和缺失数据的鲁棒性也是医学领域对统计方法的要求。针对医生需求量身定制的方法必须满足上述所有标准:本研究旨在开发一种无需编程知识、无需调整复杂参数或处理缺失数据的可视化数据探索工具。我们试图使用临床研究人员熟悉的疾病和对照队列进行统计分析。我们的目标是通过识别和突出与疾病相关的数据模式来引导用户,并揭示数据集中属性之间的关系:我们引入了属性关联图,这是一种新颖的图结构,旨在利用稳健的统计指标进行可视化数据探索。图中的节点代表疾病组和对照组中参与者属性的频率以及组间偏差。边代表属性之间的条件关系。该图通过 Neo4j(Neo4j 公司)数据平台实现可视化,无需技术知识即可进行交互式探索。队列间偏差较大的节点和条件关系明显的边缘会突出显示,以便在探索过程中为用户提供指导。该图还配有一个仪表盘,可直观显示变量分布。为了进行评估,我们将图表和仪表盘应用于汉堡市健康研究数据集,这是一项在德国汉堡市进行的大型队列研究。研究人员、医生和患者均可自由访问所有数据结构。此外,我们还结合系统可用性量表、个人问题和用户任务,对医生进行了用户测试:我们通过对汉堡市健康研究数据集中患有一般心血管疾病的参与者进行示范性数据分析,对属性关联图和仪表板进行了评估。从图表结构和仪表板中提取的所有结果都与文献研究结果一致,但患有心血管疾病的参与者胆固醇水平异常低的情况除外,这可能是药物引起的。此外,还计算了数据分析过程中发现的所有关联的皮尔逊相关系数的 95% CIs,证实了分析结果。此外,还对 10 名医生进行了用户测试,以评估建议方法的可用性。系统可用性量表得分率为 70.5%,平均成功完成任务率为 81.4%:结论:所提出的属性关联图和仪表盘可实现直观的可视化数据探索。结论:提议的属性关联图和仪表盘可实现直观的可视化数据探索,对高维数据和缺失数据具有鲁棒性,且无需参数化。临床医生的可用性已通过用户测试得到证实,统计结果的有效性已通过文献中的关联和标准统计推断得到证实。
{"title":"Introducing Attribute Association Graphs to Facilitate Medical Data Exploration: Development and Evaluation Using Epidemiological Study Data.","authors":"Louis Bellmann, Alexander Johannes Wiederhold, Leona Trübe, Raphael Twerenbold, Frank Ückert, Karl Gottfried","doi":"10.2196/49865","DOIUrl":"10.2196/49865","url":null,"abstract":"<p><strong>Background: </strong>Interpretability and intuitive visualization facilitate medical knowledge generation through big data. In addition, robustness to high-dimensional and missing data is a requirement for statistical approaches in the medical domain. A method tailored to the needs of physicians must meet all the abovementioned criteria.</p><p><strong>Objective: </strong>This study aims to develop an accessible tool for visual data exploration without the need for programming knowledge, adjusting complex parameterizations, or handling missing data. We sought to use statistical analysis using the setting of disease and control cohorts familiar to clinical researchers. We aimed to guide the user by identifying and highlighting data patterns associated with disease and reveal relations between attributes within the data set.</p><p><strong>Methods: </strong>We introduce the attribute association graph, a novel graph structure designed for visual data exploration using robust statistical metrics. The nodes capture frequencies of participant attributes in disease and control cohorts as well as deviations between groups. The edges represent conditional relations between attributes. The graph is visualized using the Neo4j (Neo4j, Inc) data platform and can be interactively explored without the need for technical knowledge. Nodes with high deviations between cohorts and edges of noticeable conditional relationship are highlighted to guide the user during the exploration. The graph is accompanied by a dashboard visualizing variable distributions. For evaluation, we applied the graph and dashboard to the Hamburg City Health Study data set, a large cohort study conducted in the city of Hamburg, Germany. All data structures can be accessed freely by researchers, physicians, and patients. In addition, we developed a user test conducted with physicians incorporating the System Usability Scale, individual questions, and user tasks.</p><p><strong>Results: </strong>We evaluated the attribute association graph and dashboard through an exemplary data analysis of participants with a general cardiovascular disease in the Hamburg City Health Study data set. All results extracted from the graph structure and dashboard are in accordance with findings from the literature, except for unusually low cholesterol levels in participants with cardiovascular disease, which could be induced by medication. In addition, 95% CIs of Pearson correlation coefficients were calculated for all associations identified during the data analysis, confirming the results. In addition, a user test with 10 physicians assessing the usability of the proposed methods was conducted. A System Usability Scale score of 70.5% and average successful task completion of 81.4% were reported.</p><p><strong>Conclusions: </strong>The proposed attribute association graph and dashboard enable intuitive visual data exploration. They are robust to high-dimensional as well as missing data and require no pa","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11306949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141753505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study. COVID-19 期间西语裔人群对远程医疗的接受程度:回顾性观察研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-24 DOI: 10.2196/57717
Di Shang, Cynthia Williams, Hera Culiqi

Background: The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.

Objective: The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.

Methods: We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.

Results: We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P<.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P<.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P<.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P<.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P<.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P<.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P<.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P<.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).

Conclusions: This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a

背景:拉美裔社区是美国医疗保健系统中存在不公平现象的一个相当大的群体。随着医疗系统向数字医疗平台发展,评估其对西班牙裔社区的潜在影响至关重要:本研究旨在调查导致西班牙裔社区远程医疗使用率低的人口、社会经济和行为因素:我们采用回顾性观察研究设计来探讨研究目标。COVID-19 研究数据库联盟提供了 Analytics IQ PeopleCore 消费者数据和 Office Alley 索赔数据。研究时间为 2020 年 3 月至 2021 年 4 月。采用多元逻辑回归法确定使用远程医疗服务的几率:我们研究了 3,478,287 名独特的西班牙裔患者,其中 16.6% (577,396 人)使用了远程医疗。结果表明,年龄在 18-44 岁之间的患者更有可能使用远程保健服务(几率比 [OR] 1.07,95% CI 1.05-1.1;PC 结论:这项研究表明,远程保健服务在西班牙裔患者中的使用率较高:本研究表明,西语裔社区使用远程医疗取决于年龄、性别、教育程度、社会经济地位、当前医疗保健参与度和健康行为等因素。为了应对这些挑战,我们主张采用跨学科的方法,让医疗专业人员、保险提供商和社区服务机构积极参与拉美裔社区的活动,促进远程保健的使用。我们提出以下建议:提高医疗保险的可及性,改善初级保健提供者的可及性,分配财政和教育资源以支持远程保健的使用。随着远程保健对医疗服务的影响越来越大,专业人员必须促进人们使用所有可用的渠道来获得医疗服务。
{"title":"Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study.","authors":"Di Shang, Cynthia Williams, Hera Culiqi","doi":"10.2196/57717","DOIUrl":"10.2196/57717","url":null,"abstract":"<p><strong>Background: </strong>The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.</p><p><strong>Objective: </strong>The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.</p><p><strong>Methods: </strong>We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.</p><p><strong>Results: </strong>We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P<.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P<.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P<.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P<.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P<.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P<.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P<.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P<.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).</p><p><strong>Conclusions: </strong>This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11289584/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of In-Hospital Cardiac Arrest in the Intensive Care Unit: Machine Learning-Based Multimodal Approach. 重症监护病房院内心脏骤停预测:基于机器学习的多模态方法。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/49142
Hsin-Ying Lee, Po-Chih Kuo, Frank Qian, Chien-Hung Li, Jiun-Ruey Hu, Wan-Ting Hsu, Hong-Jie Jhou, Po-Huang Chen, Cho-Hao Lee, Chin-Hua Su, Po-Chun Liao, I-Ju Wu, Chien-Chang Lee

Background: Early identification of impending in-hospital cardiac arrest (IHCA) improves clinical outcomes but remains elusive for practicing clinicians.

Objective: We aimed to develop a multimodal machine learning algorithm based on ensemble techniques to predict the occurrence of IHCA.

Methods: Our model was developed by the Multiparameter Intelligent Monitoring of Intensive Care (MIMIC)-IV database and validated in the Electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Baseline features consisting of patient demographics, presenting illness, and comorbidities were collected to train a random forest model. Next, vital signs were extracted to train a long short-term memory model. A support vector machine algorithm then stacked the results to form the final prediction model.

Results: Of 23,909 patients in the MIMIC-IV database and 10,049 patients in the eICU-CRD database, 452 and 85 patients, respectively, had IHCA. At 13 hours in advance of an IHCA event, our algorithm had already demonstrated an area under the receiver operating characteristic curve of 0.85 (95% CI 0.815-0.885) in the MIMIC-IV database. External validation with the eICU-CRD and National Taiwan University Hospital databases also presented satisfactory results, showing area under the receiver operating characteristic curve values of 0.81 (95% CI 0.763-0.851) and 0.945 (95% CI 0.934-0.956), respectively.

Conclusions: Using only vital signs and information available in the electronic medical record, our model demonstrates it is possible to detect a trajectory of clinical deterioration up to 13 hours in advance. This predictive tool, which has undergone external validation, could forewarn and help clinicians identify patients in need of assessment to improve their overall prognosis.

背景:早期识别即将发生的院内心脏骤停(IHCA)可改善临床预后,但对于临床医生来说仍难以捉摸:我们旨在开发一种基于集合技术的多模态机器学习算法,以预测 IHCA 的发生:我们的模型由重症监护多参数智能监测(MIMIC)-IV 数据库开发,并在重症监护室合作研究电子数据库(eICU-CRD)中进行了验证。收集的基线特征包括患者人口统计学特征、主诉疾病和合并症,用于训练随机森林模型。接着,提取生命体征来训练长短期记忆模型。然后,支持向量机算法将结果叠加,形成最终预测模型:在 MIMIC-IV 数据库的 23909 名患者和 eICU-CRD 数据库的 10049 名患者中,分别有 452 名和 85 名患者发生了 IHCA。在 IHCA 事件发生前 13 小时,我们的算法在 MIMIC-IV 数据库中的接收器操作特征曲线下面积已达到 0.85(95% CI 0.815-0.885)。eICU-CRD和台湾大学医院数据库的外部验证结果也令人满意,接收器操作特征曲线下面积值分别为0.81(95% CI 0.763-0.851)和0.945(95% CI 0.934-0.956):我们的模型仅使用生命体征和电子病历中的信息,就能提前 13 小时发现临床恶化的轨迹。这一预测工具已经过外部验证,可以提前预警并帮助临床医生识别需要评估的患者,从而改善他们的整体预后。
{"title":"Prediction of In-Hospital Cardiac Arrest in the Intensive Care Unit: Machine Learning-Based Multimodal Approach.","authors":"Hsin-Ying Lee, Po-Chih Kuo, Frank Qian, Chien-Hung Li, Jiun-Ruey Hu, Wan-Ting Hsu, Hong-Jie Jhou, Po-Huang Chen, Cho-Hao Lee, Chin-Hua Su, Po-Chun Liao, I-Ju Wu, Chien-Chang Lee","doi":"10.2196/49142","DOIUrl":"10.2196/49142","url":null,"abstract":"<p><strong>Background: </strong>Early identification of impending in-hospital cardiac arrest (IHCA) improves clinical outcomes but remains elusive for practicing clinicians.</p><p><strong>Objective: </strong>We aimed to develop a multimodal machine learning algorithm based on ensemble techniques to predict the occurrence of IHCA.</p><p><strong>Methods: </strong>Our model was developed by the Multiparameter Intelligent Monitoring of Intensive Care (MIMIC)-IV database and validated in the Electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Baseline features consisting of patient demographics, presenting illness, and comorbidities were collected to train a random forest model. Next, vital signs were extracted to train a long short-term memory model. A support vector machine algorithm then stacked the results to form the final prediction model.</p><p><strong>Results: </strong>Of 23,909 patients in the MIMIC-IV database and 10,049 patients in the eICU-CRD database, 452 and 85 patients, respectively, had IHCA. At 13 hours in advance of an IHCA event, our algorithm had already demonstrated an area under the receiver operating characteristic curve of 0.85 (95% CI 0.815-0.885) in the MIMIC-IV database. External validation with the eICU-CRD and National Taiwan University Hospital databases also presented satisfactory results, showing area under the receiver operating characteristic curve values of 0.81 (95% CI 0.763-0.851) and 0.945 (95% CI 0.934-0.956), respectively.</p><p><strong>Conclusions: </strong>Using only vital signs and information available in the electronic medical record, our model demonstrates it is possible to detect a trajectory of clinical deterioration up to 13 hours in advance. This predictive tool, which has undergone external validation, could forewarn and help clinicians identify patients in need of assessment to improve their overall prognosis.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11287234/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study. 基于回顾性差异分析,通过迭代完善快速医疗互操作性资源档案,发掘医疗数据的协调潜力:案例研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/57005
Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf

Background: Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.

Objective: In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.

Methods: We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.

Results: A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.

Conclusions: While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required

背景:医疗服务提供者之间的跨机构互操作性仍然是全球范围内经常面临的挑战。德国医疗信息学倡议是德国 37 家大学医院的合作项目,旨在通过定义用于跨机构交换医疗保健数据的快速医疗保健互操作性资源(FHIR)配置文件,即核心数据集(CDS),来实现合作医院之间的互操作性。当前的 CDS 及其扩展模块定义了代表患者医疗记录的元素。德国所有大学医院在以 CDS 为基础的标准化格式提供常规数据方面都取得了重大进展。此外,健康中央研究平台--德国医学研究数据门户网站可行性工具允许医学研究人员查询许多参与医院的可用 CDS 数据项:在这项研究中,我们旨在评估一种将当前自上而下生成的 FHIR 配置文件与通过分析各自实例数据获得的自下而上生成的知识相结合的新方法。这样,我们就能利用差异分析获得的信息,得出迭代完善 FHIR 配置文件的方案:我们开发了一个 FHIR 验证管道,并选择从原始 CDS 配置文件中提取限制性更强的配置文件。之所以做出这一决定,是因为需要更紧密地与中央可行性平台搜索本体的具体假设和要求保持一致。虽然原始 CDS 配置文件提供了一个通用框架,可适用于广泛的医学信息学用例,但它们缺乏具体性,无法模拟医学研究人员所必需的细微标准。这方面的一个重要例子就是必须准确地表示特定的实验室编码和值之间的相互依存关系。通过验证结果,我们可以发现临床站点的实例数据与可行性平台指定的配置文件之间的差异,并在今后加以解决:共有 20 家大学医院参与了这项研究。历史因素、缺乏统一、源系统范围广以及编码的病例敏感性是造成差异的部分原因。在我们的案例研究中,由于德国计费的立法要求,条件、程序和药物在实例数据编码方面具有高度的统一性,但我们发现,由于编码和价值之间的相互依存关系,实验室价值对数据协调构成了重大挑战:结论:虽然 CDS 实现了互操作性,但联合数据访问也面临着不同的挑战,需要更具体的配置文件才能对实例数据做出假设。我们还认为,进一步协调实例数据可以大大减少所需的追溯协调工作。我们认识到,差异不可能仅在临床现场得到解决;因此,我们的研究结果具有广泛的影响,需要各利益相关方在多个层面采取行动。
{"title":"Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study.","authors":"Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf","doi":"10.2196/57005","DOIUrl":"10.2196/57005","url":null,"abstract":"<p><strong>Background: </strong>Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.</p><p><strong>Objective: </strong>In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.</p><p><strong>Methods: </strong>We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.</p><p><strong>Results: </strong>A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.</p><p><strong>Conclusions: </strong>While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11303887/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach. 从养老院用药事故报告中提取多种事故因素的多标签分类器的构建:自然语言处理方法
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/58141
Hayato Kizaki, Hiroki Satoh, Sayaka Ebara, Satoshi Watabe, Yasufumi Sawada, Shungo Imai, Satoko Hori

Background: Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents.

Objective: We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff.

Methods: We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F1-score and exact match accuracy through 5-fold cross-validation.

Results: Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included "procedure adherence," "medicine," "resident," "resident family," "nonmedical staff," "medical staff," "team," "environment," and "organizational management," respectively. Owing to limited labels, "resident family" and "medical staff" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F1-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy

背景:住院护理设施中的用药安全是一个至关重要的问题,尤其是在非医务人员提供用药协助的情况下。在这些环境中,与用药相关的事故性质复杂,加上对医护人员的心理影响,凸显了有效的事故分析和预防策略的必要性。通常情况下,通过事故报告分析来透彻了解根本原因,对于减少用药相关事故至关重要:我们的目标是开发并评估一种使用自然语言处理的多标签分类器,该分类器可利用寄宿式护理机构的事故报告描述识别导致药物相关事故的因素,重点关注涉及非医务人员的事故:我们分析了 2015 年 4 月 1 日至 2016 年 3 月 31 日期间来自日本养老机构的 2143 份事故报告,其中包含 7121 个句子。根据已建立的组织因素模型和以往的研究成果,使用句子对事件因素进行了注释。定义了以下 9 个因素:程序遵守、医疗、居民、居民家庭、非医疗人员、医疗人员、团队、环境和组织管理。为了评估标注标准,两名具有相关医学知识的研究人员对 50 份报告的子集进行了标注;标注者之间的一致性采用 Cohen κ 进行测量。随后,由一名研究人员对整个数据集进行标注。每个句子都有多个标签。使用深度学习模型开发了多标签分类器,其中包括 2 个双向编码器表征转换器(BERT)型模型(Tohoku-BERT 和东京大学医院 BERT,使用日语临床文本进行预训练:UTH-BERT)和以日语文本为基础进行预训练的 "可准确分类标记替换的高效学习编码器"(ELECTRA)。对句子和报告进行了训练;通过 5 倍交叉验证,以 F1 分数和精确匹配准确率来评估性能:在所有 7121 个句子中,分别有 1167、694、2455、23、1905、46、195、1104 和 195 个句子包含 "遵守程序"、"医学"、"居民"、"居民家庭"、"非医务人员"、"医务人员"、"团队"、"环境 "和 "组织管理"。由于标签有限,"居民家庭 "和 "医务人员 "在模型开发过程中被省略。每个标签的注释者间一致值均高于 0.6。分别有 10 份、278 份和 1855 份报告没有、1 份和多个标签。使用报告数据训练的模型优于使用句子训练的模型,Tohoku-BERT、UTH-BERT 和 ELECTRA 的宏观 F1 分数分别为 0.744、0.675 和 0.735。经过报告训练的模型也表现出更高的精确匹配准确率,Tohoku-BERT、UTH-BERT 和 ELECTRA 的准确匹配准确率分别为 0.411、0.389 和 0.399。值得注意的是,即使只分析包含多个标签的报告,准确率也是一致的:我们在研究中开发的多标签分类器证明了它在利用养老院的事故报告识别与用药相关事故有关的各种因素方面的潜力。因此,该分类器可促进对事故因素的及时分析,从而有助于风险管理和预防策略的制定。
{"title":"Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach.","authors":"Hayato Kizaki, Hiroki Satoh, Sayaka Ebara, Satoshi Watabe, Yasufumi Sawada, Shungo Imai, Satoko Hori","doi":"10.2196/58141","DOIUrl":"10.2196/58141","url":null,"abstract":"<p><strong>Background: </strong>Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents.</p><p><strong>Objective: </strong>We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff.</p><p><strong>Methods: </strong>We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F<sub>1</sub>-score and exact match accuracy through 5-fold cross-validation.</p><p><strong>Results: </strong>Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included \"procedure adherence,\" \"medicine,\" \"resident,\" \"resident family,\" \"nonmedical staff,\" \"medical staff,\" \"team,\" \"environment,\" and \"organizational management,\" respectively. Owing to limited labels, \"resident family\" and \"medical staff\" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F<sub>1</sub>-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11303886/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1