首页 > 最新文献

JMIR Medical Informatics最新文献

英文 中文
Advancing Accuracy in Multimodal Medical Tasks Through Bootstrapped Language-Image Pretraining (BioMedBLIP): Performance Evaluation Study. 通过引导式语言图像预训练(BioMedBLIP)提高多模态医疗任务的准确性:性能评估研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-05 DOI: 10.2196/56627
Usman Naseem, Surendrabikram Thapa, Anum Masood

Background: Medical image analysis, particularly in the context of visual question answering (VQA) and image captioning, is crucial for accurate diagnosis and educational purposes.

Objective: Our study aims to introduce BioMedBLIP models, fine-tuned for VQA tasks using specialized medical data sets such as Radiology Objects in Context and Medical Information Mart for Intensive Care-Chest X-ray, and evaluate their performance in comparison to the state of the art (SOTA) original Bootstrapping Language-Image Pretraining (BLIP) model.

Methods: We present 9 versions of BioMedBLIP across 3 downstream tasks in various data sets. The models are trained on a varying number of epochs. The findings indicate the strong overall performance of our models. We proposed BioMedBLIP for the VQA generation model, VQA classification model, and BioMedBLIP image caption model. We conducted pretraining in BLIP using medical data sets, producing an adapted BLIP model tailored for medical applications.

Results: In VQA generation tasks, BioMedBLIP models outperformed the SOTA on the Semantically-Labeled Knowledge-Enhanced (SLAKE) data set, VQA in Radiology (VQA-RAD), and Image Cross-Language Evaluation Forum data sets. In VQA classification, our models consistently surpassed the SOTA on the SLAKE data set. Our models also showed competitive performance on the VQA-RAD and PathVQA data sets. Similarly, in image captioning tasks, our model beat the SOTA, suggesting the importance of pretraining with medical data sets. Overall, in 20 different data sets and task combinations, our BioMedBLIP excelled in 15 (75%) out of 20 tasks. BioMedBLIP represents a new SOTA in 15 (75%) out of 20 tasks, and our responses were rated higher in all 20 tasks (P<.005) in comparison to SOTA models.

Conclusions: Our BioMedBLIP models show promising performance and suggest that incorporating medical knowledge through pretraining with domain-specific medical data sets helps models achieve higher performance. Our models thus demonstrate their potential to advance medical image analysis, impacting diagnosis, medical education, and research. However, data quality, task-specific variability, computational resources, and ethical considerations should be carefully addressed. In conclusion, our models represent a contribution toward the synergy of artificial intelligence and medicine. We have made BioMedBLIP freely available, which will help in further advancing research in multimodal medical tasks.

背景:医学图像分析,尤其是在视觉问题解答(VQA)和图像字幕方面,对于准确诊断和教育目的至关重要:我们的研究旨在引入 BioMedBLIP 模型,该模型针对 VQA 任务使用专业医疗数据集(如放射学上下文对象和重症监护-胸透医疗信息市场)进行了微调,并与最新技术(SOTA)的原始引导语言-图像预训练(BLIP)模型进行了性能评估:方法:我们在各种数据集中展示了 9 个版本的 BioMedBLIP,涉及 3 个下游任务。这些模型在不同数量的epochs上进行了训练。研究结果表明,我们的模型具有很强的整体性能。我们为 VQA 生成模型、VQA 分类模型和 BioMedBLIP 图像标题模型提出了 BioMedBLIP。我们使用医疗数据集对BLIP进行了预训练,生成了一个为医疗应用量身定制的BLIP模型:在VQA生成任务中,BioMedBLIP模型在语义标注知识增强(SLAKE)数据集、放射学VQA(VQA-RAD)和图像跨语言评估论坛数据集上的表现优于SOTA。在 VQA 分类中,我们的模型在 SLAKE 数据集上的表现一直超过了 SOTA。我们的模型在 VQA-RAD 和 PathVQA 数据集上的表现也很有竞争力。同样,在图像标题任务中,我们的模型也击败了 SOTA,这表明使用医疗数据集进行预训练的重要性。总之,在 20 个不同的数据集和任务组合中,我们的 BioMedBLIP 在 15 个任务(75%)中表现出色。在 20 项任务中,BioMedBLIP 在 15 项(75%)任务中代表了一种新的 SOTA,而且我们的回答在所有 20 项任务中的评分都更高(PConclusions:我们的 BioMedBLIP 模型表现出了良好的性能,并表明通过使用特定领域的医疗数据集进行预训练来融入医疗知识有助于模型获得更高的性能。因此,我们的模型证明了其在推进医学图像分析、影响诊断、医学教育和研究方面的潜力。然而,数据质量、特定任务的可变性、计算资源和伦理方面的考虑因素都应谨慎处理。总之,我们的模型为人工智能与医学的协同发展做出了贡献。我们免费提供了 BioMedBLIP,这将有助于进一步推动多模态医疗任务的研究。
{"title":"Advancing Accuracy in Multimodal Medical Tasks Through Bootstrapped Language-Image Pretraining (BioMedBLIP): Performance Evaluation Study.","authors":"Usman Naseem, Surendrabikram Thapa, Anum Masood","doi":"10.2196/56627","DOIUrl":"10.2196/56627","url":null,"abstract":"<p><strong>Background: </strong>Medical image analysis, particularly in the context of visual question answering (VQA) and image captioning, is crucial for accurate diagnosis and educational purposes.</p><p><strong>Objective: </strong>Our study aims to introduce BioMedBLIP models, fine-tuned for VQA tasks using specialized medical data sets such as Radiology Objects in Context and Medical Information Mart for Intensive Care-Chest X-ray, and evaluate their performance in comparison to the state of the art (SOTA) original Bootstrapping Language-Image Pretraining (BLIP) model.</p><p><strong>Methods: </strong>We present 9 versions of BioMedBLIP across 3 downstream tasks in various data sets. The models are trained on a varying number of epochs. The findings indicate the strong overall performance of our models. We proposed BioMedBLIP for the VQA generation model, VQA classification model, and BioMedBLIP image caption model. We conducted pretraining in BLIP using medical data sets, producing an adapted BLIP model tailored for medical applications.</p><p><strong>Results: </strong>In VQA generation tasks, BioMedBLIP models outperformed the SOTA on the Semantically-Labeled Knowledge-Enhanced (SLAKE) data set, VQA in Radiology (VQA-RAD), and Image Cross-Language Evaluation Forum data sets. In VQA classification, our models consistently surpassed the SOTA on the SLAKE data set. Our models also showed competitive performance on the VQA-RAD and PathVQA data sets. Similarly, in image captioning tasks, our model beat the SOTA, suggesting the importance of pretraining with medical data sets. Overall, in 20 different data sets and task combinations, our BioMedBLIP excelled in 15 (75%) out of 20 tasks. BioMedBLIP represents a new SOTA in 15 (75%) out of 20 tasks, and our responses were rated higher in all 20 tasks (P<.005) in comparison to SOTA models.</p><p><strong>Conclusions: </strong>Our BioMedBLIP models show promising performance and suggest that incorporating medical knowledge through pretraining with domain-specific medical data sets helps models achieve higher performance. Our models thus demonstrate their potential to advance medical image analysis, impacting diagnosis, medical education, and research. However, data quality, task-specific variability, computational resources, and ethical considerations should be carefully addressed. In conclusion, our models represent a contribution toward the synergy of artificial intelligence and medicine. We have made BioMedBLIP freely available, which will help in further advancing research in multimodal medical tasks.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e56627"},"PeriodicalIF":3.1,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11333867/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141891171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Traditional Chinese Medicine: Domain Knowledge Graph Completion and Quality Evaluation. 中医药研究:领域知识图谱的完成与质量评价。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-02 DOI: 10.2196/55090
Chang Liu, Zhan Li, Jianmin Li, Yiqian Qu, Ying Chang, Qing Han, Lingyong Cao, Shuyuan Lin

Background: Knowledge graphs (KGs) can integrate domain knowledge into a traditional Chinese medicine (TCM) intelligent syndrome differentiation model. However, the quality of current KGs in the TCM domain varies greatly, related to the lack of knowledge graph completion (KGC) and evaluation methods.

Objective: This study aims to investigate KGC and evaluation methods tailored for TCM domain knowledge.

Methods: In the KGC phase, according to the characteristics of TCM domain knowledge, we proposed a 3-step "entity-ontology-path" completion approach. This approach uses path reasoning, ontology rule reasoning, and association rules. In the KGC quality evaluation phase, we proposed a 3-dimensional evaluation framework that encompasses completeness, accuracy, and usability, using quantitative metrics such as complex network analysis, ontology reasoning, and graph representation. Furthermore, we compared the impact of different graph representation models on KG usability.

Results: In the KGC phase, 52, 107, 27, and 479 triples were added by outlier analysis, rule-based reasoning, association rules, and path-based reasoning, respectively. In addition, rule-based reasoning identified 14 contradictory triples. In the KGC quality evaluation phase, in terms of completeness, KG had higher density and lower sparsity after completion, and there were no contradictory rules within the KG. In terms of accuracy, KG after completion was more consistent with prior knowledge. In terms of usability, the mean reciprocal ranking, mean rank, and hit rate of the first N tail entities predicted by the model (Hits@N) of the TransE, RotatE, DistMult, and ComplEx graph representation models all showed improvement after KGC. Among them, the RotatE model achieved the best representation.

Conclusions: The 3-step completion approach can effectively improve the completeness, accuracy, and availability of KGs, and the 3-dimensional evaluation framework can be used for comprehensive KGC evaluation. In the TCM field, the RotatE model performed better at KG representation.

背景:知识图谱(KGs)可以将领域知识整合到中医(TCM)智能综合征分型模型中。然而,目前中医领域的知识图谱质量参差不齐,这与缺乏知识图谱完善(KGC)和评估方法有关:本研究旨在探讨针对中医领域知识的 KGC 和评估方法:在 KGC 阶段,根据中医药领域知识的特点,我们提出了 "实体-本体-路径 "三步完成法。该方法使用了路径推理、本体规则推理和关联规则。在 KGC 质量评估阶段,我们提出了一个包括完整性、准确性和可用性的三维评估框架,并使用了复杂网络分析、本体推理和图表示等定量指标。此外,我们还比较了不同图形表示模型对 KG 可用性的影响:在 KGC 阶段,通过离群点分析、基于规则的推理、关联规则和基于路径的推理,分别添加了 52、107、27 和 479 个三元组。此外,基于规则的推理还发现了 14 个相互矛盾的三元组。在 KGC 质量评估阶段,就完备性而言,完成后的 KG 密度较高,稀疏性较低,而且 KG 中没有相互矛盾的规则。在准确性方面,完成后的 KG 与先前的知识更加一致。在可用性方面,TransE、RotatE、DistMult 和 ComplEx 图表示模型预测的前 N 个尾部实体的平均倒数排序、平均排序和命中率(Hits@N)在 KGC 后都有所改善。结论:结论:三步完成法能有效提高 KG 的完整性、准确性和可用性,三维评价框架可用于全面的 KGC 评价。在中医领域,RotatE 模型的 KG 表示性能更好。
{"title":"Research on Traditional Chinese Medicine: Domain Knowledge Graph Completion and Quality Evaluation.","authors":"Chang Liu, Zhan Li, Jianmin Li, Yiqian Qu, Ying Chang, Qing Han, Lingyong Cao, Shuyuan Lin","doi":"10.2196/55090","DOIUrl":"10.2196/55090","url":null,"abstract":"<p><strong>Background: </strong>Knowledge graphs (KGs) can integrate domain knowledge into a traditional Chinese medicine (TCM) intelligent syndrome differentiation model. However, the quality of current KGs in the TCM domain varies greatly, related to the lack of knowledge graph completion (KGC) and evaluation methods.</p><p><strong>Objective: </strong>This study aims to investigate KGC and evaluation methods tailored for TCM domain knowledge.</p><p><strong>Methods: </strong>In the KGC phase, according to the characteristics of TCM domain knowledge, we proposed a 3-step \"entity-ontology-path\" completion approach. This approach uses path reasoning, ontology rule reasoning, and association rules. In the KGC quality evaluation phase, we proposed a 3-dimensional evaluation framework that encompasses completeness, accuracy, and usability, using quantitative metrics such as complex network analysis, ontology reasoning, and graph representation. Furthermore, we compared the impact of different graph representation models on KG usability.</p><p><strong>Results: </strong>In the KGC phase, 52, 107, 27, and 479 triples were added by outlier analysis, rule-based reasoning, association rules, and path-based reasoning, respectively. In addition, rule-based reasoning identified 14 contradictory triples. In the KGC quality evaluation phase, in terms of completeness, KG had higher density and lower sparsity after completion, and there were no contradictory rules within the KG. In terms of accuracy, KG after completion was more consistent with prior knowledge. In terms of usability, the mean reciprocal ranking, mean rank, and hit rate of the first N tail entities predicted by the model (Hits@N) of the TransE, RotatE, DistMult, and ComplEx graph representation models all showed improvement after KGC. Among them, the RotatE model achieved the best representation.</p><p><strong>Conclusions: </strong>The 3-step completion approach can effectively improve the completeness, accuracy, and availability of KGs, and the 3-dimensional evaluation framework can be used for comprehensive KGC evaluation. In the TCM field, the RotatE model performed better at KG representation.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e55090"},"PeriodicalIF":3.1,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11329848/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141879923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study. 医疗人工智能聊天机器人的参考幻觉评分:开发与可用性研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-31 DOI: 10.2196/54345
Fadi Aljamaan, Mohamad-Hani Temsah, Ibraheem Altamimi, Ayman Al-Eyadhy, Amr Jamal, Khalid Alhasan, Tamer A Mesallam, Mohamed Farahat, Khalid H Malki

Background: Artificial intelligence (AI) chatbots have recently gained use in medical practice by health care practitioners. Interestingly, the output of these AI chatbots was found to have varying degrees of hallucination in content and references. Such hallucinations generate doubts about their output and their implementation.

Objective: The aim of our study was to propose a reference hallucination score (RHS) to evaluate the authenticity of AI chatbots' citations.

Methods: Six AI chatbots were challenged with the same 10 medical prompts, requesting 10 references per prompt. The RHS is composed of 6 bibliographic items and the reference's relevance to prompts' keywords. RHS was calculated for each reference, prompt, and type of prompt (basic vs complex). The average RHS was calculated for each AI chatbot and compared across the different types of prompts and AI chatbots.

Results: Bard failed to generate any references. ChatGPT 3.5 and Bing generated the highest RHS (score=11), while Elicit and SciSpace generated the lowest RHS (score=1), and Perplexity generated a middle RHS (score=7). The highest degree of hallucination was observed for reference relevancy to the prompt keywords (308/500, 61.6%), while the lowest was for reference titles (169/500, 33.8%). ChatGPT and Bing had comparable RHS (β coefficient=-0.069; P=.32), while Perplexity had significantly lower RHS than ChatGPT (β coefficient=-0.345; P<.001). AI chatbots generally had significantly higher RHS when prompted with scenarios or complex format prompts (β coefficient=0.486; P<.001).

Conclusions: The variation in RHS underscores the necessity for a robust reference evaluation tool to improve the authenticity of AI chatbots. Further, the variations highlight the importance of verifying their output and citations. Elicit and SciSpace had negligible hallucination, while ChatGPT and Bing had critical hallucination levels. The proposed AI chatbots' RHS could contribute to ongoing efforts to enhance AI's general reliability in medical research.

背景介绍人工智能(AI)聊天机器人最近开始被医疗从业人员用于医疗实践。有趣的是,人们发现这些人工智能聊天机器人的输出内容和参考资料存在不同程度的幻觉。这种幻觉让人对它们的输出和实施产生怀疑:我们的研究旨在提出参考幻觉评分(RHS),以评估人工智能聊天机器人引文的真实性:方法:六个人工智能聊天机器人接受了同样的 10 条医学提示,每条提示要求 10 篇参考文献。RHS由6个书目项目和参考文献与提示关键词的相关性组成。针对每个参考文献、提示和提示类型(基本与复杂)计算 RHS。计算每个人工智能聊天机器人的平均 RHS,并对不同类型的提示和人工智能聊天机器人进行比较:Bard 未能生成任何引用。ChatGPT 3.5 和 Bing 生成了最高的 RHS(得分=11),而 Elicit 和 SciSpace 生成了最低的 RHS(得分=1),Perplexity 生成了中等的 RHS(得分=7)。参考文献与提示关键词相关性的幻觉程度最高(308/500,61.6%),而参考文献标题的幻觉程度最低(169/500,33.8%)。ChatGPT 和 Bing 的 RHS 相当(β 系数=-0.069;P=.32),而 Perplexity 的 RHS 明显低于 ChatGPT(β 系数=-0.345;PC 结论:RHS的差异突出表明,有必要使用可靠的参考评估工具来提高人工智能聊天机器人的真实性。此外,这些差异还凸显了验证其产出和引用的重要性。Elicit 和 SciSpace 的幻觉几乎可以忽略不计,而 ChatGPT 和 Bing 的幻觉则达到了临界水平。所建议的人工智能聊天机器人的 RHS 可以为当前提高人工智能在医学研究中的总体可靠性做出贡献。
{"title":"Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study.","authors":"Fadi Aljamaan, Mohamad-Hani Temsah, Ibraheem Altamimi, Ayman Al-Eyadhy, Amr Jamal, Khalid Alhasan, Tamer A Mesallam, Mohamed Farahat, Khalid H Malki","doi":"10.2196/54345","DOIUrl":"10.2196/54345","url":null,"abstract":"<p><strong>Background: </strong>Artificial intelligence (AI) chatbots have recently gained use in medical practice by health care practitioners. Interestingly, the output of these AI chatbots was found to have varying degrees of hallucination in content and references. Such hallucinations generate doubts about their output and their implementation.</p><p><strong>Objective: </strong>The aim of our study was to propose a reference hallucination score (RHS) to evaluate the authenticity of AI chatbots' citations.</p><p><strong>Methods: </strong>Six AI chatbots were challenged with the same 10 medical prompts, requesting 10 references per prompt. The RHS is composed of 6 bibliographic items and the reference's relevance to prompts' keywords. RHS was calculated for each reference, prompt, and type of prompt (basic vs complex). The average RHS was calculated for each AI chatbot and compared across the different types of prompts and AI chatbots.</p><p><strong>Results: </strong>Bard failed to generate any references. ChatGPT 3.5 and Bing generated the highest RHS (score=11), while Elicit and SciSpace generated the lowest RHS (score=1), and Perplexity generated a middle RHS (score=7). The highest degree of hallucination was observed for reference relevancy to the prompt keywords (308/500, 61.6%), while the lowest was for reference titles (169/500, 33.8%). ChatGPT and Bing had comparable RHS (β coefficient=-0.069; P=.32), while Perplexity had significantly lower RHS than ChatGPT (β coefficient=-0.345; P<.001). AI chatbots generally had significantly higher RHS when prompted with scenarios or complex format prompts (β coefficient=0.486; P<.001).</p><p><strong>Conclusions: </strong>The variation in RHS underscores the necessity for a robust reference evaluation tool to improve the authenticity of AI chatbots. Further, the variations highlight the importance of verifying their output and citations. Elicit and SciSpace had negligible hallucination, while ChatGPT and Bing had critical hallucination levels. The proposed AI chatbots' RHS could contribute to ongoing efforts to enhance AI's general reliability in medical research.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e54345"},"PeriodicalIF":3.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11325115/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141861823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study 通过无监督特征选择来识别重要的 ICD-10 和 ATC 编码,以便对冠心病患者队列进行机器学习:回顾性研究
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-26 DOI: 10.2196/52896
Peyman Ghasemi, Joon Lee
Background: The application of machine learning in healthcare often necessitates the use of hierarchical codes such as the International Classification of Diseases (ICD) and Anatomical Therapeutic Chemical (ATC) systems. These codes classify diseases and medications respectively, thereby forming extensive data dimensions. Unsupervised feature selection tackles the "curse of dimensionality" and helps to improve the accuracy and performance of supervised learning models by reducing the number of irrelevant or redundant features and avoiding overfitting. Techniques for unsupervised feature selection, such as filter, wrapper, and embedded methods, are implemented to select the most important features with the most intrinsic information. However, they face challenges due to the sheer volume of ICD/ATC codes and the hierarchical structures of these systems. Objective: The objective of this study was to compare several unsupervised feature selection methods for ICD and ATC code databases of coronary artery disease patients in different aspects of performance and complexity and select the best set of features representing these patients. Methods: We compared several unsupervised feature selection methods for two ICD and one ATC code databases of 51,506 coronary artery disease patients in Alberta, Canada. Specifically, we employed Laplacian Score, Unsupervised Feature Selection for Multi-Cluster Data, Autoencoder Inspired Unsupervised Feature Selection, Principal Feature Analysis, and Concrete Autoencoders with and without ICD/ATC tree weight adjustment to select the 100 best features from over 9,000 ICD and 2,000 ATC codes. We assessed the selected features based on their ability to reconstruct the initial feature space and predict 90-day mortality following discharge. We also compared the complexity of selected features by mean code level in ICD/ATC tree and the interpretability of the features in the mortality prediction task using Shapley analysis. Results: In feature space reconstruction and mortality prediction, the Concrete Autoencoder-based methods outperformed other techniques. A weight-adjusted Concrete Autoencoder variant, particularly, demonstrated improved reconstruction accuracy and significant predictive performance enhancement, confirmed by DeLong's and McNemar's tests (P<.05). Concrete Autoencoders preferred more general codes and they consistently reconstructed all features accurately. Additionally, features selected by weight-adjusted Concrete Autoencoders yielded higher Shapley values in mortality prediction compared to most alternatives. Conclusions: This study scrutinized five feature selection methods in ICD/ATC code datasets in an unsupervised context. Our findings underscore the superiority of the Concrete Autoencoder method in selecting salient features that represent the entire dataset, offering a potential asset for subsequent machine learning research. We also present a novel weight adjustment approach for the Concrete Autoenc
背景:机器学习在医疗保健领域的应用通常需要使用分级代码,如国际疾病分类(ICD)和解剖治疗化学(ATC)系统。这些代码分别对疾病和药物进行分类,从而形成了广泛的数据维度。无监督特征选择可以解决 "维度诅咒 "问题,通过减少无关或冗余特征的数量,避免过度拟合,从而帮助提高有监督学习模型的准确性和性能。无监督特征选择技术,如过滤法、包装法和嵌入法等,都是为了选择具有最多内在信息的最重要特征。然而,由于 ICD/ATC 代码的庞大数量和这些系统的分层结构,这些技术面临着挑战。研究目的本研究的目的是比较几种针对冠心病患者 ICD 和 ATC 代码数据库的无监督特征选择方法在不同方面的性能和复杂性,并选出代表这些患者的最佳特征集。方法:我们针对加拿大艾伯塔省 51,506 名冠心病患者的两个 ICD 和一个 ATC 代码数据库,比较了几种无监督特征选择方法。具体来说,我们采用了拉普拉卡方评分法、多集群数据无监督特征选择法、自动编码器启发无监督特征选择法、主特征分析法以及带有或不带有 ICD/ATC 树权重调整的混凝土自动编码器,从 9000 多个 ICD 和 2000 多个 ATC 代码中选出了 100 个最佳特征。我们根据所选特征重建初始特征空间和预测出院后 90 天死亡率的能力对其进行了评估。我们还通过 ICD/ATC 树中的平均代码级别比较了所选特征的复杂性,并使用 Shapley 分析比较了死亡率预测任务中特征的可解释性。结果:在特征空间重建和死亡率预测方面,基于具体自动编码器的方法优于其他技术。特别是经过权重调整的混凝土自动编码器变体,其重建准确性得到了提高,预测性能也有显著增强,这一点已通过 DeLong 检验和 McNemar 检验得到证实(P<.05)。具体自动编码器更倾向于使用更通用的代码,而且它们始终能准确地重建所有特征。此外,与大多数替代方法相比,权重调整后的具体自动编码器选择的特征在死亡率预测中产生了更高的 Shapley 值。结论本研究在无监督的情况下仔细研究了 ICD/ATC 代码数据集中的五种特征选择方法。我们的研究结果强调了混凝土自动编码器方法在选择代表整个数据集的突出特征方面的优越性,为后续的机器学习研究提供了潜在的资产。我们还针对 ICD/ATC 代码数据集提出了一种新颖的具体自动编码器权重调整方法,以增强所选特征的通用性和可解释性。
{"title":"Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective Study","authors":"Peyman Ghasemi, Joon Lee","doi":"10.2196/52896","DOIUrl":"https://doi.org/10.2196/52896","url":null,"abstract":"Background: The application of machine learning in healthcare often necessitates the use of hierarchical codes such as the International Classification of Diseases (ICD) and Anatomical Therapeutic Chemical (ATC) systems. These codes classify diseases and medications respectively, thereby forming extensive data dimensions. Unsupervised feature selection tackles the \"curse of dimensionality\" and helps to improve the accuracy and performance of supervised learning models by reducing the number of irrelevant or redundant features and avoiding overfitting. Techniques for unsupervised feature selection, such as filter, wrapper, and embedded methods, are implemented to select the most important features with the most intrinsic information. However, they face challenges due to the sheer volume of ICD/ATC codes and the hierarchical structures of these systems. Objective: The objective of this study was to compare several unsupervised feature selection methods for ICD and ATC code databases of coronary artery disease patients in different aspects of performance and complexity and select the best set of features representing these patients. Methods: We compared several unsupervised feature selection methods for two ICD and one ATC code databases of 51,506 coronary artery disease patients in Alberta, Canada. Specifically, we employed Laplacian Score, Unsupervised Feature Selection for Multi-Cluster Data, Autoencoder Inspired Unsupervised Feature Selection, Principal Feature Analysis, and Concrete Autoencoders with and without ICD/ATC tree weight adjustment to select the 100 best features from over 9,000 ICD and 2,000 ATC codes. We assessed the selected features based on their ability to reconstruct the initial feature space and predict 90-day mortality following discharge. We also compared the complexity of selected features by mean code level in ICD/ATC tree and the interpretability of the features in the mortality prediction task using Shapley analysis. Results: In feature space reconstruction and mortality prediction, the Concrete Autoencoder-based methods outperformed other techniques. A weight-adjusted Concrete Autoencoder variant, particularly, demonstrated improved reconstruction accuracy and significant predictive performance enhancement, confirmed by DeLong's and McNemar's tests (P&lt;.05). Concrete Autoencoders preferred more general codes and they consistently reconstructed all features accurately. Additionally, features selected by weight-adjusted Concrete Autoencoders yielded higher Shapley values in mortality prediction compared to most alternatives. Conclusions: This study scrutinized five feature selection methods in ICD/ATC code datasets in an unsupervised context. Our findings underscore the superiority of the Concrete Autoencoder method in selecting salient features that represent the entire dataset, offering a potential asset for subsequent machine learning research. We also present a novel weight adjustment approach for the Concrete Autoenc","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"61 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maturity Assessment of District Health Information System Version 2 Implementation in Ethiopia: Current Status and Improvement Pathways. 埃塞俄比亚地区卫生信息系统第 2 版实施成熟度评估:现状与改进途径》。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-26 DOI: 10.2196/50375
Tesfahun Melese Yilma, Asefa Taddese, Adane Mamuye, Berhanu Fikadie Endehabtu, Yibeltal Alemayehu, Asaye Senay, Dawit Daka, Loko Abraham, Rabeal Tadesse, Gemechis Melkamu, Naod Wendrad, Oli Kaba, Mesoud Mohammed, Wubshet Denboba, Dawit Birhan, Amanuel Biru, Binyam Tilahun
<p><strong>Background: </strong>Although Ethiopia has made remarkable progress in the uptake of the District Health Information System version 2 (DHIS2) for national aggregate data reporting, there has been no comprehensive assessment of the maturity level of the system.</p><p><strong>Objective: </strong>This study aims to assess the maturity level of DHIS2 implementation in Ethiopia and propose a road map that could guide the progress toward a higher level of maturity. We also aim to assess the current maturity status, implementation gaps, and future directions of DHIS2 implementation in Ethiopia. The assessment focused on digital health system governance, skilled human resources, information and communication technology (ICT) infrastructure, interoperability, and data quality and use.</p><p><strong>Methods: </strong>A collaborative assessment was conducted with the engagement of key stakeholders through consultative workshops using the Stages of Continuous Improvement tool to measure maturity levels in 5 core domains, 13 components, and 39 subcomponents. A 5-point scale (1=emerging, 2=repeatable, 3=defined, 4=managed, and 5=optimized) was used to measure the DHIS2 implementation maturity level.</p><p><strong>Results: </strong>The national DHIS2 implementation's maturity level is currently at the defined stage (score=2.81) and planned to move to the manageable stage (score=4.09) by 2025. The domain-wise maturity score indicated that except for ICT infrastructure, which is at the repeatable stage (score=2.14), the remaining 4 domains are at the defined stage (score=3). The development of a standardized and basic DHIS2 process at the national level, the development of a 10-year strategic plan to guide the implementation of digital health systems including DHIS2, and the presence of the required competencies at the facility level to accomplish specific DHIS2-related tasks are the major strength of the Ministry of Health of Ethiopia so far. The lack of workforce competency guidelines to support the implementation of DHIS2; the unavailability of core competencies (knowledge, skills, and abilities) required to accomplish DHIS2 tasks at all levels of the health system; and ICT infrastructures such as communication network and internet connectivity at the district, zonal, and regional levels are the major hindrances to effective DHIS2 implementation in the country.</p><p><strong>Conclusions: </strong>On the basis of the Stages of Continuous Improvement maturity model toolkit, the implementation status of DHIS2 in Ethiopia is at the defined stage, with the ICT infrastructure domain being at the lowest stage as compared to the other 4 domains. By 2025, the maturity status is planned to move from the defined stage to the managed stage by improving the identified gaps. Various action points are suggested to address the identified gaps and reach the stated maturity level. The responsible body, necessary resources, and methods of verification required to reac
背景:尽管埃塞俄比亚在采用地区卫生信息系统第 2 版(DHIS2)进行全国综合数据报告方面取得了显著进展,但对该系统的成熟度却没有进行过全面评估:本研究旨在评估埃塞俄比亚实施 DHIS2 的成熟度,并提出一个路线图,以指导向更高成熟度迈进。我们还旨在评估埃塞俄比亚实施 DHIS2 系统的当前成熟度、实施差距和未来方向。评估的重点是数字医疗系统治理、熟练人力资源、信息和通信技术(ICT)基础设施、互操作性以及数据质量和使用:方法:在主要利益相关者的参与下,通过咨询研讨会使用 "持续改进阶段 "工具进行了合作评估,以衡量 5 个核心领域、13 个组成部分和 39 个子组成部分的成熟度。采用五点量表(1=萌芽、2=可重复、3=已定义、4=已管理、5=已优化)来衡量 DHIS2 的实施成熟度:结果:全国 DHIS2 实施的成熟度目前处于确定阶段(得分=2.81),计划到 2025 年进入可管理阶段(得分=4.09)。按领域划分的成熟度得分表明,除信息和通信技术基础设施处于可重复阶段(得分=2.14)外,其余 4 个领域均处于确定阶段(得分=3)。迄今为止,埃塞俄比亚卫生部的主要优势包括:在国家层面制定了标准化的 DHIS2 基本流程;制定了十年战略计划,以指导包括 DHIS2 在内的数字卫生系统的实施;在设施层面具备完成 DHIS2 相关具体任务所需的能力。缺乏支持 DHIS2 实施的劳动力能力指南;缺乏各级卫生系统完成 DHIS2 任务所需的核心能力(知识、技能和能力);以及信息和通信技术基础设施,如地区、分区和区域一级的通信网络和互联网连接,是该国有效实施 DHIS2 的主要障碍:根据 "持续改进阶段 "成熟度模型工具包,埃塞俄比亚 DHIS2 的实施状况处于既定阶段,与其他 4 个领域相比,信息和通信技术基础设施领域处于最低阶段。计划到 2025 年,通过改进已发现的差距,将成熟度从确定阶段提升到管理阶段。为解决已发现的差距并达到既定的成熟度水平,提出了各种行动要点。还列出了达到指定成熟度所需的负责机构、必要资源和验证方法。
{"title":"Maturity Assessment of District Health Information System Version 2 Implementation in Ethiopia: Current Status and Improvement Pathways.","authors":"Tesfahun Melese Yilma, Asefa Taddese, Adane Mamuye, Berhanu Fikadie Endehabtu, Yibeltal Alemayehu, Asaye Senay, Dawit Daka, Loko Abraham, Rabeal Tadesse, Gemechis Melkamu, Naod Wendrad, Oli Kaba, Mesoud Mohammed, Wubshet Denboba, Dawit Birhan, Amanuel Biru, Binyam Tilahun","doi":"10.2196/50375","DOIUrl":"10.2196/50375","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Although Ethiopia has made remarkable progress in the uptake of the District Health Information System version 2 (DHIS2) for national aggregate data reporting, there has been no comprehensive assessment of the maturity level of the system.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to assess the maturity level of DHIS2 implementation in Ethiopia and propose a road map that could guide the progress toward a higher level of maturity. We also aim to assess the current maturity status, implementation gaps, and future directions of DHIS2 implementation in Ethiopia. The assessment focused on digital health system governance, skilled human resources, information and communication technology (ICT) infrastructure, interoperability, and data quality and use.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A collaborative assessment was conducted with the engagement of key stakeholders through consultative workshops using the Stages of Continuous Improvement tool to measure maturity levels in 5 core domains, 13 components, and 39 subcomponents. A 5-point scale (1=emerging, 2=repeatable, 3=defined, 4=managed, and 5=optimized) was used to measure the DHIS2 implementation maturity level.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The national DHIS2 implementation's maturity level is currently at the defined stage (score=2.81) and planned to move to the manageable stage (score=4.09) by 2025. The domain-wise maturity score indicated that except for ICT infrastructure, which is at the repeatable stage (score=2.14), the remaining 4 domains are at the defined stage (score=3). The development of a standardized and basic DHIS2 process at the national level, the development of a 10-year strategic plan to guide the implementation of digital health systems including DHIS2, and the presence of the required competencies at the facility level to accomplish specific DHIS2-related tasks are the major strength of the Ministry of Health of Ethiopia so far. The lack of workforce competency guidelines to support the implementation of DHIS2; the unavailability of core competencies (knowledge, skills, and abilities) required to accomplish DHIS2 tasks at all levels of the health system; and ICT infrastructures such as communication network and internet connectivity at the district, zonal, and regional levels are the major hindrances to effective DHIS2 implementation in the country.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;On the basis of the Stages of Continuous Improvement maturity model toolkit, the implementation status of DHIS2 in Ethiopia is at the defined stage, with the ICT infrastructure domain being at the lowest stage as compared to the other 4 domains. By 2025, the maturity status is planned to move from the defined stage to the managed stage by improving the identified gaps. Various action points are suggested to address the identified gaps and reach the stated maturity level. The responsible body, necessary resources, and methods of verification required to reac","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e50375"},"PeriodicalIF":3.1,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11316158/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Large Language Models on Medical Education and Teaching Adaptations 大语言模型对医学教育和教学调整的影响
IF 3.2 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-25 DOI: 10.2196/55933
Li Zhui, Nina Yhap, Liu Liping, Wang Zhengjie, Xiong Zhonghao, Yuan Xiaoshu, Cui Hong, Liu Xuexiu, Ren Wei
This viewpoint article explores the transformative impact of Chat Generative Pre-trained Transformer (ChatGPT) on medical education, highlighting its opportunities and challenges. ChatGPT, a product of OpenAI, leverages advanced deep learning models to offer diverse applications, including enhancing teaching efficiency, facilitating personalized learning, reinforcing clinical skills training, improving medical teaching assessment, enhancing efficiency in medical research, and supporting continuing medical education. While presenting promising opportunities, the integration of ChatGPT in medical education raises concerns about response accuracy, overreliance, lack of emotional intelligence, and privacy and data security risks. The article underscores the imperative need to carefully address these challenges, outlining future pathways to bolster medical information accuracy, fortify privacy and data security, and promote synergy between ChatGPT and other artificial intelligence technologies in medical education. It highlights the adaptability and transformative significance of educators amid the widespread integration of ChatGPT in medical education. Educators must consistently uphold a leadership role, guiding students in the ethical and effective use of ChatGPT, nurturing independent thinking, and honing critical reasoning skills. Safeguarding the quality and integrity of medical education in this dynamic technological era remains paramount.
这篇观点文章探讨了 Chat Generative Pre-trained Transformer(ChatGPT)对医学教育的变革性影响,强调了其机遇和挑战。ChatGPT 是 OpenAI 的产品,利用先进的深度学习模型提供多样化的应用,包括提高教学效率、促进个性化学习、强化临床技能培训、改进医学教学评估、提高医学研究效率以及支持继续医学教育。虽然 ChatGPT 在医学教育中的应用前景广阔,但也引发了人们对响应准确性、过度依赖、缺乏情商以及隐私和数据安全风险等问题的担忧。文章强调了认真应对这些挑战的迫切需要,概述了提高医疗信息准确性、加强隐私和数据安全以及促进 ChatGPT 和其他人工智能技术在医学教育中的协同作用的未来途径。报告强调了在医学教育中广泛整合 ChatGPT 的过程中教育工作者的适应性和变革意义。教育者必须始终坚持发挥领导作用,指导学生以合乎道德的方式有效使用 ChatGPT,培养学生的独立思考能力和批判性推理能力。在这个充满活力的技术时代,保障医学教育的质量和完整性仍然至关重要。
{"title":"Impact of Large Language Models on Medical Education and Teaching Adaptations","authors":"Li Zhui, Nina Yhap, Liu Liping, Wang Zhengjie, Xiong Zhonghao, Yuan Xiaoshu, Cui Hong, Liu Xuexiu, Ren Wei","doi":"10.2196/55933","DOIUrl":"https://doi.org/10.2196/55933","url":null,"abstract":"This viewpoint article explores the transformative impact of Chat Generative Pre-trained Transformer (ChatGPT) on medical education, highlighting its opportunities and challenges. ChatGPT, a product of OpenAI, leverages advanced deep learning models to offer diverse applications, including enhancing teaching efficiency, facilitating personalized learning, reinforcing clinical skills training, improving medical teaching assessment, enhancing efficiency in medical research, and supporting continuing medical education. While presenting promising opportunities, the integration of ChatGPT in medical education raises concerns about response accuracy, overreliance, lack of emotional intelligence, and privacy and data security risks. The article underscores the imperative need to carefully address these challenges, outlining future pathways to bolster medical information accuracy, fortify privacy and data security, and promote synergy between ChatGPT and other artificial intelligence technologies in medical education. It highlights the adaptability and transformative significance of educators amid the widespread integration of ChatGPT in medical education. Educators must consistently uphold a leadership role, guiding students in the ethical and effective use of ChatGPT, nurturing independent thinking, and honing critical reasoning skills. Safeguarding the quality and integrity of medical education in this dynamic technological era remains paramount.","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"66 1","pages":""},"PeriodicalIF":3.2,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hjernetegn.dk-The Danish Central Nervous System Tumor Awareness Initiative Digital Decision Support Tool: Design and Implementation Report. Hjernetegn.dk-丹麦中枢神经系统肿瘤认知倡议数字决策支持工具:设计和实施报告。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-25 DOI: 10.2196/58886
Kathrine Synne Weile, René Mathiasen, Jeanette Falck Winther, Henrik Hasle, Louise Tram Henriksen

Background: Childhood tumors in the central nervous system (CNS) have longer diagnostic delays than other pediatric tumors. Vague presenting symptoms pose a challenge in the diagnostic process; it has been indicated that patients and parents may be hesitant to seek help, and health care professionals (HCPs) may lack awareness and knowledge about clinical presentation. To raise awareness among HCPs, the Danish CNS tumor awareness initiative hjernetegn.dk was launched.

Objective: This study aims to present the learnings from designing and implementing a decision support tool for HCPs to reduce diagnostic delay in childhood CNS tumors. The aims also include decisions regarding strategies for dissemination and use of social media, and an evaluation of the digital impact 6 months after launch.

Methods: The phases of developing and implementing the tool include participatory co-creation workshops, designing the website and digital platforms, and implementing a press and media strategy. The digital impact of hjernetegn.dk was evaluated through website analytics and social media engagement.

Implementation (results): hjernetegn.dk was launched in August 2023. The results after 6 months exceeded key performance indicators. The analysis showed a high number of website visitors and engagement, with a plateau reached 3 months after the initial launch. The LinkedIn campaign and Google Search strategy also generated a high number of impressions and clicks.

Conclusions: The findings suggest that the initiative has been successfully integrated, raising awareness and providing a valuable tool for HCPs in diagnosing childhood CNS tumors. The study highlights the importance of interdisciplinary collaboration, co-creation, and ongoing community management, as well as broad dissemination strategies when introducing a digital support tool.

背景:与其他儿科肿瘤相比,儿童中枢神经系统(CNS)肿瘤的诊断延迟时间更长。模糊的症状给诊断过程带来了挑战;有研究表明,患者和家长可能会对寻求帮助犹豫不决,而医疗保健专业人员(HCPs)可能缺乏对临床表现的认识和知识。为了提高医护人员对中枢神经系统肿瘤的认识,丹麦发起了中枢神经系统肿瘤认识行动 hjernetegn.dk:本研究旨在介绍在设计和实施决策支持工具以减少儿童中枢神经系统肿瘤诊断延误方面的经验。目的还包括有关传播和使用社交媒体的战略决策,以及对推出 6 个月后的数字影响进行评估:该工具的开发和实施阶段包括参与式共同创造研讨会、设计网站和数字平台,以及实施新闻和媒体战略。通过网站分析和社交媒体参与来评估 hjernetegn.dk 的数字影响。实施(结果):hjernetegn.dk 于 2023 年 8 月推出。6 个月后的结果超过了关键绩效指标。分析表明,网站访问量和参与度都很高,在首次推出 3 个月后达到了一个高峰。LinkedIn 活动和谷歌搜索战略也产生了大量的印象和点击:研究结果表明,该倡议已成功整合,提高了人们的认识,并为保健医生诊断儿童中枢神经系统肿瘤提供了宝贵的工具。该研究强调了跨学科合作、共同创造和持续社区管理的重要性,以及在引入数字支持工具时采取广泛传播策略的重要性。
{"title":"Hjernetegn.dk-The Danish Central Nervous System Tumor Awareness Initiative Digital Decision Support Tool: Design and Implementation Report.","authors":"Kathrine Synne Weile, René Mathiasen, Jeanette Falck Winther, Henrik Hasle, Louise Tram Henriksen","doi":"10.2196/58886","DOIUrl":"10.2196/58886","url":null,"abstract":"<p><strong>Background: </strong>Childhood tumors in the central nervous system (CNS) have longer diagnostic delays than other pediatric tumors. Vague presenting symptoms pose a challenge in the diagnostic process; it has been indicated that patients and parents may be hesitant to seek help, and health care professionals (HCPs) may lack awareness and knowledge about clinical presentation. To raise awareness among HCPs, the Danish CNS tumor awareness initiative hjernetegn.dk was launched.</p><p><strong>Objective: </strong>This study aims to present the learnings from designing and implementing a decision support tool for HCPs to reduce diagnostic delay in childhood CNS tumors. The aims also include decisions regarding strategies for dissemination and use of social media, and an evaluation of the digital impact 6 months after launch.</p><p><strong>Methods: </strong>The phases of developing and implementing the tool include participatory co-creation workshops, designing the website and digital platforms, and implementing a press and media strategy. The digital impact of hjernetegn.dk was evaluated through website analytics and social media engagement.</p><p><strong>Implementation (results): </strong>hjernetegn.dk was launched in August 2023. The results after 6 months exceeded key performance indicators. The analysis showed a high number of website visitors and engagement, with a plateau reached 3 months after the initial launch. The LinkedIn campaign and Google Search strategy also generated a high number of impressions and clicks.</p><p><strong>Conclusions: </strong>The findings suggest that the initiative has been successfully integrated, raising awareness and providing a valuable tool for HCPs in diagnosing childhood CNS tumors. The study highlights the importance of interdisciplinary collaboration, co-creation, and ongoing community management, as well as broad dissemination strategies when introducing a digital support tool.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e58886"},"PeriodicalIF":3.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11310640/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Introducing Attribute Association Graphs to Facilitate Medical Data Exploration: Development and Evaluation Using Epidemiological Study Data. 引入属性关联图以促进医学数据探索:利用流行病学研究数据进行开发和评估。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-24 DOI: 10.2196/49865
Louis Bellmann, Alexander Johannes Wiederhold, Leona Trübe, Raphael Twerenbold, Frank Ückert, Karl Gottfried
<p><strong>Background: </strong>Interpretability and intuitive visualization facilitate medical knowledge generation through big data. In addition, robustness to high-dimensional and missing data is a requirement for statistical approaches in the medical domain. A method tailored to the needs of physicians must meet all the abovementioned criteria.</p><p><strong>Objective: </strong>This study aims to develop an accessible tool for visual data exploration without the need for programming knowledge, adjusting complex parameterizations, or handling missing data. We sought to use statistical analysis using the setting of disease and control cohorts familiar to clinical researchers. We aimed to guide the user by identifying and highlighting data patterns associated with disease and reveal relations between attributes within the data set.</p><p><strong>Methods: </strong>We introduce the attribute association graph, a novel graph structure designed for visual data exploration using robust statistical metrics. The nodes capture frequencies of participant attributes in disease and control cohorts as well as deviations between groups. The edges represent conditional relations between attributes. The graph is visualized using the Neo4j (Neo4j, Inc) data platform and can be interactively explored without the need for technical knowledge. Nodes with high deviations between cohorts and edges of noticeable conditional relationship are highlighted to guide the user during the exploration. The graph is accompanied by a dashboard visualizing variable distributions. For evaluation, we applied the graph and dashboard to the Hamburg City Health Study data set, a large cohort study conducted in the city of Hamburg, Germany. All data structures can be accessed freely by researchers, physicians, and patients. In addition, we developed a user test conducted with physicians incorporating the System Usability Scale, individual questions, and user tasks.</p><p><strong>Results: </strong>We evaluated the attribute association graph and dashboard through an exemplary data analysis of participants with a general cardiovascular disease in the Hamburg City Health Study data set. All results extracted from the graph structure and dashboard are in accordance with findings from the literature, except for unusually low cholesterol levels in participants with cardiovascular disease, which could be induced by medication. In addition, 95% CIs of Pearson correlation coefficients were calculated for all associations identified during the data analysis, confirming the results. In addition, a user test with 10 physicians assessing the usability of the proposed methods was conducted. A System Usability Scale score of 70.5% and average successful task completion of 81.4% were reported.</p><p><strong>Conclusions: </strong>The proposed attribute association graph and dashboard enable intuitive visual data exploration. They are robust to high-dimensional as well as missing data and require no pa
背景:可解释性和直观可视化有助于通过大数据生成医学知识。此外,对高维数据和缺失数据的鲁棒性也是医学领域对统计方法的要求。针对医生需求量身定制的方法必须满足上述所有标准:本研究旨在开发一种无需编程知识、无需调整复杂参数或处理缺失数据的可视化数据探索工具。我们试图使用临床研究人员熟悉的疾病和对照队列进行统计分析。我们的目标是通过识别和突出与疾病相关的数据模式来引导用户,并揭示数据集中属性之间的关系:我们引入了属性关联图,这是一种新颖的图结构,旨在利用稳健的统计指标进行可视化数据探索。图中的节点代表疾病组和对照组中参与者属性的频率以及组间偏差。边代表属性之间的条件关系。该图通过 Neo4j(Neo4j 公司)数据平台实现可视化,无需技术知识即可进行交互式探索。队列间偏差较大的节点和条件关系明显的边缘会突出显示,以便在探索过程中为用户提供指导。该图还配有一个仪表盘,可直观显示变量分布。为了进行评估,我们将图表和仪表盘应用于汉堡市健康研究数据集,这是一项在德国汉堡市进行的大型队列研究。研究人员、医生和患者均可自由访问所有数据结构。此外,我们还结合系统可用性量表、个人问题和用户任务,对医生进行了用户测试:我们通过对汉堡市健康研究数据集中患有一般心血管疾病的参与者进行示范性数据分析,对属性关联图和仪表板进行了评估。从图表结构和仪表板中提取的所有结果都与文献研究结果一致,但患有心血管疾病的参与者胆固醇水平异常低的情况除外,这可能是药物引起的。此外,还计算了数据分析过程中发现的所有关联的皮尔逊相关系数的 95% CIs,证实了分析结果。此外,还对 10 名医生进行了用户测试,以评估建议方法的可用性。系统可用性量表得分率为 70.5%,平均成功完成任务率为 81.4%:结论:所提出的属性关联图和仪表盘可实现直观的可视化数据探索。结论:提议的属性关联图和仪表盘可实现直观的可视化数据探索,对高维数据和缺失数据具有鲁棒性,且无需参数化。临床医生的可用性已通过用户测试得到证实,统计结果的有效性已通过文献中的关联和标准统计推断得到证实。
{"title":"Introducing Attribute Association Graphs to Facilitate Medical Data Exploration: Development and Evaluation Using Epidemiological Study Data.","authors":"Louis Bellmann, Alexander Johannes Wiederhold, Leona Trübe, Raphael Twerenbold, Frank Ückert, Karl Gottfried","doi":"10.2196/49865","DOIUrl":"10.2196/49865","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Interpretability and intuitive visualization facilitate medical knowledge generation through big data. In addition, robustness to high-dimensional and missing data is a requirement for statistical approaches in the medical domain. A method tailored to the needs of physicians must meet all the abovementioned criteria.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to develop an accessible tool for visual data exploration without the need for programming knowledge, adjusting complex parameterizations, or handling missing data. We sought to use statistical analysis using the setting of disease and control cohorts familiar to clinical researchers. We aimed to guide the user by identifying and highlighting data patterns associated with disease and reveal relations between attributes within the data set.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We introduce the attribute association graph, a novel graph structure designed for visual data exploration using robust statistical metrics. The nodes capture frequencies of participant attributes in disease and control cohorts as well as deviations between groups. The edges represent conditional relations between attributes. The graph is visualized using the Neo4j (Neo4j, Inc) data platform and can be interactively explored without the need for technical knowledge. Nodes with high deviations between cohorts and edges of noticeable conditional relationship are highlighted to guide the user during the exploration. The graph is accompanied by a dashboard visualizing variable distributions. For evaluation, we applied the graph and dashboard to the Hamburg City Health Study data set, a large cohort study conducted in the city of Hamburg, Germany. All data structures can be accessed freely by researchers, physicians, and patients. In addition, we developed a user test conducted with physicians incorporating the System Usability Scale, individual questions, and user tasks.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We evaluated the attribute association graph and dashboard through an exemplary data analysis of participants with a general cardiovascular disease in the Hamburg City Health Study data set. All results extracted from the graph structure and dashboard are in accordance with findings from the literature, except for unusually low cholesterol levels in participants with cardiovascular disease, which could be induced by medication. In addition, 95% CIs of Pearson correlation coefficients were calculated for all associations identified during the data analysis, confirming the results. In addition, a user test with 10 physicians assessing the usability of the proposed methods was conducted. A System Usability Scale score of 70.5% and average successful task completion of 81.4% were reported.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The proposed attribute association graph and dashboard enable intuitive visual data exploration. They are robust to high-dimensional as well as missing data and require no pa","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e49865"},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11306949/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141753505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study. COVID-19 期间西语裔人群对远程医疗的接受程度:回顾性观察研究
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-24 DOI: 10.2196/57717
Di Shang, Cynthia Williams, Hera Culiqi
<p><strong>Background: </strong>The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.</p><p><strong>Objective: </strong>The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.</p><p><strong>Methods: </strong>We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.</p><p><strong>Results: </strong>We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P<.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P<.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P<.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P<.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P<.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P<.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P<.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P<.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).</p><p><strong>Conclusions: </strong>This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a
背景:拉美裔社区是美国医疗保健系统中存在不公平现象的一个相当大的群体。随着医疗系统向数字医疗平台发展,评估其对西班牙裔社区的潜在影响至关重要:本研究旨在调查导致西班牙裔社区远程医疗使用率低的人口、社会经济和行为因素:我们采用回顾性观察研究设计来探讨研究目标。COVID-19 研究数据库联盟提供了 Analytics IQ PeopleCore 消费者数据和 Office Alley 索赔数据。研究时间为 2020 年 3 月至 2021 年 4 月。采用多元逻辑回归法确定使用远程医疗服务的几率:我们研究了 3,478,287 名独特的西班牙裔患者,其中 16.6% (577,396 人)使用了远程医疗。结果表明,年龄在 18-44 岁之间的患者更有可能使用远程保健服务(几率比 [OR] 1.07,95% CI 1.05-1.1;PC 结论:这项研究表明,远程保健服务在西班牙裔患者中的使用率较高:本研究表明,西语裔社区使用远程医疗取决于年龄、性别、教育程度、社会经济地位、当前医疗保健参与度和健康行为等因素。为了应对这些挑战,我们主张采用跨学科的方法,让医疗专业人员、保险提供商和社区服务机构积极参与拉美裔社区的活动,促进远程保健的使用。我们提出以下建议:提高医疗保险的可及性,改善初级保健提供者的可及性,分配财政和教育资源以支持远程保健的使用。随着远程保健对医疗服务的影响越来越大,专业人员必须促进人们使用所有可用的渠道来获得医疗服务。
{"title":"Telehealth Uptake Among Hispanic People During COVID-19: Retrospective Observational Study.","authors":"Di Shang, Cynthia Williams, Hera Culiqi","doi":"10.2196/57717","DOIUrl":"10.2196/57717","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The Hispanic community represents a sizeable community that experiences inequities in the US health care system. As the system has moved toward digital health platforms, evaluating the potential impact on Hispanic communities is critical.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The study aimed to investigate demographic, socioeconomic, and behavioral factors contributing to low telehealth use in Hispanic communities.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We used a retrospective observation study design to examine the study objectives. The COVID-19 Research Database Consortium provided the Analytics IQ PeopleCore consumer data and Office Alley claims data. The study period was from March 2020 to April 2021. Multiple logistic regression was used to determine the odds of using telehealth services.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We examined 3,478,287 unique Hispanic patients, 16.6% (577,396) of whom used telehealth. Results suggested that patients aged between 18 and 44 years were more likely to use telehealth (odds ratio [OR] 1.07, 95% CI 1.05-1.1; P&lt;.001) than patients aged older than 65 years. Across all age groups, patients with high incomes were at least 20% more likely to use telehealth than patients with lower incomes (P&lt;.001); patients who had a primary care physician (P=.01), exhibited high medical usage (P&lt;.001), or were interested in exercise (P=.03) were more likely to use telehealth; patients who had unhealthy behaviors such as smoking and alcohol consumption were less likely to use telehealth (P&lt;.001). Male patients were less likely than female patients to use telehealth among patients aged 65 years and older (OR 0.94, 95% CI 0.93-0.95; P&lt;.001), while male patients aged between 18 and 44 years were more likely to use telehealth (OR 1.05, 95% CI 1.03-1.07; P&lt;.001). Among patients younger than 65 years, full-time employment was positively associated with telehealth use (P&lt;.001). Patients aged between 18 and 44 years with high school or less education were 2% less likely to use telehealth (OR 0.98, 95% CI 0.97-0.99; P=.005). Results also revealed a positive association with using WebMD (WebMD LLC) among patients aged older than 44 years (P&lt;.001), while there was a negative association with electronic prescriptions among those who were aged between 18 and 44 years (P=.009) and aged between 45 and 64 years (P=.004).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This study demonstrates that telehealth use among Hispanic communities is dependent upon factors such as age, gender, education, socioeconomic status, current health care engagement, and health behaviors. To address these challenges, we advocate for interdisciplinary approaches that involve medical professionals, insurance providers, and community-based services actively engaging with Hispanic communities and promoting telehealth use. We propose the following recommendations: enhance access to health insurance, improve access to primary care providers, and allocate fiscal a","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57717"},"PeriodicalIF":3.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11289584/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141763013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study. 基于回顾性差异分析,通过迭代完善快速医疗互操作性资源档案,发掘医疗数据的协调潜力:案例研究。
IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-07-23 DOI: 10.2196/57005
Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf
<p><strong>Background: </strong>Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.</p><p><strong>Objective: </strong>In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.</p><p><strong>Methods: </strong>We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.</p><p><strong>Results: </strong>A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.</p><p><strong>Conclusions: </strong>While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required
背景:医疗服务提供者之间的跨机构互操作性仍然是全球范围内经常面临的挑战。德国医疗信息学倡议是德国 37 家大学医院的合作项目,旨在通过定义用于跨机构交换医疗保健数据的快速医疗保健互操作性资源(FHIR)配置文件,即核心数据集(CDS),来实现合作医院之间的互操作性。当前的 CDS 及其扩展模块定义了代表患者医疗记录的元素。德国所有大学医院在以 CDS 为基础的标准化格式提供常规数据方面都取得了重大进展。此外,健康中央研究平台--德国医学研究数据门户网站可行性工具允许医学研究人员查询许多参与医院的可用 CDS 数据项:在这项研究中,我们旨在评估一种将当前自上而下生成的 FHIR 配置文件与通过分析各自实例数据获得的自下而上生成的知识相结合的新方法。这样,我们就能利用差异分析获得的信息,得出迭代完善 FHIR 配置文件的方案:我们开发了一个 FHIR 验证管道,并选择从原始 CDS 配置文件中提取限制性更强的配置文件。之所以做出这一决定,是因为需要更紧密地与中央可行性平台搜索本体的具体假设和要求保持一致。虽然原始 CDS 配置文件提供了一个通用框架,可适用于广泛的医学信息学用例,但它们缺乏具体性,无法模拟医学研究人员所必需的细微标准。这方面的一个重要例子就是必须准确地表示特定的实验室编码和值之间的相互依存关系。通过验证结果,我们可以发现临床站点的实例数据与可行性平台指定的配置文件之间的差异,并在今后加以解决:共有 20 家大学医院参与了这项研究。历史因素、缺乏统一、源系统范围广以及编码的病例敏感性是造成差异的部分原因。在我们的案例研究中,由于德国计费的立法要求,条件、程序和药物在实例数据编码方面具有高度的统一性,但我们发现,由于编码和价值之间的相互依存关系,实验室价值对数据协调构成了重大挑战:结论:虽然 CDS 实现了互操作性,但联合数据访问也面临着不同的挑战,需要更具体的配置文件才能对实例数据做出假设。我们还认为,进一步协调实例数据可以大大减少所需的追溯协调工作。我们认识到,差异不可能仅在临床现场得到解决;因此,我们的研究结果具有广泛的影响,需要各利益相关方在多个层面采取行动。
{"title":"Uncovering Harmonization Potential in Health Care Data Through Iterative Refinement of Fast Healthcare Interoperability Resources Profiles Based on Retrospective Discrepancy Analysis: Case Study.","authors":"Lorenz Rosenau, Paul Behrend, Joshua Wiedekopf, Julian Gruendner, Josef Ingenerf","doi":"10.2196/57005","DOIUrl":"10.2196/57005","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Cross-institutional interoperability between health care providers remains a recurring challenge worldwide. The German Medical Informatics Initiative, a collaboration of 37 university hospitals in Germany, aims to enable interoperability between partner sites by defining Fast Healthcare Interoperability Resources (FHIR) profiles for the cross-institutional exchange of health care data, the Core Data Set (CDS). The current CDS and its extension modules define elements representing patients' health care records. All university hospitals in Germany have made significant progress in providing routine data in a standardized format based on the CDS. In addition, the central research platform for health, the German Portal for Medical Research Data feasibility tool, allows medical researchers to query the available CDS data items across many participating hospitals.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;In this study, we aimed to evaluate a novel approach of combining the current top-down generated FHIR profiles with the bottom-up generated knowledge gained by the analysis of respective instance data. This allowed us to derive options for iteratively refining FHIR profiles using the information obtained from a discrepancy analysis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;We developed an FHIR validation pipeline and opted to derive more restrictive profiles from the original CDS profiles. This decision was driven by the need to align more closely with the specific assumptions and requirements of the central feasibility platform's search ontology. While the original CDS profiles offer a generic framework adaptable for a broad spectrum of medical informatics use cases, they lack the specificity to model the nuanced criteria essential for medical researchers. A key example of this is the necessity to represent specific laboratory codings and values interdependencies accurately. The validation results allow us to identify discrepancies between the instance data at the clinical sites and the profiles specified by the feasibility platform and addressed in the future.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;A total of 20 university hospitals participated in this study. Historical factors, lack of harmonization, a wide range of source systems, and case sensitivity of coding are some of the causes for the discrepancies identified. While in our case study, Conditions, Procedures, and Medications have a high degree of uniformity in the coding of instance data due to legislative requirements for billing in Germany, we found that laboratory values pose a significant data harmonization challenge due to their interdependency between coding and value.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;While the CDS achieves interoperability, different challenges for federated data access arise, requiring more specificity in the profiles to make assumptions on the instance data. We further argue that further harmonization of the instance data can significantly lower required","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"12 ","pages":"e57005"},"PeriodicalIF":3.1,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11303887/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141749840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1