首页 > 最新文献

International Journal of Medical Informatics最新文献

英文 中文
Healthy nutrition and weight management for a positive pregnancy experience in the antenatal period: Comparison of responses from artificial intelligence models on nutrition during pregnancy 产前健康营养和体重管理可带来积极的孕期体验:比较人工智能模型对孕期营养的反应。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-07 DOI: 10.1016/j.ijmedinf.2024.105663
Emine Karacan

Background

As artificial intelligence AI-supported applications become integral to web-based information-seeking, assessing their impact on healthy nutrition and weight management during the antenatal period is crucial.

Objective

This study was conducted to evaluate both the quality and semantic similarity of responses created by AI models to the most frequently asked questions about healthy nutrition and weight management during the antenatal period, based on existing clinical knowledge.

Methods

In this study, a cross-sectional assessment design was used to explore data from 3 AI models (GPT-4, MedicalGPT, Med-PaLM). We directed the most frequently asked questions about nutrition during pregnancy, obtained from the American College of Obstetricians and Gynecologists (ACOG) to each model in a new and single session on October 21, 2023, without any prior conversation. Immediately after, instructions were given to the AI models to generate responses to these questions. The responses created by AI models were evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) scale. Additionally, to assess the semantic similarity between answers to 31 pregnancy nutrition-related frequently asked questions sourced from the ACOG and responses from AI models we evaluated cosine similarity using both WORD2VEC and BioLORD-2023.

Results

Med-PaLM outperformed GPT-4 and MedicalGPT in response quality (mean = 3.93), demonstrating superior clinical accuracy over both GPT-4 (p = 0.016) and MedicalGPT (p = 0.001). GPT-4 had higher quality than MedicalGPT (p = 0.027).
The semantic similarity between ACOG and Med-PaLM is higher with WORD2VEC (0.92) compared to BioLORD-2023 (0.81), showing a difference of +0.11. The similarity scores for ACOG–MedicalGPT and ACOG–GPT-4 are similar across both models, with minimal differences of −0.01. Overall, WORD2VEC has a slightly higher average similarity (0.82) than BioLORD-2023 (0.79), with a difference of +0.03.

Conclusions

Despite the superior performance of Med-PaLM, there is a need for further evidence-based research and improvement in the integration of AI in healthcare due to varying AI model performances.
背景:随着人工智能AI支持的应用程序成为网络信息搜索的组成部分,评估它们对产前健康营养和体重管理的影响至关重要:本研究以现有临床知识为基础,评估人工智能模型对产前健康营养和体重管理方面最常见问题所做回答的质量和语义相似性:本研究采用横断面评估设计,对 3 个人工智能模型(GPT-4、MedicalGPT、Med-PaLM)的数据进行研究。2023 年 10 月 21 日,我们将从美国妇产科医师学会(ACOG)获得的有关孕期营养的最常见问题,在没有任何事先交谈的情况下,以新的单次会议形式传授给每个模型。紧接着,向人工智能模型发出指令,让其生成对这些问题的回答。人工智能模型生成的回答采用建议评估、开发和评价分级法(GRADE)进行评估。此外,为了评估来自 ACOG 的 31 个妊娠营养相关常见问题的回答与人工智能模型的回答之间的语义相似性,我们使用 WORD2VEC 和 BioLORD-2023 评估了余弦相似性:Med-PaLM 的回答质量(平均值 = 3.93)优于 GPT-4 和 MedicalGPT,临床准确性优于 GPT-4 (p = 0.016) 和 MedicalGPT (p = 0.001)。GPT-4 的质量高于 MedicalGPT(p = 0.027)。ACOG 和 Med-PaLM 的语义相似度 WORD2VEC(0.92)高于 BioLORD-2023(0.81),两者相差+0.11。两个模型中 ACOG-MedicalGPT 和 ACOG-GPT-4 的相似度得分相似,差异极小,均为 -0.01。总体而言,WORD2VEC 的平均相似度(0.82)略高于 BioLORD-2023(0.79),差异为 +0.03:尽管 Med-PaLM 的性能优越,但由于人工智能模型的性能参差不齐,在将人工智能整合到医疗保健领域方面仍需要进一步的循证研究和改进。
{"title":"Healthy nutrition and weight management for a positive pregnancy experience in the antenatal period: Comparison of responses from artificial intelligence models on nutrition during pregnancy","authors":"Emine Karacan","doi":"10.1016/j.ijmedinf.2024.105663","DOIUrl":"10.1016/j.ijmedinf.2024.105663","url":null,"abstract":"<div><h3>Background</h3><div>As artificial intelligence AI-supported applications become integral to web-based information-seeking, assessing their impact on healthy nutrition and weight management during the antenatal period is crucial.</div></div><div><h3>Objective</h3><div>This study was conducted to evaluate both the quality and semantic similarity of responses created by AI models to the most frequently asked questions about healthy nutrition and weight management during the antenatal period, based on existing clinical knowledge.</div></div><div><h3>Methods</h3><div>In this study, a cross-sectional assessment design was used to explore data from 3 AI models (GPT-4, MedicalGPT, Med-PaLM). We directed the most frequently asked questions about nutrition during pregnancy, obtained from the American College of Obstetricians and Gynecologists (ACOG) to each model in a new and single session on October 21, 2023, without any prior conversation. Immediately after, instructions were given to the AI models to generate responses to these questions. The responses created by AI models were evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) scale. Additionally, to assess the semantic similarity between answers to 31 pregnancy nutrition-related frequently asked questions sourced from the ACOG and responses from AI models we evaluated cosine similarity using both WORD2VEC and BioLORD-2023.</div></div><div><h3>Results</h3><div>Med-PaLM outperformed GPT-4 and MedicalGPT in response quality (mean = 3.93), demonstrating superior clinical accuracy over both GPT-4 (p = 0.016) and MedicalGPT (p = 0.001). GPT-4 had higher quality than MedicalGPT (p = 0.027).</div><div>The semantic similarity between ACOG and Med-PaLM is higher with WORD2VEC (0.92) compared to BioLORD-2023 (0.81), showing a difference of +0.11. The similarity scores for ACOG–MedicalGPT and ACOG–GPT-4 are similar across both models, with minimal differences of −0.01. Overall, WORD2VEC has a slightly higher average similarity (0.82) than BioLORD-2023 (0.79), with a difference of +0.03.</div></div><div><h3>Conclusions</h3><div>Despite the superior performance of Med-PaLM, there is a need for further evidence-based research and improvement in the integration of AI in healthcare due to varying AI model performances.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105663"},"PeriodicalIF":3.7,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A machine-learning-based algorithm for bone marrow cell differential counting 基于机器学习的骨髓细胞差异计数算法。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-07 DOI: 10.1016/j.ijmedinf.2024.105692
Ta-Chuan Yu , Cheng-Kun Yang , Wei-Han Hsu , Cheng-An Hsu , Hsiao-Chun Wang , Hsin-Jung Hsiao , Hsiao-Ling Chao , Han-Peng Hsieh , Jia-Rong Wu , Yen-Chun Tsai , Yi-Mei Chiang , Poshing Lee , Che-Pin Lin , Ling-Ping Chen , Yung-Chuan Sung , Ya-Yun Yang , Chin-Ling Yu , Chih-Kang Lin , Chia-Pin Kang , Che-Wei Chang , Wen-Chien Chou

Background

Differential counting (DC) of different cell types in bone marrow (BM) aspiration smears is crucial for diagnosing hematological diseases. However, a clinically applicable method for automatic DC has yet to be developed.

Objective

This study developed and validated an artificial intelligence (AI)-based algorithm for identifying and classifying nucleated cells in BM smears.

Methods

In the development phase, a mask region–based convolutional neural network (Mask R-CNN)-based AI model was trained to detect and classify individual BM cells. We used a large data set of expert-annotated images representing a variety of disease categories. The BM slides were stained with Liu’s stain or Wright–Giemsa stain. Consensus meetings were held to ensure experts from different institutes applied consistent criteria in classifying cells. Subsequently, the performance of the AI algorithm in identifying cell images and determining cell ratios was evaluated using a multinational clinical dataset.

Results

The AI model was trained on 542 slides (85.1 % stained with Liu’s stain and 14.9 % with Wright–Giemsa stain) containing 597,222 annotated cells. It achieved an accuracy of 0.94 for the testing dataset containing 26,170 cells. The performance of the AI model was further validated using another multinational real-world dataset (data obtained from three centers in Taiwan and one in the United States) comprising 200,639 cells. The AI model achieved an accuracy of 0.881 in classifying individual cells and demonstrated high precision in classifying blasts (0.927), bands and polymorphonuclear neutrophils (0.955), plasma cells (0.930), and lymphocytes (0.789). When the differential counting percentage of each cell type was assessed, a strong correlation (ρ > 0.8) between the AI and manual methods was observed for most cell categories.

Conclusions

In this study, an AI algorithm was developed and clinically validated using large, multinational datasets. Our algorithm can locate and classify BM cells simultaneously and has potential clinical applicability for automating BM differential counting.
背景:骨髓(BM)抽吸涂片中不同类型细胞的鉴别计数(DC)对于诊断血液病至关重要。然而,目前尚未开发出一种适用于临床的自动 DC 方法:本研究开发并验证了一种基于人工智能(AI)的算法,用于识别和分类骨髓涂片中的有核细胞:在开发阶段,我们训练了一个基于掩膜区域卷积神经网络(Mask R-CNN)的人工智能模型,以检测单个血液涂片细胞并对其进行分类。我们使用了一个大型数据集,其中包含专家标注的代表各种疾病类别的图像。用刘氏染色法或 Wright-Giemsa 染色法对 BM 切片进行染色。我们召开了共识会议,以确保来自不同机构的专家在对细胞进行分类时采用一致的标准。随后,使用跨国临床数据集评估了人工智能算法在识别细胞图像和确定细胞比例方面的性能:结果:人工智能模型是在包含 597,222 个注释细胞的 542 张幻灯片(85.1% 用刘氏染色法染色,14.9% 用赖特-吉氏染色法染色)上进行训练的。在包含 26,170 个细胞的测试数据集上,该模型的准确率达到了 0.94。人工智能模型的性能通过另一个包含 200 639 个细胞的跨国真实数据集(数据来自台湾的三个中心和美国的一个中心)得到了进一步验证。人工智能模型对单个细胞进行分类的准确率为 0.881,在对胚泡(0.927)、带状和多形核中性粒细胞(0.955)、浆细胞(0.930)和淋巴细胞(0.789)进行分类时表现出很高的精确度。在评估每种细胞类型的差异计数百分比时,大多数细胞类别的人工智能和手动方法之间都有很强的相关性(ρ > 0.8):本研究开发了一种人工智能算法,并利用大型跨国数据集进行了临床验证。我们的算法可以同时定位和分类骨髓细胞,具有临床应用潜力,可实现骨髓细胞差异计数的自动化。
{"title":"A machine-learning-based algorithm for bone marrow cell differential counting","authors":"Ta-Chuan Yu ,&nbsp;Cheng-Kun Yang ,&nbsp;Wei-Han Hsu ,&nbsp;Cheng-An Hsu ,&nbsp;Hsiao-Chun Wang ,&nbsp;Hsin-Jung Hsiao ,&nbsp;Hsiao-Ling Chao ,&nbsp;Han-Peng Hsieh ,&nbsp;Jia-Rong Wu ,&nbsp;Yen-Chun Tsai ,&nbsp;Yi-Mei Chiang ,&nbsp;Poshing Lee ,&nbsp;Che-Pin Lin ,&nbsp;Ling-Ping Chen ,&nbsp;Yung-Chuan Sung ,&nbsp;Ya-Yun Yang ,&nbsp;Chin-Ling Yu ,&nbsp;Chih-Kang Lin ,&nbsp;Chia-Pin Kang ,&nbsp;Che-Wei Chang ,&nbsp;Wen-Chien Chou","doi":"10.1016/j.ijmedinf.2024.105692","DOIUrl":"10.1016/j.ijmedinf.2024.105692","url":null,"abstract":"<div><h3>Background</h3><div>Differential counting (DC) of different cell types in bone marrow (BM) aspiration smears is crucial for diagnosing hematological diseases. However, a clinically applicable method for automatic DC has yet to be developed.</div></div><div><h3>Objective</h3><div>This study developed and validated an artificial intelligence (AI)-based algorithm for identifying and classifying nucleated cells in BM smears.</div></div><div><h3>Methods</h3><div>In the development phase, a mask region–based convolutional neural network (Mask R-CNN)-based AI model was trained to detect and classify individual BM cells. We used a large data set of expert-annotated images representing a variety of disease categories. The BM slides were stained with Liu’s stain or Wright–Giemsa stain. Consensus meetings were held to ensure experts from different institutes applied consistent criteria in classifying cells. Subsequently, the performance of the AI algorithm in identifying cell images and determining cell ratios was evaluated using a multinational clinical dataset.</div></div><div><h3>Results</h3><div>The AI model was trained on 542 slides (85.1 % stained with Liu’s stain and 14.9 % with Wright–Giemsa stain) containing 597,222 annotated cells. It achieved an accuracy of 0.94 for the testing dataset containing 26,170 cells. The performance of the AI model was further validated using another multinational real-world dataset (data obtained from three centers in Taiwan and one in the United States) comprising 200,639 cells. The AI model achieved an accuracy of 0.881 in classifying individual cells and demonstrated high precision in classifying blasts (0.927), bands and polymorphonuclear neutrophils (0.955), plasma cells (0.930), and lymphocytes (0.789). When the differential counting percentage of each cell type was assessed, a strong correlation (ρ &gt; 0.8) between the AI and manual methods was observed for most cell categories.</div></div><div><h3>Conclusions</h3><div>In this study, an AI algorithm was developed and clinically validated using large, multinational datasets. Our algorithm can locate and classify BM cells simultaneously and has potential clinical applicability for automating BM differential counting.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105692"},"PeriodicalIF":3.7,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supporting the care to breast cancer patients with unique needs: Evidence from online community members’ responses 为有特殊需求的乳腺癌患者提供护理支持:来自在线社区成员回复的证据。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-06 DOI: 10.1016/j.ijmedinf.2024.105695
Anqi Xu , Yuanyuan Gao

Background

Breast cancer is the most common cancer diagnosed in women globally. Online cancer communities (OCCs) provide platforms for breast cancer patients to connect, share experiences, and support each other. These communities facilitate discussions on a range of health- and non-health-related topics. However, posts discussing unique topics may receive varying levels of attention and support. This study aims to devise a method for identifying and supporting such posts, enhancing community response and support strategies.

Methods

We propose a Uniqueness Score Extraction Framework to compute health- and non-health-related uniqueness scores for online community posts. The framework utilizes deep learning-based natural language processing models to identify the topics discussed in OCCs and calculates the health- and non-health-related uniqueness scores of a post based on the uniqueness of the topics identified by the BERTopic model. We further employ econometric models to assess how the uniqueness scores of posts affect community members’ responses to those posts.

Results

Our study reveals that posts with a higher concentration of unique health-related topics in OCCs elicit quicker, more frequent, but shorter responses. Conversely, posts containing more unique non-health-related topics in the entire post prompt faster and longer responses, unless these topics become overly dominant, in which case the number of replies decreases, and response times are prolonged.

Conclusion

Our research develops a framework to identify posts with high uniqueness scores in OCCs, and sheds light on community member responses to these discussions. The findings indicate that while members are supportive, particularly regarding health-related topics, the post-content’s nature and focus greatly affect their engagement. These discoveries could enhance our understanding of community dynamics in OCCs, offering valuable implications for researchers, OCC facilitators, and medical professionals in supporting patients within online platforms.
背景:乳腺癌是全球妇女最常见的癌症。在线癌症社区(OCC)为乳腺癌患者提供了相互联系、分享经验和相互支持的平台。这些社区促进了一系列健康和非健康相关话题的讨论。然而,讨论独特话题的帖子可能会受到不同程度的关注和支持。本研究旨在设计一种识别和支持此类帖子的方法,以加强社区响应和支持策略:我们提出了一个独特性分数提取框架,用于计算在线社区帖子中与健康和非健康相关的独特性分数。该框架利用基于深度学习的自然语言处理模型来识别在线社区帖子中讨论的话题,并根据 BERTopic 模型所识别话题的独特性来计算帖子的健康和非健康相关独特性得分。我们进一步采用计量经济学模型来评估帖子的独特性得分如何影响社区成员对这些帖子的反应:我们的研究表明,OCC 中与健康相关的独特主题越集中的帖子,引起的回复越快、越频繁,但回复时间越短。相反,整篇帖子中包含更多与健康无关的独特话题的帖子会引起更快、更长时间的回复,除非这些话题变得过于重要,在这种情况下,回复数量会减少,回复时间也会延长:我们的研究建立了一个框架,用于识别 OCC 中独特性得分较高的帖子,并揭示了社区成员对这些讨论的反应。研究结果表明,虽然成员们都很支持讨论,尤其是与健康相关的话题,但帖子内容的性质和重点在很大程度上影响了他们的参与度。这些发现可以加深我们对 OCC 中社区动态的了解,为研究人员、OCC 促进者和医疗专业人员在网络平台上为患者提供支持提供有价值的启示。
{"title":"Supporting the care to breast cancer patients with unique needs: Evidence from online community members’ responses","authors":"Anqi Xu ,&nbsp;Yuanyuan Gao","doi":"10.1016/j.ijmedinf.2024.105695","DOIUrl":"10.1016/j.ijmedinf.2024.105695","url":null,"abstract":"<div><h3>Background</h3><div>Breast cancer is the most common cancer diagnosed in women globally. Online cancer communities (OCCs) provide platforms for breast cancer patients to connect, share experiences, and support each other. These communities facilitate discussions on a range of health- and non-health-related topics. However, posts discussing unique topics may receive varying levels of attention and support. This study aims to devise a method for identifying and supporting such posts, enhancing community response and support strategies.</div></div><div><h3>Methods</h3><div>We propose a Uniqueness Score Extraction Framework to compute health- and non-health-related uniqueness scores for online community posts. The framework utilizes deep learning-based natural language processing models to identify the topics discussed in OCCs and calculates the health- and non-health-related uniqueness scores of a post based on the uniqueness of the topics identified by the BERTopic model. We further employ econometric models to assess how the uniqueness scores of posts affect community members’ responses to those posts.</div></div><div><h3>Results</h3><div>Our study reveals that posts with a higher concentration of unique health-related topics in OCCs elicit quicker, more frequent, but shorter responses. Conversely, posts containing more unique non-health-related topics in the entire post prompt faster and longer responses, unless these topics become overly dominant, in which case the number of replies decreases, and response times are prolonged.</div></div><div><h3>Conclusion</h3><div>Our research develops a framework to identify posts with high uniqueness scores in OCCs, and sheds light on community member responses to these discussions. The findings indicate that while members are supportive, particularly regarding health-related topics, the post-content’s nature and focus greatly affect their engagement. These discoveries could enhance our understanding of community dynamics in OCCs, offering valuable implications for researchers, OCC facilitators, and medical professionals in supporting patients within online platforms.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105695"},"PeriodicalIF":3.7,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced NSCLC subtyping and staging through attention-augmented multi-task deep learning: A novel diagnostic tool 通过注意力增强型多任务深度学习增强 NSCLC 亚型和分期:新型诊断工具
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-05 DOI: 10.1016/j.ijmedinf.2024.105694
Runhuang Yang , Weiming Li , Siqi Yu , Zhiyuan Wu , Haiping Zhang , Xiangtong Liu , Lixin Tao , Xia Li , Jian Huang , Xiuhua Guo

Objectives

The objective of this study is to develop a novel multi-task learning approach with attention encoders for classifying histologic subtypes and clinical stages of non-small cell lung cancer (NSCLC), with superior performance compared to currently popular deep-learning models.

Material and methods

Data were collected from six publicly available datasets in The Cancer Imaging Archive (TCIA). Following the inclusion and exclusion criteria, a total of 4548 CT slices from 758 cases were allocated. We evaluated multiple multi-task learning models that integrate attention mechanisms to resolve challenges in NSCLC subtype classification and clinical staging. These models utilized convolution-based modules in their shared layers for feature extraction, while the task layers were dedicated to histological subtype classification and staging. Each branch sequentially processed features through convolution-based and attention-based modules prior to classification.

Results

Our study evaluated 758 NSCLC patients (mean age, 66.2 years ± 10.3; 473 men), spanning ADC and SCC cases. In the classification of histological subtypes and clinical staging of NSCLC, the MobileNet-based multi-task learning model enhanced with attention mechanisms (MN-MTL-A) demonstrated superior performance, achieving Area Under the Curve (AUC) scores of 0.963 (95 % CI: 0.943, 0.981) and 0.966 (95 % CI: 0.945, 0.982) for each task, respectively. The model significantly surpassed its counterparts lacking attention mechanisms and those configured for single-task learning, as evidenced by P-values of 0.01 or less for both tasks, according to DeLong’s test.

Conclusions

The integration of attention encoder blocks into our multi-task learning network significantly enhanced the accuracy of NSCLC histological subtyping and clinical staging. Given the reduced reliance on precise radiologist annotation, our proposed model shows promising potential for clinical application.
研究目的本研究的目的是开发一种新颖的多任务学习方法,利用注意力编码器对非小细胞肺癌(NSCLC)的组织学亚型和临床分期进行分类,与目前流行的深度学习模型相比,该方法具有更优越的性能:数据来自癌症成像档案(TCIA)中的六个公开数据集。根据纳入和排除标准,共分配了来自 758 个病例的 4548 张 CT 切片。我们评估了多个多任务学习模型,这些模型整合了注意力机制,以解决 NSCLC 亚型分类和临床分期方面的难题。这些模型的共享层利用基于卷积的模块进行特征提取,而任务层则专门用于组织学亚型分类和分期。在分类之前,每个分支通过基于卷积的模块和基于注意力的模块依次处理特征:我们的研究评估了 758 例 NSCLC 患者(平均年龄为 66.2 岁 ± 10.3;男性 473 例),其中包括 ADC 和 SCC 病例。在对 NSCLC 的组织学亚型和临床分期进行分类时,基于 MobileNet 的多任务学习模型(MN-MTL-A)在注意力机制的增强下表现出色,每项任务的曲线下面积(AUC)分别达到 0.963(95 % CI:0.943, 0.981)和 0.966(95 % CI:0.945, 0.982)。根据 DeLong 检验,该模型在两项任务中的 P 值均小于或等于 0.01,明显优于缺乏注意力机制的同类模型和为单任务学习配置的同类模型:结论:在我们的多任务学习网络中整合注意力编码器块,能显著提高 NSCLC 组织学亚型和临床分期的准确性。由于减少了对放射科医生精确注释的依赖,我们提出的模型显示出临床应用的巨大潜力。
{"title":"Enhanced NSCLC subtyping and staging through attention-augmented multi-task deep learning: A novel diagnostic tool","authors":"Runhuang Yang ,&nbsp;Weiming Li ,&nbsp;Siqi Yu ,&nbsp;Zhiyuan Wu ,&nbsp;Haiping Zhang ,&nbsp;Xiangtong Liu ,&nbsp;Lixin Tao ,&nbsp;Xia Li ,&nbsp;Jian Huang ,&nbsp;Xiuhua Guo","doi":"10.1016/j.ijmedinf.2024.105694","DOIUrl":"10.1016/j.ijmedinf.2024.105694","url":null,"abstract":"<div><h3>Objectives</h3><div>The objective of this study is to develop a novel multi-task learning approach with attention encoders for classifying histologic subtypes and clinical stages of non-small cell lung cancer (NSCLC), with superior performance compared to currently popular deep-learning models.</div></div><div><h3>Material and methods</h3><div>Data were collected from six publicly available datasets in The Cancer Imaging Archive (TCIA). Following the inclusion and exclusion criteria, a total of 4548 CT slices from 758 cases were allocated. We evaluated multiple multi-task learning models that integrate attention mechanisms to resolve challenges in NSCLC subtype classification and clinical staging. These models utilized convolution-based modules in their shared layers for feature extraction, while the task layers were dedicated to histological subtype classification and staging. Each branch sequentially processed features through convolution-based and attention-based modules prior to classification.</div></div><div><h3>Results</h3><div>Our study evaluated 758 NSCLC patients (mean age, 66.2 years ± 10.3; 473 men), spanning ADC and SCC cases. In the classification of histological subtypes and clinical staging of NSCLC, the MobileNet-based multi-task learning model enhanced with attention mechanisms (MN-MTL-A) demonstrated superior performance, achieving Area Under the Curve (AUC) scores of 0.963 (95 % CI: 0.943, 0.981) and 0.966 (95 % CI: 0.945, 0.982) for each task, respectively. The model significantly surpassed its counterparts lacking attention mechanisms and those configured for single-task learning, as evidenced by P-values of 0.01 or less for both tasks, according to DeLong’s test.</div></div><div><h3>Conclusions</h3><div>The integration of attention encoder blocks into our multi-task learning network significantly enhanced the accuracy of NSCLC histological subtyping and clinical staging. Given the reduced reliance on precise radiologist annotation, our proposed model shows promising potential for clinical application.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105694"},"PeriodicalIF":3.7,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decoding the black box: Explainable AI (XAI) for cancer diagnosis, prognosis, and treatment planning-A state-of-the art systematic review 解码黑箱:用于癌症诊断、预后和治疗规划的可解释人工智能(XAI)--最新系统综述。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-04 DOI: 10.1016/j.ijmedinf.2024.105689
Yusuf Abas Mohamed , Bee Ee Khoo , Mohd Shahrimie Mohd Asaari , Mohd Ezane Aziz , Fattah Rahiman Ghazali

Objective

Explainable Artificial Intelligence (XAI) is increasingly recognized as a crucial tool in cancer care, with significant potential to enhance diagnosis, prognosis, and treatment planning. However, the holistic integration of XAI across all stages of cancer care remains underexplored. This review addresses this gap by systematically evaluating the role of XAI in these critical areas, identifying key challenges and emerging trends.

Materials and methods

Following the PRISMA guidelines, a comprehensive literature search was conducted across Scopus and Web of Science, focusing on publications from January 2020 to May 2024. After rigorous screening and quality assessment, 69 studies were selected for in-depth analysis.

Results

The review identified critical gaps in the application of XAI within cancer care, notably the exclusion of clinicians in 83% of studies, which raises concerns about real-world applicability and may lead to explanations that are technically sound but clinically irrelevant. Additionally, 87% of studies lacked rigorous evaluation of XAI explanations, compromising their reliability in clinical practice. The dominance of post-hoc visual methods like SHAP, LIME and Grad-CAM reflects a trend toward explanations that may be inherently flawed due to specific input perturbations and simplifying assumptions. The lack of formal evaluation metrics and standardization constrains broader XAI adoption in clinical settings, creating a disconnect between AI development and clinical integration. Moreover, translating XAI insights into actionable clinical decisions remains challenging due to the absence of clear guidelines for integrating these tools into clinical workflows.

Conclusion

This review highlights the need for greater clinician involvement, standardized XAI evaluation metrics, clinician-centric interfaces, context-aware XAI systems, and frameworks for integrating XAI into clinical workflows for informed clinical decision-making and improved outcomes in cancer care.
目的:可解释人工智能(XAI)越来越被认为是癌症治疗的重要工具,在提高诊断、预后和治疗计划方面具有巨大潜力。然而,在癌症治疗的各个阶段对 XAI 进行整体整合的研究仍然不足。本综述通过系统评估 XAI 在这些关键领域的作用,确定关键挑战和新兴趋势,弥补了这一空白:按照 PRISMA 指南,在 Scopus 和 Web of Science 上进行了全面的文献检索,重点是 2020 年 1 月至 2024 年 5 月期间的出版物。经过严格筛选和质量评估后,选出 69 项研究进行深入分析:综述发现了 XAI 在癌症治疗中应用的关键差距,特别是 83% 的研究排除了临床医生,这引起了人们对现实世界适用性的担忧,可能会导致技术上合理但临床上不相关的解释。此外,87% 的研究缺乏对 XAI 解释的严格评估,从而影响了其在临床实践中的可靠性。SHAP、LIME 和 Grad-CAM 等事后视觉方法的主导地位反映了一种趋势,即由于特定的输入扰动和简化假设,解释可能存在固有缺陷。缺乏正式的评估指标和标准化限制了 XAI 在临床环境中的广泛应用,造成了人工智能开发与临床整合之间的脱节。此外,由于缺乏将这些工具整合到临床工作流程中的明确指南,因此将 XAI 见解转化为可操作的临床决策仍具有挑战性:本综述强调了加强临床医生参与、标准化 XAI 评估指标、以临床医生为中心的界面、情境感知 XAI 系统以及将 XAI 整合到临床工作流程中的框架的必要性,以便在癌症护理中做出明智的临床决策并改善疗效。
{"title":"Decoding the black box: Explainable AI (XAI) for cancer diagnosis, prognosis, and treatment planning-A state-of-the art systematic review","authors":"Yusuf Abas Mohamed ,&nbsp;Bee Ee Khoo ,&nbsp;Mohd Shahrimie Mohd Asaari ,&nbsp;Mohd Ezane Aziz ,&nbsp;Fattah Rahiman Ghazali","doi":"10.1016/j.ijmedinf.2024.105689","DOIUrl":"10.1016/j.ijmedinf.2024.105689","url":null,"abstract":"<div><h3>Objective</h3><div>Explainable Artificial Intelligence (XAI) is increasingly recognized as a crucial tool in cancer care, with significant potential to enhance diagnosis, prognosis, and treatment planning. However, the holistic integration of XAI across all stages of cancer care remains underexplored. This review addresses this gap by systematically evaluating the role of XAI in these critical areas, identifying key challenges and emerging trends.</div></div><div><h3>Materials and methods</h3><div>Following the PRISMA guidelines, a comprehensive literature search was conducted across Scopus and Web of Science, focusing on publications from January 2020 to May 2024. After rigorous screening and quality assessment, 69 studies were selected for in-depth analysis.</div></div><div><h3>Results</h3><div>The review identified critical gaps in the application of XAI within cancer care, notably the exclusion of clinicians in 83% of studies, which raises concerns about real-world applicability and may lead to explanations that are technically sound but clinically irrelevant. Additionally, 87% of studies lacked rigorous evaluation of XAI explanations, compromising their reliability in clinical practice. The dominance of post-hoc visual methods like SHAP, LIME and Grad-CAM reflects a trend toward explanations that may be inherently flawed due to specific input perturbations and simplifying assumptions. The lack of formal evaluation metrics and standardization constrains broader XAI adoption in clinical settings, creating a disconnect between AI development and clinical integration. Moreover, translating XAI insights into actionable clinical decisions remains challenging due to the absence of clear guidelines for integrating these tools into clinical workflows.</div></div><div><h3>Conclusion</h3><div>This review highlights the need for greater clinician involvement, standardized XAI evaluation metrics, clinician-centric interfaces, context-aware XAI systems, and frameworks for integrating XAI into clinical workflows for informed clinical decision-making and improved outcomes in cancer care.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105689"},"PeriodicalIF":3.7,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accuracy of machine learning in diagnosing microsatellite instability in gastric cancer: A systematic review and meta-analysis 机器学习诊断胃癌微卫星不稳定性的准确性:系统综述与荟萃分析。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-02 DOI: 10.1016/j.ijmedinf.2024.105685
Yuou Ying , Ruyi Ju , Jieyi Wang , Wenkai Li , Yuan Ji , Zhenyu Shi , Jinhan Chen , Mingxian Chen

Background

Significant challenges persist in the early identification of microsatellite instability (MSI) within current clinical practice. In recent years, with the growing utilization of machine learning (ML) in the diagnosis and management of gastric cancer (GC), numerous researchers have explored the effectiveness of ML methodologies in detecting MSI. Nevertheless, the predictive value of these approaches still lacks comprehensive evidence. Accordingly, this study was carried out to consolidate the accuracy of ML in the prompt detection of MSI in GC.

Methods

PubMed, the Cochrane Library, the Web of Science, and Embase were retrieved up to March 20, 2024. The risk of bias in the encompassed studies was evaluated utilizing a risk assessment tool for predictive models. Models were then subjected to subgroup analysis based on the modeling variables.

Results

A total of 12 studies, encompassing 11,912 patients with GC, satisfied the predefined inclusion criteria. ML models established in these studies were primarily based on pathological images, clinical features, and radiomics. The results suggested that in the validation sets, the pathological image-based models had a synthesized c-index of 0.86 [95 % CI (0.83–0.89)], with sensitivity and specificity being 0.86 [95 % CI (0.76–0.92)] and 0.83 [95 % CI (0.78–0.87)], respectively; radiomics feature-based models achieved respective values of 0.87 [95 % CI (0.81–0.92)], 0.77 [95 % CI (0.70–0.83)] and 0.81 [95 % CI (0.74–0.87)]; radiomics feature-based models + clinical feature-based models achieved respective values of 0.87 [95 % CI (0.81–0.93)], 0.78 [95 % CI (0.70–0.84)] and 0.79 [95 % CI (0.69–0.86)].

Conclusions

ML has demonstrated optimal performance in detecting MSI in GC and could serve as a prospective early adjunctive detection tool for MSI in GC. Future research should contemplate minimally invasive or non-invasive, readily collectible, and efficient predictors to augment the predictive accuracy of ML.
背景:在目前的临床实践中,早期识别微卫星不稳定性(MSI)仍然面临着巨大挑战。近年来,随着机器学习(ML)在胃癌(GC)诊断和管理中的应用日益广泛,许多研究人员探索了 ML 方法在检测 MSI 方面的有效性。然而,这些方法的预测价值仍缺乏全面的证据。因此,本研究旨在巩固 ML 在及时检测 GC 中 MSI 方面的准确性:方法:检索了截至 2024 年 3 月 20 日的 PubMed、Cochrane 图书馆、Web of Science 和 Embase。利用预测模型风险评估工具对所含研究的偏倚风险进行评估。然后根据建模变量对模型进行分组分析:共有 12 项研究符合预定的纳入标准,涵盖 11,912 名 GC 患者。这些研究建立的 ML 模型主要基于病理图像、临床特征和放射组学。结果表明,在验证集中,基于病理图像的模型的综合 c 指数为 0.86 [95 % CI (0.83-0.89)],灵敏度和特异度分别为 0.86 [95 % CI (0.76-0.92)] 和 0.83 [95 % CI (0.78-0.87)];基于放射组学特征的模型的综合 c 指数分别为 0.87[95%CI(0.81-0.92)]、0.77[95%CI(0.70-0.83)]和0.81[95%CI(0.74-0.87)];基于放射组学特征的模型+基于临床特征的模型分别达到0.87[95%CI(0.81-0.93)]、0.78[95%CI(0.70-0.84)]和0.79[95%CI(0.69-0.86)]:ML在检测GC中的MSI方面表现出最佳性能,可作为GC中MSI的前瞻性早期辅助检测工具。未来的研究应考虑采用微创或无创、易于收集和高效的预测指标来提高ML的预测准确性。
{"title":"Accuracy of machine learning in diagnosing microsatellite instability in gastric cancer: A systematic review and meta-analysis","authors":"Yuou Ying ,&nbsp;Ruyi Ju ,&nbsp;Jieyi Wang ,&nbsp;Wenkai Li ,&nbsp;Yuan Ji ,&nbsp;Zhenyu Shi ,&nbsp;Jinhan Chen ,&nbsp;Mingxian Chen","doi":"10.1016/j.ijmedinf.2024.105685","DOIUrl":"10.1016/j.ijmedinf.2024.105685","url":null,"abstract":"<div><h3>Background</h3><div>Significant challenges persist in the early identification of microsatellite instability (MSI) within current clinical practice. In recent years, with the growing utilization of machine learning (ML) in the diagnosis and management of gastric cancer (GC), numerous researchers have explored the effectiveness of ML methodologies in detecting MSI. Nevertheless, the predictive value of these approaches still lacks comprehensive evidence. Accordingly, this study was carried out to consolidate the accuracy of ML in the prompt detection of MSI in GC.</div></div><div><h3>Methods</h3><div>PubMed, the Cochrane Library, the Web of Science, and Embase were retrieved up to March 20, 2024. The risk of bias in the encompassed studies was evaluated utilizing a risk assessment tool for predictive models. Models were then subjected to subgroup analysis based on the modeling variables.</div></div><div><h3>Results</h3><div>A total of 12 studies, encompassing 11,912 patients with GC, satisfied the predefined inclusion criteria. ML models established in these studies were primarily based on pathological images, clinical features, and radiomics. The results suggested that in the validation sets, the pathological image-based models had a synthesized c-index of 0.86 [95 % CI (0.83–0.89)], with sensitivity and specificity being 0.86 [95 % CI (0.76–0.92)] and 0.83 [95 % CI (0.78–0.87)], respectively; radiomics feature-based models achieved respective values of 0.87 [95 % CI (0.81–0.92)], 0.77 [95 % CI (0.70–0.83)] and 0.81 [95 % CI (0.74–0.87)]; radiomics feature-based models + clinical feature-based models achieved respective values of 0.87 [95 % CI (0.81–0.93)], 0.78 [95 % CI (0.70–0.84)] and 0.79 [95 % CI (0.69–0.86)].</div></div><div><h3>Conclusions</h3><div>ML has demonstrated optimal performance in detecting MSI in GC and could serve as a prospective early adjunctive detection tool for MSI in GC. Future research should contemplate minimally invasive or non-invasive, readily collectible, and efficient predictors to augment the predictive accuracy of ML.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105685"},"PeriodicalIF":3.7,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tracking provenance in clinical data warehouses for quality management 跟踪临床数据仓库中的出处,促进质量管理
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-02 DOI: 10.1016/j.ijmedinf.2024.105690
Marco Johns, Lena Baum, Fabian Prasser

Introduction

Data provenance, which documents the origin, history, and transformations of data, can enhance the reproducibility of processing workflows and help to address errors and quality issues. In this work, we focus on tracking and utilizing provenance information as part of quality management in Extract-Transform-Load (ETL) processes used to build clinical data warehouses.

Methods

We designed and implemented a framework that automatically tracks how data flows through an ETL process and detects errors and quality problems during processing. This information is then reported against an Application Programming Interface (API) that stores the issues along with contextual information on their location within the data being transformed and the overall workflow. We further designed a dashboard that supports health data engineers with inspecting the encountered issues and tracing them back to their root causes.

Results

The framework was implemented in Java using the Spring Framework and integrated into ETL processes for Informatics for Integrating Biology and the Bedside (i2b2). The dashboard was realized using Grafana. We evaluated our approach on three different ETL processes for real-world datasets used to integrate them into our i2b2 clinical data warehouse. Using the provenance dashboard, we were able to identify frequent error patterns and link them to specific data points from the sources as well as ETL process steps. Provenance tracking increased the execution times of loading processes with an impact depending on the number of identified issues.

Conclusions

Provenance tracking can be a valuable tool for implementing continuous quality management for ETL processes. Relevant information can be collected from existing ETL workloads using dedicated APIs and visualized through dashboards, which support the identification of frequent patterns of problems together with their root causes, providing valuable information for improvements.
导言数据出处记录了数据的来源、历史和转换,可以提高处理工作流程的可重复性,并有助于解决错误和质量问题。我们设计并实施了一个框架,该框架可自动跟踪数据如何在 ETL 流程中流动,并检测处理过程中的错误和质量问题。然后根据应用程序接口(API)报告这些信息,应用程序接口会存储这些问题以及它们在正在转换的数据中的位置和整个工作流程的上下文信息。我们还设计了一个仪表盘,支持健康数据工程师检查遇到的问题并追溯其根源。结果该框架使用 Spring 框架在 Java 中实现,并集成到了生物与床边整合信息学(i2b2)的 ETL 流程中。仪表盘使用 Grafana 实现。我们在三个不同的 ETL 流程中对我们的方法进行了评估,这些流程用于将真实世界的数据集集成到我们的 i2b2 临床数据仓库中。通过使用出处仪表板,我们能够识别出经常出现的错误模式,并将它们与数据源中的特定数据点以及 ETL 流程步骤联系起来。出处跟踪增加了加载流程的执行时间,其影响取决于已识别问题的数量。可以使用专用的 API 从现有的 ETL 工作负载中收集相关信息,并通过仪表盘将其可视化,从而支持识别经常出现的问题模式及其根本原因,为改进工作提供有价值的信息。
{"title":"Tracking provenance in clinical data warehouses for quality management","authors":"Marco Johns,&nbsp;Lena Baum,&nbsp;Fabian Prasser","doi":"10.1016/j.ijmedinf.2024.105690","DOIUrl":"10.1016/j.ijmedinf.2024.105690","url":null,"abstract":"<div><h3>Introduction</h3><div>Data provenance, which documents the origin, history, and transformations of data, can enhance the reproducibility of processing workflows and help to address errors and quality issues. In this work, we focus on tracking and utilizing provenance information as part of quality management in Extract-Transform-Load (ETL) processes used to build clinical data warehouses.</div></div><div><h3>Methods</h3><div>We designed and implemented a framework that automatically tracks how data flows through an ETL process and detects errors and quality problems during processing. This information is then reported against an Application Programming Interface (API) that stores the issues along with contextual information on their location within the data being transformed and the overall workflow. We further designed a dashboard that supports health data engineers with inspecting the encountered issues and tracing them back to their root causes.</div></div><div><h3>Results</h3><div>The framework was implemented in Java using the Spring Framework and integrated into ETL processes for Informatics for Integrating Biology and the Bedside (i2b2). The dashboard was realized using Grafana. We evaluated our approach on three different ETL processes for real-world datasets used to integrate them into our i2b2 clinical data warehouse. Using the provenance dashboard, we were able to identify frequent error patterns and link them to specific data points from the sources as well as ETL process steps. Provenance tracking increased the execution times of loading processes with an impact depending on the number of identified issues.</div></div><div><h3>Conclusions</h3><div>Provenance tracking can be a valuable tool for implementing continuous quality management for ETL processes. Relevant information can be collected from existing ETL workloads using dedicated APIs and visualized through dashboards, which support the identification of frequent patterns of problems together with their root causes, providing valuable information for improvements.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105690"},"PeriodicalIF":3.7,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142586204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of the openEHR reference model for PGHD: A case study on the DH-Convener initiative 将开放式电子病历参考模型应用于 PGHD:关于 DH-Convener 计划的案例研究
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-02 DOI: 10.1016/j.ijmedinf.2024.105686
Somayeh Abedian , Sten Hanke , Rada Hussein

Objectives

Patient-Generated Health Data (PGHD) is increasingly influential in therapy and diagnostic decisions. PGHD should be integrated into electronic health records (EHR) to maximize its utility. This study evaluates the openEHR Reference Model (RM) compatibility with the DH-Convener initiative’s modules (Data Collection Module and Data Connector Module) as a potential concept for standardizing PGHD across wearable health devices, focusing on achieving interoperability.

Materials and Methods

The study analyzes various types of PGHD, assessing the data formats and structures used by wearable tools. We evaluate openEHR RM specification with our initiative, DH-Convenor, focusing on PGHD semantic interoperability challenges. We evaluated current Archetypes and Templates that are now created and exist on openEHR Clinical Knowledge Management (CKM) and mapped them to our requirements. The DH-Convener modules are examined for their compatibility in standardizing PGHD integration into openEHR clinical workflows and compared with other existing standards for flexibility, scalability, and interoperability.

Results

The findings indicate that the diversity in data formats across wearable tools and openEHR shows strong potential as unifying data models based on the DH-Convener’s modules. It supports a wide range of PGHD types in existing archetypes and aligns well with our initiative’s requirements for storing PGHD, enabling more seamless integration into EHR systems.

Discussion

Integrating PGHD into EHR is crucial for personalized healthcare, but inconsistent device formats hinder interoperability. The DH-Convener leverages openEHR to provide a strong solution, though stakeholder collaboration remains essential. Our initiative demonstrates openEHR’s ability to ensure consistency, particularly in Europe.

Conclusion

We aligned the openEHR layers with the DH-Convener modules, demonstrating openEHR’s compatibility for storing PGHD and supporting interoperability goals, such as standardized storage and seamless data transfer to Austria’s national EHR. Future efforts should prioritize promoting these models and ensuring their adaptability to emerging wearable devices.
目的患者生成的健康数据 (PGHD) 对治疗和诊断决策的影响越来越大。应将 PGHD 整合到电子健康记录 (EHR) 中,以最大限度地发挥其效用。本研究评估了 openEHR 参考模型 (RM) 与 DH-Convener 计划模块(数据收集模块和数据连接器模块)的兼容性,将其作为实现可穿戴健康设备 PGHD 标准化的潜在概念,重点关注实现互操作性。我们评估了 openEHR RM 规范和我们的倡议 DH-Convenor,重点关注 PGHD 语义互操作性方面的挑战。我们评估了目前在 openEHR 临床知识管理 (CKM) 上创建和存在的原型和模板,并将其与我们的要求进行了映射。我们检查了 DH-Convener 模块在将 PGHD 标准化集成到 openEHR 临床工作流中的兼容性,并就灵活性、可扩展性和互操作性与其他现有标准进行了比较。结果研究结果表明,可穿戴工具和 openEHR 数据格式的多样性显示出基于 DH-Convener 模块的统一数据模型的强大潜力。它支持现有原型中广泛的 PGHD 类型,并与我们的倡议对存储 PGHD 的要求非常吻合,从而能够更无缝地集成到 EHR 系统中。讨论将 PGHD 集成到 EHR 中对个性化医疗保健至关重要,但不一致的设备格式阻碍了互操作性。DH-Convener 利用 openEHR 提供了一个强大的解决方案,但利益相关者的合作仍然至关重要。结论我们将 openEHR 层与 DH-Convener 模块进行了统一,证明 openEHR 在存储 PGHD 和支持互操作性目标(如标准化存储和将数据无缝传输到奥地利国家 EHR)方面具有兼容性。未来的工作应优先考虑推广这些模式,并确保它们能适应新兴的可穿戴设备。
{"title":"Application of the openEHR reference model for PGHD: A case study on the DH-Convener initiative","authors":"Somayeh Abedian ,&nbsp;Sten Hanke ,&nbsp;Rada Hussein","doi":"10.1016/j.ijmedinf.2024.105686","DOIUrl":"10.1016/j.ijmedinf.2024.105686","url":null,"abstract":"<div><h3>Objectives</h3><div>Patient-Generated Health Data (PGHD) is increasingly influential in therapy and diagnostic decisions. PGHD should be integrated into electronic health records (EHR) to maximize its utility. This study evaluates the openEHR Reference Model (RM) compatibility with the DH-Convener initiative’s modules (Data Collection Module and Data Connector Module) as a potential concept for standardizing PGHD across wearable health devices, focusing on achieving interoperability.</div></div><div><h3>Materials and Methods</h3><div>The study analyzes various types of PGHD, assessing the data formats and structures used by wearable tools. We evaluate openEHR RM specification with our initiative, DH-Convenor, focusing on PGHD semantic interoperability challenges. We evaluated current Archetypes and Templates that are now created and exist on openEHR Clinical Knowledge Management (CKM) and mapped them to our requirements. The DH-Convener modules are examined for their compatibility in standardizing PGHD integration into openEHR clinical workflows and compared with other existing standards for flexibility, scalability, and interoperability.</div></div><div><h3>Results</h3><div>The findings indicate that the diversity in data formats across wearable tools and openEHR shows strong potential as unifying data models based on the DH-Convener’s modules. It supports a wide range of PGHD types in existing archetypes and aligns well with our initiative’s requirements for storing PGHD, enabling more seamless integration into EHR systems.</div></div><div><h3>Discussion</h3><div>Integrating PGHD into EHR is crucial for personalized healthcare, but inconsistent device formats hinder interoperability. The DH-Convener leverages openEHR to provide a strong solution, though stakeholder collaboration remains essential. Our initiative demonstrates openEHR’s ability to ensure consistency, particularly in Europe.</div></div><div><h3>Conclusion</h3><div>We aligned the openEHR layers with the DH-Convener modules, demonstrating openEHR’s compatibility for storing PGHD and supporting interoperability goals, such as standardized storage and seamless data transfer to Austria’s national EHR. Future efforts should prioritize promoting these models and ensuring their adaptability to emerging wearable devices.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105686"},"PeriodicalIF":3.7,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142586203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Healthcare professionals’ cross-organizational access to electronic health records: A scoping review 医疗保健专业人员跨组织访问电子健康记录:范围界定审查
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ijmedinf.2024.105688
Øivind Skeidsvoll Solvang , Sonja Cassidy , Conceição Granja , Terje Solvoll

Background

Cross-organizational access to shared electronic health records can enhance integrated, people-centered health services. However, a gap remains between these potential benefits and the limited support currently offered by electronic health records. The Valkyrie research project aims to bridge this gap by developing a technical prototype of an architecture to promote healthcare service coordination.

Objective

To inform the Valkyrie project, we aimed to evaluate approaches for healthcare professionals’ access to electronic health records across healthcare providers and identify factors influencing the success and failure of these approaches.

Materials and methods

Using the Joanna Briggs Institute guidance for scoping reviews, searches were conducted in six research databases and grey literature, without limitations on year or language. Papers selected for full-text review were analyzed, and data was extracted using standardized forms that reflected the population, concept, and context framework and the categorization model used in the qualitative analysis of the barriers and facilitators reported in the included papers.

Results

Among the 290 identified papers, five were deemed eligible for full-text review. The included papers were heterogeneous in country, year of publication, study setting, implementation level, and access approaches to electronic health records, highlighting various techniques, from federated to centralized, for accessing shared electronic health records.

Discussion and conclusion

The review did not identify one single superior access approach. However, a hybrid approach incorporating components from the different approaches combined with emerging technologies may benefit the Valkyrie project. The key facilitators were identified as improved information quality and flexible and easy access. In contrast, lack of trust and poor information quality were significant barriers to successful cross-organizational access to electronic health records. Future research should explore alternative access approaches, considering information quality, user training, and collegial trust across healthcare providers.
背景跨机构访问共享电子健康记录可以加强以人为本的综合医疗服务。然而,这些潜在的好处与电子健康记录目前提供的有限支持之间仍存在差距。为了给 Valkyrie 项目提供信息,我们旨在评估医疗保健专业人员跨医疗保健提供方访问电子健康记录的方法,并确定影响这些方法成功与失败的因素。对选中进行全文审阅的论文进行了分析,并使用标准化表格提取数据,这些表格反映了人口、概念和背景框架,以及对所收录论文中报告的障碍和促进因素进行定性分析时使用的分类模型。 结果在所确定的 290 篇论文中,有 5 篇被认为符合全文审阅的条件。被收录的论文在国家、发表年份、研究背景、实施水平和电子病历访问方法方面各不相同,突出了从联合到集中等各种访问共享电子病历的技术。不过,将不同方法的组成部分与新兴技术相结合的混合方法可能会使瓦尔基里项目受益。关键的促进因素被认为是信息质量的提高和灵活便捷的访问。相比之下,缺乏信任和信息质量差是成功跨组织访问电子病历的主要障碍。未来的研究应考虑信息质量、用户培训和医疗服务提供者之间的同事信任,探索其他访问方法。
{"title":"Healthcare professionals’ cross-organizational access to electronic health records: A scoping review","authors":"Øivind Skeidsvoll Solvang ,&nbsp;Sonja Cassidy ,&nbsp;Conceição Granja ,&nbsp;Terje Solvoll","doi":"10.1016/j.ijmedinf.2024.105688","DOIUrl":"10.1016/j.ijmedinf.2024.105688","url":null,"abstract":"<div><h3>Background</h3><div>Cross-organizational access to shared electronic health records can enhance integrated, people-centered health services. However, a gap remains between these potential benefits and the limited support currently offered by electronic health records. The Valkyrie research project aims to bridge this gap by developing a technical prototype of an architecture to promote healthcare service coordination.</div></div><div><h3>Objective</h3><div>To inform the Valkyrie project, we aimed to evaluate approaches for healthcare professionals’ access to electronic health records across healthcare providers and identify factors influencing the success and failure of these approaches.</div></div><div><h3>Materials and methods</h3><div>Using the Joanna Briggs Institute guidance for scoping reviews, searches were conducted in six research databases and grey literature, without limitations on year or language. Papers selected for full-text review were analyzed, and data was extracted using standardized forms that reflected the population, concept, and context framework and the categorization model used in the qualitative analysis of the barriers and facilitators reported in the included papers.</div></div><div><h3>Results</h3><div>Among the 290 identified papers, five were deemed eligible for full-text review. The included papers were heterogeneous in country, year of publication, study setting, implementation level, and access approaches to electronic health records, highlighting various techniques, from federated to centralized, for accessing shared electronic health records.</div></div><div><h3>Discussion and conclusion</h3><div>The review did not identify one single superior access approach. However, a hybrid approach incorporating components from the different approaches combined with emerging technologies may benefit the Valkyrie project. The key facilitators were identified as improved information quality and flexible and easy access. In contrast, lack of trust and poor information quality were significant barriers to successful cross-organizational access to electronic health records. Future research should explore alternative access approaches, considering information quality, user training, and collegial trust across healthcare providers.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105688"},"PeriodicalIF":3.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142593198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acute myocardial infarction risk prediction in emergency chest pain patients: An external validation study 急诊胸痛患者急性心肌梗死风险预测:外部验证研究
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-01 DOI: 10.1016/j.ijmedinf.2024.105683
Ching-Hung Chang , Phung-Anh Nguyen , Chien-Cheng Huang , Chung-Feng Liu , Septi Melisa , Chia-Jung Chen , Chien-Chin Hsu , Hung-Jung Lin , Min-Huei Hsu , Chun-Ming Shih , Ju-Chi Liu , Hung-Yu Yang , Jason C. Hsu

Background

Chest pain is a common symptom that presents to the emergency department (ED), and its causes range from minor illnesses to serious diseases such as acute coronary syndrome. Accurate and timely diagnosis is essential for the efficient management and treatment of these patients.

Objective

This study aims to expand on a model previously developed by the Chi Mei Medical Group (CMMG) Emergency Department in 2020 to predict adverse cardiac events in patients with chest pain. The main goal is to evaluate the accuracy and generalizability of the model through external validation using data from other hospitals.

Methods

The initial model for this study was developed using data from three CMMG-affiliated hospitals in southern Taiwan. We utilized four supervised machine learning algorithms, namely random forest, logistic regression, support-vector clustering, and K-nearest neighbor, to predict the risk of acute myocardial infarction within a one month for emergency chest pain patients. The study used the model with the best area under the curve (AUC), recall and precision for external validation. The external validated data source was data collected from three hospitals associated with Taipei Medical University (TMU) in northern Taiwan. Results: The original best model constructed by CMMG exhibited an AUC of 0.822, an accuracy of 0.740, a recall of 0.741, a precision of 0.566, a specificity of 0.740, and an NPV of 0.861. Subsequently, during the external validation phase, CMMG’s top-performing model demonstrated acceptable validation result with TMU’s data, achieving an AUC of 0.63, an accuracy of 0.661, a recall of 0.593, a precision of 0.243, a specificity of 0.691, and an NPV of 0.900. While the results indicate that the model’s performance varied across different datasets and are not outstanding, the model is still acceptable for clinical application as a preliminary decision-support tool.

Conclusion

This study highlights the importance of external validation to confirm the applicability of the previously developed predictive model in other hospital settings. Although the model shows potential in assessing chest pain patients in the ED, its broad clinical application requires further validation to ensure it can improve patient outcomes and optimize healthcare resource allocation.
背景胸痛是急诊科(ED)常见的症状,其原因从轻微疾病到严重疾病(如急性冠状动脉综合征)不等。本研究旨在扩展奇美医疗集团(CMMG)急诊科于 2020 年开发的预测胸痛患者不良心脏事件的模型。主要目的是通过使用其他医院的数据进行外部验证,评估模型的准确性和可推广性。方法本研究的初始模型是使用奇美医疗集团在台湾南部的三家附属医院的数据开发的。我们使用了四种有监督的机器学习算法,即随机森林、逻辑回归、支持向量聚类和 K 最近邻,来预测急诊胸痛患者在一个月内发生急性心肌梗死的风险。研究采用了曲线下面积(AUC)、召回率和精确度最佳的模型进行外部验证。外部验证数据来源于台湾北部台北医学大学的三家附属医院。结果由 CMMG 构建的原始最佳模型的 AUC 为 0.822,准确度为 0.740,召回率为 0.741,精确度为 0.566,特异性为 0.740,净现值为 0.861。随后,在外部验证阶段,CMMG 表现最出色的模型通过屯门大学的数据获得了可接受的验证结果,AUC 为 0.63,准确率为 0.661,召回率为 0.593,精确度为 0.243,特异性为 0.691,净现值为 0.900。虽然结果表明该模型在不同数据集上的表现各不相同,并不突出,但该模型作为初步的决策支持工具仍可用于临床应用。虽然该模型在评估急诊室胸痛患者方面显示出了潜力,但其广泛的临床应用还需要进一步验证,以确保它能改善患者预后并优化医疗资源分配。
{"title":"Acute myocardial infarction risk prediction in emergency chest pain patients: An external validation study","authors":"Ching-Hung Chang ,&nbsp;Phung-Anh Nguyen ,&nbsp;Chien-Cheng Huang ,&nbsp;Chung-Feng Liu ,&nbsp;Septi Melisa ,&nbsp;Chia-Jung Chen ,&nbsp;Chien-Chin Hsu ,&nbsp;Hung-Jung Lin ,&nbsp;Min-Huei Hsu ,&nbsp;Chun-Ming Shih ,&nbsp;Ju-Chi Liu ,&nbsp;Hung-Yu Yang ,&nbsp;Jason C. Hsu","doi":"10.1016/j.ijmedinf.2024.105683","DOIUrl":"10.1016/j.ijmedinf.2024.105683","url":null,"abstract":"<div><h3>Background</h3><div>Chest pain is a common symptom that presents to the emergency department (ED), and its causes range from minor illnesses to serious diseases such as acute coronary syndrome. Accurate and timely diagnosis is essential for the efficient management and treatment of these patients.</div></div><div><h3>Objective</h3><div>This study aims to expand on a model previously developed by the Chi Mei Medical Group (CMMG) Emergency Department in 2020 to predict adverse cardiac events in patients with chest pain. The main goal is to evaluate the accuracy and generalizability of the model through external validation using data from other hospitals.</div></div><div><h3>Methods</h3><div>The initial model for this study was developed using data from three CMMG-affiliated hospitals in southern Taiwan. We utilized four supervised machine learning algorithms, namely random forest, logistic regression, support-vector clustering, and K-nearest neighbor, to predict the risk of acute myocardial infarction within a one month for emergency chest pain patients. The study used the model with the best area under the curve (AUC), recall and precision for external validation. The external validated data source was data collected from three hospitals associated with Taipei Medical University (TMU) in northern Taiwan. <strong>Results:</strong> The original best model constructed by CMMG exhibited an AUC of 0.822, an accuracy of 0.740, a recall of 0.741, a precision of 0.566, a specificity of 0.740, and an NPV of 0.861. Subsequently, during the external validation phase, CMMG’s top-performing model demonstrated acceptable validation result with TMU’s data, achieving an AUC of 0.63, an accuracy of 0.661, a recall of 0.593, a precision of 0.243, a specificity of 0.691, and an NPV of 0.900. While the results indicate that the model’s performance varied across different datasets and are not outstanding, the model is still acceptable for clinical application as a preliminary decision-support tool.</div></div><div><h3>Conclusion</h3><div>This study highlights the importance of external validation to confirm the applicability of the previously developed predictive model in other hospital settings. Although the model shows potential in assessing chest pain patients in the ED, its broad clinical application requires further validation to ensure it can improve patient outcomes and optimize healthcare resource allocation.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105683"},"PeriodicalIF":3.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142586728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1