首页 > 最新文献

International Journal of Medical Informatics最新文献

英文 中文
Deep learning-driven ultrasound equipment quality assessment with ATS-539 phantom data 利用 ATS-539 模型数据进行深度学习驱动的超声设备质量评估。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-13 DOI: 10.1016/j.ijmedinf.2024.105698
Dong Hoon Jang , Ji Won Heo , Kyu Hong Lee , Ro Woon Lee , Tae Ran Ahn , Hyun Gyu Lee

Introduction

Ultrasound equipment provides real-time visualization of internal organs, essential for early disease detection and diagnosis. However, poor-quality ultrasound images can compromise diagnostic accuracy and increase the risk of misdiagnosis. Quality assessments are often subjective, relying on the evaluator's experience and interpretation, which can vary.

Methods

This study introduces a two-stage deep learning framework designed to objectively assess ultrasound image quality using phantom data across three key parameters: ‘Dead zone’, ‘Axial/lateral resolution’, and ‘Gray scale and dynamic range’. Stage 1 automatically extracts regions of interest for each parameter, while Stage 2 employs detection or classification models to evaluate image quality within these regions. To generate an overall equipment quality score, a logistic regression model combines the weighted results from each parameter.

Results

The classification model demonstrated high performance across datasets, achieving AUC scores of 98.6% for ‘Dead zone’, 87.7% for ‘Axial/lateral resolution’, and 96.0% for ‘Gray scale and dynamic range’. Further analysis using guideline-compliant images of individual devices showed AUC scores of 98.2%, 92.8%, and 100%, respectively. These findings highlight deep learning's potential for quantitative and objective assessments of ultrasound image quality. Ultimately, this framework provides a streamlined approach to quality management, enabling consistent quality control and efficient scoring-based evaluation of ultrasound equipment.
简介超声波设备可实时显示内脏器官,对早期疾病检测和诊断至关重要。然而,质量差的超声波图像会影响诊断的准确性,增加误诊的风险。质量评估通常是主观的,依赖于评估者的经验和解释,而这些经验和解释可能各不相同:本研究引入了一个分两个阶段的深度学习框架,旨在使用模型数据客观评估超声图像质量,包括三个关键参数:"死区"、"轴向/侧向分辨率 "以及 "灰度和动态范围"。第一阶段自动提取每个参数的关注区域,第二阶段则采用检测或分类模型来评估这些区域内的图像质量。为了得出设备质量的总分,一个逻辑回归模型综合了每个参数的加权结果:分类模型在各种数据集上都表现出很高的性能,"死区 "的 AUC 得分为 98.6%,"轴向/侧向分辨率 "的 AUC 得分为 87.7%,"灰度和动态范围 "的 AUC 得分为 96.0%。使用符合指南要求的单个设备图像进行的进一步分析显示,AUC 分数分别为 98.2%、92.8% 和 100%。这些发现凸显了深度学习在定量客观评估超声图像质量方面的潜力。最终,该框架提供了一种简化的质量管理方法,能够对超声设备进行一致的质量控制和高效的评分评估。
{"title":"Deep learning-driven ultrasound equipment quality assessment with ATS-539 phantom data","authors":"Dong Hoon Jang ,&nbsp;Ji Won Heo ,&nbsp;Kyu Hong Lee ,&nbsp;Ro Woon Lee ,&nbsp;Tae Ran Ahn ,&nbsp;Hyun Gyu Lee","doi":"10.1016/j.ijmedinf.2024.105698","DOIUrl":"10.1016/j.ijmedinf.2024.105698","url":null,"abstract":"<div><h3>Introduction</h3><div>Ultrasound equipment provides real-time visualization of internal organs, essential for early disease detection and diagnosis. However, poor-quality ultrasound images can compromise diagnostic accuracy and increase the risk of misdiagnosis. Quality assessments are often subjective, relying on the evaluator's experience and interpretation, which can vary.</div></div><div><h3>Methods</h3><div>This study introduces a two-stage deep learning framework designed to objectively assess ultrasound image quality using phantom data across three key parameters: ‘Dead zone’, ‘Axial/lateral resolution’, and ‘Gray scale and dynamic range’. Stage 1 automatically extracts regions of interest for each parameter, while Stage 2 employs detection or classification models to evaluate image quality within these regions. To generate an overall equipment quality score, a logistic regression model combines the weighted results from each parameter.</div></div><div><h3>Results</h3><div>The classification model demonstrated high performance across datasets, achieving AUC scores of 98.6% for ‘Dead zone’, 87.7% for ‘Axial/lateral resolution’, and 96.0% for ‘Gray scale and dynamic range’. Further analysis using guideline-compliant images of individual devices showed AUC scores of 98.2%, 92.8%, and 100%, respectively. These findings highlight deep learning's potential for quantitative and objective assessments of ultrasound image quality. Ultimately, this framework provides a streamlined approach to quality management, enabling consistent quality control and efficient scoring-based evaluation of ultrasound equipment.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105698"},"PeriodicalIF":3.7,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Usefulness of self-guided digital services among mental health patients: The role of health confidence and sociodemographic characteristics 精神疾病患者对自助式数字服务的实用性:健康信心和社会人口特征的作用
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-12 DOI: 10.1016/j.ijmedinf.2024.105693
Iiris Hörhammer , Johanna Suvanto , Maarit Kinnunen , Sari Kujala

Background

Remote services provided via telephone or the internet have become an essential part of mental health provision. Alongside services involving healthcare personnel (HCP), self-guided digital services hold great promise for improved self-management and health outcomes without increasing the burden on HCP. Therefore, better understanding of patients’ use and experienced benefits of these services are needed. This study investigated how health confidence and sociodemographic background are associated with mental health patients’ experiences of self-guided digital services.

Methods

This cross-sectional survey study was performed in 2022 at a Finnish Mental Health and Substance Abuse Services (MHSAS) unit of a regional public service provider that serves a population of about 163 000 people. All patients who had visited the unit up to 6 months before the study were invited to respond to an online survey on their experiences with the remote MHSAS. We report the average subjective usefulness of telephone, guided digital and self-guided digital services. Regression models were fitted to study the associations of patient characteristics with use of any digital service, and with experienced usefulness of self-guided digital services.

Findings

The respondents (n = 438) rated the usefulness of telephone, guided digital and self-guided digital services similarly (4.0/5.0, 3.9/5.0, and 3.9/5.0, respectively). Health confidence was associated with not using digital services at all as well as with high perceived usefulness of self-guided services. While elderly patients were more likely to avoid using digital services, age was not associated with experienced usefulness of self-guided digital services. No association between unemployment status and experiences of digital services was found.

Conclusions

Different types of remote services are perceived as beneficial by mental health patients. To ensure effectiveness and equity, patients’ health confidence should be considered when directing them to self-guided services. Elderly mental health patients who use digital services are equally able as younger patients to benefit from self-guided services.
背景通过电话或互联网提供的远程服务已成为心理健康服务的重要组成部分。除了有医护人员(HCP)参与的服务外,自我指导的数字服务在不增加医护人员负担的情况下,在改善自我管理和健康结果方面大有可为。因此,我们需要更好地了解患者对这些服务的使用情况和体验到的益处。本研究调查了健康信心和社会人口背景与精神疾病患者对自助式数字化服务的体验之间的关系。方法这项横断面调查研究于 2022 年在芬兰一家地区性公共服务提供商的精神健康和药物滥用服务机构(MHSAS)进行,该机构服务的人口约为 163000 人。所有在研究开始前6个月内到访过该机构的患者都受邀参加了一项在线调查,以了解他们对远程心理健康与药物滥用服务机构的体验。我们报告了电话、数字导诊和自助数字导诊服务的平均主观有用性。研究结果受访者(n = 438)对电话、引导式数字服务和引导式自助数字服务的实用性评价相似(分别为 4.0/5.0、3.9/5.0 和 3.9/5.0)。健康信心与完全不使用数字服务以及自我指导服务的高感知有用性相关。虽然老年患者更有可能避免使用数字服务,但年龄与自助式数字服务的有用性无关。结论精神疾病患者认为不同类型的远程服务都是有益的。为确保有效性和公平性,在引导患者使用自助服务时应考虑到他们的健康信心。使用数字服务的老年精神疾病患者与年轻患者一样能够从自助服务中获益。
{"title":"Usefulness of self-guided digital services among mental health patients: The role of health confidence and sociodemographic characteristics","authors":"Iiris Hörhammer ,&nbsp;Johanna Suvanto ,&nbsp;Maarit Kinnunen ,&nbsp;Sari Kujala","doi":"10.1016/j.ijmedinf.2024.105693","DOIUrl":"10.1016/j.ijmedinf.2024.105693","url":null,"abstract":"<div><h3>Background</h3><div>Remote services provided via telephone or the internet have become an essential part of mental health provision. Alongside services involving healthcare personnel (HCP), self-guided digital services hold great promise for improved self-management and health outcomes without increasing the burden on HCP. Therefore, better understanding of patients’ use and experienced benefits of these services are needed. This study investigated how health confidence and sociodemographic background are associated with mental health patients’ experiences of self-guided digital services.</div></div><div><h3>Methods</h3><div>This cross-sectional survey study was performed in 2022 at a Finnish Mental Health and Substance Abuse Services (MHSAS) unit of a regional public service provider that serves a population of about 163<!--> <!-->000 people. All patients who had visited the unit up to 6 months before the study were invited to respond to an online survey on their experiences with the remote MHSAS. We report the average subjective usefulness of telephone, guided digital and self-guided digital services. Regression models were fitted to study the associations of patient characteristics with use of any digital service, and with experienced usefulness of self-guided digital services.</div></div><div><h3>Findings</h3><div>The respondents (n = 438) rated the usefulness of telephone, guided digital and self-guided digital services similarly (4.0/5.0, 3.9/5.0, and 3.9/5.0, respectively). Health confidence was associated with not using digital services at all as well as with high perceived usefulness of self-guided services. While elderly patients were more likely to avoid using digital services, age was not associated with experienced usefulness of self-guided digital services. No association between unemployment status and experiences of digital services was found.</div></div><div><h3>Conclusions</h3><div>Different types of remote services are perceived as beneficial by mental health patients. To ensure effectiveness and equity, patients’ health confidence should be considered when directing them to self-guided services. Elderly mental health patients who use digital services are equally able as younger patients to benefit from self-guided services.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105693"},"PeriodicalIF":3.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142656282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven explainable machine learning for personalized risk classification of myasthenic crisis 用于肌无力危机个性化风险分类的数据驱动可解释机器学习。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-12 DOI: 10.1016/j.ijmedinf.2024.105679
Sivan Bershan , Andreas Meisel , Philipp Mergenthaler

Objective

Myasthenic crisis (MC) is a critical progression of Myasthenia gravis (MG), requiring intensive care treatment and invasive therapies. Classifying patients at high-risk for MC facilitates treatment decisions such as changes in medication or the need for mechanical ventilation and helps prevent disease progression by decreasing treatment-induced stress on the patient. Here, we investigated whether it is possible to reliably classify MG patients into groups at low or high risk of MC based entirely on routine medical data using explainable machine learning (ML).

Methods

In this single-center pseudo-prospective cohort study, we investigated the precision of ML models trained with real-world routine clinical data to identify MG patients at risk for MC, and identified explainable distinctive features for the groups. 51 MG patients, including 13 MC, were used for model training based on real-world clinical data available from the hospital management system. Patients were classified to high or low risk for MC using Lasso regression or random forest ML models.

Results

The mean cross-validated AUC classifying MG patients as high or low risk for MC based on simple or compound features derived from real-world clinical data showed a predictive accuracy of 68.8% for a regularized Lasso regression and 76.5% for a random forest model. Studying feature importance across 5100 model runs identified explainable features to distinguish MG patients at high or low risk for MC. Feature importance scores suggested that multimorbidity may play a role in risk classification.

Conclusion

This study establishes feasibility and proof-of-concept for risk classification of MC based on real-world routine clinical data using ML with explainable features and variance control at the point of care. Future research on ML-based prediction of MC should include multi-center, multinational data collection, more in-depth data per patient, more patients, and an attention-based ML model to include free-text.
目的:肌无力危象(MC)是重症肌无力症(MG)的一个重要进展,需要重症监护治疗和侵入性疗法。对MC高危患者进行分类有助于做出治疗决定,如更换药物或是否需要机械通气,并通过减少治疗对患者造成的压力来预防疾病进展。在此,我们利用可解释的机器学习(ML)研究了是否有可能完全根据常规医疗数据将 MG 患者可靠地分为 MC 低风险或高风险组:在这项单中心伪前瞻性队列研究中,我们研究了使用真实世界常规临床数据训练的ML模型识别MG患者MC风险的精确度,并确定了各组可解释的显著特征。根据医院管理系统提供的真实世界临床数据,对 51 名 MG 患者(包括 13 名 MC)进行了模型训练。使用 Lasso 回归或随机森林 ML 模型将患者划分为 MC 高风险或低风险:根据真实世界临床数据中的简单或复合特征将 MG 患者划分为 MC 高风险或低风险的交叉验证 AUC 平均值显示,正则化 Lasso 回归的预测准确率为 68.8%,随机森林模型的预测准确率为 76.5%。通过对 5100 次模型运行的特征重要性进行研究,确定了可用于区分 MC 高风险或低风险 MG 患者的可解释特征。特征重要性得分表明,多病性可能在风险分类中发挥作用:本研究基于真实世界的常规临床数据,利用具有可解释特征的 ML 和护理点的方差控制,建立了 MC 风险分类的可行性和概念验证。基于 ML 的 MC 预测的未来研究应包括多中心、跨国数据收集、每个患者更深入的数据、更多的患者以及基于注意力的 ML 模型(包括自由文本)。
{"title":"Data-driven explainable machine learning for personalized risk classification of myasthenic crisis","authors":"Sivan Bershan ,&nbsp;Andreas Meisel ,&nbsp;Philipp Mergenthaler","doi":"10.1016/j.ijmedinf.2024.105679","DOIUrl":"10.1016/j.ijmedinf.2024.105679","url":null,"abstract":"<div><h3>Objective</h3><div>Myasthenic crisis (MC) is a critical progression of Myasthenia gravis (MG), requiring intensive care treatment and invasive therapies. Classifying patients at high-risk for MC facilitates treatment decisions such as changes in medication or the need for mechanical ventilation and helps prevent disease progression by decreasing treatment-induced stress on the patient. Here, we investigated whether it is possible to reliably classify MG patients into groups at low or high risk of MC based entirely on routine medical data using explainable machine learning (ML).</div></div><div><h3>Methods</h3><div>In this single-center pseudo-prospective cohort study, we investigated the precision of ML models trained with real-world routine clinical data to identify MG patients at risk for MC, and identified explainable distinctive features for the groups. 51 MG patients, including 13 MC, were used for model training based on real-world clinical data available from the hospital management system. Patients were classified to high or low risk for MC using Lasso regression or random forest ML models.</div></div><div><h3>Results</h3><div>The mean cross-validated AUC classifying MG patients as high or low risk for MC based on simple or compound features derived from real-world clinical data showed a predictive accuracy of 68.8% for a regularized Lasso regression and 76.5% for a random forest model. Studying feature importance across 5100 model runs identified explainable features to distinguish MG patients at high or low risk for MC. Feature importance scores suggested that multimorbidity may play a role in risk classification.</div></div><div><h3>Conclusion</h3><div>This study establishes feasibility and proof-of-concept for risk classification of MC based on real-world routine clinical data using ML with explainable features and variance control at the point of care. Future research on ML-based prediction of MC should include multi-center, multinational data collection, more in-depth data per patient, more patients, and an attention-based ML model to include free-text.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105679"},"PeriodicalIF":3.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Electronic health records and data exchange in the WHO European region: A subregional analysis of achievements, challenges, and prospects 世卫组织欧洲地区的电子病历和数据交换:对成就、挑战和前景的次区域分析
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-10 DOI: 10.1016/j.ijmedinf.2024.105687
Roberto Tornero Costa , Keyrellous Adib , Nagui Salama , Stefania Davia , Antonio Martínez Millana , Vicente Traver , Karapet Davtyan
<div><div>Electronic health record (EHR) systems are powerful tools that enhance healthcare quality. They improve efficiency, enable data exchange, and ensure authorized access to patient information. In 2022, the World Health Organization Regional Office for Europe (WHO EURO) conducted a survey to assess the digital health capabilities of the 53 Member States. This article provides a sub-regional analysis of the status of EHR systems and major barriers to their implementation, their readiness for information sharing, and the access and re-use of EHR data.</div><div>Generally, EHR implementation and national data exchange are at an advanced stage in the region, though achievements and challenges vary across subregions. While more Member States in the Eastern, Western, and Southern Europe subregions reported having centralized national EHR systems, the situation is more diverse in Northern Europe and the Asian subregions, where both centralized and decentralized EHR systems are in use. Significant barriers to EHR implementation, including funding, technical capacity, competing priorities, and lack of interoperability standards are frequently cited, while others like demand, knowledge or acceptance challenges are not reported as significant. Significant barriers were reported the most by the Central Asia subregion, while barriers had least significance in Western Europe. Five out of the six subregions reported a wide adoption of national strategies and have dedicated agencies to ensure interoperability and secure data exchange. However, only 29 Member States have established legal requirements for healthcare providers to adopt EHR systems that conform to national standards for both clinical terminology and electronic messaging, with this being most notable in Western, Eastern, and Northern Europe, and the lowest percentage of Member States in Central Asia. All Member States of the six sub regions have passed privacy and data protection legislation. The use of EHR data is widely regulated, with only five remaining Member States of WHO Europe to develop EHR legislation distributed across subregions (Southern Europe, Western and Central Asia).</div><div>Looking ahead, Member States are encouraged to define national legislation governing EHR systems and their use, while ensuring the interconnectivity of the local and regional EHR systems. Sustainable funding should be allocated to the development and maintenance of these systems. Efforts should also focus on creating comprehensive roadmaps for the full implementation of health data standards, addressing interoperability at local and regional levels, and developing quality management systems for testing and certification. Additionally, monitoring and evaluation should be conducted to assess whether EHRs are contributing to national health objectives. Finally, engaging patients and intersectoral partners will be key to developing a more patient-centered approach, ensuring that EHR systems meet patient ne
电子病历(EHR)系统是提高医疗质量的强大工具。它们提高了效率,实现了数据交换,并确保对患者信息的授权访问。2022 年,世界卫生组织欧洲区域办事处(WHO EURO)开展了一项调查,以评估 53 个成员国的数字医疗能力。本文对电子健康记录系统的现状及其实施的主要障碍、其信息共享的准备情况以及电子健康记录数据的获取和再利用进行了次区域分析。总体而言,该区域的电子健康记录实施和国家数据交换处于高级阶段,但各次区域的成就和挑战各不相同。东欧、西欧和南欧次区域有更多的会员国报告拥有集中式国家电子健康记录系统,而北欧和亚洲次区域的情况则更为多样,集中式和分散式电子健康记录系统都在使用。实施电子健康记录系统的重大障碍,包括资金、技术能力、相互竞争的优先事项和缺乏互操作性标准等,经常被提及,而其他障碍,如需求、知识或接受方面的挑战,则没有被报告为重大障碍。中亚次区域报告的重大障碍最多,而西欧次区域报告的障碍最少。在六个次区域中,有五个次区域报告广泛采用了国家战略,并设立了专门机构来确保互操作性和安全的数据交换。然而,只有 29 个会员国制定了法律规定,要求医疗服务提供者采用符合国家临床术语和电子信息标准的电子病历系统,这在西欧、东欧和北欧最为明显,而中亚的会员国比例最低。六个分地区的所有会员国都通过了隐私和数据保护立法。电子健康记录数据的使用受到广泛监管,世卫组织欧洲分区域(南欧、西亚和中亚)仅剩五个会员国制定了电子健康记录立法。展望未来,鼓励会员国制定管理电子健康记录系统及其使用的国家立法,同时确保地方和区域电子健康记录系统的互联互通。应为这些系统的开发和维护划拨可持续的资金。还应重点制定全面实施健康数据标准的综合路线图,解决地方和区域一级的互操作性问题,并开发用于测试和认证的质量管理系统。此外,应开展监测和评估,以评估电子健康记录是否有助于实现国家健康目标。最后,让患者和跨部门合作伙伴参与进来,将是制定更加以患者为中心的方法,确保电子健康记录系统满足患者需求和期望的关键。
{"title":"Electronic health records and data exchange in the WHO European region: A subregional analysis of achievements, challenges, and prospects","authors":"Roberto Tornero Costa ,&nbsp;Keyrellous Adib ,&nbsp;Nagui Salama ,&nbsp;Stefania Davia ,&nbsp;Antonio Martínez Millana ,&nbsp;Vicente Traver ,&nbsp;Karapet Davtyan","doi":"10.1016/j.ijmedinf.2024.105687","DOIUrl":"10.1016/j.ijmedinf.2024.105687","url":null,"abstract":"&lt;div&gt;&lt;div&gt;Electronic health record (EHR) systems are powerful tools that enhance healthcare quality. They improve efficiency, enable data exchange, and ensure authorized access to patient information. In 2022, the World Health Organization Regional Office for Europe (WHO EURO) conducted a survey to assess the digital health capabilities of the 53 Member States. This article provides a sub-regional analysis of the status of EHR systems and major barriers to their implementation, their readiness for information sharing, and the access and re-use of EHR data.&lt;/div&gt;&lt;div&gt;Generally, EHR implementation and national data exchange are at an advanced stage in the region, though achievements and challenges vary across subregions. While more Member States in the Eastern, Western, and Southern Europe subregions reported having centralized national EHR systems, the situation is more diverse in Northern Europe and the Asian subregions, where both centralized and decentralized EHR systems are in use. Significant barriers to EHR implementation, including funding, technical capacity, competing priorities, and lack of interoperability standards are frequently cited, while others like demand, knowledge or acceptance challenges are not reported as significant. Significant barriers were reported the most by the Central Asia subregion, while barriers had least significance in Western Europe. Five out of the six subregions reported a wide adoption of national strategies and have dedicated agencies to ensure interoperability and secure data exchange. However, only 29 Member States have established legal requirements for healthcare providers to adopt EHR systems that conform to national standards for both clinical terminology and electronic messaging, with this being most notable in Western, Eastern, and Northern Europe, and the lowest percentage of Member States in Central Asia. All Member States of the six sub regions have passed privacy and data protection legislation. The use of EHR data is widely regulated, with only five remaining Member States of WHO Europe to develop EHR legislation distributed across subregions (Southern Europe, Western and Central Asia).&lt;/div&gt;&lt;div&gt;Looking ahead, Member States are encouraged to define national legislation governing EHR systems and their use, while ensuring the interconnectivity of the local and regional EHR systems. Sustainable funding should be allocated to the development and maintenance of these systems. Efforts should also focus on creating comprehensive roadmaps for the full implementation of health data standards, addressing interoperability at local and regional levels, and developing quality management systems for testing and certification. Additionally, monitoring and evaluation should be conducted to assess whether EHRs are contributing to national health objectives. Finally, engaging patients and intersectoral partners will be key to developing a more patient-centered approach, ensuring that EHR systems meet patient ne","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105687"},"PeriodicalIF":3.7,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142655800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing accuracy and Interpretability: An R package assessing complex relationships beyond the Cox model and applications to clinical prediction 平衡准确性与可解释性:评估 Cox 模型之外的复杂关系并应用于临床预测的 R 软件包。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-10 DOI: 10.1016/j.ijmedinf.2024.105700
Diana Shamsutdinova , Daniel Stamate , Daniel Stahl

Background

Accurate and interpretable models are essential for clinical decision-making, where predictions can directly impact patient care. Machine learning (ML) survival methods can handle complex multidimensional data and achieve high accuracy but require post-hoc explanations. Traditional models such as the Cox Proportional Hazards Model (Cox-PH) are less flexible, but fast, stable, and intrinsically transparent. Moreover, ML does not always outperform Cox-PH in clinical settings, warranting a diligent model validation. We aimed to develop a set of R functions to help explore the limits of Cox-PH compared to the tree-based and deep learning survival models for clinical prediction modelling, employing ensemble learning and nested cross-validation.

Methods

We developed a set of R functions, publicly available as the package “survcompare”. It supports Cox-PH and Cox-Lasso, and Survival Random Forest (SRF) and DeepHit are the ML alternatives, along with the ensemble methods integrating Cox-PH with SRF or DeepHit designed to isolate the marginal value of ML. The package performs a repeated nested cross-validation and tests for statistical significance of the ML’s superiority using the survival-specific performance metrics, the concordance index, time-dependent AUC-ROC and calibration slope.
To get practical insights, we applied this methodology to clinical and simulated datasets with varying complexities and sizes.

Results

In simulated data with non-linearities or interactions, ML models outperformed Cox-PH at sample sizes ≥ 500. ML superiority was also observed in imaging and high-dimensional clinical data. However, for tabular clinical data, the performance gains of ML were minimal; in some cases, regularised Cox-Lasso recovered much of the ML’s performance advantage with significantly faster computations. Ensemble methods combining Cox-PH and ML predictions were instrumental in quantifying Cox-PH’s limits and improving ML calibration. Traditional models like Cox-PH or Cox-Lasso should not be overlooked while developing clinical predictive models from tabular data or data of limited size.

Conclusion

Our package offers researchers a framework and practical tool for evaluating the accuracy-interpretability trade-off, helping make informed decisions about model selection.
背景:准确且可解释的模型对于临床决策至关重要,因为预测会直接影响到患者护理。机器学习(ML)生存方法可以处理复杂的多维数据并获得高准确性,但需要事后解释。考克斯比例危害模型(Cox-PH)等传统模型灵活性较差,但速度快、稳定性好,而且本质上是透明的。此外,在临床环境中,ML 并不总是优于 Cox-PH,因此需要对模型进行认真的验证。我们的目标是开发一套 R 函数,利用集合学习和嵌套交叉验证,帮助探索 Cox-PH 与基于树和深度学习的生存模型相比在临床预测建模方面的局限性:我们开发了一套 R 函数,作为 "survcompare "软件包公开发布。它支持 Cox-PH 和 Cox-Lasso,生存随机森林(SRF)和 DeepHit 是 ML 的替代方法,以及将 Cox-PH 与 SRF 或 DeepHit 整合在一起的集合方法,旨在分离 ML 的边际价值。该软件包执行重复嵌套交叉验证,并使用生存特定性能指标、一致性指数、随时间变化的 AUC-ROC 和校准斜率检验 ML 优越性的统计显著性。为了获得实用的见解,我们将这种方法应用于具有不同复杂性和规模的临床和模拟数据集:结果:在具有非线性或交互作用的模拟数据中,当样本量≥ 500 时,ML 模型优于 Cox-PH。在成像和高维临床数据中也观察到了 ML 的优越性。然而,在表格临床数据中,ML 的性能提升微乎其微;在某些情况下,正则化 Cox-Lasso 恢复了 ML 的大部分性能优势,而且计算速度明显更快。结合 Cox-PH 和 ML 预测的集合方法有助于量化 Cox-PH 的局限性并改进 ML 校准。在利用表格数据或规模有限的数据开发临床预测模型时,不应忽视 Cox-PH 或 Cox-Lasso 等传统模型:我们的软件包为研究人员提供了评估准确性-可解释性权衡的框架和实用工具,有助于在模型选择方面做出明智的决策。
{"title":"Balancing accuracy and Interpretability: An R package assessing complex relationships beyond the Cox model and applications to clinical prediction","authors":"Diana Shamsutdinova ,&nbsp;Daniel Stamate ,&nbsp;Daniel Stahl","doi":"10.1016/j.ijmedinf.2024.105700","DOIUrl":"10.1016/j.ijmedinf.2024.105700","url":null,"abstract":"<div><h3>Background</h3><div>Accurate and interpretable models are essential for clinical decision-making, where predictions can directly impact patient care. Machine learning (ML) survival methods can handle complex multidimensional data and achieve high accuracy but require post-hoc explanations. Traditional models such as the Cox Proportional Hazards Model (Cox-PH) are less flexible, but fast, stable, and intrinsically transparent. Moreover, ML does not always outperform Cox-PH in clinical settings, warranting a diligent model validation. We aimed to develop a set of R functions to help explore the limits of Cox-PH compared to the tree-based and deep learning survival models for clinical prediction modelling, employing ensemble learning and nested cross-validation.</div></div><div><h3>Methods</h3><div>We developed a set of R functions, publicly available as the package “survcompare”. It supports Cox-PH and Cox-Lasso, and Survival Random Forest (SRF) and DeepHit are the ML alternatives, along with the ensemble methods integrating Cox-PH with SRF or DeepHit designed to isolate the marginal value of ML. The package performs a repeated nested cross-validation and tests for statistical significance of the ML’s superiority using the survival-specific performance metrics, the concordance index, time-dependent AUC-ROC and calibration slope.</div><div>To get practical insights, we applied this methodology to clinical and simulated datasets with varying complexities and sizes.</div></div><div><h3>Results</h3><div>In simulated data with non-linearities or interactions, ML models outperformed Cox-PH at sample sizes ≥ 500. ML superiority was also observed in imaging and high-dimensional clinical data. However, for tabular clinical data, the performance gains of ML were minimal; in some cases, regularised Cox-Lasso recovered much of the ML’s performance advantage with significantly faster computations. Ensemble methods combining Cox-PH and ML predictions were instrumental in quantifying Cox-PH’s limits and improving ML calibration. Traditional models like Cox-PH or Cox-Lasso should not be overlooked while developing clinical predictive models from tabular data or data of limited size.</div></div><div><h3>Conclusion</h3><div>Our package offers researchers a framework and practical tool for evaluating the accuracy-interpretability trade-off, helping make informed decisions about model selection.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105700"},"PeriodicalIF":3.7,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method and validation for auditing e-Health applications based on reusable software security requirements specifications
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-10 DOI: 10.1016/j.ijmedinf.2024.105699
Carlos M. Mejía-Granda, José L. Fernández-Alemán, Juan M. Carrillo de Gea, José A. García-Berná

Objective

This article deals with the complex process of obtaining security requirements for e-Health applications. It introduces a tailored audit and validation methodology particularly designed for e-Health applications. Additionally, it presents a comprehensive security catalog derived from primary sources such as law, guides, standards, best practices, and a systematic literature review. This catalog is characterized by its continuous improvement, clarity, completeness, consistency, verifiability, modifiability, and traceability.

Methods

The authors reviewed electronic health security literature and gathered primary sources of law, guides, standards, and best practices. They organized the catalog according to the ISO/IEC/IEEE 29148:2018 standard and proposed a methodology to ensure its reusability. Moreover, the authors proposed SEC-AM as an audit method. The applicability of the catalog was validated through the audit method, which was conducted on a prominent medical application, OpenEMR.

Results

The proposed method and validation for auditing e-Health Applications through the catalog provided a comprehensive framework for developing or evaluating new applications. Through the audit of OpenEMR, several security vulnerabilities were identified, such as DDOs, XSS, JSONi, and CMDi, resulting in a “Secure” classification of OpenEMR with a compliance rate of 66.97%.

Conclusion

The study demonstrates the proposed catalog’s feasibility and effectiveness in enhancing health software security. The authors suggest continuous improvement by incorporating new regulations, knowledge from additional sources, and addressing emerging zero-day vulnerabilities. This approach is crucial for providing practical, safe, and quality medical care amidst increasing cyber threats in the healthcare industry.
本文论述了获取电子医疗应用程序安全要求的复杂过程。文章介绍了专门为电子医疗应用程序设计的审计和验证方法。此外,文章还介绍了从法律、指南、标准、最佳实践等主要来源和系统文献综述中得出的综合安全目录。该目录的特点是持续改进、清晰、完整、一致、可验证、可修改和可追溯。方法作者回顾了电子医疗安全文献,收集了法律、指南、标准和最佳实践等主要来源。他们根据 ISO/IEC/IEEE 29148:2018 标准整理了目录,并提出了确保其可重用性的方法。此外,作者还提出了 SEC-AM 作为审计方法。结果提出的通过目录审核电子医疗应用程序的方法和验证为开发或评估新应用程序提供了一个全面的框架。通过对 OpenEMR 的审计,发现了几个安全漏洞,如 DDOs、XSS、JSONi 和 CMDi,结果 OpenEMR 被归类为 "安全",符合率为 66.97%。作者建议通过纳入新法规、其他来源的知识和解决新出现的零日漏洞来不断改进。这种方法对于在医疗行业网络威胁不断增加的情况下提供实用、安全和优质的医疗服务至关重要。
{"title":"A method and validation for auditing e-Health applications based on reusable software security requirements specifications","authors":"Carlos M. Mejía-Granda,&nbsp;José L. Fernández-Alemán,&nbsp;Juan M. Carrillo de Gea,&nbsp;José A. García-Berná","doi":"10.1016/j.ijmedinf.2024.105699","DOIUrl":"10.1016/j.ijmedinf.2024.105699","url":null,"abstract":"<div><h3>Objective</h3><div>This article deals with the complex process of obtaining security requirements for e-Health applications. It introduces a tailored audit and validation methodology particularly designed for e-Health applications. Additionally, it presents a comprehensive security catalog derived from primary sources such as law, guides, standards, best practices, and a systematic literature review. This catalog is characterized by its continuous improvement, clarity, completeness, consistency, verifiability, modifiability, and traceability.</div></div><div><h3>Methods</h3><div>The authors reviewed electronic health security literature and gathered primary sources of law, guides, standards, and best practices. They organized the catalog according to the ISO/IEC/IEEE 29148:2018 standard and proposed a methodology to ensure its reusability. Moreover, the authors proposed SEC-AM as an audit method. The applicability of the catalog was validated through the audit method, which was conducted on a prominent medical application, OpenEMR.</div></div><div><h3>Results</h3><div>The proposed method and validation for auditing e-Health Applications through the catalog provided a comprehensive framework for developing or evaluating new applications. Through the audit of OpenEMR, several security vulnerabilities were identified, such as DDOs, XSS, JSONi, and CMDi, resulting in a “Secure” classification of OpenEMR with a compliance rate of 66.97%.</div></div><div><h3>Conclusion</h3><div>The study demonstrates the proposed catalog’s feasibility and effectiveness in enhancing health software security. The authors suggest continuous improvement by incorporating new regulations, knowledge from additional sources, and addressing emerging zero-day vulnerabilities. This approach is crucial for providing practical, safe, and quality medical care amidst increasing cyber threats in the healthcare industry.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105699"},"PeriodicalIF":3.7,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142704836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning adjusted sequential CUSUM-analyses are superior to cross-sectional analysis of excess mortality after surgery 经机器学习调整的连续 CUSUM 分析优于手术后超额死亡率的横断面分析。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-09 DOI: 10.1016/j.ijmedinf.2024.105684
Florian Bösch , Stina Schild-Suhren , Elif Yilmaz , Michael Ghadimi , Athanasios Karampalis , Nikolaus Börner , Markus Bo Schoenberg

Background

The assessment of clinical outcome quality, particularly in surgery, is crucial for healthcare improvement. Traditional cross-sectional analyses often fall short in timely and systematic identification of clinical quality issues. This study explores the efficacy of machine learning adjusted sequential CUSUM (Cumulative Sum) analyses in monitoring post-surgical mortality.

Material and methods

Utilizing the Global Open Source Severity of Illness Score (GOSSIS) dataset involving 91,714 patient records from 147 hospitals, this study involved the development of a machine learning model for mortality using a modified LightGBM algorithm. With this, sequential and cross sectional quality monitoring was simulated and compared.

Results

The modified LightGBM model demonstrated superior predictive accuracy (ROC AUC of 0.88). Simulations revealed that the AI risk-adjusted CUSUM required fewer patient outcome alterations to detect atypical trends compared to standard methods.

Conclusion

The AI risk-adjusted CUSUM analysis represents a significant advancement in monitoring clinical outcome quality in healthcare, especially in surgery. Its ability to detect minor discrepancies in mortality rates with greater sensitivity and specificity positions it as a valuable tool for healthcare providers. This approach could lead to earlier interventions and improved patient care.
背景:评估临床结果质量,尤其是外科手术的临床结果质量,对于改善医疗服务至关重要。传统的横断面分析往往不能及时、系统地识别临床质量问题。本研究探讨了机器学习调整后的连续 CUSUM(累积总和)分析在监测手术后死亡率方面的功效:本研究利用全球开放源疾病严重程度评分(GOSSIS)数据集(涉及来自 147 家医院的 91,714 份患者记录),使用改进的 LightGBM 算法开发了一个死亡率机器学习模型。利用该模型,模拟并比较了连续和横截面质量监测:结果:改进后的 LightGBM 模型显示出卓越的预测准确性(ROC AUC 为 0.88)。模拟结果显示,与标准方法相比,人工智能风险调整后的 CUSUM 需要更少的患者结果改变来检测非典型趋势:人工智能风险调整 CUSUM 分析代表了医疗保健临床结果质量监控的重大进步,尤其是在外科领域。它能以更高的灵敏度和特异性发现死亡率中的微小差异,是医疗服务提供者的重要工具。这种方法可以尽早采取干预措施,改善患者护理。
{"title":"Machine learning adjusted sequential CUSUM-analyses are superior to cross-sectional analysis of excess mortality after surgery","authors":"Florian Bösch ,&nbsp;Stina Schild-Suhren ,&nbsp;Elif Yilmaz ,&nbsp;Michael Ghadimi ,&nbsp;Athanasios Karampalis ,&nbsp;Nikolaus Börner ,&nbsp;Markus Bo Schoenberg","doi":"10.1016/j.ijmedinf.2024.105684","DOIUrl":"10.1016/j.ijmedinf.2024.105684","url":null,"abstract":"<div><h3>Background</h3><div>The assessment of clinical outcome quality, particularly in surgery, is crucial for healthcare improvement. Traditional cross-sectional analyses often fall short in timely and systematic identification of clinical quality issues. This study explores the efficacy of machine learning adjusted sequential CUSUM (Cumulative Sum) analyses in monitoring post-surgical mortality.</div></div><div><h3>Material and methods</h3><div>Utilizing the Global Open Source Severity of Illness Score (GOSSIS) dataset involving 91,714 patient records from 147 hospitals, this study involved the development of a machine learning model for mortality using a modified LightGBM algorithm. With this, sequential and cross sectional quality monitoring was simulated and compared.</div></div><div><h3>Results</h3><div>The modified LightGBM model demonstrated superior predictive accuracy (ROC AUC of 0.88). Simulations revealed that the AI risk-adjusted CUSUM required fewer patient outcome alterations to detect atypical trends compared to standard methods.</div></div><div><h3>Conclusion</h3><div>The AI risk-adjusted CUSUM analysis represents a significant advancement in monitoring clinical outcome quality in healthcare, especially in surgery. Its ability to detect minor discrepancies in mortality rates with greater sensitivity and specificity positions it as a valuable tool for healthcare providers. This approach could lead to earlier interventions and improved patient care.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105684"},"PeriodicalIF":3.7,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Counterfactual reasoning using causal Bayesian networks as a healthcare governance tool 利用因果贝叶斯网络进行反事实推理,作为医疗治理工具。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-09 DOI: 10.1016/j.ijmedinf.2024.105681
Evangelia Kyrimi , Somayyeh Mossadegh , Jared M. Wohlgemut , Rebecca S. Stoner , Nigel R.M. Tai , William Marsh

Background

Healthcare governance (HG) is a quality assurance processes that aims to maintain and improve clinical practice. Clinical decisions are routinely reviewed after the outcome is known to learn lessons for the future. When the outcome is positive, then practice is praised, but when practice is suboptimal, the area for improvement is highlighted. This process requires counterfactual reasoning, where we predict what would have happened given both what happened and the possible different decisions. Causal models that capture the mechanisms that generate events can support counterfactual reasoning.

Objective

This study is an initial attempt to show how counterfactual reasoning with causal Bayesian networks (CBNs) can be used as a HG tool to assess what would have happened if treatments other than those occurred had been selected.

Methods

Motivated by the Defence Medical Services (DMS) mortality and morbidity (M&M) review meeting, in this paper we (1) extended the use of counterfactual reasoning in CBNs to review decisions, where the alternative treatment strategies and its effect belong to different stages of care, (2) placed counterfactual reasoning in a specific clinical context to examine how it can be used as a HG tool.

Results

Using three realistic examples, we demonstrated how the proposed counterfactual reasoning can be used to assist the DMS M&M review meetings.

Conclusions

Useful lessons can be learned by assessing decisions after they are made. M&M review meetings are fruitful ground for counterfactual reasoning. The use of a clinical decision support tool that can assist clinicians in assessing counterfactual probabilities will be beneficial.
背景:医疗治理(HG)是一种质量保证程序,旨在维护和改进临床实践。在得知结果后,会对临床决策进行例行审查,以便为未来吸取经验教训。如果结果是积极的,那么临床实践就会受到表扬,但如果临床实践不尽人意,就会强调需要改进的地方。这一过程需要进行反事实推理,即我们预测在已发生的事情和可能的不同决策下会发生什么。捕捉事件产生机制的因果模型可以为反事实推理提供支持:本研究是一次初步尝试,旨在展示如何利用因果贝叶斯网络(CBN)进行反事实推理,将其作为一种 HG 工具,用于评估如果选择的治疗方法与已发生的治疗方法不同,将会发生什么情况:受国防医疗服务(DMS)死亡率和发病率(M&M)审查会议的启发,我们在本文中(1)将因果贝叶斯网络中的反事实推理扩展到审查决策中,在审查决策中,替代治疗策略及其效果属于不同的护理阶段;(2)将反事实推理置于特定的临床环境中,以研究如何将其用作一种健康治理工具:结果:通过三个真实的例子,我们展示了所提出的反事实推理如何用于协助 DMS M&M 评审会议:结论:通过在决策做出后对其进行评估,可以吸取有用的经验教训。M&M 评审会议是进行反事实推理的有益场所。使用临床决策支持工具协助临床医生评估反事实概率将大有裨益。
{"title":"Counterfactual reasoning using causal Bayesian networks as a healthcare governance tool","authors":"Evangelia Kyrimi ,&nbsp;Somayyeh Mossadegh ,&nbsp;Jared M. Wohlgemut ,&nbsp;Rebecca S. Stoner ,&nbsp;Nigel R.M. Tai ,&nbsp;William Marsh","doi":"10.1016/j.ijmedinf.2024.105681","DOIUrl":"10.1016/j.ijmedinf.2024.105681","url":null,"abstract":"<div><h3>Background</h3><div>Healthcare governance (HG) is a quality assurance processes that aims to maintain and improve clinical practice. Clinical decisions are routinely reviewed after the outcome is known to learn lessons for the future. When the outcome is positive, then practice is praised, but when practice is suboptimal, the area for improvement is highlighted. This process requires counterfactual reasoning, where we predict what would have happened given both what happened and the possible different decisions. Causal models that capture the mechanisms that generate events can support counterfactual reasoning.</div></div><div><h3>Objective</h3><div>This study is an initial attempt to show how counterfactual reasoning with causal Bayesian networks (CBNs) can be used as a HG tool to assess what would have happened if treatments other than those occurred had been selected.</div></div><div><h3>Methods</h3><div>Motivated by the Defence Medical Services (DMS) mortality and morbidity (M&amp;M) review meeting, in this paper we (1) extended the use of counterfactual reasoning in CBNs to review decisions, where the alternative treatment strategies and its effect belong to different stages of care, (2) placed counterfactual reasoning in a specific clinical context to examine how it can be used as a HG tool.</div></div><div><h3>Results</h3><div>Using three realistic examples, we demonstrated how the proposed counterfactual reasoning can be used to assist the DMS M&amp;M review meetings.</div></div><div><h3>Conclusions</h3><div>Useful lessons can be learned by assessing decisions after they are made. M&amp;M review meetings are fruitful ground for counterfactual reasoning. The use of a clinical decision support tool that can assist clinicians in assessing counterfactual probabilities will be beneficial.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105681"},"PeriodicalIF":3.7,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Healthy nutrition and weight management for a positive pregnancy experience in the antenatal period: Comparison of responses from artificial intelligence models on nutrition during pregnancy 产前健康营养和体重管理可带来积极的孕期体验:比较人工智能模型对孕期营养的反应。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-07 DOI: 10.1016/j.ijmedinf.2024.105663
Emine Karacan

Background

As artificial intelligence AI-supported applications become integral to web-based information-seeking, assessing their impact on healthy nutrition and weight management during the antenatal period is crucial.

Objective

This study was conducted to evaluate both the quality and semantic similarity of responses created by AI models to the most frequently asked questions about healthy nutrition and weight management during the antenatal period, based on existing clinical knowledge.

Methods

In this study, a cross-sectional assessment design was used to explore data from 3 AI models (GPT-4, MedicalGPT, Med-PaLM). We directed the most frequently asked questions about nutrition during pregnancy, obtained from the American College of Obstetricians and Gynecologists (ACOG) to each model in a new and single session on October 21, 2023, without any prior conversation. Immediately after, instructions were given to the AI models to generate responses to these questions. The responses created by AI models were evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) scale. Additionally, to assess the semantic similarity between answers to 31 pregnancy nutrition-related frequently asked questions sourced from the ACOG and responses from AI models we evaluated cosine similarity using both WORD2VEC and BioLORD-2023.

Results

Med-PaLM outperformed GPT-4 and MedicalGPT in response quality (mean = 3.93), demonstrating superior clinical accuracy over both GPT-4 (p = 0.016) and MedicalGPT (p = 0.001). GPT-4 had higher quality than MedicalGPT (p = 0.027).
The semantic similarity between ACOG and Med-PaLM is higher with WORD2VEC (0.92) compared to BioLORD-2023 (0.81), showing a difference of +0.11. The similarity scores for ACOG–MedicalGPT and ACOG–GPT-4 are similar across both models, with minimal differences of −0.01. Overall, WORD2VEC has a slightly higher average similarity (0.82) than BioLORD-2023 (0.79), with a difference of +0.03.

Conclusions

Despite the superior performance of Med-PaLM, there is a need for further evidence-based research and improvement in the integration of AI in healthcare due to varying AI model performances.
背景:随着人工智能AI支持的应用程序成为网络信息搜索的组成部分,评估它们对产前健康营养和体重管理的影响至关重要:本研究以现有临床知识为基础,评估人工智能模型对产前健康营养和体重管理方面最常见问题所做回答的质量和语义相似性:本研究采用横断面评估设计,对 3 个人工智能模型(GPT-4、MedicalGPT、Med-PaLM)的数据进行研究。2023 年 10 月 21 日,我们将从美国妇产科医师学会(ACOG)获得的有关孕期营养的最常见问题,在没有任何事先交谈的情况下,以新的单次会议形式传授给每个模型。紧接着,向人工智能模型发出指令,让其生成对这些问题的回答。人工智能模型生成的回答采用建议评估、开发和评价分级法(GRADE)进行评估。此外,为了评估来自 ACOG 的 31 个妊娠营养相关常见问题的回答与人工智能模型的回答之间的语义相似性,我们使用 WORD2VEC 和 BioLORD-2023 评估了余弦相似性:Med-PaLM 的回答质量(平均值 = 3.93)优于 GPT-4 和 MedicalGPT,临床准确性优于 GPT-4 (p = 0.016) 和 MedicalGPT (p = 0.001)。GPT-4 的质量高于 MedicalGPT(p = 0.027)。ACOG 和 Med-PaLM 的语义相似度 WORD2VEC(0.92)高于 BioLORD-2023(0.81),两者相差+0.11。两个模型中 ACOG-MedicalGPT 和 ACOG-GPT-4 的相似度得分相似,差异极小,均为 -0.01。总体而言,WORD2VEC 的平均相似度(0.82)略高于 BioLORD-2023(0.79),差异为 +0.03:尽管 Med-PaLM 的性能优越,但由于人工智能模型的性能参差不齐,在将人工智能整合到医疗保健领域方面仍需要进一步的循证研究和改进。
{"title":"Healthy nutrition and weight management for a positive pregnancy experience in the antenatal period: Comparison of responses from artificial intelligence models on nutrition during pregnancy","authors":"Emine Karacan","doi":"10.1016/j.ijmedinf.2024.105663","DOIUrl":"10.1016/j.ijmedinf.2024.105663","url":null,"abstract":"<div><h3>Background</h3><div>As artificial intelligence AI-supported applications become integral to web-based information-seeking, assessing their impact on healthy nutrition and weight management during the antenatal period is crucial.</div></div><div><h3>Objective</h3><div>This study was conducted to evaluate both the quality and semantic similarity of responses created by AI models to the most frequently asked questions about healthy nutrition and weight management during the antenatal period, based on existing clinical knowledge.</div></div><div><h3>Methods</h3><div>In this study, a cross-sectional assessment design was used to explore data from 3 AI models (GPT-4, MedicalGPT, Med-PaLM). We directed the most frequently asked questions about nutrition during pregnancy, obtained from the American College of Obstetricians and Gynecologists (ACOG) to each model in a new and single session on October 21, 2023, without any prior conversation. Immediately after, instructions were given to the AI models to generate responses to these questions. The responses created by AI models were evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) scale. Additionally, to assess the semantic similarity between answers to 31 pregnancy nutrition-related frequently asked questions sourced from the ACOG and responses from AI models we evaluated cosine similarity using both WORD2VEC and BioLORD-2023.</div></div><div><h3>Results</h3><div>Med-PaLM outperformed GPT-4 and MedicalGPT in response quality (mean = 3.93), demonstrating superior clinical accuracy over both GPT-4 (p = 0.016) and MedicalGPT (p = 0.001). GPT-4 had higher quality than MedicalGPT (p = 0.027).</div><div>The semantic similarity between ACOG and Med-PaLM is higher with WORD2VEC (0.92) compared to BioLORD-2023 (0.81), showing a difference of +0.11. The similarity scores for ACOG–MedicalGPT and ACOG–GPT-4 are similar across both models, with minimal differences of −0.01. Overall, WORD2VEC has a slightly higher average similarity (0.82) than BioLORD-2023 (0.79), with a difference of +0.03.</div></div><div><h3>Conclusions</h3><div>Despite the superior performance of Med-PaLM, there is a need for further evidence-based research and improvement in the integration of AI in healthcare due to varying AI model performances.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105663"},"PeriodicalIF":3.7,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A machine-learning-based algorithm for bone marrow cell differential counting 基于机器学习的骨髓细胞差异计数算法。
IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-11-07 DOI: 10.1016/j.ijmedinf.2024.105692
Ta-Chuan Yu , Cheng-Kun Yang , Wei-Han Hsu , Cheng-An Hsu , Hsiao-Chun Wang , Hsin-Jung Hsiao , Hsiao-Ling Chao , Han-Peng Hsieh , Jia-Rong Wu , Yen-Chun Tsai , Yi-Mei Chiang , Poshing Lee , Che-Pin Lin , Ling-Ping Chen , Yung-Chuan Sung , Ya-Yun Yang , Chin-Ling Yu , Chih-Kang Lin , Chia-Pin Kang , Che-Wei Chang , Wen-Chien Chou

Background

Differential counting (DC) of different cell types in bone marrow (BM) aspiration smears is crucial for diagnosing hematological diseases. However, a clinically applicable method for automatic DC has yet to be developed.

Objective

This study developed and validated an artificial intelligence (AI)-based algorithm for identifying and classifying nucleated cells in BM smears.

Methods

In the development phase, a mask region–based convolutional neural network (Mask R-CNN)-based AI model was trained to detect and classify individual BM cells. We used a large data set of expert-annotated images representing a variety of disease categories. The BM slides were stained with Liu’s stain or Wright–Giemsa stain. Consensus meetings were held to ensure experts from different institutes applied consistent criteria in classifying cells. Subsequently, the performance of the AI algorithm in identifying cell images and determining cell ratios was evaluated using a multinational clinical dataset.

Results

The AI model was trained on 542 slides (85.1 % stained with Liu’s stain and 14.9 % with Wright–Giemsa stain) containing 597,222 annotated cells. It achieved an accuracy of 0.94 for the testing dataset containing 26,170 cells. The performance of the AI model was further validated using another multinational real-world dataset (data obtained from three centers in Taiwan and one in the United States) comprising 200,639 cells. The AI model achieved an accuracy of 0.881 in classifying individual cells and demonstrated high precision in classifying blasts (0.927), bands and polymorphonuclear neutrophils (0.955), plasma cells (0.930), and lymphocytes (0.789). When the differential counting percentage of each cell type was assessed, a strong correlation (ρ > 0.8) between the AI and manual methods was observed for most cell categories.

Conclusions

In this study, an AI algorithm was developed and clinically validated using large, multinational datasets. Our algorithm can locate and classify BM cells simultaneously and has potential clinical applicability for automating BM differential counting.
背景:骨髓(BM)抽吸涂片中不同类型细胞的鉴别计数(DC)对于诊断血液病至关重要。然而,目前尚未开发出一种适用于临床的自动 DC 方法:本研究开发并验证了一种基于人工智能(AI)的算法,用于识别和分类骨髓涂片中的有核细胞:在开发阶段,我们训练了一个基于掩膜区域卷积神经网络(Mask R-CNN)的人工智能模型,以检测单个血液涂片细胞并对其进行分类。我们使用了一个大型数据集,其中包含专家标注的代表各种疾病类别的图像。用刘氏染色法或 Wright-Giemsa 染色法对 BM 切片进行染色。我们召开了共识会议,以确保来自不同机构的专家在对细胞进行分类时采用一致的标准。随后,使用跨国临床数据集评估了人工智能算法在识别细胞图像和确定细胞比例方面的性能:结果:人工智能模型是在包含 597,222 个注释细胞的 542 张幻灯片(85.1% 用刘氏染色法染色,14.9% 用赖特-吉氏染色法染色)上进行训练的。在包含 26,170 个细胞的测试数据集上,该模型的准确率达到了 0.94。人工智能模型的性能通过另一个包含 200 639 个细胞的跨国真实数据集(数据来自台湾的三个中心和美国的一个中心)得到了进一步验证。人工智能模型对单个细胞进行分类的准确率为 0.881,在对胚泡(0.927)、带状和多形核中性粒细胞(0.955)、浆细胞(0.930)和淋巴细胞(0.789)进行分类时表现出很高的精确度。在评估每种细胞类型的差异计数百分比时,大多数细胞类别的人工智能和手动方法之间都有很强的相关性(ρ > 0.8):本研究开发了一种人工智能算法,并利用大型跨国数据集进行了临床验证。我们的算法可以同时定位和分类骨髓细胞,具有临床应用潜力,可实现骨髓细胞差异计数的自动化。
{"title":"A machine-learning-based algorithm for bone marrow cell differential counting","authors":"Ta-Chuan Yu ,&nbsp;Cheng-Kun Yang ,&nbsp;Wei-Han Hsu ,&nbsp;Cheng-An Hsu ,&nbsp;Hsiao-Chun Wang ,&nbsp;Hsin-Jung Hsiao ,&nbsp;Hsiao-Ling Chao ,&nbsp;Han-Peng Hsieh ,&nbsp;Jia-Rong Wu ,&nbsp;Yen-Chun Tsai ,&nbsp;Yi-Mei Chiang ,&nbsp;Poshing Lee ,&nbsp;Che-Pin Lin ,&nbsp;Ling-Ping Chen ,&nbsp;Yung-Chuan Sung ,&nbsp;Ya-Yun Yang ,&nbsp;Chin-Ling Yu ,&nbsp;Chih-Kang Lin ,&nbsp;Chia-Pin Kang ,&nbsp;Che-Wei Chang ,&nbsp;Wen-Chien Chou","doi":"10.1016/j.ijmedinf.2024.105692","DOIUrl":"10.1016/j.ijmedinf.2024.105692","url":null,"abstract":"<div><h3>Background</h3><div>Differential counting (DC) of different cell types in bone marrow (BM) aspiration smears is crucial for diagnosing hematological diseases. However, a clinically applicable method for automatic DC has yet to be developed.</div></div><div><h3>Objective</h3><div>This study developed and validated an artificial intelligence (AI)-based algorithm for identifying and classifying nucleated cells in BM smears.</div></div><div><h3>Methods</h3><div>In the development phase, a mask region–based convolutional neural network (Mask R-CNN)-based AI model was trained to detect and classify individual BM cells. We used a large data set of expert-annotated images representing a variety of disease categories. The BM slides were stained with Liu’s stain or Wright–Giemsa stain. Consensus meetings were held to ensure experts from different institutes applied consistent criteria in classifying cells. Subsequently, the performance of the AI algorithm in identifying cell images and determining cell ratios was evaluated using a multinational clinical dataset.</div></div><div><h3>Results</h3><div>The AI model was trained on 542 slides (85.1 % stained with Liu’s stain and 14.9 % with Wright–Giemsa stain) containing 597,222 annotated cells. It achieved an accuracy of 0.94 for the testing dataset containing 26,170 cells. The performance of the AI model was further validated using another multinational real-world dataset (data obtained from three centers in Taiwan and one in the United States) comprising 200,639 cells. The AI model achieved an accuracy of 0.881 in classifying individual cells and demonstrated high precision in classifying blasts (0.927), bands and polymorphonuclear neutrophils (0.955), plasma cells (0.930), and lymphocytes (0.789). When the differential counting percentage of each cell type was assessed, a strong correlation (ρ &gt; 0.8) between the AI and manual methods was observed for most cell categories.</div></div><div><h3>Conclusions</h3><div>In this study, an AI algorithm was developed and clinically validated using large, multinational datasets. Our algorithm can locate and classify BM cells simultaneously and has potential clinical applicability for automating BM differential counting.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105692"},"PeriodicalIF":3.7,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142645226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Medical Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1