Background: Anomalies in healthcare refer to deviation from the norm of unusual or unexpected patterns or activities related to patients, diseases or medical centres. Detecting these anomalies is crucial for timely interventions and efficient decision-making, helping to identify issues like operational inefficiencies, fraud and emerging health complications.
Objectives: This study presents a novel method for detecting both collective and individual anomalies in healthcare data through time series analysis using unsupervised machine learning. The dual-strategy approach leverages two methodologies: a 'practice centre-based approach' which monitors changes across different practice centres and a 'process-based approach' which focuses on identifying anomalies within individual centres. The former allows for early detection of systemic issues, while the latter highlights specific irregularities within a centre's operations.
Methods: The study utilised a dataset over 500,000 medical records from multiple GP practice centres in the UK collected between 2018-2023. Data are clustered using DBSCAN to identify collective anomalies from deviations from linear trends in consecutive two-month scatterplots. Individual anomalies are identified by examining the SOM-clustered time series of various medical processes within a specific practice centre, where graphs show deviation from the typical pattern.
Findings: Our approach addresses some challenges posed by the complexity and sensitivity of healthcare data by not requiring personal information. The method offers accurate visual representations making the data accessible and interpretable for non-technical users. Unlike traditional methods focusing solely on subsequence anomalies, our technique analyses the collective behaviour across multiple time series providing a more comprehensive perspective.
Conclusion: This study underscores the importance of integrating unsupervised anomaly detection with clinical expertise to ensure that statistically anomalous patterns align with clinical relevance. The dual-strategy clustering method holds significant potential for enabling timely interventions, proactively identifying potential crises, and ultimately contributing to better decision-making and operational efficiency within the healthcare sector.
{"title":"Smart data-driven medical decisions through collective and individual anomaly detection in healthcare time series.","authors":"Farbod Khanizadeh, Alireza Ettefaghian, George Wilson, Amirali Shirazibeheshti, Tarek Radwan, Cristina Luca","doi":"10.1016/j.ijmedinf.2024.105696","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105696","url":null,"abstract":"<p><strong>Background: </strong>Anomalies in healthcare refer to deviation from the norm of unusual or unexpected patterns or activities related to patients, diseases or medical centres. Detecting these anomalies is crucial for timely interventions and efficient decision-making, helping to identify issues like operational inefficiencies, fraud and emerging health complications.</p><p><strong>Objectives: </strong>This study presents a novel method for detecting both collective and individual anomalies in healthcare data through time series analysis using unsupervised machine learning. The dual-strategy approach leverages two methodologies: a 'practice centre-based approach' which monitors changes across different practice centres and a 'process-based approach' which focuses on identifying anomalies within individual centres. The former allows for early detection of systemic issues, while the latter highlights specific irregularities within a centre's operations.</p><p><strong>Methods: </strong>The study utilised a dataset over 500,000 medical records from multiple GP practice centres in the UK collected between 2018-2023. Data are clustered using DBSCAN to identify collective anomalies from deviations from linear trends in consecutive two-month scatterplots. Individual anomalies are identified by examining the SOM-clustered time series of various medical processes within a specific practice centre, where graphs show deviation from the typical pattern.</p><p><strong>Findings: </strong>Our approach addresses some challenges posed by the complexity and sensitivity of healthcare data by not requiring personal information. The method offers accurate visual representations making the data accessible and interpretable for non-technical users. Unlike traditional methods focusing solely on subsequence anomalies, our technique analyses the collective behaviour across multiple time series providing a more comprehensive perspective.</p><p><strong>Conclusion: </strong>This study underscores the importance of integrating unsupervised anomaly detection with clinical expertise to ensure that statistically anomalous patterns align with clinical relevance. The dual-strategy clustering method holds significant potential for enabling timely interventions, proactively identifying potential crises, and ultimately contributing to better decision-making and operational efficiency within the healthcare sector.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105696"},"PeriodicalIF":3.7,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16DOI: 10.1016/j.ijmedinf.2024.105704
Xiao Luo, Xin Cui, Rui Wang, Yi Cheng, Ronghui Zhu, Yaoyong Tai, Cheng Wu, Jia He
Background: Stroke recurrence readmission poses an additional burden on both patients and healthcare systems. Risk stratification aims to accurately divide patients into groups to provide targeted interventions at reducing readmission. To accurately predict short and intermediate-term risks of readmission and provide information for further temporal risk stratification, we developed and validated an interpretable machine learning risk scoring tool.
Methods: In this retrospective study, all stroke admission episodes from January 1st 2015 to December 31st 2019 were obtained from the Shanghai Health and Health Development Research Centre database, which covers medical records of all patients hospitalized in 436 medical institutes in Shanghai. The outcome was time to stroke recurrence readmission within 90 days post discharge. The Score for Stroke Recurrence Readmission Prediction (SSRRP) tool was derived via an interpretable machine learning-based system for time-to-event outcomes. SSRRP as six-variable survival score includes sequelae, length of stay, type of stroke, random plasma glucose, medical expense payment, and number of hospitalizations.
Results: A total of 339,212 S admission episodes were finally included in the whole cohort. Among them, 217,393 episodes were included in the training dataset, 54,347 episodes in the internal validation dataset, and 67,472 in the temporal validation dataset. Readmission within 90 days was documented in 33922(9.97 %) episodes, with a median time to emergency readmission of 19 days (Interquartile range: 8-43). In the temporal validation dataset, the SSRRP achieved an integrated area under the curve of 0.730(95 % CI, 0.724-0.737). In addition, SSRRP demonstrated good calibration and clinical benefit rate.
Conclusions: In this retrospective cohort study, the SSRRP, a parsimonious and point-based scoring tool, was developed to predict the risk of recurrent readmission for stroke. It also provided accurate information on the time to stroke readmission, enabling further temporal risk stratification and informed clinical decision-making.
{"title":"An interpretable machine learning scoring tool for estimating time to recurrence readmissions in stroke patients.","authors":"Xiao Luo, Xin Cui, Rui Wang, Yi Cheng, Ronghui Zhu, Yaoyong Tai, Cheng Wu, Jia He","doi":"10.1016/j.ijmedinf.2024.105704","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105704","url":null,"abstract":"<p><strong>Background: </strong>Stroke recurrence readmission poses an additional burden on both patients and healthcare systems. Risk stratification aims to accurately divide patients into groups to provide targeted interventions at reducing readmission. To accurately predict short and intermediate-term risks of readmission and provide information for further temporal risk stratification, we developed and validated an interpretable machine learning risk scoring tool.</p><p><strong>Methods: </strong>In this retrospective study, all stroke admission episodes from January 1st 2015 to December 31st 2019 were obtained from the Shanghai Health and Health Development Research Centre database, which covers medical records of all patients hospitalized in 436 medical institutes in Shanghai. The outcome was time to stroke recurrence readmission within 90 days post discharge. The Score for Stroke Recurrence Readmission Prediction (SSRRP) tool was derived via an interpretable machine learning-based system for time-to-event outcomes. SSRRP as six-variable survival score includes sequelae, length of stay, type of stroke, random plasma glucose, medical expense payment, and number of hospitalizations.</p><p><strong>Results: </strong>A total of 339,212 S admission episodes were finally included in the whole cohort. Among them, 217,393 episodes were included in the training dataset, 54,347 episodes in the internal validation dataset, and 67,472 in the temporal validation dataset. Readmission within 90 days was documented in 33922(9.97 %) episodes, with a median time to emergency readmission of 19 days (Interquartile range: 8-43). In the temporal validation dataset, the SSRRP achieved an integrated area under the curve of 0.730(95 % CI, 0.724-0.737). In addition, SSRRP demonstrated good calibration and clinical benefit rate.</p><p><strong>Conclusions: </strong>In this retrospective cohort study, the SSRRP, a parsimonious and point-based scoring tool, was developed to predict the risk of recurrent readmission for stroke. It also provided accurate information on the time to stroke readmission, enabling further temporal risk stratification and informed clinical decision-making.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105704"},"PeriodicalIF":3.7,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-16DOI: 10.1016/j.ijmedinf.2024.105707
Anas Abu-Doleh, Isam F Abu-Qasmieh, Hiam H Al-Quran, Ihssan S Masad, Lamis R Banyissa, Marwa Alhaj Ahmad
Background: Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that affects social interactions and behavior. Accurate and early diagnosis of ASD is still challenging even with the improvements in neuroimaging technology and machine learning algorithms. It's challenging because of the wide range of symptoms, delayed appearance of symptoms, and the subjective nature of diagnosis. In this study, the aim is to enhance ASD recognition by focusing on brain subcortical regions, which are critical for understanding ASD pathology.
Methodology: First, subcortical structures were extracted from a collection of brain MRI datasets using sophisticated processing steps. Next, a 3D autoencoder was trained on these 3D images to help identify brain regions related to ASD. Two distinct feature selection methods were then applied to the features extracted from the encoder. The highest-ranked features were iteratively selected and increased to reconstruct a specific percentage of the brain that represents the most relevant parts for ASD. Finally, a Siamese Convolutional Neural Network (SCNN) was employed as the classifier model.
Results: The 3D autoencoder stage helped in identifying and reconstructing the significant subcortical regions related to ASD. Based on the studied dataset, high agreement in regions like the Putamen and Pallidum indicated the critical nature of these structures in distinguishing Autism from controls cases. Subsequently, applying SCNN on these selected subcortical regions yielded promising results. For example, using the classifier on the output regions identified by the Mutual Information (MI) features selection method achieved the highest accuracy of 0.66.
Conclusions: This study shows that using a two-stage model involving autoencoder and SCNN can notably improve the classification of ASD from brain MRI volumetric images. Applying an iterative feature extraction approach allowed to achieve a more accurate identification of ASD-related brain areas. This two-stage approach not only improved classification performance but also enhanced the interpretability of the neuroimaging data.
{"title":"Recognition of autism in subcortical brain volumetric images using autoencoding-based region selection method and Siamese Convolutional Neural Network.","authors":"Anas Abu-Doleh, Isam F Abu-Qasmieh, Hiam H Al-Quran, Ihssan S Masad, Lamis R Banyissa, Marwa Alhaj Ahmad","doi":"10.1016/j.ijmedinf.2024.105707","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105707","url":null,"abstract":"<p><strong>Background: </strong>Autism Spectrum Disorder (ASD) is a neurodevelopmental condition that affects social interactions and behavior. Accurate and early diagnosis of ASD is still challenging even with the improvements in neuroimaging technology and machine learning algorithms. It's challenging because of the wide range of symptoms, delayed appearance of symptoms, and the subjective nature of diagnosis. In this study, the aim is to enhance ASD recognition by focusing on brain subcortical regions, which are critical for understanding ASD pathology.</p><p><strong>Methodology: </strong>First, subcortical structures were extracted from a collection of brain MRI datasets using sophisticated processing steps. Next, a 3D autoencoder was trained on these 3D images to help identify brain regions related to ASD. Two distinct feature selection methods were then applied to the features extracted from the encoder. The highest-ranked features were iteratively selected and increased to reconstruct a specific percentage of the brain that represents the most relevant parts for ASD. Finally, a Siamese Convolutional Neural Network (SCNN) was employed as the classifier model.</p><p><strong>Results: </strong>The 3D autoencoder stage helped in identifying and reconstructing the significant subcortical regions related to ASD. Based on the studied dataset, high agreement in regions like the Putamen and Pallidum indicated the critical nature of these structures in distinguishing Autism from controls cases. Subsequently, applying SCNN on these selected subcortical regions yielded promising results. For example, using the classifier on the output regions identified by the Mutual Information (MI) features selection method achieved the highest accuracy of 0.66.</p><p><strong>Conclusions: </strong>This study shows that using a two-stage model involving autoencoder and SCNN can notably improve the classification of ASD from brain MRI volumetric images. Applying an iterative feature extraction approach allowed to achieve a more accurate identification of ASD-related brain areas. This two-stage approach not only improved classification performance but also enhanced the interpretability of the neuroimaging data.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105707"},"PeriodicalIF":3.7,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A major concern for cancer survivors after treatment is the Fear of Cancer Recurrence (FCR), which is the fear that cancer will reappear or progress. This fear can be exacerbated by medical uncertainty about the future, leading to harmful obsession and having a negative impact on quality of life. This study aims to develop a predictive Machine Learning (ML) model using healthcare reimbursement data to better predict FCR and understand the factors influencing FCR in women with breast cancer five years after their diagnosis.
Materials and Methods
We used data from the VICAN (VIe après le CANcer) survey to propose an interpretable model to identify patients at risk of developing clinical FCR. The reimbursement data for each patient were analyzed beyond the first two years of treatment, excluding the initial phase influenced by the cancer curative therapeutic process. Data experiments were conducted, including the use of algorithms such as Random Forest, Support Vector Machines, Gradient Boosting, eXtreme Gradient Boosting, and Multilayer Perceptron. The AUC was used to choose the optimal model.
Results
The dataset is composed of 918 patients classified regarding FCR. The experimental design incorporated classification levels of medications, biological and medical procedures. Subsequently, data was generated for two experiments, facilitating examination at the ultimate healthcare classification level in Experiment 1, while Experiment 2 targeted the penultimate classification level. Overall, the best-performing model achieved an AUC of 66%. This demonstrates some effectiveness of the algorithms in detecting patients at risk of developing clinical FCR and encourages further investigations to enhance the model's performance and assess its generalizability.
Conclusion
ML applied to reimbursement data has shown promise in predicting FCR, aiding in the identification of patients at risk of developing it. The results pave the way for personalized prevention and intervention strategies, representing a significant advancement in postcancer care focusing on the needs of survivors.
目的:癌症幸存者在接受治疗后最担心的问题是 "癌症复发恐惧"(Fear of Cancer Recurrence,FCR),即害怕癌症再次复发或恶化。这种恐惧会因医学上对未来的不确定性而加剧,导致有害的痴迷,并对生活质量产生负面影响。本研究旨在利用医疗报销数据开发一个预测性机器学习(ML)模型,以更好地预测乳腺癌女性患者在确诊五年后的FCR,并了解影响FCR的因素:我们利用 VICAN(VIe après le CANcer)调查的数据提出了一个可解释的模型,用于识别有临床 FCR 风险的患者。我们对每位患者治疗头两年的报销数据进行了分析,其中不包括受癌症治愈治疗过程影响的初始阶段。进行了数据实验,包括使用随机森林、支持向量机、梯度提升、极端梯度提升和多层感知器等算法。使用 AUC 来选择最佳模型:数据集由 918 名按 FCR 分类的患者组成。实验设计包括药物、生物和医疗程序的分类级别。随后,产生了两个实验的数据,实验 1 在最终的医疗保健分类级别进行检查,而实验 2 则针对倒数第二个分类级别。总体而言,表现最好的模型的 AUC 达到了 66%。这表明算法在检测有临床 FCR 风险的患者方面具有一定的有效性,并鼓励进一步研究以提高模型的性能并评估其通用性:应用于报销数据的 ML 在预测 FCR 方面显示出了前景,有助于识别有患 FCR 风险的患者。这些结果为个性化的预防和干预策略铺平了道路,代表了以幸存者需求为重点的癌症后护理领域的一大进步。
{"title":"Predicting Fear of Breast Cancer Recurrence in women five years after diagnosis using Machine Learning and healthcare reimbursement data from the French nationwide VICAN survey","authors":"Mamoudou Koume , Lorène Seguin , Julien Mancini , Marc-Karim Bendiane , Anne-Déborah Bouhnik , Raquel Urena","doi":"10.1016/j.ijmedinf.2024.105705","DOIUrl":"10.1016/j.ijmedinf.2024.105705","url":null,"abstract":"<div><h3>Objective</h3><div>A major concern for cancer survivors after treatment is the Fear of Cancer Recurrence (FCR), which is the fear that cancer will reappear or progress. This fear can be exacerbated by medical uncertainty about the future, leading to harmful obsession and having a negative impact on quality of life. This study aims to develop a predictive Machine Learning (ML) model using healthcare reimbursement data to better predict FCR and understand the factors influencing FCR in women with breast cancer five years after their diagnosis.</div></div><div><h3>Materials and Methods</h3><div>We used data from the VICAN (VIe après le CANcer) survey to propose an interpretable model to identify patients at risk of developing clinical FCR. The reimbursement data for each patient were analyzed beyond the first two years of treatment, excluding the initial phase influenced by the cancer curative therapeutic process. Data experiments were conducted, including the use of algorithms such as Random Forest, Support Vector Machines, Gradient Boosting, eXtreme Gradient Boosting, and Multilayer Perceptron. The AUC was used to choose the optimal model.</div></div><div><h3>Results</h3><div>The dataset is composed of 918 patients classified regarding FCR. The experimental design incorporated classification levels of medications, biological and medical procedures. Subsequently, data was generated for two experiments, facilitating examination at the ultimate healthcare classification level in Experiment 1, while Experiment 2 targeted the penultimate classification level. Overall, the best-performing model achieved an AUC of 66%. This demonstrates some effectiveness of the algorithms in detecting patients at risk of developing clinical FCR and encourages further investigations to enhance the model's performance and assess its generalizability.</div></div><div><h3>Conclusion</h3><div>ML applied to reimbursement data has shown promise in predicting FCR, aiding in the identification of patients at risk of developing it. The results pave the way for personalized prevention and intervention strategies, representing a significant advancement in postcancer care focusing on the needs of survivors.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105705"},"PeriodicalIF":3.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Heart failure with preserved ejection fraction (HFpEF) is associated with elevated rates of readmission and mortality. Accurate prediction of readmission risk is essential for optimizing healthcare resources and enhancing patient outcomes.
Methods: We conducted a retrospective cohort study utilizing HFpEF patient data from two institutions: the First Affiliated Hospital Zhejiang University School of Medicine for model development and internal validation, and the Affiliated Hospital of Xuzhou Medical University for external validation. A machine learning (ML) model was developed and validated using 53 variables to predict the risk of readmission within one year. The model's performance was assessed using several metrics, including the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1 score, model training time, model prediction time and brier score. SHAP (SHapley Additive exPlanations) analysis was employed to enhance model interpretability, and a dynamic nomogram was constructed to visualize the predictive model.
Results: Among the 766 HFpEF patients included in the study, 203 (26.5%) were readmitted within one year. The LightGBM model exhibited the highest predictive performance, with an AUC of 0.88 (95% confidence interval (CI):0.84-0.91), an accuracy of 0.79, a sensitivity of 0.81, and a specificity of 0.78. Key predictors included the E/e' ratio, NYHA classification, LVEF, age, BNP levels, MLR, history of atrial fibrillation (AF), use of ACEI/ARB/ARNI, and history of myocardial infarction (MI). External validation also demonstrated strong predictive performance, with an AUC of 0.87 (95%CI:0.83-0.91).
Conclusions: The LightGBM model exhibited robust performance in predicting one-year readmission risk among HFpEF patients, providing a valuable tool for clinicians to identify high-risk individuals and implement timely interventions.
{"title":"Development and validation of a machine learning model to predict the risk of readmission within one year in HFpEF patients: Short title: Prediction of HFpEF readmission.","authors":"Yue Hu, Fanghui Ma, Mengjie Hu, Binbing Shi, Defeng Pan, Jingjing Ren","doi":"10.1016/j.ijmedinf.2024.105703","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105703","url":null,"abstract":"<p><strong>Background: </strong>Heart failure with preserved ejection fraction (HFpEF) is associated with elevated rates of readmission and mortality. Accurate prediction of readmission risk is essential for optimizing healthcare resources and enhancing patient outcomes.</p><p><strong>Methods: </strong>We conducted a retrospective cohort study utilizing HFpEF patient data from two institutions: the First Affiliated Hospital Zhejiang University School of Medicine for model development and internal validation, and the Affiliated Hospital of Xuzhou Medical University for external validation. A machine learning (ML) model was developed and validated using 53 variables to predict the risk of readmission within one year. The model's performance was assessed using several metrics, including the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, F1 score, model training time, model prediction time and brier score. SHAP (SHapley Additive exPlanations) analysis was employed to enhance model interpretability, and a dynamic nomogram was constructed to visualize the predictive model.</p><p><strong>Results: </strong>Among the 766 HFpEF patients included in the study, 203 (26.5%) were readmitted within one year. The LightGBM model exhibited the highest predictive performance, with an AUC of 0.88 (95% confidence interval (CI):0.84-0.91), an accuracy of 0.79, a sensitivity of 0.81, and a specificity of 0.78. Key predictors included the E/e' ratio, NYHA classification, LVEF, age, BNP levels, MLR, history of atrial fibrillation (AF), use of ACEI/ARB/ARNI, and history of myocardial infarction (MI). External validation also demonstrated strong predictive performance, with an AUC of 0.87 (95%CI:0.83-0.91).</p><p><strong>Conclusions: </strong>The LightGBM model exhibited robust performance in predicting one-year readmission risk among HFpEF patients, providing a valuable tool for clinicians to identify high-risk individuals and implement timely interventions.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105703"},"PeriodicalIF":3.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142689805","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-14DOI: 10.1016/j.ijmedinf.2024.105691
Gizell Green
Introduction: In the digital age, electronic health literacy (eHealth literacy) has become crucial for maintaining and improving health outcomes. As the population ages, developing and validating tools that accurately measure eHealth literacy levels among older adults in different cultures is essential.
Objectives: This study aimed to validate the Hebrew version of the electronic Health Literacy scale among Israelis aged 65 and older by assessing its psychometric properties, including content validity, construct validity, age-based convergent validity, internal consistency reliability, and test-retest reliability.
Methods: A sample of 628 Israelis aged 65 and older was recruited using convenience sampling. Participants completed an online survey consisting of the HeHEALS, demographic questions, items related to participants' use of online health information sources, and measures of self-rated health, satisfaction with health, and perceived health compared to others. Psychometric properties were assessed using various statistical analyses.
Results: The HeHEALS demonstrated good content validity, construct validity, age-based convergent validity, internal consistency reliability, and test-retest reliability. Exploratory factor analysis supported a unidimensional structure of the HeHEALS. Significant positive correlations were found between HeHEALS and education, income, and subjective health measures. Users of online health information sources had significantly higher electronic health literacy scores than non-users.
Conclusions: The HeHEALS is a valid and reliable tool for assessing eHealth literacy among older adults in Israel, with potential applications in research and practice to promote digital health equity.
{"title":"Electronic Health Literacy among Older Adults: Development and Psychometric Validation of the Hebrew Version of the Electronic Health Literacy Questionnaire.","authors":"Gizell Green","doi":"10.1016/j.ijmedinf.2024.105691","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105691","url":null,"abstract":"<p><strong>Introduction: </strong>In the digital age, electronic health literacy (eHealth literacy) has become crucial for maintaining and improving health outcomes. As the population ages, developing and validating tools that accurately measure eHealth literacy levels among older adults in different cultures is essential.</p><p><strong>Objectives: </strong>This study aimed to validate the Hebrew version of the electronic Health Literacy scale among Israelis aged 65 and older by assessing its psychometric properties, including content validity, construct validity, age-based convergent validity, internal consistency reliability, and test-retest reliability.</p><p><strong>Methods: </strong>A sample of 628 Israelis aged 65 and older was recruited using convenience sampling. Participants completed an online survey consisting of the HeHEALS, demographic questions, items related to participants' use of online health information sources, and measures of self-rated health, satisfaction with health, and perceived health compared to others. Psychometric properties were assessed using various statistical analyses.</p><p><strong>Results: </strong>The HeHEALS demonstrated good content validity, construct validity, age-based convergent validity, internal consistency reliability, and test-retest reliability. Exploratory factor analysis supported a unidimensional structure of the HeHEALS. Significant positive correlations were found between HeHEALS and education, income, and subjective health measures. Users of online health information sources had significantly higher electronic health literacy scores than non-users.</p><p><strong>Conclusions: </strong>The HeHEALS is a valid and reliable tool for assessing eHealth literacy among older adults in Israel, with potential applications in research and practice to promote digital health equity.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105691"},"PeriodicalIF":3.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142689806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-14DOI: 10.1016/j.ijmedinf.2024.105702
Cansu Alptekin, Jared M Wohlgemut, Zane B Perkins, William Marsh, Nigel R M Tai, Barbaros Yet
Introduction: Both predictions and performance of clinical predictive models can be presented with various verbal and visual representations. This study aims to investigate how different risk and performance presentations for probabilistic predictions affect clinical users' judgement and preferences.
Methods: We use a clinical Bayesian Network (BN) model that has been developed for predicting the risk of Trauma Induced Coagulopathy (TIC). Three patient scenarios with different levels of TIC risk were shown to trauma care clinicians. The prediction and discriminatory performance of TIC BN were shown with each scenario using different charts in a random order. Bar charts, icon arrays and gauge charts were used for presenting the prediction. Receiver operating characteristic curves, true and false positive rate curves and icon arrays were used for presenting the performance. Risk judgement for patient scenarios, perceived accuracy for the predictions and the model, and preferences for charts were elicited using an online survey.
Results: A total of 25 clinicians evaluated 75 BN predictions. The choice of risk charts was associated with the risk score in the borderline medium-risk scenario. The choice of risk and performance charts interacts with the perceived accuracy of the predictions and model in the high and low-risk scenarios, respectively. The participants had varying but persistent preferences regarding risk presentation charts. Icon arrays were preferred for performance presentations.
Conclusions: The choice of presenting predictions and the performance of predictive models can affect risk and performance interpretation. Clinical predictive models should offer the flexibility of presenting predictions with different illustrations.
{"title":"Presenting predictions and performance of probabilistic models for clinical decision support in trauma care.","authors":"Cansu Alptekin, Jared M Wohlgemut, Zane B Perkins, William Marsh, Nigel R M Tai, Barbaros Yet","doi":"10.1016/j.ijmedinf.2024.105702","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105702","url":null,"abstract":"<p><strong>Introduction: </strong>Both predictions and performance of clinical predictive models can be presented with various verbal and visual representations. This study aims to investigate how different risk and performance presentations for probabilistic predictions affect clinical users' judgement and preferences.</p><p><strong>Methods: </strong>We use a clinical Bayesian Network (BN) model that has been developed for predicting the risk of Trauma Induced Coagulopathy (TIC). Three patient scenarios with different levels of TIC risk were shown to trauma care clinicians. The prediction and discriminatory performance of TIC BN were shown with each scenario using different charts in a random order. Bar charts, icon arrays and gauge charts were used for presenting the prediction. Receiver operating characteristic curves, true and false positive rate curves and icon arrays were used for presenting the performance. Risk judgement for patient scenarios, perceived accuracy for the predictions and the model, and preferences for charts were elicited using an online survey.</p><p><strong>Results: </strong>A total of 25 clinicians evaluated 75 BN predictions. The choice of risk charts was associated with the risk score in the borderline medium-risk scenario. The choice of risk and performance charts interacts with the perceived accuracy of the predictions and model in the high and low-risk scenarios, respectively. The participants had varying but persistent preferences regarding risk presentation charts. Icon arrays were preferred for performance presentations.</p><p><strong>Conclusions: </strong>The choice of presenting predictions and the performance of predictive models can affect risk and performance interpretation. Clinical predictive models should offer the flexibility of presenting predictions with different illustrations.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105702"},"PeriodicalIF":3.7,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142696081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13DOI: 10.1016/j.ijmedinf.2024.105698
Dong Hoon Jang , Ji Won Heo , Kyu Hong Lee , Ro Woon Lee , Tae Ran Ahn , Hyun Gyu Lee
Introduction
Ultrasound equipment provides real-time visualization of internal organs, essential for early disease detection and diagnosis. However, poor-quality ultrasound images can compromise diagnostic accuracy and increase the risk of misdiagnosis. Quality assessments are often subjective, relying on the evaluator's experience and interpretation, which can vary.
Methods
This study introduces a two-stage deep learning framework designed to objectively assess ultrasound image quality using phantom data across three key parameters: ‘Dead zone’, ‘Axial/lateral resolution’, and ‘Gray scale and dynamic range’. Stage 1 automatically extracts regions of interest for each parameter, while Stage 2 employs detection or classification models to evaluate image quality within these regions. To generate an overall equipment quality score, a logistic regression model combines the weighted results from each parameter.
Results
The classification model demonstrated high performance across datasets, achieving AUC scores of 98.6% for ‘Dead zone’, 87.7% for ‘Axial/lateral resolution’, and 96.0% for ‘Gray scale and dynamic range’. Further analysis using guideline-compliant images of individual devices showed AUC scores of 98.2%, 92.8%, and 100%, respectively. These findings highlight deep learning's potential for quantitative and objective assessments of ultrasound image quality. Ultimately, this framework provides a streamlined approach to quality management, enabling consistent quality control and efficient scoring-based evaluation of ultrasound equipment.
{"title":"Deep learning-driven ultrasound equipment quality assessment with ATS-539 phantom data","authors":"Dong Hoon Jang , Ji Won Heo , Kyu Hong Lee , Ro Woon Lee , Tae Ran Ahn , Hyun Gyu Lee","doi":"10.1016/j.ijmedinf.2024.105698","DOIUrl":"10.1016/j.ijmedinf.2024.105698","url":null,"abstract":"<div><h3>Introduction</h3><div>Ultrasound equipment provides real-time visualization of internal organs, essential for early disease detection and diagnosis. However, poor-quality ultrasound images can compromise diagnostic accuracy and increase the risk of misdiagnosis. Quality assessments are often subjective, relying on the evaluator's experience and interpretation, which can vary.</div></div><div><h3>Methods</h3><div>This study introduces a two-stage deep learning framework designed to objectively assess ultrasound image quality using phantom data across three key parameters: ‘Dead zone’, ‘Axial/lateral resolution’, and ‘Gray scale and dynamic range’. Stage 1 automatically extracts regions of interest for each parameter, while Stage 2 employs detection or classification models to evaluate image quality within these regions. To generate an overall equipment quality score, a logistic regression model combines the weighted results from each parameter.</div></div><div><h3>Results</h3><div>The classification model demonstrated high performance across datasets, achieving AUC scores of 98.6% for ‘Dead zone’, 87.7% for ‘Axial/lateral resolution’, and 96.0% for ‘Gray scale and dynamic range’. Further analysis using guideline-compliant images of individual devices showed AUC scores of 98.2%, 92.8%, and 100%, respectively. These findings highlight deep learning's potential for quantitative and objective assessments of ultrasound image quality. Ultimately, this framework provides a streamlined approach to quality management, enabling consistent quality control and efficient scoring-based evaluation of ultrasound equipment.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"193 ","pages":"Article 105698"},"PeriodicalIF":3.7,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142633144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-12DOI: 10.1016/j.ijmedinf.2024.105693
Iiris Hörhammer , Johanna Suvanto , Maarit Kinnunen , Sari Kujala
Background
Remote services provided via telephone or the internet have become an essential part of mental health provision. Alongside services involving healthcare personnel (HCP), self-guided digital services hold great promise for improved self-management and health outcomes without increasing the burden on HCP. Therefore, better understanding of patients’ use and experienced benefits of these services are needed. This study investigated how health confidence and sociodemographic background are associated with mental health patients’ experiences of self-guided digital services.
Methods
This cross-sectional survey study was performed in 2022 at a Finnish Mental Health and Substance Abuse Services (MHSAS) unit of a regional public service provider that serves a population of about 163 000 people. All patients who had visited the unit up to 6 months before the study were invited to respond to an online survey on their experiences with the remote MHSAS. We report the average subjective usefulness of telephone, guided digital and self-guided digital services. Regression models were fitted to study the associations of patient characteristics with use of any digital service, and with experienced usefulness of self-guided digital services.
Findings
The respondents (n = 438) rated the usefulness of telephone, guided digital and self-guided digital services similarly (4.0/5.0, 3.9/5.0, and 3.9/5.0, respectively). Health confidence was associated with not using digital services at all as well as with high perceived usefulness of self-guided services. While elderly patients were more likely to avoid using digital services, age was not associated with experienced usefulness of self-guided digital services. No association between unemployment status and experiences of digital services was found.
Conclusions
Different types of remote services are perceived as beneficial by mental health patients. To ensure effectiveness and equity, patients’ health confidence should be considered when directing them to self-guided services. Elderly mental health patients who use digital services are equally able as younger patients to benefit from self-guided services.
{"title":"Usefulness of self-guided digital services among mental health patients: The role of health confidence and sociodemographic characteristics","authors":"Iiris Hörhammer , Johanna Suvanto , Maarit Kinnunen , Sari Kujala","doi":"10.1016/j.ijmedinf.2024.105693","DOIUrl":"10.1016/j.ijmedinf.2024.105693","url":null,"abstract":"<div><h3>Background</h3><div>Remote services provided via telephone or the internet have become an essential part of mental health provision. Alongside services involving healthcare personnel (HCP), self-guided digital services hold great promise for improved self-management and health outcomes without increasing the burden on HCP. Therefore, better understanding of patients’ use and experienced benefits of these services are needed. This study investigated how health confidence and sociodemographic background are associated with mental health patients’ experiences of self-guided digital services.</div></div><div><h3>Methods</h3><div>This cross-sectional survey study was performed in 2022 at a Finnish Mental Health and Substance Abuse Services (MHSAS) unit of a regional public service provider that serves a population of about 163<!--> <!-->000 people. All patients who had visited the unit up to 6 months before the study were invited to respond to an online survey on their experiences with the remote MHSAS. We report the average subjective usefulness of telephone, guided digital and self-guided digital services. Regression models were fitted to study the associations of patient characteristics with use of any digital service, and with experienced usefulness of self-guided digital services.</div></div><div><h3>Findings</h3><div>The respondents (n = 438) rated the usefulness of telephone, guided digital and self-guided digital services similarly (4.0/5.0, 3.9/5.0, and 3.9/5.0, respectively). Health confidence was associated with not using digital services at all as well as with high perceived usefulness of self-guided services. While elderly patients were more likely to avoid using digital services, age was not associated with experienced usefulness of self-guided digital services. No association between unemployment status and experiences of digital services was found.</div></div><div><h3>Conclusions</h3><div>Different types of remote services are perceived as beneficial by mental health patients. To ensure effectiveness and equity, patients’ health confidence should be considered when directing them to self-guided services. Elderly mental health patients who use digital services are equally able as younger patients to benefit from self-guided services.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"Article 105693"},"PeriodicalIF":3.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142656282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-12DOI: 10.1016/j.ijmedinf.2024.105679
Sivan Bershan, Andreas Meisel, Philipp Mergenthaler
Objective: Myasthenic crisis (MC) is a critical progression of Myasthenia gravis (MG), requiring intensive care treatment and invasive therapies. Classifying patients at high-risk for MC facilitates treatment decisions such as changes in medication or the need for mechanical ventilation and helps prevent disease progression by decreasing treatment-induced stress on the patient. Here, we investigated whether it is possible to reliably classify MG patients into groups at low or high risk of MC based entirely on routine medical data using explainable machine learning (ML).
Methods: In this single-center pseudo-prospective cohort study, we investigated the precision of ML models trained with real-world routine clinical data to identify MG patients at risk for MC, and identified explainable distinctive features for the groups. 51 MG patients, including 13 MC, were used for model training based on real-world clinical data available from the hospital management system. Patients were classified to high or low risk for MC using Lasso regression or random forest ML models.
Results: The mean cross-validated AUC classifying MG patients as high or low risk for MC based on simple or compound features derived from real-world clinical data showed a predictive accuracy of 68.8% for a regularized Lasso regression and 76.5% for a random forest model. Studying feature importance across 5100 model runs identified explainable features to distinguish MG patients at high or low risk for MC. Feature importance scores suggested that multimorbidity may play a role in risk classification.
Conclusion: This study establishes feasibility and proof-of-concept for risk classification of MC based on real-world routine clinical data using ML with explainable features and variance control at the point of care. Future research on ML-based prediction of MC should include multi-center, multinational data collection, more in-depth data per patient, more patients, and an attention-based ML model to include free-text.
目的:肌无力危象(MC)是重症肌无力症(MG)的一个重要进展,需要重症监护治疗和侵入性疗法。对MC高危患者进行分类有助于做出治疗决定,如更换药物或是否需要机械通气,并通过减少治疗对患者造成的压力来预防疾病进展。在此,我们利用可解释的机器学习(ML)研究了是否有可能完全根据常规医疗数据将 MG 患者可靠地分为 MC 低风险或高风险组:在这项单中心伪前瞻性队列研究中,我们研究了使用真实世界常规临床数据训练的ML模型识别MG患者MC风险的精确度,并确定了各组可解释的显著特征。根据医院管理系统提供的真实世界临床数据,对 51 名 MG 患者(包括 13 名 MC)进行了模型训练。使用 Lasso 回归或随机森林 ML 模型将患者划分为 MC 高风险或低风险:根据真实世界临床数据中的简单或复合特征将 MG 患者划分为 MC 高风险或低风险的交叉验证 AUC 平均值显示,正则化 Lasso 回归的预测准确率为 68.8%,随机森林模型的预测准确率为 76.5%。通过对 5100 次模型运行的特征重要性进行研究,确定了可用于区分 MC 高风险或低风险 MG 患者的可解释特征。特征重要性得分表明,多病性可能在风险分类中发挥作用:本研究基于真实世界的常规临床数据,利用具有可解释特征的 ML 和护理点的方差控制,建立了 MC 风险分类的可行性和概念验证。基于 ML 的 MC 预测的未来研究应包括多中心、跨国数据收集、每个患者更深入的数据、更多的患者以及基于注意力的 ML 模型(包括自由文本)。
{"title":"Data-driven explainable machine learning for personalized risk classification of myasthenic crisis.","authors":"Sivan Bershan, Andreas Meisel, Philipp Mergenthaler","doi":"10.1016/j.ijmedinf.2024.105679","DOIUrl":"https://doi.org/10.1016/j.ijmedinf.2024.105679","url":null,"abstract":"<p><strong>Objective: </strong>Myasthenic crisis (MC) is a critical progression of Myasthenia gravis (MG), requiring intensive care treatment and invasive therapies. Classifying patients at high-risk for MC facilitates treatment decisions such as changes in medication or the need for mechanical ventilation and helps prevent disease progression by decreasing treatment-induced stress on the patient. Here, we investigated whether it is possible to reliably classify MG patients into groups at low or high risk of MC based entirely on routine medical data using explainable machine learning (ML).</p><p><strong>Methods: </strong>In this single-center pseudo-prospective cohort study, we investigated the precision of ML models trained with real-world routine clinical data to identify MG patients at risk for MC, and identified explainable distinctive features for the groups. 51 MG patients, including 13 MC, were used for model training based on real-world clinical data available from the hospital management system. Patients were classified to high or low risk for MC using Lasso regression or random forest ML models.</p><p><strong>Results: </strong>The mean cross-validated AUC classifying MG patients as high or low risk for MC based on simple or compound features derived from real-world clinical data showed a predictive accuracy of 68.8% for a regularized Lasso regression and 76.5% for a random forest model. Studying feature importance across 5100 model runs identified explainable features to distinguish MG patients at high or low risk for MC. Feature importance scores suggested that multimorbidity may play a role in risk classification.</p><p><strong>Conclusion: </strong>This study establishes feasibility and proof-of-concept for risk classification of MC based on real-world routine clinical data using ML with explainable features and variance control at the point of care. Future research on ML-based prediction of MC should include multi-center, multinational data collection, more in-depth data per patient, more patients, and an attention-based ML model to include free-text.</p>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"194 ","pages":"105679"},"PeriodicalIF":3.7,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}